a software division of
QuerySurge™ a software division of
the smart Data Testing solution
QuerySurgeTM
™
Mike Calabrese
Senior Solutions Architect
QuerySurge DevOps for Data
Chris Thompson
Senior Solutions Architect
a software division of
QuerySurge™
SENIOR DOMAIN EXPERT
DATA TESTING PRACTICE
• Joined RTTS as a Test Engineer in 2009
• Over a decade of experience successfully implementing
automated functional, data validation and ETL testing solutions
for multiple clients across many industry verticals.
• Mike is a technical expert on QuerySurge, RTTS’ flagship data
testing solution, and supports clients around the world with their
QuerySurge implementations.
• BS in Computer Engineering from Hofstra University
MIKE CALABRESE
a software division of
QuerySurge™
SENIOR DOMAIN EXPERT
DATA TESTING PRACTICE
• Military veteran - Aviation electronics technician in the U.S Navy
• BS in computer science from the University of Delaware
• Successful implementations of QA projects in the Data space for ove
15 years
• Employee for RTTS for the past 21 years
• Started with RTTS as an entry level Test Engineer
• Worked in numerous fields including Pharmaceutical, Utilities and
Retail
CHRIS THOMPSON
Data Warehouse
ETL
Data Warehouse
ETL
Mainframe
Business Intelligence
& Analytics
C-level executives are using BI &
Analytics to make critical business
decisions with the assumption
that the underlying data is fine
ETL
We know it is not
Typical data
issue areas
The Executive Office and Critical Data
a software division of
Patterned after CMMI
Source: QuerySurge
Data Maturity Model for Validation & Testing
Data Testing & Commercial Solution
Level
4
Fully repeatable test automation, centralized reporting, training,
knowledge base, professional support, partner ecosystem.
DevOps-for-Data Testing Solution
Level
5
CI/CD pipeline test automation, RESTful API, minimum human
interaction, predictive data issues, auditable history & automated
reports. Business value is fully supported by management.
1%
of companies
9%
of companies
Data Sampling & Excel
Level
1
Using Excel spreadsheets to compare a small % of data sets
visually. Ad-hoc reporting. Doesn’t scale & isn’t repeatable.
Data Compares & Minus Queries
Level
2
Utilizing SQL editor & minus queries to test data. Ad hoc reporting.
Data Testing & Home-Grown Framework
Level
3
Built internally or by a system integrator. Expensive upfront costs to
build and maintain. No documentation, limited or custom reporting.
90%
of companies
on the
Data Maturity
Level
QuerySurge connects
to any 2 points
at one time
Compares every data set to
find anomalies
…and provides
Data Validation Analytics
How QuerySurgeTM
Works
Source Data Target Data
• Databases
• Data Warehouses
• Flat Files & Excel
• JSON files
• Web Services
• Mainframe files
• Big Data Stores
(Hadoop & NoSQL)
• Data Warehouses
• Business Intelligence
reports
Source
Data
Target
Data
Data Warehouse
ETL
Data Mart
ETL
Source Data Big Data lake BI & Analytics
Testing Point #1 Testing Point #2 Testing Point #3
Data Tester: Tests data movement based on Mapping Requirements
Testing Point #4
Tester tests BI
Reports
ETL Developer: Codes data movement based on Mapping Requirements
BI User extracts
data for reports
Data Process: Developer & Tester
QuerySurgeTM
Environment
Connects through…
…to any JDBC compliant data source
Installs...
…in the Cloud
…on a VM
…on a Bare Metal Server
Supported OS...
QuerySurge™ a software division of
Web-based…
QuerySurge
Controller
QuerySurge Server
DB Server (MySQL)
App Server (Tomcat)
QuerySurge Agents
(Ships with 10 Agents)
Supported Technologies: 200+ data stores
QuerySurge supports all the data stores below (and more) as either a source or target.
Supported Technologies
Amazon Athena
Amazon DynamoDB
Apache Drill
Apache Hadoop
Avro
Azure Data Catalog
Azure Data Lake
Storage
Azure Table
BigInsights
BigQuery
Cassandra
Cloudant
Cloudera
CockroachDB
Cosmos DB
CouchDB
Couchbase
Databricks
Dremio
Elasticsearch
Google Data Catalog
Google Spanner
GraphQL
HBase
HPCC Systems
HarperDB
Hive
IBM Cloud SQL Query
Kafka
MarkLogic
MongoDB
Parquet
Phoenix
Power BI XMLA
Presto
Redis
Redshift
SAS Data Sets
SAS xpt
Snowflake
Spark
TigerGraph
Vertica
Databases
Access
Azure Analysis Services
Azure Synapse
Btrieve
DB2
EnterpriseDB
Exasol
Greenplum
Impala
Informix
JDBC-ODBC Bridge
MariaDB
MySQL
Nonstop SQL
Oracle
PostgreSQL
SAP Business One
SAP HANA
SAP Hybris C4C
SQL Analysis Services
SQL Server
SingleStore
Sybase
Sybase IQ
Teradata
xBase
Files & APIs
Amazon S3
Avro
Box
CSV
DigitalOcean
Dropbox
FTP
Google Cloud Storage
Google Drive
HDFS
IBM Cloud Object Storage
JSON
LDAP
Microsoft Dataverse
Microsoft OneDrive
OData
Parquet
Power BI XMLA
REST
RSS
SAS Data Sets
SAS xpt
SFTP
Twilio
Wasabi
XML
Accounting
ADP
Avalara AvaTax
Blackbaud FE NXT
Dynamics GP
Dynamics NAV
Exact Online
FreshBooks
Freshdesk
MYOB AccountRight
QuickBooks
QuickBooks Online
QuickBooks POS
Reckon
SAP Concur
Sage 200
Sage 50 UK
Sage Cloud Accounting
Sage Intacct
TSheets
TaxJar
Workday
Xero
Zoho Books
eCommerce
ADP
Amazon Marketplace
Authorize.Net
BigCommerce
Blackbaud FE NXT
EDGAR Online
Magento
Open Exchange Rates
PayPal
Paylocity
Quandl
SAP Fieldglass
ShipStation
Shopify
Square
Stripe
Wave Financial
WooCommerce
X-Cart
Zuora
eBay
eBay Analytics
Big Data & NoSQL Marketing
CRM
Workday
Marketing
Act-On
Adobe Analytics
Bing Ads
Facebook
Facebook Ads
Google Ad Manager
Google Ads
Google Analytics
Google Campaign Manager
HubSpot
Instagram
LinkedIn
LinkedIn Ads
Magento
MailChimp
Marketo
Oracle Eloqua
Oracle Sales
Pinterest
SAP BusinessObjects BI
Salesforce Chatter,Einstein,
Marketing,Pardot
SendGrid
Snapchat Ads
Splunk
SurveyMonkey
Twitter
Twitter Ads
YouTube Analytics
BI Reports
Microsoft Power BI
Tableau
IBM Cognos
Microstrategy
Oracle OBIEE
SAP Business Objects
Collaboration
AWS Management
Active Directory
Airtable
Alfresco
Asana
Autify
Azure DevOps
Azure Management
Basecamp
Bugzilla
Confluence
DocuSign
Email
Evernote
Excel
Excel Online
Excel Services
Gmail
Google Calendar
Google Contacts
Google Sheets
HCL Domino
Jira
Jira Service Desk
Kintone
Microsoft Exchange
Microsoft Planner
Microsoft Project
Microsoft Teams
Office 365
OneNote
QuickBase
Raisers Edge NXT
SAP BusinessObjects BI
SAP SuccessFactors
SharePoint
Slack
Smartsheet
Trello
WordPress
Xero WorkflowMax
Zendesk
QuerySurge DevOps for Data
• First full DevOps for Data testing solution
• Both RESTful and command line APIs
• Improves Data Quality at Speed
DevOps for Data integrates with:
• Continuous integration/ETL solutions
• Automated build/release/deployment solutions
• Operations and DevOps monitoring solutions
• Test management/issue tracking solutions
• Scheduling and workload automation solutions
60+ API calls with almost 100 different properties that
users can utilize to retrieve, edit, update, or delete
information.
QuerySurgeTM
DevOps
QuerySurge™ a software division of
Swagger is an open-source Interface
Description Language (IDL) for
describing RESTful APIs expressed using
JSON.
Swagger is used together with a set of
open-source software tools to design,
build, document, and use RESTful web
services.
And Swagger is embedded in
QuerySurge.
QuerySurgeTM
Swagger
QuerySurge™ a software division of
Swagger is an open-source Interface
Description Language (IDL) for
describing RESTful APIs expressed using
JSON.
Swagger is used together with a set of
open-source software tools to design,
build, document, and use RESTful web
services.
And Swagger is embedded in
QuerySurge.
QuerySurgeTM
Swagger
QuerySurge™ a software division of
QuerySurgeTM
QuerySurge™
a software division of

TestGuild and QuerySurge Presentation -DevOps for Data Testing

  • 1.
    a software divisionof QuerySurge™ a software division of the smart Data Testing solution QuerySurgeTM ™ Mike Calabrese Senior Solutions Architect QuerySurge DevOps for Data Chris Thompson Senior Solutions Architect
  • 2.
    a software divisionof QuerySurge™ SENIOR DOMAIN EXPERT DATA TESTING PRACTICE • Joined RTTS as a Test Engineer in 2009 • Over a decade of experience successfully implementing automated functional, data validation and ETL testing solutions for multiple clients across many industry verticals. • Mike is a technical expert on QuerySurge, RTTS’ flagship data testing solution, and supports clients around the world with their QuerySurge implementations. • BS in Computer Engineering from Hofstra University MIKE CALABRESE
  • 3.
    a software divisionof QuerySurge™ SENIOR DOMAIN EXPERT DATA TESTING PRACTICE • Military veteran - Aviation electronics technician in the U.S Navy • BS in computer science from the University of Delaware • Successful implementations of QA projects in the Data space for ove 15 years • Employee for RTTS for the past 21 years • Started with RTTS as an entry level Test Engineer • Worked in numerous fields including Pharmaceutical, Utilities and Retail CHRIS THOMPSON
  • 4.
    Data Warehouse ETL Data Warehouse ETL Mainframe BusinessIntelligence & Analytics C-level executives are using BI & Analytics to make critical business decisions with the assumption that the underlying data is fine ETL We know it is not Typical data issue areas The Executive Office and Critical Data
  • 5.
    a software divisionof Patterned after CMMI Source: QuerySurge Data Maturity Model for Validation & Testing Data Testing & Commercial Solution Level 4 Fully repeatable test automation, centralized reporting, training, knowledge base, professional support, partner ecosystem. DevOps-for-Data Testing Solution Level 5 CI/CD pipeline test automation, RESTful API, minimum human interaction, predictive data issues, auditable history & automated reports. Business value is fully supported by management. 1% of companies 9% of companies Data Sampling & Excel Level 1 Using Excel spreadsheets to compare a small % of data sets visually. Ad-hoc reporting. Doesn’t scale & isn’t repeatable. Data Compares & Minus Queries Level 2 Utilizing SQL editor & minus queries to test data. Ad hoc reporting. Data Testing & Home-Grown Framework Level 3 Built internally or by a system integrator. Expensive upfront costs to build and maintain. No documentation, limited or custom reporting. 90% of companies on the Data Maturity Level
  • 6.
    QuerySurge connects to any2 points at one time Compares every data set to find anomalies …and provides Data Validation Analytics How QuerySurgeTM Works Source Data Target Data • Databases • Data Warehouses • Flat Files & Excel • JSON files • Web Services • Mainframe files • Big Data Stores (Hadoop & NoSQL) • Data Warehouses • Business Intelligence reports Source Data Target Data
  • 7.
    Data Warehouse ETL Data Mart ETL SourceData Big Data lake BI & Analytics Testing Point #1 Testing Point #2 Testing Point #3 Data Tester: Tests data movement based on Mapping Requirements Testing Point #4 Tester tests BI Reports ETL Developer: Codes data movement based on Mapping Requirements BI User extracts data for reports Data Process: Developer & Tester
  • 8.
    QuerySurgeTM Environment Connects through… …to anyJDBC compliant data source Installs... …in the Cloud …on a VM …on a Bare Metal Server Supported OS... QuerySurge™ a software division of Web-based… QuerySurge Controller QuerySurge Server DB Server (MySQL) App Server (Tomcat) QuerySurge Agents (Ships with 10 Agents)
  • 9.
    Supported Technologies: 200+data stores QuerySurge supports all the data stores below (and more) as either a source or target. Supported Technologies Amazon Athena Amazon DynamoDB Apache Drill Apache Hadoop Avro Azure Data Catalog Azure Data Lake Storage Azure Table BigInsights BigQuery Cassandra Cloudant Cloudera CockroachDB Cosmos DB CouchDB Couchbase Databricks Dremio Elasticsearch Google Data Catalog Google Spanner GraphQL HBase HPCC Systems HarperDB Hive IBM Cloud SQL Query Kafka MarkLogic MongoDB Parquet Phoenix Power BI XMLA Presto Redis Redshift SAS Data Sets SAS xpt Snowflake Spark TigerGraph Vertica Databases Access Azure Analysis Services Azure Synapse Btrieve DB2 EnterpriseDB Exasol Greenplum Impala Informix JDBC-ODBC Bridge MariaDB MySQL Nonstop SQL Oracle PostgreSQL SAP Business One SAP HANA SAP Hybris C4C SQL Analysis Services SQL Server SingleStore Sybase Sybase IQ Teradata xBase Files & APIs Amazon S3 Avro Box CSV DigitalOcean Dropbox FTP Google Cloud Storage Google Drive HDFS IBM Cloud Object Storage JSON LDAP Microsoft Dataverse Microsoft OneDrive OData Parquet Power BI XMLA REST RSS SAS Data Sets SAS xpt SFTP Twilio Wasabi XML Accounting ADP Avalara AvaTax Blackbaud FE NXT Dynamics GP Dynamics NAV Exact Online FreshBooks Freshdesk MYOB AccountRight QuickBooks QuickBooks Online QuickBooks POS Reckon SAP Concur Sage 200 Sage 50 UK Sage Cloud Accounting Sage Intacct TSheets TaxJar Workday Xero Zoho Books eCommerce ADP Amazon Marketplace Authorize.Net BigCommerce Blackbaud FE NXT EDGAR Online Magento Open Exchange Rates PayPal Paylocity Quandl SAP Fieldglass ShipStation Shopify Square Stripe Wave Financial WooCommerce X-Cart Zuora eBay eBay Analytics Big Data & NoSQL Marketing CRM Workday Marketing Act-On Adobe Analytics Bing Ads Facebook Facebook Ads Google Ad Manager Google Ads Google Analytics Google Campaign Manager HubSpot Instagram LinkedIn LinkedIn Ads Magento MailChimp Marketo Oracle Eloqua Oracle Sales Pinterest SAP BusinessObjects BI Salesforce Chatter,Einstein, Marketing,Pardot SendGrid Snapchat Ads Splunk SurveyMonkey Twitter Twitter Ads YouTube Analytics BI Reports Microsoft Power BI Tableau IBM Cognos Microstrategy Oracle OBIEE SAP Business Objects Collaboration AWS Management Active Directory Airtable Alfresco Asana Autify Azure DevOps Azure Management Basecamp Bugzilla Confluence DocuSign Email Evernote Excel Excel Online Excel Services Gmail Google Calendar Google Contacts Google Sheets HCL Domino Jira Jira Service Desk Kintone Microsoft Exchange Microsoft Planner Microsoft Project Microsoft Teams Office 365 OneNote QuickBase Raisers Edge NXT SAP BusinessObjects BI SAP SuccessFactors SharePoint Slack Smartsheet Trello WordPress Xero WorkflowMax Zendesk
  • 10.
    QuerySurge DevOps forData • First full DevOps for Data testing solution • Both RESTful and command line APIs • Improves Data Quality at Speed DevOps for Data integrates with: • Continuous integration/ETL solutions • Automated build/release/deployment solutions • Operations and DevOps monitoring solutions • Test management/issue tracking solutions • Scheduling and workload automation solutions 60+ API calls with almost 100 different properties that users can utilize to retrieve, edit, update, or delete information. QuerySurgeTM DevOps QuerySurge™ a software division of
  • 11.
    Swagger is anopen-source Interface Description Language (IDL) for describing RESTful APIs expressed using JSON. Swagger is used together with a set of open-source software tools to design, build, document, and use RESTful web services. And Swagger is embedded in QuerySurge. QuerySurgeTM Swagger QuerySurge™ a software division of
  • 12.
    Swagger is anopen-source Interface Description Language (IDL) for describing RESTful APIs expressed using JSON. Swagger is used together with a set of open-source software tools to design, build, document, and use RESTful web services. And Swagger is embedded in QuerySurge. QuerySurgeTM Swagger QuerySurge™ a software division of
  • 13.