This slide deck is from one of our 4 webinars in our half-day series in conjunction with Test Guild.
Chris Thompson and Mike Calabrese, Senior Solution Architects and QuerySurge experts, provide great information, a demo and lots of humor in this webinar on how to implement DevOps for Data in your DataOps pipeline.
This webinar was performed in conjunction with Test Guild.
To watch the video, go to:
https://youtu.be/1ihuRPgY_rs
TestGuild and QuerySurge Presentation -DevOps for Data Testing
1. a software division of
QuerySurge™ a software division of
the smart Data Testing solution
QuerySurgeTM
™
Mike Calabrese
Senior Solutions Architect
QuerySurge DevOps for Data
Chris Thompson
Senior Solutions Architect
2. a software division of
QuerySurge™
SENIOR DOMAIN EXPERT
DATA TESTING PRACTICE
• Joined RTTS as a Test Engineer in 2009
• Over a decade of experience successfully implementing
automated functional, data validation and ETL testing solutions
for multiple clients across many industry verticals.
• Mike is a technical expert on QuerySurge, RTTS’ flagship data
testing solution, and supports clients around the world with their
QuerySurge implementations.
• BS in Computer Engineering from Hofstra University
MIKE CALABRESE
3. a software division of
QuerySurge™
SENIOR DOMAIN EXPERT
DATA TESTING PRACTICE
• Military veteran - Aviation electronics technician in the U.S Navy
• BS in computer science from the University of Delaware
• Successful implementations of QA projects in the Data space for ove
15 years
• Employee for RTTS for the past 21 years
• Started with RTTS as an entry level Test Engineer
• Worked in numerous fields including Pharmaceutical, Utilities and
Retail
CHRIS THOMPSON
4. Data Warehouse
ETL
Data Warehouse
ETL
Mainframe
Business Intelligence
& Analytics
C-level executives are using BI &
Analytics to make critical business
decisions with the assumption
that the underlying data is fine
ETL
We know it is not
Typical data
issue areas
The Executive Office and Critical Data
5. a software division of
Patterned after CMMI
Source: QuerySurge
Data Maturity Model for Validation & Testing
Data Testing & Commercial Solution
Level
4
Fully repeatable test automation, centralized reporting, training,
knowledge base, professional support, partner ecosystem.
DevOps-for-Data Testing Solution
Level
5
CI/CD pipeline test automation, RESTful API, minimum human
interaction, predictive data issues, auditable history & automated
reports. Business value is fully supported by management.
1%
of companies
9%
of companies
Data Sampling & Excel
Level
1
Using Excel spreadsheets to compare a small % of data sets
visually. Ad-hoc reporting. Doesn’t scale & isn’t repeatable.
Data Compares & Minus Queries
Level
2
Utilizing SQL editor & minus queries to test data. Ad hoc reporting.
Data Testing & Home-Grown Framework
Level
3
Built internally or by a system integrator. Expensive upfront costs to
build and maintain. No documentation, limited or custom reporting.
90%
of companies
on the
Data Maturity
Level
6. QuerySurge connects
to any 2 points
at one time
Compares every data set to
find anomalies
…and provides
Data Validation Analytics
How QuerySurgeTM
Works
Source Data Target Data
• Databases
• Data Warehouses
• Flat Files & Excel
• JSON files
• Web Services
• Mainframe files
• Big Data Stores
(Hadoop & NoSQL)
• Data Warehouses
• Business Intelligence
reports
Source
Data
Target
Data
7. Data Warehouse
ETL
Data Mart
ETL
Source Data Big Data lake BI & Analytics
Testing Point #1 Testing Point #2 Testing Point #3
Data Tester: Tests data movement based on Mapping Requirements
Testing Point #4
Tester tests BI
Reports
ETL Developer: Codes data movement based on Mapping Requirements
BI User extracts
data for reports
Data Process: Developer & Tester
8. QuerySurgeTM
Environment
Connects through…
…to any JDBC compliant data source
Installs...
…in the Cloud
…on a VM
…on a Bare Metal Server
Supported OS...
QuerySurge™ a software division of
Web-based…
QuerySurge
Controller
QuerySurge Server
DB Server (MySQL)
App Server (Tomcat)
QuerySurge Agents
(Ships with 10 Agents)
9. Supported Technologies: 200+ data stores
QuerySurge supports all the data stores below (and more) as either a source or target.
Supported Technologies
Amazon Athena
Amazon DynamoDB
Apache Drill
Apache Hadoop
Avro
Azure Data Catalog
Azure Data Lake
Storage
Azure Table
BigInsights
BigQuery
Cassandra
Cloudant
Cloudera
CockroachDB
Cosmos DB
CouchDB
Couchbase
Databricks
Dremio
Elasticsearch
Google Data Catalog
Google Spanner
GraphQL
HBase
HPCC Systems
HarperDB
Hive
IBM Cloud SQL Query
Kafka
MarkLogic
MongoDB
Parquet
Phoenix
Power BI XMLA
Presto
Redis
Redshift
SAS Data Sets
SAS xpt
Snowflake
Spark
TigerGraph
Vertica
Databases
Access
Azure Analysis Services
Azure Synapse
Btrieve
DB2
EnterpriseDB
Exasol
Greenplum
Impala
Informix
JDBC-ODBC Bridge
MariaDB
MySQL
Nonstop SQL
Oracle
PostgreSQL
SAP Business One
SAP HANA
SAP Hybris C4C
SQL Analysis Services
SQL Server
SingleStore
Sybase
Sybase IQ
Teradata
xBase
Files & APIs
Amazon S3
Avro
Box
CSV
DigitalOcean
Dropbox
FTP
Google Cloud Storage
Google Drive
HDFS
IBM Cloud Object Storage
JSON
LDAP
Microsoft Dataverse
Microsoft OneDrive
OData
Parquet
Power BI XMLA
REST
RSS
SAS Data Sets
SAS xpt
SFTP
Twilio
Wasabi
XML
Accounting
ADP
Avalara AvaTax
Blackbaud FE NXT
Dynamics GP
Dynamics NAV
Exact Online
FreshBooks
Freshdesk
MYOB AccountRight
QuickBooks
QuickBooks Online
QuickBooks POS
Reckon
SAP Concur
Sage 200
Sage 50 UK
Sage Cloud Accounting
Sage Intacct
TSheets
TaxJar
Workday
Xero
Zoho Books
eCommerce
ADP
Amazon Marketplace
Authorize.Net
BigCommerce
Blackbaud FE NXT
EDGAR Online
Magento
Open Exchange Rates
PayPal
Paylocity
Quandl
SAP Fieldglass
ShipStation
Shopify
Square
Stripe
Wave Financial
WooCommerce
X-Cart
Zuora
eBay
eBay Analytics
Big Data & NoSQL Marketing
CRM
Workday
Marketing
Act-On
Adobe Analytics
Bing Ads
Facebook
Facebook Ads
Google Ad Manager
Google Ads
Google Analytics
Google Campaign Manager
HubSpot
Instagram
LinkedIn
LinkedIn Ads
Magento
MailChimp
Marketo
Oracle Eloqua
Oracle Sales
Pinterest
SAP BusinessObjects BI
Salesforce Chatter,Einstein,
Marketing,Pardot
SendGrid
Snapchat Ads
Splunk
SurveyMonkey
Twitter
Twitter Ads
YouTube Analytics
BI Reports
Microsoft Power BI
Tableau
IBM Cognos
Microstrategy
Oracle OBIEE
SAP Business Objects
Collaboration
AWS Management
Active Directory
Airtable
Alfresco
Asana
Autify
Azure DevOps
Azure Management
Basecamp
Bugzilla
Confluence
DocuSign
Email
Evernote
Excel
Excel Online
Excel Services
Gmail
Google Calendar
Google Contacts
Google Sheets
HCL Domino
Jira
Jira Service Desk
Kintone
Microsoft Exchange
Microsoft Planner
Microsoft Project
Microsoft Teams
Office 365
OneNote
QuickBase
Raisers Edge NXT
SAP BusinessObjects BI
SAP SuccessFactors
SharePoint
Slack
Smartsheet
Trello
WordPress
Xero WorkflowMax
Zendesk
10. QuerySurge DevOps for Data
• First full DevOps for Data testing solution
• Both RESTful and command line APIs
• Improves Data Quality at Speed
DevOps for Data integrates with:
• Continuous integration/ETL solutions
• Automated build/release/deployment solutions
• Operations and DevOps monitoring solutions
• Test management/issue tracking solutions
• Scheduling and workload automation solutions
60+ API calls with almost 100 different properties that
users can utilize to retrieve, edit, update, or delete
information.
QuerySurgeTM
DevOps
QuerySurge™ a software division of
11. Swagger is an open-source Interface
Description Language (IDL) for
describing RESTful APIs expressed using
JSON.
Swagger is used together with a set of
open-source software tools to design,
build, document, and use RESTful web
services.
And Swagger is embedded in
QuerySurge.
QuerySurgeTM
Swagger
QuerySurge™ a software division of
12. Swagger is an open-source Interface
Description Language (IDL) for
describing RESTful APIs expressed using
JSON.
Swagger is used together with a set of
open-source software tools to design,
build, document, and use RESTful web
services.
And Swagger is embedded in
QuerySurge.
QuerySurgeTM
Swagger
QuerySurge™ a software division of