SlideShare a Scribd company logo
1 of 60
Download to read offline
COVID-19 - Building Trust in Data
To Save Lives
Paul Balas
25+ Years Experience Leading Digital
Transformations
Multiple MDM Implementations, Data
Governance, and Data Warehouse Initiatives
q Digital Transformation Consultant
q IT Executive
q Enterprise Architect
q Developer
paul.balas@revisioninc.com
THE REVISION ANALYTICS DIFFERENCE
Founded in 1998, REVISION is a management consulting firm that marries domain and technology expertise
to achieve amazing results for our clients through digital transformations that deliver value.
§ REVISION Senior Consultants
Depth Of Experience –
“We’ve Walked The Walk.”
§ 20+ Years C-level Advisory
Roles In Performance
Excellence
§ 20+ Years IT Operations
& Security Experience
§ Airport and Transportation
Experience
§ Culture & Operations
State, Local & Federal
§ DoD Secret & Top Secret
Cleared Staff
Data
Strategy
Data
Ops
Data
Governance
MDM
Advanced
Analytics
Assessment & Implementation
4
REVISION – LEADING IN RESPONSE TO THE COVID PANDEMIC
We built a system to enable John F. Kennedy International (JFK),
Newark Liberty International (EWR) and LaGuardia Airport (LGA) to
aid in response to the COVID pandemic
“… The Aviation Strategy Unit, created a custom
dashboard that estimates and predicts how many
passengers will arrive from states with travel
advisories. The new tool combines API data and
information within the Port Authority’s data
warehouse with updates from the COVID Tracking
Project…”
This is a presentation to uncover the systemic
data challenges in our Government’s response to
The COVID Pandemic
And a proposal to improve data quality and build trust
Agenda
6
What has happened since we started this journey?
Why is the Data Wrong?
Why the New HHS System Isn’t Working
How To Fix It - a POC
The Journey
7
Project
Start
Dr. Deborah Birx said “there is nothing from the CDC that I can
trust” in a White House coronavirus task force meeting
8
May 10, 2020
TRUST
I’m going to show you four news articles that
speak to four different types of challenges
with achieving good data governance and
delivering trusted data.
Do you notice any governance challenges we
encounter in our own companies?
9
10
Medical examiner offices
had struggled to keep up
with a recent spike in
COVID-19 deaths, with
more than 2,300 deaths
reported statewide in just
the past two weeks,
according to the Florida
Department of Health.
11
"In reviewing the analysis obtained by NPR, Panchadsaram says the local
and hospital-level data HHS is collecting would be very useful to
researchers and health leaders. "That stuff isn't easy to find at a national
level," he says. "There's no one place [publicly] you can go to get all that
data."
October 30, 2020
12
“During a news conference Saturday morning, Perna
explained that he had not taken into consideration the time
it would take for completed vaccine to go through the full
Food and Drug Administration quality control process, which
can take 48 hours.”
Dec. 19, 2020
13
”I saw the President presenting graphs
that I never made. I know that someone
out there or someone inside was creating
a parallel set of data and graphics that
were shown to the President.”
January 25, 2021
Increasing
volumes and
process
inefficiencies
Data silos and
protectionism
Poor data
governance
and unclear
definitions
Dueling
dashboards
14
Common Challenges in Data Governance
1: What are your top Data Governance
Challenges?
15
a) Increasing Volumes
b) Data Silos
c) Unclear Definitions
d) Dueling Dashboards
e) Lack of Executive Alignment
Why is COVID-19
Data Wrong?
No published standards on how to collect
and present the information
The CDC had an aging system which
wasn’t agile to change
The virus is spreading quickly –
huge volumes of new data to process
We weren’t capturing needed data
16
How Do We Need
to Use The Data?
PPE, Hospital Capacity, Testing
Supplies
Ensure we have enough doctors,
nurses, and other healthcare
professionals
Issuing protective orders to stop the
spread of the virus (Shelter-in-place,
Social Distancing, Shuttering
Business)
17
We’ve been asking ourselves, ”Could we have saved more lives?”
Without trusted data,
no decision will be driven
with conviction
18
19
The Data is Inconsistent - States Are Reporting to Differing Standards
https://www.beckershospitalreview.com/data-analytics/states-are-reporting-inconsistent-incomplete-covid-19-data-analysis-
finds.html#:~:text=The%20federal%20government%20has%20left,former%20CDC%20Director%20Tom%20Friedan.
20
The Data Isn’t Standardized - How Many Tests Have Been Given?
21
22
There are literally 1,000’s of
examples of incorrectly
reported data
The HHS System isn’t
solving the problem, and
It’s a Big Problem
CDC has 1,200 Users and about 950
State CDC Partners
There are over 6,000 Hospitals in the
US
About 2,000 Hospitals provide Covid
data directly to HH Protect
TeleTracking provides data from
about 1,100 additional Hospitals
23
HHS Challenges
● “Use of data across programs… remains a challenge.”
● “Data are often housed in … data silos”
24
25
NNDSS is focused on speed, data
format standardization, and
validation
They need to focus on
improving Data Governance
and Collaboration
26
TeleTracking, is just a data entry app
that is driven by procedural rules
The main problem in achieving Data
Quality is PEOPLE
27
Definitions translate into
procedural business logic to
ensure quality data.
But does this approach work?
The Fallacy of Standardized Data Entry Solutions
28
If you’re one of those managers who
believes that a standardized data entry
screen fixes your organizations data
quality problem...
You haven’t done much data analysis
(go speak with your data team)
A Novel Way to Solve
The Data Governance Challenge
29
Imagine This is Your Problem to Solve
You are the CIO of the CDC tasked to improve our Nation’s ability to
better manage this and the next pandemic through the use of data.
Your first goal is to understand the key issues in the current system
(“as-is”), and develop a roadmap to address them.
The stakes are high.
30
Architectural Principles
Build trust in the data
Identify which issues to fix first
The system should be agile to change
Efficient collaboration
31
Two Paths
Standardize data entry systems
Identify and address systemic
governance challenges
32
Identify and Address Systemic Governance Challenges
Master Data Management
Taxonomy Management
Data Quality
Cloud Data Integration
Data Transformation
Orchestration
Natural Language / Feature Extraction
Data Lake
Compute and Storage
Understand
Systemic Data
Challenges
You choose to look at issues around
‘Testing’ as you believe you can get
immediate benefits for public health if
you can build confidence in the test
data
What problems are states having in
processing Test Data?
Is Test Data being reported
consistently, accurately?
34
Compare 3 Websites
Standards on
Test Data
35
Completely Different Standards
(For The Same Data)
36
Three Weeks Later
37
Integrate and process the self-reported
data quality issues from your three test
sites
Create data quality taxonomies to
report on the issue types
Extract entities from the quality issues
and classify them using our
taxonomies
Virtual
Standardization
Meetings
You use your findings to review issues
and publish recommendations to
conform to new standards for each of
the data providers.
You use the platform to manage
quality on an on-going basis.
You provide the DataOps platform to
data providers to self-manage their
quality issues.
39
Can We Do More?
You can now see the systemic data quality issues and
communicate with stakeholders effectively to get alignment.
Next, classify news by Public Influencers, Events, and Locations
because you want to see if what our leaders say has an impact
on testing and infection rates.
40
Public Influencers - New Jersey
Public Influencers - New York
2. Do you have confidence in the COVID
data?
43
a. High Confidence
b. Neutral
c. Low Confidence
How Was It Built?
44
Google Cloud Platform
InfoWorks
Logical workflow
Internet
data sources
Data orchestration Tableau
TAMR
Big Query VM
instances
Google
cloud
storage
Natural
language
Python
Twitter API
News API
State Health
Department
Web Pages
JHU Github
I Had To Extract Meaning From Text
Data Experts - Spend More Time Analyzing/Strategizing
Before: Experts spend too much
Time manually fixing data
Today: ML can do 80% of
data mastering lift...
…. Enabling experts to put final
touches on the last 20%.
48
The TAMR Agile Approach to Data Mastering
Mastered data
OLD WAY
Rules-based
Source data
Mastered data
Time
Quality
Months to years
60%–80% Accuracy
Modify rules, create
exceptions
Months 1–4
Months 5–12+
Iterate
Machine-driven
NEW WAY
Days to weeks
90%+ accuracy
Source data
Weeks 1–12
Iterate with human-
guided machine
learning
Identify developers
Get business input
Write rules
Review with business
Unified data
Rules
Taxonomies: Before vs. After Tamr
TAMR enabled us to create standardized taxonomies that can be managed by a
networked group of hospitals, labs, health officials.
These taxonomies are critical to having good quality and conformed data across a
widely distributed data network.
There is an efficient mechanism for building consensus across experts at the same
time as fixing the data.
There is no solution like it in the market.
MDM - People Mastering
People Master Workflow
Mastering People:
530K to 9K in a
Few Days
Using TAMR, I was able to take a
corpus of over 500k entity records
identified by Google Natural Language
across 60,000 news articles, hundreds
of web pages, thousands of tweets in a
few weeks, reducing it to a about 10k
Golden Master People Records
I estimate the system can be
maintained in one to two hours a week
at scale, decreasing to minutes a week
as the model learns
53
Events - Before and After TAMR
Event Classification
7 TAMR Projects Built in a Few Weeks
Event
Classification
Events are hard to identify in text
I first created a taxonomy of events
tagged by Google Natural Language to
limit those of interest.
I loaded those into TAMR
And then classified them in a few hours
57
3: Which of these capabilities are you
most interested in learning more
about?
58
Agile Data Governance
Google Cloud Platform
Google Natural Language Processing API
TAMR
InfoWorks
DataOps
Conclusion
The COVID Pandemic data challenges
are a macro-view of the same
challenges we all face in our own
companies as we use data as
information to improve outcomes
The hardest problem to solve is
injecting Subject Matter Expertise into
a flexible data processing system that
can help us align SME perspectives,
and respond to changing needs with
agility
59
The Team!
60
Mingo Sanchez Keith Worfolk
Elizabeth Michel
Katie Everett
Kamal Maheshwari

More Related Content

What's hot

MDM is Still Failing 2020
MDM is Still Failing 2020MDM is Still Failing 2020
MDM is Still Failing 2020303Computing
 
Master Data Management's Place in the Data Governance Landscape
Master Data Management's Place in the Data Governance Landscape Master Data Management's Place in the Data Governance Landscape
Master Data Management's Place in the Data Governance Landscape CCG
 
Data Quality Strategy: A Step-by-Step Approach
Data Quality Strategy: A Step-by-Step ApproachData Quality Strategy: A Step-by-Step Approach
Data Quality Strategy: A Step-by-Step ApproachFindWhitePapers
 
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...DATAVERSITY
 
Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...
Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...
Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...DATAVERSITY
 
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...Christopher Bradley
 
Big Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBig Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBigDataExpo
 
Data quality and data profiling
Data quality and data profilingData quality and data profiling
Data quality and data profilingShailja Khurana
 
Slides: Data Governance Reality Check
Slides: Data Governance Reality CheckSlides: Data Governance Reality Check
Slides: Data Governance Reality CheckDATAVERSITY
 
Data quality and bi
Data quality and biData quality and bi
Data quality and bijeffd00
 
Introduction to Data Governance
Introduction to Data GovernanceIntroduction to Data Governance
Introduction to Data GovernanceJohn Bao Vuu
 
Selecting Data Management Tools - A practical approach
Selecting Data Management Tools - A practical approachSelecting Data Management Tools - A practical approach
Selecting Data Management Tools - A practical approachChristopher Bradley
 
Business Semantics for Data Governance and Stewardship
Business Semantics for Data Governance and StewardshipBusiness Semantics for Data Governance and Stewardship
Business Semantics for Data Governance and StewardshipPieter De Leenheer
 
Sound Data Quality for CRM
Sound Data Quality for CRMSound Data Quality for CRM
Sound Data Quality for CRMDivya Malik
 
TDWI Spotlight: Enabling Data Self-Service with Security, Governance, and Reg...
TDWI Spotlight: Enabling Data Self-Service with Security, Governance, and Reg...TDWI Spotlight: Enabling Data Self-Service with Security, Governance, and Reg...
TDWI Spotlight: Enabling Data Self-Service with Security, Governance, and Reg...Denodo
 
Data Management Meets Human Management - Why Words Matter
Data Management Meets Human Management - Why Words MatterData Management Meets Human Management - Why Words Matter
Data Management Meets Human Management - Why Words MatterDATAVERSITY
 
MLOps - Getting Machine Learning Into Production
MLOps - Getting Machine Learning Into ProductionMLOps - Getting Machine Learning Into Production
MLOps - Getting Machine Learning Into ProductionMichael Pearce
 
Enterprise Data World Webinar: A Strategic Approach to Data Quality
Enterprise Data World Webinar: A Strategic Approach to Data Quality Enterprise Data World Webinar: A Strategic Approach to Data Quality
Enterprise Data World Webinar: A Strategic Approach to Data Quality DATAVERSITY
 
Competing IT Priorities - An Operating Model for Data Stewardship and Busines...
Competing IT Priorities - An Operating Model for Data Stewardship and Busines...Competing IT Priorities - An Operating Model for Data Stewardship and Busines...
Competing IT Priorities - An Operating Model for Data Stewardship and Busines...Jaleann M McClurg MPH, CSPO, CSM, DTM
 
Data quality - The True Big Data Challenge
Data quality - The True Big Data ChallengeData quality - The True Big Data Challenge
Data quality - The True Big Data ChallengeStefan Kühn
 

What's hot (20)

MDM is Still Failing 2020
MDM is Still Failing 2020MDM is Still Failing 2020
MDM is Still Failing 2020
 
Master Data Management's Place in the Data Governance Landscape
Master Data Management's Place in the Data Governance Landscape Master Data Management's Place in the Data Governance Landscape
Master Data Management's Place in the Data Governance Landscape
 
Data Quality Strategy: A Step-by-Step Approach
Data Quality Strategy: A Step-by-Step ApproachData Quality Strategy: A Step-by-Step Approach
Data Quality Strategy: A Step-by-Step Approach
 
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
 
Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...
Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...
Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...
 
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
 
Big Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBig Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data Quality
 
Data quality and data profiling
Data quality and data profilingData quality and data profiling
Data quality and data profiling
 
Slides: Data Governance Reality Check
Slides: Data Governance Reality CheckSlides: Data Governance Reality Check
Slides: Data Governance Reality Check
 
Data quality and bi
Data quality and biData quality and bi
Data quality and bi
 
Introduction to Data Governance
Introduction to Data GovernanceIntroduction to Data Governance
Introduction to Data Governance
 
Selecting Data Management Tools - A practical approach
Selecting Data Management Tools - A practical approachSelecting Data Management Tools - A practical approach
Selecting Data Management Tools - A practical approach
 
Business Semantics for Data Governance and Stewardship
Business Semantics for Data Governance and StewardshipBusiness Semantics for Data Governance and Stewardship
Business Semantics for Data Governance and Stewardship
 
Sound Data Quality for CRM
Sound Data Quality for CRMSound Data Quality for CRM
Sound Data Quality for CRM
 
TDWI Spotlight: Enabling Data Self-Service with Security, Governance, and Reg...
TDWI Spotlight: Enabling Data Self-Service with Security, Governance, and Reg...TDWI Spotlight: Enabling Data Self-Service with Security, Governance, and Reg...
TDWI Spotlight: Enabling Data Self-Service with Security, Governance, and Reg...
 
Data Management Meets Human Management - Why Words Matter
Data Management Meets Human Management - Why Words MatterData Management Meets Human Management - Why Words Matter
Data Management Meets Human Management - Why Words Matter
 
MLOps - Getting Machine Learning Into Production
MLOps - Getting Machine Learning Into ProductionMLOps - Getting Machine Learning Into Production
MLOps - Getting Machine Learning Into Production
 
Enterprise Data World Webinar: A Strategic Approach to Data Quality
Enterprise Data World Webinar: A Strategic Approach to Data Quality Enterprise Data World Webinar: A Strategic Approach to Data Quality
Enterprise Data World Webinar: A Strategic Approach to Data Quality
 
Competing IT Priorities - An Operating Model for Data Stewardship and Busines...
Competing IT Priorities - An Operating Model for Data Stewardship and Busines...Competing IT Priorities - An Operating Model for Data Stewardship and Busines...
Competing IT Priorities - An Operating Model for Data Stewardship and Busines...
 
Data quality - The True Big Data Challenge
Data quality - The True Big Data ChallengeData quality - The True Big Data Challenge
Data quality - The True Big Data Challenge
 

Similar to COVID Data Challenges - Updated 2021

Impact of DDOD on Data Quality - White House 2016
Impact of DDOD on Data Quality -  White House 2016Impact of DDOD on Data Quality -  White House 2016
Impact of DDOD on Data Quality - White House 2016David Portnoy
 
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
2013 10 cu leeds school big data conference - bill jacobs - revolution analyticsBill Jacobs
 
Rock Report: Big Data by @Rock_Health
Rock Report: Big Data by @Rock_HealthRock Report: Big Data by @Rock_Health
Rock Report: Big Data by @Rock_HealthRock Health
 
The Digital Procurement Era
The Digital Procurement EraThe Digital Procurement Era
The Digital Procurement EraTejari
 
DataSpryng Overview
DataSpryng OverviewDataSpryng Overview
DataSpryng Overviewjkvr
 
The Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of HealthcareThe Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of HealthcareDale Sanders
 
The Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of HealthcareThe Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of HealthcareHealth Catalyst
 
MedTech Quality in the Age of Big Data - Are you ready?
MedTech Quality in the Age of Big Data - Are you ready?MedTech Quality in the Age of Big Data - Are you ready?
MedTech Quality in the Age of Big Data - Are you ready?Naveen Agarwal
 
The Role of Community-Driven Data Curation for Enterprises
The Role of Community-Driven Data Curation for EnterprisesThe Role of Community-Driven Data Curation for Enterprises
The Role of Community-Driven Data Curation for EnterprisesEdward Curry
 
Big Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBig Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBala Iyer
 
Microsoft: A Waking Giant in Healthcare Analytics and Big Data
Microsoft: A Waking Giant in Healthcare Analytics and Big DataMicrosoft: A Waking Giant in Healthcare Analytics and Big Data
Microsoft: A Waking Giant in Healthcare Analytics and Big DataDale Sanders
 
Analytic Transformation | 2013 Loras College Business Analytics Symposium
Analytic Transformation | 2013 Loras College Business Analytics SymposiumAnalytic Transformation | 2013 Loras College Business Analytics Symposium
Analytic Transformation | 2013 Loras College Business Analytics SymposiumCartegraph
 
Data centric security key to digital business success - ulf mattsson - bright...
Data centric security key to digital business success - ulf mattsson - bright...Data centric security key to digital business success - ulf mattsson - bright...
Data centric security key to digital business success - ulf mattsson - bright...Ulf Mattsson
 
Developing A Universal Approach to Cleansing Customer and Product Data
Developing A Universal Approach to Cleansing Customer and Product DataDeveloping A Universal Approach to Cleansing Customer and Product Data
Developing A Universal Approach to Cleansing Customer and Product DataFindWhitePapers
 
Microsoft: A Waking Giant In Healthcare Analytics and Big Data
Microsoft: A Waking Giant In Healthcare Analytics and Big DataMicrosoft: A Waking Giant In Healthcare Analytics and Big Data
Microsoft: A Waking Giant In Healthcare Analytics and Big DataHealth Catalyst
 
Rapid Response Analytics Solution Accelerates Analytics ROI
Rapid Response Analytics Solution Accelerates Analytics ROIRapid Response Analytics Solution Accelerates Analytics ROI
Rapid Response Analytics Solution Accelerates Analytics ROIHealth Catalyst
 
The Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallThe Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallTrillium Software
 
Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesT.S. Lim
 
Novetta Entity Analytics
Novetta Entity AnalyticsNovetta Entity Analytics
Novetta Entity AnalyticsNovetta
 

Similar to COVID Data Challenges - Updated 2021 (20)

Impact of DDOD on Data Quality - White House 2016
Impact of DDOD on Data Quality -  White House 2016Impact of DDOD on Data Quality -  White House 2016
Impact of DDOD on Data Quality - White House 2016
 
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
 
Rock Report: Big Data by @Rock_Health
Rock Report: Big Data by @Rock_HealthRock Report: Big Data by @Rock_Health
Rock Report: Big Data by @Rock_Health
 
The Digital Procurement Era
The Digital Procurement EraThe Digital Procurement Era
The Digital Procurement Era
 
DataSpryng Overview
DataSpryng OverviewDataSpryng Overview
DataSpryng Overview
 
The Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of HealthcareThe Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of Healthcare
 
The Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of HealthcareThe Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of Healthcare
 
MedTech Quality in the Age of Big Data - Are you ready?
MedTech Quality in the Age of Big Data - Are you ready?MedTech Quality in the Age of Big Data - Are you ready?
MedTech Quality in the Age of Big Data - Are you ready?
 
The Role of Community-Driven Data Curation for Enterprises
The Role of Community-Driven Data Curation for EnterprisesThe Role of Community-Driven Data Curation for Enterprises
The Role of Community-Driven Data Curation for Enterprises
 
Big Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBig Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the Marketspace
 
Microsoft: A Waking Giant in Healthcare Analytics and Big Data
Microsoft: A Waking Giant in Healthcare Analytics and Big DataMicrosoft: A Waking Giant in Healthcare Analytics and Big Data
Microsoft: A Waking Giant in Healthcare Analytics and Big Data
 
Analytic Transformation | 2013 Loras College Business Analytics Symposium
Analytic Transformation | 2013 Loras College Business Analytics SymposiumAnalytic Transformation | 2013 Loras College Business Analytics Symposium
Analytic Transformation | 2013 Loras College Business Analytics Symposium
 
Data centric security key to digital business success - ulf mattsson - bright...
Data centric security key to digital business success - ulf mattsson - bright...Data centric security key to digital business success - ulf mattsson - bright...
Data centric security key to digital business success - ulf mattsson - bright...
 
Developing A Universal Approach to Cleansing Customer and Product Data
Developing A Universal Approach to Cleansing Customer and Product DataDeveloping A Universal Approach to Cleansing Customer and Product Data
Developing A Universal Approach to Cleansing Customer and Product Data
 
Microsoft: A Waking Giant In Healthcare Analytics and Big Data
Microsoft: A Waking Giant In Healthcare Analytics and Big DataMicrosoft: A Waking Giant In Healthcare Analytics and Big Data
Microsoft: A Waking Giant In Healthcare Analytics and Big Data
 
Big Data Analytics (1).ppt
Big Data Analytics (1).pptBig Data Analytics (1).ppt
Big Data Analytics (1).ppt
 
Rapid Response Analytics Solution Accelerates Analytics ROI
Rapid Response Analytics Solution Accelerates Analytics ROIRapid Response Analytics Solution Accelerates Analytics ROI
Rapid Response Analytics Solution Accelerates Analytics ROI
 
The Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallThe Bigger They Are The Harder They Fall
The Bigger They Are The Harder They Fall
 
Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in Businesses
 
Novetta Entity Analytics
Novetta Entity AnalyticsNovetta Entity Analytics
Novetta Entity Analytics
 

Recently uploaded

Leading transformational change: inner and outer skills
Leading transformational change: inner and outer skillsLeading transformational change: inner and outer skills
Leading transformational change: inner and outer skillsHelenBevan4
 
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goa
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service GoaRussian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goa
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goanarwatsonia7
 
Low Rate Call Girls In Bommanahalli Just Call 7001305949
Low Rate Call Girls In Bommanahalli Just Call 7001305949Low Rate Call Girls In Bommanahalli Just Call 7001305949
Low Rate Call Girls In Bommanahalli Just Call 7001305949ps5894268
 
Kukatpally Call Girls Services 9907093804 High Class Babes Here Call Now
Kukatpally Call Girls Services 9907093804 High Class Babes Here Call NowKukatpally Call Girls Services 9907093804 High Class Babes Here Call Now
Kukatpally Call Girls Services 9907093804 High Class Babes Here Call NowHyderabad Call Girls Services
 
Call Girls Kukatpally 7001305949 all area service COD available Any Time
Call Girls Kukatpally 7001305949 all area service COD available Any TimeCall Girls Kukatpally 7001305949 all area service COD available Any Time
Call Girls Kukatpally 7001305949 all area service COD available Any Timedelhimodelshub1
 
Gurgaon Sector 68 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...
Gurgaon Sector 68 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...Gurgaon Sector 68 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...
Gurgaon Sector 68 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...ggsonu500
 
Local Housewife and effective ☎️ 8250192130 🍉🍓 Sexy Girls VIP Call Girls Chan...
Local Housewife and effective ☎️ 8250192130 🍉🍓 Sexy Girls VIP Call Girls Chan...Local Housewife and effective ☎️ 8250192130 🍉🍓 Sexy Girls VIP Call Girls Chan...
Local Housewife and effective ☎️ 8250192130 🍉🍓 Sexy Girls VIP Call Girls Chan...Russian Call Girls Amritsar
 
Russian Call Girls in Raipur 9873940964 Book Hot And Sexy Girls
Russian Call Girls in Raipur 9873940964 Book Hot And Sexy GirlsRussian Call Girls in Raipur 9873940964 Book Hot And Sexy Girls
Russian Call Girls in Raipur 9873940964 Book Hot And Sexy Girlsddev2574
 
College Call Girls Mumbai Alia 9910780858 Independent Escort Service Mumbai
College Call Girls Mumbai Alia 9910780858 Independent Escort Service MumbaiCollege Call Girls Mumbai Alia 9910780858 Independent Escort Service Mumbai
College Call Girls Mumbai Alia 9910780858 Independent Escort Service Mumbaisonalikaur4
 
Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...
Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...
Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...ggsonu500
 
Russian Escorts Delhi | 9711199171 | all area service available
Russian Escorts Delhi | 9711199171 | all area service availableRussian Escorts Delhi | 9711199171 | all area service available
Russian Escorts Delhi | 9711199171 | all area service availablesandeepkumar69420
 
Russian Call Girls in Chandigarh Ojaswi ❤️🍑 9907093804 👄🫦 Independent Escort ...
Russian Call Girls in Chandigarh Ojaswi ❤️🍑 9907093804 👄🫦 Independent Escort ...Russian Call Girls in Chandigarh Ojaswi ❤️🍑 9907093804 👄🫦 Independent Escort ...
Russian Call Girls in Chandigarh Ojaswi ❤️🍑 9907093804 👄🫦 Independent Escort ...High Profile Call Girls Chandigarh Aarushi
 
Call Girl Gurgaon Saloni 9711199012 Independent Escort Service Gurgaon
Call Girl Gurgaon Saloni 9711199012 Independent Escort Service GurgaonCall Girl Gurgaon Saloni 9711199012 Independent Escort Service Gurgaon
Call Girl Gurgaon Saloni 9711199012 Independent Escort Service GurgaonCall Girls Service Gurgaon
 
Call Girls LB Nagar 7001305949 all area service COD available Any Time
Call Girls LB Nagar 7001305949 all area service COD available Any TimeCall Girls LB Nagar 7001305949 all area service COD available Any Time
Call Girls LB Nagar 7001305949 all area service COD available Any Timedelhimodelshub1
 
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabad
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service HyderabadCall Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabad
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabaddelhimodelshub1
 
Call Girl Hyderabad Madhuri 9907093804 Independent Escort Service Hyderabad
Call Girl Hyderabad Madhuri 9907093804 Independent Escort Service HyderabadCall Girl Hyderabad Madhuri 9907093804 Independent Escort Service Hyderabad
Call Girl Hyderabad Madhuri 9907093804 Independent Escort Service Hyderabaddelhimodelshub1
 
Call Girls Service Chandigarh Grishma ❤️🍑 9907093804 👄🫦 Independent Escort Se...
Call Girls Service Chandigarh Grishma ❤️🍑 9907093804 👄🫦 Independent Escort Se...Call Girls Service Chandigarh Grishma ❤️🍑 9907093804 👄🫦 Independent Escort Se...
Call Girls Service Chandigarh Grishma ❤️🍑 9907093804 👄🫦 Independent Escort Se...High Profile Call Girls Chandigarh Aarushi
 

Recently uploaded (20)

Leading transformational change: inner and outer skills
Leading transformational change: inner and outer skillsLeading transformational change: inner and outer skills
Leading transformational change: inner and outer skills
 
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goa
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service GoaRussian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goa
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goa
 
Low Rate Call Girls In Bommanahalli Just Call 7001305949
Low Rate Call Girls In Bommanahalli Just Call 7001305949Low Rate Call Girls In Bommanahalli Just Call 7001305949
Low Rate Call Girls In Bommanahalli Just Call 7001305949
 
Call Girl Lucknow Gauri 🔝 8923113531 🔝 🎶 Independent Escort Service Lucknow
Call Girl Lucknow Gauri 🔝 8923113531  🔝 🎶 Independent Escort Service LucknowCall Girl Lucknow Gauri 🔝 8923113531  🔝 🎶 Independent Escort Service Lucknow
Call Girl Lucknow Gauri 🔝 8923113531 🔝 🎶 Independent Escort Service Lucknow
 
Kukatpally Call Girls Services 9907093804 High Class Babes Here Call Now
Kukatpally Call Girls Services 9907093804 High Class Babes Here Call NowKukatpally Call Girls Services 9907093804 High Class Babes Here Call Now
Kukatpally Call Girls Services 9907093804 High Class Babes Here Call Now
 
Call Girls Kukatpally 7001305949 all area service COD available Any Time
Call Girls Kukatpally 7001305949 all area service COD available Any TimeCall Girls Kukatpally 7001305949 all area service COD available Any Time
Call Girls Kukatpally 7001305949 all area service COD available Any Time
 
Gurgaon Sector 68 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...
Gurgaon Sector 68 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...Gurgaon Sector 68 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...
Gurgaon Sector 68 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...
 
VIP Call Girls Lucknow Isha 🔝 9719455033 🔝 🎶 Independent Escort Service Lucknow
VIP Call Girls Lucknow Isha 🔝 9719455033 🔝 🎶 Independent Escort Service LucknowVIP Call Girls Lucknow Isha 🔝 9719455033 🔝 🎶 Independent Escort Service Lucknow
VIP Call Girls Lucknow Isha 🔝 9719455033 🔝 🎶 Independent Escort Service Lucknow
 
Local Housewife and effective ☎️ 8250192130 🍉🍓 Sexy Girls VIP Call Girls Chan...
Local Housewife and effective ☎️ 8250192130 🍉🍓 Sexy Girls VIP Call Girls Chan...Local Housewife and effective ☎️ 8250192130 🍉🍓 Sexy Girls VIP Call Girls Chan...
Local Housewife and effective ☎️ 8250192130 🍉🍓 Sexy Girls VIP Call Girls Chan...
 
Russian Call Girls in Raipur 9873940964 Book Hot And Sexy Girls
Russian Call Girls in Raipur 9873940964 Book Hot And Sexy GirlsRussian Call Girls in Raipur 9873940964 Book Hot And Sexy Girls
Russian Call Girls in Raipur 9873940964 Book Hot And Sexy Girls
 
College Call Girls Mumbai Alia 9910780858 Independent Escort Service Mumbai
College Call Girls Mumbai Alia 9910780858 Independent Escort Service MumbaiCollege Call Girls Mumbai Alia 9910780858 Independent Escort Service Mumbai
College Call Girls Mumbai Alia 9910780858 Independent Escort Service Mumbai
 
Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...
Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...
Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...
 
Russian Escorts Delhi | 9711199171 | all area service available
Russian Escorts Delhi | 9711199171 | all area service availableRussian Escorts Delhi | 9711199171 | all area service available
Russian Escorts Delhi | 9711199171 | all area service available
 
Russian Call Girls in Dehradun Komal 🔝 7001305949 🔝 📍 Independent Escort Serv...
Russian Call Girls in Dehradun Komal 🔝 7001305949 🔝 📍 Independent Escort Serv...Russian Call Girls in Dehradun Komal 🔝 7001305949 🔝 📍 Independent Escort Serv...
Russian Call Girls in Dehradun Komal 🔝 7001305949 🔝 📍 Independent Escort Serv...
 
Russian Call Girls in Chandigarh Ojaswi ❤️🍑 9907093804 👄🫦 Independent Escort ...
Russian Call Girls in Chandigarh Ojaswi ❤️🍑 9907093804 👄🫦 Independent Escort ...Russian Call Girls in Chandigarh Ojaswi ❤️🍑 9907093804 👄🫦 Independent Escort ...
Russian Call Girls in Chandigarh Ojaswi ❤️🍑 9907093804 👄🫦 Independent Escort ...
 
Call Girl Gurgaon Saloni 9711199012 Independent Escort Service Gurgaon
Call Girl Gurgaon Saloni 9711199012 Independent Escort Service GurgaonCall Girl Gurgaon Saloni 9711199012 Independent Escort Service Gurgaon
Call Girl Gurgaon Saloni 9711199012 Independent Escort Service Gurgaon
 
Call Girls LB Nagar 7001305949 all area service COD available Any Time
Call Girls LB Nagar 7001305949 all area service COD available Any TimeCall Girls LB Nagar 7001305949 all area service COD available Any Time
Call Girls LB Nagar 7001305949 all area service COD available Any Time
 
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabad
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service HyderabadCall Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabad
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabad
 
Call Girl Hyderabad Madhuri 9907093804 Independent Escort Service Hyderabad
Call Girl Hyderabad Madhuri 9907093804 Independent Escort Service HyderabadCall Girl Hyderabad Madhuri 9907093804 Independent Escort Service Hyderabad
Call Girl Hyderabad Madhuri 9907093804 Independent Escort Service Hyderabad
 
Call Girls Service Chandigarh Grishma ❤️🍑 9907093804 👄🫦 Independent Escort Se...
Call Girls Service Chandigarh Grishma ❤️🍑 9907093804 👄🫦 Independent Escort Se...Call Girls Service Chandigarh Grishma ❤️🍑 9907093804 👄🫦 Independent Escort Se...
Call Girls Service Chandigarh Grishma ❤️🍑 9907093804 👄🫦 Independent Escort Se...
 

COVID Data Challenges - Updated 2021

  • 1. COVID-19 - Building Trust in Data To Save Lives
  • 2. Paul Balas 25+ Years Experience Leading Digital Transformations Multiple MDM Implementations, Data Governance, and Data Warehouse Initiatives q Digital Transformation Consultant q IT Executive q Enterprise Architect q Developer paul.balas@revisioninc.com
  • 3. THE REVISION ANALYTICS DIFFERENCE Founded in 1998, REVISION is a management consulting firm that marries domain and technology expertise to achieve amazing results for our clients through digital transformations that deliver value. § REVISION Senior Consultants Depth Of Experience – “We’ve Walked The Walk.” § 20+ Years C-level Advisory Roles In Performance Excellence § 20+ Years IT Operations & Security Experience § Airport and Transportation Experience § Culture & Operations State, Local & Federal § DoD Secret & Top Secret Cleared Staff Data Strategy Data Ops Data Governance MDM Advanced Analytics Assessment & Implementation
  • 4. 4 REVISION – LEADING IN RESPONSE TO THE COVID PANDEMIC We built a system to enable John F. Kennedy International (JFK), Newark Liberty International (EWR) and LaGuardia Airport (LGA) to aid in response to the COVID pandemic “… The Aviation Strategy Unit, created a custom dashboard that estimates and predicts how many passengers will arrive from states with travel advisories. The new tool combines API data and information within the Port Authority’s data warehouse with updates from the COVID Tracking Project…”
  • 5. This is a presentation to uncover the systemic data challenges in our Government’s response to The COVID Pandemic And a proposal to improve data quality and build trust
  • 6. Agenda 6 What has happened since we started this journey? Why is the Data Wrong? Why the New HHS System Isn’t Working How To Fix It - a POC
  • 8. Dr. Deborah Birx said “there is nothing from the CDC that I can trust” in a White House coronavirus task force meeting 8 May 10, 2020
  • 9. TRUST I’m going to show you four news articles that speak to four different types of challenges with achieving good data governance and delivering trusted data. Do you notice any governance challenges we encounter in our own companies? 9
  • 10. 10 Medical examiner offices had struggled to keep up with a recent spike in COVID-19 deaths, with more than 2,300 deaths reported statewide in just the past two weeks, according to the Florida Department of Health.
  • 11. 11 "In reviewing the analysis obtained by NPR, Panchadsaram says the local and hospital-level data HHS is collecting would be very useful to researchers and health leaders. "That stuff isn't easy to find at a national level," he says. "There's no one place [publicly] you can go to get all that data." October 30, 2020
  • 12. 12 “During a news conference Saturday morning, Perna explained that he had not taken into consideration the time it would take for completed vaccine to go through the full Food and Drug Administration quality control process, which can take 48 hours.” Dec. 19, 2020
  • 13. 13 ”I saw the President presenting graphs that I never made. I know that someone out there or someone inside was creating a parallel set of data and graphics that were shown to the President.” January 25, 2021
  • 14. Increasing volumes and process inefficiencies Data silos and protectionism Poor data governance and unclear definitions Dueling dashboards 14 Common Challenges in Data Governance
  • 15. 1: What are your top Data Governance Challenges? 15 a) Increasing Volumes b) Data Silos c) Unclear Definitions d) Dueling Dashboards e) Lack of Executive Alignment
  • 16. Why is COVID-19 Data Wrong? No published standards on how to collect and present the information The CDC had an aging system which wasn’t agile to change The virus is spreading quickly – huge volumes of new data to process We weren’t capturing needed data 16
  • 17. How Do We Need to Use The Data? PPE, Hospital Capacity, Testing Supplies Ensure we have enough doctors, nurses, and other healthcare professionals Issuing protective orders to stop the spread of the virus (Shelter-in-place, Social Distancing, Shuttering Business) 17
  • 18. We’ve been asking ourselves, ”Could we have saved more lives?” Without trusted data, no decision will be driven with conviction 18
  • 19. 19
  • 20. The Data is Inconsistent - States Are Reporting to Differing Standards https://www.beckershospitalreview.com/data-analytics/states-are-reporting-inconsistent-incomplete-covid-19-data-analysis- finds.html#:~:text=The%20federal%20government%20has%20left,former%20CDC%20Director%20Tom%20Friedan. 20
  • 21. The Data Isn’t Standardized - How Many Tests Have Been Given? 21
  • 22. 22 There are literally 1,000’s of examples of incorrectly reported data
  • 23. The HHS System isn’t solving the problem, and It’s a Big Problem CDC has 1,200 Users and about 950 State CDC Partners There are over 6,000 Hospitals in the US About 2,000 Hospitals provide Covid data directly to HH Protect TeleTracking provides data from about 1,100 additional Hospitals 23
  • 24. HHS Challenges ● “Use of data across programs… remains a challenge.” ● “Data are often housed in … data silos” 24
  • 25. 25 NNDSS is focused on speed, data format standardization, and validation They need to focus on improving Data Governance and Collaboration
  • 26. 26 TeleTracking, is just a data entry app that is driven by procedural rules The main problem in achieving Data Quality is PEOPLE
  • 27. 27 Definitions translate into procedural business logic to ensure quality data. But does this approach work? The Fallacy of Standardized Data Entry Solutions
  • 28. 28 If you’re one of those managers who believes that a standardized data entry screen fixes your organizations data quality problem... You haven’t done much data analysis (go speak with your data team)
  • 29. A Novel Way to Solve The Data Governance Challenge 29
  • 30. Imagine This is Your Problem to Solve You are the CIO of the CDC tasked to improve our Nation’s ability to better manage this and the next pandemic through the use of data. Your first goal is to understand the key issues in the current system (“as-is”), and develop a roadmap to address them. The stakes are high. 30
  • 31. Architectural Principles Build trust in the data Identify which issues to fix first The system should be agile to change Efficient collaboration 31
  • 32. Two Paths Standardize data entry systems Identify and address systemic governance challenges 32
  • 33. Identify and Address Systemic Governance Challenges Master Data Management Taxonomy Management Data Quality Cloud Data Integration Data Transformation Orchestration Natural Language / Feature Extraction Data Lake Compute and Storage
  • 34. Understand Systemic Data Challenges You choose to look at issues around ‘Testing’ as you believe you can get immediate benefits for public health if you can build confidence in the test data What problems are states having in processing Test Data? Is Test Data being reported consistently, accurately? 34
  • 35. Compare 3 Websites Standards on Test Data 35
  • 37. Three Weeks Later 37 Integrate and process the self-reported data quality issues from your three test sites Create data quality taxonomies to report on the issue types Extract entities from the quality issues and classify them using our taxonomies
  • 38.
  • 39. Virtual Standardization Meetings You use your findings to review issues and publish recommendations to conform to new standards for each of the data providers. You use the platform to manage quality on an on-going basis. You provide the DataOps platform to data providers to self-manage their quality issues. 39
  • 40. Can We Do More? You can now see the systemic data quality issues and communicate with stakeholders effectively to get alignment. Next, classify news by Public Influencers, Events, and Locations because you want to see if what our leaders say has an impact on testing and infection rates. 40
  • 41. Public Influencers - New Jersey
  • 43. 2. Do you have confidence in the COVID data? 43 a. High Confidence b. Neutral c. Low Confidence
  • 44. How Was It Built? 44
  • 45. Google Cloud Platform InfoWorks Logical workflow Internet data sources Data orchestration Tableau TAMR Big Query VM instances Google cloud storage Natural language Python Twitter API News API State Health Department Web Pages JHU Github
  • 46. I Had To Extract Meaning From Text
  • 47.
  • 48. Data Experts - Spend More Time Analyzing/Strategizing Before: Experts spend too much Time manually fixing data Today: ML can do 80% of data mastering lift... …. Enabling experts to put final touches on the last 20%. 48
  • 49. The TAMR Agile Approach to Data Mastering Mastered data OLD WAY Rules-based Source data Mastered data Time Quality Months to years 60%–80% Accuracy Modify rules, create exceptions Months 1–4 Months 5–12+ Iterate Machine-driven NEW WAY Days to weeks 90%+ accuracy Source data Weeks 1–12 Iterate with human- guided machine learning Identify developers Get business input Write rules Review with business Unified data Rules
  • 50. Taxonomies: Before vs. After Tamr TAMR enabled us to create standardized taxonomies that can be managed by a networked group of hospitals, labs, health officials. These taxonomies are critical to having good quality and conformed data across a widely distributed data network. There is an efficient mechanism for building consensus across experts at the same time as fixing the data. There is no solution like it in the market.
  • 51. MDM - People Mastering
  • 53. Mastering People: 530K to 9K in a Few Days Using TAMR, I was able to take a corpus of over 500k entity records identified by Google Natural Language across 60,000 news articles, hundreds of web pages, thousands of tweets in a few weeks, reducing it to a about 10k Golden Master People Records I estimate the system can be maintained in one to two hours a week at scale, decreasing to minutes a week as the model learns 53
  • 54. Events - Before and After TAMR
  • 56. 7 TAMR Projects Built in a Few Weeks
  • 57. Event Classification Events are hard to identify in text I first created a taxonomy of events tagged by Google Natural Language to limit those of interest. I loaded those into TAMR And then classified them in a few hours 57
  • 58. 3: Which of these capabilities are you most interested in learning more about? 58 Agile Data Governance Google Cloud Platform Google Natural Language Processing API TAMR InfoWorks DataOps
  • 59. Conclusion The COVID Pandemic data challenges are a macro-view of the same challenges we all face in our own companies as we use data as information to improve outcomes The hardest problem to solve is injecting Subject Matter Expertise into a flexible data processing system that can help us align SME perspectives, and respond to changing needs with agility 59
  • 60. The Team! 60 Mingo Sanchez Keith Worfolk Elizabeth Michel Katie Everett Kamal Maheshwari