SlideShare a Scribd company logo
1 of 43
Graphs Opening Medical Care
Information
@davefauth
www.intelliwareness.org
About Me
•
•
•
•

My Blog: http://www.intelliwareness.org
Find me on Twitter: @davefauth
Email me: dsfauth@gmail.com
GitHub: http://github.com/davidfauth

2
Not talking about this….
Or this….
But we want to talk about this:
And this:

Ryan Weald – isurfsoftware.com
I’ll try not to do this…
Or this….
Where we are today
Healthcare Data
• Recommend watching Fred Trotter speak at
GraphConnect – SF
• Moving from no data -> bad data -> better
data -> good data
• Claims Data
– Hard to accurately describe what a doctor is
doing and how they are getting paid without
claims data
– Limited and not a good data set by any standard
Examples of Bad Data
• Not enough data – More transparency
without having to FOIA
• State level data is hard to get
Better Data Sets
• DocGraph Data
– One of the “best” available
– “Best” does not mean “good”

• DocGraph Rx
– Prescribing patterns for Medicare Part D patients

• NPPES
• NUCC
DocGraph Dataset
• DocGraph by the numbers
– Directed graph
– Average total degree 52.8
– 940,492 providers (graph nodes/vertices)
– 49,685,810 shared edges
DocGraph Data
Doctor Detail (docNPI.com)
Doctor Detail
NPPES
•
•
•
•

National Plan and Provider Enumeration System
Source of NPI (National Provider Identifier)
No cost download 
Information is entered and updated by provider
Data quality is good to poor 

• CSV file with 314 columns 
NUCC
• National Uniform Claim Committee
– Healthcare Provider Taxonomy
– No cost download 

• CSV file with 5 columns and 830 rows
– Link taxonomy to NPPES reported taxonomy
DocGraph Data
Nodes
Organizations
Specialties
Providers
Locations
CountiesZip
Census

Relationships
* Organizations -[:PARENT_OF] – Providers -[:SPECIALTY]Specialties
* Lcations-[:LOCATED_FOR]-Providers
* Providers -[:REFERRED]-Providers
* Counties -[:INCOME_IN]- CountiesZip
* Locations – [:LOCATED_IN]-CountiesZip
DocGraph Data

Provider

refers
DocGraph Data
Specialty
Specializes_in
Provider

refers
DocGraph Data
Specialty
Specializes_in
Parent_Of

Provider

Parent
Org
Location_In

Location

refers
DocGraph Data
Specialty
Specializes_in
Parent_Of

Provider

Parent
Org
Location_In

Location

refers
DocGraph Data
Specialty
Specializes_in
Parent_Of

refers

Provider

Income

Parent
Org

Income_In
Location_For
Located_In
Location

Counties
Zip
DocGraph RX Data
• Reinforcing Jonathan Freeman’s talk on
Hadoop and Neo4J
Time for Analysis
Fraud Referrals
April 2013 - The owner and another
senior executive of Sacred Heart
Hospital and four physicians
affiliated with the west side facility
were arrested today for allegedly
conspiring to pay and receive illegal
kickbacks, including more than
$225,000 in cash, along with other
forms of payment, in exchange for
the referral of patients insured by
Medicare and Medicaid to the
hospital, announced U.S. Attorney
for the Northern District of Illinois
Gary S. Shapiro.
Hadoop Page Rank
DocGraph RX Data
• Originally obtained by ProPublica
• Prescribing pattern for all physicians for
Medicare Part D – 2011
• Largest public released prescribing database
• 2 sets of data - 30M edges each
• Related to business name and NDC-9 code
– NDC 9 code allows for aggregation of drugs
DocGraph RX Data
DocGraph RX Data
DocGraph RX Data
DocGraphRx Data
Drugs
Specialty
prescribes
Specializes_in
Parent_Of

refers

Provider

Income

Parent
Org

Income_In
Location_For
Located_In
Location

Counties
Zip
DocGraph RX Data
• http://whnt.com/2013/03/27/follow-updecatur-family-claims-prescription-drugsfrom-dr-shelinder-aggarwal-killed-their-son/
• http://www.palmbeachpost.com/news/news/
state-regional/doctors-booted-fom-medicaidfor-massive-oxy-doses-/nPpMf/
DocGraph RX Data
• Back to “bad data”
• http://www.albme.org/actions.html
Combine additional datasets
• Medical data
– Doctor referral data
– Medicare doctor prescription practices
– “Dollars for Doctors” – Drug company promotional
payments

• Census Data
– Income data
– Poverty data
Recommendation Engine?
• Build a graph model of the data
• Build a recommender model from the graph
model
• Graphs can be visualized, explained, discussed
and debugged collaboratively
GraphConnect NYC

More Related Content

What's hot

CDISC2RDF overview with examples
CDISC2RDF overview with examplesCDISC2RDF overview with examples
CDISC2RDF overview with examples
Kerstin Forsberg
 
2014 CrossRef Annual Meeting Flash Update: CrossRef Metadata Search
2014 CrossRef Annual Meeting Flash Update: CrossRef Metadata Search2014 CrossRef Annual Meeting Flash Update: CrossRef Metadata Search
2014 CrossRef Annual Meeting Flash Update: CrossRef Metadata Search
Crossref
 
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Alasdair Gray
 
Research aarkstoreenterprise disease and therapy review crohn's disease
Research aarkstoreenterprise   disease and therapy review  crohn's diseaseResearch aarkstoreenterprise   disease and therapy review  crohn's disease
Research aarkstoreenterprise disease and therapy review crohn's disease
Neel Terde
 
An Identifier Scheme for the Digitising Scotland Project
An Identifier Scheme for the Digitising Scotland ProjectAn Identifier Scheme for the Digitising Scotland Project
An Identifier Scheme for the Digitising Scotland Project
Alasdair Gray
 
CrossRef Annual Meeting 2012 ORCID Laure Haak
CrossRef Annual Meeting 2012 ORCID Laure HaakCrossRef Annual Meeting 2012 ORCID Laure Haak
CrossRef Annual Meeting 2012 ORCID Laure Haak
Crossref
 
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

What's hot (20)

CDISC2RDF overview with examples
CDISC2RDF overview with examplesCDISC2RDF overview with examples
CDISC2RDF overview with examples
 
2014 CrossRef Annual Meeting Flash Update: CrossRef Metadata Search
2014 CrossRef Annual Meeting Flash Update: CrossRef Metadata Search2014 CrossRef Annual Meeting Flash Update: CrossRef Metadata Search
2014 CrossRef Annual Meeting Flash Update: CrossRef Metadata Search
 
Crossref webinar - Maintaining your metadata - latest
Crossref webinar - Maintaining your metadata - latestCrossref webinar - Maintaining your metadata - latest
Crossref webinar - Maintaining your metadata - latest
 
CHORUS: A Collaborative Approach to Public Access
CHORUS: A Collaborative Approach to Public AccessCHORUS: A Collaborative Approach to Public Access
CHORUS: A Collaborative Approach to Public Access
 
Your Work is Distinctive, What about Your Name? Japan Library Fair 2014
Your Work is Distinctive, What about Your Name? Japan Library Fair 2014Your Work is Distinctive, What about Your Name? Japan Library Fair 2014
Your Work is Distinctive, What about Your Name? Japan Library Fair 2014
 
Starting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchStarting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer Research
 
Supporting Dataset Descriptions in the Life Sciences
Supporting Dataset Descriptions in the Life SciencesSupporting Dataset Descriptions in the Life Sciences
Supporting Dataset Descriptions in the Life Sciences
 
Your Work is Distinctive, What about Your Name?
Your Work is Distinctive, What about Your Name?Your Work is Distinctive, What about Your Name?
Your Work is Distinctive, What about Your Name?
 
Doctor mailing database
Doctor mailing databaseDoctor mailing database
Doctor mailing database
 
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale
 
Research aarkstoreenterprise disease and therapy review crohn's disease
Research aarkstoreenterprise   disease and therapy review  crohn's diseaseResearch aarkstoreenterprise   disease and therapy review  crohn's disease
Research aarkstoreenterprise disease and therapy review crohn's disease
 
Jisc UK ORCID Support: onboarding webinar
Jisc UK ORCID Support: onboarding webinarJisc UK ORCID Support: onboarding webinar
Jisc UK ORCID Support: onboarding webinar
 
An Identifier Scheme for the Digitising Scotland Project
An Identifier Scheme for the Digitising Scotland ProjectAn Identifier Scheme for the Digitising Scotland Project
An Identifier Scheme for the Digitising Scotland Project
 
Drug and Vaccine Discovery: Knowledge Graph + Apache Spark
Drug and Vaccine Discovery: Knowledge Graph + Apache SparkDrug and Vaccine Discovery: Knowledge Graph + Apache Spark
Drug and Vaccine Discovery: Knowledge Graph + Apache Spark
 
2014 CrossRef Workshops: Support Update and Multiple Resolution Overview
2014 CrossRef Workshops: Support Update and Multiple Resolution Overview2014 CrossRef Workshops: Support Update and Multiple Resolution Overview
2014 CrossRef Workshops: Support Update and Multiple Resolution Overview
 
CrossRef Annual Meeting 2012 ORCID Laure Haak
CrossRef Annual Meeting 2012 ORCID Laure HaakCrossRef Annual Meeting 2012 ORCID Laure Haak
CrossRef Annual Meeting 2012 ORCID Laure Haak
 
ORCID: An Overview - Alice Meadows
ORCID: An Overview - Alice MeadowsORCID: An Overview - Alice Meadows
ORCID: An Overview - Alice Meadows
 
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
 
Linked Vitals-20141112-v1a
Linked Vitals-20141112-v1aLinked Vitals-20141112-v1a
Linked Vitals-20141112-v1a
 

Viewers also liked (7)

Портфолио
ПортфолиоПортфолио
Портфолио
 
Project Manager
Project ManagerProject Manager
Project Manager
 
Presentations Tips
Presentations  TipsPresentations  Tips
Presentations Tips
 
Fec graph connect_2012
Fec graph connect_2012Fec graph connect_2012
Fec graph connect_2012
 
Analyzing FEC Data with NEO4J
Analyzing FEC Data with NEO4JAnalyzing FEC Data with NEO4J
Analyzing FEC Data with NEO4J
 
Fl@World™ overview presentation
Fl@World™ overview presentationFl@World™ overview presentation
Fl@World™ overview presentation
 
Hype vs. Reality: The AI Explainer
Hype vs. Reality: The AI ExplainerHype vs. Reality: The AI Explainer
Hype vs. Reality: The AI Explainer
 

Similar to GraphConnect NYC

Smart big data's new role in optimizing clinical 4
Smart big data's new role in optimizing clinical 4Smart big data's new role in optimizing clinical 4
Smart big data's new role in optimizing clinical 4
sapenov
 

Similar to GraphConnect NYC (20)

Rx16 pdmp wed_330_1_hoppe_2sun_3baumgartner-leichting
Rx16 pdmp wed_330_1_hoppe_2sun_3baumgartner-leichtingRx16 pdmp wed_330_1_hoppe_2sun_3baumgartner-leichting
Rx16 pdmp wed_330_1_hoppe_2sun_3baumgartner-leichting
 
Rx16 pdmp wed_330_1_hoppe_2sun_3baumgartner-leichting
Rx16 pdmp wed_330_1_hoppe_2sun_3baumgartner-leichtingRx16 pdmp wed_330_1_hoppe_2sun_3baumgartner-leichting
Rx16 pdmp wed_330_1_hoppe_2sun_3baumgartner-leichting
 
Data Scraping from Doctors Directories with Email List
Data Scraping from Doctors Directories with Email List Data Scraping from Doctors Directories with Email List
Data Scraping from Doctors Directories with Email List
 
Flextracker dal2013
Flextracker   dal2013Flextracker   dal2013
Flextracker dal2013
 
Smart big data's new role in optimizing clinical 4
Smart big data's new role in optimizing clinical 4Smart big data's new role in optimizing clinical 4
Smart big data's new role in optimizing clinical 4
 
Inside Outcomes - Managing Data
Inside Outcomes - Managing DataInside Outcomes - Managing Data
Inside Outcomes - Managing Data
 
Data Science presentation for elementary school students
Data Science presentation for elementary school studentsData Science presentation for elementary school students
Data Science presentation for elementary school students
 
Big Data: How does it fit in your data strategy?
Big Data: How does it fit in your data strategy?Big Data: How does it fit in your data strategy?
Big Data: How does it fit in your data strategy?
 
Data analytics in Healthcare
Data analytics in HealthcareData analytics in Healthcare
Data analytics in Healthcare
 
The Innovator’s Journey: Alternative Asset Managers
The Innovator’s Journey: Alternative Asset ManagersThe Innovator’s Journey: Alternative Asset Managers
The Innovator’s Journey: Alternative Asset Managers
 
Dev days 2017 referrals (brian postlethwaite)
Dev days 2017 referrals (brian postlethwaite)Dev days 2017 referrals (brian postlethwaite)
Dev days 2017 referrals (brian postlethwaite)
 
Monitoring, data management, and impact assessment in Africa RISING
Monitoring, data management, and impact assessment in Africa RISINGMonitoring, data management, and impact assessment in Africa RISING
Monitoring, data management, and impact assessment in Africa RISING
 
The Innovator’s Journey: Asset Manager Insights
The Innovator’s Journey: Asset Manager InsightsThe Innovator’s Journey: Asset Manager Insights
The Innovator’s Journey: Asset Manager Insights
 
Data For Good - Regina - Geoff Zakaib (DfG YYC) Presentation
Data For Good - Regina - Geoff Zakaib (DfG YYC) PresentationData For Good - Regina - Geoff Zakaib (DfG YYC) Presentation
Data For Good - Regina - Geoff Zakaib (DfG YYC) Presentation
 
Building a Data Warehouse at Clover
Building a Data Warehouse at CloverBuilding a Data Warehouse at Clover
Building a Data Warehouse at Clover
 
Statistics — Your Friend, Not Your Foe
Statistics — Your Friend, Not Your Foe Statistics — Your Friend, Not Your Foe
Statistics — Your Friend, Not Your Foe
 
Big Data - Harisfazillah Jamel - Startup and Developer 4th Meetup 5th Novembe...
Big Data - Harisfazillah Jamel - Startup and Developer 4th Meetup 5th Novembe...Big Data - Harisfazillah Jamel - Startup and Developer 4th Meetup 5th Novembe...
Big Data - Harisfazillah Jamel - Startup and Developer 4th Meetup 5th Novembe...
 
Go Code Colorado and The Data Liaison
Go Code Colorado and The Data LiaisonGo Code Colorado and The Data Liaison
Go Code Colorado and The Data Liaison
 
Building a Data Warehouse at Clover (PDF)
Building a Data Warehouse at Clover (PDF)Building a Data Warehouse at Clover (PDF)
Building a Data Warehouse at Clover (PDF)
 
Using Data to Support Partner Coordination
Using Data to Support Partner CoordinationUsing Data to Support Partner Coordination
Using Data to Support Partner Coordination
 

Recently uploaded

Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
FIDO Alliance
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
FIDO Alliance
 
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdfBreaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
UK Journal
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
FIDO Alliance
 

Recently uploaded (20)

Your enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4jYour enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4j
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
Using IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & IrelandUsing IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & Ireland
 
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfIntroduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
 
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfLinux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
 
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
 
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties ReimaginedEasier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
 
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfThe Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
 
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
 
Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptx
 
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdfBreaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
 
Working together SRE & Platform Engineering
Working together SRE & Platform EngineeringWorking together SRE & Platform Engineering
Working together SRE & Platform Engineering
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 

GraphConnect NYC