Comparative Analysis of Association Rule Mining, Crowdsourcing, and NDF-RT Knowledge Bases for Problem-Medication Pair Generation

Allison McCoy
Allison McCoyAssistant Professor at Tulane University School of Public Health and Tropical Medicine
1. Department of Bioengineering, School of Engineering and Applied Sciences, University of Pennsyvlania
2. National Center for Cognitive Decision Making in Healthcare
and School of Biomedical Informatics, University of Texas Health Science Center at Houston
Karthik Sethuraman,1 Allison B. McCoy, PhD2, Dean F. Sittig, PhD2
Comparative Analysis of Association Rule Mining, Crowdsourcing,
and NDF-RT Knowledge Bases for Problem-Medication Pair Generation
Data and Knowledge Bases
We sorted each KB by appropriateness measures
and examined the top 5,000 pairs per KB to
eliminate spurious problem relations. We assessed
similarities and differences between three KBs by:
i) constructing contingency tables by KB with
presence/absence in two remaining KBs as
margin criteria to determine degree of overlap
(Fisher's test)
ii) measuring Spearman’s rank correlation of
overlapping problem-medication pairs across KBs
to check if KBs ranks pairs in the same way
indicating similar KB construction
iii) individual assessing the top fifty problem-
medication pairs per KB
Objective
Conclusions
Acknowledgements
Association rule mining and crowdsourcing are
remarkably similar in pair generation.
The NDF-RT doesn't overlap with either.
Significant Spearman's σ and pair overlap
between association rule mining and
crowdsourcing indicate similar operation.
Similar relation formation may be because both
depend on the local EHR dataset.
NDF-RT presents a distinct, expert-generated,
non-dataset relation source that can be
exploited for automatic summarization.
Generally, expert-generated (or reference)
information may provide an underexploited
source of relations for auto-summarization.
This project was supported by Grant No. 10510592 for Patient-
Centered Cognitive Support under the Strategic Health IT
Advanced Research Projects (SHARP) from the Office of the
National Coordinator for Health Information Technology and by
the CPRIT Summer Undergraduate Cancer Research
Experience.
Please contact the corresponding author at:
amccoy1@tulane.edu
Fisher's Test and Spearman's σ
Fisher's test:
Overlap between association rule mining and
crowdsourcing was very significant (p<0.001)
while overlap between either of the two and
NDF-RT was not (p>0.10).
Spearman's σ:
Association rule mining and crowdsourcing
overlapping pairs displayed significant monotonic
correlation (for ranks) with σ of 0.315 (p<0.001),
but neither displayed correlation with NDF-RT.
Overlapping Pairs between KBs
Association Rule Mining and crowdsourcing have
more overlapping pairs (771) than either with the
NDF-RT (145, 153).
Analysis
Data: Electronic health record (EHR) data from a
large, multi-specialty, ambulatory academic practice
that provides medical care for all ages in Houston.
One year study period from June 1, 2010 to May 30,
2011, including 418,221 medications and 1,222,308
problems logged for 53,108 patients.
Knowledge Base Generation: We independently
applied association rule mining and crowdsourcing to
the dataset to generate problem-medication
knowledge bases (KBs) and mapped local EHR
terminology to NDF-RT terminology for 3rd KB.
Top Ten Problem-
Medication Pairs
Association rule mining
forms very sensitive but
low frequency pairs. It
finds very specific but
more rare relationships
like Scabies to
Permethrin.
NDF-RT gives high-
frequency pairs – all
problems listed are
hypertension.
Crowdsourcing provides
a mixture of association
rule mining and NDF-RT
relations. 3/10 problems
are hypertension but
more rare relations like
Allergic Rhinitis and
Fluticasone Propionate
are also present.
Generally, to better compile and organize the
growing patient information confronting
healthcare providers through automatic
summarization.
Specifically, to evaluate three different
approaches, association rule mining,
crowdsourcing, and the National Drug File-
Reference Terminology (NDF-RT), to problem-
medication pair generation – a key auto-
summarization task for clinical datasets.
Association
Rule Mining
Data mining
technique that looks
for relationships as
co-occurrence of
pairs in a database,
here logged
medication-problem
information
Crowdsourcing
Using information
gathered from a
group or crowd, here
the medications and
problems entered by
a set of identified
physicians
NDF-RT
Curated reference of
medications and
associated
information,
including related
problems/diseases.
NDF-RT
4771
Crowdsourcing
4153
Association
Rule Mining
4145
84
69
76 702
Association Rule Mining Crowdsourcing NDF-RT
Rank Problem Medication Problem Medication Problem Medication
1 Scabies Permethrin Hypo-
thyroidism
Levothyroxine
Sodium
Hypertension Cardizem SR
2 Bacterial
Vaginosis
Metronidazole Hyperlipid-
emia
Simvastatin Hypertension Cardene
3 Motor Neuron
Disease
Rilutek Hypertension Lisinopril Hypertension Reserpine
4 Vaginal
Candidiasis
Terconazole Bacterial
Vaginosis
Metronidazole Hypertension Triamterene
5 Pseudo-
monas
Amikacin
Sulfate
Hyperlipidimia Lipitor Hypertension Innopran XL
6 Type 1
Diabetes
Glucagon Hypertension Hydrochloro-
thiazide
Hypertension Bendroflu-
methiazide
7 Hypothyroid-
ism
Levothyroxine
Sodium
Hypertension Amlodipine
Besylate
Hypertension Isradipine
8 Mitochondrial
Metabolism
Disorder
Levocarnitine Allergic
Rhinitis
Fluticasone
Propionate
Hypertension Eplerenone
9 Tinea Capitis Griseofulvin
(Microsize)
Esophageal
Reflux
Nexium Hypertension L-Tyrosine
10 Congenital
Adrenal
Hyperplasia
Solu-Cortef Type 1
Diabetes
Metformin HCI Hypertension Diltiazem HCl
CR
1 of 1

Recommended

IUPHAR/BPS Guide to Pharmacology by
IUPHAR/BPS Guide to PharmacologyIUPHAR/BPS Guide to Pharmacology
IUPHAR/BPS Guide to PharmacologyGuide to PHARMACOLOGY
387 views1 slide
Analysing curated protein targets: Partitioning the drugged and the druggable by
Analysing curated protein targets: Partitioning the drugged and the druggable Analysing curated protein targets: Partitioning the drugged and the druggable
Analysing curated protein targets: Partitioning the drugged and the druggable Chris Southan
358 views1 slide
IUPHAR/BPS Guide to Pharmacology in 2018 by
IUPHAR/BPS Guide to Pharmacology in 2018IUPHAR/BPS Guide to Pharmacology in 2018
IUPHAR/BPS Guide to Pharmacology in 2018Guide to PHARMACOLOGY
555 views1 slide
BLAST Search tool by
BLAST Search toolBLAST Search tool
BLAST Search toolRajpal Choudhary
2.1K views13 slides
PubChem Database by
PubChem DatabasePubChem Database
PubChem DatabaseLucia Ravi
2.1K views7 slides

More Related Content

What's hot

GroupA_BSIM0007 by
GroupA_BSIM0007GroupA_BSIM0007
GroupA_BSIM0007hfcheng7
204 views30 slides
Controlled vocabularies for medical and health research by
Controlled vocabularies for medical and health researchControlled vocabularies for medical and health research
Controlled vocabularies for medical and health researchARDC
1.2K views15 slides
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates by
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updatesThe IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updatesGuide to PHARMACOLOGY
107 views1 slide
Fundamentals of Health Insurance Big Data Management: Master Patient Index De... by
Fundamentals of Health Insurance Big Data Management: Master Patient Index De...Fundamentals of Health Insurance Big Data Management: Master Patient Index De...
Fundamentals of Health Insurance Big Data Management: Master Patient Index De...Salus One Ed
180 views27 slides

Viewers also liked

Data mining fp growth by
Data mining fp growthData mining fp growth
Data mining fp growthShihab Rahman
65.2K views23 slides
Preprocessing of Academic Data for Mining Association Rule, Presentation @WAD... by
Preprocessing of Academic Data for Mining Association Rule, Presentation @WAD...Preprocessing of Academic Data for Mining Association Rule, Presentation @WAD...
Preprocessing of Academic Data for Mining Association Rule, Presentation @WAD...shibbirtanvin
660 views18 slides
Knowledge Discovery from Academic Data using Association Rule Mining, Paper P... by
Knowledge Discovery from Academic Data using Association Rule Mining, Paper P...Knowledge Discovery from Academic Data using Association Rule Mining, Paper P...
Knowledge Discovery from Academic Data using Association Rule Mining, Paper P...shibbirtanvin
862 views23 slides
Machine Learning and Data Mining: 06 Clustering: Partitioning by
Machine Learning and Data Mining: 06 Clustering: PartitioningMachine Learning and Data Mining: 06 Clustering: Partitioning
Machine Learning and Data Mining: 06 Clustering: PartitioningPier Luca Lanzi
5.7K views48 slides
Association Rule Mining in Data Mining by
Association Rule Mining in Data Mining Association Rule Mining in Data Mining
Association Rule Mining in Data Mining Ayesha Ali
631 views16 slides
3.1 mining frequent patterns with association rules-mca4 by
3.1 mining frequent patterns with association rules-mca43.1 mining frequent patterns with association rules-mca4
3.1 mining frequent patterns with association rules-mca4Azad public school
501 views15 slides

Viewers also liked(20)

Data mining fp growth by Shihab Rahman
Data mining fp growthData mining fp growth
Data mining fp growth
Shihab Rahman65.2K views
Preprocessing of Academic Data for Mining Association Rule, Presentation @WAD... by shibbirtanvin
Preprocessing of Academic Data for Mining Association Rule, Presentation @WAD...Preprocessing of Academic Data for Mining Association Rule, Presentation @WAD...
Preprocessing of Academic Data for Mining Association Rule, Presentation @WAD...
shibbirtanvin660 views
Knowledge Discovery from Academic Data using Association Rule Mining, Paper P... by shibbirtanvin
Knowledge Discovery from Academic Data using Association Rule Mining, Paper P...Knowledge Discovery from Academic Data using Association Rule Mining, Paper P...
Knowledge Discovery from Academic Data using Association Rule Mining, Paper P...
shibbirtanvin862 views
Machine Learning and Data Mining: 06 Clustering: Partitioning by Pier Luca Lanzi
Machine Learning and Data Mining: 06 Clustering: PartitioningMachine Learning and Data Mining: 06 Clustering: Partitioning
Machine Learning and Data Mining: 06 Clustering: Partitioning
Pier Luca Lanzi5.7K views
Association Rule Mining in Data Mining by Ayesha Ali
Association Rule Mining in Data Mining Association Rule Mining in Data Mining
Association Rule Mining in Data Mining
Ayesha Ali631 views
3.1 mining frequent patterns with association rules-mca4 by Azad public school
3.1 mining frequent patterns with association rules-mca43.1 mining frequent patterns with association rules-mca4
3.1 mining frequent patterns with association rules-mca4
Machine Learning and Data Mining: 04 Association Rule Mining by Pier Luca Lanzi
Machine Learning and Data Mining: 04 Association Rule MiningMachine Learning and Data Mining: 04 Association Rule Mining
Machine Learning and Data Mining: 04 Association Rule Mining
Pier Luca Lanzi7.8K views
Weka project - Classification & Association Rule Generation by rsathishwaran
Weka project - Classification & Association Rule GenerationWeka project - Classification & Association Rule Generation
Weka project - Classification & Association Rule Generation
rsathishwaran21.9K views
The comparative study of apriori and FP-growth algorithm by deepti92pawar
The comparative study of apriori and FP-growth algorithmThe comparative study of apriori and FP-growth algorithm
The comparative study of apriori and FP-growth algorithm
deepti92pawar58.5K views
Association rule mining by Acad
Association rule miningAssociation rule mining
Association rule mining
Acad41.9K views
Association Rule Mining with R by Yanchang Zhao
Association Rule Mining with RAssociation Rule Mining with R
Association Rule Mining with R
Yanchang Zhao75.8K views
Apriori and Eclat algorithm in Association Rule Mining by Wan Aezwani Wab
Apriori and Eclat algorithm in Association Rule MiningApriori and Eclat algorithm in Association Rule Mining
Apriori and Eclat algorithm in Association Rule Mining
Wan Aezwani Wab44.2K views
Context Aware Services in Connected Homes Using OSGi Service Infrastructure -... by mfrancis
Context Aware Services in Connected Homes Using OSGi Service Infrastructure -...Context Aware Services in Connected Homes Using OSGi Service Infrastructure -...
Context Aware Services in Connected Homes Using OSGi Service Infrastructure -...
mfrancis570 views
PR Strategy (GIC) by Freda Luo
PR Strategy (GIC)PR Strategy (GIC)
PR Strategy (GIC)
Freda Luo2.5K views
Teach Smarter, Not Harder by David Freeman
Teach Smarter, Not HarderTeach Smarter, Not Harder
Teach Smarter, Not Harder
David Freeman712 views
An Evaluation of the Comprehensiveness of the BSc Honours in Psychology Degre... by iosrjce
An Evaluation of the Comprehensiveness of the BSc Honours in Psychology Degre...An Evaluation of the Comprehensiveness of the BSc Honours in Psychology Degre...
An Evaluation of the Comprehensiveness of the BSc Honours in Psychology Degre...
iosrjce306 views

Similar to Comparative Analysis of Association Rule Mining, Crowdsourcing, and NDF-RT Knowledge Bases for Problem-Medication Pair Generation

Indications discovery and drug repurposing by
Indications discovery and drug repurposingIndications discovery and drug repurposing
Indications discovery and drug repurposingSean Ekins
728 views31 slides
Social Cost Of Cancer by
Social Cost Of CancerSocial Cost Of Cancer
Social Cost Of CancerJen Williams
2 views43 slides
Clinical Research Informatics (CRI) Year-in-Review 2014 by
Clinical Research Informatics (CRI) Year-in-Review 2014Clinical Research Informatics (CRI) Year-in-Review 2014
Clinical Research Informatics (CRI) Year-in-Review 2014Peter Embi
2.3K views66 slides
Annotation Editorial by
Annotation EditorialAnnotation Editorial
Annotation EditorialAbhishekPuri27
45 views3 slides
Prospective identification of drug safety signals by
Prospective identification of drug safety signalsProspective identification of drug safety signals
Prospective identification of drug safety signalsIMSHealthRWES
506 views3 slides
Big medicals record data by
Big medicals record dataBig medicals record data
Big medicals record dataErrepe
62 views4 slides

Similar to Comparative Analysis of Association Rule Mining, Crowdsourcing, and NDF-RT Knowledge Bases for Problem-Medication Pair Generation(20)

Indications discovery and drug repurposing by Sean Ekins
Indications discovery and drug repurposingIndications discovery and drug repurposing
Indications discovery and drug repurposing
Sean Ekins728 views
Clinical Research Informatics (CRI) Year-in-Review 2014 by Peter Embi
Clinical Research Informatics (CRI) Year-in-Review 2014Clinical Research Informatics (CRI) Year-in-Review 2014
Clinical Research Informatics (CRI) Year-in-Review 2014
Peter Embi2.3K views
Prospective identification of drug safety signals by IMSHealthRWES
Prospective identification of drug safety signalsProspective identification of drug safety signals
Prospective identification of drug safety signals
IMSHealthRWES506 views
Big medicals record data by Errepe
Big medicals record dataBig medicals record data
Big medicals record data
Errepe62 views
Annotating The Biomedical Literature For The Human Variome by Shannon Green
Annotating The Biomedical Literature For The Human VariomeAnnotating The Biomedical Literature For The Human Variome
Annotating The Biomedical Literature For The Human Variome
Shannon Green3 views
Data Mining and Big Data Analytics in Pharma by Ankur Khanna
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma
Ankur Khanna7.5K views
Nursing Research Resources by Ann Celestine
Nursing Research Resources Nursing Research Resources
Nursing Research Resources
Ann Celestine5.3K views
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R... by Sean Ekins
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Sean Ekins1K views
A method for mining infrequent causal associations and its application in fin... by JPINFOTECH JAYAPRAKASH
A method for mining infrequent causal associations and its application in fin...A method for mining infrequent causal associations and its application in fin...
A method for mining infrequent causal associations and its application in fin...
Are most positive findings in psychology false or exaggerated? An activist's ... by James Coyne
Are most positive findings in psychology false or exaggerated? An activist's ...Are most positive findings in psychology false or exaggerated? An activist's ...
Are most positive findings in psychology false or exaggerated? An activist's ...
James Coyne2.2K views
Summarization Techniques in Association Rule Data Mining For Risk Assessment ... by IJTET Journal
Summarization Techniques in Association Rule Data Mining For Risk Assessment ...Summarization Techniques in Association Rule Data Mining For Risk Assessment ...
Summarization Techniques in Association Rule Data Mining For Risk Assessment ...
IJTET Journal648 views
Putting Evidence Into Practice1 A by lvandill
Putting Evidence Into Practice1 APutting Evidence Into Practice1 A
Putting Evidence Into Practice1 A
lvandill963 views
Heart Disease Prediction Using Data Mining Techniques by IJRES Journal
Heart Disease Prediction Using Data Mining TechniquesHeart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining Techniques
IJRES Journal6K views
SLC CME- Evidence based medicine 07/27/2007 by cddirks
SLC CME- Evidence based medicine 07/27/2007SLC CME- Evidence based medicine 07/27/2007
SLC CME- Evidence based medicine 07/27/2007
cddirks896 views

More from Allison McCoy

Clinicians Satisfaction Before and After Transition from a Basic to a Compreh... by
Clinicians Satisfaction Before and After Transition from a Basic to a Compreh...Clinicians Satisfaction Before and After Transition from a Basic to a Compreh...
Clinicians Satisfaction Before and After Transition from a Basic to a Compreh...Allison McCoy
222 views1 slide
Computer-Based Clinical Decision Support for Colorectal Diseases: An Overview... by
Computer-Based Clinical Decision Support for Colorectal Diseases: An Overview...Computer-Based Clinical Decision Support for Colorectal Diseases: An Overview...
Computer-Based Clinical Decision Support for Colorectal Diseases: An Overview...Allison McCoy
277 views1 slide
Clinician Satisfaction Before and After Transition from a Basic to a Comprehe... by
Clinician Satisfaction Before and After Transition from a Basic to a Comprehe...Clinician Satisfaction Before and After Transition from a Basic to a Comprehe...
Clinician Satisfaction Before and After Transition from a Basic to a Comprehe...Allison McCoy
300 views26 slides
Evaluation of Clinical Decision Support Alerts for Medications Contraindicate... by
Evaluation of Clinical Decision Support Alerts for Medications Contraindicate...Evaluation of Clinical Decision Support Alerts for Medications Contraindicate...
Evaluation of Clinical Decision Support Alerts for Medications Contraindicate...Allison McCoy
190 views1 slide
Improving Lab Order, Verification, and Follow-up Processes at UT Physicians by
Improving Lab Order, Verification, and Follow-up Processes at UT PhysiciansImproving Lab Order, Verification, and Follow-up Processes at UT Physicians
Improving Lab Order, Verification, and Follow-up Processes at UT PhysiciansAllison McCoy
219 views1 slide
Use of the Crowdsourcing Methodology to Generate a Problem-Laboratory Test Kn... by
Use of the Crowdsourcing Methodology to Generate a Problem-Laboratory Test Kn...Use of the Crowdsourcing Methodology to Generate a Problem-Laboratory Test Kn...
Use of the Crowdsourcing Methodology to Generate a Problem-Laboratory Test Kn...Allison McCoy
645 views1 slide

More from Allison McCoy(11)

Clinicians Satisfaction Before and After Transition from a Basic to a Compreh... by Allison McCoy
Clinicians Satisfaction Before and After Transition from a Basic to a Compreh...Clinicians Satisfaction Before and After Transition from a Basic to a Compreh...
Clinicians Satisfaction Before and After Transition from a Basic to a Compreh...
Allison McCoy222 views
Computer-Based Clinical Decision Support for Colorectal Diseases: An Overview... by Allison McCoy
Computer-Based Clinical Decision Support for Colorectal Diseases: An Overview...Computer-Based Clinical Decision Support for Colorectal Diseases: An Overview...
Computer-Based Clinical Decision Support for Colorectal Diseases: An Overview...
Allison McCoy277 views
Clinician Satisfaction Before and After Transition from a Basic to a Comprehe... by Allison McCoy
Clinician Satisfaction Before and After Transition from a Basic to a Comprehe...Clinician Satisfaction Before and After Transition from a Basic to a Comprehe...
Clinician Satisfaction Before and After Transition from a Basic to a Comprehe...
Allison McCoy300 views
Evaluation of Clinical Decision Support Alerts for Medications Contraindicate... by Allison McCoy
Evaluation of Clinical Decision Support Alerts for Medications Contraindicate...Evaluation of Clinical Decision Support Alerts for Medications Contraindicate...
Evaluation of Clinical Decision Support Alerts for Medications Contraindicate...
Allison McCoy190 views
Improving Lab Order, Verification, and Follow-up Processes at UT Physicians by Allison McCoy
Improving Lab Order, Verification, and Follow-up Processes at UT PhysiciansImproving Lab Order, Verification, and Follow-up Processes at UT Physicians
Improving Lab Order, Verification, and Follow-up Processes at UT Physicians
Allison McCoy219 views
Use of the Crowdsourcing Methodology to Generate a Problem-Laboratory Test Kn... by Allison McCoy
Use of the Crowdsourcing Methodology to Generate a Problem-Laboratory Test Kn...Use of the Crowdsourcing Methodology to Generate a Problem-Laboratory Test Kn...
Use of the Crowdsourcing Methodology to Generate a Problem-Laboratory Test Kn...
Allison McCoy645 views
A Prototype Knowledge Base and SMART App to Facilitate Organization of Patien... by Allison McCoy
A Prototype Knowledge Base and SMART App to Facilitate Organization of Patien...A Prototype Knowledge Base and SMART App to Facilitate Organization of Patien...
A Prototype Knowledge Base and SMART App to Facilitate Organization of Patien...
Allison McCoy436 views
Automated Inference of Patient Problems from Medications using NDF-RT and the... by Allison McCoy
Automated Inference of Patient Problems from Medications using NDF-RT and the...Automated Inference of Patient Problems from Medications using NDF-RT and the...
Automated Inference of Patient Problems from Medications using NDF-RT and the...
Allison McCoy389 views
The Greasemonkey Firefox Add-On for Altering Display of Data in a Web-Based E... by Allison McCoy
The Greasemonkey Firefox Add-On for Altering Display of Data in a Web-Based E...The Greasemonkey Firefox Add-On for Altering Display of Data in a Web-Based E...
The Greasemonkey Firefox Add-On for Altering Display of Data in a Web-Based E...
Allison McCoy1.2K views
A System to Improve Medication Safety in the Setting of Acute Kidney Injury by Allison McCoy
A System to Improve Medication Safety in the Setting of Acute Kidney InjuryA System to Improve Medication Safety in the Setting of Acute Kidney Injury
A System to Improve Medication Safety in the Setting of Acute Kidney Injury
Allison McCoy208 views
Real-Time Surveillance for Rapid Correction of Clinical Decision Support Fail... by Allison McCoy
Real-Time Surveillance for Rapid Correction of Clinical Decision Support Fail...Real-Time Surveillance for Rapid Correction of Clinical Decision Support Fail...
Real-Time Surveillance for Rapid Correction of Clinical Decision Support Fail...
Allison McCoy387 views

Recently uploaded

homedoctorbook-com-book- (1).pdf by
homedoctorbook-com-book- (1).pdfhomedoctorbook-com-book- (1).pdf
homedoctorbook-com-book- (1).pdffatimasahar769
6 views14 slides
swarasa kalpana .pptx by
swarasa kalpana .pptxswarasa kalpana .pptx
swarasa kalpana .pptxAparnaNandakumar12
6 views30 slides
Lifestyle Measures to Prevent Brain Diseases.pptx by
Lifestyle Measures to Prevent Brain Diseases.pptxLifestyle Measures to Prevent Brain Diseases.pptx
Lifestyle Measures to Prevent Brain Diseases.pptxSudhir Kumar
627 views23 slides
Nidanarthakara Roga.pptx by
Nidanarthakara Roga.pptxNidanarthakara Roga.pptx
Nidanarthakara Roga.pptxAkshay Shetty
21 views23 slides
T1DM case example.pptx by
T1DM case example.pptxT1DM case example.pptx
T1DM case example.pptxNguyễn đình Đức
40 views17 slides
sedative and hypnotics by
sedative and hypnoticssedative and hypnotics
sedative and hypnoticsP.N.DESHMUKH
6 views10 slides

Recently uploaded(20)

Lifestyle Measures to Prevent Brain Diseases.pptx by Sudhir Kumar
Lifestyle Measures to Prevent Brain Diseases.pptxLifestyle Measures to Prevent Brain Diseases.pptx
Lifestyle Measures to Prevent Brain Diseases.pptx
Sudhir Kumar627 views
The Art of naming drugs.pptx by DanaKarem1
The Art of naming drugs.pptxThe Art of naming drugs.pptx
The Art of naming drugs.pptx
DanaKarem112 views
Pulmonary Embolism for Nurses.pptx by Asraf Hussain
Pulmonary Embolism for Nurses.pptxPulmonary Embolism for Nurses.pptx
Pulmonary Embolism for Nurses.pptx
Asraf Hussain32 views
Asthalin Inhaler (Generic Albuterol Sulfate Inhaler) by The Swiss Pharmacy
Asthalin Inhaler (Generic Albuterol Sulfate Inhaler) Asthalin Inhaler (Generic Albuterol Sulfate Inhaler)
Asthalin Inhaler (Generic Albuterol Sulfate Inhaler)
Cholera Romy W. (3).pptx by rweth613
Cholera Romy W. (3).pptxCholera Romy W. (3).pptx
Cholera Romy W. (3).pptx
rweth61353 views
24th oct Pulp Therapy In Young Permanent Teeth.pptx by ismasajjad1
24th oct Pulp Therapy In Young Permanent Teeth.pptx24th oct Pulp Therapy In Young Permanent Teeth.pptx
24th oct Pulp Therapy In Young Permanent Teeth.pptx
ismasajjad113 views
Top Ayurvedic PCD Companies in India Riding the Wave of Wellness Trends by muskansbl01
Top Ayurvedic PCD Companies in India Riding the Wave of Wellness TrendsTop Ayurvedic PCD Companies in India Riding the Wave of Wellness Trends
Top Ayurvedic PCD Companies in India Riding the Wave of Wellness Trends
muskansbl0141 views

Comparative Analysis of Association Rule Mining, Crowdsourcing, and NDF-RT Knowledge Bases for Problem-Medication Pair Generation

  • 1. 1. Department of Bioengineering, School of Engineering and Applied Sciences, University of Pennsyvlania 2. National Center for Cognitive Decision Making in Healthcare and School of Biomedical Informatics, University of Texas Health Science Center at Houston Karthik Sethuraman,1 Allison B. McCoy, PhD2, Dean F. Sittig, PhD2 Comparative Analysis of Association Rule Mining, Crowdsourcing, and NDF-RT Knowledge Bases for Problem-Medication Pair Generation Data and Knowledge Bases We sorted each KB by appropriateness measures and examined the top 5,000 pairs per KB to eliminate spurious problem relations. We assessed similarities and differences between three KBs by: i) constructing contingency tables by KB with presence/absence in two remaining KBs as margin criteria to determine degree of overlap (Fisher's test) ii) measuring Spearman’s rank correlation of overlapping problem-medication pairs across KBs to check if KBs ranks pairs in the same way indicating similar KB construction iii) individual assessing the top fifty problem- medication pairs per KB Objective Conclusions Acknowledgements Association rule mining and crowdsourcing are remarkably similar in pair generation. The NDF-RT doesn't overlap with either. Significant Spearman's σ and pair overlap between association rule mining and crowdsourcing indicate similar operation. Similar relation formation may be because both depend on the local EHR dataset. NDF-RT presents a distinct, expert-generated, non-dataset relation source that can be exploited for automatic summarization. Generally, expert-generated (or reference) information may provide an underexploited source of relations for auto-summarization. This project was supported by Grant No. 10510592 for Patient- Centered Cognitive Support under the Strategic Health IT Advanced Research Projects (SHARP) from the Office of the National Coordinator for Health Information Technology and by the CPRIT Summer Undergraduate Cancer Research Experience. Please contact the corresponding author at: amccoy1@tulane.edu Fisher's Test and Spearman's σ Fisher's test: Overlap between association rule mining and crowdsourcing was very significant (p<0.001) while overlap between either of the two and NDF-RT was not (p>0.10). Spearman's σ: Association rule mining and crowdsourcing overlapping pairs displayed significant monotonic correlation (for ranks) with σ of 0.315 (p<0.001), but neither displayed correlation with NDF-RT. Overlapping Pairs between KBs Association Rule Mining and crowdsourcing have more overlapping pairs (771) than either with the NDF-RT (145, 153). Analysis Data: Electronic health record (EHR) data from a large, multi-specialty, ambulatory academic practice that provides medical care for all ages in Houston. One year study period from June 1, 2010 to May 30, 2011, including 418,221 medications and 1,222,308 problems logged for 53,108 patients. Knowledge Base Generation: We independently applied association rule mining and crowdsourcing to the dataset to generate problem-medication knowledge bases (KBs) and mapped local EHR terminology to NDF-RT terminology for 3rd KB. Top Ten Problem- Medication Pairs Association rule mining forms very sensitive but low frequency pairs. It finds very specific but more rare relationships like Scabies to Permethrin. NDF-RT gives high- frequency pairs – all problems listed are hypertension. Crowdsourcing provides a mixture of association rule mining and NDF-RT relations. 3/10 problems are hypertension but more rare relations like Allergic Rhinitis and Fluticasone Propionate are also present. Generally, to better compile and organize the growing patient information confronting healthcare providers through automatic summarization. Specifically, to evaluate three different approaches, association rule mining, crowdsourcing, and the National Drug File- Reference Terminology (NDF-RT), to problem- medication pair generation – a key auto- summarization task for clinical datasets. Association Rule Mining Data mining technique that looks for relationships as co-occurrence of pairs in a database, here logged medication-problem information Crowdsourcing Using information gathered from a group or crowd, here the medications and problems entered by a set of identified physicians NDF-RT Curated reference of medications and associated information, including related problems/diseases. NDF-RT 4771 Crowdsourcing 4153 Association Rule Mining 4145 84 69 76 702 Association Rule Mining Crowdsourcing NDF-RT Rank Problem Medication Problem Medication Problem Medication 1 Scabies Permethrin Hypo- thyroidism Levothyroxine Sodium Hypertension Cardizem SR 2 Bacterial Vaginosis Metronidazole Hyperlipid- emia Simvastatin Hypertension Cardene 3 Motor Neuron Disease Rilutek Hypertension Lisinopril Hypertension Reserpine 4 Vaginal Candidiasis Terconazole Bacterial Vaginosis Metronidazole Hypertension Triamterene 5 Pseudo- monas Amikacin Sulfate Hyperlipidimia Lipitor Hypertension Innopran XL 6 Type 1 Diabetes Glucagon Hypertension Hydrochloro- thiazide Hypertension Bendroflu- methiazide 7 Hypothyroid- ism Levothyroxine Sodium Hypertension Amlodipine Besylate Hypertension Isradipine 8 Mitochondrial Metabolism Disorder Levocarnitine Allergic Rhinitis Fluticasone Propionate Hypertension Eplerenone 9 Tinea Capitis Griseofulvin (Microsize) Esophageal Reflux Nexium Hypertension L-Tyrosine 10 Congenital Adrenal Hyperplasia Solu-Cortef Type 1 Diabetes Metformin HCI Hypertension Diltiazem HCl CR