SlideShare a Scribd company logo
1 of 28
© Columbia University
MedLEE, a Natural Language
Processing Service, on the Public
Health Research Grid
Albert Lai, Ph.D.
Carol Friedman, Ph.D.
Department of Biomedical Informatics,
Columbia University
© Columbia University
Goals
 Better understand grid technology
 Provide NLP service to members of the
Public Health Research Grid
© Columbia University
Motivation
 Coded information is urgently needed for
clinical applications that improve care and
lower costs
 Coded data other than laboratory data and
pharmacy data - very scarce and difficult
to obtain
© Columbia University
Why NLP?
 NLP technology offers high throughput
method for automatic encoding of clinical
information in narrative patient reports
 Applicable to a broad range of clinical
domains
 Appropriate for diverse applications
© Columbia University
Outline
 MedLEE Overview
 Getting on the Grid
 Service Generalization Issues
© Columbia University
MedLEE Overview
 Medical Language Extraction and
Encoding
Extracts, structures, and encodes clinical
information in narrative patient reports
Comprehensive coverage
Can be used for diverse clinical applications
Development started in 1991
Used at Columbia University Medical Center
since 1995
Numerous independent evaluations
....New maculopapular rash
on trunk ….
MedLEE
Problem:rash
Status:new
Descriptor:maculopapular
Bodyloc:trunk
Code:C0460005 (trunk)
Code:C0241488 (trunk
maculopapular rash)
Analytics
Patient report
Coded data
Clinical Guidance
-- Indicate potential notifiable
disease for reporting.
-- Inform of local outbreak and
indicate appropriate tests.
-- Indicate need for vaccination.
Surveillance
-- Indicate potential bioterrorist
event.
-- Transmit syndromic event to
health dept for surveillance.
Quality Assurance
Detect potential cases of
medication reaction.
Clinical Research
-- Detect cases of rash for
inclusion in trial of new
treatment.
-- Find genetic associations
with atopic rash.
Applications
© Columbia University
MedLEE Architecture
Text
Report
MedLEE Application
Postprocessor
Preprocessor
standardize
Input to MedLEE
transform output
for application
© Columbia University
Text Reports Processed
 Radiology Reports
 Cardiology Reports
 Pathology Reports
 Admission notes
 Discharge Summaries
 Resident Sign out notes
 Office Visits
 Telephone encounters
© Columbia University
Applications using MedLEE
 Biosurveillance
 Syndromic surveillance
 Adverse Drug Event detection
 Decision Support
 Clinical Research
 Clinical Trials
 Quality Assurance
 Automated Encoding
 Patient Management
 Data mining – finding trends and associations
 Linking patient record to the literature
 Summarization
© Columbia University
Adverse Event Detection
 Use NLP system to detect events defined in the
New York Patient Occurrence Reporting and
Tracking System (NYPORTS)
 System: MedLEE +
 Queries consisiting of criteria mapping output to
NYPORTS event
 Results: system outperformed traditional and
previous automated methods  NLP effective
method for automated adverse event detection
Melton GB, Hripcsak G. Automated detection of adverse events using natural language processing of discharge summaries.
J Am Med Inform Assoc. 2005 Jul-Aug;12(4):448-57.
© Columbia University
Syndromic Surveillance
 EHR from Institute of Family Health (IFH)
 13 community health centers
 System: MedLEE+queries detecting flu-like
illness
 Processed all types of notes:
 nursing and physician notes
 several types of encounters
 Results:
 System correlated well with a proven syndromic
surveillance system based on chief complaints
© Columbia University
Challenges for Grid-enabling MedLEE
 Setting up a grid node
 Firewalls
 Security / PHI concerns
 Wrapping MedLEE into a grid service
© Columbia University
Getting on the Grid
 CUMC on hospital network
Setting up Globus node required numerous
port openings
 No DMZ available
 Solution: install main campus w/ no
firewall
© Columbia University
Security Concerns
 No firewall
 Medical data being sent to node
 Medical data remains on node for some
period of time
 Currently, we do not send data containing
PHI to the Grid service, only in
demonstration mode
© Columbia University
Wrapping MedLEE
 Introduce framework from caGrid / OSU
 Using schema for web service,
automatically generates custom beans
 Automatically generates method stubs for
service operations
 Can configure security parameters for
service
© Columbia University
© Columbia University
© Columbia University
MedLEE Example
Pre-
processor
[new,maculopapular,rash,on,trunk,’.’]
Parser [problem,rash,[status,new],[descriptor,maculopapular],[bodyloc,tru
nk]]
Encoder [problem,rash,[status,new],[descriptor,maculopapular],[bodyloc,tru
nk,[code,C0460005^trunk]],[code,C0241488^trunk maculopapular
rash]
XML
Translator
<problem v = “rash”>
<status v = “new”></status>
<descriptor v = “maculopapular”></descriptor>
<bodyloc v = ”trunk”>
<code v = “C046005^trunk”</code>
</bodyloc>
<code v = “C0241488^trunk maculopapular rash” </code>
</problem>
“New maculopapular rash on trunk.”
© Columbia University
Service Generalization Issues
 Preprocessing needed to standardize
narrative prior to MedLEE
 Allow user-specified customization
© Columbia University
Preprocessor – Standardize Report for
MedLEE
 Handle problematic text (formatted data in narrative)
Her2/Neu-2+  Her2/Neu measured 2+.
 Add punctuations (e.g., ‘.’) for run on sentences
 Add ignore tags to indicate text that should be ignored
This neoplasm is strongly estrogen positive <ign> To
date an indicator reliable in predicting tumor recurrence
in node-negative patients is not available. </ign>
 Add tags from external processor
The patient had fever <phr sem=“date” t=“19991002”>2
days before admission</phr>
© Columbia University
Postprocessor – Transform output
 Modify and transform output for different
views/applications
Map into tabular form for
spreadsheet/database
Problem-oriented/summarization/highlighting
Execute queries to interpret output according
to needs of application
© Columbia University
User-Specified Knowledge
 Section headings
New sections/abbreviated section headers:
 PMH|Past Medical History
 New abbreviations containing ‘.’ (O.D.)
 Avoids breaking up sentence at abbreviation
 Add/replace lexical definitions
 OD|bodyloc|right eye
© Columbia University
Example of Need for Custom Lexicon
 “27 yo male no sig PMH presented to ED
on 4/23 w/ fevers, backache,headache,
pharyngitis, no rhinorrhea, no cough,
lasting 1 day; Now sxs improved.”
 Add/replace lexical definitions
 ed|service|emergency department
 sxs|cfinding|symptom
finding:demo
age>> [27,[idref,2],year,[idref,4]]
parsemode>> mode2
sectname>> report clinical information item
sex>> male
idref>> 6
sid>> 1
status:past medical history
certainty>> no
idref>> 8
idref>> 12
timeper>> presentation
idref>> 14
sectname>> report clinical information item
parsemode>> mode4
sid>> 1
.....
finding:better
idref>> 63
parsemode>> mode4
sectname>> report clinical information item
sid>> 2
timeper>> now
idref>> 59
finding:demo
age>> [27,[idref,2],year,[idref,4]]
parsemode>> mode2
sectname>> report clinical information item
sex>> male
idref>> 6
sid>> 1
status:past medical history
certainty>> no
idref>> 8
idref>> 12
timeper>> presentation
idref>> 14
service>> emergency department
idref>> 18
location>> to
idref>> 16
date>> 00000423
idref>> 22
sectname>> report clinical information item
parsemode>> mode4
sid>> 1
…..
problem:symptom
change>> better
idref>> 63
idref>> 61
parsemode>> mode1
sectname>> report clinical information item
sid>> 2
timeper>> now
idref>> 59
© Columbia University
Where are we at now?
 Basic MedLEE node
 Encrypted data transport using TLS
 Authentication using gridmap file
© Columbia University
Next Steps
 Addition of customization features to
GridMedLEE for greater generalization
User-defined / domain-specific lexicons,
terminology, section headings, etc.
Additional output formats
 Investigate other security mechanisms
 Investigate real usage on grid in research
applications
© Columbia University
Acknowledgments
 Ken Hall
 Dan Washington
 Brian Lee
 Shannon Hastings
 Philip Payne
This work was supported by Centers for Disease Control
and Prevention grant P01 HK000029

More Related Content

Similar to MEDLEE: natural language processing on the public health grid

Next generation electronic medical records and search a test implementation i...
Next generation electronic medical records and search a test implementation i...Next generation electronic medical records and search a test implementation i...
Next generation electronic medical records and search a test implementation i...lucenerevolution
 
Building a consensus for the electronic health record
Building a consensus for the electronic health recordBuilding a consensus for the electronic health record
Building a consensus for the electronic health recordNursing353
 
Building a consensus for the electronic health record
Building a consensus for the electronic health recordBuilding a consensus for the electronic health record
Building a consensus for the electronic health recordtschenf
 
Building a consensus for the electronic health record
Building a consensus for the electronic health recordBuilding a consensus for the electronic health record
Building a consensus for the electronic health recordtschenf
 
Building a consensus for the electronic health record
Building a consensus for the electronic health recordBuilding a consensus for the electronic health record
Building a consensus for the electronic health recordNursing353
 
2004 10-19 rudi vdv
2004 10-19 rudi vdv2004 10-19 rudi vdv
2004 10-19 rudi vdvguest3cf4991
 
Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...
Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...
Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...Kaela Johnson
 
A Standards-based Approach to Development of Clinical Registries - NZ Gestati...
A Standards-based Approach to Development of Clinical Registries - NZ Gestati...A Standards-based Approach to Development of Clinical Registries - NZ Gestati...
A Standards-based Approach to Development of Clinical Registries - NZ Gestati...Koray Atalag
 
Introduction of BJU-BMR-RG and use case study of Applying openEHR archetypes ...
Introduction of BJU-BMR-RG and use case study of Applying openEHR archetypes ...Introduction of BJU-BMR-RG and use case study of Applying openEHR archetypes ...
Introduction of BJU-BMR-RG and use case study of Applying openEHR archetypes ...openEHR-Japan
 
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...Servio Fernando Lima Reina
 
Health research, clinical registries, electronic health records – how do they...
Health research, clinical registries, electronic health records – how do they...Health research, clinical registries, electronic health records – how do they...
Health research, clinical registries, electronic health records – how do they...Koray Atalag
 
Health Science Data and Metadata: Trends and Needs
Health Science Data and Metadata: Trends and NeedsHealth Science Data and Metadata: Trends and Needs
Health Science Data and Metadata: Trends and NeedsLynne Frederickson
 
Health Information Standards - Kevin O'Carroll
Health Information Standards - Kevin O'CarrollHealth Information Standards - Kevin O'Carroll
Health Information Standards - Kevin O'Carrollhealthcareisi
 
An infrastructure for clinical data extraction, medical knowledge and mining ...
An infrastructure for clinical data extraction, medical knowledge and mining ...An infrastructure for clinical data extraction, medical knowledge and mining ...
An infrastructure for clinical data extraction, medical knowledge and mining ...pingxiaoou
 
Simplifying semantics for biomedical applications
Simplifying semantics for biomedical applicationsSimplifying semantics for biomedical applications
Simplifying semantics for biomedical applicationsSemantic Web San Diego
 
Healthstory Project Overview - Dictation To Clinical Data For AHDI
Healthstory Project Overview - Dictation To Clinical Data For AHDIHealthstory Project Overview - Dictation To Clinical Data For AHDI
Healthstory Project Overview - Dictation To Clinical Data For AHDINick van Terheyden
 

Similar to MEDLEE: natural language processing on the public health grid (20)

Next generation electronic medical records and search a test implementation i...
Next generation electronic medical records and search a test implementation i...Next generation electronic medical records and search a test implementation i...
Next generation electronic medical records and search a test implementation i...
 
Building a consensus for the electronic health record
Building a consensus for the electronic health recordBuilding a consensus for the electronic health record
Building a consensus for the electronic health record
 
Building a consensus for the electronic health record
Building a consensus for the electronic health recordBuilding a consensus for the electronic health record
Building a consensus for the electronic health record
 
Building a consensus for the electronic health record
Building a consensus for the electronic health recordBuilding a consensus for the electronic health record
Building a consensus for the electronic health record
 
Building a consensus for the electronic health record
Building a consensus for the electronic health recordBuilding a consensus for the electronic health record
Building a consensus for the electronic health record
 
2004 10-19 rudi vdv
2004 10-19 rudi vdv2004 10-19 rudi vdv
2004 10-19 rudi vdv
 
Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...
Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...
Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...
 
A Standards-based Approach to Development of Clinical Registries - NZ Gestati...
A Standards-based Approach to Development of Clinical Registries - NZ Gestati...A Standards-based Approach to Development of Clinical Registries - NZ Gestati...
A Standards-based Approach to Development of Clinical Registries - NZ Gestati...
 
Introduction of BJU-BMR-RG and use case study of Applying openEHR archetypes ...
Introduction of BJU-BMR-RG and use case study of Applying openEHR archetypes ...Introduction of BJU-BMR-RG and use case study of Applying openEHR archetypes ...
Introduction of BJU-BMR-RG and use case study of Applying openEHR archetypes ...
 
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
 
Health research, clinical registries, electronic health records – how do they...
Health research, clinical registries, electronic health records – how do they...Health research, clinical registries, electronic health records – how do they...
Health research, clinical registries, electronic health records – how do they...
 
Health Science Data and Metadata: Trends and Needs
Health Science Data and Metadata: Trends and NeedsHealth Science Data and Metadata: Trends and Needs
Health Science Data and Metadata: Trends and Needs
 
Health Information Standards - Kevin O'Carroll
Health Information Standards - Kevin O'CarrollHealth Information Standards - Kevin O'Carroll
Health Information Standards - Kevin O'Carroll
 
clinphone.ppt
clinphone.pptclinphone.ppt
clinphone.ppt
 
An infrastructure for clinical data extraction, medical knowledge and mining ...
An infrastructure for clinical data extraction, medical knowledge and mining ...An infrastructure for clinical data extraction, medical knowledge and mining ...
An infrastructure for clinical data extraction, medical knowledge and mining ...
 
Simplifying semantics for biomedical applications
Simplifying semantics for biomedical applicationsSimplifying semantics for biomedical applications
Simplifying semantics for biomedical applications
 
Registers-2012-Funding-Proposal-Form
Registers-2012-Funding-Proposal-FormRegisters-2012-Funding-Proposal-Form
Registers-2012-Funding-Proposal-Form
 
From professional standards to information standards
From professional standards to information standardsFrom professional standards to information standards
From professional standards to information standards
 
Evidence-based Healthcare IT
Evidence-based Healthcare ITEvidence-based Healthcare IT
Evidence-based Healthcare IT
 
Healthstory Project Overview - Dictation To Clinical Data For AHDI
Healthstory Project Overview - Dictation To Clinical Data For AHDIHealthstory Project Overview - Dictation To Clinical Data For AHDI
Healthstory Project Overview - Dictation To Clinical Data For AHDI
 

Recently uploaded

B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 

Recently uploaded (20)

B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 

MEDLEE: natural language processing on the public health grid

  • 1. © Columbia University MedLEE, a Natural Language Processing Service, on the Public Health Research Grid Albert Lai, Ph.D. Carol Friedman, Ph.D. Department of Biomedical Informatics, Columbia University
  • 2. © Columbia University Goals  Better understand grid technology  Provide NLP service to members of the Public Health Research Grid
  • 3. © Columbia University Motivation  Coded information is urgently needed for clinical applications that improve care and lower costs  Coded data other than laboratory data and pharmacy data - very scarce and difficult to obtain
  • 4. © Columbia University Why NLP?  NLP technology offers high throughput method for automatic encoding of clinical information in narrative patient reports  Applicable to a broad range of clinical domains  Appropriate for diverse applications
  • 5. © Columbia University Outline  MedLEE Overview  Getting on the Grid  Service Generalization Issues
  • 6. © Columbia University MedLEE Overview  Medical Language Extraction and Encoding Extracts, structures, and encodes clinical information in narrative patient reports Comprehensive coverage Can be used for diverse clinical applications Development started in 1991 Used at Columbia University Medical Center since 1995 Numerous independent evaluations
  • 7. ....New maculopapular rash on trunk …. MedLEE Problem:rash Status:new Descriptor:maculopapular Bodyloc:trunk Code:C0460005 (trunk) Code:C0241488 (trunk maculopapular rash) Analytics Patient report Coded data Clinical Guidance -- Indicate potential notifiable disease for reporting. -- Inform of local outbreak and indicate appropriate tests. -- Indicate need for vaccination. Surveillance -- Indicate potential bioterrorist event. -- Transmit syndromic event to health dept for surveillance. Quality Assurance Detect potential cases of medication reaction. Clinical Research -- Detect cases of rash for inclusion in trial of new treatment. -- Find genetic associations with atopic rash. Applications
  • 8. © Columbia University MedLEE Architecture Text Report MedLEE Application Postprocessor Preprocessor standardize Input to MedLEE transform output for application
  • 9. © Columbia University Text Reports Processed  Radiology Reports  Cardiology Reports  Pathology Reports  Admission notes  Discharge Summaries  Resident Sign out notes  Office Visits  Telephone encounters
  • 10. © Columbia University Applications using MedLEE  Biosurveillance  Syndromic surveillance  Adverse Drug Event detection  Decision Support  Clinical Research  Clinical Trials  Quality Assurance  Automated Encoding  Patient Management  Data mining – finding trends and associations  Linking patient record to the literature  Summarization
  • 11. © Columbia University Adverse Event Detection  Use NLP system to detect events defined in the New York Patient Occurrence Reporting and Tracking System (NYPORTS)  System: MedLEE +  Queries consisiting of criteria mapping output to NYPORTS event  Results: system outperformed traditional and previous automated methods  NLP effective method for automated adverse event detection Melton GB, Hripcsak G. Automated detection of adverse events using natural language processing of discharge summaries. J Am Med Inform Assoc. 2005 Jul-Aug;12(4):448-57.
  • 12. © Columbia University Syndromic Surveillance  EHR from Institute of Family Health (IFH)  13 community health centers  System: MedLEE+queries detecting flu-like illness  Processed all types of notes:  nursing and physician notes  several types of encounters  Results:  System correlated well with a proven syndromic surveillance system based on chief complaints
  • 13. © Columbia University Challenges for Grid-enabling MedLEE  Setting up a grid node  Firewalls  Security / PHI concerns  Wrapping MedLEE into a grid service
  • 14. © Columbia University Getting on the Grid  CUMC on hospital network Setting up Globus node required numerous port openings  No DMZ available  Solution: install main campus w/ no firewall
  • 15. © Columbia University Security Concerns  No firewall  Medical data being sent to node  Medical data remains on node for some period of time  Currently, we do not send data containing PHI to the Grid service, only in demonstration mode
  • 16. © Columbia University Wrapping MedLEE  Introduce framework from caGrid / OSU  Using schema for web service, automatically generates custom beans  Automatically generates method stubs for service operations  Can configure security parameters for service
  • 19. © Columbia University MedLEE Example Pre- processor [new,maculopapular,rash,on,trunk,’.’] Parser [problem,rash,[status,new],[descriptor,maculopapular],[bodyloc,tru nk]] Encoder [problem,rash,[status,new],[descriptor,maculopapular],[bodyloc,tru nk,[code,C0460005^trunk]],[code,C0241488^trunk maculopapular rash] XML Translator <problem v = “rash”> <status v = “new”></status> <descriptor v = “maculopapular”></descriptor> <bodyloc v = ”trunk”> <code v = “C046005^trunk”</code> </bodyloc> <code v = “C0241488^trunk maculopapular rash” </code> </problem> “New maculopapular rash on trunk.”
  • 20. © Columbia University Service Generalization Issues  Preprocessing needed to standardize narrative prior to MedLEE  Allow user-specified customization
  • 21. © Columbia University Preprocessor – Standardize Report for MedLEE  Handle problematic text (formatted data in narrative) Her2/Neu-2+  Her2/Neu measured 2+.  Add punctuations (e.g., ‘.’) for run on sentences  Add ignore tags to indicate text that should be ignored This neoplasm is strongly estrogen positive <ign> To date an indicator reliable in predicting tumor recurrence in node-negative patients is not available. </ign>  Add tags from external processor The patient had fever <phr sem=“date” t=“19991002”>2 days before admission</phr>
  • 22. © Columbia University Postprocessor – Transform output  Modify and transform output for different views/applications Map into tabular form for spreadsheet/database Problem-oriented/summarization/highlighting Execute queries to interpret output according to needs of application
  • 23. © Columbia University User-Specified Knowledge  Section headings New sections/abbreviated section headers:  PMH|Past Medical History  New abbreviations containing ‘.’ (O.D.)  Avoids breaking up sentence at abbreviation  Add/replace lexical definitions  OD|bodyloc|right eye
  • 24. © Columbia University Example of Need for Custom Lexicon  “27 yo male no sig PMH presented to ED on 4/23 w/ fevers, backache,headache, pharyngitis, no rhinorrhea, no cough, lasting 1 day; Now sxs improved.”  Add/replace lexical definitions  ed|service|emergency department  sxs|cfinding|symptom
  • 25. finding:demo age>> [27,[idref,2],year,[idref,4]] parsemode>> mode2 sectname>> report clinical information item sex>> male idref>> 6 sid>> 1 status:past medical history certainty>> no idref>> 8 idref>> 12 timeper>> presentation idref>> 14 sectname>> report clinical information item parsemode>> mode4 sid>> 1 ..... finding:better idref>> 63 parsemode>> mode4 sectname>> report clinical information item sid>> 2 timeper>> now idref>> 59 finding:demo age>> [27,[idref,2],year,[idref,4]] parsemode>> mode2 sectname>> report clinical information item sex>> male idref>> 6 sid>> 1 status:past medical history certainty>> no idref>> 8 idref>> 12 timeper>> presentation idref>> 14 service>> emergency department idref>> 18 location>> to idref>> 16 date>> 00000423 idref>> 22 sectname>> report clinical information item parsemode>> mode4 sid>> 1 ….. problem:symptom change>> better idref>> 63 idref>> 61 parsemode>> mode1 sectname>> report clinical information item sid>> 2 timeper>> now idref>> 59
  • 26. © Columbia University Where are we at now?  Basic MedLEE node  Encrypted data transport using TLS  Authentication using gridmap file
  • 27. © Columbia University Next Steps  Addition of customization features to GridMedLEE for greater generalization User-defined / domain-specific lexicons, terminology, section headings, etc. Additional output formats  Investigate other security mechanisms  Investigate real usage on grid in research applications
  • 28. © Columbia University Acknowledgments  Ken Hall  Dan Washington  Brian Lee  Shannon Hastings  Philip Payne This work was supported by Centers for Disease Control and Prevention grant P01 HK000029