SlideShare a Scribd company logo
1 of 5
Download to read offline
MyHarmony: Generating Statistics from Clinical Text for Monitoring Clinical Quality Indicators
Md. Khadzir Sheikh Ahmad1
; Mohd Syazrin Mohd Sakri1
; „Ismat Mohd Sulaiman1
; Syirahaniza Mohd
Salleh1
; Dickson Lukose2
; Omar Ismail1
; Abd Aziz Latip3
; Muhammad Aiman Mazlan3
.
1
Health Informatics Centre, Planning Division, Ministry of Health, Malaysia
2
GCS Agile Pty. Ltd.
3
MIMOS Bhd.
Abstract:
The Ministry of Health developed MyHarmony with MIMOS Bhd. as part of the Malaysian Health Data
Warehouse (MyHDW) initiative. MyHDW aims to be a trusted source of truth of comprehensive health
data structured for analysis. MyHarmony fulfills that criteria by enabling data and information from
unstructured form to be mined, such as texts, images, and sound. MyHarmony's first deliverable is the
ability to mine clinical texts using Natural Language Processing (NLP) with SNOMED CT as its
knowledge-base of clinical terms. MyHarmony is the engine the retrieves data and information into
computer processable form by assigning SNOMED CT codes, which can then be further analysed
statistically. MyHarmony is able to recognise and harmonise different terms that means the same. It also
understands context for a more accurate coding; such as negations (no, not known, unknown) and
conditionals (past history, symptoms of, previous). Using SNOMED CT, MyHarmony's ability is further
advanced by using subsumption technique for a more comprehensive statistical results. This study will
present a use case where clinical text from anonymized hospital discharge summaries can generate
clinical indicators using MyHarmony for health managers. An added benefit to the operational (hospital)
staff is the ability to produce such indicators is an efficient and timely manner by reducing workload for
data collection and submission. MyHarmony could be the new and improved way to provide important
statistical measures for evidence-based health planning, leading to improved healthcare services and
health as a whole.
Keywords:
MyHarmony; MyHDW; Text mining; Quality Indicators; SNOMED CT
1. Introduction:
The Ministry of Health developed MyHarmony with MIMOS Bhd. as part of the Malaysian Health Data
Warehouse (MyHDW) initiative. MyHDW aims to be a trusted source of truth of comprehensive health
data structured for analysis. MyHarmony is an application in the Malaysian Health Data Warehouse
(MyHDW) that aims to analyse semi-structured and unstructured data. The unstructured data can be in the
form of free-text, visual, audio and machine generated data. Unstructured data does not have
predetermined values and not stored in an organized manner to be analysed by a conventional data
warehouse. Therefore, other techniques need to be applied. MyHarmony aims to address this and be
included as part of MyHDW.
There were three (3) major deliverables in the conceptual stage. The first part refers to the development
and implementation of health terminology standards, namely SNOMED CT, which will be the knowledge
bases for MyHarmony. The second part was harmonization of the medical terminology to SNOMED CT
terms by way of mapping. The last part was about the development and implementation of MyHarmony
to show that the application can codify relevant terms in free-text using Natural Language Processing
(NLP) technique. The SNOMED CT codified data can then be analysed for information generation.
2. Methodology:
The development was first started in 2014 with the development of Cardiology Refset. Cardiology Refset
was the terminology reference for the MyHarmony engine during the harmonisation/mapping and
codification process. Cardiology Refset (version 1.0) was completed and released in 2014. It is a simple
reference set [1] containing about 600 terms related to Cardiology speciality including signs and
symptoms, diagnoses, procedures, body structures, medical devices and medications. It was delivered in
time to be tested on MyHarmony standalone system to generate National Cardiovascular Disease
(NCVD) registries.
The draft Refset and method was presented during IHTSDO meetings and Expo in succession on
September 2013, October 2013, and April 2014 to gain feedback from experts in the international
community. The finalized method was presented during SNOMED CT Expo, October 2014 [2] .
The Cardiology Refset was then expanded to include all cardiology related terms and Cardiology Refset
v1.1 was completed in July 2016 containing more than 6000 concepts. First, more than 300,000
SNOMED CT concepts (Fully Specified Names) were extracted and reviewed by PIK using eyeballing
technique. About 12,000 concepts that were believed to be related to Cardiology specialty was given to
the clinicians for review. The clinicians reduced the number of concepts to about 6,000. Additionally, the
Refset included local terms and common abbreviations which were mapped to existing concepts.
Next, the team utilise MyHarmony to generate the analysis. There were 4 main functions in MyHarmony:
1. Terminology Management– to allow user to upload the SNOMED CT International Release
content into MyHarmony, and upload SNOMED CT Refset in reference to the SNOMED CT
International Release.
2. Data Management– to allow user to upload the data that will be harmonized and codified. This
function also allows user to view the content of the data.
3. Codification Management– to allow user to codify the dataset according to the selected
SNOMED CT Refset and view the codified data for validation purpose.
4. Query Management– to allow user to explore the data by generating queries using Structured
Query Language (SQL).
The functions were arranged according to the work process. First, the SNOMED CT International
Release, SNOMED CT Cardiology Refset, and the dataset needs to be uploaded. Then, the dataset is
codified and saved. Using the Query Management, the codified data can then be explored via data
profiling and query generation.
For initial analysis, SNOMED CT International release version 20160731 was used as the Cardiology
Reference Set was developed using this version of SNOMED CT.
The team received a set of database from a hospital with cardiology service which consists of 16224
discharge summaries from year 2017. The database was then uploaded into MyHarmony. The personally
identified information (patient names, ID, and street address) were anonymised prior to codification and
analysis. The output is a codified dataset, which enable information processing and analysis by machines.
Using the Query Management, the codified data was then be explored via data profiling and query
generation.
3. Result:
The team conducted several data profiling queries to ensure that MyHarmony were able to capture the
data correctly. For example, the number of records by month between Raw data (MyHarmony without
SNOMED CT) and Harmonized data (MyHarmony with SNOMED CT) should return the same result.
Other examples of data profiling queries are the number of records by gender, by specialty, and by
ethnicity.
Next, the team developed queries required by the National Cardiovascular Disease (NCVD) registries and
compare the results with published registry reports. For example, querying the number of Ischaemic Heart
Disease (IHD) by gender shows 1:4 female to male ratio, which is a similar ratio in the registry reports.
Furthermore, the query also shows that Harmonized data captures more result compare to Raw data due to
SNOMED CT relationship structure, thus capturing all the subtypes of IHD and its synonyms or ways of
writing. The registry, however, only captures three (3) diagnosis due to its structured format, which are
ST Elevation Myocardial Infarction, Non-ST Elevation Myocardial Infarction, and Unstable Angina. This
trend and pattern comparison allow validation by the Clinicians and gains their buy-in in using
MyHarmony.
The team also tried to generate more queries required by the NCVD registry. However, it was limited by
the documentation in the discharge summary. Registry queries requires more detail information that may
often not documented in a discharge summary, such as information on smoking status and complications
of procedures.
After that, the team was tasked to generate National Cardiology Key Performance Indicators (KPIs).
MyHarmony are able to generate 7 out of the 8 KPIs (KPI 2 to 8). The first KPI was excluded because the
data are available at the clinic and not documented in inpatient discharge summaries. The Health
Information Framework (HIF) was developed for the 7 KPIs, which detailed out the inclusion and
exclusion criteria, the target, the formula, the terms used by MyHarmony, and query, and lastly a section
for additional notes.
Preliminary manual validation on the completeness and accuracy of codified data shows 90% precision
and 70% recall. The content of those records is complex as it does not follow grammar rules, and contains
a large number of short forms, abbreviations, acronyms and analogous terms (e.g., synd, ACS, CCS IV,
NYHA 2). One example of record is “2VD with RCA culprit lesion - Ad hoc PCI DES to RCA and LAD”
which is challenging to codify using approaches based on strict grammar. The revised version of
MyHarmony uses a different approach based on shallow parsing and the consideration of multiple
suitable combinations of words in a sentence. With further iterations and improvement in the mapping,
these challenges were overcome[3].
From the SNOMED CT codified database, the system was able to show a more accurate result during
analysis . This is because MyHarmony capitalises on the existing SNOMED CT relationships structure
between concepts. In this case, when querying “Number of Ischaemic Heart Disease cases per year”,
MyHarmony search the code and term for “Ischaemic heart disease”, its synonyms and accepted
abbreviations, and all the subtypes of Ischaemic heart disease such as all subtypes of “Myocardial
infarction” and “Angina”. MyHarmony aggregates these records resulting in a more accurate analysis.
Usually, the result where MyHarmony utilise SNOMED CT‟s relationship structure would show more
records. This is because Mi-Harmony was able to aggregate data not just through String match, but also
utilize the IS-A hierarchy structure in SNOMED CT. For example, querying “Ischemic heart disease” will
gather clinical records with synonymous terms like “Ischaemic heart disease” and “IHD”; as well as
clinical records containing all the subtypes of “Ischemic heart disease” such as “Myocardial Infarction”
and “Unstable angina”.
Context awareness such as negation and pasts events were also applied. For example, the term “No chest
pain”, “No known history of diabetes mellitus”, and “Symptoms of heart failure” will not be coded as the
presenting condition. Additionally, terms like “Previous history of”, “Previous admission of”, and
“Family history of” within the same sentence as a clinical condition will not be coded as the current
condition for the record.
4. Discussion and Conclusion:
When showcasing these abilities to the clinicians, the team agreed that MyHarmony was able to:
(i) Generate more information from free-text utilising the SNOMED CT structure, thus,
reducing the effort needed to collect data in a structured manner such as in a registry
and indicator reports;
(ii) Able to generate new information by retrospectively running new queries on old
discharge summary records; thus, reducing the effort and time to collect data in a
prospective manner when new questions arise, such as for indicator reports that often
change on a yearly basis;
(iii) Able to deliver information in a timelier fashion; thus clinicians and health managers
are able to plan and take action without waiting for a 1 to 3 yearly report;
(iv) Improve documentation of clinicians when they are aware of MyHarmony‟s ability
during roadshows.
Generating indicators for monitoring and evaluation can be a burden even for healthcare facilities
equipped with EHR. Conventionally, collecting data for indicators requires multiple data entries in
aggregated manner, with manual submission to central agencies, where the results are only published on a
yearly basis. Introducing MyHarmony may reduce these burdens. Capturing data from the source in an
automated way, i.e. free text documented by doctors, would reduce duplication of work and the amount of
resources to capture the data into manual form. Having the data in granular form would allow a more
dynamic analysis and prevents dishonesty. Information required, whether old or new information, can be
formulated and disseminated back to the clinicians and health managers in a timelier fashion.
MyHarmony has the potential to expand further in its implementation and technology. However, there are
still challenges to be addressed. Currently, MyHarmony has been developed to mine free-text for
Cardiology via a back-end approach. It uses a single version of SNOMED CT International. The team is
still researching the best approach to manage SNOMED CT versions and its codified data, which may
impact the resulting analysis in an inconsistent way. The team are also seeking international experience
for this matter.
Other challenges include researching a more efficient and effective method to develop SNOMED CT
Refsets. The initial method by referencing terms required by registries or indicators has been established.
However, expanding the SNOMED CT Refsets to include relevant terms for a specific clinical specialty
or domain needs to be refined further. Eye-balling technique to search the entire SNOMED CT content
have its strength and weaknesses. Even though it is a very thorough method, there are possibilities of
missed terms and very time consuming. Despite these challenges, the journey in developing MyHarmony
and the lessons learnt has allowed the team to refine the methods and processes to expand the use of
MyHarmony to other clinical specialties.
Analysis from unstructured data would hope to complement analysis from structured data (like census and
registries), with the additional benefit workload reduction to provide timelier, trusted, and dynamic
information.
References:
1. SNOMED CT Simple Reference Set:
https://confluence.ihtsdotools.org/display/DOCRFSPG/5.1.+Simple+Reference+Set
2. Mohd Sulaiman I, Sheikh Ahmad MK. SNOMED CT Cardiology Reference Set Development,
Malaysia. Proceedings of the SNOMED CT Implementation Showcase [Internet]. Amsterdam: IHTSDO;
2014. Retrieved from:
http://ihtsdo.org/fileadmin/user_upload/doc/showcase/show14/SnomedCtShowcase2014_Abstract_14058
.pdf
3. Abdul Manaf NA, Mohamed K, Lukose D. Harmonizing EHR Databases with SNOMED CT. Proceedings of the
SNOMED CT Implementation Showcase 2014 [Internet]. Amsterdam: IHTSDO; 2014.
https://confluence.ihtsdotools.org/display/FT/SNOMED+CT+Implementation+Showcase+2014

More Related Content

What's hot

Fakebuster fake news detection system using logistic regression technique i...
Fakebuster   fake news detection system using logistic regression technique i...Fakebuster   fake news detection system using logistic regression technique i...
Fakebuster fake news detection system using logistic regression technique i...Conference Papers
 
Implementation of blood donation application using android smartphone
Implementation of blood donation application using android smartphoneImplementation of blood donation application using android smartphone
Implementation of blood donation application using android smartphoneIJARIIT
 
DE-IDENTIFICATION OF PROTECTED HEALTH INFORMATION PHI FROM FREE TEXT IN MEDIC...
DE-IDENTIFICATION OF PROTECTED HEALTH INFORMATION PHI FROM FREE TEXT IN MEDIC...DE-IDENTIFICATION OF PROTECTED HEALTH INFORMATION PHI FROM FREE TEXT IN MEDIC...
DE-IDENTIFICATION OF PROTECTED HEALTH INFORMATION PHI FROM FREE TEXT IN MEDIC...ijsptm
 
Authenticated Medical Documents Releasing with Privacy Protection and Release...
Authenticated Medical Documents Releasing with Privacy Protection and Release...Authenticated Medical Documents Releasing with Privacy Protection and Release...
Authenticated Medical Documents Releasing with Privacy Protection and Release...JAYAPRAKASH JPINFOTECH
 
Health data mining
Health data miningHealth data mining
Health data miningsidra ali
 
A Study on the Applications and Impact of Artificial Intelligence in E Commer...
A Study on the Applications and Impact of Artificial Intelligence in E Commer...A Study on the Applications and Impact of Artificial Intelligence in E Commer...
A Study on the Applications and Impact of Artificial Intelligence in E Commer...ijtsrd
 
Understanding the Need of Data Integration in E Healthcare
Understanding the Need of Data Integration in E HealthcareUnderstanding the Need of Data Integration in E Healthcare
Understanding the Need of Data Integration in E Healthcareijtsrd
 
Ai driven occupational skills generator
Ai driven occupational skills generatorAi driven occupational skills generator
Ai driven occupational skills generatorConference Papers
 
Making isp business profitable using data mining
Making isp business profitable using data miningMaking isp business profitable using data mining
Making isp business profitable using data miningijitcs
 
archenaa2015-survey-big-data-government.pdf
archenaa2015-survey-big-data-government.pdfarchenaa2015-survey-big-data-government.pdf
archenaa2015-survey-big-data-government.pdfAkuhuruf
 
Harmony With Healthstory Clinical Narrative And Structured Data In The Ehr ...
Harmony With Healthstory   Clinical Narrative And Structured Data In The Ehr ...Harmony With Healthstory   Clinical Narrative And Structured Data In The Ehr ...
Harmony With Healthstory Clinical Narrative And Structured Data In The Ehr ...Nick van Terheyden
 
Local and Global Learning Method for Question Answering Approach
Local and Global Learning Method for Question Answering ApproachLocal and Global Learning Method for Question Answering Approach
Local and Global Learning Method for Question Answering ApproachIRJET Journal
 
Interoperability Between Healthcare Applications
Interoperability Between Healthcare ApplicationsInteroperability Between Healthcare Applications
Interoperability Between Healthcare ApplicationsJohn Gillson
 
Interoperability & standards
Interoperability & standardsInteroperability & standards
Interoperability & standardsJ. Don Soriano
 
Medical Assistant Design during this Pandemic Like Covid-19
Medical Assistant Design during this Pandemic Like Covid-19Medical Assistant Design during this Pandemic Like Covid-19
Medical Assistant Design during this Pandemic Like Covid-19AI Publications
 
Analysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease DatasetAnalysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease DatasetIRJET Journal
 

What's hot (18)

Fakebuster fake news detection system using logistic regression technique i...
Fakebuster   fake news detection system using logistic regression technique i...Fakebuster   fake news detection system using logistic regression technique i...
Fakebuster fake news detection system using logistic regression technique i...
 
Implementation of blood donation application using android smartphone
Implementation of blood donation application using android smartphoneImplementation of blood donation application using android smartphone
Implementation of blood donation application using android smartphone
 
DE-IDENTIFICATION OF PROTECTED HEALTH INFORMATION PHI FROM FREE TEXT IN MEDIC...
DE-IDENTIFICATION OF PROTECTED HEALTH INFORMATION PHI FROM FREE TEXT IN MEDIC...DE-IDENTIFICATION OF PROTECTED HEALTH INFORMATION PHI FROM FREE TEXT IN MEDIC...
DE-IDENTIFICATION OF PROTECTED HEALTH INFORMATION PHI FROM FREE TEXT IN MEDIC...
 
Authenticated Medical Documents Releasing with Privacy Protection and Release...
Authenticated Medical Documents Releasing with Privacy Protection and Release...Authenticated Medical Documents Releasing with Privacy Protection and Release...
Authenticated Medical Documents Releasing with Privacy Protection and Release...
 
Health data mining
Health data miningHealth data mining
Health data mining
 
A Study on the Applications and Impact of Artificial Intelligence in E Commer...
A Study on the Applications and Impact of Artificial Intelligence in E Commer...A Study on the Applications and Impact of Artificial Intelligence in E Commer...
A Study on the Applications and Impact of Artificial Intelligence in E Commer...
 
Understanding the Need of Data Integration in E Healthcare
Understanding the Need of Data Integration in E HealthcareUnderstanding the Need of Data Integration in E Healthcare
Understanding the Need of Data Integration in E Healthcare
 
Ai driven occupational skills generator
Ai driven occupational skills generatorAi driven occupational skills generator
Ai driven occupational skills generator
 
B05840510
B05840510B05840510
B05840510
 
Making isp business profitable using data mining
Making isp business profitable using data miningMaking isp business profitable using data mining
Making isp business profitable using data mining
 
archenaa2015-survey-big-data-government.pdf
archenaa2015-survey-big-data-government.pdfarchenaa2015-survey-big-data-government.pdf
archenaa2015-survey-big-data-government.pdf
 
Harmony With Healthstory Clinical Narrative And Structured Data In The Ehr ...
Harmony With Healthstory   Clinical Narrative And Structured Data In The Ehr ...Harmony With Healthstory   Clinical Narrative And Structured Data In The Ehr ...
Harmony With Healthstory Clinical Narrative And Structured Data In The Ehr ...
 
Research-KS-Jun2015
Research-KS-Jun2015Research-KS-Jun2015
Research-KS-Jun2015
 
Local and Global Learning Method for Question Answering Approach
Local and Global Learning Method for Question Answering ApproachLocal and Global Learning Method for Question Answering Approach
Local and Global Learning Method for Question Answering Approach
 
Interoperability Between Healthcare Applications
Interoperability Between Healthcare ApplicationsInteroperability Between Healthcare Applications
Interoperability Between Healthcare Applications
 
Interoperability & standards
Interoperability & standardsInteroperability & standards
Interoperability & standards
 
Medical Assistant Design during this Pandemic Like Covid-19
Medical Assistant Design during this Pandemic Like Covid-19Medical Assistant Design during this Pandemic Like Covid-19
Medical Assistant Design during this Pandemic Like Covid-19
 
Analysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease DatasetAnalysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease Dataset
 

Similar to Generating Clinical Quality Indicators from Clinical Text

Automated snomed ct mapping of clinical discharge summary data for cardiology...
Automated snomed ct mapping of clinical discharge summary data for cardiology...Automated snomed ct mapping of clinical discharge summary data for cardiology...
Automated snomed ct mapping of clinical discharge summary data for cardiology...Conference Papers
 
Automated snomed ct mapping of clinical discharge summary data for cardiology...
Automated snomed ct mapping of clinical discharge summary data for cardiology...Automated snomed ct mapping of clinical discharge summary data for cardiology...
Automated snomed ct mapping of clinical discharge summary data for cardiology...Conference Papers
 
Health management information systems for universal health coverage
Health management information systems for universal health coverageHealth management information systems for universal health coverage
Health management information systems for universal health coverageMEDx eHealthCenter
 
Automated clinical documentation improvement
Automated clinical documentation improvementAutomated clinical documentation improvement
Automated clinical documentation improvementNehal (Neil) Shah
 
Systematic review of quality standards for medical devices and practice measu...
Systematic review of quality standards for medical devices and practice measu...Systematic review of quality standards for medical devices and practice measu...
Systematic review of quality standards for medical devices and practice measu...Pubrica
 
Systematic review of quality standards for medical devices and practice measu...
Systematic review of quality standards for medical devices and practice measu...Systematic review of quality standards for medical devices and practice measu...
Systematic review of quality standards for medical devices and practice measu...Pubrica
 
Clinical Decision Support System Essay
Clinical Decision Support System EssayClinical Decision Support System Essay
Clinical Decision Support System EssayMary Brown
 
Health information standars
Health information standarsHealth information standars
Health information standarsRubashkyn
 
Health Informatics- Module 2-Chapter 2.pptx
Health Informatics- Module 2-Chapter 2.pptxHealth Informatics- Module 2-Chapter 2.pptx
Health Informatics- Module 2-Chapter 2.pptxArti Parab Academics
 
Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...
Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...
Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...Kaela Johnson
 
Heart Disease Prediction Using Data Mining
Heart Disease Prediction Using Data MiningHeart Disease Prediction Using Data Mining
Heart Disease Prediction Using Data MiningIRJET Journal
 
Intelligent data analysis for medicinal diagnosis
Intelligent data analysis for medicinal diagnosisIntelligent data analysis for medicinal diagnosis
Intelligent data analysis for medicinal diagnosisIRJET Journal
 
Advantages And Disadvantages Of EHR
Advantages And Disadvantages Of EHRAdvantages And Disadvantages Of EHR
Advantages And Disadvantages Of EHRCarla Jardine
 

Similar to Generating Clinical Quality Indicators from Clinical Text (20)

What is Clinical Data Mining?
What is Clinical Data Mining?What is Clinical Data Mining?
What is Clinical Data Mining?
 
Automated snomed ct mapping of clinical discharge summary data for cardiology...
Automated snomed ct mapping of clinical discharge summary data for cardiology...Automated snomed ct mapping of clinical discharge summary data for cardiology...
Automated snomed ct mapping of clinical discharge summary data for cardiology...
 
Automated snomed ct mapping of clinical discharge summary data for cardiology...
Automated snomed ct mapping of clinical discharge summary data for cardiology...Automated snomed ct mapping of clinical discharge summary data for cardiology...
Automated snomed ct mapping of clinical discharge summary data for cardiology...
 
Health management information systems for universal health coverage
Health management information systems for universal health coverageHealth management information systems for universal health coverage
Health management information systems for universal health coverage
 
Automated clinical documentation improvement
Automated clinical documentation improvementAutomated clinical documentation improvement
Automated clinical documentation improvement
 
Systematic review of quality standards for medical devices and practice measu...
Systematic review of quality standards for medical devices and practice measu...Systematic review of quality standards for medical devices and practice measu...
Systematic review of quality standards for medical devices and practice measu...
 
Systematic review of quality standards for medical devices and practice measu...
Systematic review of quality standards for medical devices and practice measu...Systematic review of quality standards for medical devices and practice measu...
Systematic review of quality standards for medical devices and practice measu...
 
Startup bootcamp 2
Startup bootcamp 2Startup bootcamp 2
Startup bootcamp 2
 
Healthcare terminologies recommendations
Healthcare terminologies recommendationsHealthcare terminologies recommendations
Healthcare terminologies recommendations
 
Clinical Decision Support System Essay
Clinical Decision Support System EssayClinical Decision Support System Essay
Clinical Decision Support System Essay
 
Health information standars
Health information standarsHealth information standars
Health information standars
 
Imr V3
Imr V3Imr V3
Imr V3
 
Health Informatics- Module 2-Chapter 2.pptx
Health Informatics- Module 2-Chapter 2.pptxHealth Informatics- Module 2-Chapter 2.pptx
Health Informatics- Module 2-Chapter 2.pptx
 
Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...
Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...
Automated Generation Of Synoptic Reports From Narrative Pathology Reports In ...
 
Heart Disease Prediction Using Data Mining
Heart Disease Prediction Using Data MiningHeart Disease Prediction Using Data Mining
Heart Disease Prediction Using Data Mining
 
Intelligent data analysis for medicinal diagnosis
Intelligent data analysis for medicinal diagnosisIntelligent data analysis for medicinal diagnosis
Intelligent data analysis for medicinal diagnosis
 
Ex33900906
Ex33900906Ex33900906
Ex33900906
 
Advantages And Disadvantages Of EHR
Advantages And Disadvantages Of EHRAdvantages And Disadvantages Of EHR
Advantages And Disadvantages Of EHR
 
Ex33900906
Ex33900906Ex33900906
Ex33900906
 
Dss
DssDss
Dss
 

More from Conference Papers

Ai driven occupational skills generator
Ai driven occupational skills generatorAi driven occupational skills generator
Ai driven occupational skills generatorConference Papers
 
Advanced resource allocation and service level monitoring for container orche...
Advanced resource allocation and service level monitoring for container orche...Advanced resource allocation and service level monitoring for container orche...
Advanced resource allocation and service level monitoring for container orche...Conference Papers
 
Adaptive authentication to determine login attempt penalty from multiple inpu...
Adaptive authentication to determine login attempt penalty from multiple inpu...Adaptive authentication to determine login attempt penalty from multiple inpu...
Adaptive authentication to determine login attempt penalty from multiple inpu...Conference Papers
 
Absorption spectrum analysis of dentine sialophosphoprotein (dspp) in orthodo...
Absorption spectrum analysis of dentine sialophosphoprotein (dspp) in orthodo...Absorption spectrum analysis of dentine sialophosphoprotein (dspp) in orthodo...
Absorption spectrum analysis of dentine sialophosphoprotein (dspp) in orthodo...Conference Papers
 
A deployment scenario a taxonomy mapping and keyword searching for the appl...
A deployment scenario   a taxonomy mapping and keyword searching for the appl...A deployment scenario   a taxonomy mapping and keyword searching for the appl...
A deployment scenario a taxonomy mapping and keyword searching for the appl...Conference Papers
 
Automated login method selection in a multi modal authentication - login meth...
Automated login method selection in a multi modal authentication - login meth...Automated login method selection in a multi modal authentication - login meth...
Automated login method selection in a multi modal authentication - login meth...Conference Papers
 
Atomization of reduced graphene oxide ultra thin film for transparent electro...
Atomization of reduced graphene oxide ultra thin film for transparent electro...Atomization of reduced graphene oxide ultra thin film for transparent electro...
Atomization of reduced graphene oxide ultra thin film for transparent electro...Conference Papers
 
An enhanced wireless presentation system for large scale content distribution
An enhanced wireless presentation system for large scale content distribution An enhanced wireless presentation system for large scale content distribution
An enhanced wireless presentation system for large scale content distribution Conference Papers
 
An analysis of a large scale wireless image distribution system deployment
An analysis of a large scale wireless image distribution system deploymentAn analysis of a large scale wireless image distribution system deployment
An analysis of a large scale wireless image distribution system deploymentConference Papers
 
Validation of early testing method for e government projects by requirement ...
Validation of early testing method for e  government projects by requirement ...Validation of early testing method for e  government projects by requirement ...
Validation of early testing method for e government projects by requirement ...Conference Papers
 
The design and implementation of trade finance application based on hyperledg...
The design and implementation of trade finance application based on hyperledg...The design and implementation of trade finance application based on hyperledg...
The design and implementation of trade finance application based on hyperledg...Conference Papers
 
Towards predictive maintenance for marine sector in malaysia
Towards predictive maintenance for marine sector in malaysiaTowards predictive maintenance for marine sector in malaysia
Towards predictive maintenance for marine sector in malaysiaConference Papers
 
The new leaed (ii) ion selective electrode on free plasticizer film of pthfa ...
The new leaed (ii) ion selective electrode on free plasticizer film of pthfa ...The new leaed (ii) ion selective electrode on free plasticizer film of pthfa ...
The new leaed (ii) ion selective electrode on free plasticizer film of pthfa ...Conference Papers
 
Searchable symmetric encryption security definitions
Searchable symmetric encryption security definitionsSearchable symmetric encryption security definitions
Searchable symmetric encryption security definitionsConference Papers
 
Super convergence of autonomous things
Super convergence of autonomous thingsSuper convergence of autonomous things
Super convergence of autonomous thingsConference Papers
 
Study on performance of capacitor less ldo with different types of resistor
Study on performance of capacitor less ldo with different types of resistorStudy on performance of capacitor less ldo with different types of resistor
Study on performance of capacitor less ldo with different types of resistorConference Papers
 
Stil test pattern generation enhancement in mixed signal design
Stil test pattern generation enhancement in mixed signal designStil test pattern generation enhancement in mixed signal design
Stil test pattern generation enhancement in mixed signal designConference Papers
 
On premise ai platform - from dc to edge
On premise ai platform - from dc to edgeOn premise ai platform - from dc to edge
On premise ai platform - from dc to edgeConference Papers
 
Review of big data analytics (bda) architecture trends and analysis
Review of big data analytics (bda) architecture   trends and analysis Review of big data analytics (bda) architecture   trends and analysis
Review of big data analytics (bda) architecture trends and analysis Conference Papers
 
Rapid reduction of ultrathin films of graphene oxide on large area silicon su...
Rapid reduction of ultrathin films of graphene oxide on large area silicon su...Rapid reduction of ultrathin films of graphene oxide on large area silicon su...
Rapid reduction of ultrathin films of graphene oxide on large area silicon su...Conference Papers
 

More from Conference Papers (20)

Ai driven occupational skills generator
Ai driven occupational skills generatorAi driven occupational skills generator
Ai driven occupational skills generator
 
Advanced resource allocation and service level monitoring for container orche...
Advanced resource allocation and service level monitoring for container orche...Advanced resource allocation and service level monitoring for container orche...
Advanced resource allocation and service level monitoring for container orche...
 
Adaptive authentication to determine login attempt penalty from multiple inpu...
Adaptive authentication to determine login attempt penalty from multiple inpu...Adaptive authentication to determine login attempt penalty from multiple inpu...
Adaptive authentication to determine login attempt penalty from multiple inpu...
 
Absorption spectrum analysis of dentine sialophosphoprotein (dspp) in orthodo...
Absorption spectrum analysis of dentine sialophosphoprotein (dspp) in orthodo...Absorption spectrum analysis of dentine sialophosphoprotein (dspp) in orthodo...
Absorption spectrum analysis of dentine sialophosphoprotein (dspp) in orthodo...
 
A deployment scenario a taxonomy mapping and keyword searching for the appl...
A deployment scenario   a taxonomy mapping and keyword searching for the appl...A deployment scenario   a taxonomy mapping and keyword searching for the appl...
A deployment scenario a taxonomy mapping and keyword searching for the appl...
 
Automated login method selection in a multi modal authentication - login meth...
Automated login method selection in a multi modal authentication - login meth...Automated login method selection in a multi modal authentication - login meth...
Automated login method selection in a multi modal authentication - login meth...
 
Atomization of reduced graphene oxide ultra thin film for transparent electro...
Atomization of reduced graphene oxide ultra thin film for transparent electro...Atomization of reduced graphene oxide ultra thin film for transparent electro...
Atomization of reduced graphene oxide ultra thin film for transparent electro...
 
An enhanced wireless presentation system for large scale content distribution
An enhanced wireless presentation system for large scale content distribution An enhanced wireless presentation system for large scale content distribution
An enhanced wireless presentation system for large scale content distribution
 
An analysis of a large scale wireless image distribution system deployment
An analysis of a large scale wireless image distribution system deploymentAn analysis of a large scale wireless image distribution system deployment
An analysis of a large scale wireless image distribution system deployment
 
Validation of early testing method for e government projects by requirement ...
Validation of early testing method for e  government projects by requirement ...Validation of early testing method for e  government projects by requirement ...
Validation of early testing method for e government projects by requirement ...
 
The design and implementation of trade finance application based on hyperledg...
The design and implementation of trade finance application based on hyperledg...The design and implementation of trade finance application based on hyperledg...
The design and implementation of trade finance application based on hyperledg...
 
Towards predictive maintenance for marine sector in malaysia
Towards predictive maintenance for marine sector in malaysiaTowards predictive maintenance for marine sector in malaysia
Towards predictive maintenance for marine sector in malaysia
 
The new leaed (ii) ion selective electrode on free plasticizer film of pthfa ...
The new leaed (ii) ion selective electrode on free plasticizer film of pthfa ...The new leaed (ii) ion selective electrode on free plasticizer film of pthfa ...
The new leaed (ii) ion selective electrode on free plasticizer film of pthfa ...
 
Searchable symmetric encryption security definitions
Searchable symmetric encryption security definitionsSearchable symmetric encryption security definitions
Searchable symmetric encryption security definitions
 
Super convergence of autonomous things
Super convergence of autonomous thingsSuper convergence of autonomous things
Super convergence of autonomous things
 
Study on performance of capacitor less ldo with different types of resistor
Study on performance of capacitor less ldo with different types of resistorStudy on performance of capacitor less ldo with different types of resistor
Study on performance of capacitor less ldo with different types of resistor
 
Stil test pattern generation enhancement in mixed signal design
Stil test pattern generation enhancement in mixed signal designStil test pattern generation enhancement in mixed signal design
Stil test pattern generation enhancement in mixed signal design
 
On premise ai platform - from dc to edge
On premise ai platform - from dc to edgeOn premise ai platform - from dc to edge
On premise ai platform - from dc to edge
 
Review of big data analytics (bda) architecture trends and analysis
Review of big data analytics (bda) architecture   trends and analysis Review of big data analytics (bda) architecture   trends and analysis
Review of big data analytics (bda) architecture trends and analysis
 
Rapid reduction of ultrathin films of graphene oxide on large area silicon su...
Rapid reduction of ultrathin films of graphene oxide on large area silicon su...Rapid reduction of ultrathin films of graphene oxide on large area silicon su...
Rapid reduction of ultrathin films of graphene oxide on large area silicon su...
 

Recently uploaded

My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 

Recently uploaded (20)

My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 

Generating Clinical Quality Indicators from Clinical Text

  • 1. MyHarmony: Generating Statistics from Clinical Text for Monitoring Clinical Quality Indicators Md. Khadzir Sheikh Ahmad1 ; Mohd Syazrin Mohd Sakri1 ; „Ismat Mohd Sulaiman1 ; Syirahaniza Mohd Salleh1 ; Dickson Lukose2 ; Omar Ismail1 ; Abd Aziz Latip3 ; Muhammad Aiman Mazlan3 . 1 Health Informatics Centre, Planning Division, Ministry of Health, Malaysia 2 GCS Agile Pty. Ltd. 3 MIMOS Bhd. Abstract: The Ministry of Health developed MyHarmony with MIMOS Bhd. as part of the Malaysian Health Data Warehouse (MyHDW) initiative. MyHDW aims to be a trusted source of truth of comprehensive health data structured for analysis. MyHarmony fulfills that criteria by enabling data and information from unstructured form to be mined, such as texts, images, and sound. MyHarmony's first deliverable is the ability to mine clinical texts using Natural Language Processing (NLP) with SNOMED CT as its knowledge-base of clinical terms. MyHarmony is the engine the retrieves data and information into computer processable form by assigning SNOMED CT codes, which can then be further analysed statistically. MyHarmony is able to recognise and harmonise different terms that means the same. It also understands context for a more accurate coding; such as negations (no, not known, unknown) and conditionals (past history, symptoms of, previous). Using SNOMED CT, MyHarmony's ability is further advanced by using subsumption technique for a more comprehensive statistical results. This study will present a use case where clinical text from anonymized hospital discharge summaries can generate clinical indicators using MyHarmony for health managers. An added benefit to the operational (hospital) staff is the ability to produce such indicators is an efficient and timely manner by reducing workload for data collection and submission. MyHarmony could be the new and improved way to provide important statistical measures for evidence-based health planning, leading to improved healthcare services and health as a whole. Keywords: MyHarmony; MyHDW; Text mining; Quality Indicators; SNOMED CT 1. Introduction: The Ministry of Health developed MyHarmony with MIMOS Bhd. as part of the Malaysian Health Data Warehouse (MyHDW) initiative. MyHDW aims to be a trusted source of truth of comprehensive health data structured for analysis. MyHarmony is an application in the Malaysian Health Data Warehouse (MyHDW) that aims to analyse semi-structured and unstructured data. The unstructured data can be in the form of free-text, visual, audio and machine generated data. Unstructured data does not have predetermined values and not stored in an organized manner to be analysed by a conventional data warehouse. Therefore, other techniques need to be applied. MyHarmony aims to address this and be included as part of MyHDW. There were three (3) major deliverables in the conceptual stage. The first part refers to the development and implementation of health terminology standards, namely SNOMED CT, which will be the knowledge bases for MyHarmony. The second part was harmonization of the medical terminology to SNOMED CT
  • 2. terms by way of mapping. The last part was about the development and implementation of MyHarmony to show that the application can codify relevant terms in free-text using Natural Language Processing (NLP) technique. The SNOMED CT codified data can then be analysed for information generation. 2. Methodology: The development was first started in 2014 with the development of Cardiology Refset. Cardiology Refset was the terminology reference for the MyHarmony engine during the harmonisation/mapping and codification process. Cardiology Refset (version 1.0) was completed and released in 2014. It is a simple reference set [1] containing about 600 terms related to Cardiology speciality including signs and symptoms, diagnoses, procedures, body structures, medical devices and medications. It was delivered in time to be tested on MyHarmony standalone system to generate National Cardiovascular Disease (NCVD) registries. The draft Refset and method was presented during IHTSDO meetings and Expo in succession on September 2013, October 2013, and April 2014 to gain feedback from experts in the international community. The finalized method was presented during SNOMED CT Expo, October 2014 [2] . The Cardiology Refset was then expanded to include all cardiology related terms and Cardiology Refset v1.1 was completed in July 2016 containing more than 6000 concepts. First, more than 300,000 SNOMED CT concepts (Fully Specified Names) were extracted and reviewed by PIK using eyeballing technique. About 12,000 concepts that were believed to be related to Cardiology specialty was given to the clinicians for review. The clinicians reduced the number of concepts to about 6,000. Additionally, the Refset included local terms and common abbreviations which were mapped to existing concepts. Next, the team utilise MyHarmony to generate the analysis. There were 4 main functions in MyHarmony: 1. Terminology Management– to allow user to upload the SNOMED CT International Release content into MyHarmony, and upload SNOMED CT Refset in reference to the SNOMED CT International Release. 2. Data Management– to allow user to upload the data that will be harmonized and codified. This function also allows user to view the content of the data. 3. Codification Management– to allow user to codify the dataset according to the selected SNOMED CT Refset and view the codified data for validation purpose. 4. Query Management– to allow user to explore the data by generating queries using Structured Query Language (SQL). The functions were arranged according to the work process. First, the SNOMED CT International Release, SNOMED CT Cardiology Refset, and the dataset needs to be uploaded. Then, the dataset is codified and saved. Using the Query Management, the codified data can then be explored via data profiling and query generation. For initial analysis, SNOMED CT International release version 20160731 was used as the Cardiology Reference Set was developed using this version of SNOMED CT.
  • 3. The team received a set of database from a hospital with cardiology service which consists of 16224 discharge summaries from year 2017. The database was then uploaded into MyHarmony. The personally identified information (patient names, ID, and street address) were anonymised prior to codification and analysis. The output is a codified dataset, which enable information processing and analysis by machines. Using the Query Management, the codified data was then be explored via data profiling and query generation. 3. Result: The team conducted several data profiling queries to ensure that MyHarmony were able to capture the data correctly. For example, the number of records by month between Raw data (MyHarmony without SNOMED CT) and Harmonized data (MyHarmony with SNOMED CT) should return the same result. Other examples of data profiling queries are the number of records by gender, by specialty, and by ethnicity. Next, the team developed queries required by the National Cardiovascular Disease (NCVD) registries and compare the results with published registry reports. For example, querying the number of Ischaemic Heart Disease (IHD) by gender shows 1:4 female to male ratio, which is a similar ratio in the registry reports. Furthermore, the query also shows that Harmonized data captures more result compare to Raw data due to SNOMED CT relationship structure, thus capturing all the subtypes of IHD and its synonyms or ways of writing. The registry, however, only captures three (3) diagnosis due to its structured format, which are ST Elevation Myocardial Infarction, Non-ST Elevation Myocardial Infarction, and Unstable Angina. This trend and pattern comparison allow validation by the Clinicians and gains their buy-in in using MyHarmony. The team also tried to generate more queries required by the NCVD registry. However, it was limited by the documentation in the discharge summary. Registry queries requires more detail information that may often not documented in a discharge summary, such as information on smoking status and complications of procedures. After that, the team was tasked to generate National Cardiology Key Performance Indicators (KPIs). MyHarmony are able to generate 7 out of the 8 KPIs (KPI 2 to 8). The first KPI was excluded because the data are available at the clinic and not documented in inpatient discharge summaries. The Health Information Framework (HIF) was developed for the 7 KPIs, which detailed out the inclusion and exclusion criteria, the target, the formula, the terms used by MyHarmony, and query, and lastly a section for additional notes. Preliminary manual validation on the completeness and accuracy of codified data shows 90% precision and 70% recall. The content of those records is complex as it does not follow grammar rules, and contains a large number of short forms, abbreviations, acronyms and analogous terms (e.g., synd, ACS, CCS IV, NYHA 2). One example of record is “2VD with RCA culprit lesion - Ad hoc PCI DES to RCA and LAD” which is challenging to codify using approaches based on strict grammar. The revised version of MyHarmony uses a different approach based on shallow parsing and the consideration of multiple
  • 4. suitable combinations of words in a sentence. With further iterations and improvement in the mapping, these challenges were overcome[3]. From the SNOMED CT codified database, the system was able to show a more accurate result during analysis . This is because MyHarmony capitalises on the existing SNOMED CT relationships structure between concepts. In this case, when querying “Number of Ischaemic Heart Disease cases per year”, MyHarmony search the code and term for “Ischaemic heart disease”, its synonyms and accepted abbreviations, and all the subtypes of Ischaemic heart disease such as all subtypes of “Myocardial infarction” and “Angina”. MyHarmony aggregates these records resulting in a more accurate analysis. Usually, the result where MyHarmony utilise SNOMED CT‟s relationship structure would show more records. This is because Mi-Harmony was able to aggregate data not just through String match, but also utilize the IS-A hierarchy structure in SNOMED CT. For example, querying “Ischemic heart disease” will gather clinical records with synonymous terms like “Ischaemic heart disease” and “IHD”; as well as clinical records containing all the subtypes of “Ischemic heart disease” such as “Myocardial Infarction” and “Unstable angina”. Context awareness such as negation and pasts events were also applied. For example, the term “No chest pain”, “No known history of diabetes mellitus”, and “Symptoms of heart failure” will not be coded as the presenting condition. Additionally, terms like “Previous history of”, “Previous admission of”, and “Family history of” within the same sentence as a clinical condition will not be coded as the current condition for the record. 4. Discussion and Conclusion: When showcasing these abilities to the clinicians, the team agreed that MyHarmony was able to: (i) Generate more information from free-text utilising the SNOMED CT structure, thus, reducing the effort needed to collect data in a structured manner such as in a registry and indicator reports; (ii) Able to generate new information by retrospectively running new queries on old discharge summary records; thus, reducing the effort and time to collect data in a prospective manner when new questions arise, such as for indicator reports that often change on a yearly basis; (iii) Able to deliver information in a timelier fashion; thus clinicians and health managers are able to plan and take action without waiting for a 1 to 3 yearly report; (iv) Improve documentation of clinicians when they are aware of MyHarmony‟s ability during roadshows. Generating indicators for monitoring and evaluation can be a burden even for healthcare facilities equipped with EHR. Conventionally, collecting data for indicators requires multiple data entries in aggregated manner, with manual submission to central agencies, where the results are only published on a yearly basis. Introducing MyHarmony may reduce these burdens. Capturing data from the source in an automated way, i.e. free text documented by doctors, would reduce duplication of work and the amount of resources to capture the data into manual form. Having the data in granular form would allow a more
  • 5. dynamic analysis and prevents dishonesty. Information required, whether old or new information, can be formulated and disseminated back to the clinicians and health managers in a timelier fashion. MyHarmony has the potential to expand further in its implementation and technology. However, there are still challenges to be addressed. Currently, MyHarmony has been developed to mine free-text for Cardiology via a back-end approach. It uses a single version of SNOMED CT International. The team is still researching the best approach to manage SNOMED CT versions and its codified data, which may impact the resulting analysis in an inconsistent way. The team are also seeking international experience for this matter. Other challenges include researching a more efficient and effective method to develop SNOMED CT Refsets. The initial method by referencing terms required by registries or indicators has been established. However, expanding the SNOMED CT Refsets to include relevant terms for a specific clinical specialty or domain needs to be refined further. Eye-balling technique to search the entire SNOMED CT content have its strength and weaknesses. Even though it is a very thorough method, there are possibilities of missed terms and very time consuming. Despite these challenges, the journey in developing MyHarmony and the lessons learnt has allowed the team to refine the methods and processes to expand the use of MyHarmony to other clinical specialties. Analysis from unstructured data would hope to complement analysis from structured data (like census and registries), with the additional benefit workload reduction to provide timelier, trusted, and dynamic information. References: 1. SNOMED CT Simple Reference Set: https://confluence.ihtsdotools.org/display/DOCRFSPG/5.1.+Simple+Reference+Set 2. Mohd Sulaiman I, Sheikh Ahmad MK. SNOMED CT Cardiology Reference Set Development, Malaysia. Proceedings of the SNOMED CT Implementation Showcase [Internet]. Amsterdam: IHTSDO; 2014. Retrieved from: http://ihtsdo.org/fileadmin/user_upload/doc/showcase/show14/SnomedCtShowcase2014_Abstract_14058 .pdf 3. Abdul Manaf NA, Mohamed K, Lukose D. Harmonizing EHR Databases with SNOMED CT. Proceedings of the SNOMED CT Implementation Showcase 2014 [Internet]. Amsterdam: IHTSDO; 2014. https://confluence.ihtsdotools.org/display/FT/SNOMED+CT+Implementation+Showcase+2014