SlideShare a Scribd company logo
ARTIFICIAL INTELLIGENCE
DIAGNOSIS
(AID)
Group Members: Supervisor: Dr. Muhammad Sarim
Mahira Akhtar
Aly Akbar Ali Hirji
Hammad Ahmed
Muhammad Hassan Siddiqui
18 December 2017 1
OBJECTIVES
• Assist medical professionals in diagnosis
• Predict probable disease and diagnosis
• Provide personalized healthcare to patients
2
MOTIVATION & BACKGROUND
• Too many patients but very few doctors
• Doctors short on time and overlook details
• Lab tests end up in false diagnosis
• Diagnosis is dependent on Doctor’s mood
3
MOTIVATION & BACKGROUND
• EMR data is not utilized properly
– Patient’s personal information and medical history
not taken in account
– Patients are often prescribed unnecessary tests
• Demographic characteristics ignored
– Existing expert systems do not take them into account
– These account for significant differences in baselines
4
METHODOLOGY
• Extract rules from data provided by UMDC
– This process will make use of Data mining methods
such as Neural Fuzzy learners
• Extract rules from medical literature
– Online repositories such as PubMed, Medscape, and
Wikipedia
– Crawl data from them using web crawlers such as
PHPcrawl
• Take baseline differences in account during rule
generation.
5
METHODOLOGY
● Generated rules will be accessible to doctors
–Through an excel spreadsheet containing results
values of lab tests
–Rules presented in a table with each row
denoting test result parameter values for each
disease
–Doctors could add and edit parameter values and
diseases without need for any programming skills
● The rules will then be converted into XML for
updating the expert system
6
METHODOLOGY
• Ranked list of possible diseases based on rules
and scoring
• Storage and retrieval of previous diagnosis of
patients to improve accuracy of prediction
7
SYSTEM DESIGN OVERVIEW
8
EXTENSIONS
• Use of Symptoms during the prediction
• Medical Analysis based on demographic characteristics such
as gender, residential address etc.
• Integration of expert system with an existing Hospital EMR
• Risk monitoring system to identify patients at risk
9
DATA UNDERSTANDING
▪ The Blood Test Data provided by UMDC contains about 200,000 records
▪ Multiple test of about 54,000 patients
▪ Out of these, diagnosis of only 3000 is recorded
▪ Patient Tests:
10
Test
Code
Test name Normal values range
1 Haemoglobin 11.5 – 18 (mg/dl)
17 Urea 10 – 50 (mg%)
18 Creatinine 0.5 – 1.5 (mg%)
25 Potassium 3.8 – 5.2 (ME q/L)
47 Glucose Fasting 70 – 110 (mg%)
48 Glucose Random 80 – 180 (mg%)
DATA UNDERSTANDING
11
•Actual Data – Test Results Table
DATA UNDERSTANDING
•Actual Data – Vitals Table
12
DATA UNDERSTANDING
•Actual Data – Diagnosis Table
13
DATA UNDERSTANDING
•Correlation Matrix
14
DATA UNDERSTANDING
• Problems with the data
― Multiple diagnosis of patients at the same date and time
― Test codes inconsistent with the test names
e.g. Haemoglobin records are classified under test code 1 and most of the
Glucose (fasting) records are classified under test code 47. However, a few of
the Glucose (fasting) records are misclassified under test code 1
― Some of the test names are not consistent
e.g Haemoglobin test name is recorded as “Haemoglobin”, “Hb”, and
“Haemoglobin %”
― Human Errors in data entry. E.g. Temperature recorded as 980 *F (prob he
was trying to record 98.0)
15
DATA UNDERSTANDING
•Problems with the data
16
DATA UNDERSTANDING
•Problems with the data
– Multiple test results values are recorded against the same registration number and the same
date and time.
17
DATA UNDERSTANDING
–Test Value Inconsistency- above 800 cells found with text such as ‘127 (AFTER GLOCOUSE 01
HR)’ and ‘AFTER 75GRM GLOCOUSE 01HR (92)’
18
DATA UNDERSTANDING
–Test Code and Test Name inconsistency problem solved by Excel formulas such
as:=IF(OR(P2="Haemoglobin %",P2="Hb"),"Haemoglobin",P2)
–And
=IF(N2="true",(MID(L2,SEARCH("(",L2)+1,SEARCH(")",L2,SEARCH("(",L2)+1)-SEARCH("(",L2)-1)),N2)
19
DATA UNDERSTANDING
•Pivot Table: Test Codes as Columns, grouped by date and regno
20
DATA CLEANING
•Handling Missing Values
21
MODELLING – Naïve Bayes
22
DATA CLEANING
•Binning: Out of range column added for test values using normal ranges
23
DATA CLEANING
•Handling missing values: Since a patient whose test reports are cleared will have normal test range
values. So we handled those missing values by inserting the average of normal test range values
24
KNIME Workflows
25
KNIME Workflows
26
CONCLUSION
• Aim to build a Medical Expert System to assist medical
professionals especially doctors in diagnosis
• Want to make medical literature as a direct support for
diagnosis
• Want to allow patients to be provided personalised treatment
using their medical history
• Wish to serve the medical community as Computer Scientists,
considering the field’s interdisciplinary nature
27
QUESTIONS?
28

More Related Content

Similar to Fyp presentation (1)

Presentation
PresentationPresentation
Presentation
CHANDRAJEETJHA4
 
6-Presentation shared.pdf
6-Presentation shared.pdf6-Presentation shared.pdf
6-Presentation shared.pdf
RehamAbuzaid1
 
6 Epilepsy Use Cases for NLP
6 Epilepsy Use Cases for NLP6 Epilepsy Use Cases for NLP
6 Epilepsy Use Cases for NLP
CitiusTech
 
Overview of statistical tests: Data handling and data quality (Part II)
Overview of statistical tests: Data handling and data quality (Part II)Overview of statistical tests: Data handling and data quality (Part II)
Overview of statistical tests: Data handling and data quality (Part II)
Bioinformatics and Computational Biosciences Branch
 
MedDRA Coding - Katalyst HLS
MedDRA Coding - Katalyst HLSMedDRA Coding - Katalyst HLS
MedDRA Coding - Katalyst HLS
Katalyst HLS
 
1Big Data Analytics forHealthcareChandan K. ReddyD.docx
1Big Data Analytics forHealthcareChandan K. ReddyD.docx1Big Data Analytics forHealthcareChandan K. ReddyD.docx
1Big Data Analytics forHealthcareChandan K. ReddyD.docx
aulasnilda
 
Clinical Validation of Copy Number Variants Using the AMP Guidelines
Clinical Validation of Copy Number Variants Using the AMP GuidelinesClinical Validation of Copy Number Variants Using the AMP Guidelines
Clinical Validation of Copy Number Variants Using the AMP Guidelines
Golden Helix
 
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse EnvironmentsEnabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
Luis Marco Ruiz
 
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse EnvironmentsEnabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
Luis Marco Ruiz
 
Research team meeting mrs
Research team meeting mrsResearch team meeting mrs
Research team meeting mrs
Marion Sills
 
BioGears Overview for SSIH Healthcare Systems Modeling & Simulation Affinity ...
BioGears Overview for SSIH Healthcare Systems Modeling & Simulation Affinity ...BioGears Overview for SSIH Healthcare Systems Modeling & Simulation Affinity ...
BioGears Overview for SSIH Healthcare Systems Modeling & Simulation Affinity ...
BioGearsEngine
 
Readmission of Diabetes Patients Report
Readmission of Diabetes Patients ReportReadmission of Diabetes Patients Report
Readmission of Diabetes Patients Report
Hong Lu
 
The End of the Drug Development Casino?
The End of the Drug Development Casino?The End of the Drug Development Casino?
The End of the Drug Development Casino?
Paul Agapow
 
Data-driven Disease Phenotyping and Bulk Learning
Data-driven Disease Phenotyping and Bulk LearningData-driven Disease Phenotyping and Bulk Learning
Data-driven Disease Phenotyping and Bulk Learning
Po-Hsiang (Barnett) Chiu
 
A Value-Based Approach to Clinical Pathology and Informatics
A Value-Based Approach to Clinical Pathology and InformaticsA Value-Based Approach to Clinical Pathology and Informatics
A Value-Based Approach to Clinical Pathology and Informatics
Cirdan
 
Natural Language Processing to Curate Unstructured Electronic Health Records
Natural Language Processing to Curate Unstructured Electronic Health RecordsNatural Language Processing to Curate Unstructured Electronic Health Records
Natural Language Processing to Curate Unstructured Electronic Health Records
MMS Holdings
 
Data in genomics: Dr Richard Scott, Clinical Lead for Rare Disease, 100,000 G...
Data in genomics: Dr Richard Scott, Clinical Lead for Rare Disease, 100,000 G...Data in genomics: Dr Richard Scott, Clinical Lead for Rare Disease, 100,000 G...
Data in genomics: Dr Richard Scott, Clinical Lead for Rare Disease, 100,000 G...
NHS England
 
Metabolic Profiling_techniques and approaches.ppt
Metabolic Profiling_techniques and approaches.pptMetabolic Profiling_techniques and approaches.ppt
Metabolic Profiling_techniques and approaches.ppt
Sachin Teotia
 
Metabolic Profiling: Limitations, Challenges.ppt
Metabolic Profiling: Limitations, Challenges.pptMetabolic Profiling: Limitations, Challenges.ppt
Metabolic Profiling: Limitations, Challenges.ppt
Sachin Teotia
 
predictionofheartdiseaseusingmachinelearning.pdf
predictionofheartdiseaseusingmachinelearning.pdfpredictionofheartdiseaseusingmachinelearning.pdf
predictionofheartdiseaseusingmachinelearning.pdf
DasariSeshadri
 

Similar to Fyp presentation (1) (20)

Presentation
PresentationPresentation
Presentation
 
6-Presentation shared.pdf
6-Presentation shared.pdf6-Presentation shared.pdf
6-Presentation shared.pdf
 
6 Epilepsy Use Cases for NLP
6 Epilepsy Use Cases for NLP6 Epilepsy Use Cases for NLP
6 Epilepsy Use Cases for NLP
 
Overview of statistical tests: Data handling and data quality (Part II)
Overview of statistical tests: Data handling and data quality (Part II)Overview of statistical tests: Data handling and data quality (Part II)
Overview of statistical tests: Data handling and data quality (Part II)
 
MedDRA Coding - Katalyst HLS
MedDRA Coding - Katalyst HLSMedDRA Coding - Katalyst HLS
MedDRA Coding - Katalyst HLS
 
1Big Data Analytics forHealthcareChandan K. ReddyD.docx
1Big Data Analytics forHealthcareChandan K. ReddyD.docx1Big Data Analytics forHealthcareChandan K. ReddyD.docx
1Big Data Analytics forHealthcareChandan K. ReddyD.docx
 
Clinical Validation of Copy Number Variants Using the AMP Guidelines
Clinical Validation of Copy Number Variants Using the AMP GuidelinesClinical Validation of Copy Number Variants Using the AMP Guidelines
Clinical Validation of Copy Number Variants Using the AMP Guidelines
 
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse EnvironmentsEnabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
 
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse EnvironmentsEnabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
 
Research team meeting mrs
Research team meeting mrsResearch team meeting mrs
Research team meeting mrs
 
BioGears Overview for SSIH Healthcare Systems Modeling & Simulation Affinity ...
BioGears Overview for SSIH Healthcare Systems Modeling & Simulation Affinity ...BioGears Overview for SSIH Healthcare Systems Modeling & Simulation Affinity ...
BioGears Overview for SSIH Healthcare Systems Modeling & Simulation Affinity ...
 
Readmission of Diabetes Patients Report
Readmission of Diabetes Patients ReportReadmission of Diabetes Patients Report
Readmission of Diabetes Patients Report
 
The End of the Drug Development Casino?
The End of the Drug Development Casino?The End of the Drug Development Casino?
The End of the Drug Development Casino?
 
Data-driven Disease Phenotyping and Bulk Learning
Data-driven Disease Phenotyping and Bulk LearningData-driven Disease Phenotyping and Bulk Learning
Data-driven Disease Phenotyping and Bulk Learning
 
A Value-Based Approach to Clinical Pathology and Informatics
A Value-Based Approach to Clinical Pathology and InformaticsA Value-Based Approach to Clinical Pathology and Informatics
A Value-Based Approach to Clinical Pathology and Informatics
 
Natural Language Processing to Curate Unstructured Electronic Health Records
Natural Language Processing to Curate Unstructured Electronic Health RecordsNatural Language Processing to Curate Unstructured Electronic Health Records
Natural Language Processing to Curate Unstructured Electronic Health Records
 
Data in genomics: Dr Richard Scott, Clinical Lead for Rare Disease, 100,000 G...
Data in genomics: Dr Richard Scott, Clinical Lead for Rare Disease, 100,000 G...Data in genomics: Dr Richard Scott, Clinical Lead for Rare Disease, 100,000 G...
Data in genomics: Dr Richard Scott, Clinical Lead for Rare Disease, 100,000 G...
 
Metabolic Profiling_techniques and approaches.ppt
Metabolic Profiling_techniques and approaches.pptMetabolic Profiling_techniques and approaches.ppt
Metabolic Profiling_techniques and approaches.ppt
 
Metabolic Profiling: Limitations, Challenges.ppt
Metabolic Profiling: Limitations, Challenges.pptMetabolic Profiling: Limitations, Challenges.ppt
Metabolic Profiling: Limitations, Challenges.ppt
 
predictionofheartdiseaseusingmachinelearning.pdf
predictionofheartdiseaseusingmachinelearning.pdfpredictionofheartdiseaseusingmachinelearning.pdf
predictionofheartdiseaseusingmachinelearning.pdf
 

More from KayDrive

Microfluidic Flow Control using Magnetohydrodynamics
Microfluidic Flow Control using  Magnetohydrodynamics Microfluidic Flow Control using  Magnetohydrodynamics
Microfluidic Flow Control using Magnetohydrodynamics
KayDrive
 
4 city spot
4 city spot4 city spot
4 city spot
KayDrive
 
Ring Spinning Machine
Ring Spinning Machine Ring Spinning Machine
Ring Spinning Machine
KayDrive
 
FYP presentation - Saad
FYP presentation - SaadFYP presentation - Saad
FYP presentation - Saad
KayDrive
 
Pegasus fyp presentation
Pegasus fyp presentationPegasus fyp presentation
Pegasus fyp presentation
KayDrive
 
V track ppt
V track pptV track ppt
V track ppt
KayDrive
 
Presentation (1)qq
Presentation (1)qqPresentation (1)qq
Presentation (1)qq
KayDrive
 
Fyp presentationa
Fyp presentationaFyp presentationa
Fyp presentationa
KayDrive
 
Presentation
PresentationPresentation
Presentation
KayDrive
 
Go spark presentation fyp fall 2017
Go spark presentation  fyp fall 2017Go spark presentation  fyp fall 2017
Go spark presentation fyp fall 2017
KayDrive
 
Presentation
PresentationPresentation
Presentation
KayDrive
 
Presentationa
PresentationaPresentationa
Presentationa
KayDrive
 
Fyp final presentation
Fyp final presentationFyp final presentation
Fyp final presentation
KayDrive
 
Shift presentation
Shift presentationShift presentation
Shift presentation
KayDrive
 
Final year project
Final year projectFinal year project
Final year project
KayDrive
 
Presentation
PresentationPresentation
Presentation
KayDrive
 
Project presentation
Project presentationProject presentation
Project presentation
KayDrive
 
Fyp presentation
Fyp presentationFyp presentation
Fyp presentation
KayDrive
 
Marketmizer
MarketmizerMarketmizer
Marketmizer
KayDrive
 
Pegasus fyp presentation
Pegasus fyp presentationPegasus fyp presentation
Pegasus fyp presentation
KayDrive
 

More from KayDrive (20)

Microfluidic Flow Control using Magnetohydrodynamics
Microfluidic Flow Control using  Magnetohydrodynamics Microfluidic Flow Control using  Magnetohydrodynamics
Microfluidic Flow Control using Magnetohydrodynamics
 
4 city spot
4 city spot4 city spot
4 city spot
 
Ring Spinning Machine
Ring Spinning Machine Ring Spinning Machine
Ring Spinning Machine
 
FYP presentation - Saad
FYP presentation - SaadFYP presentation - Saad
FYP presentation - Saad
 
Pegasus fyp presentation
Pegasus fyp presentationPegasus fyp presentation
Pegasus fyp presentation
 
V track ppt
V track pptV track ppt
V track ppt
 
Presentation (1)qq
Presentation (1)qqPresentation (1)qq
Presentation (1)qq
 
Fyp presentationa
Fyp presentationaFyp presentationa
Fyp presentationa
 
Presentation
PresentationPresentation
Presentation
 
Go spark presentation fyp fall 2017
Go spark presentation  fyp fall 2017Go spark presentation  fyp fall 2017
Go spark presentation fyp fall 2017
 
Presentation
PresentationPresentation
Presentation
 
Presentationa
PresentationaPresentationa
Presentationa
 
Fyp final presentation
Fyp final presentationFyp final presentation
Fyp final presentation
 
Shift presentation
Shift presentationShift presentation
Shift presentation
 
Final year project
Final year projectFinal year project
Final year project
 
Presentation
PresentationPresentation
Presentation
 
Project presentation
Project presentationProject presentation
Project presentation
 
Fyp presentation
Fyp presentationFyp presentation
Fyp presentation
 
Marketmizer
MarketmizerMarketmizer
Marketmizer
 
Pegasus fyp presentation
Pegasus fyp presentationPegasus fyp presentation
Pegasus fyp presentation
 

Recently uploaded

How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
Zilliz
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
IndexBug
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 

Recently uploaded (20)

How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 

Fyp presentation (1)

  • 1. ARTIFICIAL INTELLIGENCE DIAGNOSIS (AID) Group Members: Supervisor: Dr. Muhammad Sarim Mahira Akhtar Aly Akbar Ali Hirji Hammad Ahmed Muhammad Hassan Siddiqui 18 December 2017 1
  • 2. OBJECTIVES • Assist medical professionals in diagnosis • Predict probable disease and diagnosis • Provide personalized healthcare to patients 2
  • 3. MOTIVATION & BACKGROUND • Too many patients but very few doctors • Doctors short on time and overlook details • Lab tests end up in false diagnosis • Diagnosis is dependent on Doctor’s mood 3
  • 4. MOTIVATION & BACKGROUND • EMR data is not utilized properly – Patient’s personal information and medical history not taken in account – Patients are often prescribed unnecessary tests • Demographic characteristics ignored – Existing expert systems do not take them into account – These account for significant differences in baselines 4
  • 5. METHODOLOGY • Extract rules from data provided by UMDC – This process will make use of Data mining methods such as Neural Fuzzy learners • Extract rules from medical literature – Online repositories such as PubMed, Medscape, and Wikipedia – Crawl data from them using web crawlers such as PHPcrawl • Take baseline differences in account during rule generation. 5
  • 6. METHODOLOGY ● Generated rules will be accessible to doctors –Through an excel spreadsheet containing results values of lab tests –Rules presented in a table with each row denoting test result parameter values for each disease –Doctors could add and edit parameter values and diseases without need for any programming skills ● The rules will then be converted into XML for updating the expert system 6
  • 7. METHODOLOGY • Ranked list of possible diseases based on rules and scoring • Storage and retrieval of previous diagnosis of patients to improve accuracy of prediction 7
  • 9. EXTENSIONS • Use of Symptoms during the prediction • Medical Analysis based on demographic characteristics such as gender, residential address etc. • Integration of expert system with an existing Hospital EMR • Risk monitoring system to identify patients at risk 9
  • 10. DATA UNDERSTANDING ▪ The Blood Test Data provided by UMDC contains about 200,000 records ▪ Multiple test of about 54,000 patients ▪ Out of these, diagnosis of only 3000 is recorded ▪ Patient Tests: 10 Test Code Test name Normal values range 1 Haemoglobin 11.5 – 18 (mg/dl) 17 Urea 10 – 50 (mg%) 18 Creatinine 0.5 – 1.5 (mg%) 25 Potassium 3.8 – 5.2 (ME q/L) 47 Glucose Fasting 70 – 110 (mg%) 48 Glucose Random 80 – 180 (mg%)
  • 11. DATA UNDERSTANDING 11 •Actual Data – Test Results Table
  • 12. DATA UNDERSTANDING •Actual Data – Vitals Table 12
  • 13. DATA UNDERSTANDING •Actual Data – Diagnosis Table 13
  • 15. DATA UNDERSTANDING • Problems with the data ― Multiple diagnosis of patients at the same date and time ― Test codes inconsistent with the test names e.g. Haemoglobin records are classified under test code 1 and most of the Glucose (fasting) records are classified under test code 47. However, a few of the Glucose (fasting) records are misclassified under test code 1 ― Some of the test names are not consistent e.g Haemoglobin test name is recorded as “Haemoglobin”, “Hb”, and “Haemoglobin %” ― Human Errors in data entry. E.g. Temperature recorded as 980 *F (prob he was trying to record 98.0) 15
  • 17. DATA UNDERSTANDING •Problems with the data – Multiple test results values are recorded against the same registration number and the same date and time. 17
  • 18. DATA UNDERSTANDING –Test Value Inconsistency- above 800 cells found with text such as ‘127 (AFTER GLOCOUSE 01 HR)’ and ‘AFTER 75GRM GLOCOUSE 01HR (92)’ 18
  • 19. DATA UNDERSTANDING –Test Code and Test Name inconsistency problem solved by Excel formulas such as:=IF(OR(P2="Haemoglobin %",P2="Hb"),"Haemoglobin",P2) –And =IF(N2="true",(MID(L2,SEARCH("(",L2)+1,SEARCH(")",L2,SEARCH("(",L2)+1)-SEARCH("(",L2)-1)),N2) 19
  • 20. DATA UNDERSTANDING •Pivot Table: Test Codes as Columns, grouped by date and regno 20
  • 23. DATA CLEANING •Binning: Out of range column added for test values using normal ranges 23
  • 24. DATA CLEANING •Handling missing values: Since a patient whose test reports are cleared will have normal test range values. So we handled those missing values by inserting the average of normal test range values 24
  • 27. CONCLUSION • Aim to build a Medical Expert System to assist medical professionals especially doctors in diagnosis • Want to make medical literature as a direct support for diagnosis • Want to allow patients to be provided personalised treatment using their medical history • Wish to serve the medical community as Computer Scientists, considering the field’s interdisciplinary nature 27