SlideShare a Scribd company logo
1 of 45
1
DATA MINING IN HEALTHCARE
Dr. V. Subha, B.E., M.E., Ph.D.,
Assistant Professor,
Department of Computer Science & Engineering,
Manonmaniam Sundaranar University,
Tirunelveli. 2
CONTENTS
1. Introduction to Data Mining
2. Data Mining techniques
3. Data Mining in Healthcare
4. Data Mining Resources
3
1. INTRODUCTION TO DATA MINING
4
5
6
Data Vs Knowledge
7
9
Definition of Data Mining
• Process of identifying valid, novel,
potentially useful, and ultimately
understandable patterns in data.
- Predicts outcomes of future
observations.
10
Confluence of different disciplines
11
Steps in Data Mining Evolution
12
Data Mining - Motivation
• Growth in data.
• High dimensionality of data.
• Heterogeneous and complex data.
• Development of commercial data mining software.
• Growth of computing power and storage capacity.
• Limitation of humans.
13
Why Data Mining ?
• Credit ratings :
• Given a database of 100,000 names, which persons are the least likely
to default on their credit cards?
• Fraud detection :
• Which types of transactions are likely to be fraudulent, given the
demographics and transactional history of a particular customer?
• Customer relationship management:
• Which of my customers are likely to be the most loyal, and which are
most likely to leave for a competitor?
Data Mining helps extract such information
14
Data Mining for decision making
15
Types of data analysis
Predictive models help in diagnosing diseases and then
uncovering a special path for treatment.
16
Knowledge Discovery in Databases (KDD)
17
Steps of KDD Process
Data relevant to the
analysis is decided and
retrieved from the
database
Data cleansing -
Noisy data and
missing data are
handled
appropriately
Processed data is
transformed into
appropriate forms
for mining
Clever
techniques are
applied to
extract patterns
that are
potentially
useful
Mined data
patterns are
evaluated
18
Applications of data mining
Popular applications of data mining 19
2. DATA MINING TECHNIQUES
20
Data Mining Techniques
21
Data Mining Techniques…
• Unsupervised learning
- No knowledge of output
- Self-guided learning algorithm.
• Supervised learning
- Knowledge of output
- Learning with an expert/teacher
22
Data Mining Techniques…
23
Classification – Prediction Model
Data Mining
Algorithms
Classification
Prediction
• Process of finding a model that describes the data classes or concepts.
• Purpose is to use this model to predict the class of instances whose class
label is unknown.
• This derived model is based on the analysis of sets of training data.
24
Example Dataset
25
Dataset Splitting
26
27
Model Construction
Training
Data
AGE GENDER CHEST PAIN DISEASE
63 Male atyp_angina yes
67 Male asympt yes
67 Male asympt yes
37 Male non_anginal no
41 Female atyp_angina no
62 Female asympt yes
Classification
Algorithms
IF chest pain = ‘asympt’
OR age > 53
THEN disease = ‘yes’
Classifier
(Model)
28
Using the Model in Prediction
Classifier
(Model)
Testing Data
(50,male,asympt)
Disease?
Classification Techniques
Neural Networks Bayesian Networks
Decision Trees Support Vector Machines
29
Confusion Matrix
• Contains information about actual and predicted classifications
done by a classification system.
• Performance of such systems is commonly evaluated using the
data in the confusion matrix.
30
Confusion Matrix…
• TP – Patient with heart disease.
• FP – Patient wrongly identified to have heart disease.
• FN – Patient have been left out from treatment for heart disease.
• TN – Patient does not have heart disease.
Predicted
Class
Actual Class
Positive Negative
Positive True Positive (TP) False Positive (FP)
Negative False Negative (FN) True Negative (TN)
31
Performance Metrics
• Accuracy =
• Sensitivity =
• Specificity =










FN
FP
TN
TP
TN
TP






 FN
TP
TP






 FP
TN
TN
32
3. DATA MINING IN HEALTHCARE
33
Data mining in healthcare
• Hospitals deal with lot of data on a day to day basis.
• Difficult for a single person to handle.
• Data mining techniques help a lot in analyzing
patient’s health data.
• Patient’s health care data can be sorted and any kind
of illness can be predicted which helps in treating the
patients.
34
Disease diagnosis and effective treatment
35
Healthcare applications with Data mining
36
Disease diagnosis using Data Mining
37
4. DATA MINING - RESOURCES
38
Data mining Tools - Open Source
DataMelt
39
Data mining Tools - Proprietary License
Oracle Data Mining
IBM Cognos IBM SPSS Modeler
SAS Data Mining Sisense
SSDT
Teradata Board toolkit Dundas BI
40
Data Mining Tutorials
www.tutorialpoint.com
www.guru99.com
www.tutorialride.com
www.zentut.com
www.cs.cmu.edu
www.javatpoint.com
 www.analyticsvidhya.com
41
Dataset Repository - UCI Repository
https://archive.ics.uci.edu
42
Books
• Han, Jiawei, Jian Pei, and Micheline Kamber. Data
mining: concepts and techniques. Elsevier, 2011.
• Zaki, Mohammed J., Wagner Meira Jr, and Wagner
Meira. Data mining and analysis: Fundamental
concepts and algorithms. Cambridge University Press,
2014.
• Gorunescu, Florin. Data Mining: Concepts, models and
techniques. Vol. 12. Springer Science & Business
Media, 2011.
• Ron Zacharski. A Programmer's Guide to Data Mining
: The Ancient Art of the Numerati,2013
43
44
45

More Related Content

What's hot

Data mining concepts and work
Data mining concepts and workData mining concepts and work
Data mining concepts and workAmr Abd El Latief
 
Knowledge discovery thru data mining
Knowledge discovery thru data miningKnowledge discovery thru data mining
Knowledge discovery thru data miningDevakumar Jain
 
Big Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesBig Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesJosef Scheiber
 
data mining and data warehousing
data mining and data warehousingdata mining and data warehousing
data mining and data warehousingSunny Gandhi
 
Data mining and data warehousing
Data mining and data warehousingData mining and data warehousing
Data mining and data warehousingumesh patil
 
Clinical Decision Support Systems
Clinical Decision Support SystemsClinical Decision Support Systems
Clinical Decision Support Systemspradhasrini
 
Reservoir computing fast deep learning for sequences
Reservoir computing   fast deep learning for sequencesReservoir computing   fast deep learning for sequences
Reservoir computing fast deep learning for sequencesClaudio Gallicchio
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining Sushil Kulkarni
 
database management system
database  management systemdatabase  management system
database management systemVivek Kumar
 
Knowledge Discovery and Data Mining
Knowledge Discovery and Data MiningKnowledge Discovery and Data Mining
Knowledge Discovery and Data MiningAmritanshu Mehra
 
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012Gigaom
 
Fundamentals of data mining and its applications
Fundamentals of data mining and its applicationsFundamentals of data mining and its applications
Fundamentals of data mining and its applicationsSubrat Swain
 
Data Visualization in Health
Data Visualization in HealthData Visualization in Health
Data Visualization in HealthRamon Martinez
 
Database management system presentation
Database management system presentationDatabase management system presentation
Database management system presentationsameerraaj
 

What's hot (20)

Data mining concepts and work
Data mining concepts and workData mining concepts and work
Data mining concepts and work
 
Knowledge discovery thru data mining
Knowledge discovery thru data miningKnowledge discovery thru data mining
Knowledge discovery thru data mining
 
Big Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesBig Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use Cases
 
Data mining
Data miningData mining
Data mining
 
data mining and data warehousing
data mining and data warehousingdata mining and data warehousing
data mining and data warehousing
 
Data mining and data warehousing
Data mining and data warehousingData mining and data warehousing
Data mining and data warehousing
 
Clinical Decision Support Systems
Clinical Decision Support SystemsClinical Decision Support Systems
Clinical Decision Support Systems
 
Reservoir computing fast deep learning for sequences
Reservoir computing   fast deep learning for sequencesReservoir computing   fast deep learning for sequences
Reservoir computing fast deep learning for sequences
 
Applications of Big Data
Applications of Big DataApplications of Big Data
Applications of Big Data
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining
 
database management system
database  management systemdatabase  management system
database management system
 
Knowledge Discovery and Data Mining
Knowledge Discovery and Data MiningKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining
 
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
 
Fundamentals of data mining and its applications
Fundamentals of data mining and its applicationsFundamentals of data mining and its applications
Fundamentals of data mining and its applications
 
Data Visualization in Health
Data Visualization in HealthData Visualization in Health
Data Visualization in Health
 
Lecture - Data Mining
Lecture - Data MiningLecture - Data Mining
Lecture - Data Mining
 
Data Extraction
Data ExtractionData Extraction
Data Extraction
 
Intro To Adaptive Design
Intro To Adaptive DesignIntro To Adaptive Design
Intro To Adaptive Design
 
Database management system presentation
Database management system presentationDatabase management system presentation
Database management system presentation
 
Data mining
Data miningData mining
Data mining
 

Similar to Data mining in healthcare

Presentation data mining
Presentation data miningPresentation data mining
Presentation data miningAbdul Haseeb
 
Machinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdfMachinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdfSaketBansal9
 
Introduction to Data Analytics.pptx
Introduction to Data Analytics.pptxIntroduction to Data Analytics.pptx
Introduction to Data Analytics.pptxDikshantSharma63
 
dataminingppt-170616163835.pdf jejwwkwnwnn
dataminingppt-170616163835.pdf jejwwkwnwnndataminingppt-170616163835.pdf jejwwkwnwnn
dataminingppt-170616163835.pdf jejwwkwnwnnjainutkarsh078
 
lec01-IntroductionToDataMining.pptx
lec01-IntroductionToDataMining.pptxlec01-IntroductionToDataMining.pptx
lec01-IntroductionToDataMining.pptxAmjadAlDgour
 
Zen and the Art of Data Science Maintenance
Zen and the Art of Data Science MaintenanceZen and the Art of Data Science Maintenance
Zen and the Art of Data Science MaintenanceElsevier
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxssuser1a4f0f
 
Data mining (DM) in the pharmaceutical industry
Data mining (DM) in the pharmaceutical industryData mining (DM) in the pharmaceutical industry
Data mining (DM) in the pharmaceutical industrylurdhu agnes
 
Data_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfData_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfvishal choudhary
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxwahiba ben abdessalem
 
Knowledge discovery in medicine
Knowledge discovery in medicineKnowledge discovery in medicine
Knowledge discovery in medicineAvinash Hanwate
 
Data mining an introduction
Data mining an introductionData mining an introduction
Data mining an introductionDr-Dipali Meher
 

Similar to Data mining in healthcare (20)

Presentation data mining
Presentation data miningPresentation data mining
Presentation data mining
 
DOWLD SLIDES.pptx
DOWLD SLIDES.pptxDOWLD SLIDES.pptx
DOWLD SLIDES.pptx
 
Machinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdfMachinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdf
 
NCCT.pptx
NCCT.pptxNCCT.pptx
NCCT.pptx
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
Introduction to Data Analytics.pptx
Introduction to Data Analytics.pptxIntroduction to Data Analytics.pptx
Introduction to Data Analytics.pptx
 
Data mining
Data mining Data mining
Data mining
 
dataminingppt-170616163835.pdf jejwwkwnwnn
dataminingppt-170616163835.pdf jejwwkwnwnndataminingppt-170616163835.pdf jejwwkwnwnn
dataminingppt-170616163835.pdf jejwwkwnwnn
 
lec01-IntroductionToDataMining.pptx
lec01-IntroductionToDataMining.pptxlec01-IntroductionToDataMining.pptx
lec01-IntroductionToDataMining.pptx
 
Zen and the Art of Data Science Maintenance
Zen and the Art of Data Science MaintenanceZen and the Art of Data Science Maintenance
Zen and the Art of Data Science Maintenance
 
Data Science-final7
Data Science-final7Data Science-final7
Data Science-final7
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptx
 
Data mining (DM) in the pharmaceutical industry
Data mining (DM) in the pharmaceutical industryData mining (DM) in the pharmaceutical industry
Data mining (DM) in the pharmaceutical industry
 
Data_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfData_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdf
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptx
 
Knowledge discovery in medicine
Knowledge discovery in medicineKnowledge discovery in medicine
Knowledge discovery in medicine
 
Data mining an introduction
Data mining an introductionData mining an introduction
Data mining an introduction
 
datamining.ppt
datamining.pptdatamining.ppt
datamining.ppt
 
datamining.ppt
datamining.pptdatamining.ppt
datamining.ppt
 
datamining.ppt
datamining.pptdatamining.ppt
datamining.ppt
 

Recently uploaded

Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRajesh Mondal
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...Health
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjurptikerjasaptiker
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样wsppdmt
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...nirzagarg
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1ranjankumarbehera14
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabiaahmedjiabur940
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...Elaine Werffeli
 
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxThe-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxVivek487417
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...Bertram Ludäscher
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schscnajjemba
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangeThinkInnovation
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制vexqp
 

Recently uploaded (20)

Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxThe-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 

Data mining in healthcare

  • 1. 1
  • 2. DATA MINING IN HEALTHCARE Dr. V. Subha, B.E., M.E., Ph.D., Assistant Professor, Department of Computer Science & Engineering, Manonmaniam Sundaranar University, Tirunelveli. 2
  • 3. CONTENTS 1. Introduction to Data Mining 2. Data Mining techniques 3. Data Mining in Healthcare 4. Data Mining Resources 3
  • 4. 1. INTRODUCTION TO DATA MINING 4
  • 5. 5
  • 6. 6
  • 8.
  • 9. 9
  • 10. Definition of Data Mining • Process of identifying valid, novel, potentially useful, and ultimately understandable patterns in data. - Predicts outcomes of future observations. 10
  • 11. Confluence of different disciplines 11
  • 12. Steps in Data Mining Evolution 12
  • 13. Data Mining - Motivation • Growth in data. • High dimensionality of data. • Heterogeneous and complex data. • Development of commercial data mining software. • Growth of computing power and storage capacity. • Limitation of humans. 13
  • 14. Why Data Mining ? • Credit ratings : • Given a database of 100,000 names, which persons are the least likely to default on their credit cards? • Fraud detection : • Which types of transactions are likely to be fraudulent, given the demographics and transactional history of a particular customer? • Customer relationship management: • Which of my customers are likely to be the most loyal, and which are most likely to leave for a competitor? Data Mining helps extract such information 14
  • 15. Data Mining for decision making 15
  • 16. Types of data analysis Predictive models help in diagnosing diseases and then uncovering a special path for treatment. 16
  • 17. Knowledge Discovery in Databases (KDD) 17
  • 18. Steps of KDD Process Data relevant to the analysis is decided and retrieved from the database Data cleansing - Noisy data and missing data are handled appropriately Processed data is transformed into appropriate forms for mining Clever techniques are applied to extract patterns that are potentially useful Mined data patterns are evaluated 18
  • 19. Applications of data mining Popular applications of data mining 19
  • 20. 2. DATA MINING TECHNIQUES 20
  • 22. Data Mining Techniques… • Unsupervised learning - No knowledge of output - Self-guided learning algorithm. • Supervised learning - Knowledge of output - Learning with an expert/teacher 22
  • 24. Classification – Prediction Model Data Mining Algorithms Classification Prediction • Process of finding a model that describes the data classes or concepts. • Purpose is to use this model to predict the class of instances whose class label is unknown. • This derived model is based on the analysis of sets of training data. 24
  • 27. 27 Model Construction Training Data AGE GENDER CHEST PAIN DISEASE 63 Male atyp_angina yes 67 Male asympt yes 67 Male asympt yes 37 Male non_anginal no 41 Female atyp_angina no 62 Female asympt yes Classification Algorithms IF chest pain = ‘asympt’ OR age > 53 THEN disease = ‘yes’ Classifier (Model)
  • 28. 28 Using the Model in Prediction Classifier (Model) Testing Data (50,male,asympt) Disease?
  • 29. Classification Techniques Neural Networks Bayesian Networks Decision Trees Support Vector Machines 29
  • 30. Confusion Matrix • Contains information about actual and predicted classifications done by a classification system. • Performance of such systems is commonly evaluated using the data in the confusion matrix. 30
  • 31. Confusion Matrix… • TP – Patient with heart disease. • FP – Patient wrongly identified to have heart disease. • FN – Patient have been left out from treatment for heart disease. • TN – Patient does not have heart disease. Predicted Class Actual Class Positive Negative Positive True Positive (TP) False Positive (FP) Negative False Negative (FN) True Negative (TN) 31
  • 32. Performance Metrics • Accuracy = • Sensitivity = • Specificity =           FN FP TN TP TN TP        FN TP TP        FP TN TN 32
  • 33. 3. DATA MINING IN HEALTHCARE 33
  • 34. Data mining in healthcare • Hospitals deal with lot of data on a day to day basis. • Difficult for a single person to handle. • Data mining techniques help a lot in analyzing patient’s health data. • Patient’s health care data can be sorted and any kind of illness can be predicted which helps in treating the patients. 34
  • 35. Disease diagnosis and effective treatment 35
  • 37. Disease diagnosis using Data Mining 37
  • 38. 4. DATA MINING - RESOURCES 38
  • 39. Data mining Tools - Open Source DataMelt 39
  • 40. Data mining Tools - Proprietary License Oracle Data Mining IBM Cognos IBM SPSS Modeler SAS Data Mining Sisense SSDT Teradata Board toolkit Dundas BI 40
  • 42. Dataset Repository - UCI Repository https://archive.ics.uci.edu 42
  • 43. Books • Han, Jiawei, Jian Pei, and Micheline Kamber. Data mining: concepts and techniques. Elsevier, 2011. • Zaki, Mohammed J., Wagner Meira Jr, and Wagner Meira. Data mining and analysis: Fundamental concepts and algorithms. Cambridge University Press, 2014. • Gorunescu, Florin. Data Mining: Concepts, models and techniques. Vol. 12. Springer Science & Business Media, 2011. • Ron Zacharski. A Programmer's Guide to Data Mining : The Ancient Art of the Numerati,2013 43
  • 44. 44
  • 45. 45