SlideShare a Scribd company logo
1 of 19
MAMMOGRAPHIC MASS
DATASET
PRACTICAL WORK FOR DATA MINING
PROF. KASRALIKAR
SEAN REZVANI
DESCRIPTION OF DATA
• This data set can be used to predict the severity (benign or malignant) of a
mammographic mass lesion from BI-RADS attributes and the patient’s age.
• 6 Attributes in total (1 goal field, 1 non-predictive, 4 predictive attributes)
1. BI-RADS assessment: 1 to 5 (ordinal, non-predictive!)
2. Age: patient's age in years (integer)
3. Shape: mass shape: round=1 oval=2 lobular=3 irregular=4 (nominal)
4. Margin: mass margin: circumscribed=1 microlobulated=2 obscured=3 ill-defined=4
spiculated=5 (nominal)
5. Density: mass density high=1 iso=2 low=3 fat-containing=4 (ordinal)
6. Severity: benign=0 or malignant=1 (binominal, goal field!)
A LITTLE MORE ABOUT BI-RADS
• BI-RADS stands for Breast Imaging Reporting and Data System and was
established by the American College of Radiology.
The BI-RADS assessment categories are:
•0- incomplete
•1- negative
•2- benign findings
•3- probably benign
•4- suspicious abnormality
•5- highly suspicious of malignancy
•6- known biopsy with proven malignancy
DATA PREPROCESSING TASKS PERFORMED
• Missing Attribute Values:
- BI-RADS assessment: 2
- Age: 5
- Shape: 31
- Margin: 48
- Density: 76
- Severity: 0
•
ASSOCIATION RULES
PRUNING AND SCATTER PLOTS FOR A
RULES
A RULES PLOTS
A RULES GRAPH
ECLAT (FREQUENT ITEMSET USING)
DECISION TREE (WITH PACKAGE PARTY) & TEST
PREDICTION
KNN & NAÏVE BAYES CLASSIFICATIONS
KNN
NAÏVE BAYES
CLUSTERING, K MEAN, K MEDOID
CLUSTERING PAM
CLUSTERING PAMK
CLUSTERING DBSCAN
CLUSTERING DBSCAN
HIERARCHICAL CLUSTERING
RESULTS
This data set shows that age and BI-RADS have a direct relation with
severity and these two increase the chance of breast cancer.
THANK YOU FOR YOUR ATTENTION
ANY QUESTION?
• Reference data set:
• https://archive.ics.uci.edu/ml/datasets/Mammographic+Mass

More Related Content

What's hot

Image segmentation with deep learning
Image segmentation with deep learningImage segmentation with deep learning
Image segmentation with deep learningAntonio Rueda-Toicen
 
Cut mix: Regularization strategy to train strong classifiers with localizable...
Cut mix: Regularization strategy to train strong classifiers with localizable...Cut mix: Regularization strategy to train strong classifiers with localizable...
Cut mix: Regularization strategy to train strong classifiers with localizable...Changjin Lee
 
Linked List Implementation of Deque in C
Linked List Implementation of Deque in CLinked List Implementation of Deque in C
Linked List Implementation of Deque in CKasun Ranga Wijeweera
 
Feature selection
Feature selectionFeature selection
Feature selectionDong Guo
 
Optimizers
OptimizersOptimizers
OptimizersIl Gu Yi
 
2.5 backpropagation
2.5 backpropagation2.5 backpropagation
2.5 backpropagationKrish_ver2
 
Semantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network ApproachesSemantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network ApproachesFellowship at Vodafone FutureLab
 
PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...
PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...
PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...Sunghoon Joo
 
“An Introduction to Data Augmentation Techniques in ML Frameworks,” a Present...
“An Introduction to Data Augmentation Techniques in ML Frameworks,” a Present...“An Introduction to Data Augmentation Techniques in ML Frameworks,” a Present...
“An Introduction to Data Augmentation Techniques in ML Frameworks,” a Present...Edge AI and Vision Alliance
 
Feature Engineering - Getting most out of data for predictive models
Feature Engineering - Getting most out of data for predictive modelsFeature Engineering - Getting most out of data for predictive models
Feature Engineering - Getting most out of data for predictive modelsGabriel Moreira
 
Heuristic or informed search
Heuristic or informed searchHeuristic or informed search
Heuristic or informed searchHamzaJaved64
 
밑바닥부터 시작하는딥러닝 8장
밑바닥부터 시작하는딥러닝 8장밑바닥부터 시작하는딥러닝 8장
밑바닥부터 시작하는딥러닝 8장Sunggon Song
 
Feature Engineering
Feature EngineeringFeature Engineering
Feature EngineeringHJ van Veen
 
Machine learning basics using trees algorithm (Random forest, Gradient Boosting)
Machine learning basics using trees algorithm (Random forest, Gradient Boosting)Machine learning basics using trees algorithm (Random forest, Gradient Boosting)
Machine learning basics using trees algorithm (Random forest, Gradient Boosting)Parth Khare
 
Classification
ClassificationClassification
ClassificationCloudxLab
 

What's hot (20)

Robustness in deep learning
Robustness in deep learningRobustness in deep learning
Robustness in deep learning
 
Image segmentation with deep learning
Image segmentation with deep learningImage segmentation with deep learning
Image segmentation with deep learning
 
Cut mix: Regularization strategy to train strong classifiers with localizable...
Cut mix: Regularization strategy to train strong classifiers with localizable...Cut mix: Regularization strategy to train strong classifiers with localizable...
Cut mix: Regularization strategy to train strong classifiers with localizable...
 
Linked List Implementation of Deque in C
Linked List Implementation of Deque in CLinked List Implementation of Deque in C
Linked List Implementation of Deque in C
 
Feature selection
Feature selectionFeature selection
Feature selection
 
Optimizers
OptimizersOptimizers
Optimizers
 
2.5 backpropagation
2.5 backpropagation2.5 backpropagation
2.5 backpropagation
 
Semantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network ApproachesSemantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network Approaches
 
PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...
PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...
PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...
 
“An Introduction to Data Augmentation Techniques in ML Frameworks,” a Present...
“An Introduction to Data Augmentation Techniques in ML Frameworks,” a Present...“An Introduction to Data Augmentation Techniques in ML Frameworks,” a Present...
“An Introduction to Data Augmentation Techniques in ML Frameworks,” a Present...
 
Yolo
YoloYolo
Yolo
 
You only look once
You only look onceYou only look once
You only look once
 
Feature Engineering - Getting most out of data for predictive models
Feature Engineering - Getting most out of data for predictive modelsFeature Engineering - Getting most out of data for predictive models
Feature Engineering - Getting most out of data for predictive models
 
Heuristic or informed search
Heuristic or informed searchHeuristic or informed search
Heuristic or informed search
 
밑바닥부터 시작하는딥러닝 8장
밑바닥부터 시작하는딥러닝 8장밑바닥부터 시작하는딥러닝 8장
밑바닥부터 시작하는딥러닝 8장
 
Feature Engineering
Feature EngineeringFeature Engineering
Feature Engineering
 
Deep learning
Deep learning Deep learning
Deep learning
 
LeNet-5
LeNet-5LeNet-5
LeNet-5
 
Machine learning basics using trees algorithm (Random forest, Gradient Boosting)
Machine learning basics using trees algorithm (Random forest, Gradient Boosting)Machine learning basics using trees algorithm (Random forest, Gradient Boosting)
Machine learning basics using trees algorithm (Random forest, Gradient Boosting)
 
Classification
ClassificationClassification
Classification
 

Similar to Mammographic mass dataset

Birads classification of breast.pptx
Birads classification of breast.pptxBirads classification of breast.pptx
Birads classification of breast.pptxkantaDey
 
Luc Rotenberg, Tomosynthese et densité mammaire 2015
Luc Rotenberg, Tomosynthese et densité mammaire 2015Luc Rotenberg, Tomosynthese et densité mammaire 2015
Luc Rotenberg, Tomosynthese et densité mammaire 2015Luc ROTENBERG
 
Mammography-a techniques to identifiy breast cancer.ppt
Mammography-a techniques to identifiy breast cancer.pptMammography-a techniques to identifiy breast cancer.ppt
Mammography-a techniques to identifiy breast cancer.pptomkarnunna1
 
BIRADS_Decoded.pdf
BIRADS_Decoded.pdfBIRADS_Decoded.pdf
BIRADS_Decoded.pdfssuser40fd68
 
Deep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a surveyDeep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a surveySOYEON KIM
 
3 image based_breast_analysis_john_winder
3 image based_breast_analysis_john_winder3 image based_breast_analysis_john_winder
3 image based_breast_analysis_john_winderIñaki Meis Morante
 
Avances sobre las proximas actualizacion a incluir en BI-RADS 6 ED
Avances sobre las proximas actualizacion a incluir en BI-RADS 6 EDAvances sobre las proximas actualizacion a incluir en BI-RADS 6 ED
Avances sobre las proximas actualizacion a incluir en BI-RADS 6 EDAdlitteram1
 
Advanced Biostatistics and Data Analysis abdul ghafoor sajjad
Advanced Biostatistics and Data Analysis abdul ghafoor sajjadAdvanced Biostatistics and Data Analysis abdul ghafoor sajjad
Advanced Biostatistics and Data Analysis abdul ghafoor sajjadHeadDPT
 
02 birads mammography reporting
02  birads mammography reporting02  birads mammography reporting
02 birads mammography reportinghoussein.hajj.md
 
Predicting Breast Cancer
Predicting Breast CancerPredicting Breast Cancer
Predicting Breast CancerAARollason
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
Tomosynthesis vs digital mammography efficacy comparison using birads.pptx
Tomosynthesis vs digital mammography efficacy comparison using birads.pptxTomosynthesis vs digital mammography efficacy comparison using birads.pptx
Tomosynthesis vs digital mammography efficacy comparison using birads.pptxNagasai Pelala
 
The association between an unhealthy childhood diet and body composition depe...
The association between an unhealthy childhood diet and body composition depe...The association between an unhealthy childhood diet and body composition depe...
The association between an unhealthy childhood diet and body composition depe...CLOSER
 
Randall Ellis: Risk-based comprehensive payment for primary care
Randall Ellis: Risk-based comprehensive payment for primary careRandall Ellis: Risk-based comprehensive payment for primary care
Randall Ellis: Risk-based comprehensive payment for primary careNuffield Trust
 
Breast prognostic factors,imaging,diagnosis,staging
Breast prognostic factors,imaging,diagnosis,stagingBreast prognostic factors,imaging,diagnosis,staging
Breast prognostic factors,imaging,diagnosis,stagingNilesh Kucha
 

Similar to Mammographic mass dataset (20)

Birads classification of breast.pptx
Birads classification of breast.pptxBirads classification of breast.pptx
Birads classification of breast.pptx
 
Luc Rotenberg, Tomosynthese et densité mammaire 2015
Luc Rotenberg, Tomosynthese et densité mammaire 2015Luc Rotenberg, Tomosynthese et densité mammaire 2015
Luc Rotenberg, Tomosynthese et densité mammaire 2015
 
Breast Cancer Diagnosis using a Hybrid Genetic Algorithm for Feature Selectio...
Breast Cancer Diagnosis using a Hybrid Genetic Algorithm for Feature Selectio...Breast Cancer Diagnosis using a Hybrid Genetic Algorithm for Feature Selectio...
Breast Cancer Diagnosis using a Hybrid Genetic Algorithm for Feature Selectio...
 
Mammography-a techniques to identifiy breast cancer.ppt
Mammography-a techniques to identifiy breast cancer.pptMammography-a techniques to identifiy breast cancer.ppt
Mammography-a techniques to identifiy breast cancer.ppt
 
BIRADS_Decoded.pdf
BIRADS_Decoded.pdfBIRADS_Decoded.pdf
BIRADS_Decoded.pdf
 
Deep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a surveyDeep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a survey
 
3 image based_breast_analysis_john_winder
3 image based_breast_analysis_john_winder3 image based_breast_analysis_john_winder
3 image based_breast_analysis_john_winder
 
Breast Density Notification Law Information For Primary Care Providers
Breast Density Notification Law Information For Primary Care ProvidersBreast Density Notification Law Information For Primary Care Providers
Breast Density Notification Law Information For Primary Care Providers
 
Avances sobre las proximas actualizacion a incluir en BI-RADS 6 ED
Avances sobre las proximas actualizacion a incluir en BI-RADS 6 EDAvances sobre las proximas actualizacion a incluir en BI-RADS 6 ED
Avances sobre las proximas actualizacion a incluir en BI-RADS 6 ED
 
Advanced Biostatistics and Data Analysis abdul ghafoor sajjad
Advanced Biostatistics and Data Analysis abdul ghafoor sajjadAdvanced Biostatistics and Data Analysis abdul ghafoor sajjad
Advanced Biostatistics and Data Analysis abdul ghafoor sajjad
 
02 birads mammography reporting
02  birads mammography reporting02  birads mammography reporting
02 birads mammography reporting
 
CUN-BAE PDF
CUN-BAE PDFCUN-BAE PDF
CUN-BAE PDF
 
Predicting Breast Cancer
Predicting Breast CancerPredicting Breast Cancer
Predicting Breast Cancer
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 
Tomosynthesis vs digital mammography efficacy comparison using birads.pptx
Tomosynthesis vs digital mammography efficacy comparison using birads.pptxTomosynthesis vs digital mammography efficacy comparison using birads.pptx
Tomosynthesis vs digital mammography efficacy comparison using birads.pptx
 
The association between an unhealthy childhood diet and body composition depe...
The association between an unhealthy childhood diet and body composition depe...The association between an unhealthy childhood diet and body composition depe...
The association between an unhealthy childhood diet and body composition depe...
 
Randall Ellis: Risk-based comprehensive payment for primary care
Randall Ellis: Risk-based comprehensive payment for primary careRandall Ellis: Risk-based comprehensive payment for primary care
Randall Ellis: Risk-based comprehensive payment for primary care
 
Breast Cancer Prediction - Arwa Marfatia.pptx
Breast Cancer Prediction - Arwa Marfatia.pptxBreast Cancer Prediction - Arwa Marfatia.pptx
Breast Cancer Prediction - Arwa Marfatia.pptx
 
Breast prognostic factors,imaging,diagnosis,staging
Breast prognostic factors,imaging,diagnosis,stagingBreast prognostic factors,imaging,diagnosis,staging
Breast prognostic factors,imaging,diagnosis,staging
 
BI-RADS (MAMMOGRAPHY)
BI-RADS (MAMMOGRAPHY)BI-RADS (MAMMOGRAPHY)
BI-RADS (MAMMOGRAPHY)
 

Recently uploaded

Formulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfFormulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfRobertoOcampo24
 
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeCredit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeBoston Institute of Analytics
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证acoha1
 
原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证pwgnohujw
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证pwgnohujw
 
Displacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second DerivativesDisplacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second Derivatives23050636
 
Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"John Sobanski
 
How to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data AnalyticsHow to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data AnalyticsBrainSell Technologies
 
Bios of leading Astrologers & Researchers
Bios of leading Astrologers & ResearchersBios of leading Astrologers & Researchers
Bios of leading Astrologers & Researchersdarmandersingh4580
 
社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token Prediction社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token PredictionNABLAS株式会社
 
What is Insertion Sort. Its basic information
What is Insertion Sort. Its basic informationWhat is Insertion Sort. Its basic information
What is Insertion Sort. Its basic informationmuqadasqasim10
 
The Significance of Transliteration Enhancing
The Significance of Transliteration EnhancingThe Significance of Transliteration Enhancing
The Significance of Transliteration Enhancingmohamed Elzalabany
 
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...yulianti213969
 
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证ju0dztxtn
 
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证dq9vz1isj
 
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...BabaJohn3
 
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一fztigerwe
 
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI  MANAJEMEN OF PENYAKIT TETANUS.pptMATERI  MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI MANAJEMEN OF PENYAKIT TETANUS.pptRachmaGhifari
 
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...yulianti213969
 
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjSCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjadimosmejiaslendon
 

Recently uploaded (20)

Formulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfFormulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdf
 
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeCredit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
 
原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
 
Displacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second DerivativesDisplacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second Derivatives
 
Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"
 
How to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data AnalyticsHow to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data Analytics
 
Bios of leading Astrologers & Researchers
Bios of leading Astrologers & ResearchersBios of leading Astrologers & Researchers
Bios of leading Astrologers & Researchers
 
社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token Prediction社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token Prediction
 
What is Insertion Sort. Its basic information
What is Insertion Sort. Its basic informationWhat is Insertion Sort. Its basic information
What is Insertion Sort. Its basic information
 
The Significance of Transliteration Enhancing
The Significance of Transliteration EnhancingThe Significance of Transliteration Enhancing
The Significance of Transliteration Enhancing
 
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
 
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
 
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
 
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
 
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
 
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI  MANAJEMEN OF PENYAKIT TETANUS.pptMATERI  MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
 
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
 
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjSCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
 

Mammographic mass dataset

  • 1. MAMMOGRAPHIC MASS DATASET PRACTICAL WORK FOR DATA MINING PROF. KASRALIKAR SEAN REZVANI
  • 2. DESCRIPTION OF DATA • This data set can be used to predict the severity (benign or malignant) of a mammographic mass lesion from BI-RADS attributes and the patient’s age. • 6 Attributes in total (1 goal field, 1 non-predictive, 4 predictive attributes) 1. BI-RADS assessment: 1 to 5 (ordinal, non-predictive!) 2. Age: patient's age in years (integer) 3. Shape: mass shape: round=1 oval=2 lobular=3 irregular=4 (nominal) 4. Margin: mass margin: circumscribed=1 microlobulated=2 obscured=3 ill-defined=4 spiculated=5 (nominal) 5. Density: mass density high=1 iso=2 low=3 fat-containing=4 (ordinal) 6. Severity: benign=0 or malignant=1 (binominal, goal field!)
  • 3. A LITTLE MORE ABOUT BI-RADS • BI-RADS stands for Breast Imaging Reporting and Data System and was established by the American College of Radiology. The BI-RADS assessment categories are: •0- incomplete •1- negative •2- benign findings •3- probably benign •4- suspicious abnormality •5- highly suspicious of malignancy •6- known biopsy with proven malignancy
  • 4. DATA PREPROCESSING TASKS PERFORMED • Missing Attribute Values: - BI-RADS assessment: 2 - Age: 5 - Shape: 31 - Margin: 48 - Density: 76 - Severity: 0 •
  • 6. PRUNING AND SCATTER PLOTS FOR A RULES
  • 10. DECISION TREE (WITH PACKAGE PARTY) & TEST PREDICTION
  • 11. KNN & NAÏVE BAYES CLASSIFICATIONS KNN NAÏVE BAYES
  • 18. RESULTS This data set shows that age and BI-RADS have a direct relation with severity and these two increase the chance of breast cancer.
  • 19. THANK YOU FOR YOUR ATTENTION ANY QUESTION? • Reference data set: • https://archive.ics.uci.edu/ml/datasets/Mammographic+Mass