SlideShare a Scribd company logo
1 of 60
Sales of Orthopedic Equipment
Xiaomeng (Mina) Chai
11/25/2014
Client’s Background
• Client:
a large manufacturer of orthopedic
equipment in the United States
• Customer base:
almost all hospitals over the 50 states
Client’s Products
• Orthopedic parts and equipment
• Medications administered in the process of
surgery, rehabilitation, and recovery
The Company Thinks …
• SALES!
– High sales
– Moderate sales (further sales potential)
– Little or no sales (substantial potential gain)
Imagine …
We think…
• ORTHOPEDIC ACTIVITIES!
– Small general hospitals (little or no interest)
– Large general hospitals (moderate interest)
– Specialized hospitals (main target group!)
Objective
• Increase sales...
…in the more desirable groups!
• How?
– Identify target hospitals
– Study them individually
• Another objective: other ways to classify
hospitals?
Dataset
All U.S. hospitals are in the dataset:
Variables
A subset of variables is already selected
Variables
Methodology
• Data Mining
– Dimension Reduction
• Factor Analysis
• Principal Component Analysis
– Cluster Analysis
• Hierarchical Clustering
• Centroid Methods
• Regression analysis
Data Mining
• Overall goal—to extract information from a data set and
transform it into an understandable structure for further
use. (Wikipedia)
• The objective of data mining is to identify nuggets, small
clusters of observations in these data that contain
unexpected, yet potentially valuable, information. (The
author)
Data Mining
Approach to data mining
1. Dimension (variable) reduction
– Principle components
– Factor analysis
1. Data segmentation and selection
– Cluster analysis
– Tree methods
– Neural nets
1. Data analysis of interesting segments
This case study
PART 1:Select Market Segments
• Find state or group of states (at least 300 hospitals)
– IL, IN, MI, WI are selected (590 hospitals)
Transformation
Log or square root transformations are performed
Transformation
Before After
so far…
Dimension Reduction
• Two stages factor analysis
– Operational factor (HIP95, KNEE95, HIP96, KNEE96,
and FEMUR96)
– Size factor (BEDS, OUTV, ADM, SIR, TH, and TRAUMA)
and rehab factor (RBEDS and REHAB)
Factor Analysis--stage1
Factor Analysis—stage2
Factor Analysis: Rotate?
• More interpretable results.
• Orthogonal rotation methods (VARIMAX) is commonly
used.
e.g. Look at variable X33 here:
Factor Analysis—stage2
Principal Component Analysis--stage 1
Principal Component Analysis--stage 2
R
Factor Analysis in R
Factor Analysis in R
Factor Analysis in R
PCA in R
PCA in R
PCA in R
PCA in R
PCA in R
Factor Analysis
13 variables are divided into 3 factors:
Textbook Question:
Graph the main principal components. Are there any visible clusters?
The banding is relatively vertical, REHAB is affecting factor 2 (RBEDS and REHAB).
so far…
Cluster Analysis
• To determine the best cluster to concentrate on
for improving sales.
• Two popular methods
– Hierarchical Clustering (interpoint distance)
• Single linkage
• Average linkage
• Ward
– Centroid Methods
• K-means algorithm
• Partitioning Around Medoids (PAM)
Cluster Analysis
• Hierarchical Clustering:
1. Start with a cluster at each sample point
2. At each stage of building the tree the two closest clusters joint
to form a new cluster
Cluster Analysis
• Centroid Methods (K-means algorithm)
1. K seed points are chosen and the data is distributed
among k cluster
2. At each step, switch a point from one cluster to
another if the R2
is increased
3. Clusters are slowly optimized by switching points
until no improvement of the R2
is possible
Cluster Analysis
• Centroid Methods (K-means algorithm)
Cluster Analysis
• Partitioning Around Medoids (PAM)
1. Search for k representative medoids
2. K clusters are constructed by assigning each point
to the nearest medoid
3. The goal is to find k medoids which minimize the
sum of the dissimilarities of the observations to their
closest representative medoid.
Cluster Analysis
• PAM VS K-means
– PAM operates on the dissimilarity matrix
– PAM minimizes a sum of dissimilarities instead of a
sum of squared Euclidean distances
– Silhouette plot (select the optimal number of clusters)
Cluster Analysis
• To determine the best cluster to concentrate on
for improving sales.
• Two popular methods
– Hierarchical Clustering (interpoint distance)
• Single linkage
• Average linkage
• Ward
– Centroid Methods
• K-means algorithm
• Partitioning Around Medoids (PAM)
Cluster Analysis
…
…
Cluster Tree
PAM in R
PAM in R
• Silhouette width:
si=(bi-ai)/max(ai,bi)
Large Si (almost 1) are very well clustered
PAM in R
Cluster Analysis
Cluster Analysis
Cluster Analysis in R
Cluster of Interest
so far…
Part 2-Estimate Potential Sales
• Part1 – Select Market Segments : DONE
• Part2 – Estimate Potential Sales
Regression Analysis
Regression Analysis
Regression Analysis
• Hospitals with large negative residuals:
HID CITY STATE RESIDUAL Gain
087043 Chicago IL -2.8766 68.590
915042 South Bend IN -1.7989 16.440
016045 Beloit WI -2.5633 24.893
020042 Columbus IN -2.5146 34.710
078045 Madison WI -2.2309 59.362
109043 Chicago IL -1.9317 47.980
262043 Peoria IL -2.5952 90.593
Thank you and Happy Holiday!

More Related Content

What's hot

4.3 multimedia datamining
4.3 multimedia datamining4.3 multimedia datamining
4.3 multimedia dataminingKrish_ver2
 
Business intelligence ppt
Business intelligence pptBusiness intelligence ppt
Business intelligence pptsujithkylm007
 
Grid based method & model based clustering method
Grid based method & model based clustering methodGrid based method & model based clustering method
Grid based method & model based clustering methodrajshreemuthiah
 
Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Harish Chand
 
Credit card fraud detection using python machine learning
Credit card fraud detection using python machine learningCredit card fraud detection using python machine learning
Credit card fraud detection using python machine learningSandeep Garg
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSINGKing Julian
 
Parametric & Non-Parametric Machine Learning (Supervised ML)
Parametric & Non-Parametric Machine Learning (Supervised ML)Parametric & Non-Parametric Machine Learning (Supervised ML)
Parametric & Non-Parametric Machine Learning (Supervised ML)Rehan Guha
 
House price ppt 18 bcs6588_md. tauhid alam
House price ppt  18 bcs6588_md. tauhid alamHouse price ppt  18 bcs6588_md. tauhid alam
House price ppt 18 bcs6588_md. tauhid alamArmanMalik66
 
Knowledge discovery process
Knowledge discovery process Knowledge discovery process
Knowledge discovery process Shuvra Ghosh
 
NIST Cloud Computing Reference Architecture
NIST Cloud Computing Reference ArchitectureNIST Cloud Computing Reference Architecture
NIST Cloud Computing Reference ArchitectureThanakrit Lersmethasakul
 
Association rule mining and Apriori algorithm
Association rule mining and Apriori algorithmAssociation rule mining and Apriori algorithm
Association rule mining and Apriori algorithmhina firdaus
 
Introduction to Statistical Machine Learning
Introduction to Statistical Machine LearningIntroduction to Statistical Machine Learning
Introduction to Statistical Machine Learningmahutte
 
Machine learning ~ Forecasting
Machine learning ~ ForecastingMachine learning ~ Forecasting
Machine learning ~ ForecastingShaswat Mandhanya
 
Movie recommendation project
Movie recommendation projectMovie recommendation project
Movie recommendation projectAbhishek Jaisingh
 

What's hot (20)

4.3 multimedia datamining
4.3 multimedia datamining4.3 multimedia datamining
4.3 multimedia datamining
 
Business intelligence ppt
Business intelligence pptBusiness intelligence ppt
Business intelligence ppt
 
Grid based method & model based clustering method
Grid based method & model based clustering methodGrid based method & model based clustering method
Grid based method & model based clustering method
 
Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)
 
Credit card fraud detection using python machine learning
Credit card fraud detection using python machine learningCredit card fraud detection using python machine learning
Credit card fraud detection using python machine learning
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Parametric & Non-Parametric Machine Learning (Supervised ML)
Parametric & Non-Parametric Machine Learning (Supervised ML)Parametric & Non-Parametric Machine Learning (Supervised ML)
Parametric & Non-Parametric Machine Learning (Supervised ML)
 
House price prediction
House price predictionHouse price prediction
House price prediction
 
Decision tree
Decision treeDecision tree
Decision tree
 
House price ppt 18 bcs6588_md. tauhid alam
House price ppt  18 bcs6588_md. tauhid alamHouse price ppt  18 bcs6588_md. tauhid alam
House price ppt 18 bcs6588_md. tauhid alam
 
Random forest
Random forestRandom forest
Random forest
 
Statistics for data science
Statistics for data science Statistics for data science
Statistics for data science
 
Knowledge discovery process
Knowledge discovery process Knowledge discovery process
Knowledge discovery process
 
NIST Cloud Computing Reference Architecture
NIST Cloud Computing Reference ArchitectureNIST Cloud Computing Reference Architecture
NIST Cloud Computing Reference Architecture
 
Association rule mining and Apriori algorithm
Association rule mining and Apriori algorithmAssociation rule mining and Apriori algorithm
Association rule mining and Apriori algorithm
 
Machine learning clustering
Machine learning clusteringMachine learning clustering
Machine learning clustering
 
Introduction to Statistical Machine Learning
Introduction to Statistical Machine LearningIntroduction to Statistical Machine Learning
Introduction to Statistical Machine Learning
 
Machine learning ~ Forecasting
Machine learning ~ ForecastingMachine learning ~ Forecasting
Machine learning ~ Forecasting
 
Text MIning
Text MIningText MIning
Text MIning
 
Movie recommendation project
Movie recommendation projectMovie recommendation project
Movie recommendation project
 

Viewers also liked

Social Media Mining - Chapter 5 (Data Mining Essentials)
Social Media Mining - Chapter 5 (Data Mining Essentials)Social Media Mining - Chapter 5 (Data Mining Essentials)
Social Media Mining - Chapter 5 (Data Mining Essentials)SocialMediaMining
 
Data Mining Technique Clustering on Bank Data Set
Data Mining Technique Clustering on Bank Data Set  Data Mining Technique Clustering on Bank Data Set
Data Mining Technique Clustering on Bank Data Set Punit Kishore
 
Non obvious relationship awareness (nora)
Non obvious relationship awareness (nora)Non obvious relationship awareness (nora)
Non obvious relationship awareness (nora)neymarsabin
 
USE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTORUSE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTORarpit bhadoriya
 
Data mining in social network
Data mining in social networkData mining in social network
Data mining in social networkakash_mishra
 

Viewers also liked (6)

Social Media Mining - Chapter 5 (Data Mining Essentials)
Social Media Mining - Chapter 5 (Data Mining Essentials)Social Media Mining - Chapter 5 (Data Mining Essentials)
Social Media Mining - Chapter 5 (Data Mining Essentials)
 
Data Mining Technique Clustering on Bank Data Set
Data Mining Technique Clustering on Bank Data Set  Data Mining Technique Clustering on Bank Data Set
Data Mining Technique Clustering on Bank Data Set
 
Non obvious relationship awareness (nora)
Non obvious relationship awareness (nora)Non obvious relationship awareness (nora)
Non obvious relationship awareness (nora)
 
Social Data Mining
Social Data MiningSocial Data Mining
Social Data Mining
 
USE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTORUSE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTOR
 
Data mining in social network
Data mining in social networkData mining in social network
Data mining in social network
 

Similar to Data Mining Case Study

DataAnalyticsIntroduction and its ci.pptx
DataAnalyticsIntroduction and its ci.pptxDataAnalyticsIntroduction and its ci.pptx
DataAnalyticsIntroduction and its ci.pptxPrincePatel272012
 
Analytical thinking & creativity
Analytical thinking & creativityAnalytical thinking & creativity
Analytical thinking & creativityAbhishek Gupta
 
Brief Introduction to the 12 Steps of Evaluation Data Cleaning
Brief Introduction to the 12 Steps of Evaluation Data CleaningBrief Introduction to the 12 Steps of Evaluation Data Cleaning
Brief Introduction to the 12 Steps of Evaluation Data CleaningJennifer Morrow
 
Multi variate presentation
Multi variate presentationMulti variate presentation
Multi variate presentationArun Kumar
 
Optimizing Market Segmentation
Optimizing Market SegmentationOptimizing Market Segmentation
Optimizing Market SegmentationRobert Colner
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learningSanghamitra Deb
 
Chapter 6 data analysis iec11
Chapter 6 data analysis iec11Chapter 6 data analysis iec11
Chapter 6 data analysis iec11Ho Cao Viet
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptxjeyanthisivakumar
 
Store segmentation progresso
Store segmentation progressoStore segmentation progresso
Store segmentation progressoveesingh
 
01 Statistika Lanjut - Cluster Analysis part 1 with sound (1).pptx
01 Statistika Lanjut - Cluster Analysis  part 1 with sound (1).pptx01 Statistika Lanjut - Cluster Analysis  part 1 with sound (1).pptx
01 Statistika Lanjut - Cluster Analysis part 1 with sound (1).pptxniawiya
 
Statistical analysis
Statistical analysisStatistical analysis
Statistical analysisXiuxia Du
 
Rutgers Governor School - Six Sigma
Rutgers Governor School - Six Sigma  Rutgers Governor School - Six Sigma
Rutgers Governor School - Six Sigma Brandon Theiss, PE
 

Similar to Data Mining Case Study (20)

ADAN Symposium
ADAN SymposiumADAN Symposium
ADAN Symposium
 
Intro to ml_2021
Intro to ml_2021Intro to ml_2021
Intro to ml_2021
 
DataAnalyticsIntroduction and its ci.pptx
DataAnalyticsIntroduction and its ci.pptxDataAnalyticsIntroduction and its ci.pptx
DataAnalyticsIntroduction and its ci.pptx
 
Analytical thinking & creativity
Analytical thinking & creativityAnalytical thinking & creativity
Analytical thinking & creativity
 
Brief Introduction to the 12 Steps of Evaluation Data Cleaning
Brief Introduction to the 12 Steps of Evaluation Data CleaningBrief Introduction to the 12 Steps of Evaluation Data Cleaning
Brief Introduction to the 12 Steps of Evaluation Data Cleaning
 
Multi variate presentation
Multi variate presentationMulti variate presentation
Multi variate presentation
 
Optimizing Market Segmentation
Optimizing Market SegmentationOptimizing Market Segmentation
Optimizing Market Segmentation
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
Chapter 6 data analysis iec11
Chapter 6 data analysis iec11Chapter 6 data analysis iec11
Chapter 6 data analysis iec11
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptx
 
6.2 msa-gauge-r&r
6.2 msa-gauge-r&r6.2 msa-gauge-r&r
6.2 msa-gauge-r&r
 
Exploratory factor analysis
Exploratory factor analysisExploratory factor analysis
Exploratory factor analysis
 
Store segmentation progresso
Store segmentation progressoStore segmentation progresso
Store segmentation progresso
 
0 introduction
0  introduction0  introduction
0 introduction
 
01 Statistika Lanjut - Cluster Analysis part 1 with sound (1).pptx
01 Statistika Lanjut - Cluster Analysis  part 1 with sound (1).pptx01 Statistika Lanjut - Cluster Analysis  part 1 with sound (1).pptx
01 Statistika Lanjut - Cluster Analysis part 1 with sound (1).pptx
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Forecasting
ForecastingForecasting
Forecasting
 
Statistical analysis
Statistical analysisStatistical analysis
Statistical analysis
 
DA-Module 1.pptx
DA-Module 1.pptxDA-Module 1.pptx
DA-Module 1.pptx
 
Rutgers Governor School - Six Sigma
Rutgers Governor School - Six Sigma  Rutgers Governor School - Six Sigma
Rutgers Governor School - Six Sigma
 

Recently uploaded

Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 

Recently uploaded (20)

Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 

Data Mining Case Study

Editor's Notes

  1. Orthopedic equipment refers to a variety of structural devices designed to stabilize, protect, and/or correct orthopedic disorders. Common medications used to treat orthopedic conditions include nonsteroidal anti-inflammatory medications (e.g. Motrin, Aleve, Naprosyn, Celebrex), Glucosamine, and others.
  2. From the point of view of sales
  3. From the point of view of activities
  4. 4703 hospitals and 19 variables Chicago has 45 hospitals
  5. From raw data to small dataset
  6. Independent var—linear trend Dependent var--normality
  7. The elements of the Factor Pattern reflect the unique variance each factor contributes to the variance of an observed variable. The reason factor analysis is not stopped after this initial factoring stage, without rotating the factors, is that the factors as they currently exist are not easily interpretable. In an ideal solution, the variables should “load” highly (have a high value that approaches 1) on just one factor each.
  8. Final Conmmunality Estimates: It can be derived by taking sum of squares of each row of the factor pattern. This is the variance of the observed variable that is accounted for by each factor.  
  9. The left and bottom axes are showing the loadings; the top and right axes are showing principal component scores. meaningful visual representation of the structure of cases and variables.
  10. Cluster History section starts out with n (590) clusters of size 1 and continues until all the obs are included into one cluster. R^2: the proportion of variance explained by a particular cluster. In the first step, n-1 clusters are formed, R^2 are then computed to have the largest R^2. So the largest R^2 will form the first cluster. Thus, at each step of the algorithm clusters or observations are combined in such a way as to maximize the r2 value. the biggest jump between cluster 5 and 4 with almost 0.1 difference. Therefore, I chose 5 clusters for my future analysis.
  11. Put a(i) = average dissimilarity between i and all other points of the cluster to which i belongs (if i is the only observation in its cluster, s(i) := 0 without further calculations). For all other clusters C, put d(i,C)= average dissimilarity of i to all observations of C. The smallest of these d(i,C) is b(i) := \min_C d(i,C), and can be seen as the dissimilarity between i and its “neighbor” cluster, i.e., the nearest one to which it does not belong. Finally,