SlideShare a Scribd company logo
1 of 20
Decision Support Systems Design for
Data-Driven Management
PhD Candidate: Ningrong LEI
Supervisor: Dr. Seung Ki MOON
School of Mechanical & Aerospace Engineering
Division of Systems & Engineering Management
DETC2014-34871
ASME 2014 International Design & Engineering Technical Conferences, Buffalo, New York, USA
Outline
 Data-driven dilemma.
 Construct the decision support systems.
 Case study and discussion.
 Conclusion and future work.
1
DETC2014-34871
Data alone will not improve management
decisions
Source: :
www.v1shal.com
2
Market Segmentation
Source: .
:Cameron Industrials
Platform Strategy Advisory
3
Decision support methods for market
segmentation
 Non data-driven
– Ground on “common sense” rather than on a solid
empirical base
 Data-driven
– Generally lack of reliability assessments
– Restrict to a few comparison
4
When data-driven management goes bad…
Source: :
www.v1shal.com
5
The success of the data-driven management
relies on:
 Quality of the gathered data
 Reliable model
 Effectiveness of data analysis
 Objectiveness of results interpretation
6
Objectives
Develop a reliable decision support system to:
 Identify market segmentation based on market data
 Determine product positioning
7
Online system
Block diagram of the proposed DSS
Intrinsic dimension
estimation
Dimension
reduction
Performance
evaluation
Clustering
Dimension
reduction
Automated
classification
Clustering
Market Data
Offline system
Best dimension
Reduction algorithm
Best clustering
algorithm
Best training
model
Product Data
8
Case study: automobile market
 34 brands,639 models,31 model properties
9
Online system
Dimension reduction steps
Intrinsic dimension
estimation
Dimension
reduction
Performance
evaluation
Clustering
Dimension
reduction
Automated
classification
Clustering
Market Data
Offline system
Best dimension
Reduction algorithm
Best clustering
algorithm
Best training
model
Product Data
10
Dimension reduction techniques
Intrinsic dimensionality estimation:
• Correlation Dimension Estimator
• Eigenvalue-Based Estimator
• Maximum Likelihood Estimator
• Geodesic Minimum Spanning Tree
Dimension reduction:
• Principle Component Analysis
• Multidimensional Scaling
• Local Linear Embedding
11
Online system
Block diagram of the proposed DSS
Intrinsic dimension
estimation
Dimension
reduction
Performance
evaluation
Clustering
Dimension
reduction
Product Positioning
Market
Segmentation
Market Data
Offline system
Best dimension
Reduction algorithm
Best clustering
algorithm
Best training
model
Product Data
12
Clustering
 K-means
 Fuzzy C-Means
 Hierarchical Clustering
13
Performance Evaluation
 Statistical tests:
– median and standard deviation
 Silhouette Means:
14
Performance Evaluation
How accurately a predictive model will perform in
practice?
 Stratified 10-Fold Cross-Validation
 Classification:
– Gentle AdaBoost (GA)
– Nearest Neighbour (NN)
– Support Vector Machine (SVM)
15
Online decision support system
16
Clustering and performance evaluation results
Clustering Performance evaluation
Cluster-
ing
Dimension
reduction
Dimension
estimation
Number
of
clusters
Silhouette
mean
GA
in %
NN
in %
SVM
in %
K-
means
PCA
2 2 0.67 25.24 98.25 99.68
3 2 0.61 25.08 96.83 99.52
10 11 0.40 95.50 95.50 96.50
MDS
2 2 0.67 74.13 97.78 99.68
3 2 0.61 25.08 96.03 99.52
10 2 0.48 25.71 94.13 100.00
LLE
2 2 0.89 97.38 99.76 100.00
3 3 0.81 99.02 98.05 99.76
10 2 0.85 0.95 99.76 100.00
17
Classification accuracy over all clusters
• On average, SVM outperforms NN and GA
18
Conclusion
 Deliver a blueprint on how to construct a decision
support system.
– Offline system: find the most suitable algorithms structure;
– Online system: deliver objective and reliable decision support.
 Data-driven management: right data, proven statistics,
logical explanations.
19

More Related Content

Similar to ASME14_Ningrong

Presentation Title
Presentation TitlePresentation Title
Presentation Titlebutest
 
Neo4j Graph Data Science - Webinar
Neo4j Graph Data Science - WebinarNeo4j Graph Data Science - Webinar
Neo4j Graph Data Science - WebinarNeo4j
 
Solving churn challenge in Big Data environment - Jelena Pekez
Solving churn challenge in Big Data environment  - Jelena PekezSolving churn challenge in Big Data environment  - Jelena Pekez
Solving churn challenge in Big Data environment - Jelena PekezInstitute of Contemporary Sciences
 
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...ChemAxon
 
Behind the Buzzword: Understanding Customer Data Platforms in the Light of Pr...
Behind the Buzzword: Understanding Customer Data Platforms in the Light of Pr...Behind the Buzzword: Understanding Customer Data Platforms in the Light of Pr...
Behind the Buzzword: Understanding Customer Data Platforms in the Light of Pr...Rising Media Ltd.
 
Data analytics and visualization
Data analytics and visualizationData analytics and visualization
Data analytics and visualizationVini Vasundharan
 
Using Data Visualization to Improve Your Data Balance Sheet
Using Data Visualization to Improve Your Data Balance SheetUsing Data Visualization to Improve Your Data Balance Sheet
Using Data Visualization to Improve Your Data Balance SheetAntea Group
 
GraphTour London 2020 - Graphs for AI, Amy Hodler
GraphTour London 2020  - Graphs for AI, Amy HodlerGraphTour London 2020  - Graphs for AI, Amy Hodler
GraphTour London 2020 - Graphs for AI, Amy HodlerNeo4j
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalHarvinder Atwal
 
Rokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptxRokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptxJadna Almeida
 
Rokach-GomaxSlides (1).pptx
Rokach-GomaxSlides (1).pptxRokach-GomaxSlides (1).pptx
Rokach-GomaxSlides (1).pptxJadna Almeida
 
Building Predictive Analytics on Big Data Platforms
Building Predictive Analytics on Big Data PlatformsBuilding Predictive Analytics on Big Data Platforms
Building Predictive Analytics on Big Data PlatformsOlha Hrytsay
 
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation ApproachYONG ZHENG
 
laptop price prediction presentation
laptop price prediction presentationlaptop price prediction presentation
laptop price prediction presentationNeerajNishad4
 
Big data: status atual e tendências
Big data: status atual e tendênciasBig data: status atual e tendências
Big data: status atual e tendênciasCezar Taurion
 
acmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptxacmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptxdongchangim30
 
All sight overview presenation discovery jan 23rd 2018
All sight overview presenation   discovery jan 23rd 2018All sight overview presenation   discovery jan 23rd 2018
All sight overview presenation discovery jan 23rd 2018Michael Harris
 

Similar to ASME14_Ningrong (20)

Presentation Title
Presentation TitlePresentation Title
Presentation Title
 
Neo4j Graph Data Science - Webinar
Neo4j Graph Data Science - WebinarNeo4j Graph Data Science - Webinar
Neo4j Graph Data Science - Webinar
 
Solving churn challenge in Big Data environment - Jelena Pekez
Solving churn challenge in Big Data environment  - Jelena PekezSolving churn challenge in Big Data environment  - Jelena Pekez
Solving churn challenge in Big Data environment - Jelena Pekez
 
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
 
Behind the Buzzword: Understanding Customer Data Platforms in the Light of Pr...
Behind the Buzzword: Understanding Customer Data Platforms in the Light of Pr...Behind the Buzzword: Understanding Customer Data Platforms in the Light of Pr...
Behind the Buzzword: Understanding Customer Data Platforms in the Light of Pr...
 
Data analytics and visualization
Data analytics and visualizationData analytics and visualization
Data analytics and visualization
 
Using Data Visualization to Improve Your Data Balance Sheet
Using Data Visualization to Improve Your Data Balance SheetUsing Data Visualization to Improve Your Data Balance Sheet
Using Data Visualization to Improve Your Data Balance Sheet
 
Mark Dean Notes
Mark Dean NotesMark Dean Notes
Mark Dean Notes
 
GraphTour London 2020 - Graphs for AI, Amy Hodler
GraphTour London 2020  - Graphs for AI, Amy HodlerGraphTour London 2020  - Graphs for AI, Amy Hodler
GraphTour London 2020 - Graphs for AI, Amy Hodler
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
 
Rokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptxRokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptx
 
Rokach-GomaxSlides (1).pptx
Rokach-GomaxSlides (1).pptxRokach-GomaxSlides (1).pptx
Rokach-GomaxSlides (1).pptx
 
Building Predictive Analytics on Big Data Platforms
Building Predictive Analytics on Big Data PlatformsBuilding Predictive Analytics on Big Data Platforms
Building Predictive Analytics on Big Data Platforms
 
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
 
laptop price prediction presentation
laptop price prediction presentationlaptop price prediction presentation
laptop price prediction presentation
 
Customer_Experience_for_Telco
Customer_Experience_for_TelcoCustomer_Experience_for_Telco
Customer_Experience_for_Telco
 
Dss nitin chugh
Dss nitin chughDss nitin chugh
Dss nitin chugh
 
Big data: status atual e tendências
Big data: status atual e tendênciasBig data: status atual e tendências
Big data: status atual e tendências
 
acmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptxacmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptx
 
All sight overview presenation discovery jan 23rd 2018
All sight overview presenation   discovery jan 23rd 2018All sight overview presenation   discovery jan 23rd 2018
All sight overview presenation discovery jan 23rd 2018
 

ASME14_Ningrong

  • 1. Decision Support Systems Design for Data-Driven Management PhD Candidate: Ningrong LEI Supervisor: Dr. Seung Ki MOON School of Mechanical & Aerospace Engineering Division of Systems & Engineering Management DETC2014-34871 ASME 2014 International Design & Engineering Technical Conferences, Buffalo, New York, USA
  • 2. Outline  Data-driven dilemma.  Construct the decision support systems.  Case study and discussion.  Conclusion and future work. 1 DETC2014-34871
  • 3. Data alone will not improve management decisions Source: : www.v1shal.com 2
  • 4. Market Segmentation Source: . :Cameron Industrials Platform Strategy Advisory 3
  • 5. Decision support methods for market segmentation  Non data-driven – Ground on “common sense” rather than on a solid empirical base  Data-driven – Generally lack of reliability assessments – Restrict to a few comparison 4
  • 6. When data-driven management goes bad… Source: : www.v1shal.com 5
  • 7. The success of the data-driven management relies on:  Quality of the gathered data  Reliable model  Effectiveness of data analysis  Objectiveness of results interpretation 6
  • 8. Objectives Develop a reliable decision support system to:  Identify market segmentation based on market data  Determine product positioning 7
  • 9. Online system Block diagram of the proposed DSS Intrinsic dimension estimation Dimension reduction Performance evaluation Clustering Dimension reduction Automated classification Clustering Market Data Offline system Best dimension Reduction algorithm Best clustering algorithm Best training model Product Data 8
  • 10. Case study: automobile market  34 brands,639 models,31 model properties 9
  • 11. Online system Dimension reduction steps Intrinsic dimension estimation Dimension reduction Performance evaluation Clustering Dimension reduction Automated classification Clustering Market Data Offline system Best dimension Reduction algorithm Best clustering algorithm Best training model Product Data 10
  • 12. Dimension reduction techniques Intrinsic dimensionality estimation: • Correlation Dimension Estimator • Eigenvalue-Based Estimator • Maximum Likelihood Estimator • Geodesic Minimum Spanning Tree Dimension reduction: • Principle Component Analysis • Multidimensional Scaling • Local Linear Embedding 11
  • 13. Online system Block diagram of the proposed DSS Intrinsic dimension estimation Dimension reduction Performance evaluation Clustering Dimension reduction Product Positioning Market Segmentation Market Data Offline system Best dimension Reduction algorithm Best clustering algorithm Best training model Product Data 12
  • 14. Clustering  K-means  Fuzzy C-Means  Hierarchical Clustering 13
  • 15. Performance Evaluation  Statistical tests: – median and standard deviation  Silhouette Means: 14
  • 16. Performance Evaluation How accurately a predictive model will perform in practice?  Stratified 10-Fold Cross-Validation  Classification: – Gentle AdaBoost (GA) – Nearest Neighbour (NN) – Support Vector Machine (SVM) 15
  • 18. Clustering and performance evaluation results Clustering Performance evaluation Cluster- ing Dimension reduction Dimension estimation Number of clusters Silhouette mean GA in % NN in % SVM in % K- means PCA 2 2 0.67 25.24 98.25 99.68 3 2 0.61 25.08 96.83 99.52 10 11 0.40 95.50 95.50 96.50 MDS 2 2 0.67 74.13 97.78 99.68 3 2 0.61 25.08 96.03 99.52 10 2 0.48 25.71 94.13 100.00 LLE 2 2 0.89 97.38 99.76 100.00 3 3 0.81 99.02 98.05 99.76 10 2 0.85 0.95 99.76 100.00 17
  • 19. Classification accuracy over all clusters • On average, SVM outperforms NN and GA 18
  • 20. Conclusion  Deliver a blueprint on how to construct a decision support system. – Offline system: find the most suitable algorithms structure; – Online system: deliver objective and reliable decision support.  Data-driven management: right data, proven statistics, logical explanations. 19