SlideShare a Scribd company logo
Course : ISYE8015_Selected Topic in
Industrial Engineering
Period : June 2020
Business analytics using data
science techniques
Topic
1. What is business analytics ?
2. Data preparation for business analytics
3. Business analytics and intelligence application
framework
Business Intelligence and
Analytic
• Business intelligence
– Acquisition of data and information for use
in decision-making activities
• Business analytics
– Models and solution methods
• Data mining
– Applying models and methods to data to
identify patterns and trends
Data, Information, Knowledge
• Data
– Items that are the most elementary descriptions of
things, events, activities, and transactions
– May be internal or external
• Information
– Organized data that has meaning and value
• Knowledge
– Processed data or information that conveys
understanding or learning applicable to a problem or
activity
Data problems
Database Model
• Hierarchical
– Top down, like inverted tree
– Fields have only one “parent”, each “parent” can have multiple
“children”
– Fast
• Network
– Relationships created through linked lists, using pointers
– “Children” can have multiple “parents”
– Greater flexibility, substantial overhead
• Relational
– Flat, two-dimensional tables with multiple access queries
– Examines relations between multiple tables
– Flexible, quick, and extendable with data independence
• Object oriented
– Data analyzed at conceptual level
– Inheritance, abstraction, encapsulation
Migrating Data
• Business rules
– Stored in metadata repository
– Applied to data warehouse centrally
• Data extracted from all relevant sources
– Loaded through data-transformation tools or
programs
– Separate operation and decision support
environments
• Correct problems in quality before data
stored
– Cleanse and organize in consistent manner
Business Analytics and
intelligence to support
visualization
• Technologies supporting visualization and
interpretation
– Digital imaging, GIS, GUI, tables,
multidimensions, graphs, VR, 3D, animation
– Identify relationships and trends
• Data manipulation allows real time look at
performance data
Data Analytic System
• Real-time queries and analysis
• Real-time decision-making
• Real-time data warehouses updated
daily or more frequently
–Updates may be made while queries
are active
–Not all data updated continuously
• Deployment of business analytic
applications
Business Analytics : GIS
• Computerized system for managing and
manipulating data with digitized maps
– Geographically oriented
– Geographic spreadsheet for models
– Software allows web access to maps
– Used for modeling and simulations
Business Analytics: Web
• Web analytics
– Application of business analytics to Web
sites
• Web intelligence
– Application of business intelligence
techniques to Web sites
Business Analytics and Intelligence
Application using Pentaho
About Pentaho
• Recognized leader in business analytics & data integration
• Subscription-based business model
• Achieved critical mass:
• Over 1,200 commercial customers
• Over 10,000 production deployments
• Over 185 countries
• Stewardship of most important open source analytics
projects
INDUSTRY RECOGNITION OVER 160 PARTNERS GLOBALLY
Pentaho for Big Data Analytic
Big
Data
Mgmt
Hadoop
Java MapReduce, Pig
Pentaho MapReduce
NoSQL Databases Analytic Databases
Data Integration
Job Orchestration
Workflow
Scheduling
High Performance
Visual IDE
Data
Integration
Pentaho Business Analytics
•
R
•
3rd Party BI Tools
•
Applications
3rd Party Tools
Big
Analytics
Business Analytic Model
Advanced
Power Users
& Viewers
Data Science
Information
Consumers
Dashboards
Knowledge
Workers/
Business
Users
Analysis
Business
Users
Reporting
Power Users,
Developers &
DBAs
Data
Advanced
Predictive
Analysis
Self-service Interactive
KPI & Metrics and
Visualization
Self-service Interactive and
Ad Hoc Analysis
Ad hoc and
Operational
Reports
High Performance Data Integration,
BIG DATA, Cleansing
and Presentation
Components
are
independent
High Level Feature/Functions
Advanced
Power Users
& Viewers
Data Science
Information
Consumers
Dashboards
Knowledge
Workers/
Business
Users
Analysis
Business
Users
Reporting
Power Users,
Developers &
DBAs
Data
Advanced
Predictive
Analysis
Self-service Interactive
KPI & Metrics and
Visualization
Self-service Interactive and
Ad Hoc Analysis
Ad hoc and
Operational
Reports
High Performance Data Integration,
BIG DATA, Cleansing
and Presentation
Example for Dashboards
Dashboards & Interactive Dashboards
for Business Analytics
Dashboards – Geo Location-Based
High Level Feature/Functions
Advanced
Power Users
& Viewers
Data Mining
Information
Consumers
Dashboards
Knowledge
Workers/
Business
Users
Analysis
Business
Users
Reporting
Power Users,
Developers &
DBAs
Data
Advanced
Predictive
Analysis
Self-service Interactive
KPI & Metrics and
Visualization
Self-service Interactive and
Ad Hoc Analysis
Ad hoc and
Operational
Reports
High Performance Data Integration,
BIG DATA, Cleansing
and Presentation
Reports – Interactive, Static, Distributed
Reports – Reporting Pack & House Styles
Reports – Reporting Pack & House Styles
High Level Feature/Functions
Advanced
Power Users
& Viewers
Data Science
Information
Consumers
Dashboards
Knowledge
Workers/
Business
Users
Analysis
Business
Users
Reporting
Power Users,
Developers &
DBAs
Data
Advanced
Predictive
Analysis
Self-service Interactive
KPI & Metrics and
Visualization
Self-service Interactive and
Ad Hoc Analysis
Ad hoc and
Operational
Reports
High Performance Data Integration,
BIG DATA, Cleansing
and Presentation
Enhanced In-Memory Analytics
• Enhanced in-memory caching for speed of
thought visualization & analysis
– More re-usability of in-memory data
– Fewer trips to the database/disk
• Builds on existing unique extreme-scale in-
memory analytics
– Support for external data grids
• Infinispan / JBoss Enteprise Data Grid
and Memcached
• Scale to caching hundreds of GBs
(potentially TBs) of data in-memory
• Competition
– Java heap or C++ memory space (a few GB
at most (most BI products)
or
– Proprietary (hard to manage) in-memory
technology (e.g. Qlikview, Microstrategy)
Analyzer – Table format
Analyzer – Chart format
Analyzer: Geo Location-Based Analysis
High Level Feature/Functions
Advanced
Power Users
& Viewers
Data Science
Information
Consumers
Dashboards
Knowledge
Workers/
Business
Users
Analysis
Business
Users
Reporting
Power Users,
Developers &
DBAs
Data
Advanced
Predictive
Analysis
Self-service Interactive
KPI & Metrics and
Visualization
Self-service Interactive and
Ad Hoc Analysis
Ad hoc and
Operational
Reports
High Performance Data Integration,
BIG DATA, Cleansing
and Presentation
“Traditional” Database Support
DATA INTEGRATION
DATA ANALYSIS
Broadest Support for Big Data
Platforms
Hadoop NoSQL Analytic Databases

More Related Content

Similar to 20200713152029_PPT4-Business analytics using data science techniques and case study-R1.PPT

SG Data Mgt - Findings and Recommendations.pptx
SG Data Mgt - Findings and Recommendations.pptxSG Data Mgt - Findings and Recommendations.pptx
SG Data Mgt - Findings and Recommendations.pptx
ssuser57f752
 
Big data
Big dataBig data
Big data
Srinivasa Reddy
 
Business Intelligence Architecture
Business Intelligence ArchitectureBusiness Intelligence Architecture
Business Intelligence Architecture
Philippe Julio
 
Analytics Service Framework
Analytics Service Framework Analytics Service Framework
Analytics Service Framework
Vishwanath Ramdas
 
The Data Science Institute-Cognitive Solutions
The Data Science Institute-Cognitive SolutionsThe Data Science Institute-Cognitive Solutions
The Data Science Institute-Cognitive Solutions
The Data Science Institute
 
Modern Analytics And The Future Of Quality And Performance Excellence
Modern Analytics And The Future Of Quality And Performance ExcellenceModern Analytics And The Future Of Quality And Performance Excellence
Modern Analytics And The Future Of Quality And Performance Excellence
ICFAI Business School
 
KIT601 Unit I.pptx
KIT601 Unit I.pptxKIT601 Unit I.pptx
KIT601 Unit I.pptx
LBSIMDS, Lucknow
 
Business Analytics Training
Business Analytics TrainingBusiness Analytics Training
Business Analytics Training
Natalija Pavic
 
Mis jaiswal-chapter-08
Mis jaiswal-chapter-08Mis jaiswal-chapter-08
Mis jaiswal-chapter-08
Amit Fogla
 
Chapter 10 supporting decision making
Chapter 10  supporting decision makingChapter 10  supporting decision making
Chapter 10 supporting decision making
Advance Saraswati Prakashan Pvt Ltd
 
Get your data analytics strategy right!
Get your data analytics strategy right!Get your data analytics strategy right!
Get your data analytics strategy right!
SPAN Infotech (India) Pvt Ltd
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DATAVERSITY
 
Strategy session 5 - unlocking the data dividend - andy steer
Strategy   session 5 - unlocking the data dividend - andy steerStrategy   session 5 - unlocking the data dividend - andy steer
Strategy session 5 - unlocking the data dividend - andy steer
Andy Steer
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent Enterprise
Databricks
 
Business intelligence and data analytic for value realization
Business intelligence and data analytic for value realization Business intelligence and data analytic for value realization
Business intelligence and data analytic for value realization
iyke ezeugo
 
Business intelligence techniques U2.pptx
Business intelligence techniques U2.pptxBusiness intelligence techniques U2.pptx
Business intelligence techniques U2.pptx
RenuLamba8
 
Business Intelligence and Analytics .pptx
Business Intelligence and Analytics .pptxBusiness Intelligence and Analytics .pptx
Business Intelligence and Analytics .pptx
RupaRani28
 
Implementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformImplementing Advanced Analytics Platform
Implementing Advanced Analytics Platform
Arvind Sathi
 
Prez szabolcs
Prez szabolcsPrez szabolcs
Prez szabolcs
Doina Draganescu
 
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient..."Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
Dataconomy Media
 

Similar to 20200713152029_PPT4-Business analytics using data science techniques and case study-R1.PPT (20)

SG Data Mgt - Findings and Recommendations.pptx
SG Data Mgt - Findings and Recommendations.pptxSG Data Mgt - Findings and Recommendations.pptx
SG Data Mgt - Findings and Recommendations.pptx
 
Big data
Big dataBig data
Big data
 
Business Intelligence Architecture
Business Intelligence ArchitectureBusiness Intelligence Architecture
Business Intelligence Architecture
 
Analytics Service Framework
Analytics Service Framework Analytics Service Framework
Analytics Service Framework
 
The Data Science Institute-Cognitive Solutions
The Data Science Institute-Cognitive SolutionsThe Data Science Institute-Cognitive Solutions
The Data Science Institute-Cognitive Solutions
 
Modern Analytics And The Future Of Quality And Performance Excellence
Modern Analytics And The Future Of Quality And Performance ExcellenceModern Analytics And The Future Of Quality And Performance Excellence
Modern Analytics And The Future Of Quality And Performance Excellence
 
KIT601 Unit I.pptx
KIT601 Unit I.pptxKIT601 Unit I.pptx
KIT601 Unit I.pptx
 
Business Analytics Training
Business Analytics TrainingBusiness Analytics Training
Business Analytics Training
 
Mis jaiswal-chapter-08
Mis jaiswal-chapter-08Mis jaiswal-chapter-08
Mis jaiswal-chapter-08
 
Chapter 10 supporting decision making
Chapter 10  supporting decision makingChapter 10  supporting decision making
Chapter 10 supporting decision making
 
Get your data analytics strategy right!
Get your data analytics strategy right!Get your data analytics strategy right!
Get your data analytics strategy right!
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
 
Strategy session 5 - unlocking the data dividend - andy steer
Strategy   session 5 - unlocking the data dividend - andy steerStrategy   session 5 - unlocking the data dividend - andy steer
Strategy session 5 - unlocking the data dividend - andy steer
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent Enterprise
 
Business intelligence and data analytic for value realization
Business intelligence and data analytic for value realization Business intelligence and data analytic for value realization
Business intelligence and data analytic for value realization
 
Business intelligence techniques U2.pptx
Business intelligence techniques U2.pptxBusiness intelligence techniques U2.pptx
Business intelligence techniques U2.pptx
 
Business Intelligence and Analytics .pptx
Business Intelligence and Analytics .pptxBusiness Intelligence and Analytics .pptx
Business Intelligence and Analytics .pptx
 
Implementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformImplementing Advanced Analytics Platform
Implementing Advanced Analytics Platform
 
Prez szabolcs
Prez szabolcsPrez szabolcs
Prez szabolcs
 
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient..."Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
 

Recently uploaded

spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
Madan Karki
 
Engineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdfEngineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdf
abbyasa1014
 
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECTCHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
jpsjournal1
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
Madan Karki
 
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEMTIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
HODECEDSIET
 
Question paper of renewable energy sources
Question paper of renewable energy sourcesQuestion paper of renewable energy sources
Question paper of renewable energy sources
mahammadsalmanmech
 
Recycled Concrete Aggregate in Construction Part II
Recycled Concrete Aggregate in Construction Part IIRecycled Concrete Aggregate in Construction Part II
Recycled Concrete Aggregate in Construction Part II
Aditya Rajan Patra
 
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMSA SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
IJNSA Journal
 
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
Victor Morales
 
Textile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdfTextile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdf
NazakatAliKhoso2
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
IJECEIAES
 
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELDEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
gerogepatton
 
International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...
gerogepatton
 
A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...
nooriasukmaningtyas
 
Engine Lubrication performance System.pdf
Engine Lubrication performance System.pdfEngine Lubrication performance System.pdf
Engine Lubrication performance System.pdf
mamamaam477
 
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming PipelinesHarnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Christina Lin
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
insn4465
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
MDSABBIROJJAMANPAYEL
 
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
IJECEIAES
 
The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.
sachin chaurasia
 

Recently uploaded (20)

spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
 
Engineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdfEngineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdf
 
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECTCHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
 
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEMTIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
 
Question paper of renewable energy sources
Question paper of renewable energy sourcesQuestion paper of renewable energy sources
Question paper of renewable energy sources
 
Recycled Concrete Aggregate in Construction Part II
Recycled Concrete Aggregate in Construction Part IIRecycled Concrete Aggregate in Construction Part II
Recycled Concrete Aggregate in Construction Part II
 
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMSA SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
 
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
 
Textile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdfTextile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdf
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
 
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELDEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
 
International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...
 
A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...
 
Engine Lubrication performance System.pdf
Engine Lubrication performance System.pdfEngine Lubrication performance System.pdf
Engine Lubrication performance System.pdf
 
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming PipelinesHarnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
 
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
 
The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.
 

20200713152029_PPT4-Business analytics using data science techniques and case study-R1.PPT

  • 1. Course : ISYE8015_Selected Topic in Industrial Engineering Period : June 2020 Business analytics using data science techniques
  • 2. Topic 1. What is business analytics ? 2. Data preparation for business analytics 3. Business analytics and intelligence application framework
  • 3. Business Intelligence and Analytic • Business intelligence – Acquisition of data and information for use in decision-making activities • Business analytics – Models and solution methods • Data mining – Applying models and methods to data to identify patterns and trends
  • 4. Data, Information, Knowledge • Data – Items that are the most elementary descriptions of things, events, activities, and transactions – May be internal or external • Information – Organized data that has meaning and value • Knowledge – Processed data or information that conveys understanding or learning applicable to a problem or activity
  • 6. Database Model • Hierarchical – Top down, like inverted tree – Fields have only one “parent”, each “parent” can have multiple “children” – Fast • Network – Relationships created through linked lists, using pointers – “Children” can have multiple “parents” – Greater flexibility, substantial overhead • Relational – Flat, two-dimensional tables with multiple access queries – Examines relations between multiple tables – Flexible, quick, and extendable with data independence • Object oriented – Data analyzed at conceptual level – Inheritance, abstraction, encapsulation
  • 7.
  • 8. Migrating Data • Business rules – Stored in metadata repository – Applied to data warehouse centrally • Data extracted from all relevant sources – Loaded through data-transformation tools or programs – Separate operation and decision support environments • Correct problems in quality before data stored – Cleanse and organize in consistent manner
  • 9. Business Analytics and intelligence to support visualization • Technologies supporting visualization and interpretation – Digital imaging, GIS, GUI, tables, multidimensions, graphs, VR, 3D, animation – Identify relationships and trends • Data manipulation allows real time look at performance data
  • 10. Data Analytic System • Real-time queries and analysis • Real-time decision-making • Real-time data warehouses updated daily or more frequently –Updates may be made while queries are active –Not all data updated continuously • Deployment of business analytic applications
  • 11. Business Analytics : GIS • Computerized system for managing and manipulating data with digitized maps – Geographically oriented – Geographic spreadsheet for models – Software allows web access to maps – Used for modeling and simulations
  • 12.
  • 13. Business Analytics: Web • Web analytics – Application of business analytics to Web sites • Web intelligence – Application of business intelligence techniques to Web sites
  • 14. Business Analytics and Intelligence Application using Pentaho
  • 15. About Pentaho • Recognized leader in business analytics & data integration • Subscription-based business model • Achieved critical mass: • Over 1,200 commercial customers • Over 10,000 production deployments • Over 185 countries • Stewardship of most important open source analytics projects INDUSTRY RECOGNITION OVER 160 PARTNERS GLOBALLY
  • 16. Pentaho for Big Data Analytic Big Data Mgmt Hadoop Java MapReduce, Pig Pentaho MapReduce NoSQL Databases Analytic Databases Data Integration Job Orchestration Workflow Scheduling High Performance Visual IDE Data Integration Pentaho Business Analytics • R • 3rd Party BI Tools • Applications 3rd Party Tools Big Analytics
  • 17. Business Analytic Model Advanced Power Users & Viewers Data Science Information Consumers Dashboards Knowledge Workers/ Business Users Analysis Business Users Reporting Power Users, Developers & DBAs Data Advanced Predictive Analysis Self-service Interactive KPI & Metrics and Visualization Self-service Interactive and Ad Hoc Analysis Ad hoc and Operational Reports High Performance Data Integration, BIG DATA, Cleansing and Presentation Components are independent
  • 18. High Level Feature/Functions Advanced Power Users & Viewers Data Science Information Consumers Dashboards Knowledge Workers/ Business Users Analysis Business Users Reporting Power Users, Developers & DBAs Data Advanced Predictive Analysis Self-service Interactive KPI & Metrics and Visualization Self-service Interactive and Ad Hoc Analysis Ad hoc and Operational Reports High Performance Data Integration, BIG DATA, Cleansing and Presentation
  • 20. Dashboards & Interactive Dashboards for Business Analytics
  • 21. Dashboards – Geo Location-Based
  • 22. High Level Feature/Functions Advanced Power Users & Viewers Data Mining Information Consumers Dashboards Knowledge Workers/ Business Users Analysis Business Users Reporting Power Users, Developers & DBAs Data Advanced Predictive Analysis Self-service Interactive KPI & Metrics and Visualization Self-service Interactive and Ad Hoc Analysis Ad hoc and Operational Reports High Performance Data Integration, BIG DATA, Cleansing and Presentation
  • 23. Reports – Interactive, Static, Distributed
  • 24. Reports – Reporting Pack & House Styles
  • 25. Reports – Reporting Pack & House Styles
  • 26. High Level Feature/Functions Advanced Power Users & Viewers Data Science Information Consumers Dashboards Knowledge Workers/ Business Users Analysis Business Users Reporting Power Users, Developers & DBAs Data Advanced Predictive Analysis Self-service Interactive KPI & Metrics and Visualization Self-service Interactive and Ad Hoc Analysis Ad hoc and Operational Reports High Performance Data Integration, BIG DATA, Cleansing and Presentation
  • 27. Enhanced In-Memory Analytics • Enhanced in-memory caching for speed of thought visualization & analysis – More re-usability of in-memory data – Fewer trips to the database/disk • Builds on existing unique extreme-scale in- memory analytics – Support for external data grids • Infinispan / JBoss Enteprise Data Grid and Memcached • Scale to caching hundreds of GBs (potentially TBs) of data in-memory • Competition – Java heap or C++ memory space (a few GB at most (most BI products) or – Proprietary (hard to manage) in-memory technology (e.g. Qlikview, Microstrategy)
  • 31. High Level Feature/Functions Advanced Power Users & Viewers Data Science Information Consumers Dashboards Knowledge Workers/ Business Users Analysis Business Users Reporting Power Users, Developers & DBAs Data Advanced Predictive Analysis Self-service Interactive KPI & Metrics and Visualization Self-service Interactive and Ad Hoc Analysis Ad hoc and Operational Reports High Performance Data Integration, BIG DATA, Cleansing and Presentation
  • 32. “Traditional” Database Support DATA INTEGRATION DATA ANALYSIS
  • 33. Broadest Support for Big Data Platforms Hadoop NoSQL Analytic Databases

Editor's Notes

  1. Even the best interactive visualization is frustrating if the end-user has to sit there interminably waiting for the system to respond. Testing has shown that usage of BI systems drops dramatically once response time starts to exceed 5 seconds, as users tend to lose their train of thought. This is what we mean by “speed of thought” response times – a snappy system that keeps up with thought-train of the user. By avoiding database round-trips, in-memory data caching is a popular and growing approach to providing this performance, But with the dramatic growth of data volumes it is become more and more challenging for traditional BI applications to keep-up. Pentaho is the “only” business analytics provider to use the extreme-scale in-memory caching technology used to power some of the world’s highest volume consumer websites such as Youtube and Amazon.com. That technology is known as data grids – a way of caching large amounts of data across an inexpensive cluster of commodity servers. Pentaho’s analytics supports two of the leading data grids – Infinispan (also known as JBoss Enterprise Data Grid) and Memcached. Traditional in-memory products written in either Java or C++ are constrained to using a limited amount of memory on the server on which they are executing – at most a few GBs. By contrast a data grid can be distributed across a cluster of commodity servers and can address hundreds of GBs, and potentially TBs of memory in future as hardware memory sizes get larger and less expensive. This allows customers to load all or most of their data into memory, so delivering consistent speed of thought responses times, orders of magnitude faster than needing to query a database because the data the user needed was not in-memory. A couple of vendors, Qliktech and Microstrategy, do provide proprietary in-memory caching capabilities. But these require special training and skills to use and maintain – and they are single server solutions so constrained to the amount of physical memory that can be installed on a single server, typically no more than 64GB.