SlideShare a Scribd company logo
1 of 33
Course : ISYE8015_Selected Topic in
Industrial Engineering
Period : June 2020
Business analytics using data
science techniques
Topic
1. What is business analytics ?
2. Data preparation for business analytics
3. Business analytics and intelligence application
framework
Business Intelligence and
Analytic
• Business intelligence
– Acquisition of data and information for use
in decision-making activities
• Business analytics
– Models and solution methods
• Data mining
– Applying models and methods to data to
identify patterns and trends
Data, Information, Knowledge
• Data
– Items that are the most elementary descriptions of
things, events, activities, and transactions
– May be internal or external
• Information
– Organized data that has meaning and value
• Knowledge
– Processed data or information that conveys
understanding or learning applicable to a problem or
activity
Data problems
Database Model
• Hierarchical
– Top down, like inverted tree
– Fields have only one “parent”, each “parent” can have multiple
“children”
– Fast
• Network
– Relationships created through linked lists, using pointers
– “Children” can have multiple “parents”
– Greater flexibility, substantial overhead
• Relational
– Flat, two-dimensional tables with multiple access queries
– Examines relations between multiple tables
– Flexible, quick, and extendable with data independence
• Object oriented
– Data analyzed at conceptual level
– Inheritance, abstraction, encapsulation
Migrating Data
• Business rules
– Stored in metadata repository
– Applied to data warehouse centrally
• Data extracted from all relevant sources
– Loaded through data-transformation tools or
programs
– Separate operation and decision support
environments
• Correct problems in quality before data
stored
– Cleanse and organize in consistent manner
Business Analytics and
intelligence to support
visualization
• Technologies supporting visualization and
interpretation
– Digital imaging, GIS, GUI, tables,
multidimensions, graphs, VR, 3D, animation
– Identify relationships and trends
• Data manipulation allows real time look at
performance data
Data Analytic System
• Real-time queries and analysis
• Real-time decision-making
• Real-time data warehouses updated
daily or more frequently
–Updates may be made while queries
are active
–Not all data updated continuously
• Deployment of business analytic
applications
Business Analytics : GIS
• Computerized system for managing and
manipulating data with digitized maps
– Geographically oriented
– Geographic spreadsheet for models
– Software allows web access to maps
– Used for modeling and simulations
Business Analytics: Web
• Web analytics
– Application of business analytics to Web
sites
• Web intelligence
– Application of business intelligence
techniques to Web sites
Business Analytics and Intelligence
Application using Pentaho
About Pentaho
• Recognized leader in business analytics & data integration
• Subscription-based business model
• Achieved critical mass:
• Over 1,200 commercial customers
• Over 10,000 production deployments
• Over 185 countries
• Stewardship of most important open source analytics
projects
INDUSTRY RECOGNITION OVER 160 PARTNERS GLOBALLY
Pentaho for Big Data Analytic
Big
Data
Mgmt
Hadoop
Java MapReduce, Pig
Pentaho MapReduce
NoSQL Databases Analytic Databases
Data Integration
Job Orchestration
Workflow
Scheduling
High Performance
Visual IDE
Data
Integration
Pentaho Business Analytics
•
R
•
3rd Party BI Tools
•
Applications
3rd Party Tools
Big
Analytics
Business Analytic Model
Advanced
Power Users
& Viewers
Data Science
Information
Consumers
Dashboards
Knowledge
Workers/
Business
Users
Analysis
Business
Users
Reporting
Power Users,
Developers &
DBAs
Data
Advanced
Predictive
Analysis
Self-service Interactive
KPI & Metrics and
Visualization
Self-service Interactive and
Ad Hoc Analysis
Ad hoc and
Operational
Reports
High Performance Data Integration,
BIG DATA, Cleansing
and Presentation
Components
are
independent
High Level Feature/Functions
Advanced
Power Users
& Viewers
Data Science
Information
Consumers
Dashboards
Knowledge
Workers/
Business
Users
Analysis
Business
Users
Reporting
Power Users,
Developers &
DBAs
Data
Advanced
Predictive
Analysis
Self-service Interactive
KPI & Metrics and
Visualization
Self-service Interactive and
Ad Hoc Analysis
Ad hoc and
Operational
Reports
High Performance Data Integration,
BIG DATA, Cleansing
and Presentation
Example for Dashboards
Dashboards & Interactive Dashboards
for Business Analytics
Dashboards – Geo Location-Based
High Level Feature/Functions
Advanced
Power Users
& Viewers
Data Mining
Information
Consumers
Dashboards
Knowledge
Workers/
Business
Users
Analysis
Business
Users
Reporting
Power Users,
Developers &
DBAs
Data
Advanced
Predictive
Analysis
Self-service Interactive
KPI & Metrics and
Visualization
Self-service Interactive and
Ad Hoc Analysis
Ad hoc and
Operational
Reports
High Performance Data Integration,
BIG DATA, Cleansing
and Presentation
Reports – Interactive, Static, Distributed
Reports – Reporting Pack & House Styles
Reports – Reporting Pack & House Styles
High Level Feature/Functions
Advanced
Power Users
& Viewers
Data Science
Information
Consumers
Dashboards
Knowledge
Workers/
Business
Users
Analysis
Business
Users
Reporting
Power Users,
Developers &
DBAs
Data
Advanced
Predictive
Analysis
Self-service Interactive
KPI & Metrics and
Visualization
Self-service Interactive and
Ad Hoc Analysis
Ad hoc and
Operational
Reports
High Performance Data Integration,
BIG DATA, Cleansing
and Presentation
Enhanced In-Memory Analytics
• Enhanced in-memory caching for speed of
thought visualization & analysis
– More re-usability of in-memory data
– Fewer trips to the database/disk
• Builds on existing unique extreme-scale in-
memory analytics
– Support for external data grids
• Infinispan / JBoss Enteprise Data Grid
and Memcached
• Scale to caching hundreds of GBs
(potentially TBs) of data in-memory
• Competition
– Java heap or C++ memory space (a few GB
at most (most BI products)
or
– Proprietary (hard to manage) in-memory
technology (e.g. Qlikview, Microstrategy)
Analyzer – Table format
Analyzer – Chart format
Analyzer: Geo Location-Based Analysis
High Level Feature/Functions
Advanced
Power Users
& Viewers
Data Science
Information
Consumers
Dashboards
Knowledge
Workers/
Business
Users
Analysis
Business
Users
Reporting
Power Users,
Developers &
DBAs
Data
Advanced
Predictive
Analysis
Self-service Interactive
KPI & Metrics and
Visualization
Self-service Interactive and
Ad Hoc Analysis
Ad hoc and
Operational
Reports
High Performance Data Integration,
BIG DATA, Cleansing
and Presentation
“Traditional” Database Support
DATA INTEGRATION
DATA ANALYSIS
Broadest Support for Big Data
Platforms
Hadoop NoSQL Analytic Databases

More Related Content

Similar to 20200713152029_PPT4-Business analytics using data science techniques and case study-R1.PPT

SG Data Mgt - Findings and Recommendations.pptx
SG Data Mgt - Findings and Recommendations.pptxSG Data Mgt - Findings and Recommendations.pptx
SG Data Mgt - Findings and Recommendations.pptxssuser57f752
 
Business Intelligence Architecture
Business Intelligence ArchitectureBusiness Intelligence Architecture
Business Intelligence ArchitecturePhilippe Julio
 
Analytics Service Framework
Analytics Service Framework Analytics Service Framework
Analytics Service Framework Vishwanath Ramdas
 
The Data Science Institute-Cognitive Solutions
The Data Science Institute-Cognitive SolutionsThe Data Science Institute-Cognitive Solutions
The Data Science Institute-Cognitive SolutionsThe Data Science Institute
 
Modern Analytics And The Future Of Quality And Performance Excellence
Modern Analytics And The Future Of Quality And Performance ExcellenceModern Analytics And The Future Of Quality And Performance Excellence
Modern Analytics And The Future Of Quality And Performance ExcellenceICFAI Business School
 
Business Analytics Training
Business Analytics TrainingBusiness Analytics Training
Business Analytics TrainingNatalija Pavic
 
Mis jaiswal-chapter-08
Mis jaiswal-chapter-08Mis jaiswal-chapter-08
Mis jaiswal-chapter-08Amit Fogla
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DATAVERSITY
 
Strategy session 5 - unlocking the data dividend - andy steer
Strategy   session 5 - unlocking the data dividend - andy steerStrategy   session 5 - unlocking the data dividend - andy steer
Strategy session 5 - unlocking the data dividend - andy steerAndy Steer
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseDatabricks
 
Business intelligence and data analytic for value realization
Business intelligence and data analytic for value realization Business intelligence and data analytic for value realization
Business intelligence and data analytic for value realization iyke ezeugo
 
Business intelligence techniques U2.pptx
Business intelligence techniques U2.pptxBusiness intelligence techniques U2.pptx
Business intelligence techniques U2.pptxRenuLamba8
 
Business Intelligence and Analytics .pptx
Business Intelligence and Analytics .pptxBusiness Intelligence and Analytics .pptx
Business Intelligence and Analytics .pptxRupaRani28
 
Implementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformImplementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformArvind Sathi
 
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient..."Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...Dataconomy Media
 

Similar to 20200713152029_PPT4-Business analytics using data science techniques and case study-R1.PPT (20)

SG Data Mgt - Findings and Recommendations.pptx
SG Data Mgt - Findings and Recommendations.pptxSG Data Mgt - Findings and Recommendations.pptx
SG Data Mgt - Findings and Recommendations.pptx
 
Big data
Big dataBig data
Big data
 
Business Intelligence Architecture
Business Intelligence ArchitectureBusiness Intelligence Architecture
Business Intelligence Architecture
 
Analytics Service Framework
Analytics Service Framework Analytics Service Framework
Analytics Service Framework
 
The Data Science Institute-Cognitive Solutions
The Data Science Institute-Cognitive SolutionsThe Data Science Institute-Cognitive Solutions
The Data Science Institute-Cognitive Solutions
 
Modern Analytics And The Future Of Quality And Performance Excellence
Modern Analytics And The Future Of Quality And Performance ExcellenceModern Analytics And The Future Of Quality And Performance Excellence
Modern Analytics And The Future Of Quality And Performance Excellence
 
KIT601 Unit I.pptx
KIT601 Unit I.pptxKIT601 Unit I.pptx
KIT601 Unit I.pptx
 
Business Analytics Training
Business Analytics TrainingBusiness Analytics Training
Business Analytics Training
 
Mis jaiswal-chapter-08
Mis jaiswal-chapter-08Mis jaiswal-chapter-08
Mis jaiswal-chapter-08
 
Chapter 10 supporting decision making
Chapter 10  supporting decision makingChapter 10  supporting decision making
Chapter 10 supporting decision making
 
Get your data analytics strategy right!
Get your data analytics strategy right!Get your data analytics strategy right!
Get your data analytics strategy right!
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
 
Strategy session 5 - unlocking the data dividend - andy steer
Strategy   session 5 - unlocking the data dividend - andy steerStrategy   session 5 - unlocking the data dividend - andy steer
Strategy session 5 - unlocking the data dividend - andy steer
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent Enterprise
 
Business intelligence and data analytic for value realization
Business intelligence and data analytic for value realization Business intelligence and data analytic for value realization
Business intelligence and data analytic for value realization
 
Business intelligence techniques U2.pptx
Business intelligence techniques U2.pptxBusiness intelligence techniques U2.pptx
Business intelligence techniques U2.pptx
 
Business Intelligence and Analytics .pptx
Business Intelligence and Analytics .pptxBusiness Intelligence and Analytics .pptx
Business Intelligence and Analytics .pptx
 
Implementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformImplementing Advanced Analytics Platform
Implementing Advanced Analytics Platform
 
Prez szabolcs
Prez szabolcsPrez szabolcs
Prez szabolcs
 
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient..."Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
 

Recently uploaded

Worksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptxWorksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptxMustafa Ahmed
 
15-Minute City: A Completely New Horizon
15-Minute City: A Completely New Horizon15-Minute City: A Completely New Horizon
15-Minute City: A Completely New HorizonMorshed Ahmed Rahath
 
Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdf
Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdfInvolute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdf
Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdfJNTUA
 
Seizure stage detection of epileptic seizure using convolutional neural networks
Seizure stage detection of epileptic seizure using convolutional neural networksSeizure stage detection of epileptic seizure using convolutional neural networks
Seizure stage detection of epileptic seizure using convolutional neural networksIJECEIAES
 
Filters for Electromagnetic Compatibility Applications
Filters for Electromagnetic Compatibility ApplicationsFilters for Electromagnetic Compatibility Applications
Filters for Electromagnetic Compatibility ApplicationsMathias Magdowski
 
Fuzzy logic method-based stress detector with blood pressure and body tempera...
Fuzzy logic method-based stress detector with blood pressure and body tempera...Fuzzy logic method-based stress detector with blood pressure and body tempera...
Fuzzy logic method-based stress detector with blood pressure and body tempera...IJECEIAES
 
History of Indian Railways - the story of Growth & Modernization
History of Indian Railways - the story of Growth & ModernizationHistory of Indian Railways - the story of Growth & Modernization
History of Indian Railways - the story of Growth & ModernizationEmaan Sharma
 
Software Engineering Practical File Front Pages.pdf
Software Engineering Practical File Front Pages.pdfSoftware Engineering Practical File Front Pages.pdf
Software Engineering Practical File Front Pages.pdfssuser5c9d4b1
 
Adsorption (mass transfer operations 2) ppt
Adsorption (mass transfer operations 2) pptAdsorption (mass transfer operations 2) ppt
Adsorption (mass transfer operations 2) pptjigup7320
 
Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)Ramkumar k
 
21scheme vtu syllabus of visveraya technological university
21scheme vtu syllabus of visveraya technological university21scheme vtu syllabus of visveraya technological university
21scheme vtu syllabus of visveraya technological universityMohd Saifudeen
 
Artificial intelligence presentation2-171219131633.pdf
Artificial intelligence presentation2-171219131633.pdfArtificial intelligence presentation2-171219131633.pdf
Artificial intelligence presentation2-171219131633.pdfKira Dess
 
What is Coordinate Measuring Machine? CMM Types, Features, Functions
What is Coordinate Measuring Machine? CMM Types, Features, FunctionsWhat is Coordinate Measuring Machine? CMM Types, Features, Functions
What is Coordinate Measuring Machine? CMM Types, Features, FunctionsVIEW
 
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...drjose256
 
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024EMMANUELLEFRANCEHELI
 
Augmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxAugmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxMustafa Ahmed
 
Passive Air Cooling System and Solar Water Heater.ppt
Passive Air Cooling System and Solar Water Heater.pptPassive Air Cooling System and Solar Water Heater.ppt
Passive Air Cooling System and Solar Water Heater.pptamrabdallah9
 
SLIDESHARE PPT-DECISION MAKING METHODS.pptx
SLIDESHARE PPT-DECISION MAKING METHODS.pptxSLIDESHARE PPT-DECISION MAKING METHODS.pptx
SLIDESHARE PPT-DECISION MAKING METHODS.pptxCHAIRMAN M
 
Research Methodolgy & Intellectual Property Rights Series 1
Research Methodolgy & Intellectual Property Rights Series 1Research Methodolgy & Intellectual Property Rights Series 1
Research Methodolgy & Intellectual Property Rights Series 1T.D. Shashikala
 
Raashid final report on Embedded Systems
Raashid final report on Embedded SystemsRaashid final report on Embedded Systems
Raashid final report on Embedded SystemsRaashidFaiyazSheikh
 

Recently uploaded (20)

Worksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptxWorksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptx
 
15-Minute City: A Completely New Horizon
15-Minute City: A Completely New Horizon15-Minute City: A Completely New Horizon
15-Minute City: A Completely New Horizon
 
Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdf
Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdfInvolute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdf
Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdf
 
Seizure stage detection of epileptic seizure using convolutional neural networks
Seizure stage detection of epileptic seizure using convolutional neural networksSeizure stage detection of epileptic seizure using convolutional neural networks
Seizure stage detection of epileptic seizure using convolutional neural networks
 
Filters for Electromagnetic Compatibility Applications
Filters for Electromagnetic Compatibility ApplicationsFilters for Electromagnetic Compatibility Applications
Filters for Electromagnetic Compatibility Applications
 
Fuzzy logic method-based stress detector with blood pressure and body tempera...
Fuzzy logic method-based stress detector with blood pressure and body tempera...Fuzzy logic method-based stress detector with blood pressure and body tempera...
Fuzzy logic method-based stress detector with blood pressure and body tempera...
 
History of Indian Railways - the story of Growth & Modernization
History of Indian Railways - the story of Growth & ModernizationHistory of Indian Railways - the story of Growth & Modernization
History of Indian Railways - the story of Growth & Modernization
 
Software Engineering Practical File Front Pages.pdf
Software Engineering Practical File Front Pages.pdfSoftware Engineering Practical File Front Pages.pdf
Software Engineering Practical File Front Pages.pdf
 
Adsorption (mass transfer operations 2) ppt
Adsorption (mass transfer operations 2) pptAdsorption (mass transfer operations 2) ppt
Adsorption (mass transfer operations 2) ppt
 
Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)
 
21scheme vtu syllabus of visveraya technological university
21scheme vtu syllabus of visveraya technological university21scheme vtu syllabus of visveraya technological university
21scheme vtu syllabus of visveraya technological university
 
Artificial intelligence presentation2-171219131633.pdf
Artificial intelligence presentation2-171219131633.pdfArtificial intelligence presentation2-171219131633.pdf
Artificial intelligence presentation2-171219131633.pdf
 
What is Coordinate Measuring Machine? CMM Types, Features, Functions
What is Coordinate Measuring Machine? CMM Types, Features, FunctionsWhat is Coordinate Measuring Machine? CMM Types, Features, Functions
What is Coordinate Measuring Machine? CMM Types, Features, Functions
 
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
 
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
 
Augmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxAugmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptx
 
Passive Air Cooling System and Solar Water Heater.ppt
Passive Air Cooling System and Solar Water Heater.pptPassive Air Cooling System and Solar Water Heater.ppt
Passive Air Cooling System and Solar Water Heater.ppt
 
SLIDESHARE PPT-DECISION MAKING METHODS.pptx
SLIDESHARE PPT-DECISION MAKING METHODS.pptxSLIDESHARE PPT-DECISION MAKING METHODS.pptx
SLIDESHARE PPT-DECISION MAKING METHODS.pptx
 
Research Methodolgy & Intellectual Property Rights Series 1
Research Methodolgy & Intellectual Property Rights Series 1Research Methodolgy & Intellectual Property Rights Series 1
Research Methodolgy & Intellectual Property Rights Series 1
 
Raashid final report on Embedded Systems
Raashid final report on Embedded SystemsRaashid final report on Embedded Systems
Raashid final report on Embedded Systems
 

20200713152029_PPT4-Business analytics using data science techniques and case study-R1.PPT

  • 1. Course : ISYE8015_Selected Topic in Industrial Engineering Period : June 2020 Business analytics using data science techniques
  • 2. Topic 1. What is business analytics ? 2. Data preparation for business analytics 3. Business analytics and intelligence application framework
  • 3. Business Intelligence and Analytic • Business intelligence – Acquisition of data and information for use in decision-making activities • Business analytics – Models and solution methods • Data mining – Applying models and methods to data to identify patterns and trends
  • 4. Data, Information, Knowledge • Data – Items that are the most elementary descriptions of things, events, activities, and transactions – May be internal or external • Information – Organized data that has meaning and value • Knowledge – Processed data or information that conveys understanding or learning applicable to a problem or activity
  • 6. Database Model • Hierarchical – Top down, like inverted tree – Fields have only one “parent”, each “parent” can have multiple “children” – Fast • Network – Relationships created through linked lists, using pointers – “Children” can have multiple “parents” – Greater flexibility, substantial overhead • Relational – Flat, two-dimensional tables with multiple access queries – Examines relations between multiple tables – Flexible, quick, and extendable with data independence • Object oriented – Data analyzed at conceptual level – Inheritance, abstraction, encapsulation
  • 7.
  • 8. Migrating Data • Business rules – Stored in metadata repository – Applied to data warehouse centrally • Data extracted from all relevant sources – Loaded through data-transformation tools or programs – Separate operation and decision support environments • Correct problems in quality before data stored – Cleanse and organize in consistent manner
  • 9. Business Analytics and intelligence to support visualization • Technologies supporting visualization and interpretation – Digital imaging, GIS, GUI, tables, multidimensions, graphs, VR, 3D, animation – Identify relationships and trends • Data manipulation allows real time look at performance data
  • 10. Data Analytic System • Real-time queries and analysis • Real-time decision-making • Real-time data warehouses updated daily or more frequently –Updates may be made while queries are active –Not all data updated continuously • Deployment of business analytic applications
  • 11. Business Analytics : GIS • Computerized system for managing and manipulating data with digitized maps – Geographically oriented – Geographic spreadsheet for models – Software allows web access to maps – Used for modeling and simulations
  • 12.
  • 13. Business Analytics: Web • Web analytics – Application of business analytics to Web sites • Web intelligence – Application of business intelligence techniques to Web sites
  • 14. Business Analytics and Intelligence Application using Pentaho
  • 15. About Pentaho • Recognized leader in business analytics & data integration • Subscription-based business model • Achieved critical mass: • Over 1,200 commercial customers • Over 10,000 production deployments • Over 185 countries • Stewardship of most important open source analytics projects INDUSTRY RECOGNITION OVER 160 PARTNERS GLOBALLY
  • 16. Pentaho for Big Data Analytic Big Data Mgmt Hadoop Java MapReduce, Pig Pentaho MapReduce NoSQL Databases Analytic Databases Data Integration Job Orchestration Workflow Scheduling High Performance Visual IDE Data Integration Pentaho Business Analytics • R • 3rd Party BI Tools • Applications 3rd Party Tools Big Analytics
  • 17. Business Analytic Model Advanced Power Users & Viewers Data Science Information Consumers Dashboards Knowledge Workers/ Business Users Analysis Business Users Reporting Power Users, Developers & DBAs Data Advanced Predictive Analysis Self-service Interactive KPI & Metrics and Visualization Self-service Interactive and Ad Hoc Analysis Ad hoc and Operational Reports High Performance Data Integration, BIG DATA, Cleansing and Presentation Components are independent
  • 18. High Level Feature/Functions Advanced Power Users & Viewers Data Science Information Consumers Dashboards Knowledge Workers/ Business Users Analysis Business Users Reporting Power Users, Developers & DBAs Data Advanced Predictive Analysis Self-service Interactive KPI & Metrics and Visualization Self-service Interactive and Ad Hoc Analysis Ad hoc and Operational Reports High Performance Data Integration, BIG DATA, Cleansing and Presentation
  • 20. Dashboards & Interactive Dashboards for Business Analytics
  • 21. Dashboards – Geo Location-Based
  • 22. High Level Feature/Functions Advanced Power Users & Viewers Data Mining Information Consumers Dashboards Knowledge Workers/ Business Users Analysis Business Users Reporting Power Users, Developers & DBAs Data Advanced Predictive Analysis Self-service Interactive KPI & Metrics and Visualization Self-service Interactive and Ad Hoc Analysis Ad hoc and Operational Reports High Performance Data Integration, BIG DATA, Cleansing and Presentation
  • 23. Reports – Interactive, Static, Distributed
  • 24. Reports – Reporting Pack & House Styles
  • 25. Reports – Reporting Pack & House Styles
  • 26. High Level Feature/Functions Advanced Power Users & Viewers Data Science Information Consumers Dashboards Knowledge Workers/ Business Users Analysis Business Users Reporting Power Users, Developers & DBAs Data Advanced Predictive Analysis Self-service Interactive KPI & Metrics and Visualization Self-service Interactive and Ad Hoc Analysis Ad hoc and Operational Reports High Performance Data Integration, BIG DATA, Cleansing and Presentation
  • 27. Enhanced In-Memory Analytics • Enhanced in-memory caching for speed of thought visualization & analysis – More re-usability of in-memory data – Fewer trips to the database/disk • Builds on existing unique extreme-scale in- memory analytics – Support for external data grids • Infinispan / JBoss Enteprise Data Grid and Memcached • Scale to caching hundreds of GBs (potentially TBs) of data in-memory • Competition – Java heap or C++ memory space (a few GB at most (most BI products) or – Proprietary (hard to manage) in-memory technology (e.g. Qlikview, Microstrategy)
  • 31. High Level Feature/Functions Advanced Power Users & Viewers Data Science Information Consumers Dashboards Knowledge Workers/ Business Users Analysis Business Users Reporting Power Users, Developers & DBAs Data Advanced Predictive Analysis Self-service Interactive KPI & Metrics and Visualization Self-service Interactive and Ad Hoc Analysis Ad hoc and Operational Reports High Performance Data Integration, BIG DATA, Cleansing and Presentation
  • 32. “Traditional” Database Support DATA INTEGRATION DATA ANALYSIS
  • 33. Broadest Support for Big Data Platforms Hadoop NoSQL Analytic Databases

Editor's Notes

  1. Even the best interactive visualization is frustrating if the end-user has to sit there interminably waiting for the system to respond. Testing has shown that usage of BI systems drops dramatically once response time starts to exceed 5 seconds, as users tend to lose their train of thought. This is what we mean by “speed of thought” response times – a snappy system that keeps up with thought-train of the user. By avoiding database round-trips, in-memory data caching is a popular and growing approach to providing this performance, But with the dramatic growth of data volumes it is become more and more challenging for traditional BI applications to keep-up. Pentaho is the “only” business analytics provider to use the extreme-scale in-memory caching technology used to power some of the world’s highest volume consumer websites such as Youtube and Amazon.com. That technology is known as data grids – a way of caching large amounts of data across an inexpensive cluster of commodity servers. Pentaho’s analytics supports two of the leading data grids – Infinispan (also known as JBoss Enterprise Data Grid) and Memcached. Traditional in-memory products written in either Java or C++ are constrained to using a limited amount of memory on the server on which they are executing – at most a few GBs. By contrast a data grid can be distributed across a cluster of commodity servers and can address hundreds of GBs, and potentially TBs of memory in future as hardware memory sizes get larger and less expensive. This allows customers to load all or most of their data into memory, so delivering consistent speed of thought responses times, orders of magnitude faster than needing to query a database because the data the user needed was not in-memory. A couple of vendors, Qliktech and Microstrategy, do provide proprietary in-memory caching capabilities. But these require special training and skills to use and maintain – and they are single server solutions so constrained to the amount of physical memory that can be installed on a single server, typically no more than 64GB.