SlideShare a Scribd company logo
Big Data Insight




Blueprint for Integrating Big Data Analytics and BI
Abe Taha, VP Engineering
abetaha@karmasphere.com




www.karmasphere.com
Big Data Insight


>  Agenda



ü  Where does Big Data Analytics fit in the BI ecosystem
ü  How does Big Data Analytics complement the type of analysis we do today using BI
ü  What are clients doing with Big Data Analytics that they couldn’t do with BI
ü  What do we need to think about to make Hadoop deployments successful




2                                                Karmasphere Proprietary and Confidential. Do Not Copy. Do Not Distribute
Big Data Insight


>  Hadoop not standing alone
Big Data Insight


>  Parallel and Complementary Stacks
Big Data Insight


    >  The Best of Both Worlds = Big Data Analytics + Traditional BI


                           Traditional BI                     Big Data Analytics
       Purpose             Reporting on business              Optimizing the business
       Paradigm            Ask a specific question            Ask any question
       Format              Look at structured data            Look at all data
       Setup               Pre-engineered                     On-the-fly
       Data locations       Siloed                            One place
       Agility              Weeks to months                   Almost Immediate




5                                                    Karmasphere Proprietary and Confidential. Do Not Copy. Do Not Distribute
Big Data Insight


    >  Big Data Analytics on Hadoop Use Cases



        Product
      Optimization      •  Insight to usage patterns, bug paths, quality outages
                        •  Outline new features, improve product roadmap and process
                        •  Enhance customer service, quality and product “stickiness”


     Unified Customer
           View         •  Insight to correlations across product lines and interaction channels
                        •  Personalize offers, services and customer experience
                        •  Reduce churn and increase customer satisfaction


       Marketing
      Performance       •  Insight to market program attribution and ROI
                        •  Increase customer targeting through micro-segmentation
                        •  Optimize online ads and cross channel programs




6                                                                                   © Karmasphere 2012
Big Data Insight


    >  What Hadoop Adopters Are Saying



      “The kind of new stuff
         we want to do
       can’t get done with
                BI“
           Large Hi Tech Chip Manufacturer

7                                        Karmasphere Proprietary and Confidential. Do Not Copy. Do Not Distribute
Big Data Insight


>  How to make Hadoop successful with BI



1.  Employ All Data
2.  Use All Analytic Assets
3.  Provide Self-Service Access for All Users
4.  Build a Collaborative Environment
5.  Be Open and Extensible
6.  Populate Best-of-Breed Reporting Tools
Big Data Insight


>  Cornerstone 1: Employ All Data



ü  Leave No Data Behind
  •    Raw unstructured – Web logs, machine /
       sensor data, mobile social, video, etc.

  •    Structured data – traditional RDMBS, EDW’s

  •    Streaming vs. batch oriented

  •    Data governance and quality
Big Data Insight


>  Cornerstone 2: Use All Analytic Assets



ü  Employ All Analytic Assets
   •    Traditional models and assets

   •    Standard Hadoop components including
        UDFs and SerDes

   •    Custom algorithms

   •    Models created in other systems such as
        SAS/R
Big Data Insight


>  Cornerstone 3: Provide Self-Service Access for All Users



ü  Self-Service
•    BYOD: Bring Your Own Data
•    Ingest custom functions and algorithms
•    Intuitive, no special skill sets required

ü  Empower All Users and Skill Sets
•    Business User
     •    Easy-to-use ad-hoc analysis, web-based forms
     •    Drag and drop

•    Data Analysts
     •    Common skills: SQL
     •    Powerful iterative analysis
     •    Analytical models and algorithms

•    Customers and Partners for ecosystem
Big Data Insight


>  Cornerstone 4: Build a Collaborative Environment



ü  Collaborative
•  Project-based environment

•  Leverage cross-functional skills

•  Security and isolation

ü  Social
•  Share data and insights across teams
   •    Metadata, Queries, Results and Visualizations

•  View colleague’s activities

•  Usage feedback and metrics
Big Data Insight


>  Cornerstone 5: Be Open and Extensible



ü  Open
•  Active community, rapid innovation

•  Vendor commitment

•  Standards based
•  Portable - No vendor lock-in

•  Expose standard API’s and interfaces


ü  Extensible
•  Add custom functions

•  Reuse existing analytic models
•  Add additional data sources by defining custom parsers
Big Data Insight


>      Cornerstone 6: Populate Best-of-Breed Reporting Tools



ü  Best-Of-Breed Reporting tools
•  Ingest data from existing BI systems and ad hoc data including
     Spreadsheet data

•  Automate delivery of insights

•  Push insights to RDBMS, EDW’s and MPP

•  Expose standards APIs for programmability
Big Data Insight


     >  How would an architecture look




15                                       Karmasphere Proprietary and Confidential. Do Not Copy. Do Not Distribute
Big Data Insight


      >  Summary


1.  Implement Big Data Analytics and BI co-existence   Hadoop at your fingertips
2.  Leverage all your assets
3.  Use and build on open and extensible solutions     across your company…
4.  Build social and collaborative in early            	
  




                                                                            Private and Confidential
Big Data Insight


>  Summary Get the Best of Both Worlds – Build a Bridge Inside Your Company


                                           Big Data Analytics on Hadoop
                                           Future, see intent
                                           Drives Optimization
   BI                                      Just getting started
   Historical
   Drives reporting
   Entrenched
   Be around for a long time
Questions?
abetaha@karmasphere.com	
  
www.karmasphere.com	
  
	
  

More Related Content

What's hot

Datamensional Business Intelligence and Data Services
Datamensional Business Intelligence and Data ServicesDatamensional Business Intelligence and Data Services
Datamensional Business Intelligence and Data Services
Datamensional
 
Need of business intelligence
Need of business intelligenceNeed of business intelligence
Need of business intelligence
Vivek Mohan
 
Location Intelligence - the Next Evolution of Business Applications
Location Intelligence - the Next Evolution of Business ApplicationsLocation Intelligence - the Next Evolution of Business Applications
Location Intelligence - the Next Evolution of Business Applications
MISNet - Integeo SE Asia
 
How different between Big Data, Business Intelligence and Analytics ?
How different between Big Data, Business Intelligence and Analytics ?How different between Big Data, Business Intelligence and Analytics ?
How different between Big Data, Business Intelligence and Analytics ?
Thanakrit Lersmethasakul
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
Sukirti Garg
 
Overview of Business Intelligence
Overview of Business IntelligenceOverview of Business Intelligence
Overview of Business Intelligence
Parthiv Dixit
 
Self-Service BI Trends
Self-Service BI TrendsSelf-Service BI Trends
Self-Service BI Trends
Netwoven Inc.
 
Big Data Case study - caixa bank
Big Data Case study - caixa bankBig Data Case study - caixa bank
Big Data Case study - caixa bank
Chungsik Yun
 
Big Data and Semantic Web in Manufacturing
Big Data and Semantic Web in ManufacturingBig Data and Semantic Web in Manufacturing
Big Data and Semantic Web in Manufacturing
Nitesh Khilwani
 
Instant Analytics with Birst and SAP HANA Cloud Platform for #sitNL
Instant Analytics with Birst and SAP HANA Cloud Platform for #sitNLInstant Analytics with Birst and SAP HANA Cloud Platform for #sitNL
Instant Analytics with Birst and SAP HANA Cloud Platform for #sitNL
Richard Neale
 
Big Data Analytic with Hadoop: Customer Stories
Big Data Analytic with Hadoop: Customer StoriesBig Data Analytic with Hadoop: Customer Stories
Big Data Analytic with Hadoop: Customer Stories
Yellowfin
 
Spring 2017 Sage 300 (Accpac) Users Group
Spring 2017 Sage 300 (Accpac) Users GroupSpring 2017 Sage 300 (Accpac) Users Group
Spring 2017 Sage 300 (Accpac) Users Group
Gross, Mendelsohn & Associates
 
New Approach to Supply Chain Analytics
New Approach to Supply Chain AnalyticsNew Approach to Supply Chain Analytics
New Approach to Supply Chain Analytics
demando
 
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATIONBig Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Matt Stubbs
 
The Evolution of Business Intelligence
The Evolution of Business IntelligenceThe Evolution of Business Intelligence
The Evolution of Business Intelligence
Call Sumo
 
#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...
#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...
#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...
MITX
 
Tools and techniques for predictive analytics
Tools and techniques for predictive analyticsTools and techniques for predictive analytics
Tools and techniques for predictive analytics
RohanKumarJumnani
 
Data analytics as a service
Data analytics as a serviceData analytics as a service
Data analytics as a service
Stanley Wang
 
The Present - the History of Business Intelligence
The Present - the History of Business IntelligenceThe Present - the History of Business Intelligence
The Present - the History of Business Intelligence
Phocas Software
 

What's hot (20)

Datamensional Business Intelligence and Data Services
Datamensional Business Intelligence and Data ServicesDatamensional Business Intelligence and Data Services
Datamensional Business Intelligence and Data Services
 
Need of business intelligence
Need of business intelligenceNeed of business intelligence
Need of business intelligence
 
Location Intelligence - the Next Evolution of Business Applications
Location Intelligence - the Next Evolution of Business ApplicationsLocation Intelligence - the Next Evolution of Business Applications
Location Intelligence - the Next Evolution of Business Applications
 
The evolution of Business Intelligence
The evolution of Business IntelligenceThe evolution of Business Intelligence
The evolution of Business Intelligence
 
How different between Big Data, Business Intelligence and Analytics ?
How different between Big Data, Business Intelligence and Analytics ?How different between Big Data, Business Intelligence and Analytics ?
How different between Big Data, Business Intelligence and Analytics ?
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
 
Overview of Business Intelligence
Overview of Business IntelligenceOverview of Business Intelligence
Overview of Business Intelligence
 
Self-Service BI Trends
Self-Service BI TrendsSelf-Service BI Trends
Self-Service BI Trends
 
Big Data Case study - caixa bank
Big Data Case study - caixa bankBig Data Case study - caixa bank
Big Data Case study - caixa bank
 
Big Data and Semantic Web in Manufacturing
Big Data and Semantic Web in ManufacturingBig Data and Semantic Web in Manufacturing
Big Data and Semantic Web in Manufacturing
 
Instant Analytics with Birst and SAP HANA Cloud Platform for #sitNL
Instant Analytics with Birst and SAP HANA Cloud Platform for #sitNLInstant Analytics with Birst and SAP HANA Cloud Platform for #sitNL
Instant Analytics with Birst and SAP HANA Cloud Platform for #sitNL
 
Big Data Analytic with Hadoop: Customer Stories
Big Data Analytic with Hadoop: Customer StoriesBig Data Analytic with Hadoop: Customer Stories
Big Data Analytic with Hadoop: Customer Stories
 
Spring 2017 Sage 300 (Accpac) Users Group
Spring 2017 Sage 300 (Accpac) Users GroupSpring 2017 Sage 300 (Accpac) Users Group
Spring 2017 Sage 300 (Accpac) Users Group
 
New Approach to Supply Chain Analytics
New Approach to Supply Chain AnalyticsNew Approach to Supply Chain Analytics
New Approach to Supply Chain Analytics
 
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATIONBig Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
 
The Evolution of Business Intelligence
The Evolution of Business IntelligenceThe Evolution of Business Intelligence
The Evolution of Business Intelligence
 
#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...
#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...
#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...
 
Tools and techniques for predictive analytics
Tools and techniques for predictive analyticsTools and techniques for predictive analytics
Tools and techniques for predictive analytics
 
Data analytics as a service
Data analytics as a serviceData analytics as a service
Data analytics as a service
 
The Present - the History of Business Intelligence
The Present - the History of Business IntelligenceThe Present - the History of Business Intelligence
The Present - the History of Business Intelligence
 

Viewers also liked

Malaysia Big Data Analytics Initiative: 2015 Imperatives
Malaysia Big Data Analytics Initiative: 2015 ImperativesMalaysia Big Data Analytics Initiative: 2015 Imperatives
Malaysia Big Data Analytics Initiative: 2015 Imperatives
Peter Kua
 
Text visualization - by Jeff Clark
Text visualization -  by Jeff ClarkText visualization -  by Jeff Clark
Text visualization - by Jeff Clark
Cindy Xiao
 
Bi on Big Data - Strata 2016 in London
Bi on Big Data - Strata 2016 in LondonBi on Big Data - Strata 2016 in London
Bi on Big Data - Strata 2016 in London
Dremio Corporation
 
Bi isn't big data and big data isn't BI (updated)
Bi isn't big data and big data isn't BI (updated)Bi isn't big data and big data isn't BI (updated)
Bi isn't big data and big data isn't BI (updated)
mark madsen
 
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - PentahoBI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BICC Thomas More
 
How big data is transforming BI
How big data is transforming BIHow big data is transforming BI
How big data is transforming BI
DeZyre
 
What is bi analytics and big data
What is bi analytics and big dataWhat is bi analytics and big data
What is bi analytics and big data
galiasisense
 
Big Data and BI Best Practices
Big Data and BI Best PracticesBig Data and BI Best Practices
Big Data and BI Best Practices
Yellowfin
 
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
Amazon Web Services
 
Analytics Trends 2016: The next evolution
Analytics Trends 2016: The next evolutionAnalytics Trends 2016: The next evolution
Analytics Trends 2016: The next evolution
Deloitte United States
 
Big Data visualization with Apache Spark and Zeppelin
Big Data visualization with Apache Spark and ZeppelinBig Data visualization with Apache Spark and Zeppelin
Big Data visualization with Apache Spark and Zeppelin
prajods
 

Viewers also liked (11)

Malaysia Big Data Analytics Initiative: 2015 Imperatives
Malaysia Big Data Analytics Initiative: 2015 ImperativesMalaysia Big Data Analytics Initiative: 2015 Imperatives
Malaysia Big Data Analytics Initiative: 2015 Imperatives
 
Text visualization - by Jeff Clark
Text visualization -  by Jeff ClarkText visualization -  by Jeff Clark
Text visualization - by Jeff Clark
 
Bi on Big Data - Strata 2016 in London
Bi on Big Data - Strata 2016 in LondonBi on Big Data - Strata 2016 in London
Bi on Big Data - Strata 2016 in London
 
Bi isn't big data and big data isn't BI (updated)
Bi isn't big data and big data isn't BI (updated)Bi isn't big data and big data isn't BI (updated)
Bi isn't big data and big data isn't BI (updated)
 
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - PentahoBI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
 
How big data is transforming BI
How big data is transforming BIHow big data is transforming BI
How big data is transforming BI
 
What is bi analytics and big data
What is bi analytics and big dataWhat is bi analytics and big data
What is bi analytics and big data
 
Big Data and BI Best Practices
Big Data and BI Best PracticesBig Data and BI Best Practices
Big Data and BI Best Practices
 
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
 
Analytics Trends 2016: The next evolution
Analytics Trends 2016: The next evolutionAnalytics Trends 2016: The next evolution
Analytics Trends 2016: The next evolution
 
Big Data visualization with Apache Spark and Zeppelin
Big Data visualization with Apache Spark and ZeppelinBig Data visualization with Apache Spark and Zeppelin
Big Data visualization with Apache Spark and Zeppelin
 

Similar to Blueprint for integrating big data analytics and bi

Karmasphere bdabi blueprint- final
Karmasphere bdabi blueprint- finalKarmasphere bdabi blueprint- final
Karmasphere bdabi blueprint- finalAbe Taha
 
Big data and bi best practices slidedeck
Big data and bi best practices slidedeckBig data and bi best practices slidedeck
Big data and bi best practices slidedeck
Actian Corporation
 
Time to Fly - Why Predictive Analytics is Going Mainstream
Time to Fly - Why Predictive Analytics is Going MainstreamTime to Fly - Why Predictive Analytics is Going Mainstream
Time to Fly - Why Predictive Analytics is Going Mainstream
Inside Analysis
 
Hadoop and SQL: Delivery Analytics Across the Organization
Hadoop and SQL:  Delivery Analytics Across the OrganizationHadoop and SQL:  Delivery Analytics Across the Organization
Hadoop and SQL: Delivery Analytics Across the Organization
Seeling Cheung
 
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
BigMine
 
Big data analytics - hadoop
Big data analytics - hadoopBig data analytics - hadoop
Big data analytics - hadoop
Vishwajeet Jadeja
 
Hot Technologies of 2013: Hadoop 2.0
Hot Technologies of 2013: Hadoop 2.0Hot Technologies of 2013: Hadoop 2.0
Hot Technologies of 2013: Hadoop 2.0
Inside Analysis
 
Big Data in Azure
Big Data in AzureBig Data in Azure
New Innovations in Information Management for Big Data - Smarter Business 2013
New Innovations in Information Management for Big Data - Smarter Business 2013New Innovations in Information Management for Big Data - Smarter Business 2013
New Innovations in Information Management for Big Data - Smarter Business 2013
IBM Sverige
 
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
Big Data:  InterConnect 2016 Session on Getting Started with Big Data AnalyticsBig Data:  InterConnect 2016 Session on Getting Started with Big Data Analytics
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
Cynthia Saracco
 
Bi 4.0 Migration Strategy and Best Practices
Bi 4.0 Migration Strategy and Best PracticesBi 4.0 Migration Strategy and Best Practices
Bi 4.0 Migration Strategy and Best PracticesEric Molner
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data Solutions
Mark Kromer
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
Cloudera, Inc.
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouse
Jeff Kelly
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoop
Sri Kanth
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
Mithlesh Sadh
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
 
Scaling Data overview
Scaling Data overviewScaling Data overview
Scaling Data overview
Wade Malone
 
How to implement Hadoop successfully
How to implement Hadoop successfullyHow to implement Hadoop successfully
How to implement Hadoop successfully
Adir Sharabi
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG Data
Prakalp Agarwal
 

Similar to Blueprint for integrating big data analytics and bi (20)

Karmasphere bdabi blueprint- final
Karmasphere bdabi blueprint- finalKarmasphere bdabi blueprint- final
Karmasphere bdabi blueprint- final
 
Big data and bi best practices slidedeck
Big data and bi best practices slidedeckBig data and bi best practices slidedeck
Big data and bi best practices slidedeck
 
Time to Fly - Why Predictive Analytics is Going Mainstream
Time to Fly - Why Predictive Analytics is Going MainstreamTime to Fly - Why Predictive Analytics is Going Mainstream
Time to Fly - Why Predictive Analytics is Going Mainstream
 
Hadoop and SQL: Delivery Analytics Across the Organization
Hadoop and SQL:  Delivery Analytics Across the OrganizationHadoop and SQL:  Delivery Analytics Across the Organization
Hadoop and SQL: Delivery Analytics Across the Organization
 
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
 
Big data analytics - hadoop
Big data analytics - hadoopBig data analytics - hadoop
Big data analytics - hadoop
 
Hot Technologies of 2013: Hadoop 2.0
Hot Technologies of 2013: Hadoop 2.0Hot Technologies of 2013: Hadoop 2.0
Hot Technologies of 2013: Hadoop 2.0
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
New Innovations in Information Management for Big Data - Smarter Business 2013
New Innovations in Information Management for Big Data - Smarter Business 2013New Innovations in Information Management for Big Data - Smarter Business 2013
New Innovations in Information Management for Big Data - Smarter Business 2013
 
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
Big Data:  InterConnect 2016 Session on Getting Started with Big Data AnalyticsBig Data:  InterConnect 2016 Session on Getting Started with Big Data Analytics
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
 
Bi 4.0 Migration Strategy and Best Practices
Bi 4.0 Migration Strategy and Best PracticesBi 4.0 Migration Strategy and Best Practices
Bi 4.0 Migration Strategy and Best Practices
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data Solutions
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouse
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoop
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
Scaling Data overview
Scaling Data overviewScaling Data overview
Scaling Data overview
 
How to implement Hadoop successfully
How to implement Hadoop successfullyHow to implement Hadoop successfully
How to implement Hadoop successfully
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG Data
 

More from DataWorks Summit

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
DataWorks Summit
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
DataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
DataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
DataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
DataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
DataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
DataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
DataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
DataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
DataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
DataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
DataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
DataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
DataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
DataWorks Summit
 

More from DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Recently uploaded

PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 

Recently uploaded (20)

PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 

Blueprint for integrating big data analytics and bi

  • 1. Big Data Insight Blueprint for Integrating Big Data Analytics and BI Abe Taha, VP Engineering abetaha@karmasphere.com www.karmasphere.com
  • 2. Big Data Insight >  Agenda ü  Where does Big Data Analytics fit in the BI ecosystem ü  How does Big Data Analytics complement the type of analysis we do today using BI ü  What are clients doing with Big Data Analytics that they couldn’t do with BI ü  What do we need to think about to make Hadoop deployments successful 2 Karmasphere Proprietary and Confidential. Do Not Copy. Do Not Distribute
  • 3. Big Data Insight >  Hadoop not standing alone
  • 4. Big Data Insight >  Parallel and Complementary Stacks
  • 5. Big Data Insight >  The Best of Both Worlds = Big Data Analytics + Traditional BI Traditional BI Big Data Analytics Purpose Reporting on business Optimizing the business Paradigm Ask a specific question Ask any question Format Look at structured data Look at all data Setup Pre-engineered On-the-fly Data locations Siloed One place Agility Weeks to months Almost Immediate 5 Karmasphere Proprietary and Confidential. Do Not Copy. Do Not Distribute
  • 6. Big Data Insight >  Big Data Analytics on Hadoop Use Cases Product Optimization •  Insight to usage patterns, bug paths, quality outages •  Outline new features, improve product roadmap and process •  Enhance customer service, quality and product “stickiness” Unified Customer View •  Insight to correlations across product lines and interaction channels •  Personalize offers, services and customer experience •  Reduce churn and increase customer satisfaction Marketing Performance •  Insight to market program attribution and ROI •  Increase customer targeting through micro-segmentation •  Optimize online ads and cross channel programs 6 © Karmasphere 2012
  • 7. Big Data Insight >  What Hadoop Adopters Are Saying “The kind of new stuff we want to do can’t get done with BI“ Large Hi Tech Chip Manufacturer 7 Karmasphere Proprietary and Confidential. Do Not Copy. Do Not Distribute
  • 8. Big Data Insight >  How to make Hadoop successful with BI 1.  Employ All Data 2.  Use All Analytic Assets 3.  Provide Self-Service Access for All Users 4.  Build a Collaborative Environment 5.  Be Open and Extensible 6.  Populate Best-of-Breed Reporting Tools
  • 9. Big Data Insight >  Cornerstone 1: Employ All Data ü  Leave No Data Behind •  Raw unstructured – Web logs, machine / sensor data, mobile social, video, etc. •  Structured data – traditional RDMBS, EDW’s •  Streaming vs. batch oriented •  Data governance and quality
  • 10. Big Data Insight >  Cornerstone 2: Use All Analytic Assets ü  Employ All Analytic Assets •  Traditional models and assets •  Standard Hadoop components including UDFs and SerDes •  Custom algorithms •  Models created in other systems such as SAS/R
  • 11. Big Data Insight >  Cornerstone 3: Provide Self-Service Access for All Users ü  Self-Service •  BYOD: Bring Your Own Data •  Ingest custom functions and algorithms •  Intuitive, no special skill sets required ü  Empower All Users and Skill Sets •  Business User •  Easy-to-use ad-hoc analysis, web-based forms •  Drag and drop •  Data Analysts •  Common skills: SQL •  Powerful iterative analysis •  Analytical models and algorithms •  Customers and Partners for ecosystem
  • 12. Big Data Insight >  Cornerstone 4: Build a Collaborative Environment ü  Collaborative •  Project-based environment •  Leverage cross-functional skills •  Security and isolation ü  Social •  Share data and insights across teams •  Metadata, Queries, Results and Visualizations •  View colleague’s activities •  Usage feedback and metrics
  • 13. Big Data Insight >  Cornerstone 5: Be Open and Extensible ü  Open •  Active community, rapid innovation •  Vendor commitment •  Standards based •  Portable - No vendor lock-in •  Expose standard API’s and interfaces ü  Extensible •  Add custom functions •  Reuse existing analytic models •  Add additional data sources by defining custom parsers
  • 14. Big Data Insight >  Cornerstone 6: Populate Best-of-Breed Reporting Tools ü  Best-Of-Breed Reporting tools •  Ingest data from existing BI systems and ad hoc data including Spreadsheet data •  Automate delivery of insights •  Push insights to RDBMS, EDW’s and MPP •  Expose standards APIs for programmability
  • 15. Big Data Insight >  How would an architecture look 15 Karmasphere Proprietary and Confidential. Do Not Copy. Do Not Distribute
  • 16. Big Data Insight >  Summary 1.  Implement Big Data Analytics and BI co-existence Hadoop at your fingertips 2.  Leverage all your assets 3.  Use and build on open and extensible solutions across your company… 4.  Build social and collaborative in early   Private and Confidential
  • 17. Big Data Insight >  Summary Get the Best of Both Worlds – Build a Bridge Inside Your Company Big Data Analytics on Hadoop Future, see intent Drives Optimization BI Just getting started Historical Drives reporting Entrenched Be around for a long time