SlideShare a Scribd company logo
Big Data and BI
Best Practices
Your presenters Year
          | Last

Yellowfin CEO, Glen Rabie




VP Sales & Services in APAC, Actian
Corporation, Jason Leonidas
Your presenters Year
          | Last

Yellowfin CEO, Glen Rabie




General Manager, Actian
Vectorwise, Fred Gallagher
About Actian and Yellowfin



Making Business Intelligence easy                     Taking Action on Big Data

                                    History of 100GB TPC-H Performance Benchmarks
                                                     Composite Queries Per Hour (Non-Clustered)

                                                  500,000.00

                                                  400,000.00




                                     QphH@100GB
                                                  300,000.00

                                                  200,000.00

                                                  100,000.00

                                                          -

                                                          Non-Vectorwise                 Vectorwise
Data is
the new
oil
David McCandles
Data Journalist
The rise and rise of Big Data
There has always been Big Data…

   Its just that now we can actually capture and mine it
   effectively.




Canadian Tar Fields
Not all Big Data is created Equal

Planet Google and friends are the outliers



                         Large Telco             The Norm

                                             .
Do you have a Big Data problem?
Big Data for Everyone

• Big data is not just for data scientists and bespoke
  projects
• Its for decision makers and data consumers
• It needs to be anchored in the real world




                                                Analyst
                                                Consumers
Who is benefitting from Big Data?
Who is benefitting from Big Data?
Why bother with Big Data?



          of organizations collect
60%       more data than they can
          effectively use
          (MIT Sloan Management Review)
Why bother with Big Data?


          of organizations see
70%       Big Data as a big
          business opportunity
          (Harris Interactive)

          of organizations investing
70%       in Big Data initiatives
          expect ROI within 1 year
          (Harris Interactive)
Why bother with Big Data?



          of organizations that
84%       actively leverage Big Data
          say they can now make
          better decisions
          (Avanade)
Best Practices in
   Big Data
Jason Leonidas, Actian Corporation
Best Practices in
   Big Data
Fred Gallagher, Actian Corporation
What is Big Data?
Best Practice #1




       Focus on what
     you want to achieve
It’s all about driving value
Big Data Levers


1.   Personalization
2.   Social
3.   Search
4.   Find opportunities
5.   Actionable Insights
Best Practice #2




 Identify the data you have
              vs
    The data you need
Does your data match what you want
to achieve?
What data do you need?
Best Practice #3




    Use the right Big Data
       tool for the job
Big Data and Hadoop
Big Data Eco-system

            Social
            Media                              Analytic
                                 Hadoop
                                              Databases


                       Storage
                                   BIG
                     Search       DATA          NewSQL
                                              “as-a-service”

                                  NoSQL
                                  Document
  Operational                      BigTable
   Database                       Key Value
                                    Graph
Best Practice #4




     Use a fast database
Slow Query Performance is the
#1 issue in BI

BI Survey 10: Why BI Projects Fail?
1. Query Performance Too Slow


TDWI Best Practices Report
“45% Poor Query Response the top problem that will eventually
drive users to replace their current data warehouse platform.”


Gartner Magic Quadrant Data Warehousing
70% of data warehouses experience performance
constrained issues of various types
User Expectations

                        Web-Based
                        Business Intelligence
                        Users expect results in
                        less than 10 seconds



                Mobile BI
Users expect results in less
           than 3 seconds
Use a fast database




Traditional Database   Analytical Database   Clustered Database
Consider the hidden costs

      Spend Less on Hardware
      Get faster results on smaller
      hardware configurations.



                    Spend Less Time
                    Database Tuning
                 Faster deployment and BI
                         projects. No more
               aggregates, cubes, complex
                              schemas, etc
Best Practice #5




        Plan for a mixed
          architecture
Hadoop and BI architecture



  Hadoop



Transactional

                Fast Database   BI Tool


  External
Best Practice #6




Ensure mass distribution of
        your data
Big Data for Everyone


        Visualizations


         Alerts


        Access Anywhere
Best Practice #7




    Tailor data delivery to
        each audience
Give your audience what they want

Demographics   Interactive Reports      Statistics




      KPIs           Maps            Collaboration
Visualization is powerful

     Looks like Pac-man   Does not look like Pac-man
            169                      41




                               Looks like Pac-man

                               Does not look like
                               Pac-man
Big Data Visualization Tips


•   More data requires more focus
•   Interactivity is essential
•   Select the right metrics
•   Provide context
•   Support and prompt action
Demonstration
Big Data and BI Best Practices

1. Focus on what you want to achieve
2. Identify the data you have vs The data you
   need
3. Use the right Big Data tool for the job
4. Use a fast database
5. Plan for a mixed architecture
6. Ensure mass distribution of your data
7. Tailor data delivery to each audience
Conclusion
Questions
| Last Year
 More Information

Yellowfin
www.yellowfinbi.com

Vectorwise
www.actian.com/products/vectorwise        @YellowfinBI
                                          @ActianCorp




Feedback & Questions
pr@yellowfin.bi               Yellowfin LinkedIn User Group
                             Vectorwise LinkedIn User Group

More Related Content

What's hot

Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
Databricks
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Simplilearn
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
James Serra
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
Md. Salman Ahmed
 
Databricks Delta Lake and Its Benefits
Databricks Delta Lake and Its BenefitsDatabricks Delta Lake and Its Benefits
Databricks Delta Lake and Its Benefits
Databricks
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
Hadi Fadlallah
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
James Serra
 
Snowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at ScaleSnowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at Scale
Adam Doyle
 
Snowflake + Power BI: Cloud Analytics for Everyone
Snowflake + Power BI: Cloud Analytics for EveryoneSnowflake + Power BI: Cloud Analytics for Everyone
Snowflake + Power BI: Cloud Analytics for Everyone
Angel Abundez
 
Lambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
Lambda Architecture in the Cloud with Azure Databricks with Andrei VaranovichLambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
Lambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
Databricks
 
Clickstream Analysis with Apache Spark
Clickstream Analysis with Apache SparkClickstream Analysis with Apache Spark
Clickstream Analysis with Apache Spark
QAware GmbH
 
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
Databricks
 
Data Architecture Strategies
Data Architecture StrategiesData Architecture Strategies
Data Architecture Strategies
DATAVERSITY
 
Spark SQL Tutorial | Spark SQL Using Scala | Apache Spark Tutorial For Beginn...
Spark SQL Tutorial | Spark SQL Using Scala | Apache Spark Tutorial For Beginn...Spark SQL Tutorial | Spark SQL Using Scala | Apache Spark Tutorial For Beginn...
Spark SQL Tutorial | Spark SQL Using Scala | Apache Spark Tutorial For Beginn...
Simplilearn
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture Design
Kujambu Murugesan
 
Lakehouse in Azure
Lakehouse in AzureLakehouse in Azure
Lakehouse in Azure
Sergio Zenatti Filho
 
Delta lake and the delta architecture
Delta lake and the delta architectureDelta lake and the delta architecture
Delta lake and the delta architecture
Adam Doyle
 
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DATAVERSITY
 
Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJX
Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJXDriving Data Intelligence in the Supply Chain Through the Data Catalog at TJX
Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJX
DATAVERSITY
 
Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data Architecture
DATAVERSITY
 

What's hot (20)

Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Databricks Delta Lake and Its Benefits
Databricks Delta Lake and Its BenefitsDatabricks Delta Lake and Its Benefits
Databricks Delta Lake and Its Benefits
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
 
Snowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at ScaleSnowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at Scale
 
Snowflake + Power BI: Cloud Analytics for Everyone
Snowflake + Power BI: Cloud Analytics for EveryoneSnowflake + Power BI: Cloud Analytics for Everyone
Snowflake + Power BI: Cloud Analytics for Everyone
 
Lambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
Lambda Architecture in the Cloud with Azure Databricks with Andrei VaranovichLambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
Lambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
 
Clickstream Analysis with Apache Spark
Clickstream Analysis with Apache SparkClickstream Analysis with Apache Spark
Clickstream Analysis with Apache Spark
 
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
Data Architecture Strategies
Data Architecture StrategiesData Architecture Strategies
Data Architecture Strategies
 
Spark SQL Tutorial | Spark SQL Using Scala | Apache Spark Tutorial For Beginn...
Spark SQL Tutorial | Spark SQL Using Scala | Apache Spark Tutorial For Beginn...Spark SQL Tutorial | Spark SQL Using Scala | Apache Spark Tutorial For Beginn...
Spark SQL Tutorial | Spark SQL Using Scala | Apache Spark Tutorial For Beginn...
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture Design
 
Lakehouse in Azure
Lakehouse in AzureLakehouse in Azure
Lakehouse in Azure
 
Delta lake and the delta architecture
Delta lake and the delta architectureDelta lake and the delta architecture
Delta lake and the delta architecture
 
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
 
Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJX
Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJXDriving Data Intelligence in the Supply Chain Through the Data Catalog at TJX
Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJX
 
Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data Architecture
 

Viewers also liked

How different between Big Data, Business Intelligence and Analytics ?
How different between Big Data, Business Intelligence and Analytics ?How different between Big Data, Business Intelligence and Analytics ?
How different between Big Data, Business Intelligence and Analytics ?
Thanakrit Lersmethasakul
 
Big Data, Business Intelligence and Data Analytics
Big Data, Business Intelligence and Data AnalyticsBig Data, Business Intelligence and Data Analytics
Big Data, Business Intelligence and Data Analytics
Systems Limited
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
Bernard Marr
 
Big Data vs Data Warehousing
Big Data vs Data WarehousingBig Data vs Data Warehousing
Big Data vs Data Warehousing
Thomas Kejser
 
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - PentahoBI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BICC Thomas More
 
How big data is transforming BI
How big data is transforming BIHow big data is transforming BI
How big data is transforming BI
DeZyre
 
What is big data?
What is big data?What is big data?
What is big data?
David Wellman
 
SAP’s vision and strategy on BI & BIG (and small) data
SAP’s vision and strategy on BI & BIG (and small) dataSAP’s vision and strategy on BI & BIG (and small) data
SAP’s vision and strategy on BI & BIG (and small) data
Waldemar Adams
 
Implementing business intelligence
Implementing business intelligenceImplementing business intelligence
Implementing business intelligence
Alistair Sergeant
 
Laura Madsen Healthcare Business Intelligence & Big Data Analytics
Laura Madsen Healthcare Business Intelligence & Big Data AnalyticsLaura Madsen Healthcare Business Intelligence & Big Data Analytics
Laura Madsen Healthcare Business Intelligence & Big Data Analytics
Pivotal Analytics (Cetas Analytics)
 
Stunning, multi-device, responsive websites
Stunning, multi-device, responsive websitesStunning, multi-device, responsive websites
Stunning, multi-device, responsive websites
Inventeam Solutions Pvt. Ltd.
 
BSI Teradata: The Case of the Dropped Mobile Calls
BSI Teradata: The Case of the Dropped Mobile CallsBSI Teradata: The Case of the Dropped Mobile Calls
BSI Teradata: The Case of the Dropped Mobile Calls
Teradata
 
The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration
James Hendler
 
Data Exploration & BI
Data Exploration & BIData Exploration & BI
Data Exploration & BI
Cristian Guajardo-Garcia
 
TEI of IBM Information Management Solutions
TEI of IBM Information Management SolutionsTEI of IBM Information Management Solutions
TEI of IBM Information Management Solutions
IBM Analytics
 
Self Service Buisness Intelligence - Tech Talk
Self Service Buisness Intelligence - Tech TalkSelf Service Buisness Intelligence - Tech Talk
Self Service Buisness Intelligence - Tech Talk
Brandix i3
 
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
Idiro Analytics
 
General Presentation The Selfservice company
General Presentation The Selfservice companyGeneral Presentation The Selfservice company
General Presentation The Selfservice company
The Selfservice Company
 

Viewers also liked (20)

How different between Big Data, Business Intelligence and Analytics ?
How different between Big Data, Business Intelligence and Analytics ?How different between Big Data, Business Intelligence and Analytics ?
How different between Big Data, Business Intelligence and Analytics ?
 
Big Data, Business Intelligence and Data Analytics
Big Data, Business Intelligence and Data AnalyticsBig Data, Business Intelligence and Data Analytics
Big Data, Business Intelligence and Data Analytics
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Big Data vs Data Warehousing
Big Data vs Data WarehousingBig Data vs Data Warehousing
Big Data vs Data Warehousing
 
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - PentahoBI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
 
How big data is transforming BI
How big data is transforming BIHow big data is transforming BI
How big data is transforming BI
 
What is big data?
What is big data?What is big data?
What is big data?
 
Mind reading-computer
Mind reading-computerMind reading-computer
Mind reading-computer
 
SAP’s vision and strategy on BI & BIG (and small) data
SAP’s vision and strategy on BI & BIG (and small) dataSAP’s vision and strategy on BI & BIG (and small) data
SAP’s vision and strategy on BI & BIG (and small) data
 
Implementing business intelligence
Implementing business intelligenceImplementing business intelligence
Implementing business intelligence
 
Laura Madsen Healthcare Business Intelligence & Big Data Analytics
Laura Madsen Healthcare Business Intelligence & Big Data AnalyticsLaura Madsen Healthcare Business Intelligence & Big Data Analytics
Laura Madsen Healthcare Business Intelligence & Big Data Analytics
 
Stunning, multi-device, responsive websites
Stunning, multi-device, responsive websitesStunning, multi-device, responsive websites
Stunning, multi-device, responsive websites
 
BSI Teradata: The Case of the Dropped Mobile Calls
BSI Teradata: The Case of the Dropped Mobile CallsBSI Teradata: The Case of the Dropped Mobile Calls
BSI Teradata: The Case of the Dropped Mobile Calls
 
The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration
 
Data Exploration & BI
Data Exploration & BIData Exploration & BI
Data Exploration & BI
 
TEI of IBM Information Management Solutions
TEI of IBM Information Management SolutionsTEI of IBM Information Management Solutions
TEI of IBM Information Management Solutions
 
Self Service Buisness Intelligence - Tech Talk
Self Service Buisness Intelligence - Tech TalkSelf Service Buisness Intelligence - Tech Talk
Self Service Buisness Intelligence - Tech Talk
 
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
 
General Presentation The Selfservice company
General Presentation The Selfservice companyGeneral Presentation The Selfservice company
General Presentation The Selfservice company
 

Similar to Big Data and BI Best Practices

Big data and bi best practices slidedeck
Big data and bi best practices slidedeckBig data and bi best practices slidedeck
Big data and bi best practices slidedeck
Actian Corporation
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You Want
Stuart Miniman
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment Options
Caserta
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusers
Bob Hardaway
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
Tony Bain
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG Data
Prakalp Agarwal
 
Big Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeBig Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data Lake
Caserta
 
Blueprint for integrating big data analytics and bi
Blueprint for integrating big data analytics and biBlueprint for integrating big data analytics and bi
Blueprint for integrating big data analytics and biDataWorks Summit
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
Mithlesh Sadh
 
Level Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentationLevel Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentation
Doug Denton
 
Big data
Big dataBig data
Big data
nikki135
 
What_BigData_means_to_your_organization
What_BigData_means_to_your_organizationWhat_BigData_means_to_your_organization
What_BigData_means_to_your_organizationAttila Barta
 
Trends for Modernizing Analytics and Data Warehousing in 2019
Trends for Modernizing Analytics and Data Warehousing in 2019Trends for Modernizing Analytics and Data Warehousing in 2019
Trends for Modernizing Analytics and Data Warehousing in 2019
Arcadia Data
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data Solutions
Mark Kromer
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Exploring Big Data value for your business
Exploring Big Data value for your businessExploring Big Data value for your business
Exploring Big Data value for your business
Acunu
 
Big data and data mining
Big data and data miningBig data and data mining
Big data and data mining
Emran Hossain
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
Sreedhar Chowdam
 
Incorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic ArchitectureIncorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic Architecture
Caserta
 

Similar to Big Data and BI Best Practices (20)

Big data and bi best practices slidedeck
Big data and bi best practices slidedeckBig data and bi best practices slidedeck
Big data and bi best practices slidedeck
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You Want
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment Options
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusers
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG Data
 
Big Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeBig Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data Lake
 
Blueprint for integrating big data analytics and bi
Blueprint for integrating big data analytics and biBlueprint for integrating big data analytics and bi
Blueprint for integrating big data analytics and bi
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
Level Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentationLevel Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentation
 
Big data
Big dataBig data
Big data
 
What_BigData_means_to_your_organization
What_BigData_means_to_your_organizationWhat_BigData_means_to_your_organization
What_BigData_means_to_your_organization
 
Trends for Modernizing Analytics and Data Warehousing in 2019
Trends for Modernizing Analytics and Data Warehousing in 2019Trends for Modernizing Analytics and Data Warehousing in 2019
Trends for Modernizing Analytics and Data Warehousing in 2019
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data Solutions
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
Exploring Big Data value for your business
Exploring Big Data value for your businessExploring Big Data value for your business
Exploring Big Data value for your business
 
Big data and data mining
Big data and data miningBig data and data mining
Big data and data mining
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Incorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic ArchitectureIncorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic Architecture
 

More from Yellowfin

Yellowfin 7.3+ launch presentation slides
Yellowfin 7.3+ launch presentation slidesYellowfin 7.3+ launch presentation slides
Yellowfin 7.3+ launch presentation slides
Yellowfin
 
Yellowfin 7.3 launch presentation slides
Yellowfin 7.3 launch presentation slidesYellowfin 7.3 launch presentation slides
Yellowfin 7.3 launch presentation slides
Yellowfin
 
BI Dashboard Best Practices Webinar 2016 (Slides)
BI Dashboard Best Practices Webinar 2016 (Slides) BI Dashboard Best Practices Webinar 2016 (Slides)
BI Dashboard Best Practices Webinar 2016 (Slides)
Yellowfin
 
Data Visualization Best Practice Webinar presentation slides
Data Visualization Best Practice Webinar presentation slidesData Visualization Best Practice Webinar presentation slides
Data Visualization Best Practice Webinar presentation slides
Yellowfin
 
Making healthcare analytics fast, easy and flexible
Making healthcare analytics fast, easy and flexibleMaking healthcare analytics fast, easy and flexible
Making healthcare analytics fast, easy and flexible
Yellowfin
 
Governed Data Discovery best practices webinar slides
Governed Data Discovery best practices webinar slidesGoverned Data Discovery best practices webinar slides
Governed Data Discovery best practices webinar slidesYellowfin
 
Data-driven Storytelling Best Practices Webinar (presentation slides)
Data-driven Storytelling Best Practices Webinar (presentation slides)Data-driven Storytelling Best Practices Webinar (presentation slides)
Data-driven Storytelling Best Practices Webinar (presentation slides)
Yellowfin
 
Embedded BI Best Practices: Webinar slides
Embedded BI Best Practices: Webinar slidesEmbedded BI Best Practices: Webinar slides
Embedded BI Best Practices: Webinar slides
Yellowfin
 
Yellowfin 7.1 launch webinar slides
Yellowfin 7.1 launch webinar slidesYellowfin 7.1 launch webinar slides
Yellowfin 7.1 launch webinar slidesYellowfin
 
Big Data Analytic with Hadoop: Customer Stories
Big Data Analytic with Hadoop: Customer StoriesBig Data Analytic with Hadoop: Customer Stories
Big Data Analytic with Hadoop: Customer Stories
Yellowfin
 
Data Sourcing Best Practices for Reporting (Webinar slides)
Data Sourcing Best Practices for Reporting (Webinar slides)Data Sourcing Best Practices for Reporting (Webinar slides)
Data Sourcing Best Practices for Reporting (Webinar slides)
Yellowfin
 
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Yellowfin
 
Business Intelligence Dashboard best practice webinar (2013)
Business Intelligence Dashboard best practice webinar (2013)Business Intelligence Dashboard best practice webinar (2013)
Business Intelligence Dashboard best practice webinar (2013)Yellowfin
 
Real-world state of the BI market: Webinar presentation slides
Real-world state of the BI market: Webinar presentation slidesReal-world state of the BI market: Webinar presentation slides
Real-world state of the BI market: Webinar presentation slides
Yellowfin
 
Yellowfin 6.3 webinar launch presentation slides
Yellowfin 6.3 webinar launch presentation slidesYellowfin 6.3 webinar launch presentation slides
Yellowfin 6.3 webinar launch presentation slides
Yellowfin
 
SaaS data access & integration best practices for Business Intelligence
SaaS data access & integration best practices for Business IntelligenceSaaS data access & integration best practices for Business Intelligence
SaaS data access & integration best practices for Business Intelligence
Yellowfin
 
Yellowfin BI Dashboard Best Practices
Yellowfin BI Dashboard Best PracticesYellowfin BI Dashboard Best Practices
Yellowfin BI Dashboard Best PracticesYellowfin
 
Yellowfin Location Intelligence Best Practices Webinar
Yellowfin Location Intelligence Best Practices WebinarYellowfin Location Intelligence Best Practices Webinar
Yellowfin Location Intelligence Best Practices WebinarYellowfin
 
Wisdom of crowds business intelligence market study findings overview
Wisdom of crowds business intelligence market study findings overviewWisdom of crowds business intelligence market study findings overview
Wisdom of crowds business intelligence market study findings overview
Yellowfin
 
Yellowfin BI: 6.1 launch slides
Yellowfin BI: 6.1 launch slidesYellowfin BI: 6.1 launch slides
Yellowfin BI: 6.1 launch slides
Yellowfin
 

More from Yellowfin (20)

Yellowfin 7.3+ launch presentation slides
Yellowfin 7.3+ launch presentation slidesYellowfin 7.3+ launch presentation slides
Yellowfin 7.3+ launch presentation slides
 
Yellowfin 7.3 launch presentation slides
Yellowfin 7.3 launch presentation slidesYellowfin 7.3 launch presentation slides
Yellowfin 7.3 launch presentation slides
 
BI Dashboard Best Practices Webinar 2016 (Slides)
BI Dashboard Best Practices Webinar 2016 (Slides) BI Dashboard Best Practices Webinar 2016 (Slides)
BI Dashboard Best Practices Webinar 2016 (Slides)
 
Data Visualization Best Practice Webinar presentation slides
Data Visualization Best Practice Webinar presentation slidesData Visualization Best Practice Webinar presentation slides
Data Visualization Best Practice Webinar presentation slides
 
Making healthcare analytics fast, easy and flexible
Making healthcare analytics fast, easy and flexibleMaking healthcare analytics fast, easy and flexible
Making healthcare analytics fast, easy and flexible
 
Governed Data Discovery best practices webinar slides
Governed Data Discovery best practices webinar slidesGoverned Data Discovery best practices webinar slides
Governed Data Discovery best practices webinar slides
 
Data-driven Storytelling Best Practices Webinar (presentation slides)
Data-driven Storytelling Best Practices Webinar (presentation slides)Data-driven Storytelling Best Practices Webinar (presentation slides)
Data-driven Storytelling Best Practices Webinar (presentation slides)
 
Embedded BI Best Practices: Webinar slides
Embedded BI Best Practices: Webinar slidesEmbedded BI Best Practices: Webinar slides
Embedded BI Best Practices: Webinar slides
 
Yellowfin 7.1 launch webinar slides
Yellowfin 7.1 launch webinar slidesYellowfin 7.1 launch webinar slides
Yellowfin 7.1 launch webinar slides
 
Big Data Analytic with Hadoop: Customer Stories
Big Data Analytic with Hadoop: Customer StoriesBig Data Analytic with Hadoop: Customer Stories
Big Data Analytic with Hadoop: Customer Stories
 
Data Sourcing Best Practices for Reporting (Webinar slides)
Data Sourcing Best Practices for Reporting (Webinar slides)Data Sourcing Best Practices for Reporting (Webinar slides)
Data Sourcing Best Practices for Reporting (Webinar slides)
 
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
 
Business Intelligence Dashboard best practice webinar (2013)
Business Intelligence Dashboard best practice webinar (2013)Business Intelligence Dashboard best practice webinar (2013)
Business Intelligence Dashboard best practice webinar (2013)
 
Real-world state of the BI market: Webinar presentation slides
Real-world state of the BI market: Webinar presentation slidesReal-world state of the BI market: Webinar presentation slides
Real-world state of the BI market: Webinar presentation slides
 
Yellowfin 6.3 webinar launch presentation slides
Yellowfin 6.3 webinar launch presentation slidesYellowfin 6.3 webinar launch presentation slides
Yellowfin 6.3 webinar launch presentation slides
 
SaaS data access & integration best practices for Business Intelligence
SaaS data access & integration best practices for Business IntelligenceSaaS data access & integration best practices for Business Intelligence
SaaS data access & integration best practices for Business Intelligence
 
Yellowfin BI Dashboard Best Practices
Yellowfin BI Dashboard Best PracticesYellowfin BI Dashboard Best Practices
Yellowfin BI Dashboard Best Practices
 
Yellowfin Location Intelligence Best Practices Webinar
Yellowfin Location Intelligence Best Practices WebinarYellowfin Location Intelligence Best Practices Webinar
Yellowfin Location Intelligence Best Practices Webinar
 
Wisdom of crowds business intelligence market study findings overview
Wisdom of crowds business intelligence market study findings overviewWisdom of crowds business intelligence market study findings overview
Wisdom of crowds business intelligence market study findings overview
 
Yellowfin BI: 6.1 launch slides
Yellowfin BI: 6.1 launch slidesYellowfin BI: 6.1 launch slides
Yellowfin BI: 6.1 launch slides
 

Big Data and BI Best Practices

  • 1. Big Data and BI Best Practices
  • 2. Your presenters Year | Last Yellowfin CEO, Glen Rabie VP Sales & Services in APAC, Actian Corporation, Jason Leonidas
  • 3. Your presenters Year | Last Yellowfin CEO, Glen Rabie General Manager, Actian Vectorwise, Fred Gallagher
  • 4. About Actian and Yellowfin Making Business Intelligence easy Taking Action on Big Data History of 100GB TPC-H Performance Benchmarks Composite Queries Per Hour (Non-Clustered) 500,000.00 400,000.00 QphH@100GB 300,000.00 200,000.00 100,000.00 - Non-Vectorwise Vectorwise
  • 5. Data is the new oil David McCandles Data Journalist
  • 6. The rise and rise of Big Data
  • 7. There has always been Big Data… Its just that now we can actually capture and mine it effectively. Canadian Tar Fields
  • 8. Not all Big Data is created Equal Planet Google and friends are the outliers Large Telco The Norm .
  • 9. Do you have a Big Data problem?
  • 10. Big Data for Everyone • Big data is not just for data scientists and bespoke projects • Its for decision makers and data consumers • It needs to be anchored in the real world Analyst Consumers
  • 11. Who is benefitting from Big Data?
  • 12. Who is benefitting from Big Data?
  • 13. Why bother with Big Data? of organizations collect 60% more data than they can effectively use (MIT Sloan Management Review)
  • 14. Why bother with Big Data? of organizations see 70% Big Data as a big business opportunity (Harris Interactive) of organizations investing 70% in Big Data initiatives expect ROI within 1 year (Harris Interactive)
  • 15. Why bother with Big Data? of organizations that 84% actively leverage Big Data say they can now make better decisions (Avanade)
  • 16. Best Practices in Big Data Jason Leonidas, Actian Corporation
  • 17. Best Practices in Big Data Fred Gallagher, Actian Corporation
  • 18. What is Big Data?
  • 19. Best Practice #1 Focus on what you want to achieve
  • 20. It’s all about driving value
  • 21. Big Data Levers 1. Personalization 2. Social 3. Search 4. Find opportunities 5. Actionable Insights
  • 22. Best Practice #2 Identify the data you have vs The data you need
  • 23. Does your data match what you want to achieve?
  • 24. What data do you need?
  • 25. Best Practice #3 Use the right Big Data tool for the job
  • 26. Big Data and Hadoop
  • 27. Big Data Eco-system Social Media Analytic Hadoop Databases Storage BIG Search DATA NewSQL “as-a-service” NoSQL Document Operational BigTable Database Key Value Graph
  • 28. Best Practice #4 Use a fast database
  • 29. Slow Query Performance is the #1 issue in BI BI Survey 10: Why BI Projects Fail? 1. Query Performance Too Slow TDWI Best Practices Report “45% Poor Query Response the top problem that will eventually drive users to replace their current data warehouse platform.” Gartner Magic Quadrant Data Warehousing 70% of data warehouses experience performance constrained issues of various types
  • 30. User Expectations Web-Based Business Intelligence Users expect results in less than 10 seconds Mobile BI Users expect results in less than 3 seconds
  • 31. Use a fast database Traditional Database Analytical Database Clustered Database
  • 32. Consider the hidden costs Spend Less on Hardware Get faster results on smaller hardware configurations. Spend Less Time Database Tuning Faster deployment and BI projects. No more aggregates, cubes, complex schemas, etc
  • 33. Best Practice #5 Plan for a mixed architecture
  • 34. Hadoop and BI architecture Hadoop Transactional Fast Database BI Tool External
  • 35. Best Practice #6 Ensure mass distribution of your data
  • 36. Big Data for Everyone Visualizations Alerts Access Anywhere
  • 37. Best Practice #7 Tailor data delivery to each audience
  • 38. Give your audience what they want Demographics Interactive Reports Statistics KPIs Maps Collaboration
  • 39. Visualization is powerful Looks like Pac-man Does not look like Pac-man 169 41 Looks like Pac-man Does not look like Pac-man
  • 40. Big Data Visualization Tips • More data requires more focus • Interactivity is essential • Select the right metrics • Provide context • Support and prompt action
  • 42. Big Data and BI Best Practices 1. Focus on what you want to achieve 2. Identify the data you have vs The data you need 3. Use the right Big Data tool for the job 4. Use a fast database 5. Plan for a mixed architecture 6. Ensure mass distribution of your data 7. Tailor data delivery to each audience
  • 44. | Last Year More Information Yellowfin www.yellowfinbi.com Vectorwise www.actian.com/products/vectorwise @YellowfinBI @ActianCorp Feedback & Questions pr@yellowfin.bi Yellowfin LinkedIn User Group Vectorwise LinkedIn User Group

Editor's Notes

  1. Point of slide – introduce presenters Glen introduces himself Glen introduces Jason
  2. Point of slide – introduce presenters Glen introduces himself Glen introduces Fred
  3. Point of slide – Introduce our companies and why we can talk about this topic (some attendees will not have heard of us)A little bit about our 2 companiesYellowfin – mention awards#1 in BI vendor in global Wisdom of Crowds survey#1 Mobile BI by Dresner Advisory Service#1 location Intelligence by Ventana Research Actian – mention record-breaking benchmarks Broken performance and price/performance TPC-H benchmarks by the largest margin’s ever recorded for every benchmark they have entered. And today we are going to talk about the best practices for Big Data and BI
  4. Why Big Data? Data is the new oil. A big opportunity.
  5. Point of slide – Establish how quickly data is growing. If you don’t have big data now you might soon.Data only grows. And Big Data is growing exponentially.Why? Growth of existing data sources, with sophistocation of computer tracking of shipments, sales, suppliers, and customers, as well as e-mail, and web traffic. Growth of new data sources and types such as geospatial, social media comments, mobile, etc
  6. Point of slide – Communicate Big Data didn’t suddenly appear, but now technology exists to leverage it. We’ve always had big data, but now we have the tools and the cost has come down enough to harvest and make value from it. Why is Big Data a Big Deal Now…
  7. Point of slide – Don’t confuse your Big Data problem with Googles. They are not the same.But not all Big Data is created Equal. And Big Data is relative. Google,Facebook, Twitter –are outliers that are in a class of their own. And their requirements are significantly different to large enterprise businesses, let alone the normal enterprise business and SME. And you don’t need to have petabytes of data to have a Big Data problem.
  8. Point of slide – Define Big Data, and what to look for to see if you have a Big Data problem.The 3 V’s fromGartners 3 is probably the most accepted definition of Big Data because it addresses the pain points … Volume – people think terabytes or petabytes Variety – structured and unstructured data such as… Velocity – includes fast query time, and also streaming data. And for BI, this is by far the most important which we will focus a lot on today.And these are important points because you can suffer from Big Data problems without having much data at all as it’s all relative to your hardware and the tools you are using.So if you have any of these pain points, your data is too big – hence Big Data.
  9. Point of slide – Framing slide, we are talking about Big Data for consumers, not analysts.This webinar is about Big Data and BI – and therefore the focus is on assisting decision makers. Too much of the Big Data discussion focuses on data scientists with bespoke projects (hypothesis, hadoop, partitioning, etc). Today we want to focus on data consumers using this in the real world. More data consumers than there are analysts – how can we empower the masses to add value from Big Data. Big Data for everyone
  10. Point of slide – Where big data can add value. It’s mostly marketing. What is the opportunity?Over 45% of big data deployments are spent on marketing, with spending on digital marketing set to grow form $34B to $76B by 2016This slide is about use cases
  11. Point of slide – Show what industries are using Big Data, and how easy it is for these industries to do it.
  12. Point of slide – 60% are already collecting more data than they can effectively use.
  13. Point of slide Big data is an opportunity, not a burden. And 70% of businesses see it that way. And 70% also expect an ROI within 1 year of investing in Big Data initiatives – hmmm, is that a bit optimistic if it takes years to build a data warehouse?
  14. Point of slide – And most importantly, 84% of organizations using Big Data today say they can now make better decisions – which is what it is all about.
  15. So what is big data? Why the hype?This tongue in cheek sketch that highlights the point that there is hype around big data.Roman Stanek, founder and CEO of Good Data – “Today, the difference between success and failure is the ability to monetize a new class of data. It’s ironic that, despite billions of dollars spent on business intelligence systems, we are still data-bankrupt.What that tells us is current skills and technologies is unable to deliver on the business opportunities that can be realized by Big Data. With such high stakes, its no wonder there is hype.
  16. The success of any Big Data project hinges on delivering greater business value.Many focus on the monetization of Big data which means driving greater revenue or creating new revenue opportunitiesBut, depending on the industry sector it also can deliver operational efficiencies and increased services levels and customer satisfaction.The potential trap for new entrants into the Big Data arena is the temptation to develop a Big Data infrastructure for all possibilities or contingencies. As we heard from Glen earlier, the ROI window is 12 months.We must maintain a strong focus on delivering against the specific business objectives and not let the technology drive the direction of the project.
  17. To realize the potential of big data, what are the levers available you? Which will you use?Personalization – Offering a better, more targeted serviceSocial – Allowing users to communicate and share with other community membersSearch – Making it easier for customers to find what they are looking for (save time improving customer sat)Finding opportunities – How do exploit the data and drive opportunities in the business. By understanding what customers, competitors, and the market are doing we can find new opportunities to exploit. Actionable insights – As Glenn stated earlier, - making it easy for your customers, suppliers, and staff to make better decisions – traditional BI. This is the lever we are focusing on today.Badoo – Collect 10million Records each Day. They had now way of quickly identifying which cThey now run std queries in 10 to 30sec wichhalps them determine which marketing campains are converting customers.
  18. Does the data you have match what you want to achieve? There is typically a huge gap between what we have and what we need. What additional data is required outside the normal corporate data? External feeds can make a critical difference to monetizing big DataThen there are governance issues to consider. Who owns the data Vs Who needs the data ? Are there Security & Privacy issues at play?Then there are the Physical considerations - are there documents, email images, or video, which might take a lot of space, but isn’t traditionally used for analysis. What proportion is Structured Vs Unstructured?How much of your data is just indexing to improve performance?Again, the focus must be on collecting the data you need to answer the specific business questions you have?
  19. Every industry has very specific use cases that drive Big Data Success.In areas such as…Transportation & Logistics that are detecting Fraudbefore it happens– (Timocom)Driving Sales by incorporating Environmental Data such as weather with PoS data (Sheets)Web Traffic Monitoring to determine customer behavior– GSI Commerce When you know your goals and fully understand your data requirements then you know what data you need to collect.It is then you can make a decision on what infrastructure you need.
  20. Again some more humor to get the message across, Hadoop is one of the most well know Big Data solutions. And many of you in the audience today will be using it or considering it for future projects.Hadoop makes a fantastic data store for web traffic and machine data because of it’s unmatched scalability, speed and fault tolerance. However it isn’t always the best for business intelligence were the majority of uses cases are SQL or relational database type applications. So when considering deploying BI for the masses you don’t want to ask them to learn a new skill-set or have deep technical know how.One important lesson we have learned is that creating reports from Hadoop was quite time consuming, and then the query performance was actually quite slow.Today most BI tools do connect to Hadoop (through HIVE) So the key take-away from this Best practice is that the Big Data ecosystem is much bigger than just Hadoop.
  21. So what does this eco system look like?Its a huge ecosystem,with many varied solutions that don’t necessarily address all of the 3V’s – Volume, Variety and Velocity.And obviously it’s impossible to accommodate for everything in a single product.With today’s webinar being focused on Big Data for BI and analytics we will focus on that analytical database space. Its built specifically for addressing Business Intelligences and tackles the velocity (speed) issue better than the any of the others.Hadoop makes a fantastic Big Data store, and there are many other Big Data solutions outside of Hadoop in the NoSQL and NewSQL area which solve different pain points, but again are not best practice for BI.Actian has a many customers who started with Hadoop and have incorporated Vectorwise because of its speed – designed if you like for the 3 V’s.
  22. Performance is the number 1 issue in BI today. And with rapidly growing data its only worsening.There is a lot of evidence to support this. BI Survey – say every year slow query performance is the number 1 reason by BI projects fail. TDWI Best Practices Report – Almost half (45%), said that poor query response was the top problem that will drive them to replace their current data warehouse. And Gartner – say 70% of data warehouses experience performance constrained issues
  23. Just why is it so important to have fast performance? We are the Google generation blessed with instant answers and we have become impatient.Today User Expectations are very demanding. Studies show that BI Project value & adoption drops off dramatically when queries take longer than 10 seconds to run. And on mobile devices that drops to just only 3 seconds. The bigger your data, the slower your reports will run – a huge concern.
  24. The solution to this is to use the right tool for the job - It can make a dramatic difference.1. Traditional databases can perform a wide variety of different workloads, but they were never designed for the challenges and complexities of Big Data – Particularity the job of slicing and dicing data. With the amount of additional hardware and BI tuning you require to get better performance, you’d much better served getting a fast, purpose built database.2. Analytical databases ARE purpose built for slicing and dicing data. They are quick, agile, and easy to get started with. 3. Clustered databases are an option as data volumes grow but they aren’t as agile, require substantially more resources and expertise to manage and implement.
  25. What are the other benefits you gain from using a fast database?1. Slash the cost of the hardware – In many recent tests and Proof of Concepts Vectorwise consistently outperforms other databases on very small servers compared with much larger racks of servers. The hardware you see on this slide is from the 1 TB TPCH benchmark – Oracle used the large Server and VW used the small 2U =Dell Server.2. Dramatically less maintenance – Take out the cost and burden of having teams of DBAs to tune the database3. Time – Deliver usable BI in much less time without the need for deep technical know how.
  26. Planning for a mixed architecture will allow you to bring in the variety of data sources but still deliver on user exceptions of fast BI.And this is an example of how Hadoop might be a part of it. You will certainly need to include your operational data and data from external feeds - all critical components in the big data recipe.However, without the underlying database performance to support the BI tool, even the most brilliant tool will struggle to deliver satisfactory end user adoption.IsCool Entertainment use Hadoop and Vectorwise. They are a European leader in social gaming on Facebook (number 1 in France) with 1.2 million active monthly users. Their gaming platform is built on Hadoop, and they use Vectorwise to analyze user experience. Below is the press release with the quote.“We’re using Vectorwise to investigate consumer behavior to better understand what makes our users play, interact and recommend. Fast and actionable business analytics from Vectorwise will allow us to deliver tailored offers to customers and advertising partners, and thus improve monetization of the games we develop.” – FlorianDouetteau, CTO of IsCool Entertainment. Badoo – global dating site with 150 million members. Use Hadoop for the web application and Vectorwise for the analysis. Before Vectorwise they hard-coded a custom-built analytics solution that was limited in functionality and unable to provide the level of detail their marketing and finance teams needed. “Vectorwise gives us unfettered access to our data and the ability to run ad hoc analyses without the need to have thought of the question before we asked it. This means we can now ask anything of our data and our users’ activity and get answers in just seconds.” – Ian Broadhead, BI lead at BadooNK – Socal media site that has more users in Poland than Facebook has there. 14 million active monthly users. Use Hadoop for click-stream data, such as POST and GET requests, and AdServer logs, and ad hoc queries took sometimes days to design, build and execute. Use Vectorwise for 50-90 of their largest daily queries such as banner optimization (advertising based on user/friend preference) and gaming usage (moving around the buttons/colors, etc to see how changes users). “We looked to solutions from other vendors with analytic databases, but selected Vectorwise for its superior performance and cost-effective model.”
  27. Now you have the data you need, you need to get it into the hands of the people who really need it Big Data is a big investment, and there is no point giving only a few people in your organization access to the data. The more you share data, the more value you get from it (it doesn’t lose its value) It needs to be fast, agile, drag and drop, etc – require very little training.And then finally you can get to … next slide (the magic art of making sure your data tells a story).
  28. If we are going to ensure mass distribution, then we need delivery tailored for each audiences needs. farmer see map of farm (agriculture), marketing see market segmentation, transportation, etcSo when thinking about Visualization then we need it to make sense for them. Multiple use casesData visualization is critically for people to consume it. Nobody sends people on a course to understand a graph. Build for level of skill of audience PacMan – Visualization best practice points (last webinar)Powerful visualization is the best way to express data – the more data you have, the more focus you need.Yellowfin to list best practice for visualizing huge amounts of data.Storytelling
  29. Following slides make points of… If we are going to ensure mass distribution, then we need delivery tailored for each audiences needs. Marketers are going to want to see segmentation of demographics Managers are going to want to interactively drill down into their reports Data scientists are going to want to do statistical analysis Executives are going to want to keep a close eye on KPIs Demographers, people in agriculture are going to want to see things in maps So when thinking about visualizations we need to always keep the audience in mind. Data visualization is critically for people to consume it. Nobody sends people on a course to understand a graph. So build for level of skill of audience
  30. But when you visualize it, you can get your point across much better.Should re-do this in Yellowfin.
  31. More data requires more focusLink to clearly defined business objectivesOnly include actionable informationInteractivity is essentialStart big, drill to detailMore data doesn’t mean more reports and visualizations, it means deeper insightSelect the right metricsIt’s not enough just to decide on what aspects of your business Big Data analytics allows you to monitor. You need to decide how you’re going to track and measure those chosen aspects, and communicate them to end-users via an agreed form of measurement.Provide contextWithout additional contextual information to help users understand data visualizations, it’s impossible for a user to understand the true meaning of the results presented, what action it requires, or whether it demands any action at all. Effectively highlight the most important information:Draw the users attention to the most pertinent pieces of information firstThe most important data should occupy the most screen real estateSelect the best, not the best looking, visualization.: The data; not the visualizations, should always be made the center of attention. Never use flashy visuals and chart types when simple alternatives are capable of conveying the same message – does the third dimension on that pie chart really add to its meaning?Avoid all design aspects that are unconnected to the task of analytic communication."Perfection is achieved, not when there is nothing left to add, but when there is nothing left to remove” -- Antoine de Saint-ExuperyUse colour appropriately and sparingly to achieve maximum impact and contrastIf all colors chosen to represent different metrics or values within a chart are eye-catching, no single point will standout above the othersSelect colours based on a clear understanding of their inherent or commonly accepted symbolic or metaphoric meaning (red = bad, etc)Be consistent. For example, if data relating to second quarter sales is displayed in purple in one chart, all other charts that display data relating to second quarter sales result should also be displayed in purpleAvoid visual clutter Avoid visually gratuitous chart typesSelect the right visualization for the data and the contextSelecting the most context appropriate visualization for a particularly metric or measure requires the judicious application of a little common sense. For example, if you’re attempting to monitor or track the change in something over time, a line graph will almost always work best. Likewise, if tracking several metrics of similar proportions – a potential example might be new leads generated for the current year by marketing category (Google Ads, LinkedIn, print media, banner advertising, etc) – using a column chart or bar graph would be an effective way to visualize the minor differences in performance between each marketing channel. Conversely, a pie chart would deliver a poor user experience as, at first glance, all the portions would seem equal. Layered maps are criticalDisplays large volumes of data efficiently and helps explain the relationships between different types of dataConsider the unique informational requirements of each defined user groupWhat information are they already aware of? What information would enable them to make more efficient and effective decisions? Support and prompt actionUsers must be enabled with a range of options to share the new information and their associated thoughts with others, in order to drive appropriate resultant action. Such information collaboration and decision-making options should include, but are certainly not limited to, the ability to:Email the relevant report to pertinent and affected stakeholdersAdd contextual knowledge to the reports in question via annotations and comments (discussion threads) and have relevant users with access to those reports notifiedAdd decision-widgets to discussion threads to facilitate voting and polling to enable fast and effective collective decision-makingEmbed fully interactive dashboards and reports externally to the BI tool, on any third-party Web-based platform, to allow external stakeholders to understand and act on the emergent issue
  32. Glen does the demo…Drive point home – lets assume we have built a dashboard for a user – dashboard. Real value is that I can browse, un-aggregated. If it was to be traditional, then (show comparison).
  33. Summary of what we learned today