UK Hadoop User Group Meeting <br />Davy Nys,  RVP of Enterprise Sales EMEA<br />October, 2010<br />© 2010, Pentaho. All Ri...
About Pentaho<br />Recognized leader in open source BI & Data Integration<br />Average one download every 30 seconds<br />...
Driven by Customer and Market Need<br />Pentaho has been an industry pioneer and innovator since its founding in 2004. As ...
Why Pentaho BI for Hadoop?<br />Pentaho offers full BI Suite<br />Data to dashboards (ETL, OLAP, reporting, dashboards, mi...
Pentaho for Hadoop Download Capability<br />Includes support for development, production support will follow with GA<br />...
Pentaho for Hadoop Announcements (cont)<br />Pentaho and Cloudera Partnership<br />Combines Pentaho ‘s business intelligen...
Hadoop and BI?<br />90% of new Hadoop use cases <br />are transformation of <br />semi/structured data*<br />* of those co...
Big Data<br />Terabytes and petabytes of data<br />Sometimes per day<br />US and Worldwide: +1 (866) 660-7555  |  Slide  <...
?<br />?<br />?<br />?<br />?<br />?<br />?<br />Traditional BI<br />Data Mart(s)<br />Tape/Trash<br />Data<br />Source<br...
Data Lake<br /><ul><li>Single source
Large volume
Not distilled</li></ul>US and Worldwide: +1 (866) 660-7555  |  Slide  <br />© 2010, Pentaho. All Rights Reserved. www.pent...
Data Lakes<br /><ul><li>0-2 lakes per company
Known and unknown questions
Multiple user communities
$1-10k questions, not $1m ones
Don’t fit in traditional RDBMS with a reasonable cost</li></ul>US and Worldwide: +1 (866) 660-7555  |  Slide  <br />© 2010...
Data Lake Requirements<br /><ul><li>Store all the data
Satisfy routine reporting and analysis
Satisfy ad-hoc query / analysis / reporting
Balance performance and cost</li></ul>US and Worldwide: +1 (866) 660-7555  |  Slide  <br />© 2010, Pentaho. All Rights Res...
Tape/Trash<br />Ad-Hoc<br />Data Lake(s)<br />Data Warehouse<br />What if...<br />Data Mart(s)<br />Data<br />Source<br />...
Pentaho BI Suite for Hadoop<br />Data Marts, Data Warehouse, <br />Analytical Applications<br />Design<br />Deploy<br />Or...
Big Data Does Not Replace Data Marts<br /><ul><li>It’s not a database
Upcoming SlideShare
Loading in...5
×

Hadoop uk user group meeting final

3,157

Published on

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
3,157
On Slideshare
0
From Embeds
0
Number of Embeds
3
Actions
Shares
0
Downloads
0
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide
  • Pentaho provides a complete, enterprise BI suite from ETL and data integration, through OLAP as well as reporting, dashboards and ad hoc analysis. Our Enterprise Edition BI Suite is modular, enabling users to use the entire set of functionality or to start anywhere that may be a priority such as building and deploying a data warehouse or providing management dashboards. And because Pentaho’s BI Suite is modular, users can easily deploy additional functionality as their needs grow or change. Individually, Pentaho’s BI and data integration applications , Pentaho Data Integration, Pentaho Analysis, Pentaho Reporting, Pentaho Dashboards and Pentaho Analyzer are purpose built and best of breed, providing users with world class BI and data integration functionality to meet the needs of customers ranging from innovative new companies to Fortune 1000.At the most basic level, Pentaho helps you to turn your data, stored throughout your organization. into actionable business intelligence. This functionality can be divided into three core areas: Accessing data, Optimizing and analyzing data and then visualizing information via reports or dashboards.In terms of accessing data, we integrate with both structured data, such as data stored in a relational database or coming from a core business applications such as CRM or ERP , as well as unstructured complex data via our integration with Apache Hadoop. We offer a graphical interface that allows you to quickly connect and transform data sources simply by dragging and dropping them into the Pentaho development environment.Optimizing data means you can slice and dice data to find meaningful trends, uncover root causes or other business-relevant information. It allows you to “have a conversation with the data”, interactively exploring data as you see fit. Pentaho also provides data mining capabilities to discover hidden patterns in the data for purposes of identifying indicators for predicting future performance.Visualization consists of reports and dashboards. Reporting is often where organizations start with business intelligence, trying to get business information out of existing systems to make it available to business users in an attractive, easy-to-consume format. Pentaho reporting provides both operational reporting such as for invoices or bills of lading, as well as historical and analytical reports.Dashboards have become a very popular BI capability because it lets end users easily see their key performance indicators and business metrics in a very easy-to-consume format. Rather than combing through large volumes of reports, users can immediately see what metrics are on track and which ones require immediate attention.The underlying BI server integrates all of these end-user capabilities, providing developers a single view of data across the entire suite. No other BI vendor offers the unique combination of a comprehensive BI suite with the breadth of Pentaho combined with a single, intuitive development interface that greatly simplifies the creation of new BI applications.To meet the range of user needs, Pentaho can be deployed either as an on-premise or on-demand application. In either deployment scenario, you have the same exact product set and functionality. So it is very easy to migrate in either direction. So if you decide to deploy an initial project in the cloud via the on-demand offering in order to deliver business value more quickly, you can then move it back on-premise at a later date very easily.
  • Hadoop uk user group meeting final

    1. 1. UK Hadoop User Group Meeting <br />Davy Nys, RVP of Enterprise Sales EMEA<br />October, 2010<br />© 2010, Pentaho. All Rights Reserved. www.pentaho.com. <br />
    2. 2. About Pentaho<br />Recognized leader in open source BI & Data Integration<br />Average one download every 30 seconds<br />Over 8,000 active production deployments<br />Over 1,200 customers in 65 countries<br />Saved customers >$2 billion in cumulative licenses and maintenance costs<br />Backed by Benchmark Capital, Index Ventures and NEA<br />
    3. 3. Driven by Customer and Market Need<br />Pentaho has been an industry pioneer and innovator since its founding in 2004. As an OSBI company since its start, Pentaho continues to be driven by customer and market need.<br />2004 - Founded<br />2005 - First open source BI Platform<br />2006 - First to offer live integration with Google Maps<br />2008 - First BI company to integrate with the iPhone<br />2009 - Announced groundbreaking Agile BI Initiative to address the market need of brining BI closer to business users. <br />Customers approached Pentaho with big data problems<br />2010 - First to offer ad hoc analytics to iPad<br />2010 - First to announce and deliver code to support Hadoop and big data analytics<br />
    4. 4. Why Pentaho BI for Hadoop?<br />Pentaho offers full BI Suite<br />Data to dashboards (ETL, OLAP, reporting, dashboards, mining)<br />Pentaho lowers on-ramp for Hadoop users<br />Lowers complexity and learning curve for Big Data analytics<br />Enables users to combine structured and unstructured data<br />Few Hadoop applications available, critical need<br />Rapidly integrate Hadoop into existing data architectures by easily moving data between Hadoop and databases, data warehouses and other enterprise data stores;<br />Agile BI and modern platform, deployed on-premise or on-demand<br />Pentaho brings scalability, clustering and deployment options <br />100% Java<br />Commitment to open source<br />COSS frees up $$ for more servers, CPUs<br />
    5. 5. Pentaho for Hadoop Download Capability<br />Includes support for development, production support will follow with GA<br />Collaborative effort between Pentaho and the Pentaho Community<br />60+ beta sites over three month beta cycle<br /> Pentaho contributed code for API integration with HIVE to the open source Apache Foundation<br />Pentaho and Amazon Web Services Partnership<br />Combines Pentaho Data Integration for Hadoop with Amazon’s Elastic Map Reduce (EMR) to facilitate easy integration with Hadoop data stored in EC2<br />Enables hybrid data model between EMR, databases, data warehouses and other on-premise data stores<br />Pentaho’s Amazon EC2 offering includes tightly integrated report design for building production or ad hoc reports from data spanning cloud and on-premise data sources (available November, 2010)<br />Pentaho for Hadoop Announcements<br />
    6. 6. Pentaho for Hadoop Announcements (cont)<br />Pentaho and Cloudera Partnership<br />Combines Pentaho ‘s business intelligence and data integration capabilities with Cloudera’s Distribution for Hadoop (CDH)<br />Enables business users to take advantage of Hadoop with ability to easily and cost-effectively mine, visualize and analyze their Hadoop data<br />Pentaho and Impetus Technologies Partnership<br />Incorporates Pentaho Agile BI and Pentaho BI Suite for Hadoop into Impetus Large Data Analytics practice<br />First major SI to adopt Pentaho for Hadoop<br />Facilitates large data analytics projects including expert consulting services, best practices support in Hadoop implementations and nCluster including deployment on private and public clouds<br />
    7. 7. Hadoop and BI?<br />90% of new Hadoop use cases <br />are transformation of <br />semi/structured data*<br />* of those companies we’ve talked to...<br />US and Worldwide: +1 (866) 660-7555 | Slide <br />© 2010, Pentaho. All Rights Reserved. www.pentaho.com. <br />
    8. 8. Big Data<br />Terabytes and petabytes of data<br />Sometimes per day<br />US and Worldwide: +1 (866) 660-7555 | Slide <br />© 2010, Pentaho. All Rights Reserved. www.pentaho.com. <br />
    9. 9. ?<br />?<br />?<br />?<br />?<br />?<br />?<br />Traditional BI<br />Data Mart(s)<br />Tape/Trash<br />Data<br />Source<br />US and Worldwide: +1 (866) 660-7555 | Slide <br />© 2010, Pentaho. All Rights Reserved. www.pentaho.com. <br />
    10. 10. Data Lake<br /><ul><li>Single source
    11. 11. Large volume
    12. 12. Not distilled</li></ul>US and Worldwide: +1 (866) 660-7555 | Slide <br />© 2010, Pentaho. All Rights Reserved. www.pentaho.com. <br />
    13. 13. Data Lakes<br /><ul><li>0-2 lakes per company
    14. 14. Known and unknown questions
    15. 15. Multiple user communities
    16. 16. $1-10k questions, not $1m ones
    17. 17. Don’t fit in traditional RDBMS with a reasonable cost</li></ul>US and Worldwide: +1 (866) 660-7555 | Slide <br />© 2010, Pentaho. All Rights Reserved. www.pentaho.com. <br />
    18. 18. Data Lake Requirements<br /><ul><li>Store all the data
    19. 19. Satisfy routine reporting and analysis
    20. 20. Satisfy ad-hoc query / analysis / reporting
    21. 21. Balance performance and cost</li></ul>US and Worldwide: +1 (866) 660-7555 | Slide <br />© 2010, Pentaho. All Rights Reserved. www.pentaho.com. <br />
    22. 22. Tape/Trash<br />Ad-Hoc<br />Data Lake(s)<br />Data Warehouse<br />What if...<br />Data Mart(s)<br />Data<br />Source<br />US and Worldwide: +1 (866) 660-7555 | Slide <br />© 2010, Pentaho. All Rights Reserved. www.pentaho.com. <br />
    23. 23. Pentaho BI Suite for Hadoop<br />Data Marts, Data Warehouse, <br />Analytical Applications<br />Design<br />Deploy<br />Orchestrate<br />Pentaho Data Integration<br />Hadoop<br />Pentaho Data Integration<br />Pentaho Data Integration<br />US and Worldwide: +1 (866) 660-7555 | Slide <br />© 2010, Pentaho. All Rights Reserved. www.pentaho.com. <br />
    24. 24. Big Data Does Not Replace Data Marts<br /><ul><li>It’s not a database
    25. 25. High latency
    26. 26. Optimized for massive data-crunching
    27. 27. Databases are immature
    28. 28. Databases are no-SQL</li></ul>US and Worldwide: +1 (866) 660-7555 | Slide <br />© 2010, Pentaho. All Rights Reserved. www.pentaho.com. <br />
    29. 29. Reporting / Dashboards / Analysis<br />Web Tier<br />DM & DW<br />RDBMS<br />Metadata<br />Hive<br />Hadoop<br />Files / HDFS<br />Applications & Systems<br />US and Worldwide: +1 (866) 660-7555 | Slide <br />© 2010, Pentaho. All Rights Reserved. www.pentaho.com. <br />
    30. 30. Data Lake(s)<br />Data Mart(s)<br />Data Warehouse<br />Ad-Hoc<br />Data<br />Source<br />US and Worldwide: +1 (866) 660-7555 | Slide <br />© 2010, Pentaho. All Rights Reserved. www.pentaho.com. <br />
    31. 31. Data <br />Lake<br />Reporting / Dashboards / Analysis<br />Web Tier<br />RDBMS<br />Hadoop<br />Applications & Systems<br />US and Worldwide: +1 (866) 660-7555 | Slide <br />© 2010, Pentaho. All Rights Reserved. www.pentaho.com. <br />
    32. 32. Visualize<br />Reporting / Dashboards / Analysis<br />Web Tier<br />DM & DW<br />RDBMS<br />Optimize<br />Hive<br />Hadoop<br />Files / HDFS<br />Access<br />Applications & Systems<br />US and Worldwide: +1 (866) 660-7555 | Slide <br />© 2010, Pentaho. All Rights Reserved. www.pentaho.com. <br />
    33. 33. Pentaho Turns Data into Information<br />
    34. 34. Pentaho BI Suite 3.7<br />Data Integration 4.1<br />Hadoop integration<br />Simple file management for HDFS<br />Input data from and output to HDFS<br />Use PDI Jobs to coordinate Hadoop job execution<br />Transformations as MapReduce jobs in Hadoop<br />Integration with Amazon Elastic MapReduce<br />User Console Improvements<br />Thin client Agile BI Wizard<br />Upload and stage data<br />Simple generation of reporting/OLAP metadata<br />Immediate access to self-service BI<br />Analyzer/Mondrian <br />Drill through to underlying details<br />Conditional formatting (traffic lighting)<br />Localization Support<br />iPad integration<br />
    35. 35. Benefits for Users<br />Pentaho application tools far easier than native Hadoop<br />Enables combined hybrid model of structured and unstructured data <br />Faster Time-to-Value<br />Widens the potential user base of Hadoop<br />Commercial Open Source Software (COSS) economics<br />Pentaho’s data integration, reporting and analytical capabilities enable Hadoop developers and business analysts to quickly and easily create BI applications without coding<br />Pentaho Data Integration (PDI) is a natural fit for Hadoop given its rich design tools, scalable architecture, open source distribution and adoption at a large number of Hadoop sites<br />
    36. 36. Pentaho BI Suite Resources & Events<br />Resources<br />Pentaho BI Suite landing page: www.pentaho.com/hadoop<br />Upcoming resources<br />Agile BI White Paper by Joshua Greenbaum. In-depth look at why Agile BI is important and how it is changing the BI industry. <br />Technical Agile BI White paper from Pentaho CTO, James Dixon<br />Events<br />Agile BI Tour: Data to Dashboards in Minutes<br />October 13, Oslo, NO<br />October 15, Barcelona, ES<br />October 19, Seattle, WA<br />October 20, Portland, OR<br />October 21, San Mateo, CA<br />October 22, Kontich, BE<br />October 27, Houston, TX<br />October 27, Florence, IT<br />
    37. 37. Questions and Answers<br />Davy Nys <br />dnys@pentaho.com or +32 498 160 363<br />Join the conversation. You can find us on:<br />http://blog.pentaho.com<br />@Pentaho<br />Pentaho Facebook Group<br />Pentaho - Open Source Business Intelligence Group<br />

    ×