Webinar: Open Source Business Intelligence Intro


Published on

The presentation supported the webinar delivered by Stefano Scamuzzo, SpagoBI International Manager, on 22nd December 2010 within SpagoWorld Webinar Center. http://www.spagoworld.org/

Published in: Technology
1 Like
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Webinar: Open Source Business Intelligence Intro

  1. 1. Open Source Business Intelligence IntroStefano Scamuzzo SpagoBI International ManagerSenior Technical Manager SpagoWorldArchitecture & ConsultingResearch & Innovation DivisionEngineering Group
  2. 2. The Open Source Question In many cases, the question is "when" to focus on open-source alternatives totraditional closed-source solutions, not "if" you should focus on them. Gartner Hype Cycle for Open-Source Software, 2005
  3. 3. The discovery of OSBI
  4. 4. Reasons to adopt OSBIAccording to Gartner Analysis (2008) Reducing costs Embed BI functionalities into existing applications Complement the current BI infrastructure to extend BI usage to more usersWe should add to Gartner arguments … Flexibility Innovation Better reactivity
  5. 5. … and now we are not alone … 2010 20092008Source: NORTH BRIDGE“2010 Future of Open Source Survey Results”futureofopensource.drupalgardens.com
  6. 6. And more …
  7. 7. The current situation OSBI Gaining Attention
  8. 8. The typical Business Intelligence layersData Ware House PlatformsExtract Transfer Load (ETL) solutionsBusiness Intelligence platforms: Analytical tools Document lifecycle management Security Integration
  9. 9. DWH LayerData Ware House products
  10. 10. Data WarehousingData Warehouse A reference database structured for analysis Non transactional Contents harmonized and comprehensive Partitioning, bitmap indexes, materialized views, SMP supportDWH vendors Teradata is the first DWH pure player Followed by DW appliance vendors: MS-DATAllegro, IBM-Netezza and EMC- Greenplum Every DBMS vendor supports DWH Oracle, SAP/Sybase, IBM, Microsoft Specialized: ParAccel, Kognitio DW techniques are portable to any DBMS platform
  11. 11. Data WarehousingSource: Gartner (January 2010)
  12. 12. Open Source Data WarehousingThree leading Open Source DBMS players: Ingres MySQL PostgreSQLIngres is possibly the most enterprise worthyMySQL, popular but limited DW capabilitiesbefore version 5.1 Strong point: multiengine architecture Look at MyISAM and InfoBrightPostgreSQL robust enterprise platform EnterpriseDB is the commercialization of PostgreSQL
  13. 13. DWH RecommendationsTechnological evolution MPP Column stores (InfoBright, Ingres VectorWise) Distributed Data Warehouses Search-reliant data warehouses Data stream management (Truviso) AppliancesOS option Ingres Icebreaker or Infobright vs Netezza or DATAllegro Adopt MySQL but evaluate performance and scalability, considering enhancements as InfoBright Enterprises should consider supported RDBMS as Ingres and EnterpriseDB Consider MonetDB
  14. 14. BI LayerBusiness Intelligence tools and platforms
  15. 15. Business IntelligenceMore than just software Integration with operational systems Embedding analytics in business applications CollaborationBI tools: Reporting, dashboards, ad-hoc query OLAP analysis Advanced analytics (data mining, statistics, geospatial analytics) Application integration
  16. 16. Business Intelligence SceneMany BI vendors Dominators: SAP Business Objects, IBM Cognos, Oracle Hyperion, MicroSoft Pure player: Microstrategy, SAS, SPSS Visualization specialized: Actuate, TIBCO Spotfire, Tableau, QlickView
  17. 17. OS BI Analytical ToolsReporting JasperReports Eclipse BIRT from Actuate Pentaho Report DesignerOLAP Mondrian Relational OLAP Server (ROLAP) + JPivot tag library Palo Multidimensional OLAP Server (MOLAP)
  18. 18. ReportingReporting
  19. 19. BIRT Report EngineEclipse project including Graph generator Report generator Design environment (Eclipse based)Managed by Actuate that commercialize a BI offerwhose only open source solution is BIRTLibrary allowing to generate reports in differentformatThe report can mix data, graphics and imagesCan be integrated in any Java application
  20. 20. BIRT Report Engine
  21. 21. BIRT Report EngineEssentially oriented to developers, requests mustbe written in SQLIt is possible to make BIRT accessible by lesstechnical usersIt is possible to create resource librariescontaining the basic elements to produce a reportStrength the Eclipse community the ease of use
  22. 22. Jasper ReportsReport engine developed by JasperSoft anddistributed in open sourceReport are described as xml files that can be built: Manually Using ad-hoc tools (ex. iReport)Generates report in different formats: HTML, PDF, XML, CSVThe layout of the report is composed of layers: Title, page header, column headings, details, column footers, page footer, last page, summary pageIt is possible to use subreports
  23. 23. iReportTool to design Jasper reportsOriented to report developerLess intuitive than BIRT
  24. 24. Pentaho Report DesignerFormerly known as JFreeReportsJoined Pentaho in 2006It allows to directly deploy reports in the PentahoplatformIt supports different formats: PDF, HTML, CSVReports are developed in layers, as inJasperReportsWizards are available
  25. 25. Pentaho Report Designer
  26. 26. OLAPMultidimensional Analysis (OLAP)
  27. 27. MondrianOLAP serverIt belongs to the ROLAP Category (RelationalOLAP) since it access a relational data baseMondrian executes requests described in MDXlanguageMondrian can be used together with its clientJPivotIt also exposes XMLA interface allowing to beaccessed by other clients (ex. JPalo)The Mondrian project has joined Pentaho andrenamed as Pentaho Analysis
  28. 28. JPivotOLAP clientIt allows to represent a OLAP cube and tonavigate it Drill down, drill up Drill across, drill through Slice and diceIt allows to associate a graph to the dimensionaltableIt exports in PDF or ExcelThe user interface can be customized using stylesheets
  29. 29. Jpivot - Screenshot
  30. 30. PaloOLAP serverIt belongs to the MOLAP Category(Multidimensional OLAP) since it load data in adedicated structureA plugin is available to access Palo server fromExcelIt can be accessed by a JPalo client (now PaloPivot)In the commercial version it is possible to selectand change the values and to spread aggregateddata trough the details
  31. 31. Palo PivotOLAP client by JPaloWeb interface to access both Palo and MondrianAs an alternative you can user Palo EclipseClient, a thick client based on Eclipse
  32. 32. BI PlatformsBusiness Intelligence Platforms
  33. 33. Pentaho BI SuiteProduct suite to distribute analytical functionalities anddocuments through portals (JBoss portal) web applicationIt has a double-license (open core) model Community edition: free open source Enterprise edition: license feeCommunity Edition Functionalities Pentaho Server (reporting, analysis, dashboard) Pentaho Report Designer Pentaho Design Studio Pentaho Data Integration Pentaho Metadata Editor
  34. 34. Pentaho - Architecture
  35. 35. Pentaho Community Edition
  36. 36. Pentaho Enterprise EditionThe main modules are “certified”Professional support, Software maintenance andassuranceMain enhanced functionalities: Pentaho Analyzer Dashboard designer Enterprise Console SSO Security configuration Repository utilities Lifecycle management Audit reports Clustering Performance monitoring ETL management and monitoring Enterprise security
  37. 37. Pentaho Enterprise Edition
  38. 38. Pentaho: main componentsWorkflow engine It allows to structure a decision process by means of action Each action is described in a XML file The XML files are created in the Pentaho Design Studio environment, an eclipse based user interfaceTask Scheduler Based on Quartz It allows to schedule any Pentaho action It allows to periodically send reports by mail The task control can be manual or linked to an action
  39. 39. Pentaho: user interfaceWeb application It manages user roles in accessing functionalities It is the preferred way to access PentahoPortal It manages portlets in JBoss Portal EmbeddedReportPortlet ChartPortlet The security is managed by the portal
  40. 40. SpagoBIBusiness Intelligence SuiteTotally free and open source, only one versionand one license (LGPL)It has a open architecture allowing to integratenew components both open source andproprietaryIt integrates some open source solutions (Jasper,BIRT, Mondrian) and provide original ones (Geo,QbE, KPI)
  41. 41. SpagoBI: modulesSpagoBI Server (12 analytical areas / 21 engines) SpagoBI Reporting (4 engines) SpagoBI OLAP (3 engines) SpagoBI Free Inquiry (3 engines) SpagoBI Chart SpagoBI GEO (2 engines) SpagoBI KPI SpagoBI Real Time Dashboards SpagoBI Interactive Dashboards SpagoBI Data Mining SpagoBI Analytical Dossier SpagoBI Office SpagoBI ETL – Talend SpagoBI ConsoleSpagoBI StudioSpagoBI MetadataSpagoBI SDKSpagoBI Applications
  42. 42. SpagoBI - ArchitectureAdministrative Enterprise application integration Cross Tools Services BI Webapp BI Portlet BI Service BI Tag BI Snapshot BI Api Scheduler SSORole sycronization Analytical Engine (AE) Alert & Notify Accessible reporting Profiling system Workflow External processes Import/export Search Data mining Free Inquiry RT Console Dashboard Reporting Data filter Dossier Cockpit Menù designer Collaboration OLAP Office Chart GEO ETL KPI Map catalogue Rule angine Respository mng Mail AE management Rank BM management Exporters Engine mng RT Events Data sources Behavioural Model (BM) Document Browser Data Sets Analytical Driver ROLE LOV CHECK Personal foldersAudit & Monitoring Cross Navigation Subscription mng Metadata Data & Metadata Layer Subscriptions Metadata mng repository DWH Metadata ETL/EAI External data/applications
  43. 43. SpagoBI: the user interfaceWeb application Can be deployed on any Web Container as: Tomcat, JBoss, WebSphere Security is managed by the integrated CAS modulePortal Can be deployed on any Portal Container compliant to the JSR 168 standard as: eXo WebOS, Liferay Security is managed by the portal The source code is the same: deploying as web application or portal is a matter of configuration
  44. 44. SpagoBI User Interface
  45. 45. SpagoBI Studio
  46. 46. JasperSoft BI SuiteThe BI platform of JasperSoftMain modules Jasper Server Jasper Analysis Jasper Reports + iReport Jasper ETLThree editions available: Community Edition (GPL or LGPL License) Professional Edition (Commercial License) Enterprise Edition (Commercial License)Users can build their reportsThe user interface is based on a specific web application,no use of portal
  47. 47. JasperSoft Suite - Architecture
  48. 48. Jasper Suite – Community Edition
  49. 49. Jasper Intelligence: commercial version and ETLThe commercial version includes: Certified support Release cycle management Support guarantees Legal matters, indemnityProfessional version added functionalities Web Reporting and Analysis Dashboards Flash based charts SecurityEnterprise version added functionalities Multi-tenancy Advanced OLAP Services Audit Logging ETL with Activity Monitoring
  50. 50. Jasper Suite – Professional Edition
  51. 51. The future directions Cloud BIhttp://www.flickr.com/photos/bitterjug Predictive analysis Real Time BI Business Intelligence trends
  52. 52. Thanks …