Your SlideShare is downloading. ×
Open Source BI Overview
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Introducing the official SlideShare app

Stunning, full-screen experience for iPhone and Android

Text the download link to your phone

Standard text messaging rates apply

Open Source BI Overview

3,192
views

Published on

Proof that an entire data driven Business Intelligence stack can be successfully implemented through open source software.

Proof that an entire data driven Business Intelligence stack can be successfully implemented through open source software.

Published in: Technology

0 Comments
9 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
3,192
On Slideshare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
97
Comments
0
Likes
9
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Open Source Business Intelligence Overview From Data Source to Analytics and Beyond
  • 2. Agenda ● Open Source and BI ● Data sources ● Data Integration ● Reporting/Frontend ● Analytics ● Data Quality ● Data Governance
  • 3. Source: https://www.informs.org/ORMS-Today/Public-Articles/October-Volume-37-Number-5/Back-in-Business
  • 4. Data Sources Traditional ○ PostgreSQL - http://www.postgresql.org/ ■ Pivotal Greenplum - http://gopivotal.com/ ○ MySQL - http://www.mysql.com/ ■ Percona - http://www.percona.com/ ■ MariaDB - https://mariadb.org/ Columnar ○ MySQL Derivatives ■ InfiniDB - http://infinidb.org/ ■ Infobright - https://www.infobright.com/ ○ MonetDB - http://www.monetdb.org/Home
  • 5. Relational vs Columnar Source: http://www.calpont.com/images/column-oriented-database.jpg
  • 6. Data Sources NoSQL ○ Cassandra - http://cassandra.apache.org/ ○ MongoDB - http://www.mongodb.org/ ○ CouchDB - http://couchdb.apache.org/ ○ Infinispan - http://www.jboss.org/infinispan/ ○ Hadoop - http://hadoop.apache.org/ ■ HBase - http://hbase.apache.org/ ■ Hive - http://hive.apache.org/ OLAP ○ Mondrian - http://mondrian.pentaho.com/
  • 7. Source: http://gerardnico.com/wiki/database/oracle/oracle_olap
  • 8. The Next Wave of Data Sources Virtualization ○ Teiid - http://www.jboss.org/teiid/ Semantic Web/Graph ○ Sesame - http://www.openrdf.org/ ○ Neo4j - http://www.neo4j.org/ ○ OrientDB - http://www.orientdb.org/ ○ Infogrid - http://infogrid.org/trac/
  • 9. Source: http://www.ebizq.net/blogs/guest_session/2009/12/putting-data-to-work-for-cloud-bpm-mdm-and-soa-projects.php
  • 10. Graph Database Source: http://en.wikipedia.org/wiki/Graph_database
  • 11. Data Integration Kettle - http://kettle.pentaho.com/ Talend - http://www.talend.com/ CloverETL - http://www.cloveretl.com/
  • 12. Reporting BIRT (Actuate) - http://www.eclipse.org/birt/phoenix/ Pentaho - http://reporting.pentaho.com/ Jaspersoft - http://community.jaspersoft.com/ Saiku - http://meteorite.bi/saiku
  • 13. Full Stacks SpagoBI - http://www.spagoworld.org/xwiki/bin/view/SpagoBI/# Pentaho - http://www.pentaho.com/ Jaspersoft - http://www.jaspersoft.com/
  • 14. Analytics R - http://www.r-project.org/ Weka - http://www.cs.waikato.ac.nz/ml/weka/ RapidMiner - http://rapid-i.com/content/view/181/
  • 15. Data Quality Profiling ○ DataCleaner - http://datacleaner.org/ ○ DQGuru - http://www.sqlpower.ca/page/dqguru Suites ○ Talend - http://www.talend.com/products/data-quality Testing ○ SQLUnit - http://sqlunit.sourceforge.net/ ○ dbFit - http://benilovj.github.io/dbfit/ ○ etlUnit - https://github.com/dbaAlex/etlUnit (shameless plug :p )
  • 16. Data Governance MDM ○ Talend - http://www.talend.com/resource/data-governance.html Business Rules Engine ○ JBoss Drools - http://www.jboss.org/drools/ ○ Open Rules - http://openrules.com/