SlideShare a Scribd company logo
www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.
SpagoBI and Talend jointly support Big Data scenarios
Monica Franceschini - SpagoBI Architect
SpagoBI Competency Center - Engineering Group
www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.
Big-data
• Agenda
– Intro & definitions
– Layers
– Talend & SpagoBI
– SpagoBI big-data roadmap
www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.
Big Data - 3Vs
"Big data" is high-volume, high-velocity and high-variety information assets that
demand cost-effective, innovative forms of information processing for enhanced
insight and decision making.
Source: The Importance of 'Big Data': A Definition, Mark Beyer, Douglas. Gartner, 21 June 2012.
VOLUME The increase in data volumes within enterprise systems is
caused by transaction volumes and other traditional data types, as
well as by new types of data. Too much volume is a storage issue, but
too much data is also a massive analysis issue
VARIETY IT leaders have always had an issue translating large volumes
of transactional information into decisions — now there are more
types of information to analyze — mainly coming from social media
and mobile (context-aware). Variety includes tabular data (databases),
hierarchical data, documents, e-mail, metering data, video, still
images, audio, stock ticker data, financial transactions and more.
VELOCITY This involves streams of data, structured record creation, and
availability for access and delivery. Velocity means both how fast data
is being produced and how fast the data must be processed to meet
demand
Gartner Press Release, “Gartner Says Solving ‘Big Data’ Challenge Involves More Than Just Managing Volumes of Data”, June
27, 2011
www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.
Big Data- 3Vs & more
VARIABILITY
variance in meaning, in lexicon
VERACITY
1 in 3 business leaders don’t trust the information they use to make
decisions. How can you act upon information if you don’t trust it?
Establishing trust in big data presents a huge challenge as the
variety and number of sources grows.
VALUE
The economic value of different data varies significantly. Typically
there is good information hidden amongst a larger body of non-
traditional data; the challenge is identifying what is valuable and
then transforming and extracting that data for analysis.
www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.
Big data - Layers
• Infastructure
– On-site
– IaaS
• Data management:
– capture
– cleaning
– loading
– store
• View and Analyse
– Text analysis
– Text mining
– exploration, navigation, presentation
• Application
– Cloud
– SaaA
ETL
Business Intelligence
Services
www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.
Big data & Businessn Intelligence
• Tasks:
– Manage big-data (ETL) Talend→
– Read, interpret and show big-data (BI) SpagoBI→
– Big-data and real-time (BI) SpagoBI→
www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.
Talend - Big Data Management
Big Data
Production
Big Data Management
Big Data
Consumption
Storage
Processing
Filtering
Mining
Analytics
Search
Enrichment
RDBMS
Analytical DB
NoSQL DB
ERP/CRM
SaaS
Social Media
Web Analytics
Log Files
RFID
Call Data Records
Sensors
Machine-Generated
Big Data
Integration
Big Data
Quality
Turn Big Data into
actionable information
Parsing
Checking
www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.
Talend Goal: democratize Big Data
…an open source
ecosystem
Talend Open Studio for Big Data
“Big Data for the Masses”
 Improves efficiency of big data job design with
graphic interface
 Abstracts and generates code
 Run transforms inside Hadoop
 Native support for HDFS, Sqoop, HBase,
Mahout, Pig, Hive & MapReduce code generat°
 Apache License 2.0
 Embedded in Hortonworks Data Platform
 Certifed with Cloudera, MapR and Grenplum
HCatalog
www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.
ETL: Analytical databases & appliances
Connectors from/to:
‗Greenplum
‗Netezza
‗Sybase
‗Teradata
‗VectorWise
‗Vertica
‗HDFS
‗HBase
‗Hive
‗Cassandra
‗MongoDB
www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.
SpagoBI - load
Certified appliances:
‗Teradata
‗VectorWise
Connectors from:
‗Cassandra
‗HBase
‗Hive
‗Impala
‗Hadoop
RT with:
‗Storm
‗WSO2
More:
‗Scheduled data-set
‗In-memory data set
www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.
SpagoBI - meaning
Support for open standards:
‗RDF (Resource Description Framework) http://www.w3.org/RDF/
‗OWL (Web Ontology Language) http://www.w3.org/OWL/
‗R
‗Mahout
‗Text mining
Connectors from:
‗Neo4J
‗Freebase
‗OrientDB
www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.
SpagoBI - show
Explorative front-end
‗Network analysis
‗Exploration
‗In-memory
‗Data visualization
www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.
SpagoBI - roadmap
• Capture / Store
– Talend, connector to/from:
• Greenplum
• Netezza
• Sybase
• Teradata
• VectorWise
• Vertica
• HDFS
• HBase
• Hive
• Cassandra
• MongoDB
• …
• LOAD
– Certified appliances:
• Teradata
• VectorWise
– Connectors from:
• Cassandra
• HBase
• Hive
• Impala
• Hadoop
• MongoDB
– RT with:
• Storm
• WS02
– More:
• Scheduled data-set
• In-memory data set
www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.
SpagoBI - roadmap
• Meaning
– Connectors from:
• Neo4J
• Freebase
• OrientDB
– Support for open standards:
• RDF
• OWL
– Mining
• R
• MashR
• Text mining
• Show
– Explorative front-end
– Network analysis
– Data visualization
• Services
– Big data as a service
• Multitenant
• Cloud
• BI as a service (ad-hoc+self-service)
Data scientist
www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.
Bundle Talend -SpagoBI
The bundle will provide:
a distribution of both tools
interacting one with each other
a use-case that can be run to explore
their functionalities
SpagoBI and Talend announce their bundle!
www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.
@twittmonique
Monica.franceschini@eng.it

More Related Content

What's hot

Semantic Technologies for Big Data
Semantic Technologies for Big DataSemantic Technologies for Big Data
Semantic Technologies for Big Data
Marin Dimitrov
 
Do I need a Graph Database?
Do I need a Graph Database?Do I need a Graph Database?
Do I need a Graph Database?
Juan Sequeda
 
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
DataWorks Summit/Hadoop Summit
 
Big Data Landscape 2016
Big Data Landscape 2016 Big Data Landscape 2016
Big Data Landscape 2016
Matt Turck
 
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Connected Data World
 
Big data Big Analytics
Big data Big AnalyticsBig data Big Analytics
Big data Big Analytics
Ajay Ohri
 
Conclusions - Linked Data
Conclusions - Linked DataConclusions - Linked Data
Conclusions - Linked Data
Juan Sequeda
 
Data analytics using the cloud challenges and opportunities for india
Data analytics using the cloud   challenges and opportunities for india Data analytics using the cloud   challenges and opportunities for india
Data analytics using the cloud challenges and opportunities for india
Ajay Ohri
 
The New Database Frontier: Harnessing the Cloud
The New Database Frontier: Harnessing the CloudThe New Database Frontier: Harnessing the Cloud
The New Database Frontier: Harnessing the Cloud
Inside Analysis
 
PoolParty 6.0 - Climbing the Semantic Ladder
PoolParty 6.0 - Climbing the Semantic LadderPoolParty 6.0 - Climbing the Semantic Ladder
PoolParty 6.0 - Climbing the Semantic Ladder
Semantic Web Company
 
متن‌بازسازی کلان‌داده
متن‌بازسازی کلان‌دادهمتن‌بازسازی کلان‌داده
متن‌بازسازی کلان‌داده
جشنوارهٔ روز آزادی نرم‌افزار تهران
 
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
GetInData
 
Webinar: SpagoBI & Big Data, a smart approach to turn data into knowledge
Webinar: SpagoBI & Big Data, a smart approach to turn data into knowledge Webinar: SpagoBI & Big Data, a smart approach to turn data into knowledge
Webinar: SpagoBI & Big Data, a smart approach to turn data into knowledge
SpagoWorld
 
Accelerating Big Data Implementations for the Connected World
Accelerating Big Data Implementations for the Connected WorldAccelerating Big Data Implementations for the Connected World
Accelerating Big Data Implementations for the Connected World
DataWorks Summit/Hadoop Summit
 
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
Dr. Haxel Consult
 
Marketing vs Technology
Marketing vs TechnologyMarketing vs Technology
Marketing vs Technology
Nguyen Ngoc Hoai Aan
 
Smarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing PlatformSmarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing Platform
Ontotext
 
Using the Semantic Web Stack to Make Big Data Smarter
Using the Semantic Web Stack to Make  Big Data SmarterUsing the Semantic Web Stack to Make  Big Data Smarter
Using the Semantic Web Stack to Make Big Data Smarter
Matheus Mota
 
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...
Databricks
 
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep LearningRisk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Cambridge Semantics
 

What's hot (20)

Semantic Technologies for Big Data
Semantic Technologies for Big DataSemantic Technologies for Big Data
Semantic Technologies for Big Data
 
Do I need a Graph Database?
Do I need a Graph Database?Do I need a Graph Database?
Do I need a Graph Database?
 
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
 
Big Data Landscape 2016
Big Data Landscape 2016 Big Data Landscape 2016
Big Data Landscape 2016
 
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
 
Big data Big Analytics
Big data Big AnalyticsBig data Big Analytics
Big data Big Analytics
 
Conclusions - Linked Data
Conclusions - Linked DataConclusions - Linked Data
Conclusions - Linked Data
 
Data analytics using the cloud challenges and opportunities for india
Data analytics using the cloud   challenges and opportunities for india Data analytics using the cloud   challenges and opportunities for india
Data analytics using the cloud challenges and opportunities for india
 
The New Database Frontier: Harnessing the Cloud
The New Database Frontier: Harnessing the CloudThe New Database Frontier: Harnessing the Cloud
The New Database Frontier: Harnessing the Cloud
 
PoolParty 6.0 - Climbing the Semantic Ladder
PoolParty 6.0 - Climbing the Semantic LadderPoolParty 6.0 - Climbing the Semantic Ladder
PoolParty 6.0 - Climbing the Semantic Ladder
 
متن‌بازسازی کلان‌داده
متن‌بازسازی کلان‌دادهمتن‌بازسازی کلان‌داده
متن‌بازسازی کلان‌داده
 
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
 
Webinar: SpagoBI & Big Data, a smart approach to turn data into knowledge
Webinar: SpagoBI & Big Data, a smart approach to turn data into knowledge Webinar: SpagoBI & Big Data, a smart approach to turn data into knowledge
Webinar: SpagoBI & Big Data, a smart approach to turn data into knowledge
 
Accelerating Big Data Implementations for the Connected World
Accelerating Big Data Implementations for the Connected WorldAccelerating Big Data Implementations for the Connected World
Accelerating Big Data Implementations for the Connected World
 
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
 
Marketing vs Technology
Marketing vs TechnologyMarketing vs Technology
Marketing vs Technology
 
Smarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing PlatformSmarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing Platform
 
Using the Semantic Web Stack to Make Big Data Smarter
Using the Semantic Web Stack to Make  Big Data SmarterUsing the Semantic Web Stack to Make  Big Data Smarter
Using the Semantic Web Stack to Make Big Data Smarter
 
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...
 
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep LearningRisk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
 

Similar to Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios

BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
 
Bigdata
BigdataBigdata
Big data oracle_introduccion
Big data oracle_introduccionBig data oracle_introduccion
Big data oracle_introduccion
Fran Navarro
 
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User GroupBig Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Scott Mitchell
 
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
Jürgen Ambrosi
 
VLDB 2013: How to maximize the value of Big Data with SpagoBI suite
VLDB 2013: How to maximize the value of Big Data with SpagoBI suiteVLDB 2013: How to maximize the value of Big Data with SpagoBI suite
VLDB 2013: How to maximize the value of Big Data with SpagoBI suite
SpagoWorld
 
Chug building a data lake in azure with spark and databricks
Chug   building a data lake in azure with spark and databricksChug   building a data lake in azure with spark and databricks
Chug building a data lake in azure with spark and databricks
Brandon Berlinrut
 
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data ScienceGet Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Neo4j
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
FredReynolds2
 
Solutions Linux 2013: Extracting value from Big Data through a new informatio...
Solutions Linux 2013: Extracting value from Big Data through a new informatio...Solutions Linux 2013: Extracting value from Big Data through a new informatio...
Solutions Linux 2013: Extracting value from Big Data through a new informatio...
SpagoWorld
 
SpagoBI 5 Demo Day and Workshop : Technology Applications and Uses
SpagoBI 5 Demo Day and Workshop : Technology Applications and UsesSpagoBI 5 Demo Day and Workshop : Technology Applications and Uses
SpagoBI 5 Demo Day and Workshop : Technology Applications and Uses
SpagoWorld
 
Top 10 renowned big data companies
Top 10 renowned big data companiesTop 10 renowned big data companies
Top 10 renowned big data companies
Robert Smith
 
Big Data - A Real Life Revolution
Big Data - A Real Life RevolutionBig Data - A Real Life Revolution
Big Data - A Real Life Revolution
Capgemini
 
2015 HortonWorks MDA Roadshow Presentation
2015 HortonWorks MDA Roadshow Presentation2015 HortonWorks MDA Roadshow Presentation
2015 HortonWorks MDA Roadshow Presentation
Felix Liao
 
DOAG Big Data Days 2017 - Cloud Journey
DOAG Big Data Days 2017 - Cloud JourneyDOAG Big Data Days 2017 - Cloud Journey
DOAG Big Data Days 2017 - Cloud Journey
Harald Erb
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG Data
Prakalp Agarwal
 
Big Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San JoseBig Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San Jose
Jeffrey T. Pollock
 
Expand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big DataExpand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big Data
jdijcks
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment Options
Caserta
 
Big Data
Big DataBig Data
Big Data
Ben Duan
 

Similar to Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios (20)

BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
Bigdata
BigdataBigdata
Bigdata
 
Big data oracle_introduccion
Big data oracle_introduccionBig data oracle_introduccion
Big data oracle_introduccion
 
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User GroupBig Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
 
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
 
VLDB 2013: How to maximize the value of Big Data with SpagoBI suite
VLDB 2013: How to maximize the value of Big Data with SpagoBI suiteVLDB 2013: How to maximize the value of Big Data with SpagoBI suite
VLDB 2013: How to maximize the value of Big Data with SpagoBI suite
 
Chug building a data lake in azure with spark and databricks
Chug   building a data lake in azure with spark and databricksChug   building a data lake in azure with spark and databricks
Chug building a data lake in azure with spark and databricks
 
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data ScienceGet Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
 
Solutions Linux 2013: Extracting value from Big Data through a new informatio...
Solutions Linux 2013: Extracting value from Big Data through a new informatio...Solutions Linux 2013: Extracting value from Big Data through a new informatio...
Solutions Linux 2013: Extracting value from Big Data through a new informatio...
 
SpagoBI 5 Demo Day and Workshop : Technology Applications and Uses
SpagoBI 5 Demo Day and Workshop : Technology Applications and UsesSpagoBI 5 Demo Day and Workshop : Technology Applications and Uses
SpagoBI 5 Demo Day and Workshop : Technology Applications and Uses
 
Top 10 renowned big data companies
Top 10 renowned big data companiesTop 10 renowned big data companies
Top 10 renowned big data companies
 
Big Data - A Real Life Revolution
Big Data - A Real Life RevolutionBig Data - A Real Life Revolution
Big Data - A Real Life Revolution
 
2015 HortonWorks MDA Roadshow Presentation
2015 HortonWorks MDA Roadshow Presentation2015 HortonWorks MDA Roadshow Presentation
2015 HortonWorks MDA Roadshow Presentation
 
DOAG Big Data Days 2017 - Cloud Journey
DOAG Big Data Days 2017 - Cloud JourneyDOAG Big Data Days 2017 - Cloud Journey
DOAG Big Data Days 2017 - Cloud Journey
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG Data
 
Big Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San JoseBig Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San Jose
 
Expand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big DataExpand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big Data
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment Options
 
Big Data
Big DataBig Data
Big Data
 

More from SpagoWorld

[SFScon'17] More than a decade with free open source software
[SFScon'17] More than a decade with free open source software[SFScon'17] More than a decade with free open source software
[SFScon'17] More than a decade with free open source software
SpagoWorld
 
EclipseDay Milano 2017 - How to make Data Science appealing with open source ...
EclipseDay Milano 2017 - How to make Data Science appealing with open source ...EclipseDay Milano 2017 - How to make Data Science appealing with open source ...
EclipseDay Milano 2017 - How to make Data Science appealing with open source ...
SpagoWorld
 
Parametric report slide support
Parametric report slide supportParametric report slide support
Parametric report slide support
SpagoWorld
 
My First Report slide support
My First Report slide supportMy First Report slide support
My First Report slide support
SpagoWorld
 
My First Worksheet slide support
My First Worksheet slide supportMy First Worksheet slide support
My First Worksheet slide support
SpagoWorld
 
Starting with SpagoBI Slide Support
Starting with SpagoBI Slide SupportStarting with SpagoBI Slide Support
Starting with SpagoBI Slide Support
SpagoWorld
 
SpagoBI Suite Slide Support
SpagoBI Suite Slide SupportSpagoBI Suite Slide Support
SpagoBI Suite Slide Support
SpagoWorld
 
Architectural Evolution Starting from Hadoop
Architectural Evolution Starting from HadoopArchitectural Evolution Starting from Hadoop
Architectural Evolution Starting from Hadoop
SpagoWorld
 
Openness as the Engine for Digital Innovation
Openness as the Engine for Digital InnovationOpenness as the Engine for Digital Innovation
Openness as the Engine for Digital Innovation
SpagoWorld
 
HUG Italy meet-up with Fabian Wilckens, MapR EMEA Solutions Architect
HUG Italy meet-up with Fabian Wilckens, MapR EMEA Solutions ArchitectHUG Italy meet-up with Fabian Wilckens, MapR EMEA Solutions Architect
HUG Italy meet-up with Fabian Wilckens, MapR EMEA Solutions Architect
SpagoWorld
 
HUG Italy meet-up with Tugdual Grall, MapR Technical Evangelist
HUG Italy meet-up with Tugdual Grall, MapR Technical EvangelistHUG Italy meet-up with Tugdual Grall, MapR Technical Evangelist
HUG Italy meet-up with Tugdual Grall, MapR Technical Evangelist
SpagoWorld
 
Data Mining with SpagoBI suite
Data Mining with SpagoBI suiteData Mining with SpagoBI suite
Data Mining with SpagoBI suite
SpagoWorld
 
Webinar: SpagoBI 5 - Self-build your interactive cockpits, get instant insigh...
Webinar: SpagoBI 5 - Self-build your interactive cockpits, get instant insigh...Webinar: SpagoBI 5 - Self-build your interactive cockpits, get instant insigh...
Webinar: SpagoBI 5 - Self-build your interactive cockpits, get instant insigh...
SpagoWorld
 
Webinar - SpagoBI 5 and what-if analytics: is your business strategy effective?
Webinar - SpagoBI 5 and what-if analytics: is your business strategy effective?Webinar - SpagoBI 5 and what-if analytics: is your business strategy effective?
Webinar - SpagoBI 5 and what-if analytics: is your business strategy effective?
SpagoWorld
 
Webinar - SpagoBI 5: here comes the Social Network analysis
Webinar - SpagoBI 5: here comes the Social Network analysis Webinar - SpagoBI 5: here comes the Social Network analysis
Webinar - SpagoBI 5: here comes the Social Network analysis
SpagoWorld
 
Webinar - What's new with SpagoBI 5: presentation and demo
Webinar - What's new with SpagoBI 5: presentation and demoWebinar - What's new with SpagoBI 5: presentation and demo
Webinar - What's new with SpagoBI 5: presentation and demo
SpagoWorld
 
SpagoBI 5 Demo Day and Workshop : Business Applications and Uses
SpagoBI 5 Demo Day and Workshop : Business Applications and UsesSpagoBI 5 Demo Day and Workshop : Business Applications and Uses
SpagoBI 5 Demo Day and Workshop : Business Applications and Uses
SpagoWorld
 
Engineering and OW2 Big Data Initiative: an open approach to the data-driven ...
Engineering and OW2 Big Data Initiative: an open approach to the data-driven ...Engineering and OW2 Big Data Initiative: an open approach to the data-driven ...
Engineering and OW2 Big Data Initiative: an open approach to the data-driven ...
SpagoWorld
 
OW2Con’14 – OW2 Big Data initiative: leveraging the data-driven economy with ...
OW2Con’14 – OW2 Big Data initiative: leveraging the data-driven economy with ...OW2Con’14 – OW2 Big Data initiative: leveraging the data-driven economy with ...
OW2Con’14 – OW2 Big Data initiative: leveraging the data-driven economy with ...
SpagoWorld
 
OW2Con’14 – OW2 Big Data initiative: leveraging the data-driven economy with ...
OW2Con’14 – OW2 Big Data initiative: leveraging the data-driven economy with ...OW2Con’14 – OW2 Big Data initiative: leveraging the data-driven economy with ...
OW2Con’14 – OW2 Big Data initiative: leveraging the data-driven economy with ...
SpagoWorld
 

More from SpagoWorld (20)

[SFScon'17] More than a decade with free open source software
[SFScon'17] More than a decade with free open source software[SFScon'17] More than a decade with free open source software
[SFScon'17] More than a decade with free open source software
 
EclipseDay Milano 2017 - How to make Data Science appealing with open source ...
EclipseDay Milano 2017 - How to make Data Science appealing with open source ...EclipseDay Milano 2017 - How to make Data Science appealing with open source ...
EclipseDay Milano 2017 - How to make Data Science appealing with open source ...
 
Parametric report slide support
Parametric report slide supportParametric report slide support
Parametric report slide support
 
My First Report slide support
My First Report slide supportMy First Report slide support
My First Report slide support
 
My First Worksheet slide support
My First Worksheet slide supportMy First Worksheet slide support
My First Worksheet slide support
 
Starting with SpagoBI Slide Support
Starting with SpagoBI Slide SupportStarting with SpagoBI Slide Support
Starting with SpagoBI Slide Support
 
SpagoBI Suite Slide Support
SpagoBI Suite Slide SupportSpagoBI Suite Slide Support
SpagoBI Suite Slide Support
 
Architectural Evolution Starting from Hadoop
Architectural Evolution Starting from HadoopArchitectural Evolution Starting from Hadoop
Architectural Evolution Starting from Hadoop
 
Openness as the Engine for Digital Innovation
Openness as the Engine for Digital InnovationOpenness as the Engine for Digital Innovation
Openness as the Engine for Digital Innovation
 
HUG Italy meet-up with Fabian Wilckens, MapR EMEA Solutions Architect
HUG Italy meet-up with Fabian Wilckens, MapR EMEA Solutions ArchitectHUG Italy meet-up with Fabian Wilckens, MapR EMEA Solutions Architect
HUG Italy meet-up with Fabian Wilckens, MapR EMEA Solutions Architect
 
HUG Italy meet-up with Tugdual Grall, MapR Technical Evangelist
HUG Italy meet-up with Tugdual Grall, MapR Technical EvangelistHUG Italy meet-up with Tugdual Grall, MapR Technical Evangelist
HUG Italy meet-up with Tugdual Grall, MapR Technical Evangelist
 
Data Mining with SpagoBI suite
Data Mining with SpagoBI suiteData Mining with SpagoBI suite
Data Mining with SpagoBI suite
 
Webinar: SpagoBI 5 - Self-build your interactive cockpits, get instant insigh...
Webinar: SpagoBI 5 - Self-build your interactive cockpits, get instant insigh...Webinar: SpagoBI 5 - Self-build your interactive cockpits, get instant insigh...
Webinar: SpagoBI 5 - Self-build your interactive cockpits, get instant insigh...
 
Webinar - SpagoBI 5 and what-if analytics: is your business strategy effective?
Webinar - SpagoBI 5 and what-if analytics: is your business strategy effective?Webinar - SpagoBI 5 and what-if analytics: is your business strategy effective?
Webinar - SpagoBI 5 and what-if analytics: is your business strategy effective?
 
Webinar - SpagoBI 5: here comes the Social Network analysis
Webinar - SpagoBI 5: here comes the Social Network analysis Webinar - SpagoBI 5: here comes the Social Network analysis
Webinar - SpagoBI 5: here comes the Social Network analysis
 
Webinar - What's new with SpagoBI 5: presentation and demo
Webinar - What's new with SpagoBI 5: presentation and demoWebinar - What's new with SpagoBI 5: presentation and demo
Webinar - What's new with SpagoBI 5: presentation and demo
 
SpagoBI 5 Demo Day and Workshop : Business Applications and Uses
SpagoBI 5 Demo Day and Workshop : Business Applications and UsesSpagoBI 5 Demo Day and Workshop : Business Applications and Uses
SpagoBI 5 Demo Day and Workshop : Business Applications and Uses
 
Engineering and OW2 Big Data Initiative: an open approach to the data-driven ...
Engineering and OW2 Big Data Initiative: an open approach to the data-driven ...Engineering and OW2 Big Data Initiative: an open approach to the data-driven ...
Engineering and OW2 Big Data Initiative: an open approach to the data-driven ...
 
OW2Con’14 – OW2 Big Data initiative: leveraging the data-driven economy with ...
OW2Con’14 – OW2 Big Data initiative: leveraging the data-driven economy with ...OW2Con’14 – OW2 Big Data initiative: leveraging the data-driven economy with ...
OW2Con’14 – OW2 Big Data initiative: leveraging the data-driven economy with ...
 
OW2Con’14 – OW2 Big Data initiative: leveraging the data-driven economy with ...
OW2Con’14 – OW2 Big Data initiative: leveraging the data-driven economy with ...OW2Con’14 – OW2 Big Data initiative: leveraging the data-driven economy with ...
OW2Con’14 – OW2 Big Data initiative: leveraging the data-driven economy with ...
 

Recently uploaded

Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Precisely
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
Alex Pruden
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Trusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process MiningTrusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process Mining
LucaBarbaro3
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
Public CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptxPublic CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptx
marufrahmanstratejm
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 

Recently uploaded (20)

Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Trusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process MiningTrusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process Mining
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
Public CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptxPublic CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptx
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 

Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios

  • 1. www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved. SpagoBI and Talend jointly support Big Data scenarios Monica Franceschini - SpagoBI Architect SpagoBI Competency Center - Engineering Group
  • 2. www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved. Big-data • Agenda – Intro & definitions – Layers – Talend & SpagoBI – SpagoBI big-data roadmap
  • 3. www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved. Big Data - 3Vs "Big data" is high-volume, high-velocity and high-variety information assets that demand cost-effective, innovative forms of information processing for enhanced insight and decision making. Source: The Importance of 'Big Data': A Definition, Mark Beyer, Douglas. Gartner, 21 June 2012. VOLUME The increase in data volumes within enterprise systems is caused by transaction volumes and other traditional data types, as well as by new types of data. Too much volume is a storage issue, but too much data is also a massive analysis issue VARIETY IT leaders have always had an issue translating large volumes of transactional information into decisions — now there are more types of information to analyze — mainly coming from social media and mobile (context-aware). Variety includes tabular data (databases), hierarchical data, documents, e-mail, metering data, video, still images, audio, stock ticker data, financial transactions and more. VELOCITY This involves streams of data, structured record creation, and availability for access and delivery. Velocity means both how fast data is being produced and how fast the data must be processed to meet demand Gartner Press Release, “Gartner Says Solving ‘Big Data’ Challenge Involves More Than Just Managing Volumes of Data”, June 27, 2011
  • 4. www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved. Big Data- 3Vs & more VARIABILITY variance in meaning, in lexicon VERACITY 1 in 3 business leaders don’t trust the information they use to make decisions. How can you act upon information if you don’t trust it? Establishing trust in big data presents a huge challenge as the variety and number of sources grows. VALUE The economic value of different data varies significantly. Typically there is good information hidden amongst a larger body of non- traditional data; the challenge is identifying what is valuable and then transforming and extracting that data for analysis.
  • 5. www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved. Big data - Layers • Infastructure – On-site – IaaS • Data management: – capture – cleaning – loading – store • View and Analyse – Text analysis – Text mining – exploration, navigation, presentation • Application – Cloud – SaaA ETL Business Intelligence Services
  • 6. www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved. Big data & Businessn Intelligence • Tasks: – Manage big-data (ETL) Talend→ – Read, interpret and show big-data (BI) SpagoBI→ – Big-data and real-time (BI) SpagoBI→
  • 7. www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved. Talend - Big Data Management Big Data Production Big Data Management Big Data Consumption Storage Processing Filtering Mining Analytics Search Enrichment RDBMS Analytical DB NoSQL DB ERP/CRM SaaS Social Media Web Analytics Log Files RFID Call Data Records Sensors Machine-Generated Big Data Integration Big Data Quality Turn Big Data into actionable information Parsing Checking
  • 8. www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved. Talend Goal: democratize Big Data …an open source ecosystem Talend Open Studio for Big Data “Big Data for the Masses”  Improves efficiency of big data job design with graphic interface  Abstracts and generates code  Run transforms inside Hadoop  Native support for HDFS, Sqoop, HBase, Mahout, Pig, Hive & MapReduce code generat°  Apache License 2.0  Embedded in Hortonworks Data Platform  Certifed with Cloudera, MapR and Grenplum HCatalog
  • 9. www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved. ETL: Analytical databases & appliances Connectors from/to: ‗Greenplum ‗Netezza ‗Sybase ‗Teradata ‗VectorWise ‗Vertica ‗HDFS ‗HBase ‗Hive ‗Cassandra ‗MongoDB
  • 10. www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved. SpagoBI - load Certified appliances: ‗Teradata ‗VectorWise Connectors from: ‗Cassandra ‗HBase ‗Hive ‗Impala ‗Hadoop RT with: ‗Storm ‗WSO2 More: ‗Scheduled data-set ‗In-memory data set
  • 11. www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved. SpagoBI - meaning Support for open standards: ‗RDF (Resource Description Framework) http://www.w3.org/RDF/ ‗OWL (Web Ontology Language) http://www.w3.org/OWL/ ‗R ‗Mahout ‗Text mining Connectors from: ‗Neo4J ‗Freebase ‗OrientDB
  • 12. www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved. SpagoBI - show Explorative front-end ‗Network analysis ‗Exploration ‗In-memory ‗Data visualization
  • 13. www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved. SpagoBI - roadmap • Capture / Store – Talend, connector to/from: • Greenplum • Netezza • Sybase • Teradata • VectorWise • Vertica • HDFS • HBase • Hive • Cassandra • MongoDB • … • LOAD – Certified appliances: • Teradata • VectorWise – Connectors from: • Cassandra • HBase • Hive • Impala • Hadoop • MongoDB – RT with: • Storm • WS02 – More: • Scheduled data-set • In-memory data set
  • 14. www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved. SpagoBI - roadmap • Meaning – Connectors from: • Neo4J • Freebase • OrientDB – Support for open standards: • RDF • OWL – Mining • R • MashR • Text mining • Show – Explorative front-end – Network analysis – Data visualization • Services – Big data as a service • Multitenant • Cloud • BI as a service (ad-hoc+self-service) Data scientist
  • 15. www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved. Bundle Talend -SpagoBI The bundle will provide: a distribution of both tools interacting one with each other a use-case that can be run to explore their functionalities SpagoBI and Talend announce their bundle!
  • 16. www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved. @twittmonique Monica.franceschini@eng.it