Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
Upcoming SlideShare
Loading in...5
×
 

Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios

on

  • 1,138 views

This presentation supported the speech entitled "SpagoBI and Talend jointly support Big Data scenarios" delivered by Monica Franceschini, SpagoBI Architect, during the OW2 track at Solutions Linux ...

This presentation supported the speech entitled "SpagoBI and Talend jointly support Big Data scenarios" delivered by Monica Franceschini, SpagoBI Architect, during the OW2 track at Solutions Linux 2013 (Paris, 28th-29th May 2013).

Statistics

Views

Total Views
1,138
Views on SlideShare
1,138
Embed Views
0

Actions

Likes
2
Downloads
21
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios Presentation Transcript

    • www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.SpagoBI and Talend jointly support Big Data scenariosMonica Franceschini - SpagoBI ArchitectSpagoBI Competency Center - Engineering Group
    • www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Big-data• Agenda– Intro & definitions– Layers– Talend & SpagoBI– SpagoBI big-data roadmap
    • www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Big Data - 3Vs"Big data" is high-volume, high-velocity and high-variety information assets thatdemand cost-effective, innovative forms of information processing for enhancedinsight and decision making.Source: The Importance of Big Data: A Definition, Mark Beyer, Douglas. Gartner, 21 June 2012.VOLUME The increase in data volumes within enterprise systems iscaused by transaction volumes and other traditional data types, aswell as by new types of data. Too much volume is a storage issue, buttoo much data is also a massive analysis issueVARIETY IT leaders have always had an issue translating large volumesof transactional information into decisions — now there are moretypes of information to analyze — mainly coming from social mediaand mobile (context-aware). Variety includes tabular data (databases),hierarchical data, documents, e-mail, metering data, video, stillimages, audio, stock ticker data, financial transactions and more.VELOCITY This involves streams of data, structured record creation, andavailability for access and delivery. Velocity means both how fast datais being produced and how fast the data must be processed to meetdemandGartner Press Release, “Gartner Says Solving ‘Big Data’ Challenge Involves More Than Just Managing Volumes of Data”, June27, 2011
    • www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Big Data- 3Vs & moreVARIABILITYvariance in meaning, in lexiconVERACITY1 in 3 business leaders don’t trust the information they use to makedecisions. How can you act upon information if you don’t trust it?Establishing trust in big data presents a huge challenge as thevariety and number of sources grows.VALUEThe economic value of different data varies significantly. Typicallythere is good information hidden amongst a larger body of non-traditional data; the challenge is identifying what is valuable andthen transforming and extracting that data for analysis.
    • www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Big data - Layers• Infastructure– On-site– IaaS• Data management:– capture– cleaning– loading– store• View and Analyse– Text analysis– Text mining– exploration, navigation, presentation• Application– Cloud– SaaAETLBusiness IntelligenceServices
    • www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Big data & Businessn Intelligence• Tasks:– Manage big-data (ETL) Talend→– Read, interpret and show big-data (BI) SpagoBI→– Big-data and real-time (BI) SpagoBI→
    • www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Talend - Big Data ManagementBig DataProductionBig Data ManagementBig DataConsumptionStorageProcessingFilteringMiningAnalyticsSearchEnrichmentRDBMSAnalytical DBNoSQL DBERP/CRMSaaSSocial MediaWeb AnalyticsLog FilesRFIDCall Data RecordsSensorsMachine-GeneratedBig DataIntegrationBig DataQualityTurn Big Data intoactionable informationParsingChecking
    • www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Talend Goal: democratize Big Data…an open sourceecosystemTalend Open Studio for Big Data“Big Data for the Masses” Improves efficiency of big data job design withgraphic interface Abstracts and generates code Run transforms inside Hadoop Native support for HDFS, Sqoop, HBase,Mahout, Pig, Hive & MapReduce code generat° Apache License 2.0 Embedded in Hortonworks Data Platform Certifed with Cloudera, MapR and GrenplumHCatalog
    • www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.ETL: Analytical databases & appliancesConnectors from/to:‗Greenplum‗Netezza‗Sybase‗Teradata‗VectorWise‗Vertica‗HDFS‗HBase‗Hive‗Cassandra‗MongoDB
    • www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.SpagoBI - loadCertified appliances:‗Teradata‗VectorWiseConnectors from:‗Cassandra‗HBase‗Hive‗Impala‗HadoopRT with:‗Storm‗WSO2More:‗Scheduled data-set‗In-memory data set
    • www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.SpagoBI - meaningSupport for open standards:‗RDF (Resource Description Framework) http://www.w3.org/RDF/‗OWL (Web Ontology Language) http://www.w3.org/OWL/‗R‗Mahout‗Text miningConnectors from:‗Neo4J‗Freebase‗OrientDB
    • www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.SpagoBI - showExplorative front-end‗Network analysis‗Exploration‗In-memory‗Data visualization
    • www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.SpagoBI - roadmap• Capture / Store– Talend, connector to/from:• Greenplum• Netezza• Sybase• Teradata• VectorWise• Vertica• HDFS• HBase• Hive• Cassandra• MongoDB• …• LOAD– Certified appliances:• Teradata• VectorWise– Connectors from:• Cassandra• HBase• Hive• Impala• Hadoop• MongoDB– RT with:• Storm• WS02– More:• Scheduled data-set• In-memory data set
    • www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.SpagoBI - roadmap• Meaning– Connectors from:• Neo4J• Freebase• OrientDB– Support for open standards:• RDF• OWL– Mining• R• MashR• Text mining• Show– Explorative front-end– Network analysis– Data visualization• Services– Big data as a service• Multitenant• Cloud• BI as a service (ad-hoc+self-service)Data scientist
    • www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Copyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.Bundle Talend -SpagoBIThe bundle will provide:a distribution of both toolsinteracting one with each othera use-case that can be run to exploretheir functionalitiesSpagoBI and Talend announce their bundle!
    • www.spagobi.orgCopyright © 2013 Engineering Group, SpagoBI Competency Center. All rights reserved.@twittmoniqueMonica.franceschini@eng.it