Openbravo in the Ecosystem: Integrate it into the Information System - Talend - Sandra Masse - Presentation Transcript
Openbravo World Conference
Open Source Data Integration for Openbravo
Barcelona, April 17-19 2009
Speaker: Sandra Massé, Channel Manager EMEA, Talend
Why Data Integration
• ERP systems need to interoperate with the rest of the information system
– Data volumes in exponential growth
– Diversity, heterogeneity and quality of data sources (internal & external)
– Upgrades by successive additions of layers
• Import/export capabilities in Openbravo
– No advanced, enterprise-capable data integration features
• Typical data integration requirements:
– Migration of data to Openbravo
– Extraction of Openbravo data for reporting, business intelligence, etc.
– Synchronization of Openbravo with other applications (ERP, shipping, accounting, invoicing,
etc.)
Page 2
Data Integration Spectrum
Data
Data profiling
Quality
& cleansing
Loading of
decision support
systems Data replication &
Data synchronization
integration
Analytics
(ETL)
Application
upgrade
Data migration
& capture
Operational
Integration
Page 3
Corporate Overview
• Leading provider of open source data integration software
– Making data integration solutions available to organizations of all sizes and for all integration
needs
• Venture-backed
• Worldwide operations and users
Page 4
Business Highlights
• A high adoption rate
– 900,000 core product downloads
– 4 million lifetime downloads
– 20% register as users
– 30% get automatic product upgrades
1 download of 50 new customers
Talend Open Studio per month
per minute
Page 5
Solution
• Easy to download, evaluate, use, and maintain
• Low initial cost of ownership
• Scalable, flexible pricing and technology
• Powerful functionality and superior performance
• Accelerated development of new functionality through user community
• Robust open source data integration solution built on the latest
technologies
Page 6
Migration of Data to Openbravo
• Migrate data from:
– Legacy ERP/CRM
– Reference data
– Files
– Emails
– Etc.
Legacy
CRM
Reference
Data
External
Data Files
Emails
Page 7
Slide 7
Challenges of Data Migration
• High volumes of data
– Processing of large amounts of individual data
• Heterogeneity of the environment
– Often involves legacy systems
– Packaged applications, RDBMS, mainframes, files, etc.
– Can involve external data sources (reference data)
• Differences in data structures
– Map the sources to the target
– Aggregations, calculations, etc.
• Maintain the consistency of old and new systems
– May involve complex bi-directional synchronization
Page 8
Slide 8
Extraction of Openbravo Data (ETL)
• Load Openbravo Data to:
– Data Warehouse
– Data Marts
– OLAP Applications / Cubes
OLAP
• Along with all the other data
ERP/CRM
Finance
Data Mart
Production
External
Data Files
Data
Data Mart
Warehouse
Page 9
Slide 9
Challenges of ETL
• High volumes of data
– Processing of large amounts of granular data (phone calls, banking transactions…)
– Incremental load vs. complete reload
• Disparity of sources
– Numerous production systems
– Packaged applications, RDBMS, mainframes, files, etc.
– Internal and external data sources (reference data)
• Multiplicity of targets
– Data warehouse, data marts, OLAP, etc.
• Complexity of transformations
– Aggregations, calculations, statistical functions, etc.
– Slowly Changing Dimensions
• Trend toward real-time
– Refresh more often
– Always shorter load time windows
Page 10
Synchronization of Openbravo Data
• Synchronize data between Openbravo and other systems
– Real-time or batch
– Operational data (e.g. orders, inventory…)
– Reference/Master data (e.g. products prices, territories…)
ERP
Inventory
Database
Page 11
Slide 11
Challenges of Synchronization
• Restrict volumes of processed data
– Requires Changed Data Capture (CDC)
• Heterogeneity of the environment
– Often involves legacy systems
– Packaged applications, RDBMS, mainframes, files, etc.
• Differences in data structures
– Map the sources and the targets
– Aggregations, calculations, etc.
• Conflict management and resolution
– Record update precedence or “record owner”
Page 12
Slide 12
Talend’s Integration Solutions
• Fully integrated with Openbravo
• Connects to 400 different sources and targets
– Covers all systems Openbravo may need to interact with
• Supports all data integration needs
– Migration
– ETL for Business Intelligence
– Synchronization
• Supports all latencies, volumes
• Advanced transformations and mappings
• Built-in data quality
Page 13
Open Source Changes the Game
• Remove barriers of entry
• Sharply reduce upfront costs
• Leverage more available resources
• Remove obstacles to deployment
• Leverage more data
Page 14
Remove Barriers of Entry
• Solution available for free, unrestricted download
– Trial/proof-of-concept performed by the user
– No need to allocate budget (use the free license)
– Self-service community assistance (forum, bugtracker, wiki, etc.)
• Buy only what you need, when you need it
– No vendor lock-in
– No pressure from vendor
– No pricing per-CPU, per-source, per-target, etc.
• When developing: buy Training, Support, Expertise, Enterprise
features
• When in production: buy Quality Insurance, Support, etc.
– Only when actually needed
Page 15
Sharply Reduce Upfront Costs
• No perpetual license costs
– Core features available under GPL license
– Open Source subscription for Enterprise features
– Services subscription for value added services
• Significantly lower costs than proprietary solutions
– Ratio: from 1-to-5 to 1-to-20 or more
Page 16
Leverage More Available Resources
• Standards-based solution
– Code generator (no black box engine)
– Standard and easy-to-learn user interface (Eclipse)
– Standards-based connectivity
• Distributed execution
– Grid of commodity hardware
– Leverage under-utilized systems
– Leverage standard execution engines (Java, Perl, RDBMS)
• Use existing resources
– Human – no lengthy training required
– Servers and systems – limit capital expenditures
Page 17
Remove Obstacles to Deployment
• Get rid of:
– Need for dedicated hardware
– Exaggeratedly powerful servers
– Per CPU costs
– Per source cost
– Per target cost
– Vendor pressure
• Deploy freely, without restrictions
• Expand the use of the solution as needed
• Start new projects without needing specific funding
Page 18
Leverage More Data
• Data integration is about interoperability
– Versatility and scalability
– Access all data sources, even the most exotic ones
– Leverage all the data
• Supports any data source:
– RDBMS – proprietary and open source
– Packaged applications
– Files
– Web sources
– <insert_your_data_source_here>
• Must include native support: optimized dedicated protocoles
• Can be extensible
– Without vendor dependency
Page 19
0 comments
Post a comment