Comparison CriteriaThis document provides various criteria to be considered while evaluatingETL tool such as Informatica, IBM DataStage, AbInitio, SAP BODI, PentahoKettel, Microsoft SSIS, Oracle ODI ..etc Comparison is based on following Parameters • Architecture • Metadata Support • Ease of Support • Transformations • Performance /Management • Data Quality & MDM • Support for Growth • Advance Data Transformation • 3rd Party Compatibility • License and Pricing • Vendor Information
ArchitectureCategory Criteria Scalable and Extensible Technology Client Platform Server Platforms Which DBMS are supported for extraction and loading Support for ERP Sources Architecture Support for complex event processing XML Support Web Services Pre built libraries to handle industry messaging formats like SWIFT, ISO15022 Real Time feature Real Time CDC Code Reusability capability within the product Parallelism Code Generator
Architecture (Conn..)Category Criteria Data Transformation Method (Engine Based ?) Building & Managing Aggregates Support for various data types Data Quality Check functionality or feature Debugging and logging features Architecture Exception Handling How Tool Provides information about exception Data Archival functionality Ease of integration with external rules engines like Pega Restarting an aborted ETL process Memory (Minimum/ Recommended) requirement at client machine Memory (Minimum/ Recommended) requirement at Server machine Repository Backup and Recovery Cloud Integration
Metadata and SetupCategory Criteria Metadata Capture Business View meta data Meta data security Web Integration support Metadata Versioning Support Metadata repositorys compliance to one of the industry meta data standards Meta data views using query toolsCategory Criteria Easy installation procedure Ability to generate Data mart schema similar to source Ease of setup database Support for designing data mart Importing data models from modeling tools
TransformationsCategory Criteria Filter Format conversion Lookup User Defined / Custom Transformations Scope for user defined fields Transformation Joins Support for external procedures Support for XML Support for BIG Data Integration Support for Hadoop
Management & DQCategory Criteria Scheduling feature Workflow Capability Defining calendar and using it for ad-hoc scheduling Performance monitoring of ETL process Management Performance Options Specifying the atomicity of the updates Security –Encryption Impact analysis in-built toolCategory Criteria Data Profiling Data Cleansing Data Quality and MDM MDM Integration with external DQ Tool
Growth & Advance TransformationCategory Criteria Ability to handle various source types from flat to files to major RDBMS Incremental upload Support for External loader Support for Growth Intermediate file generation during loading Event based loading Support for wide range of databases for storing (Target) information Familarity with the Tool Support for multi-user development environmentCategory Criteria Re-usability Advance Data Support for built in functions Transformation Handling duplicate records Lookup cache
3rd Party Integration & PricingCategory Criteria Compatibility with third Compatibility of ETL Tools with EAI tools like IBM MQ Series, party tools TIBCO, Vitria and webMethods as source/ target for the data.Category Criteria Consistency and re-use Global Meta dataCategory Criteria Server Licensing Licensing & Pricing Client Licensing Cost saving due to Re-use of Existing license Package Licensing
Vendor InfoCategory Criteria 2 consecutive years of profitability Significant third party partner support Global presence and support Number of Customers Vendor Info Company financial info readily available Company focus on ETL segment for the future Client Base Gartner, Forrester’s recommendations
About the AuthorAsis Mohanty has more than 12 Years of Industry experience on DataWarehousing and Business Intelligence field. He is a Certified BusinessIntelligence Professional from www.tdwi.org and Certified DataManagement Professional from www.dama.org . Asis has worked withFortune 100 & IT Service organizations (IBM, Target Corporation, Infosys &Wipro Technologies) in leadership role.Email Id: email@example.com
A particular slide catching your eye?
Clipping is a handy way to collect important slides you want to go back to later.