SlideShare a Scribd company logo
Leverage Open Source for Data Quality
Welcome to this Talend Webinar ,[object Object],All participants are muted You may ask questions using the Q&A panel, located on the right hand side of your screen, at the bottom of the GoToWebinar applet ,[object Object]
If time is too short to address all questions, you will receive an answer via emailIf you are having connection problems, please use also the Q&A panel
Agenda Talend Introduction Data Cleansing with Talend Data Cleansing Why data cleansing? The importance of data cleansing in data integration
Agenda Talend Introduction Data Cleansing with Talend Data Cleansing Why data cleansing? The importance of data cleansing in data integration
© Talend 2010 5 Corporate Overview Leading provider of open source data management software Venture-backed Worldwide operations and users London (Maidenhead) Business   Tech Support Nuremberg Business   Tech Support Tokyo Business   Tech Support Beijing R&D   Tech Support San Francisco (Los Altos)   Corporate Orange County (Irvine) Business   R&D   Tech Support New York (Tarrytown) Business   Tech Support Paris (Suresnes)   Corporate  Business   R&D   Tech Support Milan (Curno)   Business   Tech Support UtrechtBusiness 5 © Talend 2010
© Talend 2010 6 Business Highlights A high adoption rate ,[object Object]
450,000 users
1,500 customers1 download of Talend Open Studio per minute 100 new customers per month
© Talend 2010 7 Strong Industry Recognition Data Integration Magic Quadrant ,[object Object]
Technical & business criteria
Direct entry as Visionary
Only open source vendorAlso mentioned in Gartner’s Data Quality & Master Data Management research, and by other analyst firms (Forrester, IDC, Bloor Research, etc.)
© Talend 2010 8 Company History IntegrationSuite RTx MDMEnterprise Edition Open Studiov1.0 Products Open Profiler Integration Suite MPx MDMCommunity Edition Data Quality IntegrationSuite Talend Open Studio  Beta 1 2002 2006 2007 2008 2009 2010      2005 Operations Operations R&D Company creation Third round of financing: existing investors & Balderton Capital Launched US operations First round of financing: AGF Private Equity & Galileo Partners MDM acquisition Closed second round of financing Closed fourth round of financing © Talend 2010
© Talend 2010 9 Open Source History 2002 1970 1984 1998 2000 2003 2006 2009 Open source is created Free access to code Free  Software  Foundation (Richard Stallman) Launching GPL and GNU Open source  solutions emerge Apache… Software companies come out MySQL, JBoss, SugarCRM… More and more mature key players stand out JBoss, SugarCRM , Jaspersoft, Talend… Economic crisis favors enterprise-ready open source players Open Source  Initiative is created Implementing a policy meeting  economical and technical realities
© Talend 2010 10 Market Positioning MDM Data Quality Reference data management Data profiling & data cleansing Data Integration Analytics (ETL) Operational Integration Data replication & synchronization, data migration & capture, application upgrade, etc. Extract, Transform & Load for decision support systems
© Talend 2010 11 Solutions Positioning Talend MDM Enterprise Edition Deploy large scale MDM  - Full permissions management  - Validation rules  - Complex workflows Talend Open Profiler Identify data quality problems - Free, GPL, no limitations - Custom indicators Talend MDM Community Edition Manage master data ,[object Object]
 Active data model- Lightweight business user UI Talend Data Quality Cleanse & track - Specific components - Reports ,[object Object],Talend Unified Platform Common, unified environment  - Front end: UI (Eclipse, Web)  - Back end: repository Talend LCp Manage best practices ,[object Object]
 Repository Manager
 Project AuditTalend Open Studio Create data flows ,[object Object]
 Unlimited data flows- 450+ components included Talend Integration Suite Deploy data integration ,[object Object]

More Related Content

What's hot

Data Quality for Non-Data People
Data Quality for Non-Data PeopleData Quality for Non-Data People
Data Quality for Non-Data People
DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing
Emerging Trends in Data Architecture – What’s the Next Big ThingEmerging Trends in Data Architecture – What’s the Next Big Thing
Emerging Trends in Data Architecture – What’s the Next Big Thing
DATAVERSITY
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best Practices
DATAVERSITY
 
Snowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the UglySnowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the Ugly
Tyler Wishnoff
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
James Serra
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Nathan Bijnens
 
Data Mesh
Data MeshData Mesh
Data platform architecture
Data platform architectureData platform architecture
Data platform architecture
Sudheer Kondla
 
Data quality architecture
Data quality architectureData quality architecture
Data quality architecture
anicewick
 
Reference master data management
Reference master data managementReference master data management
Reference master data management
Dr. Hamdan Al-Sabri
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for Dinner
Kent Graziano
 
Data Platform Architecture Principles and Evaluation Criteria
Data Platform Architecture Principles and Evaluation CriteriaData Platform Architecture Principles and Evaluation Criteria
Data Platform Architecture Principles and Evaluation Criteria
ScyllaDB
 
Data Quality Strategies
Data Quality StrategiesData Quality Strategies
Data Quality Strategies
DATAVERSITY
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
DATAVERSITY
 
Data lineage and observability with Marquez - subsurface 2020
Data lineage and observability with Marquez - subsurface 2020Data lineage and observability with Marquez - subsurface 2020
Data lineage and observability with Marquez - subsurface 2020
Julien Le Dem
 
Five Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceFive Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data Governance
DATAVERSITY
 
Data Architecture - The Foundation for Enterprise Architecture and Governance
Data Architecture - The Foundation for Enterprise Architecture and GovernanceData Architecture - The Foundation for Enterprise Architecture and Governance
Data Architecture - The Foundation for Enterprise Architecture and Governance
DATAVERSITY
 
Making Data Timelier and More Reliable with Lakehouse Technology
Making Data Timelier and More Reliable with Lakehouse TechnologyMaking Data Timelier and More Reliable with Lakehouse Technology
Making Data Timelier and More Reliable with Lakehouse Technology
Matei Zaharia
 

What's hot (20)

Data Quality for Non-Data People
Data Quality for Non-Data PeopleData Quality for Non-Data People
Data Quality for Non-Data People
 
Emerging Trends in Data Architecture – What’s the Next Big Thing
Emerging Trends in Data Architecture – What’s the Next Big ThingEmerging Trends in Data Architecture – What’s the Next Big Thing
Emerging Trends in Data Architecture – What’s the Next Big Thing
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best Practices
 
Mdm: why, when, how
Mdm: why, when, howMdm: why, when, how
Mdm: why, when, how
 
Snowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the UglySnowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the Ugly
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
 
MDM and Reference Data
MDM and Reference DataMDM and Reference Data
MDM and Reference Data
 
Data Mesh
Data MeshData Mesh
Data Mesh
 
Data platform architecture
Data platform architectureData platform architecture
Data platform architecture
 
Data quality architecture
Data quality architectureData quality architecture
Data quality architecture
 
Reference master data management
Reference master data managementReference master data management
Reference master data management
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for Dinner
 
Data Platform Architecture Principles and Evaluation Criteria
Data Platform Architecture Principles and Evaluation CriteriaData Platform Architecture Principles and Evaluation Criteria
Data Platform Architecture Principles and Evaluation Criteria
 
Data Quality Strategies
Data Quality StrategiesData Quality Strategies
Data Quality Strategies
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
Data lineage and observability with Marquez - subsurface 2020
Data lineage and observability with Marquez - subsurface 2020Data lineage and observability with Marquez - subsurface 2020
Data lineage and observability with Marquez - subsurface 2020
 
Five Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceFive Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data Governance
 
Data Architecture - The Foundation for Enterprise Architecture and Governance
Data Architecture - The Foundation for Enterprise Architecture and GovernanceData Architecture - The Foundation for Enterprise Architecture and Governance
Data Architecture - The Foundation for Enterprise Architecture and Governance
 
Making Data Timelier and More Reliable with Lakehouse Technology
Making Data Timelier and More Reliable with Lakehouse TechnologyMaking Data Timelier and More Reliable with Lakehouse Technology
Making Data Timelier and More Reliable with Lakehouse Technology
 

Viewers also liked

Talend Open Studio Data Integration
Talend Open Studio Data IntegrationTalend Open Studio Data Integration
Talend Open Studio Data Integration
Roberto Marchetto
 
Industrializing Data Integration
Industrializing Data IntegrationIndustrializing Data Integration
Industrializing Data Integration
Talend
 
Talend MDM
Talend MDMTalend MDM
Talend MDM
Talend
 
Talend Open Studio Fundamentals #1: Workspaces, Jobs, Metadata and Trips & Tr...
Talend Open Studio Fundamentals #1: Workspaces, Jobs, Metadata and Trips & Tr...Talend Open Studio Fundamentals #1: Workspaces, Jobs, Metadata and Trips & Tr...
Talend Open Studio Fundamentals #1: Workspaces, Jobs, Metadata and Trips & Tr...
Gabriele Baldassarre
 
Talend Data Quality - Customer Data Management platform
Talend Data Quality - Customer Data Management platformTalend Data Quality - Customer Data Management platform
Talend Data Quality - Customer Data Management platform
Максим Остархов
 
Talend for big_data_intorduction
Talend for big_data_intorductionTalend for big_data_intorduction
Talend for big_data_intorduction
Lakshman Dhullipalla
 
Talend勉強会 20150414
Talend勉強会 20150414Talend勉強会 20150414
Talend勉強会 20150414
kuroiwa
 
Talendビッグデータインテグレーション製品ご紹介
Talendビッグデータインテグレーション製品ご紹介Talendビッグデータインテグレーション製品ご紹介
Talendビッグデータインテグレーション製品ご紹介
Talend KK
 
Who is Talend?
Who is Talend?Who is Talend?
Who is Talend?
Talend
 
Unleashing the value of metadata with Talend
Unleashing the value of metadata with Talend Unleashing the value of metadata with Talend
Unleashing the value of metadata with Talend
Jean-Michel Franco
 
JobSchedulerアップデート2016
JobSchedulerアップデート2016JobSchedulerアップデート2016
JobSchedulerアップデート2016
OSSラボ株式会社
 
ビッグデータ関連Oss動向調査とニーズ分析
ビッグデータ関連Oss動向調査とニーズ分析ビッグデータ関連Oss動向調査とニーズ分析
ビッグデータ関連Oss動向調査とニーズ分析
Yukio Yoshida
 
Mike Tuche, CEO of Talend: Enabling the Data Driven Enterprise
Mike Tuche, CEO of Talend: Enabling the Data Driven EnterpriseMike Tuche, CEO of Talend: Enabling the Data Driven Enterprise
Mike Tuche, CEO of Talend: Enabling the Data Driven Enterprise
Talend
 
大規模データ分析を支えるインフラ系オープンソースソフトウェアの最新事情
大規模データ分析を支えるインフラ系オープンソースソフトウェアの最新事情大規模データ分析を支えるインフラ系オープンソースソフトウェアの最新事情
大規模データ分析を支えるインフラ系オープンソースソフトウェアの最新事情
nagix
 
Data Governance Overview - Doreen Christian
Data Governance Overview - Doreen ChristianData Governance Overview - Doreen Christian
Data Governance Overview - Doreen ChristianDoreen Christian
 
Talend winter 2017 overview webinar
Talend winter 2017 overview webinarTalend winter 2017 overview webinar
Talend winter 2017 overview webinar
Jean-Michel Franco
 
Présentation de Talend Winter 2017
Présentation de Talend Winter 2017 Présentation de Talend Winter 2017
Présentation de Talend Winter 2017
Jean-Michel Franco
 
Why You Need to Govern Big Data
Why You Need to Govern Big DataWhy You Need to Govern Big Data
Why You Need to Govern Big Data
IBM Analytics
 
Gouvernance et architecture des données de l’Entreprise Digitale
Gouvernance et architecture des données de l’Entreprise DigitaleGouvernance et architecture des données de l’Entreprise Digitale
Gouvernance et architecture des données de l’Entreprise Digitale
Antoine Vigneron
 
An Introduction to Talend Integration Cloud
An Introduction to Talend Integration CloudAn Introduction to Talend Integration Cloud
An Introduction to Talend Integration Cloud
Talend
 

Viewers also liked (20)

Talend Open Studio Data Integration
Talend Open Studio Data IntegrationTalend Open Studio Data Integration
Talend Open Studio Data Integration
 
Industrializing Data Integration
Industrializing Data IntegrationIndustrializing Data Integration
Industrializing Data Integration
 
Talend MDM
Talend MDMTalend MDM
Talend MDM
 
Talend Open Studio Fundamentals #1: Workspaces, Jobs, Metadata and Trips & Tr...
Talend Open Studio Fundamentals #1: Workspaces, Jobs, Metadata and Trips & Tr...Talend Open Studio Fundamentals #1: Workspaces, Jobs, Metadata and Trips & Tr...
Talend Open Studio Fundamentals #1: Workspaces, Jobs, Metadata and Trips & Tr...
 
Talend Data Quality - Customer Data Management platform
Talend Data Quality - Customer Data Management platformTalend Data Quality - Customer Data Management platform
Talend Data Quality - Customer Data Management platform
 
Talend for big_data_intorduction
Talend for big_data_intorductionTalend for big_data_intorduction
Talend for big_data_intorduction
 
Talend勉強会 20150414
Talend勉強会 20150414Talend勉強会 20150414
Talend勉強会 20150414
 
Talendビッグデータインテグレーション製品ご紹介
Talendビッグデータインテグレーション製品ご紹介Talendビッグデータインテグレーション製品ご紹介
Talendビッグデータインテグレーション製品ご紹介
 
Who is Talend?
Who is Talend?Who is Talend?
Who is Talend?
 
Unleashing the value of metadata with Talend
Unleashing the value of metadata with Talend Unleashing the value of metadata with Talend
Unleashing the value of metadata with Talend
 
JobSchedulerアップデート2016
JobSchedulerアップデート2016JobSchedulerアップデート2016
JobSchedulerアップデート2016
 
ビッグデータ関連Oss動向調査とニーズ分析
ビッグデータ関連Oss動向調査とニーズ分析ビッグデータ関連Oss動向調査とニーズ分析
ビッグデータ関連Oss動向調査とニーズ分析
 
Mike Tuche, CEO of Talend: Enabling the Data Driven Enterprise
Mike Tuche, CEO of Talend: Enabling the Data Driven EnterpriseMike Tuche, CEO of Talend: Enabling the Data Driven Enterprise
Mike Tuche, CEO of Talend: Enabling the Data Driven Enterprise
 
大規模データ分析を支えるインフラ系オープンソースソフトウェアの最新事情
大規模データ分析を支えるインフラ系オープンソースソフトウェアの最新事情大規模データ分析を支えるインフラ系オープンソースソフトウェアの最新事情
大規模データ分析を支えるインフラ系オープンソースソフトウェアの最新事情
 
Data Governance Overview - Doreen Christian
Data Governance Overview - Doreen ChristianData Governance Overview - Doreen Christian
Data Governance Overview - Doreen Christian
 
Talend winter 2017 overview webinar
Talend winter 2017 overview webinarTalend winter 2017 overview webinar
Talend winter 2017 overview webinar
 
Présentation de Talend Winter 2017
Présentation de Talend Winter 2017 Présentation de Talend Winter 2017
Présentation de Talend Winter 2017
 
Why You Need to Govern Big Data
Why You Need to Govern Big DataWhy You Need to Govern Big Data
Why You Need to Govern Big Data
 
Gouvernance et architecture des données de l’Entreprise Digitale
Gouvernance et architecture des données de l’Entreprise DigitaleGouvernance et architecture des données de l’Entreprise Digitale
Gouvernance et architecture des données de l’Entreprise Digitale
 
An Introduction to Talend Integration Cloud
An Introduction to Talend Integration CloudAn Introduction to Talend Integration Cloud
An Introduction to Talend Integration Cloud
 

Similar to Talend Data Quality

Sugar con 2010
Sugar con 2010Sugar con 2010
Sugar con 2010
pparvizi
 
Curing dataheadachesv2 with sugarcrm levementum and talend
Curing dataheadachesv2 with sugarcrm levementum and talendCuring dataheadachesv2 with sugarcrm levementum and talend
Curing dataheadachesv2 with sugarcrm levementum and talendGeoffrey Mobisson
 
Q1 2015 Investor PresentationQ1 2015 Investor Presentation
Q1 2015 Investor PresentationQ1 2015 Investor PresentationQ1 2015 Investor PresentationQ1 2015 Investor Presentation
Q1 2015 Investor PresentationQ1 2015 Investor Presentation
teradata2014
 
Turn Five Communications Industry Challenges into Real Competitive Opportunities
Turn Five Communications Industry Challenges into Real Competitive OpportunitiesTurn Five Communications Industry Challenges into Real Competitive Opportunities
Turn Five Communications Industry Challenges into Real Competitive OpportunitiesPerficient, Inc.
 
MDM for product data with Talend
MDM for product data with Talend MDM for product data with Talend
MDM for product data with Talend
Jean-Michel Franco
 
Réinventez le Data Management avec la Data Virtualization de Denodo
Réinventez le Data Management avec la Data Virtualization de DenodoRéinventez le Data Management avec la Data Virtualization de Denodo
Réinventez le Data Management avec la Data Virtualization de Denodo
Denodo
 
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Rittman Analytics
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
Hortonworks
 
The Warranty Data Lake – After, Inc.
The Warranty Data Lake – After, Inc.The Warranty Data Lake – After, Inc.
The Warranty Data Lake – After, Inc.
Richard Vermillion
 
M_Amjad_Khan_resume
M_Amjad_Khan_resumeM_Amjad_Khan_resume
M_Amjad_Khan_resumeAmjad Khan
 
8.17.11 big data and hadoop with informatica slideshare
8.17.11 big data and hadoop with informatica slideshare8.17.11 big data and hadoop with informatica slideshare
8.17.11 big data and hadoop with informatica slideshare
Julianna DeLua
 
Talend Data Preparation Overview
Talend Data Preparation OverviewTalend Data Preparation Overview
Talend Data Preparation Overview
Jean-Michel Franco
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA
 
Key Considerations While Rolling Out Denodo Platform
Key Considerations While Rolling Out Denodo PlatformKey Considerations While Rolling Out Denodo Platform
Key Considerations While Rolling Out Denodo Platform
Denodo
 
Data Quality Everywhere
Data Quality EverywhereData Quality Everywhere
Data Quality Everywhere
Jean-Michel Franco
 
Rabobank - There is something about Data
Rabobank - There is something about DataRabobank - There is something about Data
Rabobank - There is something about Data
BigDataExpo
 
Data Design - the x factor for a successful data migration v1.3
Data Design - the x factor for a successful data migration v1.3Data Design - the x factor for a successful data migration v1.3
Data Design - the x factor for a successful data migration v1.3
Richard Neale
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Denodo
 
Semantix Data Platform - 2022.pdf
Semantix Data Platform - 2022.pdfSemantix Data Platform - 2022.pdf
Semantix Data Platform - 2022.pdf
Lucas Panchorra
 
IdealNet Data Integration ETL vs Cloud
IdealNet Data Integration ETL vs CloudIdealNet Data Integration ETL vs Cloud
IdealNet Data Integration ETL vs Cloud
cbiddle2
 

Similar to Talend Data Quality (20)

Sugar con 2010
Sugar con 2010Sugar con 2010
Sugar con 2010
 
Curing dataheadachesv2 with sugarcrm levementum and talend
Curing dataheadachesv2 with sugarcrm levementum and talendCuring dataheadachesv2 with sugarcrm levementum and talend
Curing dataheadachesv2 with sugarcrm levementum and talend
 
Q1 2015 Investor PresentationQ1 2015 Investor Presentation
Q1 2015 Investor PresentationQ1 2015 Investor PresentationQ1 2015 Investor PresentationQ1 2015 Investor Presentation
Q1 2015 Investor PresentationQ1 2015 Investor Presentation
 
Turn Five Communications Industry Challenges into Real Competitive Opportunities
Turn Five Communications Industry Challenges into Real Competitive OpportunitiesTurn Five Communications Industry Challenges into Real Competitive Opportunities
Turn Five Communications Industry Challenges into Real Competitive Opportunities
 
MDM for product data with Talend
MDM for product data with Talend MDM for product data with Talend
MDM for product data with Talend
 
Réinventez le Data Management avec la Data Virtualization de Denodo
Réinventez le Data Management avec la Data Virtualization de DenodoRéinventez le Data Management avec la Data Virtualization de Denodo
Réinventez le Data Management avec la Data Virtualization de Denodo
 
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
 
The Warranty Data Lake – After, Inc.
The Warranty Data Lake – After, Inc.The Warranty Data Lake – After, Inc.
The Warranty Data Lake – After, Inc.
 
M_Amjad_Khan_resume
M_Amjad_Khan_resumeM_Amjad_Khan_resume
M_Amjad_Khan_resume
 
8.17.11 big data and hadoop with informatica slideshare
8.17.11 big data and hadoop with informatica slideshare8.17.11 big data and hadoop with informatica slideshare
8.17.11 big data and hadoop with informatica slideshare
 
Talend Data Preparation Overview
Talend Data Preparation OverviewTalend Data Preparation Overview
Talend Data Preparation Overview
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
 
Key Considerations While Rolling Out Denodo Platform
Key Considerations While Rolling Out Denodo PlatformKey Considerations While Rolling Out Denodo Platform
Key Considerations While Rolling Out Denodo Platform
 
Data Quality Everywhere
Data Quality EverywhereData Quality Everywhere
Data Quality Everywhere
 
Rabobank - There is something about Data
Rabobank - There is something about DataRabobank - There is something about Data
Rabobank - There is something about Data
 
Data Design - the x factor for a successful data migration v1.3
Data Design - the x factor for a successful data migration v1.3Data Design - the x factor for a successful data migration v1.3
Data Design - the x factor for a successful data migration v1.3
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
 
Semantix Data Platform - 2022.pdf
Semantix Data Platform - 2022.pdfSemantix Data Platform - 2022.pdf
Semantix Data Platform - 2022.pdf
 
IdealNet Data Integration ETL vs Cloud
IdealNet Data Integration ETL vs CloudIdealNet Data Integration ETL vs Cloud
IdealNet Data Integration ETL vs Cloud
 

Recently uploaded

Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
CatarinaPereira64715
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
Abida Shariff
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 

Recently uploaded (20)

Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 

Talend Data Quality

  • 1. Leverage Open Source for Data Quality
  • 2.
  • 3. If time is too short to address all questions, you will receive an answer via emailIf you are having connection problems, please use also the Q&A panel
  • 4. Agenda Talend Introduction Data Cleansing with Talend Data Cleansing Why data cleansing? The importance of data cleansing in data integration
  • 5. Agenda Talend Introduction Data Cleansing with Talend Data Cleansing Why data cleansing? The importance of data cleansing in data integration
  • 6. © Talend 2010 5 Corporate Overview Leading provider of open source data management software Venture-backed Worldwide operations and users London (Maidenhead) Business Tech Support Nuremberg Business Tech Support Tokyo Business Tech Support Beijing R&D Tech Support San Francisco (Los Altos) Corporate Orange County (Irvine) Business R&D Tech Support New York (Tarrytown) Business Tech Support Paris (Suresnes) Corporate Business R&D Tech Support Milan (Curno) Business Tech Support UtrechtBusiness 5 © Talend 2010
  • 7.
  • 9. 1,500 customers1 download of Talend Open Studio per minute 100 new customers per month
  • 10.
  • 12. Direct entry as Visionary
  • 13. Only open source vendorAlso mentioned in Gartner’s Data Quality & Master Data Management research, and by other analyst firms (Forrester, IDC, Bloor Research, etc.)
  • 14. © Talend 2010 8 Company History IntegrationSuite RTx MDMEnterprise Edition Open Studiov1.0 Products Open Profiler Integration Suite MPx MDMCommunity Edition Data Quality IntegrationSuite Talend Open Studio Beta 1 2002 2006 2007 2008 2009 2010 2005 Operations Operations R&D Company creation Third round of financing: existing investors & Balderton Capital Launched US operations First round of financing: AGF Private Equity & Galileo Partners MDM acquisition Closed second round of financing Closed fourth round of financing © Talend 2010
  • 15. © Talend 2010 9 Open Source History 2002 1970 1984 1998 2000 2003 2006 2009 Open source is created Free access to code Free Software Foundation (Richard Stallman) Launching GPL and GNU Open source solutions emerge Apache… Software companies come out MySQL, JBoss, SugarCRM… More and more mature key players stand out JBoss, SugarCRM , Jaspersoft, Talend… Economic crisis favors enterprise-ready open source players Open Source Initiative is created Implementing a policy meeting economical and technical realities
  • 16. © Talend 2010 10 Market Positioning MDM Data Quality Reference data management Data profiling & data cleansing Data Integration Analytics (ETL) Operational Integration Data replication & synchronization, data migration & capture, application upgrade, etc. Extract, Transform & Load for decision support systems
  • 17.
  • 18.
  • 20.
  • 21.
  • 22.
  • 23. Agenda Talend Introduction Data Cleansing with Talend Data Cleansing Why data cleansing? The importance of data cleansing in data integration
  • 24.
  • 25. Data lacks granularity to fulfill the requirements of the applications
  • 26.
  • 27.
  • 28.
  • 29.
  • 31.
  • 33. Identify potential problems before beginning data projects
  • 34. Reduce time and resources needed to find problematic data
  • 35. Allow business analysts to have more control of maintenance and data managementData Quality
  • 36.
  • 37.
  • 38. Eliminate or recycle erroneous records
  • 41. Name, address, & phone validation databases
  • 42. Synonym tables and lookup data
  • 44. Types of processing: Filtering / Correction / EnrichmentData Quality
  • 45.
  • 47. Measure and track level of quality
  • 48. Preserve historical records for measuring improvement or degradation
  • 53. Complete data quality lifecycle managementNatively integrated with Data Integration Implement a "Data Quality Firewall" in data integration processes Data Quality
  • 54. Agenda Talend Introduction Data Cleansing with Talend Data Cleansing Why data cleansing? The importance of data cleansing in data integration
  • 55.
  • 56.
  • 57. Watch anotherWebinar on Demand : http://nxy.in/hkidj
  • 58. Watch our Live Webinars : http://nxy.in/pjeph© Talend 2010 21