SlideShare a Scribd company logo
1 of 18
Download to read offline
Orchestrate data with agility and
responsiveness.
Learn how to manage a common data
integration project
by SKENDER KOLLCAKU
Milan, 07/2017
keywords:
iPaaS, data integration, Talend, Salesforce, data-driven, use case, migration, cloud computing, SaaS, CRM, database,
real-time, open-source, java, professional services, on-premise, mainframe, data quality, hybrid, repository, metadata,
reusable job, data validation, bi-directional sync, design pattern, agile, business, ETL, project management, customer,
The scenario: Manage a typical data
integration project
Consider the following business requirements:
 Manage successfully and keep on track the project considering budget, cost,
time and stakeholders’concerns.
 (1) Provide a customers data migration from a mainframe to a Cloud SaaS
CRM (Salesforce: https://www.salesforce.com) respecting Address
format/values according to some business requirements
 (2) Set up a bi-directional integration between two systems
 (3) Identify what added values the integration and data-driven culture make
available
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
2
Agenda
 Data availability, iPaaS and why data-driven culture is the new norm for
organizations
 Data asset requires Governance, but also Agility and Responsiveness
 Define a roadmap to manage and close successfully the project (business
case)
 How to identify business-related data and valuable Customers records
 Talend (https://www.talend.com/) as the unified leader platform for the
solution
 Data validation and initial load (migration as the pattern design)
 Bi-directional synchronization to automate jobs in real-time
 Added values and future implementations
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
3
The importance of data availability
Data is one of the most important assets an organization
has because it defines each organization’s uniqueness.
Being a data-driven organization is not the final objective,
but it represents a crucial process
in the innovation challenge.
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
4
Data requires Governance, but also Agility and
Responsiveness
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
5
Collaborate
in an open
manner
BE AGILE AND
ADAPT TO
CHANGES
Agility
Start with
business-
related data
FAST TIME TO
MARKET
Share process
to engage
Inspire
through
Talend
SHARE,
DEMOCRATIZE
AND INSPIRE FOR
THE FUTURE
Short and
fast deliveries
3-steps Project plan starts comunicating with
the stakeholders
(1) Comunicate
with decision-
making players
(2) Identify
candidate data for
business-related
value
(3) Model and
implement design
pattern for the
specific process
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
6
Talend is the leading open source integration
software provider to data-driven enterprises
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
7
Open-Source leader
Eclipse-similar IDE
unified platform
Java-based code
generator
Visual job design
Graphical business
process modeller
(100% graphical)
Smart product
subscription
Big Data native in
real-time
Reusable metadata
elements
+1000 built-in
drag & drop
connectors and
components
Determine customers containing business-
related value
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
8
Prospects/Leads (potential Customers)
Filter by fastest closed deals
Particular industry (life science, manufacturing or finance...)
Recent closed deals (filter by time range)
Largest revenue generated streams
Interested geographical area
(1) Project phase: initial load prior ETL
operations
Once available the input flat files from the mainframe, the ETL (Extract, Transform and Load)
operations to be executed could be the following:
 Cleanse
 Validate
 Format
 Unify
 Standardize
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
9
pull data from MF
cleanse, validate,
format
unify or standardize
provision DB schema
compatibility
upload into SaaS
CRM
Data quality includes data validation
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
10
DATAVALIDATION
NULL HANDLING
STRING HANDLING
DATE HANDLING
THIRD-PARTY
VALIDATION LIBRARIES
Talend Data Preparation self-service free tool
Business process model definition before we
start implementing the job
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
11
Use of Talend DI canvas to model the business process. Flow of data will satisfy the
following business requirement: only matched/validated Customers address records will
be loaded into the SaaS CRM.
Use Talend to set up the data migration between
On-Premise input files and target SaaS CRM object
(Account in Salesforce)
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
12
prior to Address
validation
Simplified job which uses tMap “magical”
component to validate Address
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
13
Simplified job which uses tMap component to validate Customer address.
The output are (1) loaded into Salesforce Account object as records and (2) rejected
Customers with invalid addresses in an Excel spreadsheet for future analysis
(2) Project phase: bi-directional
synchronization between mainframe and SaaS
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
14
Talend built-in component tSalesforceGetUpdated_1 used for tracking changes (update,
insert, upsert) in the Salesforce Account object and propagate them in real-time to a DB2
mainframe’s table. This component can work in background given a past Start and End
time range.
Another mechanism is the CDC (Change Data Capture).
Bi-directional integration means real-time
synchronization between the two databases
There are some key issues to consider:
 How similar are the schemas of the databases to be kept in sync (this helps for
eventual JOIN operations)?
 How often do the databases need to be synched (performance query…)?
 How will we resolve situations in which the same data has been modified in both
of databases since the last sync session (conflict based on the “record owner” or
“last modified” solution to be described)?
 How much effort and/or money are we willing to invest in developing our sync
system (“keep project budget on track”)?
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
15
(3) Added values: technical perspective
 External lookup with any other data sources (supply chain, e-commerce, BI
(analysis of ROIs, deals/opportunities), DW, Marketing, social networks
activity/engagement, distributed and cross-platform applications… )
 Reusable jobs, thanks to repository metadata
 Versioning of the Java generated code (Github, Maven…)
 Statistical reports about job execution (performance)
 Other applications can trigger the job (example: collecting data for reports
and dashboards…)
 Unified and scalable integration platform (Data Preparation, DI, Cloud
integration, ESB, MDM, Big Data, Fabric…)
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
16
(3) Added values: business perspective
 Give real value to the data asset (“enable data-driven organizations”)
 Support for decisions (“how to use the information obtained?”) and provide
them in advance (apply automatically and review rules regularly)
 Remove data management risk when modernizing systems
 Consolidate applications
 Smooth subscription model (start with free open-source tool and then
upgrade in a predictable fashion depending on business needs – pay only for
the number of developers…)
 Optimize processes by keeping comprehensive, relevant and consistent data
everywhere.
 Deliveries in real-time and analytics prediction!
 Big Data native suite of products
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
17
Thank you!
"Orchestrate data with agility and
responsiveness" - by Skender Kollcaku
18

More Related Content

What's hot

Enterprise 360 - Graphs at the Center of a Data Fabric
Enterprise 360 - Graphs at the Center of a Data FabricEnterprise 360 - Graphs at the Center of a Data Fabric
Enterprise 360 - Graphs at the Center of a Data FabricPrecisely
 
Scaling up your Analytics & Insights
Scaling up your Analytics & InsightsScaling up your Analytics & Insights
Scaling up your Analytics & InsightsLoQutus
 
5 Pillars of API Management
5 Pillars of API Management5 Pillars of API Management
5 Pillars of API ManagementRich Graham
 
LoQutus: A deep-dive into Microsoft Power BI
LoQutus: A deep-dive into Microsoft Power BILoQutus: A deep-dive into Microsoft Power BI
LoQutus: A deep-dive into Microsoft Power BILoQutus
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Dataconomy Media
 
CI/DC in MLOps by J.B. Hunt
CI/DC in MLOps by J.B. HuntCI/DC in MLOps by J.B. Hunt
CI/DC in MLOps by J.B. HuntDatabricks
 
KidsLife & LoQutus: A real-life case study of digital enablement
KidsLife & LoQutus: A real-life case study of digital enablementKidsLife & LoQutus: A real-life case study of digital enablement
KidsLife & LoQutus: A real-life case study of digital enablementLoQutus
 
Qlik web connectors
Qlik web connectorsQlik web connectors
Qlik web connectorsSand
 
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...Databricks
 
Power BI vs Tableau vs Cognos: A Data Analytics Research
Power BI vs Tableau vs Cognos: A Data Analytics ResearchPower BI vs Tableau vs Cognos: A Data Analytics Research
Power BI vs Tableau vs Cognos: A Data Analytics ResearchLuciano Vilas Boas
 
Graphically understand and interactively explore your Data Lineage
Graphically understand and interactively explore your Data LineageGraphically understand and interactively explore your Data Lineage
Graphically understand and interactively explore your Data LineageMohammad Ahmed
 
Modernizing the Finance Function with Qlik
Modernizing the Finance Function with QlikModernizing the Finance Function with Qlik
Modernizing the Finance Function with QlikPaul Van Siclen
 
Afternoons with Azure - Power BI and Azure Analysis Services
Afternoons with Azure - Power BI and Azure Analysis ServicesAfternoons with Azure - Power BI and Azure Analysis Services
Afternoons with Azure - Power BI and Azure Analysis ServicesCCG
 
Unlock Insights Enabling Data-driven Decisions with Databricks, Precisely & CMA
Unlock Insights Enabling Data-driven Decisions with Databricks, Precisely & CMAUnlock Insights Enabling Data-driven Decisions with Databricks, Precisely & CMA
Unlock Insights Enabling Data-driven Decisions with Databricks, Precisely & CMAPrecisely
 
AWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and RedshiftAWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and RedshiftAmazon Web Services
 
Visualr - Client Presentation
Visualr - Client PresentationVisualr - Client Presentation
Visualr - Client PresentationMayank Chaubey
 
MapInfo Pro v2021 - Next Generation Location Analytics Made Easy
MapInfo Pro v2021 - Next Generation Location Analytics Made EasyMapInfo Pro v2021 - Next Generation Location Analytics Made Easy
MapInfo Pro v2021 - Next Generation Location Analytics Made EasyPrecisely
 
Cepta The Future of Data with Power BI
Cepta The Future of Data with Power BICepta The Future of Data with Power BI
Cepta The Future of Data with Power BIKellyn Pot'Vin-Gorman
 
StreamCentral Information System Overview
StreamCentral Information System OverviewStreamCentral Information System Overview
StreamCentral Information System OverviewRaheel Retiwalla
 

What's hot (20)

Enterprise 360 - Graphs at the Center of a Data Fabric
Enterprise 360 - Graphs at the Center of a Data FabricEnterprise 360 - Graphs at the Center of a Data Fabric
Enterprise 360 - Graphs at the Center of a Data Fabric
 
Scaling up your Analytics & Insights
Scaling up your Analytics & InsightsScaling up your Analytics & Insights
Scaling up your Analytics & Insights
 
5 Pillars of API Management
5 Pillars of API Management5 Pillars of API Management
5 Pillars of API Management
 
LoQutus: A deep-dive into Microsoft Power BI
LoQutus: A deep-dive into Microsoft Power BILoQutus: A deep-dive into Microsoft Power BI
LoQutus: A deep-dive into Microsoft Power BI
 
AI is a Team Sport
AI is a Team SportAI is a Team Sport
AI is a Team Sport
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
 
CI/DC in MLOps by J.B. Hunt
CI/DC in MLOps by J.B. HuntCI/DC in MLOps by J.B. Hunt
CI/DC in MLOps by J.B. Hunt
 
KidsLife & LoQutus: A real-life case study of digital enablement
KidsLife & LoQutus: A real-life case study of digital enablementKidsLife & LoQutus: A real-life case study of digital enablement
KidsLife & LoQutus: A real-life case study of digital enablement
 
Qlik web connectors
Qlik web connectorsQlik web connectors
Qlik web connectors
 
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
 
Power BI vs Tableau vs Cognos: A Data Analytics Research
Power BI vs Tableau vs Cognos: A Data Analytics ResearchPower BI vs Tableau vs Cognos: A Data Analytics Research
Power BI vs Tableau vs Cognos: A Data Analytics Research
 
Graphically understand and interactively explore your Data Lineage
Graphically understand and interactively explore your Data LineageGraphically understand and interactively explore your Data Lineage
Graphically understand and interactively explore your Data Lineage
 
Modernizing the Finance Function with Qlik
Modernizing the Finance Function with QlikModernizing the Finance Function with Qlik
Modernizing the Finance Function with Qlik
 
Afternoons with Azure - Power BI and Azure Analysis Services
Afternoons with Azure - Power BI and Azure Analysis ServicesAfternoons with Azure - Power BI and Azure Analysis Services
Afternoons with Azure - Power BI and Azure Analysis Services
 
Unlock Insights Enabling Data-driven Decisions with Databricks, Precisely & CMA
Unlock Insights Enabling Data-driven Decisions with Databricks, Precisely & CMAUnlock Insights Enabling Data-driven Decisions with Databricks, Precisely & CMA
Unlock Insights Enabling Data-driven Decisions with Databricks, Precisely & CMA
 
AWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and RedshiftAWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
 
Visualr - Client Presentation
Visualr - Client PresentationVisualr - Client Presentation
Visualr - Client Presentation
 
MapInfo Pro v2021 - Next Generation Location Analytics Made Easy
MapInfo Pro v2021 - Next Generation Location Analytics Made EasyMapInfo Pro v2021 - Next Generation Location Analytics Made Easy
MapInfo Pro v2021 - Next Generation Location Analytics Made Easy
 
Cepta The Future of Data with Power BI
Cepta The Future of Data with Power BICepta The Future of Data with Power BI
Cepta The Future of Data with Power BI
 
StreamCentral Information System Overview
StreamCentral Information System OverviewStreamCentral Information System Overview
StreamCentral Information System Overview
 

Similar to Orchestrate data with agility and responsiveness. Learn how to manage a common data integration project

Why Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionWhy Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionDenodo
 
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...DATAVERSITY
 
Accenture-Cloud-Data-Migration-POV-Final.pdf
Accenture-Cloud-Data-Migration-POV-Final.pdfAccenture-Cloud-Data-Migration-POV-Final.pdf
Accenture-Cloud-Data-Migration-POV-Final.pdfRajvir Kaushal
 
Informatica 5+years of experince
Informatica 5+years of experinceInformatica 5+years of experince
Informatica 5+years of experinceDharma Rao
 
Informatica_5+years of experince
Informatica_5+years of experinceInformatica_5+years of experince
Informatica_5+years of experinceDharma Rao
 
Informatica 5+years of experince
Informatica 5+years of experinceInformatica 5+years of experince
Informatica 5+years of experinceDharma Rao
 
Integrating Salesforce.com and Oracle ERP Using IBM WebSphere Cast Iron
Integrating Salesforce.com and Oracle ERP Using IBM WebSphere Cast IronIntegrating Salesforce.com and Oracle ERP Using IBM WebSphere Cast Iron
Integrating Salesforce.com and Oracle ERP Using IBM WebSphere Cast IronProlifics
 
Integrating SFDC and Oracle ERP with IBM Websphere CastIron Appliance
Integrating SFDC and Oracle ERP with IBM Websphere CastIron ApplianceIntegrating SFDC and Oracle ERP with IBM Websphere CastIron Appliance
Integrating SFDC and Oracle ERP with IBM Websphere CastIron ApplianceSandeep Chellingi
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Denodo
 
Data virtualization an introduction
Data virtualization an introductionData virtualization an introduction
Data virtualization an introductionDenodo
 
Ashish_Maheshwari_Data_Analyst
Ashish_Maheshwari_Data_AnalystAshish_Maheshwari_Data_Analyst
Ashish_Maheshwari_Data_AnalystAshish Maheshwari
 
Dynamics 365 saturday 2018 - data migration story
Dynamics 365 saturday   2018 - data migration storyDynamics 365 saturday   2018 - data migration story
Dynamics 365 saturday 2018 - data migration storyAndre Margono
 
Fast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationFast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationDenodo
 
Big Data: It’s all about the Use Cases
Big Data: It’s all about the Use CasesBig Data: It’s all about the Use Cases
Big Data: It’s all about the Use CasesJames Serra
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An IntroductionDenodo
 
Confluent Partner Tech Talk with BearingPoint
Confluent Partner Tech Talk with BearingPointConfluent Partner Tech Talk with BearingPoint
Confluent Partner Tech Talk with BearingPointconfluent
 
Best Practices in the Cloud for Data Management (US)
Best Practices in the Cloud for Data Management (US)Best Practices in the Cloud for Data Management (US)
Best Practices in the Cloud for Data Management (US)Denodo
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
LinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchLinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchSheetal Pratik
 
Data and Application Modernization in the Age of the Cloud
Data and Application Modernization in the Age of the CloudData and Application Modernization in the Age of the Cloud
Data and Application Modernization in the Age of the Cloudredmondpulver
 

Similar to Orchestrate data with agility and responsiveness. Learn how to manage a common data integration project (20)

Why Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionWhy Data Virtualization? An Introduction
Why Data Virtualization? An Introduction
 
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
 
Accenture-Cloud-Data-Migration-POV-Final.pdf
Accenture-Cloud-Data-Migration-POV-Final.pdfAccenture-Cloud-Data-Migration-POV-Final.pdf
Accenture-Cloud-Data-Migration-POV-Final.pdf
 
Informatica 5+years of experince
Informatica 5+years of experinceInformatica 5+years of experince
Informatica 5+years of experince
 
Informatica_5+years of experince
Informatica_5+years of experinceInformatica_5+years of experince
Informatica_5+years of experince
 
Informatica 5+years of experince
Informatica 5+years of experinceInformatica 5+years of experince
Informatica 5+years of experince
 
Integrating Salesforce.com and Oracle ERP Using IBM WebSphere Cast Iron
Integrating Salesforce.com and Oracle ERP Using IBM WebSphere Cast IronIntegrating Salesforce.com and Oracle ERP Using IBM WebSphere Cast Iron
Integrating Salesforce.com and Oracle ERP Using IBM WebSphere Cast Iron
 
Integrating SFDC and Oracle ERP with IBM Websphere CastIron Appliance
Integrating SFDC and Oracle ERP with IBM Websphere CastIron ApplianceIntegrating SFDC and Oracle ERP with IBM Websphere CastIron Appliance
Integrating SFDC and Oracle ERP with IBM Websphere CastIron Appliance
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)
 
Data virtualization an introduction
Data virtualization an introductionData virtualization an introduction
Data virtualization an introduction
 
Ashish_Maheshwari_Data_Analyst
Ashish_Maheshwari_Data_AnalystAshish_Maheshwari_Data_Analyst
Ashish_Maheshwari_Data_Analyst
 
Dynamics 365 saturday 2018 - data migration story
Dynamics 365 saturday   2018 - data migration storyDynamics 365 saturday   2018 - data migration story
Dynamics 365 saturday 2018 - data migration story
 
Fast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationFast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow Presentation
 
Big Data: It’s all about the Use Cases
Big Data: It’s all about the Use CasesBig Data: It’s all about the Use Cases
Big Data: It’s all about the Use Cases
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
Confluent Partner Tech Talk with BearingPoint
Confluent Partner Tech Talk with BearingPointConfluent Partner Tech Talk with BearingPoint
Confluent Partner Tech Talk with BearingPoint
 
Best Practices in the Cloud for Data Management (US)
Best Practices in the Cloud for Data Management (US)Best Practices in the Cloud for Data Management (US)
Best Practices in the Cloud for Data Management (US)
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
LinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchLinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbench
 
Data and Application Modernization in the Age of the Cloud
Data and Application Modernization in the Age of the CloudData and Application Modernization in the Age of the Cloud
Data and Application Modernization in the Age of the Cloud
 

Recently uploaded

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 

Recently uploaded (20)

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 

Orchestrate data with agility and responsiveness. Learn how to manage a common data integration project

  • 1. Orchestrate data with agility and responsiveness. Learn how to manage a common data integration project by SKENDER KOLLCAKU Milan, 07/2017 keywords: iPaaS, data integration, Talend, Salesforce, data-driven, use case, migration, cloud computing, SaaS, CRM, database, real-time, open-source, java, professional services, on-premise, mainframe, data quality, hybrid, repository, metadata, reusable job, data validation, bi-directional sync, design pattern, agile, business, ETL, project management, customer,
  • 2. The scenario: Manage a typical data integration project Consider the following business requirements:  Manage successfully and keep on track the project considering budget, cost, time and stakeholders’concerns.  (1) Provide a customers data migration from a mainframe to a Cloud SaaS CRM (Salesforce: https://www.salesforce.com) respecting Address format/values according to some business requirements  (2) Set up a bi-directional integration between two systems  (3) Identify what added values the integration and data-driven culture make available "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 2
  • 3. Agenda  Data availability, iPaaS and why data-driven culture is the new norm for organizations  Data asset requires Governance, but also Agility and Responsiveness  Define a roadmap to manage and close successfully the project (business case)  How to identify business-related data and valuable Customers records  Talend (https://www.talend.com/) as the unified leader platform for the solution  Data validation and initial load (migration as the pattern design)  Bi-directional synchronization to automate jobs in real-time  Added values and future implementations "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 3
  • 4. The importance of data availability Data is one of the most important assets an organization has because it defines each organization’s uniqueness. Being a data-driven organization is not the final objective, but it represents a crucial process in the innovation challenge. "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 4
  • 5. Data requires Governance, but also Agility and Responsiveness "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 5 Collaborate in an open manner BE AGILE AND ADAPT TO CHANGES Agility Start with business- related data FAST TIME TO MARKET Share process to engage Inspire through Talend SHARE, DEMOCRATIZE AND INSPIRE FOR THE FUTURE Short and fast deliveries
  • 6. 3-steps Project plan starts comunicating with the stakeholders (1) Comunicate with decision- making players (2) Identify candidate data for business-related value (3) Model and implement design pattern for the specific process "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 6
  • 7. Talend is the leading open source integration software provider to data-driven enterprises "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 7 Open-Source leader Eclipse-similar IDE unified platform Java-based code generator Visual job design Graphical business process modeller (100% graphical) Smart product subscription Big Data native in real-time Reusable metadata elements +1000 built-in drag & drop connectors and components
  • 8. Determine customers containing business- related value "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 8 Prospects/Leads (potential Customers) Filter by fastest closed deals Particular industry (life science, manufacturing or finance...) Recent closed deals (filter by time range) Largest revenue generated streams Interested geographical area
  • 9. (1) Project phase: initial load prior ETL operations Once available the input flat files from the mainframe, the ETL (Extract, Transform and Load) operations to be executed could be the following:  Cleanse  Validate  Format  Unify  Standardize "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 9 pull data from MF cleanse, validate, format unify or standardize provision DB schema compatibility upload into SaaS CRM
  • 10. Data quality includes data validation "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 10 DATAVALIDATION NULL HANDLING STRING HANDLING DATE HANDLING THIRD-PARTY VALIDATION LIBRARIES Talend Data Preparation self-service free tool
  • 11. Business process model definition before we start implementing the job "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 11 Use of Talend DI canvas to model the business process. Flow of data will satisfy the following business requirement: only matched/validated Customers address records will be loaded into the SaaS CRM.
  • 12. Use Talend to set up the data migration between On-Premise input files and target SaaS CRM object (Account in Salesforce) "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 12 prior to Address validation
  • 13. Simplified job which uses tMap “magical” component to validate Address "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 13 Simplified job which uses tMap component to validate Customer address. The output are (1) loaded into Salesforce Account object as records and (2) rejected Customers with invalid addresses in an Excel spreadsheet for future analysis
  • 14. (2) Project phase: bi-directional synchronization between mainframe and SaaS "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 14 Talend built-in component tSalesforceGetUpdated_1 used for tracking changes (update, insert, upsert) in the Salesforce Account object and propagate them in real-time to a DB2 mainframe’s table. This component can work in background given a past Start and End time range. Another mechanism is the CDC (Change Data Capture).
  • 15. Bi-directional integration means real-time synchronization between the two databases There are some key issues to consider:  How similar are the schemas of the databases to be kept in sync (this helps for eventual JOIN operations)?  How often do the databases need to be synched (performance query…)?  How will we resolve situations in which the same data has been modified in both of databases since the last sync session (conflict based on the “record owner” or “last modified” solution to be described)?  How much effort and/or money are we willing to invest in developing our sync system (“keep project budget on track”)? "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 15
  • 16. (3) Added values: technical perspective  External lookup with any other data sources (supply chain, e-commerce, BI (analysis of ROIs, deals/opportunities), DW, Marketing, social networks activity/engagement, distributed and cross-platform applications… )  Reusable jobs, thanks to repository metadata  Versioning of the Java generated code (Github, Maven…)  Statistical reports about job execution (performance)  Other applications can trigger the job (example: collecting data for reports and dashboards…)  Unified and scalable integration platform (Data Preparation, DI, Cloud integration, ESB, MDM, Big Data, Fabric…) "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 16
  • 17. (3) Added values: business perspective  Give real value to the data asset (“enable data-driven organizations”)  Support for decisions (“how to use the information obtained?”) and provide them in advance (apply automatically and review rules regularly)  Remove data management risk when modernizing systems  Consolidate applications  Smooth subscription model (start with free open-source tool and then upgrade in a predictable fashion depending on business needs – pay only for the number of developers…)  Optimize processes by keeping comprehensive, relevant and consistent data everywhere.  Deliveries in real-time and analytics prediction!  Big Data native suite of products "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 17
  • 18. Thank you! "Orchestrate data with agility and responsiveness" - by Skender Kollcaku 18