SlideShare a Scribd company logo
1 of 33
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Andrew McIntyre
Director of Strategic ISV Alliances, Informatica
Accelerate Digital Transformation
through AI-powered Cloud Analytics
Modernization with Informatica
New
technology architecture
New users
and influencers
New
consumption
models
New ecosystems
Four
Challenges
3
Data explosion beyond data warehouse
No clear view of data relationships and semantic meaning of data
The Challenges with Changing Data Landscape….
Organizations are unable to maximize business value from their data assets
Growing number of users using self-service analytics
4 © Informatica. Proprietary and Confidential.4 © Informatica. Proprietary and Confidential.
58% of companies have a hybrid strategy with
footprints on both cloud and on-premises
Sources: Rightscale Cloud Computing Trends: 2018 State of the Cloud
Survey – February 2018, BetterCloud Monitor: The 2017 State of the
SaaS-Powered Workplace Report
Companies use 16 SaaS apps on average today
Informatica
Accelerates Your
Data-driven Digital
Transformation
Enterprise
Cloud Data
Management
1#
These graphics were published by Gartner, Inc. as part of larger research documents and should be evaluated in the context of the entire document. The Gartner documents are available upon request from Informatica.
Gartner does not endorse any vendor, product or service depicted in its research publications, and does not advise technology users to select only those vendors with the highest ratings or other designation. Gartner
research publications consist of the opinions of Gartner's research organization and should not be construed as statements of fact. Gartner disclaims all warranties, expressed or implied, with respect to this research,
including any warranties of merchantability or fitness for a particular purpose.
Gartner MQ for
Enterprise iPaaS
March 2017
Gartner MQ for
MDM Solutions
Oct 2017
Gartner MQ for
Data Quality Tools
Oct 2017
Gartner MQ for Metadata
Management Solutions
Aug 2017
Gartner MQ for Data
Integration Tools
Aug 2017
The Leader in Five Gartner Magic Quadrants
Informatica Intelligent Cloud Services
Connecting 100,000 applications,
databases and other end points
2 Trillion
Transactions a month
>300%
Growth of
API volume
3M+
Integrations/day
200%+ growth YoY
8,000+
Customers
150+
iPaaS Connectors
>50%
Annual revenue
growth
Volumes of data
2x
Every 6 months
A future-proof data foundation for your enterprise
Any Integration Pattern
Data, Steaming, Applications,
APIs, & Processes
Any User
For IT & Business Users
Any Data
Cloud, On-premises, IoT, Big
Data
Enterprise Unified Metadata Intelligence
A Single, Modular , Hybrid, Secure and Trusted, Platform
1
Cloud/Hybrid
360
Engagement
Data
Governance
Next Generation
Analytics
One Data Management Partner
Many Journeys
11 © Informatica. Proprietary and Confidential.
Integrate and manage your hybrid world
ica
Intelligent
Cloud
Services
AWS RedshiftAmazon Aurora
Integrating business
processes that span
clouds
Data management
challenges amplify
with multiple
environments
Understanding where all
of your data resides
Challenges
Adobe Cloud Platform
12
Cloud Apps (SaaS)Data Stores
DBs, DWs, Big Data, Cloud
Enterprise Systems B2B
Middleware and Tech
Analytics
Social Apps
Any Data - Broadest connectivity across cloud and on-premises
Redshift
13 © Informatica. Proprietary and Confidential.
Informatica Support Key AWS Services
DynamoDB S3Aurora RedshiftEMR Kinesis QuickSight
Informatica for AWS
Migration
15 © Informatica. Proprietary and Confidential.15 © Informatica. Proprietary and Confidential.15
Typical use cases on cloud journey
Cloud Data
Warehousing
AWS
Redshift
1 Cloud Application
Integration
2 Self-Service
Analytics
Amazon Web
Services QuickSight
3
Cloud Data
Lake
AWS
EC2 & EMR
Amazon Web
Services S3
4 Hybrid Data
Management
5 Cloud
Migration
SaaS PaaS IaaS
All ecosystem vendors across Cloud DW, Cloud Application Integration,
Cloud Data Lake/Big Data use cases
6
16 © Informatica. Proprietary and Confidential.
Four Most Common Cloud Journey Types
RE-HOST
RE-PLATFORM
RE-ARCHITECT
RE-BUY
17
Key Challenge – What Data to Modernize First?
Typical Large Enterprise
• 10000 – 50000 Database Schemas
• 1000 – 5000 Applications
• 10M – 100M Columns
• 1 – 5 Hadoop Data Lakes
• Multi-vendor IT
• Exponentially expanding data volumes
18
• Help organizations to democratize data
ü Enable governed self-service analytics
ü Take inventory of all data assets
ü Explore and understand data assets and data beyond data warehouses
• Discover data assets and relationships to make sense of data
ü Technical and business metadata
ü Data lineage, impact analysis
ü Data relationships
• Manage data as a strategic asset
Maximize the Value of Data Assets with a Data Catalog
19
Enterprise Data Catalog
Powered by CLAIRE engine
• Easily find and discover trusted data
• Explore holistic data relationships
• End-to-End data lineage & impact analysis
• Automatically catalog and classify data assets
• Curate data assets – governed or crowdsourced
data assets
• Machine-learning-based
semantic inference and recommendations
• Enhance Classification with entity recognition
• Broad Connectivity, Big Data Scale
Artificial-Intelligence based data discovery and visibility
to all data assets across the enterprise
20
Machine Curated Catalog
Auto-Scan
Source Metadata
Profiling and Domain DiscoveryMachine Learning
Curated Catalog
Business Glossary AssociationsCrowd Sourced AnnotationsGoverned Curation
Enterprise Data Catalog
Applications &
Databases
Internet of Things
3rd Party Data
Data Modeling
Tools
BI Tools CustomCloud
Enterprise Data Catalog
Data
Relationships
Data
Profile
Data
Lineage
Data
Classification
Data
Discovery
Informatica for AWS
Data Warehouse
Modernization
22 © Informatica. Proprietary and Confidential.
Cloud Data Warehouse / Analytics Modernization
Migrate Extend Born in the Cloud
• Agile self-service analytics
• Highly scalable & elastic
• Quickly scale
• Variety & volumes
of data for analysis
• Improve performance
• Reduce costs
23
Informatica for Redshift Data Warehouse Modernization
Data
Quality
Master Data Management
24
Fast Data Ingestion into Amazon Redshift
Amazon
Redshift
Batch Read Source Data
Number of Local
Staging Files =
multiple of RedShift
Slices
Files
encrypted
compressed
S3 VPC Endpoint
for performance
Parallel or
Multipart Upload
S3
Bucket
Key Range Partition
Source Key = Distribution Key
No of partitions = no of Redshift
slices
Optimized Copy
Command
Redshift Slices
Informatica for AWS
Next-Generation Data Lakes
on AWS
26 © Informatica. Proprietary and Confidential.
Data Lake on AWS
• Immediate Availability. Deploy instantly. No hardware to procure, no
infrastructure to maintain & scale
• Broad & Deep Capabilities. Over 70 services and 100s of features to support
virtually any big data application & workload
• Trusted & Secure. Designed to meet the strictest requirements. Continuously
audited, including certifications such as ISO 27001, FedRAMP, DoD CSM, and
PCI DSS.
• Hundreds of Partners & Solutions. Get help from a consulting partner or
choose from hundreds of tools and applications across the entire data
management stack.
27 © Informatica. Proprietary and Confidential.
Building a Data Lake on AWS
28 © Informatica. Proprietary and Confidential.
Building a Data Lake on AWS with Informatica
29 © Informatica. Proprietary and Confidential.
AIntelligent Data
Lake Management
STREAM INTEGRATE ENRICH PREPARE CATALOG RELATE PROTECT DELIVERINGEST DEFINE
Orchestrate
data flows
and provision
data to the
enterprise
Prepare data
for analytics
& collaborate
on projects
Streaming
analytics &
event
processing
Integrate all
types of data
of any volume
at scale
Cleanse and
enrich trusted
data
Define and
verify data
governance
policies
Discover,
catalog,and
curate all
enterprise data
Match
and relate
identities
& entities
at scale
Data security
intelligence to
Detect &
protect
sensitive data
Multi-latency
data ingestion
& edge
computing
CATALOG SEARCH LINEAGE RECOMMENDATIONSPARSE MATCH
Informatica’s Enterprise-Class Cloud Data Management Solution for Big Data
Amazon
S3
Amazon
Redshift
Amazon
EMR
30 © Informatica. Proprietary and Confidential.
Informatica Big Data Management Reference
Architecture for Amazon Data Lakes
Ingest
3rd Party Data
On-Prem
Data Center
Database &
Application Servers
Social & Web Logs
Landing
Zone
Raw Data
Amazon S3
Buckets
Data Curation
Amazon EMR
Compute
Cluster
Temporary S3 Storage
Enterprise
Zone
Hive on S3
Buckets
Discovery
Zone
Hive on S3
Buckets
Delivery
Amazon
Redshift
Amazon
RDS
Amazon
DynamoDB
On-Premises
Data Warehouse or
Master Data Management Hub
VPN
Gateway
customer
gateway
VPN
connection
VPN
Gateway
customer
gateway
VPN
connection
Analytic/BI
Tools
PWX
BDM
Big Data
Parser BDM
Informatica for AWS
Getting Started
32 © Informatica. Proprietary and Confidential.
Informatica Data Lake Solution for AWS Quick Start
Find out more:
https://www.informatica.com/solutions/explore-
ecosystems/aws/aws-data-lakes
High-performance codeless
integration of multiple clouds,
on-premises systems and AWS
Accelerate enterprise data lake
management deployment to
power next generation analytics
Catalog, understand and manage
data assets across clouds and
on-premises systems
Rapid Deployment with AWS
Quick Start
33
Learn more
Learn & Prepare
• Data Lake Management on
AWS
• Cloud Analytics with
Informatica Intelligent Cloud
Services & Amazon Redshift
• PowerCenter on AWS
Deep Dive Get Started
www.informatica.com/AWS
AWS Marketplace & Quick StartsReference Architecture GuidesWhitepapers & Workbooks

More Related Content

What's hot

Big Data and MDM altogether: the winning association
Big Data and MDM altogether: the winning associationBig Data and MDM altogether: the winning association
Big Data and MDM altogether: the winning associationJean-Michel Franco
 
Slides: The Business Value of Data Modeling
Slides: The Business Value of Data ModelingSlides: The Business Value of Data Modeling
Slides: The Business Value of Data ModelingDATAVERSITY
 
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)Denodo
 
Modern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | QuboleModern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | QuboleVasu S
 
Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...
Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...
Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...Denodo
 
Using neo4j for enterprise metadata requirements
Using neo4j for enterprise metadata requirementsUsing neo4j for enterprise metadata requirements
Using neo4j for enterprise metadata requirementsNeo4j
 
IBM Governed Data Lake
IBM Governed Data LakeIBM Governed Data Lake
IBM Governed Data LakeKaran Sachdeva
 
Graph Databases for Master Data Management
Graph Databases for Master Data ManagementGraph Databases for Master Data Management
Graph Databases for Master Data ManagementNeo4j
 
The Evolution of Self-Service Analytics
The Evolution of Self-Service AnalyticsThe Evolution of Self-Service Analytics
The Evolution of Self-Service AnalyticsEckerson Group
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
Cloud and Analytics -- 2020 sparksummit
Cloud and Analytics -- 2020 sparksummitCloud and Analytics -- 2020 sparksummit
Cloud and Analytics -- 2020 sparksummitMing Yuan
 
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...DLT Solutions
 
Best Practices: Data Virtualization Perspectives and Best Practices
Best Practices: Data Virtualization Perspectives and Best PracticesBest Practices: Data Virtualization Perspectives and Best Practices
Best Practices: Data Virtualization Perspectives and Best PracticesDenodo
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Denodo
 
bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000Kartik Padmanabhan
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesDATAVERSITY
 
Why My Wife Loves Data Governance
Why My Wife Loves Data GovernanceWhy My Wife Loves Data Governance
Why My Wife Loves Data GovernancePaul Boal
 
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360Databricks
 
Data Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaData Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaCaserta
 

What's hot (20)

Big Data and MDM altogether: the winning association
Big Data and MDM altogether: the winning associationBig Data and MDM altogether: the winning association
Big Data and MDM altogether: the winning association
 
Slides: The Business Value of Data Modeling
Slides: The Business Value of Data ModelingSlides: The Business Value of Data Modeling
Slides: The Business Value of Data Modeling
 
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
 
Modern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | QuboleModern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | Qubole
 
Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...
Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...
Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...
 
Using neo4j for enterprise metadata requirements
Using neo4j for enterprise metadata requirementsUsing neo4j for enterprise metadata requirements
Using neo4j for enterprise metadata requirements
 
IBM Governed Data Lake
IBM Governed Data LakeIBM Governed Data Lake
IBM Governed Data Lake
 
Graph Databases for Master Data Management
Graph Databases for Master Data ManagementGraph Databases for Master Data Management
Graph Databases for Master Data Management
 
The Evolution of Self-Service Analytics
The Evolution of Self-Service AnalyticsThe Evolution of Self-Service Analytics
The Evolution of Self-Service Analytics
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
Cloud and Analytics -- 2020 sparksummit
Cloud and Analytics -- 2020 sparksummitCloud and Analytics -- 2020 sparksummit
Cloud and Analytics -- 2020 sparksummit
 
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
 
Best Practices: Data Virtualization Perspectives and Best Practices
Best Practices: Data Virtualization Perspectives and Best PracticesBest Practices: Data Virtualization Perspectives and Best Practices
Best Practices: Data Virtualization Perspectives and Best Practices
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
 
Mdm And Ref Data
Mdm And Ref DataMdm And Ref Data
Mdm And Ref Data
 
bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & Approaches
 
Why My Wife Loves Data Governance
Why My Wife Loves Data GovernanceWhy My Wife Loves Data Governance
Why My Wife Loves Data Governance
 
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360
 
Data Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaData Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with Cloudera
 

Similar to AWS Summit Singapore - Accelerate Digital Transformation through AI-powered Cloud Analytics Modernization with Informatica

Accelerate Digital Transformation Through AI-powered Cloud Analytics Moderniz...
Accelerate Digital Transformation Through AI-powered Cloud Analytics Moderniz...Accelerate Digital Transformation Through AI-powered Cloud Analytics Moderniz...
Accelerate Digital Transformation Through AI-powered Cloud Analytics Moderniz...Amazon Web Services
 
Big Data LDN 2017: Data Governance Reimagined
Big Data LDN 2017: Data Governance ReimaginedBig Data LDN 2017: Data Governance Reimagined
Big Data LDN 2017: Data Governance ReimaginedMatt Stubbs
 
20181212 AWS NL - Informatica Cloud Overview
20181212 AWS NL - Informatica Cloud Overview20181212 AWS NL - Informatica Cloud Overview
20181212 AWS NL - Informatica Cloud OverviewGreg Rakers
 
SendGrid Improves Email Delivery with Hybrid Data Warehousing
SendGrid Improves Email Delivery with Hybrid Data WarehousingSendGrid Improves Email Delivery with Hybrid Data Warehousing
SendGrid Improves Email Delivery with Hybrid Data WarehousingAmazon Web Services
 
IICS_Capabilities.pptx
IICS_Capabilities.pptxIICS_Capabilities.pptx
IICS_Capabilities.pptxNandan Kumar
 
Turn Big Data into Big Value on Informatica and AWS
Turn Big Data into Big Value on Informatica and AWSTurn Big Data into Big Value on Informatica and AWS
Turn Big Data into Big Value on Informatica and AWSAmazon Web Services
 
MongoDB World 2019: Managing a Heterogeneous Data Stack with Informatica and ...
MongoDB World 2019: Managing a Heterogeneous Data Stack with Informatica and ...MongoDB World 2019: Managing a Heterogeneous Data Stack with Informatica and ...
MongoDB World 2019: Managing a Heterogeneous Data Stack with Informatica and ...MongoDB
 
Data Integration for Both Self-Service Analytics and IT Users
Data Integration for Both Self-Service Analytics and IT Users Data Integration for Both Self-Service Analytics and IT Users
Data Integration for Both Self-Service Analytics and IT Users Senturus
 
Big Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationBig Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationCambridge Semantics
 
Empowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog RequirementsEmpowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog RequirementsPrecisely
 
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...Cambridge Semantics
 
Modern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyModern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyNeo4j
 
When SAP alone is not enough
When SAP alone is not enoughWhen SAP alone is not enough
When SAP alone is not enoughCloudera, Inc.
 
Four Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyFour Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyArcadia Data
 
Impulser la digitalisation et modernisation de la fonction Finance grâce à la...
Impulser la digitalisation et modernisation de la fonction Finance grâce à la...Impulser la digitalisation et modernisation de la fonction Finance grâce à la...
Impulser la digitalisation et modernisation de la fonction Finance grâce à la...Denodo
 
Revolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleRevolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleBardess Group
 
SIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikSIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikBardess Group
 
Is your data paying you dividends?
Is your data paying you dividends? Is your data paying you dividends?
Is your data paying you dividends? Karan Sachdeva
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DATAVERSITY
 

Similar to AWS Summit Singapore - Accelerate Digital Transformation through AI-powered Cloud Analytics Modernization with Informatica (20)

Accelerate Digital Transformation Through AI-powered Cloud Analytics Moderniz...
Accelerate Digital Transformation Through AI-powered Cloud Analytics Moderniz...Accelerate Digital Transformation Through AI-powered Cloud Analytics Moderniz...
Accelerate Digital Transformation Through AI-powered Cloud Analytics Moderniz...
 
Big Data LDN 2017: Data Governance Reimagined
Big Data LDN 2017: Data Governance ReimaginedBig Data LDN 2017: Data Governance Reimagined
Big Data LDN 2017: Data Governance Reimagined
 
20181212 AWS NL - Informatica Cloud Overview
20181212 AWS NL - Informatica Cloud Overview20181212 AWS NL - Informatica Cloud Overview
20181212 AWS NL - Informatica Cloud Overview
 
SendGrid Improves Email Delivery with Hybrid Data Warehousing
SendGrid Improves Email Delivery with Hybrid Data WarehousingSendGrid Improves Email Delivery with Hybrid Data Warehousing
SendGrid Improves Email Delivery with Hybrid Data Warehousing
 
IICS_Capabilities.pptx
IICS_Capabilities.pptxIICS_Capabilities.pptx
IICS_Capabilities.pptx
 
Turn Big Data into Big Value on Informatica and AWS
Turn Big Data into Big Value on Informatica and AWSTurn Big Data into Big Value on Informatica and AWS
Turn Big Data into Big Value on Informatica and AWS
 
MongoDB World 2019: Managing a Heterogeneous Data Stack with Informatica and ...
MongoDB World 2019: Managing a Heterogeneous Data Stack with Informatica and ...MongoDB World 2019: Managing a Heterogeneous Data Stack with Informatica and ...
MongoDB World 2019: Managing a Heterogeneous Data Stack with Informatica and ...
 
Data Integration for Both Self-Service Analytics and IT Users
Data Integration for Both Self-Service Analytics and IT Users Data Integration for Both Self-Service Analytics and IT Users
Data Integration for Both Self-Service Analytics and IT Users
 
Big Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationBig Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data Democratization
 
Empowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog RequirementsEmpowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog Requirements
 
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
 
Modern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyModern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph Technology
 
When SAP alone is not enough
When SAP alone is not enoughWhen SAP alone is not enough
When SAP alone is not enough
 
Four Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyFour Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics Strategy
 
Impulser la digitalisation et modernisation de la fonction Finance grâce à la...
Impulser la digitalisation et modernisation de la fonction Finance grâce à la...Impulser la digitalisation et modernisation de la fonction Finance grâce à la...
Impulser la digitalisation et modernisation de la fonction Finance grâce à la...
 
IBM Cloud pak for data brochure
IBM Cloud pak for data   brochureIBM Cloud pak for data   brochure
IBM Cloud pak for data brochure
 
Revolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleRevolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus Example
 
SIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikSIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess Qlik
 
Is your data paying you dividends?
Is your data paying you dividends? Is your data paying you dividends?
Is your data paying you dividends?
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

AWS Summit Singapore - Accelerate Digital Transformation through AI-powered Cloud Analytics Modernization with Informatica

  • 1. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Andrew McIntyre Director of Strategic ISV Alliances, Informatica Accelerate Digital Transformation through AI-powered Cloud Analytics Modernization with Informatica
  • 2. New technology architecture New users and influencers New consumption models New ecosystems Four Challenges
  • 3. 3 Data explosion beyond data warehouse No clear view of data relationships and semantic meaning of data The Challenges with Changing Data Landscape…. Organizations are unable to maximize business value from their data assets Growing number of users using self-service analytics
  • 4. 4 © Informatica. Proprietary and Confidential.4 © Informatica. Proprietary and Confidential. 58% of companies have a hybrid strategy with footprints on both cloud and on-premises Sources: Rightscale Cloud Computing Trends: 2018 State of the Cloud Survey – February 2018, BetterCloud Monitor: The 2017 State of the SaaS-Powered Workplace Report Companies use 16 SaaS apps on average today
  • 7. These graphics were published by Gartner, Inc. as part of larger research documents and should be evaluated in the context of the entire document. The Gartner documents are available upon request from Informatica. Gartner does not endorse any vendor, product or service depicted in its research publications, and does not advise technology users to select only those vendors with the highest ratings or other designation. Gartner research publications consist of the opinions of Gartner's research organization and should not be construed as statements of fact. Gartner disclaims all warranties, expressed or implied, with respect to this research, including any warranties of merchantability or fitness for a particular purpose. Gartner MQ for Enterprise iPaaS March 2017 Gartner MQ for MDM Solutions Oct 2017 Gartner MQ for Data Quality Tools Oct 2017 Gartner MQ for Metadata Management Solutions Aug 2017 Gartner MQ for Data Integration Tools Aug 2017 The Leader in Five Gartner Magic Quadrants
  • 8. Informatica Intelligent Cloud Services Connecting 100,000 applications, databases and other end points 2 Trillion Transactions a month >300% Growth of API volume 3M+ Integrations/day 200%+ growth YoY 8,000+ Customers 150+ iPaaS Connectors >50% Annual revenue growth Volumes of data 2x Every 6 months
  • 9. A future-proof data foundation for your enterprise Any Integration Pattern Data, Steaming, Applications, APIs, & Processes Any User For IT & Business Users Any Data Cloud, On-premises, IoT, Big Data Enterprise Unified Metadata Intelligence A Single, Modular , Hybrid, Secure and Trusted, Platform
  • 11. 11 © Informatica. Proprietary and Confidential. Integrate and manage your hybrid world ica Intelligent Cloud Services AWS RedshiftAmazon Aurora Integrating business processes that span clouds Data management challenges amplify with multiple environments Understanding where all of your data resides Challenges Adobe Cloud Platform
  • 12. 12 Cloud Apps (SaaS)Data Stores DBs, DWs, Big Data, Cloud Enterprise Systems B2B Middleware and Tech Analytics Social Apps Any Data - Broadest connectivity across cloud and on-premises Redshift
  • 13. 13 © Informatica. Proprietary and Confidential. Informatica Support Key AWS Services DynamoDB S3Aurora RedshiftEMR Kinesis QuickSight
  • 15. 15 © Informatica. Proprietary and Confidential.15 © Informatica. Proprietary and Confidential.15 Typical use cases on cloud journey Cloud Data Warehousing AWS Redshift 1 Cloud Application Integration 2 Self-Service Analytics Amazon Web Services QuickSight 3 Cloud Data Lake AWS EC2 & EMR Amazon Web Services S3 4 Hybrid Data Management 5 Cloud Migration SaaS PaaS IaaS All ecosystem vendors across Cloud DW, Cloud Application Integration, Cloud Data Lake/Big Data use cases 6
  • 16. 16 © Informatica. Proprietary and Confidential. Four Most Common Cloud Journey Types RE-HOST RE-PLATFORM RE-ARCHITECT RE-BUY
  • 17. 17 Key Challenge – What Data to Modernize First? Typical Large Enterprise • 10000 – 50000 Database Schemas • 1000 – 5000 Applications • 10M – 100M Columns • 1 – 5 Hadoop Data Lakes • Multi-vendor IT • Exponentially expanding data volumes
  • 18. 18 • Help organizations to democratize data ü Enable governed self-service analytics ü Take inventory of all data assets ü Explore and understand data assets and data beyond data warehouses • Discover data assets and relationships to make sense of data ü Technical and business metadata ü Data lineage, impact analysis ü Data relationships • Manage data as a strategic asset Maximize the Value of Data Assets with a Data Catalog
  • 19. 19 Enterprise Data Catalog Powered by CLAIRE engine • Easily find and discover trusted data • Explore holistic data relationships • End-to-End data lineage & impact analysis • Automatically catalog and classify data assets • Curate data assets – governed or crowdsourced data assets • Machine-learning-based semantic inference and recommendations • Enhance Classification with entity recognition • Broad Connectivity, Big Data Scale Artificial-Intelligence based data discovery and visibility to all data assets across the enterprise
  • 20. 20 Machine Curated Catalog Auto-Scan Source Metadata Profiling and Domain DiscoveryMachine Learning Curated Catalog Business Glossary AssociationsCrowd Sourced AnnotationsGoverned Curation Enterprise Data Catalog Applications & Databases Internet of Things 3rd Party Data Data Modeling Tools BI Tools CustomCloud Enterprise Data Catalog Data Relationships Data Profile Data Lineage Data Classification Data Discovery
  • 21. Informatica for AWS Data Warehouse Modernization
  • 22. 22 © Informatica. Proprietary and Confidential. Cloud Data Warehouse / Analytics Modernization Migrate Extend Born in the Cloud • Agile self-service analytics • Highly scalable & elastic • Quickly scale • Variety & volumes of data for analysis • Improve performance • Reduce costs
  • 23. 23 Informatica for Redshift Data Warehouse Modernization Data Quality Master Data Management
  • 24. 24 Fast Data Ingestion into Amazon Redshift Amazon Redshift Batch Read Source Data Number of Local Staging Files = multiple of RedShift Slices Files encrypted compressed S3 VPC Endpoint for performance Parallel or Multipart Upload S3 Bucket Key Range Partition Source Key = Distribution Key No of partitions = no of Redshift slices Optimized Copy Command Redshift Slices
  • 26. 26 © Informatica. Proprietary and Confidential. Data Lake on AWS • Immediate Availability. Deploy instantly. No hardware to procure, no infrastructure to maintain & scale • Broad & Deep Capabilities. Over 70 services and 100s of features to support virtually any big data application & workload • Trusted & Secure. Designed to meet the strictest requirements. Continuously audited, including certifications such as ISO 27001, FedRAMP, DoD CSM, and PCI DSS. • Hundreds of Partners & Solutions. Get help from a consulting partner or choose from hundreds of tools and applications across the entire data management stack.
  • 27. 27 © Informatica. Proprietary and Confidential. Building a Data Lake on AWS
  • 28. 28 © Informatica. Proprietary and Confidential. Building a Data Lake on AWS with Informatica
  • 29. 29 © Informatica. Proprietary and Confidential. AIntelligent Data Lake Management STREAM INTEGRATE ENRICH PREPARE CATALOG RELATE PROTECT DELIVERINGEST DEFINE Orchestrate data flows and provision data to the enterprise Prepare data for analytics & collaborate on projects Streaming analytics & event processing Integrate all types of data of any volume at scale Cleanse and enrich trusted data Define and verify data governance policies Discover, catalog,and curate all enterprise data Match and relate identities & entities at scale Data security intelligence to Detect & protect sensitive data Multi-latency data ingestion & edge computing CATALOG SEARCH LINEAGE RECOMMENDATIONSPARSE MATCH Informatica’s Enterprise-Class Cloud Data Management Solution for Big Data Amazon S3 Amazon Redshift Amazon EMR
  • 30. 30 © Informatica. Proprietary and Confidential. Informatica Big Data Management Reference Architecture for Amazon Data Lakes Ingest 3rd Party Data On-Prem Data Center Database & Application Servers Social & Web Logs Landing Zone Raw Data Amazon S3 Buckets Data Curation Amazon EMR Compute Cluster Temporary S3 Storage Enterprise Zone Hive on S3 Buckets Discovery Zone Hive on S3 Buckets Delivery Amazon Redshift Amazon RDS Amazon DynamoDB On-Premises Data Warehouse or Master Data Management Hub VPN Gateway customer gateway VPN connection VPN Gateway customer gateway VPN connection Analytic/BI Tools PWX BDM Big Data Parser BDM
  • 32. 32 © Informatica. Proprietary and Confidential. Informatica Data Lake Solution for AWS Quick Start Find out more: https://www.informatica.com/solutions/explore- ecosystems/aws/aws-data-lakes High-performance codeless integration of multiple clouds, on-premises systems and AWS Accelerate enterprise data lake management deployment to power next generation analytics Catalog, understand and manage data assets across clouds and on-premises systems Rapid Deployment with AWS Quick Start
  • 33. 33 Learn more Learn & Prepare • Data Lake Management on AWS • Cloud Analytics with Informatica Intelligent Cloud Services & Amazon Redshift • PowerCenter on AWS Deep Dive Get Started www.informatica.com/AWS AWS Marketplace & Quick StartsReference Architecture GuidesWhitepapers & Workbooks