SlideShare a Scribd company logo
Copyright © 2020 Impetus Technologies, Inc.
You are prohibited from making a copy or modification of, or from redistributing, rebroadcasting, or
re-encoding of this content without the prior written consent of Impetus.
This presentation may include images from other products and services. These images are used for
illustrative purposes only. Unless explicitly stated there is no implied endorsement or sponsorship of
these products by Impetus. All copyrights and trademarks are property of their respective owners.
Best Practices to Build a Sustainable
Data Lake on Cloud
IMP
Our mission
Enabling a unified, clear, and present view of
your business
Agenda
Data Lake Essentials
The Cloud Advantage
Cloud Adoption Strategy
Keys to Building a Robust Data Lake on Cloud
Q&A
Poll
Speakers
Mita Baxi
Solutions Architect
Chetan Kalanki
Sr. Director
Big Data and
Cloud Engineering
The Cloud Opportunity
-Gartner
The worldwide public cloud service market will grow to $331.2 B in 2022
-MarketWatch
Data Lakes Market will touch an aggregate of $12.01 B by 2024
-Gartner
If you have not developed a cloud-first strategy yet, you are likely falling behind your competitors
What’s in it for me?
Key influencers in building a data lake on cloud
Best practices for building a sustainable data lake on cloud
What is a Data Lake?
Data Lake
Database
Logs
XML Data
Spreadsheet
Text
Data lake: Key functional aspects
Ingest Store Process Analyze Consume
Data Privacy, Security and Access Management
Data Management
Data lake: Governance essentials
Ingress
Data Discovery and Curation
Profiling | Classification | Lineage | Prepare
Metadata Catalog | Catalog | MDM | Archive
Quality
Physical | Encryption | Access | Audit
Data Discovery, Reporting and Visualization
End-to-end big data lake capability view
Metadata / Governance Layer
Streaming
Structured (ODS) /
Unstructured
Geospatial /
Machine /
Time Series
External Social
RawLayer
ProcessedLayer
Landing/
Staging
Active Archive
Common Data
Model
ODL – Rest /
Motion
Master
Reference Data
API Data
Mart
Stewardship/
Policies
QuickSight
Athena
SLA-BasedDataVendingLayer
Sandbox
Stores
Post Analytics
Store
Search
ODL-End
User
Lineage
Service
Catalog
Audit /
Monitoring
Workflow
/DQ
Workflow
/DQ
Why cloud?
Agility
Scalability
Security
Economics
Advanced analytics
Success criteria
Pace of adoption
Footprint
Time-to-market
Innovation
ROI and TCO
Cloud adoption cycle
Excitement
Realization of Complexity, Effort
and Impact on Individual
Transformation
Deployment
Chaos
Performance
Time
Integration
Key influencers
Process Technology
People
Critical concerns
Knowledge
Funding and cost
Purchasing
Resistance
Plan
Poll Results
Best Practices
Build a solution from an enterprise view
Goal alignment
Budgeting
Chargebacks
ROI assessment
Assess reskill needs
Focus on value delivery
Adopt cloud first commitment
Have flexibility to change
Use accelerators
Build a foundation first
Security should be your top priority
Address key challenges first
Reusable templates
Logging and monitoring
Financial controls
Select the right tooling
Data integration
Data catalog
Data quality
Data governance
Fail fast
Start small
Perform proof of concept
Extend and enhance
Fail fast, recover fast
Measure
Assess lock-ins
Which layer do you want to lock-in?
- Tool
- Framework
- Platform
- Cloud
- Solution
Build for expansion
Business expansion
Geographical compliances
Integrations
Build for speed of change
DevOps
Automate
Cloud-native tools
Engage experts
Know and optimize expenditure – Always
Hard cost
Soft cost
Measure everything you can
Usage data
Security metrics
Cost parameter
Skills level
Recap
Advisory and Consulting
Cloud Enablement and Migration, Big Data Enablement and Migration, Data Lake,
DevOps, Usability, Mobility etc.
Architecture, Design and
Engineering
Technology Evaluation and Benchmarking, Solution Design and Architecture,
Engineering, Quality Assurance, NFRs and Performance Engineering etc.
Enterprise Data Management
Data Lake, Data Modelling, Data Migration, Data Visualization, Data Democratization
and Governance etc.
DevOps and Productionization
Capacity Planning, Infra as Code, Infra Provisioning, Automation and Administration,
Ops Support etc.
Data Science and Analytics
NLP and NLG, AI and Deep Learning, Descriptive-Prescriptive-Predictive Analytics,
Sentiment Analysis etc.
User Experience
UX Design and Architecture, Rich Media Design, Mobile App Design, Responsive
Design, UX Lab Assessment etc.
Our Services
Impetus cloud practice competencies
Amazon Web Services
Microsoft Azure
Google Cloud
Salesforce
Advisory, strategy,
TCO
Architecture evaluation
Cloud infrastructure
realization
Cloud cost optimization
Workload assessment
and transformation
Capacity planning
Automation and
orchestration
Security and
governance
DevOps
Maintenance
and administration
Thank you. Questions?
Visit www.impetus.com or get in touch with us at inquiry@impetus.com

More Related Content

What's hot

5 benefits of real-time visibility
5 benefits of real-time visibility5 benefits of real-time visibility
5 benefits of real-time visibilityAna Paula Hurtado
 
Increase your collaboration with Azure Automation
Increase your collaboration with Azure AutomationIncrease your collaboration with Azure Automation
Increase your collaboration with Azure AutomationMike Maadarani
 
Io t a_de_techgigwebinar_04nov2016
Io t a_de_techgigwebinar_04nov2016Io t a_de_techgigwebinar_04nov2016
Io t a_de_techgigwebinar_04nov2016Dr. Aloknath De
 
Desktop-as-a-Service: Flexible Application Delivery to Cloud-Native Desktops
Desktop-as-a-Service: Flexible Application Delivery to Cloud-Native DesktopsDesktop-as-a-Service: Flexible Application Delivery to Cloud-Native Desktops
Desktop-as-a-Service: Flexible Application Delivery to Cloud-Native DesktopsAmazon Web Services
 
Introduction to vRealize Suite Messaging
Introduction to vRealize Suite MessagingIntroduction to vRealize Suite Messaging
Introduction to vRealize Suite MessagingJennifer Stern
 
SporTech BI Presentation
SporTech BI PresentationSporTech BI Presentation
SporTech BI PresentationSporTechBI
 
Lessons Learned from Building a CSB Part III
Lessons Learned from Building a CSB Part IIILessons Learned from Building a CSB Part III
Lessons Learned from Building a CSB Part IIIGravitant, Inc.
 
Failing and Failing Fast in AppDev – How Do We Keep up in AppSec?
Failing and Failing Fast in AppDev – How Do We Keep up in AppSec?Failing and Failing Fast in AppDev – How Do We Keep up in AppSec?
Failing and Failing Fast in AppDev – How Do We Keep up in AppSec?Capgemini
 
Global Cloud Migration Market (2020 - 2025) - Mordor Intelligence
Global Cloud Migration Market (2020 - 2025) - Mordor IntelligenceGlobal Cloud Migration Market (2020 - 2025) - Mordor Intelligence
Global Cloud Migration Market (2020 - 2025) - Mordor IntelligenceSampath pogula
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Solutionpath - HPE Discover 2015
Solutionpath - HPE Discover 2015Solutionpath - HPE Discover 2015
Solutionpath - HPE Discover 2015Gemma Wilson
 
The Journey to Digital Enterprise, presented by CSC
The Journey to Digital Enterprise, presented by CSCThe Journey to Digital Enterprise, presented by CSC
The Journey to Digital Enterprise, presented by CSCAmazon Web Services
 
Lessons Learned from building a CSB Part I
Lessons Learned from building a CSB Part ILessons Learned from building a CSB Part I
Lessons Learned from building a CSB Part IGravitant, Inc.
 
Cloud ROI and Implementation - A TechBlocks Solutions Guide
Cloud ROI and Implementation - A TechBlocks Solutions GuideCloud ROI and Implementation - A TechBlocks Solutions Guide
Cloud ROI and Implementation - A TechBlocks Solutions GuideTechBlocks
 
Hybrid Cloud Essential for Success
Hybrid Cloud Essential for SuccessHybrid Cloud Essential for Success
Hybrid Cloud Essential for SuccessNetApp
 
Modern IT: Keeping Pace in a Cloud-First World
Modern IT: Keeping Pace in a Cloud-First WorldModern IT: Keeping Pace in a Cloud-First World
Modern IT: Keeping Pace in a Cloud-First WorldSWC Technology Partners
 
CWIN17 san francisco-kiran murthy-cloud native - sf v4
CWIN17 san francisco-kiran murthy-cloud native - sf v4CWIN17 san francisco-kiran murthy-cloud native - sf v4
CWIN17 san francisco-kiran murthy-cloud native - sf v4Capgemini
 
CSC Journey to the Digital Enterprise
CSC Journey to the Digital EnterpriseCSC Journey to the Digital Enterprise
CSC Journey to the Digital EnterpriseKristof Breesch
 

What's hot (20)

5 benefits of real-time visibility
5 benefits of real-time visibility5 benefits of real-time visibility
5 benefits of real-time visibility
 
Increase your collaboration with Azure Automation
Increase your collaboration with Azure AutomationIncrease your collaboration with Azure Automation
Increase your collaboration with Azure Automation
 
Io t a_de_techgigwebinar_04nov2016
Io t a_de_techgigwebinar_04nov2016Io t a_de_techgigwebinar_04nov2016
Io t a_de_techgigwebinar_04nov2016
 
Desktop-as-a-Service: Flexible Application Delivery to Cloud-Native Desktops
Desktop-as-a-Service: Flexible Application Delivery to Cloud-Native DesktopsDesktop-as-a-Service: Flexible Application Delivery to Cloud-Native Desktops
Desktop-as-a-Service: Flexible Application Delivery to Cloud-Native Desktops
 
Introduction to vRealize Suite Messaging
Introduction to vRealize Suite MessagingIntroduction to vRealize Suite Messaging
Introduction to vRealize Suite Messaging
 
SporTech BI Presentation
SporTech BI PresentationSporTech BI Presentation
SporTech BI Presentation
 
Lessons Learned from Building a CSB Part III
Lessons Learned from Building a CSB Part IIILessons Learned from Building a CSB Part III
Lessons Learned from Building a CSB Part III
 
Failing and Failing Fast in AppDev – How Do We Keep up in AppSec?
Failing and Failing Fast in AppDev – How Do We Keep up in AppSec?Failing and Failing Fast in AppDev – How Do We Keep up in AppSec?
Failing and Failing Fast in AppDev – How Do We Keep up in AppSec?
 
Global Cloud Migration Market (2020 - 2025) - Mordor Intelligence
Global Cloud Migration Market (2020 - 2025) - Mordor IntelligenceGlobal Cloud Migration Market (2020 - 2025) - Mordor Intelligence
Global Cloud Migration Market (2020 - 2025) - Mordor Intelligence
 
App Modernization
App ModernizationApp Modernization
App Modernization
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Solutionpath - HPE Discover 2015
Solutionpath - HPE Discover 2015Solutionpath - HPE Discover 2015
Solutionpath - HPE Discover 2015
 
The Journey to Digital Enterprise, presented by CSC
The Journey to Digital Enterprise, presented by CSCThe Journey to Digital Enterprise, presented by CSC
The Journey to Digital Enterprise, presented by CSC
 
Marketing concepts
Marketing conceptsMarketing concepts
Marketing concepts
 
Lessons Learned from building a CSB Part I
Lessons Learned from building a CSB Part ILessons Learned from building a CSB Part I
Lessons Learned from building a CSB Part I
 
Cloud ROI and Implementation - A TechBlocks Solutions Guide
Cloud ROI and Implementation - A TechBlocks Solutions GuideCloud ROI and Implementation - A TechBlocks Solutions Guide
Cloud ROI and Implementation - A TechBlocks Solutions Guide
 
Hybrid Cloud Essential for Success
Hybrid Cloud Essential for SuccessHybrid Cloud Essential for Success
Hybrid Cloud Essential for Success
 
Modern IT: Keeping Pace in a Cloud-First World
Modern IT: Keeping Pace in a Cloud-First WorldModern IT: Keeping Pace in a Cloud-First World
Modern IT: Keeping Pace in a Cloud-First World
 
CWIN17 san francisco-kiran murthy-cloud native - sf v4
CWIN17 san francisco-kiran murthy-cloud native - sf v4CWIN17 san francisco-kiran murthy-cloud native - sf v4
CWIN17 san francisco-kiran murthy-cloud native - sf v4
 
CSC Journey to the Digital Enterprise
CSC Journey to the Digital EnterpriseCSC Journey to the Digital Enterprise
CSC Journey to the Digital Enterprise
 

Similar to Best practices to build a sustainable data lake on cloud - Impetus Webinar

Get ahead of the cloud or get left behind
Get ahead of the cloud or get left behindGet ahead of the cloud or get left behind
Get ahead of the cloud or get left behindMatt Mandich
 
Capgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with ClouderaCapgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with ClouderaCapgemini
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaCloudera, Inc.
 
Planning A Cloud Implementation
Planning A Cloud ImplementationPlanning A Cloud Implementation
Planning A Cloud ImplementationRex Wang
 
Riaan Van Nierkirk, CIO at Mc Gregor - Can business transformation be success...
Riaan Van Nierkirk, CIO at Mc Gregor - Can business transformation be success...Riaan Van Nierkirk, CIO at Mc Gregor - Can business transformation be success...
Riaan Van Nierkirk, CIO at Mc Gregor - Can business transformation be success...Global Business Events
 
How to Utilize Cloud in Your Corporate IT Strategy
How to Utilize Cloud in Your Corporate IT StrategyHow to Utilize Cloud in Your Corporate IT Strategy
How to Utilize Cloud in Your Corporate IT StrategyVISIHOSTING
 
Augmenting IT strategy with Enterprise architecture assessment
Augmenting IT strategy with Enterprise architecture assessmentAugmenting IT strategy with Enterprise architecture assessment
Augmenting IT strategy with Enterprise architecture assessmentPrashanth Panduranga
 
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXCustomer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXtsigitnist02
 
Business agility imperatives smarter solutions-transformation-icty 2011-1
Business agility imperatives smarter solutions-transformation-icty 2011-1Business agility imperatives smarter solutions-transformation-icty 2011-1
Business agility imperatives smarter solutions-transformation-icty 2011-1zslmarketing
 
Enterprise Adoption – Patterns for Success with AWS - Business
Enterprise Adoption – Patterns for Success with AWS - BusinessEnterprise Adoption – Patterns for Success with AWS - Business
Enterprise Adoption – Patterns for Success with AWS - BusinessAmazon Web Services
 
Enterprise Adoption – Patterns for Success with AWS - Business
Enterprise Adoption – Patterns for Success with AWS - BusinessEnterprise Adoption – Patterns for Success with AWS - Business
Enterprise Adoption – Patterns for Success with AWS - BusinessAmazon Web Services
 
Webinar: Make Your Cloud Strategy Work for 2016
Webinar: Make Your Cloud Strategy Work for 2016Webinar: Make Your Cloud Strategy Work for 2016
Webinar: Make Your Cloud Strategy Work for 2016Alexandra Sasha Tchulkova
 
Enterprise IT Cloud Segmentation Model
Enterprise IT Cloud Segmentation ModelEnterprise IT Cloud Segmentation Model
Enterprise IT Cloud Segmentation ModelAtchison Frazer
 
tero-peltola-serverlessMeetup-10.11.2022.ppt
tero-peltola-serverlessMeetup-10.11.2022.ppttero-peltola-serverlessMeetup-10.11.2022.ppt
tero-peltola-serverlessMeetup-10.11.2022.pptTero Peltola
 
Developing Your Cloud Strategy
Developing Your Cloud StrategyDeveloping Your Cloud Strategy
Developing Your Cloud StrategyVISI
 
I360 Gov Kaplan Presentation V03 15 10
I360 Gov Kaplan Presentation V03 15 10I360 Gov Kaplan Presentation V03 15 10
I360 Gov Kaplan Presentation V03 15 10Jeffrey Kaplan
 
Building the Agile Enterprise - Cloud Computing
Building the Agile Enterprise - Cloud ComputingBuilding the Agile Enterprise - Cloud Computing
Building the Agile Enterprise - Cloud ComputingSrinivas Koushik
 

Similar to Best practices to build a sustainable data lake on cloud - Impetus Webinar (20)

Get ahead of the cloud or get left behind
Get ahead of the cloud or get left behindGet ahead of the cloud or get left behind
Get ahead of the cloud or get left behind
 
Capgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with ClouderaCapgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with Cloudera
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
Planning A Cloud Implementation
Planning A Cloud ImplementationPlanning A Cloud Implementation
Planning A Cloud Implementation
 
Riaan Van Nierkirk, CIO at Mc Gregor - Can business transformation be success...
Riaan Van Nierkirk, CIO at Mc Gregor - Can business transformation be success...Riaan Van Nierkirk, CIO at Mc Gregor - Can business transformation be success...
Riaan Van Nierkirk, CIO at Mc Gregor - Can business transformation be success...
 
How to Utilize Cloud in Your Corporate IT Strategy
How to Utilize Cloud in Your Corporate IT StrategyHow to Utilize Cloud in Your Corporate IT Strategy
How to Utilize Cloud in Your Corporate IT Strategy
 
Augmenting IT strategy with Enterprise architecture assessment
Augmenting IT strategy with Enterprise architecture assessmentAugmenting IT strategy with Enterprise architecture assessment
Augmenting IT strategy with Enterprise architecture assessment
 
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXCustomer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
 
Business agility imperatives smarter solutions-transformation-icty 2011-1
Business agility imperatives smarter solutions-transformation-icty 2011-1Business agility imperatives smarter solutions-transformation-icty 2011-1
Business agility imperatives smarter solutions-transformation-icty 2011-1
 
Enterprise Adoption – Patterns for Success with AWS - Business
Enterprise Adoption – Patterns for Success with AWS - BusinessEnterprise Adoption – Patterns for Success with AWS - Business
Enterprise Adoption – Patterns for Success with AWS - Business
 
Enterprise Adoption – Patterns for Success with AWS - Business
Enterprise Adoption – Patterns for Success with AWS - BusinessEnterprise Adoption – Patterns for Success with AWS - Business
Enterprise Adoption – Patterns for Success with AWS - Business
 
Getting ready for the cloud sukhbir jasuja
Getting ready for the cloud sukhbir jasujaGetting ready for the cloud sukhbir jasuja
Getting ready for the cloud sukhbir jasuja
 
Webinar: Make Your Cloud Strategy Work for 2016
Webinar: Make Your Cloud Strategy Work for 2016Webinar: Make Your Cloud Strategy Work for 2016
Webinar: Make Your Cloud Strategy Work for 2016
 
Make your cloud strategy work for 2016 webinar 1.13.16
Make your cloud strategy work for 2016 webinar 1.13.16Make your cloud strategy work for 2016 webinar 1.13.16
Make your cloud strategy work for 2016 webinar 1.13.16
 
Enterprise IT Cloud Segmentation Model
Enterprise IT Cloud Segmentation ModelEnterprise IT Cloud Segmentation Model
Enterprise IT Cloud Segmentation Model
 
tero-peltola-serverlessMeetup-10.11.2022.ppt
tero-peltola-serverlessMeetup-10.11.2022.ppttero-peltola-serverlessMeetup-10.11.2022.ppt
tero-peltola-serverlessMeetup-10.11.2022.ppt
 
Cloud webinar final
Cloud webinar finalCloud webinar final
Cloud webinar final
 
Developing Your Cloud Strategy
Developing Your Cloud StrategyDeveloping Your Cloud Strategy
Developing Your Cloud Strategy
 
I360 Gov Kaplan Presentation V03 15 10
I360 Gov Kaplan Presentation V03 15 10I360 Gov Kaplan Presentation V03 15 10
I360 Gov Kaplan Presentation V03 15 10
 
Building the Agile Enterprise - Cloud Computing
Building the Agile Enterprise - Cloud ComputingBuilding the Agile Enterprise - Cloud Computing
Building the Agile Enterprise - Cloud Computing
 

More from Impetus Technologies

The fastest way to convert etl analytics and data warehouse to AWS- Impetus W...
The fastest way to convert etl analytics and data warehouse to AWS- Impetus W...The fastest way to convert etl analytics and data warehouse to AWS- Impetus W...
The fastest way to convert etl analytics and data warehouse to AWS- Impetus W...Impetus Technologies
 
Eliminate cyber-security threats using data analytics – Build a resilient ent...
Eliminate cyber-security threats using data analytics – Build a resilient ent...Eliminate cyber-security threats using data analytics – Build a resilient ent...
Eliminate cyber-security threats using data analytics – Build a resilient ent...Impetus Technologies
 
Automated EDW Assessment and Actionable Recommendations - Impetus Webinar
Automated EDW Assessment and Actionable Recommendations - Impetus WebinarAutomated EDW Assessment and Actionable Recommendations - Impetus Webinar
Automated EDW Assessment and Actionable Recommendations - Impetus WebinarImpetus Technologies
 
Building a mature foundation for life in the cloud
Building a mature foundation for life in the cloudBuilding a mature foundation for life in the cloud
Building a mature foundation for life in the cloudImpetus Technologies
 
Automate and Optimize Data Warehouse Migration to Snowflake
Automate and Optimize Data Warehouse Migration to SnowflakeAutomate and Optimize Data Warehouse Migration to Snowflake
Automate and Optimize Data Warehouse Migration to SnowflakeImpetus Technologies
 
Instantly convert Teradata ETL and EDW to Spark- Impetus webinar
Instantly convert Teradata ETL and EDW to Spark- Impetus webinarInstantly convert Teradata ETL and EDW to Spark- Impetus webinar
Instantly convert Teradata ETL and EDW to Spark- Impetus webinarImpetus Technologies
 
Keys to establish sustainable DW and analytics on the cloud -Impetus webinar
Keys to establish sustainable DW and analytics on the cloud -Impetus webinarKeys to establish sustainable DW and analytics on the cloud -Impetus webinar
Keys to establish sustainable DW and analytics on the cloud -Impetus webinarImpetus Technologies
 
Solving the EDW transformation conundrum - Impetus webinar
Solving the EDW transformation conundrum - Impetus webinarSolving the EDW transformation conundrum - Impetus webinar
Solving the EDW transformation conundrum - Impetus webinarImpetus Technologies
 
Anomaly detection with machine learning at scale
Anomaly detection with machine learning at scaleAnomaly detection with machine learning at scale
Anomaly detection with machine learning at scaleImpetus Technologies
 
Keys to Formulating an Effective Data Management Strategy in the Age of Data
Keys to Formulating an Effective Data Management Strategy in the Age of DataKeys to Formulating an Effective Data Management Strategy in the Age of Data
Keys to Formulating an Effective Data Management Strategy in the Age of DataImpetus Technologies
 
Build Spark-based ETL Workflows on Cloud in Minutes
Build Spark-based ETL Workflows on Cloud in MinutesBuild Spark-based ETL Workflows on Cloud in Minutes
Build Spark-based ETL Workflows on Cloud in MinutesImpetus Technologies
 
Planning your Next-Gen Change Data Capture (CDC) Architecture in 2019 - Strea...
Planning your Next-Gen Change Data Capture (CDC) Architecture in 2019 - Strea...Planning your Next-Gen Change Data Capture (CDC) Architecture in 2019 - Strea...
Planning your Next-Gen Change Data Capture (CDC) Architecture in 2019 - Strea...Impetus Technologies
 
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...Impetus Technologies
 
Streaming Analytics for IoT with Apache Spark
Streaming Analytics for IoT with Apache SparkStreaming Analytics for IoT with Apache Spark
Streaming Analytics for IoT with Apache SparkImpetus Technologies
 
Anomaly Detection - Real World Scenarios, Approaches and Live Implementation
Anomaly Detection - Real World Scenarios, Approaches and Live ImplementationAnomaly Detection - Real World Scenarios, Approaches and Live Implementation
Anomaly Detection - Real World Scenarios, Approaches and Live ImplementationImpetus Technologies
 
The structured streaming upgrade to Apache Spark and how enterprises can bene...
The structured streaming upgrade to Apache Spark and how enterprises can bene...The structured streaming upgrade to Apache Spark and how enterprises can bene...
The structured streaming upgrade to Apache Spark and how enterprises can bene...Impetus Technologies
 
Apache spark empowering the real time data driven enterprise - StreamAnalytix...
Apache spark empowering the real time data driven enterprise - StreamAnalytix...Apache spark empowering the real time data driven enterprise - StreamAnalytix...
Apache spark empowering the real time data driven enterprise - StreamAnalytix...Impetus Technologies
 
Anomaly Detection and Spark Implementation - Meetup Presentation.pptx
Anomaly Detection and Spark Implementation - Meetup Presentation.pptxAnomaly Detection and Spark Implementation - Meetup Presentation.pptx
Anomaly Detection and Spark Implementation - Meetup Presentation.pptxImpetus Technologies
 

More from Impetus Technologies (19)

The fastest way to convert etl analytics and data warehouse to AWS- Impetus W...
The fastest way to convert etl analytics and data warehouse to AWS- Impetus W...The fastest way to convert etl analytics and data warehouse to AWS- Impetus W...
The fastest way to convert etl analytics and data warehouse to AWS- Impetus W...
 
Eliminate cyber-security threats using data analytics – Build a resilient ent...
Eliminate cyber-security threats using data analytics – Build a resilient ent...Eliminate cyber-security threats using data analytics – Build a resilient ent...
Eliminate cyber-security threats using data analytics – Build a resilient ent...
 
Automated EDW Assessment and Actionable Recommendations - Impetus Webinar
Automated EDW Assessment and Actionable Recommendations - Impetus WebinarAutomated EDW Assessment and Actionable Recommendations - Impetus Webinar
Automated EDW Assessment and Actionable Recommendations - Impetus Webinar
 
Building a mature foundation for life in the cloud
Building a mature foundation for life in the cloudBuilding a mature foundation for life in the cloud
Building a mature foundation for life in the cloud
 
Automate and Optimize Data Warehouse Migration to Snowflake
Automate and Optimize Data Warehouse Migration to SnowflakeAutomate and Optimize Data Warehouse Migration to Snowflake
Automate and Optimize Data Warehouse Migration to Snowflake
 
Instantly convert Teradata ETL and EDW to Spark- Impetus webinar
Instantly convert Teradata ETL and EDW to Spark- Impetus webinarInstantly convert Teradata ETL and EDW to Spark- Impetus webinar
Instantly convert Teradata ETL and EDW to Spark- Impetus webinar
 
Keys to establish sustainable DW and analytics on the cloud -Impetus webinar
Keys to establish sustainable DW and analytics on the cloud -Impetus webinarKeys to establish sustainable DW and analytics on the cloud -Impetus webinar
Keys to establish sustainable DW and analytics on the cloud -Impetus webinar
 
Solving the EDW transformation conundrum - Impetus webinar
Solving the EDW transformation conundrum - Impetus webinarSolving the EDW transformation conundrum - Impetus webinar
Solving the EDW transformation conundrum - Impetus webinar
 
Anomaly detection with machine learning at scale
Anomaly detection with machine learning at scaleAnomaly detection with machine learning at scale
Anomaly detection with machine learning at scale
 
Keys to Formulating an Effective Data Management Strategy in the Age of Data
Keys to Formulating an Effective Data Management Strategy in the Age of DataKeys to Formulating an Effective Data Management Strategy in the Age of Data
Keys to Formulating an Effective Data Management Strategy in the Age of Data
 
Build Spark-based ETL Workflows on Cloud in Minutes
Build Spark-based ETL Workflows on Cloud in MinutesBuild Spark-based ETL Workflows on Cloud in Minutes
Build Spark-based ETL Workflows on Cloud in Minutes
 
Planning your Next-Gen Change Data Capture (CDC) Architecture in 2019 - Strea...
Planning your Next-Gen Change Data Capture (CDC) Architecture in 2019 - Strea...Planning your Next-Gen Change Data Capture (CDC) Architecture in 2019 - Strea...
Planning your Next-Gen Change Data Capture (CDC) Architecture in 2019 - Strea...
 
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
 
Streaming Analytics for IoT with Apache Spark
Streaming Analytics for IoT with Apache SparkStreaming Analytics for IoT with Apache Spark
Streaming Analytics for IoT with Apache Spark
 
Anomaly Detection - Real World Scenarios, Approaches and Live Implementation
Anomaly Detection - Real World Scenarios, Approaches and Live ImplementationAnomaly Detection - Real World Scenarios, Approaches and Live Implementation
Anomaly Detection - Real World Scenarios, Approaches and Live Implementation
 
The structured streaming upgrade to Apache Spark and how enterprises can bene...
The structured streaming upgrade to Apache Spark and how enterprises can bene...The structured streaming upgrade to Apache Spark and how enterprises can bene...
The structured streaming upgrade to Apache Spark and how enterprises can bene...
 
Apache spark empowering the real time data driven enterprise - StreamAnalytix...
Apache spark empowering the real time data driven enterprise - StreamAnalytix...Apache spark empowering the real time data driven enterprise - StreamAnalytix...
Apache spark empowering the real time data driven enterprise - StreamAnalytix...
 
Anomaly Detection and Spark Implementation - Meetup Presentation.pptx
Anomaly Detection and Spark Implementation - Meetup Presentation.pptxAnomaly Detection and Spark Implementation - Meetup Presentation.pptx
Anomaly Detection and Spark Implementation - Meetup Presentation.pptx
 
Importance of Big Data Analytics
Importance of Big Data AnalyticsImportance of Big Data Analytics
Importance of Big Data Analytics
 

Recently uploaded

IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxAbida Shariff
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...CzechDreamin
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsStefano
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Product School
 
The architecture of Generative AI for enterprises.pdf
The architecture of Generative AI for enterprises.pdfThe architecture of Generative AI for enterprises.pdf
The architecture of Generative AI for enterprises.pdfalexjohnson7307
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlPeter Udo Diehl
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeCzechDreamin
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxDavid Michel
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...CzechDreamin
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...CzechDreamin
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Product School
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoTAnalytics
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Alison B. Lowndes
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Julian Hyde
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backElena Simperl
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCzechDreamin
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfChristopherTHyatt
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...Product School
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...Product School
 

Recently uploaded (20)

IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. Startups
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
The architecture of Generative AI for enterprises.pdf
The architecture of Generative AI for enterprises.pdfThe architecture of Generative AI for enterprises.pdf
The architecture of Generative AI for enterprises.pdf
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdf
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 

Best practices to build a sustainable data lake on cloud - Impetus Webinar

  • 1. Copyright © 2020 Impetus Technologies, Inc. You are prohibited from making a copy or modification of, or from redistributing, rebroadcasting, or re-encoding of this content without the prior written consent of Impetus. This presentation may include images from other products and services. These images are used for illustrative purposes only. Unless explicitly stated there is no implied endorsement or sponsorship of these products by Impetus. All copyrights and trademarks are property of their respective owners.
  • 2. Best Practices to Build a Sustainable Data Lake on Cloud IMP
  • 3. Our mission Enabling a unified, clear, and present view of your business
  • 4. Agenda Data Lake Essentials The Cloud Advantage Cloud Adoption Strategy Keys to Building a Robust Data Lake on Cloud Q&A
  • 6. Speakers Mita Baxi Solutions Architect Chetan Kalanki Sr. Director Big Data and Cloud Engineering
  • 8. -Gartner The worldwide public cloud service market will grow to $331.2 B in 2022
  • 9. -MarketWatch Data Lakes Market will touch an aggregate of $12.01 B by 2024
  • 10. -Gartner If you have not developed a cloud-first strategy yet, you are likely falling behind your competitors
  • 11. What’s in it for me? Key influencers in building a data lake on cloud Best practices for building a sustainable data lake on cloud
  • 12. What is a Data Lake? Data Lake Database Logs XML Data Spreadsheet Text
  • 13. Data lake: Key functional aspects Ingest Store Process Analyze Consume
  • 14. Data Privacy, Security and Access Management Data Management Data lake: Governance essentials Ingress Data Discovery and Curation Profiling | Classification | Lineage | Prepare Metadata Catalog | Catalog | MDM | Archive Quality Physical | Encryption | Access | Audit Data Discovery, Reporting and Visualization
  • 15. End-to-end big data lake capability view Metadata / Governance Layer Streaming Structured (ODS) / Unstructured Geospatial / Machine / Time Series External Social RawLayer ProcessedLayer Landing/ Staging Active Archive Common Data Model ODL – Rest / Motion Master Reference Data API Data Mart Stewardship/ Policies QuickSight Athena SLA-BasedDataVendingLayer Sandbox Stores Post Analytics Store Search ODL-End User Lineage Service Catalog Audit / Monitoring Workflow /DQ Workflow /DQ
  • 17. Success criteria Pace of adoption Footprint Time-to-market Innovation ROI and TCO
  • 18. Cloud adoption cycle Excitement Realization of Complexity, Effort and Impact on Individual Transformation Deployment Chaos Performance Time Integration
  • 20. Critical concerns Knowledge Funding and cost Purchasing Resistance Plan
  • 23. Build a solution from an enterprise view Goal alignment Budgeting Chargebacks ROI assessment Assess reskill needs
  • 24. Focus on value delivery Adopt cloud first commitment Have flexibility to change Use accelerators
  • 25. Build a foundation first Security should be your top priority Address key challenges first Reusable templates Logging and monitoring Financial controls
  • 26. Select the right tooling Data integration Data catalog Data quality Data governance
  • 27. Fail fast Start small Perform proof of concept Extend and enhance Fail fast, recover fast Measure
  • 28. Assess lock-ins Which layer do you want to lock-in? - Tool - Framework - Platform - Cloud - Solution
  • 29. Build for expansion Business expansion Geographical compliances Integrations
  • 30. Build for speed of change DevOps Automate Cloud-native tools Engage experts
  • 31. Know and optimize expenditure – Always Hard cost Soft cost
  • 32. Measure everything you can Usage data Security metrics Cost parameter Skills level
  • 33. Recap
  • 34. Advisory and Consulting Cloud Enablement and Migration, Big Data Enablement and Migration, Data Lake, DevOps, Usability, Mobility etc. Architecture, Design and Engineering Technology Evaluation and Benchmarking, Solution Design and Architecture, Engineering, Quality Assurance, NFRs and Performance Engineering etc. Enterprise Data Management Data Lake, Data Modelling, Data Migration, Data Visualization, Data Democratization and Governance etc. DevOps and Productionization Capacity Planning, Infra as Code, Infra Provisioning, Automation and Administration, Ops Support etc. Data Science and Analytics NLP and NLG, AI and Deep Learning, Descriptive-Prescriptive-Predictive Analytics, Sentiment Analysis etc. User Experience UX Design and Architecture, Rich Media Design, Mobile App Design, Responsive Design, UX Lab Assessment etc. Our Services
  • 35. Impetus cloud practice competencies Amazon Web Services Microsoft Azure Google Cloud Salesforce Advisory, strategy, TCO Architecture evaluation Cloud infrastructure realization Cloud cost optimization Workload assessment and transformation Capacity planning Automation and orchestration Security and governance DevOps Maintenance and administration
  • 36. Thank you. Questions? Visit www.impetus.com or get in touch with us at inquiry@impetus.com