Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Achieving Agility and Scale for Your Data Lake - Talend

551 views

Published on

Most organization who going through Digital Transformation need to break down their data silos as well as leverage existing and new data sources. Here is how to build a data lake for data change in your organization.

Published in: Technology
  • Be the first to comment

  • Be the first to like this

Achieving Agility and Scale for Your Data Lake - Talend

  1. 1. @isanuage Achieving Agility and Scale for Your Data Lake Isabelle Nuage, Product Marketing Cyril Sonnefraud, Product Management
  2. 2. ©2017 Talend Inc #TalendConnect Poll • Who’s using Talend Big Data today? • Who has a data lake in production? • Who is deploying or planning a data lake project within 12 months? • Who is implementing a data lake in the Cloud?
  3. 3. <Digital Tranformation Stats> By end 2017, > 70% of G500 By 2020, 50% of the G2000 Digital Transformation is no Longer an Option Are You Prepared? But only 26% of Organizations Accenture and Forrester Digital Transformation in the Age of the Customer studyIDC Futurescape
  4. 4. The Data Lake is the New Digital Backbone • Break down data silos • Structured and unstructured • Granular data • Machine learning
  5. 5. BusinessValue • Offload EDW • Cheaper storage • Access to archived data Why Create Data Lakes? Reduce costs
  6. 6. BusinessValue Generating new opportunities • Offload EDW • Cheaper storage • Access to archived data • Customer acquisition, retention.. • Real-time engagement • Pricing optimization • Demand forecasting • Risk and fraud • Predictive maintenance • Smart products… Why Create Data Lakes? Reduce costs
  7. 7. 7 Challenges Complex Technology Limited Access Data Swamps How to achieve Agility & Scale? DATA LAKES #TalendConnect
  8. 8. People Doing it the OLD Way… #TalendConnect
  9. 9. 2017 Lenovo Internal. All rights reserved. Change is the Only Constant BusinessValue Reporting Measurement Business Insights Optimization Predictive Analytics Automation Prescriptive Analytics Pre FY - 07 FY - 07/10 FY - 11/ 12 FY - 13/ 14 FY – 15/ 17 Time Cognitive Analytics FY – 17/ 18
  10. 10. • Any innovation • Any platform • Any use case • Any speed • Any user The Agile Data Lake
  11. 11. The Path to Agility Ingestion+basic visualization DataQuality SelfService Data Governance Real-time Machine Learning
  12. 12. ©2017 Talend Inc #TalendConnect Examples Smart Data Quality Smart Data Pipelines
  13. 13. Demo flow Data Lake Incoming Lead Data (Raw) Amazon EMR Cluster Data Lake Output Lead Data (Processed) With Segmentation 1 Ingestion with Smart Data Quality 2 Smart Data Pipeline with Machine Learning
  14. 14. ©2017 Talend Inc #TalendConnect Architecture Guidelines
  15. 15. On-premise Data Lakes On-Premise Data Sources Ingest Prepare Process Access Consume Cloud Data Sources Governance Processing Storage On-prem Datalake
  16. 16. Hybrid Data Lakes On-Premise Data Sources Ingest Prepare Process Access Consume Cloud Data Sources Governance Cloud Processing Processing Cloud Storage Storage On-prem Datalake Cloud Datalake Distribute
  17. 17. Cloud Data Lakes – A Concrete Example Ingest Prepare Process Access Consume Governance Cloud Processing Cloud Storage On-Premise Data Sources Cloud Data Sources S3 EMR Cloud Storage Cloud Dataflow Azure DL Store HDInsight
  18. 18. The Path to Agility Ingestion+basic visualization DataQuality SelfService Data Governance Real-time Machine Learning
  19. 19. Deliver Value Along The Way Start with quick wins & business outcome in mind Get a cadence of constantly delivering value Focus on game changer value drivers Get the company onboard
  20. 20. Be Eligible to Win Prizes at the End of the Show!

×