Hal’s Headache               Data Tuesday       02/25/2013       Florian Douetteau
Meet Hal Alowne                                        Dim Sum                                 ‟                          ...
CHOOSE TECHNOLOGYNoSQL-Slavia                               Scalability Central                Machine LearningElastic Sea...
LEARN MACHINE                           LEARNING STUFF Try to understand              Find People that understand machin...
DO IT                     Open Data           StormMegabytes                     CRM                 Hadoop               ...
MERIT = TIME + ROI  TIME : 6 MONTHS                                          ROI : APPS2013                               ...
DataikuOne Goal                           One platform with an open source core‟Help you build your data lab in     less t...
3/8/2013   8Dataiku - Data Tuesday
Upcoming SlideShare
Loading in …5
×

8 douetteau-dataiku-datatuesdayopensource-130228102749-phpapp01

476 views
383 views

Published on

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
476
On SlideShare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
2
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

8 douetteau-dataiku-datatuesdayopensource-130228102749-phpapp01

  1. 1. Hal’s Headache Data Tuesday  02/25/2013  Florian Douetteau
  2. 2. Meet Hal Alowne Dim Sum ‟ CEO & Founder Dim’s Private Showroom Hey Hal ! We need a big data platform like the big guys. ” Hal Alowne BI Manager Let’s just do as they do! Dim’s Private ShowroomEuropean E-commerce Web site Big Guys• 100M$ Revenue Big Data • 10B$+ Revenue• 1 Million customer Copy Cat • 100M+ customers• 1 Data Analyst (Hal Himself) Project • 100+ Data ScientistDataiku - Data Tuesday 3/8/2013 2
  3. 3. CHOOSE TECHNOLOGYNoSQL-Slavia Scalability Central Machine LearningElastic Search Mystery Land Hadoop Scikit-LearnSOLR Ceph MongoDB Cassandra Sphere Mahout WEKARiak MLBase Membase SparkSQL Colunnar RepublicInfiniDB RapidMiner R LucidDB Pig PandaImpala Hive D3 Cascading Statistician Old Crossfilter Talend House Vizualization County Data Clean Wasteland Dataiku - Data Tuesday 3/8/2013 3
  4. 4. LEARN MACHINE LEARNING STUFF Try to understand  Find People that understand machine learning myself and all this stuff Dataiku - Data Tuesday 3/8/2013 4
  5. 5. DO IT Open Data StormMegabytes CRM Hadoop RGigabytes Elastic Search Web LogsTerabytes SQL D3  Connect things together  Pour Data in  Clean Data  Fix the leaks  Start again Dataiku - Data Tuesday 3/8/2013 5
  6. 6. MERIT = TIME + ROI TIME : 6 MONTHS ROI : APPS2013 2014 Targeted Find the right Choose the Make it work Newsletter people technology (6 months?) (6 months?) (6 months?) Recommender 2013 System Build the lab (6 months) • Train People • Reuse working patterns Dynamic Pricing  Build a lab in 6 months  Deploy apps (rather than 18 months) that actually deliver value Dataiku - Data Tuesday 3/8/2013 6
  7. 7. DataikuOne Goal One platform with an open source core‟Help you build your data lab in less than six months Export Predictions Manage datasets and transformations Impact Flow Feedback Doctor Continuous Loopback Diagnose Data ” all-in-one data scientists D1 Shaker Prepare Data distributionOne fake customer A few real ones Data Is Money Dataiku - Data Tuesday 3/8/2013 7
  8. 8. 3/8/2013 8Dataiku - Data Tuesday

×