• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Dataiku, Pitch at Data-Driven NYC, New York City, September 17th 2013
 

Dataiku, Pitch at Data-Driven NYC, New York City, September 17th 2013

on

  • 829 views

Our pitch at Data-Driven NYC meetup on September 17th (http://datadrivennyc.com). ...

Our pitch at Data-Driven NYC meetup on September 17th (http://datadrivennyc.com).

Speaking about Data Scientists pains and how Dataiku Data Science Studio can help them to more than Data Cleaners and Data Leak Fixers !

Statistics

Views

Total Views
829
Views on SlideShare
820
Embed Views
9

Actions

Likes
2
Downloads
13
Comments
0

2 Embeds 9

https://twitter.com 7
http://www.linkedin.com 2

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Dataiku, Pitch at Data-Driven NYC, New York City, September 17th 2013 Dataiku, Pitch at Data-Driven NYC, New York City, September 17th 2013 Presentation Transcript

    • DATA   SCIENTIST   is     NOT   a  defined  term   This  is  not…  a  Data  Scien9st     www.dataiku.com    -­‐  @dataiku  -­‐  @baAymarc  
    • MACHINE   LEARNING   EXPERT  
    • DATA   CLEANER  
    • DATA  LEAK  FIXER  
    • END  OF     (HADOOP)  JOB     DATA   WAITER  
    • How  can  we     HELP     DATA  SCIENTISTS   to     FOCUS   on  the     REAL  PROBLEMS  ?    
    • Pain  points   •  Data  prepara9on  is  9me-­‐consuming     •  Machine  learning  is  hard  to  understand   •  Insights  and  models  (almost)  never  reach   produc9on  
    • Data  Science  Studio   •  A  democra9c  &  ready  to  use  Data  Science   Studio  to  start  innova9ng  with  data!   Ready  to  Use  Data   Science  PlaYorm   Common  playground  for   innova9on   Accessible  Sta9s9cs  &   Machine  Learning  for   everyone   Handle  real-­‐life  data  
    • Data  Science  Studio   Visual  and  Interac9ve  Data   Prepara9on   For  Data  Cleaners   Guided  Machine  Learning   For  non  Machine  Learning  Experts   Produc9on  ready   For  Data  Leak  Fixers  
    • Visual  Data   Prepara9on  
    • Visual  Data  Prepara9on   •  Interac9ve  UI  with  instant  feedback  and   sugges9ons   •  Reversibility  of  the  script,  data  integrity   •  Explora9on  of  data:  quick  analysis,  facets   •  Cleansing:  missing  values,  outliers,  parsing   •  Enrichment:  GeoIP,  Holidays,  joins   •  Produc9on-­‐ready:  integra9on  within  a  flow  
    • Guided  Machine     Learning  
    • Produc9on   &  orchestra9on  
    • Data  Science  Studio:     benefits   •  Real-­‐9me  and  interac9ve   –  Transforma9on  effects  can  be  previsualized  in  real-­‐9me     •  Transparent  and  traceable   –  Keep  the  full  history  of  your  data  transforma9on  logics  and   model  designs   •  Easy  access  to  machine  learning   –  Get  started  with  our  app  templates,  bootstrap  your  model   and  features  selec9ons,  then  go  further!     •  Scalable  and  Produc9on  Ready   –  Apply  your  recipes  on  your  cluster  on  terabytes  of  data  
    • Dataiku  at  a  glance   •  Founded  in  2013  by  Data  and  Search  Engine  veterans   •  From  “data”  and  “haïku”   “data  can  be  big     solu;on  would  be  small   feel  the  hot  wind”   •  1  goal:  make  Data  Science  accessible  to  anyone!               Contact:  marc.baAy@dataiku.com  -­‐  @baAymarc  -­‐  github.com/dataiku