Good evening everyoneI’m Marc Batty cofounder of DATAIKUI would like to speak about Data Science tonight.Even though Data Scientist is a buzzword these days, almost nobody knows what they do!
You may think about the Machine Learning expertHe is going to answer all your business questions and may be save the world.
But we mostly see a lot of Data Cleaners, and there not quite happy about their jobs
Or also Data Leak Fixer, you know when you have to do all the plumbing between all your databases and hadoop clusters.
And even the Data Waiter, waiting for his endless hadoop job to finish, before getting the first insight on his data?
So the question is …
They all have in common :They spend to much time preparing their data to go from raw data to usable dataMachine learning is hard to understand if you don’t have a PHD in staticticsIn most companies, insights and models (almost) reach production because it’s hard to integrate all the required big data technologies.
So at DATAIKU we built a Data Science Studio.It’s a ready to use Data Science platform with all the tools you need to create your Data Science Apps.It’s accessible so you don’t need to be an experienced Data Scientist to start building models.It’s a common playground for your team that can share datasets, models and insights.
In our studio we’ve got a whole range of tools to help all the Data Scientists being more productive.Visual Data Preparation for Data Cleaners for instant feedbackGuided Machine Learning for non Machine Learning experts to quickly start building modelsProduction tools to integrate all the required Big Data technologies.Now Data Scientists can focus on being innovative and creative with their data.
Dataiku has 1 goal : make Data Science accessible to anyone.I’ll be happy to continue this discussion and show you a demo after to this pitch so don’t hesitate to come see me.Thank you very much
Dataiku, Pitch Data Innovation Night, Boston, Septembre 16th
a defined term This is not… a Data Scientist
www.dataiku.com - @dataiku
How can we
REAL PROBLEMS ?
• Data preparation is time-consuming
• Machine learning is hard to understand
• Insights and models (almost) never reach
Data Science Studio
• A democratic & ready to use Data Science
Studio to start innovating with data!
Ready to Use Data
Common playground for
Accessible Statistics &
Machine Learning for
Handle real-life data
Data Science Studio
Visual and Interactive Data
For Data Cleaners
Guided Machine Learning
For non Machine Learning Experts
For Data Leak Fixers
Dataiku at a glance
• Founded in 2013 by Data and Search Engine veterans
• From “data” and “haïku”
“data can be big
solution would be small
feel the hot wind”
• 1 goal: make Data Science accessible to anyone!
Contact: firstname.lastname@example.org / @battymarc
A particular slide catching your eye?
Clipping is a handy way to collect important slides you want to go back to later.