2. SELF INTRODUCTION
➤ Dinh Khac Thanh
➤ thanh@holistics.io
➤ Co-founder and Chief Engineer at Holistics Software - data
reporting and infrastructure company
➤ Worked at Revolution Analytics (acquired by Microsoft) -
basically a Data Science company, as Data Engineer
3. DATA SCIENTIST
➤ Harvard Business Review: The Sexiest Job of 21st century
➤ Data scientist salary (Business Insider)
➤ Facebook: $133,841
➤ Apple: $149,963
➤ Airbnb: $117,229
➤ Twitter: $134,861
➤ Best Job of the year 2016 in the US
4. WHAT IS DATA SCIENCE?
➤ Examples
➤ Real life
➤ Amazon (recommendation)
➤ Facebook (face recognition)
➤ Google (spam detection, google translation, etc.)
➤ Nate Silver
➤ From Revo
➤ Seagate
➤ Intel
➤ NLP project
➤ Kaggle https://www.kaggle.com/
7. DATA SCIENCE PROCESS
➤ Getting data
➤ Data scraping
➤ Data collection
➤ Exploring data
➤ Data exploration
➤ Data visualisation
➤ Building models
➤ Machine learning models (SVM, decision trees, deep learning, etc.)
➤ Presenting data
➤ Data visualisation
8.
9. DATA SCIENTISTS’ KNOWLEDGE
➤ Fundamental requirements
➤ Linear Algebra
➤ Multivariable Calculus
➤ Probability and Statistics
➤ Know Python and/or R
➤ SQL
➤ Excel
➤ Machine learning algorithms (Regression, random forest, boosting, etc.)
➤ Advanced tools
➤ Hadoop/Spark
➤ DW like Redshift/Vertica/Teradata etc.
10. HOW TO BECOME DATA SCIENTIST?
➤ Learn
➤ Courses
➤ Harvard’s Data Science (http://cs109.github.io/2014/)
➤ Coursera
➤ Udacity
➤ Books
➤ http://www.wzchen.com/data-science-books
➤ Practice
➤ Kaggle competition