DATA SCIENCE
SELF INTRODUCTION
➤ Dinh Khac Thanh
➤ thanh@holistics.io
➤ Co-founder and Chief Engineer at Holistics Software - data
reporting and infrastructure company
➤ Worked at Revolution Analytics (acquired by Microsoft) -
basically a Data Science company, as Data Engineer
DATA SCIENTIST
➤ Harvard Business Review: The Sexiest Job of 21st century
➤ Data scientist salary (Business Insider)
➤ Facebook: $133,841
➤ Apple: $149,963
➤ Airbnb: $117,229
➤ Twitter: $134,861
➤ Best Job of the year 2016 in the US
WHAT IS DATA SCIENCE?
➤ Examples
➤ Real life
➤ Amazon (recommendation)
➤ Facebook (face recognition)
➤ Google (spam detection, google translation, etc.)
➤ Nate Silver
➤ From Revo
➤ Seagate
➤ Intel
➤ NLP project
➤ Kaggle https://www.kaggle.com/
WHAT IS DATA SCIENCE?
WHAT IS A DATA SCIENTIST?
DATA SCIENCE PROCESS
➤ Getting data
➤ Data scraping
➤ Data collection
➤ Exploring data
➤ Data exploration
➤ Data visualisation
➤ Building models
➤ Machine learning models (SVM, decision trees, deep learning, etc.)
➤ Presenting data
➤ Data visualisation
DATA SCIENTISTS’ KNOWLEDGE
➤ Fundamental requirements
➤ Linear Algebra
➤ Multivariable Calculus
➤ Probability and Statistics
➤ Know Python and/or R
➤ SQL
➤ Excel
➤ Machine learning algorithms (Regression, random forest, boosting, etc.)
➤ Advanced tools
➤ Hadoop/Spark
➤ DW like Redshift/Vertica/Teradata etc.
HOW TO BECOME DATA SCIENTIST?
➤ Learn
➤ Courses
➤ Harvard’s Data Science (http://cs109.github.io/2014/)
➤ Coursera
➤ Udacity
➤ Books
➤ http://www.wzchen.com/data-science-books
➤ Practice
➤ Kaggle competition
QUESTIONS AND ANSWERS

Data science

  • 1.
  • 2.
    SELF INTRODUCTION ➤ DinhKhac Thanh ➤ thanh@holistics.io ➤ Co-founder and Chief Engineer at Holistics Software - data reporting and infrastructure company ➤ Worked at Revolution Analytics (acquired by Microsoft) - basically a Data Science company, as Data Engineer
  • 3.
    DATA SCIENTIST ➤ HarvardBusiness Review: The Sexiest Job of 21st century ➤ Data scientist salary (Business Insider) ➤ Facebook: $133,841 ➤ Apple: $149,963 ➤ Airbnb: $117,229 ➤ Twitter: $134,861 ➤ Best Job of the year 2016 in the US
  • 4.
    WHAT IS DATASCIENCE? ➤ Examples ➤ Real life ➤ Amazon (recommendation) ➤ Facebook (face recognition) ➤ Google (spam detection, google translation, etc.) ➤ Nate Silver ➤ From Revo ➤ Seagate ➤ Intel ➤ NLP project ➤ Kaggle https://www.kaggle.com/
  • 5.
    WHAT IS DATASCIENCE?
  • 6.
    WHAT IS ADATA SCIENTIST?
  • 7.
    DATA SCIENCE PROCESS ➤Getting data ➤ Data scraping ➤ Data collection ➤ Exploring data ➤ Data exploration ➤ Data visualisation ➤ Building models ➤ Machine learning models (SVM, decision trees, deep learning, etc.) ➤ Presenting data ➤ Data visualisation
  • 9.
    DATA SCIENTISTS’ KNOWLEDGE ➤Fundamental requirements ➤ Linear Algebra ➤ Multivariable Calculus ➤ Probability and Statistics ➤ Know Python and/or R ➤ SQL ➤ Excel ➤ Machine learning algorithms (Regression, random forest, boosting, etc.) ➤ Advanced tools ➤ Hadoop/Spark ➤ DW like Redshift/Vertica/Teradata etc.
  • 10.
    HOW TO BECOMEDATA SCIENTIST? ➤ Learn ➤ Courses ➤ Harvard’s Data Science (http://cs109.github.io/2014/) ➤ Coursera ➤ Udacity ➤ Books ➤ http://www.wzchen.com/data-science-books ➤ Practice ➤ Kaggle competition
  • 11.