4. Data Scientist asks relevant
real world questions
Source: http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram
And hopefully,
discovers
actionable
recommendations
from data
6. WHAT IS
PYTHON?
“THE NAME PYTHON COMES
FROM THE SURREAL BRITISH
COMEDY GROUP MONTY PYTHON,
NOT FROM THE SNAKE. PYTHON
PROGRAMMERS ARE
AFFECTIONATELY CALLED
PYTHONISTAS, AND BOTH MONTY
PYTHON AND SERPENTINE
REFERENCES USUALLY PEPPER
PYTHON TUTORIALS AND
DOCUMENTATION.”
Automate the Boring Stuff with Python
10. When is data ready and
prepared for analysis ?
Image source: http://blog.kaggle.com/2016/07/21/approaching-almost-any-machine-learning-problem-abhishek-thakur/
13. Pandas: Python Data Analysis
Library
Import pandas library
Reading/Writing Data
Series
DataFrame
Selecting Internal Elements
Assigning Values to Elements
14. Pandas: Python Data Analysis
Library
Evaluating Values (unique, isin, value_counts,
NaN)
Filtering Values
Transpose
Operations between DataFrame and Series
Statistics Functions, Correlation/Covariance
15. Scikit-learn & ML Basics
... learning from experience either
with or without supervision of
humans
Mastering Machine Learning with scikit-learn
16. ML Flow
Image source: http://blog.kaggle.com/2016/07/21/approaching-almost-any-machine-learning-problem-abhishek-thakur/
18. A bit of Big Data Processing
Source: Python Data Analytics
19. Creative Commons License
Python in Data Science Work by Rick
Bahague is licensed under a Creative Commons
Attribution-NonCommercial-ShareAlike 4.0
International License.
Based on a work at https://medium.com/
@rbahaguejr.
Permissions beyond the scope of this license
may be available at https://medium.com/
@rbahaguejr.