This talk goes over what Data Science is and how
you can start working with data in
your role. This is for everyone interested in Data
Science who might be unsure about how to
start working with data. Learn the core
concepts of Data Science and how you can
start learning data science pain-free!
2. About Me
● Data Scientist @ CometML
● MS in Data Science from Regis University
● Teaching Explainable ML
● Author of Getting Started in Data Science
● Currently writing Uncovering Bias in
Machine Learning
4. What is Data Science?
Data science is an inter-disciplinary field that uses (somewhat) scientific methods, processes,
algorithms and systems to extract knowledge and insights from many structural and
unstructured data.
6. What Data Projects Include
Identify a Problem
Asses the org’s incentives
Gather & clean data
Data documentation
Exploratory Analysis
Inferential Statistics
Data Storytelling
Harm identification and mitigation
Creating ML Models
Building User Recourse Frameworks
15. Goals
● Predict future events given past data
● Find anomalies in our datasets
● Make recommendations based on someone’s interests
16. Methods
1. Clean data so its in a format we can model
2. Understand data distributions to inform model selection
3. Perform Exploratory Data Analysis to grasp data
4. Choose modeling techniques that help us solve problems
5. Measure how well our models perform and optimize then
6. Iterate!
17. Exploratory What?
In statistics, exploratory data analysis is an approach to analyzing data sets to summarize their
main characteristics, often with visual methods. A statistical model can be used or not, but
primarily EDA is for seeing what the data can tell us beyond the formal modeling or hypothesis
testing task.
25. Cleaning & Manipulating Data
Grasp the basic techniques
Build intuition for when to use certain methods
Understand pros and cons of each
Tools:
Excel
Python & R
SQL
30. Communicate Your Value
How have you impacted past businesses?
How would your relevant projects help a company?
Do you know how to quantify your value?
31. Github
Showing off code projects
Connecting with other developers
Collaborating and proving technical skills
32. Blog / Personal Website
Share Expertise
Show off Portfolio
Provide insight into your thought process
33. Thank You!
25% off Getting Started in Data Science
Code: VBROWNBAG
@DataSciBae
ayodeleodubela.com