SlideShare a Scribd company logo
1 of 33
Download to read offline
Intro to Python for DSIntro to Python for DS
2 birds. One stone.
 
WIFI: MakeO ces 5Ghz
Password: Internet!23
 
http://bit.ly/thinkful-dc-python
TJ Stalcup
Lead DC Mentor @Thinkful
API Evangelist @540
Pokemon Master
About UsAbout Us
***see slide***
Speaker notes
What's your name?
What do you do?
Why are you interested in data science or python?
About youAbout you
Online Bootcamp since 2012. We have worked
with over 6000 students around the world
paired up with over 400 mentors. 
 
We get you ready for a career and guarantee
your rst job
 
92% success rate
About ThinkfulAbout Thinkful
Local DC Crew
 
Learn why DS is a thing
 
What is Python
 
How do we use it with a real world project?
 
How do I learn more?
TONIGHT: Learn Python by DoingTONIGHT: Learn Python by Doing
What is a Data Scientist?What is a Data Scientist?
Example: LinkedIn 2006Example: LinkedIn 2006
“[LinkedIn] was like arriving at a conference reception
and realizing you don’t know anyone. So you just stand
in the corner sipping your drink—and you probably
leave early.”
-LinkedIn Manager, June 2006
Enter: Jonathan GoldmanEnter: Jonathan Goldman
Data Scientist
Joined LinkedIn in 2006, only 8M users (450M in 2016)
Started experiments to predict people’s networks
Engineers were dismissive: “you can already import your address
book”
DS ProcessDS Process
Frame the question
Collect the raw data
Process the data
Explore the data
Communicate results
Frame the QuestionFrame the Question
What questions do we want to answer?
What connections (type and number) lead to higher
user engagement?
Which connections do people want to make but are
currently limited from making?
How might we predict these types of connections with
limited data from the user?
Collect the DataCollect the Data
What data do we need to answer these questions?
Connection data (who is who connected to?)
Demographic data (what is the pro le of the
connection)
Engagement data (how do they use the site)
Process the DataProcess the Data
How is the data “dirty” and how can we clean it?
• User input
• Redundancies
• Feature changes
• Data model changes
Explore the DataExplore the Data
What are the meaningful patterns in the data?
• Triangle closing
• Time overlaps
• Geographic overlaps
Communicate FindingsCommunicate Findings
How do we communicate this? To whom?
Marketing - this will enable us to sell X more ad space. Results in X
more impressions per day
Product - this will allow us to build X more features
Development - this will allow us to grow our team by X
Sales - this will attract X more premium accounts
C-Level - this will result in $$$ more revenue
8M - 450M in 10 years
The ResultThe Result
 
 
Career Whack-A-Mole
Why DS now?Why DS now?
Big Data: datasets whose size is beyond the
ability of typical database software tools to
capture, store, manage, and analyze
Big DataBig Data
Trend "started" in 2005
Web 2.0 - Majority of content is created by users
Mobile accelerates this — data/person skyrockets
The Data ProblemThe Data Problem
We are generating more data
every year than existed
before.........
The SolutionThe Solution
There goes my hero....
 
watch 'em as they code....
Just need to do everything....Just need to do everything....
Just need to do everything....Just need to do everything....
Knowledge of statistics, algorithms, & software
Comfort with languages & tools (Python, SQL, Tableau)
Inquisitiveness and intellectual curiosity
Strong communication skills
It’s all Teachable!
Coming Soon....Coming Soon....
Intro to SQL
Intro to Tableau
Intro to Statistics
 
http://meetup.com/Thinkful-DC
Let's Learn Python TonightLet's Learn Python Tonight
Python for Programming
Great for Data Science
Robotics
Web Development (Python/Django)
Automation
Let's Learn Python TonightLet's Learn Python Tonight
firstName = 'TJ'
lastName = "Stalcup"
age = 34 // wow, much old
print firstName // TJ
print firstName + lastName // TJStalcup
print firstName + ' ' + lastName // TJ Stalcup
print lastName + ', ' + firstName // Stalcup, TJ
print age * 2 // 68, hopefully retired
def greet(name):
print 'Hello', name
greet('Jack') // Hello, Jack
greet('Jill') // Hello, Jill
greet('Bob') // Hello, Bob
greet(firstName) // Hello, TJ
greet(firstName + ' ' + lastName) // Hello, TJ Stalcup
The ModelThe Model
Our model is going to be a Decision Tree.
 
Decision trees predict the most likely outcome based on input.
 
You can think of it like a computer building a version of 20 questions.
Decision Trees - Golf?Decision Trees - Golf?
The NotebookThe Notebook
We're going to use a Google hosted Python to build this
model. This app is called Colaboratory (Collaboration + Laboratory)
 
http://colab.research.google.com
 
New Notebook > New Python3 Notebook
notebook
ShortcomingsShortcomings
Our model has a few weaknesses:
 
-Limited inputs
-Assumptions
Data Science @ ThinkfulData Science @ Thinkful
Flexible, project-based curriculum to help you become the data
scientist you want to be
You don’t just learn skills, you get to make things
Mentor support from experts in the industry
Also, there's a job guarantee
Link for the third party audit jobs report:
https://www.thinkful.com/bootcamp-jobs-stats
Thinkful Graduates 92% Job Placement Rate
Learning Mentor
Career MentorProgram Manager
Local Community
You
Unprecedented SupportUnprecedented Support
http://bit.ly/dc-ds-trial
Initial 2-week trial course
Start with Python and Statistics
Unlimited Q&A Sessions
Option to continue with full bootcamp
Financing & scholarships available
O er valid for tonight only
Aaron Lamphere
Trial Program Manager
 
Thinkful Two Week TrialThinkful Two Week Trial

More Related Content

What's hot

Cheat sheets for data scientists
Cheat sheets for data scientistsCheat sheets for data scientists
Cheat sheets for data scientists
Ajay Ohri
 
Promoting a Data Driven Culture in a Microservices Environment
Promoting a Data Driven Culture in a Microservices EnvironmentPromoting a Data Driven Culture in a Microservices Environment
Promoting a Data Driven Culture in a Microservices Environment
PyData
 
Reproducible Research with R, The Tidyverse, Notebooks, and Spark
Reproducible Research with R, The Tidyverse, Notebooks, and SparkReproducible Research with R, The Tidyverse, Notebooks, and Spark
Reproducible Research with R, The Tidyverse, Notebooks, and Spark
Adaryl "Bob" Wakefield, MBA
 

What's hot (20)

Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
Unit 3 part 2
Unit  3 part 2Unit  3 part 2
Unit 3 part 2
 
Introduction to Data Science (Data Science Thailand Meetup #1)
Introduction to Data Science (Data Science Thailand Meetup #1)Introduction to Data Science (Data Science Thailand Meetup #1)
Introduction to Data Science (Data Science Thailand Meetup #1)
 
How to become a Data Scientist?
How to become a Data Scientist? How to become a Data Scientist?
How to become a Data Scientist?
 
Big Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsBig Data Science: Intro and Benefits
Big Data Science: Intro and Benefits
 
Data Science: Past, Present, and Future
Data Science: Past, Present, and FutureData Science: Past, Present, and Future
Data Science: Past, Present, and Future
 
Agile Data Science
Agile Data ScienceAgile Data Science
Agile Data Science
 
Visualization of Information (ProQuest)
Visualization of Information (ProQuest)Visualization of Information (ProQuest)
Visualization of Information (ProQuest)
 
Data Science: Not Just For Big Data
Data Science: Not Just For Big DataData Science: Not Just For Big Data
Data Science: Not Just For Big Data
 
Big Data and Data Science for traditional Swiss companies
Big Data and Data Science for traditional Swiss companiesBig Data and Data Science for traditional Swiss companies
Big Data and Data Science for traditional Swiss companies
 
Data science and_analytics_for_ordinary_people_ebook
Data science and_analytics_for_ordinary_people_ebookData science and_analytics_for_ordinary_people_ebook
Data science and_analytics_for_ordinary_people_ebook
 
Session 10 handling bigger data
Session 10 handling bigger dataSession 10 handling bigger data
Session 10 handling bigger data
 
Katharine Jarmul, Founder at Kjamistan - "Learn Data Wrangling with Python"
Katharine Jarmul, Founder at Kjamistan - "Learn Data Wrangling with Python"Katharine Jarmul, Founder at Kjamistan - "Learn Data Wrangling with Python"
Katharine Jarmul, Founder at Kjamistan - "Learn Data Wrangling with Python"
 
Cheat sheets for data scientists
Cheat sheets for data scientistsCheat sheets for data scientists
Cheat sheets for data scientists
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Promoting a Data Driven Culture in a Microservices Environment
Promoting a Data Driven Culture in a Microservices EnvironmentPromoting a Data Driven Culture in a Microservices Environment
Promoting a Data Driven Culture in a Microservices Environment
 
Reproducible Research with R, The Tidyverse, Notebooks, and Spark
Reproducible Research with R, The Tidyverse, Notebooks, and SparkReproducible Research with R, The Tidyverse, Notebooks, and Spark
Reproducible Research with R, The Tidyverse, Notebooks, and Spark
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science project
 
Embracing the Monolith
Embracing the MonolithEmbracing the Monolith
Embracing the Monolith
 

Similar to Intro to Python for Data Science

Similar to Intro to Python for Data Science (20)

Intro to Python for Data Science
Intro to Python for Data ScienceIntro to Python for Data Science
Intro to Python for Data Science
 
Tf itpbapm
Tf itpbapmTf itpbapm
Tf itpbapm
 
Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014
Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014
Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014
 
Blacks In Technology for BMI Technology Workshop Presentation
Blacks In Technology for BMI Technology Workshop PresentationBlacks In Technology for BMI Technology Workshop Presentation
Blacks In Technology for BMI Technology Workshop Presentation
 
Blacks In Technology BMI Tech Workshop preso
Blacks In Technology BMI Tech Workshop presoBlacks In Technology BMI Tech Workshop preso
Blacks In Technology BMI Tech Workshop preso
 
Final Blacks In Tech /BMI presentation
Final Blacks In Tech /BMI presentationFinal Blacks In Tech /BMI presentation
Final Blacks In Tech /BMI presentation
 
IIPGH Webinar 1: Getting Started With Data Science
IIPGH Webinar 1: Getting Started With Data ScienceIIPGH Webinar 1: Getting Started With Data Science
IIPGH Webinar 1: Getting Started With Data Science
 
Tf itpbapm
Tf itpbapmTf itpbapm
Tf itpbapm
 
Introduction to python
Introduction to pythonIntroduction to python
Introduction to python
 
Analyzing social media with Python and other tools (1/4)
Analyzing social media with Python and other tools (1/4)Analyzing social media with Python and other tools (1/4)
Analyzing social media with Python and other tools (1/4)
 
Cloud AI GenAI Overview.pptx
Cloud AI GenAI Overview.pptxCloud AI GenAI Overview.pptx
Cloud AI GenAI Overview.pptx
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
Deep Learning Lightning Talk
Deep Learning Lightning TalkDeep Learning Lightning Talk
Deep Learning Lightning Talk
 
Big Data made easy in the era of the Cloud - Demi Ben-Ari
Big Data made easy in the era of the Cloud - Demi Ben-AriBig Data made easy in the era of the Cloud - Demi Ben-Ari
Big Data made easy in the era of the Cloud - Demi Ben-Ari
 
Tactical Information Gathering
Tactical Information GatheringTactical Information Gathering
Tactical Information Gathering
 
Industry of Things World - Berlin 19-09-16
Industry of Things World - Berlin 19-09-16Industry of Things World - Berlin 19-09-16
Industry of Things World - Berlin 19-09-16
 
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
 
The New IT
The New ITThe New IT
The New IT
 
Big Data Oslo v 4 Sci Code: "Current Industry Projects within AI and the Best...
Big Data Oslo v 4 Sci Code: "Current Industry Projects within AI and the Best...Big Data Oslo v 4 Sci Code: "Current Industry Projects within AI and the Best...
Big Data Oslo v 4 Sci Code: "Current Industry Projects within AI and the Best...
 
Continuum Analytics and Python
Continuum Analytics and PythonContinuum Analytics and Python
Continuum Analytics and Python
 

More from TJ Stalcup

More from TJ Stalcup (20)

Intro to JavaScript - Thinkful DC
Intro to JavaScript - Thinkful DCIntro to JavaScript - Thinkful DC
Intro to JavaScript - Thinkful DC
 
Frontend Crash Course
Frontend Crash CourseFrontend Crash Course
Frontend Crash Course
 
Build Your Own Website - Intro to HTML & CSS
Build Your Own Website - Intro to HTML & CSSBuild Your Own Website - Intro to HTML & CSS
Build Your Own Website - Intro to HTML & CSS
 
Intro to Python
Intro to PythonIntro to Python
Intro to Python
 
Intro to Python
Intro to PythonIntro to Python
Intro to Python
 
Predict the Oscars using Data Science
Predict the Oscars using Data SciencePredict the Oscars using Data Science
Predict the Oscars using Data Science
 
Thinkful DC - Intro to JavaScript
Thinkful DC - Intro to JavaScriptThinkful DC - Intro to JavaScript
Thinkful DC - Intro to JavaScript
 
Data Science Your Vacation
Data Science Your VacationData Science Your Vacation
Data Science Your Vacation
 
Data Science Your Vacation
Data Science Your VacationData Science Your Vacation
Data Science Your Vacation
 
Build a Game with Javascript
Build a Game with JavascriptBuild a Game with Javascript
Build a Game with Javascript
 
Thinkful DC FrontEnd Crash Course - HTML & CSS
Thinkful DC FrontEnd Crash Course - HTML & CSSThinkful DC FrontEnd Crash Course - HTML & CSS
Thinkful DC FrontEnd Crash Course - HTML & CSS
 
Build Your Own Instagram Filters
Build Your Own Instagram FiltersBuild Your Own Instagram Filters
Build Your Own Instagram Filters
 
Choosing a Programming Language
Choosing a Programming LanguageChoosing a Programming Language
Choosing a Programming Language
 
Frontend Crash Course
Frontend Crash CourseFrontend Crash Course
Frontend Crash Course
 
Thinkful FrontEnd Crash Course - HTML & CSS
Thinkful FrontEnd Crash Course - HTML & CSSThinkful FrontEnd Crash Course - HTML & CSS
Thinkful FrontEnd Crash Course - HTML & CSS
 
Thinkful FrontEnd Crash Course - HTML & CSS
Thinkful FrontEnd Crash Course - HTML & CSSThinkful FrontEnd Crash Course - HTML & CSS
Thinkful FrontEnd Crash Course - HTML & CSS
 
Build a Virtual Pet with JavaScript
Build a Virtual Pet with JavaScriptBuild a Virtual Pet with JavaScript
Build a Virtual Pet with JavaScript
 
Intro to Javascript
Intro to JavascriptIntro to Javascript
Intro to Javascript
 
DC jQuery App
DC jQuery AppDC jQuery App
DC jQuery App
 
Thinkful DC - Intro to JavaScript
Thinkful DC - Intro to JavaScriptThinkful DC - Intro to JavaScript
Thinkful DC - Intro to JavaScript
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Recently uploaded (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 

Intro to Python for Data Science

  • 1. Intro to Python for DSIntro to Python for DS 2 birds. One stone.   WIFI: MakeO ces 5Ghz Password: Internet!23   http://bit.ly/thinkful-dc-python
  • 2. TJ Stalcup Lead DC Mentor @Thinkful API Evangelist @540 Pokemon Master About UsAbout Us
  • 4. What's your name? What do you do? Why are you interested in data science or python? About youAbout you
  • 5. Online Bootcamp since 2012. We have worked with over 6000 students around the world paired up with over 400 mentors.    We get you ready for a career and guarantee your rst job   92% success rate About ThinkfulAbout Thinkful Local DC Crew
  • 6.   Learn why DS is a thing   What is Python   How do we use it with a real world project?   How do I learn more? TONIGHT: Learn Python by DoingTONIGHT: Learn Python by Doing
  • 7. What is a Data Scientist?What is a Data Scientist?
  • 8. Example: LinkedIn 2006Example: LinkedIn 2006 “[LinkedIn] was like arriving at a conference reception and realizing you don’t know anyone. So you just stand in the corner sipping your drink—and you probably leave early.” -LinkedIn Manager, June 2006
  • 9. Enter: Jonathan GoldmanEnter: Jonathan Goldman Data Scientist Joined LinkedIn in 2006, only 8M users (450M in 2016) Started experiments to predict people’s networks Engineers were dismissive: “you can already import your address book”
  • 10. DS ProcessDS Process Frame the question Collect the raw data Process the data Explore the data Communicate results
  • 11. Frame the QuestionFrame the Question What questions do we want to answer? What connections (type and number) lead to higher user engagement? Which connections do people want to make but are currently limited from making? How might we predict these types of connections with limited data from the user?
  • 12. Collect the DataCollect the Data What data do we need to answer these questions? Connection data (who is who connected to?) Demographic data (what is the pro le of the connection) Engagement data (how do they use the site)
  • 13. Process the DataProcess the Data How is the data “dirty” and how can we clean it? • User input • Redundancies • Feature changes • Data model changes
  • 14. Explore the DataExplore the Data What are the meaningful patterns in the data? • Triangle closing • Time overlaps • Geographic overlaps
  • 15. Communicate FindingsCommunicate Findings How do we communicate this? To whom? Marketing - this will enable us to sell X more ad space. Results in X more impressions per day Product - this will allow us to build X more features Development - this will allow us to grow our team by X Sales - this will attract X more premium accounts C-Level - this will result in $$$ more revenue 8M - 450M in 10 years
  • 17. Why DS now?Why DS now? Big Data: datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze
  • 18. Big DataBig Data Trend "started" in 2005 Web 2.0 - Majority of content is created by users Mobile accelerates this — data/person skyrockets
  • 19. The Data ProblemThe Data Problem We are generating more data every year than existed before.........
  • 20. The SolutionThe Solution There goes my hero....   watch 'em as they code....
  • 21. Just need to do everything....Just need to do everything....
  • 22. Just need to do everything....Just need to do everything.... Knowledge of statistics, algorithms, & software Comfort with languages & tools (Python, SQL, Tableau) Inquisitiveness and intellectual curiosity Strong communication skills It’s all Teachable!
  • 23. Coming Soon....Coming Soon.... Intro to SQL Intro to Tableau Intro to Statistics   http://meetup.com/Thinkful-DC
  • 24. Let's Learn Python TonightLet's Learn Python Tonight Python for Programming Great for Data Science Robotics Web Development (Python/Django) Automation
  • 25. Let's Learn Python TonightLet's Learn Python Tonight firstName = 'TJ' lastName = "Stalcup" age = 34 // wow, much old print firstName // TJ print firstName + lastName // TJStalcup print firstName + ' ' + lastName // TJ Stalcup print lastName + ', ' + firstName // Stalcup, TJ print age * 2 // 68, hopefully retired def greet(name): print 'Hello', name greet('Jack') // Hello, Jack greet('Jill') // Hello, Jill greet('Bob') // Hello, Bob greet(firstName) // Hello, TJ greet(firstName + ' ' + lastName) // Hello, TJ Stalcup
  • 26. The ModelThe Model Our model is going to be a Decision Tree.   Decision trees predict the most likely outcome based on input.   You can think of it like a computer building a version of 20 questions.
  • 27. Decision Trees - Golf?Decision Trees - Golf?
  • 28. The NotebookThe Notebook We're going to use a Google hosted Python to build this model. This app is called Colaboratory (Collaboration + Laboratory)   http://colab.research.google.com   New Notebook > New Python3 Notebook notebook
  • 29. ShortcomingsShortcomings Our model has a few weaknesses:   -Limited inputs -Assumptions
  • 30. Data Science @ ThinkfulData Science @ Thinkful Flexible, project-based curriculum to help you become the data scientist you want to be You don’t just learn skills, you get to make things Mentor support from experts in the industry Also, there's a job guarantee
  • 31. Link for the third party audit jobs report: https://www.thinkful.com/bootcamp-jobs-stats Thinkful Graduates 92% Job Placement Rate
  • 32. Learning Mentor Career MentorProgram Manager Local Community You Unprecedented SupportUnprecedented Support
  • 33. http://bit.ly/dc-ds-trial Initial 2-week trial course Start with Python and Statistics Unlimited Q&A Sessions Option to continue with full bootcamp Financing & scholarships available O er valid for tonight only Aaron Lamphere Trial Program Manager   Thinkful Two Week TrialThinkful Two Week Trial