SlideShare a Scribd company logo
1 of 60
Download to read offline
Math & Data Science
Dr June Andrews
July 29, 2015
Dr June Andrews Math & Data Science July 29, 2015 1 / 59
Table of contents
1 Data Science
Origins
People
Work
2 Math Behind Data Science
Experimentation
Growth Normalization
If Time
Dr June Andrews Math & Data Science July 29, 2015 2 / 59
First Data Science Job Rec
Be challenged at LinkedIn. We’re looking for superb analytical
minds of all levels to expand our small team that will build some
of the most innovative products at LinkedIn.
No specific technical skills are required (we’ll help you learn
SQL, Python, and R). You should be extremely intelligent, have
quantitative background, and be able to learn quickly and work
independently. This is the perfect job for someone who’s really
smart, driven, and extremely skilled at creatively solving
problems. You’ll learn statistics, data mining, programming, and
product design, but you’ve gotta start with what we can’t teach
- intellectual sharpness and creativity.
Figure: LinkedIn Job Posting April 2008
Dr June Andrews Math & Data Science July 29, 2015 3 / 59
Latest Data Science Job Rec
Data Scientist – Growth Analytics at LinkedIn
Data Scientists on our team partner with product managers, engineers and a cross-functional team to drive
LinkedIn membership growth and connectivity. We inform product strategy and product decisions by:
Extracting and analyzing LinkedIn data to derive actionable insights.
Formulating success metrics for completely novel products and creating dashboards/reports to monitor
them.
Designing and analyzing experiments to test new product ideas.
Developing models and data-driven solutions that add material lift to principal performance metrics.
LinkedIn member data is amazingly rich and provides a fantastic opportunity for Data Scientists to explore and
create, ultimately developing ways for members to improve their professional lives. Youll have the opportunity to
work with some of the best data people anywhere in an environment which truly values data-driven decisions.
Required qualifications include:
BS/MS in a quantitative discipline: Statistics, Applied Mathematics, Operations Research, Computer
Science, Engineering, Economics, etc.
1+ years experience working with large amounts of real data with SQL (Teradata, Oracle, or MySQL)
and R, or other statistical package.
1+ years work experience programming in Java or Python - Pig experience desired.
Proficiency in a Unix/Linux environment for automating processes with shell scripting.
Able to translate business objectives into actionable analyses.
Able to communicate findings clearly to both technical and non-technical audiences
Preferred Qualifications include:
Experience with Consumer Internet products.
Knowledge in one of the following areas is a strong plus: Viral Growth mechanisms, user acquisition in
International markets, Search Engine Optimization (SEO)
Expertise in applied statistics, understanding of controlled experiments.
Figure: LinkedIn Job Posting July 2015
Dr June Andrews Math & Data Science July 29, 2015 4 / 59
Latest Data Science Job Rec - Applicants
Figure: Applicants now have SQL, Python, and R. 702 applicants in 5 months.
Dr June Andrews Math & Data Science July 29, 2015 5 / 59
Trend is to Demand More
Definition (Data Science as a Victim of Success)
When use of a skill demonstrates improvements in support and innovation,
it is added to the next job rec.
Rule of thumb when hiring, does your favorite colleague pass your
interview?
Dr June Andrews Math & Data Science July 29, 2015 6 / 59
Goals
Invariant
Use data to support colleagues: marketing, finance, engineering, . . .
Use data to innovate: products, strategies, performance, . . .
Cherry on Top
Do what it takes to drive company success.
Dr June Andrews Math & Data Science July 29, 2015 7 / 59
Progress
1 Data Science
Origins
People
Work
2 Math Behind Data Science
Experimentation
Growth Normalization
If Time
Dr June Andrews Math & Data Science July 29, 2015 8 / 59
LinkedIn Data
Dr June Andrews Math & Data Science July 29, 2015 9 / 59
Source of 125k Data Professionals
Figure: Incredibly diverse.
Dr June Andrews Math & Data Science July 29, 2015 10 / 59
Data Professionals on LinkedIn
> 2k degree fields (after standardization)
16% are Unique Degrees:
Oral Surgery
Phytopathology
Wedding Planning
Ground Transportation
Library Sciences
Turfgrass Management
Embryology
Fire Fighting
Stagecraft
Art Conservation
Dr June Andrews Math & Data Science July 29, 2015 11 / 59
Data Science Homogenization Trend
Dr June Andrews Math & Data Science July 29, 2015 12 / 59
Uneven Growth of Top 10 Backgrounds
Dr June Andrews Math & Data Science July 29, 2015 13 / 59
Uneven Growth of Top 10 Backgrounds
Figure: Increased recruitment of economists and statisticans.
Dr June Andrews Math & Data Science July 29, 2015 14 / 59
Destinations of Data Professionals
Dr June Andrews Math & Data Science July 29, 2015 15 / 59
Industry Diversification of Data Professionals
Dr June Andrews Math & Data Science July 29, 2015 16 / 59
Uneven Growth of Top 10 Industries
Dr June Andrews Math & Data Science July 29, 2015 17 / 59
Trends
Homogenization of Sources of Data Professionals
Diversification of Industry Destinations of Data Professionals
Dr June Andrews Math & Data Science July 29, 2015 18 / 59
Progress
1 Data Science
Origins
People
Work
2 Math Behind Data Science
Experimentation
Growth Normalization
If Time
Dr June Andrews Math & Data Science July 29, 2015 19 / 59
Product Cycle
Figure: What portion of work data scientists do on a daily basis depends on
product life cycle.
Dr June Andrews Math & Data Science July 29, 2015 20 / 59
Content
Ask - Make content go big.
Dr June Andrews Math & Data Science July 29, 2015 21 / 59
Connection Network
Figure: Content spreads along existing connection network.
Dr June Andrews Math & Data Science July 29, 2015 22 / 59
Follow Network
Figure: Change the game. Increase readership and visibilty via follows.
Dr June Andrews Math & Data Science July 29, 2015 23 / 59
Product Cycle - Follow Network
Stage Work Time
Ideation
Explore how to make content go big.
Follows.
2 weeks
Design & Spec
Define a Follow for security, PR,
marketing, all teams possibly affected.
3 weeks
Development
Database engineering, rollback safe,
experimental framework.
6 months
Test & Iterate Slow release experiment. 3 months
Release Clean up code, outline fast follows 1 month
Table: Follow Network, slow and steady development cycle.
Dr June Andrews Math & Data Science July 29, 2015 24 / 59
Types of Work
Area of Data Goal
Analyze Understand
Visualize Communicate
Business Decisions Orchestrate Action
Prototype Product Demonstrate Usefulness
Refine Product Maximize Usefulness
Design Experiment Measure Changes
Analyze Experiment Learn
Log Save Everything
Process Make Data Useable
Load to Server/DB Make Data Accessible
Table: General data science stack.
Dr June Andrews Math & Data Science July 29, 2015 25 / 59
Who does What
Figure: Depth v. breadth of different fields.
Dr June Andrews Math & Data Science July 29, 2015 26 / 59
Skills of Data Professionals
Languages Tools Hard Skills Soft Skills
SQL
Microsoft (Office,
Excel, SQL, Visio)
Research Management
Java Oracle Statistics Leadership
Matlab SAS ETL Process Improvement
Javascript SharePoint Data Modeling Customer Service
R SAP Software Dev Software Docs
Python Cisco Data Mining Strategy
C++ Salseforce Forecasting Public Speaking
XML Six Sigma Database Design Team Leadership
Table: From LinkedIn’s 125k Data Professionals.
Dr June Andrews Math & Data Science July 29, 2015 27 / 59
Network Product Development
1 Data Science
Origins
People
Work
2 Math Behind Data Science
Experimentation
Growth Normalization
If Time
Dr June Andrews Math & Data Science July 29, 2015 28 / 59
Traditional A/B Testing
Figure: Traditional ab testing. [Salesforce]
High Level
Randomly divides users into two groups for different treatments.
Dr June Andrews Math & Data Science July 29, 2015 29 / 59
Social Influence
Figure: Users can communicate experiences in social networks.
Cross Over
Testing interaction features such as messaging, connections, and profile
views inherently have cross cohort communication.
Dr June Andrews Math & Data Science July 29, 2015 30 / 59
Elegant Solution
Figure: See geographical bounds. [Ugander et al]
High Level
Partition network into relatively low intra communication groups.
Dr June Andrews Math & Data Science July 29, 2015 31 / 59
Elegant Solution
Downside
Costly to implement and assign elegant solution. Limited number of
experiments can run simultaneousl.
Cohort Actual Performance Observed Performance Observed Diff
A x z
B y c · z c − 1
Table: What exists and is observed.
2 equations, 3 variables, can compute upper bound for x
y
Dr June Andrews Math & Data Science July 29, 2015 32 / 59
Elegant v. Brute Force Tradeoff
Bound
Actual impact a is bounded by observed impact c & viral coefficient V :
a = c−V
1−cV
Figure: Small impact for low viral products. [Andrews]
Dr June Andrews Math & Data Science July 29, 2015 33 / 59
Alternative Brute Force
Control Interactions
Split on the interaction at the cost of inconsistent user experience. Benefit
is test the impact of sending or receiving.
Sender / Receiver A B
A Treatment Control
B Control Control
Dr June Andrews Math & Data Science July 29, 2015 34 / 59
Progress
1 Data Science
Origins
People
Work
2 Math Behind Data Science
Experimentation
Growth Normalization
If Time
Dr June Andrews Math & Data Science July 29, 2015 35 / 59
Health Care’s Relations with other Industries
Figure: Since 2008 Health Care has increased relationships with Recruiters.
Dr June Andrews Math & Data Science July 29, 2015 36 / 59
Not so Fast
Figure: Growth of relationships is dominated by LinkedIn’s growth.
Dr June Andrews Math & Data Science July 29, 2015 37 / 59
Confounding or Masking Variables
Control Confounding Variables
Data quality and growth can dominate underlying
trends.
LinkedIn’s Network Growth is massive and
diverse
Venture Capitalists and Recruiters are hyper
connectors
Figure: Stan Lee
Dr June Andrews Math & Data Science July 29, 2015 38 / 59
Control for Growth and Behavioral Variables
Approach
Set as constants the number of users in an industry and how many
connections they have. Then reconnect connections at random.
Figure: Break edges and reconnect randomly.
Dr June Andrews Math & Data Science July 29, 2015 39 / 59
Expected Connections
Closed Form Solution
Reducible to pulling red and blue balls from a bag without replacement.
The solution is the expectation of the Hypergeometric distribution.
E[ Edges(Health Care, I) ] = Edges(Health Care) Edges(I)
i,j Edges(i,j) − Edges(Health Care)
Dr June Andrews Math & Data Science July 29, 2015 40 / 59
Expected Connections
Figure: Given growth and behavioral patterns, we expect some industries to have
a dramatic number of connections to health care professionals.
Dr June Andrews Math & Data Science July 29, 2015 41 / 59
Significant Relations with Health Care Appear
Figure: Venture Capitalists and Recruiters are no longer in the top rankings.
Dr June Andrews Math & Data Science July 29, 2015 42 / 59
Significant Relations with Health Care Appear
Relations Now Reflect the Larger
Economy
City programs have increased
inhome and preventative care
Many hospitals are named after
Saints and affliated with
Religious Denominations
Medical Devices and
Pharmaceuticals have and have
always had strong connection
to Health Care
Figure: Industries with Significant
Connections to Health Care
Dr June Andrews Math & Data Science July 29, 2015 43 / 59
Significant Relations with Realtors
Figure: Period of dramatic growth for real estate
Dr June Andrews Math & Data Science July 29, 2015 44 / 59
Significant Relations with Realtors
Figure: Period of economic change
Dr June Andrews Math & Data Science July 29, 2015 45 / 59
Significant Relations with Construction
Figure: Symmetric relationship between real estate and construction.
Construction workers migrate between real estate and oil and mining.
Dr June Andrews Math & Data Science July 29, 2015 46 / 59
Industry Migration - Mechanics
Figure: Construction workers connecting with Oil & Mining over Real Estate
Dr June Andrews Math & Data Science July 29, 2015 47 / 59
Industry Migration - Mechanics
How?
Is migration prompted by influential people?
Is migration independent pockets of movement?
Dr June Andrews Math & Data Science July 29, 2015 48 / 59
Industry Migration - Cascades
Figure: Median is 4 neighbors migrated before conversion
Dr June Andrews Math & Data Science July 29, 2015 49 / 59
Industry Migration - Mechanics
Figure: Size of bubble is proportional to size of complete cascade.
How?
Migration is largely independent, with some cascades.
Dr June Andrews Math & Data Science July 29, 2015 50 / 59
Wrap
1 Data Science
Origins
People
Work
2 Math Behind Data Science
Experimentation
Growth Normalization
If Time
Dr June Andrews Math & Data Science July 29, 2015 51 / 59
Data & Computing Growth
Figure: Data growth is exponential. Rule of thumb is doubles every 4-8 months.
Dr June Andrews Math & Data Science July 29, 2015 52 / 59
Linearity Wins
Figure: Linear algorithms are fast, predictable, and complete.
Dr June Andrews Math & Data Science July 29, 2015 53 / 59
Takes a Village - Thank You!
Figure: Every project involved at least 3 people.
Dr June Andrews Math & Data Science July 29, 2015 54 / 59
Progress
1 Data Science
Origins
People
Work
2 Math Behind Data Science
Experimentation
Growth Normalization
If Time
Dr June Andrews Math & Data Science July 29, 2015 55 / 59
MAP
MAP combines:
Precision - Give me only what I want
Recall - Give me everything I want
Figure: (Precision, Recall) values with same MAP score.
Dr June Andrews Math & Data Science July 29, 2015 56 / 59
MAP
Figure: Two sets of (Precision, Recall) values with same MAP score.
Dr June Andrews Math & Data Science July 29, 2015 57 / 59
MAP
Figure: Improve a search algorithm from point a with either small increase in
Recall or a large increase in Precision.
Dr June Andrews Math & Data Science July 29, 2015 58 / 59
MAP
Figure: Additional points.
Dr June Andrews Math & Data Science July 29, 2015 59 / 59
MAP
Snake Oil
When Precision and Recall values are not balanced, MAP only responds
to changes in the lower one.
North Star
When Precision and Recall values are balanced, promotes improvement of
both Precision and Recall.
Dr June Andrews Math & Data Science July 29, 2015 60 / 59

More Related Content

What's hot

Introduction and Applications of Discrete Mathematics
Introduction and Applications of Discrete MathematicsIntroduction and Applications of Discrete Mathematics
Introduction and Applications of Discrete Mathematicsblaircomp2003
 
Application of calculus in cse
Application of calculus in cseApplication of calculus in cse
Application of calculus in cseMd. Janibul Hoque
 
Machine Learning and its Applications
Machine Learning and its ApplicationsMachine Learning and its Applications
Machine Learning and its ApplicationsDr Ganesh Iyer
 
Applications of Linear Algebra in Computer Sciences
Applications of Linear Algebra in Computer SciencesApplications of Linear Algebra in Computer Sciences
Applications of Linear Algebra in Computer SciencesAmir Sharif Chishti
 
An introduction to Machine Learning
An introduction to Machine LearningAn introduction to Machine Learning
An introduction to Machine Learningbutest
 
Mathematics, Machine Learning and ML Engineering
Mathematics, Machine Learning and ML EngineeringMathematics, Machine Learning and ML Engineering
Mathematics, Machine Learning and ML EngineeringGopi Krishna Nuti
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learningKoundinya Desiraju
 
Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Venkata Reddy Konasani
 
Intro & Applications of Discrete Math
Intro & Applications of Discrete MathIntro & Applications of Discrete Math
Intro & Applications of Discrete MathBilal Khan
 
Intro to Discrete Mathematics
Intro to Discrete MathematicsIntro to Discrete Mathematics
Intro to Discrete Mathematicsasad faraz
 
Supervised Machine Learning Techniques
Supervised Machine Learning TechniquesSupervised Machine Learning Techniques
Supervised Machine Learning TechniquesTara ram Goyal
 
Discrete Math in Real Life
Discrete Math in Real LifeDiscrete Math in Real Life
Discrete Math in Real Lifesulaiman hridoy
 
Applications of linear algebra
Applications of linear algebraApplications of linear algebra
Applications of linear algebraPrerak Trivedi
 
Role of Mathematics in everyday life
Role of Mathematics in everyday lifeRole of Mathematics in everyday life
Role of Mathematics in everyday lifeKajal Satija
 
1_Introduction to Artificial Intelligence.pdf
1_Introduction to Artificial Intelligence.pdf1_Introduction to Artificial Intelligence.pdf
1_Introduction to Artificial Intelligence.pdfasxc1
 

What's hot (20)

Introduction and Applications of Discrete Mathematics
Introduction and Applications of Discrete MathematicsIntroduction and Applications of Discrete Mathematics
Introduction and Applications of Discrete Mathematics
 
Application of calculus in cse
Application of calculus in cseApplication of calculus in cse
Application of calculus in cse
 
Machine Learning and its Applications
Machine Learning and its ApplicationsMachine Learning and its Applications
Machine Learning and its Applications
 
Applications of Linear Algebra in Computer Sciences
Applications of Linear Algebra in Computer SciencesApplications of Linear Algebra in Computer Sciences
Applications of Linear Algebra in Computer Sciences
 
2.mathematics for machine learning
2.mathematics for machine learning2.mathematics for machine learning
2.mathematics for machine learning
 
An introduction to Machine Learning
An introduction to Machine LearningAn introduction to Machine Learning
An introduction to Machine Learning
 
Mathematics, Machine Learning and ML Engineering
Mathematics, Machine Learning and ML EngineeringMathematics, Machine Learning and ML Engineering
Mathematics, Machine Learning and ML Engineering
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science
 
Sadiq maths ppt
Sadiq maths pptSadiq maths ppt
Sadiq maths ppt
 
Intro & Applications of Discrete Math
Intro & Applications of Discrete MathIntro & Applications of Discrete Math
Intro & Applications of Discrete Math
 
Intro to Discrete Mathematics
Intro to Discrete MathematicsIntro to Discrete Mathematics
Intro to Discrete Mathematics
 
Supervised Machine Learning Techniques
Supervised Machine Learning TechniquesSupervised Machine Learning Techniques
Supervised Machine Learning Techniques
 
Discrete Math in Real Life
Discrete Math in Real LifeDiscrete Math in Real Life
Discrete Math in Real Life
 
Artificial Intelligence and Mathematics
Artificial Intelligence and MathematicsArtificial Intelligence and Mathematics
Artificial Intelligence and Mathematics
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Applications of linear algebra
Applications of linear algebraApplications of linear algebra
Applications of linear algebra
 
Role of Mathematics in everyday life
Role of Mathematics in everyday lifeRole of Mathematics in everyday life
Role of Mathematics in everyday life
 
1_Introduction to Artificial Intelligence.pdf
1_Introduction to Artificial Intelligence.pdf1_Introduction to Artificial Intelligence.pdf
1_Introduction to Artificial Intelligence.pdf
 
Terminology Machine Learning
Terminology Machine LearningTerminology Machine Learning
Terminology Machine Learning
 

Viewers also liked

10 Crucial Tips You Need to Learn to Keep Your Customers Happy
10 Crucial Tips You Need to Learn to Keep Your Customers Happy10 Crucial Tips You Need to Learn to Keep Your Customers Happy
10 Crucial Tips You Need to Learn to Keep Your Customers HappyRichard Kimber (CMICS)
 
Start15 homework template-1-miha_in_lina
Start15 homework template-1-miha_in_linaStart15 homework template-1-miha_in_lina
Start15 homework template-1-miha_in_linaMiha Zoubek
 
Empowering Publishing Production
Empowering Publishing ProductionEmpowering Publishing Production
Empowering Publishing ProductionDeanta
 
Benefits realization - May 2016
Benefits realization - May 2016Benefits realization - May 2016
Benefits realization - May 2016Haresh Desai
 
Видатки бюджету
Видатки бюджетуВидатки бюджету
Видатки бюджетуmynizhyn
 
osteosentez sonrası akut enfeksiyonlar
osteosentez sonrası akut enfeksiyonlarosteosentez sonrası akut enfeksiyonlar
osteosentez sonrası akut enfeksiyonlargökhan tevfik ateş
 
Ретро Fm презентация декабрь 2014
Ретро Fm презентация декабрь 2014Ретро Fm презентация декабрь 2014
Ретро Fm презентация декабрь 2014mynizhyn
 
Пропозиція щодо перейменування вулиць. 2013 р.
Пропозиція щодо перейменування вулиць. 2013 р.Пропозиція щодо перейменування вулиць. 2013 р.
Пропозиція щодо перейменування вулиць. 2013 р.mynizhyn
 

Viewers also liked (17)

Data+Design
Data+DesignData+Design
Data+Design
 
10 Crucial Tips You Need to Learn to Keep Your Customers Happy
10 Crucial Tips You Need to Learn to Keep Your Customers Happy10 Crucial Tips You Need to Learn to Keep Your Customers Happy
10 Crucial Tips You Need to Learn to Keep Your Customers Happy
 
Video Analysis
Video AnalysisVideo Analysis
Video Analysis
 
Start15 homework template-1-miha_in_lina
Start15 homework template-1-miha_in_linaStart15 homework template-1-miha_in_lina
Start15 homework template-1-miha_in_lina
 
Survival
SurvivalSurvival
Survival
 
Converted_file_43d2657c
Converted_file_43d2657cConverted_file_43d2657c
Converted_file_43d2657c
 
NYCLA Linked In
NYCLA Linked InNYCLA Linked In
NYCLA Linked In
 
Applications Presentation
Applications PresentationApplications Presentation
Applications Presentation
 
Empowering Publishing Production
Empowering Publishing ProductionEmpowering Publishing Production
Empowering Publishing Production
 
Survival
SurvivalSurvival
Survival
 
Benefits realization - May 2016
Benefits realization - May 2016Benefits realization - May 2016
Benefits realization - May 2016
 
Видатки бюджету
Видатки бюджетуВидатки бюджету
Видатки бюджету
 
Filo
FiloFilo
Filo
 
CHOOSE CHET
CHOOSE CHETCHOOSE CHET
CHOOSE CHET
 
osteosentez sonrası akut enfeksiyonlar
osteosentez sonrası akut enfeksiyonlarosteosentez sonrası akut enfeksiyonlar
osteosentez sonrası akut enfeksiyonlar
 
Ретро Fm презентация декабрь 2014
Ретро Fm презентация декабрь 2014Ретро Fm презентация декабрь 2014
Ретро Fm презентация декабрь 2014
 
Пропозиція щодо перейменування вулиць. 2013 р.
Пропозиція щодо перейменування вулиць. 2013 р.Пропозиція щодо перейменування вулиць. 2013 р.
Пропозиція щодо перейменування вулиць. 2013 р.
 

Similar to Math in data

How to Prepare for a Career in Data Science
How to Prepare for a Career in Data ScienceHow to Prepare for a Career in Data Science
How to Prepare for a Career in Data ScienceJuuso Parkkinen
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science LandscapePhilip Bourne
 
Introduction to Data Science and Analytics
Introduction to Data Science and AnalyticsIntroduction to Data Science and Analytics
Introduction to Data Science and AnalyticsDhruv Saxena
 
Better Living Through Analytics - Strategies for Data Decisions
Better Living Through Analytics - Strategies for Data DecisionsBetter Living Through Analytics - Strategies for Data Decisions
Better Living Through Analytics - Strategies for Data DecisionsProduct School
 
Data_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfData_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfvishal choudhary
 
Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020Joanne Luciano
 
Careers in Data Science _ Navigating the Digital Frontier (1).pptx
Careers in Data Science _  Navigating the Digital Frontier (1).pptxCareers in Data Science _  Navigating the Digital Frontier (1).pptx
Careers in Data Science _ Navigating the Digital Frontier (1).pptx2075AAGEPRATIK
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxwahiba ben abdessalem
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxssuser1a4f0f
 
Tips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistTips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistLisa Cohen
 
Paradigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4
 
Predictive Analytics - How to get stuff out of your Crystal Ball
Predictive Analytics - How to get stuff out of your Crystal BallPredictive Analytics - How to get stuff out of your Crystal Ball
Predictive Analytics - How to get stuff out of your Crystal BallDATAVERSITY
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data SciencePouria Amirian
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data SciencePouria Amirian
 
Who is a data scientist
Who is a data scientist  Who is a data scientist
Who is a data scientist prateek kumar
 
Introduction to Data Science.pptx
Introduction to Data Science.pptxIntroduction to Data Science.pptx
Introduction to Data Science.pptxDr.Shweta
 
Rady School Master of Science Business Analytics (MSBA) Program Overview
Rady School Master of Science Business Analytics (MSBA) Program OverviewRady School Master of Science Business Analytics (MSBA) Program Overview
Rady School Master of Science Business Analytics (MSBA) Program OverviewUC San Diego Rady School of Management
 
Data Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxData Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxsumitkumar600840
 

Similar to Math in data (20)

How to Prepare for a Career in Data Science
How to Prepare for a Career in Data ScienceHow to Prepare for a Career in Data Science
How to Prepare for a Career in Data Science
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science Landscape
 
Introduction to Data Science and Analytics
Introduction to Data Science and AnalyticsIntroduction to Data Science and Analytics
Introduction to Data Science and Analytics
 
Better Living Through Analytics - Strategies for Data Decisions
Better Living Through Analytics - Strategies for Data DecisionsBetter Living Through Analytics - Strategies for Data Decisions
Better Living Through Analytics - Strategies for Data Decisions
 
Data_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfData_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdf
 
Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020
 
Careers in Data Science _ Navigating the Digital Frontier (1).pptx
Careers in Data Science _  Navigating the Digital Frontier (1).pptxCareers in Data Science _  Navigating the Digital Frontier (1).pptx
Careers in Data Science _ Navigating the Digital Frontier (1).pptx
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptx
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptx
 
Tips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistTips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data Scientist
 
Untitled document.pdf
Untitled document.pdfUntitled document.pdf
Untitled document.pdf
 
Paradigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the table
 
Data Analytics Course in Noida. pptx
Data Analytics  Course in Noida.     pptxData Analytics  Course in Noida.     pptx
Data Analytics Course in Noida. pptx
 
Predictive Analytics - How to get stuff out of your Crystal Ball
Predictive Analytics - How to get stuff out of your Crystal BallPredictive Analytics - How to get stuff out of your Crystal Ball
Predictive Analytics - How to get stuff out of your Crystal Ball
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
 
Who is a data scientist
Who is a data scientist  Who is a data scientist
Who is a data scientist
 
Introduction to Data Science.pptx
Introduction to Data Science.pptxIntroduction to Data Science.pptx
Introduction to Data Science.pptx
 
Rady School Master of Science Business Analytics (MSBA) Program Overview
Rady School Master of Science Business Analytics (MSBA) Program OverviewRady School Master of Science Business Analytics (MSBA) Program Overview
Rady School Master of Science Business Analytics (MSBA) Program Overview
 
Data Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxData Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptx
 

More from June Andrews

Scaling & Transforming Stitch Fix's Visibility into What Folks will love
Scaling & Transforming Stitch Fix's Visibility into What Folks will loveScaling & Transforming Stitch Fix's Visibility into What Folks will love
Scaling & Transforming Stitch Fix's Visibility into What Folks will loveJune Andrews
 
The Uncanny Valley of ML
The Uncanny Valley of MLThe Uncanny Valley of ML
The Uncanny Valley of MLJune Andrews
 
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...Critical turbine maintenance: Monitoring and diagnosing planes and power plan...
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...June Andrews
 
Push & Pull History of Data Science in Industry & Academia
Push & Pull History of Data Science in Industry & AcademiaPush & Pull History of Data Science in Industry & Academia
Push & Pull History of Data Science in Industry & AcademiaJune Andrews
 
Counter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsCounter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsJune Andrews
 
Counter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsCounter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsJune Andrews
 
Replication in Data Science
Replication in Data ScienceReplication in Data Science
Replication in Data ScienceJune Andrews
 
Replication in Data Science - A Dance Between Data Science & Machine Learning...
Replication in Data Science - A Dance Between Data Science & Machine Learning...Replication in Data Science - A Dance Between Data Science & Machine Learning...
Replication in Data Science - A Dance Between Data Science & Machine Learning...June Andrews
 
Trends on Pinterest
Trends on PinterestTrends on Pinterest
Trends on PinterestJune Andrews
 
Growth, Engagement & Search Metrics: Snake Oil or North Stars
Growth, Engagement & Search Metrics: Snake Oil or North StarsGrowth, Engagement & Search Metrics: Snake Oil or North Stars
Growth, Engagement & Search Metrics: Snake Oil or North StarsJune Andrews
 
Predictive Analytics & Business Insights
Predictive Analytics & Business InsightsPredictive Analytics & Business Insights
Predictive Analytics & Business InsightsJune Andrews
 

More from June Andrews (14)

Scaling & Transforming Stitch Fix's Visibility into What Folks will love
Scaling & Transforming Stitch Fix's Visibility into What Folks will loveScaling & Transforming Stitch Fix's Visibility into What Folks will love
Scaling & Transforming Stitch Fix's Visibility into What Folks will love
 
The Uncanny Valley of ML
The Uncanny Valley of MLThe Uncanny Valley of ML
The Uncanny Valley of ML
 
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...Critical turbine maintenance: Monitoring and diagnosing planes and power plan...
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...
 
Data Competitive
Data CompetitiveData Competitive
Data Competitive
 
Push & Pull History of Data Science in Industry & Academia
Push & Pull History of Data Science in Industry & AcademiaPush & Pull History of Data Science in Industry & Academia
Push & Pull History of Data Science in Industry & Academia
 
ML Playbook
ML PlaybookML Playbook
ML Playbook
 
Counter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsCounter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of Things
 
Counter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsCounter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of Things
 
Replication in Data Science
Replication in Data ScienceReplication in Data Science
Replication in Data Science
 
Replication in Data Science - A Dance Between Data Science & Machine Learning...
Replication in Data Science - A Dance Between Data Science & Machine Learning...Replication in Data Science - A Dance Between Data Science & Machine Learning...
Replication in Data Science - A Dance Between Data Science & Machine Learning...
 
Trends on Pinterest
Trends on PinterestTrends on Pinterest
Trends on Pinterest
 
Growth, Engagement & Search Metrics: Snake Oil or North Stars
Growth, Engagement & Search Metrics: Snake Oil or North StarsGrowth, Engagement & Search Metrics: Snake Oil or North Stars
Growth, Engagement & Search Metrics: Snake Oil or North Stars
 
Economic Insights
Economic InsightsEconomic Insights
Economic Insights
 
Predictive Analytics & Business Insights
Predictive Analytics & Business InsightsPredictive Analytics & Business Insights
Predictive Analytics & Business Insights
 

Recently uploaded

Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 

Recently uploaded (20)

Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 

Math in data

  • 1. Math & Data Science Dr June Andrews July 29, 2015 Dr June Andrews Math & Data Science July 29, 2015 1 / 59
  • 2. Table of contents 1 Data Science Origins People Work 2 Math Behind Data Science Experimentation Growth Normalization If Time Dr June Andrews Math & Data Science July 29, 2015 2 / 59
  • 3. First Data Science Job Rec Be challenged at LinkedIn. We’re looking for superb analytical minds of all levels to expand our small team that will build some of the most innovative products at LinkedIn. No specific technical skills are required (we’ll help you learn SQL, Python, and R). You should be extremely intelligent, have quantitative background, and be able to learn quickly and work independently. This is the perfect job for someone who’s really smart, driven, and extremely skilled at creatively solving problems. You’ll learn statistics, data mining, programming, and product design, but you’ve gotta start with what we can’t teach - intellectual sharpness and creativity. Figure: LinkedIn Job Posting April 2008 Dr June Andrews Math & Data Science July 29, 2015 3 / 59
  • 4. Latest Data Science Job Rec Data Scientist – Growth Analytics at LinkedIn Data Scientists on our team partner with product managers, engineers and a cross-functional team to drive LinkedIn membership growth and connectivity. We inform product strategy and product decisions by: Extracting and analyzing LinkedIn data to derive actionable insights. Formulating success metrics for completely novel products and creating dashboards/reports to monitor them. Designing and analyzing experiments to test new product ideas. Developing models and data-driven solutions that add material lift to principal performance metrics. LinkedIn member data is amazingly rich and provides a fantastic opportunity for Data Scientists to explore and create, ultimately developing ways for members to improve their professional lives. Youll have the opportunity to work with some of the best data people anywhere in an environment which truly values data-driven decisions. Required qualifications include: BS/MS in a quantitative discipline: Statistics, Applied Mathematics, Operations Research, Computer Science, Engineering, Economics, etc. 1+ years experience working with large amounts of real data with SQL (Teradata, Oracle, or MySQL) and R, or other statistical package. 1+ years work experience programming in Java or Python - Pig experience desired. Proficiency in a Unix/Linux environment for automating processes with shell scripting. Able to translate business objectives into actionable analyses. Able to communicate findings clearly to both technical and non-technical audiences Preferred Qualifications include: Experience with Consumer Internet products. Knowledge in one of the following areas is a strong plus: Viral Growth mechanisms, user acquisition in International markets, Search Engine Optimization (SEO) Expertise in applied statistics, understanding of controlled experiments. Figure: LinkedIn Job Posting July 2015 Dr June Andrews Math & Data Science July 29, 2015 4 / 59
  • 5. Latest Data Science Job Rec - Applicants Figure: Applicants now have SQL, Python, and R. 702 applicants in 5 months. Dr June Andrews Math & Data Science July 29, 2015 5 / 59
  • 6. Trend is to Demand More Definition (Data Science as a Victim of Success) When use of a skill demonstrates improvements in support and innovation, it is added to the next job rec. Rule of thumb when hiring, does your favorite colleague pass your interview? Dr June Andrews Math & Data Science July 29, 2015 6 / 59
  • 7. Goals Invariant Use data to support colleagues: marketing, finance, engineering, . . . Use data to innovate: products, strategies, performance, . . . Cherry on Top Do what it takes to drive company success. Dr June Andrews Math & Data Science July 29, 2015 7 / 59
  • 8. Progress 1 Data Science Origins People Work 2 Math Behind Data Science Experimentation Growth Normalization If Time Dr June Andrews Math & Data Science July 29, 2015 8 / 59
  • 9. LinkedIn Data Dr June Andrews Math & Data Science July 29, 2015 9 / 59
  • 10. Source of 125k Data Professionals Figure: Incredibly diverse. Dr June Andrews Math & Data Science July 29, 2015 10 / 59
  • 11. Data Professionals on LinkedIn > 2k degree fields (after standardization) 16% are Unique Degrees: Oral Surgery Phytopathology Wedding Planning Ground Transportation Library Sciences Turfgrass Management Embryology Fire Fighting Stagecraft Art Conservation Dr June Andrews Math & Data Science July 29, 2015 11 / 59
  • 12. Data Science Homogenization Trend Dr June Andrews Math & Data Science July 29, 2015 12 / 59
  • 13. Uneven Growth of Top 10 Backgrounds Dr June Andrews Math & Data Science July 29, 2015 13 / 59
  • 14. Uneven Growth of Top 10 Backgrounds Figure: Increased recruitment of economists and statisticans. Dr June Andrews Math & Data Science July 29, 2015 14 / 59
  • 15. Destinations of Data Professionals Dr June Andrews Math & Data Science July 29, 2015 15 / 59
  • 16. Industry Diversification of Data Professionals Dr June Andrews Math & Data Science July 29, 2015 16 / 59
  • 17. Uneven Growth of Top 10 Industries Dr June Andrews Math & Data Science July 29, 2015 17 / 59
  • 18. Trends Homogenization of Sources of Data Professionals Diversification of Industry Destinations of Data Professionals Dr June Andrews Math & Data Science July 29, 2015 18 / 59
  • 19. Progress 1 Data Science Origins People Work 2 Math Behind Data Science Experimentation Growth Normalization If Time Dr June Andrews Math & Data Science July 29, 2015 19 / 59
  • 20. Product Cycle Figure: What portion of work data scientists do on a daily basis depends on product life cycle. Dr June Andrews Math & Data Science July 29, 2015 20 / 59
  • 21. Content Ask - Make content go big. Dr June Andrews Math & Data Science July 29, 2015 21 / 59
  • 22. Connection Network Figure: Content spreads along existing connection network. Dr June Andrews Math & Data Science July 29, 2015 22 / 59
  • 23. Follow Network Figure: Change the game. Increase readership and visibilty via follows. Dr June Andrews Math & Data Science July 29, 2015 23 / 59
  • 24. Product Cycle - Follow Network Stage Work Time Ideation Explore how to make content go big. Follows. 2 weeks Design & Spec Define a Follow for security, PR, marketing, all teams possibly affected. 3 weeks Development Database engineering, rollback safe, experimental framework. 6 months Test & Iterate Slow release experiment. 3 months Release Clean up code, outline fast follows 1 month Table: Follow Network, slow and steady development cycle. Dr June Andrews Math & Data Science July 29, 2015 24 / 59
  • 25. Types of Work Area of Data Goal Analyze Understand Visualize Communicate Business Decisions Orchestrate Action Prototype Product Demonstrate Usefulness Refine Product Maximize Usefulness Design Experiment Measure Changes Analyze Experiment Learn Log Save Everything Process Make Data Useable Load to Server/DB Make Data Accessible Table: General data science stack. Dr June Andrews Math & Data Science July 29, 2015 25 / 59
  • 26. Who does What Figure: Depth v. breadth of different fields. Dr June Andrews Math & Data Science July 29, 2015 26 / 59
  • 27. Skills of Data Professionals Languages Tools Hard Skills Soft Skills SQL Microsoft (Office, Excel, SQL, Visio) Research Management Java Oracle Statistics Leadership Matlab SAS ETL Process Improvement Javascript SharePoint Data Modeling Customer Service R SAP Software Dev Software Docs Python Cisco Data Mining Strategy C++ Salseforce Forecasting Public Speaking XML Six Sigma Database Design Team Leadership Table: From LinkedIn’s 125k Data Professionals. Dr June Andrews Math & Data Science July 29, 2015 27 / 59
  • 28. Network Product Development 1 Data Science Origins People Work 2 Math Behind Data Science Experimentation Growth Normalization If Time Dr June Andrews Math & Data Science July 29, 2015 28 / 59
  • 29. Traditional A/B Testing Figure: Traditional ab testing. [Salesforce] High Level Randomly divides users into two groups for different treatments. Dr June Andrews Math & Data Science July 29, 2015 29 / 59
  • 30. Social Influence Figure: Users can communicate experiences in social networks. Cross Over Testing interaction features such as messaging, connections, and profile views inherently have cross cohort communication. Dr June Andrews Math & Data Science July 29, 2015 30 / 59
  • 31. Elegant Solution Figure: See geographical bounds. [Ugander et al] High Level Partition network into relatively low intra communication groups. Dr June Andrews Math & Data Science July 29, 2015 31 / 59
  • 32. Elegant Solution Downside Costly to implement and assign elegant solution. Limited number of experiments can run simultaneousl. Cohort Actual Performance Observed Performance Observed Diff A x z B y c · z c − 1 Table: What exists and is observed. 2 equations, 3 variables, can compute upper bound for x y Dr June Andrews Math & Data Science July 29, 2015 32 / 59
  • 33. Elegant v. Brute Force Tradeoff Bound Actual impact a is bounded by observed impact c & viral coefficient V : a = c−V 1−cV Figure: Small impact for low viral products. [Andrews] Dr June Andrews Math & Data Science July 29, 2015 33 / 59
  • 34. Alternative Brute Force Control Interactions Split on the interaction at the cost of inconsistent user experience. Benefit is test the impact of sending or receiving. Sender / Receiver A B A Treatment Control B Control Control Dr June Andrews Math & Data Science July 29, 2015 34 / 59
  • 35. Progress 1 Data Science Origins People Work 2 Math Behind Data Science Experimentation Growth Normalization If Time Dr June Andrews Math & Data Science July 29, 2015 35 / 59
  • 36. Health Care’s Relations with other Industries Figure: Since 2008 Health Care has increased relationships with Recruiters. Dr June Andrews Math & Data Science July 29, 2015 36 / 59
  • 37. Not so Fast Figure: Growth of relationships is dominated by LinkedIn’s growth. Dr June Andrews Math & Data Science July 29, 2015 37 / 59
  • 38. Confounding or Masking Variables Control Confounding Variables Data quality and growth can dominate underlying trends. LinkedIn’s Network Growth is massive and diverse Venture Capitalists and Recruiters are hyper connectors Figure: Stan Lee Dr June Andrews Math & Data Science July 29, 2015 38 / 59
  • 39. Control for Growth and Behavioral Variables Approach Set as constants the number of users in an industry and how many connections they have. Then reconnect connections at random. Figure: Break edges and reconnect randomly. Dr June Andrews Math & Data Science July 29, 2015 39 / 59
  • 40. Expected Connections Closed Form Solution Reducible to pulling red and blue balls from a bag without replacement. The solution is the expectation of the Hypergeometric distribution. E[ Edges(Health Care, I) ] = Edges(Health Care) Edges(I) i,j Edges(i,j) − Edges(Health Care) Dr June Andrews Math & Data Science July 29, 2015 40 / 59
  • 41. Expected Connections Figure: Given growth and behavioral patterns, we expect some industries to have a dramatic number of connections to health care professionals. Dr June Andrews Math & Data Science July 29, 2015 41 / 59
  • 42. Significant Relations with Health Care Appear Figure: Venture Capitalists and Recruiters are no longer in the top rankings. Dr June Andrews Math & Data Science July 29, 2015 42 / 59
  • 43. Significant Relations with Health Care Appear Relations Now Reflect the Larger Economy City programs have increased inhome and preventative care Many hospitals are named after Saints and affliated with Religious Denominations Medical Devices and Pharmaceuticals have and have always had strong connection to Health Care Figure: Industries with Significant Connections to Health Care Dr June Andrews Math & Data Science July 29, 2015 43 / 59
  • 44. Significant Relations with Realtors Figure: Period of dramatic growth for real estate Dr June Andrews Math & Data Science July 29, 2015 44 / 59
  • 45. Significant Relations with Realtors Figure: Period of economic change Dr June Andrews Math & Data Science July 29, 2015 45 / 59
  • 46. Significant Relations with Construction Figure: Symmetric relationship between real estate and construction. Construction workers migrate between real estate and oil and mining. Dr June Andrews Math & Data Science July 29, 2015 46 / 59
  • 47. Industry Migration - Mechanics Figure: Construction workers connecting with Oil & Mining over Real Estate Dr June Andrews Math & Data Science July 29, 2015 47 / 59
  • 48. Industry Migration - Mechanics How? Is migration prompted by influential people? Is migration independent pockets of movement? Dr June Andrews Math & Data Science July 29, 2015 48 / 59
  • 49. Industry Migration - Cascades Figure: Median is 4 neighbors migrated before conversion Dr June Andrews Math & Data Science July 29, 2015 49 / 59
  • 50. Industry Migration - Mechanics Figure: Size of bubble is proportional to size of complete cascade. How? Migration is largely independent, with some cascades. Dr June Andrews Math & Data Science July 29, 2015 50 / 59
  • 51. Wrap 1 Data Science Origins People Work 2 Math Behind Data Science Experimentation Growth Normalization If Time Dr June Andrews Math & Data Science July 29, 2015 51 / 59
  • 52. Data & Computing Growth Figure: Data growth is exponential. Rule of thumb is doubles every 4-8 months. Dr June Andrews Math & Data Science July 29, 2015 52 / 59
  • 53. Linearity Wins Figure: Linear algorithms are fast, predictable, and complete. Dr June Andrews Math & Data Science July 29, 2015 53 / 59
  • 54. Takes a Village - Thank You! Figure: Every project involved at least 3 people. Dr June Andrews Math & Data Science July 29, 2015 54 / 59
  • 55. Progress 1 Data Science Origins People Work 2 Math Behind Data Science Experimentation Growth Normalization If Time Dr June Andrews Math & Data Science July 29, 2015 55 / 59
  • 56. MAP MAP combines: Precision - Give me only what I want Recall - Give me everything I want Figure: (Precision, Recall) values with same MAP score. Dr June Andrews Math & Data Science July 29, 2015 56 / 59
  • 57. MAP Figure: Two sets of (Precision, Recall) values with same MAP score. Dr June Andrews Math & Data Science July 29, 2015 57 / 59
  • 58. MAP Figure: Improve a search algorithm from point a with either small increase in Recall or a large increase in Precision. Dr June Andrews Math & Data Science July 29, 2015 58 / 59
  • 59. MAP Figure: Additional points. Dr June Andrews Math & Data Science July 29, 2015 59 / 59
  • 60. MAP Snake Oil When Precision and Recall values are not balanced, MAP only responds to changes in the lower one. North Star When Precision and Recall values are balanced, promotes improvement of both Precision and Recall. Dr June Andrews Math & Data Science July 29, 2015 60 / 59