SlideShare a Scribd company logo
Can you run faster?
Alexis Yelton
Runners want to run faster
How fast can I run a half marathon?
How can I improve on that time?
Demo
Data from Strava.com
Pace
Time series, demographic, and aggregated running data on
10,000 runners. 1,000 with half-marathon times.
75
100
125
150
1 2 3 4
Rests per week
HalfMarathonTime
"
1
8.4
8.6
8.8
9.0
0 2 4
Log Month Distance
LogHalfMarathonTime
10
10
10
20
clean5$mnth_pace
8.4
8.6
8.8
9.0
0 2 4
Log Month Distance
LogHalfMarathonTime
10
10
10
20
clean5$mnth_pace
Data from Strava.com
60
80
100
120
140
160
20 30 40 50 60
Age (years)
HalfMarathonTime
"blue"
blue
10
10
60
80
100
120
140
160
120 140 160 180 200
Weight (lbs)
HalfMarathonTime
Analysis
Benchmarking with a linear model 0.73 6.5 min
Reducing number of features
Ensemble partial least squares regression 0.73 6.4 min
5-fold cross-validation
Regression r2
RMSE
Validation:
72 runners 0.63 7.2 min
About me: Alexis Yelton, MIT postdoc
Chitinase in marine cyanobacteria
Chitinaseactivity
My first half
marathon:
1:56:30
Personal best:
1:47:56
22 Features
Month distance Weight Range
Month Runs Gender
Month Elevation Rest Days / Week
Month Pace Fast Days / Week
Month Time Long Days /Week
6 Month Distance 5K Time
6 Month Runs Marathon Time
6 Month Elevation Minimum Pace
6 Month Pace Minimum Pace > 2 mi
6 Month Time Minimum Pace > 3 mi
Age Range SD Pace
Results
0
500
1000
1500
4000 5000 6000 7000 8000
train_sub$HALF.MARATHON
sqrt(errorstrain^2)
Half Marathon Time
Errors vary with half
marathon time.
A larger data set
would allow for
better predictions
for faster and slower
runners.
Results
Analysis
Benchmarking with a linear model 0.73 6.5 min
Dealing with collinear features (and reducing number of features)
1. Ensemble partial least squares regression 0.72 6.6 min
2. Linear model 0.71
6.7 min
3. Lasso regression 0.69 6.8 min
4. Ridge regression 0.72 6.7 min
5. Random forest regression 0.67 7.1 min
3-fold cross-validation
Regression r2
RMSE
Validation:
69 runners 0.63 7.2 min
Analysis
Benchmarking with a linear model 0.73 6.5 min
Reducing number of features
1. Ensemble partial least squares regression 0.72 6.6 min
2. Linear model 0.71
6.7 min
3. Lasso regression 0.69 6.8 min
Other models with these features
1. Ridge regression 0.72 6.7 min
2. Random forest regression 0.67 7.1 min
3-fold cross-validation
Regression r2
RMSE
Validation:
69 runners 0.63 7.2 min
0.0e+00 4.0e+07 8.0e+07 1.2e+08
rf$finalModel
IncNodePurity
Your average pace over the past month is the most
important feature by far.
Results
Variable importance
Increase in node purity
Pace past month
5K time
Pace past month
Rest days
SD pace
Weight
Long days
Age
Gender

More Related Content

Similar to Week 5 presentation

Untitledtest6818 aef6d6c0991a013425ea
Untitledtest6818 aef6d6c0991a013425eaUntitledtest6818 aef6d6c0991a013425ea
Untitledtest6818 aef6d6c0991a013425ea
paul diya
 
3 Ways the New Thermo Scientific LC MS Triple Quads Improve Residue Analysis
3 Ways the New Thermo Scientific LC MS Triple Quads Improve Residue Analysis3 Ways the New Thermo Scientific LC MS Triple Quads Improve Residue Analysis
3 Ways the New Thermo Scientific LC MS Triple Quads Improve Residue Analysis
Thermo Fisher Scientific
 
Untitledtest6806 c17b226098de013425b9
Untitledtest6806 c17b226098de013425b9Untitledtest6806 c17b226098de013425b9
Untitledtest6806 c17b226098de013425b9
paul diya
 
Untitledtest6806 c17b226098de013425b9
Untitledtest6806 c17b226098de013425b9Untitledtest6806 c17b226098de013425b9
Untitledtest6806 c17b226098de013425b9
paul diya
 
Body Weight-Normalized Energy Expenditure Estimation Using Combined Activity ...
Body Weight-Normalized Energy Expenditure Estimation Using Combined Activity ...Body Weight-Normalized Energy Expenditure Estimation Using Combined Activity ...
Body Weight-Normalized Energy Expenditure Estimation Using Combined Activity ...
Marco Altini
 
Estimating Tail Parameters
Estimating Tail ParametersEstimating Tail Parameters
Estimating Tail Parameters
Alejandro Ortega
 
Variable Selection Methods
Variable Selection MethodsVariable Selection Methods
Variable Selection Methods
joycemi_la
 
Variable Selection Methods
Variable Selection MethodsVariable Selection Methods
Variable Selection Methods
joycemi_la
 
Survival report of 76 breast cancer patients under three different treatments
Survival report of 76 breast cancer patients under three different treatmentsSurvival report of 76 breast cancer patients under three different treatments
Survival report of 76 breast cancer patients under three different treatments
Dwaipayan Mukhopadhyay
 
Risk management
Risk managementRisk management
Risk management
Sunam Pal
 
Risk management Report
Risk management ReportRisk management Report
Risk management Report
NewGate India
 
Practical Data Science: Data Modelling and Presentation
Practical Data Science: Data Modelling and PresentationPractical Data Science: Data Modelling and Presentation
Practical Data Science: Data Modelling and Presentation
HariniMS1
 
Design of experiment methodology
Design of experiment methodologyDesign of experiment methodology
Design of experiment methodology
CHUN-HAO KUNG
 
Mnistauto 3
Mnistauto 3Mnistauto 3
Mnistauto 3
Ali Rıza SARAL
 
Enhancing the performance of kmeans algorithm
Enhancing the performance of kmeans algorithmEnhancing the performance of kmeans algorithm
Enhancing the performance of kmeans algorithm
Hadi Fadlallah
 
Simulation and hardware implementation of Adaptive algorithms on tms320 c6713...
Simulation and hardware implementation of Adaptive algorithms on tms320 c6713...Simulation and hardware implementation of Adaptive algorithms on tms320 c6713...
Simulation and hardware implementation of Adaptive algorithms on tms320 c6713...
Raj Kumar Thenua
 
Msa rr
Msa rrMsa rr
Msa rr
Chai Wei
 
Tools of the Trade
Tools of the TradeTools of the Trade
Tools of the Trade
Pierre-Charles Bierly
 
Models Can Lie
Models Can LieModels Can Lie
Models Can Lie
Raju Rimal
 
Lower back pain Regression models
Lower back pain Regression modelsLower back pain Regression models
Lower back pain Regression models
Alok Tayal (PMP, PMI-ACP, TOGAF)
 

Similar to Week 5 presentation (20)

Untitledtest6818 aef6d6c0991a013425ea
Untitledtest6818 aef6d6c0991a013425eaUntitledtest6818 aef6d6c0991a013425ea
Untitledtest6818 aef6d6c0991a013425ea
 
3 Ways the New Thermo Scientific LC MS Triple Quads Improve Residue Analysis
3 Ways the New Thermo Scientific LC MS Triple Quads Improve Residue Analysis3 Ways the New Thermo Scientific LC MS Triple Quads Improve Residue Analysis
3 Ways the New Thermo Scientific LC MS Triple Quads Improve Residue Analysis
 
Untitledtest6806 c17b226098de013425b9
Untitledtest6806 c17b226098de013425b9Untitledtest6806 c17b226098de013425b9
Untitledtest6806 c17b226098de013425b9
 
Untitledtest6806 c17b226098de013425b9
Untitledtest6806 c17b226098de013425b9Untitledtest6806 c17b226098de013425b9
Untitledtest6806 c17b226098de013425b9
 
Body Weight-Normalized Energy Expenditure Estimation Using Combined Activity ...
Body Weight-Normalized Energy Expenditure Estimation Using Combined Activity ...Body Weight-Normalized Energy Expenditure Estimation Using Combined Activity ...
Body Weight-Normalized Energy Expenditure Estimation Using Combined Activity ...
 
Estimating Tail Parameters
Estimating Tail ParametersEstimating Tail Parameters
Estimating Tail Parameters
 
Variable Selection Methods
Variable Selection MethodsVariable Selection Methods
Variable Selection Methods
 
Variable Selection Methods
Variable Selection MethodsVariable Selection Methods
Variable Selection Methods
 
Survival report of 76 breast cancer patients under three different treatments
Survival report of 76 breast cancer patients under three different treatmentsSurvival report of 76 breast cancer patients under three different treatments
Survival report of 76 breast cancer patients under three different treatments
 
Risk management
Risk managementRisk management
Risk management
 
Risk management Report
Risk management ReportRisk management Report
Risk management Report
 
Practical Data Science: Data Modelling and Presentation
Practical Data Science: Data Modelling and PresentationPractical Data Science: Data Modelling and Presentation
Practical Data Science: Data Modelling and Presentation
 
Design of experiment methodology
Design of experiment methodologyDesign of experiment methodology
Design of experiment methodology
 
Mnistauto 3
Mnistauto 3Mnistauto 3
Mnistauto 3
 
Enhancing the performance of kmeans algorithm
Enhancing the performance of kmeans algorithmEnhancing the performance of kmeans algorithm
Enhancing the performance of kmeans algorithm
 
Simulation and hardware implementation of Adaptive algorithms on tms320 c6713...
Simulation and hardware implementation of Adaptive algorithms on tms320 c6713...Simulation and hardware implementation of Adaptive algorithms on tms320 c6713...
Simulation and hardware implementation of Adaptive algorithms on tms320 c6713...
 
Msa rr
Msa rrMsa rr
Msa rr
 
Tools of the Trade
Tools of the TradeTools of the Trade
Tools of the Trade
 
Models Can Lie
Models Can LieModels Can Lie
Models Can Lie
 
Lower back pain Regression models
Lower back pain Regression modelsLower back pain Regression models
Lower back pain Regression models
 

Recently uploaded

一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
74nqk8xf
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
g4dpvqap0
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
AndrzejJarynowski
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
kuntobimo2016
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 

Recently uploaded (20)

一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 

Week 5 presentation

  • 1. Can you run faster? Alexis Yelton
  • 2. Runners want to run faster How fast can I run a half marathon? How can I improve on that time?
  • 4. Data from Strava.com Pace Time series, demographic, and aggregated running data on 10,000 runners. 1,000 with half-marathon times. 75 100 125 150 1 2 3 4 Rests per week HalfMarathonTime " 1 8.4 8.6 8.8 9.0 0 2 4 Log Month Distance LogHalfMarathonTime 10 10 10 20 clean5$mnth_pace 8.4 8.6 8.8 9.0 0 2 4 Log Month Distance LogHalfMarathonTime 10 10 10 20 clean5$mnth_pace
  • 5. Data from Strava.com 60 80 100 120 140 160 20 30 40 50 60 Age (years) HalfMarathonTime "blue" blue 10 10 60 80 100 120 140 160 120 140 160 180 200 Weight (lbs) HalfMarathonTime
  • 6. Analysis Benchmarking with a linear model 0.73 6.5 min Reducing number of features Ensemble partial least squares regression 0.73 6.4 min 5-fold cross-validation Regression r2 RMSE Validation: 72 runners 0.63 7.2 min
  • 7. About me: Alexis Yelton, MIT postdoc Chitinase in marine cyanobacteria Chitinaseactivity My first half marathon: 1:56:30 Personal best: 1:47:56
  • 8. 22 Features Month distance Weight Range Month Runs Gender Month Elevation Rest Days / Week Month Pace Fast Days / Week Month Time Long Days /Week 6 Month Distance 5K Time 6 Month Runs Marathon Time 6 Month Elevation Minimum Pace 6 Month Pace Minimum Pace > 2 mi 6 Month Time Minimum Pace > 3 mi Age Range SD Pace
  • 9. Results 0 500 1000 1500 4000 5000 6000 7000 8000 train_sub$HALF.MARATHON sqrt(errorstrain^2) Half Marathon Time Errors vary with half marathon time. A larger data set would allow for better predictions for faster and slower runners.
  • 11.
  • 12.
  • 13. Analysis Benchmarking with a linear model 0.73 6.5 min Dealing with collinear features (and reducing number of features) 1. Ensemble partial least squares regression 0.72 6.6 min 2. Linear model 0.71 6.7 min 3. Lasso regression 0.69 6.8 min 4. Ridge regression 0.72 6.7 min 5. Random forest regression 0.67 7.1 min 3-fold cross-validation Regression r2 RMSE Validation: 69 runners 0.63 7.2 min
  • 14. Analysis Benchmarking with a linear model 0.73 6.5 min Reducing number of features 1. Ensemble partial least squares regression 0.72 6.6 min 2. Linear model 0.71 6.7 min 3. Lasso regression 0.69 6.8 min Other models with these features 1. Ridge regression 0.72 6.7 min 2. Random forest regression 0.67 7.1 min 3-fold cross-validation Regression r2 RMSE Validation: 69 runners 0.63 7.2 min
  • 15. 0.0e+00 4.0e+07 8.0e+07 1.2e+08 rf$finalModel IncNodePurity Your average pace over the past month is the most important feature by far. Results Variable importance Increase in node purity Pace past month 5K time Pace past month Rest days SD pace Weight Long days Age Gender

Editor's Notes

  1. Start with asking if anyone is a runner. Be excited about the problem. Try elastic net regression (lasso and ridge combination)
  2. Motivation: Add what specifically works (main drivers of improving performance) Figure of errors vs actual times
  3. Motivation: Add what specifically works (main drivers of improving performance) Figure of errors vs actual times
  4. Start with 20 features in LM Feature selection (PLS regression, Lasso) Compare the same features in different models Improve usability Focus on the model I used. Introduce motivations before talking about the models (simplicity/usability and results)
  5. Start with 20 features in LM Feature selection (PLS regression, Lasso) Compare the same features in different models Improve usability
  6. Start with 20 features in LM Feature selection (PLS regression, Lasso) Compare the same features in different models Improve usability Focus on the model I used. Introduce motivations before talking about the models (simplicity/usability and results)