@simplystats
Tophat2
derfinder
(HMMs)
+
Tophat2
derfinder
(bumphunter)
+
Rail-RNA
(cloud computing)
derfinder
(bumphunter)
+
7/1/2012 (3:45PM)
“Hopkins has a deal w/ Coursera"
7/1/2012 (5:00PM)
Roger has bad timing
7/2/2012
Roger + Jeff sign on
7/5/2012
We make advertising videos
7/17/2012
Official JHU + Coursera announcement
Wouldn’t it be amazing if we
got 2,000 people to learn
statistics!
“
”-Jeff Leek
7/17/12
date: 7/19/12
from: jtleek@gmail.com
Roger let me know you gave him a
ballpark figure for the number of
students registered for his course
"Computing for Data Analysis”.
Could you give me an idea of how
many have registered for my course
"Data Analysis?”
date: 7/19/12
from: pangwei@coursera.org
Hi Jeff,
7,000 students! It's pretty awesome.
(You'll be able to check this out
yourself next week, once the class
sites are up.)
date: 7/19/12
from: rdpeng@gmail.com

You are f**ed.
-roger
7/2012
Official JHU + Coursera announcement
9/2012
Brian/Roger run classes
1/2013
Jeff runs “data analysis”
A MOOC is
Videos
A MOOC is
Quizzes
A MOOC is
Forums
A MOOC is
Peer grading
Formatting
1. Does the analysis have an introduction, analysis, and
conclusions? (wt = 10)
2. Does the analysis include references for the statistical
methods used? (wt = 2)
….
The Question
1. Is the type of question specified (exploratory, inferential,
predictive, causal)? (wt = 10)
2. Does the analysis answer the scientific question? (wt = 10)
3. Does the analysis report a measure of uncertainty about
the answer? (wt = 10)
….
…
Leek & Peng 2015 PNAS
Experiment
1
Fisher et al. 2014 PeerJ
n=2,048
Fisher et al. 2014 PeerJ
Fisher et al. 2014 PeerJ
Experiment
2
69% vs 40%
n=1,985
Mathematical
Biostatistics
Bootcamp
Computing for
Data Analysis
Data Analysis
~15K enrolled ~50K enrolled ~100K enrolled
Understanding scale
6503
Data analysis
completers
6761*
M.S. in
Statistics
* http://community.amstat.org/blogs/steve-pierson/2014/02/09/largest-graduate-programs-in-statistics
Understanding scale
Understanding cost
Laptop
iPhone w/tripod mount
Tripod
Microphone (a good one)
Camtasia (screen recording)
Final Cut Pro X (video editing)
Total Cost: $2,877
7/2012
Official JHU + Coursera announcement
9/2012
Brian/Roger run classes
1/2013
Jeff runs “data analysis”
11/2013
Daphne Koller visits
7/2012
Official JHU + Coursera announcement
9/2012
Brian/Roger run classes
1/2013
Jeff runs “data analysis”
11/2013
Daphne Koller visits
We claim to have data science sequence
This is false
Failure is not an option
1/2013
Jeff runs “data analysis”
11/2013
Daphne Koller visits
We claim to have data science sequence
12/2013
We start making DSS
2/2014
We start testing DSS
4/2014
We launch!
9 classes
1 month long
Every month
Less standard content
Standard content
Github
Data cleaning
Interactive graphics
Presentations
Capstone
Probability
Inference
Regression and GLMs
EDA
Moore Data Science Environments
0/3 directors, 1/25 speakers statisticians
NAS Big Data Workshop
2/13 speakers statisticians
NIH BD2K Proposal Workshop
0/18 participants
Big Data Rollout from White House
0/4 thought leaders in statistics
(1/n) reasons:
speed
Should we teach
the Lasso?
“
”
No“ ”
(2/n) reasons:
infrastructure
Less standard content
Standard content
Github
Data cleaning
Interactive graphics
Presentations
Capstone
Probability
Inference
Regression and GLMs
EDA
swirl +
Coursera
Want to do a
capstone?
“
”
Ok guy I just met“
”
LinkedIn Certification
This is not a degree!
Portfolio based
Open content
Johns Hopkins backing
Alumni “social network”
Enrollment
Sigtrack
Completion percentage
Sigtrack completion
Total Time Running: 13 months
Avg. Monthly Enrollment: 170,837
Avg. Monthly SigTrack: 12,486 (7.3%)
Overall Completion Rate: 10%
SigTrack Completion Rate: 85%
First Capstone Enrollment: 663
Cost comparison
Revenue for 2014 (Q2—Q4): $1.75M
Revenue to Biostatistics: $1.24M
Resources req’d to date: 0.5 staff
Low overhead:
No admissions process,
no student supervision,
no administrative support
Student population: Orthogonal?
Why I think we
were successful
http://www.provost.umd.edu/announcements/
new_coursera_mooc.cfm
@jtleek
jtleek.com/talks

JHU Data Science MOOCs - Behind the Scenes