Jonathan Cornelissen - Jonathan@datacamp.com - +1 857 498 4105
09/10/17
Open-Source Data Science
Crossing The Chasm
Crossing the chasm
➔ Who’s the early majority?
➔ What are they looking for?
➔ How are they driving change?
◆ Higher-level interfaces start winning
◆ Growth of Python
Outline
Who?
Registered students on DataCamp
70% professionals
~ 70% of learners is younger than 35
~ 70% of learners is male
Learning traffic last 30 days on DataCamp
Generally, the higher GDP, the faster the adoption
Source: http://varianceexplained.org/r/nyr-conference/
North america and Europe have faster adoption relatively speaking
Source: http://varianceexplained.org/r/nyr-conference/
What are they looking for?
What are they learning?
Newcomers want to build models and solve problems…
Confirmed by search behavior on DataCamp
… but first have to learn the basic skills
~ 145,000 course completions
Course completions on DataCamp across different topics - last 2 quarters
~ 50,000 course completions
How is this driving change?
Easier syntax and higher level
interfaces gain more traction
dplyr data.table
Difference in syntax between dplyr and data.table
More context:
https://stackoverflow.com/questions/21435339/data-table-vs-dplyr-can-one-do-something-well-the-other-
cant-or-does-poorly
https://insights.stackoverflow.com/trends
dplyr seems to have taken over from data.table
Growth of Python
Q&A
or
Jonathan@datacamp.com

Open-Source Data Science Crossing The Chasm