Your SlideShare is downloading. ×
Learn Data Science
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Saving this for later?

Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime - even offline.

Text the download link to your phone

Standard text messaging rates apply

Learn Data Science

4,007
views

Published on

Big Data and Data Science are hot buzzwords right now. The buzzwords might go away but the ideas will not. This talk will explain the buzzwords, and it will cover some of the best resources for …

Big Data and Data Science are hot buzzwords right now. The buzzwords might go away but the ideas will not. This talk will explain the buzzwords, and it will cover some of the best resources for attaining data science skills.


0 Comments
3 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
4,007
On Slideshare
0
From Embeds
0
Number of Embeds
4
Actions
Shares
0
Downloads
37
Comments
0
Likes
3
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Why Data Science isSomething You ShouldCare AboutPresented @ South Dakota Code Camp 2012Ryan Swanstrom @swgoof
  • 2. About Ryan SwanstromFind me on the web http://twitter.com/swgoof http://linkedin.com/in/ryanswanstrom http://datascience101.wordpress.com/
  • 3. Data Science"[ability to] obtain, scrub, explore, model andinterpret data, blending hacking, statistics, andmachine learning." definition by Hilary Mason, Chief Scientist @ Bit.ly
  • 4. Data Sciencehttp://www.drewconway.com/zia/?p=2378
  • 5. Who is a data scientist?http://onforb.es/WNLnRu
  • 6. Big DataAny dataset where the size or speed ofincoming data causes difficulties in processing ● Volume ● Velocity ● Variety
  • 7. Hadoop"[...] a framework that allows for the distributedprocessing of large data sets across clusters ofcomputers using simple programming models." Apache Hadoop Website ● HDFS - Hadoop Distributed File System ● MapReduce
  • 8. Lots of Data 18 Months the amount of time for digital data to double
  • 9. Data Products
  • 10. Why Do You Care?McKinsey Global Big Data Report● 140k - 190k Unfilled Jobs by 2018● 1.5M Managers & Analysts
  • 11. Indeed Data Science Job Listingshttp://www.indeed.com/jobtrends?q=Data-science&relative=1
  • 12. Now That You Care, What Skills? 1. Machine Learning 2. Statistics 3. Story Telling (Communication) 4. Big Data 5. Algorithms 6. Curiosity
  • 13. College and University http://datascience101.wordpress.com/2012/04/09/colleges-with-data-science-degrees/ http://whatsthebigdata.com/2012/08/09/graduate-programs-in-big-data-and-data-science/
  • 14. College and University Pros Cons● Credentials ● Expensive● Experts ● Not Individualized● Familiar ● School● Widely Accepted ● Lengthy● Structured ● Inflexible ● Not Real World
  • 15. Corporate Training General Assembly - Not really Corp Training, but it looks really good
  • 16. Corporate Training Pros Cons● Short Timeframe ● Expensive● Experts ● Not Individualized● Certificates ● Product Focused● Business-Savy ● Sales Pitch● Real World● Structured
  • 17. MOOCs (Massive Open OnlineCourses)
  • 18. MOOCs (Massive Open OnlineCourses) Pros Cons● Free ● No Credentials● Experts ● Single Course● Flexible ● No Programs (Yet)
  • 19. Blogs/Wikis/Other
  • 20. Blogs/Wikis/Other Pros Cons● Free ● Quality?● Very Specific ● No Credentials● Short ● No Structure● Lots of them ● Too many!
  • 21. Blogs/Wikis/Other The Problem ● What content is good? ● What order should I cover the content? ● Where do I find new content? ● Who can help me understand?
  • 22. Data Science 201 - coming soon http://www.datascience201.com Helping you find the best data science learning content! Thank You