Big Data and Data Science are hot buzzwords right now. The buzzwords might go away but the ideas will not. This talk will explain the buzzwords, and it will cover some of the best resources for attaining data science skills.
Why Data Science isSomething You ShouldCare AboutPresented @ South Dakota Code Camp 2012Ryan Swanstrom @swgoof
About Ryan SwanstromFind me on the web http://twitter.com/swgoof http://linkedin.com/in/ryanswanstrom http://datascience101.wordpress.com/
Data Science"[ability to] obtain, scrub, explore, model andinterpret data, blending hacking, statistics, andmachine learning." definition by Hilary Mason, Chief Scientist @ Bit.ly
Who is a data scientist?http://onforb.es/WNLnRu
Big DataAny dataset where the size or speed ofincoming data causes difficulties in processing ● Volume ● Velocity ● Variety
Hadoop"[...] a framework that allows for the distributedprocessing of large data sets across clusters ofcomputers using simple programming models." Apache Hadoop Website ● HDFS - Hadoop Distributed File System ● MapReduce
Lots of Data 18 Months the amount of time for digital data to double