View stunning SlideShares in full-screen with the new iOS app!Introducing SlideShare for AndroidExplore all your favorite topics in the SlideShare appGet the SlideShare app to Save for Later — even offline
View stunning SlideShares in full-screen with the new Android app!View stunning SlideShares in full-screen with the new iOS app!
Big Data is Getting Bigger 2.7 Zetabytes in 2012 Over 90% will be unstructured Data spread across a wide array of silos
Why is Big Data Hard (and Getting Harder)? Changing Data Requirements Faster response time of fresher dataSampling is not good enough & history is important Increasing complexity of analytics Users demand inexpensive experimentation
Where is it Coming From?Computer Generated Human Generated• Application server logs • Twitter “Fire Hose” 50m (web sites, games) tweets/day 1,400% growth• Sensor data (weather, per year water, smart grids) • Blogs/Reviews/Emails/Pict• Images/videos (traffic, ures security cameras) • Social Graphs: Facebook, Linked-in, Contacts
We know we want collect, store,organize, analyze and share it.But we have limited resources.
The Cloud OptimizesPrecious IT Resources i.e. Skilled People
“Over the next decade, the number of ﬁles or containersthat encapsulate the information in the digital universewill grow by 75x.While the pool of IT staff available to manage them willgrow only slightly. At 1.5x” - 2011 IDC Digital Universe Study
Big Data Verticals SocialMedia/Adverti Financial Oil & Gas Retail Life Sciences Security Network/Gami sing Services ng User Anti-virus Targeted Monte Carlo Demographics Recommend Advertising Simulations Seismic Genome Fraud Usage analysis Analysis Analysis Detection Image and Transactions Video Risk Analysis Analysis Image In-game Processing Recognition metrics
Bank – Monte Carlo Simulations “The AWS platform was a good fit for its unlimited and flexible computational power to23 Hours to our risk-simulation process requirements. With AWS, we now have the power to decide20 Minutes how fast we want to obtain simulation results, and, more importantly, we have the ability to run simulations not possible before due to the large amount of infrastructure required.” – Castillo, Director, Bankinter
Recommendations The Taste Testhttp://www.etsy.com/tastetest
RecommendationsGift Ideas for Facebook Friends etsy.com/gifts
Click Stream Analysis User recently purchased a Targeted Adsports movie and (1.7 Million per day) is searching for video games
Characteristics of Big Data How the Cloud Is Big Data’s Best Friend Big Data on the Cloud In the Real World