Introduction to Big Data for LABDUG

573 views

Published on

My slides for the 1st meeting of the LA Big Data User Group meetup event: "Introduction to Big Data"

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
573
On SlideShare
0
From Embeds
0
Number of Embeds
5
Actions
Shares
0
Downloads
12
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Introduction to Big Data for LABDUG

  1. 1. Introduction to Big Data Daniel D. Gutierrez, Data Scientist AMULET Analytics March 2014
  2. 2. / page 2
  3. 3. / page 3 Not Everyone Likes the “Big Data” Hype
  4. 4. / page 4 Volume is a Big Reason for Big Data
  5. 5. / page 5
  6. 6. / page 6 Economist February 27, 2010 Profiled “Big Data”
  7. 7. / page 7
  8. 8. Big Data – “large data sets so big that commonly-used software tools are unable to capture, curate, manage, and process the data within a tolerable elapsed time.” Hadoop Dominates Big Data market – Used widely by some of the world's largest websites, such as Facebook, eBay, Amazon and Yahoo – Moving into the enterprise – Invented by developers at Yahoo! / page 8 What is Big Data? Apache Hadoop
  9. 9. / page 9
  10. 10. / page 10 Characteristics of Big Data Component Parts Big Data is facilitated by Data Science Data Science is facilitated by Machine Learning Machine Learning is a confluence of disciplines: computer science, mathematical statistics, probability theory, visualization, etc. What is the “New” Part of Big Data “Big” is new, more data to manage than ever before Traditional data content is now coupled with internal and external sources of unstructured data via social media New forms of analysis such as sentiment and credibility analysis Bubble Brewing? Circa 2000 and the Internet bubble event. Will it occur again? A bubble may occur, but not because of Big Data
  11. 11. / page 11 Applications for Big Data Smarter Healthcare Multi-channel sales Financial Services Log Analysis Homeland Security Traffic Control Telecom Search Quality Manufacturing Trading Analytics Fraud and Risk Retail: Churn “Big Data is the definitive source of competitive advantage across all industries. For those organizations that understand and embrace the new reality of Big Data, the possibilities for new innovation, improved agility, and increased profitability are nearly endless.” Source: Wikibon 2012
  12. 12. / page 12
  13. 13. © 2014 AMULET Analytics. All rights reserved.
  14. 14. Thank you! Follow me: @AMULETAnalytics Contact me: daniel@amuletanalytics.com www.amuletanalytics.com

×