Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Copyright © 2013 Splunk Inc.
Big Data at the
Speed of Business
Isaac Mosquera
Director of Mobile, ShareThis
Clint Sharp
Pr...
What We’ll Talk About
• Our quest for visibility
• Analyzing at scale
• Splunk and Big Data
• Where do you start?
• Q&A
About Splunk
Company (NASDAQ: SPLK)
Founded 2004, first software release in 2006
HQ: San Francisco
Business Model / Produc...
About ShareThis and Socialize
ShareThis makes the world more
connected, trusted and valuable through sharing
Powers the so...
Evaluating 20 Billion
Ad Impressions Monthly
Copyright © 2013 Splunk Inc.
Copyright © 2013 Splunk Inc.
Copyright © 2013 Splunk Inc.
Copyright © 2013 Splunk Inc.
Copyright © 2013 Splunk Inc.
Copyright © 2013 Splunk Inc.
Copyright © 2013 Splunk Inc.
Final Architecture
RDBMS
(Generated
Reports)
S3
Snapshots
Search
Head
Socialize Bidder
Splunk
Indexer
Indexer
Indexer
Cach...
So, What is
Splunk?
14
Expanding Universe of Data Sources
Machine-generated DataBusiness Application Data Human-generated Data
Highly Structured ...
Industry Leading Platform for Machine Data
Any Machine Data Operational Intelligence
HA Indexes
and Storage
Commodity
Serv...
Analyzing Heterogeneous Data
Universal Index Schema-on-the-fly Flexibility and
Fast Time to Value
• No data normalization
...
Gain Critical Insights … in Real-time
Order ID
Customer’s Tweet
Time Waiting On Hold
Product ID
Company’s Name
Sources
Twi...
Deep Visibility and Insight for IT and Business
IT Operations Management Web Intelligence
Business AnalyticsApplication Ma...
Driving Insights
from Big Data
Hadoop
The ShareThis Insights Platform
On Father’s day:
“Who were the most shared about topics?”
“What type of type of bee...
Finding the Optimal Approach
Hadoop and MapReduce are great for complex data science on data
at rest – the previous archit...
What About
Ad Hoc Analysis?
PR Insights Example
What was the situation? (e.g. fast moving business, needed
real-time insights)
What was the PR team st...
PR Insights Dashboard
Let’s not forget
The low-hanging fruit
Operational Analytics for an Online World
website
API Notification
Google (GCM)
Feedback
Processor
Apple (APNS)
? !
Notifi...
One More Thing …
28
Copyright © 2013 Splunk Inc.
New product from Splunk
delivers interactive data
exploration, analysis and
visualizations fo...
Derive Actionable Insights from Raw Data
Hadoop
Storage
Immediately
start
exploring, analyz
ing and
visualizing raw
data i...
Learn More
31
splunk.com/bigdata
Copyright © 2013 Splunk Inc.
Questions?
Upcoming SlideShare
Loading in …5
×

Hadoop summit socialize_v1.0

456 views

Published on

Published in: Technology
  • Login to see the comments

  • Be the first to like this

Hadoop summit socialize_v1.0

  1. 1. Copyright © 2013 Splunk Inc. Big Data at the Speed of Business Isaac Mosquera Director of Mobile, ShareThis Clint Sharp Principal Big Data Product Manager, Splunk Copyright © 2013 Splunk Inc.
  2. 2. What We’ll Talk About • Our quest for visibility • Analyzing at scale • Splunk and Big Data • Where do you start? • Q&A
  3. 3. About Splunk Company (NASDAQ: SPLK) Founded 2004, first software release in 2006 HQ: San Francisco Business Model / Products Industry-leading machine data platform On-premise, in the cloud and SaaS 5,600+ Customers 63 of the Fortune 100 Largest license: 100 Terabytes per day #1 Big Data Innovator* * Fast Company's Most Innovative Companies Issue (March 2013)
  4. 4. About ShareThis and Socialize ShareThis makes the world more connected, trusted and valuable through sharing Powers the social web, touching the lives of 95 percent of U.S. Acquires Socialize, which makes mobile and social more engaging Socialized integrated into thousands of iOS and Android Apps Installed on 80M+ devices
  5. 5. Evaluating 20 Billion Ad Impressions Monthly
  6. 6. Copyright © 2013 Splunk Inc.
  7. 7. Copyright © 2013 Splunk Inc.
  8. 8. Copyright © 2013 Splunk Inc.
  9. 9. Copyright © 2013 Splunk Inc.
  10. 10. Copyright © 2013 Splunk Inc.
  11. 11. Copyright © 2013 Splunk Inc.
  12. 12. Copyright © 2013 Splunk Inc.
  13. 13. Final Architecture RDBMS (Generated Reports) S3 Snapshots Search Head Socialize Bidder Splunk Indexer Indexer Indexer Cache Cluster Memcache Memcache Memcache
  14. 14. So, What is Splunk? 14
  15. 15. Expanding Universe of Data Sources Machine-generated DataBusiness Application Data Human-generated Data Highly Structured Arbitrarily Structured 2012-12-05 07:04:44 Id=00Q000000Rd910EAJ City=New York Country=US CreatedDate=“2012-12-05 07:06:44” Email.jdoe@gmail.com Email_Opt_In_c Customer_Street _Address_c=“123 Main St.” purchased_product_id= product_i BD-01 twitter_username john_t_doe
  16. 16. Industry Leading Platform for Machine Data Any Machine Data Operational Intelligence HA Indexes and Storage Commodity Servers Developer Platform Custom dashboards Monitor and alert Ad hoc search Report and analyze
  17. 17. Analyzing Heterogeneous Data Universal Index Schema-on-the-fly Flexibility and Fast Time to Value • No data normalization • Automatically handles timestamps • Parsers not required • Index every term & pattern “blindly” • No attempt to “understand” up front • Structure applied at search-time • No brittle schema to work around • Automatically find transactions, patterns and trends • Normalization as it’s needed • Faster implementation • Easy search language • Multiple views into the same data
  18. 18. Gain Critical Insights … in Real-time Order ID Customer’s Tweet Time Waiting On Hold Product ID Company’s Name Sources Twitter Care IVR Middleware Error Order Processing Order ID Customer ID Twitter ID Customer ID Customer ID
  19. 19. Deep Visibility and Insight for IT and Business IT Operations Management Web Intelligence Business AnalyticsApplication Management Security and Compliance Industrial Data / Internet of Things Over 5,600 organizations using Splunk across IT and business users
  20. 20. Driving Insights from Big Data
  21. 21. Hadoop The ShareThis Insights Platform On Father’s day: “Who were the most shared about topics?” “What type of type of beers do people drink?” API ETL Pre- aggregation Analytics ?
  22. 22. Finding the Optimal Approach Hadoop and MapReduce are great for complex data science on data at rest – the previous architecture took 9 months with a team of engineers, data architects, etc. The Splunk platform delivers real-time, interactive analysis – we can build many of the same insights within 1 hour What should be the core focus or competency of your team? Conclusion: find the most optimal approach for the business
  23. 23. What About Ad Hoc Analysis?
  24. 24. PR Insights Example What was the situation? (e.g. fast moving business, needed real-time insights) What was the PR team struggling with? Difficult to find useful data to build interesting use-cases What did they want? They wanted a flexible real-time reporting environment to extract insights useful for the market How my team helped? Delivered a single dashboard that contained real-time data into the sharing behaviors across our network
  25. 25. PR Insights Dashboard
  26. 26. Let’s not forget The low-hanging fruit
  27. 27. Operational Analytics for an Online World website API Notification Google (GCM) Feedback Processor Apple (APNS) ? ! Notifications Systems Driving Superior Customer Experience How many 500 errors have I had over time? Look for anomalies and spikes! Zone in directly to the customer!! Online Device Notifications
  28. 28. One More Thing … 28
  29. 29. Copyright © 2013 Splunk Inc. New product from Splunk delivers interactive data exploration, analysis and visualizations for Hadoop Announcing Hunk Beta Splunk Analytics for Hadoop
  30. 30. Derive Actionable Insights from Raw Data Hadoop Storage Immediately start exploring, analyz ing and visualizing raw data in Hadoop 1 2Point Splunk at Hadoop Cluster Explore Analyze Visualize Dashboards Share
  31. 31. Learn More 31 splunk.com/bigdata
  32. 32. Copyright © 2013 Splunk Inc. Questions?

×