SlideShare a Scribd company logo
1 of 22
© This document contains confidential and proprietary information of Adroitent. It is furnished for evaluation purposes
only. Except with the express prior written permission of Adroitent, this document and the information contained herein
may not be published, disclosed, or used for any other purpose. | www.adroitent.com
Understanding Social
Media Analytics
SANDEEP SEERAPU
• Web is no longer a static library that people passively browse
• Web is a place where people:
o Consume and create content
o Interact with other people:
 Internet forums, Blogs, Social networks, Twitter, Wikis, Podcasts, Slide
sharing, Bookmark sharing, Product reviews, Comments, …
• DATA POINT: Facebook traffic tops Google (for USA)
• March 2010: FB > 7% of US traffic
http://money.cnn.com/2010/03/16/technology/facebook_most_visited
Social Media : Big Change
• Rich and big data:
• Billions users, billions contents
• Textual, Multimedia (image, videos, etc.)
• Billions of connections
• Behaviours, preferences, trends...
• Data is open and easy to access
• It’s easy to get data from Social Media
• Datasets
• Developers APIs
• Spidering the Web
Social Media : Rich and Big data
Social Media : Opportunities
Any user can share and contribute content, express opinions, link to others
This means: Can data-mine opinions and behaviours of millions of users to gain
insights into:
• Human behaviour
• Marketing analytics
• Product sentiment
What can we do with this data?
• Consumer Brand Analytics
• What are people saying about our brand?
• Marketing Communications
• Significant spending on marketing, advertising:
• Companies trying to position their products
• Brand analytics helps to determine whether such campaigns are effective
• Product reviews
• Automatically mine product reviews for information on product features, new
requests, …
• Easy to use, Comfortable chair, Light weight, Sturdy, Good price
Applications: Reputation Management
• Citizen response
• Solicit citizen feedback on bills debated in Congress
• What new issues are being raised, what aspects of bill are popular, unpopular
• Political Campaigns
• Why do people support a candidate?
• Law enforcement
• Gang members boast about their activities on Facebook
• Protests being planned through Twitter
• NYT: Sending the Police Before There’s a Crime
http://www.nytimes.com/2011/08/16/us/16police.html?_r=1
Applications: Citizen Response
• Viral marketing:
• Personalized recommendations Online forum users are
• Brand advocates:
• 79.2% of forum contributors help a friend to make a decision about a product
• purchase (47.6% of non-contributors).
• 65% of forum contributors share advice (offline and in person) based on
information that they’ve read online (35% of non-contributors)
http://www.socialmediaexaminer.com/new-studies-show-value-of-social-media
Applications: Social Media Marketing
Information Flow
How do we capture and model
the flow of information?
Given that social media generate a wealth of consumer data, how can brands turn raw
social media comment data from Twitter, Facebook, blogs, and forums into actionable
business insights? The answer lies in the application of text-mining and semantic
technology to these new sources of unstructured data.
How does it work?
• Text mining is similar to data mining in that it is aimed at identifying interesting patterns
in data
• The first step in any text-mining effort is to identify the text-based sources to be
analysed and gather this material through information retrieval or selecting the corpus
that comprises the set of textual files and content of interest.
• Extensive NLP is deployed that invokes "part of speech tagging" and text sequencing to
parse for syntax (that is, tokenizing text) and applying Named Entity Recognition (that is,
identifying the mention of brands, people's names, places, common abbreviations, and
so on).
Text mining and semantic methods
Unique challenges exist when setting out to apply text mining to social media
data. The data that social networking sites, blogs, and forums generate falls in
the category of what is commonly referred to as big data. The data is
unstructured and semi-structured, petabytes are generated around larger
brands on a daily basis, and traditional relational databases cannot efficiently
scale to support real-time analytics based on the data. Big data and NoSQL
database solutions are therefore required.
Social media datamarts and big data
There are several commercial and open source options for text-mining software and
applications.
Of the open source text mining tools, RapidMiner and R appear to be two of the most
popular. R has a wider user base; a programming language in which source code is
required, it has a large selection of algorithms. However, scalability is an issue with R so it's
not ideal for large datasets without workarounds. RapidMiner has a smaller user base, but
it doesn't require source code and has a powerful user interface (UI).
Embedded is a list of other Text Mining tools:
Text mining tools
Who does these Text Mining?
Spinn3r is a web service that provides raw access to posts, articles, tweets, status
updates, etc. being published - in real or near real time, allowing you to focus on building
your application, mashup, or search engine. We find the sources, index their content and
take care of all the heavy lifting around delivering large amounts of relevant data.
They publish an API for companies to build Analytic products on top of this data
• Spinn3r Dataset: http://spinn3r.com
• 30 million articles/day (50GB of data)
• 20,000 news sources + millions blogs and forums
• And lots of Tweets and public Facebook posts
Gnip and DataSift are among the many others who provide these
kind of Datasets
Dataset Providers
Now that you have the Datasets,
What Next?
Product Companies
There are many product companies who use these datasets and build analytical products
for organizations:
InsideView
With InsideView CRM+, your marketing, sales, and service teams can:
• Research market, company, contact, and competitor information
• Use real-time news and social network connections to target new leads and engage with
customers
• Enrich leads to help sales move from lead to win
• One-click integration with CRM to update leads and contacts into your CRM
Tealeaf
Tealeaf's Customer Behavior Analysis Suite
• Improving online customer experience is a top priority for many organizations and
Tealeaf's Customer Behavior Analysis Suite was created with this goal in mind. By
utilizing cxImpact, cxResults and cxView in concert, companies have both the
quantitative data, as well as the qualitative experience information necessary to
understand customers' true experiences
And similarly
Further list of product companies those provide analytical tools from datasets
www.sprinklr.com
www.leadformix.com
www.xactlycorp.com
www.moxiesoft.com
www.synaptris.com
www.quinstreet.com
www.enirogroup.com/en
www.saama.com
www.mu-sigma.com
And many more..
Conceptually, what do these
tools provide?
Sentiment analysis depends on an appropriate subjectivity lexicon that understands the
relative positive, neutral or negative context of a word or expression. It is both language
and context specific.
A good example can be seen below:
I find PRODUCTX to be very good and useful, but it is a bit too expensive.
The expression (and therefore the PRODUCTX) is rated as positive, since there are two
positive words “good” and “useful” – and one negative word “expensive”. In addition, one
of the positive words is enhanced with the word “very” while the negative word is put
into perspective by the qualifier “a bit”. The more advanced the lexica, the more detailed
the analysis and the findings can be.
Sentiment analysis is a well-established, stand-alone predictive analytic technique.
Sentiment Analysis: Predictive Analytic Technique
These tools are generally cloud-based applications that pull many different social media
data sources (datasets) together including communities and blogs. They are able to do
this because they generally incorporate a massive back end infrastructure that constantly
crawls and captures new data as it occurs from the API’s.
They all provide an interface to filter the data and enter selection criteria to look across a
broad range of channel choices. The results usually take some form of a visual scorecard
that combines different graphical and tabular techniques for displaying the summarized
information. Many allow an interactive “drill down” to see further details, most of them
allowing you to drill right through to the original source of the data.
Social Media Scorecards
Technologies Used by these Product Companies
Big Data Technologies:
• Hadoop Frameworks (hdfs, Pig, Hive, oozie, Hbase, Mahout),
• Cloudera (CDH3 & CDH4) distributions,
• Postgres+ Postgis,
• Cassandra
Languages:
• Java,
• Perl
Cloud computing technologies:
• Amazon Web Services (AWS) / Amazon EC2,
• Amazon S3,
• Amazon EMR,
• Amazon Cloud watch
© This document contains confidential and proprietary information of Adroitent. It is furnished for evaluation purposes
only. Except with the express prior written permission of Adroitent, this document and the information contained herein
may not be published, disclosed, or used for any other purpose. | www.adroitent.com

More Related Content

Recently uploaded

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 

Recently uploaded (20)

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 

Featured (20)

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 

Understanding Social Media Analytics : Big Picture

  • 1. © This document contains confidential and proprietary information of Adroitent. It is furnished for evaluation purposes only. Except with the express prior written permission of Adroitent, this document and the information contained herein may not be published, disclosed, or used for any other purpose. | www.adroitent.com Understanding Social Media Analytics SANDEEP SEERAPU
  • 2. • Web is no longer a static library that people passively browse • Web is a place where people: o Consume and create content o Interact with other people:  Internet forums, Blogs, Social networks, Twitter, Wikis, Podcasts, Slide sharing, Bookmark sharing, Product reviews, Comments, … • DATA POINT: Facebook traffic tops Google (for USA) • March 2010: FB > 7% of US traffic http://money.cnn.com/2010/03/16/technology/facebook_most_visited Social Media : Big Change
  • 3. • Rich and big data: • Billions users, billions contents • Textual, Multimedia (image, videos, etc.) • Billions of connections • Behaviours, preferences, trends... • Data is open and easy to access • It’s easy to get data from Social Media • Datasets • Developers APIs • Spidering the Web Social Media : Rich and Big data
  • 4. Social Media : Opportunities Any user can share and contribute content, express opinions, link to others This means: Can data-mine opinions and behaviours of millions of users to gain insights into: • Human behaviour • Marketing analytics • Product sentiment
  • 5. What can we do with this data?
  • 6. • Consumer Brand Analytics • What are people saying about our brand? • Marketing Communications • Significant spending on marketing, advertising: • Companies trying to position their products • Brand analytics helps to determine whether such campaigns are effective • Product reviews • Automatically mine product reviews for information on product features, new requests, … • Easy to use, Comfortable chair, Light weight, Sturdy, Good price Applications: Reputation Management
  • 7. • Citizen response • Solicit citizen feedback on bills debated in Congress • What new issues are being raised, what aspects of bill are popular, unpopular • Political Campaigns • Why do people support a candidate? • Law enforcement • Gang members boast about their activities on Facebook • Protests being planned through Twitter • NYT: Sending the Police Before There’s a Crime http://www.nytimes.com/2011/08/16/us/16police.html?_r=1 Applications: Citizen Response
  • 8. • Viral marketing: • Personalized recommendations Online forum users are • Brand advocates: • 79.2% of forum contributors help a friend to make a decision about a product • purchase (47.6% of non-contributors). • 65% of forum contributors share advice (offline and in person) based on information that they’ve read online (35% of non-contributors) http://www.socialmediaexaminer.com/new-studies-show-value-of-social-media Applications: Social Media Marketing
  • 9. Information Flow How do we capture and model the flow of information?
  • 10. Given that social media generate a wealth of consumer data, how can brands turn raw social media comment data from Twitter, Facebook, blogs, and forums into actionable business insights? The answer lies in the application of text-mining and semantic technology to these new sources of unstructured data. How does it work? • Text mining is similar to data mining in that it is aimed at identifying interesting patterns in data • The first step in any text-mining effort is to identify the text-based sources to be analysed and gather this material through information retrieval or selecting the corpus that comprises the set of textual files and content of interest. • Extensive NLP is deployed that invokes "part of speech tagging" and text sequencing to parse for syntax (that is, tokenizing text) and applying Named Entity Recognition (that is, identifying the mention of brands, people's names, places, common abbreviations, and so on). Text mining and semantic methods
  • 11. Unique challenges exist when setting out to apply text mining to social media data. The data that social networking sites, blogs, and forums generate falls in the category of what is commonly referred to as big data. The data is unstructured and semi-structured, petabytes are generated around larger brands on a daily basis, and traditional relational databases cannot efficiently scale to support real-time analytics based on the data. Big data and NoSQL database solutions are therefore required. Social media datamarts and big data
  • 12. There are several commercial and open source options for text-mining software and applications. Of the open source text mining tools, RapidMiner and R appear to be two of the most popular. R has a wider user base; a programming language in which source code is required, it has a large selection of algorithms. However, scalability is an issue with R so it's not ideal for large datasets without workarounds. RapidMiner has a smaller user base, but it doesn't require source code and has a powerful user interface (UI). Embedded is a list of other Text Mining tools: Text mining tools
  • 13. Who does these Text Mining?
  • 14. Spinn3r is a web service that provides raw access to posts, articles, tweets, status updates, etc. being published - in real or near real time, allowing you to focus on building your application, mashup, or search engine. We find the sources, index their content and take care of all the heavy lifting around delivering large amounts of relevant data. They publish an API for companies to build Analytic products on top of this data • Spinn3r Dataset: http://spinn3r.com • 30 million articles/day (50GB of data) • 20,000 news sources + millions blogs and forums • And lots of Tweets and public Facebook posts Gnip and DataSift are among the many others who provide these kind of Datasets Dataset Providers
  • 15. Now that you have the Datasets, What Next?
  • 16. Product Companies There are many product companies who use these datasets and build analytical products for organizations: InsideView With InsideView CRM+, your marketing, sales, and service teams can: • Research market, company, contact, and competitor information • Use real-time news and social network connections to target new leads and engage with customers • Enrich leads to help sales move from lead to win • One-click integration with CRM to update leads and contacts into your CRM Tealeaf Tealeaf's Customer Behavior Analysis Suite • Improving online customer experience is a top priority for many organizations and Tealeaf's Customer Behavior Analysis Suite was created with this goal in mind. By utilizing cxImpact, cxResults and cxView in concert, companies have both the quantitative data, as well as the qualitative experience information necessary to understand customers' true experiences
  • 17. And similarly Further list of product companies those provide analytical tools from datasets www.sprinklr.com www.leadformix.com www.xactlycorp.com www.moxiesoft.com www.synaptris.com www.quinstreet.com www.enirogroup.com/en www.saama.com www.mu-sigma.com And many more..
  • 18. Conceptually, what do these tools provide?
  • 19. Sentiment analysis depends on an appropriate subjectivity lexicon that understands the relative positive, neutral or negative context of a word or expression. It is both language and context specific. A good example can be seen below: I find PRODUCTX to be very good and useful, but it is a bit too expensive. The expression (and therefore the PRODUCTX) is rated as positive, since there are two positive words “good” and “useful” – and one negative word “expensive”. In addition, one of the positive words is enhanced with the word “very” while the negative word is put into perspective by the qualifier “a bit”. The more advanced the lexica, the more detailed the analysis and the findings can be. Sentiment analysis is a well-established, stand-alone predictive analytic technique. Sentiment Analysis: Predictive Analytic Technique
  • 20. These tools are generally cloud-based applications that pull many different social media data sources (datasets) together including communities and blogs. They are able to do this because they generally incorporate a massive back end infrastructure that constantly crawls and captures new data as it occurs from the API’s. They all provide an interface to filter the data and enter selection criteria to look across a broad range of channel choices. The results usually take some form of a visual scorecard that combines different graphical and tabular techniques for displaying the summarized information. Many allow an interactive “drill down” to see further details, most of them allowing you to drill right through to the original source of the data. Social Media Scorecards
  • 21. Technologies Used by these Product Companies Big Data Technologies: • Hadoop Frameworks (hdfs, Pig, Hive, oozie, Hbase, Mahout), • Cloudera (CDH3 & CDH4) distributions, • Postgres+ Postgis, • Cassandra Languages: • Java, • Perl Cloud computing technologies: • Amazon Web Services (AWS) / Amazon EC2, • Amazon S3, • Amazon EMR, • Amazon Cloud watch
  • 22. © This document contains confidential and proprietary information of Adroitent. It is furnished for evaluation purposes only. Except with the express prior written permission of Adroitent, this document and the information contained herein may not be published, disclosed, or used for any other purpose. | www.adroitent.com