SlideShare a Scribd company logo
Social Media with Big Data Analytics
Mohammed Zuhair Al-Taie
Big Data Centre - Universiti Teknologi Malaysia - 2016
AGENDA
Web 2.0
Social Media
Big Data
Social Media with Big Data Analytics
Social Network Analysis
Sentiment Analysis
Web 2.0 is
A Complex,
Organic Online
Conversation
WHAT IS WEB 2.0?
Web 2.0 is powered by:
• Social Networks
•News and
Bookmarking
•Blogs
•Microblogging
•Video/Photo-sharing
•Message Boards
•Wikis
•Virtual reality
•Social gaming
•Podcasts
•Real Simple
syndication (RSS)
•Social Media Press
Release
TECHNOLOGY OVERVIEW
Search: The ease of finding information through keyword search
Links: Ad-hoc guides to other relevant information
Authoring: The ability to create constantly updating content over a platform
that is shifted from being the creation of a few to being constantly updated,
interlinked work.
Tags: Categorization of content by creating tags: simple,one-word user-
determined descriptions to facilitate searching and avoid rigid, pre-made
categories
Extensions: Powerful algorithms that leverage the Web as an application
platform as well as a documentserver
Signals: The use of RSS technology to rapidly notify users of content changes
Web 2.0 websites typically include some of the following features/techniques-
SLATES
Social media:
is an umbrella
term that
defines the
various activities
that integrate
technology,
social
interaction, and
the construction
of words,
pictures, videos
and audio.
WEB 2.0 TECHNOLOGIES:
SOCIAL MEDIA
“Creation of web content, by the
people, for the people”
In Simple Language…
SOCIAL MEDIA PLATFORMS
WHAT HAPPENS EVERY 1 MIN?
 Variety of sources from where data is being
generated has also undergone a shift
 The types of data being created has changed
from structured to semi-structured to
unstructured data
Structured
Data
Semi-
Structured
Data
Unstructured
Data Need to manage broad range of data types
 Process analytic queries across numerous data
types
 Need to extract meaningful analysis from this
data has led to several technologies to gain
traction
 Examples include NoSQL databases to store
unstructured data as well as innovative
processing methods like Hadoop and massive
parallel processing (MPP)
Today 80% Of Data Existing In
Any Enterprise Is Unstructured
Data
Unstructured data from social
media has to be approached in a
non traditional manner.
UNSTRUCTURED DATA
Facebook
- User Likes and
Favorites
- Article/Video/Link
Shares
- Views
- Comments
- Location / Geospatial
Twitter
Tweet Characteristics
- Length
- Language Model
- Semantics
- Emoticons
- Location / Geospatial
Google / You Tube
- Blogs
- Comments
- Search Statistics
- Likes vs Dislikes
- Shares / Views /
Comments
IDENTIFYING UNSTRUCTURED DATA
SOURCES
“Big Data”
is data whose
scale, diversity,
and complexity
require new
architecture,
techniques,
algorithms, and
analytics to
manage it and
extract value and
hidden knowledge
from it…
BIG DATA IS…
BIG DATA =
BIG DATA VS USUAL DATA
Implication for an organization
2009 2011 2015 2020
0.8
1.9
7.9
35.0
CAGR
(2009-2020)
41.0%
Zetabytes
THE GLOBAL DATA GROWTH
>3,500
>40
>2,000
>200
>400
 Key verticals: Healthcare,
Manufacturing, Retail, Digital
Marketing
 Demand trend: High demand
of Big Data analytics
>250
 Key verticals: Telecom, Retail, Banking
 Demand trend: Still embryonic; most
organizations have wait and watch approach
 Demand trend: Current demand
appears to be limited, however,
lack of skills may drive
outsourcing of Big Data analytics
 Low awareness levels
 Key verticals: Technology, Financial services,
Oil & Gas, Utilities, Manufacturing
 Demand trend: European MNC’s are still in
the early stages of the adoption cycle
North
America
South America
Europe
Middle East
India
China
Japan
 Key verticals: Manufacturing,
Telecom, Health & Life Sciences
 Demand trend: Demand for BI
to derive operational efficiency
 Key verticals: Telecom, Bioinformatics,
Retail
 Demand trend: Industry is in nascent stage
with demand catching up, particularly in retail
>50
16
NORTH AMERICA & EUROPE DRIVES THE BIG DATA
OPPORTUNITY WITH OVER 85%
OF THE WORLD’S DATA
Tools Description
The Hadoop
Distributed
File System
(HDFS)
HDFS divides the data into smaller parts and distributes
it across the various servers/nodes
SQL Server
Integration
Service
These tools allow posts can be downloaded and loaded
into Hadoop
Apache
Flume
MapReduce
MapReduce is a process that transforms data loaded
into Hadoop into a format that can be used for analysis.
Hive
a runtime Hadoop support architecture that leverages
Structure Query Language (SQL) with the Hadoop
platform.
Jaql Jaql converts high-level queries into low-level queries
and
Zookeeper Zookeeper coordinate parallel processing across big
clusters
HBase HBase is a column-oriented database management
system that sits on top of HDFS by using a non-SQL
approach.
BIG DATA TOOLS
Variety
Veracity
Value
BIG DATA IS OFTEN DESCRIBED USING
FIVE Vs
Volume
refers to the vast amounts of
data generated every second.
We are not talking Terabytes
but Zettabytes or Brontobytes.
If we take all the data
generated in the world
between the beginning of time
and 2008, the same amount of
data will soon be generated
every minute.
This makes most data sets too
large to store and analyse
using traditional database
technology.
Variety
Veracity
Value
BIG DATA: VOLUME
BIG DATA: VELOCITY
Variety
Veracity
Value
Velocity
refers to the speed at which
new data is generated and
the speed at which data
moves around. Just think of
social media messages
going viral in seconds.
Technology allows us now to
analyse the data while it is
being generated
(sometimes referred to as
in-memory analytics),
without ever putting it into
databases.
Variety
Veracity
Value
Variety
refers to the different types
of data we can now use. In
the past we only focused on
structured data that neatly
fitted into tables or
relational databases, such
as financial data. In fact,
80% of the world’s data is
unstructured (text, images,
video, voice, etc.)
BIG DATA: VARIETY
Variety
Veracity
Value
Veracity
refers to the messiness or
trustworthiness of the data.
With many forms of big
data quality and accuracy
are less controllable (just
think of Twitter posts with
hash tags, abbreviations,
typos and colloquial speech
as well as the reliability and
accuracy of content) but
technology now allows us to
work with this type of data.
BIG DATA: VERACITY
Variety
Veracity
Value
VALUE
Then there is another V to
take into account when
looking at Big Data: Value!
Having access to big data is
no good unless we can turn
it into value.
Companies are starting to
generate amazing value
from their big data.
BIG DATA: VALUE
THE INTERSECTION OF SOCIAL MEDIA
AND BIG DATA
 Big Data is also characterized by
velocity or speed i.e. frequency of
data generation or the frequency of
data delivery
 New age communication channels
such as mobile phones, emails, social
networking has increased the rate of
information flows
Examples:
 Telcos adopting location based
marketing based on user location
sensed by mobile towers
 Satellite images can help monitor
and analyze troop movements, a
flood plane, cloud patterns, or forest
fires
 Video analysis systems could monitor
a sensitive or valuable facility,
watching for possible intruders and
alert authorities in real time
Big Data velocity enabling real
time use of data
Data
velocity
per
minute
600+
videos on
YouTube
200
million+
emails sent
2
million+
Google
search
queries
400,000+
minutes of
Skype
calling
400,000+
tweets on
Twitter
US$
300,000+
are spent
on online
shopping
700,000+
Facebook
updates
7,000+
photos on
flickr
1,500+
blog posts
3500+
ticks per
minute in
securities
trading
BIG DATA & REAL TIME USE
BIG DATA FOR SOCIAL MEDIA ANALYTICS
PROCESS MODEL
CONCEPTUAL VIEW OF FRAMEWORK FOR BIG DATA
EXTRACTION, MESSAGING AND STORE
This phase has a composite pattern that is
based on the store-and-explore and focuses on
obtaining and storing the relevant data from
sources outside our establishment.
CONCEPTUAL VIEW OF DISCUSSION TOPIC AND
OPINION ANALYSIS COMPONENT
This phase has a composite pattern that is based on
purposeful-and-predictive analytics to gain advanced
insight.
WHAT IS HADOOP?
*Hadoop is an open source
framework which is used for
storing and processing the
large scale of data sets on
large clusters of hardware.
*The specialty of Hadoop
involves in HDFS which is used
for storing data on large
commodity machines and
provides very huge bandwidth
for the cluster.
CONCEPTUAL VIEW OF APACHE HADOOP
ARCHITECTURE
CONCEPTUAL VIEW OF DATA VISUALIZATION AND
DECISION-MAKING COMPONENT
This project has a composite pattern based on
actionable-analysis with the aim of taking the next best
actions that leads to take appropriate actions by
related customers.
SOCIAL NETWORK ANALYSIS
A GLOBAL SOCIAL NETWORK
NETWORK PERSPECTIVE
WHY SOCIAL NETWORK ANALYSIS
MATTERS?
SOCIAL NETWORK ANALYSIS: THE NEW
SCIENCE OF NETWORKS
Sentiment analysis…
• Analyzes people’s sentiments,
opinions, appraisals, attitudes,
evaluations, and emotions
• Towards entities such as
organizations, products,
services, individuals, topics,
issues, events, and their
attributes
• As presented online via text,
video and other means of
communication.
• These communications can fall
into three broad categories:
positive, neutral or negative.
SENTIMENT ANALYSIS
We can inquire about sentiment at
various linguistic levels:
O Words – objective, positive,
negative, neutral
O Clauses – “going out of my
mind”
O Sentences – possibly multiple
sentiments
O Documents
LEVEL OF ANALYSIS
Elections 2012 Dashboard
FILTER BY:
Facebook
Twitter
Google
Mitt Romney
RepublicanPrimary
Democratic Vote
Republican Vote
Democratic Sentiment
Republican Sentiment
TRUTHY: A SOCIAL MEDIA RESEARCH
PROJECT
Truthy is a research project to study how memes spread on social
media. A meme is a transmissible unit of information, such as a hashtag,
phrase, or link. This website highlights some of the research coming from
this effort and showcases some visualizations, tools, and data resources
demonstrating broader impacts of the project.
Social media with big data analytics

More Related Content

What's hot

Importance of Big Data Analytics
Importance of Big Data AnalyticsImportance of Big Data Analytics
Importance of Big Data Analytics
Impetus Technologies
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
Md. Salman Ahmed
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
Ghulam Imaduddin
 
Machine learning with Big Data power point presentation
Machine learning with Big Data power point presentationMachine learning with Big Data power point presentation
Machine learning with Big Data power point presentation
David Raj Kanthi
 
Business intelligence
Business intelligenceBusiness intelligence
Business intelligence
Randy L. Archambault
 
Big data
Big dataBig data
Team 2 Big Data Presentation
Team 2 Big Data PresentationTeam 2 Big Data Presentation
Team 2 Big Data PresentationMatthew Urdan
 
Big Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesBig Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture Capabilities
Ashraf Uddin
 
Big data
Big dataBig data
Big data
Ami Redwan Haq
 
Social Media Mining and Analytics
Social Media Mining and AnalyticsSocial Media Mining and Analytics
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
Mithlesh Sadh
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notes
Mohit Saini
 
Social media data analytics
Social media data analyticsSocial media data analytics
Social media data analytics
AujaswiAgarwal1
 
Big Data
Big DataBig Data
Big Data
Seminar Links
 
Overview of Big data(ppt)
Overview of Big data(ppt)Overview of Big data(ppt)
Overview of Big data(ppt)
Shatavisha Roy Chowdhury
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
Vikram Nandini
 

What's hot (20)

Importance of Big Data Analytics
Importance of Big Data AnalyticsImportance of Big Data Analytics
Importance of Big Data Analytics
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Big data
Big dataBig data
Big data
 
Machine learning with Big Data power point presentation
Machine learning with Big Data power point presentationMachine learning with Big Data power point presentation
Machine learning with Big Data power point presentation
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Business intelligence
Business intelligenceBusiness intelligence
Business intelligence
 
Big data
Big dataBig data
Big data
 
Team 2 Big Data Presentation
Team 2 Big Data PresentationTeam 2 Big Data Presentation
Team 2 Big Data Presentation
 
Big Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesBig Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture Capabilities
 
Big data
Big dataBig data
Big data
 
Social Media Mining and Analytics
Social Media Mining and AnalyticsSocial Media Mining and Analytics
Social Media Mining and Analytics
 
Big data
Big dataBig data
Big data
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notes
 
Data analytics
Data analyticsData analytics
Data analytics
 
Social media data analytics
Social media data analyticsSocial media data analytics
Social media data analytics
 
Big Data
Big DataBig Data
Big Data
 
Overview of Big data(ppt)
Overview of Big data(ppt)Overview of Big data(ppt)
Overview of Big data(ppt)
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 

Similar to Social media with big data analytics

Big data
Big dataBig data
Big data
Pietro Nardone
 
Big data analytics with Apache Hadoop
Big data analytics with Apache  HadoopBig data analytics with Apache  Hadoop
Big data analytics with Apache Hadoop
Suman Saurabh
 
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessHow Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help business
Ajay Ohri
 
Big data data lake and beyond
Big data data lake and beyond Big data data lake and beyond
Big data data lake and beyond
Rajesh Kumar
 
Big data and Hadoop overview
Big data and Hadoop overviewBig data and Hadoop overview
Big data and Hadoop overview
Nitesh Ghosh
 
Big Data
Big DataBig Data
Big Data
Kirubaburi R
 
Big-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdfBig-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdf
rajsharma159890
 
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User GroupBig Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Scott Mitchell
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunities
Bigdata Meetup Kochi
 
Bigdata overview
Bigdata overviewBigdata overview
Bigdata overview
AllsoftSolutions
 
BigData
BigDataBigData
BigData
Viveka Sharma
 
Making Internet Of Things Device Data Just Work!
Making Internet Of Things Device Data Just Work!Making Internet Of Things Device Data Just Work!
Making Internet Of Things Device Data Just Work!
Memoori
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG Data
Prakalp Agarwal
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
 
TDWI checklist - Evolving to Modern DW
TDWI checklist - Evolving to Modern DWTDWI checklist - Evolving to Modern DW
TDWI checklist - Evolving to Modern DW
Jeannette Browning
 
Big Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformBig Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop Platform
IRJET Journal
 
Introduction to BIG DATA
Introduction to BIG DATA Introduction to BIG DATA
Introduction to BIG DATA
Zeeshan Khan
 
Cloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdfCloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdf
kalai75
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
himanshu arora
 
Real-time Analytics in Big data
Real-time Analytics in Big dataReal-time Analytics in Big data
Real-time Analytics in Big data
Pratiksha Manan
 

Similar to Social media with big data analytics (20)

Big data
Big dataBig data
Big data
 
Big data analytics with Apache Hadoop
Big data analytics with Apache  HadoopBig data analytics with Apache  Hadoop
Big data analytics with Apache Hadoop
 
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessHow Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help business
 
Big data data lake and beyond
Big data data lake and beyond Big data data lake and beyond
Big data data lake and beyond
 
Big data and Hadoop overview
Big data and Hadoop overviewBig data and Hadoop overview
Big data and Hadoop overview
 
Big Data
Big DataBig Data
Big Data
 
Big-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdfBig-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdf
 
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User GroupBig Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunities
 
Bigdata overview
Bigdata overviewBigdata overview
Bigdata overview
 
BigData
BigDataBigData
BigData
 
Making Internet Of Things Device Data Just Work!
Making Internet Of Things Device Data Just Work!Making Internet Of Things Device Data Just Work!
Making Internet Of Things Device Data Just Work!
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG Data
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
TDWI checklist - Evolving to Modern DW
TDWI checklist - Evolving to Modern DWTDWI checklist - Evolving to Modern DW
TDWI checklist - Evolving to Modern DW
 
Big Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformBig Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop Platform
 
Introduction to BIG DATA
Introduction to BIG DATA Introduction to BIG DATA
Introduction to BIG DATA
 
Cloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdfCloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdf
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 
Real-time Analytics in Big data
Real-time Analytics in Big dataReal-time Analytics in Big data
Real-time Analytics in Big data
 

More from Universiti Technologi Malaysia (UTM)

A self organizing communication model for disaster risk management
A self organizing communication model for disaster risk managementA self organizing communication model for disaster risk management
A self organizing communication model for disaster risk management
Universiti Technologi Malaysia (UTM)
 
Spark Working Environment in Windows OS
Spark Working Environment in Windows OSSpark Working Environment in Windows OS
Spark Working Environment in Windows OS
Universiti Technologi Malaysia (UTM)
 
Python networkx library quick start guide
Python networkx library quick start guidePython networkx library quick start guide
Python networkx library quick start guide
Universiti Technologi Malaysia (UTM)
 
Python 3.x quick syntax guide
Python 3.x quick syntax guidePython 3.x quick syntax guide
Python 3.x quick syntax guide
Universiti Technologi Malaysia (UTM)
 
Predicting the relevance of search results for e-commerce systems
Predicting the relevance of search results for e-commerce systemsPredicting the relevance of search results for e-commerce systems
Predicting the relevance of search results for e-commerce systems
Universiti Technologi Malaysia (UTM)
 
Scientific theory of state and society parities and disparities between the p...
Scientific theory of state and society parities and disparities between the p...Scientific theory of state and society parities and disparities between the p...
Scientific theory of state and society parities and disparities between the p...
Universiti Technologi Malaysia (UTM)
 
Nation building current trends of technology use in da’wah
Nation building current trends of technology use in da’wahNation building current trends of technology use in da’wah
Nation building current trends of technology use in da’wah
Universiti Technologi Malaysia (UTM)
 
Flight MH370 community structure
Flight MH370 community structureFlight MH370 community structure
Flight MH370 community structure
Universiti Technologi Malaysia (UTM)
 
Visualization of explanations in recommender systems
Visualization of explanations in recommender systemsVisualization of explanations in recommender systems
Visualization of explanations in recommender systems
Universiti Technologi Malaysia (UTM)
 
Explanations in Recommender Systems: Overview and Research Approaches
Explanations in Recommender Systems: Overview and Research ApproachesExplanations in Recommender Systems: Overview and Research Approaches
Explanations in Recommender Systems: Overview and Research Approaches
Universiti Technologi Malaysia (UTM)
 
Factors disrupting a successful implementation of e-commerce in iraq
Factors disrupting a successful implementation of e-commerce in iraqFactors disrupting a successful implementation of e-commerce in iraq
Factors disrupting a successful implementation of e-commerce in iraq
Universiti Technologi Malaysia (UTM)
 

More from Universiti Technologi Malaysia (UTM) (11)

A self organizing communication model for disaster risk management
A self organizing communication model for disaster risk managementA self organizing communication model for disaster risk management
A self organizing communication model for disaster risk management
 
Spark Working Environment in Windows OS
Spark Working Environment in Windows OSSpark Working Environment in Windows OS
Spark Working Environment in Windows OS
 
Python networkx library quick start guide
Python networkx library quick start guidePython networkx library quick start guide
Python networkx library quick start guide
 
Python 3.x quick syntax guide
Python 3.x quick syntax guidePython 3.x quick syntax guide
Python 3.x quick syntax guide
 
Predicting the relevance of search results for e-commerce systems
Predicting the relevance of search results for e-commerce systemsPredicting the relevance of search results for e-commerce systems
Predicting the relevance of search results for e-commerce systems
 
Scientific theory of state and society parities and disparities between the p...
Scientific theory of state and society parities and disparities between the p...Scientific theory of state and society parities and disparities between the p...
Scientific theory of state and society parities and disparities between the p...
 
Nation building current trends of technology use in da’wah
Nation building current trends of technology use in da’wahNation building current trends of technology use in da’wah
Nation building current trends of technology use in da’wah
 
Flight MH370 community structure
Flight MH370 community structureFlight MH370 community structure
Flight MH370 community structure
 
Visualization of explanations in recommender systems
Visualization of explanations in recommender systemsVisualization of explanations in recommender systems
Visualization of explanations in recommender systems
 
Explanations in Recommender Systems: Overview and Research Approaches
Explanations in Recommender Systems: Overview and Research ApproachesExplanations in Recommender Systems: Overview and Research Approaches
Explanations in Recommender Systems: Overview and Research Approaches
 
Factors disrupting a successful implementation of e-commerce in iraq
Factors disrupting a successful implementation of e-commerce in iraqFactors disrupting a successful implementation of e-commerce in iraq
Factors disrupting a successful implementation of e-commerce in iraq
 

Recently uploaded

做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
mzpolocfi
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Enterprise Wired
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Subhajit Sahu
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 

Recently uploaded (20)

做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 

Social media with big data analytics

  • 1. Social Media with Big Data Analytics Mohammed Zuhair Al-Taie Big Data Centre - Universiti Teknologi Malaysia - 2016
  • 2. AGENDA Web 2.0 Social Media Big Data Social Media with Big Data Analytics Social Network Analysis Sentiment Analysis
  • 3. Web 2.0 is A Complex, Organic Online Conversation WHAT IS WEB 2.0? Web 2.0 is powered by: • Social Networks •News and Bookmarking •Blogs •Microblogging •Video/Photo-sharing •Message Boards •Wikis •Virtual reality •Social gaming •Podcasts •Real Simple syndication (RSS) •Social Media Press Release
  • 4. TECHNOLOGY OVERVIEW Search: The ease of finding information through keyword search Links: Ad-hoc guides to other relevant information Authoring: The ability to create constantly updating content over a platform that is shifted from being the creation of a few to being constantly updated, interlinked work. Tags: Categorization of content by creating tags: simple,one-word user- determined descriptions to facilitate searching and avoid rigid, pre-made categories Extensions: Powerful algorithms that leverage the Web as an application platform as well as a documentserver Signals: The use of RSS technology to rapidly notify users of content changes Web 2.0 websites typically include some of the following features/techniques- SLATES
  • 5. Social media: is an umbrella term that defines the various activities that integrate technology, social interaction, and the construction of words, pictures, videos and audio. WEB 2.0 TECHNOLOGIES: SOCIAL MEDIA
  • 6. “Creation of web content, by the people, for the people” In Simple Language…
  • 9.  Variety of sources from where data is being generated has also undergone a shift  The types of data being created has changed from structured to semi-structured to unstructured data Structured Data Semi- Structured Data Unstructured Data Need to manage broad range of data types  Process analytic queries across numerous data types  Need to extract meaningful analysis from this data has led to several technologies to gain traction  Examples include NoSQL databases to store unstructured data as well as innovative processing methods like Hadoop and massive parallel processing (MPP) Today 80% Of Data Existing In Any Enterprise Is Unstructured Data Unstructured data from social media has to be approached in a non traditional manner. UNSTRUCTURED DATA
  • 10. Facebook - User Likes and Favorites - Article/Video/Link Shares - Views - Comments - Location / Geospatial Twitter Tweet Characteristics - Length - Language Model - Semantics - Emoticons - Location / Geospatial Google / You Tube - Blogs - Comments - Search Statistics - Likes vs Dislikes - Shares / Views / Comments IDENTIFYING UNSTRUCTURED DATA SOURCES
  • 11.
  • 12.
  • 13. “Big Data” is data whose scale, diversity, and complexity require new architecture, techniques, algorithms, and analytics to manage it and extract value and hidden knowledge from it… BIG DATA IS… BIG DATA =
  • 14. BIG DATA VS USUAL DATA
  • 15. Implication for an organization 2009 2011 2015 2020 0.8 1.9 7.9 35.0 CAGR (2009-2020) 41.0% Zetabytes THE GLOBAL DATA GROWTH
  • 16. >3,500 >40 >2,000 >200 >400  Key verticals: Healthcare, Manufacturing, Retail, Digital Marketing  Demand trend: High demand of Big Data analytics >250  Key verticals: Telecom, Retail, Banking  Demand trend: Still embryonic; most organizations have wait and watch approach  Demand trend: Current demand appears to be limited, however, lack of skills may drive outsourcing of Big Data analytics  Low awareness levels  Key verticals: Technology, Financial services, Oil & Gas, Utilities, Manufacturing  Demand trend: European MNC’s are still in the early stages of the adoption cycle North America South America Europe Middle East India China Japan  Key verticals: Manufacturing, Telecom, Health & Life Sciences  Demand trend: Demand for BI to derive operational efficiency  Key verticals: Telecom, Bioinformatics, Retail  Demand trend: Industry is in nascent stage with demand catching up, particularly in retail >50 16 NORTH AMERICA & EUROPE DRIVES THE BIG DATA OPPORTUNITY WITH OVER 85% OF THE WORLD’S DATA
  • 17. Tools Description The Hadoop Distributed File System (HDFS) HDFS divides the data into smaller parts and distributes it across the various servers/nodes SQL Server Integration Service These tools allow posts can be downloaded and loaded into Hadoop Apache Flume MapReduce MapReduce is a process that transforms data loaded into Hadoop into a format that can be used for analysis. Hive a runtime Hadoop support architecture that leverages Structure Query Language (SQL) with the Hadoop platform. Jaql Jaql converts high-level queries into low-level queries and Zookeeper Zookeeper coordinate parallel processing across big clusters HBase HBase is a column-oriented database management system that sits on top of HDFS by using a non-SQL approach. BIG DATA TOOLS
  • 18. Variety Veracity Value BIG DATA IS OFTEN DESCRIBED USING FIVE Vs
  • 19. Volume refers to the vast amounts of data generated every second. We are not talking Terabytes but Zettabytes or Brontobytes. If we take all the data generated in the world between the beginning of time and 2008, the same amount of data will soon be generated every minute. This makes most data sets too large to store and analyse using traditional database technology. Variety Veracity Value BIG DATA: VOLUME
  • 20. BIG DATA: VELOCITY Variety Veracity Value Velocity refers to the speed at which new data is generated and the speed at which data moves around. Just think of social media messages going viral in seconds. Technology allows us now to analyse the data while it is being generated (sometimes referred to as in-memory analytics), without ever putting it into databases.
  • 21. Variety Veracity Value Variety refers to the different types of data we can now use. In the past we only focused on structured data that neatly fitted into tables or relational databases, such as financial data. In fact, 80% of the world’s data is unstructured (text, images, video, voice, etc.) BIG DATA: VARIETY
  • 22. Variety Veracity Value Veracity refers to the messiness or trustworthiness of the data. With many forms of big data quality and accuracy are less controllable (just think of Twitter posts with hash tags, abbreviations, typos and colloquial speech as well as the reliability and accuracy of content) but technology now allows us to work with this type of data. BIG DATA: VERACITY
  • 23. Variety Veracity Value VALUE Then there is another V to take into account when looking at Big Data: Value! Having access to big data is no good unless we can turn it into value. Companies are starting to generate amazing value from their big data. BIG DATA: VALUE
  • 24.
  • 25. THE INTERSECTION OF SOCIAL MEDIA AND BIG DATA
  • 26.  Big Data is also characterized by velocity or speed i.e. frequency of data generation or the frequency of data delivery  New age communication channels such as mobile phones, emails, social networking has increased the rate of information flows Examples:  Telcos adopting location based marketing based on user location sensed by mobile towers  Satellite images can help monitor and analyze troop movements, a flood plane, cloud patterns, or forest fires  Video analysis systems could monitor a sensitive or valuable facility, watching for possible intruders and alert authorities in real time Big Data velocity enabling real time use of data Data velocity per minute 600+ videos on YouTube 200 million+ emails sent 2 million+ Google search queries 400,000+ minutes of Skype calling 400,000+ tweets on Twitter US$ 300,000+ are spent on online shopping 700,000+ Facebook updates 7,000+ photos on flickr 1,500+ blog posts 3500+ ticks per minute in securities trading BIG DATA & REAL TIME USE
  • 27. BIG DATA FOR SOCIAL MEDIA ANALYTICS PROCESS MODEL
  • 28. CONCEPTUAL VIEW OF FRAMEWORK FOR BIG DATA EXTRACTION, MESSAGING AND STORE This phase has a composite pattern that is based on the store-and-explore and focuses on obtaining and storing the relevant data from sources outside our establishment.
  • 29. CONCEPTUAL VIEW OF DISCUSSION TOPIC AND OPINION ANALYSIS COMPONENT This phase has a composite pattern that is based on purposeful-and-predictive analytics to gain advanced insight.
  • 30. WHAT IS HADOOP? *Hadoop is an open source framework which is used for storing and processing the large scale of data sets on large clusters of hardware. *The specialty of Hadoop involves in HDFS which is used for storing data on large commodity machines and provides very huge bandwidth for the cluster.
  • 31. CONCEPTUAL VIEW OF APACHE HADOOP ARCHITECTURE
  • 32. CONCEPTUAL VIEW OF DATA VISUALIZATION AND DECISION-MAKING COMPONENT This project has a composite pattern based on actionable-analysis with the aim of taking the next best actions that leads to take appropriate actions by related customers.
  • 34. A GLOBAL SOCIAL NETWORK
  • 36. WHY SOCIAL NETWORK ANALYSIS MATTERS?
  • 37. SOCIAL NETWORK ANALYSIS: THE NEW SCIENCE OF NETWORKS
  • 38. Sentiment analysis… • Analyzes people’s sentiments, opinions, appraisals, attitudes, evaluations, and emotions • Towards entities such as organizations, products, services, individuals, topics, issues, events, and their attributes • As presented online via text, video and other means of communication. • These communications can fall into three broad categories: positive, neutral or negative. SENTIMENT ANALYSIS
  • 39. We can inquire about sentiment at various linguistic levels: O Words – objective, positive, negative, neutral O Clauses – “going out of my mind” O Sentences – possibly multiple sentiments O Documents LEVEL OF ANALYSIS
  • 40. Elections 2012 Dashboard FILTER BY: Facebook Twitter Google Mitt Romney RepublicanPrimary Democratic Vote Republican Vote Democratic Sentiment Republican Sentiment
  • 41. TRUTHY: A SOCIAL MEDIA RESEARCH PROJECT Truthy is a research project to study how memes spread on social media. A meme is a transmissible unit of information, such as a hashtag, phrase, or link. This website highlights some of the research coming from this effort and showcases some visualizations, tools, and data resources demonstrating broader impacts of the project.