Trend Analysis

TREND
DETECTION, TRACKING & TRANSITION
in Social Networks
1. Definition & General Idea
2. Web Samples in Trend Hunting
3. Detection Approches
4. Architecture: TwitterMonitor
5. Detection: MemeTracker
6. Classification: ExoEndo
SemioNet: Semantic Social Network Analysis

REFERENCES
Mathioudakis, Michael, and Nick Koudas. "Twittermonitor: trend
detection over the twitter stream." Proceedings of the 2010 ACM
SIGMOD International Conference on Management of data. ACM,
2010.
Leskovec, Jure, Lars Backstrom, and Jon Kleinberg. "Meme-tracking
and the dynamics of the news cycle." Proceedings of the 15th ACM
SIGKDD international conference on Knowledge discovery and
data mining. ACM, 2009.
Naaman, Mor, Hila Becker, and Luis Gravano. "Hip and trendy:
Characterizing emerging trends on Twitter." Journal of the American
Society for Information Science and Technology 62.5 (2011): 902-
918.
Becker, Hila, Mor Naaman, and Luis Gravano. "Beyond Trending
Topics: Real-World Event Identification on Twitter." ICWSM 11 (2011):
438-441.

Trend Analysis
The Science of Studying
Changes in Social Patterns,
Including Fashion, Technology
& Consumer Behavior
Horizontal Analysis
The General Movement
over TIME of a
Statistically Detectable
Change
Fundamentally, a Method
for Understanding HOW &
WHY Things have Changed
– or will Change – over TIME

APPROCH
Text Mining
Topic Ident. & Clust.
"Kilroy was here" was a
piece of graffiti that
became popular in the
1940s, and existed under
various names in
different countries,
illustrating how a meme
can be modified through
replication
Memes
(/ˈmiːm/) is "an idea, behavior, or
style that spreads from person to person
within a culture.“ … through writing,
speech, gestures, rituals, or other
imitable phenomena with a mimicked
theme. … cultural analogues to genes in that
they self-replicate, mutate, and
respond to selective pressures.

GroupBurst: Assesses Co-occurrences
One-pass
of Bursty
Real-time
Keyword in Recent Tweets
Adjustable against spam
Theoretically sound!
Adjustable against SPURIOUS Bursts. Coincidental Burst of Keyword over a short period of time
Context Extraction Algorithms (PCA,
SVD) & Grapevine’s Entity Extractor
to Add more
271 Million Monthly Active Users
500 Million Tweets (140 ch) Per Day
78% Active Users on Mobile
77% Accounts Outside U.S.
Supports 35+ languages

MemeTracking
News Cycle
Tracking News Evolution
Quotes & Memes
Integral Part of Journalistic Practice
Travel Relatively Intact with Mutational Variants
Clustering by Graph

Item: Each News Article/Blog Post
Phrase: A Quoted String Occurs in Items
MemeTracking …

Phrase Graph
DAG
|P| < |Q|
“senseless killing”
“enough of senseless
killing”
“Hear our voice. We have had enough of this
senseless killing”
Directed Edit Distance(P, Q) < δ
Word Consecutive Overlap(P, Q) > k
P  Q
푊푃,푄 ∝
1
퐷푖푟푒푐푡푒푑 퐸푑푖푡 퐷푖푠푡푎푛푐푒(푃,푄)
∝ 푇표푡푎푙 푁푢푚푏푒푟 표푓 푄 푖푛 퐶표푟푝푢푠
MemeTracking …

Phrase Clusters
Directed Acyclic Graph (DAG) Partitioning
Given a Weighted DAG, Delete a Set of Edges of
Min Total Weight So That Each of the Resulting
Components is Single-Rooted.
NP-hard
Heuristic
1.Start from the Roots
2.Down the DAG & greedily Assigns each Node to the Cluster to
which it has the most Edges
MemeTracking …

Result
Volume Distribution
Dataset
3 Months Aug 1 to Oct 31 2008
~ 1M Docs per Day from 1.65 Million
Sites!
47M Phrases, 22M Distinct
9H Clustering Process Time
35, 800 Non-trivial Clusters (at least two phrases)
MemeTracking …

Other Findings
Time lag between the news media and blogs
푓 푛푗 훿 푡 − 푡푗
푛푗 = Number of Item Previously Written for Cluster j
푡 = 푡ℎ푒 푐푢푟푟푒푛푡 푡푖푚푒
푡푗 = 푡ℎ푒 푡푖푚푒 푤ℎ푒푛 푗 푤푎푠 푓푖푟푠푡 푝푟표푑푢푐푒푑
푅푒푐푒푛푐푦 → 훿 푖푠 푚표푛표푡표푛푖푐푎푙푙푦 푑푒푐푟푒푎푠푖푛푔 푖푛 푡 − 푡푗
퐼푚푖푡푎푡푖표푛 → 푓 푖푠 푚표푛표푡표푛푖푐푎푙푙푦 푖푛푐푟푒푎푠푖푛푔 푖푛 푛푗, 푓(0) > 0
푡 → 0−: 푎 = 0.076 푡 → 0+: 푎 = 0.092
푡 → 0−: 푏 = 1.77 푡 → 0+: 푏 = 2.15
Quotes migrating from blogs to news media: 3.5%
Each Cluster
Modeling the news trend
Imitation≠Recency
MemeTracking …

Characterizing Trends
“trends in trend data.”  Meta Trend
Taxonomy of the trends
Key Distinguishing Features of Trends
Not only the Textual Content
Social Network Structure
Ties
Geographic
Action  Retweet, Reply, Mention, Hashtag

Trends
Exogenous
Broadcast-media
Broadcast of local media
“fight” (boxing event)
“Ravens” (football game)
Broadcast of global/national media
“Kanye”(KanyeWest acts up at the MTVVideo MusicAwards)
“Lost Finale” (series finale of Lost).
Global News
Breaking
“earthquake” (Chile earthquake)
“Tsunami” (HawaiiTsunamiwarning)
“Beyoncé”(Beyoncé cancels Malaysia concert).
Nonbreaking
“HCR” (health care reform)
“Tiger” (Tiger Woods apologizes)
“iPad” (toward thelaunch of Apple’s popular device).
National Holidays & Memorial Days
“Halloween,” “Valentine’s.”
Local Participatory & Physical
Planned
“marathon,”
“superbowl” (Super Bowl viewing parties)
“patrick’s” (St. Patrick’s Day Parade).
Unplanned
“rainy,” “snow.”
Endogenous
Memes
#in2010 (in December 2009, users imagine their near future)
“November” (users marking the beginning of the month on November 1)
Retweets
Fan Community Activities
“2pac” (the anniversary of the death of hip-hop artist Tupac Shakur).
Characterizing Trends …

Trends from twitter.com
Trends from Simple Trend Detector
Trends for Quality Analysis  Supervised Categories
Trends for Computing Features
Tquantity
Ttwitter
Tterm freq.
Tquality

Content Features
•Average number of words/characters
•Proportion of messages with URLs, unique URLs, with hashtags ex/including trend terms
•Top unique hashtag?
•Similarity to centroid
Interaction Features
• Proportion of retweets, replies, mentions
Time-based Features
• Exponential fit head, tail
• Logarithmic fit head, tail
Participation Features
• Messages per author
• Proportion of messages from top author
• Proportion of messages from top 10% of authors
Social Network Features
•Level of reciprocity
•Maximal eigenvector centrality
•Maximal degree centrality
•Transitivity
•Density
•Average component size

Content features: Exo higher URLs, smaller hashtags
Exogenous
vs.
Endogenous
Trends
Interaction features: Exo fewer
retweets, similar number of replies
Time features: Exo different for the
head period before the trend peak
but will exhibit similar time features in
the tail period after the trend peak,
compared to endogenous trends.
Social network features: Exo fewer connections, less reciprocity
1.1
1.2
1.3
1.4

IDEA
Automatic Categorization of Trends
Photography Trend  Selfie Image
Trust Trend  Trustful Users, Trustful Twits
Untrendy People! Users Counteract the trends

Trend Analysis

More Related Content

Viewers also liked

Similar to Trend Analysis

More from Hossein Fani

Recently uploaded

Trend Analysis

Editor's Notes