1. ML & Social Media
4th Summer School on Machine Learning
July 13, 2019
Ponnurangam Kumaraguru (“PK”)
IIIT Delhi
TEDx & ACM Distinguished Speaker
Linkedin/in/ponguru/
fb/ponnurangam.kumaraguru
@ponguru
15. Credibility Modeling
15
Feature set Features (45)
Tweet meta-data
Number of seconds since the tweet; Source of tweet (mobile / web/
etc); Tweet contains geo-coordinates
Tweet content (simple)
Number of characters; Number of words; Number of URLs; Number of
hashtags; Number of unique characters; Presence of stock symbol;
Presence of happy smiley; Presence of sad smiley; Tweet contains
`via'; Presence of colon symbol
Tweet content (linguistic)
Presence of swear words; Presence of negative emotion words;
Presence of positive emotion words; Presence of pronouns; Mention
of self words in tweet (I; my; mine)
Tweet author Number of followers; friends; time since the user if on Twitter; etc.
Tweet network
Number of retweets; Number of mentions; Tweet is a reply; Tweet is a
retweet
Tweet links WOT score for the URL; Ratio of likes / dislikes for a YouTube video
32. Heuristic Identity Search
32
cerc.iiitd.ac.in
Profile
Content
Self-mention
Network
Syntactic
and Image
Search Linking
If self-identified /
returned by
more than one
search method
No
Yes
Candidate
Identities
name,
location,
username
mobile no,
post,
friends,
followers
Paridhi Jain, Ponnurangam Kumaraguru, and Anupam Joshi. 2013. @I seek ‘fb.me’: Identifying Users across Multiple Online Social
Networks. In Proceedings of the 22nd International Conference on World Wide Web, WWW ’13 Companion. ACM, New York, NY, USA,
1259- 1268. DOI=http://dx.doi.org/10.1145/2487788.2488160 [Honorable Mention Award}
47. Takeaways
– Social Media is a powerful tool
– Business / Government can use the analysis
to their benefits
– You should use it for your benefits J
-You tried doing analysis on your accounts?
– Lots of challenges and opportunities!
– Will be happy to interact more, if needed
47