SlideShare a Scribd company logo
1 of 20
Modeling Simulation & Visualization Student
Capstone Conference 2024
Web Archives for Verifying
Attribution in Twitter Screenshots
Track: AI and Autonomous Systems
Authors: Tarannum Zaki, Michael L. Nelson, and Michele C. Weigle
Presented by Tarannum Zaki
Department of Computer Science
Old Dominion University, Norfolk, Virginia
April 11, 2024
Screenshots are commonly used to annotate the social media of others
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL 2
https://twitter.com/BetteMidler/status/1541472225341198338
https://twitter.com/MahyarTousi/status/1534307163073658881 https://twitter.com/urbanachievr/status/1505944201208516612
Why screenshots?
To use as an evidence for deleted posts
3
https://web.archive.org/web/20220525125749/https://twitter.com/DanielDefense/status/1526237750277681154
Controversial posts
may be deleted.
https://twitter.com/ashtonpittman/status/1530243294868930560
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
https://twitter.com/DanielDefense/status/1526237750277681154
Other reasons: To deny cross-platform engagement, to aggregate, to mark-up etc.
Did they really post that?
Screenshots can also be used for humor, satire, and disinformation
4
https://twitter.com/Shayan86/status/1515753937139388418
https://twitter.com/paulthacker11/status/1495436489492090881
https://twitter.com/elonmusk/status/1544051155562598401
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
Creating fake tweets using Tweetgen
5
https://www.tweetgen.com/
https://www.tweetgen.com/create/tweet.html
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
Motivation
➢ Fake tweets can be responsible for misinformation/disinformation spread.
➢ Fake tweets are easy to create using online tools.
➢ There are no tools currently available to evaluate the authenticity of
attribution of screenshots.
6
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
Aim
To develop a tool that would automatically provide a probability
whether a screenshot of a social media post is fake using the
services of web archives.
7
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
To search for a tweet in the Wayback Machine, you must first
know its URL
8
https://web.archive.org/web/20220323185843/https://twitter.com/annaturley/status/1506706947239817224
URL of the tweet:
https://twitter.com/annaturley/status/1506706947239817224
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
https://web.archive.org/
But, URL of a tweet is not present in most screenshots
9
https://twitter.com/AaronBastani/status/1507391218854117377
@annaturley
March 23, 2022
March 25, 2022
https://twitter.com/TWITTER_HANDLE/status/TWEET_ID
https://web.archive.org/web/20220323185843/https://twitter.com/annaturley/status/1506706947239817224
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
Tweet ID encodes the timestamp of when
the tweet was created
Construction of a tweet URL
- Use the Twitter handle and approximate a time window based
on the timestamp.
- Construct URL for the tweet.
- Search for the tweet in the Wayback Machine using the URL.
Process to verify whether content of a screenshot exists in the
Wayback Machine
10
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
Creating a dataset of screenshots collected from Twitter
11
Fields
Shared post’s URL Original post’s URL
Category Reason
Content category Structural features
Post type Social media
Search strategy Annotated images
Screenshot Remarks
- Screenshot images shared on Twitter.
- 200 examples
- Examples include both real and fake screenshots
https://ws-dl.blogspot.com/2022/12/2022-12-12-disinformation-spread-on.html
https://twitter.com/rvawonk/status/1503227687917305863
https://twitter.com/RealCandaceO/status/1501576
352587292673
Category: Real
Reason: Found in the live web
Content category: Politics
Post Type: Tweet
Structural features: Single author, single
post
Search strategy: Searched on Twitter
interface
Social media: Twitter
Original post’s URL
Shared post’s URL
Screenshot
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
OCRing screenshots: Single tweet images
12
OCR
Optical Character Recognition extracts information as text from digital image.
Example screenshot image OCR extracted output
Twitter Handle
Timestamp
Tweet Text
Zaki, T., Nelson, M.L., and Weigle, M.C. (2023, Jun 14). Extracting Information from Twitter Screenshots. Tech Report arXiv:2306.08236. https://doi.org/10.48550/arXiv.2306.08236
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
Computing a time window based on the screenshot timestamp
13
The maximum difference between two time zones on Earth is 26 hours.
Example screenshot image OCR extracted output
Twitter handle and computed timestamps
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
Using CDX API to retrieve archived tweets within the time window
14
request = "http://web.archive.org/cdx/search/cdx?url=" + urir + params
urir = "https://twitter.com/"+randyhillier+"/status"
params =
"&matchType=prefix&from="+20220218154100+"&to="+20220220174100
CDX API prefix search process
Twitter handle and computed timestamps
Output: Retrieved archived tweets within the timeframe (cropped).
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
https://archive.org/help/wayback_api.php
Extracting tweet IDs and determining tweet creation
timestamp using TweetedAt
15
https://web.archive.org/web/20220218163926/https://twitter.com/randyhillier/status/1006984708109099008
https://ws-dl.blogspot.com/2019/08/2019-08-03-tweetedat-finding-tweet.html
Each tweet ID encodes its
creation timestamp
An archived tweet’s URL
https://oduwsdl.github.io/tweetedat/#1006984708109099008
Tweet ID Tweet Creation Date
1006984708109099008 20180613194037
………… …………..
Mapping between all the tweet IDs and
tweet creation timestamps
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
Determining the final set of archived tweets by filtering the
tweet creation timestamps within the time window
16
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
https://web.archive.org/web/20220218163926/https://twitter.com/randyhillier/status/1006984708109099008
An archived tweet’s URL
Timestamp when the tweet was archived
Tweet ID encoding the tweet creation timestamp:
20180613194037
The archived timestamp of the tweet falls within the timeframe, but the tweet creation
timestamp does not fall within the timeframe.
So, such archived tweets can be filtered out.
Extracting tweet text from archived tweets using
BeautifulSoup and Selenium
17
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
https://web.archive.org/web/20220220024223/https://twitter.com/randyhillier/status/1495226962058649603
TweetTextSize TweetTextSize--jumbo js-tweet-text tweet-text
An archived tweet’s URL
Extracted text from archived tweet
HTML tag containing
the tweet text
https://www.selenium.dev/
https://pypi.org/project/beautifulsoup4/
Selenium automates web scraping and BeautifulSoup parses text from HTML.
Computing text similarity score between tweet text from
screenshot and archived tweets using Python’s difflib library
18
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
https://docs.python.org/3/library/difflib.html
Example screenshot image Extracted text from archived tweet Extracted tweet text from screenshot
match_score(Archived_Tweet_Text, Screenshot_Tweet_Text)= 81.40%
Text similarity score is computed based on longest common subsequence
Archived_Tweet_Text1 Screenshot_Tweet_Text match _score = 81.40%
Archived_Tweet_Text2 Screenshot_Tweet_Text match_score = 30.78%
Archived_Tweet_Text3 Screenshot_Tweet_Text match_score = 5.67%
……………..
A match score of 81.40% helps us to prove the existence of the screenshot tweet posted by the alleged
author.
A threshold of 60% produced the highest F1 (0.69)
19
Threshold Value Precision Recall F1 Score
90% 1.00 0.42 0.59
80% 1.00 0.49 0.66
70% 1.00 0.51 0.67
60% 1.00 0.53 0.69
Experimented on 108 single tweet images from the collected dataset.
Performance of the overlap between the tweet text from the
screenshot and the archived tweets.
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
Summary
20
➢ Screenshots are an easy way to share content on social media.
➢ Since screenshots can be easily faked, it is a critical task to detect a fabricated post.
➢ Services of web archives could be useful to verify attribution of a screenshot by finding
an archived version of the screenshot content.
➢ Our research will mitigate misinformation and disinformation spread on social media.
Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL

More Related Content

Similar to Web Archives for Verifying Attribution in Twitter Screenshots

Building TweetEngine
Building TweetEngineBuilding TweetEngine
Building TweetEngineikailan
 
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...Raphael Troncy
 
Rails 3 and OAuth for Barcamp Tampa
Rails 3 and OAuth for Barcamp TampaRails 3 and OAuth for Barcamp Tampa
Rails 3 and OAuth for Barcamp TampaBryce Kerley
 
A faster journey with HTTP
A faster journey with HTTPA faster journey with HTTP
A faster journey with HTTPMichael Ewins
 
Leveling Up Your Social Analytics Program: Resources
Leveling Up Your Social Analytics Program: ResourcesLeveling Up Your Social Analytics Program: Resources
Leveling Up Your Social Analytics Program: Resourceswordsbywallace
 
@twitter Mining #Microblogs Using #Semantic Technologies
@twitter Mining #Microblogs Using #Semantic Technologies@twitter Mining #Microblogs Using #Semantic Technologies
@twitter Mining #Microblogs Using #Semantic TechnologiesMartin Ebner
 
Swap2010 twitter minining using semantic web technologies and linked data
Swap2010 twitter minining using semantic web technologies and linked dataSwap2010 twitter minining using semantic web technologies and linked data
Swap2010 twitter minining using semantic web technologies and linked dataSelver Softic
 
The Next Big Thing is Web 3.0. Catch It If You Can
The Next Big Thing is Web 3.0. Catch It If You Can The Next Big Thing is Web 3.0. Catch It If You Can
The Next Big Thing is Web 3.0. Catch It If You Can Judy O'Connell
 
Http/2 - What's it all about?
Http/2  - What's it all about?Http/2  - What's it all about?
Http/2 - What's it all about?Andy Davies
 
WebRTC From Asterisk to Headline - MoNage
WebRTC From Asterisk to Headline - MoNageWebRTC From Asterisk to Headline - MoNage
WebRTC From Asterisk to Headline - MoNageChad Hart
 
Uncertainty in replaying archived Twitter pages
Uncertainty in replaying archived Twitter pagesUncertainty in replaying archived Twitter pages
Uncertainty in replaying archived Twitter pagesMichael Nelson
 
Centralise legacy auth at the ingress gateway, SREday
Centralise legacy auth at the ingress gateway, SREdayCentralise legacy auth at the ingress gateway, SREday
Centralise legacy auth at the ingress gateway, SREdayAndrew Kirkpatrick
 
Centralise legacy auth at the ingress gateway
Centralise legacy auth at the ingress gatewayCentralise legacy auth at the ingress gateway
Centralise legacy auth at the ingress gatewayAndrew Kirkpatrick
 
Twitter Bootstrap
Twitter BootstrapTwitter Bootstrap
Twitter BootstrapTechizzaa
 
Social Developers London update for Twitter Developers
Social Developers London update for Twitter Developers Social Developers London update for Twitter Developers
Social Developers London update for Twitter Developers Angus Fox
 
435752048-web-development-report.pdf
435752048-web-development-report.pdf435752048-web-development-report.pdf
435752048-web-development-report.pdfUtkarshSingh697319
 

Similar to Web Archives for Verifying Attribution in Twitter Screenshots (20)

Building TweetEngine
Building TweetEngineBuilding TweetEngine
Building TweetEngine
 
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
 
Rails 3 and OAuth for Barcamp Tampa
Rails 3 and OAuth for Barcamp TampaRails 3 and OAuth for Barcamp Tampa
Rails 3 and OAuth for Barcamp Tampa
 
Grampa, What's a deleted tweet?
Grampa, What's a deleted tweet?Grampa, What's a deleted tweet?
Grampa, What's a deleted tweet?
 
A faster journey with HTTP
A faster journey with HTTPA faster journey with HTTP
A faster journey with HTTP
 
Leveling Up Your Social Analytics Program: Resources
Leveling Up Your Social Analytics Program: ResourcesLeveling Up Your Social Analytics Program: Resources
Leveling Up Your Social Analytics Program: Resources
 
@twitter Mining #Microblogs Using #Semantic Technologies
@twitter Mining #Microblogs Using #Semantic Technologies@twitter Mining #Microblogs Using #Semantic Technologies
@twitter Mining #Microblogs Using #Semantic Technologies
 
Swap2010 twitter minining using semantic web technologies and linked data
Swap2010 twitter minining using semantic web technologies and linked dataSwap2010 twitter minining using semantic web technologies and linked data
Swap2010 twitter minining using semantic web technologies and linked data
 
The Next Big Thing is Web 3.0. Catch It If You Can
The Next Big Thing is Web 3.0. Catch It If You Can The Next Big Thing is Web 3.0. Catch It If You Can
The Next Big Thing is Web 3.0. Catch It If You Can
 
Useful Twitter Tools
Useful Twitter ToolsUseful Twitter Tools
Useful Twitter Tools
 
Http/2 - What's it all about?
Http/2  - What's it all about?Http/2  - What's it all about?
Http/2 - What's it all about?
 
WebRTC From Asterisk to Headline - MoNage
WebRTC From Asterisk to Headline - MoNageWebRTC From Asterisk to Headline - MoNage
WebRTC From Asterisk to Headline - MoNage
 
Uncertainty in replaying archived Twitter pages
Uncertainty in replaying archived Twitter pagesUncertainty in replaying archived Twitter pages
Uncertainty in replaying archived Twitter pages
 
Why Tweet?
Why Tweet?Why Tweet?
Why Tweet?
 
Centralise legacy auth at the ingress gateway, SREday
Centralise legacy auth at the ingress gateway, SREdayCentralise legacy auth at the ingress gateway, SREday
Centralise legacy auth at the ingress gateway, SREday
 
Centralise legacy auth at the ingress gateway
Centralise legacy auth at the ingress gatewayCentralise legacy auth at the ingress gateway
Centralise legacy auth at the ingress gateway
 
YQL - HackU IIT Madras 2012
YQL - HackU IIT Madras 2012YQL - HackU IIT Madras 2012
YQL - HackU IIT Madras 2012
 
Twitter Bootstrap
Twitter BootstrapTwitter Bootstrap
Twitter Bootstrap
 
Social Developers London update for Twitter Developers
Social Developers London update for Twitter Developers Social Developers London update for Twitter Developers
Social Developers London update for Twitter Developers
 
435752048-web-development-report.pdf
435752048-web-development-report.pdf435752048-web-development-report.pdf
435752048-web-development-report.pdf
 

Recently uploaded

Interpreting the brief for the media IDY
Interpreting the brief for the media IDYInterpreting the brief for the media IDY
Interpreting the brief for the media IDYgalaxypingy
 
Spotify AI DJ Deck - The Agency at University of Florida
Spotify AI DJ Deck - The Agency at University of FloridaSpotify AI DJ Deck - The Agency at University of Florida
Spotify AI DJ Deck - The Agency at University of Floridajorirz24
 
DickinsonSlides teeeeeeeeeeessssssssssst.pptx
DickinsonSlides teeeeeeeeeeessssssssssst.pptxDickinsonSlides teeeeeeeeeeessssssssssst.pptx
DickinsonSlides teeeeeeeeeeessssssssssst.pptxednyonat
 
This is a Powerpoint about research into the codes and conventions of a film ...
This is a Powerpoint about research into the codes and conventions of a film ...This is a Powerpoint about research into the codes and conventions of a film ...
This is a Powerpoint about research into the codes and conventions of a film ...samuelcoulson30
 
Night 7k Call Girls Noida New Ashok Nagar Escorts Call Me: 8448380779
Night 7k Call Girls Noida New Ashok Nagar Escorts Call Me: 8448380779Night 7k Call Girls Noida New Ashok Nagar Escorts Call Me: 8448380779
Night 7k Call Girls Noida New Ashok Nagar Escorts Call Me: 8448380779Delhi Call girls
 
Call Girls In Noida Mall Of Noida O9654467111 Escorts Serviec
Call Girls In Noida Mall Of Noida O9654467111 Escorts ServiecCall Girls In Noida Mall Of Noida O9654467111 Escorts Serviec
Call Girls In Noida Mall Of Noida O9654467111 Escorts ServiecSapana Sha
 
Top Call Girls In Charbagh ( Lucknow ) 🔝 8923113531 🔝 Cash Payment
Top Call Girls In Charbagh ( Lucknow  ) 🔝 8923113531 🔝  Cash PaymentTop Call Girls In Charbagh ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment
Top Call Girls In Charbagh ( Lucknow ) 🔝 8923113531 🔝 Cash Paymentanilsa9823
 
Codes and Conventions of Artists' Websites
Codes and Conventions of Artists' WebsitesCodes and Conventions of Artists' Websites
Codes and Conventions of Artists' WebsitesLukeNash7
 
GREAT OPORTUNITY Russian Call Girls Kirti Nagar 9711199012 Independent Escort...
GREAT OPORTUNITY Russian Call Girls Kirti Nagar 9711199012 Independent Escort...GREAT OPORTUNITY Russian Call Girls Kirti Nagar 9711199012 Independent Escort...
GREAT OPORTUNITY Russian Call Girls Kirti Nagar 9711199012 Independent Escort...Mona Rathore
 
Learn About the Rise of Instagram Pro in 2024
Learn About the Rise of Instagram Pro in 2024Learn About the Rise of Instagram Pro in 2024
Learn About the Rise of Instagram Pro in 2024Islam Fit
 
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFE
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFECASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFE
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFECall girl Jaipur
 
Top Astrologer, Kala ilam specialist in USA and Bangali Amil baba in Saudi Ar...
Top Astrologer, Kala ilam specialist in USA and Bangali Amil baba in Saudi Ar...Top Astrologer, Kala ilam specialist in USA and Bangali Amil baba in Saudi Ar...
Top Astrologer, Kala ilam specialist in USA and Bangali Amil baba in Saudi Ar...baharayali
 
CALL ON ➥8923113531 🔝Call Girls Takrohi Lucknow best Female service 👖
CALL ON ➥8923113531 🔝Call Girls Takrohi Lucknow best Female service  👖CALL ON ➥8923113531 🔝Call Girls Takrohi Lucknow best Female service  👖
CALL ON ➥8923113531 🔝Call Girls Takrohi Lucknow best Female service 👖anilsa9823
 
"Ready to elevate your Instagram? Let's go
"Ready to elevate your Instagram? Let's go"Ready to elevate your Instagram? Let's go
"Ready to elevate your Instagram? Let's goSocioCosmos
 
Top Call Girls In Telibagh ( Lucknow ) 🔝 8923113531 🔝 Cash Payment
Top Call Girls In Telibagh ( Lucknow  ) 🔝 8923113531 🔝  Cash PaymentTop Call Girls In Telibagh ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment
Top Call Girls In Telibagh ( Lucknow ) 🔝 8923113531 🔝 Cash Paymentanilsa9823
 
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...anilsa9823
 
Call Girls In Gurgaon Dlf pHACE 2 Women Delhi ncr
Call Girls In Gurgaon Dlf pHACE 2 Women Delhi ncrCall Girls In Gurgaon Dlf pHACE 2 Women Delhi ncr
Call Girls In Gurgaon Dlf pHACE 2 Women Delhi ncrSapana Sha
 
Stunning ➥8448380779▻ Call Girls In Paharganj Delhi NCR
Stunning ➥8448380779▻ Call Girls In Paharganj Delhi NCRStunning ➥8448380779▻ Call Girls In Paharganj Delhi NCR
Stunning ➥8448380779▻ Call Girls In Paharganj Delhi NCRDelhi Call girls
 

Recently uploaded (20)

Interpreting the brief for the media IDY
Interpreting the brief for the media IDYInterpreting the brief for the media IDY
Interpreting the brief for the media IDY
 
9953056974 Young Call Girls In Kirti Nagar Indian Quality Escort service
9953056974 Young Call Girls In  Kirti Nagar Indian Quality Escort service9953056974 Young Call Girls In  Kirti Nagar Indian Quality Escort service
9953056974 Young Call Girls In Kirti Nagar Indian Quality Escort service
 
Spotify AI DJ Deck - The Agency at University of Florida
Spotify AI DJ Deck - The Agency at University of FloridaSpotify AI DJ Deck - The Agency at University of Florida
Spotify AI DJ Deck - The Agency at University of Florida
 
DickinsonSlides teeeeeeeeeeessssssssssst.pptx
DickinsonSlides teeeeeeeeeeessssssssssst.pptxDickinsonSlides teeeeeeeeeeessssssssssst.pptx
DickinsonSlides teeeeeeeeeeessssssssssst.pptx
 
This is a Powerpoint about research into the codes and conventions of a film ...
This is a Powerpoint about research into the codes and conventions of a film ...This is a Powerpoint about research into the codes and conventions of a film ...
This is a Powerpoint about research into the codes and conventions of a film ...
 
Night 7k Call Girls Noida New Ashok Nagar Escorts Call Me: 8448380779
Night 7k Call Girls Noida New Ashok Nagar Escorts Call Me: 8448380779Night 7k Call Girls Noida New Ashok Nagar Escorts Call Me: 8448380779
Night 7k Call Girls Noida New Ashok Nagar Escorts Call Me: 8448380779
 
Call Girls In Noida Mall Of Noida O9654467111 Escorts Serviec
Call Girls In Noida Mall Of Noida O9654467111 Escorts ServiecCall Girls In Noida Mall Of Noida O9654467111 Escorts Serviec
Call Girls In Noida Mall Of Noida O9654467111 Escorts Serviec
 
Top Call Girls In Charbagh ( Lucknow ) 🔝 8923113531 🔝 Cash Payment
Top Call Girls In Charbagh ( Lucknow  ) 🔝 8923113531 🔝  Cash PaymentTop Call Girls In Charbagh ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment
Top Call Girls In Charbagh ( Lucknow ) 🔝 8923113531 🔝 Cash Payment
 
Codes and Conventions of Artists' Websites
Codes and Conventions of Artists' WebsitesCodes and Conventions of Artists' Websites
Codes and Conventions of Artists' Websites
 
GREAT OPORTUNITY Russian Call Girls Kirti Nagar 9711199012 Independent Escort...
GREAT OPORTUNITY Russian Call Girls Kirti Nagar 9711199012 Independent Escort...GREAT OPORTUNITY Russian Call Girls Kirti Nagar 9711199012 Independent Escort...
GREAT OPORTUNITY Russian Call Girls Kirti Nagar 9711199012 Independent Escort...
 
Learn About the Rise of Instagram Pro in 2024
Learn About the Rise of Instagram Pro in 2024Learn About the Rise of Instagram Pro in 2024
Learn About the Rise of Instagram Pro in 2024
 
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFE
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFECASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFE
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFE
 
Top Astrologer, Kala ilam specialist in USA and Bangali Amil baba in Saudi Ar...
Top Astrologer, Kala ilam specialist in USA and Bangali Amil baba in Saudi Ar...Top Astrologer, Kala ilam specialist in USA and Bangali Amil baba in Saudi Ar...
Top Astrologer, Kala ilam specialist in USA and Bangali Amil baba in Saudi Ar...
 
CALL ON ➥8923113531 🔝Call Girls Takrohi Lucknow best Female service 👖
CALL ON ➥8923113531 🔝Call Girls Takrohi Lucknow best Female service  👖CALL ON ➥8923113531 🔝Call Girls Takrohi Lucknow best Female service  👖
CALL ON ➥8923113531 🔝Call Girls Takrohi Lucknow best Female service 👖
 
Russian Call Girls Rohini Sector 37 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Rohini Sector 37 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...Russian Call Girls Rohini Sector 37 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Rohini Sector 37 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
 
"Ready to elevate your Instagram? Let's go
"Ready to elevate your Instagram? Let's go"Ready to elevate your Instagram? Let's go
"Ready to elevate your Instagram? Let's go
 
Top Call Girls In Telibagh ( Lucknow ) 🔝 8923113531 🔝 Cash Payment
Top Call Girls In Telibagh ( Lucknow  ) 🔝 8923113531 🔝  Cash PaymentTop Call Girls In Telibagh ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment
Top Call Girls In Telibagh ( Lucknow ) 🔝 8923113531 🔝 Cash Payment
 
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...
 
Call Girls In Gurgaon Dlf pHACE 2 Women Delhi ncr
Call Girls In Gurgaon Dlf pHACE 2 Women Delhi ncrCall Girls In Gurgaon Dlf pHACE 2 Women Delhi ncr
Call Girls In Gurgaon Dlf pHACE 2 Women Delhi ncr
 
Stunning ➥8448380779▻ Call Girls In Paharganj Delhi NCR
Stunning ➥8448380779▻ Call Girls In Paharganj Delhi NCRStunning ➥8448380779▻ Call Girls In Paharganj Delhi NCR
Stunning ➥8448380779▻ Call Girls In Paharganj Delhi NCR
 

Web Archives for Verifying Attribution in Twitter Screenshots

  • 1. Modeling Simulation & Visualization Student Capstone Conference 2024 Web Archives for Verifying Attribution in Twitter Screenshots Track: AI and Autonomous Systems Authors: Tarannum Zaki, Michael L. Nelson, and Michele C. Weigle Presented by Tarannum Zaki Department of Computer Science Old Dominion University, Norfolk, Virginia April 11, 2024
  • 2. Screenshots are commonly used to annotate the social media of others Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL 2 https://twitter.com/BetteMidler/status/1541472225341198338 https://twitter.com/MahyarTousi/status/1534307163073658881 https://twitter.com/urbanachievr/status/1505944201208516612
  • 3. Why screenshots? To use as an evidence for deleted posts 3 https://web.archive.org/web/20220525125749/https://twitter.com/DanielDefense/status/1526237750277681154 Controversial posts may be deleted. https://twitter.com/ashtonpittman/status/1530243294868930560 Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL https://twitter.com/DanielDefense/status/1526237750277681154 Other reasons: To deny cross-platform engagement, to aggregate, to mark-up etc.
  • 4. Did they really post that? Screenshots can also be used for humor, satire, and disinformation 4 https://twitter.com/Shayan86/status/1515753937139388418 https://twitter.com/paulthacker11/status/1495436489492090881 https://twitter.com/elonmusk/status/1544051155562598401 Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
  • 5. Creating fake tweets using Tweetgen 5 https://www.tweetgen.com/ https://www.tweetgen.com/create/tweet.html Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
  • 6. Motivation ➢ Fake tweets can be responsible for misinformation/disinformation spread. ➢ Fake tweets are easy to create using online tools. ➢ There are no tools currently available to evaluate the authenticity of attribution of screenshots. 6 Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
  • 7. Aim To develop a tool that would automatically provide a probability whether a screenshot of a social media post is fake using the services of web archives. 7 Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
  • 8. To search for a tweet in the Wayback Machine, you must first know its URL 8 https://web.archive.org/web/20220323185843/https://twitter.com/annaturley/status/1506706947239817224 URL of the tweet: https://twitter.com/annaturley/status/1506706947239817224 Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL https://web.archive.org/
  • 9. But, URL of a tweet is not present in most screenshots 9 https://twitter.com/AaronBastani/status/1507391218854117377 @annaturley March 23, 2022 March 25, 2022 https://twitter.com/TWITTER_HANDLE/status/TWEET_ID https://web.archive.org/web/20220323185843/https://twitter.com/annaturley/status/1506706947239817224 Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL Tweet ID encodes the timestamp of when the tweet was created Construction of a tweet URL - Use the Twitter handle and approximate a time window based on the timestamp. - Construct URL for the tweet. - Search for the tweet in the Wayback Machine using the URL.
  • 10. Process to verify whether content of a screenshot exists in the Wayback Machine 10 Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
  • 11. Creating a dataset of screenshots collected from Twitter 11 Fields Shared post’s URL Original post’s URL Category Reason Content category Structural features Post type Social media Search strategy Annotated images Screenshot Remarks - Screenshot images shared on Twitter. - 200 examples - Examples include both real and fake screenshots https://ws-dl.blogspot.com/2022/12/2022-12-12-disinformation-spread-on.html https://twitter.com/rvawonk/status/1503227687917305863 https://twitter.com/RealCandaceO/status/1501576 352587292673 Category: Real Reason: Found in the live web Content category: Politics Post Type: Tweet Structural features: Single author, single post Search strategy: Searched on Twitter interface Social media: Twitter Original post’s URL Shared post’s URL Screenshot Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
  • 12. OCRing screenshots: Single tweet images 12 OCR Optical Character Recognition extracts information as text from digital image. Example screenshot image OCR extracted output Twitter Handle Timestamp Tweet Text Zaki, T., Nelson, M.L., and Weigle, M.C. (2023, Jun 14). Extracting Information from Twitter Screenshots. Tech Report arXiv:2306.08236. https://doi.org/10.48550/arXiv.2306.08236 Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
  • 13. Computing a time window based on the screenshot timestamp 13 The maximum difference between two time zones on Earth is 26 hours. Example screenshot image OCR extracted output Twitter handle and computed timestamps Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
  • 14. Using CDX API to retrieve archived tweets within the time window 14 request = "http://web.archive.org/cdx/search/cdx?url=" + urir + params urir = "https://twitter.com/"+randyhillier+"/status" params = "&matchType=prefix&from="+20220218154100+"&to="+20220220174100 CDX API prefix search process Twitter handle and computed timestamps Output: Retrieved archived tweets within the timeframe (cropped). Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL https://archive.org/help/wayback_api.php
  • 15. Extracting tweet IDs and determining tweet creation timestamp using TweetedAt 15 https://web.archive.org/web/20220218163926/https://twitter.com/randyhillier/status/1006984708109099008 https://ws-dl.blogspot.com/2019/08/2019-08-03-tweetedat-finding-tweet.html Each tweet ID encodes its creation timestamp An archived tweet’s URL https://oduwsdl.github.io/tweetedat/#1006984708109099008 Tweet ID Tweet Creation Date 1006984708109099008 20180613194037 ………… ………….. Mapping between all the tweet IDs and tweet creation timestamps Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
  • 16. Determining the final set of archived tweets by filtering the tweet creation timestamps within the time window 16 Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL https://web.archive.org/web/20220218163926/https://twitter.com/randyhillier/status/1006984708109099008 An archived tweet’s URL Timestamp when the tweet was archived Tweet ID encoding the tweet creation timestamp: 20180613194037 The archived timestamp of the tweet falls within the timeframe, but the tweet creation timestamp does not fall within the timeframe. So, such archived tweets can be filtered out.
  • 17. Extracting tweet text from archived tweets using BeautifulSoup and Selenium 17 Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL https://web.archive.org/web/20220220024223/https://twitter.com/randyhillier/status/1495226962058649603 TweetTextSize TweetTextSize--jumbo js-tweet-text tweet-text An archived tweet’s URL Extracted text from archived tweet HTML tag containing the tweet text https://www.selenium.dev/ https://pypi.org/project/beautifulsoup4/ Selenium automates web scraping and BeautifulSoup parses text from HTML.
  • 18. Computing text similarity score between tweet text from screenshot and archived tweets using Python’s difflib library 18 Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL https://docs.python.org/3/library/difflib.html Example screenshot image Extracted text from archived tweet Extracted tweet text from screenshot match_score(Archived_Tweet_Text, Screenshot_Tweet_Text)= 81.40% Text similarity score is computed based on longest common subsequence Archived_Tweet_Text1 Screenshot_Tweet_Text match _score = 81.40% Archived_Tweet_Text2 Screenshot_Tweet_Text match_score = 30.78% Archived_Tweet_Text3 Screenshot_Tweet_Text match_score = 5.67% …………….. A match score of 81.40% helps us to prove the existence of the screenshot tweet posted by the alleged author.
  • 19. A threshold of 60% produced the highest F1 (0.69) 19 Threshold Value Precision Recall F1 Score 90% 1.00 0.42 0.59 80% 1.00 0.49 0.66 70% 1.00 0.51 0.67 60% 1.00 0.53 0.69 Experimented on 108 single tweet images from the collected dataset. Performance of the overlap between the tweet text from the screenshot and the archived tweets. Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL
  • 20. Summary 20 ➢ Screenshots are an easy way to share content on social media. ➢ Since screenshots can be easily faked, it is a critical task to detect a fabricated post. ➢ Services of web archives could be useful to verify attribution of a screenshot by finding an archived version of the screenshot content. ➢ Our research will mitigate misinformation and disinformation spread on social media. Tarannum Zaki MSVSCC 2024 Web Archives for Verifying Attribution in Twitter Screenshots @tarannum_zaki @WebSciDL