AIDR Tutorial (Artificial Intelligence for Disaster Response)

•

2 likes•508 views

Muhammad Imran

This is a short tutorial of AIDR.

Technology

AIDR Tutorial
Muhammad Imran
Research Scien1st
Qatar Compu1ng Research Ins1tute, HBKU
Doha, Qatar
h"p://aidr.qcri.org/

Outline
•  Data collec2on in AIDR
•  Data classiﬁca2on in AIDR
•  Data view/download in AIDR

Data Collec2on in AIDR
•  Twi:er data collec2on strategies that AIDR supports
–  By keywords
–  By geographical regions
•  Strict: coordinates strictly inside geo boundaries
•  Approximate: tweets from a place that overlaps with the geo
boundaries.
–  By following Twi:er users
–  By keywords + regions
•  Tweets that match any of the keywords and within the geo
boundaries.

Data Collec2on Using Keywords
•  Keywords limit = 400
•  One keyword could a single word like
“Suﬀolk” or a phrase “Suﬀolk accident”
•  1 keyword/phrase cannot be more than 60
bytes (1 char = 1 byte)
•  Generic keywords collect irrelevant tweets
•  Speciﬁc keywords most likely collect relevant
tweets

Loca2on-based Collec2on
•  Bounding boxes do not act as ﬁlters for other ﬁlter
parameters. For example :
keyword=twi:er&loca2ons=-122.75,36.8,-121.75,37.8
would match any tweets containing the term Twi:er (even
non-geo tweets) OR coming from the San Francisco area.

Following Twi:er Users
For each user speciﬁed, the tool will collect:
•  Tweets created by the user.
•  Tweets which are retweeted by the user.
•  Replies to any Tweet created by the user.
•  Retweets of any Tweet created by the user.
•  Manual replies, created without pressing a reply bu:on (e.g.
“@twi:erapi I agree”).
The tool will not contain:
•  Tweets men2oning the user (e.g. “Hello @twi:erapi!”).
•  Manual Retweets created without pressing a Retweet bu:on (e.g.
“RT @twi:erapi The API is great”).
•  Tweets by protected users.
Use comma-separated list of TwiFer user id (hFp://geFwiFerid.com/)

Data Classiﬁca2on in AIDR
•  Deﬁne classiﬁers (name, descrip2on)
– Deﬁne labels (name, descrip2on)
– Having a “miscellaneous” category will be helpful
•  Wait around 15-20 minutes (for fast
collec2ons) and 30-40 minutes (for slow
collec2on)
•  Start tagging

Classiﬁer Genera2on
•  Check the classiﬁer status (UI)
–  First classiﬁer/model will be up ager 50 labeled
tweets, ideally equally distributed among labels
–  If no model appears ager 50 tags, keep tagging
•  Human-tagged items (the more the be:er)
•  40 more needed to re-train (next classiﬁer target)
•  Machine-tagged items (keep an eye on
misclassiﬁca2ons)
•  Quality (ideally should be 90 < AUC != 100)

What's hot

Huri Search 2008 Huridocshuridocs

Managing errata and retractions with CrossMarkCrossref

PoolParty SKOS and Linked DataAndreas Blumauer

A Privacy Preference Ontology (PPO) for Linked DataOwen Sacco

New Initiatives - Geoffrey Bilder - London LIVE 2017Crossref

Session 02 - Object Identification - Part 1SiddharthSelenium

What's hot (6)

Huri Search 2008 Huridocs

Managing errata and retractions with CrossMark

PoolParty SKOS and Linked Data

A Privacy Preference Ontology (PPO) for Linked Data

New Initiatives - Geoffrey Bilder - London LIVE 2017

Session 02 - Object Identification - Part 1

Recently uploaded

Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed

Artificial intelligence in cctv survelliance.pptxhariprasad279825

Gen AI in Business - Global Trends Report 2024.pdfAddepto

DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy

Search Engine Optimization SEO PDF for 2024.pdfRankYa

Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited

The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech

Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro

What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett

Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

CloudStudio User manual (basic edition):comworks

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays

SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal

Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

From Family Reminiscence to Scholarly Archive .Alan Dix

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

Recently uploaded (20)

Scanning the Internet for External Cloud Exposures via SSL Certs

Artificial intelligence in cctv survelliance.pptx

Gen AI in Business - Global Trends Report 2024.pdf

DevoxxFR 2024 Reproducible Builds with Apache Maven

Search Engine Optimization SEO PDF for 2024.pdf

Ensuring Technical Readiness For Copilot in Microsoft 365

The Ultimate Guide to Choosing WordPress Pros and Cons

Unraveling Multimodality with Large Language Models.pdf

What's New in Teams Calling, Meetings and Devices March 2024

Vertex AI Gemini Prompt Engineering Tips

Connect Wave/ connectwave Pitch Deck Presentation

CloudStudio User manual (basic edition):

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...

SAP Build Work Zone - Overview L2-L3.pptx

Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf

Streamlining Python Development: A Guide to a Modern Project Setup

From Family Reminiscence to Scholarly Archive .

Human Factors of XR: Using Human Factors to Design XR Systems

AIDR Tutorial (Artificial Intelligence for Disaster Response)

1. AIDR Tutorial Muhammad Imran Research Scien1st Qatar Compu1ng Research Ins1tute, HBKU Doha, Qatar h"p://aidr.qcri.org/

2. Outline •  Data collec2on in AIDR •  Data classiﬁca2on in AIDR •  Data view/download in AIDR

3. Data Collec2on in AIDR •  Twi:er data collec2on strategies that AIDR supports –  By keywords –  By geographical regions •  Strict: coordinates strictly inside geo boundaries •  Approximate: tweets from a place that overlaps with the geo boundaries. –  By following Twi:er users –  By keywords + regions •  Tweets that match any of the keywords and within the geo boundaries.

4. Data Collec2on Using Keywords •  Keywords limit = 400 •  One keyword could a single word like “Suffolk” or a phrase “Suffolk accident” •  1 keyword/phrase cannot be more than 60 bytes (1 char = 1 byte) •  Generic keywords collect irrelevant tweets •  Specific keywords most likely collect relevant tweets

5. Keywords Examples

6. Loca2on-based Collec2on •  Bounding boxes do not act as ﬁlters for other ﬁlter parameters. For example : keyword=twi:er&loca2ons=-122.75,36.8,-121.75,37.8 would match any tweets containing the term Twi:er (even non-geo tweets) OR coming from the San Francisco area.

7. Following Twi:er Users For each user speciﬁed, the tool will collect: •  Tweets created by the user. •  Tweets which are retweeted by the user. •  Replies to any Tweet created by the user. •  Retweets of any Tweet created by the user. •  Manual replies, created without pressing a reply bu:on (e.g. “@twi:erapi I agree”). The tool will not contain: •  Tweets men2oning the user (e.g. “Hello @twi:erapi!”). •  Manual Retweets created without pressing a Retweet bu:on (e.g. “RT @twi:erapi The API is great”). •  Tweets by protected users. Use comma-separated list of TwiFer user id (hFp://geFwiFerid.com/)

9. Classiﬁer UI

10. Detailed Informa2on of Classiﬁers

11. Data Classifica2on in AIDR •  Define classifiers (name, descrip2on) – Define labels (name, descrip2on) – Having a “miscellaneous” category will be helpful •  Wait around 15-20 minutes (for fast collec2ons) and 30-40 minutes (for slow collec2on) •  Start tagging

12. Classifier Genera2on •  Check the classifier status (UI) –  First classifier/model will be up ager 50 labeled tweets, ideally equally distributed among labels –  If no model appears ager 50 tags, keep tagging •  Human-tagged items (the more the be:er) •  40 more needed to re-train (next classifier target) •  Machine-tagged items (keep an eye on misclassifica2ons) •  Quality (ideally should be 90 < AUC != 100)

AIDR Tutorial (Artificial Intelligence for Disaster Response)

Recommended

Recommended

More Related Content

What's hot

What's hot (6)

Similar to AIDR Tutorial (Artificial Intelligence for Disaster Response)

Similar to AIDR Tutorial (Artificial Intelligence for Disaster Response) (20)

More from Muhammad Imran

More from Muhammad Imran (16)

Recently uploaded

Recently uploaded (20)

AIDR Tutorial (Artificial Intelligence for Disaster Response)