Sample Twitter Data Deposit Form
                                                    Kris Kasianovitz. November, 2011


P.I. Name                                             Todd Presner

University/College Affiliation                        UCLA Germanic Languages
Address                                               BOX 951539, 212 RH, Los Angeles, CA 90095-1539
Email                                                 presner@ucla.edu

Phone                                                 310-794-6051

Research Assistant (if primary contact)
                                                      Use of Twitter in Egyptian Revolution merited some form of documentation. Wanted to
Reason for Capture                                    capture Tweets in order to have long-term as well as to display via HyperCities Platform to
                                                      show where and what people were tweeting.
                                                      Currently being displayed via HyperCities, http://egypt.hypercities.com/ that enables one to
Researcher’s use of captured data
                                                      search captured tweets by keyword or display by date on a Google earth base map.
Includes Protected Accounts?                          No
If Yes, explain any steps taken to keep these
                                                      N/A
Tweets protected in your dataset.
Did you contact Twitter or the Twitter users to
                                                      No
inform them of data capture?
If yes, please upload all related documentation.      N/A
Did you contact campus counsel, or receive any
university guidance about potential risks involved    Yes
in making this data set available?
If yes, please upload all related documentation       N/A
                                                            Open, no restrictions
                                                            Portions of data are restricted, contact PI
Restrictions on re-use/redistribution
                                                            Restricted, contact PI
                                                            Do not release. Archiving data only, not allowing re-use.
                                                            REST API
                                                            Search API (will be deprecated)
Capture Methods                                             Streaming API
                                                            Licensed Data Set
                                                            Other
                                                      Location = Center of Cairo within 200 miles
Capture Parameters                                    AND
                                                      hashtag = (#jan25 OR #egypt OR #tahrir)
Capture Dates                                         January 30-March 8, 2011
Capture Frequency
                                                      Daily within Twitter Rate Limits
(Daily, Weekly, Monthly, etc.)
Please note any anomalies or issues encountered       On February 28, when setting up the captures, ran into a rate limit issue; was not able to
with capture.                                         download data until the next day. Duplicate Tweets were removed.
Total Number of Tweets                                420,000

Total Number of Unique User IDs                       approximately 40,000
Does the data set contain Latitude/Longitude
                                                      Yes.
Data? Y/N
If yes,
                                                      1.     Lat/Lon and Twitter Location fields were captured
1. state specific fields captured
                                                      2.     The majority of Tweets did not contain any data in the Lat/Long fields.
2. how many tweets contain these fields,
                                                      3.     For display purposes in HyperCities, all locations are aggregated at the city level. We
3. did you recode, aggregate, or delete these
                                                             did not remove any Lat/Long Data from this data set.
      fields from the data set being deposited?
Have you handled the Location data as required
in the Twitter Geo Developer Guideslines?
Yes/No
                                                      Yes.
If you are unsure, please review these guidelines
https://dev.twitter.com/terms/geo-developer-
guidelines
Does the data set contain ONLY public tweets?
                                                      Yes.
Y/N
Did you capture image files locally?                  No.

Sample twitter data deposit form

  • 1.
    Sample Twitter DataDeposit Form Kris Kasianovitz. November, 2011 P.I. Name Todd Presner University/College Affiliation UCLA Germanic Languages Address BOX 951539, 212 RH, Los Angeles, CA 90095-1539 Email presner@ucla.edu Phone 310-794-6051 Research Assistant (if primary contact) Use of Twitter in Egyptian Revolution merited some form of documentation. Wanted to Reason for Capture capture Tweets in order to have long-term as well as to display via HyperCities Platform to show where and what people were tweeting. Currently being displayed via HyperCities, http://egypt.hypercities.com/ that enables one to Researcher’s use of captured data search captured tweets by keyword or display by date on a Google earth base map. Includes Protected Accounts? No If Yes, explain any steps taken to keep these N/A Tweets protected in your dataset. Did you contact Twitter or the Twitter users to No inform them of data capture? If yes, please upload all related documentation. N/A Did you contact campus counsel, or receive any university guidance about potential risks involved Yes in making this data set available? If yes, please upload all related documentation N/A  Open, no restrictions  Portions of data are restricted, contact PI Restrictions on re-use/redistribution  Restricted, contact PI  Do not release. Archiving data only, not allowing re-use.  REST API  Search API (will be deprecated) Capture Methods  Streaming API  Licensed Data Set  Other Location = Center of Cairo within 200 miles Capture Parameters AND hashtag = (#jan25 OR #egypt OR #tahrir) Capture Dates January 30-March 8, 2011 Capture Frequency Daily within Twitter Rate Limits (Daily, Weekly, Monthly, etc.) Please note any anomalies or issues encountered On February 28, when setting up the captures, ran into a rate limit issue; was not able to with capture. download data until the next day. Duplicate Tweets were removed. Total Number of Tweets 420,000 Total Number of Unique User IDs approximately 40,000 Does the data set contain Latitude/Longitude Yes. Data? Y/N If yes, 1. Lat/Lon and Twitter Location fields were captured 1. state specific fields captured 2. The majority of Tweets did not contain any data in the Lat/Long fields. 2. how many tweets contain these fields, 3. For display purposes in HyperCities, all locations are aggregated at the city level. We 3. did you recode, aggregate, or delete these did not remove any Lat/Long Data from this data set. fields from the data set being deposited? Have you handled the Location data as required in the Twitter Geo Developer Guideslines? Yes/No Yes. If you are unsure, please review these guidelines https://dev.twitter.com/terms/geo-developer- guidelines Does the data set contain ONLY public tweets? Yes. Y/N Did you capture image files locally? No.