Msm2011 Twitter Citations
Upcoming SlideShare
Loading in...5
×
 

Msm2011 Twitter Citations

on

  • 1,452 views

Presentation at Making Sense of Microposts Workshop (#MSM2011), co-located with Extended Semantic Web Conference 2011 (ESWC2011), Crete, Greece, May 30 2011

Presentation at Making Sense of Microposts Workshop (#MSM2011), co-located with Extended Semantic Web Conference 2011 (ESWC2011), Crete, Greece, May 30 2011

Statistics

Views

Total Views
1,452
Views on SlideShare
1,452
Embed Views
0

Actions

Likes
0
Downloads
4
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Msm2011 Twitter Citations Msm2011 Twitter Citations Presentation Transcript

  • Citation Analysis in Twitter:Approaches for Defining and Measuring InformationFlows within Tweets during Scientific ConferencesKatrin Weller, Evelyn Dröge, & Cornelius PuschmannPresentation at Making Sense of Microposts Workshop (#MSM2011),Co-located with Extended Semantic Web Conference (ESWC 2011), Crete, GreeceMay 30, 2011
  • Scientific Microblogging?How can scientific tweets be identified? Based on content • What is scientific content on Twitter? • No automatic identification. Based on persons • What is a scientist? • How can we find scientists on Twitter? Based on formatting • Identification via Hashtags. • One must accept data loss.
  • Analyzing Conference Tweets • Selection of conferences Data • Collection of tweets based on conference Collection hashtags (www.twapperkeeper.com) • Time series Automatic • Most active users Analysis • User-networks • Categorization of tweet contents Manual • Key question: Are tweets dealing with the Analysis scientific topics of a conference? • URLs in tweets („external citations“) Citation • Retweets („internal citations“) Analysis
  • Citations and References • Document A cites Document B = A includes a reference to B. • There is an information flow from B to A. • Document B receives a citation (and thus reputation). Citing Document Cited Document B A As pointed out by Reputation „Everything will be B, everything will allright!“ be allright. Reference: see Information Document B.
  • Citations and References Citation Citation Reference Quotation
  • Citations and References on Twitter? URLs in tweets • References to websites („external citations“) • What is cited? Retweets • Quotations, reference to twitterers („internal citations“) • Lack in formatting standards @-mentions • „Just read an interesting paper by @sampleuser” • Can hardly be identified
  • Data Collection: Selected Conferences Conference World Wide Dublin Core Modern Meeting of the Web and Language Institute for Conference Metadata Association Enabling Geo- Conference spatial Scholarship Hashtag #www2010 #dcmi2009 #mla09 #geoinst Location Raleigh, Seoul, South Philadelphia, Charlottesville, NC, USA Korea PA, USA VA, USA Days 26.- 12. - 27. - 25. - 30.04.2010 16.10.2009 30.12.2009 27.05.2010 Discipline Computer Computer (Digital) Digital science science Humanities Humanities Participants >1.000 < 500 > 1.000 < 500 Days for data 13.04.- 12.10.- 15.12.2009- 11.05.- collection 14.05.2010 23.10.2009 14.01.2010 09.06.2010
  • Basic Facts Conference #www2010 #dcmi2009 #mla09 #geoinst Number of tweets 3,358 146 1,929 1,673 Number of twitterers 903 27 369 99 Ø Tweets per 3.72 5.41 5.23 16.90 twitterer number/ percentage 33.38% 25.34% 21.41% 8.31% of RTs* 1,121/3,358 37/146 413/1,929 139/1,673 Ø RTs* per twitterer 1.24 1.37 1.12 1.40 number/ percentage 7.47% 5.48% 13.37% 24.33% of @-messages 251/3,358 8/146 258/1,929 407/1,673 Ø @-messages per 0.28 0.30 0.70 4.11 twitterer * Automatically detected RTs
  • External Citations: URLs in TweetsDifferent ways to count URLs #www2010 #mla09 Number (and %) of tweets 1,338 (39.85%) 525 (27.22%) including at least one URL Number of total URLs 1,460 551 Number of unique URLs 574 199
  • Frequency of URLs Distribution of URLs from #www2010 Distribution of URLs from #mla09 45 30 Frequency of URL on rank n Frequency of URL on rank n 40 #www2010 25 #mla09 35 30 20 25 15 20 15 10 10 5 5 0 0 133 145 181 109 121 157 169 1 13 49 61 85 97 25 37 73 121 211 271 331 421 481 541 151 181 241 301 361 391 451 511 61 31 91 1 URL on rank n (ranked by frequency) URL on rank n (ranked by frequency)
  • URL Categorization Blog Conference Error Media Press Project Publication Slides Twitter Other
  • Frequent URLs and their categories: #www2010 URL Frequency Category http://blog.marcua.net/post/566480920/twitter-papers-at- 41 Blog the-www-2010-conference http://www.danah.org/papers/talks/2010/WWW2010.html 35 Publication http://kmi.tugraz.at/staff/markus/www2010/www2010_roo 29 Twitter mstream.html http://xquery.pbworks.com/rtp-meetup 22 Error http://www.elon.edu/e- 22 Conference web/predictions/futureweb2010/carl_mala mud_www_keynote.xhtml http://www.elon.edu/e- 18 Conference web/predictions/futureweb2010/default .xhtml http://futureweb2010.wordpress.com/schedule/ 16 Conference http://www.slideshare.net/haewoon/what-is-twitter-a- 13 Slides social-network-or-a-news-media-3922095 http://events.linkeddata.org/ldow2010/ 12 Conference http://opengraphprotocol.org/ 12 Project http://www.websci10.org/program.html 12 Conference
  • Frequent URLs and their categories: #mla09 URL Frequency Category http://amandafrench.net/2009/12/30/make-10-louder/ 27 Blog http://www.briancroxall.net/2009/12/28/the-absent-presence- 23 Blog todays-faculty/ http://nowviskie.org/2009/monopolies-of-invention/ 22 Blog http://chronicle.com/article/missing-in-action-at/63276/ 20 Error http://www.profhacker.com/?p=4448 18 Press http://www.samplereality.com/2009/11/15/digital-humanities- 18 Blog sessions-at-the-2009-mla/ http://chronicle.com/blogpost/the-mlathe-digital/19468/ 16 Press http://www.profhacker.com/2010/01/09/academics-and- 15 Press social-media-mla09-and-twitter/ http://academhack.outsidethetext.com/home/2010/the-mla- 15 Blog briancroxall-and-the-non-rise-of-the-digital-humanities/ http://www.samplereality.com/2010/01/02/the-mla-in-tweets/ 15 Blog
  • URL Categories: #mla09 and #www2010 Categories of URLs from #mla09 Categories of URLs from #mla09 (counting all URLs, n=551) (counting unique URLs only, n=199) Publicatio Twitter; 22 n; 4 Other; 36 Twitter; 14 Other; 20 Slides; 0 Slides; 0 Project; 11 Publication ;3 Blog; 54 Press; Blog; 229 123 Project; 5 Press; 34 Error; Error; Media; 34 28 69 Conferenc Conferenc e; 23 Media; 25 e; 16 Categories of URLs from #www2010 Categories of URLs from #www2010 (counting all URLs, n=1460) (counting unique URLs only, n=574) Other; 169 Conferenc Twitter; 76 e; 37 Twitter; 31 Slides; Blog; Other; Blog; 106 222 94 68 Conferenc e; 206 Publicatio Slides; 45 Error; 92 n; 135 Error; 201 Project; Publication Media; Media; 71 116 ; 52 Press; 92 137 Project; 51 Press; 33
  • Internal Citations: Retweets Different ways to count retweets #www2010 #mla09 Automatically detected RTs: Number and 1,121 (33.38% of 414 (21.46% of percentage of RTs in entire conference 3,358) 1,929) dataset ∅ RTs per twitterer (automatically detected 1.24 1.12 RTs, entire conference dataset) Manually detected RTs: Number and 1,318 (39.25% of 514 (26.65% of percentage of RTs in entire conference 3,358) 1,929) dataset
  • Retweets by Persons Top 3 highly cited and highly citing twitterers #www2010 #www2010 #mla09 #mla09 RTs given RTs received RTs given RTs receivedlaterribleliz (46) boraz (85) newfacmajority (25) samplereality (49)uncpublichealth (42) apisanty (61) ryancordell (20) briancroxall (35)olgag (30) futureweb2010 (51) jcmeloni (13) nowviskie (33)
  • Retweet Distribution: #www2010 Distribution of given and received retweets for #www2010 number of RTs recieved for or given by 90 80 RTs received 70 RTs given person on rank n 60 50 40 30 20 10 0 1 51 101 151 201 251 301 351 401 451 501 551 person on rank n
  • Retweet Distribution: #mla09 Distribution of given and received retweets for #mla09 number of RTs recieved for or given by 60 50 RTs received person on rank n 40 RTs given 30 20 10 0 1 21 41 61 81 101 121 141 161 181 person on rank n
  • Internal and External Citations URLs RTs with RTs with RTs 1,338 URL RTs URLs URL 530 1,121 525 270 414 #www2010 #mla09
  • Highly Cited Tweets: #mla09 Tweet text and ID From User RTs Hey, guys, Ive blogged about "the amandafrench 18 amplification of scholarly communication":URL No. 1 Twitter, #MLA09, @briancroxall, & such: http://bit.ly/7SRgqZ (ID: 7221520139) New at ProfHacker: “Academics and Social profhacker 17 Media: #mla09 and Twitter,” byURL No. 5 @GeorgeOnline (and a bunch of you): http://wp.me/pAGUw-19K (ID: 7566711357) "Monopolies of Invention:" text of my #MLA09 nowviskie 16 talk on labor & IP issues in humanitiesURL No. 3 collaboration: http://is.gd/5Gckz (ID: 7185970970)
  • Conlusions Internal and external citations are used to similar extends and are also intervowen. Citation behavior on Twitter is not equivalent to classical structures of scientific citations: New methods are needed. Different disciplines seem to behave differently.
  • OutlookFuture work: Inclusion of additional conferences Working with datasets based on people (selected scientists) Refinement of URL classification scheme Identification of „user types“ and motivations for citations
  • Greetings from Düsseldorf! Dr. Katrin Weller Dept. of Information Science Heinrich-Heine-University Düsseldorf Universitätsstr. 1, Geb. 23.21.04.68, D-40225 Düsseldorf E-Mail: weller@uni-duesseldorf.de Twitter: @kwelle Slides will be available: http://www.slideshare.net/katrinweller Acknowledgements: @coffee001 @knuurps @ParrPar @free5pirit Cornelius Evelyn Parinaz #iwhhu #nfgwin Julia Verbina Puschmann Dröge Maghferat
  • Further Reading• Weller, K., & Puschmann, P. (2011, in press). Twitter for Scientific Communication: How Can Citations/References be Identified and Measured? To appear in: Proceedings of the Poster Session at the Web Science Conference 2011, Koblenz, Germany. Preprint: http://www.websci11.org/fileadmin/websci/Posters/153_paper.pdf• Weller, K., Dröge, E., & Puschmann, C. (2011). Citation Analysis in Twitter: Approaches for Defining and Measuring Information Flows within Tweets during Scientific Conferences. In Matthew Rowe, Milan Stankovic, Aba-Sah Dadzie, & Mariann Hardey (Eds.), Making Sense of Microposts (#MSM2011), Workshop at Extended Semantic Web Conference (ESWC 2011), Crete, Greece (pp. 1-12). CEUR Workshop Proceedings Vol. 718. http://sunsite.informatik.rwth- aachen.de/Publications/CEUR-WS/Vol-718/• Puschmann, C., Weller, K., & Dröge, E. (2011). Studying Twitter conversations as (dynamic) graphs: Visualization and structural comparison. Poster presented at General Online Research (GOR 11), 14-16 March 2011, Düsseldorf, Germany. Retrieved from http://ynada.com/posters/gor11.pdf.• Dröge, E., Maghferat, P., Puschmann, C., Verbina, J., & Weller, K. (2011). Konferenz-Tweets. Ein Ansatz zur Analyse der Twitter-Kommunikation bei wissenschaftlichen Konferenzen. In Joachim Griesbaum, Thomas Mandl, Christa Womser-Hacker (Eds.), Information und Wissen: global, sozial und frei? Proceedings des 12. Internationalen Symposiums für Informationswissenchaft (pp. 98- 110). Boizenburg: VWH.
  • Selected References • Boyd, D., Golder, S., Lotan, G.: Tweet, tweet, retweet: Conversational aspects of retweeting on Twitter. In R. H. Sprague (Ed.), Proceedings of the 43rd Conference on System Sciences (HICSS 10), Honolulu, Hawaii, USA. Piscataway, NJ: IEEE (2010) • Ebner, M., & Reinhardt, W. (2009). Social networking in scientific conferences: Twitter as tool for strengthen a scientific community. In U. Cress; V. Dimitrova, & M. Specht (Eds.), Learning in the Synergy of Multiple Disciplines.4th European Conference on Technology Enhanced Learning, EC-TEL 2009 Nice, France. Berlin: Springer. • Letierce, J., Passant, A., Decker, S., & Breslin, J. G. (2010). Understanding how Twitter is used to spread scientific messages. In Proceedings of the Web Science Conference (WebSci10): Extending the Frontiers of Society On-Line, Raleigh, NC, USA. • Priem, J., & Costello, K. L. (2010). How and why scholars cite on Twitter. In C. Marshall; E. Toms, & A. Grove (Eds.), Proceedings of the 73rd ASIS&T Annual Meeting on Navigating Streams in an Information Ecosystem, Pittsburgh, PA, USA (pp. Article No. 75). New York, NY: ACM. • Ross, C., Terras, M., Warwick, C., & Welsh, A. (2011). Enabled backchannel: Conference Twitter use by digital humanists. Journal of Documentation, 67(2), 214– 237.