Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Understanding and improving Wikipedia article discussion spaces SAC2011


Published on

How can we make Wikipedia Talk pages easier for readers, editors, and administrators to use? What kind of structure can be added?

Symposium on Applied Computing (SAC 2011) paper presentation slides from Taichung, Taiwan

Wikipedia’s article discussion spaces (“Talk pages”) form a large and growing proportion of the encyclopedia, used for collaboration and article improvement. So far there is no in-depth account of how article Talk pages are used, what is wrong with them, and how they can be improved. This paper reports on three contributions promoting the under- standing of and improvement of these spaces:
(1) Wikipedia editor interviews provide an increased understanding of readers’ and editors’ needs,
(2) a large-scale comparative content analysis adds to knowledge of what kinds of discussions and coordination occur on Talk pages,
(3) a prototype bookmarklet-based system, which we test in a formative user evaluation, integrates lightweight semantics.

Full paper at

Published in: Technology
  • Login to see the comments

Understanding and improving Wikipedia article discussion spaces SAC2011

  1. 1. Understanding and Improving Wikipedia Article Discussion Spaces Jodi Schneider , Alexandre Passant, John Breslin ACM SAC 2011-03-24 Taichung, Taiwan
  2. 2. Wikipedia editors are leaving faster than they can be replaced Felipe Ortega via of 27
  3. 3. How do we turn readers into editors? <ul><li>Ensure people know they can edit </li></ul><ul><li>Make editing easier </li></ul><ul><li>Help learn how things work by reading discussions! </li></ul><ul><ul><li>“ Reading Talk pages – the behind-the-scenes discussions about Wikipedia articles – signals a transition towards more active forms of participation.” – Antin & Cheshire, CSCW 2010 </li></ul></ul><ul><li>Make more edits “stick” </li></ul><ul><ul><li>Understand what kinds of contributions are accepted </li></ul></ul><ul><ul><ul><li>Provide support for creating good arguments </li></ul></ul></ul><ul><ul><ul><li>Avoid need for reverts </li></ul></ul></ul> of 27
  4. 4. Wikipedia Discussion Space: “Talk page” of 27
  5. 5. Talk pages need semantics <ul><li>Lots of conversations </li></ul><ul><ul><li>Viégas: “the fastest growing areas of Wikipedia are devoted to coordination and organization” </li></ul></ul><ul><li>When are people agreeing/ disagreeing? </li></ul><ul><ul><li>Not well understood! </li></ul></ul><ul><li>Very little study of Talk pages </li></ul><ul><ul><li>Largest study: 60 pages, 2 types. Discovered: Featured Articles have 10x discussion! </li></ul></ul><ul><li>Immense variation between pages </li></ul> of 27 Data from Stvilia
  6. 6. Social Semantic Web of 27
  7. 7. My Research Questions <ul><li>What do Wikipedians do on Talk pages? </li></ul><ul><li>What kind of arguments happen on Talk pages? </li></ul><ul><li>Can we add structure to make pages “fit” how editors and readers use them? </li></ul> of 27
  8. 8. Three ways of understanding Talk pages <ul><li>Interviews with editors and administrators </li></ul><ul><ul><li>What do Wikipedians do on Talk pages? </li></ul></ul><ul><li>Hand content analysis of 100 Talk pages </li></ul><ul><ul><li>What kind of arguments happen on Talk pages? </li></ul></ul><ul><li>Developing & using a semantic model </li></ul><ul><ul><li>Can we add structure to make pages “fit” how editors and readers use them? </li></ul></ul> of 27
  9. 9. 1. Interviews <ul><li>Administrators </li></ul><ul><ul><li>Frequently monitor conversations </li></ul></ul><ul><ul><li>Know + meet co-editors </li></ul></ul><ul><ul><li>Make community-related edits such as adding infoboxes </li></ul></ul><ul><ul><li>More likely to move/rename articles and Talk pages </li></ul></ul><ul><li>Editors </li></ul><ul><ul><li>Mostly read Talk pages </li></ul></ul><ul><ul><li>“ Get the scoop”—what’s controversial? More details? </li></ul></ul><ul><ul><li>More likely to read older conversations </li></ul></ul><ul><ul><li>May learn policy and procedures </li></ul></ul> of 27
  10. 10. 2. Content Analysis <ul><li>100 Talk pages </li></ul><ul><li>5 categories of pages </li></ul><ul><ul><li>Most editors (of the article) </li></ul></ul><ul><ul><li>Most visits (to the article) </li></ul></ul><ul><ul><li>Controversial </li></ul></ul><ul><ul><li>Featured Articles </li></ul></ul><ul><ul><li>Random </li></ul></ul><ul><li>15 classifications </li></ul> of 27
  11. 11. of 27 Classification Example Reference to... Sources outside the wiki ... Not sure where to put it but I’ll leave it here as somebody might find it useful Reverts, removed material, or controversial edits I noticed some people edit the page into what it will be in 10 minutes but someone is reverting it...just let it be Edits the discussant made Added the review since the review was part of the reception section. Requests for... Help with another article, portal, etc. This is just to invite attention to the page Facebook statistics just created…
  12. 12. The 15 Classifications <ul><li>References to… </li></ul><ul><li>Vandalism </li></ul><ul><li>Guidelines and policies </li></ul><ul><li>Sources outside Wikipedia </li></ul><ul><li>Reverts, removed material, or controversial edits </li></ul><ul><li>Edits the discussant made </li></ul><ul><li>Internal Wikipedia resources </li></ul><ul><li>Requests for… </li></ul><ul><li>Editing coordination </li></ul><ul><li>Information </li></ul><ul><li>Help with another article </li></ul><ul><li>Peer review </li></ul><ul><li>Etc. </li></ul><ul><li>Off-topic remarks </li></ul><ul><li>Polls </li></ul><ul><li>Information boxes </li></ul><ul><li>Images </li></ul><ul><li>Other </li></ul> of 39 of 27
  13. 14. 3a. Developing a content-based semantic model <ul><li>Represent article structure </li></ul><ul><ul><li>Reuse existing ontologies (FOAF, SIOC) </li></ul></ul><ul><li>Represent content (based on the content analysis) </li></ul><ul><ul><li>Winnow the 15 classifications: relevance & plausibility </li></ul></ul><ul><ul><ul><li>“ Relevant” for querying and retrieving information </li></ul></ul></ul><ul><ul><ul><li>“ Plausible” a person would mark their own comment </li></ul></ul></ul><ul><ul><ul><ul><li>“ Off topic” </li></ul></ul></ul></ul><ul><ul><ul><ul><li>“ Request for help” </li></ul></ul></ul></ul> of 27
  14. 15. Represent thread structure of 27 sioc:Thread sioc:Post
  15. 16. sioc: links_to Express relationships of 27
  16. 17. Reuse SIOC & FOAF for structure <ul><li>Article </li></ul><ul><ul><li>sioct:WikiArticle </li></ul></ul><ul><li>Link the article to the Talk page </li></ul><ul><ul><li>sioc:has_discussion </li></ul></ul><ul><li>Discussion threads </li></ul><ul><ul><li>sioc:Thread </li></ul></ul><ul><li>Individual comments </li></ul><ul><ul><li>sioc:Post </li></ul></ul><ul><li>Commenter </li></ul><ul><ul><li>foaf:Person / sioc:UserAccount </li></ul></ul> of 27
  17. 18. Our SIOC WikiTalk ontology <ul><li>WikiDiscussionItem </li></ul><ul><ul><li>ReferenceItem </li></ul></ul><ul><ul><ul><li>ReferenceToEdit </li></ul></ul></ul><ul><ul><ul><li>ReferenceToGuidelinesOrPolicies </li></ul></ul></ul><ul><ul><ul><li>ReferenceToInternalResources </li></ul></ul></ul><ul><ul><ul><li>ReferenceToRevertsOrControversialOrRemovedMaterial </li></ul></ul></ul><ul><ul><ul><li>ReferenceToVandalism </li></ul></ul></ul><ul><ul><li>RequestItem </li></ul></ul><ul><ul><ul><li>RequestEditingCoordination </li></ul></ul></ul><ul><ul><ul><li>RequestHelpElsewhere </li></ul></ul></ul><ul><ul><ul><li>RequestInfo </li></ul></ul></ul><ul><ul><ul><li>RequestPeer-review </li></ul></ul></ul><ul><ul><ul><li> </li></ul></ul></ul><ul><ul><ul><li> </li></ul></ul></ul> of 27
  18. 19. 3b. Using our semantic model <ul><li>Hand markup Wikipedia Talk pages with RDFa </li></ul><ul><li>Query to find comments meeting specified criteria </li></ul><ul><ul><li>JavaScript and SPARQL </li></ul></ul><ul><li>Formative evaluations </li></ul><ul><ul><li>Browsing talk pages, with & without highlighting, to identify particular types of comments </li></ul></ul> of 27
  19. 20. <ul><li><p about=&quot;#Thread2Post1&quot; typeof=&quot;siocwt:RequestEditingCoordination&quot; </li></ul><ul><li>rel=&quot;sioc:has_container&quot; href=&quot;#Rule_Interchange_Format&quot;></p> </li></ul><ul><li><div about=&quot;#Thread2Post1&quot; rel=&quot;sioc:has_creator&quot; </li></ul><ul><li>href=&quot;;> </li></ul><ul><li><div about=&quot;#Thread2Post1&quot; rel=&quot;sioc:last_activity_date&quot; </li></ul><ul><li>content=&quot;20091116T0432-0000&quot; datatype=&quot;xsd:dateTime&quot;> </li></ul><ul><li><p>I'd support having <a href=&quot;;> Rule Interchange Format</a> merged into this article … <a href= title=&quot;User:Nloth&quot;> Nloth</a> (<a href=&quot;; title=&quot;User talk:Nloth&quot;>talk</a>) 04:32, 16 November 2009 (UTC) </p></div> </div> </li></ul>
  20. 21. Using the markup: JavaScript bookmarklets <ul><li>Highlight posts based on the ontology class – e.g. ReferenceToEdit </li></ul> of 27
  21. 22. Retrieve RequestInfo posts in WikiProject Computing <ul><li>We retrieve the “RequestInfo” posts with SPARQL: </li></ul><ul><li>SELECT ?commment ?page </li></ul><ul><li>WHERE </li></ul><ul><li>{ </li></ul><ul><li>?page sioc:links_to < wiki/ Template:WikiProject_Computing > . </li></ul><ul><li>?comment sioc:has_container ?page ; </li></ul><ul><li>a sioc:Post ; a siocwt:RequestInfo . </li></ul><ul><li>} </li></ul> of 27
  22. 23. Summary <ul><li>We can increase the effectiveness of Wikipedia Talk pages by understanding how they are used </li></ul><ul><li>We add semantic structure to Wikipedia Talk pages which can be used to extract socially useful info </li></ul><ul><li>Social Semantic Web expertise can benefit Wikipedia </li></ul> of 27
  23. 24. Thank You! <ul><li>Questions & Comments? </li></ul><ul><li>Contact: </li></ul><ul><li>[email_address] </li></ul><ul><li>Thanks to SAC-STAP for travel support and to Science Foundation Ireland for Ph.D. funding Grant No. SFI/09/CE/I1380 (Líon2)! </li></ul> of 27
  24. 25. Our Wikipedia-Related Research <ul><li>“ Understanding and Improving Wikipedia Article Discussion Spaces.” In SAC 2011 (Web Track), TaiChung, Taiwan, March 21-25, 2011. </li></ul><ul><li>“ Enhancing MediaWiki Talk pages with Semantics for Better Coordination - A Proposal.” In The Fifth Workshop on Semantic Wikis: Linking Data and People Workshop at 7th Extended Semantic Web Conference (ESWC), Crete, Greece, May 31, 2010. </li></ul><ul><li>“ A Content Analysis: How Wikipedia Talk Pages Are Used.” In WebSci2010, Web Science Conference. Raleigh, NC, April 26 & 27 2010. </li></ul> of 27
  25. 26. References <ul><li>Antin, J., & Cheshire, C. (2010). Readers are not free-riders: Reading as a form of participation on Wikipedia. CSCW 2010. doi: 10.1145/1718918.1718942 </li></ul><ul><li>Stvilia, Twidale, Smith & Gasser, &quot;Information Quality Work Organization in Wikipedia,&quot; JASIST 2008. doi: 10.1002/asi.2081 </li></ul><ul><li>Viégas, Wattenberg, Kriss & Ham, &quot;Talk Before You Type: Coordination in Wikipedia,&quot; HICSS 2007. doi: 10.1109/HICSS.2007.511 </li></ul> of 27
  26. 27. Further image credits <ul><li>Felipe Ortega’s dissertation research </li></ul><ul><li>Wikipedia logo </li></ul><ul><li>Talk pages screenshots from </li></ul><ul><li> : {articlename} </li></ul> of 27