Understanding and Improving Wikipedia Article Discussion Spaces Jodi Schneider , Alexandre Passant, John Breslin ACM SAC 2...
Wikipedia editors are leaving faster than they can be replaced Felipe Ortega via  http://www.businessinsider.com/chart-of-...
How do we turn readers into editors? <ul><li>Ensure people know they can edit </li></ul><ul><li>Make editing easier </li><...
Wikipedia Discussion Space:  “Talk page”   of 27
Talk pages need semantics <ul><li>Lots of conversations </li></ul><ul><ul><li>Viégas:  “the fastest growing areas  of Wiki...
Social Semantic Web   of 27
My Research Questions <ul><li>What do Wikipedians  do  on Talk pages? </li></ul><ul><li>What kind of arguments happen on T...
Three ways of understanding Talk pages <ul><li>Interviews with editors and administrators </li></ul><ul><ul><li>What do Wi...
1. Interviews <ul><li>Administrators </li></ul><ul><ul><li>Frequently monitor conversations </li></ul></ul><ul><ul><li>Kno...
2. Content Analysis <ul><li>100 Talk pages </li></ul><ul><li>5 categories of pages </li></ul><ul><ul><li>Most editors (of ...
  of 27 Classification Example Reference to... Sources outside the wiki ... Not sure where to put it but I’ll leave it her...
The 15 Classifications <ul><li>References to… </li></ul><ul><li>Vandalism </li></ul><ul><li>Guidelines and policies </li><...
 
3a.  Developing a content-based semantic model <ul><li>Represent article structure </li></ul><ul><ul><li>Reuse existing on...
Represent thread structure   of 27 sioc:Thread sioc:Post
sioc: links_to http://en.wikipedia.org/wiki/Template:WikiProject_Computing Express relationships   of 27
Reuse SIOC & FOAF for structure <ul><li>Article </li></ul><ul><ul><li>sioct:WikiArticle </li></ul></ul><ul><li>Link the ar...
Our SIOC WikiTalk ontology <ul><li>WikiDiscussionItem </li></ul><ul><ul><li>ReferenceItem </li></ul></ul><ul><ul><ul><li>R...
3b.  Using our semantic model <ul><li>Hand markup Wikipedia Talk pages with RDFa </li></ul><ul><li>Query to find comments ...
<ul><li><p about=&quot;#Thread2Post1&quot;  typeof=&quot;siocwt:RequestEditingCoordination&quot; </li></ul><ul><li>rel=&qu...
Using the markup:  JavaScript bookmarklets <ul><li>Highlight posts based on the ontology class – e.g.  ReferenceToEdit   <...
Retrieve RequestInfo posts in WikiProject Computing <ul><li>We retrieve the “RequestInfo” posts with SPARQL: </li></ul><ul...
Summary <ul><li>We can increase the effectiveness of Wikipedia  Talk pages by understanding how they are used </li></ul><u...
Thank You! <ul><li>Questions & Comments? </li></ul><ul><li>Contact:  </li></ul><ul><li>[email_address] </li></ul><ul><li>T...
Our Wikipedia-Related Research <ul><li>“ Understanding and Improving Wikipedia Article Discussion Spaces.” In SAC 2011 (We...
References <ul><li>Antin, J., & Cheshire, C. (2010). Readers are not free-riders: Reading as a form of participation on Wi...
Further image credits <ul><li>Felipe Ortega’s dissertation research </li></ul><ul><li>Wikipedia logo </li></ul><ul><li>Tal...
Upcoming SlideShare
Loading in...5
×

Understanding and improving Wikipedia article discussion spaces SAC2011

1,247

Published on

How can we make Wikipedia Talk pages easier for readers, editors, and administrators to use? What kind of structure can be added?

Symposium on Applied Computing (SAC 2011) paper presentation slides from Taichung, Taiwan

Abstract:
Wikipedia’s article discussion spaces (“Talk pages”) form a large and growing proportion of the encyclopedia, used for collaboration and article improvement. So far there is no in-depth account of how article Talk pages are used, what is wrong with them, and how they can be improved. This paper reports on three contributions promoting the under- standing of and improvement of these spaces:
(1) Wikipedia editor interviews provide an increased understanding of readers’ and editors’ needs,
(2) a large-scale comparative content analysis adds to knowledge of what kinds of discussions and coordination occur on Talk pages,
(3) a prototype bookmarklet-based system, which we test in a formative user evaluation, integrates lightweight semantics.

Full paper at http://jodischneider.com/pubs/sac2011.pdf

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
1,247
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
5
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide
  • &amp;quot;only 10% of participants knew that Wikipedia has a policy against posting original research.” – Antin &amp; Cheshire, CSCW 2010
  • Talk pages are LONG!!! six Talk pages can yield over 100 printed pages [3], and individual Talk pages may yield 50 printed pages.
  • Trust &amp; credibility layer Golbeck, Computing with Social Trust, Springer 2008 Hartig, Querying Trust in RDF Data with tSPARQL, ESWC 2009 W3C Provenance Incubator Group Final Report
  • 2 Wikipedia editors, 2 Wikipedia administrators
  • 20 pages per category
  • Partial example of RDFa markup
  • We can also retrieve posts by novices, or which have no replies. Or both! SELECT ?comment ?reply ?user ?name WHERE { ?comment a sioc:Post ; sioc:has_creator ?user . OPTIONAL { ?user sioc:name ?name . } OPTIONAL { ?comment sioc:has_reply ?reply . } FILTER (!BOUND(?name)) FILTER (!BOUND(?reply)) }
  • Understanding and improving Wikipedia article discussion spaces SAC2011

    1. 1. Understanding and Improving Wikipedia Article Discussion Spaces Jodi Schneider , Alexandre Passant, John Breslin ACM SAC 2011-03-24 Taichung, Taiwan
    2. 2. Wikipedia editors are leaving faster than they can be replaced Felipe Ortega via http://www.businessinsider.com/chart-of-the-day-wikipedia-editors-2009-11 of 27
    3. 3. How do we turn readers into editors? <ul><li>Ensure people know they can edit </li></ul><ul><li>Make editing easier </li></ul><ul><li>Help learn how things work by reading discussions! </li></ul><ul><ul><li>“ Reading Talk pages – the behind-the-scenes discussions about Wikipedia articles – signals a transition towards more active forms of participation.” – Antin & Cheshire, CSCW 2010 </li></ul></ul><ul><li>Make more edits “stick” </li></ul><ul><ul><li>Understand what kinds of contributions are accepted </li></ul></ul><ul><ul><ul><li>Provide support for creating good arguments </li></ul></ul></ul><ul><ul><ul><li>Avoid need for reverts </li></ul></ul></ul> of 27
    4. 4. Wikipedia Discussion Space: “Talk page” of 27
    5. 5. Talk pages need semantics <ul><li>Lots of conversations </li></ul><ul><ul><li>Viégas: “the fastest growing areas of Wikipedia are devoted to coordination and organization” </li></ul></ul><ul><li>When are people agreeing/ disagreeing? </li></ul><ul><ul><li>Not well understood! </li></ul></ul><ul><li>Very little study of Talk pages </li></ul><ul><ul><li>Largest study: 60 pages, 2 types. Discovered: Featured Articles have 10x discussion! </li></ul></ul><ul><li>Immense variation between pages </li></ul> of 27 Data from Stvilia
    6. 6. Social Semantic Web of 27
    7. 7. My Research Questions <ul><li>What do Wikipedians do on Talk pages? </li></ul><ul><li>What kind of arguments happen on Talk pages? </li></ul><ul><li>Can we add structure to make pages “fit” how editors and readers use them? </li></ul> of 27
    8. 8. Three ways of understanding Talk pages <ul><li>Interviews with editors and administrators </li></ul><ul><ul><li>What do Wikipedians do on Talk pages? </li></ul></ul><ul><li>Hand content analysis of 100 Talk pages </li></ul><ul><ul><li>What kind of arguments happen on Talk pages? </li></ul></ul><ul><li>Developing & using a semantic model </li></ul><ul><ul><li>Can we add structure to make pages “fit” how editors and readers use them? </li></ul></ul> of 27
    9. 9. 1. Interviews <ul><li>Administrators </li></ul><ul><ul><li>Frequently monitor conversations </li></ul></ul><ul><ul><li>Know + meet co-editors </li></ul></ul><ul><ul><li>Make community-related edits such as adding infoboxes </li></ul></ul><ul><ul><li>More likely to move/rename articles and Talk pages </li></ul></ul><ul><li>Editors </li></ul><ul><ul><li>Mostly read Talk pages </li></ul></ul><ul><ul><li>“ Get the scoop”—what’s controversial? More details? </li></ul></ul><ul><ul><li>More likely to read older conversations </li></ul></ul><ul><ul><li>May learn policy and procedures </li></ul></ul> of 27
    10. 10. 2. Content Analysis <ul><li>100 Talk pages </li></ul><ul><li>5 categories of pages </li></ul><ul><ul><li>Most editors (of the article) </li></ul></ul><ul><ul><li>Most visits (to the article) </li></ul></ul><ul><ul><li>Controversial </li></ul></ul><ul><ul><li>Featured Articles </li></ul></ul><ul><ul><li>Random </li></ul></ul><ul><li>15 classifications </li></ul> of 27
    11. 11. of 27 Classification Example Reference to... Sources outside the wiki ... Not sure where to put it but I’ll leave it here as somebody might find it useful Reverts, removed material, or controversial edits I noticed some people edit the page into what it will be in 10 minutes but someone is reverting it...just let it be Edits the discussant made Added the About.com review since the review was part of the reception section. Requests for... Help with another article, portal, etc. This is just to invite attention to the page Facebook statistics just created…
    12. 12. The 15 Classifications <ul><li>References to… </li></ul><ul><li>Vandalism </li></ul><ul><li>Guidelines and policies </li></ul><ul><li>Sources outside Wikipedia </li></ul><ul><li>Reverts, removed material, or controversial edits </li></ul><ul><li>Edits the discussant made </li></ul><ul><li>Internal Wikipedia resources </li></ul><ul><li>Requests for… </li></ul><ul><li>Editing coordination </li></ul><ul><li>Information </li></ul><ul><li>Help with another article </li></ul><ul><li>Peer review </li></ul><ul><li>Etc. </li></ul><ul><li>Off-topic remarks </li></ul><ul><li>Polls </li></ul><ul><li>Information boxes </li></ul><ul><li>Images </li></ul><ul><li>Other </li></ul> of 39 of 27
    13. 14. 3a. Developing a content-based semantic model <ul><li>Represent article structure </li></ul><ul><ul><li>Reuse existing ontologies (FOAF, SIOC) </li></ul></ul><ul><li>Represent content (based on the content analysis) </li></ul><ul><ul><li>Winnow the 15 classifications: relevance & plausibility </li></ul></ul><ul><ul><ul><li>“ Relevant” for querying and retrieving information </li></ul></ul></ul><ul><ul><ul><li>“ Plausible” a person would mark their own comment </li></ul></ul></ul><ul><ul><ul><ul><li>“ Off topic” </li></ul></ul></ul></ul><ul><ul><ul><ul><li>“ Request for help” </li></ul></ul></ul></ul> of 27
    14. 15. Represent thread structure of 27 sioc:Thread sioc:Post
    15. 16. sioc: links_to http://en.wikipedia.org/wiki/Template:WikiProject_Computing Express relationships of 27
    16. 17. Reuse SIOC & FOAF for structure <ul><li>Article </li></ul><ul><ul><li>sioct:WikiArticle </li></ul></ul><ul><li>Link the article to the Talk page </li></ul><ul><ul><li>sioc:has_discussion </li></ul></ul><ul><li>Discussion threads </li></ul><ul><ul><li>sioc:Thread </li></ul></ul><ul><li>Individual comments </li></ul><ul><ul><li>sioc:Post </li></ul></ul><ul><li>Commenter </li></ul><ul><ul><li>foaf:Person / sioc:UserAccount </li></ul></ul> of 27
    17. 18. Our SIOC WikiTalk ontology <ul><li>WikiDiscussionItem </li></ul><ul><ul><li>ReferenceItem </li></ul></ul><ul><ul><ul><li>ReferenceToEdit </li></ul></ul></ul><ul><ul><ul><li>ReferenceToGuidelinesOrPolicies </li></ul></ul></ul><ul><ul><ul><li>ReferenceToInternalResources </li></ul></ul></ul><ul><ul><ul><li>ReferenceToRevertsOrControversialOrRemovedMaterial </li></ul></ul></ul><ul><ul><ul><li>ReferenceToVandalism </li></ul></ul></ul><ul><ul><li>RequestItem </li></ul></ul><ul><ul><ul><li>RequestEditingCoordination </li></ul></ul></ul><ul><ul><ul><li>RequestHelpElsewhere </li></ul></ul></ul><ul><ul><ul><li>RequestInfo </li></ul></ul></ul><ul><ul><ul><li>RequestPeer-review </li></ul></ul></ul><ul><ul><ul><li> </li></ul></ul></ul><ul><ul><ul><li>http://rdfs.org/sioc/wikitalk </li></ul></ul></ul> of 27
    18. 19. 3b. Using our semantic model <ul><li>Hand markup Wikipedia Talk pages with RDFa </li></ul><ul><li>Query to find comments meeting specified criteria </li></ul><ul><ul><li>JavaScript and SPARQL </li></ul></ul><ul><li>Formative evaluations </li></ul><ul><ul><li>Browsing talk pages, with & without highlighting, to identify particular types of comments </li></ul></ul> of 27
    19. 20. <ul><li><p about=&quot;#Thread2Post1&quot; typeof=&quot;siocwt:RequestEditingCoordination&quot; </li></ul><ul><li>rel=&quot;sioc:has_container&quot; href=&quot;#Rule_Interchange_Format&quot;></p> </li></ul><ul><li><div about=&quot;#Thread2Post1&quot; rel=&quot;sioc:has_creator&quot; </li></ul><ul><li>href=&quot;http://en.wikipedia.org/wiki/User:Nloth&quot;> </li></ul><ul><li><div about=&quot;#Thread2Post1&quot; rel=&quot;sioc:last_activity_date&quot; </li></ul><ul><li>content=&quot;20091116T0432-0000&quot; datatype=&quot;xsd:dateTime&quot;> </li></ul><ul><li><p>I'd support having <a href=&quot;http://en.wikipedia.org/wiki/Rule_Interchange_Format&quot;> Rule Interchange Format</a> merged into this article … <a href=http://en.wikipedia.org/wiki/User:Nloth title=&quot;User:Nloth&quot;> Nloth</a> (<a href=&quot;http://en.wikipedia.org/wiki/User_talk:Nloth&quot; title=&quot;User talk:Nloth&quot;>talk</a>) 04:32, 16 November 2009 (UTC) </p></div> </div> </li></ul>
    20. 21. Using the markup: JavaScript bookmarklets <ul><li>Highlight posts based on the ontology class – e.g. ReferenceToEdit </li></ul> of 27
    21. 22. Retrieve RequestInfo posts in WikiProject Computing <ul><li>We retrieve the “RequestInfo” posts with SPARQL: </li></ul><ul><li>SELECT ?commment ?page </li></ul><ul><li>WHERE </li></ul><ul><li>{ </li></ul><ul><li>?page sioc:links_to <http://en.wikipedia.org/ wiki/ Template:WikiProject_Computing > . </li></ul><ul><li>?comment sioc:has_container ?page ; </li></ul><ul><li>a sioc:Post ; a siocwt:RequestInfo . </li></ul><ul><li>} </li></ul> of 27
    22. 23. Summary <ul><li>We can increase the effectiveness of Wikipedia Talk pages by understanding how they are used </li></ul><ul><li>We add semantic structure to Wikipedia Talk pages which can be used to extract socially useful info </li></ul><ul><li>Social Semantic Web expertise can benefit Wikipedia </li></ul> of 27
    23. 24. Thank You! <ul><li>Questions & Comments? </li></ul><ul><li>Contact: </li></ul><ul><li>[email_address] </li></ul><ul><li>Thanks to SAC-STAP for travel support and to Science Foundation Ireland for Ph.D. funding Grant No. SFI/09/CE/I1380 (Líon2)! </li></ul> of 27
    24. 25. Our Wikipedia-Related Research <ul><li>“ Understanding and Improving Wikipedia Article Discussion Spaces.” In SAC 2011 (Web Track), TaiChung, Taiwan, March 21-25, 2011. </li></ul><ul><li>“ Enhancing MediaWiki Talk pages with Semantics for Better Coordination - A Proposal.” In The Fifth Workshop on Semantic Wikis: Linking Data and People Workshop at 7th Extended Semantic Web Conference (ESWC), Crete, Greece, May 31, 2010. </li></ul><ul><li>“ A Content Analysis: How Wikipedia Talk Pages Are Used.” In WebSci2010, Web Science Conference. Raleigh, NC, April 26 & 27 2010. </li></ul> of 27
    25. 26. References <ul><li>Antin, J., & Cheshire, C. (2010). Readers are not free-riders: Reading as a form of participation on Wikipedia. CSCW 2010. doi: 10.1145/1718918.1718942 </li></ul><ul><li>Stvilia, Twidale, Smith & Gasser, &quot;Information Quality Work Organization in Wikipedia,&quot; JASIST 2008. doi: 10.1002/asi.2081 </li></ul><ul><li>Viégas, Wattenberg, Kriss & Ham, &quot;Talk Before You Type: Coordination in Wikipedia,&quot; HICSS 2007. doi: 10.1109/HICSS.2007.511 </li></ul> of 27
    26. 27. Further image credits <ul><li>Felipe Ortega’s dissertation research </li></ul><ul><li>Wikipedia logo </li></ul><ul><li>Talk pages screenshots from </li></ul><ul><li>http://en.wikipedia.org/Talk : {articlename} </li></ul> of 27
    1. A particular slide catching your eye?

      Clipping is a handy way to collect important slides you want to go back to later.

    ×