Advertisement

Parc floss-wikipedia

Assistant Professor, ETSIT. Co-founder, Data Science Lab. Academic Director Master in Data Science at URJC
Jun. 15, 2011
Advertisement

More Related Content

Similar to Parc floss-wikipedia(20)

Recently uploaded(20)

Advertisement

Parc floss-wikipedia

  1. From libre software to Wikipedia: A tour of open collaboration Felipe Ortega Libresoft, Universidad Rey Juan Carlos e-mail: jfelipe@libresoft.es Twitter | Identi.ca: @jfelipe Xerox PARC June 14, 2011 By Diego GrezCC-BY-SA 3.0, Wikimedia Commons
  2. © 2011 Felipe Ortega. Some rights reserved. This document is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License (Logos on first slide are (TM) of their respective organizations)
  3. Open collaboration
  4. “Think of how Wikipedia works, how Amazon harnesses user annotation on its site, the way photo-sharing sites like Flickr are bleeding out into other applications... We're entering an era in which software learns from its users and all of the users are connected”. Tim O'Reilly. TIME Magazine, 24 October 2005. By Felipe Ortega, CC-BY-SA 3.0
  5. In the beginning... ● ...all started with “real programmers” and FLOSS. ● FSF, GNU, free licenses. ● Open source goes into industry. ● Libre software becomes ubiquitous. ● However ● Crowdsourced ! = Open source ● Much betters if results encourage reusing and distribution of derivative works.
  6. The “paradox” of open collaboration “Wikipedia is the best thing ever. Anyone in the world can write anything they want about any subject, so you know you are getting the best possible information.”. Michael Scott (played by Steve Carell) The Office, "The Negotiation" [3.18], 5 April 2007
  7. 3 lessons from libre software ● Onion model. ● Generational relay. ● Lasting participation. By El_T, Public Domain, from Wikimedia Commons
  8. Onion model The Social Structure of Free and Open Source Software Development Crowston & Howison, 2005
  9. Generational relay Robles, González-Barahona. Contributor Turnover in Libre Software Projects. OSS 2006.
  10. Lasting participation ● Robles, González-Barahona and Michlmayr. Evolution of Volunteer Participation in Libre Software Projects: Evidence from Debian. OSS 2005. Half-life ratio = 7.5 years! +50% maintainers in Debian 2.0 still present in Debian 3.1
  11. Thesis. Wikipedia: A quantitative analysis. ● Apply lessons from libre software to under- stand open collaborative process in Wikipedia. ● Content production. ● Effort distribution. ● Implications for quality. ● Participation and sustainability.
  12. Tool: WikiXRay Automated analysis of Wikipedia dumps. http://git.libresoft.es/WikiXRay Download Local MySQL Wikimedia Download Compressed dumps Server Center DB dumps WIKIXRAY Results evaluation Analysis (scripts + GNU R) Preparation for data mining
  13. New articles created in Wikipedia Entered steady-state in 2006, before graph of monthly edits became stable (2007)
  14. Interaction: talk pages 100% 90% 80% 70% 60% 50% no-talk 40% talk 30% 20% 10% 0% EN DE FR PL JA NL IT PT ES SV 0.0086% (old talk pages deleted)
  15. Contributions per editor ● Upper truncated Pareto distribution. ● Limit in max. number of revisions by human editors. ● Better to have more editors rather than increasing contributions per editor.
  16. Effort distribution: Gini coefficient
  17. Monthly effort distribution Wikipedia Constant over the whole history! Ortega, F., González-Barahona, J., Robles, G. On the inequality of contributions to Wikipedia. HICSS 2008.
  18. Profile editors in Featured Articles ● Most Featured Articles are at least 1,000 days old. ● 10 times more editors in FAs than in non-FAs, almost 200 times in EN (!!). ● FAs reviewed by significantly older authors (+3 years actively contributing to Wikipedia). FAs non-FAs
  19. The Digital Potlatch ● Book with J. Rodríguez (in Spanish). ● Ed. Cátedra, expected September 2011. ● Interdisciplinary. ● Anthropology + Engineering. ● Meritocracy in Wikipedia. ● Effort recognition. ● Motivations. ● Implications for quality. Public Domain, from Wikimedia Commons
  20. Future lines of work ● Study causes of change in evolution patterns and reverts. ● “The singularity is not near” By Bios, CC-BY-SA 3.0, from Wikimedia Commons ASC @PARC, WikiSym 2009. ● Edit diffs to study contribution patterns. ● Different types of content. ● Cross-relation with traffic patterns.
Advertisement