SlideShare a Scribd company logo
From libre software to Wikipedia:
 A tour of open collaboration




Felipe Ortega
Libresoft, Universidad Rey Juan Carlos
e-mail: jfelipe@libresoft.es
Twitter | Identi.ca: @jfelipe

Xerox PARC
June 14, 2011
                                         By Diego GrezCC-BY-SA 3.0, Wikimedia Commons
© 2011 Felipe Ortega.
                                          Some rights reserved.
                              This document is licensed under a
Creative Commons Attribution-ShareAlike 3.0 Unported License
 (Logos on first slide are (TM) of their respective organizations)
Open collaboration
“Think of how Wikipedia works, how Amazon harnesses
user annotation on its site, the way photo-sharing sites
like Flickr are bleeding out into other applications...
We're entering an era in which software learns from
its users and all of the users are connected”.

Tim O'Reilly.
TIME Magazine, 24 October 2005.




                                                By Felipe Ortega, CC-BY-SA 3.0
In the beginning...


●   ...all started with “real programmers” and FLOSS.
    ●   FSF, GNU, free licenses.
    ●   Open source goes into industry.
    ●   Libre software becomes ubiquitous.
●   However
    ●   Crowdsourced ! = Open source
    ●   Much betters if results encourage reusing and
        distribution of derivative works.
The “paradox” of open collaboration



“Wikipedia is the best thing ever. Anyone in the world can
write anything they want about any subject, so you know
you are getting the best possible information.”.

Michael Scott (played by Steve Carell)
The Office, "The Negotiation" [3.18], 5 April 2007
3 lessons from libre software



●   Onion model.
●   Generational relay.
●   Lasting participation.         By El_T, Public Domain,
                                from Wikimedia Commons
Onion model

The Social Structure of Free and Open Source Software Development
Crowston & Howison, 2005
Generational relay




      Robles, González-Barahona.
      Contributor Turnover in Libre Software Projects.
      OSS 2006.
Lasting participation


●   Robles, González-Barahona and Michlmayr.
    Evolution of Volunteer Participation in Libre Software
    Projects: Evidence from Debian. OSS 2005.



    Half-life ratio = 7.5 years!


+50% maintainers in Debian 2.0 still present in Debian 3.1
Thesis. Wikipedia: A quantitative
analysis.

●   Apply lessons from libre software to under-
    stand open collaborative process in Wikipedia.
    ●   Content production.
    ●   Effort distribution.
    ●   Implications for quality.
    ●   Participation and sustainability.
Tool: WikiXRay

Automated analysis of Wikipedia dumps.
http://git.libresoft.es/WikiXRay




                                      Download
                                                  Local MySQL
Wikimedia Download   Compressed        dumps
                                                     Server
      Center          DB dumps
                                      WIKIXRAY




Results evaluation   Analysis (scripts + GNU R)   Preparation for
                                                   data mining
New articles created in Wikipedia




                Entered steady-state in 2006,
                before graph of monthly edits
                    became stable (2007)
Interaction: talk pages

100%

90%

80%

70%

60%

50%                                                           no-talk
40%
                                                              talk

30%

20%

10%

 0%
       EN   DE   FR   PL   JA   NL   IT   PT   ES   SV

                           0.0086% (old talk pages deleted)
Contributions per editor

                    ●   Upper truncated Pareto
                        distribution.
                    ●   Limit in max. number of
                        revisions by human
                        editors.
                    ●   Better to have more
                        editors rather than
                        increasing contributions
                        per editor.
Effort distribution: Gini coefficient
Monthly effort distribution Wikipedia




                   Constant over the whole history!
              Ortega, F., González-Barahona, J., Robles, G.
              On the inequality of contributions to Wikipedia.
              HICSS 2008.
Profile editors in Featured Articles

●   Most Featured Articles are at least 1,000 days old.
●   10 times more editors in FAs than in non-FAs,
    almost 200 times in EN (!!).
●   FAs reviewed by significantly older authors
    (+3 years actively contributing to Wikipedia).


         FAs                                   non-FAs
The Digital Potlatch


●   Book with J. Rodríguez (in Spanish).
    ●   Ed. Cátedra, expected September 2011.
●   Interdisciplinary.
    ●   Anthropology + Engineering.
●   Meritocracy in Wikipedia.
●   Effort recognition.
●   Motivations.
●   Implications for quality.
                                        Public Domain, from Wikimedia Commons
Future lines of work


●   Study causes of change in
    evolution patterns and reverts.
    ●   “The singularity is not near”       By Bios, CC-BY-SA 3.0, from
                                                    Wikimedia Commons

        ASC @PARC, WikiSym 2009.
●   Edit diffs to study contribution patterns.
●   Different types of content.
●   Cross-relation with traffic patterns.

More Related Content

Similar to Parc floss-wikipedia

Contropedia: Critical learning through Wikipedia's edit history
Contropedia: Critical learning through Wikipedia's edit historyContropedia: Critical learning through Wikipedia's edit history
Contropedia: Critical learning through Wikipedia's edit history
David Laniado
 
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...
Oscar Corcho
 
Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)
Takashi Iba
 
WIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting OntologiesWIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting Ontologies
dgarijo
 
Editing Behavior over Time Power vs. Standard Wikidata Editors
Editing Behavior over Time  Power vs. Standard Wikidata EditorsEditing Behavior over Time  Power vs. Standard Wikidata Editors
Editing Behavior over Time Power vs. Standard Wikidata Editors
Cristina Sarasua
 
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
vbrant
 
Wmf wikimedia conference japan feb 3 en pdf
Wmf wikimedia conference japan feb 3 en pdfWmf wikimedia conference japan feb 3 en pdf
Wmf wikimedia conference japan feb 3 en pdf
Wikimedia Foundation
 
Practical Open Source Software for Libraries (part 1)
Practical Open Source Software for Libraries (part 1)Practical Open Source Software for Libraries (part 1)
Practical Open Source Software for Libraries (part 1)
Nicole C. Engard
 
Free For All: Getting Started in Open Source
Free For All: Getting Started in Open SourceFree For All: Getting Started in Open Source
Free For All: Getting Started in Open Source
Ali King
 
Collaborative Ontology Building Project
Collaborative Ontology Building Project  Collaborative Ontology Building Project
Collaborative Ontology Building Project
Jie Bao
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineering
Elena Simperl
 
BCcampus a-great-babbling-bazaar
BCcampus a-great-babbling-bazaarBCcampus a-great-babbling-bazaar
BCcampus a-great-babbling-bazaar
b p
 
Open Source: Freedom and Community
Open Source: Freedom and CommunityOpen Source: Freedom and Community
Open Source: Freedom and Community
Nicole C. Engard
 
Wikisource - Where we are, where we want to go
Wikisource  - Where we are, where we want to go Wikisource  - Where we are, where we want to go
Wikisource - Where we are, where we want to go
AubreyMcFato
 
Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...
Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...
Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...
helmoony
 
Wanted: Best Practices for Collaborative Translation
Wanted: Best Practices for Collaborative TranslationWanted: Best Practices for Collaborative Translation
Wanted: Best Practices for Collaborative Translation
Grupo Inmigra i+d
 
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
Cornelius Puschmann
 
OpenMinteD Project - building a TDM infrastructure
OpenMinteD Project - building a TDM infrastructureOpenMinteD Project - building a TDM infrastructure
OpenMinteD Project - building a TDM infrastructure
FutureTDM
 
Reciprocal Enrichment between Wikipedia and Machine Translators
Reciprocal Enrichment between Wikipedia and Machine TranslatorsReciprocal Enrichment between Wikipedia and Machine Translators
Reciprocal Enrichment between Wikipedia and Machine Translators
Mikel Iturbe
 
Towards a diversity-minded Wikipedia
Towards a diversity-minded WikipediaTowards a diversity-minded Wikipedia
Towards a diversity-minded Wikipedia
RENDER project
 

Similar to Parc floss-wikipedia (20)

Contropedia: Critical learning through Wikipedia's edit history
Contropedia: Critical learning through Wikipedia's edit historyContropedia: Critical learning through Wikipedia's edit history
Contropedia: Critical learning through Wikipedia's edit history
 
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...
 
Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)
 
WIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting OntologiesWIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting Ontologies
 
Editing Behavior over Time Power vs. Standard Wikidata Editors
Editing Behavior over Time  Power vs. Standard Wikidata EditorsEditing Behavior over Time  Power vs. Standard Wikidata Editors
Editing Behavior over Time Power vs. Standard Wikidata Editors
 
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
 
Wmf wikimedia conference japan feb 3 en pdf
Wmf wikimedia conference japan feb 3 en pdfWmf wikimedia conference japan feb 3 en pdf
Wmf wikimedia conference japan feb 3 en pdf
 
Practical Open Source Software for Libraries (part 1)
Practical Open Source Software for Libraries (part 1)Practical Open Source Software for Libraries (part 1)
Practical Open Source Software for Libraries (part 1)
 
Free For All: Getting Started in Open Source
Free For All: Getting Started in Open SourceFree For All: Getting Started in Open Source
Free For All: Getting Started in Open Source
 
Collaborative Ontology Building Project
Collaborative Ontology Building Project  Collaborative Ontology Building Project
Collaborative Ontology Building Project
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineering
 
BCcampus a-great-babbling-bazaar
BCcampus a-great-babbling-bazaarBCcampus a-great-babbling-bazaar
BCcampus a-great-babbling-bazaar
 
Open Source: Freedom and Community
Open Source: Freedom and CommunityOpen Source: Freedom and Community
Open Source: Freedom and Community
 
Wikisource - Where we are, where we want to go
Wikisource  - Where we are, where we want to go Wikisource  - Where we are, where we want to go
Wikisource - Where we are, where we want to go
 
Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...
Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...
Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...
 
Wanted: Best Practices for Collaborative Translation
Wanted: Best Practices for Collaborative TranslationWanted: Best Practices for Collaborative Translation
Wanted: Best Practices for Collaborative Translation
 
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
 
OpenMinteD Project - building a TDM infrastructure
OpenMinteD Project - building a TDM infrastructureOpenMinteD Project - building a TDM infrastructure
OpenMinteD Project - building a TDM infrastructure
 
Reciprocal Enrichment between Wikipedia and Machine Translators
Reciprocal Enrichment between Wikipedia and Machine TranslatorsReciprocal Enrichment between Wikipedia and Machine Translators
Reciprocal Enrichment between Wikipedia and Machine Translators
 
Towards a diversity-minded Wikipedia
Towards a diversity-minded WikipediaTowards a diversity-minded Wikipedia
Towards a diversity-minded Wikipedia
 

Recently uploaded

Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website
Pixlogix Infotech
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
TIPNGVN2
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Zilliz
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 

Recently uploaded (20)

Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 

Parc floss-wikipedia

  • 1. From libre software to Wikipedia: A tour of open collaboration Felipe Ortega Libresoft, Universidad Rey Juan Carlos e-mail: jfelipe@libresoft.es Twitter | Identi.ca: @jfelipe Xerox PARC June 14, 2011 By Diego GrezCC-BY-SA 3.0, Wikimedia Commons
  • 2. © 2011 Felipe Ortega. Some rights reserved. This document is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License (Logos on first slide are (TM) of their respective organizations)
  • 4. “Think of how Wikipedia works, how Amazon harnesses user annotation on its site, the way photo-sharing sites like Flickr are bleeding out into other applications... We're entering an era in which software learns from its users and all of the users are connected”. Tim O'Reilly. TIME Magazine, 24 October 2005. By Felipe Ortega, CC-BY-SA 3.0
  • 5. In the beginning... ● ...all started with “real programmers” and FLOSS. ● FSF, GNU, free licenses. ● Open source goes into industry. ● Libre software becomes ubiquitous. ● However ● Crowdsourced ! = Open source ● Much betters if results encourage reusing and distribution of derivative works.
  • 6. The “paradox” of open collaboration “Wikipedia is the best thing ever. Anyone in the world can write anything they want about any subject, so you know you are getting the best possible information.”. Michael Scott (played by Steve Carell) The Office, "The Negotiation" [3.18], 5 April 2007
  • 7. 3 lessons from libre software ● Onion model. ● Generational relay. ● Lasting participation. By El_T, Public Domain, from Wikimedia Commons
  • 8. Onion model The Social Structure of Free and Open Source Software Development Crowston & Howison, 2005
  • 9. Generational relay Robles, González-Barahona. Contributor Turnover in Libre Software Projects. OSS 2006.
  • 10. Lasting participation ● Robles, González-Barahona and Michlmayr. Evolution of Volunteer Participation in Libre Software Projects: Evidence from Debian. OSS 2005. Half-life ratio = 7.5 years! +50% maintainers in Debian 2.0 still present in Debian 3.1
  • 11. Thesis. Wikipedia: A quantitative analysis. ● Apply lessons from libre software to under- stand open collaborative process in Wikipedia. ● Content production. ● Effort distribution. ● Implications for quality. ● Participation and sustainability.
  • 12. Tool: WikiXRay Automated analysis of Wikipedia dumps. http://git.libresoft.es/WikiXRay Download Local MySQL Wikimedia Download Compressed dumps Server Center DB dumps WIKIXRAY Results evaluation Analysis (scripts + GNU R) Preparation for data mining
  • 13. New articles created in Wikipedia Entered steady-state in 2006, before graph of monthly edits became stable (2007)
  • 14. Interaction: talk pages 100% 90% 80% 70% 60% 50% no-talk 40% talk 30% 20% 10% 0% EN DE FR PL JA NL IT PT ES SV 0.0086% (old talk pages deleted)
  • 15. Contributions per editor ● Upper truncated Pareto distribution. ● Limit in max. number of revisions by human editors. ● Better to have more editors rather than increasing contributions per editor.
  • 17. Monthly effort distribution Wikipedia Constant over the whole history! Ortega, F., González-Barahona, J., Robles, G. On the inequality of contributions to Wikipedia. HICSS 2008.
  • 18. Profile editors in Featured Articles ● Most Featured Articles are at least 1,000 days old. ● 10 times more editors in FAs than in non-FAs, almost 200 times in EN (!!). ● FAs reviewed by significantly older authors (+3 years actively contributing to Wikipedia). FAs non-FAs
  • 19. The Digital Potlatch ● Book with J. Rodríguez (in Spanish). ● Ed. Cátedra, expected September 2011. ● Interdisciplinary. ● Anthropology + Engineering. ● Meritocracy in Wikipedia. ● Effort recognition. ● Motivations. ● Implications for quality. Public Domain, from Wikimedia Commons
  • 20. Future lines of work ● Study causes of change in evolution patterns and reverts. ● “The singularity is not near” By Bios, CC-BY-SA 3.0, from Wikimedia Commons ASC @PARC, WikiSym 2009. ● Edit diffs to study contribution patterns. ● Different types of content. ● Cross-relation with traffic patterns.