SlideShare a Scribd company logo
The structure of social collaboration on Wikipedia Sorin Adam Matei, Associate Professor of Communication, Purdue U smatei@purdue.edu David Braun, Research Scientist, Envision Lab, Purdue U dbraun@purdue.edu HoriaPetrache, Assistant Professor of Physics, IUPUI hpetrach@iupui.edu Presented at Wikimania, 2009 Buenos Aires, Argentina August 25-28 2009 http://wikimania2009.wikimedia.org/wiki/Proceedings:132
2005: A Wikipedian explains Wikipedia as Wisdom of Crowds The basic premise [of Wisdom of Crowds] that crowds of relatively ignorant individuals make better decisions than small groups of experts. I'm sure everyone here agrees with this as Wikipedia is run this way... Wikipedia displays emergent properties because each article is better than the contribution of each individual. Similarly, ants display emergence because an ant colony can accomplish things that each individual ant cannot even conceive.
Implied idea Fine grained, micro contributions, independent and decentralized and maybe equal lead to articles that are better than what each contributor can write
As expressed in this Wikipedia-l post I imagine Wikipedia as a massive, active swarm intelligence, supplemented by small roving groups of active editors who admire consistency, elegance, and reasoned discourse. (not unlike certain models of how the brain works :) The swarm does the bulk of the writing, especially finding and providing current facts, starting new articles, and adding neglected POVs. The roving groups are sensitive to dozens of policy pages, and implement them as they rove... they also take on large projects, one at a time, and try to implement certain changes across thousands of pages at once.
To which “Jimbo” (Wales) answers I should point out that I like Suroweicki'sthesis just fine, it's just that I'm not convinced that "swarm intelligence" is very helpful in understanding how Wikipedia works -- in fact, it might be an impediment, because it leads us away from thinking about how the community interacts in a process of reasoned discourse.
Jimbo concludes My research (conducted in December) showed that half the edits by logged in users belong to just 2.5% of logged in users.
Does the 80/20 applies? Power-law curves are all over the real world … Adar and Huberman (2000) found 50% of the content on Gnutella is provided by 1% of the users,  O'Mahonyand Ferraro (2003) found the curve in the Debian dev key ring, Moon and Sproul (2002) on the Linux Kernel list, Briggs et al. (1997) in group support systems, Krogh, Spaeth and Lakhani (2003) in Freenet. (By another participant to the 2005 discussion)
What would the 80/20 rule mean? Extreme inequality? Elitism? Structured collaboration? Interactive exchanges between groups of individuals?
Previous research Wikipedia contributions, in all languages, have become more skewed in favor of a small group of editors and old time users (Ortega et al., 2005)
Top contributors dominate edits and no words contributed
Our approach Increase in inequality => higher level of structuration Increasing division of labor From diffuse collaboration to structured collaboration Emergence of bureaucracy Emergence of adhocracy Groups of individuals that become article stewards
Social entropy and structuration Social Entropy As system become organized (biased) their entropy decreases Entropy is a measure of meaningful organization
Entropy and organization Meaningful messages use words and letters in uneven manner Symbol distribution in meaningful messages is uneven Information (and social) entropy are measures of organization and meaning As collaboration becomes more biased, the group becomes more organized
Shannon’s formula where the sum is over all users i, and is the fractional contribution of user i. We allow p and S  and to be functions of time (t).
Shannon’s forumal explained Social entropy reflects how uneven and lacking in diversity a group/system process is 10 users and 100 contributions,  each contributing 10 edits to a Wikipedia article =>  entropy reaches its highest level 1 contributor contributes all, entropy at the lowest value
Analytic strategy Downloaded latest available dump Trouble with unzipping (dump corrupted) Extracted  792,654 registered users 234,798 articles Calculated number of times individuals contributed to each article and how many words have they contributed (not completely finalized)
[object Object]
Orange: fit curve (takes into account the spread of values
Dotted: Maximum entropy, wisdom of crowds ceilingln(x) Intervention entropy Basic plot: Entropy increases for the first @500 interventions, then levels off…. Intervention number (events)
[object Object]

More Related Content

Similar to Wikipedia structure of collaboration

Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)
Takashi Iba
 
Offen. Divers. Inklusiv. Thinking the Future of Organizations
Offen. Divers. Inklusiv. Thinking the Future of OrganizationsOffen. Divers. Inklusiv. Thinking the Future of Organizations
Offen. Divers. Inklusiv. Thinking the Future of Organizations
Dobusch Leonhard
 
Social Technologies for Informaticians and Researchers
Social Technologies for Informaticians and ResearchersSocial Technologies for Informaticians and Researchers
Social Technologies for Informaticians and Researchers
University of Michigan Taubman Health Sciences Library
 
Social Systems Theory 2012 #1
Social Systems Theory 2012 #1Social Systems Theory 2012 #1
Social Systems Theory 2012 #1
Takashi Iba
 
Science 2.0
Science 2.0Science 2.0
Science 2.0
fridolin.wild
 
Notational systems and the abstract built environment
Notational systems and the abstract built environmentNotational systems and the abstract built environment
Notational systems and the abstract built environment
Jeff Long
 
Online information 2010_track_two_final_corrected
Online information 2010_track_two_final_correctedOnline information 2010_track_two_final_corrected
Online information 2010_track_two_final_corrected
Basset Hervé
 
Models and Concepts for Socio-technical Complex Systems: Towards Fractal Soci...
Models and Concepts for Socio-technical Complex Systems: Towards Fractal Soci...Models and Concepts for Socio-technical Complex Systems: Towards Fractal Soci...
Models and Concepts for Socio-technical Complex Systems: Towards Fractal Soci...
Vincenzo De Florio
 
Science 2.0 and language technology
Science 2.0 and language technologyScience 2.0 and language technology
Science 2.0 and language technology
fridolin.wild
 
Openingandclosedsystems
OpeningandclosedsystemsOpeningandclosedsystems
Openingandclosedsystems
Francesca Lyn
 
Presentation
PresentationPresentation
Presentation
Mengqing Liu
 
E soc13
E soc13E soc13
Virginia Tech College Essay We Write Cu. Online assignment writing service.
Virginia Tech College Essay We Write Cu. Online assignment writing service.Virginia Tech College Essay We Write Cu. Online assignment writing service.
Virginia Tech College Essay We Write Cu. Online assignment writing service.
Elizabeth Jenkins
 
Social software for teaching and learning, mid-2008
Social software for teaching and learning, mid-2008Social software for teaching and learning, mid-2008
Social software for teaching and learning, mid-2008
Bryan Alexander
 
Objectification Is A Word That Has Many Negative Connotations
Objectification Is A Word That Has Many Negative ConnotationsObjectification Is A Word That Has Many Negative Connotations
Objectification Is A Word That Has Many Negative Connotations
Beth Johnson
 
Being Engelbartian
Being EngelbartianBeing Engelbartian
Being Engelbartian
John Bradley
 
BCcampus a-great-babbling-bazaar
BCcampus a-great-babbling-bazaarBCcampus a-great-babbling-bazaar
BCcampus a-great-babbling-bazaar
b p
 
Notational systems and cognitive evolution
Notational systems and cognitive evolutionNotational systems and cognitive evolution
Notational systems and cognitive evolution
Jeff Long
 
Digital Texts scholarly communication in a digital networked age
Digital Texts scholarly communication in a digital networked ageDigital Texts scholarly communication in a digital networked age
Digital Texts scholarly communication in a digital networked age
Tony Hirst
 
Searching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSearching for patterns in crowdsourced information
Searching for patterns in crowdsourced information
Silvia Puglisi
 

Similar to Wikipedia structure of collaboration (20)

Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)
 
Offen. Divers. Inklusiv. Thinking the Future of Organizations
Offen. Divers. Inklusiv. Thinking the Future of OrganizationsOffen. Divers. Inklusiv. Thinking the Future of Organizations
Offen. Divers. Inklusiv. Thinking the Future of Organizations
 
Social Technologies for Informaticians and Researchers
Social Technologies for Informaticians and ResearchersSocial Technologies for Informaticians and Researchers
Social Technologies for Informaticians and Researchers
 
Social Systems Theory 2012 #1
Social Systems Theory 2012 #1Social Systems Theory 2012 #1
Social Systems Theory 2012 #1
 
Science 2.0
Science 2.0Science 2.0
Science 2.0
 
Notational systems and the abstract built environment
Notational systems and the abstract built environmentNotational systems and the abstract built environment
Notational systems and the abstract built environment
 
Online information 2010_track_two_final_corrected
Online information 2010_track_two_final_correctedOnline information 2010_track_two_final_corrected
Online information 2010_track_two_final_corrected
 
Models and Concepts for Socio-technical Complex Systems: Towards Fractal Soci...
Models and Concepts for Socio-technical Complex Systems: Towards Fractal Soci...Models and Concepts for Socio-technical Complex Systems: Towards Fractal Soci...
Models and Concepts for Socio-technical Complex Systems: Towards Fractal Soci...
 
Science 2.0 and language technology
Science 2.0 and language technologyScience 2.0 and language technology
Science 2.0 and language technology
 
Openingandclosedsystems
OpeningandclosedsystemsOpeningandclosedsystems
Openingandclosedsystems
 
Presentation
PresentationPresentation
Presentation
 
E soc13
E soc13E soc13
E soc13
 
Virginia Tech College Essay We Write Cu. Online assignment writing service.
Virginia Tech College Essay We Write Cu. Online assignment writing service.Virginia Tech College Essay We Write Cu. Online assignment writing service.
Virginia Tech College Essay We Write Cu. Online assignment writing service.
 
Social software for teaching and learning, mid-2008
Social software for teaching and learning, mid-2008Social software for teaching and learning, mid-2008
Social software for teaching and learning, mid-2008
 
Objectification Is A Word That Has Many Negative Connotations
Objectification Is A Word That Has Many Negative ConnotationsObjectification Is A Word That Has Many Negative Connotations
Objectification Is A Word That Has Many Negative Connotations
 
Being Engelbartian
Being EngelbartianBeing Engelbartian
Being Engelbartian
 
BCcampus a-great-babbling-bazaar
BCcampus a-great-babbling-bazaarBCcampus a-great-babbling-bazaar
BCcampus a-great-babbling-bazaar
 
Notational systems and cognitive evolution
Notational systems and cognitive evolutionNotational systems and cognitive evolution
Notational systems and cognitive evolution
 
Digital Texts scholarly communication in a digital networked age
Digital Texts scholarly communication in a digital networked ageDigital Texts scholarly communication in a digital networked age
Digital Texts scholarly communication in a digital networked age
 
Searching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSearching for patterns in crowdsourced information
Searching for patterns in crowdsourced information
 

More from Sorin Adam Matei

Enhancing C-Span Video Archive with Practice Capital Metadata and data journa...
Enhancing C-Span Video Archive with Practice Capital Metadata and data journa...Enhancing C-Span Video Archive with Practice Capital Metadata and data journa...
Enhancing C-Span Video Archive with Practice Capital Metadata and data journa...
Sorin Adam Matei
 
Visible Effort: A Social Entropy Methodology for Managing Computer-Mediated ...
Visible Effort: A Social Entropy Methodology for  Managing Computer-Mediated ...Visible Effort: A Social Entropy Methodology for  Managing Computer-Mediated ...
Visible Effort: A Social Entropy Methodology for Managing Computer-Mediated ...
Sorin Adam Matei
 
Sorin Adam Matei Curriculum Vitae
Sorin Adam Matei Curriculum VitaeSorin Adam Matei Curriculum Vitae
Sorin Adam Matei Curriculum Vitae
Sorin Adam Matei
 
Web 3.0
Web 3.0Web 3.0
Cine are carte, imparte?
Cine are carte, imparte?Cine are carte, imparte?
Cine are carte, imparte?
Sorin Adam Matei
 
The Internet is a magnifying glass
The Internet is a magnifying glassThe Internet is a magnifying glass
The Internet is a magnifying glass
Sorin Adam Matei
 
Marital Status, Individualism And On Line
Marital Status, Individualism And On LineMarital Status, Individualism And On Line
Marital Status, Individualism And On Line
Sorin Adam Matei
 
Communication As A Spatial Problem
Communication As A Spatial ProblemCommunication As A Spatial Problem
Communication As A Spatial Problem
Sorin Adam Matei
 
Convorbiri Despre Paramodernitate
Convorbiri Despre ParamodernitateConvorbiri Despre Paramodernitate
Convorbiri Despre Paramodernitate
Sorin Adam Matei
 
Nca2006
Nca2006Nca2006
Barometru Preelectoral 23 26nov
Barometru Preelectoral 23 26novBarometru Preelectoral 23 26nov
Barometru Preelectoral 23 26nov
Sorin Adam Matei
 
Barometru Preelectoral 17 21nov
Barometru Preelectoral 17 21novBarometru Preelectoral 17 21nov
Barometru Preelectoral 17 21nov
Sorin Adam Matei
 
Barometrul de opinie publica 1999
Barometrul de opinie publica 1999Barometrul de opinie publica 1999
Barometrul de opinie publica 1999
Sorin Adam Matei
 
Sondaj de opinie, Ianuarie 2007
Sondaj de opinie, Ianuarie 2007Sondaj de opinie, Ianuarie 2007
Sondaj de opinie, Ianuarie 2007
Sorin Adam Matei
 

More from Sorin Adam Matei (14)

Enhancing C-Span Video Archive with Practice Capital Metadata and data journa...
Enhancing C-Span Video Archive with Practice Capital Metadata and data journa...Enhancing C-Span Video Archive with Practice Capital Metadata and data journa...
Enhancing C-Span Video Archive with Practice Capital Metadata and data journa...
 
Visible Effort: A Social Entropy Methodology for Managing Computer-Mediated ...
Visible Effort: A Social Entropy Methodology for  Managing Computer-Mediated ...Visible Effort: A Social Entropy Methodology for  Managing Computer-Mediated ...
Visible Effort: A Social Entropy Methodology for Managing Computer-Mediated ...
 
Sorin Adam Matei Curriculum Vitae
Sorin Adam Matei Curriculum VitaeSorin Adam Matei Curriculum Vitae
Sorin Adam Matei Curriculum Vitae
 
Web 3.0
Web 3.0Web 3.0
Web 3.0
 
Cine are carte, imparte?
Cine are carte, imparte?Cine are carte, imparte?
Cine are carte, imparte?
 
The Internet is a magnifying glass
The Internet is a magnifying glassThe Internet is a magnifying glass
The Internet is a magnifying glass
 
Marital Status, Individualism And On Line
Marital Status, Individualism And On LineMarital Status, Individualism And On Line
Marital Status, Individualism And On Line
 
Communication As A Spatial Problem
Communication As A Spatial ProblemCommunication As A Spatial Problem
Communication As A Spatial Problem
 
Convorbiri Despre Paramodernitate
Convorbiri Despre ParamodernitateConvorbiri Despre Paramodernitate
Convorbiri Despre Paramodernitate
 
Nca2006
Nca2006Nca2006
Nca2006
 
Barometru Preelectoral 23 26nov
Barometru Preelectoral 23 26novBarometru Preelectoral 23 26nov
Barometru Preelectoral 23 26nov
 
Barometru Preelectoral 17 21nov
Barometru Preelectoral 17 21novBarometru Preelectoral 17 21nov
Barometru Preelectoral 17 21nov
 
Barometrul de opinie publica 1999
Barometrul de opinie publica 1999Barometrul de opinie publica 1999
Barometrul de opinie publica 1999
 
Sondaj de opinie, Ianuarie 2007
Sondaj de opinie, Ianuarie 2007Sondaj de opinie, Ianuarie 2007
Sondaj de opinie, Ianuarie 2007
 

Recently uploaded

Project Management Semester Long Project - Acuity
Project Management Semester Long Project - AcuityProject Management Semester Long Project - Acuity
Project Management Semester Long Project - Acuity
jpupo2018
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
fredae14
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 

Recently uploaded (20)

Project Management Semester Long Project - Acuity
Project Management Semester Long Project - AcuityProject Management Semester Long Project - Acuity
Project Management Semester Long Project - Acuity
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 

Wikipedia structure of collaboration

  • 1. The structure of social collaboration on Wikipedia Sorin Adam Matei, Associate Professor of Communication, Purdue U smatei@purdue.edu David Braun, Research Scientist, Envision Lab, Purdue U dbraun@purdue.edu HoriaPetrache, Assistant Professor of Physics, IUPUI hpetrach@iupui.edu Presented at Wikimania, 2009 Buenos Aires, Argentina August 25-28 2009 http://wikimania2009.wikimedia.org/wiki/Proceedings:132
  • 2. 2005: A Wikipedian explains Wikipedia as Wisdom of Crowds The basic premise [of Wisdom of Crowds] that crowds of relatively ignorant individuals make better decisions than small groups of experts. I'm sure everyone here agrees with this as Wikipedia is run this way... Wikipedia displays emergent properties because each article is better than the contribution of each individual. Similarly, ants display emergence because an ant colony can accomplish things that each individual ant cannot even conceive.
  • 3. Implied idea Fine grained, micro contributions, independent and decentralized and maybe equal lead to articles that are better than what each contributor can write
  • 4. As expressed in this Wikipedia-l post I imagine Wikipedia as a massive, active swarm intelligence, supplemented by small roving groups of active editors who admire consistency, elegance, and reasoned discourse. (not unlike certain models of how the brain works :) The swarm does the bulk of the writing, especially finding and providing current facts, starting new articles, and adding neglected POVs. The roving groups are sensitive to dozens of policy pages, and implement them as they rove... they also take on large projects, one at a time, and try to implement certain changes across thousands of pages at once.
  • 5. To which “Jimbo” (Wales) answers I should point out that I like Suroweicki'sthesis just fine, it's just that I'm not convinced that "swarm intelligence" is very helpful in understanding how Wikipedia works -- in fact, it might be an impediment, because it leads us away from thinking about how the community interacts in a process of reasoned discourse.
  • 6. Jimbo concludes My research (conducted in December) showed that half the edits by logged in users belong to just 2.5% of logged in users.
  • 7. Does the 80/20 applies? Power-law curves are all over the real world … Adar and Huberman (2000) found 50% of the content on Gnutella is provided by 1% of the users, O'Mahonyand Ferraro (2003) found the curve in the Debian dev key ring, Moon and Sproul (2002) on the Linux Kernel list, Briggs et al. (1997) in group support systems, Krogh, Spaeth and Lakhani (2003) in Freenet. (By another participant to the 2005 discussion)
  • 8. What would the 80/20 rule mean? Extreme inequality? Elitism? Structured collaboration? Interactive exchanges between groups of individuals?
  • 9. Previous research Wikipedia contributions, in all languages, have become more skewed in favor of a small group of editors and old time users (Ortega et al., 2005)
  • 10. Top contributors dominate edits and no words contributed
  • 11. Our approach Increase in inequality => higher level of structuration Increasing division of labor From diffuse collaboration to structured collaboration Emergence of bureaucracy Emergence of adhocracy Groups of individuals that become article stewards
  • 12. Social entropy and structuration Social Entropy As system become organized (biased) their entropy decreases Entropy is a measure of meaningful organization
  • 13. Entropy and organization Meaningful messages use words and letters in uneven manner Symbol distribution in meaningful messages is uneven Information (and social) entropy are measures of organization and meaning As collaboration becomes more biased, the group becomes more organized
  • 14. Shannon’s formula where the sum is over all users i, and is the fractional contribution of user i. We allow p and S and to be functions of time (t).
  • 15. Shannon’s forumal explained Social entropy reflects how uneven and lacking in diversity a group/system process is 10 users and 100 contributions, each contributing 10 edits to a Wikipedia article => entropy reaches its highest level 1 contributor contributes all, entropy at the lowest value
  • 16. Analytic strategy Downloaded latest available dump Trouble with unzipping (dump corrupted) Extracted 792,654 registered users 234,798 articles Calculated number of times individuals contributed to each article and how many words have they contributed (not completely finalized)
  • 17.
  • 18. Orange: fit curve (takes into account the spread of values
  • 19. Dotted: Maximum entropy, wisdom of crowds ceilingln(x) Intervention entropy Basic plot: Entropy increases for the first @500 interventions, then levels off…. Intervention number (events)
  • 20.
  • 21. Orange: fit curve (takes into account the spread of values
  • 22. Dotted: Maximum entropy, wisdom of crowds ceilingln(x) Intervention entropy Intervention number (events) Logged plot: Average article entropy increasingly and monotonously diverges from the “wisdom of the crowds” ceiling. Wikipedia becomes “cooler” and more and more structured ….
  • 23. Standard deviation/ Int. Entropy After the 500th intervention the coefficient of variation (StDev/Mean) becomes constant; all articles tend to behave within the same limits of variability for the next 9,500 iterations n-1/2 ratio Intervention number (events)
  • 24. What remains to be done Entropy decreases, Wikipedia “hardens” Does it become more structured? In what way? Will analyze degree of structuration measuring structure of coedits (network) analysis Expectation: as entropy decreases, network structures become more hierarchical can inflexible (less degrees of freedom) Will analyze distribution of collaboration across formal and informal roles Who are the nodes of collaboration What is their contribution to cooling and hardening Wikipedia