Editorial Collaboration Networks of Wikipedia Articles in Various Languages

  • 1,197 views
Uploaded on

Our presentation about open-collaboration given at the International Conference on Collaborative Innovation Networks (COINs2011) in Basel, Switzerland, Sep. 9, 2011. The video of this presentation is …

Our presentation about open-collaboration given at the International Conference on Collaborative Innovation Networks (COINs2011) in Basel, Switzerland, Sep. 9, 2011. The video of this presentation is available at the Livestream site http://www.livestream.com/coinsconference

More in: Design , Technology , Education
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
1,197
On Slideshare
0
From Embeds
0
Number of Embeds
2

Actions

Shares
Downloads
5
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. COINs2011 Editorial Collaboration Networks ofWikipedia Articles in Various Languages Takashi Iba Ko Matsuzuka Daiki Muramatsu Faculty of Policy Management, Keio University
  • 2. Editorial Collaboration Networks ofWikipedia Articles in Various Languages • The characteristics of collaboration patterns of all articles in a certain language. • The commonality and differences of collaboration patterns among Wikipedias written in various languages.
  • 3. Editorial Collaboration Networks ofWikipedia Articles in Various Languages   Method: Sequential collaboration network   Analysis 1: Comparison of 12 different languages   Analysis 2: Distribution of account and IP users   Analysis 3: Distribution of Featured Articles
  • 4. Editorial Collaboration Networks ofWikipedia Articles in Various Languages   Method: Sequential collaboration network   Analysis 1: Comparison of 12 different languages   Analysis 2: Distribution of account and IP users   Analysis 3: Distribution of Featured Articles
  • 5.   Method: Sequential collaboration network Building a sequential collaboration network, connecting a relation from editor A to editor B, if editor B follows on work done by editor A. order 1 A A 2 B 3 4 A B C C 5
  • 6. Sequential Collaboration Network of Article“Collaborative Innovation Networks” in English WikipediaThe number of Nodes = 51Average path length = 6.399
  • 7. Sequential Collaboration Network of Article “Basel”in English WikipediaThe number of Nodes = 594Average path length = 6.577
  • 8. Sequential Collaboration Network of Article “Switzerland”in English WikipediaThe number of Nodes = 3998Average path length = 5.468
  • 9. Sequential Collaboration Network of Article “Fondue”in English WikipediaThe number of Nodes = 457Average path length = 10.485
  • 10.   Method: Sequential collaboration network Building a sequential collaboration network, connecting a relation from editor A to editor B, if editor B follows on work done by editor A. order 1 A A 2 B 3 4 A B C C 5
  • 11. Our Previous Study: Featured Articles in English Wikipedia each sequential collaboration network The average path length of Linear graph 2,545 articles [Jun 27 2009] The order of each sequential collaboration network (The number of editors in each article)T. Iba, K. Nemoto, B. Peters & P. Gloor, "Analyzing the Creative Editing Behavior of WikipediaEditors Through Dynamical Social Network Analysis", COINs2011, 2009T. Iba and S. Itoh, "Sequential Collaboration Network of Open Collaboration", NetSci09, 2009
  • 12. Editorial Collaboration Networks ofWikipedia Articles in Various Languages   Method: Sequential collaboration network   Analysis 1: Comparison of 12 different languages   Analysis 2: Distribution of account and IP users   Analysis 3: Distribution of Featured Articles
  • 13.  Analysis 1: Comparison of 12 different languages Target Languages Rank 1: English Rank 2: German Rank 3: French Rank 4: Polish Rank 5: Italian Rank 6: Japanese Rank 7: Spanish Rank 8: Dutch Rank 9: PortugueseAnalyzing ALL articles as of Rank 10: RussianJanuary 1st, 2011 in each language. … Rank 15: FinnishThe ranking based on the data as of …January 6th, 2011. Rank 20: Turkish
  • 14. English Rank 1 3,490,325 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 15. English Rank 1 3,490,325 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 16. German Rank 2 1,155,210 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 17. French Rank 3 1,039,251 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 18. Polish Rank 4 752,734 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 19. Italian Rank 5 750,634 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 20. Japanese Rank 6 718,974 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 21. Spanish Rank 7 676,866 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 22. Dutch Rank 8 656,079 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 23. Portuguese Rank 9 638,747 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 24. Russian Rank 10 627,139 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 25. Finnish Rank 15 255,712 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 26. Turkish Rank 20 152,262 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 27. English German French PolishItalian Japanese Spanish DutchPortuguese Russian Finnish Turkish
  • 28. Result of Analysis 1: Comparison of 12 different languages • Scatter plot of all articles exhibits a tilted triangle in all languages. • The height of triangle gets shorter as the number of articles decreases.
  • 29. Editorial Collaboration Networks ofWikipedia Articles in Various Languages   Method: Sequential collaboration network   Analysis 1: Comparison of 12 different languages   Analysis 2: Distribution of account and IP users   Analysis 3: Distribution of Featured Articles
  • 30.  Analysis 2: Distribution of account and IP users IP users Account users
  • 31. Scatter plot of articles in English Wikipedia each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 32. Scatter plot of articles withnumber of IP users / number of total editors each sequential collaboration network The average path length of 0.0 PIP 1.0 Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 33. Scatter plot of articles withnumber of IP users / number of total editors PIP = 0.0 PIP = 0.1 PIP = 0.2 PIP = 0.3 PIP = 0.4 PIP = 0.5 PIP = 0.6 PIP = 0.7 PIP = 0.8
  • 34. Scatter plot of articles withnumber of IP users / number of total editors each sequential collaboration network The average path length of 0.0 PIP 1.0 Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 35. Result of Analysis 2: Distribution of account and IP users • Top and right area of the “triangle” in scatter plot consist of articles which ratios of users is high. • As a result, both the average path length and order of network can be large in these areas. PIP = 0.0 PIP = 0.6
  • 36. Editorial Collaboration Networks ofWikipedia Articles in Various Languages   Method: Sequential collaboration network   Analysis 1: Comparison of 12 different languages   Analysis 2: Distribution of account and IP users   Analysis 3: Distribution of Featured Articles
  • 37.  Analysis 3: Distribution of Featured Articles 3,372 featured articles / 3,732,033 articles In English Wikipedia
  • 38. Scatter plot of all articles in English Wikipedia each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 39. Scatter plot of featured articles on the all articlesin English Wikipedia each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 40. Scatter plot of featured articles on the all articlesin English Wikipedia each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  • 41. Result of Analysis 3: Distribution of Featured Articles • Features articles are located at a certain area in the scatter plot. • It implies that there would be characteristic patterns of collaboration producing good results.
  • 42. Editorial Collaboration Networks ofWikipedia Articles in Various Languages   Method: Sequential collaboration network   Analysis 1: Comparison of 12 different languages   Analysis 2: Distribution of account and IP users   Analysis 3: Distribution of Featured Articles
  • 43. Editorial Collaboration Networks ofWikipedia Articles in Various Languages • Scatter plot of all articles commonly exhibits a tilted triangle in all languages, but the height of triangle gets shorter as the number of articles decreases. • Top and right area of the “triangle” in scatter plot consist of articles which the ratios of IP users are high. • Features articles are located at a certain area in the scatter plot.
  • 44. Collaborators “Editorial Collaboration Networks of Wikipedia Articles in Various Languages” Contact us: e-mail to iba@sfc.keio.ac.jp Takashi Iba Associate Professor,Faculty of Policy Management, Keio UniversityPh.D. in media and governance Natsumi Yotsumoto Ko Matsuzuka Former student of Iba Lab. Iba Lab. Faculty of Policy Management,Faculty of Policy Management, Keio University Keio University Bui Hong Ha Daiki Muramatsu Former student of Iba Lab. Iba Lab. Faculty of Policy Management,Faculty of Policy Management, Keio University Keio University