COINs2011 Editorial Collaboration Networks ofWikipedia Articles in Various Languages	                    Takashi Iba      ...
Editorial Collaboration Networks ofWikipedia Articles in Various Languages  • The characteristics of collaboration pattern...
Editorial Collaboration Networks ofWikipedia Articles in Various Languages     Method: Sequential collaboration network  ...
Editorial Collaboration Networks ofWikipedia Articles in Various Languages     Method: Sequential collaboration network  ...
  Method: Sequential collaboration network    Building a sequential collaboration network, connecting a    relation from ...
Sequential Collaboration Network of Article“Collaborative Innovation Networks” in English WikipediaThe number of Nodes = 5...
Sequential Collaboration Network of Article “Basel”in English WikipediaThe number of Nodes = 594Average path length = 6.577
Sequential Collaboration Network of Article “Switzerland”in English WikipediaThe number of Nodes = 3998Average path length...
Sequential Collaboration Network of Article “Fondue”in English WikipediaThe number of Nodes = 457Average path length = 10....
  Method: Sequential collaboration network    Building a sequential collaboration network, connecting a    relation from ...
Our Previous Study: Featured Articles in English Wikipedia        each sequential collaboration network             The av...
Editorial Collaboration Networks ofWikipedia Articles in Various Languages     Method: Sequential collaboration network  ...
 Analysis 1: Comparison of 12 different languages	                                         Target Languages              ...
English                                                                                     Rank 1                        ...
English                                                                                     Rank 1                        ...
German                                                                                      Rank 2                        ...
French                                                                                      Rank 3                        ...
Polish                                                                                       Rank 4                       ...
Italian                                                                                      Rank 5                       ...
Japanese                                                                                    Rank 6                        ...
Spanish                                                                                     Rank 7                        ...
Dutch                                                                                       Rank 8                        ...
Portuguese                                                                                  Rank 9                        ...
Russian                                                                                     Rank 10                       ...
Finnish                                                                                      Rank 15                      ...
Turkish                                                                                     Rank 20                       ...
English      German     French    PolishItalian      Japanese   Spanish   DutchPortuguese   Russian    Finnish   Turkish
Result of Analysis 1: Comparison of 12 different languages	    • Scatter plot of all articles exhibits a tilted triangle  ...
Editorial Collaboration Networks ofWikipedia Articles in Various Languages     Method: Sequential collaboration network  ...
 Analysis 2: Distribution of account and IP users	                                                   IP users            ...
Scatter plot of articles in English Wikipedia      each sequential collaboration network           The average path length...
Scatter plot of articles withnumber of IP users / number of total editors      each sequential collaboration network      ...
Scatter plot of articles withnumber of IP users / number of total editors             PIP = 0.0         PIP = 0.1       PI...
Scatter plot of articles withnumber of IP users / number of total editors      each sequential collaboration network      ...
Result of Analysis 2: Distribution of account and IP users 	    • Top and right area of the “triangle” in scatter plot    ...
Editorial Collaboration Networks ofWikipedia Articles in Various Languages     Method: Sequential collaboration network  ...
 Analysis 3: Distribution of Featured Articles	         3,372 featured articles / 3,732,033 articles                   In...
Scatter plot of all articles in English Wikipedia     each sequential collaboration network          The average path leng...
Scatter plot of featured articles on the all articlesin English Wikipedia      each sequential collaboration network      ...
Scatter plot of featured articles on the all articlesin English Wikipedia      each sequential collaboration network      ...
Result of Analysis 3: Distribution of Featured Articles	    • Features articles are located at a certain area in the    sc...
Editorial Collaboration Networks ofWikipedia Articles in Various Languages     Method: Sequential collaboration network  ...
Editorial Collaboration Networks ofWikipedia Articles in Various Languages  • Scatter plot of all articles commonly  exhib...
Collaborators                         “Editorial Collaboration Networks of                                       Wikipedia...
Upcoming SlideShare
Loading in...5
×

Editorial Collaboration Networks of Wikipedia Articles in Various Languages

1,251

Published on

Our presentation about open-collaboration given at the International Conference on Collaborative Innovation Networks (COINs2011) in Basel, Switzerland, Sep. 9, 2011. The video of this presentation is available at the Livestream site http://www.livestream.com/coinsconference

Published in: Design, Technology, Education
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
1,251
On Slideshare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
6
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Editorial Collaboration Networks of Wikipedia Articles in Various Languages

  1. 1. COINs2011 Editorial Collaboration Networks ofWikipedia Articles in Various Languages Takashi Iba Ko Matsuzuka Daiki Muramatsu Faculty of Policy Management, Keio University
  2. 2. Editorial Collaboration Networks ofWikipedia Articles in Various Languages • The characteristics of collaboration patterns of all articles in a certain language. • The commonality and differences of collaboration patterns among Wikipedias written in various languages.
  3. 3. Editorial Collaboration Networks ofWikipedia Articles in Various Languages   Method: Sequential collaboration network   Analysis 1: Comparison of 12 different languages   Analysis 2: Distribution of account and IP users   Analysis 3: Distribution of Featured Articles
  4. 4. Editorial Collaboration Networks ofWikipedia Articles in Various Languages   Method: Sequential collaboration network   Analysis 1: Comparison of 12 different languages   Analysis 2: Distribution of account and IP users   Analysis 3: Distribution of Featured Articles
  5. 5.   Method: Sequential collaboration network Building a sequential collaboration network, connecting a relation from editor A to editor B, if editor B follows on work done by editor A. order 1 A A 2 B 3 4 A B C C 5
  6. 6. Sequential Collaboration Network of Article“Collaborative Innovation Networks” in English WikipediaThe number of Nodes = 51Average path length = 6.399
  7. 7. Sequential Collaboration Network of Article “Basel”in English WikipediaThe number of Nodes = 594Average path length = 6.577
  8. 8. Sequential Collaboration Network of Article “Switzerland”in English WikipediaThe number of Nodes = 3998Average path length = 5.468
  9. 9. Sequential Collaboration Network of Article “Fondue”in English WikipediaThe number of Nodes = 457Average path length = 10.485
  10. 10.   Method: Sequential collaboration network Building a sequential collaboration network, connecting a relation from editor A to editor B, if editor B follows on work done by editor A. order 1 A A 2 B 3 4 A B C C 5
  11. 11. Our Previous Study: Featured Articles in English Wikipedia each sequential collaboration network The average path length of Linear graph 2,545 articles [Jun 27 2009] The order of each sequential collaboration network (The number of editors in each article)T. Iba, K. Nemoto, B. Peters & P. Gloor, "Analyzing the Creative Editing Behavior of WikipediaEditors Through Dynamical Social Network Analysis", COINs2011, 2009T. Iba and S. Itoh, "Sequential Collaboration Network of Open Collaboration", NetSci09, 2009
  12. 12. Editorial Collaboration Networks ofWikipedia Articles in Various Languages   Method: Sequential collaboration network   Analysis 1: Comparison of 12 different languages   Analysis 2: Distribution of account and IP users   Analysis 3: Distribution of Featured Articles
  13. 13.  Analysis 1: Comparison of 12 different languages Target Languages Rank 1: English Rank 2: German Rank 3: French Rank 4: Polish Rank 5: Italian Rank 6: Japanese Rank 7: Spanish Rank 8: Dutch Rank 9: PortugueseAnalyzing ALL articles as of Rank 10: RussianJanuary 1st, 2011 in each language. … Rank 15: FinnishThe ranking based on the data as of …January 6th, 2011. Rank 20: Turkish
  14. 14. English Rank 1 3,490,325 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  15. 15. English Rank 1 3,490,325 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  16. 16. German Rank 2 1,155,210 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  17. 17. French Rank 3 1,039,251 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  18. 18. Polish Rank 4 752,734 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  19. 19. Italian Rank 5 750,634 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  20. 20. Japanese Rank 6 718,974 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  21. 21. Spanish Rank 7 676,866 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  22. 22. Dutch Rank 8 656,079 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  23. 23. Portuguese Rank 9 638,747 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  24. 24. Russian Rank 10 627,139 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  25. 25. Finnish Rank 15 255,712 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  26. 26. Turkish Rank 20 152,262 articles each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  27. 27. English German French PolishItalian Japanese Spanish DutchPortuguese Russian Finnish Turkish
  28. 28. Result of Analysis 1: Comparison of 12 different languages • Scatter plot of all articles exhibits a tilted triangle in all languages. • The height of triangle gets shorter as the number of articles decreases.
  29. 29. Editorial Collaboration Networks ofWikipedia Articles in Various Languages   Method: Sequential collaboration network   Analysis 1: Comparison of 12 different languages   Analysis 2: Distribution of account and IP users   Analysis 3: Distribution of Featured Articles
  30. 30.  Analysis 2: Distribution of account and IP users IP users Account users
  31. 31. Scatter plot of articles in English Wikipedia each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  32. 32. Scatter plot of articles withnumber of IP users / number of total editors each sequential collaboration network The average path length of 0.0 PIP 1.0 Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  33. 33. Scatter plot of articles withnumber of IP users / number of total editors PIP = 0.0 PIP = 0.1 PIP = 0.2 PIP = 0.3 PIP = 0.4 PIP = 0.5 PIP = 0.6 PIP = 0.7 PIP = 0.8
  34. 34. Scatter plot of articles withnumber of IP users / number of total editors each sequential collaboration network The average path length of 0.0 PIP 1.0 Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  35. 35. Result of Analysis 2: Distribution of account and IP users • Top and right area of the “triangle” in scatter plot consist of articles which ratios of users is high. • As a result, both the average path length and order of network can be large in these areas. PIP = 0.0 PIP = 0.6
  36. 36. Editorial Collaboration Networks ofWikipedia Articles in Various Languages   Method: Sequential collaboration network   Analysis 1: Comparison of 12 different languages   Analysis 2: Distribution of account and IP users   Analysis 3: Distribution of Featured Articles
  37. 37.  Analysis 3: Distribution of Featured Articles 3,372 featured articles / 3,732,033 articles In English Wikipedia
  38. 38. Scatter plot of all articles in English Wikipedia each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  39. 39. Scatter plot of featured articles on the all articlesin English Wikipedia each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  40. 40. Scatter plot of featured articles on the all articlesin English Wikipedia each sequential collaboration network The average path length of Double logarithmic graph The order of each sequential collaboration network (The number of editors in each article)
  41. 41. Result of Analysis 3: Distribution of Featured Articles • Features articles are located at a certain area in the scatter plot. • It implies that there would be characteristic patterns of collaboration producing good results.
  42. 42. Editorial Collaboration Networks ofWikipedia Articles in Various Languages   Method: Sequential collaboration network   Analysis 1: Comparison of 12 different languages   Analysis 2: Distribution of account and IP users   Analysis 3: Distribution of Featured Articles
  43. 43. Editorial Collaboration Networks ofWikipedia Articles in Various Languages • Scatter plot of all articles commonly exhibits a tilted triangle in all languages, but the height of triangle gets shorter as the number of articles decreases. • Top and right area of the “triangle” in scatter plot consist of articles which the ratios of IP users are high. • Features articles are located at a certain area in the scatter plot.
  44. 44. Collaborators “Editorial Collaboration Networks of Wikipedia Articles in Various Languages” Contact us: e-mail to iba@sfc.keio.ac.jp Takashi Iba Associate Professor,Faculty of Policy Management, Keio UniversityPh.D. in media and governance Natsumi Yotsumoto Ko Matsuzuka Former student of Iba Lab. Iba Lab. Faculty of Policy Management,Faculty of Policy Management, Keio University Keio University Bui Hong Ha Daiki Muramatsu Former student of Iba Lab. Iba Lab. Faculty of Policy Management,Faculty of Policy Management, Keio University Keio University
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×