wikilyticsDATA SUMMIT      Feb 4, 2011   Diederik van Liere
“ Live Demo”
answers           wikilytics   platformraw data
Wikilytics config | download | extract | sort | store | transform      raw data                                   answers
configure | wikilyticsconfig | download | extract | sort | store | transform
language support | wikilyticsconfig | download | extract | sort | store | transform
download dump | wikilyticsconfig | download | extract | sort | store | transform
extract variables | wikilyticsconfig | download | extract | sort | store | transform
mergesort edits by editor | wikilyticsconfig | download | extract | sort | store | transform
store editors in db | wikilyticsconfig | download | extract | sort | store | transform
precompute variables | wikilyticsconfig | download | extract | sort | store | transform
config | download | extract | sort | store | transform     00:00:00:000                          01:05:29:352              ...
“Command lines are too hard...”
Launch the entire data-processing from            your browser
The entire data-processing chain is             running...
wikilytics              platform                Plugins (Python)         Map / Reduce (JS)Analysis               BSON Docu...
Each analysis is a plugin...      def new_editor_count(var, editor, **kwargs):                    Summary: This function g...
...and available in your browser
Text
Resources•strategy.wikimedia.org/wiki/Editor_Trends_Study•strategy.wikimedia.org/wiki/Editor_Trends_Study/Software•svn.wik...
Wikilytics
Wikilytics
Wikilytics
Wikilytics
Wikilytics
Wikilytics
Wikilytics
Upcoming SlideShare
Loading in …5
×

Wikilytics

2,075 views

Published on

Introduction to the Wikilytics Platform to analyze the health of a Mediawiki community.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
2,075
On SlideShare
0
From Embeds
0
Number of Embeds
20
Actions
Shares
0
Downloads
7
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • Wikilytics

    1. 1. wikilyticsDATA SUMMIT Feb 4, 2011 Diederik van Liere
    2. 2. “ Live Demo”
    3. 3. answers wikilytics platformraw data
    4. 4. Wikilytics config | download | extract | sort | store | transform raw data answers
    5. 5. configure | wikilyticsconfig | download | extract | sort | store | transform
    6. 6. language support | wikilyticsconfig | download | extract | sort | store | transform
    7. 7. download dump | wikilyticsconfig | download | extract | sort | store | transform
    8. 8. extract variables | wikilyticsconfig | download | extract | sort | store | transform
    9. 9. mergesort edits by editor | wikilyticsconfig | download | extract | sort | store | transform
    10. 10. store editors in db | wikilyticsconfig | download | extract | sort | store | transform
    11. 11. precompute variables | wikilyticsconfig | download | extract | sort | store | transform
    12. 12. config | download | extract | sort | store | transform 00:00:00:000 01:05:29:352 (Polish wiki)
    13. 13. “Command lines are too hard...”
    14. 14. Launch the entire data-processing from your browser
    15. 15. The entire data-processing chain is running...
    16. 16. wikilytics platform Plugins (Python) Map / Reduce (JS)Analysis BSON Document in New Collection inStorage MongoDB MongoDBOutput
    17. 17. Each analysis is a plugin... def new_editor_count(var, editor, **kwargs): Summary: This function generates an overview of the number of new_wikipedians for a given year / month combination. Purpose: This data can be used to compare with Erik Zachtes stats.download.org to make sure that we are using the same numbers. # headers = [year, month, count] new_wikipedian = editor[new_wikipedian] var.add(new_wikipedian, {0:1}) return var
    18. 18. ...and available in your browser
    19. 19. Text
    20. 20. Resources•strategy.wikimedia.org/wiki/Editor_Trends_Study•strategy.wikimedia.org/wiki/Editor_Trends_Study/Software•svn.wikimedia.org/svnroot/mediawiki/trunk/tools/editor_trends/

    ×