SlideShare a Scribd company logo
WikiBhasha From Digital Inclusion to Digital Democracy… Dr A Kumaran Microsoft Research Feb 2011
“Egypt will not become liberal democracy overnight; it has a long and difficult journey…Becoming an open and empowered democracy needs a slow process of popular self-education, and it will not come easily or naturally.” Hindustan Times Editorial Feb 14, 2011
Agenda Language Technology Research (15Min) “Digital Inclusion vs. Digital Democracy” (5 Min) WikiBhasha (5 Min)
Language Technology ResearchFrom Classical to Statistical…
Technology & Language... ,[object Object]
3 Cen. BC  ~O(KB-MB)
Scripts/Media for preservation/…
Print Era: Post-Gutenberg
15 Cen. AD  ~O(GB)?
Printing tech./coordination & distribution/…
Electronics & Computing Era
Late 20thCen.  ~O(TB-PB)
Electronic standards/Multimedia/Storage tech./…,[object Object]
Need of the hour: Language Technology Research! Language Technology is primarily concerned with processing Natural Language data ,[object Object]
Information Extraction
Machine Translation
Language Understanding
…Needs “Computational Linguistics” Research!
Linguistics & Computational Linguistics ,[object Object]
Computational Linguistics studies Computational Models for languages
Lexicography 	-  Letter-level
Morphology & Phonology  -  Word-level
Syntax -  Sentence-level
Semantics -  …
Pragmatics -  …,[object Object]
Ex #1: Which Language? (In general, Language Identification) Is a document in English or Finnish or Tamil?  “Length of words” Near-perfect identification!
IKONE-2007 Ex #2: Which is Right?(In general, Grammar Checking & Modeling) Are these correct sentences?  S1: “Yesterday evening, I will have tea”  S2: “I enjoy my tea with biscuits”   S3: “I enjoy my tea with motor oil” ,[object Object]
P(S1): 0.01 P(S2): 0.75 P(S3): 0.05,[object Object],[object Object]
Ex #5: Statistical MT President visits Chennai ஜனாதிபதிசென்னைசெல்கிறார் Statistical Models Parallel corpora visits President Chennai செல்கிறார் ஜனாதிபதி சென்னை President –ஜனாதிபதி       visits  –செல்கிறார் Chennai –சென்னை President inaugurates Tamil Conference ஜனாதிபதி தமிழ்மாநாட்டைதுவக்குகிறார்
For most Language Technologies… Statistical approaches EXIST, and are proven to be very successful! Data are Critical! Theorem: Data drivesResearch & Technology!
Where are the data? Axiom:  Web = Language data Read “Wikipedia = Language Data” ,[object Object]

More Related Content

Similar to Wikibhasha by Dr A Kumaran

Information Management Trends 2009
Information Management Trends 2009Information Management Trends 2009
Information Management Trends 2009
Christopher Eagle
 
INSC580MacasaOpenSourceSoftwareLibrariesFall2016
INSC580MacasaOpenSourceSoftwareLibrariesFall2016INSC580MacasaOpenSourceSoftwareLibrariesFall2016
INSC580MacasaOpenSourceSoftwareLibrariesFall2016Michael J. Macasa
 
Inforum 2007 Into The User environment
Inforum 2007 Into The User environmentInforum 2007 Into The User environment
Inforum 2007 Into The User environment
Guus van den Brekel
 
Oss In Libraries And We Information Professional
Oss In Libraries And We Information ProfessionalOss In Libraries And We Information Professional
Oss In Libraries And We Information Professional
Ashok Kumar Satapathy
 
Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries
mdabrowski
 
Irish Digital Libraries Summit
Irish Digital Libraries SummitIrish Digital Libraries Summit
Irish Digital Libraries Summit
Sebastian Ryszard Kruk
 
Development of the database, the website and the online transcription platfor...
Development of the database, the website and the online transcription platfor...Development of the database, the website and the online transcription platfor...
Development of the database, the website and the online transcription platfor...
Itinera Nova
 
Institutional knowledge and information ecology in a Free Software ecosystem
Institutional knowledge and information ecology in a Free Software ecosystemInstitutional knowledge and information ecology in a Free Software ecosystem
Institutional knowledge and information ecology in a Free Software ecosystem
Derek Keats
 
Global Media Monitor - Marko Grobelnik
Global Media Monitor - Marko GrobelnikGlobal Media Monitor - Marko Grobelnik
Global Media Monitor - Marko Grobelnik
Marko Grobelnik
 
The scripting library: Combining data and information in the library
The scripting library: Combining data and information in the libraryThe scripting library: Combining data and information in the library
The scripting library: Combining data and information in the libraryBonaria Biancu
 
20110324 linked openeuropeanahumanities
20110324 linked openeuropeanahumanities20110324 linked openeuropeanahumanities
20110324 linked openeuropeanahumanitiesStefan Gradmann
 
HIT project - Humanities Integration Technology
HIT project - Humanities Integration TechnologyHIT project - Humanities Integration Technology
HIT project - Humanities Integration Technology
Justo Hidalgo
 
MarcOnt Initiative - Protege meeting
MarcOnt Initiative - Protege meetingMarcOnt Initiative - Protege meeting
MarcOnt Initiative - Protege meeting
mdabrowski
 
Hci
HciHci
New ICT Trends and Issues of Librarianship
New ICT Trends and Issues of LibrarianshipNew ICT Trends and Issues of Librarianship
New ICT Trends and Issues of Librarianship
Liaquat Rahoo
 
Web 20 E Oltre 1202297800291589 3
Web 20 E Oltre 1202297800291589 3Web 20 E Oltre 1202297800291589 3
Web 20 E Oltre 1202297800291589 3Universita' di Bari
 
Wikipedia : Workshop
Wikipedia : WorkshopWikipedia : Workshop
Wikipedia : WorkshopNIFT
 
Semantic Technolgy
Semantic TechnolgySemantic Technolgy
Semantic TechnolgyTalat Fakhri
 
An Introduction to Information Retrieval and Applications
 An Introduction to Information Retrieval and Applications An Introduction to Information Retrieval and Applications
An Introduction to Information Retrieval and Applications
sathish sak
 
Sentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonSentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using python
Hetu Bhavsar
 

Similar to Wikibhasha by Dr A Kumaran (20)

Information Management Trends 2009
Information Management Trends 2009Information Management Trends 2009
Information Management Trends 2009
 
INSC580MacasaOpenSourceSoftwareLibrariesFall2016
INSC580MacasaOpenSourceSoftwareLibrariesFall2016INSC580MacasaOpenSourceSoftwareLibrariesFall2016
INSC580MacasaOpenSourceSoftwareLibrariesFall2016
 
Inforum 2007 Into The User environment
Inforum 2007 Into The User environmentInforum 2007 Into The User environment
Inforum 2007 Into The User environment
 
Oss In Libraries And We Information Professional
Oss In Libraries And We Information ProfessionalOss In Libraries And We Information Professional
Oss In Libraries And We Information Professional
 
Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries
 
Irish Digital Libraries Summit
Irish Digital Libraries SummitIrish Digital Libraries Summit
Irish Digital Libraries Summit
 
Development of the database, the website and the online transcription platfor...
Development of the database, the website and the online transcription platfor...Development of the database, the website and the online transcription platfor...
Development of the database, the website and the online transcription platfor...
 
Institutional knowledge and information ecology in a Free Software ecosystem
Institutional knowledge and information ecology in a Free Software ecosystemInstitutional knowledge and information ecology in a Free Software ecosystem
Institutional knowledge and information ecology in a Free Software ecosystem
 
Global Media Monitor - Marko Grobelnik
Global Media Monitor - Marko GrobelnikGlobal Media Monitor - Marko Grobelnik
Global Media Monitor - Marko Grobelnik
 
The scripting library: Combining data and information in the library
The scripting library: Combining data and information in the libraryThe scripting library: Combining data and information in the library
The scripting library: Combining data and information in the library
 
20110324 linked openeuropeanahumanities
20110324 linked openeuropeanahumanities20110324 linked openeuropeanahumanities
20110324 linked openeuropeanahumanities
 
HIT project - Humanities Integration Technology
HIT project - Humanities Integration TechnologyHIT project - Humanities Integration Technology
HIT project - Humanities Integration Technology
 
MarcOnt Initiative - Protege meeting
MarcOnt Initiative - Protege meetingMarcOnt Initiative - Protege meeting
MarcOnt Initiative - Protege meeting
 
Hci
HciHci
Hci
 
New ICT Trends and Issues of Librarianship
New ICT Trends and Issues of LibrarianshipNew ICT Trends and Issues of Librarianship
New ICT Trends and Issues of Librarianship
 
Web 20 E Oltre 1202297800291589 3
Web 20 E Oltre 1202297800291589 3Web 20 E Oltre 1202297800291589 3
Web 20 E Oltre 1202297800291589 3
 
Wikipedia : Workshop
Wikipedia : WorkshopWikipedia : Workshop
Wikipedia : Workshop
 
Semantic Technolgy
Semantic TechnolgySemantic Technolgy
Semantic Technolgy
 
An Introduction to Information Retrieval and Applications
 An Introduction to Information Retrieval and Applications An Introduction to Information Retrieval and Applications
An Introduction to Information Retrieval and Applications
 
Sentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonSentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using python
 

Recently uploaded

CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
Jheel Barad
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Atul Kumar Singh
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
CarlosHernanMontoyab2
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
RaedMohamed3
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
kaushalkr1407
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 

Recently uploaded (20)

CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 

Wikibhasha by Dr A Kumaran

  • 1. WikiBhasha From Digital Inclusion to Digital Democracy… Dr A Kumaran Microsoft Research Feb 2011
  • 2. “Egypt will not become liberal democracy overnight; it has a long and difficult journey…Becoming an open and empowered democracy needs a slow process of popular self-education, and it will not come easily or naturally.” Hindustan Times Editorial Feb 14, 2011
  • 3. Agenda Language Technology Research (15Min) “Digital Inclusion vs. Digital Democracy” (5 Min) WikiBhasha (5 Min)
  • 4. Language Technology ResearchFrom Classical to Statistical…
  • 5.
  • 6. 3 Cen. BC ~O(KB-MB)
  • 9. 15 Cen. AD ~O(GB)?
  • 10. Printing tech./coordination & distribution/…
  • 12. Late 20thCen. ~O(TB-PB)
  • 13.
  • 14.
  • 19.
  • 20. Computational Linguistics studies Computational Models for languages
  • 21. Lexicography - Letter-level
  • 22. Morphology & Phonology - Word-level
  • 23. Syntax - Sentence-level
  • 24. Semantics -
  • 25.
  • 26. Ex #1: Which Language? (In general, Language Identification) Is a document in English or Finnish or Tamil? “Length of words” Near-perfect identification!
  • 27.
  • 28.
  • 29. Ex #5: Statistical MT President visits Chennai ஜனாதிபதிசென்னைசெல்கிறார் Statistical Models Parallel corpora visits President Chennai செல்கிறார் ஜனாதிபதி சென்னை President –ஜனாதிபதி visits –செல்கிறார் Chennai –சென்னை President inaugurates Tamil Conference ஜனாதிபதி தமிழ்மாநாட்டைதுவக்குகிறார்
  • 30. For most Language Technologies… Statistical approaches EXIST, and are proven to be very successful! Data are Critical! Theorem: Data drivesResearch & Technology!
  • 31.
  • 32.
  • 33. WikiBhashaResearch Project on Crowd-sourcing to explore collaborative data creation for Computational Linguistic research (first focus: parallel data)
  • 34. Content Creation by Infusion… Rough content using Machine Translation Appropriate community correction to create value… Article to Target Wikipedia WikiBABEL on Wikipedia CollaborativeTranslation Cache MachineTranslationSystem Linguistic Resources
  • 35.
  • 38. Little traction with Wikipedians Published in WikiSYM 2008 Conference; Adopted for some products in Microsoft
  • 39. WikiBhasha V2.0: Design Objectives #1: Focus users on their purpose (say, Wikipedia) Content Creation, and not Translation #2: In-site Solution WikiBABEL to stay on Wikipedia for the session Submit any/all contributions #3: Generic components, but specifically purposed Vendor Neutrality Componentized Architecture …
  • 41. WikiBhasha Beta WikiBABEL released as WikiBhasha… Content creator, and not Translator ‘On-Wikipedia’ Open sourced ‘Bhasha’ in Sanskrit means ‘Language’  2010 Version; Interested in open-sourcing and contributing to Wikipedia
  • 42. WikiBhasha: User View CTF Dictionary Designed WikiBhasha as a thin edit layer Stays on Wikipedia User contribution submitted to Wikipedia Cloud Services WikiBABEL UX Wikipedia User Community WikiBhasha 2.0 API’s
  • 43. WikiBhasha: Developer View WikiBhasha designed to be modular & extendible Open-sourced, so community can contribute/enhance WikiBhasha CORE Components GUI Components(Wikipedia-specific UI and Workflow) MediaWiki Software WikiBABEL [Edit] WikiBABEL-CORE User-Interface(Generic UI Components, Scratch Pad, …) User-Experience(Linguistically Aware Wiki-site Aware Workflow Engine) User Management(Authentication, User Credentials Management, User Preferences/Skills, Contributions Tracking, …) Contextual Help(Domain-specific, Context-specific, User-Contribution Aware Help…) CTF Wikipedia Communication (Message Boards, Email/Alert Mechanisms, Wikis, …) Linguistic Resources(Mono-/Bi-lingual Dictionaries, Thesauri, …) Content Management(Content Discovery, Versions, Tagging, Notification Lists, …) 3rd Party Linguistic Services Source/Target Wiki System Interface MediawikiExtensions Lang. Technology Components(Machine Translation, Transliteration, Summarization …) Source/Target Wiki System Interface(Wiki API’s for Content Pull/Push, Content & User Management, …) MediaWikiLayer Cloud Services Layer WikiBhasha UI/UX/IntegrationComponents Layer
  • 44. WikiBhasha: A Community Project WikiBhasha is available as a Bookmarklet/ Wikipedia user-script Please contribute to your Wikipedia! WikiBhasha source code available as a MediaWiki Extension http://svn.wikimedia.org/viewvc/mediawiki/trunk/extensions/WikiBhasha Please enhance it!
  • 45. WikiBhasha Release Released & Open-sourced in 10/’10 Announced jointly by MSR and WMF
  • 46. Covered in 20+ languages/countries across the world WikiBhasha Release
  • 47. WikiBhasha Release ~500K Visits & ~100K Unique Visitors Visits from 50+ countries Primarily from Europe (and Eastern Europe) Many “casual visitors” who may become “contributors”!
  • 48. Community Program Being conducted in 5 demographics Allahabad &Banaras Cairo Delhi, … Objectives Interaction with Wikipedians & Language Enthusiasts To study community adoption, user experience, data creation, and ultimately, technology development…
  • 49. Back to “Digital Democracy”Communities to Research…
  • 50. Languages: Communities & Technology Research requires Data Participatory Internet provides the data needed! Digital “haves and have-nots” For many languages of the world Digital Inclusion is a necessary first step Digital Democracy is a process in which the communities may have to take active part in…

Editor's Notes

  1. Hosted site One article to one article.