Development of Indic Language Spell Checking
            Dictionary for OOo




   K G Sulochana
   G Jaganadh
   C-DAC Thiruvananthapuram




             FossConf 2008 Chennai
Talk Summery


    Introduction

    OpenOffice.org

    Hunspell

    Building Dictionaries

    Issues

    Tips for Building Dictionaries

    Evaluation of Performance

    Conclusion
                       FossConf 2008 Chennai
Introduction



Localization
Spell checking
Spell checker Development in Indian Languages




                 FossConf 2008 Chennai
OpenOffice.org


Free and Open Source Office Suite
Collaborative Development
Available with Interface in Indian Languages
Future Office Suite of India




                 FossConf 2008 Chennai
Hunspell


Spell checking library in OOo
Free and Open Source
Capable of Handling Complex Languages
Unicode Support
Compound Handling




                  FossConf 2008 Chennai
Building Dictionaries


Format of Hunspell Dictionaries
Word list .dic file
Rule Base or Affix list .aff file
How to prepare .dic file ?
How to prepare .aff file ?
Building OOo with your .dic file and .aff file


                      FossConf 2008 Chennai
Issues



Availability of Word list
Rule Generation
Sandhi Handling
Tuning Compound Handling part for Indian Languages




                    FossConf 2008 Chennai
Tips




Tips and tricks for generating word list
Tips and Tricks for rule base development




                 FossConf 2008 Chennai
Testing and Evaluation



Testing of spell checker
Performance evaluation
Quality ensuring




                   FossConf 2008 Chennai
Towards Future



Works to do in Hunspell for Indic Language Spell
Checking in OOo
Setting Up User Groups of Help Support and
Development




                  FossConf 2008 Chennai
Concluding Remarks




  FossConf 2008 Chennai
Questions ?



 FossConf 2008 Chennai
Thank You


 FossConf 2008 Chennai

Indian Language Spellchecker Development for OpenOffice.org