Your SlideShare is downloading. ×
0
K Search Al Khawarizmy Language Software
K Search Al Khawarizmy Language Software
K Search Al Khawarizmy Language Software
K Search Al Khawarizmy Language Software
K Search Al Khawarizmy Language Software
K Search Al Khawarizmy Language Software
K Search Al Khawarizmy Language Software
K Search Al Khawarizmy Language Software
K Search Al Khawarizmy Language Software
K Search Al Khawarizmy Language Software
K Search Al Khawarizmy Language Software
K Search Al Khawarizmy Language Software
K Search Al Khawarizmy Language Software
K Search Al Khawarizmy Language Software
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

K Search Al Khawarizmy Language Software

2,144

Published on

ARABIC SEARCH ENGINE(KSearch)

ARABIC SEARCH ENGINE(KSearch)

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
2,144
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
0
Comments
0
Likes
1
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Monday 12/05/2008
  • 2.
    • Arabic NLP Research
    • Arabic Applications based on NLP components
    • Stress on software quality (targeting ‘zero defect’ S/W)
    • Cooperate with the community; e.g. research students at universities (forming partnerships)
    • Promote widespread use of affordable applications that take the special features of the Arabic language into account
    • Effectively serve the Arab region by catering for its users’ needs
    Monday 12/05/2008
  • 3.
    • 1 st Nov. 2007 – 31 st Dec. 2007:
    • 3 Developers + 1 Product Manager => Small (borrowed) room.
    • 1 st Jan. 2008 – 31 st Jan. 2008:
    • 1 Linguist => Home Office.
    • 1 st Feb. 2008 – 31 st Mar. 2008:
    • 1 Linguist => Smart Village Incubation.
    • 1 st Apr. 2008 – Present:
    • 3 Developers + 1 Linguist + 1 Business Development Manager + 1 Office Manager => Smart Village Incubation.
    Monday 12/05/2008
  • 4.
    • The number of Arab Internet Users is growing
      • 22 million users in 2006
      • 43 million expected in 2008
    • The volume of Arabic e-content is increasing (on the web and in companies’ intranets):
    • Around 100 million Arabic web pages
    • About 5 million Arabic web sites
    Monday 12/05/2008
  • 5.
    • Arabic is a highly inflected language
    • Arabic morphology has a set of unique features
    • Proper Arabic e-content processing is deficient
    • Consequently, Arab users are unable to take full advantage of Arabic e-content, compared with other languages
    • As an example, considering searching through Arabic content …
    Monday 12/05/2008
  • 6. Using : - Search for “ الحائزون على جوائز نوبل ” produces about 238 results Monday 12/05/2008
  • 7. Using : - Search for “ الحائزون على جائزة نوبل ” produces about 684 results Monday 12/05/2008
  • 8. Using : - Search for “ حاز على جائزة نوبل ” produces about 16,700 results Monday 12/05/2008
  • 9.
    • When used for Arabic search, traditional search engines produce
      • Incomprehensive results, i.e. not all inflected forms are found => a lot of useful information is missing
      • Redundant results, i.e. some results are inaccurate => they ‘bear no relation’ in form or in meaning to the search word(s)
    Monday 12/05/2008
  • 10. An Arabic Search Model that:
    • Provides morphological search  Comprehensive
    • Differentiates between meanings of Arabic words  Improves Accuracy
    • In other words…
    • Let us see the same example, using KSearch …
    Monday 12/05/2008
  • 11. Monday 12/05/2008
  • 12.
    • Arabic Morphological Search (to produce comprehensive search results).
    • Differentiation between Word Meanings (to increase accuracy of search results, i.e. reduce redundancy).
    • Search using Logical Operators ( و – أو - ليس ).
    • Adjacency (Proximity) Search.
    • Search using Wildcards (for proper nouns and Latin text) .
    • Search words are highlighted in the results pages.
    • Over 200 document formats are supported, including UNICODE encoded documents.
    • Arabic comprehensive dictionary of contemporary Arabic (approximately 78,000 entries).
    • Fast Indexing Engine (25,000 - 30,000 words/sec on a PC with AMD Athlon 3800+ CPU, IDE HDD, 1GB RAM).
    • Uses 64 bit Technology => Unlimited Index Size.
    • Comprehensive Index Management: Capability of deleting, updating and merging indexes.
    Monday 12/05/2008
  • 13. Monday 12/05/2008 Arabic ِ Morphological Analyzer Comprehensive + Contemporary Arabic Lexicon Arabic Data Source (Database, Document, etc.) Fast Indexing Engine Meta Data Repository Search Engine Search Results Arabic Lexical Semantic Analyzer
  • 14.
    • Employs KMorph , a fast Arabic morphological analyzer
    • Uses a comprehensive Arabic lexicon of contemporary words
    • KSpell Engine: Provides APIs for spelling verification and correction, e.g. may be integrated with content management systems to produce correctly spelled Arabic web content
    Monday 12/05/2008

×