SlideShare a Scribd company logo
1 of 5
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7962
A Rule-Based Stemmer for Punjabi Verbs
Prabhjot Kaur1, Preetpal Kaur Buttar2
1M. Tech. Research Scholar, Department of Computer Science and Engineering, Sant Longowal Institute of
Engineering and Technology, Longowal, Sangrur - 148106, Punjab, India
2Assistant Professor, Department of Computer Science and Engineering, Sant Longowal Institute of Engineering
and Technology, Longowal, Sangrur - 148106, Punjab, India
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract – This paper proposes a rule-based stemmer for
the stemming of verbs in Punjabi language. The proposed
Punjabi Verb Stemmer (PVS) uses a rule-based approach for
the stemming of Punjabi verbs. A database containing valid
root verbs occurring in thePunjabilanguagehasbeencreated.
The database stores 3,135 Punjabi root verbs. When a verb is
input to the system, it is first compared with the database of
root verbs to discover whether the input verb is a root verb or
an inflected verb. If the input verb is a root verb, no stemming
is required and the input verb is returned as the output.
Otherwise, the input verb (inflected) is sent to the suffix
stripping algorithm to get the corresponding root verb which
uses a predefined suffix list. Punjabi is a resource-scarce
language with a very few linguistic resourcesdevelopedsofar.
In case of stemming in Punjabi language, most of the work
done so far has concentrated on stemming of noun words and
proper names, no published work for the stemming of Punjabi
verbs has been reported. The proposed PVS has an overall
accuracy of 95.21%. This stemmer can contribute to many
applications of natural language processing and text mining.
Key Words: Stemming, Stemmer, Suffix Stripping, Verbs,
Punjabi
1. INTRODUCTION
Stemming is a pre-processing task in the information
retrieval systems[1], which reduces an inflected word to its
stem, root or base form [2, 3]. For example, a stemming
algorithm reduces the words “discussion”, “discussing” and
“discussed” to the stem “discuss”. Stemming is used in the
information retrieval tasks to improve system performance
by enhancing the ability to match document queries[3]. For
example, when a user enters a query word ‘applying’, s/he
may also want to retrieve the documents that contain the
word ‘apply’ and ‘applied’ as well [4]. Stemming is used to
improve the efficiency of various search engines to get the
results [5].
Various methods have been used for stemming such as
stochastic algorithm, brute force algorithm, suffix stripping
algorithm, n-gram analysis, lemmatizationalgorithm,hybrid
approaches, etc. These methods differ in terms of accuracy
and performance[3].
Ramanathan and Rao [6] proposed a lightweight stemmer
for Hindi language. Their proposed stemmer conflates the
similar words using suffix removal. The suffix lists for the
stemmer were created manually. Dasgupta and Ng [7]
proposed an algorithm for the unsupervised morphological
parsing of Bengali language by segmenting the words into
prefixes, suffixes, and stems. Pandey and Siddiqui [8]
suggested an unsupervised stemming algorithm for Hindi
language. The algorithm was based on division technique.
Zehurul et.al[9] proposed a lightweightstemmerforBengali
language based on a predefined suffixlistwhichremovesthe
suffix of a word on a longest-match basis. Suba et.al [10]
suggested two kinds of stemmers for Gujarati language – a
lightweight inflectional stemmer basedona hybridapproach
and heavyweight derivational stemmer based on a rule-
based approach. Kumar and Rana [3] proposed a stemmer
for Punjabi language based on the duo of brute force and
suffix stripping approaches.Gupta andLehal [11]proposeda
stemmer for noun and proper names occurring in Punjabi
language. The stemmer obtained a stem or radix of Punjabi
word and then searched in Punjabi noun and proper name
dictionary. Mishra and Chandra [12] suggested MAULIK: a
stemmer for Hindi language. The MAULIK algorithm was
used to stem the Hindi words by using hybrid technique.
Gupta [13] proposed a stemmer for verbs in Hindi language
based on suffix-stripping. The stemmer applied a technique
based on suffix removal rules in order to stem Hindi verbs.
Punjabi is a resource-scarce language with a very few
linguistic resources developed so far. In case of stemming in
Punjabi language, most of the work done so far has
concentrated onstemmingofnoun wordsandpropernames,
no stemming is required and the input verb is returned as
the output. In this paper, a rule-based approach has been
proposed for the stemming ofPunjabiverbs. Ourapproachis
similar to those of Ramanathan and Rao [6] for Hindi and
Zehurul et.al [9] for Bengali in which the suffix is stripped
from the input verb using a predefined list on the basis of
longest-match. PVS has an overall accuracy of 95.21%.
2. PROPOSED METHODOLOGY FOR STEMMING OF
PUNJABI VERBS
2.1 Categorization of verbs in Punjabi language
A verb is intended to explain states, activities or events
and the main components of predicates that form particular
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7963
sentences [13]. In the Punjabi language, verbs can be
categorized as one-word verbs or two-word verbs. For
example, is a one-word verb with as root and as
suffix. Similarly, is a two-word verb with
as root and as suffix.
2.2 Approach used
In this paper, a rule-based approach has been proposed
for the stemming of Punjabi verbs. A database containing
valid root verbs existing in the Punjabi language has been
created. The database stores 3,135 Punjabi root verbs.When
a verb is input to the system, it iscomparedwiththedatabase
of root verbs to check whether the input verbisarootverbor
an inflected verb. If it is a root verb, no stemming is required
and the input verb is returned as the output. Otherwise, the
input verb (inflected) is sent to the suffix-strippingalgorithm
to get the corresponding root verb. This algorithm strips the
suffix from the input verb on a longest-match basis. The
figure below presents a schematic diagram of PVS.
Fig -1: Schematic diagram of PVS
2.3 List of suffixes for Punjabi verbs
First, a total of 48 suffixes existing for Punjabi verbs were
identified.Then,thesesuffixeswerecategorizedintodifferent
lists on the basis of suffix length. These lists help to construct
the rules for PVS. The lists of suffixes that we have used in
stemmer are as below:
Table- 1: Suffix-list-1 having suffix length-6
Sr. No. Suffix Sr. No. Suffix
1. ਵਾਂਗੀਆ 4. ਾਾਵਾਂਗਾ
2. ਾਾਵਾਂਗੀ 5. ਉਂਦੀਆਂ
3. ਾਾਵਾਂਗੀ
Table- 2: Suffix-list-2 having suffix length-5
Sr. No. Suffix Sr. No. Suffix
1. ਉਣੀਆਂ 3. ਵਾਂਗਾ
2. ਵਾਂਗੇ 4. ਵਾਂਗੀ
Table- 3: Suffix-list-3 having suffix length-4
Sr. No. Suffix Sr. No. Suffix
1. ਉਣਾ 7. ਾਂਦਾ
2. ਉਣੀ 8. ਾਦਾ
3. ਉਣੇ 9. ਾਦੀ
4. ਓਗੇ 10. ਾਦੇ
5. ਾਂਦੇ 11. ਾੀਦਾ
6. ਾਂਦੀ
Table- 4: Suffix-list-4 having suffix length-3
Sr. No. Suffix Sr. No. Suffix
1. ਦੀਆਂ 8. ਾਾਾਂਗਾ
2. ਨੀਆਂ 9. ਾਾਾਂਗੀ
3. ਉਂਦਾ 10. ਾਾਾਂਗੇ
4. ਣੀਆਂ 11. ਵੇਗਾ
5. ਉਂਦੇ 12. ਾਂਦੀਆ
6. ਉਂਦੀ 13. ਵੇਗੀ
7. ਵੋਗੇ
Table- 5: Suffix-list-5 having suffix length-2
Sr. No. Suffix Sr. No. Suffix
1. ਦਾ 8. ਣਾ
2. ਦੀ 9. ਣੇ
3. ਦੇ 10. ਨਾ
4. ਇਆ 11. ਨੇ
5. ਦਦਆ 12. ਨੀ
6. ਉਣ 13. ਈਏ
7. ਣੀ 14. ਦਾਆ
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7964
Table- 6: Suffix-list-6 having suffix length-1
Sr. No. Suffix
1. ਏ
2.4 Proposed rules for PVS
After generating the suffix lists, we have created rules for
stemming based on the length of suffixes. We have made 48
rules for removal of suffix from the input verb. The most
appropriate rule is matched with the input verb, and the
suffix is stripped. The rules used for PVS are as follows:
Rule 1: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of word.
Rule 2: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of word.
Rule 3: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of word.
Rule 4: if a word of Punjabi verb ends with ‘ , remove the
suffix ‘ ’ at the end of word.
Rule 5: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of word.
Rule 6: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of word.
Rule 7: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of word.
Rule 8: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of word.
Rule 9: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of word.
Rule 10: if a word of Punjabi verb ends with ‘ ’,removethe
suffix ‘ ’ at the end of word.
Rule 11: if a word of Punjabi verb ends with ‘ ’,removethe
suffix ‘ ’ at the end of word.
Rule 12: if a word of Punjabi verb ends with ‘ ’, removethe
suffix ‘ ’ at the end of word.
Rule 13: if a word of Punjabi verb ends with ‘ ’, removethe
suffix ‘ ’ at the end of word.
Rule 14: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of the word.
Rule 15: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘’ at the end of the word.
Rule 16: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of word.
Rule 17: if a word of Punjabi verb ends with ‘ ’, remove
the suffix ‘ ’ at the end of word.
Rule 18: if a word of Punjabi verb ends with ‘ ’, remove
the suffix ‘ ’ at the end of word.
Rule 19: if a word of Punjabi verb ends with ‘ ’, remove
the suffix ‘ ’ at the end of word.
Rule 20: if a word of Punjabi verb ends with ‘ ’,removethe
suffix ‘ ’ at the end of word.
Rule 21: if a word of Punjabi verb ends with ‘ ’, remove
the suffix ‘ ’ at the end of word.
Rule 22: if a word of Punjabi verb ends with ‘ ’, remove
the suffix ‘ ’ at the end of word.
Rule 23: if a word of Punjabi verb ends with ‘ ’, remove
the suffix ‘ ’ at the end of word.
Rule 24: if a word of Punjabi verb ends with ‘ ’, remove
the suffix ‘ ’ at the end of word.
Rule 25: if a word of Punjabi verb ends with ‘ ’, remove
the suffix ‘ ’ at the end of word.
Rule 26: if a word of Punjabi verb ends with ‘ ’, remove
the suffix ‘ ’ at the end of word.
Rule 27: if a word of Punjabi verb ends with ‘ ’, remove
the suffix ‘ ’ at the end of word.
Rule 28: if a word of Punjabi verb ends with ‘ ’,removethe
suffix ‘ ’ at the end of word.
Rule 29: if a word of Punjabi verb ends with ‘ ’, remove
the suffix ‘ ’ at the end of word.
Rule 30: if a word of Punjabi verb ends with ‘ ’,removethe
suffix ‘ ’ at the end of word.
Rule 31: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of word.
Rule 32: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of word.
Rule 33: if a word of Punjabi verb ends with ‘ ’, removethe
suffix ‘ ’ at the end of word.
Rule 34: if a word of Punjabi verb ends with ‘ ’, removethe
suffix ‘ ’ at the end of word.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7965
Rule 35: if a word of Punjabi verb ends with ‘ ’, removethe
suffix ‘ ’ at the end of word.
Rule 36: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of word.
Rule 37: if a word of Punjabi verb ends with ‘ ’, remove
the suffix ‘ ’ at the end of word.
Rule 38: if a word of Punjabi verbendswith‘ ’,removethe
suffix ‘ ’ at the end of word.
Rule 39: if a word of Punjabi verb ends with ‘ ’, remove
the suffix ‘ ’ at the end of word.
Rule 40: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of word.
Rule 41: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of word.
Rule 42: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of word.
Rule 43: if a word of Punjabi verb ends with ‘ ’, remove the
suffix ‘ ’ at the end of word.
Rule 44: if a word of Punjabi verb ends with ‘ਈਏ’, remove the
suffix ‘ਈਏ’ at the end of word.
Rule 45: if a word of Punjabi verb ends with ‘ ’, remove
the suffix ‘ ’ at the end of word.
Rule 46: if a word of Punjabi verb ends with ‘ ’, remove
the suffix ‘ ’ at the end of word.
Rule 47: if a word of Punjabi verb ends with ‘ ’, removethe
suffix ‘ ’ at the end of word.
Rule 48: if a word of Punjabi verb ends with ‘ਏ’, remove the
suffix ‘ਏ’ at the end of word.
2.5 Proposed Algorithm
Step 1: Read word from user input containing only verbs.
Step 2: Check whether the entered word is root word.
If the entered word is root word,thenprinttheword
as output.
Go to step 1.
Step 3: SUFF = Last six characters from the right side of the
word.
If SUFF exists in the suffix-list-1 (length-6), then:
Print the root word obtainedafterstripping
SUFF, go to step 1.
Step 4: SUFF = Last five characters from the right sideofthe
word.
If SUFF exists in the suffix-list-2 (length-5), then:
Print the root word obtainedafterstripping
SUFF, go to step 1.
Step 5: SUFF = Last four characters fromtherightsideofthe
word.
If SUFF exists in the suffix-list-3 (length-4), then:
Print the root word obtainedafterstripping
SUFF, go to step 1.
Step 6: SUFF = Last three characters from the right side of
the word.
If SUFF exists in the suffix-list-4 (length-3), then:
Print the root word obtainedafterstripping
SUFF, go to step 1.
Step 7: SUFF = Last twocharacters from the right sideofthe
word.
If SUFF exists in the suffix-list-5 (length-2), then:
Print the root word obtainedafterstripping
SUFF, go to step 1.
Step 8: SUFF = Last one character from the right side of the
word.
If SUFF exists in the suffix-list-6 (length-1), then:
Print the root word obtainedafterstripping
SUFF, go to step 1.
Step 9: Exit.
At step 1, the word is read from the user input which
should be a Punjabi verb. At step 2, the entered word is
searched in the root verb database, to find whether the
entered word is a root word or not. If the input word is not
present in the database, then we go to step 3. At this step, the
last six characters from the right side of the word are
extracted. If the extracted characters exist in the suffix-list-1
(the list having length of suffix equal to 6), then the matching
suffix is removed from the entered word. If the extracted
characters do not exist in suffix-list-1, then suffix-list-2 is
searched and so on until the last characters of the word do
not match with the suffix list. If matched, then the suffix is
removed from the word.
3. RESULTS AND DISCUSSION
Accuracy of the stemmer depends on the words in the
database (i.e. the list of root words) and the rules created to
remove suffixes. We havestored3,135wordsinthedatabase.
It decreases the probability of switching to thesecondoption
for removing suffixes[12].
To calculatethe accuracyofthesystem,weinput3,135words
to PVS for analysis. Among these 3,135 words, 2,985 words
have been correctly evaluated and 150 words are invalid
because they violate specific and general rules. The accuracy
of our stemmer is 95.21%.
4. CONCLUSION
Stemming produces linguistically standardized text that
helps in enhancing the results of information retrieval tasks
[1]. Our proposed stemmer works with Punjabi verbs which
uses a rule-based suffix-stripping technique to perform the
stemming of the Punjabi verbs. Punjabi is a resource-scarce
language with stemmers existing for nouns and proper
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7966
names only. PVS is the reportedly the first stemmer for the
stemming of Punjabi verbs with an overall accuracy of
95.21%. PVS can be enhanced by enabling it for the
stemming of Punjabi adverbs and adjectives. This stemmer
can contribute to many applications of informationretrieval
and natural language processing.
REFERENCES
[1] R. Puri, R. P. S. Bedi, V. Goyal, “Punjabi stemmer using
Punjabi WordNet database,” Indian journal of science
and technology, vol. 8 no. 27, Oct. 2015, pp. 1-5.
[2] G. Joshi, K. D. Garg, “Enhanced version of Punjabi
stemmer using synset,”AdvancedResearchinComputer
Science and Software Engineering, vol. 4 no. 5, 2014.
[3] D. Kumar, P. Rana, “Design and development of a
stemmer for Punjabi,”International Journal ofComputer
Applications, vol. 11 no. 12, Dec. 2010, pp. 18-23.
[4] V. Gupta, G. S. Lehal, “A survey of common stemming
techniques existing stemmers for Indian languages,”
Journal of Emerging Technologies in Web Intelligence,
vol. 5 no. 2, pp. 157-161.
[5] P. Thapar, “A hybrid approach used to stem Punjabi
words,” International Journal of Computer Science and
Mobile Computing, vol. 3 no. 11, 2014, pp. 1-9.
[6] A. Ramanathan, D. Rao, “A lightweight stemmer for
Hindi,” Proceedings of the 10th Conference of the
European Chapter of the Association for Computational
Linguistics, on Computational Linguistics for South
Asian Languages, 2003, pp. 43-48.
[7] S. Dasgupta, V.Ng,“Unsupervisedmorphological parsing
of Bengali,” Language Resources and Evaluation,vol.40,
2006, pp. 311-330.
[8] A. K. Pandey, T. J. Siddiqui, “An unsupervised Hindi
stemmer with heuristic improvements,” Proceedings of
the Second Workshop on Analytics for Noisy
Unstructured Text Data, 2008, pp. 99-105.
[9] M. Z. Islam, M. N. Uddin, M. Khan, “A light weight
stemmer for Bengali and its use in spelling checker,”
Proceedings of the First International Conference on
Digital Communication and Computer Applications
(DCCA07), Irbid, Jordan, 2007.
[10] K. Suba, D. Jiandani, P. Bhattacharyya, “Hybrid
inflectional stemmer and rule-based derivational
stemmer for Gujarati,” Proceedings of the 2nd
Workshop on South and Southeast Asian Natural
Language Processing (WSSANLP), IJCNLP 2011,
ChiangMai, Thailand, 2011, pp. 1-8.
[11] V. Gupta, G. S. Lehal, “Punjabi language stemmer for
nouns and proper names,” Proceedings of the 2nd
Workshop on South and Southeast Asian Natural
Language Processing (WSSANLP), IJCNLP 2011, Chiang
Mai, Thailand, 2011, pp. 35-39.
[12] U. Mishra, C. Prakash, “MAULIK: An effective stemmer
for Hindi language,” International Journal of Computer
Science and Engineering, vol. 4, 2012; pp. 711–717.
[13] V. Gupta, “Suffix stripping based verb stemming for
Hindi,” International Journal of Advanced Research in
Computer Science and SoftwareEngineering,vol.4no.1,
2014, pp. 179-181.

More Related Content

Similar to IRJET- A Rule-Based Stemmer for Punjabi Verbs

A Review on a web based Punjabi to English Machine Transliteration System
A Review on a web based Punjabi to English Machine Transliteration SystemA Review on a web based Punjabi to English Machine Transliteration System
A Review on a web based Punjabi to English Machine Transliteration SystemEditor IJCATR
 
Part-of-Speech Tagging for Bengali Thesis submitted to Indian ...
Part-of-Speech Tagging for Bengali Thesis submitted to Indian ...Part-of-Speech Tagging for Bengali Thesis submitted to Indian ...
Part-of-Speech Tagging for Bengali Thesis submitted to Indian ...butest
 
DESIGN OF A RULE BASED HINDI LEMMATIZER
DESIGN OF A RULE BASED HINDI LEMMATIZER DESIGN OF A RULE BASED HINDI LEMMATIZER
DESIGN OF A RULE BASED HINDI LEMMATIZER cscpconf
 
DESIGN OF A RULE BASED HINDI LEMMATIZER
DESIGN OF A RULE BASED HINDI LEMMATIZERDESIGN OF A RULE BASED HINDI LEMMATIZER
DESIGN OF A RULE BASED HINDI LEMMATIZERcsandit
 
Design of a rule based hindi lemmatizer
Design of a rule based hindi lemmatizerDesign of a rule based hindi lemmatizer
Design of a rule based hindi lemmatizercsandit
 
Comparison of stemming algorithms on Indonesian text processing
Comparison of stemming algorithms on Indonesian text processingComparison of stemming algorithms on Indonesian text processing
Comparison of stemming algorithms on Indonesian text processingTELKOMNIKA JOURNAL
 
Implementation of Enhanced Parts-of-Speech Based Rules for English to Telugu ...
Implementation of Enhanced Parts-of-Speech Based Rules for English to Telugu ...Implementation of Enhanced Parts-of-Speech Based Rules for English to Telugu ...
Implementation of Enhanced Parts-of-Speech Based Rules for English to Telugu ...Waqas Tariq
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI) International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI) inventionjournals
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)inventionjournals
 
A Novel Method for An Intelligent Based Voice Meeting System Using Machine Le...
A Novel Method for An Intelligent Based Voice Meeting System Using Machine Le...A Novel Method for An Intelligent Based Voice Meeting System Using Machine Le...
A Novel Method for An Intelligent Based Voice Meeting System Using Machine Le...IRJET Journal
 
IRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language Models
IRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language ModelsIRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language Models
IRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language ModelsIRJET Journal
 
IRJET- Spelling and Grammar Checker and Template Suggestion
IRJET- Spelling and Grammar Checker and Template SuggestionIRJET- Spelling and Grammar Checker and Template Suggestion
IRJET- Spelling and Grammar Checker and Template SuggestionIRJET Journal
 
IMPROVING THE QUALITY OF GUJARATI-HINDI MACHINE TRANSLATION THROUGH PART-OF-S...
IMPROVING THE QUALITY OF GUJARATI-HINDI MACHINE TRANSLATION THROUGH PART-OF-S...IMPROVING THE QUALITY OF GUJARATI-HINDI MACHINE TRANSLATION THROUGH PART-OF-S...
IMPROVING THE QUALITY OF GUJARATI-HINDI MACHINE TRANSLATION THROUGH PART-OF-S...ijnlc
 
Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...
Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...
Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...iosrjce
 
Towards Building Semantic Role Labeler for Indian Languages
Towards Building Semantic Role Labeler for Indian LanguagesTowards Building Semantic Role Labeler for Indian Languages
Towards Building Semantic Role Labeler for Indian LanguagesAlgoscale Technologies Inc.
 
Hps a hierarchical persian stemming method
Hps a hierarchical persian stemming methodHps a hierarchical persian stemming method
Hps a hierarchical persian stemming methodijnlc
 
A COMPREHENSIVE ANALYSIS OF STEMMERS AVAILABLE FOR INDIC LANGUAGES
A COMPREHENSIVE ANALYSIS OF STEMMERS AVAILABLE FOR INDIC LANGUAGES A COMPREHENSIVE ANALYSIS OF STEMMERS AVAILABLE FOR INDIC LANGUAGES
A COMPREHENSIVE ANALYSIS OF STEMMERS AVAILABLE FOR INDIC LANGUAGES ijnlc
 
Implementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large DictionaryImplementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large Dictionaryiosrjce
 
Chinese Word Segmentation in MSR-NLP
Chinese Word Segmentation in MSR-NLPChinese Word Segmentation in MSR-NLP
Chinese Word Segmentation in MSR-NLPAndi Wu
 

Similar to IRJET- A Rule-Based Stemmer for Punjabi Verbs (20)

A Review on a web based Punjabi to English Machine Transliteration System
A Review on a web based Punjabi to English Machine Transliteration SystemA Review on a web based Punjabi to English Machine Transliteration System
A Review on a web based Punjabi to English Machine Transliteration System
 
Part-of-Speech Tagging for Bengali Thesis submitted to Indian ...
Part-of-Speech Tagging for Bengali Thesis submitted to Indian ...Part-of-Speech Tagging for Bengali Thesis submitted to Indian ...
Part-of-Speech Tagging for Bengali Thesis submitted to Indian ...
 
DESIGN OF A RULE BASED HINDI LEMMATIZER
DESIGN OF A RULE BASED HINDI LEMMATIZER DESIGN OF A RULE BASED HINDI LEMMATIZER
DESIGN OF A RULE BASED HINDI LEMMATIZER
 
DESIGN OF A RULE BASED HINDI LEMMATIZER
DESIGN OF A RULE BASED HINDI LEMMATIZERDESIGN OF A RULE BASED HINDI LEMMATIZER
DESIGN OF A RULE BASED HINDI LEMMATIZER
 
Design of a rule based hindi lemmatizer
Design of a rule based hindi lemmatizerDesign of a rule based hindi lemmatizer
Design of a rule based hindi lemmatizer
 
Comparison of stemming algorithms on Indonesian text processing
Comparison of stemming algorithms on Indonesian text processingComparison of stemming algorithms on Indonesian text processing
Comparison of stemming algorithms on Indonesian text processing
 
Implementation of Enhanced Parts-of-Speech Based Rules for English to Telugu ...
Implementation of Enhanced Parts-of-Speech Based Rules for English to Telugu ...Implementation of Enhanced Parts-of-Speech Based Rules for English to Telugu ...
Implementation of Enhanced Parts-of-Speech Based Rules for English to Telugu ...
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI) International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
A Novel Method for An Intelligent Based Voice Meeting System Using Machine Le...
A Novel Method for An Intelligent Based Voice Meeting System Using Machine Le...A Novel Method for An Intelligent Based Voice Meeting System Using Machine Le...
A Novel Method for An Intelligent Based Voice Meeting System Using Machine Le...
 
IRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language Models
IRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language ModelsIRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language Models
IRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language Models
 
IRJET- Spelling and Grammar Checker and Template Suggestion
IRJET- Spelling and Grammar Checker and Template SuggestionIRJET- Spelling and Grammar Checker and Template Suggestion
IRJET- Spelling and Grammar Checker and Template Suggestion
 
IMPROVING THE QUALITY OF GUJARATI-HINDI MACHINE TRANSLATION THROUGH PART-OF-S...
IMPROVING THE QUALITY OF GUJARATI-HINDI MACHINE TRANSLATION THROUGH PART-OF-S...IMPROVING THE QUALITY OF GUJARATI-HINDI MACHINE TRANSLATION THROUGH PART-OF-S...
IMPROVING THE QUALITY OF GUJARATI-HINDI MACHINE TRANSLATION THROUGH PART-OF-S...
 
Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...
Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...
Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...
 
Towards Building Semantic Role Labeler for Indian Languages
Towards Building Semantic Role Labeler for Indian LanguagesTowards Building Semantic Role Labeler for Indian Languages
Towards Building Semantic Role Labeler for Indian Languages
 
A Proposition Bank of Urdu
A Proposition Bank of UrduA Proposition Bank of Urdu
A Proposition Bank of Urdu
 
Hps a hierarchical persian stemming method
Hps a hierarchical persian stemming methodHps a hierarchical persian stemming method
Hps a hierarchical persian stemming method
 
A COMPREHENSIVE ANALYSIS OF STEMMERS AVAILABLE FOR INDIC LANGUAGES
A COMPREHENSIVE ANALYSIS OF STEMMERS AVAILABLE FOR INDIC LANGUAGES A COMPREHENSIVE ANALYSIS OF STEMMERS AVAILABLE FOR INDIC LANGUAGES
A COMPREHENSIVE ANALYSIS OF STEMMERS AVAILABLE FOR INDIC LANGUAGES
 
Implementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large DictionaryImplementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large Dictionary
 
Chinese Word Segmentation in MSR-NLP
Chinese Word Segmentation in MSR-NLPChinese Word Segmentation in MSR-NLP
Chinese Word Segmentation in MSR-NLP
 

More from IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASIRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesIRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web applicationIRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
 

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 

Recently uploaded (20)

Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 

IRJET- A Rule-Based Stemmer for Punjabi Verbs

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7962 A Rule-Based Stemmer for Punjabi Verbs Prabhjot Kaur1, Preetpal Kaur Buttar2 1M. Tech. Research Scholar, Department of Computer Science and Engineering, Sant Longowal Institute of Engineering and Technology, Longowal, Sangrur - 148106, Punjab, India 2Assistant Professor, Department of Computer Science and Engineering, Sant Longowal Institute of Engineering and Technology, Longowal, Sangrur - 148106, Punjab, India ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract – This paper proposes a rule-based stemmer for the stemming of verbs in Punjabi language. The proposed Punjabi Verb Stemmer (PVS) uses a rule-based approach for the stemming of Punjabi verbs. A database containing valid root verbs occurring in thePunjabilanguagehasbeencreated. The database stores 3,135 Punjabi root verbs. When a verb is input to the system, it is first compared with the database of root verbs to discover whether the input verb is a root verb or an inflected verb. If the input verb is a root verb, no stemming is required and the input verb is returned as the output. Otherwise, the input verb (inflected) is sent to the suffix stripping algorithm to get the corresponding root verb which uses a predefined suffix list. Punjabi is a resource-scarce language with a very few linguistic resourcesdevelopedsofar. In case of stemming in Punjabi language, most of the work done so far has concentrated on stemming of noun words and proper names, no published work for the stemming of Punjabi verbs has been reported. The proposed PVS has an overall accuracy of 95.21%. This stemmer can contribute to many applications of natural language processing and text mining. Key Words: Stemming, Stemmer, Suffix Stripping, Verbs, Punjabi 1. INTRODUCTION Stemming is a pre-processing task in the information retrieval systems[1], which reduces an inflected word to its stem, root or base form [2, 3]. For example, a stemming algorithm reduces the words “discussion”, “discussing” and “discussed” to the stem “discuss”. Stemming is used in the information retrieval tasks to improve system performance by enhancing the ability to match document queries[3]. For example, when a user enters a query word ‘applying’, s/he may also want to retrieve the documents that contain the word ‘apply’ and ‘applied’ as well [4]. Stemming is used to improve the efficiency of various search engines to get the results [5]. Various methods have been used for stemming such as stochastic algorithm, brute force algorithm, suffix stripping algorithm, n-gram analysis, lemmatizationalgorithm,hybrid approaches, etc. These methods differ in terms of accuracy and performance[3]. Ramanathan and Rao [6] proposed a lightweight stemmer for Hindi language. Their proposed stemmer conflates the similar words using suffix removal. The suffix lists for the stemmer were created manually. Dasgupta and Ng [7] proposed an algorithm for the unsupervised morphological parsing of Bengali language by segmenting the words into prefixes, suffixes, and stems. Pandey and Siddiqui [8] suggested an unsupervised stemming algorithm for Hindi language. The algorithm was based on division technique. Zehurul et.al[9] proposed a lightweightstemmerforBengali language based on a predefined suffixlistwhichremovesthe suffix of a word on a longest-match basis. Suba et.al [10] suggested two kinds of stemmers for Gujarati language – a lightweight inflectional stemmer basedona hybridapproach and heavyweight derivational stemmer based on a rule- based approach. Kumar and Rana [3] proposed a stemmer for Punjabi language based on the duo of brute force and suffix stripping approaches.Gupta andLehal [11]proposeda stemmer for noun and proper names occurring in Punjabi language. The stemmer obtained a stem or radix of Punjabi word and then searched in Punjabi noun and proper name dictionary. Mishra and Chandra [12] suggested MAULIK: a stemmer for Hindi language. The MAULIK algorithm was used to stem the Hindi words by using hybrid technique. Gupta [13] proposed a stemmer for verbs in Hindi language based on suffix-stripping. The stemmer applied a technique based on suffix removal rules in order to stem Hindi verbs. Punjabi is a resource-scarce language with a very few linguistic resources developed so far. In case of stemming in Punjabi language, most of the work done so far has concentrated onstemmingofnoun wordsandpropernames, no stemming is required and the input verb is returned as the output. In this paper, a rule-based approach has been proposed for the stemming ofPunjabiverbs. Ourapproachis similar to those of Ramanathan and Rao [6] for Hindi and Zehurul et.al [9] for Bengali in which the suffix is stripped from the input verb using a predefined list on the basis of longest-match. PVS has an overall accuracy of 95.21%. 2. PROPOSED METHODOLOGY FOR STEMMING OF PUNJABI VERBS 2.1 Categorization of verbs in Punjabi language A verb is intended to explain states, activities or events and the main components of predicates that form particular
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7963 sentences [13]. In the Punjabi language, verbs can be categorized as one-word verbs or two-word verbs. For example, is a one-word verb with as root and as suffix. Similarly, is a two-word verb with as root and as suffix. 2.2 Approach used In this paper, a rule-based approach has been proposed for the stemming of Punjabi verbs. A database containing valid root verbs existing in the Punjabi language has been created. The database stores 3,135 Punjabi root verbs.When a verb is input to the system, it iscomparedwiththedatabase of root verbs to check whether the input verbisarootverbor an inflected verb. If it is a root verb, no stemming is required and the input verb is returned as the output. Otherwise, the input verb (inflected) is sent to the suffix-strippingalgorithm to get the corresponding root verb. This algorithm strips the suffix from the input verb on a longest-match basis. The figure below presents a schematic diagram of PVS. Fig -1: Schematic diagram of PVS 2.3 List of suffixes for Punjabi verbs First, a total of 48 suffixes existing for Punjabi verbs were identified.Then,thesesuffixeswerecategorizedintodifferent lists on the basis of suffix length. These lists help to construct the rules for PVS. The lists of suffixes that we have used in stemmer are as below: Table- 1: Suffix-list-1 having suffix length-6 Sr. No. Suffix Sr. No. Suffix 1. ਵਾਂਗੀਆ 4. ਾਾਵਾਂਗਾ 2. ਾਾਵਾਂਗੀ 5. ਉਂਦੀਆਂ 3. ਾਾਵਾਂਗੀ Table- 2: Suffix-list-2 having suffix length-5 Sr. No. Suffix Sr. No. Suffix 1. ਉਣੀਆਂ 3. ਵਾਂਗਾ 2. ਵਾਂਗੇ 4. ਵਾਂਗੀ Table- 3: Suffix-list-3 having suffix length-4 Sr. No. Suffix Sr. No. Suffix 1. ਉਣਾ 7. ਾਂਦਾ 2. ਉਣੀ 8. ਾਦਾ 3. ਉਣੇ 9. ਾਦੀ 4. ਓਗੇ 10. ਾਦੇ 5. ਾਂਦੇ 11. ਾੀਦਾ 6. ਾਂਦੀ Table- 4: Suffix-list-4 having suffix length-3 Sr. No. Suffix Sr. No. Suffix 1. ਦੀਆਂ 8. ਾਾਾਂਗਾ 2. ਨੀਆਂ 9. ਾਾਾਂਗੀ 3. ਉਂਦਾ 10. ਾਾਾਂਗੇ 4. ਣੀਆਂ 11. ਵੇਗਾ 5. ਉਂਦੇ 12. ਾਂਦੀਆ 6. ਉਂਦੀ 13. ਵੇਗੀ 7. ਵੋਗੇ Table- 5: Suffix-list-5 having suffix length-2 Sr. No. Suffix Sr. No. Suffix 1. ਦਾ 8. ਣਾ 2. ਦੀ 9. ਣੇ 3. ਦੇ 10. ਨਾ 4. ਇਆ 11. ਨੇ 5. ਦਦਆ 12. ਨੀ 6. ਉਣ 13. ਈਏ 7. ਣੀ 14. ਦਾਆ
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7964 Table- 6: Suffix-list-6 having suffix length-1 Sr. No. Suffix 1. ਏ 2.4 Proposed rules for PVS After generating the suffix lists, we have created rules for stemming based on the length of suffixes. We have made 48 rules for removal of suffix from the input verb. The most appropriate rule is matched with the input verb, and the suffix is stripped. The rules used for PVS are as follows: Rule 1: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 2: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 3: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 4: if a word of Punjabi verb ends with ‘ , remove the suffix ‘ ’ at the end of word. Rule 5: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 6: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 7: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 8: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 9: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 10: if a word of Punjabi verb ends with ‘ ’,removethe suffix ‘ ’ at the end of word. Rule 11: if a word of Punjabi verb ends with ‘ ’,removethe suffix ‘ ’ at the end of word. Rule 12: if a word of Punjabi verb ends with ‘ ’, removethe suffix ‘ ’ at the end of word. Rule 13: if a word of Punjabi verb ends with ‘ ’, removethe suffix ‘ ’ at the end of word. Rule 14: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of the word. Rule 15: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘’ at the end of the word. Rule 16: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 17: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 18: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 19: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 20: if a word of Punjabi verb ends with ‘ ’,removethe suffix ‘ ’ at the end of word. Rule 21: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 22: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 23: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 24: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 25: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 26: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 27: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 28: if a word of Punjabi verb ends with ‘ ’,removethe suffix ‘ ’ at the end of word. Rule 29: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 30: if a word of Punjabi verb ends with ‘ ’,removethe suffix ‘ ’ at the end of word. Rule 31: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 32: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 33: if a word of Punjabi verb ends with ‘ ’, removethe suffix ‘ ’ at the end of word. Rule 34: if a word of Punjabi verb ends with ‘ ’, removethe suffix ‘ ’ at the end of word.
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7965 Rule 35: if a word of Punjabi verb ends with ‘ ’, removethe suffix ‘ ’ at the end of word. Rule 36: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 37: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 38: if a word of Punjabi verbendswith‘ ’,removethe suffix ‘ ’ at the end of word. Rule 39: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 40: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 41: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 42: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 43: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 44: if a word of Punjabi verb ends with ‘ਈਏ’, remove the suffix ‘ਈਏ’ at the end of word. Rule 45: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 46: if a word of Punjabi verb ends with ‘ ’, remove the suffix ‘ ’ at the end of word. Rule 47: if a word of Punjabi verb ends with ‘ ’, removethe suffix ‘ ’ at the end of word. Rule 48: if a word of Punjabi verb ends with ‘ਏ’, remove the suffix ‘ਏ’ at the end of word. 2.5 Proposed Algorithm Step 1: Read word from user input containing only verbs. Step 2: Check whether the entered word is root word. If the entered word is root word,thenprinttheword as output. Go to step 1. Step 3: SUFF = Last six characters from the right side of the word. If SUFF exists in the suffix-list-1 (length-6), then: Print the root word obtainedafterstripping SUFF, go to step 1. Step 4: SUFF = Last five characters from the right sideofthe word. If SUFF exists in the suffix-list-2 (length-5), then: Print the root word obtainedafterstripping SUFF, go to step 1. Step 5: SUFF = Last four characters fromtherightsideofthe word. If SUFF exists in the suffix-list-3 (length-4), then: Print the root word obtainedafterstripping SUFF, go to step 1. Step 6: SUFF = Last three characters from the right side of the word. If SUFF exists in the suffix-list-4 (length-3), then: Print the root word obtainedafterstripping SUFF, go to step 1. Step 7: SUFF = Last twocharacters from the right sideofthe word. If SUFF exists in the suffix-list-5 (length-2), then: Print the root word obtainedafterstripping SUFF, go to step 1. Step 8: SUFF = Last one character from the right side of the word. If SUFF exists in the suffix-list-6 (length-1), then: Print the root word obtainedafterstripping SUFF, go to step 1. Step 9: Exit. At step 1, the word is read from the user input which should be a Punjabi verb. At step 2, the entered word is searched in the root verb database, to find whether the entered word is a root word or not. If the input word is not present in the database, then we go to step 3. At this step, the last six characters from the right side of the word are extracted. If the extracted characters exist in the suffix-list-1 (the list having length of suffix equal to 6), then the matching suffix is removed from the entered word. If the extracted characters do not exist in suffix-list-1, then suffix-list-2 is searched and so on until the last characters of the word do not match with the suffix list. If matched, then the suffix is removed from the word. 3. RESULTS AND DISCUSSION Accuracy of the stemmer depends on the words in the database (i.e. the list of root words) and the rules created to remove suffixes. We havestored3,135wordsinthedatabase. It decreases the probability of switching to thesecondoption for removing suffixes[12]. To calculatethe accuracyofthesystem,weinput3,135words to PVS for analysis. Among these 3,135 words, 2,985 words have been correctly evaluated and 150 words are invalid because they violate specific and general rules. The accuracy of our stemmer is 95.21%. 4. CONCLUSION Stemming produces linguistically standardized text that helps in enhancing the results of information retrieval tasks [1]. Our proposed stemmer works with Punjabi verbs which uses a rule-based suffix-stripping technique to perform the stemming of the Punjabi verbs. Punjabi is a resource-scarce language with stemmers existing for nouns and proper
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7966 names only. PVS is the reportedly the first stemmer for the stemming of Punjabi verbs with an overall accuracy of 95.21%. PVS can be enhanced by enabling it for the stemming of Punjabi adverbs and adjectives. This stemmer can contribute to many applications of informationretrieval and natural language processing. REFERENCES [1] R. Puri, R. P. S. Bedi, V. Goyal, “Punjabi stemmer using Punjabi WordNet database,” Indian journal of science and technology, vol. 8 no. 27, Oct. 2015, pp. 1-5. [2] G. Joshi, K. D. Garg, “Enhanced version of Punjabi stemmer using synset,”AdvancedResearchinComputer Science and Software Engineering, vol. 4 no. 5, 2014. [3] D. Kumar, P. Rana, “Design and development of a stemmer for Punjabi,”International Journal ofComputer Applications, vol. 11 no. 12, Dec. 2010, pp. 18-23. [4] V. Gupta, G. S. Lehal, “A survey of common stemming techniques existing stemmers for Indian languages,” Journal of Emerging Technologies in Web Intelligence, vol. 5 no. 2, pp. 157-161. [5] P. Thapar, “A hybrid approach used to stem Punjabi words,” International Journal of Computer Science and Mobile Computing, vol. 3 no. 11, 2014, pp. 1-9. [6] A. Ramanathan, D. Rao, “A lightweight stemmer for Hindi,” Proceedings of the 10th Conference of the European Chapter of the Association for Computational Linguistics, on Computational Linguistics for South Asian Languages, 2003, pp. 43-48. [7] S. Dasgupta, V.Ng,“Unsupervisedmorphological parsing of Bengali,” Language Resources and Evaluation,vol.40, 2006, pp. 311-330. [8] A. K. Pandey, T. J. Siddiqui, “An unsupervised Hindi stemmer with heuristic improvements,” Proceedings of the Second Workshop on Analytics for Noisy Unstructured Text Data, 2008, pp. 99-105. [9] M. Z. Islam, M. N. Uddin, M. Khan, “A light weight stemmer for Bengali and its use in spelling checker,” Proceedings of the First International Conference on Digital Communication and Computer Applications (DCCA07), Irbid, Jordan, 2007. [10] K. Suba, D. Jiandani, P. Bhattacharyya, “Hybrid inflectional stemmer and rule-based derivational stemmer for Gujarati,” Proceedings of the 2nd Workshop on South and Southeast Asian Natural Language Processing (WSSANLP), IJCNLP 2011, ChiangMai, Thailand, 2011, pp. 1-8. [11] V. Gupta, G. S. Lehal, “Punjabi language stemmer for nouns and proper names,” Proceedings of the 2nd Workshop on South and Southeast Asian Natural Language Processing (WSSANLP), IJCNLP 2011, Chiang Mai, Thailand, 2011, pp. 35-39. [12] U. Mishra, C. Prakash, “MAULIK: An effective stemmer for Hindi language,” International Journal of Computer Science and Engineering, vol. 4, 2012; pp. 711–717. [13] V. Gupta, “Suffix stripping based verb stemming for Hindi,” International Journal of Advanced Research in Computer Science and SoftwareEngineering,vol.4no.1, 2014, pp. 179-181.