SlideShare a Scribd company logo
1 of 4
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 03 | Mar 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 170
CONTEXT BASED LEXICAL SIMPLIFICATION
Piyush Pandey1, Tejas Patil2, Tanvi Sutar3, Ankita Ugale4
*Dr S.F. Sayyad, Computer Dept. .AISSMS College of Engineering, Pune, Maharashtra. India
1Piyush Pandey. AISSMS College of Engineering, Pune, Maharashtra. India
2Tejas Patil., AISSMS College of Engineering, Pune, Maharashtra. India
3Tanvi Sutar, AISSMS College of Engineering, Pune, Maharashtra. India
4Ankita Ugale, AISSMS College of Engineering, Pune, Maharashtra. India
---------------------------------------------------------------------***----------------------------------------------------------------------
Abstract - It's tough to decipher text that contains
sophisticated and infrequently used vocabulary. Thus, it
calls for a mechanism or tool for simplification of the text
without changing its meaning. Thus, the motive of this
paper is to facilitate non-native (English) language
speakers[2], people with dyslexia and children[1] to better
understand textual information. For the accomplishment
of such a simplification, the approach that is the most
appropriate and favored is natural language processing.
This task is broadly divided into 3 significant inter-
dependent steps which perform identification of complex
words[3], generation of appropriate synonyms for the
complex words, filtering through all synonyms to find the
best substitute and finally the substitution of the
generated simpler alternative into the sentence
Key Words: Lexical simplification, BERT,
Unsupervised, Pretrained language model.
1. INTRODUCTION
The main objective of lexical simplification (LS) is to
replace complex words with simpler equivalents in
order to aid non-native speakers, small kids and
specially abled people in comprehending textual data.
Lexical simplification can help people in an effective way,
because research shows that people can understand the
text, in spite of the confusing grammatical context, if they
are familiar with the words used. Complex word
identification (CWI)[3], substitution generation (SG), and
substitute filtering and ranking are all part of the LS
framework (SFR).
Existing LS systems use handmade word databases (e.g.
Wordnet)[4] or para databases to replace complex
words with simpler ones using a set of rules. A technique
called word embedding is being heavily used now-a-days
to generate substitutes for words. These models produce
nearly ten synonyms for the given term. Current state-of-
the-art model, REC-LS [5], makes use of both word
databases and embedding models. Though the accuracy
of generated substitutes is very high, there is a drawback
in this model’s approach. Due to the little importance
given to the context, the model cannot produce
substitutes which are meaningful in the given context.
This disadvantage of the model, eventually leads to more
confusion and has a larger possibility of causing
misinterpretation.
Fig -1: Comparison of substitutes generated by Our
model, Glavas and REC-LS model
Context plays a vital role in substitute generation. Fig - 1
depicts a scenario and helps to back the claim. In the
figure, Fig -1, substitutes generated by three different
models are considered. Models under consideration are
LS-Bert, Glavas[6] and RES-LS. Top three substitutes
generated by the models for the given complex word can
be seen in the figure. Though substitutes generated by
Glavas[6] and REC-LS are quite accurate synonyms for
the given complex word, the generated synonyms do not
fit well with the sentence. They fail to convey the
sentence's correct meaning. On the contrary, the
substitutes generated by LS-Bert are not only accurate
alternatives for the given complex word considered
independently but also perfectly fits into the sentence
and does not create any ambiguity and lowers the
chances of misinterpretation.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 03 | Mar 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 171
Because word complexity is context-dependent, LSBert
uses Bidirectional long-short term memory a.k.a Bi-
LSTM to recognise complicated words. The Bert Model
masks the difficult words that have been identified.
Then, both the sentences, with masked and unmasked
complex words, are concatenated and fed to the Bert
model. The Bert model then produces multiple
substitutes for the masked words. To assure
grammaticality and meaning equivalence to the original
sentence in the output, five high-quality features are
employed to rate the substitutions: the frequency of
words and their similarity, the ordering of prediction,
language models which are based on Bert model and the
paraphrase database PPDB. LS-Bert uses a recursive
approach to simplify the sentence. It considers one
complex word at a time and iterates until all complex
words in the sentence are simplified.
2. LITERATURE REVIEW
Previously, lexical simplification tasks were performed
by analyzers. [8]Practical Simplification of English
Text(PSET) was one of the models studied, and the
system was broadly divided into two components: An
analyzer component that performs the syntactic analysis
and disambiguation of text and a simplify component
that eventually takes the output of the analyzer and
performs the simplification. The analyzer has 3 main
components: lexical tagger, morphological analyzer,
parser. The simplifier consists of two components:
lexical simplifier and syntactic simplifier. This approach
to text simplification was quite primitive and had several
flaws. Syntactic simplifiers replaced a big sentence with
multiple smaller sentences for better understandability.
But it had an adverse effect on the length of the
sentences and number of words being used. When used
alone, several of the words become confusing. Thus such
a simple approach to simplification was not sufficient to
counter ambiguity issues. Due to the simplicity of the
system, it fails to incorporate the context of the sentence
in the lexical simplification process.
Supervised text-data simplification systems are utilized
in recently developed text simplification models. The
availability of parallel sentences dataset is critical to the
performance of such systems. Resources for such
parallel sentences are available but they contain a lot of
inaccurate data which eventually makes this approach
inefficient.
Rule-based[9][10][11], in which each rule contains a
difficult word and its related synonyms, is another
prominent lexical simplification strategy. In a rule-based
approach, synonyms are identified from WordNet or a
linguistic dataset that consists of predefined complex
words and their simpler counterparts. However, rule-
based systems have a key drawback in that it is hard to
provide all possible simplification rules for each word.
To avoid such a dependence on parallel corpora or word
to synonym mapping datasets, [5] Recursive Context-
Aware Lexical Simplification uses this approach where
LS systems started using word embeddings. In this
approach, the top 10 words were extracted whose
vectors were closer to that of the complex word’s vector
in terms of cosine similarity.
Following a review of existing simplification models
ranging from rule-based to embedding-based, the most
significant flaw discovered was that the system
generated a large number of substitute candidates,
making further processing, i.e. determining the best fit
substitute, extremely difficult and labor-intensive. This is
because these systems either completely ignored the
context or treated it with less significance.
Thus, to eliminate these drawbacks, the context-based
lexical simplification model is proposed. This model
leverages the advanced capacity of BERT to generate
substitutes which are accurate and in accordance with
the context of the sentence. An LS framework
incorporating complicated word identification,
substitution generations, and substitute ranking is
proposed in this research, which is based on the LSBert-
simple framework of lexical simplification. The
framework can simplify one sentence recursively.
3. LEXICAL SIMPLIFICATION PIPELINE
We detail each phase of our lexical simplification
architecture in this section, which comprises the three
steps of complex word identification (CWI), substitution
generation (SG), filtering, and substitute ranking (see
Figure 2). (SR). Our model iteratively simplifies the text
by simplifying one complicated word at a time. Each
stage will be described in full below.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 03 | Mar 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 172
Fig -2: Overview of the lexical simplification framework.
3.1. Complex Word Identification
The goal of identifying complex words from a
single sentence has been explored for years, with the
purpose of determining which words in a sentence
should be simplified.
The CWI was framed as a sequence labeling task,
and a SEQ based on bi-directional long short-term
memory units (BiLSTM) was trained to predict the
binary complexity of words as marked in the dataset.
The SEQ model, in contrast to the other CWI models, has
two advantages: it takes word context into consideration
and it helps eliminate the need for substantial feature
engineering because it just uses word embeddings as
input information.
The SEQ method assigns a lexical complexity
score (p) to each word, indicating the chance that it
belongs to the complicated class. If the lexical complexity
of a word exceeds a set threshold, it is classified as a
complicated word. In Figure 2, for example, the example
"John produced these verses" is shown. The two words
"assembled" (with p = 0.55) and "verses" (with p = 0.76)
will be the challenging words to be simplified if the
complexity threshold is set to 0.5. To simplify, our model
begins with the word "verses" and the highest p value
above the predefined threshold. We'll recalculate the
complexity of each word in the sentence when we've
completed the simplification process, eliminating terms
that have been simplified. Furthermore, by conducting
named entity identification on the phrase, we eliminate
the simplification of entity words.
3.2. Substitute Generation
The goal of substitue generation (SG) is to generate
substitute candidates for the difficult word w given a
sentence S and a complex word w. Using the pre-trained
language model Bert, our model prefers to produce
substitution candidates for the complicated term. We
give a quick overview of the Bert model before
describing how we use it to perform lexical
simplification.
Bert [7] is a self-supervised technique to deep
transformer encoder training involving two training
shots on goal: masked language modeling (MLM) and the
next sentence prediction (NSP). Unlike classical language
modeling, which predicts the next word in a series based
on its history, MLM predicts missing tokens in a
sequence based on its left and right context. Bert
completes the NSP task by appending a special
categorization token, [CLS], to each sentence and
combining sentences with a special separator token,
[SEP]. The last hidden state relating to the [CLS] token is
used as the total sequence representation from which we
estimate a label for classification tasks or may be ignored
otherwise.
To conceal the difficult word w in a phrase S, we use
the special symbol "[MASK]" as a new sequence S'. If we
feed S' directly into MLM, the probability of the
vocabulary p(·|S’{ti}) corresponding to the complex
word w solely takes the context into account, regardless
of the influence of the complex word w. Bert is skilled at
dealing with sentence pairs as a result of the NSP task he
has chosen. We concatenate the original sequences S and
S' to form a sentence pair, and then input the sentence
pair (S, S') into the Bert to extract the probability
distribution of the vocabulary p (·|S, S’{w})
corresponding to the mask word. As a result, the higher
probability words in p(·|S, S’{w}) corresponding to the
mask word takes into consideration not just the complex
word itself, as well as the context of the sophisticated
word. Ultimately, after excluding semantic derivations,
we select the top ten words of p(·|S, S’{w}) as
substitution alternatives. Furthermore, because the
difficult word's contextual information is applied twice,
we randomly mask a fixed number of words in S
omitting w to suitably reduce the influence of contextual
information.
3.3. Filtering and Substitute Ranking
The replacement ranking of the lexical simplification
framework identifies which substitutions are the
simplest in the context of a complex word. C = c1, c2, …
cn, where n is the number of replacement candidates
[14].
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 03 | Mar 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 173
To eliminate some complex options, our approach
employs threshold-based filtering. We utilise the Zipf
frequency [15], which is the base-10 logarithm of the
number of times it appears per billion words, to rank the
replacement word possibilities. The higher the value of a
term for a person, the more common or familiar it is. Our
model then computes various ranks depending on their
scores for each of the criteria. After obtaining all of the
rankings for each feature, our model scores each
candidate by average all of its rankings. Finally, as the
best substitution, we select the candidate with the
greatest ranking.
4. CONCLUSIONS
In this paper, we provide a text simplification system
that simplifies text based on its context. In the next three
steps, we summarize the paper's key contribution:
1. The SEQ model, which is more optimal than the
classic identification model, is used to identify the
complicated word.
2. It creates alternatives for the current complicated
word, for which it employs Bert's pre-trained model.
Bert is enhanced by two training objectives: MLM
(masked language modeling) and NSP (next
sentence prediction) are two training objectives
that improve Bert's contextual accuracy.
3. It chooses and ranks substitution choices based on
the output's grammatically and contextual fit.
In short, our system makes use of the context and
consider it as one of the most important parameters for
simplification which in turn, makes our proposed system
more accurate in all three important steps i.e., Complex
Word Identification, Substitute Generation, filtering and
substitute ranking in lexical simplification.
5. REFERENCES
[1] J. De Belder, M.-F. Moens, Text simplification for
children, In Proceedings of the 2010 SIGIR
Workshop on Accessible Search Systems (2010)
[2] G. H. Paetzold, L. Specia, Unsupervised lexical
simplification for non- native speakers., in: AAAI,
2016
[3] G. Paetzold, L. Specia, Lexical simplification with
neural ranking, in: ACL: Volume 2, Short Papers,
2017.
[4] S. Devlin, J. Tait, The use of a psycholinguistic
database in the simplification of text for aphasic
readers, Linguistic Databases 1 (1998) 161173.
[5] S. Nisioi, S. Stajner, S. P. Ponzetto, L. P. Dinu,
Exploring neural text simplification models, in: ACL,
Vol. 2, 2017.
[6] G. Glavas, S. Stajner, Simplifying lexical
simplification: do we need simplified corpora?, in:
ACL, 2015.
[7] J. Devlin, M.-W. Chang, K. Lee, K. Toutanova, Bert:
Pre-training of deep bidirectional transformers for
language understanding, arXiv preprint
arXiv:1810.04805.
[8] John Caroll, Guido Minnen, Yvonne Canning ,
Siobhan Devlin, and John Tait- Practical
Simplification of English newspaper Text to assist
Aphasic Readers.
[9] E. Pavlick, C. Callison-Burch, Simple ppdb: A
paraphrase database for simplification, in: ACL:
Volume 2, Short Papers, 2016.
[10] M. Lesk, Automatic sense disambiguation using
machine readable dictionaries: How to tell a pine
cone from an ice cream cone, in: Proceedings of the
5th Annual International Conference on Systems
Documentation, 1986.
[11] M. Maddela, W. Xu, A word-complexity lexicon and a
neural readability ranking model for lexical
simplification, in: EMNLP, 2018.
[12] Advaith Siddharthan. 2006. Syntactic Simplification
and Text Cohesion. Research on Language and
Computation.
[13] Xingxing Zhang and Mirella Lapata. 2017. Sentence
Simplification with Deep Reinforcement Learning.
In Proceedings of the 2017 Conference on Empirical
Methods in Natural Language Processing.
[14] G. H. Paetzold, L. Specia, A survey on lexical
simplification, in: Journal of Artificial Intelligence
Research.
[15] M. Brysbaert, B. New, Moving beyond kucera and
francis: A critical evaluation of current word
frequency norms and the introduction of a new and
improved word frequency measure for american
english, Behavior Research Methods.

More Related Content

Similar to CONTEXT BASED LEXICAL SIMPLIFICATION

Knowledge Graph and Similarity Based Retrieval Method for Query Answering System
Knowledge Graph and Similarity Based Retrieval Method for Query Answering SystemKnowledge Graph and Similarity Based Retrieval Method for Query Answering System
Knowledge Graph and Similarity Based Retrieval Method for Query Answering SystemIRJET Journal
 
Free Writing - Grammatical Error Correction System: Sequence Tagging
Free Writing - Grammatical Error Correction System: Sequence TaggingFree Writing - Grammatical Error Correction System: Sequence Tagging
Free Writing - Grammatical Error Correction System: Sequence TaggingIRJET Journal
 
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET-  	  Multi-Document Summarization using Fuzzy and Hierarchical ApproachIRJET-  	  Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical ApproachIRJET Journal
 
IRJET- Short-Text Semantic Similarity using Glove Word Embedding
IRJET- Short-Text Semantic Similarity using Glove Word EmbeddingIRJET- Short-Text Semantic Similarity using Glove Word Embedding
IRJET- Short-Text Semantic Similarity using Glove Word EmbeddingIRJET Journal
 
IRJET-Semantic Similarity Between Sentences
IRJET-Semantic Similarity Between SentencesIRJET-Semantic Similarity Between Sentences
IRJET-Semantic Similarity Between SentencesIRJET Journal
 
Semantic Similarity Between Sentences
Semantic Similarity Between SentencesSemantic Similarity Between Sentences
Semantic Similarity Between SentencesIRJET Journal
 
Reviews on swarm intelligence algorithms for text document clustering
Reviews on swarm intelligence algorithms for text document clusteringReviews on swarm intelligence algorithms for text document clustering
Reviews on swarm intelligence algorithms for text document clusteringIRJET Journal
 
Semantic similarity measurement- A theoretical study of various approaches
Semantic similarity measurement- A theoretical study of various approachesSemantic similarity measurement- A theoretical study of various approaches
Semantic similarity measurement- A theoretical study of various approachesIRJET Journal
 
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEMCANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEMIRJET Journal
 
Survey on Natural Language Generation
Survey on Natural Language GenerationSurvey on Natural Language Generation
Survey on Natural Language Generationijtsrd
 
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURESGENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURESijnlc
 
Coverage-Criteria-for-Testing-SQL-Queries
Coverage-Criteria-for-Testing-SQL-QueriesCoverage-Criteria-for-Testing-SQL-Queries
Coverage-Criteria-for-Testing-SQL-QueriesMohamed Reda
 
Named Entity Recognition (NER) Using Automatic Summarization of Resumes
Named Entity Recognition (NER) Using Automatic Summarization of ResumesNamed Entity Recognition (NER) Using Automatic Summarization of Resumes
Named Entity Recognition (NER) Using Automatic Summarization of ResumesIRJET Journal
 
Hindi language as a graphical user interface to relational database for tran...
Hindi language as a graphical user interface to relational  database for tran...Hindi language as a graphical user interface to relational  database for tran...
Hindi language as a graphical user interface to relational database for tran...IRJET Journal
 
Automatic customer review summarization using deep learningbased hybrid senti...
Automatic customer review summarization using deep learningbased hybrid senti...Automatic customer review summarization using deep learningbased hybrid senti...
Automatic customer review summarization using deep learningbased hybrid senti...IJECEIAES
 
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Knowledge...
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Knowledge...IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Knowledge...
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Knowledge...sunda2011
 
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES cscpconf
 
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIESENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIEScsandit
 
Enhancing keyword search over relational databases using ontologies
Enhancing keyword search over relational databases using ontologiesEnhancing keyword search over relational databases using ontologies
Enhancing keyword search over relational databases using ontologiescsandit
 

Similar to CONTEXT BASED LEXICAL SIMPLIFICATION (20)

Knowledge Graph and Similarity Based Retrieval Method for Query Answering System
Knowledge Graph and Similarity Based Retrieval Method for Query Answering SystemKnowledge Graph and Similarity Based Retrieval Method for Query Answering System
Knowledge Graph and Similarity Based Retrieval Method for Query Answering System
 
Free Writing - Grammatical Error Correction System: Sequence Tagging
Free Writing - Grammatical Error Correction System: Sequence TaggingFree Writing - Grammatical Error Correction System: Sequence Tagging
Free Writing - Grammatical Error Correction System: Sequence Tagging
 
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET-  	  Multi-Document Summarization using Fuzzy and Hierarchical ApproachIRJET-  	  Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical Approach
 
IRJET- Short-Text Semantic Similarity using Glove Word Embedding
IRJET- Short-Text Semantic Similarity using Glove Word EmbeddingIRJET- Short-Text Semantic Similarity using Glove Word Embedding
IRJET- Short-Text Semantic Similarity using Glove Word Embedding
 
IRJET-Semantic Similarity Between Sentences
IRJET-Semantic Similarity Between SentencesIRJET-Semantic Similarity Between Sentences
IRJET-Semantic Similarity Between Sentences
 
Semantic Similarity Between Sentences
Semantic Similarity Between SentencesSemantic Similarity Between Sentences
Semantic Similarity Between Sentences
 
Reviews on swarm intelligence algorithms for text document clustering
Reviews on swarm intelligence algorithms for text document clusteringReviews on swarm intelligence algorithms for text document clustering
Reviews on swarm intelligence algorithms for text document clustering
 
arabic.pdf
arabic.pdfarabic.pdf
arabic.pdf
 
Semantic similarity measurement- A theoretical study of various approaches
Semantic similarity measurement- A theoretical study of various approachesSemantic similarity measurement- A theoretical study of various approaches
Semantic similarity measurement- A theoretical study of various approaches
 
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEMCANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
 
Survey on Natural Language Generation
Survey on Natural Language GenerationSurvey on Natural Language Generation
Survey on Natural Language Generation
 
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURESGENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
 
Coverage-Criteria-for-Testing-SQL-Queries
Coverage-Criteria-for-Testing-SQL-QueriesCoverage-Criteria-for-Testing-SQL-Queries
Coverage-Criteria-for-Testing-SQL-Queries
 
Named Entity Recognition (NER) Using Automatic Summarization of Resumes
Named Entity Recognition (NER) Using Automatic Summarization of ResumesNamed Entity Recognition (NER) Using Automatic Summarization of Resumes
Named Entity Recognition (NER) Using Automatic Summarization of Resumes
 
Hindi language as a graphical user interface to relational database for tran...
Hindi language as a graphical user interface to relational  database for tran...Hindi language as a graphical user interface to relational  database for tran...
Hindi language as a graphical user interface to relational database for tran...
 
Automatic customer review summarization using deep learningbased hybrid senti...
Automatic customer review summarization using deep learningbased hybrid senti...Automatic customer review summarization using deep learningbased hybrid senti...
Automatic customer review summarization using deep learningbased hybrid senti...
 
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Knowledge...
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Knowledge...IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Knowledge...
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Knowledge...
 
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
 
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIESENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
 
Enhancing keyword search over relational databases using ontologies
Enhancing keyword search over relational databases using ontologiesEnhancing keyword search over relational databases using ontologies
Enhancing keyword search over relational databases using ontologies
 

More from IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASIRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesIRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web applicationIRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
 

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwaitjaanualu31
 
Ground Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth ReinforcementGround Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth ReinforcementDr. Deepak Mudgal
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxpritamlangde
 
Electromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptxElectromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptxNANDHAKUMARA10
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxSCMS School of Architecture
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network DevicesChandrakantDivate1
 
Worksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptxWorksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptxMustafa Ahmed
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.Kamal Acharya
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdfKamal Acharya
 
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...Amil baba
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdfKamal Acharya
 
8086 Microprocessor Architecture: 16-bit microprocessor
8086 Microprocessor Architecture: 16-bit microprocessor8086 Microprocessor Architecture: 16-bit microprocessor
8086 Microprocessor Architecture: 16-bit microprocessorAshwiniTodkar4
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b
 
Linux Systems Programming: Inter Process Communication (IPC) using Pipes
Linux Systems Programming: Inter Process Communication (IPC) using PipesLinux Systems Programming: Inter Process Communication (IPC) using Pipes
Linux Systems Programming: Inter Process Communication (IPC) using PipesRashidFaridChishti
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdfAldoGarca30
 
Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)ChandrakantDivate1
 
Computer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesComputer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesChandrakantDivate1
 
Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...ppkakm
 

Recently uploaded (20)

Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
 
Ground Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth ReinforcementGround Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth Reinforcement
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
 
Electromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptxElectromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptx
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network Devices
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
Worksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptxWorksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptx
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
 
8086 Microprocessor Architecture: 16-bit microprocessor
8086 Microprocessor Architecture: 16-bit microprocessor8086 Microprocessor Architecture: 16-bit microprocessor
8086 Microprocessor Architecture: 16-bit microprocessor
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
Linux Systems Programming: Inter Process Communication (IPC) using Pipes
Linux Systems Programming: Inter Process Communication (IPC) using PipesLinux Systems Programming: Inter Process Communication (IPC) using Pipes
Linux Systems Programming: Inter Process Communication (IPC) using Pipes
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
 
Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)
 
Signal Processing and Linear System Analysis
Signal Processing and Linear System AnalysisSignal Processing and Linear System Analysis
Signal Processing and Linear System Analysis
 
Computer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesComputer Graphics Introduction To Curves
Computer Graphics Introduction To Curves
 
Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...
 

CONTEXT BASED LEXICAL SIMPLIFICATION

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 03 | Mar 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 170 CONTEXT BASED LEXICAL SIMPLIFICATION Piyush Pandey1, Tejas Patil2, Tanvi Sutar3, Ankita Ugale4 *Dr S.F. Sayyad, Computer Dept. .AISSMS College of Engineering, Pune, Maharashtra. India 1Piyush Pandey. AISSMS College of Engineering, Pune, Maharashtra. India 2Tejas Patil., AISSMS College of Engineering, Pune, Maharashtra. India 3Tanvi Sutar, AISSMS College of Engineering, Pune, Maharashtra. India 4Ankita Ugale, AISSMS College of Engineering, Pune, Maharashtra. India ---------------------------------------------------------------------***---------------------------------------------------------------------- Abstract - It's tough to decipher text that contains sophisticated and infrequently used vocabulary. Thus, it calls for a mechanism or tool for simplification of the text without changing its meaning. Thus, the motive of this paper is to facilitate non-native (English) language speakers[2], people with dyslexia and children[1] to better understand textual information. For the accomplishment of such a simplification, the approach that is the most appropriate and favored is natural language processing. This task is broadly divided into 3 significant inter- dependent steps which perform identification of complex words[3], generation of appropriate synonyms for the complex words, filtering through all synonyms to find the best substitute and finally the substitution of the generated simpler alternative into the sentence Key Words: Lexical simplification, BERT, Unsupervised, Pretrained language model. 1. INTRODUCTION The main objective of lexical simplification (LS) is to replace complex words with simpler equivalents in order to aid non-native speakers, small kids and specially abled people in comprehending textual data. Lexical simplification can help people in an effective way, because research shows that people can understand the text, in spite of the confusing grammatical context, if they are familiar with the words used. Complex word identification (CWI)[3], substitution generation (SG), and substitute filtering and ranking are all part of the LS framework (SFR). Existing LS systems use handmade word databases (e.g. Wordnet)[4] or para databases to replace complex words with simpler ones using a set of rules. A technique called word embedding is being heavily used now-a-days to generate substitutes for words. These models produce nearly ten synonyms for the given term. Current state-of- the-art model, REC-LS [5], makes use of both word databases and embedding models. Though the accuracy of generated substitutes is very high, there is a drawback in this model’s approach. Due to the little importance given to the context, the model cannot produce substitutes which are meaningful in the given context. This disadvantage of the model, eventually leads to more confusion and has a larger possibility of causing misinterpretation. Fig -1: Comparison of substitutes generated by Our model, Glavas and REC-LS model Context plays a vital role in substitute generation. Fig - 1 depicts a scenario and helps to back the claim. In the figure, Fig -1, substitutes generated by three different models are considered. Models under consideration are LS-Bert, Glavas[6] and RES-LS. Top three substitutes generated by the models for the given complex word can be seen in the figure. Though substitutes generated by Glavas[6] and REC-LS are quite accurate synonyms for the given complex word, the generated synonyms do not fit well with the sentence. They fail to convey the sentence's correct meaning. On the contrary, the substitutes generated by LS-Bert are not only accurate alternatives for the given complex word considered independently but also perfectly fits into the sentence and does not create any ambiguity and lowers the chances of misinterpretation.
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 03 | Mar 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 171 Because word complexity is context-dependent, LSBert uses Bidirectional long-short term memory a.k.a Bi- LSTM to recognise complicated words. The Bert Model masks the difficult words that have been identified. Then, both the sentences, with masked and unmasked complex words, are concatenated and fed to the Bert model. The Bert model then produces multiple substitutes for the masked words. To assure grammaticality and meaning equivalence to the original sentence in the output, five high-quality features are employed to rate the substitutions: the frequency of words and their similarity, the ordering of prediction, language models which are based on Bert model and the paraphrase database PPDB. LS-Bert uses a recursive approach to simplify the sentence. It considers one complex word at a time and iterates until all complex words in the sentence are simplified. 2. LITERATURE REVIEW Previously, lexical simplification tasks were performed by analyzers. [8]Practical Simplification of English Text(PSET) was one of the models studied, and the system was broadly divided into two components: An analyzer component that performs the syntactic analysis and disambiguation of text and a simplify component that eventually takes the output of the analyzer and performs the simplification. The analyzer has 3 main components: lexical tagger, morphological analyzer, parser. The simplifier consists of two components: lexical simplifier and syntactic simplifier. This approach to text simplification was quite primitive and had several flaws. Syntactic simplifiers replaced a big sentence with multiple smaller sentences for better understandability. But it had an adverse effect on the length of the sentences and number of words being used. When used alone, several of the words become confusing. Thus such a simple approach to simplification was not sufficient to counter ambiguity issues. Due to the simplicity of the system, it fails to incorporate the context of the sentence in the lexical simplification process. Supervised text-data simplification systems are utilized in recently developed text simplification models. The availability of parallel sentences dataset is critical to the performance of such systems. Resources for such parallel sentences are available but they contain a lot of inaccurate data which eventually makes this approach inefficient. Rule-based[9][10][11], in which each rule contains a difficult word and its related synonyms, is another prominent lexical simplification strategy. In a rule-based approach, synonyms are identified from WordNet or a linguistic dataset that consists of predefined complex words and their simpler counterparts. However, rule- based systems have a key drawback in that it is hard to provide all possible simplification rules for each word. To avoid such a dependence on parallel corpora or word to synonym mapping datasets, [5] Recursive Context- Aware Lexical Simplification uses this approach where LS systems started using word embeddings. In this approach, the top 10 words were extracted whose vectors were closer to that of the complex word’s vector in terms of cosine similarity. Following a review of existing simplification models ranging from rule-based to embedding-based, the most significant flaw discovered was that the system generated a large number of substitute candidates, making further processing, i.e. determining the best fit substitute, extremely difficult and labor-intensive. This is because these systems either completely ignored the context or treated it with less significance. Thus, to eliminate these drawbacks, the context-based lexical simplification model is proposed. This model leverages the advanced capacity of BERT to generate substitutes which are accurate and in accordance with the context of the sentence. An LS framework incorporating complicated word identification, substitution generations, and substitute ranking is proposed in this research, which is based on the LSBert- simple framework of lexical simplification. The framework can simplify one sentence recursively. 3. LEXICAL SIMPLIFICATION PIPELINE We detail each phase of our lexical simplification architecture in this section, which comprises the three steps of complex word identification (CWI), substitution generation (SG), filtering, and substitute ranking (see Figure 2). (SR). Our model iteratively simplifies the text by simplifying one complicated word at a time. Each stage will be described in full below.
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 03 | Mar 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 172 Fig -2: Overview of the lexical simplification framework. 3.1. Complex Word Identification The goal of identifying complex words from a single sentence has been explored for years, with the purpose of determining which words in a sentence should be simplified. The CWI was framed as a sequence labeling task, and a SEQ based on bi-directional long short-term memory units (BiLSTM) was trained to predict the binary complexity of words as marked in the dataset. The SEQ model, in contrast to the other CWI models, has two advantages: it takes word context into consideration and it helps eliminate the need for substantial feature engineering because it just uses word embeddings as input information. The SEQ method assigns a lexical complexity score (p) to each word, indicating the chance that it belongs to the complicated class. If the lexical complexity of a word exceeds a set threshold, it is classified as a complicated word. In Figure 2, for example, the example "John produced these verses" is shown. The two words "assembled" (with p = 0.55) and "verses" (with p = 0.76) will be the challenging words to be simplified if the complexity threshold is set to 0.5. To simplify, our model begins with the word "verses" and the highest p value above the predefined threshold. We'll recalculate the complexity of each word in the sentence when we've completed the simplification process, eliminating terms that have been simplified. Furthermore, by conducting named entity identification on the phrase, we eliminate the simplification of entity words. 3.2. Substitute Generation The goal of substitue generation (SG) is to generate substitute candidates for the difficult word w given a sentence S and a complex word w. Using the pre-trained language model Bert, our model prefers to produce substitution candidates for the complicated term. We give a quick overview of the Bert model before describing how we use it to perform lexical simplification. Bert [7] is a self-supervised technique to deep transformer encoder training involving two training shots on goal: masked language modeling (MLM) and the next sentence prediction (NSP). Unlike classical language modeling, which predicts the next word in a series based on its history, MLM predicts missing tokens in a sequence based on its left and right context. Bert completes the NSP task by appending a special categorization token, [CLS], to each sentence and combining sentences with a special separator token, [SEP]. The last hidden state relating to the [CLS] token is used as the total sequence representation from which we estimate a label for classification tasks or may be ignored otherwise. To conceal the difficult word w in a phrase S, we use the special symbol "[MASK]" as a new sequence S'. If we feed S' directly into MLM, the probability of the vocabulary p(·|S’{ti}) corresponding to the complex word w solely takes the context into account, regardless of the influence of the complex word w. Bert is skilled at dealing with sentence pairs as a result of the NSP task he has chosen. We concatenate the original sequences S and S' to form a sentence pair, and then input the sentence pair (S, S') into the Bert to extract the probability distribution of the vocabulary p (·|S, S’{w}) corresponding to the mask word. As a result, the higher probability words in p(·|S, S’{w}) corresponding to the mask word takes into consideration not just the complex word itself, as well as the context of the sophisticated word. Ultimately, after excluding semantic derivations, we select the top ten words of p(·|S, S’{w}) as substitution alternatives. Furthermore, because the difficult word's contextual information is applied twice, we randomly mask a fixed number of words in S omitting w to suitably reduce the influence of contextual information. 3.3. Filtering and Substitute Ranking The replacement ranking of the lexical simplification framework identifies which substitutions are the simplest in the context of a complex word. C = c1, c2, … cn, where n is the number of replacement candidates [14].
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 03 | Mar 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 173 To eliminate some complex options, our approach employs threshold-based filtering. We utilise the Zipf frequency [15], which is the base-10 logarithm of the number of times it appears per billion words, to rank the replacement word possibilities. The higher the value of a term for a person, the more common or familiar it is. Our model then computes various ranks depending on their scores for each of the criteria. After obtaining all of the rankings for each feature, our model scores each candidate by average all of its rankings. Finally, as the best substitution, we select the candidate with the greatest ranking. 4. CONCLUSIONS In this paper, we provide a text simplification system that simplifies text based on its context. In the next three steps, we summarize the paper's key contribution: 1. The SEQ model, which is more optimal than the classic identification model, is used to identify the complicated word. 2. It creates alternatives for the current complicated word, for which it employs Bert's pre-trained model. Bert is enhanced by two training objectives: MLM (masked language modeling) and NSP (next sentence prediction) are two training objectives that improve Bert's contextual accuracy. 3. It chooses and ranks substitution choices based on the output's grammatically and contextual fit. In short, our system makes use of the context and consider it as one of the most important parameters for simplification which in turn, makes our proposed system more accurate in all three important steps i.e., Complex Word Identification, Substitute Generation, filtering and substitute ranking in lexical simplification. 5. REFERENCES [1] J. De Belder, M.-F. Moens, Text simplification for children, In Proceedings of the 2010 SIGIR Workshop on Accessible Search Systems (2010) [2] G. H. Paetzold, L. Specia, Unsupervised lexical simplification for non- native speakers., in: AAAI, 2016 [3] G. Paetzold, L. Specia, Lexical simplification with neural ranking, in: ACL: Volume 2, Short Papers, 2017. [4] S. Devlin, J. Tait, The use of a psycholinguistic database in the simplification of text for aphasic readers, Linguistic Databases 1 (1998) 161173. [5] S. Nisioi, S. Stajner, S. P. Ponzetto, L. P. Dinu, Exploring neural text simplification models, in: ACL, Vol. 2, 2017. [6] G. Glavas, S. Stajner, Simplifying lexical simplification: do we need simplified corpora?, in: ACL, 2015. [7] J. Devlin, M.-W. Chang, K. Lee, K. Toutanova, Bert: Pre-training of deep bidirectional transformers for language understanding, arXiv preprint arXiv:1810.04805. [8] John Caroll, Guido Minnen, Yvonne Canning , Siobhan Devlin, and John Tait- Practical Simplification of English newspaper Text to assist Aphasic Readers. [9] E. Pavlick, C. Callison-Burch, Simple ppdb: A paraphrase database for simplification, in: ACL: Volume 2, Short Papers, 2016. [10] M. Lesk, Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone, in: Proceedings of the 5th Annual International Conference on Systems Documentation, 1986. [11] M. Maddela, W. Xu, A word-complexity lexicon and a neural readability ranking model for lexical simplification, in: EMNLP, 2018. [12] Advaith Siddharthan. 2006. Syntactic Simplification and Text Cohesion. Research on Language and Computation. [13] Xingxing Zhang and Mirella Lapata. 2017. Sentence Simplification with Deep Reinforcement Learning. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. [14] G. H. Paetzold, L. Specia, A survey on lexical simplification, in: Journal of Artificial Intelligence Research. [15] M. Brysbaert, B. New, Moving beyond kucera and francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for american english, Behavior Research Methods.