Submit Search
Upload
CONTEXT BASED LEXICAL SIMPLIFICATION
•
0 likes
•
3 views
IRJET Journal
Follow
https://www.irjet.net/archives/V9/i3/IRJET-V9I331.pdf
Read less
Read more
Engineering
Report
Share
Report
Share
1 of 4
Download now
Download to read offline
Recommended
Malayalam Word Sense Disambiguation using Machine Learning Approach
Malayalam Word Sense Disambiguation using Machine Learning Approach
IRJET Journal
WordNet Based Online Reverse Dictionary with Improved Accuracy and Parts-of-S...
WordNet Based Online Reverse Dictionary with Improved Accuracy and Parts-of-S...
IRJET Journal
Semantic Based Document Clustering Using Lexical Chains
Semantic Based Document Clustering Using Lexical Chains
IRJET Journal
IRJET-Semantic Based Document Clustering Using Lexical Chains
IRJET-Semantic Based Document Clustering Using Lexical Chains
IRJET Journal
A Survey on Automatic Text Summarization
A Survey on Automatic Text Summarization
IRJET Journal
Wsd final paper
Wsd final paper
Milind Gokhale
Methodology for Managing Dynamic Collections on Semantic Semi-Structured XMLs
Methodology for Managing Dynamic Collections on Semantic Semi-Structured XMLs
IRJET Journal
Enhanced TextRank using weighted word embedding for text summarization
Enhanced TextRank using weighted word embedding for text summarization
IJECEIAES
Recommended
Malayalam Word Sense Disambiguation using Machine Learning Approach
Malayalam Word Sense Disambiguation using Machine Learning Approach
IRJET Journal
WordNet Based Online Reverse Dictionary with Improved Accuracy and Parts-of-S...
WordNet Based Online Reverse Dictionary with Improved Accuracy and Parts-of-S...
IRJET Journal
Semantic Based Document Clustering Using Lexical Chains
Semantic Based Document Clustering Using Lexical Chains
IRJET Journal
IRJET-Semantic Based Document Clustering Using Lexical Chains
IRJET-Semantic Based Document Clustering Using Lexical Chains
IRJET Journal
A Survey on Automatic Text Summarization
A Survey on Automatic Text Summarization
IRJET Journal
Wsd final paper
Wsd final paper
Milind Gokhale
Methodology for Managing Dynamic Collections on Semantic Semi-Structured XMLs
Methodology for Managing Dynamic Collections on Semantic Semi-Structured XMLs
IRJET Journal
Enhanced TextRank using weighted word embedding for text summarization
Enhanced TextRank using weighted word embedding for text summarization
IJECEIAES
Knowledge Graph and Similarity Based Retrieval Method for Query Answering System
Knowledge Graph and Similarity Based Retrieval Method for Query Answering System
IRJET Journal
Free Writing - Grammatical Error Correction System: Sequence Tagging
Free Writing - Grammatical Error Correction System: Sequence Tagging
IRJET Journal
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET Journal
IRJET- Short-Text Semantic Similarity using Glove Word Embedding
IRJET- Short-Text Semantic Similarity using Glove Word Embedding
IRJET Journal
IRJET-Semantic Similarity Between Sentences
IRJET-Semantic Similarity Between Sentences
IRJET Journal
Semantic Similarity Between Sentences
Semantic Similarity Between Sentences
IRJET Journal
Reviews on swarm intelligence algorithms for text document clustering
Reviews on swarm intelligence algorithms for text document clustering
IRJET Journal
arabic.pdf
arabic.pdf
ShoaibBigzad
Semantic similarity measurement- A theoretical study of various approaches
Semantic similarity measurement- A theoretical study of various approaches
IRJET Journal
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
IRJET Journal
Survey on Natural Language Generation
Survey on Natural Language Generation
ijtsrd
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
ijnlc
Coverage-Criteria-for-Testing-SQL-Queries
Coverage-Criteria-for-Testing-SQL-Queries
Mohamed Reda
Named Entity Recognition (NER) Using Automatic Summarization of Resumes
Named Entity Recognition (NER) Using Automatic Summarization of Resumes
IRJET Journal
Hindi language as a graphical user interface to relational database for tran...
Hindi language as a graphical user interface to relational database for tran...
IRJET Journal
Automatic customer review summarization using deep learningbased hybrid senti...
Automatic customer review summarization using deep learningbased hybrid senti...
IJECEIAES
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Knowledge...
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Knowledge...
sunda2011
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
cscpconf
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
csandit
Enhancing keyword search over relational databases using ontologies
Enhancing keyword search over relational databases using ontologies
csandit
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
IRJET Journal
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
IRJET Journal
More Related Content
Similar to CONTEXT BASED LEXICAL SIMPLIFICATION
Knowledge Graph and Similarity Based Retrieval Method for Query Answering System
Knowledge Graph and Similarity Based Retrieval Method for Query Answering System
IRJET Journal
Free Writing - Grammatical Error Correction System: Sequence Tagging
Free Writing - Grammatical Error Correction System: Sequence Tagging
IRJET Journal
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET Journal
IRJET- Short-Text Semantic Similarity using Glove Word Embedding
IRJET- Short-Text Semantic Similarity using Glove Word Embedding
IRJET Journal
IRJET-Semantic Similarity Between Sentences
IRJET-Semantic Similarity Between Sentences
IRJET Journal
Semantic Similarity Between Sentences
Semantic Similarity Between Sentences
IRJET Journal
Reviews on swarm intelligence algorithms for text document clustering
Reviews on swarm intelligence algorithms for text document clustering
IRJET Journal
arabic.pdf
arabic.pdf
ShoaibBigzad
Semantic similarity measurement- A theoretical study of various approaches
Semantic similarity measurement- A theoretical study of various approaches
IRJET Journal
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
IRJET Journal
Survey on Natural Language Generation
Survey on Natural Language Generation
ijtsrd
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
ijnlc
Coverage-Criteria-for-Testing-SQL-Queries
Coverage-Criteria-for-Testing-SQL-Queries
Mohamed Reda
Named Entity Recognition (NER) Using Automatic Summarization of Resumes
Named Entity Recognition (NER) Using Automatic Summarization of Resumes
IRJET Journal
Hindi language as a graphical user interface to relational database for tran...
Hindi language as a graphical user interface to relational database for tran...
IRJET Journal
Automatic customer review summarization using deep learningbased hybrid senti...
Automatic customer review summarization using deep learningbased hybrid senti...
IJECEIAES
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Knowledge...
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Knowledge...
sunda2011
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
cscpconf
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
csandit
Enhancing keyword search over relational databases using ontologies
Enhancing keyword search over relational databases using ontologies
csandit
Similar to CONTEXT BASED LEXICAL SIMPLIFICATION
(20)
Knowledge Graph and Similarity Based Retrieval Method for Query Answering System
Knowledge Graph and Similarity Based Retrieval Method for Query Answering System
Free Writing - Grammatical Error Correction System: Sequence Tagging
Free Writing - Grammatical Error Correction System: Sequence Tagging
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET- Short-Text Semantic Similarity using Glove Word Embedding
IRJET- Short-Text Semantic Similarity using Glove Word Embedding
IRJET-Semantic Similarity Between Sentences
IRJET-Semantic Similarity Between Sentences
Semantic Similarity Between Sentences
Semantic Similarity Between Sentences
Reviews on swarm intelligence algorithms for text document clustering
Reviews on swarm intelligence algorithms for text document clustering
arabic.pdf
arabic.pdf
Semantic similarity measurement- A theoretical study of various approaches
Semantic similarity measurement- A theoretical study of various approaches
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
Survey on Natural Language Generation
Survey on Natural Language Generation
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
Coverage-Criteria-for-Testing-SQL-Queries
Coverage-Criteria-for-Testing-SQL-Queries
Named Entity Recognition (NER) Using Automatic Summarization of Resumes
Named Entity Recognition (NER) Using Automatic Summarization of Resumes
Hindi language as a graphical user interface to relational database for tran...
Hindi language as a graphical user interface to relational database for tran...
Automatic customer review summarization using deep learningbased hybrid senti...
Automatic customer review summarization using deep learningbased hybrid senti...
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Knowledge...
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Knowledge...
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
Enhancing keyword search over relational databases using ontologies
Enhancing keyword search over relational databases using ontologies
More from IRJET Journal
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
IRJET Journal
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
IRJET Journal
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
IRJET Journal
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
IRJET Journal
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
IRJET Journal
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
IRJET Journal
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
IRJET Journal
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
IRJET Journal
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
IRJET Journal
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
IRJET Journal
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
IRJET Journal
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
IRJET Journal
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
IRJET Journal
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
IRJET Journal
React based fullstack edtech web application
React based fullstack edtech web application
IRJET Journal
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
IRJET Journal
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
IRJET Journal
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
IRJET Journal
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
IRJET Journal
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
IRJET Journal
More from IRJET Journal
(20)
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
React based fullstack edtech web application
React based fullstack edtech web application
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Recently uploaded
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
jaanualu31
Ground Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth Reinforcement
Dr. Deepak Mudgal
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
pritamlangde
Electromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptx
NANDHAKUMARA10
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
SCMS School of Architecture
Computer Networks Basics of Network Devices
Computer Networks Basics of Network Devices
ChandrakantDivate1
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Kandungan 087776558899
Worksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptx
Mustafa Ahmed
Employee leave management system project.
Employee leave management system project.
Kamal Acharya
Online food ordering system project report.pdf
Online food ordering system project report.pdf
Kamal Acharya
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
Amil baba
Hospital management system project report.pdf
Hospital management system project report.pdf
Kamal Acharya
8086 Microprocessor Architecture: 16-bit microprocessor
8086 Microprocessor Architecture: 16-bit microprocessor
AshwiniTodkar4
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
Linux Systems Programming: Inter Process Communication (IPC) using Pipes
Linux Systems Programming: Inter Process Communication (IPC) using Pipes
RashidFaridChishti
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
AldoGarca30
Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)
ChandrakantDivate1
Signal Processing and Linear System Analysis
Signal Processing and Linear System Analysis
National Chung Hsing University
Computer Graphics Introduction To Curves
Computer Graphics Introduction To Curves
ChandrakantDivate1
Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...
ppkakm
Recently uploaded
(20)
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Ground Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth Reinforcement
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Electromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
Computer Networks Basics of Network Devices
Computer Networks Basics of Network Devices
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Worksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptx
Employee leave management system project.
Employee leave management system project.
Online food ordering system project report.pdf
Online food ordering system project report.pdf
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
Hospital management system project report.pdf
Hospital management system project report.pdf
8086 Microprocessor Architecture: 16-bit microprocessor
8086 Microprocessor Architecture: 16-bit microprocessor
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Linux Systems Programming: Inter Process Communication (IPC) using Pipes
Linux Systems Programming: Inter Process Communication (IPC) using Pipes
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)
Signal Processing and Linear System Analysis
Signal Processing and Linear System Analysis
Computer Graphics Introduction To Curves
Computer Graphics Introduction To Curves
Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...
CONTEXT BASED LEXICAL SIMPLIFICATION
1.
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 03 | Mar 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 170 CONTEXT BASED LEXICAL SIMPLIFICATION Piyush Pandey1, Tejas Patil2, Tanvi Sutar3, Ankita Ugale4 *Dr S.F. Sayyad, Computer Dept. .AISSMS College of Engineering, Pune, Maharashtra. India 1Piyush Pandey. AISSMS College of Engineering, Pune, Maharashtra. India 2Tejas Patil., AISSMS College of Engineering, Pune, Maharashtra. India 3Tanvi Sutar, AISSMS College of Engineering, Pune, Maharashtra. India 4Ankita Ugale, AISSMS College of Engineering, Pune, Maharashtra. India ---------------------------------------------------------------------***---------------------------------------------------------------------- Abstract - It's tough to decipher text that contains sophisticated and infrequently used vocabulary. Thus, it calls for a mechanism or tool for simplification of the text without changing its meaning. Thus, the motive of this paper is to facilitate non-native (English) language speakers[2], people with dyslexia and children[1] to better understand textual information. For the accomplishment of such a simplification, the approach that is the most appropriate and favored is natural language processing. This task is broadly divided into 3 significant inter- dependent steps which perform identification of complex words[3], generation of appropriate synonyms for the complex words, filtering through all synonyms to find the best substitute and finally the substitution of the generated simpler alternative into the sentence Key Words: Lexical simplification, BERT, Unsupervised, Pretrained language model. 1. INTRODUCTION The main objective of lexical simplification (LS) is to replace complex words with simpler equivalents in order to aid non-native speakers, small kids and specially abled people in comprehending textual data. Lexical simplification can help people in an effective way, because research shows that people can understand the text, in spite of the confusing grammatical context, if they are familiar with the words used. Complex word identification (CWI)[3], substitution generation (SG), and substitute filtering and ranking are all part of the LS framework (SFR). Existing LS systems use handmade word databases (e.g. Wordnet)[4] or para databases to replace complex words with simpler ones using a set of rules. A technique called word embedding is being heavily used now-a-days to generate substitutes for words. These models produce nearly ten synonyms for the given term. Current state-of- the-art model, REC-LS [5], makes use of both word databases and embedding models. Though the accuracy of generated substitutes is very high, there is a drawback in this model’s approach. Due to the little importance given to the context, the model cannot produce substitutes which are meaningful in the given context. This disadvantage of the model, eventually leads to more confusion and has a larger possibility of causing misinterpretation. Fig -1: Comparison of substitutes generated by Our model, Glavas and REC-LS model Context plays a vital role in substitute generation. Fig - 1 depicts a scenario and helps to back the claim. In the figure, Fig -1, substitutes generated by three different models are considered. Models under consideration are LS-Bert, Glavas[6] and RES-LS. Top three substitutes generated by the models for the given complex word can be seen in the figure. Though substitutes generated by Glavas[6] and REC-LS are quite accurate synonyms for the given complex word, the generated synonyms do not fit well with the sentence. They fail to convey the sentence's correct meaning. On the contrary, the substitutes generated by LS-Bert are not only accurate alternatives for the given complex word considered independently but also perfectly fits into the sentence and does not create any ambiguity and lowers the chances of misinterpretation.
2.
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 03 | Mar 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 171 Because word complexity is context-dependent, LSBert uses Bidirectional long-short term memory a.k.a Bi- LSTM to recognise complicated words. The Bert Model masks the difficult words that have been identified. Then, both the sentences, with masked and unmasked complex words, are concatenated and fed to the Bert model. The Bert model then produces multiple substitutes for the masked words. To assure grammaticality and meaning equivalence to the original sentence in the output, five high-quality features are employed to rate the substitutions: the frequency of words and their similarity, the ordering of prediction, language models which are based on Bert model and the paraphrase database PPDB. LS-Bert uses a recursive approach to simplify the sentence. It considers one complex word at a time and iterates until all complex words in the sentence are simplified. 2. LITERATURE REVIEW Previously, lexical simplification tasks were performed by analyzers. [8]Practical Simplification of English Text(PSET) was one of the models studied, and the system was broadly divided into two components: An analyzer component that performs the syntactic analysis and disambiguation of text and a simplify component that eventually takes the output of the analyzer and performs the simplification. The analyzer has 3 main components: lexical tagger, morphological analyzer, parser. The simplifier consists of two components: lexical simplifier and syntactic simplifier. This approach to text simplification was quite primitive and had several flaws. Syntactic simplifiers replaced a big sentence with multiple smaller sentences for better understandability. But it had an adverse effect on the length of the sentences and number of words being used. When used alone, several of the words become confusing. Thus such a simple approach to simplification was not sufficient to counter ambiguity issues. Due to the simplicity of the system, it fails to incorporate the context of the sentence in the lexical simplification process. Supervised text-data simplification systems are utilized in recently developed text simplification models. The availability of parallel sentences dataset is critical to the performance of such systems. Resources for such parallel sentences are available but they contain a lot of inaccurate data which eventually makes this approach inefficient. Rule-based[9][10][11], in which each rule contains a difficult word and its related synonyms, is another prominent lexical simplification strategy. In a rule-based approach, synonyms are identified from WordNet or a linguistic dataset that consists of predefined complex words and their simpler counterparts. However, rule- based systems have a key drawback in that it is hard to provide all possible simplification rules for each word. To avoid such a dependence on parallel corpora or word to synonym mapping datasets, [5] Recursive Context- Aware Lexical Simplification uses this approach where LS systems started using word embeddings. In this approach, the top 10 words were extracted whose vectors were closer to that of the complex word’s vector in terms of cosine similarity. Following a review of existing simplification models ranging from rule-based to embedding-based, the most significant flaw discovered was that the system generated a large number of substitute candidates, making further processing, i.e. determining the best fit substitute, extremely difficult and labor-intensive. This is because these systems either completely ignored the context or treated it with less significance. Thus, to eliminate these drawbacks, the context-based lexical simplification model is proposed. This model leverages the advanced capacity of BERT to generate substitutes which are accurate and in accordance with the context of the sentence. An LS framework incorporating complicated word identification, substitution generations, and substitute ranking is proposed in this research, which is based on the LSBert- simple framework of lexical simplification. The framework can simplify one sentence recursively. 3. LEXICAL SIMPLIFICATION PIPELINE We detail each phase of our lexical simplification architecture in this section, which comprises the three steps of complex word identification (CWI), substitution generation (SG), filtering, and substitute ranking (see Figure 2). (SR). Our model iteratively simplifies the text by simplifying one complicated word at a time. Each stage will be described in full below.
3.
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 03 | Mar 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 172 Fig -2: Overview of the lexical simplification framework. 3.1. Complex Word Identification The goal of identifying complex words from a single sentence has been explored for years, with the purpose of determining which words in a sentence should be simplified. The CWI was framed as a sequence labeling task, and a SEQ based on bi-directional long short-term memory units (BiLSTM) was trained to predict the binary complexity of words as marked in the dataset. The SEQ model, in contrast to the other CWI models, has two advantages: it takes word context into consideration and it helps eliminate the need for substantial feature engineering because it just uses word embeddings as input information. The SEQ method assigns a lexical complexity score (p) to each word, indicating the chance that it belongs to the complicated class. If the lexical complexity of a word exceeds a set threshold, it is classified as a complicated word. In Figure 2, for example, the example "John produced these verses" is shown. The two words "assembled" (with p = 0.55) and "verses" (with p = 0.76) will be the challenging words to be simplified if the complexity threshold is set to 0.5. To simplify, our model begins with the word "verses" and the highest p value above the predefined threshold. We'll recalculate the complexity of each word in the sentence when we've completed the simplification process, eliminating terms that have been simplified. Furthermore, by conducting named entity identification on the phrase, we eliminate the simplification of entity words. 3.2. Substitute Generation The goal of substitue generation (SG) is to generate substitute candidates for the difficult word w given a sentence S and a complex word w. Using the pre-trained language model Bert, our model prefers to produce substitution candidates for the complicated term. We give a quick overview of the Bert model before describing how we use it to perform lexical simplification. Bert [7] is a self-supervised technique to deep transformer encoder training involving two training shots on goal: masked language modeling (MLM) and the next sentence prediction (NSP). Unlike classical language modeling, which predicts the next word in a series based on its history, MLM predicts missing tokens in a sequence based on its left and right context. Bert completes the NSP task by appending a special categorization token, [CLS], to each sentence and combining sentences with a special separator token, [SEP]. The last hidden state relating to the [CLS] token is used as the total sequence representation from which we estimate a label for classification tasks or may be ignored otherwise. To conceal the difficult word w in a phrase S, we use the special symbol "[MASK]" as a new sequence S'. If we feed S' directly into MLM, the probability of the vocabulary p(·|S’{ti}) corresponding to the complex word w solely takes the context into account, regardless of the influence of the complex word w. Bert is skilled at dealing with sentence pairs as a result of the NSP task he has chosen. We concatenate the original sequences S and S' to form a sentence pair, and then input the sentence pair (S, S') into the Bert to extract the probability distribution of the vocabulary p (·|S, S’{w}) corresponding to the mask word. As a result, the higher probability words in p(·|S, S’{w}) corresponding to the mask word takes into consideration not just the complex word itself, as well as the context of the sophisticated word. Ultimately, after excluding semantic derivations, we select the top ten words of p(·|S, S’{w}) as substitution alternatives. Furthermore, because the difficult word's contextual information is applied twice, we randomly mask a fixed number of words in S omitting w to suitably reduce the influence of contextual information. 3.3. Filtering and Substitute Ranking The replacement ranking of the lexical simplification framework identifies which substitutions are the simplest in the context of a complex word. C = c1, c2, … cn, where n is the number of replacement candidates [14].
4.
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 03 | Mar 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 173 To eliminate some complex options, our approach employs threshold-based filtering. We utilise the Zipf frequency [15], which is the base-10 logarithm of the number of times it appears per billion words, to rank the replacement word possibilities. The higher the value of a term for a person, the more common or familiar it is. Our model then computes various ranks depending on their scores for each of the criteria. After obtaining all of the rankings for each feature, our model scores each candidate by average all of its rankings. Finally, as the best substitution, we select the candidate with the greatest ranking. 4. CONCLUSIONS In this paper, we provide a text simplification system that simplifies text based on its context. In the next three steps, we summarize the paper's key contribution: 1. The SEQ model, which is more optimal than the classic identification model, is used to identify the complicated word. 2. It creates alternatives for the current complicated word, for which it employs Bert's pre-trained model. Bert is enhanced by two training objectives: MLM (masked language modeling) and NSP (next sentence prediction) are two training objectives that improve Bert's contextual accuracy. 3. It chooses and ranks substitution choices based on the output's grammatically and contextual fit. In short, our system makes use of the context and consider it as one of the most important parameters for simplification which in turn, makes our proposed system more accurate in all three important steps i.e., Complex Word Identification, Substitute Generation, filtering and substitute ranking in lexical simplification. 5. REFERENCES [1] J. De Belder, M.-F. Moens, Text simplification for children, In Proceedings of the 2010 SIGIR Workshop on Accessible Search Systems (2010) [2] G. H. Paetzold, L. Specia, Unsupervised lexical simplification for non- native speakers., in: AAAI, 2016 [3] G. Paetzold, L. Specia, Lexical simplification with neural ranking, in: ACL: Volume 2, Short Papers, 2017. [4] S. Devlin, J. Tait, The use of a psycholinguistic database in the simplification of text for aphasic readers, Linguistic Databases 1 (1998) 161173. [5] S. Nisioi, S. Stajner, S. P. Ponzetto, L. P. Dinu, Exploring neural text simplification models, in: ACL, Vol. 2, 2017. [6] G. Glavas, S. Stajner, Simplifying lexical simplification: do we need simplified corpora?, in: ACL, 2015. [7] J. Devlin, M.-W. Chang, K. Lee, K. Toutanova, Bert: Pre-training of deep bidirectional transformers for language understanding, arXiv preprint arXiv:1810.04805. [8] John Caroll, Guido Minnen, Yvonne Canning , Siobhan Devlin, and John Tait- Practical Simplification of English newspaper Text to assist Aphasic Readers. [9] E. Pavlick, C. Callison-Burch, Simple ppdb: A paraphrase database for simplification, in: ACL: Volume 2, Short Papers, 2016. [10] M. Lesk, Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone, in: Proceedings of the 5th Annual International Conference on Systems Documentation, 1986. [11] M. Maddela, W. Xu, A word-complexity lexicon and a neural readability ranking model for lexical simplification, in: EMNLP, 2018. [12] Advaith Siddharthan. 2006. Syntactic Simplification and Text Cohesion. Research on Language and Computation. [13] Xingxing Zhang and Mirella Lapata. 2017. Sentence Simplification with Deep Reinforcement Learning. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. [14] G. H. Paetzold, L. Specia, A survey on lexical simplification, in: Journal of Artificial Intelligence Research. [15] M. Brysbaert, B. New, Moving beyond kucera and francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for american english, Behavior Research Methods.
Download now