SlideShare a Scribd company logo
1 of 8
Download to read offline
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 03 Issue: 06 | Jun-2014, Available @ http://www.ijret.org 285
AN AUTOMATIC TEXT SUMMARIZATION USING LEXICAL
COHESION AND CORRELATION OF SENTENCES
A.R.Kulkarni1
, S.S.Apte2
1
Computer Science & Engineering Department, Walchand Institute of Technology, Solapur – 413006, India
2
Head, Computer Science & Engineering Department, Walchand Institute of Technology, Solapur – 413006, India
Abstract
Due to substantial increase in the amount of information on the Internet, it has become extremely difficult to search for relevant
documents needed by the users. To solve this problem, Text summarization is used which produces the summary of documents
such that the summary contains important content of the document. This paper proposes a better approach for text summarization
using lexical chaining and correlation of sentences. Lexical chains are created using Wordnet . The score of each Lexical chain is
calculated based on keyword strength, Tf-idf & other features. The concept of using lexical chains helps to analyze the document
semantically and the concept of correlation of sentences helps to consider the relation of sentence with preceding or succeeding
sentence. This improves the quality of summary generated.
In this paper we discuss a summarization method, which combines lexical chaining with correlation of sentences in which relation
of a sentence with the preceding sentence is considered. Our experiments show that the inclusion of both these features improves
the quality of summary generated.
Keywords— Text summarization, Wordnet, Correlation of sentences, Lexical chains
--------------------------------------------------------------------***-----------------------------------------------------------------
1. INTRODUCTION
1.1 Motivation
These days, the number of Web pages on the Internet almost
doubles every year as the information is now available from
a variety of sources. It takes considerable amount of time to
find the relevant information. Automatic Text
Summarization will help the users to find the relevant
information rapidly. It generates the summary of the
document and one can read the read the summary and
decide the relevance of the document to the information
needed by the user.
1.2 Background Research:
Text summarization is the process of producing a condensed
version of original document. This condensed version
should have important content of the original document.
Research is being done since many years to generate
coherent and indicative summaries using different
techniques. According to (Jones, 1993) the text
summarization is described as two step process
i) Building a source representation from the original
document.
ii) Generating summary from the source representation
Text summarization can be broadly classified into two types:
Single document summarization and multi-document
summarization. This paper focuses on single document
summarization that generates summary of single document.
The text summarization can be categorized into extractive
and abstractive based on the nature of text representation in
the summary.
Many methods have been proposed till now on generating a
coherent summary. The earlier methods used only statistical
methods that focused on term frequency [1] for choosing
important sentences. These methods were not found to be
efficient as it did not consider all the contexts of the word
or identify semantically related terms known as cohesion.
Then came methods which used semantic representation of
the original document supported by a domain-specific
knowledge base. Now a days text summarization is
considered as a natural language processing task . Lexical
chains a simplest form of lexical cohesion was introduced
by Morns & Hirst[2].But it was found that all possible
senses of the word were not taken into account. .
Berzilay & Elhada [2] presented a better algorithm that
constructs all possible interpretations of the source text
using lexical chains. It is an efficient method for text
summarization as lexical chains identify and capture
important concepts of the document without going into
deep semantic analyses. Lexical chains are constructed
using some knowledge base that contains nouns and its
various associations.
Our Algorithm is based on the method used above. We
have used Wordnet to generate domain-specific extractive
summary using Lexical chains for the nouns in the
document. The algorithm segments the given content into
sentences & then into tokens. These tokens are tagged using
POS tagger. The Nouns are selected & for each noun in the
segment, we consider its sense using Wordnet. Then we
attempt to merge these senses into all of the existing chains
in all possible ways, hence building every possible
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 03 Issue: 06 | Jun-2014, Available @ http://www.ijret.org 286
interpretation of the segment. Next merge chains between
segments that contain a word in the same sense in common.
The algorithm then calculates score of lexical chains,
determines the strongest chain and uses this to generate a
summary. We have also used the concept of correlation of
sentences to generate a good quality summary.. The terms
that occur in the strongest lexical chains are considered as
key terms and the score of sentences is calculated based
on the presence of key terms in it. All the sentences are
ranked based on their score and top n sentences are selected
for inclusion in the summary. Then the correlation of
sentences is checked and if any sentence has correlation
with the previous sentence, then the previous sentence
should also be included in the summary based on condition
as shown in the algorithm below
2. ARCHITECTURE OF TEXT
SUMMARIZATION
Preprocessing includes
 Segmentation
 Tokenization
 POS(part of speech tagging) at lexical level.
 Stemming.
3. LEXICAL CHAIN COMPUTING
ALGORITHM
1. Input Original document for generating summary
(.txt file).
2. Divide the document into sentences using
segmentation.
3. Each sentence is divided into tokens using
tokenizer.
4. These tokens are tagged using POS Tagger.
5. For each noun build the synsets.
6. For each sentence generate a map using 4
relations: Synonym, Hyperrnym, Hyponym,
Merynym.
7. Calculate distance of each word from other related
words.
8. Build Lexical chains using generated map.
9. Calculate each chain weight using values of
distances of each word
10. Select longest chain i.e. best chain having highest
chain weight
11. From the original document select sentences that
have words in the best chain retaining their order
of occurrence in the original document.
12. Pick top n sentences as summary based on the
percentage of original document to be used for
generating summary.
13. If the selected sentence starts with words :
although, however, moreover ,also, this, those and
that ,then they are related with the preceding
sentence.
14. If the rank of the preceding sentence is equal to or
greater than 70% of the rank of the selected
sentence, then it is included in the summary. In this
way correlation between sentences is maintained.
4. EVALUATION
Evaluation is the most important part of any research work.
It helps to compare various techniques based on evaluation
metrics.
This paper uses precision & recall [4,5,6]technique for
evaluation which is based on statistical measures. Precision
evaluates the proportion of correctness for the sentences in
the summary whereas recall is utilized to evaluate the
proportion of relevant sentences included in the summary.
4.1 Precision
Precision = {Retrieved sentences} - {Relevant sentences}
-----------------------------------------------------------
{Retrieved Sentences}
The higher the precision value, the better is the efficiency
of the system in reducing irrelevant Sentences
4.2 Recall
Recall= {Retrieved sentences}- {Relevant sentences}
______________________________
{relevant sentences}
Higher the recall value, better the efficiency of the
approach in selecting only relevant sentences.
4.3 F-Measure
The weighted harmonic mean of precision and recall is
called as F-measure
F-measure= 2 x Precision * recall
-----------------------
Precision + recall
5. EXPERIMENTAL RESULTS
Three documents are taken in news domain. The original
document, manually generated summaries and summaries
generated by the above approach are shown below. The
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 03 Issue: 06 | Jun-2014, Available @ http://www.ijret.org 287
precision recall and F-measure are calculated for these three
documents and they are compared with other two
summarizers.
Original Document 1
Ideal Summary of Document 1
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 03 Issue: 06 | Jun-2014, Available @ http://www.ijret.org 288
Summary of Document 1 generated by our Summarizer
Original Document 2
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 03 Issue: 06 | Jun-2014, Available @ http://www.ijret.org 289
Ideal Summary of Document 2
Summary of Document 2 generated by our summarizer
Original document 3
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 03 Issue: 06 | Jun-2014, Available @ http://www.ijret.org 290
Ideal summary of document 3
Generated Summary of Document 3
6. COMPARISION
This paper considers online summarizer from freesummarizer.com[7], Copernicus summarizer and our summarizer using lexical
chains of sentences for comparison. The above three documents are used as input to all the three summarizers. The precision,
recall and F-measure are used as performance measures for summary generated.
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 03 Issue: 06 | Jun-2014, Available @ http://www.ijret.org 291
Document1
Document2:
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
Copernic Summarizer Online Summarizer Lexical Chain
Summarizer
Precision
Recall
F-measure
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
Copernic Summarizer Online Summarizer Lexical Chain
Summarizer
Precision
Recall
F-measure
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 03 Issue: 06 | Jun-2014, Available @ http://www.ijret.org 292
Document3:
7. CONCLUSIONS
It is seen that for document 1 and document 3 our summarizer
performs better than Copernicus summarizer. and online
summarizer. For document 2, It performs equally as online
summarizer but less efficient than Copernicus summarizer.
Our summarizer is better as it also considers the semantic
analysis of the document & correlation of sentences for
generating the summary.
REFERENCES
[1] Canasai Kruengkari and Chuleer at Jaruskulchai,
"Generic Text Summarization Using Local and Global
Properties of Sentences", Proceedings of the IEEE/WIC
international Conference on Web Intelligence (WI’03),
2003.
[2] Morris, J. and G. Hirst. Lexical cohesion computed by
thesaural relations as an indicator of the structure of the
text. In Computational Linguistics, 18(1):pp21-45.
1991.
[3] Barzilay, Regina and Michael Elhadad. Using Lexical
Chains for Text Summarization. in Proceedings of the
Intelligent Scalable Text Summarization
Workshop.(ISTS’97), ACL Madrid, 1997.
[4] Rene Arnulfo Garcia-Hera ndez and Yulia Ledeneva,
“Word Sequence Models for Single Text
Summarization”, IEEE,44-48, 2009.
[5] Jade Goldstein, Mark Kantrowitz, Vibhu Mittal, Jaime
Carbonell, Summarizing text documents: Sentence
Selection and Evaluation Metrics, Language
Technologies Institute, Carnegie Mellon University.
[6] Khosrow Kaikhah, "Automatic Text summarization
with Neural Networks", in Proceedings of second
international Conference on intelligent systems, IEEE,
40-44, Texas, USA, June 2004.
[7] www.freesummarizer.com/summarize/
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
Copernic Summarizer Online Summarizer Lexical Chain
Summarizer
Precision
Recall
F-measure

More Related Content

What's hot

Semi-Supervised Keyphrase Extraction on Scientific Article using Fact-based S...
Semi-Supervised Keyphrase Extraction on Scientific Article using Fact-based S...Semi-Supervised Keyphrase Extraction on Scientific Article using Fact-based S...
Semi-Supervised Keyphrase Extraction on Scientific Article using Fact-based S...TELKOMNIKA JOURNAL
 
DOCUMENT SUMMARIZATION IN KANNADA USING KEYWORD EXTRACTION
DOCUMENT SUMMARIZATION IN KANNADA USING KEYWORD EXTRACTION DOCUMENT SUMMARIZATION IN KANNADA USING KEYWORD EXTRACTION
DOCUMENT SUMMARIZATION IN KANNADA USING KEYWORD EXTRACTION cscpconf
 
Complete agglomerative hierarchy document’s clustering based on fuzzy luhn’s ...
Complete agglomerative hierarchy document’s clustering based on fuzzy luhn’s ...Complete agglomerative hierarchy document’s clustering based on fuzzy luhn’s ...
Complete agglomerative hierarchy document’s clustering based on fuzzy luhn’s ...IJECEIAES
 
IRJET-Semantic Based Document Clustering Using Lexical Chains
IRJET-Semantic Based Document Clustering Using Lexical ChainsIRJET-Semantic Based Document Clustering Using Lexical Chains
IRJET-Semantic Based Document Clustering Using Lexical ChainsIRJET Journal
 
Elevating forensic investigation system for file clustering
Elevating forensic investigation system for file clusteringElevating forensic investigation system for file clustering
Elevating forensic investigation system for file clusteringeSAT Journals
 
Elevating forensic investigation system for file clustering
Elevating forensic investigation system for file clusteringElevating forensic investigation system for file clustering
Elevating forensic investigation system for file clusteringeSAT Publishing House
 
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURESGENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURESijnlc
 
IRJET- Automatic Recapitulation of Text Document
IRJET- Automatic Recapitulation of Text DocumentIRJET- Automatic Recapitulation of Text Document
IRJET- Automatic Recapitulation of Text DocumentIRJET Journal
 
SEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATION
SEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATIONSEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATION
SEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATIONijnlc
 
Legal Document
Legal DocumentLegal Document
Legal Documentlegal4
 
Generation of Question and Answer from Unstructured Document using Gaussian M...
Generation of Question and Answer from Unstructured Document using Gaussian M...Generation of Question and Answer from Unstructured Document using Gaussian M...
Generation of Question and Answer from Unstructured Document using Gaussian M...IJACEE IJACEE
 
Analysis of Opinionated Text for Opinion Mining
Analysis of Opinionated Text for Opinion MiningAnalysis of Opinionated Text for Opinion Mining
Analysis of Opinionated Text for Opinion Miningmlaij
 
Dr31564567
Dr31564567Dr31564567
Dr31564567IJMER
 
Information extraction using discourse
Information extraction using discourseInformation extraction using discourse
Information extraction using discourseijitcs
 
A Newly Proposed Technique for Summarizing the Abstractive Newspapers’ Articl...
A Newly Proposed Technique for Summarizing the Abstractive Newspapers’ Articl...A Newly Proposed Technique for Summarizing the Abstractive Newspapers’ Articl...
A Newly Proposed Technique for Summarizing the Abstractive Newspapers’ Articl...mlaij
 

What's hot (17)

Semi-Supervised Keyphrase Extraction on Scientific Article using Fact-based S...
Semi-Supervised Keyphrase Extraction on Scientific Article using Fact-based S...Semi-Supervised Keyphrase Extraction on Scientific Article using Fact-based S...
Semi-Supervised Keyphrase Extraction on Scientific Article using Fact-based S...
 
DOCUMENT SUMMARIZATION IN KANNADA USING KEYWORD EXTRACTION
DOCUMENT SUMMARIZATION IN KANNADA USING KEYWORD EXTRACTION DOCUMENT SUMMARIZATION IN KANNADA USING KEYWORD EXTRACTION
DOCUMENT SUMMARIZATION IN KANNADA USING KEYWORD EXTRACTION
 
Complete agglomerative hierarchy document’s clustering based on fuzzy luhn’s ...
Complete agglomerative hierarchy document’s clustering based on fuzzy luhn’s ...Complete agglomerative hierarchy document’s clustering based on fuzzy luhn’s ...
Complete agglomerative hierarchy document’s clustering based on fuzzy luhn’s ...
 
I6 mala3 sowmya
I6 mala3 sowmyaI6 mala3 sowmya
I6 mala3 sowmya
 
IRJET-Semantic Based Document Clustering Using Lexical Chains
IRJET-Semantic Based Document Clustering Using Lexical ChainsIRJET-Semantic Based Document Clustering Using Lexical Chains
IRJET-Semantic Based Document Clustering Using Lexical Chains
 
Y24168171
Y24168171Y24168171
Y24168171
 
Elevating forensic investigation system for file clustering
Elevating forensic investigation system for file clusteringElevating forensic investigation system for file clustering
Elevating forensic investigation system for file clustering
 
Elevating forensic investigation system for file clustering
Elevating forensic investigation system for file clusteringElevating forensic investigation system for file clustering
Elevating forensic investigation system for file clustering
 
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURESGENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
 
IRJET- Automatic Recapitulation of Text Document
IRJET- Automatic Recapitulation of Text DocumentIRJET- Automatic Recapitulation of Text Document
IRJET- Automatic Recapitulation of Text Document
 
SEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATION
SEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATIONSEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATION
SEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATION
 
Legal Document
Legal DocumentLegal Document
Legal Document
 
Generation of Question and Answer from Unstructured Document using Gaussian M...
Generation of Question and Answer from Unstructured Document using Gaussian M...Generation of Question and Answer from Unstructured Document using Gaussian M...
Generation of Question and Answer from Unstructured Document using Gaussian M...
 
Analysis of Opinionated Text for Opinion Mining
Analysis of Opinionated Text for Opinion MiningAnalysis of Opinionated Text for Opinion Mining
Analysis of Opinionated Text for Opinion Mining
 
Dr31564567
Dr31564567Dr31564567
Dr31564567
 
Information extraction using discourse
Information extraction using discourseInformation extraction using discourse
Information extraction using discourse
 
A Newly Proposed Technique for Summarizing the Abstractive Newspapers’ Articl...
A Newly Proposed Technique for Summarizing the Abstractive Newspapers’ Articl...A Newly Proposed Technique for Summarizing the Abstractive Newspapers’ Articl...
A Newly Proposed Technique for Summarizing the Abstractive Newspapers’ Articl...
 

Viewers also liked

5 Bedroom Terra Nova Villa Community View Arabian Ranches
5 Bedroom Terra Nova Villa Community View Arabian Ranches5 Bedroom Terra Nova Villa Community View Arabian Ranches
5 Bedroom Terra Nova Villa Community View Arabian RanchesProperty Listing Search
 
Avoiding packet loss in gateway reallocation in mobile wimax networks
Avoiding packet loss in gateway reallocation in mobile wimax networksAvoiding packet loss in gateway reallocation in mobile wimax networks
Avoiding packet loss in gateway reallocation in mobile wimax networkseSAT Publishing House
 
June 1 prescription for life
June 1 prescription for lifeJune 1 prescription for life
June 1 prescription for lifeGary Thompson
 
Gestão de recursos 3º Nível- Curso Básico em Agro-Pecuário
Gestão de recursos 3º Nível- Curso Básico em Agro-PecuárioGestão de recursos 3º Nível- Curso Básico em Agro-Pecuário
Gestão de recursos 3º Nível- Curso Básico em Agro-PecuárioEpfr De Estaquinha
 
Heterogeneous data transfer and loader
Heterogeneous data transfer and loaderHeterogeneous data transfer and loader
Heterogeneous data transfer and loadereSAT Publishing House
 
You Will Be Breached
You Will Be BreachedYou Will Be Breached
You Will Be BreachedMike Saunders
 
Enabling Limitless Connectivity, Opportunity and Growth with Interconnection ...
Enabling Limitless Connectivity, Opportunity and Growth with Interconnection ...Enabling Limitless Connectivity, Opportunity and Growth with Interconnection ...
Enabling Limitless Connectivity, Opportunity and Growth with Interconnection ...Sagi Brody
 
The Economic Impact of Hockey in Canada
The Economic Impact of Hockey in CanadaThe Economic Impact of Hockey in Canada
The Economic Impact of Hockey in CanadaAbel Sports Management
 
Nhìn ra thế giới để tìm kiếm cơ hội làm giàu cho người việt nam
Nhìn ra thế giới để tìm kiếm cơ hội làm giàu cho người việt namNhìn ra thế giới để tìm kiếm cơ hội làm giàu cho người việt nam
Nhìn ra thế giới để tìm kiếm cơ hội làm giàu cho người việt namNguyễn Thái
 
Presentation 11 19 [recovered]
Presentation 11 19 [recovered]Presentation 11 19 [recovered]
Presentation 11 19 [recovered]Alicia White
 

Viewers also liked (16)

Document Summarizer
Document SummarizerDocument Summarizer
Document Summarizer
 
5 Bedroom Terra Nova Villa Community View Arabian Ranches
5 Bedroom Terra Nova Villa Community View Arabian Ranches5 Bedroom Terra Nova Villa Community View Arabian Ranches
5 Bedroom Terra Nova Villa Community View Arabian Ranches
 
Avoiding packet loss in gateway reallocation in mobile wimax networks
Avoiding packet loss in gateway reallocation in mobile wimax networksAvoiding packet loss in gateway reallocation in mobile wimax networks
Avoiding packet loss in gateway reallocation in mobile wimax networks
 
Childadvocacy
ChildadvocacyChildadvocacy
Childadvocacy
 
June 1 prescription for life
June 1 prescription for lifeJune 1 prescription for life
June 1 prescription for life
 
праздники осени
праздники осенипраздники осени
праздники осени
 
Gestão de recursos 3º Nível- Curso Básico em Agro-Pecuário
Gestão de recursos 3º Nível- Curso Básico em Agro-PecuárioGestão de recursos 3º Nível- Curso Básico em Agro-Pecuário
Gestão de recursos 3º Nível- Curso Básico em Agro-Pecuário
 
All about relative ctr
All about relative ctrAll about relative ctr
All about relative ctr
 
Heterogeneous data transfer and loader
Heterogeneous data transfer and loaderHeterogeneous data transfer and loader
Heterogeneous data transfer and loader
 
Kebutuhan Server
Kebutuhan ServerKebutuhan Server
Kebutuhan Server
 
You Will Be Breached
You Will Be BreachedYou Will Be Breached
You Will Be Breached
 
Enabling Limitless Connectivity, Opportunity and Growth with Interconnection ...
Enabling Limitless Connectivity, Opportunity and Growth with Interconnection ...Enabling Limitless Connectivity, Opportunity and Growth with Interconnection ...
Enabling Limitless Connectivity, Opportunity and Growth with Interconnection ...
 
The Economic Impact of Hockey in Canada
The Economic Impact of Hockey in CanadaThe Economic Impact of Hockey in Canada
The Economic Impact of Hockey in Canada
 
Nhìn ra thế giới để tìm kiếm cơ hội làm giàu cho người việt nam
Nhìn ra thế giới để tìm kiếm cơ hội làm giàu cho người việt namNhìn ra thế giới để tìm kiếm cơ hội làm giàu cho người việt nam
Nhìn ra thế giới để tìm kiếm cơ hội làm giàu cho người việt nam
 
Smo by goigi
Smo by goigiSmo by goigi
Smo by goigi
 
Presentation 11 19 [recovered]
Presentation 11 19 [recovered]Presentation 11 19 [recovered]
Presentation 11 19 [recovered]
 

Similar to An automatic text summarization using lexical cohesion and correlation of sentences

A hybrid approach for text summarization using semantic latent Dirichlet allo...
A hybrid approach for text summarization using semantic latent Dirichlet allo...A hybrid approach for text summarization using semantic latent Dirichlet allo...
A hybrid approach for text summarization using semantic latent Dirichlet allo...IJECEIAES
 
Conceptual framework for abstractive text summarization
Conceptual framework for abstractive text summarizationConceptual framework for abstractive text summarization
Conceptual framework for abstractive text summarizationijnlc
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)theijes
 
Syntactic Indexes for Text Retrieval
Syntactic Indexes for Text RetrievalSyntactic Indexes for Text Retrieval
Syntactic Indexes for Text RetrievalITIIIndustries
 
Keyword Extraction Based Summarization of Categorized Kannada Text Documents
Keyword Extraction Based Summarization of Categorized Kannada Text Documents Keyword Extraction Based Summarization of Categorized Kannada Text Documents
Keyword Extraction Based Summarization of Categorized Kannada Text Documents ijsc
 
IRJET- Sewage Treatment Potential of Coir Geotextiles in Conjunction with Act...
IRJET- Sewage Treatment Potential of Coir Geotextiles in Conjunction with Act...IRJET- Sewage Treatment Potential of Coir Geotextiles in Conjunction with Act...
IRJET- Sewage Treatment Potential of Coir Geotextiles in Conjunction with Act...IRJET Journal
 
IRJET- Text Highlighting – A Machine Learning Approach
IRJET- Text Highlighting – A Machine Learning ApproachIRJET- Text Highlighting – A Machine Learning Approach
IRJET- Text Highlighting – A Machine Learning ApproachIRJET Journal
 
Semantic Based Document Clustering Using Lexical Chains
Semantic Based Document Clustering Using Lexical ChainsSemantic Based Document Clustering Using Lexical Chains
Semantic Based Document Clustering Using Lexical ChainsIRJET Journal
 
8 efficient multi-document summary generation using neural network
8 efficient multi-document summary generation using neural network8 efficient multi-document summary generation using neural network
8 efficient multi-document summary generation using neural networkINFOGAIN PUBLICATION
 
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEMCANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEMIRJET Journal
 
Comparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization TechniquesComparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization Techniquesugginaramesh
 
Automatic Text Summarization using Natural Language Processing
Automatic Text Summarization using Natural Language ProcessingAutomatic Text Summarization using Natural Language Processing
Automatic Text Summarization using Natural Language ProcessingIRJET Journal
 
Re-enactment of Newspaper Articles
Re-enactment of Newspaper ArticlesRe-enactment of Newspaper Articles
Re-enactment of Newspaper ArticlesEditor IJCATR
 
Automatic Text Summarization: A Critical Review
Automatic Text Summarization: A Critical ReviewAutomatic Text Summarization: A Critical Review
Automatic Text Summarization: A Critical ReviewIRJET Journal
 
Comparative Study of Abstractive Text Summarization Techniques
Comparative Study of Abstractive Text Summarization TechniquesComparative Study of Abstractive Text Summarization Techniques
Comparative Study of Abstractive Text Summarization TechniquesIRJET Journal
 
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...IRJET Journal
 
Automatic Text Summarization
Automatic Text SummarizationAutomatic Text Summarization
Automatic Text SummarizationIRJET Journal
 
Scaling Down Dimensions and Feature Extraction in Document Repository Classif...
Scaling Down Dimensions and Feature Extraction in Document Repository Classif...Scaling Down Dimensions and Feature Extraction in Document Repository Classif...
Scaling Down Dimensions and Feature Extraction in Document Repository Classif...ijdmtaiir
 
Feature selection, optimization and clustering strategies of text documents
Feature selection, optimization and clustering strategies of text documentsFeature selection, optimization and clustering strategies of text documents
Feature selection, optimization and clustering strategies of text documentsIJECEIAES
 

Similar to An automatic text summarization using lexical cohesion and correlation of sentences (20)

A hybrid approach for text summarization using semantic latent Dirichlet allo...
A hybrid approach for text summarization using semantic latent Dirichlet allo...A hybrid approach for text summarization using semantic latent Dirichlet allo...
A hybrid approach for text summarization using semantic latent Dirichlet allo...
 
Conceptual framework for abstractive text summarization
Conceptual framework for abstractive text summarizationConceptual framework for abstractive text summarization
Conceptual framework for abstractive text summarization
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)
 
Syntactic Indexes for Text Retrieval
Syntactic Indexes for Text RetrievalSyntactic Indexes for Text Retrieval
Syntactic Indexes for Text Retrieval
 
Keyword Extraction Based Summarization of Categorized Kannada Text Documents
Keyword Extraction Based Summarization of Categorized Kannada Text Documents Keyword Extraction Based Summarization of Categorized Kannada Text Documents
Keyword Extraction Based Summarization of Categorized Kannada Text Documents
 
IRJET- Sewage Treatment Potential of Coir Geotextiles in Conjunction with Act...
IRJET- Sewage Treatment Potential of Coir Geotextiles in Conjunction with Act...IRJET- Sewage Treatment Potential of Coir Geotextiles in Conjunction with Act...
IRJET- Sewage Treatment Potential of Coir Geotextiles in Conjunction with Act...
 
IRJET- Text Highlighting – A Machine Learning Approach
IRJET- Text Highlighting – A Machine Learning ApproachIRJET- Text Highlighting – A Machine Learning Approach
IRJET- Text Highlighting – A Machine Learning Approach
 
Semantic Based Document Clustering Using Lexical Chains
Semantic Based Document Clustering Using Lexical ChainsSemantic Based Document Clustering Using Lexical Chains
Semantic Based Document Clustering Using Lexical Chains
 
8 efficient multi-document summary generation using neural network
8 efficient multi-document summary generation using neural network8 efficient multi-document summary generation using neural network
8 efficient multi-document summary generation using neural network
 
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEMCANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
 
Comparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization TechniquesComparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization Techniques
 
Automatic Text Summarization using Natural Language Processing
Automatic Text Summarization using Natural Language ProcessingAutomatic Text Summarization using Natural Language Processing
Automatic Text Summarization using Natural Language Processing
 
Re-enactment of Newspaper Articles
Re-enactment of Newspaper ArticlesRe-enactment of Newspaper Articles
Re-enactment of Newspaper Articles
 
Automatic Text Summarization: A Critical Review
Automatic Text Summarization: A Critical ReviewAutomatic Text Summarization: A Critical Review
Automatic Text Summarization: A Critical Review
 
Comparative Study of Abstractive Text Summarization Techniques
Comparative Study of Abstractive Text Summarization TechniquesComparative Study of Abstractive Text Summarization Techniques
Comparative Study of Abstractive Text Summarization Techniques
 
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...
 
Automatic Text Summarization
Automatic Text SummarizationAutomatic Text Summarization
Automatic Text Summarization
 
H04564550
H04564550H04564550
H04564550
 
Scaling Down Dimensions and Feature Extraction in Document Repository Classif...
Scaling Down Dimensions and Feature Extraction in Document Repository Classif...Scaling Down Dimensions and Feature Extraction in Document Repository Classif...
Scaling Down Dimensions and Feature Extraction in Document Repository Classif...
 
Feature selection, optimization and clustering strategies of text documents
Feature selection, optimization and clustering strategies of text documentsFeature selection, optimization and clustering strategies of text documents
Feature selection, optimization and clustering strategies of text documents
 

More from eSAT Publishing House

Likely impacts of hudhud on the environment of visakhapatnam
Likely impacts of hudhud on the environment of visakhapatnamLikely impacts of hudhud on the environment of visakhapatnam
Likely impacts of hudhud on the environment of visakhapatnameSAT Publishing House
 
Impact of flood disaster in a drought prone area – case study of alampur vill...
Impact of flood disaster in a drought prone area – case study of alampur vill...Impact of flood disaster in a drought prone area – case study of alampur vill...
Impact of flood disaster in a drought prone area – case study of alampur vill...eSAT Publishing House
 
Hudhud cyclone – a severe disaster in visakhapatnam
Hudhud cyclone – a severe disaster in visakhapatnamHudhud cyclone – a severe disaster in visakhapatnam
Hudhud cyclone – a severe disaster in visakhapatnameSAT Publishing House
 
Groundwater investigation using geophysical methods a case study of pydibhim...
Groundwater investigation using geophysical methods  a case study of pydibhim...Groundwater investigation using geophysical methods  a case study of pydibhim...
Groundwater investigation using geophysical methods a case study of pydibhim...eSAT Publishing House
 
Flood related disasters concerned to urban flooding in bangalore, india
Flood related disasters concerned to urban flooding in bangalore, indiaFlood related disasters concerned to urban flooding in bangalore, india
Flood related disasters concerned to urban flooding in bangalore, indiaeSAT Publishing House
 
Enhancing post disaster recovery by optimal infrastructure capacity building
Enhancing post disaster recovery by optimal infrastructure capacity buildingEnhancing post disaster recovery by optimal infrastructure capacity building
Enhancing post disaster recovery by optimal infrastructure capacity buildingeSAT Publishing House
 
Effect of lintel and lintel band on the global performance of reinforced conc...
Effect of lintel and lintel band on the global performance of reinforced conc...Effect of lintel and lintel band on the global performance of reinforced conc...
Effect of lintel and lintel band on the global performance of reinforced conc...eSAT Publishing House
 
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...eSAT Publishing House
 
Wind damage to buildings, infrastrucuture and landscape elements along the be...
Wind damage to buildings, infrastrucuture and landscape elements along the be...Wind damage to buildings, infrastrucuture and landscape elements along the be...
Wind damage to buildings, infrastrucuture and landscape elements along the be...eSAT Publishing House
 
Shear strength of rc deep beam panels – a review
Shear strength of rc deep beam panels – a reviewShear strength of rc deep beam panels – a review
Shear strength of rc deep beam panels – a revieweSAT Publishing House
 
Role of voluntary teams of professional engineers in dissater management – ex...
Role of voluntary teams of professional engineers in dissater management – ex...Role of voluntary teams of professional engineers in dissater management – ex...
Role of voluntary teams of professional engineers in dissater management – ex...eSAT Publishing House
 
Risk analysis and environmental hazard management
Risk analysis and environmental hazard managementRisk analysis and environmental hazard management
Risk analysis and environmental hazard managementeSAT Publishing House
 
Review study on performance of seismically tested repaired shear walls
Review study on performance of seismically tested repaired shear wallsReview study on performance of seismically tested repaired shear walls
Review study on performance of seismically tested repaired shear wallseSAT Publishing House
 
Monitoring and assessment of air quality with reference to dust particles (pm...
Monitoring and assessment of air quality with reference to dust particles (pm...Monitoring and assessment of air quality with reference to dust particles (pm...
Monitoring and assessment of air quality with reference to dust particles (pm...eSAT Publishing House
 
Low cost wireless sensor networks and smartphone applications for disaster ma...
Low cost wireless sensor networks and smartphone applications for disaster ma...Low cost wireless sensor networks and smartphone applications for disaster ma...
Low cost wireless sensor networks and smartphone applications for disaster ma...eSAT Publishing House
 
Coastal zones – seismic vulnerability an analysis from east coast of india
Coastal zones – seismic vulnerability an analysis from east coast of indiaCoastal zones – seismic vulnerability an analysis from east coast of india
Coastal zones – seismic vulnerability an analysis from east coast of indiaeSAT Publishing House
 
Can fracture mechanics predict damage due disaster of structures
Can fracture mechanics predict damage due disaster of structuresCan fracture mechanics predict damage due disaster of structures
Can fracture mechanics predict damage due disaster of structureseSAT Publishing House
 
Assessment of seismic susceptibility of rc buildings
Assessment of seismic susceptibility of rc buildingsAssessment of seismic susceptibility of rc buildings
Assessment of seismic susceptibility of rc buildingseSAT Publishing House
 
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...eSAT Publishing House
 
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...eSAT Publishing House
 

More from eSAT Publishing House (20)

Likely impacts of hudhud on the environment of visakhapatnam
Likely impacts of hudhud on the environment of visakhapatnamLikely impacts of hudhud on the environment of visakhapatnam
Likely impacts of hudhud on the environment of visakhapatnam
 
Impact of flood disaster in a drought prone area – case study of alampur vill...
Impact of flood disaster in a drought prone area – case study of alampur vill...Impact of flood disaster in a drought prone area – case study of alampur vill...
Impact of flood disaster in a drought prone area – case study of alampur vill...
 
Hudhud cyclone – a severe disaster in visakhapatnam
Hudhud cyclone – a severe disaster in visakhapatnamHudhud cyclone – a severe disaster in visakhapatnam
Hudhud cyclone – a severe disaster in visakhapatnam
 
Groundwater investigation using geophysical methods a case study of pydibhim...
Groundwater investigation using geophysical methods  a case study of pydibhim...Groundwater investigation using geophysical methods  a case study of pydibhim...
Groundwater investigation using geophysical methods a case study of pydibhim...
 
Flood related disasters concerned to urban flooding in bangalore, india
Flood related disasters concerned to urban flooding in bangalore, indiaFlood related disasters concerned to urban flooding in bangalore, india
Flood related disasters concerned to urban flooding in bangalore, india
 
Enhancing post disaster recovery by optimal infrastructure capacity building
Enhancing post disaster recovery by optimal infrastructure capacity buildingEnhancing post disaster recovery by optimal infrastructure capacity building
Enhancing post disaster recovery by optimal infrastructure capacity building
 
Effect of lintel and lintel band on the global performance of reinforced conc...
Effect of lintel and lintel band on the global performance of reinforced conc...Effect of lintel and lintel band on the global performance of reinforced conc...
Effect of lintel and lintel band on the global performance of reinforced conc...
 
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
 
Wind damage to buildings, infrastrucuture and landscape elements along the be...
Wind damage to buildings, infrastrucuture and landscape elements along the be...Wind damage to buildings, infrastrucuture and landscape elements along the be...
Wind damage to buildings, infrastrucuture and landscape elements along the be...
 
Shear strength of rc deep beam panels – a review
Shear strength of rc deep beam panels – a reviewShear strength of rc deep beam panels – a review
Shear strength of rc deep beam panels – a review
 
Role of voluntary teams of professional engineers in dissater management – ex...
Role of voluntary teams of professional engineers in dissater management – ex...Role of voluntary teams of professional engineers in dissater management – ex...
Role of voluntary teams of professional engineers in dissater management – ex...
 
Risk analysis and environmental hazard management
Risk analysis and environmental hazard managementRisk analysis and environmental hazard management
Risk analysis and environmental hazard management
 
Review study on performance of seismically tested repaired shear walls
Review study on performance of seismically tested repaired shear wallsReview study on performance of seismically tested repaired shear walls
Review study on performance of seismically tested repaired shear walls
 
Monitoring and assessment of air quality with reference to dust particles (pm...
Monitoring and assessment of air quality with reference to dust particles (pm...Monitoring and assessment of air quality with reference to dust particles (pm...
Monitoring and assessment of air quality with reference to dust particles (pm...
 
Low cost wireless sensor networks and smartphone applications for disaster ma...
Low cost wireless sensor networks and smartphone applications for disaster ma...Low cost wireless sensor networks and smartphone applications for disaster ma...
Low cost wireless sensor networks and smartphone applications for disaster ma...
 
Coastal zones – seismic vulnerability an analysis from east coast of india
Coastal zones – seismic vulnerability an analysis from east coast of indiaCoastal zones – seismic vulnerability an analysis from east coast of india
Coastal zones – seismic vulnerability an analysis from east coast of india
 
Can fracture mechanics predict damage due disaster of structures
Can fracture mechanics predict damage due disaster of structuresCan fracture mechanics predict damage due disaster of structures
Can fracture mechanics predict damage due disaster of structures
 
Assessment of seismic susceptibility of rc buildings
Assessment of seismic susceptibility of rc buildingsAssessment of seismic susceptibility of rc buildings
Assessment of seismic susceptibility of rc buildings
 
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
 
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
 

Recently uploaded

Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxvipinkmenon1
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 

Recently uploaded (20)

Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptx
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 

An automatic text summarization using lexical cohesion and correlation of sentences

  • 1. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 03 Issue: 06 | Jun-2014, Available @ http://www.ijret.org 285 AN AUTOMATIC TEXT SUMMARIZATION USING LEXICAL COHESION AND CORRELATION OF SENTENCES A.R.Kulkarni1 , S.S.Apte2 1 Computer Science & Engineering Department, Walchand Institute of Technology, Solapur – 413006, India 2 Head, Computer Science & Engineering Department, Walchand Institute of Technology, Solapur – 413006, India Abstract Due to substantial increase in the amount of information on the Internet, it has become extremely difficult to search for relevant documents needed by the users. To solve this problem, Text summarization is used which produces the summary of documents such that the summary contains important content of the document. This paper proposes a better approach for text summarization using lexical chaining and correlation of sentences. Lexical chains are created using Wordnet . The score of each Lexical chain is calculated based on keyword strength, Tf-idf & other features. The concept of using lexical chains helps to analyze the document semantically and the concept of correlation of sentences helps to consider the relation of sentence with preceding or succeeding sentence. This improves the quality of summary generated. In this paper we discuss a summarization method, which combines lexical chaining with correlation of sentences in which relation of a sentence with the preceding sentence is considered. Our experiments show that the inclusion of both these features improves the quality of summary generated. Keywords— Text summarization, Wordnet, Correlation of sentences, Lexical chains --------------------------------------------------------------------***----------------------------------------------------------------- 1. INTRODUCTION 1.1 Motivation These days, the number of Web pages on the Internet almost doubles every year as the information is now available from a variety of sources. It takes considerable amount of time to find the relevant information. Automatic Text Summarization will help the users to find the relevant information rapidly. It generates the summary of the document and one can read the read the summary and decide the relevance of the document to the information needed by the user. 1.2 Background Research: Text summarization is the process of producing a condensed version of original document. This condensed version should have important content of the original document. Research is being done since many years to generate coherent and indicative summaries using different techniques. According to (Jones, 1993) the text summarization is described as two step process i) Building a source representation from the original document. ii) Generating summary from the source representation Text summarization can be broadly classified into two types: Single document summarization and multi-document summarization. This paper focuses on single document summarization that generates summary of single document. The text summarization can be categorized into extractive and abstractive based on the nature of text representation in the summary. Many methods have been proposed till now on generating a coherent summary. The earlier methods used only statistical methods that focused on term frequency [1] for choosing important sentences. These methods were not found to be efficient as it did not consider all the contexts of the word or identify semantically related terms known as cohesion. Then came methods which used semantic representation of the original document supported by a domain-specific knowledge base. Now a days text summarization is considered as a natural language processing task . Lexical chains a simplest form of lexical cohesion was introduced by Morns & Hirst[2].But it was found that all possible senses of the word were not taken into account. . Berzilay & Elhada [2] presented a better algorithm that constructs all possible interpretations of the source text using lexical chains. It is an efficient method for text summarization as lexical chains identify and capture important concepts of the document without going into deep semantic analyses. Lexical chains are constructed using some knowledge base that contains nouns and its various associations. Our Algorithm is based on the method used above. We have used Wordnet to generate domain-specific extractive summary using Lexical chains for the nouns in the document. The algorithm segments the given content into sentences & then into tokens. These tokens are tagged using POS tagger. The Nouns are selected & for each noun in the segment, we consider its sense using Wordnet. Then we attempt to merge these senses into all of the existing chains in all possible ways, hence building every possible
  • 2. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 03 Issue: 06 | Jun-2014, Available @ http://www.ijret.org 286 interpretation of the segment. Next merge chains between segments that contain a word in the same sense in common. The algorithm then calculates score of lexical chains, determines the strongest chain and uses this to generate a summary. We have also used the concept of correlation of sentences to generate a good quality summary.. The terms that occur in the strongest lexical chains are considered as key terms and the score of sentences is calculated based on the presence of key terms in it. All the sentences are ranked based on their score and top n sentences are selected for inclusion in the summary. Then the correlation of sentences is checked and if any sentence has correlation with the previous sentence, then the previous sentence should also be included in the summary based on condition as shown in the algorithm below 2. ARCHITECTURE OF TEXT SUMMARIZATION Preprocessing includes  Segmentation  Tokenization  POS(part of speech tagging) at lexical level.  Stemming. 3. LEXICAL CHAIN COMPUTING ALGORITHM 1. Input Original document for generating summary (.txt file). 2. Divide the document into sentences using segmentation. 3. Each sentence is divided into tokens using tokenizer. 4. These tokens are tagged using POS Tagger. 5. For each noun build the synsets. 6. For each sentence generate a map using 4 relations: Synonym, Hyperrnym, Hyponym, Merynym. 7. Calculate distance of each word from other related words. 8. Build Lexical chains using generated map. 9. Calculate each chain weight using values of distances of each word 10. Select longest chain i.e. best chain having highest chain weight 11. From the original document select sentences that have words in the best chain retaining their order of occurrence in the original document. 12. Pick top n sentences as summary based on the percentage of original document to be used for generating summary. 13. If the selected sentence starts with words : although, however, moreover ,also, this, those and that ,then they are related with the preceding sentence. 14. If the rank of the preceding sentence is equal to or greater than 70% of the rank of the selected sentence, then it is included in the summary. In this way correlation between sentences is maintained. 4. EVALUATION Evaluation is the most important part of any research work. It helps to compare various techniques based on evaluation metrics. This paper uses precision & recall [4,5,6]technique for evaluation which is based on statistical measures. Precision evaluates the proportion of correctness for the sentences in the summary whereas recall is utilized to evaluate the proportion of relevant sentences included in the summary. 4.1 Precision Precision = {Retrieved sentences} - {Relevant sentences} ----------------------------------------------------------- {Retrieved Sentences} The higher the precision value, the better is the efficiency of the system in reducing irrelevant Sentences 4.2 Recall Recall= {Retrieved sentences}- {Relevant sentences} ______________________________ {relevant sentences} Higher the recall value, better the efficiency of the approach in selecting only relevant sentences. 4.3 F-Measure The weighted harmonic mean of precision and recall is called as F-measure F-measure= 2 x Precision * recall ----------------------- Precision + recall 5. EXPERIMENTAL RESULTS Three documents are taken in news domain. The original document, manually generated summaries and summaries generated by the above approach are shown below. The
  • 3. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 03 Issue: 06 | Jun-2014, Available @ http://www.ijret.org 287 precision recall and F-measure are calculated for these three documents and they are compared with other two summarizers. Original Document 1 Ideal Summary of Document 1
  • 4. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 03 Issue: 06 | Jun-2014, Available @ http://www.ijret.org 288 Summary of Document 1 generated by our Summarizer Original Document 2
  • 5. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 03 Issue: 06 | Jun-2014, Available @ http://www.ijret.org 289 Ideal Summary of Document 2 Summary of Document 2 generated by our summarizer Original document 3
  • 6. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 03 Issue: 06 | Jun-2014, Available @ http://www.ijret.org 290 Ideal summary of document 3 Generated Summary of Document 3 6. COMPARISION This paper considers online summarizer from freesummarizer.com[7], Copernicus summarizer and our summarizer using lexical chains of sentences for comparison. The above three documents are used as input to all the three summarizers. The precision, recall and F-measure are used as performance measures for summary generated.
  • 7. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 03 Issue: 06 | Jun-2014, Available @ http://www.ijret.org 291 Document1 Document2: 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 Copernic Summarizer Online Summarizer Lexical Chain Summarizer Precision Recall F-measure 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 Copernic Summarizer Online Summarizer Lexical Chain Summarizer Precision Recall F-measure
  • 8. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 03 Issue: 06 | Jun-2014, Available @ http://www.ijret.org 292 Document3: 7. CONCLUSIONS It is seen that for document 1 and document 3 our summarizer performs better than Copernicus summarizer. and online summarizer. For document 2, It performs equally as online summarizer but less efficient than Copernicus summarizer. Our summarizer is better as it also considers the semantic analysis of the document & correlation of sentences for generating the summary. REFERENCES [1] Canasai Kruengkari and Chuleer at Jaruskulchai, "Generic Text Summarization Using Local and Global Properties of Sentences", Proceedings of the IEEE/WIC international Conference on Web Intelligence (WI’03), 2003. [2] Morris, J. and G. Hirst. Lexical cohesion computed by thesaural relations as an indicator of the structure of the text. In Computational Linguistics, 18(1):pp21-45. 1991. [3] Barzilay, Regina and Michael Elhadad. Using Lexical Chains for Text Summarization. in Proceedings of the Intelligent Scalable Text Summarization Workshop.(ISTS’97), ACL Madrid, 1997. [4] Rene Arnulfo Garcia-Hera ndez and Yulia Ledeneva, “Word Sequence Models for Single Text Summarization”, IEEE,44-48, 2009. [5] Jade Goldstein, Mark Kantrowitz, Vibhu Mittal, Jaime Carbonell, Summarizing text documents: Sentence Selection and Evaluation Metrics, Language Technologies Institute, Carnegie Mellon University. [6] Khosrow Kaikhah, "Automatic Text summarization with Neural Networks", in Proceedings of second international Conference on intelligent systems, IEEE, 40-44, Texas, USA, June 2004. [7] www.freesummarizer.com/summarize/ 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 Copernic Summarizer Online Summarizer Lexical Chain Summarizer Precision Recall F-measure