SlideShare a Scribd company logo
ISSN: 2278 – 1323
International Journal of Advanced Research in Computer Engineering & Technology (IJARCET)
Volume 2, Issue 4, April 2013
1339
All Rights Reserved © 2013 IJARCET

Abstract —With the increasing number of user queries to
search engine. it has become necessary to study the area of
Information Retrieval. User always expects to get the most
appropriate and relevant results to the query. But when there is
a ambiguity user usually do not get what he/she actually wants.
This paper deals with ambiguity of the query and helps
providing user with relevant results. Also to validate the work
we have used evaluation measure Mean Average Precision.
Keywords— Information Retrieval, Page Rank Algorithm,
Precision, Average Precision, Mean Average Precision
I. INTRODUCTION
In recent years there have been a great number of user
queries increased to the search engines. For all the queries
humans are mostly relying on the search engines. Also user
expects to get the most relevant and efficient results. There
are many problems faced in doing so by the search engine,
one of them is ambiguity of query. When the user enters a
word beat, it has various meanings, beat can be related to boat,
it can be in context of music, it can be in context of police etc.
Such ambiguity in words leads to a depth study in area of
Information Retrieval, NLP etc. In this paper we have
discussed few aspects of these areas and proposed an
algorithm to provide user more relevant results by
disambiguating the queries of user. Also we have included
some of the evaluation measures to proof the work.
II. RANKING ALGORITHMS
Web Search engine uses different ranking algorithms to
provide user with relevant results. Ranking Algorithms are
generally used by web search engine to provide user results
according to its priority by ranking the web pages. Different
ranking algorithms use different aspects of user query to
provide user with satisfactory relevant results. They are based
on Web Structure Mining, Web Usage Mining, or Web
Content Mining. All the three mining fields are playing a great
deal of work in area of Information Retrieval. Many Ranking
algorithms have been proposed and implemented till now,
such as Page Rank Algorithm used by Google[6][8], HITS(
Hypertext Induced Topic Search)[4][5], WPR( Weighted
Page Rank)[10], etc.
III. WORD SENSE DISAMBIGUATION
Word sense Disambiguation is a task of automatically
assigning sense to a polysemous word in a appropriate
context. It helps find the correct meaning or sense to the
sentence or query which has multiple meanings. It is a very
challenging and important area of NLP which helps
improving the performance of information retrieval and
information extraction etc. When searching results, it is
important to eliminate the senses of query or word which are
not important to the user. Word Sense Disambiguation is the
process that provide the direction to solve queries [9][11].
IV. WORD SENSE DISAMBIGUATION APPROACHES
There are two main approaches to WSD [11], Deep
Approach that tries to access world of knowledge beforehand.
Other approach is Shallow Approach that doesn’t understand
the text instead uses surrounding words to assign the meaning
to text. Also there are four conventional approaches to WSD:
A. Dictionary and Knowledge based method
This method relies on dictionaries, thesauri, and knowledge
bases they do not form a corpus for training. It assumes that
words that are used together in text are related and the relation
can be taken out from the meaning of words and sense of
usage [11].
B. Supervised
This method assumes that context of text is enough to
disambiguate the data. It uses sense annotated corpus to train
the data [11].
C. Semi-Supervised or Minimally Supervised
This method is used by many Word sense disambiguation
algorithms because of lack of training sets. It uses both
annotated and un annotated data. A small set of annotated data
is used to train initial data set. Then the initial classified data
is used to train larger un annotated data. This process is
repeated until whole corpus is consumed [9][11].
Disambiguation of User Queries in Search
Engines
Rekha Jain*1
, Rupal Bhargava*2
, G.N Purohit*3
*Department of Computer Science, Banasthali University, Jaipur, India
ISSN: 2278 – 1323
International Journal of Advanced Research in Computer Engineering & Technology (IJARCET)
Volume 2, Issue 4, April 2013
All Rights Reserved © 2013 IJARCET
1340
D. Unsupervised
It completely uses un annotated data. This method is biggest
challenge for WSD researchers. It assumes that senses similar
senses occur in similar contexts and thus senses can be
induced from text by clustering word occurrences using some
measure of similarity of context, a task referred to as word
sense induction or discrimination.[9]
V. PROPOSED WORK
Our proposed work acts as a layer onto the Google’s Page
Rank algorithm but it can always use other ranking algorithms
also. In our proposed work we have disambiguated the user
query and calculated dynamic page rank for the results to
rearrange the results according to user preferences.
Figure1 Framework for Proposed Algorithm
VI. EVALUATION MEASURES
There are different measures available for evaluating
performance of Information Retrieval. Some of them are:
A. Precision
Precision is the ratio of relevant retrieved documents to the
retrieved documents. [3][7]
(1)
B. Recall
Recall is the ratio of relevant retrieved documents to the
relevant documents. [3][7]
(2)
C. Average Precision
Average precision computes the average value of
over the interval from to [3].
(3)
D. Mean average Precision
Mean average precision for a set of queries is the mean of
the average precision scores for each query [3].
Where Q is the number of queries.
(4)
VII. EXPERIMENTAL RESULTS
There are various ambiguous words; we have chosen Beat
which has various senses in context of heart, boat and music.
The result of the simulation on the basis of Page rank
algorithm and our Dynamic Page Rank Algorithm are shown
in figure2, 3, 4,5. Also comparative graphs of Page Rank and
Dynamic Page Rank for values of Average Precision and
Mean Average Precision are given in figure 6,7.
Figure 2: Results according to Page Rank
Figure 3: Results for beat (heart)
Figure 4: Results for beat (boat)
ISSN: 2278 – 1323
International Journal of Advanced Research in Computer Engineering & Technology (IJARCET)
Volume 2, Issue 4, April 2013
1341
All Rights Reserved © 2013 IJARCET
Figure 5: Results for beat (music)
Figure 6: Comparative graph for Average Precision
Figure 7: Comparative graph for Mean Average Precision
.
VIII. CONCLUSION
There are numerous ranking algorithms available for
ranking web pages; all of them work on different perspectives.
Page Rank algorithm is used by Google which is most popular
search engine now days. But to solve the ambiguity faced by
the user we have proposed a dynamic page rank algorithm
which is acting as a layer on to the Page Rank algorithm and
provides user much appropriate and relevant result. Also to
validate the work we have included Mean Average Precision
measure.
REFERENCES
[1] Ashutosh Kumar Singh, Ravi Kumar P, ―A Comparative study of Page
Ranking Algorithms for Information Retrieval‖, International Journal
of Electrical and Computer Engineering 4:7 2009
[2] Daniel Jurafsky and James H. Martin, Speech and Language
Processing, Pearson Prentice Hall, 2009
[3] ―Information Retrival‖ available at
http://en.wikipedia.org/wiki/Information_retrieval
[4] J. Kleinberg, ―Authoritative Sources in a Hyper-Linked Environment‖,
Journal of the ACM 46(5), pp. 604-632, 1999.
[5] J. Kleinberg, ―Hubs, Authorities and Communities‖, ACM Computing
Surveys, 31(4), 1999.
[6] L. Page, S. Brin, R. Motwani, and T. Winograd, ―The Pagerank
Citation Ranking: Bringing order to the Web‖. Technical Report,
Stanford Digital Libraries SIDL-WP-1999-0120, 1999.
[7] ―Precision and Recall‖ available at
http://en.wikipedia.org/wiki/Precision_and_recall
[8] S. Brin, and L. Page, ―The Anatomy of a Large Scale Hypertextual Web
Search Engine‖, Computer Network and ISDN Systems, Vol. 30, Issue
1-7, pp. 107-117,1998.
[9] Schütze, H., ―Automatic word sense discrimination. Computational
Linguistics‖, 24(1): 97–123, 1998.
[10] W. Xing and Ali Ghorbani, ―Weighted PageRank Algorithm‖, Proc. Of
the Second Annual Conference on Communication Networks and
Services Research (CNSR ’04), IEEE, 2004.
[11] ―Word Sense Disambiguation‖, available at
http://en.wikipedia.org/wiki/Word-sense_disambiguation
AUTHORS
Rekha Jain completed her Master Degree in
Computer Science from Kurukshetra University in
2004. Now she is working as Assistant Professor in
Department of ―Apaji Institute of Mathematics &
Applied Computer Technology‖ at Banasthali
University, Rajasthan and pursuing Ph.D. under
the supervision of Prof. G. N. Purohit. Her current
research interest includes Web Mining, Semantic
Web and Data Mining. She has various National
and International publications and conferences.
Rupal Bhargava is pursuing her M.Tech in
Computer Science from BanasthaliVidyapith,
Rajasthan. She is undergoing the training of her
M.Tech in supervision of Mrs. Rekha Jain. Her
current research interest includes Web Mining,
Semantic Web and Data Mining. She has
published various papers in the conferences and
journals.
Prof. G. N. Purohit is a Professor in
Department of Mathematics & Statistics at
Banasthali University (Rajasthan). Before joining
Banasthali University, he was Professor and Head
of the Department of Mathematics, University of
Rajasthan, Jaipur. He had been Chief-editor of a
research journal and regular reviewer of many
journals. His present interest is in O.R., Discrete
Mathematics and Communication networks. He
has published round 40 research papers in various
journals.

More Related Content

What's hot

Context Sensitive Search String Composition Algorithm using User Intention to...
Context Sensitive Search String Composition Algorithm using User Intention to...Context Sensitive Search String Composition Algorithm using User Intention to...
Context Sensitive Search String Composition Algorithm using User Intention to...
IJECEIAES
 
8 efficient multi-document summary generation using neural network
8 efficient multi-document summary generation using neural network8 efficient multi-document summary generation using neural network
8 efficient multi-document summary generation using neural network
INFOGAIN PUBLICATION
 
Programmer information needs after memory failure
Programmer information needs after memory failureProgrammer information needs after memory failure
Programmer information needs after memory failure
Bhagyashree Deokar
 
Modeling Text Independent Speaker Identification with Vector Quantization
Modeling Text Independent Speaker Identification with Vector QuantizationModeling Text Independent Speaker Identification with Vector Quantization
Modeling Text Independent Speaker Identification with Vector Quantization
TELKOMNIKA JOURNAL
 
K0936266
K0936266K0936266
K0936266
IOSR Journals
 
A Novel Approach for Keyword extraction in learning objects using text mining
A Novel Approach for Keyword extraction in learning objects using text miningA Novel Approach for Keyword extraction in learning objects using text mining
A Novel Approach for Keyword extraction in learning objects using text mining
IJSRD
 
IRJET- Detection of Clinical Depression in Humans using Sentiment Analysis
IRJET-  	  Detection of Clinical Depression in Humans using Sentiment AnalysisIRJET-  	  Detection of Clinical Depression in Humans using Sentiment Analysis
IRJET- Detection of Clinical Depression in Humans using Sentiment Analysis
IRJET Journal
 
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET Journal
 
Sentiment Analysis of Bengali text using Gated Recurrent Neural Network
Sentiment Analysis of Bengali text using Gated Recurrent Neural NetworkSentiment Analysis of Bengali text using Gated Recurrent Neural Network
Sentiment Analysis of Bengali text using Gated Recurrent Neural Network
A. Hasib Uddin
 
IRJET- Suspicious Email Detection System
IRJET- Suspicious Email Detection SystemIRJET- Suspicious Email Detection System
IRJET- Suspicious Email Detection System
IRJET Journal
 
Ijarcet vol-2-issue-7-2252-2257
Ijarcet vol-2-issue-7-2252-2257Ijarcet vol-2-issue-7-2252-2257
Ijarcet vol-2-issue-7-2252-2257
Editor IJARCET
 
2. an efficient approach for web query preprocessing edit sat
2. an efficient approach for web query preprocessing edit sat2. an efficient approach for web query preprocessing edit sat
2. an efficient approach for web query preprocessing edit sat
IAESIJEECS
 
VIDEO OBJECTS DESCRIPTION IN HINDI TEXT LANGUAGE
VIDEO OBJECTS DESCRIPTION IN HINDI TEXT LANGUAGE VIDEO OBJECTS DESCRIPTION IN HINDI TEXT LANGUAGE
VIDEO OBJECTS DESCRIPTION IN HINDI TEXT LANGUAGE
ijmpict
 
USER PROFILE BASED PERSONALIZED WEB SEARCH
USER PROFILE BASED PERSONALIZED WEB SEARCHUSER PROFILE BASED PERSONALIZED WEB SEARCH
USER PROFILE BASED PERSONALIZED WEB SEARCH
ijmpict
 
IRJET - Deep Collaborrative Filtering with Aspect Information
IRJET - Deep Collaborrative Filtering with Aspect InformationIRJET - Deep Collaborrative Filtering with Aspect Information
IRJET - Deep Collaborrative Filtering with Aspect Information
IRJET Journal
 
Convolutional recurrent neural network with template based representation for...
Convolutional recurrent neural network with template based representation for...Convolutional recurrent neural network with template based representation for...
Convolutional recurrent neural network with template based representation for...
IJECEIAES
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
IJERD Editor
 
Query expansion
Query expansionQuery expansion
Query expansion
Spandan Veggalam
 
Design of optimal search engine using text summarization through artificial i...
Design of optimal search engine using text summarization through artificial i...Design of optimal search engine using text summarization through artificial i...
Design of optimal search engine using text summarization through artificial i...
TELKOMNIKA JOURNAL
 
Optimal approach for text summarization
Optimal approach for text summarizationOptimal approach for text summarization
Optimal approach for text summarization
IAEME Publication
 

What's hot (20)

Context Sensitive Search String Composition Algorithm using User Intention to...
Context Sensitive Search String Composition Algorithm using User Intention to...Context Sensitive Search String Composition Algorithm using User Intention to...
Context Sensitive Search String Composition Algorithm using User Intention to...
 
8 efficient multi-document summary generation using neural network
8 efficient multi-document summary generation using neural network8 efficient multi-document summary generation using neural network
8 efficient multi-document summary generation using neural network
 
Programmer information needs after memory failure
Programmer information needs after memory failureProgrammer information needs after memory failure
Programmer information needs after memory failure
 
Modeling Text Independent Speaker Identification with Vector Quantization
Modeling Text Independent Speaker Identification with Vector QuantizationModeling Text Independent Speaker Identification with Vector Quantization
Modeling Text Independent Speaker Identification with Vector Quantization
 
K0936266
K0936266K0936266
K0936266
 
A Novel Approach for Keyword extraction in learning objects using text mining
A Novel Approach for Keyword extraction in learning objects using text miningA Novel Approach for Keyword extraction in learning objects using text mining
A Novel Approach for Keyword extraction in learning objects using text mining
 
IRJET- Detection of Clinical Depression in Humans using Sentiment Analysis
IRJET-  	  Detection of Clinical Depression in Humans using Sentiment AnalysisIRJET-  	  Detection of Clinical Depression in Humans using Sentiment Analysis
IRJET- Detection of Clinical Depression in Humans using Sentiment Analysis
 
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
 
Sentiment Analysis of Bengali text using Gated Recurrent Neural Network
Sentiment Analysis of Bengali text using Gated Recurrent Neural NetworkSentiment Analysis of Bengali text using Gated Recurrent Neural Network
Sentiment Analysis of Bengali text using Gated Recurrent Neural Network
 
IRJET- Suspicious Email Detection System
IRJET- Suspicious Email Detection SystemIRJET- Suspicious Email Detection System
IRJET- Suspicious Email Detection System
 
Ijarcet vol-2-issue-7-2252-2257
Ijarcet vol-2-issue-7-2252-2257Ijarcet vol-2-issue-7-2252-2257
Ijarcet vol-2-issue-7-2252-2257
 
2. an efficient approach for web query preprocessing edit sat
2. an efficient approach for web query preprocessing edit sat2. an efficient approach for web query preprocessing edit sat
2. an efficient approach for web query preprocessing edit sat
 
VIDEO OBJECTS DESCRIPTION IN HINDI TEXT LANGUAGE
VIDEO OBJECTS DESCRIPTION IN HINDI TEXT LANGUAGE VIDEO OBJECTS DESCRIPTION IN HINDI TEXT LANGUAGE
VIDEO OBJECTS DESCRIPTION IN HINDI TEXT LANGUAGE
 
USER PROFILE BASED PERSONALIZED WEB SEARCH
USER PROFILE BASED PERSONALIZED WEB SEARCHUSER PROFILE BASED PERSONALIZED WEB SEARCH
USER PROFILE BASED PERSONALIZED WEB SEARCH
 
IRJET - Deep Collaborrative Filtering with Aspect Information
IRJET - Deep Collaborrative Filtering with Aspect InformationIRJET - Deep Collaborrative Filtering with Aspect Information
IRJET - Deep Collaborrative Filtering with Aspect Information
 
Convolutional recurrent neural network with template based representation for...
Convolutional recurrent neural network with template based representation for...Convolutional recurrent neural network with template based representation for...
Convolutional recurrent neural network with template based representation for...
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
Query expansion
Query expansionQuery expansion
Query expansion
 
Design of optimal search engine using text summarization through artificial i...
Design of optimal search engine using text summarization through artificial i...Design of optimal search engine using text summarization through artificial i...
Design of optimal search engine using text summarization through artificial i...
 
Optimal approach for text summarization
Optimal approach for text summarizationOptimal approach for text summarization
Optimal approach for text summarization
 

Viewers also liked

Linkedin
LinkedinLinkedin
Linkedin
nickyimrie
 
Unit 9: London, Part 3
Unit 9: London, Part 3Unit 9: London, Part 3
Unit 9: London, Part 3
British Studies
 
55 60
55 6055 60
Smard card123
Smard card123Smard card123
Smard card123
Editor IJARCET
 
38 41
38 4138 41
A problematica do tempo nos programas de formacao docente online
A problematica do tempo nos programas de formacao docente onlineA problematica do tempo nos programas de formacao docente online
A problematica do tempo nos programas de formacao docente online
Lucila Pesce
 
Analytics Site Search
Analytics Site SearchAnalytics Site Search
Analytics Site SearchJun Miyazaki
 
Sem 27 02 09 3
Sem 27 02 09 3Sem 27 02 09 3
Sem 27 02 09 3Olga92
 

Viewers also liked (8)

Linkedin
LinkedinLinkedin
Linkedin
 
Unit 9: London, Part 3
Unit 9: London, Part 3Unit 9: London, Part 3
Unit 9: London, Part 3
 
55 60
55 6055 60
55 60
 
Smard card123
Smard card123Smard card123
Smard card123
 
38 41
38 4138 41
38 41
 
A problematica do tempo nos programas de formacao docente online
A problematica do tempo nos programas de formacao docente onlineA problematica do tempo nos programas de formacao docente online
A problematica do tempo nos programas de formacao docente online
 
Analytics Site Search
Analytics Site SearchAnalytics Site Search
Analytics Site Search
 
Sem 27 02 09 3
Sem 27 02 09 3Sem 27 02 09 3
Sem 27 02 09 3
 

Similar to Ijarcet vol-2-issue-4-1339-1341

Performance Evaluation of Query Processing Techniques in Information Retrieval
Performance Evaluation of Query Processing Techniques in Information RetrievalPerformance Evaluation of Query Processing Techniques in Information Retrieval
Performance Evaluation of Query Processing Techniques in Information Retrieval
idescitation
 
Enhanced Retrieval of Web Pages using Improved Page Rank Algorithm
Enhanced Retrieval of Web Pages using Improved Page Rank AlgorithmEnhanced Retrieval of Web Pages using Improved Page Rank Algorithm
Enhanced Retrieval of Web Pages using Improved Page Rank Algorithm
kevig
 
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET-  	  Multi-Document Summarization using Fuzzy and Hierarchical ApproachIRJET-  	  Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET Journal
 
Viva
VivaViva
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET Journal
 
50120140503012
5012014050301250120140503012
50120140503012
IAEME Publication
 
The sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regressionThe sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regression
EditorIJAERD
 
50120140502013
5012014050201350120140502013
50120140502013
IAEME Publication
 
50120140502013
5012014050201350120140502013
50120140502013
IAEME Publication
 
`A Survey on approaches of Web Mining in Varied Areas
`A Survey on approaches of Web Mining in Varied Areas`A Survey on approaches of Web Mining in Varied Areas
`A Survey on approaches of Web Mining in Varied Areas
inventionjournals
 
Hybrid Classifier for Sentiment Analysis using Effective Pipelining
Hybrid Classifier for Sentiment Analysis using Effective PipeliningHybrid Classifier for Sentiment Analysis using Effective Pipelining
Hybrid Classifier for Sentiment Analysis using Effective Pipelining
IRJET Journal
 
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
professional fuzzy type-ahead rummage around in xml  type-ahead search techni...professional fuzzy type-ahead rummage around in xml  type-ahead search techni...
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
Kumar Goud
 
A novel method to search information through multi agent search and retrie
A novel method to search information through multi agent search and retrieA novel method to search information through multi agent search and retrie
A novel method to search information through multi agent search and retrie
IAEME Publication
 
IRJET- A Literature Review and Classification of Semantic Web Approaches for ...
IRJET- A Literature Review and Classification of Semantic Web Approaches for ...IRJET- A Literature Review and Classification of Semantic Web Approaches for ...
IRJET- A Literature Review and Classification of Semantic Web Approaches for ...
IRJET Journal
 
An effective search on web log from most popular downloaded content
An effective search on web log from most popular downloaded contentAn effective search on web log from most popular downloaded content
An effective search on web log from most popular downloaded content
ijdpsjournal
 
Co-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online ReviewsCo-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online Reviews
Editor IJCATR
 
On the benefit of logic-based machine learning to learn pairwise comparisons
On the benefit of logic-based machine learning to learn pairwise comparisonsOn the benefit of logic-based machine learning to learn pairwise comparisons
On the benefit of logic-based machine learning to learn pairwise comparisons
journalBEEI
 
C017510717
C017510717C017510717
C017510717
IOSR Journals
 
Ijetcas14 368
Ijetcas14 368Ijetcas14 368
Ijetcas14 368
Iasir Journals
 
An efficient information retrieval ontology system based indexing for context
An efficient information retrieval ontology system based indexing for contextAn efficient information retrieval ontology system based indexing for context
An efficient information retrieval ontology system based indexing for context
eSAT Journals
 

Similar to Ijarcet vol-2-issue-4-1339-1341 (20)

Performance Evaluation of Query Processing Techniques in Information Retrieval
Performance Evaluation of Query Processing Techniques in Information RetrievalPerformance Evaluation of Query Processing Techniques in Information Retrieval
Performance Evaluation of Query Processing Techniques in Information Retrieval
 
Enhanced Retrieval of Web Pages using Improved Page Rank Algorithm
Enhanced Retrieval of Web Pages using Improved Page Rank AlgorithmEnhanced Retrieval of Web Pages using Improved Page Rank Algorithm
Enhanced Retrieval of Web Pages using Improved Page Rank Algorithm
 
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET-  	  Multi-Document Summarization using Fuzzy and Hierarchical ApproachIRJET-  	  Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical Approach
 
Viva
VivaViva
Viva
 
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
 
50120140503012
5012014050301250120140503012
50120140503012
 
The sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regressionThe sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regression
 
50120140502013
5012014050201350120140502013
50120140502013
 
50120140502013
5012014050201350120140502013
50120140502013
 
`A Survey on approaches of Web Mining in Varied Areas
`A Survey on approaches of Web Mining in Varied Areas`A Survey on approaches of Web Mining in Varied Areas
`A Survey on approaches of Web Mining in Varied Areas
 
Hybrid Classifier for Sentiment Analysis using Effective Pipelining
Hybrid Classifier for Sentiment Analysis using Effective PipeliningHybrid Classifier for Sentiment Analysis using Effective Pipelining
Hybrid Classifier for Sentiment Analysis using Effective Pipelining
 
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
professional fuzzy type-ahead rummage around in xml  type-ahead search techni...professional fuzzy type-ahead rummage around in xml  type-ahead search techni...
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
 
A novel method to search information through multi agent search and retrie
A novel method to search information through multi agent search and retrieA novel method to search information through multi agent search and retrie
A novel method to search information through multi agent search and retrie
 
IRJET- A Literature Review and Classification of Semantic Web Approaches for ...
IRJET- A Literature Review and Classification of Semantic Web Approaches for ...IRJET- A Literature Review and Classification of Semantic Web Approaches for ...
IRJET- A Literature Review and Classification of Semantic Web Approaches for ...
 
An effective search on web log from most popular downloaded content
An effective search on web log from most popular downloaded contentAn effective search on web log from most popular downloaded content
An effective search on web log from most popular downloaded content
 
Co-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online ReviewsCo-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online Reviews
 
On the benefit of logic-based machine learning to learn pairwise comparisons
On the benefit of logic-based machine learning to learn pairwise comparisonsOn the benefit of logic-based machine learning to learn pairwise comparisons
On the benefit of logic-based machine learning to learn pairwise comparisons
 
C017510717
C017510717C017510717
C017510717
 
Ijetcas14 368
Ijetcas14 368Ijetcas14 368
Ijetcas14 368
 
An efficient information retrieval ontology system based indexing for context
An efficient information retrieval ontology system based indexing for contextAn efficient information retrieval ontology system based indexing for context
An efficient information retrieval ontology system based indexing for context
 

More from Editor IJARCET

Electrically small antennas: The art of miniaturization
Electrically small antennas: The art of miniaturizationElectrically small antennas: The art of miniaturization
Electrically small antennas: The art of miniaturization
Editor IJARCET
 
Volume 2-issue-6-2205-2207
Volume 2-issue-6-2205-2207Volume 2-issue-6-2205-2207
Volume 2-issue-6-2205-2207
Editor IJARCET
 
Volume 2-issue-6-2195-2199
Volume 2-issue-6-2195-2199Volume 2-issue-6-2195-2199
Volume 2-issue-6-2195-2199
Editor IJARCET
 
Volume 2-issue-6-2200-2204
Volume 2-issue-6-2200-2204Volume 2-issue-6-2200-2204
Volume 2-issue-6-2200-2204
Editor IJARCET
 
Volume 2-issue-6-2190-2194
Volume 2-issue-6-2190-2194Volume 2-issue-6-2190-2194
Volume 2-issue-6-2190-2194
Editor IJARCET
 
Volume 2-issue-6-2186-2189
Volume 2-issue-6-2186-2189Volume 2-issue-6-2186-2189
Volume 2-issue-6-2186-2189
Editor IJARCET
 
Volume 2-issue-6-2177-2185
Volume 2-issue-6-2177-2185Volume 2-issue-6-2177-2185
Volume 2-issue-6-2177-2185
Editor IJARCET
 
Volume 2-issue-6-2173-2176
Volume 2-issue-6-2173-2176Volume 2-issue-6-2173-2176
Volume 2-issue-6-2173-2176
Editor IJARCET
 
Volume 2-issue-6-2165-2172
Volume 2-issue-6-2165-2172Volume 2-issue-6-2165-2172
Volume 2-issue-6-2165-2172
Editor IJARCET
 
Volume 2-issue-6-2159-2164
Volume 2-issue-6-2159-2164Volume 2-issue-6-2159-2164
Volume 2-issue-6-2159-2164
Editor IJARCET
 
Volume 2-issue-6-2155-2158
Volume 2-issue-6-2155-2158Volume 2-issue-6-2155-2158
Volume 2-issue-6-2155-2158
Editor IJARCET
 
Volume 2-issue-6-2148-2154
Volume 2-issue-6-2148-2154Volume 2-issue-6-2148-2154
Volume 2-issue-6-2148-2154
Editor IJARCET
 
Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147
Editor IJARCET
 
Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124
Editor IJARCET
 
Volume 2-issue-6-2139-2142
Volume 2-issue-6-2139-2142Volume 2-issue-6-2139-2142
Volume 2-issue-6-2139-2142
Editor IJARCET
 
Volume 2-issue-6-2130-2138
Volume 2-issue-6-2130-2138Volume 2-issue-6-2130-2138
Volume 2-issue-6-2130-2138
Editor IJARCET
 
Volume 2-issue-6-2125-2129
Volume 2-issue-6-2125-2129Volume 2-issue-6-2125-2129
Volume 2-issue-6-2125-2129
Editor IJARCET
 
Volume 2-issue-6-2114-2118
Volume 2-issue-6-2114-2118Volume 2-issue-6-2114-2118
Volume 2-issue-6-2114-2118
Editor IJARCET
 
Volume 2-issue-6-2108-2113
Volume 2-issue-6-2108-2113Volume 2-issue-6-2108-2113
Volume 2-issue-6-2108-2113
Editor IJARCET
 
Volume 2-issue-6-2102-2107
Volume 2-issue-6-2102-2107Volume 2-issue-6-2102-2107
Volume 2-issue-6-2102-2107
Editor IJARCET
 

More from Editor IJARCET (20)

Electrically small antennas: The art of miniaturization
Electrically small antennas: The art of miniaturizationElectrically small antennas: The art of miniaturization
Electrically small antennas: The art of miniaturization
 
Volume 2-issue-6-2205-2207
Volume 2-issue-6-2205-2207Volume 2-issue-6-2205-2207
Volume 2-issue-6-2205-2207
 
Volume 2-issue-6-2195-2199
Volume 2-issue-6-2195-2199Volume 2-issue-6-2195-2199
Volume 2-issue-6-2195-2199
 
Volume 2-issue-6-2200-2204
Volume 2-issue-6-2200-2204Volume 2-issue-6-2200-2204
Volume 2-issue-6-2200-2204
 
Volume 2-issue-6-2190-2194
Volume 2-issue-6-2190-2194Volume 2-issue-6-2190-2194
Volume 2-issue-6-2190-2194
 
Volume 2-issue-6-2186-2189
Volume 2-issue-6-2186-2189Volume 2-issue-6-2186-2189
Volume 2-issue-6-2186-2189
 
Volume 2-issue-6-2177-2185
Volume 2-issue-6-2177-2185Volume 2-issue-6-2177-2185
Volume 2-issue-6-2177-2185
 
Volume 2-issue-6-2173-2176
Volume 2-issue-6-2173-2176Volume 2-issue-6-2173-2176
Volume 2-issue-6-2173-2176
 
Volume 2-issue-6-2165-2172
Volume 2-issue-6-2165-2172Volume 2-issue-6-2165-2172
Volume 2-issue-6-2165-2172
 
Volume 2-issue-6-2159-2164
Volume 2-issue-6-2159-2164Volume 2-issue-6-2159-2164
Volume 2-issue-6-2159-2164
 
Volume 2-issue-6-2155-2158
Volume 2-issue-6-2155-2158Volume 2-issue-6-2155-2158
Volume 2-issue-6-2155-2158
 
Volume 2-issue-6-2148-2154
Volume 2-issue-6-2148-2154Volume 2-issue-6-2148-2154
Volume 2-issue-6-2148-2154
 
Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147
 
Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124
 
Volume 2-issue-6-2139-2142
Volume 2-issue-6-2139-2142Volume 2-issue-6-2139-2142
Volume 2-issue-6-2139-2142
 
Volume 2-issue-6-2130-2138
Volume 2-issue-6-2130-2138Volume 2-issue-6-2130-2138
Volume 2-issue-6-2130-2138
 
Volume 2-issue-6-2125-2129
Volume 2-issue-6-2125-2129Volume 2-issue-6-2125-2129
Volume 2-issue-6-2125-2129
 
Volume 2-issue-6-2114-2118
Volume 2-issue-6-2114-2118Volume 2-issue-6-2114-2118
Volume 2-issue-6-2114-2118
 
Volume 2-issue-6-2108-2113
Volume 2-issue-6-2108-2113Volume 2-issue-6-2108-2113
Volume 2-issue-6-2108-2113
 
Volume 2-issue-6-2102-2107
Volume 2-issue-6-2102-2107Volume 2-issue-6-2102-2107
Volume 2-issue-6-2102-2107
 

Recently uploaded

Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 

Recently uploaded (20)

Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 

Ijarcet vol-2-issue-4-1339-1341

  • 1. ISSN: 2278 – 1323 International Journal of Advanced Research in Computer Engineering & Technology (IJARCET) Volume 2, Issue 4, April 2013 1339 All Rights Reserved © 2013 IJARCET  Abstract —With the increasing number of user queries to search engine. it has become necessary to study the area of Information Retrieval. User always expects to get the most appropriate and relevant results to the query. But when there is a ambiguity user usually do not get what he/she actually wants. This paper deals with ambiguity of the query and helps providing user with relevant results. Also to validate the work we have used evaluation measure Mean Average Precision. Keywords— Information Retrieval, Page Rank Algorithm, Precision, Average Precision, Mean Average Precision I. INTRODUCTION In recent years there have been a great number of user queries increased to the search engines. For all the queries humans are mostly relying on the search engines. Also user expects to get the most relevant and efficient results. There are many problems faced in doing so by the search engine, one of them is ambiguity of query. When the user enters a word beat, it has various meanings, beat can be related to boat, it can be in context of music, it can be in context of police etc. Such ambiguity in words leads to a depth study in area of Information Retrieval, NLP etc. In this paper we have discussed few aspects of these areas and proposed an algorithm to provide user more relevant results by disambiguating the queries of user. Also we have included some of the evaluation measures to proof the work. II. RANKING ALGORITHMS Web Search engine uses different ranking algorithms to provide user with relevant results. Ranking Algorithms are generally used by web search engine to provide user results according to its priority by ranking the web pages. Different ranking algorithms use different aspects of user query to provide user with satisfactory relevant results. They are based on Web Structure Mining, Web Usage Mining, or Web Content Mining. All the three mining fields are playing a great deal of work in area of Information Retrieval. Many Ranking algorithms have been proposed and implemented till now, such as Page Rank Algorithm used by Google[6][8], HITS( Hypertext Induced Topic Search)[4][5], WPR( Weighted Page Rank)[10], etc. III. WORD SENSE DISAMBIGUATION Word sense Disambiguation is a task of automatically assigning sense to a polysemous word in a appropriate context. It helps find the correct meaning or sense to the sentence or query which has multiple meanings. It is a very challenging and important area of NLP which helps improving the performance of information retrieval and information extraction etc. When searching results, it is important to eliminate the senses of query or word which are not important to the user. Word Sense Disambiguation is the process that provide the direction to solve queries [9][11]. IV. WORD SENSE DISAMBIGUATION APPROACHES There are two main approaches to WSD [11], Deep Approach that tries to access world of knowledge beforehand. Other approach is Shallow Approach that doesn’t understand the text instead uses surrounding words to assign the meaning to text. Also there are four conventional approaches to WSD: A. Dictionary and Knowledge based method This method relies on dictionaries, thesauri, and knowledge bases they do not form a corpus for training. It assumes that words that are used together in text are related and the relation can be taken out from the meaning of words and sense of usage [11]. B. Supervised This method assumes that context of text is enough to disambiguate the data. It uses sense annotated corpus to train the data [11]. C. Semi-Supervised or Minimally Supervised This method is used by many Word sense disambiguation algorithms because of lack of training sets. It uses both annotated and un annotated data. A small set of annotated data is used to train initial data set. Then the initial classified data is used to train larger un annotated data. This process is repeated until whole corpus is consumed [9][11]. Disambiguation of User Queries in Search Engines Rekha Jain*1 , Rupal Bhargava*2 , G.N Purohit*3 *Department of Computer Science, Banasthali University, Jaipur, India
  • 2. ISSN: 2278 – 1323 International Journal of Advanced Research in Computer Engineering & Technology (IJARCET) Volume 2, Issue 4, April 2013 All Rights Reserved © 2013 IJARCET 1340 D. Unsupervised It completely uses un annotated data. This method is biggest challenge for WSD researchers. It assumes that senses similar senses occur in similar contexts and thus senses can be induced from text by clustering word occurrences using some measure of similarity of context, a task referred to as word sense induction or discrimination.[9] V. PROPOSED WORK Our proposed work acts as a layer onto the Google’s Page Rank algorithm but it can always use other ranking algorithms also. In our proposed work we have disambiguated the user query and calculated dynamic page rank for the results to rearrange the results according to user preferences. Figure1 Framework for Proposed Algorithm VI. EVALUATION MEASURES There are different measures available for evaluating performance of Information Retrieval. Some of them are: A. Precision Precision is the ratio of relevant retrieved documents to the retrieved documents. [3][7] (1) B. Recall Recall is the ratio of relevant retrieved documents to the relevant documents. [3][7] (2) C. Average Precision Average precision computes the average value of over the interval from to [3]. (3) D. Mean average Precision Mean average precision for a set of queries is the mean of the average precision scores for each query [3]. Where Q is the number of queries. (4) VII. EXPERIMENTAL RESULTS There are various ambiguous words; we have chosen Beat which has various senses in context of heart, boat and music. The result of the simulation on the basis of Page rank algorithm and our Dynamic Page Rank Algorithm are shown in figure2, 3, 4,5. Also comparative graphs of Page Rank and Dynamic Page Rank for values of Average Precision and Mean Average Precision are given in figure 6,7. Figure 2: Results according to Page Rank Figure 3: Results for beat (heart) Figure 4: Results for beat (boat)
  • 3. ISSN: 2278 – 1323 International Journal of Advanced Research in Computer Engineering & Technology (IJARCET) Volume 2, Issue 4, April 2013 1341 All Rights Reserved © 2013 IJARCET Figure 5: Results for beat (music) Figure 6: Comparative graph for Average Precision Figure 7: Comparative graph for Mean Average Precision . VIII. CONCLUSION There are numerous ranking algorithms available for ranking web pages; all of them work on different perspectives. Page Rank algorithm is used by Google which is most popular search engine now days. But to solve the ambiguity faced by the user we have proposed a dynamic page rank algorithm which is acting as a layer on to the Page Rank algorithm and provides user much appropriate and relevant result. Also to validate the work we have included Mean Average Precision measure. REFERENCES [1] Ashutosh Kumar Singh, Ravi Kumar P, ―A Comparative study of Page Ranking Algorithms for Information Retrieval‖, International Journal of Electrical and Computer Engineering 4:7 2009 [2] Daniel Jurafsky and James H. Martin, Speech and Language Processing, Pearson Prentice Hall, 2009 [3] ―Information Retrival‖ available at http://en.wikipedia.org/wiki/Information_retrieval [4] J. Kleinberg, ―Authoritative Sources in a Hyper-Linked Environment‖, Journal of the ACM 46(5), pp. 604-632, 1999. [5] J. Kleinberg, ―Hubs, Authorities and Communities‖, ACM Computing Surveys, 31(4), 1999. [6] L. Page, S. Brin, R. Motwani, and T. Winograd, ―The Pagerank Citation Ranking: Bringing order to the Web‖. Technical Report, Stanford Digital Libraries SIDL-WP-1999-0120, 1999. [7] ―Precision and Recall‖ available at http://en.wikipedia.org/wiki/Precision_and_recall [8] S. Brin, and L. Page, ―The Anatomy of a Large Scale Hypertextual Web Search Engine‖, Computer Network and ISDN Systems, Vol. 30, Issue 1-7, pp. 107-117,1998. [9] Schütze, H., ―Automatic word sense discrimination. Computational Linguistics‖, 24(1): 97–123, 1998. [10] W. Xing and Ali Ghorbani, ―Weighted PageRank Algorithm‖, Proc. Of the Second Annual Conference on Communication Networks and Services Research (CNSR ’04), IEEE, 2004. [11] ―Word Sense Disambiguation‖, available at http://en.wikipedia.org/wiki/Word-sense_disambiguation AUTHORS Rekha Jain completed her Master Degree in Computer Science from Kurukshetra University in 2004. Now she is working as Assistant Professor in Department of ―Apaji Institute of Mathematics & Applied Computer Technology‖ at Banasthali University, Rajasthan and pursuing Ph.D. under the supervision of Prof. G. N. Purohit. Her current research interest includes Web Mining, Semantic Web and Data Mining. She has various National and International publications and conferences. Rupal Bhargava is pursuing her M.Tech in Computer Science from BanasthaliVidyapith, Rajasthan. She is undergoing the training of her M.Tech in supervision of Mrs. Rekha Jain. Her current research interest includes Web Mining, Semantic Web and Data Mining. She has published various papers in the conferences and journals. Prof. G. N. Purohit is a Professor in Department of Mathematics & Statistics at Banasthali University (Rajasthan). Before joining Banasthali University, he was Professor and Head of the Department of Mathematics, University of Rajasthan, Jaipur. He had been Chief-editor of a research journal and regular reviewer of many journals. His present interest is in O.R., Discrete Mathematics and Communication networks. He has published round 40 research papers in various journals.