SlideShare a Scribd company logo
1 of 3
Download to read offline
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
__________________________________________________________________________________________
Volume: 02 Issue: 12 | Dec-2013, Available @ http://www.ijret.org 559
EFFICIENTLY SEARCHING NEAREST NEIGHBOR IN DOCUMENTS
USING KEYWORDS
Sonal S. Kasare1
, Anup Bongale2
1
Lecturer, 2
Asst. Prof., Computer Engineering, Dr. D. Y. Patil School of Engg, Maharashtra, India,
sonalkasare@gmail.com, anup.dypcoe@gmail.com
Abstract
Conservative spatial queries, such as range search and nearest neighbor reclamation, involve only conditions on objects’ numerical
properties. Today, many modern applications call for novel forms of queries that aim to find objects satisfying both a spatial
predicate, and a predicate on their associated texts. For example, instead of considering all the restaurants, a nearest neighbor query
would instead ask for the restaurant that is the closest among those whose menus contain “steak, spaghetti, brandy” all at the same
time. Currently the best solution to such queries is based on the InformationRetrieval2-tree, which, has a few deficiencies that
seriously impact its efficiency. Motivated by this, there is a development of a new access method called the spatial inverted index that
extends the conventional inverted index to cope with multidimensional data, and comes with algorithms that can answer nearest
neighbor queries with keywords in real time. As verified by experiments, the proposed techniques outperform the
InformationRetrieval2-tree in query response time significantly, often by a factor of orders of magnitude
Keywords: Information retrieval, spatial index, keyword search
-----------------------------------------------------------------------***-----------------------------------------------------------------------
1. INTRODUCTION
Keyword search in document performed with various
approaches ranked retrieval results, clustering search results &
identifying the nearest neighbor Keyword search on xml
document categorized as two different approaches one is
Keyword search on xml document which can be performed by
ranking the searched results based on match or the answer to
keyword & finding the nearest neighbor of keyword by using
GST or by Xpath Query. In paper [2] the problem of returning
clustered results for keyword search on documents the core of
the semantics is the conceptually related relationship between
keyword matches, which is based on the conceptual
relationship between nodes in trees. Then, we propose a new
clustering methodology for search results, which clusters
results according to the way they match the given query.
A spatial database manages multidimensional objects (such as
points, rectangles, etc.), and provides fast access to those
objects based on different selection criteria. The importance of
spatial databases is reflected by the convenience of modeling
entities of reality in a geometric manner. For example,
locations of restaurants, hotels, hospitals and so on are often
represented as points in a map, while larger extents such as
parks, lakes, and landscapes often as a combination of
rectangles. Many functionalities of a spatial database are
useful in various ways in specific contexts. For instance, in a
geography information system, range search can be deployed
to find all restaurants in a certain area, while nearest neighbor
retrieval can discover the restaurant closest to a given address.
Today, the widespread use of search engines has made it
realistic to write spatial queries in a brand new way.
Conventionally, queries focus on objects‟ geometric properties
only, such as whether a point is in a rectangle, or how close
two points are from each other. We have seen some modern
applications that call for the ability to select objects based on
both of their geometric coordinates and their associated texts.
For example, it would be fairly useful if a search engine can
be used to find the nearest restaurant that offers “steak,
spaghetti, and brandy” all at the same time. Note that this is
not the “globally” nearest restaurant (which would have been
returned by a traditional nearest neighbor query), but the
nearest restaurant among only those providing all the
demanded foods and drinks.
There are easy ways to support queries that combine spatial
and text features. For example, for the above query, we could
first fetch all the restaurants whose menus contain the set of
keywords {steak, spaghetti, brandy}, and then from the
retrieved restaurants, find the nearest one. Similarly, one could
also do it reversely by targeting first the spatial conditions –
browse all the restaurants in ascending order of their distances
to the query point until encountering one whose menu has all
the keywords. The major drawback of these straightforward
approaches is that they will fail to provide real time answers
on difficult inputs. A typical example is that the real nearest
neighbor lies quite far away from the query point, while all the
closer neighbors are missing at least one of the query
keywords. Spatial queries with keywords have not been
extensively explored. In the past years, the community has
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
__________________________________________________________________________________________
Volume: 02 Issue: 12 | Dec-2013, Available @ http://www.ijret.org 560
sparked enthusiasm in studying keyword search in relational
databases. It is until recently that attention was diverted to
multidimensional data.
The boom of Internet has given rise to an ever increasing
amount of text data associated with multiple dimensions
(attributes), for example, customer reviews in shopping
websites (e.g., Amazon) are always associated with attributes
like price, model, and rate. A traditional OLAP data cube can
be naturally extended to summarize and navigate structured
data together with unstructured text data. Such a cube model is
called text cube [1]. A cell in the text cube aggregates a set of
documents/items with matching attribute values in a subset of
dimensions. Keyword query, one of the most popular and
easy-to-use ways to retrieve useful information from a
collection of plain documents, is being extended to RDBMSs
to retrieve information from text-rich attributes [2], [3]. Given
a set of keywords, existing methods aim to find relevant items
or joins of items (e.g., linked by foreign keys) that contain all
or some of the keywords.
Traditional IR techniques can be used to rank documents
according to the relevance. In a large text database, however,
the number of relevant documents to a query could be large,
and a user has to spend much time reading them. If a
document is associated with attribute information, in a data
cube model (a multidimensional space induced by the
attributes), e.g., the text cube, a cell aggregates the documents
with matching values in a subset of attributes. Such a
collection of documents is associated with each cell,
corresponding to an object that can be directly recommended
to the user for the given query. This paper studies the problem
of keyword-based top-k search in text cube, i.e., given a
keyword query, find the top-k most relevant cells in a text
cube. When users want to retrieve information from a text
cube using keyword question, believe that relevant cells,
rather than relevant documents, are preferred as the answers,
because: 1) relevant cells are easy for users to browse; and 2)
relevant cells provide users insights about the relationship
between the values of relational attributes and the text data.
2. PROBLEM DEFINITION
1. Implement k nearest neighbor search algorithm using mat
lab for given data set and to find out closest point from give
query also analyze the result fetch time and accuracy.
2. Implement the Inverted index algorithm by extending point
the k nearest neighbor and forming R tree to find closest point
from given set of query and also analyze the result against
time and result accuracy.
3. LITERATURE SURVEY
In the paper „fast nearest neighbor search with keywords‟,
there are methods like spatial index, inverted index, nearest
neighbor search. The first method spatial index is used for
creating indices because there is huge amount of data need to
be stored for searching that data stored in the form of xml
documents. If the data storage created in the form of indices
then space required is less also time needed for searching the
keyword is less.
Second method is inverted index. The inverted index data
structure is a central component of a typical search engine
indexing algorithm. A goal of a search engine performance is
to optimize the speed of the query: find the documents where
word occurs. Once an index is developed, which provisions
lists of words per document; it is next inverted to develop an
inverted index. Querying the index would require sequential
iteration through each document and to each word to verify a
matching document. The time memory and processing
property to execute such a query are not always theoretically
realistic. Instead of listing the words per article in the index,
the inverted index data structure is developed which lists the
documents per word. The inverted index produced, the query
can now be determined by jumping to the word id in the
inverted index. These were effectively inverted indexes with a
small amount of supplementary explanation that required a
implausible amount of attempt to produce.
Third method is nearest neighbor search. Nearest neighbor
search (NNS), also identified as closeness search, parallel
search is an optimization problem for finding closest points
in metric spaces. In the paper „Efficient Keyword-Based
Search for Top-K Cells in Text Cube‟ methods used are
inverted-index one-scan, document sorted-scan, bottom-up
dynamic programming, and search-space ordering. In the top k
cells, there is a searching of nearest key to the query. Cubes
forms clusters of single unique group which shows its identity.
Method like inverted index used for giving index rather than
providing whole data which can be space consuming.
4. COMPARITIVE STUDY
Table -1: Comparison of Depth first search and inverted index
ParametersMethods Depth first
search
Inverted index
Space More Less
Time complexity O(n+m) O(n)
Working Slow Efficient
Methods used before is depth first search which required more
space because it needs to travel through left and right trees
respectively. Rather in inverted indexing there is less space
required as there is use of indices. Next is time complexity, for
DFS its O(n+m) ( n for edges and m for nodes). While in
inverted index there is time required is O(n) that is for single
index. Also DFS works slowly and inverted indexing is
efficient to use.
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
__________________________________________________________________________________________
Volume: 02 Issue: 12 | Dec-2013, Available @ http://www.ijret.org 561
CONCLUSIONS
By studying above methods the main motto is searching a
relevant keyword with meaningful information with minimum
time with valid results. I can conclude with this there should
be efficient method to identify the keyword, grouping those
results by using any one of clustering methods. The major
drawbacks in paper [1] are if two nodes have same keyword
then how to differentiate the keyword with the searched one.
In paper [2] there can be come up with indexing the cluster
results will be more if cluster is dynamically growing, or we
can form clusters of clusters.
REFERENCES
[1]. S. Agrawal S. Chaudhuri, and G. Das. Dbxplorer: A
system for keyword-based search over relational
databases. In Proc. Of International Conference on
Data Engineering (ICDE), pages 5–16, 2002.
[2]. N. Beckmann, H. Kriegel, R. Schneider, and B.
Seeger. The R*-tree: An efficient and robust access
method for points and rectangles. In Proc. of ACM
Management of Data (SIGMOD), pages 322–331,
1990.
[3]. G. Bhalotia, A. Hulgeri, C. Nakhe, S. Chakrabarti,
and S. Sudarshan. Keyword searching and browsing
in databases using banks. In Proc. of International
Conference on Data Engineering (ICDE), pages 431–
440, 2002.
[4]. X. Cao, L. Chen, G. Cong, C. S. Jensen, Q. Qu, A.
Skovsgaard, D. Wu, and M. L. Yiu. Spatial keyword
querying. In ER, pages 16–29, 2012. X. Cao, G.
Cong, and C. S. Jensen. Retrieving top-k prestige-
based relevant spatial web objects. PVLDB,
3(1):373–384, 2010.
[5]. X. Cao, G. Cong, C. S. Jensen, and B. C. Ooi.
Collective spatial keyword querying. In Proc. of
ACM Management of Data (SIGMOD), pages 373–
384, 2011.
[6]. B. Chazelle, J. Kilian, R. Rubinfeld, and A. Tal. The
bloomier filter: an efficient data structure for static
support lookup tables. In Proc. of the Annual ACM-
SIAM Symposium on Discrete Algorithms (SODA),
pages 30–39, 2004.
[7]. Y.-Y. Chen, T. Suel, and A. Markowetz. Efficient
query processing in geographic web search engines.
In Proc. of ACM Management of Data (SIGMOD),
pages 277–288, 2006.
[8]. E. Chu, A. Baid, X. Chai, A. Doan, and J. Naughton.
Combining keyword search and forms for ad hoc
querying of databases. In Proc. of ACM Management
of Data (SIGMOD), 2009.
[9]. G. Cong, C. S. Jensen, and D. Wu. Efficient retrieval
of the top-k most relevant spatial web objects.
PVLDB, 2(1):337–348, 2009.
[10]. C. Faloutsos and S. Christodoulakis. Signature files:
An access method for documents and its analytical
performance evaluation. ACM Transactions on
Information Systems (TOIS), 2(4):267–288, 1984.
[11]. I. D. Felipe, V. Hristidis, and N. Rishe. Keyword
search on spatial databases. In Proc. of International
Conference on Data Engineering (ICDE), pages 656–
665, 2008.
[12]. R. Hariharan, B. Hore, C. Li, and S. Mehrotra.
Processing spatialkeyword (SK) queries in
geographic information retrieval (GIR) systems. In
Proc. of Scientific and Statistical Database
Management (SSDBM), 2007.
[13]. G. R. Hjaltason and H. Samet. Distance browsing in
spatial databases. ACM Transactions on Database
Systems (TODS), 24(2):265–318, 1999.
[14]. V. Hristidis and Y. Papakonstantinou. Discover:
Keyword search in relational databases. In Proc. of
Very Large Data Bases (VLDB), pages 670–681,
2002.
[15]. I. Kamel and C. Faloutsos. Hilbert R-tree: An
improved r-tree using fractals. In Proc. of Very Large
Data Bases (VLDB), pages 500–509, 1994.

More Related Content

What's hot

Fast nearest neighbor search with keywords
Fast nearest neighbor search with keywords Fast nearest neighbor search with keywords
Fast nearest neighbor search with keywords Papitha Velumani
 
Fast nearest neighbor search with keywords
Fast nearest neighbor search with keywordsFast nearest neighbor search with keywords
Fast nearest neighbor search with keywordsLeMeniz Infotech
 
Context Based Web Indexing For Semantic Web
Context Based Web Indexing For Semantic WebContext Based Web Indexing For Semantic Web
Context Based Web Indexing For Semantic WebIOSR Journals
 
Fast nearest neighbor search with keywords
Fast nearest neighbor search with keywordsFast nearest neighbor search with keywords
Fast nearest neighbor search with keywordsIEEEFINALYEARPROJECTS
 
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routingIEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routingIEEEFINALYEARSTUDENTPROJECTS
 
Keyword Query Routing
Keyword Query RoutingKeyword Query Routing
Keyword Query RoutingSWAMI06
 
Computing semantic similarity measure between words using web search engine
Computing semantic similarity measure between words using web search engineComputing semantic similarity measure between words using web search engine
Computing semantic similarity measure between words using web search enginecsandit
 
JPJ1422 Fast Nearest Neighbour Search With Keywords
JPJ1422   Fast Nearest Neighbour Search With KeywordsJPJ1422   Fast Nearest Neighbour Search With Keywords
JPJ1422 Fast Nearest Neighbour Search With Keywordschennaijp
 
Interpreting the Semantics of Anomalies Based on Mutual Information in Link M...
Interpreting the Semantics of Anomalies Based on Mutual Information in Link M...Interpreting the Semantics of Anomalies Based on Mutual Information in Link M...
Interpreting the Semantics of Anomalies Based on Mutual Information in Link M...ijdms
 
Dr31564567
Dr31564567Dr31564567
Dr31564567IJMER
 
Annotating Search Results from Web Databases
Annotating Search Results from Web Databases Annotating Search Results from Web Databases
Annotating Search Results from Web Databases Mohit Sngg
 
Annotation Approach for Document with Recommendation
Annotation Approach for Document with Recommendation Annotation Approach for Document with Recommendation
Annotation Approach for Document with Recommendation ijmpict
 
A03202001005
A03202001005A03202001005
A03202001005theijes
 
Using Page Size for Controlling Duplicate Query Results in Semantic Web
Using Page Size for Controlling Duplicate Query Results in Semantic WebUsing Page Size for Controlling Duplicate Query Results in Semantic Web
Using Page Size for Controlling Duplicate Query Results in Semantic WebIJwest
 

What's hot (16)

Fast nearest neighbor search with keywords
Fast nearest neighbor search with keywords Fast nearest neighbor search with keywords
Fast nearest neighbor search with keywords
 
Fast nearest neighbor search with keywords
Fast nearest neighbor search with keywordsFast nearest neighbor search with keywords
Fast nearest neighbor search with keywords
 
Context Based Web Indexing For Semantic Web
Context Based Web Indexing For Semantic WebContext Based Web Indexing For Semantic Web
Context Based Web Indexing For Semantic Web
 
Fast nearest neighbor search with keywords
Fast nearest neighbor search with keywordsFast nearest neighbor search with keywords
Fast nearest neighbor search with keywords
 
G1803054653
G1803054653G1803054653
G1803054653
 
Keyword query routing
Keyword query routingKeyword query routing
Keyword query routing
 
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routingIEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
 
Keyword Query Routing
Keyword Query RoutingKeyword Query Routing
Keyword Query Routing
 
Computing semantic similarity measure between words using web search engine
Computing semantic similarity measure between words using web search engineComputing semantic similarity measure between words using web search engine
Computing semantic similarity measure between words using web search engine
 
JPJ1422 Fast Nearest Neighbour Search With Keywords
JPJ1422   Fast Nearest Neighbour Search With KeywordsJPJ1422   Fast Nearest Neighbour Search With Keywords
JPJ1422 Fast Nearest Neighbour Search With Keywords
 
Interpreting the Semantics of Anomalies Based on Mutual Information in Link M...
Interpreting the Semantics of Anomalies Based on Mutual Information in Link M...Interpreting the Semantics of Anomalies Based on Mutual Information in Link M...
Interpreting the Semantics of Anomalies Based on Mutual Information in Link M...
 
Dr31564567
Dr31564567Dr31564567
Dr31564567
 
Annotating Search Results from Web Databases
Annotating Search Results from Web Databases Annotating Search Results from Web Databases
Annotating Search Results from Web Databases
 
Annotation Approach for Document with Recommendation
Annotation Approach for Document with Recommendation Annotation Approach for Document with Recommendation
Annotation Approach for Document with Recommendation
 
A03202001005
A03202001005A03202001005
A03202001005
 
Using Page Size for Controlling Duplicate Query Results in Semantic Web
Using Page Size for Controlling Duplicate Query Results in Semantic WebUsing Page Size for Controlling Duplicate Query Results in Semantic Web
Using Page Size for Controlling Duplicate Query Results in Semantic Web
 

Viewers also liked

Science-based pathway for long-term solution to fire and haze
Science-based pathway for long-term solution to fire and hazeScience-based pathway for long-term solution to fire and haze
Science-based pathway for long-term solution to fire and hazeCIFOR-ICRAF
 
Developing sustainable wood fuel cooking systems
Developing sustainable wood fuel cooking systemsDeveloping sustainable wood fuel cooking systems
Developing sustainable wood fuel cooking systemsWorld Agroforestry (ICRAF)
 
Certified Organic Shrimp: A New Approach to Mangrove PES?
Certified Organic Shrimp: A New Approach to Mangrove PES?Certified Organic Shrimp: A New Approach to Mangrove PES?
Certified Organic Shrimp: A New Approach to Mangrove PES?CIFOR-ICRAF
 
Prospects for REDD+ in swidden landscapes
Prospects for REDD+ in swidden landscapesProspects for REDD+ in swidden landscapes
Prospects for REDD+ in swidden landscapesCIFOR-ICRAF
 
Semicompatibility and fixed point theorem in fuzzy metric space using implici...
Semicompatibility and fixed point theorem in fuzzy metric space using implici...Semicompatibility and fixed point theorem in fuzzy metric space using implici...
Semicompatibility and fixed point theorem in fuzzy metric space using implici...eSAT Journals
 
Local governance, social networks and REDD+: Lessons from swidden communities...
Local governance, social networks and REDD+: Lessons from swidden communities...Local governance, social networks and REDD+: Lessons from swidden communities...
Local governance, social networks and REDD+: Lessons from swidden communities...CIFOR-ICRAF
 
REDD+ and agroforestry potential in the Democratic Republic of Congo
REDD+ and agroforestry potential in the Democratic Republic of CongoREDD+ and agroforestry potential in the Democratic Republic of Congo
REDD+ and agroforestry potential in the Democratic Republic of CongoWorld Agroforestry (ICRAF)
 
기업용 코딩교육제안서 공유
기업용 코딩교육제안서 공유기업용 코딩교육제안서 공유
기업용 코딩교육제안서 공유Tommy Park
 
Experimental studies on laterite soil stabilized with cement and aggregate
Experimental studies on laterite soil stabilized with cement and aggregateExperimental studies on laterite soil stabilized with cement and aggregate
Experimental studies on laterite soil stabilized with cement and aggregateeSAT Journals
 
The politics of swidden: A case study from Nghe An and Son La in Vietnam
The politics of swidden:  A case study from Nghe An and Son La in VietnamThe politics of swidden:  A case study from Nghe An and Son La in Vietnam
The politics of swidden: A case study from Nghe An and Son La in VietnamCIFOR-ICRAF
 
The equity of REDD+ benefit sharing mechanisms? A multilevel governance analy...
The equity of REDD+ benefit sharing mechanisms? A multilevel governance analy...The equity of REDD+ benefit sharing mechanisms? A multilevel governance analy...
The equity of REDD+ benefit sharing mechanisms? A multilevel governance analy...CIFOR-ICRAF
 
Effect of various process parameters on friction stir welded joint
Effect of various process parameters on friction stir welded jointEffect of various process parameters on friction stir welded joint
Effect of various process parameters on friction stir welded jointeSAT Journals
 
Potential use of iron ore tailings in sandcrete block making
Potential use of iron ore tailings in sandcrete block makingPotential use of iron ore tailings in sandcrete block making
Potential use of iron ore tailings in sandcrete block makingeSAT Journals
 
The Western Ghats Sentinel landscape: a platform to coordinate research efforts
The Western Ghats Sentinel landscape: a platform to coordinate research effortsThe Western Ghats Sentinel landscape: a platform to coordinate research efforts
The Western Ghats Sentinel landscape: a platform to coordinate research effortsForest Trees Sentinel Landscapes
 

Viewers also liked (16)

Science-based pathway for long-term solution to fire and haze
Science-based pathway for long-term solution to fire and hazeScience-based pathway for long-term solution to fire and haze
Science-based pathway for long-term solution to fire and haze
 
Developing sustainable wood fuel cooking systems
Developing sustainable wood fuel cooking systemsDeveloping sustainable wood fuel cooking systems
Developing sustainable wood fuel cooking systems
 
Certified Organic Shrimp: A New Approach to Mangrove PES?
Certified Organic Shrimp: A New Approach to Mangrove PES?Certified Organic Shrimp: A New Approach to Mangrove PES?
Certified Organic Shrimp: A New Approach to Mangrove PES?
 
Prospects for REDD+ in swidden landscapes
Prospects for REDD+ in swidden landscapesProspects for REDD+ in swidden landscapes
Prospects for REDD+ in swidden landscapes
 
Semicompatibility and fixed point theorem in fuzzy metric space using implici...
Semicompatibility and fixed point theorem in fuzzy metric space using implici...Semicompatibility and fixed point theorem in fuzzy metric space using implici...
Semicompatibility and fixed point theorem in fuzzy metric space using implici...
 
Local governance, social networks and REDD+: Lessons from swidden communities...
Local governance, social networks and REDD+: Lessons from swidden communities...Local governance, social networks and REDD+: Lessons from swidden communities...
Local governance, social networks and REDD+: Lessons from swidden communities...
 
REDD+ and agroforestry potential in the Democratic Republic of Congo
REDD+ and agroforestry potential in the Democratic Republic of CongoREDD+ and agroforestry potential in the Democratic Republic of Congo
REDD+ and agroforestry potential in the Democratic Republic of Congo
 
기업용 코딩교육제안서 공유
기업용 코딩교육제안서 공유기업용 코딩교육제안서 공유
기업용 코딩교육제안서 공유
 
Experimental studies on laterite soil stabilized with cement and aggregate
Experimental studies on laterite soil stabilized with cement and aggregateExperimental studies on laterite soil stabilized with cement and aggregate
Experimental studies on laterite soil stabilized with cement and aggregate
 
The politics of swidden: A case study from Nghe An and Son La in Vietnam
The politics of swidden:  A case study from Nghe An and Son La in VietnamThe politics of swidden:  A case study from Nghe An and Son La in Vietnam
The politics of swidden: A case study from Nghe An and Son La in Vietnam
 
The equity of REDD+ benefit sharing mechanisms? A multilevel governance analy...
The equity of REDD+ benefit sharing mechanisms? A multilevel governance analy...The equity of REDD+ benefit sharing mechanisms? A multilevel governance analy...
The equity of REDD+ benefit sharing mechanisms? A multilevel governance analy...
 
Presentacion grupo102058 558 diseño de proyectos_unad
Presentacion grupo102058 558 diseño de proyectos_unadPresentacion grupo102058 558 diseño de proyectos_unad
Presentacion grupo102058 558 diseño de proyectos_unad
 
Effect of various process parameters on friction stir welded joint
Effect of various process parameters on friction stir welded jointEffect of various process parameters on friction stir welded joint
Effect of various process parameters on friction stir welded joint
 
Potential use of iron ore tailings in sandcrete block making
Potential use of iron ore tailings in sandcrete block makingPotential use of iron ore tailings in sandcrete block making
Potential use of iron ore tailings in sandcrete block making
 
The Western Ghats Sentinel landscape: a platform to coordinate research efforts
The Western Ghats Sentinel landscape: a platform to coordinate research effortsThe Western Ghats Sentinel landscape: a platform to coordinate research efforts
The Western Ghats Sentinel landscape: a platform to coordinate research efforts
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 

Similar to Efficiently Searching Nearest Neighbor in Documents Using Keywords

A survey on location based serach using spatial inverted index method
A survey on location based serach using spatial inverted index methodA survey on location based serach using spatial inverted index method
A survey on location based serach using spatial inverted index methodeSAT Journals
 
IEEE 2014 JAVA DATA MINING PROJECTS Fast nearest neighbor search with keywords
IEEE 2014 JAVA DATA MINING PROJECTS Fast nearest neighbor search with keywordsIEEE 2014 JAVA DATA MINING PROJECTS Fast nearest neighbor search with keywords
IEEE 2014 JAVA DATA MINING PROJECTS Fast nearest neighbor search with keywordsIEEEFINALYEARSTUDENTPROJECTS
 
Performing Fast Spatial Query Search by Using Ultimate Code Words
Performing Fast Spatial Query Search by Using Ultimate Code WordsPerforming Fast Spatial Query Search by Using Ultimate Code Words
Performing Fast Spatial Query Search by Using Ultimate Code WordsBRNSSPublicationHubI
 
Ontological approach for improving semantic web search results
Ontological approach for improving semantic web search resultsOntological approach for improving semantic web search results
Ontological approach for improving semantic web search resultseSAT Publishing House
 
Ontological approach for improving semantic web search results
Ontological approach for improving semantic web search resultsOntological approach for improving semantic web search results
Ontological approach for improving semantic web search resultseSAT Journals
 
Best Keyword Cover Search
Best Keyword Cover SearchBest Keyword Cover Search
Best Keyword Cover Search1crore projects
 
ROMAN URDU OPINION MINING SYSTEM (RUOMIS)
ROMAN URDU OPINION MINING SYSTEM (RUOMIS) ROMAN URDU OPINION MINING SYSTEM (RUOMIS)
ROMAN URDU OPINION MINING SYSTEM (RUOMIS) cseij
 
Hybrid geo textual index structure
Hybrid geo textual index structureHybrid geo textual index structure
Hybrid geo textual index structurecseij
 
Scalable Keyword Cover Search using Keyword NNE and Inverted Indexing
Scalable Keyword Cover Search using Keyword NNE and Inverted IndexingScalable Keyword Cover Search using Keyword NNE and Inverted Indexing
Scalable Keyword Cover Search using Keyword NNE and Inverted IndexingIRJET Journal
 
Search engine. Elasticsearch
Search engine. ElasticsearchSearch engine. Elasticsearch
Search engine. ElasticsearchSelecto
 
An Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search TechniqueAn Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search Techniquepaperpublications3
 
Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...IRJET Journal
 
Classification of News and Research Articles Using Text Pattern Mining
Classification of News and Research Articles Using Text Pattern MiningClassification of News and Research Articles Using Text Pattern Mining
Classification of News and Research Articles Using Text Pattern MiningIOSR Journals
 
Volume 2-issue-6-2016-2020
Volume 2-issue-6-2016-2020Volume 2-issue-6-2016-2020
Volume 2-issue-6-2016-2020Editor IJARCET
 
A rough set based hybrid method to text categorization
A rough set based hybrid method to text categorizationA rough set based hybrid method to text categorization
A rough set based hybrid method to text categorizationNinad Samel
 
Cluster Based Web Search Using Support Vector Machine
Cluster Based Web Search Using Support Vector MachineCluster Based Web Search Using Support Vector Machine
Cluster Based Web Search Using Support Vector MachineCSCJournals
 
JAVA 2013 IEEE DATAMINING PROJECT Fast nearest neighbor search with keywords
JAVA 2013 IEEE DATAMINING PROJECT Fast nearest neighbor search with keywordsJAVA 2013 IEEE DATAMINING PROJECT Fast nearest neighbor search with keywords
JAVA 2013 IEEE DATAMINING PROJECT Fast nearest neighbor search with keywordsIEEEGLOBALSOFTTECHNOLOGIES
 
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGYINTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGYcscpconf
 
Extracting and Reducing the Semantic Information Content of Web Documents to ...
Extracting and Reducing the Semantic Information Content of Web Documents to ...Extracting and Reducing the Semantic Information Content of Web Documents to ...
Extracting and Reducing the Semantic Information Content of Web Documents to ...ijsrd.com
 

Similar to Efficiently Searching Nearest Neighbor in Documents Using Keywords (20)

A survey on location based serach using spatial inverted index method
A survey on location based serach using spatial inverted index methodA survey on location based serach using spatial inverted index method
A survey on location based serach using spatial inverted index method
 
IEEE 2014 JAVA DATA MINING PROJECTS Fast nearest neighbor search with keywords
IEEE 2014 JAVA DATA MINING PROJECTS Fast nearest neighbor search with keywordsIEEE 2014 JAVA DATA MINING PROJECTS Fast nearest neighbor search with keywords
IEEE 2014 JAVA DATA MINING PROJECTS Fast nearest neighbor search with keywords
 
Performing Fast Spatial Query Search by Using Ultimate Code Words
Performing Fast Spatial Query Search by Using Ultimate Code WordsPerforming Fast Spatial Query Search by Using Ultimate Code Words
Performing Fast Spatial Query Search by Using Ultimate Code Words
 
Ontological approach for improving semantic web search results
Ontological approach for improving semantic web search resultsOntological approach for improving semantic web search results
Ontological approach for improving semantic web search results
 
Ontological approach for improving semantic web search results
Ontological approach for improving semantic web search resultsOntological approach for improving semantic web search results
Ontological approach for improving semantic web search results
 
Best Keyword Cover Search
Best Keyword Cover SearchBest Keyword Cover Search
Best Keyword Cover Search
 
ROMAN URDU OPINION MINING SYSTEM (RUOMIS)
ROMAN URDU OPINION MINING SYSTEM (RUOMIS) ROMAN URDU OPINION MINING SYSTEM (RUOMIS)
ROMAN URDU OPINION MINING SYSTEM (RUOMIS)
 
Hybrid geo textual index structure
Hybrid geo textual index structureHybrid geo textual index structure
Hybrid geo textual index structure
 
Scalable Keyword Cover Search using Keyword NNE and Inverted Indexing
Scalable Keyword Cover Search using Keyword NNE and Inverted IndexingScalable Keyword Cover Search using Keyword NNE and Inverted Indexing
Scalable Keyword Cover Search using Keyword NNE and Inverted Indexing
 
Search engine. Elasticsearch
Search engine. ElasticsearchSearch engine. Elasticsearch
Search engine. Elasticsearch
 
An Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search TechniqueAn Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search Technique
 
Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...
 
Classification of News and Research Articles Using Text Pattern Mining
Classification of News and Research Articles Using Text Pattern MiningClassification of News and Research Articles Using Text Pattern Mining
Classification of News and Research Articles Using Text Pattern Mining
 
Volume 2-issue-6-2016-2020
Volume 2-issue-6-2016-2020Volume 2-issue-6-2016-2020
Volume 2-issue-6-2016-2020
 
A rough set based hybrid method to text categorization
A rough set based hybrid method to text categorizationA rough set based hybrid method to text categorization
A rough set based hybrid method to text categorization
 
Cluster Based Web Search Using Support Vector Machine
Cluster Based Web Search Using Support Vector MachineCluster Based Web Search Using Support Vector Machine
Cluster Based Web Search Using Support Vector Machine
 
JAVA 2013 IEEE DATAMINING PROJECT Fast nearest neighbor search with keywords
JAVA 2013 IEEE DATAMINING PROJECT Fast nearest neighbor search with keywordsJAVA 2013 IEEE DATAMINING PROJECT Fast nearest neighbor search with keywords
JAVA 2013 IEEE DATAMINING PROJECT Fast nearest neighbor search with keywords
 
Bl24409420
Bl24409420Bl24409420
Bl24409420
 
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGYINTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
 
Extracting and Reducing the Semantic Information Content of Web Documents to ...
Extracting and Reducing the Semantic Information Content of Web Documents to ...Extracting and Reducing the Semantic Information Content of Web Documents to ...
Extracting and Reducing the Semantic Information Content of Web Documents to ...
 

More from eSAT Journals

Mechanical properties of hybrid fiber reinforced concrete for pavements
Mechanical properties of hybrid fiber reinforced concrete for pavementsMechanical properties of hybrid fiber reinforced concrete for pavements
Mechanical properties of hybrid fiber reinforced concrete for pavementseSAT Journals
 
Material management in construction – a case study
Material management in construction – a case studyMaterial management in construction – a case study
Material management in construction – a case studyeSAT Journals
 
Managing drought short term strategies in semi arid regions a case study
Managing drought    short term strategies in semi arid regions  a case studyManaging drought    short term strategies in semi arid regions  a case study
Managing drought short term strategies in semi arid regions a case studyeSAT Journals
 
Life cycle cost analysis of overlay for an urban road in bangalore
Life cycle cost analysis of overlay for an urban road in bangaloreLife cycle cost analysis of overlay for an urban road in bangalore
Life cycle cost analysis of overlay for an urban road in bangaloreeSAT Journals
 
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materialsLaboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materialseSAT Journals
 
Laboratory investigation of expansive soil stabilized with natural inorganic ...
Laboratory investigation of expansive soil stabilized with natural inorganic ...Laboratory investigation of expansive soil stabilized with natural inorganic ...
Laboratory investigation of expansive soil stabilized with natural inorganic ...eSAT Journals
 
Influence of reinforcement on the behavior of hollow concrete block masonry p...
Influence of reinforcement on the behavior of hollow concrete block masonry p...Influence of reinforcement on the behavior of hollow concrete block masonry p...
Influence of reinforcement on the behavior of hollow concrete block masonry p...eSAT Journals
 
Influence of compaction energy on soil stabilized with chemical stabilizer
Influence of compaction energy on soil stabilized with chemical stabilizerInfluence of compaction energy on soil stabilized with chemical stabilizer
Influence of compaction energy on soil stabilized with chemical stabilizereSAT Journals
 
Geographical information system (gis) for water resources management
Geographical information system (gis) for water resources managementGeographical information system (gis) for water resources management
Geographical information system (gis) for water resources managementeSAT Journals
 
Forest type mapping of bidar forest division, karnataka using geoinformatics ...
Forest type mapping of bidar forest division, karnataka using geoinformatics ...Forest type mapping of bidar forest division, karnataka using geoinformatics ...
Forest type mapping of bidar forest division, karnataka using geoinformatics ...eSAT Journals
 
Factors influencing compressive strength of geopolymer concrete
Factors influencing compressive strength of geopolymer concreteFactors influencing compressive strength of geopolymer concrete
Factors influencing compressive strength of geopolymer concreteeSAT Journals
 
Experimental investigation on circular hollow steel columns in filled with li...
Experimental investigation on circular hollow steel columns in filled with li...Experimental investigation on circular hollow steel columns in filled with li...
Experimental investigation on circular hollow steel columns in filled with li...eSAT Journals
 
Experimental behavior of circular hsscfrc filled steel tubular columns under ...
Experimental behavior of circular hsscfrc filled steel tubular columns under ...Experimental behavior of circular hsscfrc filled steel tubular columns under ...
Experimental behavior of circular hsscfrc filled steel tubular columns under ...eSAT Journals
 
Evaluation of punching shear in flat slabs
Evaluation of punching shear in flat slabsEvaluation of punching shear in flat slabs
Evaluation of punching shear in flat slabseSAT Journals
 
Evaluation of performance of intake tower dam for recent earthquake in india
Evaluation of performance of intake tower dam for recent earthquake in indiaEvaluation of performance of intake tower dam for recent earthquake in india
Evaluation of performance of intake tower dam for recent earthquake in indiaeSAT Journals
 
Evaluation of operational efficiency of urban road network using travel time ...
Evaluation of operational efficiency of urban road network using travel time ...Evaluation of operational efficiency of urban road network using travel time ...
Evaluation of operational efficiency of urban road network using travel time ...eSAT Journals
 
Estimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of surface runoff in nallur amanikere watershed using scs cn methodEstimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of surface runoff in nallur amanikere watershed using scs cn methodeSAT Journals
 
Estimation of morphometric parameters and runoff using rs & gis techniques
Estimation of morphometric parameters and runoff using rs & gis techniquesEstimation of morphometric parameters and runoff using rs & gis techniques
Estimation of morphometric parameters and runoff using rs & gis techniqueseSAT Journals
 
Effect of variation of plastic hinge length on the results of non linear anal...
Effect of variation of plastic hinge length on the results of non linear anal...Effect of variation of plastic hinge length on the results of non linear anal...
Effect of variation of plastic hinge length on the results of non linear anal...eSAT Journals
 
Effect of use of recycled materials on indirect tensile strength of asphalt c...
Effect of use of recycled materials on indirect tensile strength of asphalt c...Effect of use of recycled materials on indirect tensile strength of asphalt c...
Effect of use of recycled materials on indirect tensile strength of asphalt c...eSAT Journals
 

More from eSAT Journals (20)

Mechanical properties of hybrid fiber reinforced concrete for pavements
Mechanical properties of hybrid fiber reinforced concrete for pavementsMechanical properties of hybrid fiber reinforced concrete for pavements
Mechanical properties of hybrid fiber reinforced concrete for pavements
 
Material management in construction – a case study
Material management in construction – a case studyMaterial management in construction – a case study
Material management in construction – a case study
 
Managing drought short term strategies in semi arid regions a case study
Managing drought    short term strategies in semi arid regions  a case studyManaging drought    short term strategies in semi arid regions  a case study
Managing drought short term strategies in semi arid regions a case study
 
Life cycle cost analysis of overlay for an urban road in bangalore
Life cycle cost analysis of overlay for an urban road in bangaloreLife cycle cost analysis of overlay for an urban road in bangalore
Life cycle cost analysis of overlay for an urban road in bangalore
 
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materialsLaboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
 
Laboratory investigation of expansive soil stabilized with natural inorganic ...
Laboratory investigation of expansive soil stabilized with natural inorganic ...Laboratory investigation of expansive soil stabilized with natural inorganic ...
Laboratory investigation of expansive soil stabilized with natural inorganic ...
 
Influence of reinforcement on the behavior of hollow concrete block masonry p...
Influence of reinforcement on the behavior of hollow concrete block masonry p...Influence of reinforcement on the behavior of hollow concrete block masonry p...
Influence of reinforcement on the behavior of hollow concrete block masonry p...
 
Influence of compaction energy on soil stabilized with chemical stabilizer
Influence of compaction energy on soil stabilized with chemical stabilizerInfluence of compaction energy on soil stabilized with chemical stabilizer
Influence of compaction energy on soil stabilized with chemical stabilizer
 
Geographical information system (gis) for water resources management
Geographical information system (gis) for water resources managementGeographical information system (gis) for water resources management
Geographical information system (gis) for water resources management
 
Forest type mapping of bidar forest division, karnataka using geoinformatics ...
Forest type mapping of bidar forest division, karnataka using geoinformatics ...Forest type mapping of bidar forest division, karnataka using geoinformatics ...
Forest type mapping of bidar forest division, karnataka using geoinformatics ...
 
Factors influencing compressive strength of geopolymer concrete
Factors influencing compressive strength of geopolymer concreteFactors influencing compressive strength of geopolymer concrete
Factors influencing compressive strength of geopolymer concrete
 
Experimental investigation on circular hollow steel columns in filled with li...
Experimental investigation on circular hollow steel columns in filled with li...Experimental investigation on circular hollow steel columns in filled with li...
Experimental investigation on circular hollow steel columns in filled with li...
 
Experimental behavior of circular hsscfrc filled steel tubular columns under ...
Experimental behavior of circular hsscfrc filled steel tubular columns under ...Experimental behavior of circular hsscfrc filled steel tubular columns under ...
Experimental behavior of circular hsscfrc filled steel tubular columns under ...
 
Evaluation of punching shear in flat slabs
Evaluation of punching shear in flat slabsEvaluation of punching shear in flat slabs
Evaluation of punching shear in flat slabs
 
Evaluation of performance of intake tower dam for recent earthquake in india
Evaluation of performance of intake tower dam for recent earthquake in indiaEvaluation of performance of intake tower dam for recent earthquake in india
Evaluation of performance of intake tower dam for recent earthquake in india
 
Evaluation of operational efficiency of urban road network using travel time ...
Evaluation of operational efficiency of urban road network using travel time ...Evaluation of operational efficiency of urban road network using travel time ...
Evaluation of operational efficiency of urban road network using travel time ...
 
Estimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of surface runoff in nallur amanikere watershed using scs cn methodEstimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of surface runoff in nallur amanikere watershed using scs cn method
 
Estimation of morphometric parameters and runoff using rs & gis techniques
Estimation of morphometric parameters and runoff using rs & gis techniquesEstimation of morphometric parameters and runoff using rs & gis techniques
Estimation of morphometric parameters and runoff using rs & gis techniques
 
Effect of variation of plastic hinge length on the results of non linear anal...
Effect of variation of plastic hinge length on the results of non linear anal...Effect of variation of plastic hinge length on the results of non linear anal...
Effect of variation of plastic hinge length on the results of non linear anal...
 
Effect of use of recycled materials on indirect tensile strength of asphalt c...
Effect of use of recycled materials on indirect tensile strength of asphalt c...Effect of use of recycled materials on indirect tensile strength of asphalt c...
Effect of use of recycled materials on indirect tensile strength of asphalt c...
 

Recently uploaded

Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEroselinkalist12
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 

Recently uploaded (20)

young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
POWER SYSTEMS-1 Complete notes examples
POWER SYSTEMS-1 Complete notes  examplesPOWER SYSTEMS-1 Complete notes  examples
POWER SYSTEMS-1 Complete notes examples
 

Efficiently Searching Nearest Neighbor in Documents Using Keywords

  • 1. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 __________________________________________________________________________________________ Volume: 02 Issue: 12 | Dec-2013, Available @ http://www.ijret.org 559 EFFICIENTLY SEARCHING NEAREST NEIGHBOR IN DOCUMENTS USING KEYWORDS Sonal S. Kasare1 , Anup Bongale2 1 Lecturer, 2 Asst. Prof., Computer Engineering, Dr. D. Y. Patil School of Engg, Maharashtra, India, sonalkasare@gmail.com, anup.dypcoe@gmail.com Abstract Conservative spatial queries, such as range search and nearest neighbor reclamation, involve only conditions on objects’ numerical properties. Today, many modern applications call for novel forms of queries that aim to find objects satisfying both a spatial predicate, and a predicate on their associated texts. For example, instead of considering all the restaurants, a nearest neighbor query would instead ask for the restaurant that is the closest among those whose menus contain “steak, spaghetti, brandy” all at the same time. Currently the best solution to such queries is based on the InformationRetrieval2-tree, which, has a few deficiencies that seriously impact its efficiency. Motivated by this, there is a development of a new access method called the spatial inverted index that extends the conventional inverted index to cope with multidimensional data, and comes with algorithms that can answer nearest neighbor queries with keywords in real time. As verified by experiments, the proposed techniques outperform the InformationRetrieval2-tree in query response time significantly, often by a factor of orders of magnitude Keywords: Information retrieval, spatial index, keyword search -----------------------------------------------------------------------***----------------------------------------------------------------------- 1. INTRODUCTION Keyword search in document performed with various approaches ranked retrieval results, clustering search results & identifying the nearest neighbor Keyword search on xml document categorized as two different approaches one is Keyword search on xml document which can be performed by ranking the searched results based on match or the answer to keyword & finding the nearest neighbor of keyword by using GST or by Xpath Query. In paper [2] the problem of returning clustered results for keyword search on documents the core of the semantics is the conceptually related relationship between keyword matches, which is based on the conceptual relationship between nodes in trees. Then, we propose a new clustering methodology for search results, which clusters results according to the way they match the given query. A spatial database manages multidimensional objects (such as points, rectangles, etc.), and provides fast access to those objects based on different selection criteria. The importance of spatial databases is reflected by the convenience of modeling entities of reality in a geometric manner. For example, locations of restaurants, hotels, hospitals and so on are often represented as points in a map, while larger extents such as parks, lakes, and landscapes often as a combination of rectangles. Many functionalities of a spatial database are useful in various ways in specific contexts. For instance, in a geography information system, range search can be deployed to find all restaurants in a certain area, while nearest neighbor retrieval can discover the restaurant closest to a given address. Today, the widespread use of search engines has made it realistic to write spatial queries in a brand new way. Conventionally, queries focus on objects‟ geometric properties only, such as whether a point is in a rectangle, or how close two points are from each other. We have seen some modern applications that call for the ability to select objects based on both of their geometric coordinates and their associated texts. For example, it would be fairly useful if a search engine can be used to find the nearest restaurant that offers “steak, spaghetti, and brandy” all at the same time. Note that this is not the “globally” nearest restaurant (which would have been returned by a traditional nearest neighbor query), but the nearest restaurant among only those providing all the demanded foods and drinks. There are easy ways to support queries that combine spatial and text features. For example, for the above query, we could first fetch all the restaurants whose menus contain the set of keywords {steak, spaghetti, brandy}, and then from the retrieved restaurants, find the nearest one. Similarly, one could also do it reversely by targeting first the spatial conditions – browse all the restaurants in ascending order of their distances to the query point until encountering one whose menu has all the keywords. The major drawback of these straightforward approaches is that they will fail to provide real time answers on difficult inputs. A typical example is that the real nearest neighbor lies quite far away from the query point, while all the closer neighbors are missing at least one of the query keywords. Spatial queries with keywords have not been extensively explored. In the past years, the community has
  • 2. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 __________________________________________________________________________________________ Volume: 02 Issue: 12 | Dec-2013, Available @ http://www.ijret.org 560 sparked enthusiasm in studying keyword search in relational databases. It is until recently that attention was diverted to multidimensional data. The boom of Internet has given rise to an ever increasing amount of text data associated with multiple dimensions (attributes), for example, customer reviews in shopping websites (e.g., Amazon) are always associated with attributes like price, model, and rate. A traditional OLAP data cube can be naturally extended to summarize and navigate structured data together with unstructured text data. Such a cube model is called text cube [1]. A cell in the text cube aggregates a set of documents/items with matching attribute values in a subset of dimensions. Keyword query, one of the most popular and easy-to-use ways to retrieve useful information from a collection of plain documents, is being extended to RDBMSs to retrieve information from text-rich attributes [2], [3]. Given a set of keywords, existing methods aim to find relevant items or joins of items (e.g., linked by foreign keys) that contain all or some of the keywords. Traditional IR techniques can be used to rank documents according to the relevance. In a large text database, however, the number of relevant documents to a query could be large, and a user has to spend much time reading them. If a document is associated with attribute information, in a data cube model (a multidimensional space induced by the attributes), e.g., the text cube, a cell aggregates the documents with matching values in a subset of attributes. Such a collection of documents is associated with each cell, corresponding to an object that can be directly recommended to the user for the given query. This paper studies the problem of keyword-based top-k search in text cube, i.e., given a keyword query, find the top-k most relevant cells in a text cube. When users want to retrieve information from a text cube using keyword question, believe that relevant cells, rather than relevant documents, are preferred as the answers, because: 1) relevant cells are easy for users to browse; and 2) relevant cells provide users insights about the relationship between the values of relational attributes and the text data. 2. PROBLEM DEFINITION 1. Implement k nearest neighbor search algorithm using mat lab for given data set and to find out closest point from give query also analyze the result fetch time and accuracy. 2. Implement the Inverted index algorithm by extending point the k nearest neighbor and forming R tree to find closest point from given set of query and also analyze the result against time and result accuracy. 3. LITERATURE SURVEY In the paper „fast nearest neighbor search with keywords‟, there are methods like spatial index, inverted index, nearest neighbor search. The first method spatial index is used for creating indices because there is huge amount of data need to be stored for searching that data stored in the form of xml documents. If the data storage created in the form of indices then space required is less also time needed for searching the keyword is less. Second method is inverted index. The inverted index data structure is a central component of a typical search engine indexing algorithm. A goal of a search engine performance is to optimize the speed of the query: find the documents where word occurs. Once an index is developed, which provisions lists of words per document; it is next inverted to develop an inverted index. Querying the index would require sequential iteration through each document and to each word to verify a matching document. The time memory and processing property to execute such a query are not always theoretically realistic. Instead of listing the words per article in the index, the inverted index data structure is developed which lists the documents per word. The inverted index produced, the query can now be determined by jumping to the word id in the inverted index. These were effectively inverted indexes with a small amount of supplementary explanation that required a implausible amount of attempt to produce. Third method is nearest neighbor search. Nearest neighbor search (NNS), also identified as closeness search, parallel search is an optimization problem for finding closest points in metric spaces. In the paper „Efficient Keyword-Based Search for Top-K Cells in Text Cube‟ methods used are inverted-index one-scan, document sorted-scan, bottom-up dynamic programming, and search-space ordering. In the top k cells, there is a searching of nearest key to the query. Cubes forms clusters of single unique group which shows its identity. Method like inverted index used for giving index rather than providing whole data which can be space consuming. 4. COMPARITIVE STUDY Table -1: Comparison of Depth first search and inverted index ParametersMethods Depth first search Inverted index Space More Less Time complexity O(n+m) O(n) Working Slow Efficient Methods used before is depth first search which required more space because it needs to travel through left and right trees respectively. Rather in inverted indexing there is less space required as there is use of indices. Next is time complexity, for DFS its O(n+m) ( n for edges and m for nodes). While in inverted index there is time required is O(n) that is for single index. Also DFS works slowly and inverted indexing is efficient to use.
  • 3. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 __________________________________________________________________________________________ Volume: 02 Issue: 12 | Dec-2013, Available @ http://www.ijret.org 561 CONCLUSIONS By studying above methods the main motto is searching a relevant keyword with meaningful information with minimum time with valid results. I can conclude with this there should be efficient method to identify the keyword, grouping those results by using any one of clustering methods. The major drawbacks in paper [1] are if two nodes have same keyword then how to differentiate the keyword with the searched one. In paper [2] there can be come up with indexing the cluster results will be more if cluster is dynamically growing, or we can form clusters of clusters. REFERENCES [1]. S. Agrawal S. Chaudhuri, and G. Das. Dbxplorer: A system for keyword-based search over relational databases. In Proc. Of International Conference on Data Engineering (ICDE), pages 5–16, 2002. [2]. N. Beckmann, H. Kriegel, R. Schneider, and B. Seeger. The R*-tree: An efficient and robust access method for points and rectangles. In Proc. of ACM Management of Data (SIGMOD), pages 322–331, 1990. [3]. G. Bhalotia, A. Hulgeri, C. Nakhe, S. Chakrabarti, and S. Sudarshan. Keyword searching and browsing in databases using banks. In Proc. of International Conference on Data Engineering (ICDE), pages 431– 440, 2002. [4]. X. Cao, L. Chen, G. Cong, C. S. Jensen, Q. Qu, A. Skovsgaard, D. Wu, and M. L. Yiu. Spatial keyword querying. In ER, pages 16–29, 2012. X. Cao, G. Cong, and C. S. Jensen. Retrieving top-k prestige- based relevant spatial web objects. PVLDB, 3(1):373–384, 2010. [5]. X. Cao, G. Cong, C. S. Jensen, and B. C. Ooi. Collective spatial keyword querying. In Proc. of ACM Management of Data (SIGMOD), pages 373– 384, 2011. [6]. B. Chazelle, J. Kilian, R. Rubinfeld, and A. Tal. The bloomier filter: an efficient data structure for static support lookup tables. In Proc. of the Annual ACM- SIAM Symposium on Discrete Algorithms (SODA), pages 30–39, 2004. [7]. Y.-Y. Chen, T. Suel, and A. Markowetz. Efficient query processing in geographic web search engines. In Proc. of ACM Management of Data (SIGMOD), pages 277–288, 2006. [8]. E. Chu, A. Baid, X. Chai, A. Doan, and J. Naughton. Combining keyword search and forms for ad hoc querying of databases. In Proc. of ACM Management of Data (SIGMOD), 2009. [9]. G. Cong, C. S. Jensen, and D. Wu. Efficient retrieval of the top-k most relevant spatial web objects. PVLDB, 2(1):337–348, 2009. [10]. C. Faloutsos and S. Christodoulakis. Signature files: An access method for documents and its analytical performance evaluation. ACM Transactions on Information Systems (TOIS), 2(4):267–288, 1984. [11]. I. D. Felipe, V. Hristidis, and N. Rishe. Keyword search on spatial databases. In Proc. of International Conference on Data Engineering (ICDE), pages 656– 665, 2008. [12]. R. Hariharan, B. Hore, C. Li, and S. Mehrotra. Processing spatialkeyword (SK) queries in geographic information retrieval (GIR) systems. In Proc. of Scientific and Statistical Database Management (SSDBM), 2007. [13]. G. R. Hjaltason and H. Samet. Distance browsing in spatial databases. ACM Transactions on Database Systems (TODS), 24(2):265–318, 1999. [14]. V. Hristidis and Y. Papakonstantinou. Discover: Keyword search in relational databases. In Proc. of Very Large Data Bases (VLDB), pages 670–681, 2002. [15]. I. Kamel and C. Faloutsos. Hilbert R-tree: An improved r-tree using fractals. In Proc. of Very Large Data Bases (VLDB), pages 500–509, 1994.