SlideShare a Scribd company logo
Ginix Generalized Inverted Index for Keyword Search
ABSTRACT
Keyword search has become a ubiquitous method for users to access text data
in the face of information explosion. Inverted lists are usually used to index
underlying documents to retrieve documents according to a set of keywords
efficiently. Since inverted lists are usually large, many compression
techniques have been proposed to reduce the storage space and disk I/O time.
However, these techniques usually perform decompression operations on the
fly, which increases the CPU time. This paper presents a more efficient index
structure, the Generalized Inverted IndeX (Ginix), which merges consecutive
IDs in inverted lists into intervals to save storage space. With this index
structure, more efficient algorithms can be devised to perform basic keyword
search operations, i.e., the union and the intersection operations, by taking
the advantage of intervals. Specifically, these algorithms do not require
conversions from interval lists back to ID lists. As a result, keyword search
using Ginix can be more efficient than those using traditional inverted indices.
The performance of Ginix is also improved by reordering the documents in
datasets using two scalable algorithms. Experiments on the performance and
GLOBALSOFT TECHNOLOGIES
IEEE PROJECTS & SOFTWARE DEVELOPMENTS
IEEE FINAL YEAR PROJECTS|IEEE ENGINEERING PROJECTS|IEEE STUDENTS PROJECTS|IEEE
BULK PROJECTS|BE/BTECH/ME/MTECH/MS/MCA PROJECTS|CSE/IT/ECE/EEE PROJECTS
CELL: +91 98495 39085, +91 99662 35788, +91 98495 57908, +91 97014 40401
Visit: www.finalyearprojects.org Mail to:ieeefinalsemprojects@gmail.com
scalability of Ginix on real datasets show that Ginix not only requires less
storage space, but also improves the keyword search performance, compared
with traditional inverted indexes.
SYSTEM ANALYSIS
Existing System:
Beyond asking for explicit user input, earlier work focused on handling
recency queries, which are queries that are after recent events or breaking
news. The time sensitive approach processes a recency query by computing
traditional topic similarity scores for each document, and then “boosts” the
scores of the most recent documents, to privilege recent articles over older
ones. In contrast to traditional models, which assume a uniform prior
probability of relevance for each document d in a collection, define the prior to
be a function of document d’s creation date. The prior probability decreases
exponentially with time, and hence recent documents are ranked higher than
older documents. Li and Croft’s strategy is designed for queries that are after
recent documents, but it does not handle other types of time-sensitive queries,
such as [Madrid bombing], [Google IPO], or even that implicitly target one or
more past time periods.
Proposed System:
Many compression techniques have been proposed to reduce the storage space and disk I/O
time. However, these techniques usually perform decompression operations on the fly, which
increases the CPU time. This paper presents a more efficient index structure, the Generalized
INverted IndeX (Ginix), which merges consecutive IDs in inverted lists into intervals to save
storage space. The problem of document reordering is equivalent to making similar
documents stay near to each other. Silvestri[5] proposed a simple method that sorts web
pages in lexicographical order based on their URLs as an acceptable solution to the problem.
This method is reasonable because the URLs are usually good indicates of the web page
content. The performance of Ginix is also improved by reordering the documents in datasets
using two scalable algorithms. Experiments on the performance and scalability of Ginix on real
datasets show that Ginix not only requires less storage space, but also improves the keyword
search performance, compared with traditional inverted indexes.
Advantages:
1. Efficient algorithms are given to support basic operations on interval lists, such as union
and intersection without decompression.
2. The problem of enhancing the performance of Ginix by document reordering is investigated,
and two scalable and effective algorithms based on signature sorting and greedy heuristic of
Traveling Salesman Problem (TSP)[3] are proposed.
3. Extensive experiments that evaluate the performance of Ginix are conducted. Results show
that Ginix not only reduces the index size but also improves the search performance on real
datasets.
Module Description:
1. Search over Blogs
2. Time interval feedback
3. Temporal relevance feedback (Time Sensitive results
4. Overall ranking document identification Search over blogs.
5. Blogs Growth Charts.
A large number of searches, such as over blogs and news archives. So far, research
on searching over such collections has largely focused on retrieving topically similar
documents for a query. Unfortunately, ignoring or not fully exploiting the time dimension can
be detrimental for a large family of queries for which we should consider not only the
document topical relevance.
Time Interval Feedback:
Time-sensitive query over a news archive, our approach automatically identifies
important time intervals for the query. These intervals are then used to adjust the document
relevance scores by boosting the scores of documents published within the important
intervals. We have implemented our system on top of Indri, 2 a state-of-the-art search engine
that combines language models and inference networks for retrieval, as well as over Lemur3,
into its implementation. Our system provides a web interface for searching the News blaster
archive4, an operational news archive and summarization system, and for experimenting with
variations of our approach.
Temporal Relevance Feedback:
We discuss several techniques to estimate the temporal relevance of a day to a query at
hand. These estimation techniques use the temporal distribution of matching articles for the
query to compute the probability that a day in the archive has a relevant document for the
query.
Overall ranking document identification:
We integrate temporal relevance with state-of-the- art retrieval models, including a
query likelihood model, a relevance model, a probabilistic relevance model, and a query
expansion with pseudo relevance feedback model, to naturally process time-sensitive queries.
In these models, we combine topical relevance and temporal relevance to determine the
overall relevance of a document.
Blogs Growth Charts:
The scalability of Ginix was evaluated using different numbers of reocrds in the DBLP dataset.
Search time: Since the current algorithms take advantage of the intervals, the search time of
Ginix is nearly 2x faster than that of InvIndex.
Algorithm:
SYSTEM SPECIFICATION
Hardware Requirements:
• System : Pentium IV 2.4 GHz.
• Hard Disk : 80 GB.
• Floppy Drive: 1.44 Mb.
• Monitor : 15’ VGA Colour.
• Mouse : Optical Mouse
• RAM : 512 MB.
Software Requirements:
• Operating system : Windows 7 32 Bit.
• Coding Language : ASP.Net 4.0 with C#
• Data Base : SQL Server 2008

More Related Content

What's hot

A First Step Towards Content Protecting Plagiarism Detection
A First Step Towards Content Protecting Plagiarism Detection  A First Step Towards Content Protecting Plagiarism Detection
A First Step Towards Content Protecting Plagiarism Detection
Scientific Information Analytics Group, Prof. Gipp
 
Modern Scientific Data Management Practices: The Atmospheric Radiation Measur...
Modern Scientific Data Management Practices: The Atmospheric Radiation Measur...Modern Scientific Data Management Practices: The Atmospheric Radiation Measur...
Modern Scientific Data Management Practices: The Atmospheric Radiation Measur...
Globus
 
Google BigQuery is the future of Analytics! (Google Developer Conference)
Google BigQuery is the future of Analytics! (Google Developer Conference)Google BigQuery is the future of Analytics! (Google Developer Conference)
Google BigQuery is the future of Analytics! (Google Developer Conference)
Rasel Rana
 
PAS: A Sampling Based Similarity Identification Algorithm for compression of ...
PAS: A Sampling Based Similarity Identification Algorithm for compression of ...PAS: A Sampling Based Similarity Identification Algorithm for compression of ...
PAS: A Sampling Based Similarity Identification Algorithm for compression of ...
rahulmonikasharma
 
Database novelty detection
Database novelty detectionDatabase novelty detection
Database novelty detection
MostafaAliAbbas
 
Research Topics in Data Mining
Research Topics in Data MiningResearch Topics in Data Mining
Research Topics in Data Mining
Phdtopiccom
 
Panda Provenance
Panda ProvenancePanda Provenance
Panda Provenance
Vlad Vega
 
A survey on data mining and analysis in hadoop and mongo db
A survey on data mining and analysis in hadoop and mongo dbA survey on data mining and analysis in hadoop and mongo db
A survey on data mining and analysis in hadoop and mongo db
Alexander Decker
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Ian Foster
 
Geospatial data
Geospatial dataGeospatial data
Geospatial data
MostafaAliAbbas
 
Integrating scientific laboratories into the cloud
Integrating scientific laboratories into the cloudIntegrating scientific laboratories into the cloud
Integrating scientific laboratories into the cloud
Data Finder
 
Stanford/SLAC Cryo-EM Computing and Storage, Yee-Ting Li
Stanford/SLAC Cryo-EM Computing and Storage, Yee-Ting LiStanford/SLAC Cryo-EM Computing and Storage, Yee-Ting Li
Stanford/SLAC Cryo-EM Computing and Storage, Yee-Ting Li
PacificResearchPlatform
 
Social Media Analytics on Canadian Airlines
Social Media Analytics on Canadian AirlinesSocial Media Analytics on Canadian Airlines
Social Media Analytics on Canadian Airlines
Bernardo Najlis
 
Improving Association Rule Mining by Defining a Novel Data Structure
Improving Association Rule Mining by Defining a Novel Data StructureImproving Association Rule Mining by Defining a Novel Data Structure
Improving Association Rule Mining by Defining a Novel Data Structure
IRJET Journal
 
Fast raq a fast approach to range aggregate queries in big data environments
Fast raq a fast approach to range aggregate queries in big data environmentsFast raq a fast approach to range aggregate queries in big data environments
Fast raq a fast approach to range aggregate queries in big data environments
Nexgen Technology
 
Real Time Reporting Platform
Real Time Reporting PlatformReal Time Reporting Platform
Real Time Reporting Platform
Kyle Burke
 
Scalable and adaptive data replica placement for geo distributed cloud storages
Scalable and adaptive data replica placement for geo distributed cloud storagesScalable and adaptive data replica placement for geo distributed cloud storages
Scalable and adaptive data replica placement for geo distributed cloud storages
Venkat Projects
 
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
IRJET Journal
 

What's hot (18)

A First Step Towards Content Protecting Plagiarism Detection
A First Step Towards Content Protecting Plagiarism Detection  A First Step Towards Content Protecting Plagiarism Detection
A First Step Towards Content Protecting Plagiarism Detection
 
Modern Scientific Data Management Practices: The Atmospheric Radiation Measur...
Modern Scientific Data Management Practices: The Atmospheric Radiation Measur...Modern Scientific Data Management Practices: The Atmospheric Radiation Measur...
Modern Scientific Data Management Practices: The Atmospheric Radiation Measur...
 
Google BigQuery is the future of Analytics! (Google Developer Conference)
Google BigQuery is the future of Analytics! (Google Developer Conference)Google BigQuery is the future of Analytics! (Google Developer Conference)
Google BigQuery is the future of Analytics! (Google Developer Conference)
 
PAS: A Sampling Based Similarity Identification Algorithm for compression of ...
PAS: A Sampling Based Similarity Identification Algorithm for compression of ...PAS: A Sampling Based Similarity Identification Algorithm for compression of ...
PAS: A Sampling Based Similarity Identification Algorithm for compression of ...
 
Database novelty detection
Database novelty detectionDatabase novelty detection
Database novelty detection
 
Research Topics in Data Mining
Research Topics in Data MiningResearch Topics in Data Mining
Research Topics in Data Mining
 
Panda Provenance
Panda ProvenancePanda Provenance
Panda Provenance
 
A survey on data mining and analysis in hadoop and mongo db
A survey on data mining and analysis in hadoop and mongo dbA survey on data mining and analysis in hadoop and mongo db
A survey on data mining and analysis in hadoop and mongo db
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
 
Geospatial data
Geospatial dataGeospatial data
Geospatial data
 
Integrating scientific laboratories into the cloud
Integrating scientific laboratories into the cloudIntegrating scientific laboratories into the cloud
Integrating scientific laboratories into the cloud
 
Stanford/SLAC Cryo-EM Computing and Storage, Yee-Ting Li
Stanford/SLAC Cryo-EM Computing and Storage, Yee-Ting LiStanford/SLAC Cryo-EM Computing and Storage, Yee-Ting Li
Stanford/SLAC Cryo-EM Computing and Storage, Yee-Ting Li
 
Social Media Analytics on Canadian Airlines
Social Media Analytics on Canadian AirlinesSocial Media Analytics on Canadian Airlines
Social Media Analytics on Canadian Airlines
 
Improving Association Rule Mining by Defining a Novel Data Structure
Improving Association Rule Mining by Defining a Novel Data StructureImproving Association Rule Mining by Defining a Novel Data Structure
Improving Association Rule Mining by Defining a Novel Data Structure
 
Fast raq a fast approach to range aggregate queries in big data environments
Fast raq a fast approach to range aggregate queries in big data environmentsFast raq a fast approach to range aggregate queries in big data environments
Fast raq a fast approach to range aggregate queries in big data environments
 
Real Time Reporting Platform
Real Time Reporting PlatformReal Time Reporting Platform
Real Time Reporting Platform
 
Scalable and adaptive data replica placement for geo distributed cloud storages
Scalable and adaptive data replica placement for geo distributed cloud storagesScalable and adaptive data replica placement for geo distributed cloud storages
Scalable and adaptive data replica placement for geo distributed cloud storages
 
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
 

Viewers also liked

Dossier corporativo de eNexus
Dossier corporativo de eNexusDossier corporativo de eNexus
Dossier corporativo de eNexus
enexus
 
Uso de Twitter en las editoriales españolas (Anatomía de la edición) -OCT11
Uso de Twitter en las editoriales españolas (Anatomía de la edición) -OCT11Uso de Twitter en las editoriales españolas (Anatomía de la edición) -OCT11
Uso de Twitter en las editoriales españolas (Anatomía de la edición) -OCT11
Retelur Marketing
 
Zooloski vrt-mesto za obrazovanje i razonodu
Zooloski vrt-mesto za obrazovanje i razonoduZooloski vrt-mesto za obrazovanje i razonodu
Zooloski vrt-mesto za obrazovanje i razonodu
Andjela Tosic
 
Tarea de informatica mary tt
Tarea de informatica mary  ttTarea de informatica mary  tt
Tarea de informatica mary tt
maryleyva
 
Galle Fort Ceylon
Galle Fort CeylonGalle Fort Ceylon
Galle Fort Ceylon
Hussain Fahmy
 
Home Staging Business Opportunity Join Our Team Presentation
Home Staging Business Opportunity Join Our Team PresentationHome Staging Business Opportunity Join Our Team Presentation
Home Staging Business Opportunity Join Our Team Presentation
richardduda
 
Guide pratique Le Sauze hiver 2013 / 2014
Guide pratique Le Sauze hiver 2013 / 2014Guide pratique Le Sauze hiver 2013 / 2014
Guide pratique Le Sauze hiver 2013 / 2014Ubaye Vallée
 
Duties and resposibilities
Duties and resposibilitiesDuties and resposibilities
Duties and resposibilities
Danushka Wijerathne
 
Gta 5
Gta 5
Gta 5
Las TIC y la Formación Docente
Las TIC y la Formación DocenteLas TIC y la Formación Docente
Las TIC y la Formación Docente
hilda2000
 
Garcia Lorca ceip Miguel Hernández. Castillo de Locubín (Jaén)
Garcia Lorca ceip Miguel Hernández. Castillo de Locubín (Jaén)Garcia Lorca ceip Miguel Hernández. Castillo de Locubín (Jaén)
Garcia Lorca ceip Miguel Hernández. Castillo de Locubín (Jaén)
mlarfer351
 
WEBINAR: Crea campañas que generen un ROAS efectivo personalizando la experie...
WEBINAR: Crea campañas que generen un ROAS efectivo personalizando la experie...WEBINAR: Crea campañas que generen un ROAS efectivo personalizando la experie...
WEBINAR: Crea campañas que generen un ROAS efectivo personalizando la experie...
Nosto
 
Cuento aeiou
Cuento aeiouCuento aeiou
Cuento aeiou
Rosa
 
Bugs 6 pp_loe_cb
Bugs 6 pp_loe_cbBugs 6 pp_loe_cb
Curriculum vitae otec innovares junio 2015
Curriculum vitae otec innovares junio 2015Curriculum vitae otec innovares junio 2015
Curriculum vitae otec innovares junio 2015
OTEC Innovares
 
Scenarios of Simultrain : Doc 786
Scenarios of Simultrain : Doc 786Scenarios of Simultrain : Doc 786
Scenarios of Simultrain : Doc 786
Sauter Training & Simulation, STS SA, Demos Group
 
Nbr 5737 cimento portland resistente a sulfatos
Nbr 5737    cimento portland resistente a sulfatosNbr 5737    cimento portland resistente a sulfatos
Nbr 5737 cimento portland resistente a sulfatos
profNICODEMOS
 
Historia Clinica Compartida en Cataluña y Carpeta personal de Salud
Historia Clinica Compartida en Cataluña y Carpeta personal de SaludHistoria Clinica Compartida en Cataluña y Carpeta personal de Salud
Historia Clinica Compartida en Cataluña y Carpeta personal de Salud
Plan de Calidad para el SNS
 
Public Service Announcements
Public Service AnnouncementsPublic Service Announcements
Public Service Announcements
mrsbauerart
 

Viewers also liked (20)

Presentació Applus+ IDIADA
Presentació Applus+ IDIADAPresentació Applus+ IDIADA
Presentació Applus+ IDIADA
 
Dossier corporativo de eNexus
Dossier corporativo de eNexusDossier corporativo de eNexus
Dossier corporativo de eNexus
 
Uso de Twitter en las editoriales españolas (Anatomía de la edición) -OCT11
Uso de Twitter en las editoriales españolas (Anatomía de la edición) -OCT11Uso de Twitter en las editoriales españolas (Anatomía de la edición) -OCT11
Uso de Twitter en las editoriales españolas (Anatomía de la edición) -OCT11
 
Zooloski vrt-mesto za obrazovanje i razonodu
Zooloski vrt-mesto za obrazovanje i razonoduZooloski vrt-mesto za obrazovanje i razonodu
Zooloski vrt-mesto za obrazovanje i razonodu
 
Tarea de informatica mary tt
Tarea de informatica mary  ttTarea de informatica mary  tt
Tarea de informatica mary tt
 
Galle Fort Ceylon
Galle Fort CeylonGalle Fort Ceylon
Galle Fort Ceylon
 
Home Staging Business Opportunity Join Our Team Presentation
Home Staging Business Opportunity Join Our Team PresentationHome Staging Business Opportunity Join Our Team Presentation
Home Staging Business Opportunity Join Our Team Presentation
 
Guide pratique Le Sauze hiver 2013 / 2014
Guide pratique Le Sauze hiver 2013 / 2014Guide pratique Le Sauze hiver 2013 / 2014
Guide pratique Le Sauze hiver 2013 / 2014
 
Duties and resposibilities
Duties and resposibilitiesDuties and resposibilities
Duties and resposibilities
 
Gta 5
Gta 5
Gta 5
Gta 5
 
Las TIC y la Formación Docente
Las TIC y la Formación DocenteLas TIC y la Formación Docente
Las TIC y la Formación Docente
 
Garcia Lorca ceip Miguel Hernández. Castillo de Locubín (Jaén)
Garcia Lorca ceip Miguel Hernández. Castillo de Locubín (Jaén)Garcia Lorca ceip Miguel Hernández. Castillo de Locubín (Jaén)
Garcia Lorca ceip Miguel Hernández. Castillo de Locubín (Jaén)
 
WEBINAR: Crea campañas que generen un ROAS efectivo personalizando la experie...
WEBINAR: Crea campañas que generen un ROAS efectivo personalizando la experie...WEBINAR: Crea campañas que generen un ROAS efectivo personalizando la experie...
WEBINAR: Crea campañas que generen un ROAS efectivo personalizando la experie...
 
Cuento aeiou
Cuento aeiouCuento aeiou
Cuento aeiou
 
Bugs 6 pp_loe_cb
Bugs 6 pp_loe_cbBugs 6 pp_loe_cb
Bugs 6 pp_loe_cb
 
Curriculum vitae otec innovares junio 2015
Curriculum vitae otec innovares junio 2015Curriculum vitae otec innovares junio 2015
Curriculum vitae otec innovares junio 2015
 
Scenarios of Simultrain : Doc 786
Scenarios of Simultrain : Doc 786Scenarios of Simultrain : Doc 786
Scenarios of Simultrain : Doc 786
 
Nbr 5737 cimento portland resistente a sulfatos
Nbr 5737    cimento portland resistente a sulfatosNbr 5737    cimento portland resistente a sulfatos
Nbr 5737 cimento portland resistente a sulfatos
 
Historia Clinica Compartida en Cataluña y Carpeta personal de Salud
Historia Clinica Compartida en Cataluña y Carpeta personal de SaludHistoria Clinica Compartida en Cataluña y Carpeta personal de Salud
Historia Clinica Compartida en Cataluña y Carpeta personal de Salud
 
Public Service Announcements
Public Service AnnouncementsPublic Service Announcements
Public Service Announcements
 

Similar to Ginix generalized inverted index for keyword search

An investigative scheme for keyword search using inverted key tactic
An investigative scheme for keyword search using inverted key tacticAn investigative scheme for keyword search using inverted key tactic
An investigative scheme for keyword search using inverted key tactic
eSAT Publishing House
 
Paper id 37201536
Paper id 37201536Paper id 37201536
Paper id 37201536
IJRAT
 
Context Based Web Indexing For Semantic Web
Context Based Web Indexing For Semantic WebContext Based Web Indexing For Semantic Web
Context Based Web Indexing For Semantic Web
IOSR Journals
 
IRJET- Proficient Recovery Over Records using Encryption in Cloud Computing
IRJET- Proficient Recovery Over Records using Encryption in Cloud ComputingIRJET- Proficient Recovery Over Records using Encryption in Cloud Computing
IRJET- Proficient Recovery Over Records using Encryption in Cloud Computing
IRJET Journal
 
An Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search TechniqueAn Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search Technique
paperpublications3
 
Hierarchal clustering and similarity measures along
Hierarchal clustering and similarity measures alongHierarchal clustering and similarity measures along
Hierarchal clustering and similarity measures along
eSAT Publishing House
 
Hierarchal clustering and similarity measures along with multi representation
Hierarchal clustering and similarity measures along with multi representationHierarchal clustering and similarity measures along with multi representation
Hierarchal clustering and similarity measures along with multi representation
eSAT Journals
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI) International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
inventionjournals
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
inventionjournals
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
inventionjournals
 
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGYINTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
cscpconf
 
QUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENT
QUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENTQUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENT
QUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENT
csandit
 
Secure and Efficient Client and Server Side Data Deduplication to Reduce Stor...
Secure and Efficient Client and Server Side Data Deduplication to Reduce Stor...Secure and Efficient Client and Server Side Data Deduplication to Reduce Stor...
Secure and Efficient Client and Server Side Data Deduplication to Reduce Stor...
dbpublications
 
A signature based indexing method for efficient content-based retrieval of re...
A signature based indexing method for efficient content-based retrieval of re...A signature based indexing method for efficient content-based retrieval of re...
A signature based indexing method for efficient content-based retrieval of re...
Mumbai Academisc
 
A Review of Elastic Search: Performance Metrics and challenges
A Review of Elastic Search: Performance Metrics and challengesA Review of Elastic Search: Performance Metrics and challenges
A Review of Elastic Search: Performance Metrics and challenges
rahulmonikasharma
 
Efficiently searching nearest neighbor in documents
Efficiently searching nearest neighbor in documentsEfficiently searching nearest neighbor in documents
Efficiently searching nearest neighbor in documents
eSAT Publishing House
 
Efficiently searching nearest neighbor in documents using keywords
Efficiently searching nearest neighbor in documents using keywordsEfficiently searching nearest neighbor in documents using keywords
Efficiently searching nearest neighbor in documents using keywords
eSAT Journals
 
Query optimization in oodbms identifying subquery for query management
Query optimization in oodbms identifying subquery for query managementQuery optimization in oodbms identifying subquery for query management
Query optimization in oodbms identifying subquery for query management
ijdms
 
ast nearest neighbor search with keywords
ast nearest neighbor search with keywordsast nearest neighbor search with keywords
ast nearest neighbor search with keywords
swathi78
 
Ijsrdv1 i2039
Ijsrdv1 i2039Ijsrdv1 i2039
Ijsrdv1 i2039
ijsrd.com
 

Similar to Ginix generalized inverted index for keyword search (20)

An investigative scheme for keyword search using inverted key tactic
An investigative scheme for keyword search using inverted key tacticAn investigative scheme for keyword search using inverted key tactic
An investigative scheme for keyword search using inverted key tactic
 
Paper id 37201536
Paper id 37201536Paper id 37201536
Paper id 37201536
 
Context Based Web Indexing For Semantic Web
Context Based Web Indexing For Semantic WebContext Based Web Indexing For Semantic Web
Context Based Web Indexing For Semantic Web
 
IRJET- Proficient Recovery Over Records using Encryption in Cloud Computing
IRJET- Proficient Recovery Over Records using Encryption in Cloud ComputingIRJET- Proficient Recovery Over Records using Encryption in Cloud Computing
IRJET- Proficient Recovery Over Records using Encryption in Cloud Computing
 
An Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search TechniqueAn Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search Technique
 
Hierarchal clustering and similarity measures along
Hierarchal clustering and similarity measures alongHierarchal clustering and similarity measures along
Hierarchal clustering and similarity measures along
 
Hierarchal clustering and similarity measures along with multi representation
Hierarchal clustering and similarity measures along with multi representationHierarchal clustering and similarity measures along with multi representation
Hierarchal clustering and similarity measures along with multi representation
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI) International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGYINTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
 
QUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENT
QUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENTQUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENT
QUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENT
 
Secure and Efficient Client and Server Side Data Deduplication to Reduce Stor...
Secure and Efficient Client and Server Side Data Deduplication to Reduce Stor...Secure and Efficient Client and Server Side Data Deduplication to Reduce Stor...
Secure and Efficient Client and Server Side Data Deduplication to Reduce Stor...
 
A signature based indexing method for efficient content-based retrieval of re...
A signature based indexing method for efficient content-based retrieval of re...A signature based indexing method for efficient content-based retrieval of re...
A signature based indexing method for efficient content-based retrieval of re...
 
A Review of Elastic Search: Performance Metrics and challenges
A Review of Elastic Search: Performance Metrics and challengesA Review of Elastic Search: Performance Metrics and challenges
A Review of Elastic Search: Performance Metrics and challenges
 
Efficiently searching nearest neighbor in documents
Efficiently searching nearest neighbor in documentsEfficiently searching nearest neighbor in documents
Efficiently searching nearest neighbor in documents
 
Efficiently searching nearest neighbor in documents using keywords
Efficiently searching nearest neighbor in documents using keywordsEfficiently searching nearest neighbor in documents using keywords
Efficiently searching nearest neighbor in documents using keywords
 
Query optimization in oodbms identifying subquery for query management
Query optimization in oodbms identifying subquery for query managementQuery optimization in oodbms identifying subquery for query management
Query optimization in oodbms identifying subquery for query management
 
ast nearest neighbor search with keywords
ast nearest neighbor search with keywordsast nearest neighbor search with keywords
ast nearest neighbor search with keywords
 
Ijsrdv1 i2039
Ijsrdv1 i2039Ijsrdv1 i2039
Ijsrdv1 i2039
 

More from IEEEFINALYEARPROJECTS

Scalable face image retrieval using attribute enhanced sparse codewords
Scalable face image retrieval using attribute enhanced sparse codewordsScalable face image retrieval using attribute enhanced sparse codewords
Scalable face image retrieval using attribute enhanced sparse codewords
IEEEFINALYEARPROJECTS
 
Scalable face image retrieval using attribute enhanced sparse codewords
Scalable face image retrieval using attribute enhanced sparse codewordsScalable face image retrieval using attribute enhanced sparse codewords
Scalable face image retrieval using attribute enhanced sparse codewords
IEEEFINALYEARPROJECTS
 
Reversible watermarking based on invariant image classification and dynamic h...
Reversible watermarking based on invariant image classification and dynamic h...Reversible watermarking based on invariant image classification and dynamic h...
Reversible watermarking based on invariant image classification and dynamic h...
IEEEFINALYEARPROJECTS
 
Reversible data hiding with optimal value transfer
Reversible data hiding with optimal value transferReversible data hiding with optimal value transfer
Reversible data hiding with optimal value transfer
IEEEFINALYEARPROJECTS
 
Query adaptive image search with hash codes
Query adaptive image search with hash codesQuery adaptive image search with hash codes
Query adaptive image search with hash codes
IEEEFINALYEARPROJECTS
 
Noise reduction based on partial reference, dual-tree complex wavelet transfo...
Noise reduction based on partial reference, dual-tree complex wavelet transfo...Noise reduction based on partial reference, dual-tree complex wavelet transfo...
Noise reduction based on partial reference, dual-tree complex wavelet transfo...
IEEEFINALYEARPROJECTS
 
Local directional number pattern for face analysis face and expression recogn...
Local directional number pattern for face analysis face and expression recogn...Local directional number pattern for face analysis face and expression recogn...
Local directional number pattern for face analysis face and expression recogn...
IEEEFINALYEARPROJECTS
 
An access point based fec mechanism for video transmission over wireless la ns
An access point based fec mechanism for video transmission over wireless la nsAn access point based fec mechanism for video transmission over wireless la ns
An access point based fec mechanism for video transmission over wireless la ns
IEEEFINALYEARPROJECTS
 
Towards differential query services in cost efficient clouds
Towards differential query services in cost efficient cloudsTowards differential query services in cost efficient clouds
Towards differential query services in cost efficient clouds
IEEEFINALYEARPROJECTS
 
Spoc a secure and privacy preserving opportunistic computing framework for mo...
Spoc a secure and privacy preserving opportunistic computing framework for mo...Spoc a secure and privacy preserving opportunistic computing framework for mo...
Spoc a secure and privacy preserving opportunistic computing framework for mo...
IEEEFINALYEARPROJECTS
 
Secure and efficient data transmission for cluster based wireless sensor netw...
Secure and efficient data transmission for cluster based wireless sensor netw...Secure and efficient data transmission for cluster based wireless sensor netw...
Secure and efficient data transmission for cluster based wireless sensor netw...
IEEEFINALYEARPROJECTS
 
Privacy preserving back propagation neural network learning over arbitrarily ...
Privacy preserving back propagation neural network learning over arbitrarily ...Privacy preserving back propagation neural network learning over arbitrarily ...
Privacy preserving back propagation neural network learning over arbitrarily ...
IEEEFINALYEARPROJECTS
 
Non cooperative location privacy
Non cooperative location privacyNon cooperative location privacy
Non cooperative location privacy
IEEEFINALYEARPROJECTS
 
Harnessing the cloud for securely outsourcing large
Harnessing the cloud for securely outsourcing largeHarnessing the cloud for securely outsourcing large
Harnessing the cloud for securely outsourcing large
IEEEFINALYEARPROJECTS
 
Geo community-based broadcasting for data dissemination in mobile social netw...
Geo community-based broadcasting for data dissemination in mobile social netw...Geo community-based broadcasting for data dissemination in mobile social netw...
Geo community-based broadcasting for data dissemination in mobile social netw...
IEEEFINALYEARPROJECTS
 
Enabling data dynamic and indirect mutual trust for cloud computing storage s...
Enabling data dynamic and indirect mutual trust for cloud computing storage s...Enabling data dynamic and indirect mutual trust for cloud computing storage s...
Enabling data dynamic and indirect mutual trust for cloud computing storage s...
IEEEFINALYEARPROJECTS
 
Dynamic resource allocation using virtual machines for cloud computing enviro...
Dynamic resource allocation using virtual machines for cloud computing enviro...Dynamic resource allocation using virtual machines for cloud computing enviro...
Dynamic resource allocation using virtual machines for cloud computing enviro...
IEEEFINALYEARPROJECTS
 
A secure protocol for spontaneous wireless ad hoc networks creation
A secure protocol for spontaneous wireless ad hoc networks creationA secure protocol for spontaneous wireless ad hoc networks creation
A secure protocol for spontaneous wireless ad hoc networks creation
IEEEFINALYEARPROJECTS
 
Utility privacy tradeoff in databases an information-theoretic approach
Utility privacy tradeoff in databases an information-theoretic approachUtility privacy tradeoff in databases an information-theoretic approach
Utility privacy tradeoff in databases an information-theoretic approach
IEEEFINALYEARPROJECTS
 
Two tales of privacy in online social networks
Two tales of privacy in online social networksTwo tales of privacy in online social networks
Two tales of privacy in online social networks
IEEEFINALYEARPROJECTS
 

More from IEEEFINALYEARPROJECTS (20)

Scalable face image retrieval using attribute enhanced sparse codewords
Scalable face image retrieval using attribute enhanced sparse codewordsScalable face image retrieval using attribute enhanced sparse codewords
Scalable face image retrieval using attribute enhanced sparse codewords
 
Scalable face image retrieval using attribute enhanced sparse codewords
Scalable face image retrieval using attribute enhanced sparse codewordsScalable face image retrieval using attribute enhanced sparse codewords
Scalable face image retrieval using attribute enhanced sparse codewords
 
Reversible watermarking based on invariant image classification and dynamic h...
Reversible watermarking based on invariant image classification and dynamic h...Reversible watermarking based on invariant image classification and dynamic h...
Reversible watermarking based on invariant image classification and dynamic h...
 
Reversible data hiding with optimal value transfer
Reversible data hiding with optimal value transferReversible data hiding with optimal value transfer
Reversible data hiding with optimal value transfer
 
Query adaptive image search with hash codes
Query adaptive image search with hash codesQuery adaptive image search with hash codes
Query adaptive image search with hash codes
 
Noise reduction based on partial reference, dual-tree complex wavelet transfo...
Noise reduction based on partial reference, dual-tree complex wavelet transfo...Noise reduction based on partial reference, dual-tree complex wavelet transfo...
Noise reduction based on partial reference, dual-tree complex wavelet transfo...
 
Local directional number pattern for face analysis face and expression recogn...
Local directional number pattern for face analysis face and expression recogn...Local directional number pattern for face analysis face and expression recogn...
Local directional number pattern for face analysis face and expression recogn...
 
An access point based fec mechanism for video transmission over wireless la ns
An access point based fec mechanism for video transmission over wireless la nsAn access point based fec mechanism for video transmission over wireless la ns
An access point based fec mechanism for video transmission over wireless la ns
 
Towards differential query services in cost efficient clouds
Towards differential query services in cost efficient cloudsTowards differential query services in cost efficient clouds
Towards differential query services in cost efficient clouds
 
Spoc a secure and privacy preserving opportunistic computing framework for mo...
Spoc a secure and privacy preserving opportunistic computing framework for mo...Spoc a secure and privacy preserving opportunistic computing framework for mo...
Spoc a secure and privacy preserving opportunistic computing framework for mo...
 
Secure and efficient data transmission for cluster based wireless sensor netw...
Secure and efficient data transmission for cluster based wireless sensor netw...Secure and efficient data transmission for cluster based wireless sensor netw...
Secure and efficient data transmission for cluster based wireless sensor netw...
 
Privacy preserving back propagation neural network learning over arbitrarily ...
Privacy preserving back propagation neural network learning over arbitrarily ...Privacy preserving back propagation neural network learning over arbitrarily ...
Privacy preserving back propagation neural network learning over arbitrarily ...
 
Non cooperative location privacy
Non cooperative location privacyNon cooperative location privacy
Non cooperative location privacy
 
Harnessing the cloud for securely outsourcing large
Harnessing the cloud for securely outsourcing largeHarnessing the cloud for securely outsourcing large
Harnessing the cloud for securely outsourcing large
 
Geo community-based broadcasting for data dissemination in mobile social netw...
Geo community-based broadcasting for data dissemination in mobile social netw...Geo community-based broadcasting for data dissemination in mobile social netw...
Geo community-based broadcasting for data dissemination in mobile social netw...
 
Enabling data dynamic and indirect mutual trust for cloud computing storage s...
Enabling data dynamic and indirect mutual trust for cloud computing storage s...Enabling data dynamic and indirect mutual trust for cloud computing storage s...
Enabling data dynamic and indirect mutual trust for cloud computing storage s...
 
Dynamic resource allocation using virtual machines for cloud computing enviro...
Dynamic resource allocation using virtual machines for cloud computing enviro...Dynamic resource allocation using virtual machines for cloud computing enviro...
Dynamic resource allocation using virtual machines for cloud computing enviro...
 
A secure protocol for spontaneous wireless ad hoc networks creation
A secure protocol for spontaneous wireless ad hoc networks creationA secure protocol for spontaneous wireless ad hoc networks creation
A secure protocol for spontaneous wireless ad hoc networks creation
 
Utility privacy tradeoff in databases an information-theoretic approach
Utility privacy tradeoff in databases an information-theoretic approachUtility privacy tradeoff in databases an information-theoretic approach
Utility privacy tradeoff in databases an information-theoretic approach
 
Two tales of privacy in online social networks
Two tales of privacy in online social networksTwo tales of privacy in online social networks
Two tales of privacy in online social networks
 

Recently uploaded

leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
alexjohnson7307
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
HarisZaheer8
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
dbms calicut university B. sc Cs 4th sem.pdf
dbms  calicut university B. sc Cs 4th sem.pdfdbms  calicut university B. sc Cs 4th sem.pdf
dbms calicut university B. sc Cs 4th sem.pdf
Shinana2
 
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdfNunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
flufftailshop
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Jeffrey Haguewood
 

Recently uploaded (20)

leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
dbms calicut university B. sc Cs 4th sem.pdf
dbms  calicut university B. sc Cs 4th sem.pdfdbms  calicut university B. sc Cs 4th sem.pdf
dbms calicut university B. sc Cs 4th sem.pdf
 
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdfNunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
 

Ginix generalized inverted index for keyword search

  • 1. Ginix Generalized Inverted Index for Keyword Search ABSTRACT Keyword search has become a ubiquitous method for users to access text data in the face of information explosion. Inverted lists are usually used to index underlying documents to retrieve documents according to a set of keywords efficiently. Since inverted lists are usually large, many compression techniques have been proposed to reduce the storage space and disk I/O time. However, these techniques usually perform decompression operations on the fly, which increases the CPU time. This paper presents a more efficient index structure, the Generalized Inverted IndeX (Ginix), which merges consecutive IDs in inverted lists into intervals to save storage space. With this index structure, more efficient algorithms can be devised to perform basic keyword search operations, i.e., the union and the intersection operations, by taking the advantage of intervals. Specifically, these algorithms do not require conversions from interval lists back to ID lists. As a result, keyword search using Ginix can be more efficient than those using traditional inverted indices. The performance of Ginix is also improved by reordering the documents in datasets using two scalable algorithms. Experiments on the performance and GLOBALSOFT TECHNOLOGIES IEEE PROJECTS & SOFTWARE DEVELOPMENTS IEEE FINAL YEAR PROJECTS|IEEE ENGINEERING PROJECTS|IEEE STUDENTS PROJECTS|IEEE BULK PROJECTS|BE/BTECH/ME/MTECH/MS/MCA PROJECTS|CSE/IT/ECE/EEE PROJECTS CELL: +91 98495 39085, +91 99662 35788, +91 98495 57908, +91 97014 40401 Visit: www.finalyearprojects.org Mail to:ieeefinalsemprojects@gmail.com
  • 2. scalability of Ginix on real datasets show that Ginix not only requires less storage space, but also improves the keyword search performance, compared with traditional inverted indexes. SYSTEM ANALYSIS Existing System: Beyond asking for explicit user input, earlier work focused on handling recency queries, which are queries that are after recent events or breaking news. The time sensitive approach processes a recency query by computing traditional topic similarity scores for each document, and then “boosts” the scores of the most recent documents, to privilege recent articles over older ones. In contrast to traditional models, which assume a uniform prior probability of relevance for each document d in a collection, define the prior to be a function of document d’s creation date. The prior probability decreases exponentially with time, and hence recent documents are ranked higher than older documents. Li and Croft’s strategy is designed for queries that are after recent documents, but it does not handle other types of time-sensitive queries, such as [Madrid bombing], [Google IPO], or even that implicitly target one or more past time periods.
  • 3. Proposed System: Many compression techniques have been proposed to reduce the storage space and disk I/O time. However, these techniques usually perform decompression operations on the fly, which increases the CPU time. This paper presents a more efficient index structure, the Generalized INverted IndeX (Ginix), which merges consecutive IDs in inverted lists into intervals to save storage space. The problem of document reordering is equivalent to making similar documents stay near to each other. Silvestri[5] proposed a simple method that sorts web pages in lexicographical order based on their URLs as an acceptable solution to the problem. This method is reasonable because the URLs are usually good indicates of the web page content. The performance of Ginix is also improved by reordering the documents in datasets using two scalable algorithms. Experiments on the performance and scalability of Ginix on real datasets show that Ginix not only requires less storage space, but also improves the keyword search performance, compared with traditional inverted indexes. Advantages: 1. Efficient algorithms are given to support basic operations on interval lists, such as union and intersection without decompression. 2. The problem of enhancing the performance of Ginix by document reordering is investigated, and two scalable and effective algorithms based on signature sorting and greedy heuristic of Traveling Salesman Problem (TSP)[3] are proposed. 3. Extensive experiments that evaluate the performance of Ginix are conducted. Results show that Ginix not only reduces the index size but also improves the search performance on real datasets.
  • 4. Module Description: 1. Search over Blogs 2. Time interval feedback 3. Temporal relevance feedback (Time Sensitive results 4. Overall ranking document identification Search over blogs. 5. Blogs Growth Charts. A large number of searches, such as over blogs and news archives. So far, research on searching over such collections has largely focused on retrieving topically similar documents for a query. Unfortunately, ignoring or not fully exploiting the time dimension can be detrimental for a large family of queries for which we should consider not only the document topical relevance. Time Interval Feedback: Time-sensitive query over a news archive, our approach automatically identifies important time intervals for the query. These intervals are then used to adjust the document relevance scores by boosting the scores of documents published within the important intervals. We have implemented our system on top of Indri, 2 a state-of-the-art search engine that combines language models and inference networks for retrieval, as well as over Lemur3, into its implementation. Our system provides a web interface for searching the News blaster archive4, an operational news archive and summarization system, and for experimenting with variations of our approach.
  • 5. Temporal Relevance Feedback: We discuss several techniques to estimate the temporal relevance of a day to a query at hand. These estimation techniques use the temporal distribution of matching articles for the query to compute the probability that a day in the archive has a relevant document for the query. Overall ranking document identification: We integrate temporal relevance with state-of-the- art retrieval models, including a query likelihood model, a relevance model, a probabilistic relevance model, and a query expansion with pseudo relevance feedback model, to naturally process time-sensitive queries. In these models, we combine topical relevance and temporal relevance to determine the overall relevance of a document. Blogs Growth Charts: The scalability of Ginix was evaluated using different numbers of reocrds in the DBLP dataset. Search time: Since the current algorithms take advantage of the intervals, the search time of Ginix is nearly 2x faster than that of InvIndex.
  • 6. Algorithm: SYSTEM SPECIFICATION Hardware Requirements: • System : Pentium IV 2.4 GHz. • Hard Disk : 80 GB. • Floppy Drive: 1.44 Mb.
  • 7. • Monitor : 15’ VGA Colour. • Mouse : Optical Mouse • RAM : 512 MB. Software Requirements: • Operating system : Windows 7 32 Bit. • Coding Language : ASP.Net 4.0 with C# • Data Base : SQL Server 2008