SlideShare a Scribd company logo
1 of 5
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 170
On-AIR based Information Retrieval System for Semi-Structure Data
Srujan K1, Radha K2
1III.BTECH-CSE-Rudraram, GITAM UNIVERSITY Hyderabad, Telanagana, India
2Asst Professor, CSE, Rudraram, GITAM UNIVERSITY, Hyderabad, Telangana, India
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract – Retrieving of information has been the
fundamental process to any storage file. With advent of
technology, it is possible to store a large amount of data and
retrieving them has become tedious task for theincorporators
and users. Information retrieval systems emerged from the
demand minimize of human resources required in the finding
of needed information to accomplish a task. This paper talks
about On-AIR architecture for retrieving data from video
containing databases using Information Retrieval Systems
,Information Retrieval Systems Capabilities like search and
browse capabilities, FunctionalOverviewofIRSandAutomatic
Indexing that is performed on data items. This paper presents
the overview of an Information retrieval system, Information
Retrieval Systems Capabilities, FunctionalOverviewofIRSand
Automatic Indexing.
Key Words: On-AIR, Automatic Indexing, IRS, Precision,
Recall
1. INTRODUCTION
An Information Retrieval System is basically used to store,
maintain and retrieve the information. Even though the
information is diverse, the text aspect is the only data type
that has been undergoing full functional processing[1].New
Techniques are being invented to search the other
multimedia data types like ExCaliber’sVisual Retrieval Ware
which uses natural language processing and semantic
networks for processing the information [13].
1.1. OBJECTIVES AND PERFORMANCE DMEASURES FOR
INFORMATION RETRIEVAL SYSTEM
The main objective of IRS would be to reduce its overhead
i.e., the time taken from leading to reading the required
information. Distinguishing between a relevant and
irrelevant can be the challenging aspect of IRS which might
be due to problems like synonymsandpolysemy[4], whichis
a word with multiplemeanings.Toevaluatetheperformance
of search engines, measures like precision, recall and
response time can be taken into consideration. Precision is
the ratio between the number of relevant documents
retrieved through a query to the total number of documents
retrieved with help of a query. Recall istheratiobetween the
numbers of documents retrieved to the total number of
documents that are relevant with respect to a query.
Response time is the elapsed time gap after the submission
of a query and presentation of documents retrieved by the
software [4].
Precision =Number of Relevant documents retrieved
Total number of documents retrieved
Recall = Number of documents retrieved
Total number of documents that are retrieved
Precision can be optimized by retrieving a certainlyrelevant
document and recall by retrieving all documents in the
database. Therefore, F-square combines both precision and
recall and results in the harmonic mean of precision and
recall[2].
Example 2.1:
Assume the following:
• A database contains 100 records
regarding a topic
• 50 were recorded as a result of search.
• Of the 50 records retrieved, 45 were
relevant.
Using the designations above:
• X =The number of relevant records retrieved
• Y =The number of relevant records not retrieved,
• Z =The number of irrelevant records retrieved.
In this example X = 45,Y = 65 (100-45) and Z = 5 (50-45).
Recall = (45 / (45 + 65)) * 100% => 45/110 * 100% = 40.909%
Precision = (45 / (45 + 5)) * 100% => 45/50 * 100% = 90%
2. INFORMATION RETRIEVAL SYSTEMS
FUNCTIONALITY
Information retrieval and storage system basically contains
of four different functional processes, Item normalization,
Selective Dissemination, Document Database data search,
Index Database search[1].
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 171
Fig-1. Functional Overview of Information Retrieval
Systems
2.1. Item Normalization
This is the first step before storing in the database. All the
information in different format is converted into a single
standard format to avoid ambiguity between information
and to provide logical restructuring. The process of
transforming multiple formats into a single data structure
that can be easily processed is termedasitem normalization.
This process includes identification of process tokens,
characterization of tokens, Stemming of tokens, and then
transforming into searchable data structure.
2.2. Selective Dissemination
Item is processed with help of profile. It gives the capability
to compare newly found items with the interested
statements and results the documents matching the interest
dynamically. If there is a profile match, the item is placed
mail file with the profile after processing against every user
profile.
2.3. Document Database Search
Query is processed along with items which are prior
processed. The items are searched in Document Database
and can take longer time periods.
2.4. Index Database Search
Saving a file for any future references, this is done through
indexing process.
3. RETRIEVING DIGITAL VIDEO DATABASES USING ON-
AIR
Ontologies are used in Information retrieval system to
search for a related term and improve precision and recall.
Ontology is developed priory defining the terms and their
corresponding relations. Ontology aided information
retrieval is used to retrieve data in digital video databases
using domain ontology .It consists of mainly two processes
indexing and retrieval.
3.1. Indexing process
Indexing process is responsible for creatingsearchabledata
structure. Initially this process consists of Video collection
and Domain ontology. Video collection consists of clips of
videos of a long video assigned with keywords, set of images
and transcription of speech. Domain Ontology contains the
information for query expansion. Indexing process is
performed by registering the video clips, the ontology and
establishing the configuration values.
This process produces three outputs
 XML Configuration file: Contains the information
about the associations betweenclips,transcriptions,
keywords, resources etc.
 Inverted Index: An Inverted Index data structure is
used to store the frequencies of word occurrences
along the word itself. It also stores weights of terms
to contribute to precision and recall .Stemming is
performed and stop words are excluded.
Fig-2. Diagram representing a Inverted file structure.
Figure-2 represents how tokens are stored in inverted data
structure for the document 1 containing “I did enact Julius
Caesar: I was killed i’ the capitol; Brutus killed me” and
document 2 containing “So let it be with Caesar. The noble
Brutus hath told you Caesar was ambitious”[5]. Initially the
frequency of each term is tabulated and converted into
inverted data structure containing posting lists .The
following algorithm is a prototype of generating index file
[6]. In the following algorithm, initially the frequencies of
each word in the document is calculated and weights are
calculated using Term Frequency-Inverse Document
Frequency before stemming is performed.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 172
Fig-3. Algorithm representing Creation of index in Index
File.
 Ontology Data Structure: This helps in enhancing
processing time. All the information is represented
in a graph with the help of classes, sub-classes,
instances, relationships etc.
 Retrieval process:
Using the information obtained in inverted file and
ontology data structure, Visualizer is used to displaythe
relevant information in the form of list. The process
performed by a Visualizer is shown in Figure-4.
Fig-4.Retrieval Process
Pre-Process: In this stage, the misspelling detection is
applied and also stop words are removed on encounter.
Stemming process is applied and weightsare calculated.
Query-Expansion: The degree of similarity is computed in
Query Expansion process[7].
Retrieval: In this stage, the query is compared to video clip
contents and information of maximum importance is
returned. Vector space model is used for retrieval[9].
Video player: This software returns all the relevant videos
and the user can choose upon his need. Searching for the
information in long-duration videos is time consuming
process. Currently research is going on in Ontologies which
is used to retrieve the information from the web. Through
the digital videos we can retrieve the information efficiently
[14]. Most of the organizations are producing the massive
volumes of data in a semi-structured format only that is in
PDF formats, Reports and Tabular extraction of data. To
enable the SQL-Like query and analysis of financial tables
from annual reports in pdf format, deep learning based
technique is used. For the nearest row match table type
classification can be done. To queryandanalysethefinancial
tables in pdf documents from distinct sources we will use
text-match based approach. Similarity measure for query
framework returns the similar rows andassociatedcolumns
via different documents for given table row as a query.
Queries can be given in two ways such as Querying table and
querying by row/column. Query table permits the users to
select a table then similar tables are returned and finally
these queries can be refined using the rangeoperators.Deep
learning architecturefortableclassificationhasbeendivided
into Data collection and pre-processing. Tokenization and
Classification. A technique and prototypeimplementationis
presented for deep-learning based table processingpipeline
architecture. It provides access to similar table data across
various PDF files [15].
Example of IR Process
Consider a query specified by the user.
“How do you make money online??”
Initially, stop words “how”, ’do”, “make” are separated and
“money” and “online” are given weights according to the
ontology defined. Depending upon the ontology therelevant
data from the inverted index is retrieved and displayed.
4. INFORMATION RETRIEVAL SYSTEM CAPABILITIES
4.1. Search Capabilities:
The Searching capabilities include mapping the user
required query to the information present in database.
“Weighting” has been used to rank and procure information
in commercial systems[7].
Many functions are been defined to determine the
relationships between search statements
 Boolean Logic: Multiple concepts can be logically
related by using Boolean logic. The Boolean
operators like AND, OR, NOT are used to logically
compute intersection, union and negation
respectively.
 Proximity: Proximity is used to restrict the distance
between two search terms within a query. When a
certain proximity parameter is crossed between
two search items, they are ignored.
 Contiguous word phrase: When two or more units
are considered as single unit then they are called as
contiguous word phrase. ”United Arab Emirates” is
a contiguous word phrase.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 173
 Term Masking :
Term Masking is the process of masking a certain
portion of term and matching it with the unmasked
portion of the term. Stemming can be used as the
size is huge .It can be either fixed length masking or
variable length masking.
 Concept/Thesaurus expansion:
Thesaurus function is used to group a term along
with its term which has a similar meaning. Concept
tree is used to group similar items in the form of
tree structure [1].Figure-5 depicts concept
expansion of Orchestra, In Orchestra different kind
of instruments are used like woodwinds, brass,
percussion, strings and keyboard. Further
woodwinds, brass are subdivided into their
respective instruments [6].
Figure-5:Concept Expansion
4.2. Browsing Capabilities:
The jargon associated with browsing are ranking, zoning,
highlighting. Ranking is the process where different tokens
of the query are provided with values on their relevance.
Zoning is the process of dividing the standardizedinputinto
meaningful divisions. Highlighting is usedtograbtheuser’s
attention to particular parts.
5. INDEXING PROCESS IN INFORMATION RETRIEVAL
SYSTEMS
The indexing process consists of three basic steps: defining
the data source, transforming document contenttogenerate
a logical view, and building an index of the text on the logical
view[10] .In order, to retrieve information,thereceiveditem
must be converted into a data structurethatcanbesearched.
Indexing process can be automatic or manual. Search can be
either direct search in which information in present in a
simple document or it can be indirect i.e., with the help of
index files. Some systems instead of creating data structure,
the item is completely transformed into a different
representation which is used as searchable data structure.
5.1. Automatic Indexing
In automatic indexing process, system determines the index
terms and assigns them automatically. Automatic indexing
can be complicated when the determining the limited
number of index terms from the topic compared to manual
indexing [1]. Automatic indexing would result in twoclasses
of indexes. They are weighted and un-weighted. In weighted
systems, a value is placed on index term and its related
concept in the document. The weight is dependent on the
function which is responsible for counting the frequency of
occurring in each item. In case of un-weighted systems, the
queries are dependent on Boolean logic and occurrences in
hit file which is considered as value. Techniques of
Automatic Indexing:
5.2. Indexing by term:
When the basis of indexing is the term that belongs to the
original item, the index can be created by two techniques.
5.3. Statistical Techniques:
Statistical techniques can be models like vector models,
probabilistic model or Bayesian model[1].
All these models are based on statistical information like
frequency and their distribution. Weights are calculated
considering frequency and distributionsinthedatabase.The
weighted systems are also called as Vectorized information
systems. In vectorized information system, the weights are
stored in the form of vectors. All the queries can be
converted into vector forms and the distancebetween query
vector and information present in data base vector is
calculated. In probabilistic model, Bayesian product is
calculated with the help of Bayesian network. Bayesian
network can be considered as an acyclic graph that is
directed where the random variables are definedintheform
of nodes and the probabilistic dependency among the node
and its parents are denoted by the arcs between the nodes.
Figure-6: Two level Bayesian Network
Natural language processing is another method of defining
indexes to terms. The DR LINK systems process the items at
different levels like morphological, semantic, lexical,
syntactic and discourse. DR LINK stands for document
retrieval throughlinguisticknowledge.Eachlevel inDRLINK
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 174
uses the information of previous level to perform analysisof
the present level.
5.4. Indexing by Concept
There are numerous ways of expressing an idea outofwhich
only one can give the enhanced retrieval performance;thisis
the basis of Indexing by Concept. The Concept indexing will
determine a collection of concepts. The concepts are
determined based on the test set terms and these tests are
used as basis for indexing the items.
6. CONCLUSION
Information retrieval system has the high potential to
revolutionize the present and conventional storing and
retrieving techniques. Ten years ago, the algorithms
developed were restricted in scope allowingtheoreticiansto
limit their focus to very specific areas. But the insertion of
technology in systems like Internet has changed the way
problems were bounded.Hence,efficientalgorithmsmustbe
constructed in order to handle enormous data and minimal
user search statement informationmust beconsideredalong
with optimal usage of functional aspects of Information
Retrieval system.
REFERENCES
1. Gerald J Kowalski, Mark T Maybury: Information Storage
and Retrieval Systems, Theory and Implementation.
2. Paz-Trillo, C., Wassermann, R., and Braga, P: An
Information Retrieval Application using Ontologies (2005).
3. R. Baeza-Yates and B. Ribeiro-Neto.Modern Information
Retrieval. Addison Wesley Longman, 1999.
4. M. Mauldin. Conceptual Information Retrieval: A case
study in adaptive partial parsing. Kluwer Academic
Publishers, 1991.
5. Stanford Home Page: https://nlp.stanford.edu/IR-
book/html/htmledition/a-first-take-at-building-an-inverted-
index-1.html
6.ResearcherGate:https://www.researchgate.net/figure/CY
C-assisted-concept-enpansion-tree-An-example-of-
demonstrating-concept-expansion-is_fig1_257728125
7. Information Retrieval: search process, techniques and
strategies KiranPrakashBachchhav Librarian,
SarvajanikShikshanSanstha’sAdv.V.B.Deshpande College of
Commerce, Mulund (West), Mumbai- 400080
8.Berkeley.edu:http://people.ischool.berkeley.edu/~buckla
nd/papers/analysis/node1.html
9. M. Smith, C. Welthy, and D. McGuiness.OWL Web Ontology
Language Guide. Technical report, World Wide Web
Consortium, 2004.http://www.w3.org/TR/2004/REC-owl-
guide-20040210/
10. Indexing:https://www.springer.com
11. J.F. Sequeda, D.P. Miranker, A pay-as-you-go
methodology for ontology-based data access, IEEE Internet
Comput. 21 (2) (2017) 92–96.
12. M. Rodriguez-Muro, R. Kontchakov, M. Zakharyaschev,
Ontology-based data access: ontop of databases, in:
International Semantic Web Conference, Springer, 2013, pp.
558–573.
13. Vise, David A. (2004-12-03). "Agencies Find What
They're Looking For".The WashingtonPost.Retrieved 2010-
05-22.
14. Christian Paz-Trillo, et al,”Using ontologies to retrieve
video information”.
15. Rahul Anand, et al, “Integrating and querying similar
tables from PDF documents using deep learning”, 15 jan
2019.
BIOGRAPHIES
First Author: K. Srujan Kumar,currentlypursuingIIIB.Tech,
CSE at GITAM UNIVERSITY, HYDERABAD. My Research
areas are Machine Learning, Predictive Analytics.
Second Author: K Radha working as an Asst Professor at
GITAM University, Hyderabad. She has Completed MTech
(CSE) at JNTUH, Pursuing PhD at KL University, Vijayawada.
She has 12 years of Teaching Experience and 1 Year
Industrial Experience. Shehaspublishednumerousresearch
papers and presented at various conferences. She is a
Member of IAENG.

More Related Content

What's hot

International journal of_software_engine
International journal of_software_engineInternational journal of_software_engine
International journal of_software_engineronan messi
 
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...IRJET Journal
 
QUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENT
QUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENTQUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENT
QUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENTcsandit
 
A New Mechanism for Service Recovery Technology by using Recovering Service’s...
A New Mechanism for Service Recovery Technology by using Recovering Service’s...A New Mechanism for Service Recovery Technology by using Recovering Service’s...
A New Mechanism for Service Recovery Technology by using Recovering Service’s...ijfcstjournal
 
Information Retrieval based on Cluster Analysis Approach
Information Retrieval based on Cluster Analysis ApproachInformation Retrieval based on Cluster Analysis Approach
Information Retrieval based on Cluster Analysis ApproachAIRCC Publishing Corporation
 
Query optimization in oodbms identifying subquery for query management
Query optimization in oodbms identifying subquery for query managementQuery optimization in oodbms identifying subquery for query management
Query optimization in oodbms identifying subquery for query managementijdms
 
D0373024030
D0373024030D0373024030
D0373024030theijes
 
Data Mining for XML Query-Answering Support
Data Mining for XML Query-Answering SupportData Mining for XML Query-Answering Support
Data Mining for XML Query-Answering SupportIOSR Journals
 
IRJET- Proficient Recovery Over Records using Encryption in Cloud Computing
IRJET- Proficient Recovery Over Records using Encryption in Cloud ComputingIRJET- Proficient Recovery Over Records using Encryption in Cloud Computing
IRJET- Proficient Recovery Over Records using Encryption in Cloud ComputingIRJET Journal
 
Anomalous symmetry succession for seek out
Anomalous symmetry succession for seek outAnomalous symmetry succession for seek out
Anomalous symmetry succession for seek outiaemedu
 
A BRIEF REVIEW ALONG WITH A NEW PROPOSED APPROACH OF DATA DE DUPLICATION
A BRIEF REVIEW ALONG WITH A NEW PROPOSED APPROACH OF DATA DE DUPLICATIONA BRIEF REVIEW ALONG WITH A NEW PROPOSED APPROACH OF DATA DE DUPLICATION
A BRIEF REVIEW ALONG WITH A NEW PROPOSED APPROACH OF DATA DE DUPLICATIONcscpconf
 
Automatic document clustering
Automatic document clusteringAutomatic document clustering
Automatic document clusteringIAEME Publication
 
EVALUATE DATABASE COMPRESSION PERFORMANCE AND PARALLEL BACKUP
EVALUATE DATABASE COMPRESSION PERFORMANCE AND PARALLEL BACKUPEVALUATE DATABASE COMPRESSION PERFORMANCE AND PARALLEL BACKUP
EVALUATE DATABASE COMPRESSION PERFORMANCE AND PARALLEL BACKUPijdms
 
IRJET- Intelligence Extraction using Various Machine Learning Algorithms
IRJET- Intelligence Extraction using Various Machine Learning AlgorithmsIRJET- Intelligence Extraction using Various Machine Learning Algorithms
IRJET- Intelligence Extraction using Various Machine Learning AlgorithmsIRJET Journal
 
Fast Range Aggregate Queries for Big Data Analysis
Fast Range Aggregate Queries for Big Data AnalysisFast Range Aggregate Queries for Big Data Analysis
Fast Range Aggregate Queries for Big Data AnalysisIRJET Journal
 
Comparative Study on Graph-based Information Retrieval: the Case of XML Document
Comparative Study on Graph-based Information Retrieval: the Case of XML DocumentComparative Study on Graph-based Information Retrieval: the Case of XML Document
Comparative Study on Graph-based Information Retrieval: the Case of XML DocumentIJAEMSJORNAL
 
New proximity estimate for incremental update of non uniformly distributed cl...
New proximity estimate for incremental update of non uniformly distributed cl...New proximity estimate for incremental update of non uniformly distributed cl...
New proximity estimate for incremental update of non uniformly distributed cl...IJDKP
 

What's hot (20)

International journal of_software_engine
International journal of_software_engineInternational journal of_software_engine
International journal of_software_engine
 
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
 
F1803013034
F1803013034F1803013034
F1803013034
 
QUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENT
QUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENTQUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENT
QUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENT
 
A New Mechanism for Service Recovery Technology by using Recovering Service’s...
A New Mechanism for Service Recovery Technology by using Recovering Service’s...A New Mechanism for Service Recovery Technology by using Recovering Service’s...
A New Mechanism for Service Recovery Technology by using Recovering Service’s...
 
O1803017981
O1803017981O1803017981
O1803017981
 
Bq36404412
Bq36404412Bq36404412
Bq36404412
 
Information Retrieval based on Cluster Analysis Approach
Information Retrieval based on Cluster Analysis ApproachInformation Retrieval based on Cluster Analysis Approach
Information Retrieval based on Cluster Analysis Approach
 
Query optimization in oodbms identifying subquery for query management
Query optimization in oodbms identifying subquery for query managementQuery optimization in oodbms identifying subquery for query management
Query optimization in oodbms identifying subquery for query management
 
D0373024030
D0373024030D0373024030
D0373024030
 
Data Mining for XML Query-Answering Support
Data Mining for XML Query-Answering SupportData Mining for XML Query-Answering Support
Data Mining for XML Query-Answering Support
 
IRJET- Proficient Recovery Over Records using Encryption in Cloud Computing
IRJET- Proficient Recovery Over Records using Encryption in Cloud ComputingIRJET- Proficient Recovery Over Records using Encryption in Cloud Computing
IRJET- Proficient Recovery Over Records using Encryption in Cloud Computing
 
Anomalous symmetry succession for seek out
Anomalous symmetry succession for seek outAnomalous symmetry succession for seek out
Anomalous symmetry succession for seek out
 
A BRIEF REVIEW ALONG WITH A NEW PROPOSED APPROACH OF DATA DE DUPLICATION
A BRIEF REVIEW ALONG WITH A NEW PROPOSED APPROACH OF DATA DE DUPLICATIONA BRIEF REVIEW ALONG WITH A NEW PROPOSED APPROACH OF DATA DE DUPLICATION
A BRIEF REVIEW ALONG WITH A NEW PROPOSED APPROACH OF DATA DE DUPLICATION
 
Automatic document clustering
Automatic document clusteringAutomatic document clustering
Automatic document clustering
 
EVALUATE DATABASE COMPRESSION PERFORMANCE AND PARALLEL BACKUP
EVALUATE DATABASE COMPRESSION PERFORMANCE AND PARALLEL BACKUPEVALUATE DATABASE COMPRESSION PERFORMANCE AND PARALLEL BACKUP
EVALUATE DATABASE COMPRESSION PERFORMANCE AND PARALLEL BACKUP
 
IRJET- Intelligence Extraction using Various Machine Learning Algorithms
IRJET- Intelligence Extraction using Various Machine Learning AlgorithmsIRJET- Intelligence Extraction using Various Machine Learning Algorithms
IRJET- Intelligence Extraction using Various Machine Learning Algorithms
 
Fast Range Aggregate Queries for Big Data Analysis
Fast Range Aggregate Queries for Big Data AnalysisFast Range Aggregate Queries for Big Data Analysis
Fast Range Aggregate Queries for Big Data Analysis
 
Comparative Study on Graph-based Information Retrieval: the Case of XML Document
Comparative Study on Graph-based Information Retrieval: the Case of XML DocumentComparative Study on Graph-based Information Retrieval: the Case of XML Document
Comparative Study on Graph-based Information Retrieval: the Case of XML Document
 
New proximity estimate for incremental update of non uniformly distributed cl...
New proximity estimate for incremental update of non uniformly distributed cl...New proximity estimate for incremental update of non uniformly distributed cl...
New proximity estimate for incremental update of non uniformly distributed cl...
 

Similar to IRJET- On-AIR Based Information Retrieval System for Semi-Structure Data

Multikeyword Hunt on Progressive Graphs
Multikeyword Hunt on Progressive GraphsMultikeyword Hunt on Progressive Graphs
Multikeyword Hunt on Progressive GraphsIRJET Journal
 
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEMCANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEMIRJET Journal
 
IRJET- Analysis of using Software Defined and Service Coherence Approach
IRJET- Analysis of using Software Defined and Service Coherence ApproachIRJET- Analysis of using Software Defined and Service Coherence Approach
IRJET- Analysis of using Software Defined and Service Coherence ApproachIRJET Journal
 
IRJET-Computational model for the processing of documents and support to the ...
IRJET-Computational model for the processing of documents and support to the ...IRJET-Computational model for the processing of documents and support to the ...
IRJET-Computational model for the processing of documents and support to the ...IRJET Journal
 
IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...
IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...
IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...IRJET Journal
 
IRJET- An Efficient Solitude Securing Ranked Keyword Search Technique
IRJET- An Efficient Solitude Securing Ranked Keyword Search TechniqueIRJET- An Efficient Solitude Securing Ranked Keyword Search Technique
IRJET- An Efficient Solitude Securing Ranked Keyword Search TechniqueIRJET Journal
 
BLOCKCHAIN IMPLEMENTATION IN EDUCATIONAL SYSTEM
BLOCKCHAIN IMPLEMENTATION IN EDUCATIONAL SYSTEMBLOCKCHAIN IMPLEMENTATION IN EDUCATIONAL SYSTEM
BLOCKCHAIN IMPLEMENTATION IN EDUCATIONAL SYSTEMIRJET Journal
 
Comparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & PythonComparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & PythonIRJET Journal
 
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...IRJET Journal
 
IRJET- Highlighting Document Streams using Multi-Regional Input Output (MRIO)
IRJET- Highlighting Document Streams using Multi-Regional Input Output (MRIO)IRJET- Highlighting Document Streams using Multi-Regional Input Output (MRIO)
IRJET- Highlighting Document Streams using Multi-Regional Input Output (MRIO)IRJET Journal
 
IRJET- Resume Information Extraction Framework
IRJET- Resume Information Extraction FrameworkIRJET- Resume Information Extraction Framework
IRJET- Resume Information Extraction FrameworkIRJET Journal
 
IRJET-Unraveling the Data Structures of Big data, the HDFS Architecture and I...
IRJET-Unraveling the Data Structures of Big data, the HDFS Architecture and I...IRJET-Unraveling the Data Structures of Big data, the HDFS Architecture and I...
IRJET-Unraveling the Data Structures of Big data, the HDFS Architecture and I...IRJET Journal
 
A Review on Software Mining: Current Trends and Methodologies
A Review on Software Mining: Current Trends and MethodologiesA Review on Software Mining: Current Trends and Methodologies
A Review on Software Mining: Current Trends and MethodologiesIJERA Editor
 
Content Migration -FileNet Image Service to P8
Content Migration -FileNet Image Service to P8Content Migration -FileNet Image Service to P8
Content Migration -FileNet Image Service to P8IRJET Journal
 
IRJET- Automatic Database Schema Generator
IRJET- Automatic Database Schema GeneratorIRJET- Automatic Database Schema Generator
IRJET- Automatic Database Schema GeneratorIRJET Journal
 
A Novel Additive Order Protocol in Cloud Storage and Avoiding the Trapdoors
A Novel Additive Order Protocol in Cloud Storage and Avoiding the TrapdoorsA Novel Additive Order Protocol in Cloud Storage and Avoiding the Trapdoors
A Novel Additive Order Protocol in Cloud Storage and Avoiding the TrapdoorsIRJET Journal
 
Reviews on swarm intelligence algorithms for text document clustering
Reviews on swarm intelligence algorithms for text document clusteringReviews on swarm intelligence algorithms for text document clustering
Reviews on swarm intelligence algorithms for text document clusteringIRJET Journal
 
Paper id 28201425
Paper id 28201425Paper id 28201425
Paper id 28201425IJRAT
 
IRJET- Plug-In based System for Data Visualization
IRJET- Plug-In based System for Data VisualizationIRJET- Plug-In based System for Data Visualization
IRJET- Plug-In based System for Data VisualizationIRJET Journal
 
Review on an automatic extraction of educational digital objects and metadata...
Review on an automatic extraction of educational digital objects and metadata...Review on an automatic extraction of educational digital objects and metadata...
Review on an automatic extraction of educational digital objects and metadata...IRJET Journal
 

Similar to IRJET- On-AIR Based Information Retrieval System for Semi-Structure Data (20)

Multikeyword Hunt on Progressive Graphs
Multikeyword Hunt on Progressive GraphsMultikeyword Hunt on Progressive Graphs
Multikeyword Hunt on Progressive Graphs
 
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEMCANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
 
IRJET- Analysis of using Software Defined and Service Coherence Approach
IRJET- Analysis of using Software Defined and Service Coherence ApproachIRJET- Analysis of using Software Defined and Service Coherence Approach
IRJET- Analysis of using Software Defined and Service Coherence Approach
 
IRJET-Computational model for the processing of documents and support to the ...
IRJET-Computational model for the processing of documents and support to the ...IRJET-Computational model for the processing of documents and support to the ...
IRJET-Computational model for the processing of documents and support to the ...
 
IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...
IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...
IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...
 
IRJET- An Efficient Solitude Securing Ranked Keyword Search Technique
IRJET- An Efficient Solitude Securing Ranked Keyword Search TechniqueIRJET- An Efficient Solitude Securing Ranked Keyword Search Technique
IRJET- An Efficient Solitude Securing Ranked Keyword Search Technique
 
BLOCKCHAIN IMPLEMENTATION IN EDUCATIONAL SYSTEM
BLOCKCHAIN IMPLEMENTATION IN EDUCATIONAL SYSTEMBLOCKCHAIN IMPLEMENTATION IN EDUCATIONAL SYSTEM
BLOCKCHAIN IMPLEMENTATION IN EDUCATIONAL SYSTEM
 
Comparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & PythonComparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & Python
 
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
 
IRJET- Highlighting Document Streams using Multi-Regional Input Output (MRIO)
IRJET- Highlighting Document Streams using Multi-Regional Input Output (MRIO)IRJET- Highlighting Document Streams using Multi-Regional Input Output (MRIO)
IRJET- Highlighting Document Streams using Multi-Regional Input Output (MRIO)
 
IRJET- Resume Information Extraction Framework
IRJET- Resume Information Extraction FrameworkIRJET- Resume Information Extraction Framework
IRJET- Resume Information Extraction Framework
 
IRJET-Unraveling the Data Structures of Big data, the HDFS Architecture and I...
IRJET-Unraveling the Data Structures of Big data, the HDFS Architecture and I...IRJET-Unraveling the Data Structures of Big data, the HDFS Architecture and I...
IRJET-Unraveling the Data Structures of Big data, the HDFS Architecture and I...
 
A Review on Software Mining: Current Trends and Methodologies
A Review on Software Mining: Current Trends and MethodologiesA Review on Software Mining: Current Trends and Methodologies
A Review on Software Mining: Current Trends and Methodologies
 
Content Migration -FileNet Image Service to P8
Content Migration -FileNet Image Service to P8Content Migration -FileNet Image Service to P8
Content Migration -FileNet Image Service to P8
 
IRJET- Automatic Database Schema Generator
IRJET- Automatic Database Schema GeneratorIRJET- Automatic Database Schema Generator
IRJET- Automatic Database Schema Generator
 
A Novel Additive Order Protocol in Cloud Storage and Avoiding the Trapdoors
A Novel Additive Order Protocol in Cloud Storage and Avoiding the TrapdoorsA Novel Additive Order Protocol in Cloud Storage and Avoiding the Trapdoors
A Novel Additive Order Protocol in Cloud Storage and Avoiding the Trapdoors
 
Reviews on swarm intelligence algorithms for text document clustering
Reviews on swarm intelligence algorithms for text document clusteringReviews on swarm intelligence algorithms for text document clustering
Reviews on swarm intelligence algorithms for text document clustering
 
Paper id 28201425
Paper id 28201425Paper id 28201425
Paper id 28201425
 
IRJET- Plug-In based System for Data Visualization
IRJET- Plug-In based System for Data VisualizationIRJET- Plug-In based System for Data Visualization
IRJET- Plug-In based System for Data Visualization
 
Review on an automatic extraction of educational digital objects and metadata...
Review on an automatic extraction of educational digital objects and metadata...Review on an automatic extraction of educational digital objects and metadata...
Review on an automatic extraction of educational digital objects and metadata...
 

More from IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASIRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesIRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web applicationIRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
 

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa
 

Recently uploaded (20)

High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
 

IRJET- On-AIR Based Information Retrieval System for Semi-Structure Data

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 170 On-AIR based Information Retrieval System for Semi-Structure Data Srujan K1, Radha K2 1III.BTECH-CSE-Rudraram, GITAM UNIVERSITY Hyderabad, Telanagana, India 2Asst Professor, CSE, Rudraram, GITAM UNIVERSITY, Hyderabad, Telangana, India ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract – Retrieving of information has been the fundamental process to any storage file. With advent of technology, it is possible to store a large amount of data and retrieving them has become tedious task for theincorporators and users. Information retrieval systems emerged from the demand minimize of human resources required in the finding of needed information to accomplish a task. This paper talks about On-AIR architecture for retrieving data from video containing databases using Information Retrieval Systems ,Information Retrieval Systems Capabilities like search and browse capabilities, FunctionalOverviewofIRSandAutomatic Indexing that is performed on data items. This paper presents the overview of an Information retrieval system, Information Retrieval Systems Capabilities, FunctionalOverviewofIRSand Automatic Indexing. Key Words: On-AIR, Automatic Indexing, IRS, Precision, Recall 1. INTRODUCTION An Information Retrieval System is basically used to store, maintain and retrieve the information. Even though the information is diverse, the text aspect is the only data type that has been undergoing full functional processing[1].New Techniques are being invented to search the other multimedia data types like ExCaliber’sVisual Retrieval Ware which uses natural language processing and semantic networks for processing the information [13]. 1.1. OBJECTIVES AND PERFORMANCE DMEASURES FOR INFORMATION RETRIEVAL SYSTEM The main objective of IRS would be to reduce its overhead i.e., the time taken from leading to reading the required information. Distinguishing between a relevant and irrelevant can be the challenging aspect of IRS which might be due to problems like synonymsandpolysemy[4], whichis a word with multiplemeanings.Toevaluatetheperformance of search engines, measures like precision, recall and response time can be taken into consideration. Precision is the ratio between the number of relevant documents retrieved through a query to the total number of documents retrieved with help of a query. Recall istheratiobetween the numbers of documents retrieved to the total number of documents that are relevant with respect to a query. Response time is the elapsed time gap after the submission of a query and presentation of documents retrieved by the software [4]. Precision =Number of Relevant documents retrieved Total number of documents retrieved Recall = Number of documents retrieved Total number of documents that are retrieved Precision can be optimized by retrieving a certainlyrelevant document and recall by retrieving all documents in the database. Therefore, F-square combines both precision and recall and results in the harmonic mean of precision and recall[2]. Example 2.1: Assume the following: • A database contains 100 records regarding a topic • 50 were recorded as a result of search. • Of the 50 records retrieved, 45 were relevant. Using the designations above: • X =The number of relevant records retrieved • Y =The number of relevant records not retrieved, • Z =The number of irrelevant records retrieved. In this example X = 45,Y = 65 (100-45) and Z = 5 (50-45). Recall = (45 / (45 + 65)) * 100% => 45/110 * 100% = 40.909% Precision = (45 / (45 + 5)) * 100% => 45/50 * 100% = 90% 2. INFORMATION RETRIEVAL SYSTEMS FUNCTIONALITY Information retrieval and storage system basically contains of four different functional processes, Item normalization, Selective Dissemination, Document Database data search, Index Database search[1].
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 171 Fig-1. Functional Overview of Information Retrieval Systems 2.1. Item Normalization This is the first step before storing in the database. All the information in different format is converted into a single standard format to avoid ambiguity between information and to provide logical restructuring. The process of transforming multiple formats into a single data structure that can be easily processed is termedasitem normalization. This process includes identification of process tokens, characterization of tokens, Stemming of tokens, and then transforming into searchable data structure. 2.2. Selective Dissemination Item is processed with help of profile. It gives the capability to compare newly found items with the interested statements and results the documents matching the interest dynamically. If there is a profile match, the item is placed mail file with the profile after processing against every user profile. 2.3. Document Database Search Query is processed along with items which are prior processed. The items are searched in Document Database and can take longer time periods. 2.4. Index Database Search Saving a file for any future references, this is done through indexing process. 3. RETRIEVING DIGITAL VIDEO DATABASES USING ON- AIR Ontologies are used in Information retrieval system to search for a related term and improve precision and recall. Ontology is developed priory defining the terms and their corresponding relations. Ontology aided information retrieval is used to retrieve data in digital video databases using domain ontology .It consists of mainly two processes indexing and retrieval. 3.1. Indexing process Indexing process is responsible for creatingsearchabledata structure. Initially this process consists of Video collection and Domain ontology. Video collection consists of clips of videos of a long video assigned with keywords, set of images and transcription of speech. Domain Ontology contains the information for query expansion. Indexing process is performed by registering the video clips, the ontology and establishing the configuration values. This process produces three outputs  XML Configuration file: Contains the information about the associations betweenclips,transcriptions, keywords, resources etc.  Inverted Index: An Inverted Index data structure is used to store the frequencies of word occurrences along the word itself. It also stores weights of terms to contribute to precision and recall .Stemming is performed and stop words are excluded. Fig-2. Diagram representing a Inverted file structure. Figure-2 represents how tokens are stored in inverted data structure for the document 1 containing “I did enact Julius Caesar: I was killed i’ the capitol; Brutus killed me” and document 2 containing “So let it be with Caesar. The noble Brutus hath told you Caesar was ambitious”[5]. Initially the frequency of each term is tabulated and converted into inverted data structure containing posting lists .The following algorithm is a prototype of generating index file [6]. In the following algorithm, initially the frequencies of each word in the document is calculated and weights are calculated using Term Frequency-Inverse Document Frequency before stemming is performed.
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 172 Fig-3. Algorithm representing Creation of index in Index File.  Ontology Data Structure: This helps in enhancing processing time. All the information is represented in a graph with the help of classes, sub-classes, instances, relationships etc.  Retrieval process: Using the information obtained in inverted file and ontology data structure, Visualizer is used to displaythe relevant information in the form of list. The process performed by a Visualizer is shown in Figure-4. Fig-4.Retrieval Process Pre-Process: In this stage, the misspelling detection is applied and also stop words are removed on encounter. Stemming process is applied and weightsare calculated. Query-Expansion: The degree of similarity is computed in Query Expansion process[7]. Retrieval: In this stage, the query is compared to video clip contents and information of maximum importance is returned. Vector space model is used for retrieval[9]. Video player: This software returns all the relevant videos and the user can choose upon his need. Searching for the information in long-duration videos is time consuming process. Currently research is going on in Ontologies which is used to retrieve the information from the web. Through the digital videos we can retrieve the information efficiently [14]. Most of the organizations are producing the massive volumes of data in a semi-structured format only that is in PDF formats, Reports and Tabular extraction of data. To enable the SQL-Like query and analysis of financial tables from annual reports in pdf format, deep learning based technique is used. For the nearest row match table type classification can be done. To queryandanalysethefinancial tables in pdf documents from distinct sources we will use text-match based approach. Similarity measure for query framework returns the similar rows andassociatedcolumns via different documents for given table row as a query. Queries can be given in two ways such as Querying table and querying by row/column. Query table permits the users to select a table then similar tables are returned and finally these queries can be refined using the rangeoperators.Deep learning architecturefortableclassificationhasbeendivided into Data collection and pre-processing. Tokenization and Classification. A technique and prototypeimplementationis presented for deep-learning based table processingpipeline architecture. It provides access to similar table data across various PDF files [15]. Example of IR Process Consider a query specified by the user. “How do you make money online??” Initially, stop words “how”, ’do”, “make” are separated and “money” and “online” are given weights according to the ontology defined. Depending upon the ontology therelevant data from the inverted index is retrieved and displayed. 4. INFORMATION RETRIEVAL SYSTEM CAPABILITIES 4.1. Search Capabilities: The Searching capabilities include mapping the user required query to the information present in database. “Weighting” has been used to rank and procure information in commercial systems[7]. Many functions are been defined to determine the relationships between search statements  Boolean Logic: Multiple concepts can be logically related by using Boolean logic. The Boolean operators like AND, OR, NOT are used to logically compute intersection, union and negation respectively.  Proximity: Proximity is used to restrict the distance between two search terms within a query. When a certain proximity parameter is crossed between two search items, they are ignored.  Contiguous word phrase: When two or more units are considered as single unit then they are called as contiguous word phrase. ”United Arab Emirates” is a contiguous word phrase.
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 173  Term Masking : Term Masking is the process of masking a certain portion of term and matching it with the unmasked portion of the term. Stemming can be used as the size is huge .It can be either fixed length masking or variable length masking.  Concept/Thesaurus expansion: Thesaurus function is used to group a term along with its term which has a similar meaning. Concept tree is used to group similar items in the form of tree structure [1].Figure-5 depicts concept expansion of Orchestra, In Orchestra different kind of instruments are used like woodwinds, brass, percussion, strings and keyboard. Further woodwinds, brass are subdivided into their respective instruments [6]. Figure-5:Concept Expansion 4.2. Browsing Capabilities: The jargon associated with browsing are ranking, zoning, highlighting. Ranking is the process where different tokens of the query are provided with values on their relevance. Zoning is the process of dividing the standardizedinputinto meaningful divisions. Highlighting is usedtograbtheuser’s attention to particular parts. 5. INDEXING PROCESS IN INFORMATION RETRIEVAL SYSTEMS The indexing process consists of three basic steps: defining the data source, transforming document contenttogenerate a logical view, and building an index of the text on the logical view[10] .In order, to retrieve information,thereceiveditem must be converted into a data structurethatcanbesearched. Indexing process can be automatic or manual. Search can be either direct search in which information in present in a simple document or it can be indirect i.e., with the help of index files. Some systems instead of creating data structure, the item is completely transformed into a different representation which is used as searchable data structure. 5.1. Automatic Indexing In automatic indexing process, system determines the index terms and assigns them automatically. Automatic indexing can be complicated when the determining the limited number of index terms from the topic compared to manual indexing [1]. Automatic indexing would result in twoclasses of indexes. They are weighted and un-weighted. In weighted systems, a value is placed on index term and its related concept in the document. The weight is dependent on the function which is responsible for counting the frequency of occurring in each item. In case of un-weighted systems, the queries are dependent on Boolean logic and occurrences in hit file which is considered as value. Techniques of Automatic Indexing: 5.2. Indexing by term: When the basis of indexing is the term that belongs to the original item, the index can be created by two techniques. 5.3. Statistical Techniques: Statistical techniques can be models like vector models, probabilistic model or Bayesian model[1]. All these models are based on statistical information like frequency and their distribution. Weights are calculated considering frequency and distributionsinthedatabase.The weighted systems are also called as Vectorized information systems. In vectorized information system, the weights are stored in the form of vectors. All the queries can be converted into vector forms and the distancebetween query vector and information present in data base vector is calculated. In probabilistic model, Bayesian product is calculated with the help of Bayesian network. Bayesian network can be considered as an acyclic graph that is directed where the random variables are definedintheform of nodes and the probabilistic dependency among the node and its parents are denoted by the arcs between the nodes. Figure-6: Two level Bayesian Network Natural language processing is another method of defining indexes to terms. The DR LINK systems process the items at different levels like morphological, semantic, lexical, syntactic and discourse. DR LINK stands for document retrieval throughlinguisticknowledge.Eachlevel inDRLINK
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 174 uses the information of previous level to perform analysisof the present level. 5.4. Indexing by Concept There are numerous ways of expressing an idea outofwhich only one can give the enhanced retrieval performance;thisis the basis of Indexing by Concept. The Concept indexing will determine a collection of concepts. The concepts are determined based on the test set terms and these tests are used as basis for indexing the items. 6. CONCLUSION Information retrieval system has the high potential to revolutionize the present and conventional storing and retrieving techniques. Ten years ago, the algorithms developed were restricted in scope allowingtheoreticiansto limit their focus to very specific areas. But the insertion of technology in systems like Internet has changed the way problems were bounded.Hence,efficientalgorithmsmustbe constructed in order to handle enormous data and minimal user search statement informationmust beconsideredalong with optimal usage of functional aspects of Information Retrieval system. REFERENCES 1. Gerald J Kowalski, Mark T Maybury: Information Storage and Retrieval Systems, Theory and Implementation. 2. Paz-Trillo, C., Wassermann, R., and Braga, P: An Information Retrieval Application using Ontologies (2005). 3. R. Baeza-Yates and B. Ribeiro-Neto.Modern Information Retrieval. Addison Wesley Longman, 1999. 4. M. Mauldin. Conceptual Information Retrieval: A case study in adaptive partial parsing. Kluwer Academic Publishers, 1991. 5. Stanford Home Page: https://nlp.stanford.edu/IR- book/html/htmledition/a-first-take-at-building-an-inverted- index-1.html 6.ResearcherGate:https://www.researchgate.net/figure/CY C-assisted-concept-enpansion-tree-An-example-of- demonstrating-concept-expansion-is_fig1_257728125 7. Information Retrieval: search process, techniques and strategies KiranPrakashBachchhav Librarian, SarvajanikShikshanSanstha’sAdv.V.B.Deshpande College of Commerce, Mulund (West), Mumbai- 400080 8.Berkeley.edu:http://people.ischool.berkeley.edu/~buckla nd/papers/analysis/node1.html 9. M. Smith, C. Welthy, and D. McGuiness.OWL Web Ontology Language Guide. Technical report, World Wide Web Consortium, 2004.http://www.w3.org/TR/2004/REC-owl- guide-20040210/ 10. Indexing:https://www.springer.com 11. J.F. Sequeda, D.P. Miranker, A pay-as-you-go methodology for ontology-based data access, IEEE Internet Comput. 21 (2) (2017) 92–96. 12. M. Rodriguez-Muro, R. Kontchakov, M. Zakharyaschev, Ontology-based data access: ontop of databases, in: International Semantic Web Conference, Springer, 2013, pp. 558–573. 13. Vise, David A. (2004-12-03). "Agencies Find What They're Looking For".The WashingtonPost.Retrieved 2010- 05-22. 14. Christian Paz-Trillo, et al,”Using ontologies to retrieve video information”. 15. Rahul Anand, et al, “Integrating and querying similar tables from PDF documents using deep learning”, 15 jan 2019. BIOGRAPHIES First Author: K. Srujan Kumar,currentlypursuingIIIB.Tech, CSE at GITAM UNIVERSITY, HYDERABAD. My Research areas are Machine Learning, Predictive Analytics. Second Author: K Radha working as an Asst Professor at GITAM University, Hyderabad. She has Completed MTech (CSE) at JNTUH, Pursuing PhD at KL University, Vijayawada. She has 12 years of Teaching Experience and 1 Year Industrial Experience. Shehaspublishednumerousresearch papers and presented at various conferences. She is a Member of IAENG.