The growing number of datasets published on the Web as linked data brings both opportunities for high data
availability of data. As the data increases challenges for querying also increases. It is very difficult to search
linked data using structured languages. Hence, we use Keyword Query searching for linked data. In this paper,
we propose different approaches for keyword query routing through which the efficiency of keyword search can
be improved greatly. By routing the keywords to the relevant data sources the processing cost of keyword search
queries can be greatly reduced. In this paper, we contrast and compare four models – Keyword level, Element
level, Set level and query expansion using semantic and linguistic analysis. These models are used for keyword
query routing in keyword search.
Existing work on keyword search relies on an element-level model (data graphs) to compute keyword query results.Elements mentioning keywords are retrieved from this model and paths between them are explored to compute Steiner graphs. KRG (keyword Element Relationship Graph) captures relationships at the keyword level.Relationships captured by a KRG are not direct edges between tuples but stand for paths between keywords.
We propose to route keywords only to relevant sources to reduce the high cost of processing keyword search queries over all sources. A multilevel scoring mechanism is proposed for computing the relevance of routing plans based on scores at the level of keywords, data elements,element sets, and subgraphs that connect these elements
Computing semantic similarity measure between words using web search enginecsandit
Semantic Similarity measures between words plays an important role in information retrieval,
natural language processing and in various tasks on the web. In this paper, we have proposed a
Modified Pattern Extraction Algorithm to compute the supervised semantic similarity measure
between the words by combining both page count method and web snippets method. Four
association measures are used to find semantic similarity between words in page count method
using web search engines. We use a Sequential Minimal Optimization (SMO) support vector
machines (SVM) to find the optimal combination of page counts-based similarity scores and
top-ranking patterns from the web snippets method. The SVM is trained to classify synonymous
word-pairs and non-synonymous word-pairs. The proposed Modified Pattern Extraction
Algorithm outperforms by 89.8 percent of correlation value.
IJERA (International journal of Engineering Research and Applications) is International online, ... peer reviewed journal. For more detail or submit your article, please visit www.ijera.com
To Get any Project for CSE, IT ECE, EEE Contact Me @ 09666155510, 09849539085 or mail us - ieeefinalsemprojects@gmail.com-Visit Our Website: www.finalyearprojects.org
Existing work on keyword search relies on an element-level model (data graphs) to compute keyword query results.Elements mentioning keywords are retrieved from this model and paths between them are explored to compute Steiner graphs. KRG (keyword Element Relationship Graph) captures relationships at the keyword level.Relationships captured by a KRG are not direct edges between tuples but stand for paths between keywords.
We propose to route keywords only to relevant sources to reduce the high cost of processing keyword search queries over all sources. A multilevel scoring mechanism is proposed for computing the relevance of routing plans based on scores at the level of keywords, data elements,element sets, and subgraphs that connect these elements
Computing semantic similarity measure between words using web search enginecsandit
Semantic Similarity measures between words plays an important role in information retrieval,
natural language processing and in various tasks on the web. In this paper, we have proposed a
Modified Pattern Extraction Algorithm to compute the supervised semantic similarity measure
between the words by combining both page count method and web snippets method. Four
association measures are used to find semantic similarity between words in page count method
using web search engines. We use a Sequential Minimal Optimization (SMO) support vector
machines (SVM) to find the optimal combination of page counts-based similarity scores and
top-ranking patterns from the web snippets method. The SVM is trained to classify synonymous
word-pairs and non-synonymous word-pairs. The proposed Modified Pattern Extraction
Algorithm outperforms by 89.8 percent of correlation value.
IJERA (International journal of Engineering Research and Applications) is International online, ... peer reviewed journal. For more detail or submit your article, please visit www.ijera.com
To Get any Project for CSE, IT ECE, EEE Contact Me @ 09666155510, 09849539085 or mail us - ieeefinalsemprojects@gmail.com-Visit Our Website: www.finalyearprojects.org
We are good IEEE java projects development center in Chennai and Pondicherry. We guided advanced java technologies projects of cloud computing, data mining, Secure Computing, Networking, Parallel & Distributed Systems, Mobile Computing and Service Computing (Web Service).
For More Details:
http://jpinfotech.org/final-year-ieee-projects/2014-ieee-projects/java-projects/
An Advanced IR System of Relational Keyword Search Techniquepaperpublications3
Abstract: Now these days keyword search to relational data set becomes an area of research within the data base and Information Retrieval. There is no standard process of information retrieval, which will clearly show the accurate result also it shows keyword search with ranking. Execution time is retrieving of data is more in existing system. We propose a system for increasing performance of relational keyword search systems. In the proposed system we combine schema-based and graph-based approaches and propose a Relational Keyword Search System to overcome the mentioned disadvantages of existing systems and manage the information and user access the information very efficiently. Keyword Search with the ranking requires very low execution time. Execution time of retrieving information and file length during Information retrieval can be display using chart.Keywords: Keyword Search, Datasets, Information Retrieval Query Workloads, Schema-based Systems, Graph-based Systems, ranking, relational databases.
Title: An Advanced IR System of Relational Keyword Search Technique
Author: Dhananjay A. Gholap, Gumaste S. V
ISSN 2350-1022
International Journal of Recent Research in Mathematics Computer Science and Information Technology
Paper Publications
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Efficiently searching nearest neighbor in documents using keywordseSAT Journals
Abstract Conservative spatial queries, such as range search and nearest neighbor reclamation, involve only conditions on objects’ numerical properties. Today, many modern applications call for novel forms of queries that aim to find objects satisfying both a spatial predicate, and a predicate on their associated texts. For example, instead of considering all the restaurants, a nearest neighbor query would instead ask for the restaurant that is the closest among those whose menus contain “steak, spaghetti, brandy” all at the same time. Currently the best solution to such queries is based on the InformationRetrieval2-tree, which, has a few deficiencies that seriously impact its efficiency. Motivated by this, there is a development of a new access method called the spatial inverted index that extends the conventional inverted index to cope with multidimensional data, and comes with algorithms that can answer nearest neighbor queries with keywords in real time. As verified by experiments, the proposed techniques outperform the InformationRetrieval2-tree in query response time significantly, often by a factor of orders of magnitude Keywords: Information retrieval, spatial index, keyword search
An Improved Web Explorer using Explicit Semantic Similarity with ontology and...INFOGAIN PUBLICATION
The Improved Web Explorer aims at extraction and selection of the best possible hyperlinks and retrieving more accurate search results for the entered search query. The hyperlinks that are more preferable to the entered search query are evaluated by taking into account weighted values of frequencies of words in search string that are present in anchor texts and plain texts available in title and body tags of various hyperlink pages respectively to retrieve relevant hyperlinks from all available links. Then the concept of ontology is used to gain insights of words in search string by finding their hypernyms, hyponyms and synsets to reach to the source and context of the words in search string. The Explicit Semantic Similarity analysis along with Naïve Bayes method is used to find the semantic similarity between lexically different terms using Wikipedia and Google as explicit semantic analysis tools and calculating the probabilities of occurrence of words in anchor and body texts .Vector Space Model is being used to calculate Term frequency and Inverse document frequency values, and then calculate cosine similarities between the entered Search query and extracted relevant hyperlinks to get the most appropriate relevance wise ranked search results to the entered search string
The International Journal of Engineering and Science (IJES)theijes
The International Journal of Engineering & Science is aimed at providing a platform for researchers, engineers, scientists, or educators to publish their original research results, to exchange new ideas, to disseminate information in innovative designs, engineering experiences and technological skills. It is also the Journal's objective to promote engineering and technology education. All papers submitted to the Journal will be blind peer-reviewed. Only original articles will be published.
Using Page Size for Controlling Duplicate Query Results in Semantic WebIJwest
Semantic web is a web of future. The Resource Description Framework (RDF) is a language
to represent resources in the World Wide Web. When these resources are queried the problem of duplicate
query results occurs. The present techniques used hash index comparison to remove duplicate query
results. The major drawback of using the hash index to remove duplicate query results is that, if there is a
slight change in formatting or word order, then hash index is changed and query results are no more
considered as duplicate even though they have same contents. We presented an algorithm for detection and
elimination of duplicate query results from semantic web using hash index and page size comparisons.
Experimental results showed that the proposed technique removed duplicate query results from semantic
web efficiently, solved the problems of using hash index for duplicate handling and could be embedded in
existing SQL-Based query system for semantic web. Research could be carried out for certain flexibilities
in existing SQL-Based query system of semantic web to accommodate other duplicate detection techniques
as well.
Detecting Gender-bias from Energy Modeling Jobscapeyungahhh
Despite of many strides made by women in the workplace, workplace inequality persists, whether they come from geographical cultures or a result of workplace attrition. Unconscious (or implicit) bias can creep
into job listings and can actively or passively discourage certain applicants’ pool. The goal of this study is to understand the role of gender bias in the energy modeling job market.
Job listings and their word choices can significantly affect the application pool. This study investigates the word choices from job postings specifically for the energy modeling sector within the United States building construction industry, using web-scraped data of an online job site. The study explores the potential usage of gender-bias terminology through frequency analysis and Natural Language Processing Kit. The study also employs two machine learning methods (one supervise and one unsupervised) to test the application in this
context and evaluate its utilization.
WEB SEARCH ENGINE BASED SEMANTIC SIMILARITY MEASURE BETWEEN WORDS USING PATTE...cscpconf
Semantic Similarity measures plays an important role in information retrieval, natural language processing and various tasks on web such as relation extraction, community mining, document clustering, and automatic meta-data extraction. In this paper, we have proposed a Pattern Retrieval Algorithm [PRA] to compute the semantic similarity measure between the words by
combining both page count method and web snippets method. Four association measures are used to find semantic similarity between words in page count method using web search engines. We use a Sequential Minimal Optimization (SMO) support vector machines (SVM) to find the optimal combination of page counts-based similarity scores and top-ranking patterns from the web snippets method. The SVM is trained to classify synonymous word-pairs and nonsynonymous word-pairs. The proposed approach aims to improve the Correlation values,
Precision, Recall, and F-measures, compared to the existing methods. The proposed algorithm outperforms by 89.8 % of correlation value.
Topic Modeling : Clustering of Deep Webpagescsandit
The internet is comprised of massive amount of info
rmation in the form of zillions of web
pages.This information can be categorized into the
surface web and the deep web. The existing
search engines can effectively make use of surface
web information.But the deep web remains
unexploited yet. Machine learning techniques have b
een commonly employed to access deep
web content.
Under Machine Learning, topic models provide a simp
le way to analyze large volumes of
unlabeled text. A "topic" consists of a cluster of
words that frequently occur together. Using
contextual clues, topic models can connect words wi
th similar meanings and distinguish
between words with multiple meanings. Clustering is
one of the key solutions to organize the
deep web databases.In this paper, we cluster deep w
eb databases based on the relevance found
among deep web forms by employing a generative prob
abilistic model called Latent Dirichlet
Allocation(LDA) for modeling content representative
of deep web databases. This is
implemented after preprocessing the set of web page
s to extract page contents and form
contents.Further, we contrive the distribution of “
topics per document” and “words per topic”
using the technique of Gibbs sampling. Experimental
results show that the proposed method
clearly outperforms the existing clustering methods
.
Context-Based Diversification for Keyword Queries over XML Data1crore projects
IEEE PROJECTS 2015
1 crore projects is a leading Guide for ieee Projects and real time projects Works Provider.
It has been provided Lot of Guidance for Thousands of Students & made them more beneficial in all Technology Training.
Dot Net
DOTNET Project Domain list 2015
1. IEEE based on datamining and knowledge engineering
2. IEEE based on mobile computing
3. IEEE based on networking
4. IEEE based on Image processing
5. IEEE based on Multimedia
6. IEEE based on Network security
7. IEEE based on parallel and distributed systems
Java Project Domain list 2015
1. IEEE based on datamining and knowledge engineering
2. IEEE based on mobile computing
3. IEEE based on networking
4. IEEE based on Image processing
5. IEEE based on Multimedia
6. IEEE based on Network security
7. IEEE based on parallel and distributed systems
ECE IEEE Projects 2015
1. Matlab project
2. Ns2 project
3. Embedded project
4. Robotics project
Eligibility
Final Year students of
1. BSc (C.S)
2. BCA/B.E(C.S)
3. B.Tech IT
4. BE (C.S)
5. MSc (C.S)
6. MSc (IT)
7. MCA
8. MS (IT)
9. ME(ALL)
10. BE(ECE)(EEE)(E&I)
TECHNOLOGY USED AND FOR TRAINING IN
1. DOT NET
2. C sharp
3. ASP
4. VB
5. SQL SERVER
6. JAVA
7. J2EE
8. STRINGS
9. ORACLE
10. VB dotNET
11. EMBEDDED
12. MAT LAB
13. LAB VIEW
14. Multi Sim
CONTACT US
1 CRORE PROJECTS
Door No: 214/215,2nd Floor,
No. 172, Raahat Plaza, (Shopping Mall) ,Arcot Road, Vadapalani, Chennai,
Tamin Nadu, INDIA - 600 026
Email id: 1croreprojects@gmail.com
website:1croreprojects.com
Phone : +91 97518 00789 / +91 72999 51536
Nagano visitamos castillo Matsumoto, museo,templos ..ZenkojiB & M Co., Ltd.
Fuimos a Nagano visitamos el castillo Matsumoto,
museo,templos el mejor Zenkoji ,
tome unos bocetos a lápiz
Recorrido de 600 km desde Nagoya ... ..
We went to visit the Matsumoto Castle Nagano,
museum, temples Zenkoji best
take a few pencil sketches
Route 600 km from Nagoya .....
私たちは、松本城長野を訪問することを行ってきました
博物館、寺院善光寺最高
いくつかの鉛筆のスケッチを取る
ルート名古屋から600キロ.....
https://www.facebook.com/notes/b-m-galeria-arte/mirando-pr%C3%B3xima-visita-castillos-en-nagano-jap%C3%B3n-looking-next-visit-castles-in-n/751904238183646
We are good IEEE java projects development center in Chennai and Pondicherry. We guided advanced java technologies projects of cloud computing, data mining, Secure Computing, Networking, Parallel & Distributed Systems, Mobile Computing and Service Computing (Web Service).
For More Details:
http://jpinfotech.org/final-year-ieee-projects/2014-ieee-projects/java-projects/
An Advanced IR System of Relational Keyword Search Techniquepaperpublications3
Abstract: Now these days keyword search to relational data set becomes an area of research within the data base and Information Retrieval. There is no standard process of information retrieval, which will clearly show the accurate result also it shows keyword search with ranking. Execution time is retrieving of data is more in existing system. We propose a system for increasing performance of relational keyword search systems. In the proposed system we combine schema-based and graph-based approaches and propose a Relational Keyword Search System to overcome the mentioned disadvantages of existing systems and manage the information and user access the information very efficiently. Keyword Search with the ranking requires very low execution time. Execution time of retrieving information and file length during Information retrieval can be display using chart.Keywords: Keyword Search, Datasets, Information Retrieval Query Workloads, Schema-based Systems, Graph-based Systems, ranking, relational databases.
Title: An Advanced IR System of Relational Keyword Search Technique
Author: Dhananjay A. Gholap, Gumaste S. V
ISSN 2350-1022
International Journal of Recent Research in Mathematics Computer Science and Information Technology
Paper Publications
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Efficiently searching nearest neighbor in documents using keywordseSAT Journals
Abstract Conservative spatial queries, such as range search and nearest neighbor reclamation, involve only conditions on objects’ numerical properties. Today, many modern applications call for novel forms of queries that aim to find objects satisfying both a spatial predicate, and a predicate on their associated texts. For example, instead of considering all the restaurants, a nearest neighbor query would instead ask for the restaurant that is the closest among those whose menus contain “steak, spaghetti, brandy” all at the same time. Currently the best solution to such queries is based on the InformationRetrieval2-tree, which, has a few deficiencies that seriously impact its efficiency. Motivated by this, there is a development of a new access method called the spatial inverted index that extends the conventional inverted index to cope with multidimensional data, and comes with algorithms that can answer nearest neighbor queries with keywords in real time. As verified by experiments, the proposed techniques outperform the InformationRetrieval2-tree in query response time significantly, often by a factor of orders of magnitude Keywords: Information retrieval, spatial index, keyword search
An Improved Web Explorer using Explicit Semantic Similarity with ontology and...INFOGAIN PUBLICATION
The Improved Web Explorer aims at extraction and selection of the best possible hyperlinks and retrieving more accurate search results for the entered search query. The hyperlinks that are more preferable to the entered search query are evaluated by taking into account weighted values of frequencies of words in search string that are present in anchor texts and plain texts available in title and body tags of various hyperlink pages respectively to retrieve relevant hyperlinks from all available links. Then the concept of ontology is used to gain insights of words in search string by finding their hypernyms, hyponyms and synsets to reach to the source and context of the words in search string. The Explicit Semantic Similarity analysis along with Naïve Bayes method is used to find the semantic similarity between lexically different terms using Wikipedia and Google as explicit semantic analysis tools and calculating the probabilities of occurrence of words in anchor and body texts .Vector Space Model is being used to calculate Term frequency and Inverse document frequency values, and then calculate cosine similarities between the entered Search query and extracted relevant hyperlinks to get the most appropriate relevance wise ranked search results to the entered search string
The International Journal of Engineering and Science (IJES)theijes
The International Journal of Engineering & Science is aimed at providing a platform for researchers, engineers, scientists, or educators to publish their original research results, to exchange new ideas, to disseminate information in innovative designs, engineering experiences and technological skills. It is also the Journal's objective to promote engineering and technology education. All papers submitted to the Journal will be blind peer-reviewed. Only original articles will be published.
Using Page Size for Controlling Duplicate Query Results in Semantic WebIJwest
Semantic web is a web of future. The Resource Description Framework (RDF) is a language
to represent resources in the World Wide Web. When these resources are queried the problem of duplicate
query results occurs. The present techniques used hash index comparison to remove duplicate query
results. The major drawback of using the hash index to remove duplicate query results is that, if there is a
slight change in formatting or word order, then hash index is changed and query results are no more
considered as duplicate even though they have same contents. We presented an algorithm for detection and
elimination of duplicate query results from semantic web using hash index and page size comparisons.
Experimental results showed that the proposed technique removed duplicate query results from semantic
web efficiently, solved the problems of using hash index for duplicate handling and could be embedded in
existing SQL-Based query system for semantic web. Research could be carried out for certain flexibilities
in existing SQL-Based query system of semantic web to accommodate other duplicate detection techniques
as well.
Detecting Gender-bias from Energy Modeling Jobscapeyungahhh
Despite of many strides made by women in the workplace, workplace inequality persists, whether they come from geographical cultures or a result of workplace attrition. Unconscious (or implicit) bias can creep
into job listings and can actively or passively discourage certain applicants’ pool. The goal of this study is to understand the role of gender bias in the energy modeling job market.
Job listings and their word choices can significantly affect the application pool. This study investigates the word choices from job postings specifically for the energy modeling sector within the United States building construction industry, using web-scraped data of an online job site. The study explores the potential usage of gender-bias terminology through frequency analysis and Natural Language Processing Kit. The study also employs two machine learning methods (one supervise and one unsupervised) to test the application in this
context and evaluate its utilization.
WEB SEARCH ENGINE BASED SEMANTIC SIMILARITY MEASURE BETWEEN WORDS USING PATTE...cscpconf
Semantic Similarity measures plays an important role in information retrieval, natural language processing and various tasks on web such as relation extraction, community mining, document clustering, and automatic meta-data extraction. In this paper, we have proposed a Pattern Retrieval Algorithm [PRA] to compute the semantic similarity measure between the words by
combining both page count method and web snippets method. Four association measures are used to find semantic similarity between words in page count method using web search engines. We use a Sequential Minimal Optimization (SMO) support vector machines (SVM) to find the optimal combination of page counts-based similarity scores and top-ranking patterns from the web snippets method. The SVM is trained to classify synonymous word-pairs and nonsynonymous word-pairs. The proposed approach aims to improve the Correlation values,
Precision, Recall, and F-measures, compared to the existing methods. The proposed algorithm outperforms by 89.8 % of correlation value.
Topic Modeling : Clustering of Deep Webpagescsandit
The internet is comprised of massive amount of info
rmation in the form of zillions of web
pages.This information can be categorized into the
surface web and the deep web. The existing
search engines can effectively make use of surface
web information.But the deep web remains
unexploited yet. Machine learning techniques have b
een commonly employed to access deep
web content.
Under Machine Learning, topic models provide a simp
le way to analyze large volumes of
unlabeled text. A "topic" consists of a cluster of
words that frequently occur together. Using
contextual clues, topic models can connect words wi
th similar meanings and distinguish
between words with multiple meanings. Clustering is
one of the key solutions to organize the
deep web databases.In this paper, we cluster deep w
eb databases based on the relevance found
among deep web forms by employing a generative prob
abilistic model called Latent Dirichlet
Allocation(LDA) for modeling content representative
of deep web databases. This is
implemented after preprocessing the set of web page
s to extract page contents and form
contents.Further, we contrive the distribution of “
topics per document” and “words per topic”
using the technique of Gibbs sampling. Experimental
results show that the proposed method
clearly outperforms the existing clustering methods
.
Context-Based Diversification for Keyword Queries over XML Data1crore projects
IEEE PROJECTS 2015
1 crore projects is a leading Guide for ieee Projects and real time projects Works Provider.
It has been provided Lot of Guidance for Thousands of Students & made them more beneficial in all Technology Training.
Dot Net
DOTNET Project Domain list 2015
1. IEEE based on datamining and knowledge engineering
2. IEEE based on mobile computing
3. IEEE based on networking
4. IEEE based on Image processing
5. IEEE based on Multimedia
6. IEEE based on Network security
7. IEEE based on parallel and distributed systems
Java Project Domain list 2015
1. IEEE based on datamining and knowledge engineering
2. IEEE based on mobile computing
3. IEEE based on networking
4. IEEE based on Image processing
5. IEEE based on Multimedia
6. IEEE based on Network security
7. IEEE based on parallel and distributed systems
ECE IEEE Projects 2015
1. Matlab project
2. Ns2 project
3. Embedded project
4. Robotics project
Eligibility
Final Year students of
1. BSc (C.S)
2. BCA/B.E(C.S)
3. B.Tech IT
4. BE (C.S)
5. MSc (C.S)
6. MSc (IT)
7. MCA
8. MS (IT)
9. ME(ALL)
10. BE(ECE)(EEE)(E&I)
TECHNOLOGY USED AND FOR TRAINING IN
1. DOT NET
2. C sharp
3. ASP
4. VB
5. SQL SERVER
6. JAVA
7. J2EE
8. STRINGS
9. ORACLE
10. VB dotNET
11. EMBEDDED
12. MAT LAB
13. LAB VIEW
14. Multi Sim
CONTACT US
1 CRORE PROJECTS
Door No: 214/215,2nd Floor,
No. 172, Raahat Plaza, (Shopping Mall) ,Arcot Road, Vadapalani, Chennai,
Tamin Nadu, INDIA - 600 026
Email id: 1croreprojects@gmail.com
website:1croreprojects.com
Phone : +91 97518 00789 / +91 72999 51536
Nagano visitamos castillo Matsumoto, museo,templos ..ZenkojiB & M Co., Ltd.
Fuimos a Nagano visitamos el castillo Matsumoto,
museo,templos el mejor Zenkoji ,
tome unos bocetos a lápiz
Recorrido de 600 km desde Nagoya ... ..
We went to visit the Matsumoto Castle Nagano,
museum, temples Zenkoji best
take a few pencil sketches
Route 600 km from Nagoya .....
私たちは、松本城長野を訪問することを行ってきました
博物館、寺院善光寺最高
いくつかの鉛筆のスケッチを取る
ルート名古屋から600キロ.....
https://www.facebook.com/notes/b-m-galeria-arte/mirando-pr%C3%B3xima-visita-castillos-en-nagano-jap%C3%B3n-looking-next-visit-castles-in-n/751904238183646
Ви переглядаєте цей каталог, де знайдете актуальні інформаційні матеріали та контакти тих установ, які цікавлять саме Вас. Збірник контактів дозвільних установ - це комплексний довідник контактів, послуг та документів, які безпосередньо можуть бути пов'язані з Вашим бізнесом. Цей проспект розроблено з врахуванням потреб підприємців у налагодженні контактів із дозвільними установами командою Центру сприяння бізнесу.
Центр сприяння бізнесу 33003, Україна м. Рівне,
вул. Гагаріна, 39 www.bizcenter.rv.ua
*Даний документ виготовлений за фінансової підтримки Європейського Союзу, в рамках Програми Транскордонного Співробітництва Польща-Білорусь-Україна 2007-2013. Відповідальність за зміст даної публікації несе виключно виконавчий комітет Рівненської міської ради і він не відображає позиції Європейського Союзу.
A Proposed Equation for Elastic Modulus of High-Strength Concrete Using Local...IJERA Editor
There several of equations to determine the modulus of elasticity by codes of practice and researchers. They
differ in the form of the equations and their parameter functions. Many codes and researches advise engineers of
the dependence of the modulus of elasticity on the aggregate type, size and shape; and, hence, it is wise to
determine the concrete properties for the specified mix from the trial batches. This paper considers the Iraqi
aggregates used in producing the HSC to develop and equation for prediction the modulus of elasticity for HSC.
Modulus of elasticity of high strength concrete using Iraqi aggregate with a wide range of 41 to 83.3 MPa has
been studied and by analyzing 69 tests from the available literature. An empirical equation has been proposed
for prediction of modulus elasticity presents the local aggregate in Iraq. The predicted values are compared with
the predictions by codes of practice like ACI 318-02, EC2-02 and a practical equation by Noguchi et al. It has
been found that there are differences in the predictions between them and the proposed equation. The ACI
overestimates the modulus of elasticity for these tests and 80% of the tests are below it, while EC2 values are
over conservative as they are below 78% of test values. The proposed equation is lie between the two codes. The
prediction by Noguchi et al. showed better results as they are very close to the proposed equation.
To Get any Project for CSE, IT ECE, EEE Contact Me @ 09666155510, 09849539085 or mail us - ieeefinalsemprojects@gmail.com-Visit Our Website: www.finalyearprojects.org
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
EFFICIENT SCHEMA BASED KEYWORD SEARCH IN RELATIONAL DATABASESIJCSEIT Journal
Keyword search in relational databases allows user to search information without knowing database
schema and using structural query language (SQL). In this paper, we address the problem of generating
and evaluating candidate networks. In candidate network generation, the overhead is caused by raising the
number of joining tuples for the size of minimal candidate network. To reduce overhead, we propose
candidate network generation algorithms to generate a minimum number of joining tuples according to the
maximum number of tuple set. We first generate a set of joining tuples, candidate networks (CNs). It is
difficult to obtain an optimal query processing plan during generating a number of joins. We also develop a
dynamic CN evaluation algorithm (D_CNEval) to generate connected tuple trees (CTTs) by reducing the
size of intermediate joining results. The performance evaluation of the proposed algorithms is conducted
on IMDB and DBLP datasets and also compared with existing algorithms.
Object surface segmentation, Image segmentation, Region growing, X-Y-Z image,...cscpconf
Semantic Similarity measures between words plays an important role in information retrieval, natural language processing and in various tasks on the web. In this paper, we have proposed a Modified Pattern Extraction Algorithm to compute the supervised semantic similarity measure
between the words by combining both page count method and web snippets method. Four association measures are used to find semantic similarity between words in page count method
using web search engines. We use a Sequential Minimal Optimization (SMO) support vector machines (SVM) to find the optimal combination of page counts-based similarity scores and
top-ranking patterns from the web snippets method. The SVM is trained to classify synonymous word-pairs and non-synonymous word-pairs. The proposed Modified Pattern Extraction
Algorithm outperforms by 89.8 percent of correlation value.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
SEMANTICS GRAPH MINING FOR TOPIC DISCOVERY AND WORD ASSOCIATIONSIJDKP
Big Data creates many challenges for data mining experts, in particular in getting meanings of text data. It is beneficial for text mining to build a bridge between word embedding process and graph capacity to connect the dots and represent complex correlations between entities. In this study we examine processes of building a semantic graph model to determine word associations and discover document topics. We introduce a novel Word2Vec2Graph model that is built on top of Word2Vec word embedding model. We demonstrate how this model can be used to analyze long documents, get unexpected word associations and uncover document topics. To validate topic discovery method we transfer words to vectors and vectors to images and use CNN deep learning image classification.
Conceptual Similarity Measurement Algorithm For Domain Specific OntologyZac Darcy
This paper presents the similarity measurement algorithm for domain specific terms collected in the
ontology based data integration system. This similarity measurement algorithm can be used in ontology
mapping and query service of ontology based data integration system. In this paper, we focus on the web
query service to apply this proposed algorithm. Concepts similarity is important for web query service
because the words in user input query are not same wholly with the concepts in ontology. So, we need to
extract the possible concepts that are match or related to the input words with the help of machine readable
dictionary WordNet. Sometimes, we use the generated mapping rules in query generation procedure for
some words that cannot be confirmed the similarity of these words by WordNet. We prove the effect of this
algorithm with two degree semantic result of web mining by generating the concepts results obtained form
the input query.
Conceptual similarity measurement algorithm for domain specific ontology[Zac Darcy
This paper presents the similarity measurement algorithm for domain specific terms collected in the
ontology based data integration system. This similarity measurement algorithm can be used in ontology
mapping and query service of
ontology based data integration sy
stem. In this paper, we focus
o
n the web
query service to apply
this proposed algorithm
. Concepts similarity is important for web query service
because the words in user input query are not
same wholly with the concepts in
ontology. So, we need to
extract the possible concepts that are match or related to the input words with the help of machine readable
dictionary WordNet. Sometimes, we use the generated mapping rules in query generation procedure for
some words that canno
t be
confirmed the similarity of these words
by WordNet. We prove the effect
of this
algorithm with two degree semantic result of web minin
g by generating
the concepts results obtained form
the input query
An Efficient Annotation of Search Results Based on Feature Ranking Approach f...Computer Science Journals
With the increased number of web databases, major part of deep web is one of the bases of database. In several search engines, encoded data in the returned resultant pages from the web often comes from structured databases which are referred as Web databases (WDB).
Computing semantic similarity between two words comes with variety of approaches. This is mainly
essential for the applications such as text analysis, text understanding. In traditional system search engines are used to compute the similarity between words. In that search engines are keyword based. There is one drawback that user should know what exactly they are looking for. There are mainly two main approaches for computation namely
knowledge based and corpus based approaches. But there is one drawback that these two approaches are not suitable for computing similarity between multi-word expressions. This system provides efficient and effective approach for computing term similarity using semantic network approach. A clustering approach is used in order to improve the
accuracy of the semantic similarity. This approach is more efficient than other computing algorithms. technique
can also apply to large scale dataset to compute term similarity.
Similar to Approaches for Keyword Query Routing (20)
Forklift Classes Overview by Intella PartsIntella Parts
Discover the different forklift classes and their specific applications. Learn how to choose the right forklift for your needs to ensure safety, efficiency, and compliance in your operations.
For more technical information, visit our website https://intellaparts.com
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...Amil Baba Dawood bangali
Contact with Dawood Bhai Just call on +92322-6382012 and we'll help you. We'll solve all your problems within 12 to 24 hours and with 101% guarantee and with astrology systematic. If you want to take any personal or professional advice then also you can call us on +92322-6382012 , ONLINE LOVE PROBLEM & Other all types of Daily Life Problem's.Then CALL or WHATSAPP us on +92322-6382012 and Get all these problems solutions here by Amil Baba DAWOOD BANGALI
#vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore#blackmagicformarriage #aamilbaba #kalajadu #kalailam #taweez #wazifaexpert #jadumantar #vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore #blackmagicforlove #blackmagicformarriage #aamilbaba #kalajadu #kalailam #taweez #wazifaexpert #jadumantar #vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore #Amilbabainuk #amilbabainspain #amilbabaindubai #Amilbabainnorway #amilbabainkrachi #amilbabainlahore #amilbabaingujranwalan #amilbabainislamabad
Cosmetic shop management system project report.pdfKamal Acharya
Buying new cosmetic products is difficult. It can even be scary for those who have sensitive skin and are prone to skin trouble. The information needed to alleviate this problem is on the back of each product, but it's thought to interpret those ingredient lists unless you have a background in chemistry.
Instead of buying and hoping for the best, we can use data science to help us predict which products may be good fits for us. It includes various function programs to do the above mentioned tasks.
Data file handling has been effectively used in the program.
The automated cosmetic shop management system should deal with the automation of general workflow and administration process of the shop. The main processes of the system focus on customer's request where the system is able to search the most appropriate products and deliver it to the customers. It should help the employees to quickly identify the list of cosmetic product that have reached the minimum quantity and also keep a track of expired date for each cosmetic product. It should help the employees to find the rack number in which the product is placed.It is also Faster and more efficient way.
Final project report on grocery store management system..pdfKamal Acharya
In today’s fast-changing business environment, it’s extremely important to be able to respond to client needs in the most effective and timely manner. If your customers wish to see your business online and have instant access to your products or services.
Online Grocery Store is an e-commerce website, which retails various grocery products. This project allows viewing various products available enables registered users to purchase desired products instantly using Paytm, UPI payment processor (Instant Pay) and also can place order by using Cash on Delivery (Pay Later) option. This project provides an easy access to Administrators and Managers to view orders placed using Pay Later and Instant Pay options.
In order to develop an e-commerce website, a number of Technologies must be studied and understood. These include multi-tiered architecture, server and client-side scripting techniques, implementation technologies, programming language (such as PHP, HTML, CSS, JavaScript) and MySQL relational databases. This is a project with the objective to develop a basic website where a consumer is provided with a shopping cart website and also to know about the technologies used to develop such a website.
This document will discuss each of the underlying technologies to create and implement an e- commerce website.
Democratizing Fuzzing at Scale by Abhishek Aryaabh.arya
Presented at NUS: Fuzzing and Software Security Summer School 2024
This keynote talks about the democratization of fuzzing at scale, highlighting the collaboration between open source communities, academia, and industry to advance the field of fuzzing. It delves into the history of fuzzing, the development of scalable fuzzing platforms, and the empowerment of community-driven research. The talk will further discuss recent advancements leveraging AI/ML and offer insights into the future evolution of the fuzzing landscape.
Explore the innovative world of trenchless pipe repair with our comprehensive guide, "The Benefits and Techniques of Trenchless Pipe Repair." This document delves into the modern methods of repairing underground pipes without the need for extensive excavation, highlighting the numerous advantages and the latest techniques used in the industry.
Learn about the cost savings, reduced environmental impact, and minimal disruption associated with trenchless technology. Discover detailed explanations of popular techniques such as pipe bursting, cured-in-place pipe (CIPP) lining, and directional drilling. Understand how these methods can be applied to various types of infrastructure, from residential plumbing to large-scale municipal systems.
Ideal for homeowners, contractors, engineers, and anyone interested in modern plumbing solutions, this guide provides valuable insights into why trenchless pipe repair is becoming the preferred choice for pipe rehabilitation. Stay informed about the latest advancements and best practices in the field.
Quality defects in TMT Bars, Possible causes and Potential Solutions.PrashantGoswami42
Maintaining high-quality standards in the production of TMT bars is crucial for ensuring structural integrity in construction. Addressing common defects through careful monitoring, standardized processes, and advanced technology can significantly improve the quality of TMT bars. Continuous training and adherence to quality control measures will also play a pivotal role in minimizing these defects.
Student information management system project report ii.pdfKamal Acharya
Our project explains about the student management. This project mainly explains the various actions related to student details. This project shows some ease in adding, editing and deleting the student details. It also provides a less time consuming process for viewing, adding, editing and deleting the marks of the students.
Welcome to WIPAC Monthly the magazine brought to you by the LinkedIn Group Water Industry Process Automation & Control.
In this month's edition, along with this month's industry news to celebrate the 13 years since the group was created we have articles including
A case study of the used of Advanced Process Control at the Wastewater Treatment works at Lleida in Spain
A look back on an article on smart wastewater networks in order to see how the industry has measured up in the interim around the adoption of Digital Transformation in the Water Industry.
Gen AI Study Jams _ For the GDSC Leads in India.pdf
Approaches for Keyword Query Routing
1. Mrs. Suwarna GothaneInt. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 10( Part -1), October 2014, pp.16-22
www.ijera.com 16 | P a g e
Approaches for Keyword Query Routing
Mrs. Suwarna Gothane, Srujana.P
M.E , Assitant professor, CMRTC
, M.Tech, CMRTC
Abstract-
The growing number of datasets published on the Web as linked data brings both opportunities for high data
availability of data. As the data increases challenges for querying also increases. It is very difficult to search
linked data using structured languages. Hence, we use Keyword Query searching for linked data. In this paper,
we propose different approaches for keyword query routing through which the efficiency of keyword search can
be improved greatly. By routing the keywords to the relevant data sources the processing cost of keyword search
queries can be greatly reduced. In this paper, we contrast and compare four models – Keyword level, Element
level, Set level and query expansion using semantic and linguistic analysis. These models are used for keyword
query routing in keyword search.
Index terms: Keyword search, Keyword query routing, Graph-structured data, linguistic and semantic analysis
I. Introduction
The web is no longer a collection of textual data
but also a web of interlinked data sources. One
project that largely contributes to this develop- ment
is Linking Open Data. Through this, a vast amount of
structured information was made publicly available.
Querying that huge amount of data in an intuitive
way is challenging.
Collectively, Linked Data comprise hundreds of
sources containing billions of RDF triples, which are
connected by millions of links. While different kinds
of links can be established, the ones frequently
published are sameAs links, which denote that two
RDF resources represent the same real-world object.
The representation of the linked data on the web is
shown in figure 1.
The linked data Web already contains valuable
data in diverse areas, such as e-government, e-
commerce, and the biosciences. Additionally, the
number of available datasets has grown solidly since
its inception. [1]
In order to search such data we use keyword
search techniques which employ keyword query
routing. To decrease the high cost incurred in
searching structured results that span multiple
sources, we propose routing of the keywords to the
relevant databases. As opposed to the source
selection problem [2], which is focusing on
computing the most relevant sources, the problem
here is to compute the most relevant combinations of
sources. The goal is to produce routing plans, which
can be used to compute results from multiple sources.
Figure 1: Example of Linked data on web
For selecting the correct routing plan, we use
graphs that are developed based on the relationships
between the keywords present in the keyword query.
This relationship is considered at the various levels
such as keyword level, element level, set level e.t.c.,
The rest of paper is organized as follows. Section
2 provides the brief outline on the existing work. The
different approaches are listed along with the some
examples explaining how the routing is considered in
the section 3 before we conclude in the section 4.
II. Related work:
Keyword Query Search can be divided into two
directions of work. They are: 1) keyword search
approaches compute the most relevant structured
results and 2) Solutions for source selection compute
the most relevant sources.
RESEARCH ARTICLE OPEN ACCESS
2. Mrs. Suwarna GothaneInt. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 10( Part -1), October 2014, pp.16-22
www.ijera.com 17 | P a g e
2.1. Keyword search
In the keyword searching, we mainly follow two
approaches. They are schema-based approaches and
schema-agnostic approaches.
Schema-based approaches are implemented on top of
off-the-shelf databases. A keyword is processed by
mapping keywords to the elements of the databases,
called keyword elements. Then, using the schema,
valid join sequences are derived and are employed to
join the computed keyword elements to form the
candidate-networks that represent the possible results
to the keyword query.
Schema-agnostic approaches operate directly on
the data. By exploring the underlying graphs the
structured results are computed in these approaches.
Keywords and elements which are connected are
represented using Steiner trees/graphs. The goal of
this approach is to find structures in the Steiner trees.
For the query “Stanley Robert Award” for
instance, a Steiner graph is the path between uni1 and
prize1 in Fig. 1. Various kinds of algorithms have
been proposed for the efficient exploration of
keyword search results over data graphs, which might
be very large. Examples are bidirectional search [3]
and dynamic programming [4].
Recently, a system called Kite extends schema-
based techniques to find candidate networks in the
multi source setting [5]. It employs schema matching
techniques to discover links between sources and
uses structure discovery techniques to find foreign-
key joins across sources. Also based on pre computed
links, Hermes [6] translates keywords to structured
queries.
2.2 Database Selection
In order to get the efficient results for keyword
search, the selection of the relevant data sources
plays a major role. The main idea is based on
modeling databases using keyword relationships. A
keyword relationship is a pair of keywords that can
be connected via a sequence of join operations. For
instance, (Stanley, Award) is a keyword relationship
as there is a path between uni1 and prize1 in Fig. 1. A
database is considered relevant if its keyword
relationship model covers all pairs of query
keywords.
M-KS considers only binary relationships
between keywords. It incurs a large number of false
positives for queries with more than two keywords.
This is the case when all query keywords are pair
wise related but there is no combined join sequence
which connects all of them.
G-KS [7] addresses this problem by considering
more complex relationships between keywords using
a Keyword Relationship Graph (KRG). Each node in
the graph corresponds to a keyword. Each edge
between two nodes corresponding to the keywords
(ki, kj) indicates that there exists at least two
connected tuples ti ↔ tj that match ki and kj.
Moreover, the distance between ti and tj are marked
on the edges.
III. Approaches
For routing the keywords to the relevant data
sources and searching the given keyword query, we
propose four different approaches. They are:
1) Keyword level model 2) Element level model, 3)
Set level model, and 5) Query expansion using
linguistic and semantic features.
We compute the keyword query result and keyword
routing plan [11] which are the two important factors
of keyword routing.
3.1 Keyword level model
In keyword level, we mainly consider the
relationship between the keywords in the keyword
query. This relationship can be represented using
Keyword Relationship Graph (KRG) [7]. It captures
relationships at the keyword level. As opposed to
keyword search solutions, relationships captured by a
KRG are not direct edges between tuples but stand
for paths between keywords.
For database selection, KRG relationships are
retrieved for all pairs of query keywords to construct
a sub graph. Based on these keyword relationships
alone, it is not possible to guarantee that such a sub
graph is also a Steiner graph (i.e., to guarantee that
the database is relevant). To address this, sub graphs
are validated by finding those that contain Steiner
graphs. This is a filtering step, which makes use of
information in the KRG as well as additional
information about which keywords are contained in
which tuples in the database. It is similar to the
exploration of Steiner graph in keyword search,
where the goal is to ensure that not only keywords
but also tuples mentioning them are connected.
However, since KRG focuses on database selection,
it only needs to know whether two keywords are
connected by some join sequences or not. This
information is stored as relationships in the KRG and
can be retrieved directly. For keyword search, paths
between data elements have to be retrieved and
explored. Retrieving and exploring paths that might
be composed of several edges are clearly more
expensive than retrieving relationships between
keywords.
Keyword search over relational databases finds
the answers of tuples in the databases which are
connected through primary/foreign keys and contain
query keywords. As there are usually large numbers
of tuples in the databases, these methods are rather
expensive to find answers by on-the-fly enumerating
the connections.
To address this problem, proposed tuple units [8]
to efficiently answer keyword queries. A tuple unit is
3. Mrs. Suwarna GothaneInt. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 10( Part -1), October 2014, pp.16-22
www.ijera.com 18 | P a g e
a set of highly relevant tuples which contain query
keywords.
Definition-1 (Tuple Units): Given a database D with
m connected tables, R1, R2, . . . ,Rm, for each tuple ti in
table Ri, let Rti denote the table with the same
primary/foreign keys as Ri, having a single tuple ti.
The joined result of table Rti and other tables Rj(j ≠ i)
based on foreign keys, denoted by R = j≠i Rj Rti,
is called a tuple set. Given two tuple sets t1 and t2, if
any tuple in t2 is contained in t1, we call that t1 covers
t2 (t2 is covered by t1). A tuple set is called a tuple
unit if it is not covered by any tuple set.
To better understand the above definition, consider
the following example.
Table 1: An example database
Example 1: Consider a publication database in
Table 1. For each tuple in a table, we join the three
tables and get the tuple sets as shown in Table 2.
Tuple set Ta1 is not a tuple unit as it is covered by
Tp1. Ta2 is a tuple unit as any tuple set does not cover
it. In this way, we can find all tuple units as shown in
Table 2. Each tuple unit can represent a meaningful
and integral information unit, and can be taken as an
answer of a keyword query. Considering a keyword
query {relational, database, keyword, search,
Hristidis}, the underlined tuple unit Tp5 (Table 2)
contains all the input keywords. We can take this
tuple unit as an answer. Note that we do not need to
on-the-fly identify structural relationships between
tuples in different tables, and the tuple-unit-based
method can improve search performance.
3.2 Element level model
Keyword search [9] relies on an element-level
model (i.e., data graphs) to compute keyword query
results. Elements mentioning keywords are retrieved
from this model and paths between them are explored
to compute Steiner graphs. To deal with the keyword
routing problem, elements can be stored along with
the sources they are contained in so that this
information can be retrieved to derive the routing
plans from the computed keyword query results.
In this model, we mainly concentrate on IR
technique of data retrieval. This technique allow
users to search unstructured information using
keyword based on scoring and ranking, and do not
need users to understand any database schemas.
We use graph-based data models to characterize
individual data models.
Definition 1 (Element-level Data Graph): An
element-level data graph g (N, ε) consists of
The set of nodes N, which is the disjoint
union of Nε NV, where the nodes Nε represent
entities and the nodes NV capture entities’
attribute values, and
The set of edges ε, subdivided by ε = εR ] εA,
where εR represents inter entity relations, εA
stands for entity-attribute assignments. We
have e (n1, n2) 2 εR iff n1; n2 2 Nε and e (n1,
n2) 2 εA iff n1 2 Nε and n2 2 NV. The set of
attribute edges εA (n) = {e (n, m) 2 εA} is
referred to as the description of the entity n.
Note that this model resembles RDF data where
entities stand for some RDF resources, data values
stand for RDF literals, and relations and attribute
Table 2: Tuple sets and Tuple units
4. Mrs. Suwarna GothaneInt. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 10( Part -1), October 2014, pp.16-22
www.ijera.com 19 | P a g e
correspond to RDF triples. While it is primarily used
to model RDF Linked Data on the web, such a graph
model is sufficiently general to capture XML and
relational data. For instance, a tuple in a relational
database can be modeled as an entity, and foreign key
relationships can be represented as inter entity
relations.
Existing keyword search solutions naturally
apply to this problem. However, the data graph and
the number of keyword elements are possibly very
large in our scenario, and thus, exploring all paths
between them in the data graphs is expensive. This is
the main drawback of this model.
3.3 Set level model
In this model we derive the summary at the level
of set of elements.
Definition 2 (Set-level Data Graph): A set-level
data graph of an element-level graph g (Nε NV; εR
εA) is a tuple g′ = (N′, ε′). Every node n′ N′
stands for a set of element level entities Nn′ Nε, i.e.,
there is mapping type: Nε → N′ that associates every
element-level entity n Nε with a set-level element
n′ N′. Every edge e′ (n′i, n′j) ε′ represents a
relation between the two sets of element-level entities
n′i and n′j. We have ε′ = {e′ (n′i, n′j) | e (ni, nj) εR,
type (ni) = n′i; type (nj) = n′j}.
This set-level graph essentially captures a part of
the Linked Data schema on the web that is
represented in RDFS, i.e., relations between classes.
Often, a schema might be incomplete or simply does
not exist for RDF data on the web. In such a case, a
pseudo schema can be obtained by computing a
structural summary such as a data guide [10].
A set-level data graph can be derived from a
given schema or a generated pseudo schema. Thus,
we assume a membership mapping type: Nε → N′
exists and use n n′ to denote that n belongs to the
set n′. An example of the set level graph is given in
Fig. 2. We consider the search space as a set of
Linked Data sources, forming a web of data.
Fig 2: set-level web data graph
We develop a set level Keyword-Element
Relationship Graph (KERG) [11].
Intuitively, a dmax-KERG represents all paths
between keywords that are connected over a
maximum distance dmax. This is to capture all dmax-
Steiner graphs that exist in the data. The fig 3 below
shows the set-level KERG with dmax = 1.
Fig 3: set-level KERG with dmax = 1
Example 1: A KERG for our running example with
dmax = 1 is illustrated in Fig. 3. For instance, there is a
keyword-element node (Robert, Person, DBPedia).
Note that the relationship {(Robert, Person,
DBPedia), (Award, Prize, DBPedia)} actually stands
for the element-level connections {(Robert, per3,
DBPedia), (Award, prize1, DBPedia)}, and {(Robert,
per4, DBPedia), (Award, prize2, DBPedia)} because
per3 and per4 mention Robert, prize1 and prize2
mention Award, per3, per4 Person, prize1, prize2
Prize, there is a path between per3 and prize1,
and a path between per4 and prize2 (see web data
graph in Fig. 1). This example illustrates that
element-level relationships, which share the same
pair of terms (Robert and Award), classes (Person
and Prize), and sources (DBPedia and DBPedia) can
be summarized to one single set-level relationship.
In order to compute the routing plan we use the
following algorithm.
------------------------------------------------------------
Algorithm1: PPRJ ComputeRoutingPlan(K, W′K)
------------------------------------------------------------
Input: The query K, the summary W′K (N′K, ε′K)
Output: The set of routing plans [RP]
JP ← a join plan that contains all {ki, kj} 2K
T ← a table where every tuple captures a join
sequence of KERG relationships e′K ε′K, the score
of each e′K, and the combined score of the join
sequence; it is initially empty;
While JP .empty() do
{ki, kj}← JP .pop();
ε′{ki, kj}← retrieve(ε′K, {ki, kj});
if T.empty() then
T ← ε′{ki, kj};
else
T ← ε′{ki, kj} T;
Compute score of tuples in T via SCORE (K, W′S
K);
5. Mrs. Suwarna GothaneInt. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 10( Part -1), October 2014, pp.16-22
www.ijera.com 20 | P a g e
[RP] ← Group T by sources to identify unique
combinations of sources;
Compute scores of routing plans in [RP] via SCORE
(K, RP);
Sort [RP] by score;
------------------------------------------------------------
3.4 Query expansion using linguistic and semantic
features
In document retrieval, many query expansion
techniques are based on information contained in the
top-ranked retrieved documents in response to the
original user query, e.g. [12], [15]. Similarly, our
approach is based on performing an initial retrieval of
resources according to the original keyword query.
Thereafter, further resources are derived by
leveraging the initially retrieved ones.
Overall, the proposed process depicted in Figure 4
is divided into three main steps. In the first step, all
words closely related to the original keyword are
extracted based on two types of features – linguistic
and semantic. In the second step, various introduced
linguistic and semantic features are weighted using
learning approaches. In the third step, we assign a
relevance score to the set of the related words. Using
this score we prune the related word set to achieve a
balance between precision and recall.
Fig 4: AQE pipeline
A. Extracting and Preprocessing of Data using
Semantic and Linguistic Features:
For the given input keyword k, we define the set of
all words related to the keyword k as Xk = {x1, x2, ...,
xn}. The set Xk is defined as the union of the two sets
LEk and SEk. LEk is constructed as the collection of
all words obtained through linguistic features and in
the same manner the semantic features also.
Linguistic features extracted from WordNet are:
• Synonyms: words having similar meanings to the
input keyword k.
• Hyponyms: words representing a specialization of
the input keyword k.
• Hyponyms: words representing a generalization of
the input keyword k.
The set SE comprises all words semantically
derived from the input keyword k using Linked Data.
These semantic features are defined as the following
semantic relations:
• sameAs: deriving resources having the same
identity as the input resource using owl:sameAs.
• seeAlso: deriving resources that provide more
information about the input resource using
rdfs:seeAlso.
• class/property equivalence: deriving classes or
properties providing related descriptions for the input
resource using owl:equivalentClass and
owl:equivalentProperty.
• superclass/-property: deriving all super
classes/properties of the input resource by following
the rdfs:subClassOf or rdfs:subPropertyOf property
paths originating from the input resource.
• subclass/-property: deriving all sub resources of the
input resource ri by following the rdfs:subClassOf or
rdfs:subPropertyOf property paths ending with the
input resource.
• broader concepts: deriving broader concepts related
to the input resource ri using the SKOS vocabulary
properties skos:broader and skos:broadMatch.
• narrower concepts: deriving narrower concepts
related to the input resource ri using skos:narrower
and skos:narrowMatch.
• related concepts: deriving related concepts to the
input resource ri using skos:closeMatch,
skos:mappingRelation and skos:exactMatch.
For each ri APk, we derive all the related
resources employing the above semantic features.
Then, for each derived resource r′, we add all the
English labels of that resource to the the set SEk.
Therefore, SEk contains the labels of all semantically
derived resource.
The set of all related words of the input keyword k is
defined as Xk = LEk SEk. After extracting the set Xk
of related words, we run the following preprocessing
methods for each xi Xk:
1) Tokenization: extraction of individual words,
ignoring punctuation and case.
2) Stop word removal: removal of common words
such as articles and prepositions.
3) Word lemmatisation: determining the lemma of
the word.
For example, as can be observed in Figure 5, the word
“Thinking machine” and “electronic brain” is derived by
synonym, sameAs and equivalent
Relations
6. Mrs. Suwarna GothaneInt. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 10( Part -1), October 2014, pp.16-22
www.ijera.com 21 | P a g e
Fig 5: Exemplary expansion graph of the word
computer using semantic features.
B. Feature Selection and Feature Weighting
In order to distinguish how effective each feature is
and to remove ineffective features, we employ a
weighting schema ws for computing the weights of
the features as ws : fi F → wi. Note that F is the
set of all features taken into account. There are
numerous feature weighting methods to assign
weight to features like information gain [13], weights
from a linear classifier [14], odds ratio, etc. Herein,
we consider two well-known weighting schemas.
1) Information Gain (IG): Information gain is often
used to decide which of the features are the most
relevant. We define the information gain (IG) of a
feature as:
2) Feature weights from linear classifiers: Linear
classifiers, such as for example SVMs, calculate
predictions by associating the weight wi to the feature
fi. Features whose wi is close to 0 have a small
influence on the predictions. Therefore, we can
assume that they are not very important for query
expansion.
C. Setting the Classifier Threshold
As a last step, we set the threshold for the classifiers
above. To do this, we compute the relevance score
value score (xi) for each word xi Xk. Naturally,
this is done by combining the feature vector Vxi = [α1,
α2, . . . , αn] and the feature weight vector W = [w1,
w2, . . . , wn] as follows:
IV. Conclusion and Future Scope
Keyword query search is a widely used approach
for retrieving linked data in an efficient manner. In
order to reduce the high cost of searching, we redirect
the keywords to the relevant data sources. Here we
use keyword routing to redirect the keywords. This is
done using different types of approaches. Here, we
discussed the four approaches of keyword query
evaluation to get the desired results. We use graph
based methods to compute the routing plans. Further,
we show that when routing is applied to an existing
keyword search system to prune sources, substantial
performance gain can be achieved.
References
[1] T. Berners-Lee, “Linked Data Design Issues,”
2009;www.w3.org/DesignIssues/LinkedData.
html
[2] B. Yu, G. Li, K.R. Sollins, and A.K.H. Tung,
“Effective Keyword-Based Selection of
Relational Databases,” Proc. ACM SIGMOD
Conf., pp. 139-150, 2007.
[3] V. Kacholia, S. Pandit, S. Chakrabarti, S.
Sudarshan, R. Desai, and H. Karambelkar,
“Bidirectional Expansion for Keyword
Search on Graph Databases,” Proc. 31st Int’l
Conf. Very Large Data Bases (VLDB), pp.
505-516, 2005.
[4] B. Ding, J.X. Yu, S. Wang, L. Qin, X. Zhang,
and X. Lin, “Finding Top-K Min-Cost
Connected Trees in Databases,” Proc. IEEE
23rd
Int’l Conf. Data Eng. (ICDE), pp. 836-
845, 2007.
[5] M. Sayyadian, H. LeKhac, A. Doan, and L.
Gravano, “Efficient Keyword Search Across
Heterogeneous Relational Databases,” Proc.
IEEE 23rd Int’l Conf. Data Eng. (ICDE), pp.
346-355, 2007.
[6] T. Tran, H. Wang, and P. Haase, “Hermes:
Data Web Search on a Pay-as-You-Go
Integration Infrastructure,” J. Web Semantics,
vol. 7, no. 3, pp. 189-203, 2009.
[7] Q.H. Vu, B.C. Ooi, D. Papadias, and A.K.H.
Tung, “A Graph Method for Keyword-Based
Selection of the Top-K Databases,” Proc.
ACM SIGMOD Conf., pp. 915-926, 2008.
[8] Jianhua Feng, Guoliang Li and Jianyong
Wang, “Finding Top-k answers in keyword
search over relational databases using tuple
units” IEEE transactions, VOL. 23 NO. 12,
December 2011.
7. Mrs. Suwarna GothaneInt. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 10( Part -1), October 2014, pp.16-22
www.ijera.com 22 | P a g e
[9] G. Li, B.C. Ooi, J. Feng, J. Wang, and L.
Zhou, “Ease: An Effective 3-in-1 Keyword
Search Method for Unstructured, Semi-
Structured and Structured Data,” Proc. ACM
SIGMOD Conf., pp. 903-914, 2008.
[10] R. Goldman and J. Widom, “DataGuides:
Enabling Query Formulation and
Optimization in Semistructured Databases,”
Proc. 23rd Int’l Conf. Very Large Data Bases
(VLDB), pp. 436-445, 1997.
[11] Thanh Tran and Lei Zhang, “Keyword Query
Routing” IEEE Transactions, VOL.26, NO.2,
February 2014.
[12] K. Collins- Thompson, Reducing the risk of
query expansion via robust constrained
optimization. In CIKM. ACM, 2009.
[13] H. Deng, G. C. Runger, and E. Tuv. Bias of
importance measures for multi-valued
attributes and solutions. In ICANN (2),
volume 6792, pages 293–300. Springer,
2011.
[14] D. Mladenic, J. Brank, M. Grobelnik, and N.
Milic-Frayling. Feature selection using linear
classifier weights: interaction with
classification models. In Proceedings of the
27th Annual International ACM SIGIR
Conference SIGIR2004. ACM, 2004.
[15] Saeedeh Shekarpour, Jens Lehmann, and
Sören Auer, “Keyword Query Expansion on
Linked Data Using Linguistic and Semantic
Features” IEEE Seventh International
Conference on Semantic Computing, 2013.