SlideShare a Scribd company logo
1 of 8
RANKING ON DATA MANIFOLD WITH SINK POINTS

ABSTRACT:

In this paper, we introduce a novel approach, Manifold Ranking with Sink Points (MRSPs), to
address diversity as well as relevance and importance in ranking. Specifically, our approach uses
a manifold ranking process over the data manifold, which can naturally find the most relevant
and important data objects. Meanwhile, by turning ranked objects into sink points on data
manifold, we can effectively prevent redundant objects from receiving a high rank.
MRSP not only shows a nice convergence property, but also has an interesting and satisfying
optimization explanation. We applied MRSP on two application tasks, update summarization and
query recommendation, where diversity is of great concern in ranking. Experimental results on
both tasks present a strong empirical performance of MRSP as compared to existing ranking
approaches.
EXISTING SYSTEM:
Ranking is an important problem in various applications, such as Information Retrieval (IR),
natural language processing, computational biology, and social sciences. Many ranking
approaches have been proposed to rank objects according to their degrees of relevance or
importance. Beyond these two goals, diversity has also been recognized as a crucial criterion in
ranking. Top ranked results are expected to convey as little redundant information as possible,
and cover as many aspects as possible.
However, existing ranking approaches either take no account of diversity, or handle it separately
with some heuristics. The redundance in top ranked results will reduce the chance to satisfy
different users. For example, given a query “zeppelin,” if the top ranked search results were all
similar articles about the “Zeppelin iPod speaker,” it would be a waste of the output space and
largely degrade users’ search experience even though the results are all bad relevant to the query.
We are able to minimize the risk that the information need of the user will not be satisfied. Many
real application tasks demand diversity in ranking. For example, in query recommendation, the
recommended queries should capture different query intents of different users. In text
summarization, candidate sentences of a summary are expected to be less redundant and cover
different aspects of information delivered by the document. In e-commerce, a list of relevant but
distinctive products is useful for users to browse and make a purchase.
Disadvantages
•

However, these methods often treat relevance and diversity separately in the ranking
algorithm, sometimes with additional heuristic procedures.

•

Therefore, relevance and importance are well balanced in manifold ranking, similar to
Personalized PageRank
PROPOSED SYSTEM:
Our ranking approaches have been proposed to rank objects according to their degrees of
relevance or importance. Beyond these two goals, diversity has also been recognized as a crucial
criterion in ranking. The issue of diversity in ranking has been widely studied recently.
Researchers from various domains have proposed many approaches to address this problem,
such as Maximum Marginal Relevance (MMR) subtopic diversity cluster-based centroids
selecting, categorization- based approach, and many other redundancy penalty approaches.
However, these methods often treat relevance and diversity separately in the ranking algorithm,
sometimes with additional heuristic procedures.
Our proposed approach MRSP has not only a nice convergence property, but also a satisfying
optimization explanation. The manifold ranking algorithm is proposed based on the following
two key assumptions: 1) nearby data are likely to have close ranking scores; and 2) data on the
same structure are likely to have close ranking scores. An intuitive description of the ranking
algorithm is described as follows: a weighted network is constructed first, where nodes represent
all the data and query points, and an edge is put between two nodes if they are “close.
ADVANTAGES:
•

Manifold ranking gives high ranks to nodes that are close to the queries on the manifold
and that have strong centrality.

•

The best of our knowledge, the challenge of addressing relevance, importance and
diversity simultaneously in a unified way is still far from being well resolved.

•

At each iteration data uses manifold ranking to find one or more most relevant points.
MODULES:
•

DATA MANIFOLDS

•

MANIFOLD RANKING

•

UPDATE SUMMARIZATION

•

QUERY RECOMMENDATION

•

QUALITATIVE COMPARISON

•

PARAMETER TUNING

MODULE DESCRIPTION:

DATA MANIFOLDS:
Ranking on data manifolds is proposed In their approach, data objects are assumed to be points
sampled from a low-dimensional manifold embedded in a high-dimensional euclidean space
(ambient space). Hereafter, object and point will not be discriminated unless otherwise specified.
Manifold ranking is then to rank the data points with respect to the intrinsic global manifold
structure given a set of query points.
MANIFOLD RANKING:
Query nodes are then initiated with a positive ranking score, while the nodes to be ranked are
assigned with a zero initial score. All the nodes then propagate their ranking scores to their
neighbors via the weighted network. The propagation process is repeated until a global stable
state is achieved, and all the nodes except the queries are ranked according to their final scores.
Manifold ranking gives high ranks to nodes that are close to the queries on the manifold (which
reflects high relevance) and that have strong centrality (which reflects high importance).
Therefore, relevance and importance are well balanced in manifold ranking, similar to
Personalized. However, diversity is not considered in manifold ranking.
UPDATE SUMMARIZATION:
Update summarization is a temporal extension of topic focused multi document summarization,
by focusing on summarizing up-to-date information contained in the new document set given a
past document set. There are mainly two kinds of approaches for update summarization; one is
abstractive summarization in which some deep natural language processing techniques are
leveraged to compress sentences or to reorganize phrases to produce a summary of the text.
Another one is extractive summarization. In the extractive approach, update summarization is
reduced to a sentence ranking problem, which composes a summary by extracting the most
representative sentences from target document set.
QUERY RECOMMENDATION:
Query recommendation aims to provide alternative queries to help users search and also improve
the usability of search engines. It has been employed as a core utility by many industrial search
engines. Most of the work on query recommendation focuses on measures of query similarity,
where query log data has been widely used in these approaches. For example, Beeferman and
Berger applied agglomerative clustering to the click-through bipartite graph to identify related
queries for recommendation proposed to combine both user click-through data and query content
information to determine query similarity.
QUALITATIVE COMPARISON:
A qualitative comparison to gain some intuition on the differences between the summaries
generated by our approach and the baseline methods. We randomly selected one topic from the
48 topics of TAC2008 data set as an example, which is about “the investigation of Jack Abramoff
and others related to lobbying activities.” We show the four reference summaries, provided by
NIST as the ground truth, and the summaries generated by our approach and other three
baselines.
PARAMETER TUNING:
There is only one parameter in the MRSP algorithm, which is a balance factor between the
influence of the intrinsic manifold structure and the prior knowledge on each sentence. the
influence of on the summarization performance. As we can see, the summarization approach
performs not so well when is small, which may be due to over emphasis on the prior knowledge.
HARDWARE & SOFTWARE REQUIREMENTS:

HARDWARE REQUIREMENT:



Processor

-

Pentium –IV



Speed

-

1.1 GHz



RAM

-

256 MB (min)



Hard Disk

- 20 GB



Floppy Drive

-

1.44 MB



Key Board

-

Standard Windows Keyboard



Mouse

- Two or Three Button Mouse



Monitor

-

SVGA

SOFTWARE REQUIREMENTS:



Operating System

:

Windows XP



Front End

:

JAVA JDK 1.7



Tools

:

Net beans



Database

:

SQL Server 2005 or MS-ACCESS
25.ranking on data manifold with sink points

More Related Content

What's hot

2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )SBGC
 
Relevance feature discovery for text mining
Relevance feature discovery for text miningRelevance feature discovery for text mining
Relevance feature discovery for text miningredpel dot com
 
Efficient Filtering Algorithms for Location- Aware Publish/subscribe
Efficient Filtering Algorithms for Location- Aware Publish/subscribeEfficient Filtering Algorithms for Location- Aware Publish/subscribe
Efficient Filtering Algorithms for Location- Aware Publish/subscribeIJSRD
 
A novel approach towards developing a statistical dependent and rank
A novel approach towards developing a statistical dependent and rankA novel approach towards developing a statistical dependent and rank
A novel approach towards developing a statistical dependent and rankIAEME Publication
 
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATAEFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATAcsandit
 
Review of Various Text Categorization Methods
Review of Various Text Categorization MethodsReview of Various Text Categorization Methods
Review of Various Text Categorization Methodsiosrjce
 
Stock prediction using social network
Stock prediction using social networkStock prediction using social network
Stock prediction using social networkChanon Hongsirikulkit
 
USING NLP APPROACH FOR ANALYZING CUSTOMER REVIEWS
USING NLP APPROACH FOR ANALYZING CUSTOMER REVIEWSUSING NLP APPROACH FOR ANALYZING CUSTOMER REVIEWS
USING NLP APPROACH FOR ANALYZING CUSTOMER REVIEWScsandit
 
The comparative study of information retrieval models used in search engines
The comparative study of information retrieval models used in search enginesThe comparative study of information retrieval models used in search engines
The comparative study of information retrieval models used in search enginesfawad khan
 
Query Recommendation by using Collaborative Filtering Approach
Query Recommendation by using Collaborative Filtering ApproachQuery Recommendation by using Collaborative Filtering Approach
Query Recommendation by using Collaborative Filtering ApproachIRJET Journal
 
Ieee transactions on 2018 knowledge and data engineering topics with abstract .
Ieee transactions on 2018 knowledge and data engineering topics with abstract .Ieee transactions on 2018 knowledge and data engineering topics with abstract .
Ieee transactions on 2018 knowledge and data engineering topics with abstract .tsysglobalsolutions
 
A NEW TOP-K CONDITIONAL XML PREFERENCE QUERIES
A NEW TOP-K CONDITIONAL XML PREFERENCE QUERIESA NEW TOP-K CONDITIONAL XML PREFERENCE QUERIES
A NEW TOP-K CONDITIONAL XML PREFERENCE QUERIESijaia
 
Computing semantic similarity measure between words using web search engine
Computing semantic similarity measure between words using web search engineComputing semantic similarity measure between words using web search engine
Computing semantic similarity measure between words using web search enginecsandit
 
Information retrival system and PageRank algorithm
Information retrival system and PageRank algorithmInformation retrival system and PageRank algorithm
Information retrival system and PageRank algorithmRupali Bhatnagar
 
An Improved Web Explorer using Explicit Semantic Similarity with ontology and...
An Improved Web Explorer using Explicit Semantic Similarity with ontology and...An Improved Web Explorer using Explicit Semantic Similarity with ontology and...
An Improved Web Explorer using Explicit Semantic Similarity with ontology and...INFOGAIN PUBLICATION
 

What's hot (19)

2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
 
Relevance feature discovery for text mining
Relevance feature discovery for text miningRelevance feature discovery for text mining
Relevance feature discovery for text mining
 
Efficient Filtering Algorithms for Location- Aware Publish/subscribe
Efficient Filtering Algorithms for Location- Aware Publish/subscribeEfficient Filtering Algorithms for Location- Aware Publish/subscribe
Efficient Filtering Algorithms for Location- Aware Publish/subscribe
 
A novel approach towards developing a statistical dependent and rank
A novel approach towards developing a statistical dependent and rankA novel approach towards developing a statistical dependent and rank
A novel approach towards developing a statistical dependent and rank
 
BoTLRet: A Template-based Linked Data Information Retrieval
 BoTLRet: A Template-based Linked Data Information Retrieval BoTLRet: A Template-based Linked Data Information Retrieval
BoTLRet: A Template-based Linked Data Information Retrieval
 
0
00
0
 
Phd thesis final presentation
Phd thesis   final presentationPhd thesis   final presentation
Phd thesis final presentation
 
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATAEFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
 
Cl4201593597
Cl4201593597Cl4201593597
Cl4201593597
 
Review of Various Text Categorization Methods
Review of Various Text Categorization MethodsReview of Various Text Categorization Methods
Review of Various Text Categorization Methods
 
Stock prediction using social network
Stock prediction using social networkStock prediction using social network
Stock prediction using social network
 
USING NLP APPROACH FOR ANALYZING CUSTOMER REVIEWS
USING NLP APPROACH FOR ANALYZING CUSTOMER REVIEWSUSING NLP APPROACH FOR ANALYZING CUSTOMER REVIEWS
USING NLP APPROACH FOR ANALYZING CUSTOMER REVIEWS
 
The comparative study of information retrieval models used in search engines
The comparative study of information retrieval models used in search enginesThe comparative study of information retrieval models used in search engines
The comparative study of information retrieval models used in search engines
 
Query Recommendation by using Collaborative Filtering Approach
Query Recommendation by using Collaborative Filtering ApproachQuery Recommendation by using Collaborative Filtering Approach
Query Recommendation by using Collaborative Filtering Approach
 
Ieee transactions on 2018 knowledge and data engineering topics with abstract .
Ieee transactions on 2018 knowledge and data engineering topics with abstract .Ieee transactions on 2018 knowledge and data engineering topics with abstract .
Ieee transactions on 2018 knowledge and data engineering topics with abstract .
 
A NEW TOP-K CONDITIONAL XML PREFERENCE QUERIES
A NEW TOP-K CONDITIONAL XML PREFERENCE QUERIESA NEW TOP-K CONDITIONAL XML PREFERENCE QUERIES
A NEW TOP-K CONDITIONAL XML PREFERENCE QUERIES
 
Computing semantic similarity measure between words using web search engine
Computing semantic similarity measure between words using web search engineComputing semantic similarity measure between words using web search engine
Computing semantic similarity measure between words using web search engine
 
Information retrival system and PageRank algorithm
Information retrival system and PageRank algorithmInformation retrival system and PageRank algorithm
Information retrival system and PageRank algorithm
 
An Improved Web Explorer using Explicit Semantic Similarity with ontology and...
An Improved Web Explorer using Explicit Semantic Similarity with ontology and...An Improved Web Explorer using Explicit Semantic Similarity with ontology and...
An Improved Web Explorer using Explicit Semantic Similarity with ontology and...
 

Similar to 25.ranking on data manifold with sink points

Ranking on data manifold with sink points
Ranking on data manifold with sink pointsRanking on data manifold with sink points
Ranking on data manifold with sink pointsEcway Technologies
 
Ranking on data manifold with sink points
Ranking on data manifold with sink pointsRanking on data manifold with sink points
Ranking on data manifold with sink pointsEcway Technologies
 
Dotnet ranking on data manifold with sink points
Dotnet  ranking on data manifold with sink pointsDotnet  ranking on data manifold with sink points
Dotnet ranking on data manifold with sink pointsEcway Technologies
 
Java ranking on data manifold with sink points
Java  ranking on data manifold with sink pointsJava  ranking on data manifold with sink points
Java ranking on data manifold with sink pointsEcway Technologies
 
Keyword Query Routing
Keyword Query RoutingKeyword Query Routing
Keyword Query RoutingSWAMI06
 
an empirical performance evaluation of relational keyword search techniques
an empirical performance evaluation of relational keyword search techniquesan empirical performance evaluation of relational keyword search techniques
an empirical performance evaluation of relational keyword search techniquesswathi78
 
Efficient instant fuzzy search with proximity ranking
Efficient instant fuzzy search with proximity rankingEfficient instant fuzzy search with proximity ranking
Efficient instant fuzzy search with proximity rankingShakas Technologies
 
A Formal Machine Learning or Multi Objective Decision Making System for Deter...
A Formal Machine Learning or Multi Objective Decision Making System for Deter...A Formal Machine Learning or Multi Objective Decision Making System for Deter...
A Formal Machine Learning or Multi Objective Decision Making System for Deter...Editor IJCATR
 
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routingIEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routingIEEEFINALYEARSTUDENTPROJECTS
 
2014 IEEE JAVA DATA MINING PROJECT Keyword query routing
2014 IEEE JAVA DATA MINING PROJECT Keyword query routing2014 IEEE JAVA DATA MINING PROJECT Keyword query routing
2014 IEEE JAVA DATA MINING PROJECT Keyword query routingIEEEMEMTECHSTUDENTSPROJECTS
 
Scalable Action Mining Hybrid Method for Enhanced User Emotions in Education ...
Scalable Action Mining Hybrid Method for Enhanced User Emotions in Education ...Scalable Action Mining Hybrid Method for Enhanced User Emotions in Education ...
Scalable Action Mining Hybrid Method for Enhanced User Emotions in Education ...IJCI JOURNAL
 
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...IOSR Journals
 
Online Learning to Rank
Online Learning to RankOnline Learning to Rank
Online Learning to Rankewhuang3
 
JPJ1423 Keyword Query Routing
JPJ1423   Keyword Query RoutingJPJ1423   Keyword Query Routing
JPJ1423 Keyword Query Routingchennaijp
 
An empirical performance evaluation of relational keyword search systems
An empirical performance evaluation of relational keyword search systemsAn empirical performance evaluation of relational keyword search systems
An empirical performance evaluation of relational keyword search systemsBrowse Jobs
 
keyword query routing
keyword query routingkeyword query routing
keyword query routingswathi78
 
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKS
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKSBSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKS
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKSijp2p
 
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKS
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKSBSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKS
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKSijp2p
 
Bsi bloom filter based semantic indexing
Bsi bloom filter based semantic indexingBsi bloom filter based semantic indexing
Bsi bloom filter based semantic indexingijp2p
 
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKS
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKSBSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKS
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKSijp2p
 

Similar to 25.ranking on data manifold with sink points (20)

Ranking on data manifold with sink points
Ranking on data manifold with sink pointsRanking on data manifold with sink points
Ranking on data manifold with sink points
 
Ranking on data manifold with sink points
Ranking on data manifold with sink pointsRanking on data manifold with sink points
Ranking on data manifold with sink points
 
Dotnet ranking on data manifold with sink points
Dotnet  ranking on data manifold with sink pointsDotnet  ranking on data manifold with sink points
Dotnet ranking on data manifold with sink points
 
Java ranking on data manifold with sink points
Java  ranking on data manifold with sink pointsJava  ranking on data manifold with sink points
Java ranking on data manifold with sink points
 
Keyword Query Routing
Keyword Query RoutingKeyword Query Routing
Keyword Query Routing
 
an empirical performance evaluation of relational keyword search techniques
an empirical performance evaluation of relational keyword search techniquesan empirical performance evaluation of relational keyword search techniques
an empirical performance evaluation of relational keyword search techniques
 
Efficient instant fuzzy search with proximity ranking
Efficient instant fuzzy search with proximity rankingEfficient instant fuzzy search with proximity ranking
Efficient instant fuzzy search with proximity ranking
 
A Formal Machine Learning or Multi Objective Decision Making System for Deter...
A Formal Machine Learning or Multi Objective Decision Making System for Deter...A Formal Machine Learning or Multi Objective Decision Making System for Deter...
A Formal Machine Learning or Multi Objective Decision Making System for Deter...
 
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routingIEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
 
2014 IEEE JAVA DATA MINING PROJECT Keyword query routing
2014 IEEE JAVA DATA MINING PROJECT Keyword query routing2014 IEEE JAVA DATA MINING PROJECT Keyword query routing
2014 IEEE JAVA DATA MINING PROJECT Keyword query routing
 
Scalable Action Mining Hybrid Method for Enhanced User Emotions in Education ...
Scalable Action Mining Hybrid Method for Enhanced User Emotions in Education ...Scalable Action Mining Hybrid Method for Enhanced User Emotions in Education ...
Scalable Action Mining Hybrid Method for Enhanced User Emotions in Education ...
 
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
 
Online Learning to Rank
Online Learning to RankOnline Learning to Rank
Online Learning to Rank
 
JPJ1423 Keyword Query Routing
JPJ1423   Keyword Query RoutingJPJ1423   Keyword Query Routing
JPJ1423 Keyword Query Routing
 
An empirical performance evaluation of relational keyword search systems
An empirical performance evaluation of relational keyword search systemsAn empirical performance evaluation of relational keyword search systems
An empirical performance evaluation of relational keyword search systems
 
keyword query routing
keyword query routingkeyword query routing
keyword query routing
 
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKS
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKSBSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKS
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKS
 
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKS
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKSBSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKS
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKS
 
Bsi bloom filter based semantic indexing
Bsi bloom filter based semantic indexingBsi bloom filter based semantic indexing
Bsi bloom filter based semantic indexing
 
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKS
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKSBSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKS
BSI: BLOOM FILTER-BASED SEMANTIC INDEXING FOR UNSTRUCTURED P2P NETWORKS
 

Recently uploaded

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 

Recently uploaded (20)

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 

25.ranking on data manifold with sink points

  • 1. RANKING ON DATA MANIFOLD WITH SINK POINTS ABSTRACT: In this paper, we introduce a novel approach, Manifold Ranking with Sink Points (MRSPs), to address diversity as well as relevance and importance in ranking. Specifically, our approach uses a manifold ranking process over the data manifold, which can naturally find the most relevant and important data objects. Meanwhile, by turning ranked objects into sink points on data manifold, we can effectively prevent redundant objects from receiving a high rank. MRSP not only shows a nice convergence property, but also has an interesting and satisfying optimization explanation. We applied MRSP on two application tasks, update summarization and query recommendation, where diversity is of great concern in ranking. Experimental results on both tasks present a strong empirical performance of MRSP as compared to existing ranking approaches.
  • 2.
  • 3. EXISTING SYSTEM: Ranking is an important problem in various applications, such as Information Retrieval (IR), natural language processing, computational biology, and social sciences. Many ranking approaches have been proposed to rank objects according to their degrees of relevance or importance. Beyond these two goals, diversity has also been recognized as a crucial criterion in ranking. Top ranked results are expected to convey as little redundant information as possible, and cover as many aspects as possible. However, existing ranking approaches either take no account of diversity, or handle it separately with some heuristics. The redundance in top ranked results will reduce the chance to satisfy different users. For example, given a query “zeppelin,” if the top ranked search results were all similar articles about the “Zeppelin iPod speaker,” it would be a waste of the output space and largely degrade users’ search experience even though the results are all bad relevant to the query. We are able to minimize the risk that the information need of the user will not be satisfied. Many real application tasks demand diversity in ranking. For example, in query recommendation, the recommended queries should capture different query intents of different users. In text summarization, candidate sentences of a summary are expected to be less redundant and cover different aspects of information delivered by the document. In e-commerce, a list of relevant but distinctive products is useful for users to browse and make a purchase. Disadvantages • However, these methods often treat relevance and diversity separately in the ranking algorithm, sometimes with additional heuristic procedures. • Therefore, relevance and importance are well balanced in manifold ranking, similar to Personalized PageRank
  • 4. PROPOSED SYSTEM: Our ranking approaches have been proposed to rank objects according to their degrees of relevance or importance. Beyond these two goals, diversity has also been recognized as a crucial criterion in ranking. The issue of diversity in ranking has been widely studied recently. Researchers from various domains have proposed many approaches to address this problem, such as Maximum Marginal Relevance (MMR) subtopic diversity cluster-based centroids selecting, categorization- based approach, and many other redundancy penalty approaches. However, these methods often treat relevance and diversity separately in the ranking algorithm, sometimes with additional heuristic procedures. Our proposed approach MRSP has not only a nice convergence property, but also a satisfying optimization explanation. The manifold ranking algorithm is proposed based on the following two key assumptions: 1) nearby data are likely to have close ranking scores; and 2) data on the same structure are likely to have close ranking scores. An intuitive description of the ranking algorithm is described as follows: a weighted network is constructed first, where nodes represent all the data and query points, and an edge is put between two nodes if they are “close. ADVANTAGES: • Manifold ranking gives high ranks to nodes that are close to the queries on the manifold and that have strong centrality. • The best of our knowledge, the challenge of addressing relevance, importance and diversity simultaneously in a unified way is still far from being well resolved. • At each iteration data uses manifold ranking to find one or more most relevant points.
  • 5. MODULES: • DATA MANIFOLDS • MANIFOLD RANKING • UPDATE SUMMARIZATION • QUERY RECOMMENDATION • QUALITATIVE COMPARISON • PARAMETER TUNING MODULE DESCRIPTION: DATA MANIFOLDS: Ranking on data manifolds is proposed In their approach, data objects are assumed to be points sampled from a low-dimensional manifold embedded in a high-dimensional euclidean space (ambient space). Hereafter, object and point will not be discriminated unless otherwise specified. Manifold ranking is then to rank the data points with respect to the intrinsic global manifold structure given a set of query points. MANIFOLD RANKING: Query nodes are then initiated with a positive ranking score, while the nodes to be ranked are assigned with a zero initial score. All the nodes then propagate their ranking scores to their neighbors via the weighted network. The propagation process is repeated until a global stable state is achieved, and all the nodes except the queries are ranked according to their final scores. Manifold ranking gives high ranks to nodes that are close to the queries on the manifold (which reflects high relevance) and that have strong centrality (which reflects high importance). Therefore, relevance and importance are well balanced in manifold ranking, similar to Personalized. However, diversity is not considered in manifold ranking.
  • 6. UPDATE SUMMARIZATION: Update summarization is a temporal extension of topic focused multi document summarization, by focusing on summarizing up-to-date information contained in the new document set given a past document set. There are mainly two kinds of approaches for update summarization; one is abstractive summarization in which some deep natural language processing techniques are leveraged to compress sentences or to reorganize phrases to produce a summary of the text. Another one is extractive summarization. In the extractive approach, update summarization is reduced to a sentence ranking problem, which composes a summary by extracting the most representative sentences from target document set. QUERY RECOMMENDATION: Query recommendation aims to provide alternative queries to help users search and also improve the usability of search engines. It has been employed as a core utility by many industrial search engines. Most of the work on query recommendation focuses on measures of query similarity, where query log data has been widely used in these approaches. For example, Beeferman and Berger applied agglomerative clustering to the click-through bipartite graph to identify related queries for recommendation proposed to combine both user click-through data and query content information to determine query similarity. QUALITATIVE COMPARISON: A qualitative comparison to gain some intuition on the differences between the summaries generated by our approach and the baseline methods. We randomly selected one topic from the 48 topics of TAC2008 data set as an example, which is about “the investigation of Jack Abramoff and others related to lobbying activities.” We show the four reference summaries, provided by NIST as the ground truth, and the summaries generated by our approach and other three baselines.
  • 7. PARAMETER TUNING: There is only one parameter in the MRSP algorithm, which is a balance factor between the influence of the intrinsic manifold structure and the prior knowledge on each sentence. the influence of on the summarization performance. As we can see, the summarization approach performs not so well when is small, which may be due to over emphasis on the prior knowledge. HARDWARE & SOFTWARE REQUIREMENTS: HARDWARE REQUIREMENT:  Processor - Pentium –IV  Speed - 1.1 GHz  RAM - 256 MB (min)  Hard Disk - 20 GB  Floppy Drive - 1.44 MB  Key Board - Standard Windows Keyboard  Mouse - Two or Three Button Mouse  Monitor - SVGA SOFTWARE REQUIREMENTS:  Operating System : Windows XP  Front End : JAVA JDK 1.7  Tools : Net beans  Database : SQL Server 2005 or MS-ACCESS