SlideShare a Scribd company logo
(DR. BABASAHEB AMBEDKAR TECHNOLOGICAL UNIVERSITY, LONERE(2022-23)
RAJIV GANDHI COLLEGE OF ENGINEERING, RESEARCH & TECHNOLOGY, CHANDRAPUR
SEMINAR REPORT
ON
“Web Clustering Engines”
Submitted
By
Prajwal Dilip Kamble
Roll No: CSEA347
SEMISTER- III SECOND YEAR
BTCOS347 Seminar –I
Seminar Incharge
Prof. R. V. Lichode
Guided By:
Prof.Ravi Chibule
Dr. Nitin Janwe
HOD, CSE & IT
RAJIV GANDHI COLLEGE OF ENGINEERING, RESEARCH & TECHNOLOGY,
CHANDRAPUR
(DR. BABASAHEB AMBEDKAR TECHNOLOGICAL UNIVERSITY, LONERE)
(2022-23)
CERTIFICATE
This is to certify that, Mr. Prajwal Dilip Kamble , Roll
No. CSEA347 studying in B. E. Third semester
Computer Science and Engineering in the session
2022-2023, has successfully completed seminar-I on
“Web Clustering Engine” satisfactorily during the
academic session 2022-2023 from CSE, RCERT,
Chandrapur.
Seminar Incharge
Prof. R. V. Lichode
Guided By:
Prof. Ravi Chibule
Dr. Nitin Janwe
HOD, CSE & IT
Web Clustering Engines
CONTENTS:
• Abstract
• Introduction
• Search engine
• Why web clustering engine
• Main Advantages of cluster hierarchy
• Issues in implementation of cluster
• Architecture
• How to represent Feature/text ?
• Data Centric clustering algorithm
• Conclusion
Abstract
 Web clustering Engines are emerging trend in the field of information
retrieval. They organize search results by topic, thus offering a
complementary view to the flat ranked list returned by the conventional
search engines.
 The search results returned by traditional search engines on different
subtopics or meanings of a query will be mixed together in the list so that the
user may have to sift through a large number of irrelevant items to locate
those of interest. The Web clustering engines categorize the search results
into different hierarchical groups/clusters and display those cluster labels.
Introduction
 Web Clustering Engines organize search results by topic, thus offering a
complementary view to the flat ranked list returned by the conventional
search engines.
Search Engine
 A search engine is a software system designed to carry out web searches. They search the
World Wide Web in a systematic way for particular information specified in a textual web search
query. The search results are generally presented in a line of results, often referred to as search
engine results pages.
 A search engine is a software program that helps people find the information
they are looking for online using keywords or phrases.
Search engine
• Search engine is a website
• Help user to find information on world wide Web.
• Archie is a first search engine
• 1970 Archie stablish is a world first search engine
• The most used search engine is a Google
• Google is invented is 1997 , google is a one of the most
famous search engine.
Why web clustering engine?
 Conventional engines are not much efficient in ambiguous queries
 The search results returned by conventional search engines on query will be
mixed together in the list, irrelevant item occurs.
 In this context of search result come into picture!
Main advantages of cluster hierarchy
 It makes for shortcuts to the items that relate to the same meaning
 It allows better topic understanding
 It favors systematic exploration of search results.
Issues in implementation of clusters
 Short input description
 Meaningful labels
 Selection of similar measure
 Grouping of objects into clusters
 Computation efficiency
Architecture
• Practical implementations of Web search clustering engines will usually consist of four
general components: search results acquisition, input preprocessing, cluster
construction, and visualization of clustered results
Search result acquisition
 The task of search result acquisition is to provide input for the rest of system.
 Based on the query, the acquisition component must deliver 50 to 500 results,
each of which should contain
 -Contextual snippet
 Title URL pointing to the full text being referred to
 The source of search results can be any public search engines such as ggoogle
yahoo etc.
 The most elegant way of searching results from search engines is by using
application programming interfaces (APIs) these engines provide.
Conclusion
 Web clustering engines organize search results by topic thus offering a
complimentary view to the flat-ranked list returned by conventional search
engines.
 Due to lack of efficient methods of performance evaluation of clustering
engines they are not seeking the attention of the people.
THANK YOU

More Related Content

What's hot

Intro to web scraping with Python
Intro to web scraping with PythonIntro to web scraping with Python
Intro to web scraping with Python
Maris Lemba
 
Graph theory narsingh deo
Graph theory narsingh deoGraph theory narsingh deo
Graph theory narsingh deo
Umang Gupta
 
Automatic indexing
Automatic indexingAutomatic indexing
Automatic indexing
dhatchayaninandu
 
Web search Technologies
Web search TechnologiesWeb search Technologies
Web search Technologies
Abdul Sami Kharal
 
Database indexing techniques
Database indexing techniquesDatabase indexing techniques
Database indexing techniques
ahmadmughal0312
 
Parallel and Distributed Information Retrieval System
Parallel and Distributed Information Retrieval SystemParallel and Distributed Information Retrieval System
Parallel and Distributed Information Retrieval System
vimalsura
 
Solution manual of assembly language programming and organization of the ibm ...
Solution manual of assembly language programming and organization of the ibm ...Solution manual of assembly language programming and organization of the ibm ...
Solution manual of assembly language programming and organization of the ibm ...
Tayeen Ahmed
 
Web Content Mining
Web Content MiningWeb Content Mining
Web Content Mining
Daminda Herath
 
The search engine index
The search engine indexThe search engine index
The search engine index
CJ Jenkins
 
Extracting keywords from texts - Sanda Martincic Ipsic
Extracting keywords from texts - Sanda Martincic IpsicExtracting keywords from texts - Sanda Martincic Ipsic
Extracting keywords from texts - Sanda Martincic Ipsic
Institute of Contemporary Sciences
 
HITS algorithm : NOTES
HITS algorithm : NOTESHITS algorithm : NOTES
HITS algorithm : NOTES
Subhajit Sahu
 
Physical Design of IoT.pdf
Physical Design of IoT.pdfPhysical Design of IoT.pdf
Physical Design of IoT.pdf
JoshuaKimmich1
 
Mapreduce in Search
Mapreduce in SearchMapreduce in Search
Mapreduce in Search
Amund Tveit
 
DCDR Unit-7 Mathematical Preliminaries for Lossy Coding
DCDR Unit-7 Mathematical Preliminaries for Lossy CodingDCDR Unit-7 Mathematical Preliminaries for Lossy Coding
DCDR Unit-7 Mathematical Preliminaries for Lossy Coding
Gyanmanjari Institute Of Technology
 
CS2303 theory of computation Toc answer key november december 2014
CS2303 theory of computation Toc answer key november december 2014CS2303 theory of computation Toc answer key november december 2014
CS2303 theory of computation Toc answer key november december 2014
appasami
 
Structured Knowledge Representation
Structured Knowledge RepresentationStructured Knowledge Representation
Structured Knowledge Representation
Sagacious IT Solution
 
Information retrieval system!
Information retrieval system!Information retrieval system!
Information retrieval system!
Jane Garay
 
Artificial Neural Networks 1
Artificial Neural Networks 1Artificial Neural Networks 1
Artificial Neural Networks 1
swapnac12
 
The Chase in Database Theory
The Chase in Database TheoryThe Chase in Database Theory
The Chase in Database Theory
Jan Hidders
 
Web indexing finale
Web indexing finaleWeb indexing finale
Web indexing finale
Ajit More
 

What's hot (20)

Intro to web scraping with Python
Intro to web scraping with PythonIntro to web scraping with Python
Intro to web scraping with Python
 
Graph theory narsingh deo
Graph theory narsingh deoGraph theory narsingh deo
Graph theory narsingh deo
 
Automatic indexing
Automatic indexingAutomatic indexing
Automatic indexing
 
Web search Technologies
Web search TechnologiesWeb search Technologies
Web search Technologies
 
Database indexing techniques
Database indexing techniquesDatabase indexing techniques
Database indexing techniques
 
Parallel and Distributed Information Retrieval System
Parallel and Distributed Information Retrieval SystemParallel and Distributed Information Retrieval System
Parallel and Distributed Information Retrieval System
 
Solution manual of assembly language programming and organization of the ibm ...
Solution manual of assembly language programming and organization of the ibm ...Solution manual of assembly language programming and organization of the ibm ...
Solution manual of assembly language programming and organization of the ibm ...
 
Web Content Mining
Web Content MiningWeb Content Mining
Web Content Mining
 
The search engine index
The search engine indexThe search engine index
The search engine index
 
Extracting keywords from texts - Sanda Martincic Ipsic
Extracting keywords from texts - Sanda Martincic IpsicExtracting keywords from texts - Sanda Martincic Ipsic
Extracting keywords from texts - Sanda Martincic Ipsic
 
HITS algorithm : NOTES
HITS algorithm : NOTESHITS algorithm : NOTES
HITS algorithm : NOTES
 
Physical Design of IoT.pdf
Physical Design of IoT.pdfPhysical Design of IoT.pdf
Physical Design of IoT.pdf
 
Mapreduce in Search
Mapreduce in SearchMapreduce in Search
Mapreduce in Search
 
DCDR Unit-7 Mathematical Preliminaries for Lossy Coding
DCDR Unit-7 Mathematical Preliminaries for Lossy CodingDCDR Unit-7 Mathematical Preliminaries for Lossy Coding
DCDR Unit-7 Mathematical Preliminaries for Lossy Coding
 
CS2303 theory of computation Toc answer key november december 2014
CS2303 theory of computation Toc answer key november december 2014CS2303 theory of computation Toc answer key november december 2014
CS2303 theory of computation Toc answer key november december 2014
 
Structured Knowledge Representation
Structured Knowledge RepresentationStructured Knowledge Representation
Structured Knowledge Representation
 
Information retrieval system!
Information retrieval system!Information retrieval system!
Information retrieval system!
 
Artificial Neural Networks 1
Artificial Neural Networks 1Artificial Neural Networks 1
Artificial Neural Networks 1
 
The Chase in Database Theory
The Chase in Database TheoryThe Chase in Database Theory
The Chase in Database Theory
 
Web indexing finale
Web indexing finaleWeb indexing finale
Web indexing finale
 

Similar to PPT Web Clustering Engine.pptx

Rutuja SEO.pdf
Rutuja SEO.pdfRutuja SEO.pdf
Rutuja SEO.pdf
PRATIKSHABHOYAR6
 
Search Engine Optimization (SEO) Seminar Report
Search Engine Optimization (SEO) Seminar ReportSearch Engine Optimization (SEO) Seminar Report
Search Engine Optimization (SEO) Seminar Report
Nandu B Rajan
 
Search Engine Optimization (SEO) Seminar Report
Search Engine Optimization (SEO) Seminar ReportSearch Engine Optimization (SEO) Seminar Report
Search Engine Optimization (SEO) Seminar Report
Nandu B Rajan
 
IRJET-Deep Web Crawling Efficiently using Dynamic Focused Web Crawler
IRJET-Deep Web Crawling Efficiently using Dynamic Focused Web CrawlerIRJET-Deep Web Crawling Efficiently using Dynamic Focused Web Crawler
IRJET-Deep Web Crawling Efficiently using Dynamic Focused Web Crawler
IRJET Journal
 
Calculating Rank of Web Documents Using Its Content and Link Analysis
Calculating Rank of Web Documents Using Its Content and Link AnalysisCalculating Rank of Web Documents Using Its Content and Link Analysis
Calculating Rank of Web Documents Using Its Content and Link Analysis
IRJET Journal
 
G017254554
G017254554G017254554
G017254554
IOSR Journals
 
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
An Intelligent Meta Search Engine for Efficient Web Document RetrievalAn Intelligent Meta Search Engine for Efficient Web Document Retrieval
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
iosrjce
 
Teregowda.ppt
Teregowda.pptTeregowda.ppt
Teregowda.ppt
aozcan1
 
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET Journal
 
2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...
2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...
2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...
IEEEFINALYEARSTUDENTPROJECT
 
2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...
2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...
2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...
IEEEMEMTECHSTUDENTSPROJECTS
 
IEEE 2014 JAVA DATA MINING PROJECTS Web image re ranking using query-specific...
IEEE 2014 JAVA DATA MINING PROJECTS Web image re ranking using query-specific...IEEE 2014 JAVA DATA MINING PROJECTS Web image re ranking using query-specific...
IEEE 2014 JAVA DATA MINING PROJECTS Web image re ranking using query-specific...
IEEEFINALYEARSTUDENTPROJECTS
 
IRJET- A Novel Technique for Inferring User Search using Feedback Sessions
IRJET- A Novel Technique for Inferring User Search using Feedback SessionsIRJET- A Novel Technique for Inferring User Search using Feedback Sessions
IRJET- A Novel Technique for Inferring User Search using Feedback Sessions
IRJET Journal
 
CloWSer
CloWSerCloWSer
50120140503003 2
50120140503003 250120140503003 2
50120140503003 2
IAEME Publication
 
Implementing Site Search in CQ5 / AEM
Implementing Site Search in CQ5 / AEMImplementing Site Search in CQ5 / AEM
Implementing Site Search in CQ5 / AEM
rtpaem
 
A Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsA Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search Results
IRJET Journal
 
Ranking algorithms
Ranking algorithmsRanking algorithms
Ranking algorithms
Ankit Raj
 
A Survey On Search Engines
A Survey On Search EnginesA Survey On Search Engines
A Survey On Search Engines
Andrew Parish
 
How Internet Search Engines Work
How Internet Search Engines WorkHow Internet Search Engines Work
How Internet Search Engines Work
Mukalele Rogers
 

Similar to PPT Web Clustering Engine.pptx (20)

Rutuja SEO.pdf
Rutuja SEO.pdfRutuja SEO.pdf
Rutuja SEO.pdf
 
Search Engine Optimization (SEO) Seminar Report
Search Engine Optimization (SEO) Seminar ReportSearch Engine Optimization (SEO) Seminar Report
Search Engine Optimization (SEO) Seminar Report
 
Search Engine Optimization (SEO) Seminar Report
Search Engine Optimization (SEO) Seminar ReportSearch Engine Optimization (SEO) Seminar Report
Search Engine Optimization (SEO) Seminar Report
 
IRJET-Deep Web Crawling Efficiently using Dynamic Focused Web Crawler
IRJET-Deep Web Crawling Efficiently using Dynamic Focused Web CrawlerIRJET-Deep Web Crawling Efficiently using Dynamic Focused Web Crawler
IRJET-Deep Web Crawling Efficiently using Dynamic Focused Web Crawler
 
Calculating Rank of Web Documents Using Its Content and Link Analysis
Calculating Rank of Web Documents Using Its Content and Link AnalysisCalculating Rank of Web Documents Using Its Content and Link Analysis
Calculating Rank of Web Documents Using Its Content and Link Analysis
 
G017254554
G017254554G017254554
G017254554
 
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
An Intelligent Meta Search Engine for Efficient Web Document RetrievalAn Intelligent Meta Search Engine for Efficient Web Document Retrieval
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
 
Teregowda.ppt
Teregowda.pptTeregowda.ppt
Teregowda.ppt
 
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
 
2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...
2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...
2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...
 
2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...
2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...
2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...
 
IEEE 2014 JAVA DATA MINING PROJECTS Web image re ranking using query-specific...
IEEE 2014 JAVA DATA MINING PROJECTS Web image re ranking using query-specific...IEEE 2014 JAVA DATA MINING PROJECTS Web image re ranking using query-specific...
IEEE 2014 JAVA DATA MINING PROJECTS Web image re ranking using query-specific...
 
IRJET- A Novel Technique for Inferring User Search using Feedback Sessions
IRJET- A Novel Technique for Inferring User Search using Feedback SessionsIRJET- A Novel Technique for Inferring User Search using Feedback Sessions
IRJET- A Novel Technique for Inferring User Search using Feedback Sessions
 
CloWSer
CloWSerCloWSer
CloWSer
 
50120140503003 2
50120140503003 250120140503003 2
50120140503003 2
 
Implementing Site Search in CQ5 / AEM
Implementing Site Search in CQ5 / AEMImplementing Site Search in CQ5 / AEM
Implementing Site Search in CQ5 / AEM
 
A Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsA Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search Results
 
Ranking algorithms
Ranking algorithmsRanking algorithms
Ranking algorithms
 
A Survey On Search Engines
A Survey On Search EnginesA Survey On Search Engines
A Survey On Search Engines
 
How Internet Search Engines Work
How Internet Search Engines WorkHow Internet Search Engines Work
How Internet Search Engines Work
 

Recently uploaded

2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
Łukasz Chruściel
 
UI5con 2024 - Keynote: Latest News about UI5 and it’s Ecosystem
UI5con 2024 - Keynote: Latest News about UI5 and it’s EcosystemUI5con 2024 - Keynote: Latest News about UI5 and it’s Ecosystem
UI5con 2024 - Keynote: Latest News about UI5 and it’s Ecosystem
Peter Muessig
 
socradar-q1-2024-aviation-industry-report.pdf
socradar-q1-2024-aviation-industry-report.pdfsocradar-q1-2024-aviation-industry-report.pdf
socradar-q1-2024-aviation-industry-report.pdf
SOCRadar
 
Revolutionizing Visual Effects Mastering AI Face Swaps.pdf
Revolutionizing Visual Effects Mastering AI Face Swaps.pdfRevolutionizing Visual Effects Mastering AI Face Swaps.pdf
Revolutionizing Visual Effects Mastering AI Face Swaps.pdf
Undress Baby
 
Oracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptxOracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptx
Remote DBA Services
 
Oracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptxOracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptx
Remote DBA Services
 
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdfVitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke
 
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissancesAtelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Neo4j
 
Why Choose Odoo 17 Community & How it differs from Odoo 17 Enterprise Edition
Why Choose Odoo 17 Community & How it differs from Odoo 17 Enterprise EditionWhy Choose Odoo 17 Community & How it differs from Odoo 17 Enterprise Edition
Why Choose Odoo 17 Community & How it differs from Odoo 17 Enterprise Edition
Envertis Software Solutions
 
SWEBOK and Education at FUSE Okinawa 2024
SWEBOK and Education at FUSE Okinawa 2024SWEBOK and Education at FUSE Okinawa 2024
SWEBOK and Education at FUSE Okinawa 2024
Hironori Washizaki
 
Fundamentals of Programming and Language Processors
Fundamentals of Programming and Language ProcessorsFundamentals of Programming and Language Processors
Fundamentals of Programming and Language Processors
Rakesh Kumar R
 
DDS-Security 1.2 - What's New? Stronger security for long-running systems
DDS-Security 1.2 - What's New? Stronger security for long-running systemsDDS-Security 1.2 - What's New? Stronger security for long-running systems
DDS-Security 1.2 - What's New? Stronger security for long-running systems
Gerardo Pardo-Castellote
 
Energy consumption of Database Management - Florina Jonuzi
Energy consumption of Database Management - Florina JonuziEnergy consumption of Database Management - Florina Jonuzi
Energy consumption of Database Management - Florina Jonuzi
Green Software Development
 
SMS API Integration in Saudi Arabia| Best SMS API Service
SMS API Integration in Saudi Arabia| Best SMS API ServiceSMS API Integration in Saudi Arabia| Best SMS API Service
SMS API Integration in Saudi Arabia| Best SMS API Service
Yara Milbes
 
E-commerce Application Development Company.pdf
E-commerce Application Development Company.pdfE-commerce Application Development Company.pdf
E-commerce Application Development Company.pdf
Hornet Dynamics
 
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
mz5nrf0n
 
What is Augmented Reality Image Tracking
What is Augmented Reality Image TrackingWhat is Augmented Reality Image Tracking
What is Augmented Reality Image Tracking
pavan998932
 
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling ExtensionsUI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
Peter Muessig
 
Hand Rolled Applicative User Validation Code Kata
Hand Rolled Applicative User ValidationCode KataHand Rolled Applicative User ValidationCode Kata
Hand Rolled Applicative User Validation Code Kata
Philip Schwarz
 
Transform Your Communication with Cloud-Based IVR Solutions
Transform Your Communication with Cloud-Based IVR SolutionsTransform Your Communication with Cloud-Based IVR Solutions
Transform Your Communication with Cloud-Based IVR Solutions
TheSMSPoint
 

Recently uploaded (20)

2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
 
UI5con 2024 - Keynote: Latest News about UI5 and it’s Ecosystem
UI5con 2024 - Keynote: Latest News about UI5 and it’s EcosystemUI5con 2024 - Keynote: Latest News about UI5 and it’s Ecosystem
UI5con 2024 - Keynote: Latest News about UI5 and it’s Ecosystem
 
socradar-q1-2024-aviation-industry-report.pdf
socradar-q1-2024-aviation-industry-report.pdfsocradar-q1-2024-aviation-industry-report.pdf
socradar-q1-2024-aviation-industry-report.pdf
 
Revolutionizing Visual Effects Mastering AI Face Swaps.pdf
Revolutionizing Visual Effects Mastering AI Face Swaps.pdfRevolutionizing Visual Effects Mastering AI Face Swaps.pdf
Revolutionizing Visual Effects Mastering AI Face Swaps.pdf
 
Oracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptxOracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptx
 
Oracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptxOracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptx
 
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdfVitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdf
 
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissancesAtelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissances
 
Why Choose Odoo 17 Community & How it differs from Odoo 17 Enterprise Edition
Why Choose Odoo 17 Community & How it differs from Odoo 17 Enterprise EditionWhy Choose Odoo 17 Community & How it differs from Odoo 17 Enterprise Edition
Why Choose Odoo 17 Community & How it differs from Odoo 17 Enterprise Edition
 
SWEBOK and Education at FUSE Okinawa 2024
SWEBOK and Education at FUSE Okinawa 2024SWEBOK and Education at FUSE Okinawa 2024
SWEBOK and Education at FUSE Okinawa 2024
 
Fundamentals of Programming and Language Processors
Fundamentals of Programming and Language ProcessorsFundamentals of Programming and Language Processors
Fundamentals of Programming and Language Processors
 
DDS-Security 1.2 - What's New? Stronger security for long-running systems
DDS-Security 1.2 - What's New? Stronger security for long-running systemsDDS-Security 1.2 - What's New? Stronger security for long-running systems
DDS-Security 1.2 - What's New? Stronger security for long-running systems
 
Energy consumption of Database Management - Florina Jonuzi
Energy consumption of Database Management - Florina JonuziEnergy consumption of Database Management - Florina Jonuzi
Energy consumption of Database Management - Florina Jonuzi
 
SMS API Integration in Saudi Arabia| Best SMS API Service
SMS API Integration in Saudi Arabia| Best SMS API ServiceSMS API Integration in Saudi Arabia| Best SMS API Service
SMS API Integration in Saudi Arabia| Best SMS API Service
 
E-commerce Application Development Company.pdf
E-commerce Application Development Company.pdfE-commerce Application Development Company.pdf
E-commerce Application Development Company.pdf
 
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
 
What is Augmented Reality Image Tracking
What is Augmented Reality Image TrackingWhat is Augmented Reality Image Tracking
What is Augmented Reality Image Tracking
 
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling ExtensionsUI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
 
Hand Rolled Applicative User Validation Code Kata
Hand Rolled Applicative User ValidationCode KataHand Rolled Applicative User ValidationCode Kata
Hand Rolled Applicative User Validation Code Kata
 
Transform Your Communication with Cloud-Based IVR Solutions
Transform Your Communication with Cloud-Based IVR SolutionsTransform Your Communication with Cloud-Based IVR Solutions
Transform Your Communication with Cloud-Based IVR Solutions
 

PPT Web Clustering Engine.pptx

  • 1. (DR. BABASAHEB AMBEDKAR TECHNOLOGICAL UNIVERSITY, LONERE(2022-23) RAJIV GANDHI COLLEGE OF ENGINEERING, RESEARCH & TECHNOLOGY, CHANDRAPUR SEMINAR REPORT ON “Web Clustering Engines” Submitted By Prajwal Dilip Kamble Roll No: CSEA347 SEMISTER- III SECOND YEAR BTCOS347 Seminar –I Seminar Incharge Prof. R. V. Lichode Guided By: Prof.Ravi Chibule Dr. Nitin Janwe HOD, CSE & IT
  • 2. RAJIV GANDHI COLLEGE OF ENGINEERING, RESEARCH & TECHNOLOGY, CHANDRAPUR (DR. BABASAHEB AMBEDKAR TECHNOLOGICAL UNIVERSITY, LONERE) (2022-23) CERTIFICATE This is to certify that, Mr. Prajwal Dilip Kamble , Roll No. CSEA347 studying in B. E. Third semester Computer Science and Engineering in the session 2022-2023, has successfully completed seminar-I on “Web Clustering Engine” satisfactorily during the academic session 2022-2023 from CSE, RCERT, Chandrapur. Seminar Incharge Prof. R. V. Lichode Guided By: Prof. Ravi Chibule Dr. Nitin Janwe HOD, CSE & IT
  • 4. CONTENTS: • Abstract • Introduction • Search engine • Why web clustering engine • Main Advantages of cluster hierarchy • Issues in implementation of cluster • Architecture • How to represent Feature/text ? • Data Centric clustering algorithm • Conclusion
  • 5. Abstract  Web clustering Engines are emerging trend in the field of information retrieval. They organize search results by topic, thus offering a complementary view to the flat ranked list returned by the conventional search engines.  The search results returned by traditional search engines on different subtopics or meanings of a query will be mixed together in the list so that the user may have to sift through a large number of irrelevant items to locate those of interest. The Web clustering engines categorize the search results into different hierarchical groups/clusters and display those cluster labels.
  • 6. Introduction  Web Clustering Engines organize search results by topic, thus offering a complementary view to the flat ranked list returned by the conventional search engines.
  • 7. Search Engine  A search engine is a software system designed to carry out web searches. They search the World Wide Web in a systematic way for particular information specified in a textual web search query. The search results are generally presented in a line of results, often referred to as search engine results pages.  A search engine is a software program that helps people find the information they are looking for online using keywords or phrases.
  • 8. Search engine • Search engine is a website • Help user to find information on world wide Web. • Archie is a first search engine • 1970 Archie stablish is a world first search engine • The most used search engine is a Google • Google is invented is 1997 , google is a one of the most famous search engine.
  • 9. Why web clustering engine?  Conventional engines are not much efficient in ambiguous queries  The search results returned by conventional search engines on query will be mixed together in the list, irrelevant item occurs.  In this context of search result come into picture!
  • 10. Main advantages of cluster hierarchy  It makes for shortcuts to the items that relate to the same meaning  It allows better topic understanding  It favors systematic exploration of search results.
  • 11. Issues in implementation of clusters  Short input description  Meaningful labels  Selection of similar measure  Grouping of objects into clusters  Computation efficiency
  • 12. Architecture • Practical implementations of Web search clustering engines will usually consist of four general components: search results acquisition, input preprocessing, cluster construction, and visualization of clustered results
  • 13. Search result acquisition  The task of search result acquisition is to provide input for the rest of system.  Based on the query, the acquisition component must deliver 50 to 500 results, each of which should contain  -Contextual snippet  Title URL pointing to the full text being referred to  The source of search results can be any public search engines such as ggoogle yahoo etc.  The most elegant way of searching results from search engines is by using application programming interfaces (APIs) these engines provide.
  • 14. Conclusion  Web clustering engines organize search results by topic thus offering a complimentary view to the flat-ranked list returned by conventional search engines.  Due to lack of efficient methods of performance evaluation of clustering engines they are not seeking the attention of the people.