SlideShare a Scribd company logo
In Situ Evaluation of Entity Ranking 
and Opinion Summarization 
using 
www.findilike.com 
Kavita Ganesan & ChengXiang Zhai 
University of Illinois @ Urbana Champaign
What is findilike? 
• Preference – driven search engine 
– Currently works in hotels domain 
– Finds & ranks hotels based on user preferences: 
Structured: price, distance 
Unstructured: “friendly service”, “clean”, “good views” 
(Based on existing user reviews)  UNIQUE 
• Beyond search: Support for analysis of hotels 
– Opinion summaries 
– Tag cloud visualization of reviews
…What is findilike? 
• Developed as part of PhD. Work – new system 
(Opinion-Driven Decision Support System, UIUC, 2013) 
• Tracked ~1000 unique users from Jan - Aug ‘13 
– Working on speed & reaching out to more users
Evaluating Review Summarization 
Mini Test-bed 
• Base code to extend 
• Set of sample sentences 
• Gold standard summary for those sentences 
• ROUGE toolkit to evaluate the results 
• Data set based on - Ganesan et. al 2010
Evaluating Entity Ranking 
Mini Test-bed 
• Base code to extend 
• Terrier Index of hotel reviews 
• Gold standard ranking of hotels 
• Code to generate nDCG scores. 
• Raw unindexed data set for reference
Building a new ranking model 
Extend Weighting 
Model
DEMO
2 Components that can be evaluated 
through natural user interaction 
1 
Ranking entities based on 
unstructured user preferences 
Opinion-Based Entity Ranking 
(Ganesan & Zhai 2012) 
Summarization of reviews 
Generating short phrases 
summarizing key opinions 
(Ganesan et. al 2010, 2012) 
2
Evaluation of entity ranking 
• Retrieval 
– Interleave results 
Balanced 
interleaving 
(T. Joachims, 2002) 
Base 
DirichletLM 
A click indicates preference… 
Base
Snapshot of pairwise comparison 
results for entity ranking 
# Queries 
B is better 
Algorithms 
DirichletLM, 
Base, PL2 
# Queries 
A is Better 
A B CA > CB 
(A Better) 
CB > CA 
(B Better) 
CA = CB > 0 
(Tie) 
CA = CB = 0 Total 
DLM Base 30 35 2 5 72 
PL2 Base 10 28 3 7 48 
… … … … … … …
Snapshot of pairwise comparison 
results for entity ranking 
A B CA > CB 
(A Better) 
CB > CA 
(B Better) 
Base model 
better, but DLM 
not too far behind 
Base model 
better CA = CB & > 0 
PL2 not 
(Tie) 
CA = CB = 0 Total 
too good 
DLM Base 30 35 2 5 72 
PL2 Base 10 28 3 7 48 
… … … … … … …
Evaluation of review summarization 
Randomly mix top N 
phrases from two 
algorithms 
ALGO1 
ALGO2 Monitor click-through 
More clicks on phrases from Algo1 vs. Algo2  
Algo1 better 
on per 
entity basis
How to submit a new algorithm? 
Submit code 
Performance 
report 
A B CA > CB 
(A Better) 
… … … … CB > CA 
(B Better) 
DLM Base 30 35 
PL2 Base 10 28 
Online Performance 
Test on mini test 
bed 
Sample Code 
Test Data & Gold 
Standard 
Evaluator 
(nDCG, ROUGE) 
Mini Testbed 
Local performance 
Write Java 
based code 
Extend 
existing code 
Implementation
More information about evaluation… 
eval.findilike.com
Thanks! Questions? 
Links 
• Evaluation: http://eval.findilike.com 
• System: http://hotels.findilike.com/ 
• Related Papers: kavita-ganesan.com
References 
• Ganesan, K. A., C. X. Zhai, and E. Viegas, Micropinion Generation: An Unsupervised 
Approach to Generating Ultra-Concise Summaries of Opinions, Proceedings of the 
21st International Conference on World Wide Web 2012 (WWW '12), 2012. 
• Ganesan, K. A., and C. X. Zhai, Opinion-Based Entity Ranking, Information Retrieval, 
vol. 15, issue 2, 2012 
• Ganesan, K. A., C. X. Zhai, and J. Han, Opinosis: A Graph Based Approach to 
Abstractive Summarization of Highly Redundant Opinions, Proceedings of the 23rd 
International Conference on Computational Linguistics (COLING '10), 2010. 
• T. Joachims. Optimizing search engines using clickthrough data. In Proceedings of 
the eighth ACM SIGKDD international conference on Knowledge discovery and 
data mining, KDD ’02, NY, 2002.

More Related Content

Viewers also liked

Listening exercise ted 2
Listening exercise ted 2Listening exercise ted 2
Listening exercise ted 2Nini Paz
 
Presente continúo2
Presente continúo2Presente continúo2
Presente continúo2Nini Paz
 
Supermarket
SupermarketSupermarket
SupermarketNini Paz
 
Trabajo de realidad nacional
Trabajo de realidad nacionalTrabajo de realidad nacional
Trabajo de realidad nacionalJuan Wong
 
Listening 3 activity
Listening 3 activityListening 3 activity
Listening 3 activityNini Paz
 
Documental
DocumentalDocumental
Documental
Héctor Snchz
 
Listening exercise ted 2
Listening exercise ted 2Listening exercise ted 2
Listening exercise ted 2Nini Paz
 
BLOQUES DE REALIDAD NACIONAL
BLOQUES DE REALIDAD NACIONALBLOQUES DE REALIDAD NACIONAL
BLOQUES DE REALIDAD NACIONALJuan Wong
 
Blubag brochure
Blubag brochureBlubag brochure
Blubag brochurefooserv
 

Viewers also liked (19)

Listening exercise ted 2
Listening exercise ted 2Listening exercise ted 2
Listening exercise ted 2
 
Presente continúo2
Presente continúo2Presente continúo2
Presente continúo2
 
Supermarket
SupermarketSupermarket
Supermarket
 
Trabajo de realidad nacional
Trabajo de realidad nacionalTrabajo de realidad nacional
Trabajo de realidad nacional
 
El colegio (mesitas)
El colegio (mesitas)El colegio (mesitas)
El colegio (mesitas)
 
Listening 3 activity
Listening 3 activityListening 3 activity
Listening 3 activity
 
Mars
MarsMars
Mars
 
Giffords
GiffordsGiffords
Giffords
 
Feelings
FeelingsFeelings
Feelings
 
Flessas c.a 2011
Flessas c.a 2011Flessas c.a 2011
Flessas c.a 2011
 
Hansen
HansenHansen
Hansen
 
Documental
DocumentalDocumental
Documental
 
Listening exercise ted 2
Listening exercise ted 2Listening exercise ted 2
Listening exercise ted 2
 
Clothes
ClothesClothes
Clothes
 
BLOQUES DE REALIDAD NACIONAL
BLOQUES DE REALIDAD NACIONALBLOQUES DE REALIDAD NACIONAL
BLOQUES DE REALIDAD NACIONAL
 
Blubag brochure
Blubag brochureBlubag brochure
Blubag brochure
 
Soria antigua
Soria antiguaSoria antigua
Soria antigua
 
Mars
MarsMars
Mars
 
Soria antigua
Soria antiguaSoria antigua
Soria antigua
 

Similar to In situ evaluation of entity retrieval and opinion summarization

Opinion-Based Entity Ranking
Opinion-Based Entity RankingOpinion-Based Entity Ranking
Opinion-Based Entity Ranking
Kavita Ganesan
 
Rokach-GomaxSlides (1).pptx
Rokach-GomaxSlides (1).pptxRokach-GomaxSlides (1).pptx
Rokach-GomaxSlides (1).pptx
Jadna Almeida
 
Rokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptxRokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptx
Jadna Almeida
 
AI, Search, and the Disruption of Knowledge Management
AI, Search, and the Disruption of Knowledge ManagementAI, Search, and the Disruption of Knowledge Management
AI, Search, and the Disruption of Knowledge Management
Trey Grainger
 
[RecSys 2014] Deviation-Based and Similarity-Based Contextual SLIM Recommenda...
[RecSys 2014] Deviation-Based and Similarity-Based Contextual SLIM Recommenda...[RecSys 2014] Deviation-Based and Similarity-Based Contextual SLIM Recommenda...
[RecSys 2014] Deviation-Based and Similarity-Based Contextual SLIM Recommenda...
YONG ZHENG
 
Fully Automated QA System For Large Scale Search And Recommendation Engines U...
Fully Automated QA System For Large Scale Search And Recommendation Engines U...Fully Automated QA System For Large Scale Search And Recommendation Engines U...
Fully Automated QA System For Large Scale Search And Recommendation Engines U...
Spark Summit
 
Data Science for Online Services: Problems & Frontiers (Changbal Conference 2...
Data Science for Online Services: Problems & Frontiers (Changbal Conference 2...Data Science for Online Services: Problems & Frontiers (Changbal Conference 2...
Data Science for Online Services: Problems & Frontiers (Changbal Conference 2...
Jin Young Kim
 
Online feedback correlation using clustering
Online feedback correlation using clusteringOnline feedback correlation using clustering
Online feedback correlation using clustering
awesomesos
 
Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...
Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...
Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...
Sease
 
Internet 信息检索中的数学
Internet 信息检索中的数学Internet 信息检索中的数学
Internet 信息检索中的数学
Xu jiakon
 
PASS Summit 2010 Keynote David DeWitt
PASS Summit 2010 Keynote David DeWittPASS Summit 2010 Keynote David DeWitt
PASS Summit 2010 Keynote David DeWitt
GraySystemsLab
 
Search quality in practice
Search quality in practiceSearch quality in practice
Search quality in practice
Alexander Sibiryakov
 
Summarization and opinion detection of product reviews (1)
Summarization and opinion detection of product reviews (1)Summarization and opinion detection of product reviews (1)
Summarization and opinion detection of product reviews (1)Lokesh Mittal
 
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
YONG ZHENG
 
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Alan Said
 
Understanding Queries through Entities
Understanding Queries through EntitiesUnderstanding Queries through Entities
Understanding Queries through Entities
Peter Mika
 
Feature Based Opinion Mining from Amazon Reviews
Feature Based Opinion Mining from Amazon ReviewsFeature Based Opinion Mining from Amazon Reviews
Feature Based Opinion Mining from Amazon Reviews
Ravi Kiran Holur Vijay
 
Capstone Project Slides- Yelper
Capstone Project Slides- YelperCapstone Project Slides- Yelper
Capstone Project Slides- YelperChuan Sun
 
presentation.pdf
presentation.pdfpresentation.pdf
presentation.pdf
caa28steve
 

Similar to In situ evaluation of entity retrieval and opinion summarization (20)

Opinion-Based Entity Ranking
Opinion-Based Entity RankingOpinion-Based Entity Ranking
Opinion-Based Entity Ranking
 
Rokach-GomaxSlides (1).pptx
Rokach-GomaxSlides (1).pptxRokach-GomaxSlides (1).pptx
Rokach-GomaxSlides (1).pptx
 
Rokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptxRokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptx
 
AI, Search, and the Disruption of Knowledge Management
AI, Search, and the Disruption of Knowledge ManagementAI, Search, and the Disruption of Knowledge Management
AI, Search, and the Disruption of Knowledge Management
 
[RecSys 2014] Deviation-Based and Similarity-Based Contextual SLIM Recommenda...
[RecSys 2014] Deviation-Based and Similarity-Based Contextual SLIM Recommenda...[RecSys 2014] Deviation-Based and Similarity-Based Contextual SLIM Recommenda...
[RecSys 2014] Deviation-Based and Similarity-Based Contextual SLIM Recommenda...
 
Fully Automated QA System For Large Scale Search And Recommendation Engines U...
Fully Automated QA System For Large Scale Search And Recommendation Engines U...Fully Automated QA System For Large Scale Search And Recommendation Engines U...
Fully Automated QA System For Large Scale Search And Recommendation Engines U...
 
Data Science for Online Services: Problems & Frontiers (Changbal Conference 2...
Data Science for Online Services: Problems & Frontiers (Changbal Conference 2...Data Science for Online Services: Problems & Frontiers (Changbal Conference 2...
Data Science for Online Services: Problems & Frontiers (Changbal Conference 2...
 
Online feedback correlation using clustering
Online feedback correlation using clusteringOnline feedback correlation using clustering
Online feedback correlation using clustering
 
Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...
Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...
Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...
 
Mazhiming
MazhimingMazhiming
Mazhiming
 
Internet 信息检索中的数学
Internet 信息检索中的数学Internet 信息检索中的数学
Internet 信息检索中的数学
 
PASS Summit 2010 Keynote David DeWitt
PASS Summit 2010 Keynote David DeWittPASS Summit 2010 Keynote David DeWitt
PASS Summit 2010 Keynote David DeWitt
 
Search quality in practice
Search quality in practiceSearch quality in practice
Search quality in practice
 
Summarization and opinion detection of product reviews (1)
Summarization and opinion detection of product reviews (1)Summarization and opinion detection of product reviews (1)
Summarization and opinion detection of product reviews (1)
 
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
 
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
 
Understanding Queries through Entities
Understanding Queries through EntitiesUnderstanding Queries through Entities
Understanding Queries through Entities
 
Feature Based Opinion Mining from Amazon Reviews
Feature Based Opinion Mining from Amazon ReviewsFeature Based Opinion Mining from Amazon Reviews
Feature Based Opinion Mining from Amazon Reviews
 
Capstone Project Slides- Yelper
Capstone Project Slides- YelperCapstone Project Slides- Yelper
Capstone Project Slides- Yelper
 
presentation.pdf
presentation.pdfpresentation.pdf
presentation.pdf
 

More from Kavita Ganesan

Comparison between cbow, skip gram and skip-gram with subword information (1)
Comparison between cbow, skip gram and skip-gram with subword information (1)Comparison between cbow, skip gram and skip-gram with subword information (1)
Comparison between cbow, skip gram and skip-gram with subword information (1)
Kavita Ganesan
 
Comparison between cbow, skip gram and skip-gram with subword information
Comparison between cbow, skip gram and skip-gram with subword informationComparison between cbow, skip gram and skip-gram with subword information
Comparison between cbow, skip gram and skip-gram with subword information
Kavita Ganesan
 
Introduction to Java Strings, By Kavita Ganesan
Introduction to Java Strings, By Kavita GanesanIntroduction to Java Strings, By Kavita Ganesan
Introduction to Java Strings, By Kavita Ganesan
Kavita Ganesan
 
Statistical Methods for Integration and Analysis of Online Opinionated Text...
Statistical Methods for Integration and Analysis of Online Opinionated Text...Statistical Methods for Integration and Analysis of Online Opinionated Text...
Statistical Methods for Integration and Analysis of Online Opinionated Text...
Kavita Ganesan
 
Segmentation of Clinical Texts
Segmentation of Clinical TextsSegmentation of Clinical Texts
Segmentation of Clinical Texts
Kavita Ganesan
 
Enabling Opinion-Driven Decision Making - Sentiment Analysis Innovation Summit
Enabling Opinion-Driven Decision Making - Sentiment Analysis Innovation Summit Enabling Opinion-Driven Decision Making - Sentiment Analysis Innovation Summit
Enabling Opinion-Driven Decision Making - Sentiment Analysis Innovation Summit
Kavita Ganesan
 
Very Small Tutorial on Terrier 3.0 Retrieval Toolkit
Very Small Tutorial on Terrier 3.0 Retrieval ToolkitVery Small Tutorial on Terrier 3.0 Retrieval Toolkit
Very Small Tutorial on Terrier 3.0 Retrieval Toolkit
Kavita Ganesan
 
Opinion Driven Decision Support System
Opinion Driven Decision Support SystemOpinion Driven Decision Support System
Opinion Driven Decision Support System
Kavita Ganesan
 
Micropinion Generation
Micropinion GenerationMicropinion Generation
Micropinion Generation
Kavita Ganesan
 
Opinion Mining Tutorial (Sentiment Analysis)
Opinion Mining Tutorial (Sentiment Analysis)Opinion Mining Tutorial (Sentiment Analysis)
Opinion Mining Tutorial (Sentiment Analysis)
Kavita Ganesan
 

More from Kavita Ganesan (10)

Comparison between cbow, skip gram and skip-gram with subword information (1)
Comparison between cbow, skip gram and skip-gram with subword information (1)Comparison between cbow, skip gram and skip-gram with subword information (1)
Comparison between cbow, skip gram and skip-gram with subword information (1)
 
Comparison between cbow, skip gram and skip-gram with subword information
Comparison between cbow, skip gram and skip-gram with subword informationComparison between cbow, skip gram and skip-gram with subword information
Comparison between cbow, skip gram and skip-gram with subword information
 
Introduction to Java Strings, By Kavita Ganesan
Introduction to Java Strings, By Kavita GanesanIntroduction to Java Strings, By Kavita Ganesan
Introduction to Java Strings, By Kavita Ganesan
 
Statistical Methods for Integration and Analysis of Online Opinionated Text...
Statistical Methods for Integration and Analysis of Online Opinionated Text...Statistical Methods for Integration and Analysis of Online Opinionated Text...
Statistical Methods for Integration and Analysis of Online Opinionated Text...
 
Segmentation of Clinical Texts
Segmentation of Clinical TextsSegmentation of Clinical Texts
Segmentation of Clinical Texts
 
Enabling Opinion-Driven Decision Making - Sentiment Analysis Innovation Summit
Enabling Opinion-Driven Decision Making - Sentiment Analysis Innovation Summit Enabling Opinion-Driven Decision Making - Sentiment Analysis Innovation Summit
Enabling Opinion-Driven Decision Making - Sentiment Analysis Innovation Summit
 
Very Small Tutorial on Terrier 3.0 Retrieval Toolkit
Very Small Tutorial on Terrier 3.0 Retrieval ToolkitVery Small Tutorial on Terrier 3.0 Retrieval Toolkit
Very Small Tutorial on Terrier 3.0 Retrieval Toolkit
 
Opinion Driven Decision Support System
Opinion Driven Decision Support SystemOpinion Driven Decision Support System
Opinion Driven Decision Support System
 
Micropinion Generation
Micropinion GenerationMicropinion Generation
Micropinion Generation
 
Opinion Mining Tutorial (Sentiment Analysis)
Opinion Mining Tutorial (Sentiment Analysis)Opinion Mining Tutorial (Sentiment Analysis)
Opinion Mining Tutorial (Sentiment Analysis)
 

Recently uploaded

Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website
Pixlogix Infotech
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Vladimir Iglovikov, Ph.D.
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 

Recently uploaded (20)

Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 

In situ evaluation of entity retrieval and opinion summarization

  • 1. In Situ Evaluation of Entity Ranking and Opinion Summarization using www.findilike.com Kavita Ganesan & ChengXiang Zhai University of Illinois @ Urbana Champaign
  • 2. What is findilike? • Preference – driven search engine – Currently works in hotels domain – Finds & ranks hotels based on user preferences: Structured: price, distance Unstructured: “friendly service”, “clean”, “good views” (Based on existing user reviews)  UNIQUE • Beyond search: Support for analysis of hotels – Opinion summaries – Tag cloud visualization of reviews
  • 3. …What is findilike? • Developed as part of PhD. Work – new system (Opinion-Driven Decision Support System, UIUC, 2013) • Tracked ~1000 unique users from Jan - Aug ‘13 – Working on speed & reaching out to more users
  • 4. Evaluating Review Summarization Mini Test-bed • Base code to extend • Set of sample sentences • Gold standard summary for those sentences • ROUGE toolkit to evaluate the results • Data set based on - Ganesan et. al 2010
  • 5. Evaluating Entity Ranking Mini Test-bed • Base code to extend • Terrier Index of hotel reviews • Gold standard ranking of hotels • Code to generate nDCG scores. • Raw unindexed data set for reference
  • 6. Building a new ranking model Extend Weighting Model
  • 8. 2 Components that can be evaluated through natural user interaction 1 Ranking entities based on unstructured user preferences Opinion-Based Entity Ranking (Ganesan & Zhai 2012) Summarization of reviews Generating short phrases summarizing key opinions (Ganesan et. al 2010, 2012) 2
  • 9. Evaluation of entity ranking • Retrieval – Interleave results Balanced interleaving (T. Joachims, 2002) Base DirichletLM A click indicates preference… Base
  • 10. Snapshot of pairwise comparison results for entity ranking # Queries B is better Algorithms DirichletLM, Base, PL2 # Queries A is Better A B CA > CB (A Better) CB > CA (B Better) CA = CB > 0 (Tie) CA = CB = 0 Total DLM Base 30 35 2 5 72 PL2 Base 10 28 3 7 48 … … … … … … …
  • 11. Snapshot of pairwise comparison results for entity ranking A B CA > CB (A Better) CB > CA (B Better) Base model better, but DLM not too far behind Base model better CA = CB & > 0 PL2 not (Tie) CA = CB = 0 Total too good DLM Base 30 35 2 5 72 PL2 Base 10 28 3 7 48 … … … … … … …
  • 12. Evaluation of review summarization Randomly mix top N phrases from two algorithms ALGO1 ALGO2 Monitor click-through More clicks on phrases from Algo1 vs. Algo2  Algo1 better on per entity basis
  • 13. How to submit a new algorithm? Submit code Performance report A B CA > CB (A Better) … … … … CB > CA (B Better) DLM Base 30 35 PL2 Base 10 28 Online Performance Test on mini test bed Sample Code Test Data & Gold Standard Evaluator (nDCG, ROUGE) Mini Testbed Local performance Write Java based code Extend existing code Implementation
  • 14. More information about evaluation… eval.findilike.com
  • 15. Thanks! Questions? Links • Evaluation: http://eval.findilike.com • System: http://hotels.findilike.com/ • Related Papers: kavita-ganesan.com
  • 16. References • Ganesan, K. A., C. X. Zhai, and E. Viegas, Micropinion Generation: An Unsupervised Approach to Generating Ultra-Concise Summaries of Opinions, Proceedings of the 21st International Conference on World Wide Web 2012 (WWW '12), 2012. • Ganesan, K. A., and C. X. Zhai, Opinion-Based Entity Ranking, Information Retrieval, vol. 15, issue 2, 2012 • Ganesan, K. A., C. X. Zhai, and J. Han, Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions, Proceedings of the 23rd International Conference on Computational Linguistics (COLING '10), 2010. • T. Joachims. Optimizing search engines using clickthrough data. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, KDD ’02, NY, 2002.