SlideShare a Scribd company logo
1 of 16
ES-Rank: Evolutionary Strategy
Learning to Rank Approach
Osman AliOsman Ali SadekSadek Ibrahim, Dario LandaIbrahim, Dario Landa--SilvaSilva
ASAP Research GroupASAP Research Group
The UniversityThe University oof Nottingham, UKf Nottingham, UK
April,April, 20172017
Outline of the PresentationOutline of the Presentation
Background
Information Retrieval System Architecture
Learning to Rank Architecture
Problem Statement
The Proposed ES-Rank
2
High-Level Overview
Detailed steps
Experimental Studies
Experimental Study Settings.
Experimental Results.
Conclusion
Information Retrieval (IR) System ArchitectureInformation Retrieval (IR) System Architecture
3Figure 1: Typical IR System Architecture
Typical Structure of a LTR DatasetTypical Structure of a LTR Dataset
4
In general, there are three categories of LTR methods:
point-wise, pair-wise and list-wise. Hybrid approaches
have also been proposed.
Learning to RankLearning to Rank ArchitectureArchitecture
5
Some of the limitations of existing Learning to Rank techniques:
Not considering the whole training dataset in each
learning iteration. Performance vs Computation Effort.
Problem StatementProblem Statement
6
Computational runtime and memory size requirements
for some of the big datasets. For example, rt-rank and
Ranklib packages.
The large variations in the characteristics of big datasets
demands their full consideration for more effective
learning.
Start IR System Using
Various Query-
Document Features
Document
Collection
OR
Users search the IR
system for Information
Proposed Approach (highProposed Approach (high--level description)level description)
7
Gathering Relevance
Feedback by IR System from
Users to Create the IR
Query-Document Pairs
Use Query-Document Pairs Training Set for
Evolving a Ranking Model using ES-Rank
OR
WWW
The Proposed Approach (ESThe Proposed Approach (ES--Rank)Rank)
Start
Initialize Chromosomes for
Parent and Offspring
Previous mutation
good and stopping
Mutate chromosome on
R genes using previous
NO
Yes
Stopping
criterion
reached?
End
YES
8
Mutate chromosome on
R random genes with
Gaussian*exp(Cauchy)
random number
good and stopping
criterion not
reached?
R genes using previous
mutation step
Mutated chromosome
better than current
chromosome and
stop criterion not
reached?
Mutated chromosome
becomes the current
chromosome
NO
Yes
NO
Initialization
Stopping Criterion
Random Mutation with
The Proposed Approach (ESThe Proposed Approach (ES--Rank)Rank)
9
Objective Function
Random Mutation with
adaptive procedure
according to pervious
iteration procedure.
Experimental Study SettingsExperimental Study Settings
• Each dataset includes the training, validation and test sets.
• ES-Rank and the other compared methods are applied to the training set in
order to evolve a linear ranking function.
• The performance of the linear ranking function is assessed using the test set
to obtain the predictive performance of the learning algorithm.to obtain the predictive performance of the learning algorithm.
• The comparison used three well-known LETOR datasets and two evaluation
fitness metrics.
10
LETOR Datasets used in experiments
Dataset Queries
Query-URL
Pairs
Features Relevance Labels No. Folds
MQ2007 1692 69623 46 {0,1,2} 5
MQ2008 784 15211 46 {0,1,2} 5
MSLR-
WEB10K
10000 1200192 136 {0,1,2,3,4} 5
Experimental Study SettingsExperimental Study Settings
• The experimental study conducted was a comparison between ES-Rank and
14 other Evolutionary and Machine Learning techniques.
• The default parameter settings suggested for the 14 Evolutionary and
Machine Learning techniques were used as described in well-known packagesMachine Learning techniques were used as described in well-known packages
(Ranklib, rt-Rank, LAGEP, SVM-Rank, Sofia-ml).
• ES-Rank uses (1+1)-Evolutionary Strategy with Gaussian*exp(Cauchy) Random
Number as the mutation step. The mutation has a random probability
(Random Walk) that adapts according to the performance in the previous
iteration. The number of evolving iterations was set to 1300 in these
experiments. 11
Experimental ResultsExperimental Results
[Mean Average Precision (MAP) Results][Mean Average Precision (MAP) Results]
ES-Rank performs the best in 7 of the 15 data folds
.
9
Experimental ResultsExperimental Results
[Normalized Discounted Cumulative Gain (NDCG) Results][Normalized Discounted Cumulative Gain (NDCG) Results]
ES-Rank performs the best in 4 of the 15 data folds.
.
13
Experimental Results (Time Evaluation)Experimental Results (Time Evaluation)
Algorithm MSLR-WEB10K MQ2008 MQ2007 AvgTime (s)
RankBoost 3720 15 74 1269.667
RankSVM 32409 19 23 10817
ListNET 18005 45 95 6048.333
AdaRank 3600 11 20 1210.333
MART 1200 8 11 406.3333
Coordinate Ascent 25200 37 240 8492.333
14
Coordinate Ascent 25200 37 240 8492.333
LambdaMART 3720 9 11 1246.667
RankNET 10800 33 96 3643
Random Forest 3660 27 55 1247.333
Linear Regression 157 2 3 54
RankGP 26020 375 390 8928.333
Combined Ranking and
Regression
10803 42 51 3632
LambdaRank 18015 46 142 6067.667
IGBRT 36750 274 253 12425.67
ES-Rank 1800 35 51 628.6667
Experimental Results (Statistical Summary)Experimental Results (Statistical Summary)
Algorithms mean sd se(mean) cv
AdaRank 0.4605 0.0758 0.0138 0.1646
Combined.Regression.Ranking 0.4318 0.0406 0.0074 0.0941
Coordinate.Ascent 0.4776 0.0638 0.0116 0.1330
ES-Rank 0.4779 0.0596 0.0109 0.1247
IGBRT 0.4565 0.0563 0.0145 0.1234
LambdaMART 0.4774 0.0634 0.0116 0.1328
15
LambdaMART 0.4774 0.0634 0.0116 0.1328
LambdaRank 0.3247 0.0939 0.0171 0.2892
Linear.Regression 0.4425 0.0527 0.0096 0.1192
ListNET 0.4099 0.1034 0.0189 0.2522
MART 0.4750 0.0624 0.0114 0.1313
Random.Forest 0.4771 0.0661 0.0121 0.1384
RankBoost 0.4621 0.0770 0.0141 0.1666
RankGP 0.4197 0.0372 0.0068 0.0887
RankNET 0.4143 0.1060 0.0194 0.2559
RankSVM 0.4043 0.0707 0.0129 0.1748
ConclusionConclusion
There are issues in improving IR using learning to rank approaches,
particularly on large document collections.
The proposes ES-Rank performs well against fourteen other techniques in
terms of NDCG@10 and MAP.
16
terms of NDCG@10 and MAP.
The proposed ES-Rank considers the whole instance (Query-Document Pairs)
in each learning iteration which helps to improve the performance of the
evolved ranking model.
The computational run-time of ES-Rank is reasonable given its performance in
terms of accuracy (third fastest after Linear Regression and MART).

More Related Content

Similar to ES-Rank: Evolutionary Strategy Learning to Rank Approach

MLConf 2016 SigOpt Talk by Scott Clark
MLConf 2016 SigOpt Talk by Scott ClarkMLConf 2016 SigOpt Talk by Scott Clark
MLConf 2016 SigOpt Talk by Scott ClarkSigOpt
 
B2 2006 sizing_benchmarking
B2 2006 sizing_benchmarkingB2 2006 sizing_benchmarking
B2 2006 sizing_benchmarkingSteve Feldman
 
B2 2006 sizing_benchmarking (1)
B2 2006 sizing_benchmarking (1)B2 2006 sizing_benchmarking (1)
B2 2006 sizing_benchmarking (1)Steve Feldman
 
IRJET- Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...
IRJET-  	  Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...IRJET-  	  Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...
IRJET- Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...IRJET Journal
 
Artificial Intelligence based Pattern Recognition
Artificial Intelligence based Pattern RecognitionArtificial Intelligence based Pattern Recognition
Artificial Intelligence based Pattern RecognitionDr. Amarjeet Singh
 
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...IRJET Journal
 
Using Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsUsing Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsScott Clark
 
Using Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsUsing Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsSigOpt
 
Cmmi hm experiences with leveraging six sigma to implement cmmi l4 yl5 2003
Cmmi hm experiences with leveraging six sigma to implement cmmi l4 yl5 2003Cmmi hm experiences with leveraging six sigma to implement cmmi l4 yl5 2003
Cmmi hm experiences with leveraging six sigma to implement cmmi l4 yl5 2003JULIO GONZALEZ SANZ
 
AlphaPy: A Data Science Pipeline in Python
AlphaPy: A Data Science Pipeline in PythonAlphaPy: A Data Science Pipeline in Python
AlphaPy: A Data Science Pipeline in PythonMark Conway
 
E132833
E132833E132833
E132833irjes
 
Nose Dive into Apache Spark ML
Nose Dive into Apache Spark MLNose Dive into Apache Spark ML
Nose Dive into Apache Spark MLAhmet Bulut
 
Is Unsupervised Ensemble Learning Useful for Aggregated or Clustered Load For...
Is Unsupervised Ensemble Learning Useful for Aggregated or Clustered Load For...Is Unsupervised Ensemble Learning Useful for Aggregated or Clustered Load For...
Is Unsupervised Ensemble Learning Useful for Aggregated or Clustered Load For...Peter Laurinec
 
Biomedical Signal and Image Analytics using MATLAB
Biomedical Signal and Image Analytics using MATLABBiomedical Signal and Image Analytics using MATLAB
Biomedical Signal and Image Analytics using MATLABCodeOps Technologies LLP
 

Similar to ES-Rank: Evolutionary Strategy Learning to Rank Approach (20)

MLConf 2016 SigOpt Talk by Scott Clark
MLConf 2016 SigOpt Talk by Scott ClarkMLConf 2016 SigOpt Talk by Scott Clark
MLConf 2016 SigOpt Talk by Scott Clark
 
Ajila (1)
Ajila (1)Ajila (1)
Ajila (1)
 
B2 2006 sizing_benchmarking
B2 2006 sizing_benchmarkingB2 2006 sizing_benchmarking
B2 2006 sizing_benchmarking
 
B2 2006 sizing_benchmarking (1)
B2 2006 sizing_benchmarking (1)B2 2006 sizing_benchmarking (1)
B2 2006 sizing_benchmarking (1)
 
IRJET- Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...
IRJET-  	  Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...IRJET-  	  Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...
IRJET- Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...
 
Artificial Intelligence based Pattern Recognition
Artificial Intelligence based Pattern RecognitionArtificial Intelligence based Pattern Recognition
Artificial Intelligence based Pattern Recognition
 
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
 
Using Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsUsing Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning Models
 
Using Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsUsing Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning Models
 
Chapter15
Chapter15Chapter15
Chapter15
 
Cmmi hm experiences with leveraging six sigma to implement cmmi l4 yl5 2003
Cmmi hm experiences with leveraging six sigma to implement cmmi l4 yl5 2003Cmmi hm experiences with leveraging six sigma to implement cmmi l4 yl5 2003
Cmmi hm experiences with leveraging six sigma to implement cmmi l4 yl5 2003
 
AlphaPy: A Data Science Pipeline in Python
AlphaPy: A Data Science Pipeline in PythonAlphaPy: A Data Science Pipeline in Python
AlphaPy: A Data Science Pipeline in Python
 
AlphaPy
AlphaPyAlphaPy
AlphaPy
 
Query processing
Query processingQuery processing
Query processing
 
PraveenResumeNewL
PraveenResumeNewLPraveenResumeNewL
PraveenResumeNewL
 
Vldb14
Vldb14Vldb14
Vldb14
 
E132833
E132833E132833
E132833
 
Nose Dive into Apache Spark ML
Nose Dive into Apache Spark MLNose Dive into Apache Spark ML
Nose Dive into Apache Spark ML
 
Is Unsupervised Ensemble Learning Useful for Aggregated or Clustered Load For...
Is Unsupervised Ensemble Learning Useful for Aggregated or Clustered Load For...Is Unsupervised Ensemble Learning Useful for Aggregated or Clustered Load For...
Is Unsupervised Ensemble Learning Useful for Aggregated or Clustered Load For...
 
Biomedical Signal and Image Analytics using MATLAB
Biomedical Signal and Image Analytics using MATLABBiomedical Signal and Image Analytics using MATLAB
Biomedical Signal and Image Analytics using MATLAB
 

Recently uploaded

Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.arsicmarija21
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
MICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxMICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxabhijeetpadhi001
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 

Recently uploaded (20)

Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
MICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxMICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 

ES-Rank: Evolutionary Strategy Learning to Rank Approach

  • 1. ES-Rank: Evolutionary Strategy Learning to Rank Approach Osman AliOsman Ali SadekSadek Ibrahim, Dario LandaIbrahim, Dario Landa--SilvaSilva ASAP Research GroupASAP Research Group The UniversityThe University oof Nottingham, UKf Nottingham, UK April,April, 20172017
  • 2. Outline of the PresentationOutline of the Presentation Background Information Retrieval System Architecture Learning to Rank Architecture Problem Statement The Proposed ES-Rank 2 High-Level Overview Detailed steps Experimental Studies Experimental Study Settings. Experimental Results. Conclusion
  • 3. Information Retrieval (IR) System ArchitectureInformation Retrieval (IR) System Architecture 3Figure 1: Typical IR System Architecture
  • 4. Typical Structure of a LTR DatasetTypical Structure of a LTR Dataset 4 In general, there are three categories of LTR methods: point-wise, pair-wise and list-wise. Hybrid approaches have also been proposed.
  • 5. Learning to RankLearning to Rank ArchitectureArchitecture 5
  • 6. Some of the limitations of existing Learning to Rank techniques: Not considering the whole training dataset in each learning iteration. Performance vs Computation Effort. Problem StatementProblem Statement 6 Computational runtime and memory size requirements for some of the big datasets. For example, rt-rank and Ranklib packages. The large variations in the characteristics of big datasets demands their full consideration for more effective learning.
  • 7. Start IR System Using Various Query- Document Features Document Collection OR Users search the IR system for Information Proposed Approach (highProposed Approach (high--level description)level description) 7 Gathering Relevance Feedback by IR System from Users to Create the IR Query-Document Pairs Use Query-Document Pairs Training Set for Evolving a Ranking Model using ES-Rank OR WWW
  • 8. The Proposed Approach (ESThe Proposed Approach (ES--Rank)Rank) Start Initialize Chromosomes for Parent and Offspring Previous mutation good and stopping Mutate chromosome on R genes using previous NO Yes Stopping criterion reached? End YES 8 Mutate chromosome on R random genes with Gaussian*exp(Cauchy) random number good and stopping criterion not reached? R genes using previous mutation step Mutated chromosome better than current chromosome and stop criterion not reached? Mutated chromosome becomes the current chromosome NO Yes NO
  • 9. Initialization Stopping Criterion Random Mutation with The Proposed Approach (ESThe Proposed Approach (ES--Rank)Rank) 9 Objective Function Random Mutation with adaptive procedure according to pervious iteration procedure.
  • 10. Experimental Study SettingsExperimental Study Settings • Each dataset includes the training, validation and test sets. • ES-Rank and the other compared methods are applied to the training set in order to evolve a linear ranking function. • The performance of the linear ranking function is assessed using the test set to obtain the predictive performance of the learning algorithm.to obtain the predictive performance of the learning algorithm. • The comparison used three well-known LETOR datasets and two evaluation fitness metrics. 10 LETOR Datasets used in experiments Dataset Queries Query-URL Pairs Features Relevance Labels No. Folds MQ2007 1692 69623 46 {0,1,2} 5 MQ2008 784 15211 46 {0,1,2} 5 MSLR- WEB10K 10000 1200192 136 {0,1,2,3,4} 5
  • 11. Experimental Study SettingsExperimental Study Settings • The experimental study conducted was a comparison between ES-Rank and 14 other Evolutionary and Machine Learning techniques. • The default parameter settings suggested for the 14 Evolutionary and Machine Learning techniques were used as described in well-known packagesMachine Learning techniques were used as described in well-known packages (Ranklib, rt-Rank, LAGEP, SVM-Rank, Sofia-ml). • ES-Rank uses (1+1)-Evolutionary Strategy with Gaussian*exp(Cauchy) Random Number as the mutation step. The mutation has a random probability (Random Walk) that adapts according to the performance in the previous iteration. The number of evolving iterations was set to 1300 in these experiments. 11
  • 12. Experimental ResultsExperimental Results [Mean Average Precision (MAP) Results][Mean Average Precision (MAP) Results] ES-Rank performs the best in 7 of the 15 data folds . 9
  • 13. Experimental ResultsExperimental Results [Normalized Discounted Cumulative Gain (NDCG) Results][Normalized Discounted Cumulative Gain (NDCG) Results] ES-Rank performs the best in 4 of the 15 data folds. . 13
  • 14. Experimental Results (Time Evaluation)Experimental Results (Time Evaluation) Algorithm MSLR-WEB10K MQ2008 MQ2007 AvgTime (s) RankBoost 3720 15 74 1269.667 RankSVM 32409 19 23 10817 ListNET 18005 45 95 6048.333 AdaRank 3600 11 20 1210.333 MART 1200 8 11 406.3333 Coordinate Ascent 25200 37 240 8492.333 14 Coordinate Ascent 25200 37 240 8492.333 LambdaMART 3720 9 11 1246.667 RankNET 10800 33 96 3643 Random Forest 3660 27 55 1247.333 Linear Regression 157 2 3 54 RankGP 26020 375 390 8928.333 Combined Ranking and Regression 10803 42 51 3632 LambdaRank 18015 46 142 6067.667 IGBRT 36750 274 253 12425.67 ES-Rank 1800 35 51 628.6667
  • 15. Experimental Results (Statistical Summary)Experimental Results (Statistical Summary) Algorithms mean sd se(mean) cv AdaRank 0.4605 0.0758 0.0138 0.1646 Combined.Regression.Ranking 0.4318 0.0406 0.0074 0.0941 Coordinate.Ascent 0.4776 0.0638 0.0116 0.1330 ES-Rank 0.4779 0.0596 0.0109 0.1247 IGBRT 0.4565 0.0563 0.0145 0.1234 LambdaMART 0.4774 0.0634 0.0116 0.1328 15 LambdaMART 0.4774 0.0634 0.0116 0.1328 LambdaRank 0.3247 0.0939 0.0171 0.2892 Linear.Regression 0.4425 0.0527 0.0096 0.1192 ListNET 0.4099 0.1034 0.0189 0.2522 MART 0.4750 0.0624 0.0114 0.1313 Random.Forest 0.4771 0.0661 0.0121 0.1384 RankBoost 0.4621 0.0770 0.0141 0.1666 RankGP 0.4197 0.0372 0.0068 0.0887 RankNET 0.4143 0.1060 0.0194 0.2559 RankSVM 0.4043 0.0707 0.0129 0.1748
  • 16. ConclusionConclusion There are issues in improving IR using learning to rank approaches, particularly on large document collections. The proposes ES-Rank performs well against fourteen other techniques in terms of NDCG@10 and MAP. 16 terms of NDCG@10 and MAP. The proposed ES-Rank considers the whole instance (Query-Document Pairs) in each learning iteration which helps to improve the performance of the evolved ranking model. The computational run-time of ES-Rank is reasonable given its performance in terms of accuracy (third fastest after Linear Regression and MART).