Boosting Documents in Solr by Recency, Popularity and Personal Preferences - By Timothy Potter

•Download as PPT, PDF•

39 likes•26,208 views

See conference video - http://www.lucidimagination.com/devzone/events/conferences/revolution/2011 Attendees with come away from this presentation with a good understanding and access to source code for boosting and/or filtering documents by recency, popularity, and personal preferences. My solution improves upon the common “recipe” based solution for boosting by document age. The framework also supports boosting documents by a popularity score, which is calculated and managed outside the index. I will present a few different ways to calculate popularity in a scalable manner. Lastly, my solution supports the concept of a personal document collection, where each user is only interested in a subset of the total number of documents in the index.

Technology

Boosting Documents in Solr by Recency, Popularity, and User Preferences Timothy Potter [email_address] , May 25, 2011

What I Will Cover ,[object Object],[object Object],[object Object]

My Background ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Boost documents by age ,[object Object],[object Object],[object Object]

Solr: Indexing ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

FunctionQuery Basics ,[object Object],[object Object],[object Object],constant literal fieldvalue ord rord sum sub product pow abs log sqrt map scale query linear recip max min ms sqedist - Squared Euclidean Dist hsin, ghhsin - Haversine Formula geohash - Convert to geohash strdist

Solr: Query Time Boost ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Tips and Tricks ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

[object Object],[object Object],[object Object],Boost by Popularity

Solr: ExternalFileField ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Popularity Boost: Nuts & Bolts Logs Solr Server User activity logged View Counting Job solr-home/data/ external_popularity a=1.114 b=1.05 c=1.111 … commit

Popularity Tips & Tricks ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Filtering By User Preferences ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Preferences Component ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Preferences Filter ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Preferences Filter in Action User Preferences Db Solr Server LRU Cache Preferences Component Update Preferences Query with pref.id=123 and pref.mod = TS pref.id & pref.mod If cached mod == pref.mod read from cache SQL to compute excluded categories sources and types

Wrap Up ,[object Object],[object Object],[object Object]

Contact ,[object Object],[object Object],[object Object],[object Object]

What's hot

The Killer Feature Store: Orchestrating Spark ML Pipelines and MLflow for Pro...Databricks

Introduction of Deep Reinforcement LearningNAVER Engineering

Introduction to MLflowDatabricks

Solr Query ParsingErik Hatcher

RLCode와 A3C 쉽고 깊게 이해하기Woong won Lee

강화학습의 개요Dongmin Lee

Near Real Time Indexing: Presented by Umesh Prasad & Thejus V M, FlipkartLucidworks

Dense Retrieval with Apache Solr Neural Search.pdfSease

Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit

Drifting Away: Testing ML Models in ProductionDatabricks

elasticsearch_적용 및 활용_정리Junyi Song

Starring sakila my sql university 2009David Paz

A Thorough Comparison of Delta Lake, Iceberg and HudiDatabricks

OptimizersIl Gu Yi

Orion Context Broker 20211209Fermin Galan

백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스NAVER D2

NiFi 시작하기Byunghwa Yoon

파이썬과 케라스로 배우는 강화학습 저자특강Woong won Lee

Spark DataFrames and ML PipelinesDatabricks

Parquet and AVROairisData

What's hot (20)

The Killer Feature Store: Orchestrating Spark ML Pipelines and MLflow for Pro...

Introduction of Deep Reinforcement Learning

Introduction to MLflow

Solr Query Parsing

RLCode와 A3C 쉽고 깊게 이해하기

강화학습의 개요

Near Real Time Indexing: Presented by Umesh Prasad & Thejus V M, Flipkart

Dense Retrieval with Apache Solr Neural Search.pdf

Presto: Optimizing Performance of SQL-on-Anything Engine

Drifting Away: Testing ML Models in Production

elasticsearch_적용 및 활용_정리

Starring sakila my sql university 2009

A Thorough Comparison of Delta Lake, Iceberg and Hudi

Optimizers

Orion Context Broker 20211209

백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스

NiFi 시작하기

파이썬과 케라스로 배우는 강화학습 저자특강

Spark DataFrames and ML Pipelines

Parquet and AVRO

Viewers also liked

Implementing Click-through Relevance Ranking in Solr and LucidWorks EnterpriseLucidworks (Archived)

Semantic & Multilingual Strategies in Lucene/SolrTrey Grainger

Implementing Click-through Relevance Ranking in Solr and LucidWorks EnterpriseLucidworks (Archived)

Click-through relevance ranking in solr & lucid works enterprise - By Andrz...lucenerevolution

네이버 지식쇼핑과 아마존의 검색결과 페이지네비게이션 유형분석상욱 송

Apache Solr 4 Part 1 - Introduction, Features, Recency Ranking and Popularity...Ramzi Alqrainy

Crowdsourced query augmentation through the semantic discovery of domain spec...Trey Grainger

Query Parsing - Tips and TricksErik Hatcher

Twitter Search Architecture Ramez Al-Fayez

第16回Lucene/Solr勉強会 – ランキングチューニングと定量評価 #SolrJPYahoo!デベロッパーネットワーク

Where Search Meets Machine Learning: Presented by Diana Hu & Joaquin Delgado,...Lucidworks

Building a Real-time Solr-powered Recommendation Enginelucenerevolution

Reflected intelligence evolving self-learning data systemsTrey Grainger

Language support and linguistics in lucene solr & its eco systemlucenerevolution

Hierarchical data models in Relational Databasesnavicorevn

SearchLeeds, Tom Anthony 'The next trilion searches: Intelligent personal ass...Branded3

South Big Data Hub: Text Data Analysis PanelTrey Grainger

The Semantic Knowledge GraphTrey Grainger

Reflected Intelligence: Lucene/Solr as a self-learning data systemTrey Grainger

The Apache Solr Smart Data EcosystemTrey Grainger

Viewers also liked (20)

Implementing Click-through Relevance Ranking in Solr and LucidWorks Enterprise

Semantic & Multilingual Strategies in Lucene/Solr

Implementing Click-through Relevance Ranking in Solr and LucidWorks Enterprise

Click-through relevance ranking in solr & lucid works enterprise - By Andrz...

네이버 지식쇼핑과 아마존의 검색결과 페이지네비게이션 유형분석

Apache Solr 4 Part 1 - Introduction, Features, Recency Ranking and Popularity...

Crowdsourced query augmentation through the semantic discovery of domain spec...

Query Parsing - Tips and Tricks

Twitter Search Architecture

第16回Lucene/Solr勉強会 – ランキングチューニングと定量評価 #SolrJP

Where Search Meets Machine Learning: Presented by Diana Hu & Joaquin Delgado,...

Building a Real-time Solr-powered Recommendation Engine

Reflected intelligence evolving self-learning data systems

Language support and linguistics in lucene solr & its eco system

Hierarchical data models in Relational Databases

SearchLeeds, Tom Anthony 'The next trilion searches: Intelligent personal ass...

South Big Data Hub: Text Data Analysis Panel

The Semantic Knowledge Graph

Reflected Intelligence: Lucene/Solr as a self-learning data system

The Apache Solr Smart Data Ecosystem

Recently uploaded (20)

WSO2's API Vision: Unifying Control, Empowering Developers

Understanding the FAA Part 107 License ..

AI in Action: Real World Use Cases by Anitaraj

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

Apidays New York 2024 - The value of a flexible API Management solution for O...

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam

[BuildWithAI] Introduction to Gemini.pdf

Introduction to Multilingual Retrieval Augmented Generation (RAG)

Exploring Multimodal Embeddings with Milvus

presentation ICT roal in 21st century education

CNIC Information System with Pakdata Cf In Pakistan

Why Teams call analytics are critical to your entire business

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...

Strategies for Landing an Oracle DBA Job as a Fresher

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf