RediSearch (SV Meetup)

•Download as PPTX, PDF•

1 like•378 views

A deep dive on RediSearch, the search engine built as a Redis Module. Originally given at the Silicon Valley Redis / Silicon Valley Big Data joint meetup

Data & Analytics

RediSearch Deep Dive
KYLEDAVIS,REDISLABS

RediSearch can do three things.
Search Aggregate
Autocomplete
Query

• Create a schema using four types
–Text
–Numeric
–Tag
–Geospatial
• Add Documents in Real Time
–Directly
–From Hash
–Index only
• Search & Aggregate
• Delete documents as needed
• Drop the whole index
Data Lifecycle in RediSearch / Search and Aggregate

• Goals
–Intentionally not SQL
–But familiar
–Exposable to end-users
• Simple
–No knowledge of data/structure needed
• Powerful
–With knowledge, zero in on data
Query Language

AND / OR / NOT / Exact Phrase / Geospatial /
Tags / Prefix / Number Ranges / Optional Terms
& more
Query Syntax
And combine them all into one query:
(chev*|ford) -explorer ~truck @year:[2001
2011] @location:[74 40 100 km] @condition:{
good | verygood }

• Stop words:
–”a fox in the woods” -> “fox woods”
• Stemming:
–Query “going” -> find “going” ”go” “gone”
–Arabic, Danish, Dutch, English, Finnish, French, German, Hungarian, Italian, Norwegian, Portuguese,
Romanian, Russian, Spanish, Swedish, Tamil, Turkish, Chinese
• Slop:
–Query: “glass pitcher”, slop 2 -> “glass gallon beer pitcher”
• With or without content:
–Query: “To be or not to be” -> Hamlet (without the whole play)
• Matched text highlight/summary:
–Query – “To be or not to be” -> Hamlet. <b>To be, or not to be</b>- that is the question
• Synonyms
–Query “Bob” -> Find documents with “Robert”
Full-text Search

• Each field can have a weight which influences the rank in the returned result
• Each document can have a score to influence rank
• Built-in Scoring Functions
–Default: TF-IDF / term frequency–inverse document frequency
• Variant: DOCNORM
• Variant: BM25
–DISMAX (Solr’s default)
–DOCSCORE
–HAMMING for binary payloads
• Fields can be independently sortable, which trumps any in-built scorcer
Scoring, Weights, and Sorting

Aggregations
• Processes and transforms
• Same query language as search
• Can group, sort and apply transformations
• Follows pipeline of composable actions:
Filter Group Apply Sort Apply
Reduce
Reduce

Grouping & Applications
• Reducers:
–COUNT
–COUNT_DISTINCT
–COUNT_DISTINCTISH
–SUM
–MIN
–MAX
–AVG
–STDDEV
–QUANTILE
–TOLIST
–FIRST_VALUE
–RANDOM_SAMPLE
• Manipulate
–Strings
• substr(upper('hello world'),0,3) ->
“HEL”
–Numbers w/ Arithmetic
• sqrt(log(foo) * floor(@bar/baz)) +
(3^@qaz % 6)
–Timestamp to Calendar
• timefmt(@mytimestamp, "%b %d %Y
%H:%M:%S”) -> Feb 24 2018 00:05:48

• In the module, but separate storage
• Radix tree-based, optimized for real-
time, as-you-type completions
• Simple API
–Add a suggestion (FT.SUGADD)
–Get a suggestion (FT.SUGGET)
–Delete a suggestion(FT.SUGDEL)
• Specify or increment “score” of each
item to create custom sortings
Autocomplete/Suggestions

Similar to RediSearch (SV Meetup)

Why MongoDB over other Databases - HabilelabsHabilelabs

Azure Redis Cache - Cache on Steroids!François Boucher

SolrClaudio Devecchi

Drill dchug-29 nov2012MapR Technologies

Solr Masterclass Bangkok, June 2014Alexandre Rafalovitch

Redshift Chartio Event PresentationChartio

SolrCloud on HadoopAlex Moundalexis

Apache Solr crash courseTommaso Teofili

Big data elasticsearch practicalJWORKS powered by Ordina

IRGirish Khanzode

Effective and Efficient Entity Search in RDF dataRoi Blanco

Mba admission in indiaEdhole.com

Jose portillo dev con presentation 1138Jose Portillo

Drill at the Chug 9-19-12Ted Dunning

REDIS327Rajan Bhatt

Drill Bay Area HUG 2012-09-19jasonfrantz

Sep 2012 HUG: Apache Drill for Interactive Analysis Yahoo Developer Network

Oracle by Muhammad IqbalYOUTH MEDIA AGENCY

Important work-arounds for making ASS multi-lingualAxel Faust

How Solr Search WorksAtlogys Technical Consulting

Similar to RediSearch (SV Meetup) (20)

Why MongoDB over other Databases - Habilelabs

Azure Redis Cache - Cache on Steroids!

Solr

Drill dchug-29 nov2012

Solr Masterclass Bangkok, June 2014

Redshift Chartio Event Presentation

SolrCloud on Hadoop

Apache Solr crash course

Big data elasticsearch practical

Effective and Efficient Entity Search in RDF data

Mba admission in india

Jose portillo dev con presentation 1138

Drill at the Chug 9-19-12

REDIS327

Drill Bay Area HUG 2012-09-19

Sep 2012 HUG: Apache Drill for Interactive Analysis

Oracle by Muhammad Iqbal

Important work-arounds for making ASS multi-lingual

How Solr Search Works

Recently uploaded

Sequential and reinforcement learning for demand side management by Margaux B...Paris Women in Machine Learning and Data Science

Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...gajnagarg

Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

如何办理英国诺森比亚大学毕业证（NU毕业证书）成绩单原件一模一样wsppdmt

怎样办理伦敦大学毕业证（UoL毕业证书）成绩单学校原版复制vexqp

怎样办理旧金山城市学院毕业证（CCSF毕业证书）成绩单学校原版复制vexqp

Dubai Call Girls Peeing O525547819 Call Girls Dubaikojalkojal131

In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabiaahmedjiabur940

Gartner's Data Analytics Maturity Model.pptxchadhar227

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795

一比一原版(曼大毕业证书）曼尼托巴大学毕业证成绩单留信学历认证一手价格q6pzkpark

怎样办理伦敦大学城市学院毕业证（CITY毕业证书）成绩单学校原版复制vexqp

Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...nirzagarg

Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATIONLakpaYanziSherpa

Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjurptikerjasaptiker

Lecture_2_Deep_Learning_Overview-newone1ranjankumarbehera14

Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg

怎样办理圣路易斯大学毕业证（SLU毕业证书）成绩单学校原版复制vexqp

Recently uploaded (20)

Sequential and reinforcement learning for demand side management by Margaux B...

Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec

如何办理英国诺森比亚大学毕业证（NU毕业证书）成绩单原件一模一样

怎样办理伦敦大学毕业证（UoL毕业证书）成绩单学校原版复制

怎样办理旧金山城市学院毕业证（CCSF毕业证书）成绩单学校原版复制

Dubai Call Girls Peeing O525547819 Call Girls Dubai

In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia

Gartner's Data Analytics Maturity Model.pptx

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed

一比一原版(曼大毕业证书）曼尼托巴大学毕业证成绩单留信学历认证一手价格

怎样办理伦敦大学城市学院毕业证（CITY毕业证书）成绩单学校原版复制

Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...

Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION

Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur

Lecture_2_Deep_Learning_Overview-newone1

Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...

怎样办理圣路易斯大学毕业证（SLU毕业证书）成绩单学校原版复制

RediSearch (SV Meetup)

1. RediSearch Deep Dive KYLEDAVIS,REDISLABS

2. Beyond Key/Value with Redis

3. RediSearch can do three things. Search Aggregate Autocomplete Query

4. • Create a schema using four types –Text –Numeric –Tag –Geospatial • Add Documents in Real Time –Directly –From Hash –Index only • Search & Aggregate • Delete documents as needed • Drop the whole index Data Lifecycle in RediSearch / Search and Aggregate

5. Search & Query

6. • Goals –Intentionally not SQL –But familiar –Exposable to end-users • Simple –No knowledge of data/structure needed • Powerful –With knowledge, zero in on data Query Language

7. AND / OR / NOT / Exact Phrase / Geospatial / Tags / Prefix / Number Ranges / Optional Terms & more Query Syntax And combine them all into one query: (chev*|ford) -explorer ~truck @year:[2001 2011] @location:[74 40 100 km] @condition:{ good | verygood }

8. • Stop words: –”a fox in the woods” -> “fox woods” • Stemming: –Query “going” -> find “going” ”go” “gone” –Arabic, Danish, Dutch, English, Finnish, French, German, Hungarian, Italian, Norwegian, Portuguese, Romanian, Russian, Spanish, Swedish, Tamil, Turkish, Chinese • Slop: –Query: “glass pitcher”, slop 2 -> “glass gallon beer pitcher” • With or without content: –Query: “To be or not to be” -> Hamlet (without the whole play) • Matched text highlight/summary: –Query – “To be or not to be” -> Hamlet. <b>To be, or not to be</b>- that is the question • Synonyms –Query “Bob” -> Find documents with “Robert” Full-text Search

9. Search & Query Demo

10. • Each field can have a weight which influences the rank in the returned result • Each document can have a score to influence rank • Built-in Scoring Functions –Default: TF-IDF / term frequency–inverse document frequency • Variant: DOCNORM • Variant: BM25 –DISMAX (Solr’s default) –DOCSCORE –HAMMING for binary payloads • Fields can be independently sortable, which trumps any in-built scorcer Scoring, Weights, and Sorting

11. Aggregations

12. Aggregations • Processes and transforms • Same query language as search • Can group, sort and apply transformations • Follows pipeline of composable actions: Filter Group Apply Sort Apply Reduce Reduce

13. Grouping & Applications • Reducers: –COUNT –COUNT_DISTINCT –COUNT_DISTINCTISH –SUM –MIN –MAX –AVG –STDDEV –QUANTILE –TOLIST –FIRST_VALUE –RANDOM_SAMPLE • Manipulate –Strings • substr(upper('hello world'),0,3) -> “HEL” –Numbers w/ Arithmetic • sqrt(log(foo) * floor(@bar/baz)) + (3^@qaz % 6) –Timestamp to Calendar • timefmt(@mytimestamp, "%b %d %Y %H:%M:%S”) -> Feb 24 2018 00:05:48

14. Aggregations Demo

15. Autocomplete

16. • In the module, but separate storage • Radix tree-based, optimized for real- time, as-you-type completions • Simple API –Add a suggestion (FT.SUGADD) –Get a suggestion (FT.SUGGET) –Delete a suggestion(FT.SUGDEL) • Specify or increment “score” of each item to create custom sortings Autocomplete/Suggestions

17. Autocomplete Demo

18. Questions?

Editor's Notes

The query will find a chevy or a ford not an explorer optionally a truck from 2001 to 2011 100 km from that locaiton (NYC) with the condition tags of good or very good
Stop words – remove insignifigant words Stemming – query ”going” matches “go” and “gone” Slop – intervening words Highlighting – retrieve the fragment of the full text surrounding the found words
redisearch.search('@location:[-113.506324209616 53.44783933862182 2 km]',['limit','0','85']); redisearch.search(‘deck @location:[-113.506324209616 53.44783933862182 2 km]',['limit','0','85']); redisearch.search('deck @location:[-113.506324209616 53.44783933862182 2 km]',['return',3,'address','description','construction_value','limit','0','85']); redisearch.search('deck @location:[-113.506324209616 53.44783933862182 2 km] @construction_value:[1 3000]',['return',3,'address','description','construction_value','limit','0','85']); redisearch.search('deck @location:[-113.506324209616 53.44783933862182 2 km] @construction_value:[1 3000]',['return',4,'neighbourhood','address','description','construction_value','limit','0','85']); redisearch.search('deck @location:[-113.506324209616 53.44783933862182 2 km] @construction_value:[1 3000] @neighbourhood:{ TWIN BROOKS | SKYRATTLER }',['return',4,'neighbourhood','address','description','construction_value','limit','0','85']); redisearch.search('deck @location:[-113.506324209616 53.44783933862182 2 km] @construction_value:[1 3000] @neighbourhood:{ TWIN BROOKS | SKYRATTLER }',['return',4,'neighbourhood','address','description','construction_value','SORTBY', 'construction_value', 'limit','0','85']);
redisearch.aggregate('deck @location:[-113.506324209616 53.44783933862182 2 km] @construction_value:[1 3000] @neighbourhood:{ TWIN\ BROOKS | SKYRATTLER }',['GROUPBY',1,'@neighbourhood','REDUCE','SUM',1,'@construction_value','as','total_construction_value']);
Autocomplete is based on a Radix tree (illustrated) Separate from the search indexes – completely optional and customizable based on behaviour or data Scoring goes beyond simple alpha autocomplete Has payloads to be able to have richer autocompletes Three basic commands (SUGADD, SUGGET, SUGDEL) yields a very easy implementation

RediSearch (SV Meetup)

Recommended

Recommended

More Related Content

Similar to RediSearch (SV Meetup)

Similar to RediSearch (SV Meetup) (20)

Recently uploaded

Recently uploaded (20)

RediSearch (SV Meetup)

Editor's Notes