SlideShare a Scribd company logo
1 of 28
Dr.Tony Russell-Rose FBCS CITP CEng
1. randomized controlled trial.pt.
2. controlled clinical trial.pt.
3. randomized.ab.
4. placebo.ab.
5. clinical trials as topic.sh.
6. randomly.ab.
7. trial.ti.
8. 1 or 2 or 3 or 4 or 5 or 6 or 7
9. (animals not (humans and animals)).sh.
10. 8 not 9
11. exp Child/
12. ADOLESCENT/
13. exp infant/
14. child hospitalized/
15. adolescent hospitalized/
16. (child$ or infant$ or toddler$ or adolescen$
or teenage$).tw.
Think outside the search box – www.2dsearch.com 2
17. or/11-16
18. Child Nutrition Sciences/
19. exp Dietary Proteins/
20. Dietary Supplements/
21. Dietetics/
22. or/18-21
23. exp Infant, Newborn/
24. exp Overweight/
25. exp Eating Disorders/
26. Athletes/
27. exp Sports/
28. exp Pregnancy/
29. expViruses/
30. (newborn$ or obes$ or "eating disorder$" or
pregnan$ or childbirth or virus$ or influenza).tw.
31. or/23-30
32. 10 and 17 and 22
33. 32 not 31
Think outside the search box – www.2dsearch.com 3
Java AND (Design OR develop OR code OR Program) AND
("* Engineer" OR MTS OR "* Develop*" OR Scientist OR
technologist) AND (J2EE OR Struts OR Spring) AND
(Algorithm OR "Data Structure" OR PS OR “Problem Solving”)
1 A01N0025-004/CPC
2 RODENT OR RAT OR RATS OR
MOUSE OR MICE
3 BAIT OR POISON
4 2 AND 3
5 1 OR 4
6 AVERSIVE OR ADVERSIVE OR
DETER? OR REPEL?
7 NONTARGET OR (NON WITH
TARGET) OR HUMAN OR DOMESTIC OR
PET OR DOG OR CAT
8 6 AND 7
9 8 AND 5
10 BITREX OR DENATONIUM OR
BITREXENE OR BITTERANT OR BITTER
11 10 AND 5
12 9 OR 11
Think outside the search box – www.2dsearch.com 4
(
(
(~Magnolia OR ~Magnolias) AND
('~Paul ~Anderson' OR '~Paul ~Thomas' OR paul* OR thom*
OR anders*)
) OR
'jeremy blackman*' OR jeremy OR Jerem* OR
(
(~incredible OR ~coincidences) AND
(role* OR chance* OR life* OR investigat* OR disturb*)
) OR
(
(~C NOT
('~John C' OR
(
(
('~Mister C' AND ~Shamen) OR
('~Tom C' AND ~Scientology)
)
)
)
)
) OR ~XCs OR ~CV OR ~CVs OR
(
(
(~CKR OR ~CKRs) NOT ('CKR 123' OR lawyer)
)
)
)
(topic:"d5f485b9-aab0-4d95-a83b-28b2decd900c" AND (topic:"ba942587-854e-407d-81dc-812ee29624bd" OR topic:"805a2281-8f04-42b7-a451-e2b7daac4db1" OR topic:"1fc2a4f5-
3f23-4c80-943c-42f789d11644" OR topic:"91e4df78-db55-4d63-be65-2fad1afa77a7" OR organization:"Argos"OR organization:"Sainsbury"OR organization:"Marks and Spencer" OR
organization:"Morrisons" OR organization:"Signet" OR organization:"Rapha"OR organization:"Vodafone" OR organization:"Evans Cycle" OR organization:"The Body Shop" OR
organization:"Homebase" OR organization:"Topps Tiles" OR organization:"Carphone Warehouse"OR organization:"B&Q" OR organization:"Tesco" OR organization:"Lidl"OR
organization:"Aldi"OR organization:"Waitrose"OR organization:"John Lewis" OR organization:"Dixons" OR organization:"Alliance Boots" OR organization:"AS Watson" OR
organization:"Halfords"OR organization:"Vision Express" OR organization:"Pets at Home" OR organization:"Asda" OR organization:"Pret a Manger" OR organization:"Greggs" OR
organization:"Majestic Wine" OR organization:"Ocado" OR organization:"Selfridges" OR organization:"Nike" OR organization:"Adidas" OR organization:"Walmart" OR
organization:"Woolworths"OR organization:"Debenhams" OR organization:"Halfords"OR organization:"Punch Taverns" OR organization:"Monarch Group" OR organization:"Whitbread" OR
organization:"Carnival Cruise" OR organization:"KFC" OR organization:"Yum!" OR organization:"Costa Coffee" OR organization:"Pizza Hut" OR organization:"Greene King" OR
organization:"Carluccio's" OR organization:"Gondola"OR organization:"Starbucks"OR organization:"Sodexo" OR organization:"SSP" OR organization:"Nando's" OR organization:"Avis"OR
organization:"Paddy Power" OR organization:"Easyjet" OR organization:"Emaar" OR organization:"Nuffield Health" OR organization:"Safestore" OR organization:"TUI" OR
organization:"Merlin Entertainment" OR organization:"Virgin Atlantic" OR organization:"IHG" OR organization:"Virgin Active" OR organization:"Thomas Cook" OR organization:"Odeon
Hilton" OR organization:"Pizza Express" OR organization:"Zizzi's" OR organization:"Gemfields"OR organization:"Orlebar Brown" OR organization:"Rebecca Trowell" OR organization:"Spring
Studios" OR organization:"Loewe" OR organization:"Jimmy Choo" OR organization:"Burberry" OR organization:"Conde Nast" OR organization:"Zadig & Voltaire" OR organization:"Prada"OR
organization:"Fendi"OR organization:"French Connection" OR organization:"Lyle & Scott" OR organization:"LK Bennett" OR organization:"Alexander Wang" OR organization:"Sandro"OR
organization:"Isabel Marant" OR organization:"Rag & Bone" OR organization:"J Crew" OR organization:"Maje" OR organization:"Ralph Lauren" OR organization:"Tommy Hilfiger" OR
organization:"Michael Kors" OR organization:"Kenzo"OR organization:"Pringle of Scotland" OR organization:"Heidi Klein" OR organization:"Louis Vuitton" OR organization:"Lanvin" OR
organization:"Loewe" OR organization:"Chanel" OR organization:"Valentino" OR organization:"Celine" OR organization:"Dior" OR organization:"Alexander McQueen" OR
organization:"Victoria Beckham"OR organization:"Sonia Rykel"OR organization:"Bally" OR organization:"Notonthehighstreet" OR organization:"Rupert Sanderson" OR
organization:"Dunhill" OR organization:"De Beers" OR organization:"Swarovski"OR organization:"Hermes" OR organization:"Cartier" OR organization:"David Yurman" OR
organization:"Mulberry" OR organization:"Tiffany & Co" OR organization:"Boucheron" OR organization:"Lancel" OR organization:"Marc Jacobs" OR organization:"Longchamp" OR
organization:"Kate Spade" OR organization:"Salvatore Ferragamo" OR organization:"Asos"OR organization:"Net a porter" OR organization:"Astley Clarke" OR organization:"Fenwick" OR
organization:"Yoox" OR organization:"Harrods" OR organization:"Selfridges" OR organization:"MyTheresa" OR organization:"Lane Crawford" OR organization:"Liberty of London" OR
organization:"Gucci"OR organization:"Armarni"OR organization:"Zara"OR organization:"Topshop"OR organization:"Arcadia" OR organization:"Balenciaga" OR organization:"Stella
McCartney" OR organization:"Brioni"OR organization:"Bottega Veneta" OR organization:"Christopher Kane" OR organization:"Anya Hindmarch" OR organization:"Business of Fashion" OR
organization:"3i" OR organization:"Active Private Equity" OR organization:"Advent International" OR organization:"Advent" OR organization:"ApolloGlobal Management" OR
organization:"Apollo"OR organization:"Apax" OR organization:"Apax" OR organization:"Augmentum Capital"OR organization:"Bain" OR organization:"Endless" OR organization:"BC
Partners" OR organization:"Better Capital" OR organization:"HuttonCollins" OR organization:"Hutton Collins" OR organization:"Bridgepoint" OR organization:"Business Growth Fund" OR
organization:"CDR" OR organization:"Darwin" OR organization:"Duke Street Capital" OR organization:"Electra Partners" OR organization:"Exponent" OR organization:"European Capital" OR
organization:"General Atlantic" OR organization:"Hilco" OR organization:"Inflexion" OR organization:"KKR"OR organization:"LGV"OR organization:"Lion" OR organization:"Magenta
Partners" OR organization:"Montagu" OR organization:"Octopus"OR organization:"Octopus Capital" OR organization:"Piper Private Equity" OR organization:"Piper" OR organization:"Sun
Capital" OR organization:"TA Associates" OR organization:"Warburg Pincus" OR organization:"Charterhouse" OR organization:"Primary Capital" OR organization:"Bowmark"OR
organization:"Beringea" OR organization:"Business Growth Fund" OR organization:"Bridgepoint" OR organization:"Caird Capital" OR organization:"CapVest"OR organization:"Cerberus
Capital" OR organization:"CBPE" OR organization:"CCMP" OR organization:"CDR" OR organization:"Change Capital" OR organization:"Cinven" OR organization:"CVC" OR organization:"Duke
Street" OR organization:"EC1" OR organization:"Electra" OR organization:"Encore" OR organization:"EQT" OR organization:"Equistone" OR organization:"Graphite Capital" OR
organization:"Caledonian"OR organization:"Kings Park Capital" OR organization:"L Capital" OR organization:"LDC"
Think outside the search box – www.2dsearch.com 5
 Keyword search is not good enough
 ‘Advanced’ search is anything but
 Patent, healthcare, recruitment, media…
 Professional search needs to be:
▪ Comprehensive
▪ Repeatable
▪ Transparent
Think outside the search box – www.2dsearch.com 7
Think outside the search box – www.2dsearch.com 8
 Ineffective for communicating structure
 Visual cues needed
 Poor scalability
 No abstraction
 Error-prone
 Syntactic + semantic errors
 Salvador-Oliván, José Antonio, Gonzalo Marco-Cuenca, and Rosario Arquero-Avilés. "Errors in search strategies used in
systematic reviews and their effects on information retrieval." Journal of the Medical Library Association: JMLA 107.2 (2019): 210.
Think outside the search box – www.2dsearch.com 9
 “Errors in the search strategy of a systematic review may undermine the
integrity of the evidence base used in the review”
 Sampson M, McGowan J. Errors in search strategies were identified by type and frequency. J Clin Epidemiol
2006 Oct;59(10):1057-1063
 63 MEDLINE search strategies examined
 90.5% contained ≥1 errors
 82.5% contained errors that could potentially lower recall of relevant studies
 The most common search errors were
▪ missed MeSH terms (44.4%)
▪ unwarranted explosion of MeSH terms (38.1%)
▪ irrelevant MeSH or free text terms (28.6%)
▪ Missed spelling variants, combining MeSH and free text terms in the same line, and failure
to tailor the search strategy for other databases (20.6%)
▪ Logical operator error occurred in 19.0% of searches
Think outside the search box – www.2dsearch.com 10
 What if we could visualize search strategies?
 Use metaphors the user already understands
 Separate keyword selection from query
manipulation
 Provide a ‘scratchpad’ & ‘debugging aids’
 Provide instant feedback
 Support abstraction (levels of composition)
 Provide query suggestions
Think outside the search box – www.2dsearch.com 11
 Concepts are objects on a 2d canvas
 Relationships via direct manipulation
 Syntactically correct expressions
 Transparent semantics
 Arbitrary levels of abstraction
 ‘Lego’ analogy
 Optimisation thru exploration
▪ Execute, enable/disable
▪ Hit counts
Think outside the search box – www.2dsearch.com 12
 Cut, copy, paste
 Single or multiple select
 Lasso
 Undo / redo
 Mapping metaphor
 Conceptual zoom
 Overview+detail
Think outside the search box – www.2dsearch.com 13
 Integrations
 Google, Bing, PubMed, GS,
TRIP, Epistemonikos
 Universal representation
 Auto-translation
 Error-checking
 Interoperability with
legacy formats
 Boolean string I/O
Think outside the search box – www.2dsearch.com 14
 Term selection is common source of inefficiency
& error
 Apply query suggestions?
 (“business analyst” or “systems analyst” or “system
analyst” or “data analyst” or “requirements analyst” or
“functional analyst”) and crystal and report* and analy*
and data near analy* and not inventory and not retail
and not (ecommerce or “e-commerce” or b2b or b2c)
Think outside the search box – www.2dsearch.com 15
Think outside the search box – www.2dsearch.com 16
 Manually curated resources
 Controlled vocabularies, thesauri, etc.
▪ Ontological relations
▪ Terms in abstracts & definitions
 Unsupervised learning
 Pre-trained models
 Bespoke models
 Other techniques
 Terms extracted from search result snippets
 Search result clustering + topic labels
Think outside the search box – www.2dsearch.com 17
 DBPEDIA
 4.58 million entities fromWikipedia, of which 4.22 million are classified in a
consistent ontology
 WEBISA
 11.7 million hypernymy relations extracted from CommonCrawl web corpus
 MeSH
 Controlled vocabulary of 25,186 subject headings for indexing documents in
life sciences
 BNF
 Pharmaceutical reference for medicines available on the NHS
 RxNorm
 Terminology on medications available on the US market
 DrugBank
 Information on ~12,000 drug entries and drug targets
Think outside the search box – www.2dsearch.com 18
 Word2vec (shallow neural network)
 Pre-trained on Google news corpus
 GloVe (unsupervised learning)
 Pre-trained on Wikipedia articles
 FastText (unsupervised learning of character
ngrams)
 Pre-trained on Wikipedia
 Word2vec
 Bespoke models trained on PubMed articles
Think outside the search box – www.2dsearch.com 19
Think outside the search box – www.2dsearch.com 20
T.G. Russell-Rose and Phil Gooch ,"2dSearch: a
visual approach to search strategy
formulation", Proceedings of DESIRES: Design
of Experimental Search & Information
REtrieval Systems, Bertinoro, Italy, 28-31
August 2018
1. Acquire test data from published search strategies
 CLEF 2017 eHealth Lab
 SIGN search filters
 Boolean Search Strings Repository
2. Extract term equivalence sets
 child or infant or toddler or
adolescent or teenage
3. Generate terms using each method
 Calculate precision, recall, F-measure
Think outside the search box – www.2dsearch.com 21
CLEF 2017 (n=898) SIGN (n=355) Recruitment (n=571)
P R F P R F P R F
DBpedia 0.026 0.046 0.033 0.024 0.034 0.028 0.019 0.043 0.026
WebISA 0.013 0.010 0.011 0.014 0.009 0.011 0.005 0.004 0.004
MeSH 0.065 0.017 0.027 0.148 0.015 0.027 n/a n/a n/a
RxNorm 0.000 0.000 0.000 0.000 0.000 0.000 n/a n/a n/a
BNF 0.002 0.001 0.001 0.000 0.000 0.000 n/a n/a n/a
DrugBank 0.000 0.000 0.000 0.000 0.000 0.000 n/a n/a n/a
Think outside the search box – www.2dsearch.com 22
CLEF 2017 (n=898) SIGN (n=355) Recruitment (n=571)
P R F P R F P R F
NCF regex 0.026 0.030 0.028 0.010 0.011 0.010 0.012 0.023 0.016
Nltk-np 0.017 0.017 0.017 0.011 0.011 0.011 0.018 0.030 0.023
rake 0.005 0.004 0.004 0.004 0.004 0.004 0.003 0.005 0.004
sgrank 0.024 0.023 0.023 0.013 0.014 0.013 0.008 0.011 0.009
Gensim
textrank
0.027 0.017 0.021 0.010 0.009 0.009 0.016 0.014 0.015
Textacy
textrank
0.030 0.033 0.031 0.012 0.014 0.013 0.020 0.024 0.022
Think outside the search box – www.2dsearch.com 23
CLEF 2017 (n=898) SIGN (n=355) Recruitment (n=571)
P R F P R F P R F
Word2vec+Google News 0.033 0.037 0.035 0.027 0.025 0.028 0.041 0.035 0.038
GloVe+Wikipedia 0.044 0.047 0.045 0.026 0.030 0.028 0.057 0.047 0.051
FastText+Wikipedia 0.024 0.038 0.029 0.019 0.016 0.017 0.024 0.018 0.021
Word2vec+PubMed (win2) 0.057 0.062 0.059 0.026 0.028 0.027 n/a n/a n/a
Word2vec+PubMed (win30) 0.069 0.073 0.071 0.028 0.033 0.030 n/a n/a n/a
Bespoke word2vec +PubMed, unigrams 0.071 0.075 0.073 0.038 0.040 0.039 n/a n/a n/a
Bespoke word2vec +PubMed, trigrams 0.069 0.072 0.072 0.042 0.040 0.041 n/a n/a n/a
Think outside the search box – www.2dsearch.com 24
Recruitment (n=571)
P R F
DBpedia (alone) 0.019 0.043 0.026
GloVe+Wikipedia (alone) 0.057 0.047 0.051
Aggregated 0.030 0.081 0.044
Think outside the search box – www.2dsearch.com 25
CLEF 2017 (n=898) SIGN (n=355)
P R F P R F
MeSH (alone) 0.065 0.017 0.027 0.148 0.015 0.027
Bespoke PubMed trigram (alone) 0.071 0.075 0.073 0.042 0.040 0.041
Aggregated 0.082 0.081 0.081 0.075 0.035 0.048
CLEF 2017 (n=898) SIGN (n=355) Recruitment (n=571)
P R F P R F P R F
Curated ontology 0.065 0.017 0.027 0.148 0.015 0.027 0.019 0.043 0.026
Word embedding 0.071 0.075 0.073 0.042 0.040 0.041 0.057 0.047 0.051
Simple aggregation 0.082 0.081 0.081 0.073 0.074 0.035 0.030 0.081 0.044
Loose pipelining 0.083 0.081 0.082 0.075 0.035 0.048 0.061 0.069 0.065
Strict pipelining 0.100 0.076 0.086 0.135 0.032 0.052 0.065 0.068 0.066
Think outside the search box – www.2dsearch.com 26
 Curated resources used for higher order ngrams
(bigrams and above), ML methods for unigrams
 Manually curated resources
 Outperformed baseline for healthcare but not for recruitment
 Terms extracted from abstracts and definitions not effective
 Machine learning methods
 Outperformed baseline for all data sets
 Outperformed manually-curated resources
 Bespoke Pubmed model performed best
 Combinations: exploit ngram order
 Simple aggregation produced highest recall
 Strict pipelining produced highest P and highest F
Think outside the search box – www.2dsearch.com 27
 User experience
 Visual methods vs. traditional query builders
 Quality of query suggestions
 Data science
 Semi-automated strategy generation (e.g. from job specs)
 Investigation of ConceptNet, Wikidata, etc.
 Combining set retrieval with ranked retrieval
 Contextual language modelling (BERT, ELMo, etc.)
 Try it out: https://www.2dsearch.com
Think outside the search box – www.2dsearch.com 28
Dr.Tony Russell-Rose, FBCS CITP CEng
Founder: 2Dsearch, Director: UXLabs
RAEVisiting Professor, Essex University
Senior Lecturer, Goldsmiths
 Web: http://www.2dsearch.com
 Email: info@2dsearch.com
 Blog: http://isquared.wordpress.com
 LinkedIn: http://uk.linkedin.com/in/tonyrussellrose
 Twitter: @tonygrr, @2d_search
30Think outside the search box – www.2dsearch.com

More Related Content

Similar to NLP techniques for automated query suggestions

Fishing for pearls Babraham PhD
Fishing for pearls Babraham PhDFishing for pearls Babraham PhD
Fishing for pearls Babraham PhDIsla Kuhn
 
CrossRef Plagiarism Checking
CrossRef Plagiarism CheckingCrossRef Plagiarism Checking
CrossRef Plagiarism CheckingCarol Anne Meyer
 
Medical Self-Care issue # 1, 1976
Medical Self-Care issue # 1, 1976Medical Self-Care issue # 1, 1976
Medical Self-Care issue # 1, 1976Gilles Frydman
 
Systematic reviewing oct2011
Systematic reviewing oct2011Systematic reviewing oct2011
Systematic reviewing oct2011Isla Kuhn
 
How to use Internet resources in Pediatric Practice
How to use Internet resources in Pediatric PracticeHow to use Internet resources in Pediatric Practice
How to use Internet resources in Pediatric PracticeYasser Sami Abdel Dayem Amer
 
100-cases-in-paediatrics.pdf
100-cases-in-paediatrics.pdf100-cases-in-paediatrics.pdf
100-cases-in-paediatrics.pdfjluisnu
 
Biol161 01
Biol161 01Biol161 01
Biol161 01gfb1
 
Kyle Jensen's MIT Ph.D. Thesis Proposal
Kyle Jensen's MIT Ph.D. Thesis ProposalKyle Jensen's MIT Ph.D. Thesis Proposal
Kyle Jensen's MIT Ph.D. Thesis ProposalKyle Jensen
 
Werner's Syndrome Keara and Tianna BIO 408 505
Werner's Syndrome Keara and Tianna BIO 408 505Werner's Syndrome Keara and Tianna BIO 408 505
Werner's Syndrome Keara and Tianna BIO 408 505Keara Coffield
 
Nature nurture
Nature nurtureNature nurture
Nature nurtureirenek
 
YourGenes,YourChoicesby Catherine BakerExplorin.docx
YourGenes,YourChoicesby Catherine BakerExplorin.docxYourGenes,YourChoicesby Catherine BakerExplorin.docx
YourGenes,YourChoicesby Catherine BakerExplorin.docxdanielfoster65629
 
12.08.08: Common Musculoskeletal Problems
12.08.08: Common Musculoskeletal Problems12.08.08: Common Musculoskeletal Problems
12.08.08: Common Musculoskeletal ProblemsOpen.Michigan
 
Preventative Health and Screening in General Practice: the 4 Step guide to me...
Preventative Health and Screening in General Practice: the 4 Step guide to me...Preventative Health and Screening in General Practice: the 4 Step guide to me...
Preventative Health and Screening in General Practice: the 4 Step guide to me...SOgnenis
 
ADHD - Diagnoses, Epidemiology & Precision Treatment - IMMH 2014
ADHD - Diagnoses, Epidemiology & Precision Treatment - IMMH 2014ADHD - Diagnoses, Epidemiology & Precision Treatment - IMMH 2014
ADHD - Diagnoses, Epidemiology & Precision Treatment - IMMH 2014Louis Cady, MD
 
Ideals and Norms in Scholarship
Ideals and Norms in ScholarshipIdeals and Norms in Scholarship
Ideals and Norms in ScholarshipPaul Groth
 

Similar to NLP techniques for automated query suggestions (20)

Fishing for pearls Babraham PhD
Fishing for pearls Babraham PhDFishing for pearls Babraham PhD
Fishing for pearls Babraham PhD
 
Telling Stories with Data
Telling Stories with DataTelling Stories with Data
Telling Stories with Data
 
CrossRef Plagiarism Checking
CrossRef Plagiarism CheckingCrossRef Plagiarism Checking
CrossRef Plagiarism Checking
 
Medical Self-Care issue # 1, 1976
Medical Self-Care issue # 1, 1976Medical Self-Care issue # 1, 1976
Medical Self-Care issue # 1, 1976
 
Systematic reviewing oct2011
Systematic reviewing oct2011Systematic reviewing oct2011
Systematic reviewing oct2011
 
How to use Internet resources in Pediatric Practice
How to use Internet resources in Pediatric PracticeHow to use Internet resources in Pediatric Practice
How to use Internet resources in Pediatric Practice
 
The BRCA Challenge - John Burn
The BRCA Challenge - John BurnThe BRCA Challenge - John Burn
The BRCA Challenge - John Burn
 
Obstetrics and gynecology
Obstetrics and gynecologyObstetrics and gynecology
Obstetrics and gynecology
 
100-cases-in-paediatrics.pdf
100-cases-in-paediatrics.pdf100-cases-in-paediatrics.pdf
100-cases-in-paediatrics.pdf
 
Biol161 01
Biol161 01Biol161 01
Biol161 01
 
Kyle Jensen's MIT Ph.D. Thesis Proposal
Kyle Jensen's MIT Ph.D. Thesis ProposalKyle Jensen's MIT Ph.D. Thesis Proposal
Kyle Jensen's MIT Ph.D. Thesis Proposal
 
Werner's Syndrome Keara and Tianna BIO 408 505
Werner's Syndrome Keara and Tianna BIO 408 505Werner's Syndrome Keara and Tianna BIO 408 505
Werner's Syndrome Keara and Tianna BIO 408 505
 
Nature nurture
Nature nurtureNature nurture
Nature nurture
 
YourGenes,YourChoicesby Catherine BakerExplorin.docx
YourGenes,YourChoicesby Catherine BakerExplorin.docxYourGenes,YourChoicesby Catherine BakerExplorin.docx
YourGenes,YourChoicesby Catherine BakerExplorin.docx
 
12.08.08: Common Musculoskeletal Problems
12.08.08: Common Musculoskeletal Problems12.08.08: Common Musculoskeletal Problems
12.08.08: Common Musculoskeletal Problems
 
Preventative Health and Screening in General Practice: the 4 Step guide to me...
Preventative Health and Screening in General Practice: the 4 Step guide to me...Preventative Health and Screening in General Practice: the 4 Step guide to me...
Preventative Health and Screening in General Practice: the 4 Step guide to me...
 
ADHD - Diagnoses, Epidemiology & Precision Treatment - IMMH 2014
ADHD - Diagnoses, Epidemiology & Precision Treatment - IMMH 2014ADHD - Diagnoses, Epidemiology & Precision Treatment - IMMH 2014
ADHD - Diagnoses, Epidemiology & Precision Treatment - IMMH 2014
 
دوا ذكا
دوا ذكادوا ذكا
دوا ذكا
 
Archimedes Essay
Archimedes EssayArchimedes Essay
Archimedes Essay
 
Ideals and Norms in Scholarship
Ideals and Norms in ScholarshipIdeals and Norms in Scholarship
Ideals and Norms in Scholarship
 

More from Tony Russell-Rose

Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language ProcessingTony Russell-Rose
 
Designing the Search Experience: The Language of Discovery
Designing the Search Experience: The Language of DiscoveryDesigning the Search Experience: The Language of Discovery
Designing the Search Experience: The Language of DiscoveryTony Russell-Rose
 
User requirements for complex search strategies
User requirements for complex search strategiesUser requirements for complex search strategies
User requirements for complex search strategiesTony Russell-Rose
 
Sentiment analysis in healthcare
Sentiment analysis in healthcareSentiment analysis in healthcare
Sentiment analysis in healthcareTony Russell-Rose
 
A Model of Consumer Search Behaviour
A Model of Consumer Search BehaviourA Model of Consumer Search Behaviour
A Model of Consumer Search BehaviourTony Russell-Rose
 
A taxonomy of search strategies and their design implications
A taxonomy of search strategies and their design implicationsA taxonomy of search strategies and their design implications
A taxonomy of search strategies and their design implicationsTony Russell-Rose
 
From search to discovery: Information search strategies and design solutions
From search to discovery: Information search strategies and design solutionsFrom search to discovery: Information search strategies and design solutions
From search to discovery: Information search strategies and design solutionsTony Russell-Rose
 
The Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information RetrievalThe Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information RetrievalTony Russell-Rose
 
Text Analytics: Yesterday, Today and Tomorrow
Text Analytics: Yesterday, Today and TomorrowText Analytics: Yesterday, Today and Tomorrow
Text Analytics: Yesterday, Today and TomorrowTony Russell-Rose
 
UI Design Patterns for Search & Information Discovery
UI Design Patterns for Search & Information DiscoveryUI Design Patterns for Search & Information Discovery
UI Design Patterns for Search & Information DiscoveryTony Russell-Rose
 

More from Tony Russell-Rose (13)

Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Designing the Search Experience: The Language of Discovery
Designing the Search Experience: The Language of DiscoveryDesigning the Search Experience: The Language of Discovery
Designing the Search Experience: The Language of Discovery
 
User requirements for complex search strategies
User requirements for complex search strategiesUser requirements for complex search strategies
User requirements for complex search strategies
 
Sentiment analysis in healthcare
Sentiment analysis in healthcareSentiment analysis in healthcare
Sentiment analysis in healthcare
 
A Model of Consumer Search Behaviour
A Model of Consumer Search BehaviourA Model of Consumer Search Behaviour
A Model of Consumer Search Behaviour
 
A Taxonomy of Site Search
A Taxonomy of Site SearchA Taxonomy of Site Search
A Taxonomy of Site Search
 
Patterns of Personalization
Patterns of PersonalizationPatterns of Personalization
Patterns of Personalization
 
A taxonomy of search strategies and their design implications
A taxonomy of search strategies and their design implicationsA taxonomy of search strategies and their design implications
A taxonomy of search strategies and their design implications
 
From search to discovery: Information search strategies and design solutions
From search to discovery: Information search strategies and design solutionsFrom search to discovery: Information search strategies and design solutions
From search to discovery: Information search strategies and design solutions
 
The Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information RetrievalThe Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information Retrieval
 
From Search to Discovery
From Search to DiscoveryFrom Search to Discovery
From Search to Discovery
 
Text Analytics: Yesterday, Today and Tomorrow
Text Analytics: Yesterday, Today and TomorrowText Analytics: Yesterday, Today and Tomorrow
Text Analytics: Yesterday, Today and Tomorrow
 
UI Design Patterns for Search & Information Discovery
UI Design Patterns for Search & Information DiscoveryUI Design Patterns for Search & Information Discovery
UI Design Patterns for Search & Information Discovery
 

Recently uploaded

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 

Recently uploaded (20)

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 

NLP techniques for automated query suggestions

  • 2. 1. randomized controlled trial.pt. 2. controlled clinical trial.pt. 3. randomized.ab. 4. placebo.ab. 5. clinical trials as topic.sh. 6. randomly.ab. 7. trial.ti. 8. 1 or 2 or 3 or 4 or 5 or 6 or 7 9. (animals not (humans and animals)).sh. 10. 8 not 9 11. exp Child/ 12. ADOLESCENT/ 13. exp infant/ 14. child hospitalized/ 15. adolescent hospitalized/ 16. (child$ or infant$ or toddler$ or adolescen$ or teenage$).tw. Think outside the search box – www.2dsearch.com 2 17. or/11-16 18. Child Nutrition Sciences/ 19. exp Dietary Proteins/ 20. Dietary Supplements/ 21. Dietetics/ 22. or/18-21 23. exp Infant, Newborn/ 24. exp Overweight/ 25. exp Eating Disorders/ 26. Athletes/ 27. exp Sports/ 28. exp Pregnancy/ 29. expViruses/ 30. (newborn$ or obes$ or "eating disorder$" or pregnan$ or childbirth or virus$ or influenza).tw. 31. or/23-30 32. 10 and 17 and 22 33. 32 not 31
  • 3. Think outside the search box – www.2dsearch.com 3 Java AND (Design OR develop OR code OR Program) AND ("* Engineer" OR MTS OR "* Develop*" OR Scientist OR technologist) AND (J2EE OR Struts OR Spring) AND (Algorithm OR "Data Structure" OR PS OR “Problem Solving”)
  • 4. 1 A01N0025-004/CPC 2 RODENT OR RAT OR RATS OR MOUSE OR MICE 3 BAIT OR POISON 4 2 AND 3 5 1 OR 4 6 AVERSIVE OR ADVERSIVE OR DETER? OR REPEL? 7 NONTARGET OR (NON WITH TARGET) OR HUMAN OR DOMESTIC OR PET OR DOG OR CAT 8 6 AND 7 9 8 AND 5 10 BITREX OR DENATONIUM OR BITREXENE OR BITTERANT OR BITTER 11 10 AND 5 12 9 OR 11 Think outside the search box – www.2dsearch.com 4 ( ( (~Magnolia OR ~Magnolias) AND ('~Paul ~Anderson' OR '~Paul ~Thomas' OR paul* OR thom* OR anders*) ) OR 'jeremy blackman*' OR jeremy OR Jerem* OR ( (~incredible OR ~coincidences) AND (role* OR chance* OR life* OR investigat* OR disturb*) ) OR ( (~C NOT ('~John C' OR ( ( ('~Mister C' AND ~Shamen) OR ('~Tom C' AND ~Scientology) ) ) ) ) ) OR ~XCs OR ~CV OR ~CVs OR ( ( (~CKR OR ~CKRs) NOT ('CKR 123' OR lawyer) ) ) )
  • 5. (topic:"d5f485b9-aab0-4d95-a83b-28b2decd900c" AND (topic:"ba942587-854e-407d-81dc-812ee29624bd" OR topic:"805a2281-8f04-42b7-a451-e2b7daac4db1" OR topic:"1fc2a4f5- 3f23-4c80-943c-42f789d11644" OR topic:"91e4df78-db55-4d63-be65-2fad1afa77a7" OR organization:"Argos"OR organization:"Sainsbury"OR organization:"Marks and Spencer" OR organization:"Morrisons" OR organization:"Signet" OR organization:"Rapha"OR organization:"Vodafone" OR organization:"Evans Cycle" OR organization:"The Body Shop" OR organization:"Homebase" OR organization:"Topps Tiles" OR organization:"Carphone Warehouse"OR organization:"B&Q" OR organization:"Tesco" OR organization:"Lidl"OR organization:"Aldi"OR organization:"Waitrose"OR organization:"John Lewis" OR organization:"Dixons" OR organization:"Alliance Boots" OR organization:"AS Watson" OR organization:"Halfords"OR organization:"Vision Express" OR organization:"Pets at Home" OR organization:"Asda" OR organization:"Pret a Manger" OR organization:"Greggs" OR organization:"Majestic Wine" OR organization:"Ocado" OR organization:"Selfridges" OR organization:"Nike" OR organization:"Adidas" OR organization:"Walmart" OR organization:"Woolworths"OR organization:"Debenhams" OR organization:"Halfords"OR organization:"Punch Taverns" OR organization:"Monarch Group" OR organization:"Whitbread" OR organization:"Carnival Cruise" OR organization:"KFC" OR organization:"Yum!" OR organization:"Costa Coffee" OR organization:"Pizza Hut" OR organization:"Greene King" OR organization:"Carluccio's" OR organization:"Gondola"OR organization:"Starbucks"OR organization:"Sodexo" OR organization:"SSP" OR organization:"Nando's" OR organization:"Avis"OR organization:"Paddy Power" OR organization:"Easyjet" OR organization:"Emaar" OR organization:"Nuffield Health" OR organization:"Safestore" OR organization:"TUI" OR organization:"Merlin Entertainment" OR organization:"Virgin Atlantic" OR organization:"IHG" OR organization:"Virgin Active" OR organization:"Thomas Cook" OR organization:"Odeon Hilton" OR organization:"Pizza Express" OR organization:"Zizzi's" OR organization:"Gemfields"OR organization:"Orlebar Brown" OR organization:"Rebecca Trowell" OR organization:"Spring Studios" OR organization:"Loewe" OR organization:"Jimmy Choo" OR organization:"Burberry" OR organization:"Conde Nast" OR organization:"Zadig & Voltaire" OR organization:"Prada"OR organization:"Fendi"OR organization:"French Connection" OR organization:"Lyle & Scott" OR organization:"LK Bennett" OR organization:"Alexander Wang" OR organization:"Sandro"OR organization:"Isabel Marant" OR organization:"Rag & Bone" OR organization:"J Crew" OR organization:"Maje" OR organization:"Ralph Lauren" OR organization:"Tommy Hilfiger" OR organization:"Michael Kors" OR organization:"Kenzo"OR organization:"Pringle of Scotland" OR organization:"Heidi Klein" OR organization:"Louis Vuitton" OR organization:"Lanvin" OR organization:"Loewe" OR organization:"Chanel" OR organization:"Valentino" OR organization:"Celine" OR organization:"Dior" OR organization:"Alexander McQueen" OR organization:"Victoria Beckham"OR organization:"Sonia Rykel"OR organization:"Bally" OR organization:"Notonthehighstreet" OR organization:"Rupert Sanderson" OR organization:"Dunhill" OR organization:"De Beers" OR organization:"Swarovski"OR organization:"Hermes" OR organization:"Cartier" OR organization:"David Yurman" OR organization:"Mulberry" OR organization:"Tiffany & Co" OR organization:"Boucheron" OR organization:"Lancel" OR organization:"Marc Jacobs" OR organization:"Longchamp" OR organization:"Kate Spade" OR organization:"Salvatore Ferragamo" OR organization:"Asos"OR organization:"Net a porter" OR organization:"Astley Clarke" OR organization:"Fenwick" OR organization:"Yoox" OR organization:"Harrods" OR organization:"Selfridges" OR organization:"MyTheresa" OR organization:"Lane Crawford" OR organization:"Liberty of London" OR organization:"Gucci"OR organization:"Armarni"OR organization:"Zara"OR organization:"Topshop"OR organization:"Arcadia" OR organization:"Balenciaga" OR organization:"Stella McCartney" OR organization:"Brioni"OR organization:"Bottega Veneta" OR organization:"Christopher Kane" OR organization:"Anya Hindmarch" OR organization:"Business of Fashion" OR organization:"3i" OR organization:"Active Private Equity" OR organization:"Advent International" OR organization:"Advent" OR organization:"ApolloGlobal Management" OR organization:"Apollo"OR organization:"Apax" OR organization:"Apax" OR organization:"Augmentum Capital"OR organization:"Bain" OR organization:"Endless" OR organization:"BC Partners" OR organization:"Better Capital" OR organization:"HuttonCollins" OR organization:"Hutton Collins" OR organization:"Bridgepoint" OR organization:"Business Growth Fund" OR organization:"CDR" OR organization:"Darwin" OR organization:"Duke Street Capital" OR organization:"Electra Partners" OR organization:"Exponent" OR organization:"European Capital" OR organization:"General Atlantic" OR organization:"Hilco" OR organization:"Inflexion" OR organization:"KKR"OR organization:"LGV"OR organization:"Lion" OR organization:"Magenta Partners" OR organization:"Montagu" OR organization:"Octopus"OR organization:"Octopus Capital" OR organization:"Piper Private Equity" OR organization:"Piper" OR organization:"Sun Capital" OR organization:"TA Associates" OR organization:"Warburg Pincus" OR organization:"Charterhouse" OR organization:"Primary Capital" OR organization:"Bowmark"OR organization:"Beringea" OR organization:"Business Growth Fund" OR organization:"Bridgepoint" OR organization:"Caird Capital" OR organization:"CapVest"OR organization:"Cerberus Capital" OR organization:"CBPE" OR organization:"CCMP" OR organization:"CDR" OR organization:"Change Capital" OR organization:"Cinven" OR organization:"CVC" OR organization:"Duke Street" OR organization:"EC1" OR organization:"Electra" OR organization:"Encore" OR organization:"EQT" OR organization:"Equistone" OR organization:"Graphite Capital" OR organization:"Caledonian"OR organization:"Kings Park Capital" OR organization:"L Capital" OR organization:"LDC" Think outside the search box – www.2dsearch.com 5
  • 6.  Keyword search is not good enough  ‘Advanced’ search is anything but  Patent, healthcare, recruitment, media…  Professional search needs to be: ▪ Comprehensive ▪ Repeatable ▪ Transparent Think outside the search box – www.2dsearch.com 7
  • 7. Think outside the search box – www.2dsearch.com 8
  • 8.  Ineffective for communicating structure  Visual cues needed  Poor scalability  No abstraction  Error-prone  Syntactic + semantic errors  Salvador-Oliván, José Antonio, Gonzalo Marco-Cuenca, and Rosario Arquero-Avilés. "Errors in search strategies used in systematic reviews and their effects on information retrieval." Journal of the Medical Library Association: JMLA 107.2 (2019): 210. Think outside the search box – www.2dsearch.com 9
  • 9.  “Errors in the search strategy of a systematic review may undermine the integrity of the evidence base used in the review”  Sampson M, McGowan J. Errors in search strategies were identified by type and frequency. J Clin Epidemiol 2006 Oct;59(10):1057-1063  63 MEDLINE search strategies examined  90.5% contained ≥1 errors  82.5% contained errors that could potentially lower recall of relevant studies  The most common search errors were ▪ missed MeSH terms (44.4%) ▪ unwarranted explosion of MeSH terms (38.1%) ▪ irrelevant MeSH or free text terms (28.6%) ▪ Missed spelling variants, combining MeSH and free text terms in the same line, and failure to tailor the search strategy for other databases (20.6%) ▪ Logical operator error occurred in 19.0% of searches Think outside the search box – www.2dsearch.com 10
  • 10.  What if we could visualize search strategies?  Use metaphors the user already understands  Separate keyword selection from query manipulation  Provide a ‘scratchpad’ & ‘debugging aids’  Provide instant feedback  Support abstraction (levels of composition)  Provide query suggestions Think outside the search box – www.2dsearch.com 11
  • 11.  Concepts are objects on a 2d canvas  Relationships via direct manipulation  Syntactically correct expressions  Transparent semantics  Arbitrary levels of abstraction  ‘Lego’ analogy  Optimisation thru exploration ▪ Execute, enable/disable ▪ Hit counts Think outside the search box – www.2dsearch.com 12
  • 12.  Cut, copy, paste  Single or multiple select  Lasso  Undo / redo  Mapping metaphor  Conceptual zoom  Overview+detail Think outside the search box – www.2dsearch.com 13
  • 13.  Integrations  Google, Bing, PubMed, GS, TRIP, Epistemonikos  Universal representation  Auto-translation  Error-checking  Interoperability with legacy formats  Boolean string I/O Think outside the search box – www.2dsearch.com 14
  • 14.  Term selection is common source of inefficiency & error  Apply query suggestions?  (“business analyst” or “systems analyst” or “system analyst” or “data analyst” or “requirements analyst” or “functional analyst”) and crystal and report* and analy* and data near analy* and not inventory and not retail and not (ecommerce or “e-commerce” or b2b or b2c) Think outside the search box – www.2dsearch.com 15
  • 15. Think outside the search box – www.2dsearch.com 16
  • 16.  Manually curated resources  Controlled vocabularies, thesauri, etc. ▪ Ontological relations ▪ Terms in abstracts & definitions  Unsupervised learning  Pre-trained models  Bespoke models  Other techniques  Terms extracted from search result snippets  Search result clustering + topic labels Think outside the search box – www.2dsearch.com 17
  • 17.  DBPEDIA  4.58 million entities fromWikipedia, of which 4.22 million are classified in a consistent ontology  WEBISA  11.7 million hypernymy relations extracted from CommonCrawl web corpus  MeSH  Controlled vocabulary of 25,186 subject headings for indexing documents in life sciences  BNF  Pharmaceutical reference for medicines available on the NHS  RxNorm  Terminology on medications available on the US market  DrugBank  Information on ~12,000 drug entries and drug targets Think outside the search box – www.2dsearch.com 18
  • 18.  Word2vec (shallow neural network)  Pre-trained on Google news corpus  GloVe (unsupervised learning)  Pre-trained on Wikipedia articles  FastText (unsupervised learning of character ngrams)  Pre-trained on Wikipedia  Word2vec  Bespoke models trained on PubMed articles Think outside the search box – www.2dsearch.com 19
  • 19. Think outside the search box – www.2dsearch.com 20 T.G. Russell-Rose and Phil Gooch ,"2dSearch: a visual approach to search strategy formulation", Proceedings of DESIRES: Design of Experimental Search & Information REtrieval Systems, Bertinoro, Italy, 28-31 August 2018
  • 20. 1. Acquire test data from published search strategies  CLEF 2017 eHealth Lab  SIGN search filters  Boolean Search Strings Repository 2. Extract term equivalence sets  child or infant or toddler or adolescent or teenage 3. Generate terms using each method  Calculate precision, recall, F-measure Think outside the search box – www.2dsearch.com 21
  • 21. CLEF 2017 (n=898) SIGN (n=355) Recruitment (n=571) P R F P R F P R F DBpedia 0.026 0.046 0.033 0.024 0.034 0.028 0.019 0.043 0.026 WebISA 0.013 0.010 0.011 0.014 0.009 0.011 0.005 0.004 0.004 MeSH 0.065 0.017 0.027 0.148 0.015 0.027 n/a n/a n/a RxNorm 0.000 0.000 0.000 0.000 0.000 0.000 n/a n/a n/a BNF 0.002 0.001 0.001 0.000 0.000 0.000 n/a n/a n/a DrugBank 0.000 0.000 0.000 0.000 0.000 0.000 n/a n/a n/a Think outside the search box – www.2dsearch.com 22
  • 22. CLEF 2017 (n=898) SIGN (n=355) Recruitment (n=571) P R F P R F P R F NCF regex 0.026 0.030 0.028 0.010 0.011 0.010 0.012 0.023 0.016 Nltk-np 0.017 0.017 0.017 0.011 0.011 0.011 0.018 0.030 0.023 rake 0.005 0.004 0.004 0.004 0.004 0.004 0.003 0.005 0.004 sgrank 0.024 0.023 0.023 0.013 0.014 0.013 0.008 0.011 0.009 Gensim textrank 0.027 0.017 0.021 0.010 0.009 0.009 0.016 0.014 0.015 Textacy textrank 0.030 0.033 0.031 0.012 0.014 0.013 0.020 0.024 0.022 Think outside the search box – www.2dsearch.com 23
  • 23. CLEF 2017 (n=898) SIGN (n=355) Recruitment (n=571) P R F P R F P R F Word2vec+Google News 0.033 0.037 0.035 0.027 0.025 0.028 0.041 0.035 0.038 GloVe+Wikipedia 0.044 0.047 0.045 0.026 0.030 0.028 0.057 0.047 0.051 FastText+Wikipedia 0.024 0.038 0.029 0.019 0.016 0.017 0.024 0.018 0.021 Word2vec+PubMed (win2) 0.057 0.062 0.059 0.026 0.028 0.027 n/a n/a n/a Word2vec+PubMed (win30) 0.069 0.073 0.071 0.028 0.033 0.030 n/a n/a n/a Bespoke word2vec +PubMed, unigrams 0.071 0.075 0.073 0.038 0.040 0.039 n/a n/a n/a Bespoke word2vec +PubMed, trigrams 0.069 0.072 0.072 0.042 0.040 0.041 n/a n/a n/a Think outside the search box – www.2dsearch.com 24
  • 24. Recruitment (n=571) P R F DBpedia (alone) 0.019 0.043 0.026 GloVe+Wikipedia (alone) 0.057 0.047 0.051 Aggregated 0.030 0.081 0.044 Think outside the search box – www.2dsearch.com 25 CLEF 2017 (n=898) SIGN (n=355) P R F P R F MeSH (alone) 0.065 0.017 0.027 0.148 0.015 0.027 Bespoke PubMed trigram (alone) 0.071 0.075 0.073 0.042 0.040 0.041 Aggregated 0.082 0.081 0.081 0.075 0.035 0.048
  • 25. CLEF 2017 (n=898) SIGN (n=355) Recruitment (n=571) P R F P R F P R F Curated ontology 0.065 0.017 0.027 0.148 0.015 0.027 0.019 0.043 0.026 Word embedding 0.071 0.075 0.073 0.042 0.040 0.041 0.057 0.047 0.051 Simple aggregation 0.082 0.081 0.081 0.073 0.074 0.035 0.030 0.081 0.044 Loose pipelining 0.083 0.081 0.082 0.075 0.035 0.048 0.061 0.069 0.065 Strict pipelining 0.100 0.076 0.086 0.135 0.032 0.052 0.065 0.068 0.066 Think outside the search box – www.2dsearch.com 26  Curated resources used for higher order ngrams (bigrams and above), ML methods for unigrams
  • 26.  Manually curated resources  Outperformed baseline for healthcare but not for recruitment  Terms extracted from abstracts and definitions not effective  Machine learning methods  Outperformed baseline for all data sets  Outperformed manually-curated resources  Bespoke Pubmed model performed best  Combinations: exploit ngram order  Simple aggregation produced highest recall  Strict pipelining produced highest P and highest F Think outside the search box – www.2dsearch.com 27
  • 27.  User experience  Visual methods vs. traditional query builders  Quality of query suggestions  Data science  Semi-automated strategy generation (e.g. from job specs)  Investigation of ConceptNet, Wikidata, etc.  Combining set retrieval with ranked retrieval  Contextual language modelling (BERT, ELMo, etc.)  Try it out: https://www.2dsearch.com Think outside the search box – www.2dsearch.com 28
  • 28. Dr.Tony Russell-Rose, FBCS CITP CEng Founder: 2Dsearch, Director: UXLabs RAEVisiting Professor, Essex University Senior Lecturer, Goldsmiths  Web: http://www.2dsearch.com  Email: info@2dsearch.com  Blog: http://isquared.wordpress.com  LinkedIn: http://uk.linkedin.com/in/tonyrussellrose  Twitter: @tonygrr, @2d_search 30Think outside the search box – www.2dsearch.com