SlideShare a Scribd company logo
1 of 41
Download to read offline
A Knowledge Base
of Entity-Oriented Search Intents
Darío Garigliotti

February 2, 2018
A brief introduction to...
- Entities

- Entity types

- RDF tuples

- Knowledge bases

- Search intents
A brief introduction to...
- Entities

- Entity types

- RDF tuples

- Knowledge bases

- Search intents
A brief introduction to...
- Entities

- Entity types

- RDF tuples

- Knowledge bases

- Search intents
Recalling from the past
Recalling from the past
Entities
- An entity is an individual or thing, uniquely
identified

- For example:
Henrik Ibsen
Stavanger
- Also:
Pythagorean Theorem
UEFA Champions League
Entities
Entity types
- A typical property of an entity is the type(s)

- A entity type is a semantic class grouping
multiple entities

(Henrik Ibsen, is a, writer)
(Henrik Ibsen, is a, Norwegian writer)
(Henrik Ibsen, is a, person)
Entity types
Tuples
- We describe entity properties using triples 

- Attributes
(Henrik Ibsen, birthdate, 20 March 1828)
- Types
(Henrik Ibsen, is a, writer)
- Relations
(Henrik Ibsen, work, A Doll’s House)
- RDF (Resource Description Framework) 

- A way to represent structured knowledge
Knowledge bases
- A knowledge base (KB) is a set of tuples

- There are many knowledge bases

- Domain-specific, e.g. GeoNames, DOI, BBCMusic
- Cross-domain, e.g. DBpedia, YAGO, Freebase,
Google Knowledge Graph
- Yes, there are many
Cloud of (linked) KBs
Search intents and refiners
- Intent: the underlying user need in a search
query

- For example, the intent of booking a hotel room
- Entity-oriented queries

- Refiner: a way to express an intent in an entity-
oriented query

- For example, for booking a hotel room:
"booking", "book", "reservation", "rooms"
Towards an understanding
of search intents
- A large proportion of entity-oriented search
queries

- Interest in understanding what those queries ask
for, and how they can be fulfilled
A KB of entity-oriented
search intents
1. Intents searched for a type of entities

paris map, sydney map => [city] map
- a
2. Categories assigned to refiners

messi instagram => Website
scandic rooms => Service
henrik ibsen child => Property
- a
3. Multiple refiners expressing an intent

"booking", "book", "make a reservation", "rooms"
- a
A KB of entity-oriented
search intents
1. Intents searched for a type of entities

paris map, sydney map => [city] map
- (intent ID, searchedForType, entity type, conf.)
2. Categories assigned to refiners

messi instagram => Website
scandic rooms => Service
henrik ibsen child => Property
- a
3. Multiple refiners expressing an intent

"booking", "book", "make a reservation", "rooms"
- a
A KB of entity-oriented
search intents
1. Intents searched for a type of entities

paris map, sydney map => [city] map
- (intent ID, searchedForType, entity type, conf.)
2. Categories assigned to refiners

messi instagram => Website
scandic rooms => Service
henrik ibsen child => Property
- (intent ID, ofCategory, intent category, conf.)
3. Multiple refiners expressing an intent

"booking", "book", "make a reservation", "rooms"
- a
A KB of entity-oriented
search intents
1. Intents searched for a type of entities

paris map, sydney map => [city] map
- (intent ID, searchedForType, entity type, conf.)
2. Categories assigned to refiners

messi instagram => Website
scandic rooms => Service
henrik ibsen child => Property
- (intent ID, ofCategory, intent category, conf.)
3. Multiple refiners expressing an intent

"booking", "book", "make a reservation", "rooms"
- (intent ID, expressedBy, refiner, conf.)
Our pipeline approach
Our pipeline approach
Entities:

clarion hotel

casa 400

...
Our pipeline approach
clarion hotel
clarion hotel airport

clarion hotel spa

clarion hotel booking
casa 400
casa 400 rooms

casa 400 address

casa 400 deals
...
Our pipeline approach
clarion hotel
clarion hotel airport

clarion hotel spa

clarion hotel booking
casa 400
casa 400 rooms

casa 400 address

casa 400 deals
clarion hotel airport
casa 400 airport
scandic airport
...
...
Our pipeline approach
[hotel] airport

clarion hotel
clarion hotel airport

clarion hotel spa

clarion hotel booking
casa 400
casa 400 rooms

casa 400 address

casa 400 deals
clarion hotel airport
casa 400 airport
scandic airport
...
...
Our pipeline approach
[hotel] airport

[hotel] spa

[hotel] booking

...
clarion hotel
clarion hotel airport

clarion hotel spa

clarion hotel booking
casa 400
casa 400 rooms

casa 400 address

casa 400 deals
clarion hotel airport
casa 400 airport
scandic airport
...
...
Our pipeline approach
Refiners
acquisition
[hotel] airport

[hotel] spa

[hotel] booking

...
clarion hotel
clarion hotel airport

clarion hotel spa

clarion hotel booking
casa 400
casa 400 rooms

casa 400 address

casa 400 deals
clarion hotel airport
casa 400 airport
scandic airport
...
...
Our pipeline approach
Refiners
acquisition
[hotel] airport

[hotel] spa

[hotel] booking

...
Our pipeline approach
Refiners
acquisition
Refiners
categorization
[hotel] airport

[hotel] spa

[hotel] booking

...
Our pipeline approach
Refiners
acquisition
Refiners
categorization
[hotel] airport

[hotel] spa

[hotel] booking

...
[hotel] airport: Service

[hotel] address: Property

[hotel] expedia: Website

...
Our pipeline approach
Refiners
acquisition
Refiners
categorization
Intents
discovery
[hotel] airport

[hotel] spa

[hotel] booking

...
[hotel] airport: Service

[hotel] address: Property

[hotel] expedia: Website

...
Our pipeline approach
Refiners
acquisition
Refiners
categorization
Intents
discovery
[hotel] airport

[hotel] spa

[hotel] booking

...
[hotel] airport: Service

[hotel] address: Property

[hotel] expedia: Website

...
taxi

arrive

Hotel_Arrivingbooking

make a reservation
Hotel_Booking
address
Hotel_Address
Our pipeline approach
Refiners
acquisition
Refiners
categorization
Intents
discovery
[hotel] airport

[hotel] spa

[hotel] booking

...
[hotel] airport: Service

[hotel] address: Property

[hotel] expedia: Website

...
taxi

arrive

Hotel_Arrivingbooking

make a reservation
Hotel_Booking
address
Hotel_Address
KB
construction
Our pipeline approach
Refiners
acquisition
Refiners
categorization
Intents
discovery
[hotel] airport

[hotel] spa

[hotel] booking

...
[hotel] airport: Service

[hotel] address: Property

[hotel] expedia: Website

...
taxi

arrive

Hotel_Arrivingbooking

make a reservation
Hotel_Booking
address
Hotel_Address
Intent ID Predicate Object Confidence
Hotel_Booking searchedForType [hotel] c1

Hotel_Booking ofCategory Service c2

Hotel_Booking expressedBy "booking" c3

Hotel_Booking expressedBy "make a reservation" c4
Hotel_Booking expressedBy "rooms" c5
KB
construction
Our pipeline approach
Refiners
acquisition
Refiners
categorization
Intents
discovery
[hotel] airport

[hotel] spa

[hotel] booking

...
[hotel] airport: Service

[hotel] address: Property

[hotel] expedia: Website

...
taxi

arrive

Hotel_Arrivingbooking

make a reservation
Hotel_Booking
address
Hotel_Address
Intent

profile
{ KB
construction
Intent ID Predicate Object Confidence
Hotel_Booking searchedForType [hotel] c1

Hotel_Booking ofCategory Service c2

Hotel_Booking expressedBy "booking" c3

Hotel_Booking expressedBy "make a reservation" c4
Hotel_Booking expressedBy "rooms" c5
Evaluation
- Component-level evaluation

- Cross-validation using the human annotations of intent
categories and refiner clusters, for a representative
sample of 50 types
- End-to-end evaluation

- Human judgments about KB facts, for a sample of
additional types, defined w.r.t. confidence intervals
Knowledge base
construction
- Application of the pipeline to extract all
quadruples from 581 unseen types

- 155K quadruples, 31K intent profiles

Excerpt of the KB, for intent ID
<aviation.airline-65-customer_service>
Results
[0, 0.8652)
[0.8652, 0.8837)
[0.8837, 0.9043)
[0.9043, 0.9319)
[0.9319, 1]
Confidence intervals according to the splitting percentiles
0%
20%
40%
60%
80%
100%
Proportionoftriples
Correct
Incorrect due to OFCATEGORY
Incorrect due to EXPRESSEDBY
Application scenarios
- Leveraging knowledge

with levels of confidence

- Identification of search

intents in unseen queries

- Design and functionality

of entity cards
Thank you!

More Related Content

More from Darío Garigliotti

Task-Based Information Retrieval
Task-Based Information RetrievalTask-Based Information Retrieval
Task-Based Information RetrievalDarío Garigliotti
 
Type Information in Entity Retrieval
Type Information in Entity RetrievalType Information in Entity Retrieval
Type Information in Entity RetrievalDarío Garigliotti
 
If this is the answer, what was the question?
If this is the answer, what was the question?If this is the answer, what was the question?
If this is the answer, what was the question?Darío Garigliotti
 
Semi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationSemi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationDarío Garigliotti
 
Semi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationSemi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationDarío Garigliotti
 
Semi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationSemi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationDarío Garigliotti
 
FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...
FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...
FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...Darío Garigliotti
 
Machine Learning - Classification (ctd.)
Machine Learning - Classification (ctd.)Machine Learning - Classification (ctd.)
Machine Learning - Classification (ctd.)Darío Garigliotti
 
Machine Learning - Classification
Machine Learning - ClassificationMachine Learning - Classification
Machine Learning - ClassificationDarío Garigliotti
 
Data Mining - Introduction and Data
Data Mining - Introduction and DataData Mining - Introduction and Data
Data Mining - Introduction and DataDarío Garigliotti
 
Patterns, Automata and Regular Expressions
Patterns, Automata and Regular ExpressionsPatterns, Automata and Regular Expressions
Patterns, Automata and Regular ExpressionsDarío Garigliotti
 

More from Darío Garigliotti (20)

Task-Based Information Retrieval
Task-Based Information RetrievalTask-Based Information Retrieval
Task-Based Information Retrieval
 
Type Information in Entity Retrieval
Type Information in Entity RetrievalType Information in Entity Retrieval
Type Information in Entity Retrieval
 
Type-Aware Entity Retrieval
Type-Aware Entity RetrievalType-Aware Entity Retrieval
Type-Aware Entity Retrieval
 
Type-Aware Entity Retrieval
Type-Aware Entity RetrievalType-Aware Entity Retrieval
Type-Aware Entity Retrieval
 
Dive into Deep Learning
Dive into Deep LearningDive into Deep Learning
Dive into Deep Learning
 
Type-Aware Entity Retrieval
Type-Aware Entity RetrievalType-Aware Entity Retrieval
Type-Aware Entity Retrieval
 
If this is the answer, what was the question?
If this is the answer, what was the question?If this is the answer, what was the question?
If this is the answer, what was the question?
 
Semi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationSemi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense Disambiguation
 
Semi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationSemi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense Disambiguation
 
Type-Aware Entity Retrieval
Type-Aware Entity RetrievalType-Aware Entity Retrieval
Type-Aware Entity Retrieval
 
Semi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationSemi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense Disambiguation
 
FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...
FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...
FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...
 
Machine Learning - Clustering
Machine Learning - ClusteringMachine Learning - Clustering
Machine Learning - Clustering
 
Machine Learning - Classification (ctd.)
Machine Learning - Classification (ctd.)Machine Learning - Classification (ctd.)
Machine Learning - Classification (ctd.)
 
Machine Learning - Classification
Machine Learning - ClassificationMachine Learning - Classification
Machine Learning - Classification
 
Data Mining - Exploring Data
Data Mining - Exploring DataData Mining - Exploring Data
Data Mining - Exploring Data
 
Data Mining - Introduction and Data
Data Mining - Introduction and DataData Mining - Introduction and Data
Data Mining - Introduction and Data
 
Predicate Logic
Predicate LogicPredicate Logic
Predicate Logic
 
Patterns, Automata and Regular Expressions
Patterns, Automata and Regular ExpressionsPatterns, Automata and Regular Expressions
Patterns, Automata and Regular Expressions
 
The List Data Model
The List Data ModelThe List Data Model
The List Data Model
 

Recently uploaded

Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 

Recently uploaded (20)

Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 

A Knowledge Base of Entity-Oriented Search Intents

  • 1. A Knowledge Base of Entity-Oriented Search Intents Darío Garigliotti February 2, 2018
  • 2. A brief introduction to... - Entities - Entity types - RDF tuples - Knowledge bases - Search intents
  • 3. A brief introduction to... - Entities - Entity types - RDF tuples - Knowledge bases - Search intents
  • 4. A brief introduction to... - Entities - Entity types - RDF tuples - Knowledge bases - Search intents
  • 7. Entities - An entity is an individual or thing, uniquely identified - For example: Henrik Ibsen Stavanger - Also: Pythagorean Theorem UEFA Champions League
  • 9. Entity types - A typical property of an entity is the type(s) - A entity type is a semantic class grouping multiple entities (Henrik Ibsen, is a, writer) (Henrik Ibsen, is a, Norwegian writer) (Henrik Ibsen, is a, person)
  • 11. Tuples - We describe entity properties using triples - Attributes (Henrik Ibsen, birthdate, 20 March 1828) - Types (Henrik Ibsen, is a, writer) - Relations (Henrik Ibsen, work, A Doll’s House) - RDF (Resource Description Framework) - A way to represent structured knowledge
  • 12. Knowledge bases - A knowledge base (KB) is a set of tuples - There are many knowledge bases - Domain-specific, e.g. GeoNames, DOI, BBCMusic - Cross-domain, e.g. DBpedia, YAGO, Freebase, Google Knowledge Graph - Yes, there are many
  • 14. Search intents and refiners - Intent: the underlying user need in a search query - For example, the intent of booking a hotel room - Entity-oriented queries - Refiner: a way to express an intent in an entity- oriented query - For example, for booking a hotel room: "booking", "book", "reservation", "rooms"
  • 15. Towards an understanding of search intents - A large proportion of entity-oriented search queries - Interest in understanding what those queries ask for, and how they can be fulfilled
  • 16. A KB of entity-oriented search intents 1. Intents searched for a type of entities paris map, sydney map => [city] map - a 2. Categories assigned to refiners messi instagram => Website scandic rooms => Service henrik ibsen child => Property - a 3. Multiple refiners expressing an intent "booking", "book", "make a reservation", "rooms" - a
  • 17. A KB of entity-oriented search intents 1. Intents searched for a type of entities paris map, sydney map => [city] map - (intent ID, searchedForType, entity type, conf.) 2. Categories assigned to refiners messi instagram => Website scandic rooms => Service henrik ibsen child => Property - a 3. Multiple refiners expressing an intent "booking", "book", "make a reservation", "rooms" - a
  • 18. A KB of entity-oriented search intents 1. Intents searched for a type of entities paris map, sydney map => [city] map - (intent ID, searchedForType, entity type, conf.) 2. Categories assigned to refiners messi instagram => Website scandic rooms => Service henrik ibsen child => Property - (intent ID, ofCategory, intent category, conf.) 3. Multiple refiners expressing an intent "booking", "book", "make a reservation", "rooms" - a
  • 19. A KB of entity-oriented search intents 1. Intents searched for a type of entities paris map, sydney map => [city] map - (intent ID, searchedForType, entity type, conf.) 2. Categories assigned to refiners messi instagram => Website scandic rooms => Service henrik ibsen child => Property - (intent ID, ofCategory, intent category, conf.) 3. Multiple refiners expressing an intent "booking", "book", "make a reservation", "rooms" - (intent ID, expressedBy, refiner, conf.)
  • 22. Our pipeline approach clarion hotel clarion hotel airport clarion hotel spa clarion hotel booking casa 400 casa 400 rooms casa 400 address casa 400 deals ...
  • 23. Our pipeline approach clarion hotel clarion hotel airport clarion hotel spa clarion hotel booking casa 400 casa 400 rooms casa 400 address casa 400 deals clarion hotel airport casa 400 airport scandic airport ... ...
  • 24. Our pipeline approach [hotel] airport clarion hotel clarion hotel airport clarion hotel spa clarion hotel booking casa 400 casa 400 rooms casa 400 address casa 400 deals clarion hotel airport casa 400 airport scandic airport ... ...
  • 25. Our pipeline approach [hotel] airport [hotel] spa [hotel] booking ... clarion hotel clarion hotel airport clarion hotel spa clarion hotel booking casa 400 casa 400 rooms casa 400 address casa 400 deals clarion hotel airport casa 400 airport scandic airport ... ...
  • 26. Our pipeline approach Refiners acquisition [hotel] airport [hotel] spa [hotel] booking ... clarion hotel clarion hotel airport clarion hotel spa clarion hotel booking casa 400 casa 400 rooms casa 400 address casa 400 deals clarion hotel airport casa 400 airport scandic airport ... ...
  • 27. Our pipeline approach Refiners acquisition [hotel] airport [hotel] spa [hotel] booking ...
  • 29. Our pipeline approach Refiners acquisition Refiners categorization [hotel] airport [hotel] spa [hotel] booking ... [hotel] airport: Service [hotel] address: Property [hotel] expedia: Website ...
  • 30. Our pipeline approach Refiners acquisition Refiners categorization Intents discovery [hotel] airport [hotel] spa [hotel] booking ... [hotel] airport: Service [hotel] address: Property [hotel] expedia: Website ...
  • 31. Our pipeline approach Refiners acquisition Refiners categorization Intents discovery [hotel] airport [hotel] spa [hotel] booking ... [hotel] airport: Service [hotel] address: Property [hotel] expedia: Website ... taxi arrive Hotel_Arrivingbooking make a reservation Hotel_Booking address Hotel_Address
  • 32. Our pipeline approach Refiners acquisition Refiners categorization Intents discovery [hotel] airport [hotel] spa [hotel] booking ... [hotel] airport: Service [hotel] address: Property [hotel] expedia: Website ... taxi arrive Hotel_Arrivingbooking make a reservation Hotel_Booking address Hotel_Address KB construction
  • 33. Our pipeline approach Refiners acquisition Refiners categorization Intents discovery [hotel] airport [hotel] spa [hotel] booking ... [hotel] airport: Service [hotel] address: Property [hotel] expedia: Website ... taxi arrive Hotel_Arrivingbooking make a reservation Hotel_Booking address Hotel_Address Intent ID Predicate Object Confidence Hotel_Booking searchedForType [hotel] c1 Hotel_Booking ofCategory Service c2 Hotel_Booking expressedBy "booking" c3 Hotel_Booking expressedBy "make a reservation" c4 Hotel_Booking expressedBy "rooms" c5 KB construction
  • 34. Our pipeline approach Refiners acquisition Refiners categorization Intents discovery [hotel] airport [hotel] spa [hotel] booking ... [hotel] airport: Service [hotel] address: Property [hotel] expedia: Website ... taxi arrive Hotel_Arrivingbooking make a reservation Hotel_Booking address Hotel_Address Intent profile { KB construction Intent ID Predicate Object Confidence Hotel_Booking searchedForType [hotel] c1 Hotel_Booking ofCategory Service c2 Hotel_Booking expressedBy "booking" c3 Hotel_Booking expressedBy "make a reservation" c4 Hotel_Booking expressedBy "rooms" c5
  • 35. Evaluation - Component-level evaluation - Cross-validation using the human annotations of intent categories and refiner clusters, for a representative sample of 50 types - End-to-end evaluation - Human judgments about KB facts, for a sample of additional types, defined w.r.t. confidence intervals
  • 36. Knowledge base construction - Application of the pipeline to extract all quadruples from 581 unseen types - 155K quadruples, 31K intent profiles Excerpt of the KB, for intent ID <aviation.airline-65-customer_service>
  • 37. Results [0, 0.8652) [0.8652, 0.8837) [0.8837, 0.9043) [0.9043, 0.9319) [0.9319, 1] Confidence intervals according to the splitting percentiles 0% 20% 40% 60% 80% 100% Proportionoftriples Correct Incorrect due to OFCATEGORY Incorrect due to EXPRESSEDBY
  • 38. Application scenarios - Leveraging knowledge with levels of confidence - Identification of search intents in unseen queries - Design and functionality of entity cards
  • 39.
  • 40.