SlideShare a Scribd company logo
The Daedalus: A search-engine for
visualizing semantic relationships
Brent Kievit-Kylar, Sean Connolly, & Colin Allen
Indiana University, Bloomington
Background
Search engines make predictions. Given the set of words a user
enters, a search engine makes a prediction about what
information that user is seeking. The predictions are generally
given in the form of a list, with the highest ranked prediction
coming first. But, what determines this ranking?
Internet search engines turn each webpage into a generic
“bag” of words. The “words in the bag” contain no semantic or
grammatical meaning. Each is given a score based on how
many times it was repeated in the doc, and its connection to all
Semantic Linking
Search engines can’t always tell when different words mean the
same thing. In the “chemist” example below, the search engine
reveals it doesn’t know that “qc” and “quality”-“control” mean the
same thing. It may never natively figure this out. But to a human
with expertise in a specific domain, the matter could be trivial.
Semantic linking could help for corpuses compiled over many
years, or over different languages. Aristotle separated “plot”
from “story”: “story” is the elements and events of a narrative
and “plot” is their ordering. Russian Formalists used the words
“fabula” and “syuzhet” to write a similar conceptualization of
narrative. Researchers might want to know of both. The tool lets
a researcher build domain-specific knowledge into generic IT.
Future Work
We believe Daedalus – the “data list” – can perhaps help best
in the querying and cataloging of archives that exist at
universities. We believe the tool can help researchers dive
deeper into texts with technology, see yet unseen connections
“Words are known by the company they keep.” (Firth 1957)
Semantic Override
Do you know the search-weighting protocols for your data
search tools? The way your tool is built impacts the efficacy and
limits its potential for use. The tool allows the re-weighting of
terms so users may “take over” the search and override the
strength of the weightings of the word-symbol relationships..
Re-weighitng the relationships of key words also simultaneously
refreshes the search with the new weights and generates a new
the other words.
A visual representation
for the weighted “bag
of words” for the non-
grammatical query
“potter’s patronus
animal” at left (drawn
from a real web query
by our Daedalus tool
9/29/12)
Reweight
key terms
across and within
texts, and give users
greater control over
digital research tools.
As part of the InPho
project, the Daedalus
represents each
article of the Stanford
Encyclopedia of
Philosophy as a
meta-object, showing
the introduction in
one domain and the
rest of the article as
another. Re-
weighting and linking
generates new
search results.
search results page.

More Related Content

Viewers also liked

Can 3D Movies Screen for Binocular Vision Problems in Children - Indiana
Can 3D Movies Screen for Binocular Vision Problems in Children - IndianaCan 3D Movies Screen for Binocular Vision Problems in Children - Indiana
Can 3D Movies Screen for Binocular Vision Problems in Children - IndianaSean Connolly
 
Business Art Strategies
Business Art StrategiesBusiness Art Strategies
Business Art StrategiesCreatingdemand
 
Recommendation Pelayo
Recommendation PelayoRecommendation Pelayo
Recommendation PelayoLaura Pelayo
 
Financing Water Management Efthymia Fachouridou and Robert van Cleef WAGO 04 ...
Financing Water Management Efthymia Fachouridou and Robert van Cleef WAGO 04 ...Financing Water Management Efthymia Fachouridou and Robert van Cleef WAGO 04 ...
Financing Water Management Efthymia Fachouridou and Robert van Cleef WAGO 04 ...Efthymia Fachouridou
 
Corporate finance
Corporate financeCorporate finance
Corporate financeRifat Ahsan
 
Organisational Effectivness
Organisational EffectivnessOrganisational Effectivness
Organisational EffectivnessCreatingdemand
 

Viewers also liked (8)

TALDE-BLOGEAN IRUDIAK SARTU
TALDE-BLOGEAN IRUDIAK SARTUTALDE-BLOGEAN IRUDIAK SARTU
TALDE-BLOGEAN IRUDIAK SARTU
 
Can 3D Movies Screen for Binocular Vision Problems in Children - Indiana
Can 3D Movies Screen for Binocular Vision Problems in Children - IndianaCan 3D Movies Screen for Binocular Vision Problems in Children - Indiana
Can 3D Movies Screen for Binocular Vision Problems in Children - Indiana
 
Business Art Strategies
Business Art StrategiesBusiness Art Strategies
Business Art Strategies
 
Recommendation Pelayo
Recommendation PelayoRecommendation Pelayo
Recommendation Pelayo
 
Financing Water Management Efthymia Fachouridou and Robert van Cleef WAGO 04 ...
Financing Water Management Efthymia Fachouridou and Robert van Cleef WAGO 04 ...Financing Water Management Efthymia Fachouridou and Robert van Cleef WAGO 04 ...
Financing Water Management Efthymia Fachouridou and Robert van Cleef WAGO 04 ...
 
Corporate finance
Corporate financeCorporate finance
Corporate finance
 
Organisational Effectivness
Organisational EffectivnessOrganisational Effectivness
Organisational Effectivness
 
Conociendo el capital
Conociendo el capitalConociendo el capital
Conociendo el capital
 

Similar to Return to the Materials Digital Humanities Conference 2013

Content Analyst - Conceptualizing LSI Based Text Analytics White Paper
Content Analyst - Conceptualizing LSI Based Text Analytics White PaperContent Analyst - Conceptualizing LSI Based Text Analytics White Paper
Content Analyst - Conceptualizing LSI Based Text Analytics White PaperJohn Felahi
 
Empowering Search Through 3RDi Semantic Enrichment
Empowering Search Through 3RDi Semantic EnrichmentEmpowering Search Through 3RDi Semantic Enrichment
Empowering Search Through 3RDi Semantic EnrichmentThe Digital Group
 
Technical Whitepaper: A Knowledge Correlation Search Engine
Technical Whitepaper: A Knowledge Correlation Search EngineTechnical Whitepaper: A Knowledge Correlation Search Engine
Technical Whitepaper: A Knowledge Correlation Search Engines0P5a41b
 
Riding The Semantic Wave
Riding The Semantic WaveRiding The Semantic Wave
Riding The Semantic WaveKaniska Mandal
 
Extracting and Reducing the Semantic Information Content of Web Documents to ...
Extracting and Reducing the Semantic Information Content of Web Documents to ...Extracting and Reducing the Semantic Information Content of Web Documents to ...
Extracting and Reducing the Semantic Information Content of Web Documents to ...ijsrd.com
 
Entity linking with a knowledge base issues techniques and solutions
Entity linking with a knowledge base issues techniques and solutionsEntity linking with a knowledge base issues techniques and solutions
Entity linking with a knowledge base issues techniques and solutionsCloudTechnologies
 
The need for sophistication in modern search engine implementations
The need for sophistication in modern search engine implementationsThe need for sophistication in modern search engine implementations
The need for sophistication in modern search engine implementationsBen DeMott
 
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing ApproachCoping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing ApproachAndre Freitas
 
Semantics in Financial Services -David Newman
Semantics in Financial Services -David NewmanSemantics in Financial Services -David Newman
Semantics in Financial Services -David NewmanPeter Berger
 
The search engine index
The search engine indexThe search engine index
The search engine indexCJ Jenkins
 
DM110 - Week 10 - Semantic Web / Web 3.0
DM110 - Week 10 - Semantic Web / Web 3.0DM110 - Week 10 - Semantic Web / Web 3.0
DM110 - Week 10 - Semantic Web / Web 3.0John Breslin
 
Natural Language Search with Knowledge Graphs (Activate 2019)
Natural Language Search with Knowledge Graphs (Activate 2019)Natural Language Search with Knowledge Graphs (Activate 2019)
Natural Language Search with Knowledge Graphs (Activate 2019)Trey Grainger
 
Chemical information instruction in the age of Google(TM)
Chemical information instruction in the age of Google(TM)Chemical information instruction in the age of Google(TM)
Chemical information instruction in the age of Google(TM)Charles Huber
 
Comparison of Semantic and Syntactic Information Retrieval System on the basi...
Comparison of Semantic and Syntactic Information Retrieval System on the basi...Comparison of Semantic and Syntactic Information Retrieval System on the basi...
Comparison of Semantic and Syntactic Information Retrieval System on the basi...Waqas Tariq
 
Demystifying analytics in e discovery white paper 06-30-14
Demystifying analytics in e discovery   white paper 06-30-14Demystifying analytics in e discovery   white paper 06-30-14
Demystifying analytics in e discovery white paper 06-30-14Steven Toole
 
How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than DataAmit Sheth
 

Similar to Return to the Materials Digital Humanities Conference 2013 (20)

Content Analyst - Conceptualizing LSI Based Text Analytics White Paper
Content Analyst - Conceptualizing LSI Based Text Analytics White PaperContent Analyst - Conceptualizing LSI Based Text Analytics White Paper
Content Analyst - Conceptualizing LSI Based Text Analytics White Paper
 
Empowering Search Through 3RDi Semantic Enrichment
Empowering Search Through 3RDi Semantic EnrichmentEmpowering Search Through 3RDi Semantic Enrichment
Empowering Search Through 3RDi Semantic Enrichment
 
020610
020610020610
020610
 
Technical Whitepaper: A Knowledge Correlation Search Engine
Technical Whitepaper: A Knowledge Correlation Search EngineTechnical Whitepaper: A Knowledge Correlation Search Engine
Technical Whitepaper: A Knowledge Correlation Search Engine
 
Riding The Semantic Wave
Riding The Semantic WaveRiding The Semantic Wave
Riding The Semantic Wave
 
Extracting and Reducing the Semantic Information Content of Web Documents to ...
Extracting and Reducing the Semantic Information Content of Web Documents to ...Extracting and Reducing the Semantic Information Content of Web Documents to ...
Extracting and Reducing the Semantic Information Content of Web Documents to ...
 
Entity linking with a knowledge base issues techniques and solutions
Entity linking with a knowledge base issues techniques and solutionsEntity linking with a knowledge base issues techniques and solutions
Entity linking with a knowledge base issues techniques and solutions
 
The need for sophistication in modern search engine implementations
The need for sophistication in modern search engine implementationsThe need for sophistication in modern search engine implementations
The need for sophistication in modern search engine implementations
 
What is What, When?
What is What, When?What is What, When?
What is What, When?
 
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing ApproachCoping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
 
Semantics in Financial Services -David Newman
Semantics in Financial Services -David NewmanSemantics in Financial Services -David Newman
Semantics in Financial Services -David Newman
 
Word Embedding In IR
Word Embedding In IRWord Embedding In IR
Word Embedding In IR
 
The search engine index
The search engine indexThe search engine index
The search engine index
 
DM110 - Week 10 - Semantic Web / Web 3.0
DM110 - Week 10 - Semantic Web / Web 3.0DM110 - Week 10 - Semantic Web / Web 3.0
DM110 - Week 10 - Semantic Web / Web 3.0
 
Natural Language Search with Knowledge Graphs (Activate 2019)
Natural Language Search with Knowledge Graphs (Activate 2019)Natural Language Search with Knowledge Graphs (Activate 2019)
Natural Language Search with Knowledge Graphs (Activate 2019)
 
Chemical information instruction in the age of Google(TM)
Chemical information instruction in the age of Google(TM)Chemical information instruction in the age of Google(TM)
Chemical information instruction in the age of Google(TM)
 
Comparison of Semantic and Syntactic Information Retrieval System on the basi...
Comparison of Semantic and Syntactic Information Retrieval System on the basi...Comparison of Semantic and Syntactic Information Retrieval System on the basi...
Comparison of Semantic and Syntactic Information Retrieval System on the basi...
 
Demystifying analytics in e discovery white paper 06-30-14
Demystifying analytics in e discovery   white paper 06-30-14Demystifying analytics in e discovery   white paper 06-30-14
Demystifying analytics in e discovery white paper 06-30-14
 
How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than Data
 
How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than Data
 

Return to the Materials Digital Humanities Conference 2013

  • 1. The Daedalus: A search-engine for visualizing semantic relationships Brent Kievit-Kylar, Sean Connolly, & Colin Allen Indiana University, Bloomington Background Search engines make predictions. Given the set of words a user enters, a search engine makes a prediction about what information that user is seeking. The predictions are generally given in the form of a list, with the highest ranked prediction coming first. But, what determines this ranking? Internet search engines turn each webpage into a generic “bag” of words. The “words in the bag” contain no semantic or grammatical meaning. Each is given a score based on how many times it was repeated in the doc, and its connection to all Semantic Linking Search engines can’t always tell when different words mean the same thing. In the “chemist” example below, the search engine reveals it doesn’t know that “qc” and “quality”-“control” mean the same thing. It may never natively figure this out. But to a human with expertise in a specific domain, the matter could be trivial. Semantic linking could help for corpuses compiled over many years, or over different languages. Aristotle separated “plot” from “story”: “story” is the elements and events of a narrative and “plot” is their ordering. Russian Formalists used the words “fabula” and “syuzhet” to write a similar conceptualization of narrative. Researchers might want to know of both. The tool lets a researcher build domain-specific knowledge into generic IT. Future Work We believe Daedalus – the “data list” – can perhaps help best in the querying and cataloging of archives that exist at universities. We believe the tool can help researchers dive deeper into texts with technology, see yet unseen connections “Words are known by the company they keep.” (Firth 1957) Semantic Override Do you know the search-weighting protocols for your data search tools? The way your tool is built impacts the efficacy and limits its potential for use. The tool allows the re-weighting of terms so users may “take over” the search and override the strength of the weightings of the word-symbol relationships.. Re-weighitng the relationships of key words also simultaneously refreshes the search with the new weights and generates a new the other words. A visual representation for the weighted “bag of words” for the non- grammatical query “potter’s patronus animal” at left (drawn from a real web query by our Daedalus tool 9/29/12) Reweight key terms across and within texts, and give users greater control over digital research tools. As part of the InPho project, the Daedalus represents each article of the Stanford Encyclopedia of Philosophy as a meta-object, showing the introduction in one domain and the rest of the article as another. Re- weighting and linking generates new search results. search results page.