SlideShare a Scribd company logo
1 of 24
Why is Semantic Search so Hard? and What Truevert Does About It  Powered by www.truevert.com  www.orcatec.com
Semantic search harnesses the meaning of words to improve the quality of search results
Using meaning is difficult
Language is dynamic Jabberwocky Effect Humpty Dumpty Syndrome Making up new words Using old words in new ways blog Twitter
Strike Bank Words are ambiguous
How ambiguous? Look it up! The companies have agreed to  a  brief delay in implementing their agreement.     37  14  39  17  54 62  20  8  84  8  7  9 7,788,584,618,680,320  possible interpretations Each word disambiguates the others # definitions
Isn’t the Semantic Web supposed to fix these problems? The Semantic Web was intended to support machine – machine communication to manage the day to day mechanisms of trade, bureaucracy, and daily life (Berners-Lee, 1999).
Web Ontology Language: OWL Semantic Web Line up the information in web pages with predefined categories
Sports Recreation Baseball Basketball Cricket Gloves Basketballs Baseballs Wicket Is a Is a Is a Batter Is a Is a Uses Uses Uses Player Uses Player Ontology: set of concepts, categories, relations Ontologies cast meaning into categories Is a
Ontologies Limit thinking to known tracks
People are creative For example: 20 - 25% of the searches on Google on any day have never been seen before
What categories matter to you? “basketball?” Bouncy things Round things Things to dribble Things that my brother hates Things with a pebbly surface Things that Barack Obama likes Things that float An infinite number of ways to categorize
What’s  Truevert ’s solution?
“ The meaning of a word is its use in the language.” — Ludwig Wittgenstein Philosophical Investigations , § 43.
Truevert learns the meaning of words in the same way that people do, from the context in which they are used Truevert works in any language
Gabbro is a dark, coarse-grained, igneous rock formed underground. It is chemically equivalent to basalt.  Gabbro is rarely used as a building stone.   Do you know the meaning of the word “Gabbro?”
Blah blah blah  court  blah blah blah  lawyer  blah blah blah blah  bailiff  blah blah blah blah blah. Blah blah  court  blah blah blah  basketball  blah blah blah blah blah blah  freethrow  blah blah blah blah. Computer creates model of word use patterns from documents in its vertical Legal vertical Sport vertical
Model identifies characteristic word patterns for vertical Court & (lawyer or bailiff or jury or attorney or …)  =  legal Court & (basketball or hoops or freethrow or …) =  sports
Word use patterns are meaning
Follow your own path Truevert delivers results tuned to your interests
Truevert’s patterns let YOU find the results that YOU are looking for
Green Vertical Semantic Search Results
Truevert is a project of OrcaTec LLC.  Headquartered in Ojai, CA. OrcaTec is a leading provider of information discovery software including intelligent semantic search, near duplicate clustering, language identification, email threading, and interesting phrase finding. OrcaTec-developed software was nominated by the Jet Propulsion Laboratory as NASA software of the year 2008. OrcaTec software has been used in electronic discovery and advertising applications as well as knowledge management. Core OrcaTec software is patent pending.
Contact Truevert www.truevert.com [email_address] 805-918-4612

More Related Content

Similar to Why Semantic Search Is Hard

Jarrar.lecture notes.aai.2011s.ontology part1_introduction
Jarrar.lecture notes.aai.2011s.ontology part1_introductionJarrar.lecture notes.aai.2011s.ontology part1_introduction
Jarrar.lecture notes.aai.2011s.ontology part1_introductionPalGov
 
Semantic Search From Truevert
Semantic Search From TruevertSemantic Search From Truevert
Semantic Search From TruevertTruevert
 
Aibdconference chat bot for every product Maksym Volchenko
Aibdconference chat bot for every product Maksym VolchenkoAibdconference chat bot for every product Maksym Volchenko
Aibdconference chat bot for every product Maksym VolchenkoOlga Zinkevych
 
From Natural Language Processing to Artificial Intelligence
From Natural Language Processing to Artificial IntelligenceFrom Natural Language Processing to Artificial Intelligence
From Natural Language Processing to Artificial IntelligenceJonathan Mugan
 
Web 3 Expert System
Web 3 Expert SystemWeb 3 Expert System
Web 3 Expert Systemguest4513a7
 
Web 3 Expert System
Web 3 Expert SystemWeb 3 Expert System
Web 3 Expert SystemMediabistro
 
Data Day Seattle, From NLP to AI
Data Day Seattle, From NLP to AIData Day Seattle, From NLP to AI
Data Day Seattle, From NLP to AIJonathan Mugan
 
Natural Language Search with Knowledge Graphs (Activate 2019)
Natural Language Search with Knowledge Graphs (Activate 2019)Natural Language Search with Knowledge Graphs (Activate 2019)
Natural Language Search with Knowledge Graphs (Activate 2019)Trey Grainger
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language ProcessingMichel Bruley
 
ChatGPT-and-Generative-AI-Landscape Working of generative ai search
ChatGPT-and-Generative-AI-Landscape Working of generative ai searchChatGPT-and-Generative-AI-Landscape Working of generative ai search
ChatGPT-and-Generative-AI-Landscape Working of generative ai searchrohitcse52
 
Short URLs, Big Fun
Short URLs, Big FunShort URLs, Big Fun
Short URLs, Big FunHilary Mason
 
Bryan Bell Presentation
Bryan Bell PresentationBryan Bell Presentation
Bryan Bell PresentationMediabistro
 
A Bird Eye View of Dialogue Machines
A Bird Eye View of Dialogue MachinesA Bird Eye View of Dialogue Machines
A Bird Eye View of Dialogue MachinesBatool Arhamna Haider
 
Search Patterns: An Early Talk
Search Patterns: An Early TalkSearch Patterns: An Early Talk
Search Patterns: An Early TalkPeter Morville
 
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...Alexander Serebrenik
 
Creating Chatbots Using TensorFlow | Chatbot Tutorial | Deep Learning Trainin...
Creating Chatbots Using TensorFlow | Chatbot Tutorial | Deep Learning Trainin...Creating Chatbots Using TensorFlow | Chatbot Tutorial | Deep Learning Trainin...
Creating Chatbots Using TensorFlow | Chatbot Tutorial | Deep Learning Trainin...Edureka!
 
Omosola Odetunde - Fantastic Data and Where to Find Them: The Importance of K...
Omosola Odetunde - Fantastic Data and Where to Find Them: The Importance of K...Omosola Odetunde - Fantastic Data and Where to Find Them: The Importance of K...
Omosola Odetunde - Fantastic Data and Where to Find Them: The Importance of K...Codemotion
 

Similar to Why Semantic Search Is Hard (20)

Jarrar.lecture notes.aai.2011s.ontology part1_introduction
Jarrar.lecture notes.aai.2011s.ontology part1_introductionJarrar.lecture notes.aai.2011s.ontology part1_introduction
Jarrar.lecture notes.aai.2011s.ontology part1_introduction
 
Semantic Search From Truevert
Semantic Search From TruevertSemantic Search From Truevert
Semantic Search From Truevert
 
Aibdconference chat bot for every product Maksym Volchenko
Aibdconference chat bot for every product Maksym VolchenkoAibdconference chat bot for every product Maksym Volchenko
Aibdconference chat bot for every product Maksym Volchenko
 
From Natural Language Processing to Artificial Intelligence
From Natural Language Processing to Artificial IntelligenceFrom Natural Language Processing to Artificial Intelligence
From Natural Language Processing to Artificial Intelligence
 
Web 3 Expert System
Web 3 Expert SystemWeb 3 Expert System
Web 3 Expert System
 
Web 3 Expert System
Web 3 Expert SystemWeb 3 Expert System
Web 3 Expert System
 
Data Day Seattle, From NLP to AI
Data Day Seattle, From NLP to AIData Day Seattle, From NLP to AI
Data Day Seattle, From NLP to AI
 
Natural Language Search with Knowledge Graphs (Activate 2019)
Natural Language Search with Knowledge Graphs (Activate 2019)Natural Language Search with Knowledge Graphs (Activate 2019)
Natural Language Search with Knowledge Graphs (Activate 2019)
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language Processing
 
Oss swot
Oss swotOss swot
Oss swot
 
EDI 2009 Case Law Update
EDI 2009 Case Law UpdateEDI 2009 Case Law Update
EDI 2009 Case Law Update
 
ChatGPT-and-Generative-AI-Landscape Working of generative ai search
ChatGPT-and-Generative-AI-Landscape Working of generative ai searchChatGPT-and-Generative-AI-Landscape Working of generative ai search
ChatGPT-and-Generative-AI-Landscape Working of generative ai search
 
Short URLs, Big Fun
Short URLs, Big FunShort URLs, Big Fun
Short URLs, Big Fun
 
Bryan Bell Presentation
Bryan Bell PresentationBryan Bell Presentation
Bryan Bell Presentation
 
Chatbot
ChatbotChatbot
Chatbot
 
A Bird Eye View of Dialogue Machines
A Bird Eye View of Dialogue MachinesA Bird Eye View of Dialogue Machines
A Bird Eye View of Dialogue Machines
 
Search Patterns: An Early Talk
Search Patterns: An Early TalkSearch Patterns: An Early Talk
Search Patterns: An Early Talk
 
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
 
Creating Chatbots Using TensorFlow | Chatbot Tutorial | Deep Learning Trainin...
Creating Chatbots Using TensorFlow | Chatbot Tutorial | Deep Learning Trainin...Creating Chatbots Using TensorFlow | Chatbot Tutorial | Deep Learning Trainin...
Creating Chatbots Using TensorFlow | Chatbot Tutorial | Deep Learning Trainin...
 
Omosola Odetunde - Fantastic Data and Where to Find Them: The Importance of K...
Omosola Odetunde - Fantastic Data and Where to Find Them: The Importance of K...Omosola Odetunde - Fantastic Data and Where to Find Them: The Importance of K...
Omosola Odetunde - Fantastic Data and Where to Find Them: The Importance of K...
 

Recently uploaded

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 

Recently uploaded (20)

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 

Why Semantic Search Is Hard

  • 1. Why is Semantic Search so Hard? and What Truevert Does About It Powered by www.truevert.com www.orcatec.com
  • 2. Semantic search harnesses the meaning of words to improve the quality of search results
  • 3. Using meaning is difficult
  • 4. Language is dynamic Jabberwocky Effect Humpty Dumpty Syndrome Making up new words Using old words in new ways blog Twitter
  • 5. Strike Bank Words are ambiguous
  • 6. How ambiguous? Look it up! The companies have agreed to a brief delay in implementing their agreement. 37 14 39 17 54 62 20 8 84 8 7 9 7,788,584,618,680,320 possible interpretations Each word disambiguates the others # definitions
  • 7. Isn’t the Semantic Web supposed to fix these problems? The Semantic Web was intended to support machine – machine communication to manage the day to day mechanisms of trade, bureaucracy, and daily life (Berners-Lee, 1999).
  • 8. Web Ontology Language: OWL Semantic Web Line up the information in web pages with predefined categories
  • 9. Sports Recreation Baseball Basketball Cricket Gloves Basketballs Baseballs Wicket Is a Is a Is a Batter Is a Is a Uses Uses Uses Player Uses Player Ontology: set of concepts, categories, relations Ontologies cast meaning into categories Is a
  • 10. Ontologies Limit thinking to known tracks
  • 11. People are creative For example: 20 - 25% of the searches on Google on any day have never been seen before
  • 12. What categories matter to you? “basketball?” Bouncy things Round things Things to dribble Things that my brother hates Things with a pebbly surface Things that Barack Obama likes Things that float An infinite number of ways to categorize
  • 13. What’s Truevert ’s solution?
  • 14. “ The meaning of a word is its use in the language.” — Ludwig Wittgenstein Philosophical Investigations , § 43.
  • 15. Truevert learns the meaning of words in the same way that people do, from the context in which they are used Truevert works in any language
  • 16. Gabbro is a dark, coarse-grained, igneous rock formed underground. It is chemically equivalent to basalt. Gabbro is rarely used as a building stone. Do you know the meaning of the word “Gabbro?”
  • 17. Blah blah blah court blah blah blah lawyer blah blah blah blah bailiff blah blah blah blah blah. Blah blah court blah blah blah basketball blah blah blah blah blah blah freethrow blah blah blah blah. Computer creates model of word use patterns from documents in its vertical Legal vertical Sport vertical
  • 18. Model identifies characteristic word patterns for vertical Court & (lawyer or bailiff or jury or attorney or …) = legal Court & (basketball or hoops or freethrow or …) = sports
  • 19. Word use patterns are meaning
  • 20. Follow your own path Truevert delivers results tuned to your interests
  • 21. Truevert’s patterns let YOU find the results that YOU are looking for
  • 22. Green Vertical Semantic Search Results
  • 23. Truevert is a project of OrcaTec LLC. Headquartered in Ojai, CA. OrcaTec is a leading provider of information discovery software including intelligent semantic search, near duplicate clustering, language identification, email threading, and interesting phrase finding. OrcaTec-developed software was nominated by the Jet Propulsion Laboratory as NASA software of the year 2008. OrcaTec software has been used in electronic discovery and advertising applications as well as knowledge management. Core OrcaTec software is patent pending.
  • 24. Contact Truevert www.truevert.com [email_address] 805-918-4612