SlideShare a Scribd company logo
Text Analytics World
San Francisco – March 31, 2015
4:15-4:45pm
Speaker: Bryan Bell, Executive Vice President, Expert System USA
 What is in Your Business Requirement: Searching or Finding? Enterprise Search
 Product Demonstration: The Google Search Appliance (GSA) integrated with a
semantic technology platform.
1. Internal and external information comes at us faster than we can keep up with.
2. Business expectations for deploying solutions, using enterprise search and content navigation systems
to capture the hidden value of strategic information.
3. CONTEXT: Exploiting deep linguistic analysis, combined with semantics offers the ability to create
contextually correct metadata.
4. Dynamically enrich content with contextually relevant metadata and deploy as the heart of a
knowledge management applications and the Google Search Appliance.
1. Internal and external information comes at us faster than we can keep up with.
80 – 90% is unstructured text.
Zettabyte
1,000,000,000,000,000,000, 000 bytes
4
 The Google crawler visits 20 billion web sites a day.
 The search engine has located more than 30 trillion unique URLs.
 Processes 100 billion searches every month.
• 3.3 billion searches per day.
• Over 38,000 thousand searches per second.
• A single Google query uses 1,000 computers to retrieve an answer.
• This volume combined with the PageRank algorithm…
PR(A) = (1-d) + d (PR(T1)/C(T1) + PR(Tn)/C(Tn)) …. is why Google is so good on the internet.
• 16% to 20% of queries that get asked every day have never been asked before.
Amit Singhal,
Senior Vice President of development, Google Search
August 2012
The Internet
2. Deploying internal enterprise search
engine / content navigation system to
capture and share the hidden value of the
information that is available to the company.
The intranet / corporate portal
2. Deploying internal enterprise search
engine / content navigation system to
capture and share the hidden value of the
information that is available to the company.
The intranet / corporate portal
“Our search stinks!
I want it to work
like Google.”
9
Zettabyte
1,000,000,000,000,000,000,000 bytes
Good news:
PR(A) = (1-d) + d (PR(T1)/C(T1) + ... + PR(Tn)/C(Tn))
Don’t have 3.3 billion searches per day.
Don’t have 38,000 thousand searches per second.
Don’t have 1,000 computers to retrieve an answer.
10
Zettabyte
1,000,000,000,000,
000,000,000 bytes
Key words
No metadata
Poor metadata
Inconsistent
11
Zettabyte
1,000,000,000,000,
000,000,000 bytes
Key words
No metadata
Poor metadata
Inconsistent
=
POOR
CONTENT
FINDABILITY
12
stock
People are able to disambiguate “on the fly”, but machines cannot.
Key words vs. Context
Language ambiguity
13
People are able to disambiguate “on the fly”, but machines cannot.
stock
apple
Key words vs. Context
Language ambiguity
14
stock
apple
Apple
People are able to disambiguate “on the fly”, but machines cannot.
Key words vs. Context
Language ambiguity
15
stock
apple
Apple
“I bought 10,000 shares of stock in Apple.”
“I have 10,000 apples in stock.”
People are able to disambiguate “on the fly”, but machines cannot.
Context is King
3. Exploiting deep linguistic analysis,
combined with semantics.
4. Dynamically enrich content with contextually
relevant metadata.
How is word context established?
Morphological
analysis word forms dog, dog-catcher, doggy bag
Grammatical analysis parts of speech "There are 40 rows in the table." (noun)
"She rows 5 times a week." (verb)
Logical analysis
word
relationships
"The car I bought, to replace my Chrysler,
stinks."
Semantic analysis word context "I bought 10,000 shares of stock in Apple."
"I have 10,000 apples in stock."
"I used chicken broth for my soup stock."
Deep linguistic analysis of words to achieve word disambiguation.
How is word context established
and deployed with the GSA?
www.intelligenceapi.com
20
Linguistic and semantic analysis engine
27
Case Study: GSA – Google Search Appliance
What is in Your Business Requirement? Searching or Finding.
Contacts
Thank you
Bryan Bell
bbell@expertsystem.com
@bellbryan
+1.847.508.7938
www.expertsystem.com

More Related Content

What's hot

Data scienceppt
Data sciencepptData scienceppt
Data scienceppt
Jayabalan Sekar
 
Key Failure Factors of Building a Data Scientist Team
Key Failure Factors of Building a Data Scientist TeamKey Failure Factors of Building a Data Scientist Team
Key Failure Factors of Building a Data Scientist Team
DSP智庫驅動
 
متن‌بازسازی کلان‌داده
متن‌بازسازی کلان‌دادهمتن‌بازسازی کلان‌داده
متن‌بازسازی کلان‌داده
جشنوارهٔ روز آزادی نرم‌افزار تهران
 
Big data using Public Cloud
Big data using Public CloudBig data using Public Cloud
Big data using Public Cloud
IMC Institute
 
Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016
Mark Tabladillo
 
Microsoft Data Science Technologies 201608
Microsoft Data Science Technologies 201608Microsoft Data Science Technologies 201608
Microsoft Data Science Technologies 201608
Mark Tabladillo
 
How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace
Mohamadreza Mohtat
 
Tools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsTools for Unstructured Data Analytics
Tools for Unstructured Data Analytics
Ravi Teja
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should Know
Bernard Marr
 
Big data analytics presented at meetup big data for decision makers
Big data analytics presented at meetup big data for decision makersBig data analytics presented at meetup big data for decision makers
Big data analytics presented at meetup big data for decision makers
Ruhollah Farchtchi
 
How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610
Mark Tabladillo
 
The big data value chain r1-31 oct13
The big data value chain r1-31 oct13The big data value chain r1-31 oct13
The big data value chain r1-31 oct13
Rei Lynn Hayashi
 
Glue Conference
Glue ConferenceGlue Conference
Glue Conference
Assist
 
Big data landscape v 3.0 - Matt Turck (FirstMark)
Big data landscape v 3.0 - Matt Turck (FirstMark) Big data landscape v 3.0 - Matt Turck (FirstMark)
Big data landscape v 3.0 - Matt Turck (FirstMark)
Matt Turck
 
5 important trends in big data cloud & big data services
5 important trends in big data cloud & big data services5 important trends in big data cloud & big data services
5 important trends in big data cloud & big data services
johnmerchant
 
presentation
presentationpresentation
presentation
Kelly (Kaili) Li
 
Foresight conversation
Foresight conversationForesight conversation
Foresight conversation
suresh sood
 
Big Data Cloud June 3rd Meetup - Presentation by Mark Davis
Big Data Cloud June 3rd Meetup - Presentation by Mark Davis Big Data Cloud June 3rd Meetup - Presentation by Mark Davis
Big Data Cloud June 3rd Meetup - Presentation by Mark Davis
Dj Das
 
Big data big rewards
Big data big rewards Big data big rewards
Big data big rewards
Zulkifflee Sofee
 
Linked Open Government Data and the Semantic Web
Linked Open Government Data and the Semantic WebLinked Open Government Data and the Semantic Web
Linked Open Government Data and the Semantic Web
James Hendler
 

What's hot (20)

Data scienceppt
Data sciencepptData scienceppt
Data scienceppt
 
Key Failure Factors of Building a Data Scientist Team
Key Failure Factors of Building a Data Scientist TeamKey Failure Factors of Building a Data Scientist Team
Key Failure Factors of Building a Data Scientist Team
 
متن‌بازسازی کلان‌داده
متن‌بازسازی کلان‌دادهمتن‌بازسازی کلان‌داده
متن‌بازسازی کلان‌داده
 
Big data using Public Cloud
Big data using Public CloudBig data using Public Cloud
Big data using Public Cloud
 
Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016
 
Microsoft Data Science Technologies 201608
Microsoft Data Science Technologies 201608Microsoft Data Science Technologies 201608
Microsoft Data Science Technologies 201608
 
How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace
 
Tools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsTools for Unstructured Data Analytics
Tools for Unstructured Data Analytics
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should Know
 
Big data analytics presented at meetup big data for decision makers
Big data analytics presented at meetup big data for decision makersBig data analytics presented at meetup big data for decision makers
Big data analytics presented at meetup big data for decision makers
 
How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610
 
The big data value chain r1-31 oct13
The big data value chain r1-31 oct13The big data value chain r1-31 oct13
The big data value chain r1-31 oct13
 
Glue Conference
Glue ConferenceGlue Conference
Glue Conference
 
Big data landscape v 3.0 - Matt Turck (FirstMark)
Big data landscape v 3.0 - Matt Turck (FirstMark) Big data landscape v 3.0 - Matt Turck (FirstMark)
Big data landscape v 3.0 - Matt Turck (FirstMark)
 
5 important trends in big data cloud & big data services
5 important trends in big data cloud & big data services5 important trends in big data cloud & big data services
5 important trends in big data cloud & big data services
 
presentation
presentationpresentation
presentation
 
Foresight conversation
Foresight conversationForesight conversation
Foresight conversation
 
Big Data Cloud June 3rd Meetup - Presentation by Mark Davis
Big Data Cloud June 3rd Meetup - Presentation by Mark Davis Big Data Cloud June 3rd Meetup - Presentation by Mark Davis
Big Data Cloud June 3rd Meetup - Presentation by Mark Davis
 
Big data big rewards
Big data big rewards Big data big rewards
Big data big rewards
 
Linked Open Government Data and the Semantic Web
Linked Open Government Data and the Semantic WebLinked Open Government Data and the Semantic Web
Linked Open Government Data and the Semantic Web
 

Similar to Text Analytics World - Expert System USA

Business Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search EngineBusiness Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search Engine
ankur881120
 
Business intelligence 3.0 and the data lake
Business intelligence 3.0 and the data lakeBusiness intelligence 3.0 and the data lake
Business intelligence 3.0 and the data lake
Data Science Thailand
 
Data Curation @ SpazioDati - NEXA Lunch Seminar
Data Curation @ SpazioDati - NEXA Lunch SeminarData Curation @ SpazioDati - NEXA Lunch Seminar
Data Curation @ SpazioDati - NEXA Lunch Seminar
SpazioDati
 
Bearish SEO: Defining the User Experience for Google’s Panda Search Landscape
Bearish SEO: Defining the User Experience for Google’s Panda Search LandscapeBearish SEO: Defining the User Experience for Google’s Panda Search Landscape
Bearish SEO: Defining the User Experience for Google’s Panda Search Landscape
Marianne Sweeny
 
Test
TestTest
Google
GoogleGoogle
Google
OUM SAOKOSAL
 
Intelligent search | Semantic Search
Intelligent search | Semantic SearchIntelligent search | Semantic Search
Intelligent search | Semantic Search
Repustate
 
Google Research Paper
Google Research PaperGoogle Research Paper
Google Research Paper
didip
 
Broad Data
Broad DataBroad Data
Broad Data
James Hendler
 
Interactive and Conversational Search with Google Cloud and Elasticsearch
Interactive and Conversational Search with Google Cloud and ElasticsearchInteractive and Conversational Search with Google Cloud and Elasticsearch
Interactive and Conversational Search with Google Cloud and Elasticsearch
Inexture Solutions
 
Orbyfy Overview - Solutions_vF_x.pdf
Orbyfy Overview - Solutions_vF_x.pdfOrbyfy Overview - Solutions_vF_x.pdf
Orbyfy Overview - Solutions_vF_x.pdf
Orbyfy
 
The Future State of Document Management, Taxonomies and Metadata in the Cloud
The Future State of Document Management, Taxonomies and Metadata in the CloudThe Future State of Document Management, Taxonomies and Metadata in the Cloud
The Future State of Document Management, Taxonomies and Metadata in the Cloud
BIWUG
 
The anatomy of google
The anatomy of googleThe anatomy of google
The anatomy of google
maelmardi
 
Search V Next Final
Search V Next FinalSearch V Next Final
Search V Next Final
Marianne Sweeny
 
Semantic Web Science
Semantic Web ScienceSemantic Web Science
Semantic Web Science
James Hendler
 
Career_Jobs_in_Data_Science.pptx
Career_Jobs_in_Data_Science.pptxCareer_Jobs_in_Data_Science.pptx
Career_Jobs_in_Data_Science.pptx
HarpreetSharma14
 
ΟΚΤΩΒΡΙΟΣ 2010
ΟΚΤΩΒΡΙΟΣ 2010ΟΚΤΩΒΡΙΟΣ 2010
ΟΚΤΩΒΡΙΟΣ 2010
steverz
 
3 Understanding Search
3 Understanding Search3 Understanding Search
3 Understanding Search
masiclat
 
BrightTALK - Semantic AI
BrightTALK - Semantic AI BrightTALK - Semantic AI
BrightTALK - Semantic AI
Semantic Web Company
 
The Searchmaster's Toolbox - David Hawking, Funnelback Search
The Searchmaster's Toolbox - David Hawking, Funnelback SearchThe Searchmaster's Toolbox - David Hawking, Funnelback Search
The Searchmaster's Toolbox - David Hawking, Funnelback Search
Squiz
 

Similar to Text Analytics World - Expert System USA (20)

Business Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search EngineBusiness Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search Engine
 
Business intelligence 3.0 and the data lake
Business intelligence 3.0 and the data lakeBusiness intelligence 3.0 and the data lake
Business intelligence 3.0 and the data lake
 
Data Curation @ SpazioDati - NEXA Lunch Seminar
Data Curation @ SpazioDati - NEXA Lunch SeminarData Curation @ SpazioDati - NEXA Lunch Seminar
Data Curation @ SpazioDati - NEXA Lunch Seminar
 
Bearish SEO: Defining the User Experience for Google’s Panda Search Landscape
Bearish SEO: Defining the User Experience for Google’s Panda Search LandscapeBearish SEO: Defining the User Experience for Google’s Panda Search Landscape
Bearish SEO: Defining the User Experience for Google’s Panda Search Landscape
 
Test
TestTest
Test
 
Google
GoogleGoogle
Google
 
Intelligent search | Semantic Search
Intelligent search | Semantic SearchIntelligent search | Semantic Search
Intelligent search | Semantic Search
 
Google Research Paper
Google Research PaperGoogle Research Paper
Google Research Paper
 
Broad Data
Broad DataBroad Data
Broad Data
 
Interactive and Conversational Search with Google Cloud and Elasticsearch
Interactive and Conversational Search with Google Cloud and ElasticsearchInteractive and Conversational Search with Google Cloud and Elasticsearch
Interactive and Conversational Search with Google Cloud and Elasticsearch
 
Orbyfy Overview - Solutions_vF_x.pdf
Orbyfy Overview - Solutions_vF_x.pdfOrbyfy Overview - Solutions_vF_x.pdf
Orbyfy Overview - Solutions_vF_x.pdf
 
The Future State of Document Management, Taxonomies and Metadata in the Cloud
The Future State of Document Management, Taxonomies and Metadata in the CloudThe Future State of Document Management, Taxonomies and Metadata in the Cloud
The Future State of Document Management, Taxonomies and Metadata in the Cloud
 
The anatomy of google
The anatomy of googleThe anatomy of google
The anatomy of google
 
Search V Next Final
Search V Next FinalSearch V Next Final
Search V Next Final
 
Semantic Web Science
Semantic Web ScienceSemantic Web Science
Semantic Web Science
 
Career_Jobs_in_Data_Science.pptx
Career_Jobs_in_Data_Science.pptxCareer_Jobs_in_Data_Science.pptx
Career_Jobs_in_Data_Science.pptx
 
ΟΚΤΩΒΡΙΟΣ 2010
ΟΚΤΩΒΡΙΟΣ 2010ΟΚΤΩΒΡΙΟΣ 2010
ΟΚΤΩΒΡΙΟΣ 2010
 
3 Understanding Search
3 Understanding Search3 Understanding Search
3 Understanding Search
 
BrightTALK - Semantic AI
BrightTALK - Semantic AI BrightTALK - Semantic AI
BrightTALK - Semantic AI
 
The Searchmaster's Toolbox - David Hawking, Funnelback Search
The Searchmaster's Toolbox - David Hawking, Funnelback SearchThe Searchmaster's Toolbox - David Hawking, Funnelback Search
The Searchmaster's Toolbox - David Hawking, Funnelback Search
 

Recently uploaded

UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling ExtensionsUI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
Peter Muessig
 
GraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph TechnologyGraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph Technology
Neo4j
 
E-commerce Development Services- Hornet Dynamics
E-commerce Development Services- Hornet DynamicsE-commerce Development Services- Hornet Dynamics
E-commerce Development Services- Hornet Dynamics
Hornet Dynamics
 
Graspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code AnalysisGraspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code Analysis
Aftab Hussain
 
socradar-q1-2024-aviation-industry-report.pdf
socradar-q1-2024-aviation-industry-report.pdfsocradar-q1-2024-aviation-industry-report.pdf
socradar-q1-2024-aviation-industry-report.pdf
SOCRadar
 
Neo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
Neo4j - Product Vision and Knowledge Graphs - GraphSummit ParisNeo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
Neo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
Neo4j
 
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata
 
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI AppAI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
Google
 
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdfVitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke
 
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissancesAtelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Neo4j
 
DDS-Security 1.2 - What's New? Stronger security for long-running systems
DDS-Security 1.2 - What's New? Stronger security for long-running systemsDDS-Security 1.2 - What's New? Stronger security for long-running systems
DDS-Security 1.2 - What's New? Stronger security for long-running systems
Gerardo Pardo-Castellote
 
Oracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptxOracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptx
Remote DBA Services
 
Artificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension FunctionsArtificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension Functions
Octavian Nadolu
 
openEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain SecurityopenEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain Security
Shane Coughlan
 
Hand Rolled Applicative User Validation Code Kata
Hand Rolled Applicative User ValidationCode KataHand Rolled Applicative User ValidationCode Kata
Hand Rolled Applicative User Validation Code Kata
Philip Schwarz
 
Oracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptxOracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptx
Remote DBA Services
 
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
Łukasz Chruściel
 
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
mz5nrf0n
 
8 Best Automated Android App Testing Tool and Framework in 2024.pdf
8 Best Automated Android App Testing Tool and Framework in 2024.pdf8 Best Automated Android App Testing Tool and Framework in 2024.pdf
8 Best Automated Android App Testing Tool and Framework in 2024.pdf
kalichargn70th171
 
E-commerce Application Development Company.pdf
E-commerce Application Development Company.pdfE-commerce Application Development Company.pdf
E-commerce Application Development Company.pdf
Hornet Dynamics
 

Recently uploaded (20)

UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling ExtensionsUI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
 
GraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph TechnologyGraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph Technology
 
E-commerce Development Services- Hornet Dynamics
E-commerce Development Services- Hornet DynamicsE-commerce Development Services- Hornet Dynamics
E-commerce Development Services- Hornet Dynamics
 
Graspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code AnalysisGraspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code Analysis
 
socradar-q1-2024-aviation-industry-report.pdf
socradar-q1-2024-aviation-industry-report.pdfsocradar-q1-2024-aviation-industry-report.pdf
socradar-q1-2024-aviation-industry-report.pdf
 
Neo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
Neo4j - Product Vision and Knowledge Graphs - GraphSummit ParisNeo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
Neo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
 
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024
 
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI AppAI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
 
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdfVitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdf
 
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissancesAtelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissances
 
DDS-Security 1.2 - What's New? Stronger security for long-running systems
DDS-Security 1.2 - What's New? Stronger security for long-running systemsDDS-Security 1.2 - What's New? Stronger security for long-running systems
DDS-Security 1.2 - What's New? Stronger security for long-running systems
 
Oracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptxOracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptx
 
Artificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension FunctionsArtificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension Functions
 
openEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain SecurityopenEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain Security
 
Hand Rolled Applicative User Validation Code Kata
Hand Rolled Applicative User ValidationCode KataHand Rolled Applicative User ValidationCode Kata
Hand Rolled Applicative User Validation Code Kata
 
Oracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptxOracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptx
 
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
 
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
 
8 Best Automated Android App Testing Tool and Framework in 2024.pdf
8 Best Automated Android App Testing Tool and Framework in 2024.pdf8 Best Automated Android App Testing Tool and Framework in 2024.pdf
8 Best Automated Android App Testing Tool and Framework in 2024.pdf
 
E-commerce Application Development Company.pdf
E-commerce Application Development Company.pdfE-commerce Application Development Company.pdf
E-commerce Application Development Company.pdf
 

Text Analytics World - Expert System USA

  • 1. Text Analytics World San Francisco – March 31, 2015 4:15-4:45pm Speaker: Bryan Bell, Executive Vice President, Expert System USA  What is in Your Business Requirement: Searching or Finding? Enterprise Search  Product Demonstration: The Google Search Appliance (GSA) integrated with a semantic technology platform. 1. Internal and external information comes at us faster than we can keep up with. 2. Business expectations for deploying solutions, using enterprise search and content navigation systems to capture the hidden value of strategic information. 3. CONTEXT: Exploiting deep linguistic analysis, combined with semantics offers the ability to create contextually correct metadata. 4. Dynamically enrich content with contextually relevant metadata and deploy as the heart of a knowledge management applications and the Google Search Appliance.
  • 2. 1. Internal and external information comes at us faster than we can keep up with. 80 – 90% is unstructured text.
  • 4. 4  The Google crawler visits 20 billion web sites a day.  The search engine has located more than 30 trillion unique URLs.  Processes 100 billion searches every month. • 3.3 billion searches per day. • Over 38,000 thousand searches per second. • A single Google query uses 1,000 computers to retrieve an answer. • This volume combined with the PageRank algorithm… PR(A) = (1-d) + d (PR(T1)/C(T1) + PR(Tn)/C(Tn)) …. is why Google is so good on the internet. • 16% to 20% of queries that get asked every day have never been asked before. Amit Singhal, Senior Vice President of development, Google Search August 2012 The Internet
  • 5. 2. Deploying internal enterprise search engine / content navigation system to capture and share the hidden value of the information that is available to the company. The intranet / corporate portal
  • 6. 2. Deploying internal enterprise search engine / content navigation system to capture and share the hidden value of the information that is available to the company. The intranet / corporate portal
  • 7. “Our search stinks! I want it to work like Google.”
  • 8.
  • 9. 9 Zettabyte 1,000,000,000,000,000,000,000 bytes Good news: PR(A) = (1-d) + d (PR(T1)/C(T1) + ... + PR(Tn)/C(Tn)) Don’t have 3.3 billion searches per day. Don’t have 38,000 thousand searches per second. Don’t have 1,000 computers to retrieve an answer.
  • 11. 11 Zettabyte 1,000,000,000,000, 000,000,000 bytes Key words No metadata Poor metadata Inconsistent = POOR CONTENT FINDABILITY
  • 12. 12 stock People are able to disambiguate “on the fly”, but machines cannot. Key words vs. Context Language ambiguity
  • 13. 13 People are able to disambiguate “on the fly”, but machines cannot. stock apple Key words vs. Context Language ambiguity
  • 14. 14 stock apple Apple People are able to disambiguate “on the fly”, but machines cannot. Key words vs. Context Language ambiguity
  • 15. 15 stock apple Apple “I bought 10,000 shares of stock in Apple.” “I have 10,000 apples in stock.” People are able to disambiguate “on the fly”, but machines cannot. Context is King
  • 16. 3. Exploiting deep linguistic analysis, combined with semantics. 4. Dynamically enrich content with contextually relevant metadata. How is word context established?
  • 17. Morphological analysis word forms dog, dog-catcher, doggy bag Grammatical analysis parts of speech "There are 40 rows in the table." (noun) "She rows 5 times a week." (verb) Logical analysis word relationships "The car I bought, to replace my Chrysler, stinks." Semantic analysis word context "I bought 10,000 shares of stock in Apple." "I have 10,000 apples in stock." "I used chicken broth for my soup stock." Deep linguistic analysis of words to achieve word disambiguation. How is word context established and deployed with the GSA?
  • 19. 20 Linguistic and semantic analysis engine
  • 20. 27 Case Study: GSA – Google Search Appliance What is in Your Business Requirement? Searching or Finding.