SlideShare a Scribd company logo
1 of 38
Enabling Exploration through Text Analytics Daniel Tunkelang Chief Scientist, Endeca
overview ,[object Object],[object Object],[object Object],[object Object]
real-world information seeking examples ,[object Object],[object Object],[object Object],[object Object],[object Object]
example 1: looking for health information ,[object Object],[object Object],[object Object]
google: the default option for most
in government we trust: fda.gov
maybe the private sector knows best: webmd powered by
success – and a sticky site powered by
example 2: looking for work-related information ,[object Object],[object Object],[object Object]
let’s try google again
google: the gateway to wikipedia?
the library of congress (loc.gov)
triangle research libraries: next-gen catalog powered by
faceted search enables query refinement powered by
take-away #1 ,[object Object],[object Object]
text analytics ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
newssift: text analytics enabling exploration powered by categorization named entity detection term extraction sentiment analysis
exploring the news about facebook powered by
facebook: the good powered by Social Utility Iphone Application
facebook: the bad powered by Criminal Behavior Litigation And Settlement
take-away #2 ,[object Object],[object Object]
text analytics is here and now ? ? ?
lots of off-the-shelf options and more!
caveats ,[object Object],[object Object],[object Object],[object Object]
problems with entity extraction ,[object Object],[object Object],[object Object],Arrest (1) Asia (1) ALTOONA, PA (1) Abe Lincoln (1) Bob Dole (1) Boston Tea Party (1) Abraham Lincoln (1) Budweiser (1) Australia (1) Adlai Stephenson (1) Boston Tea Party (1) Austin, Texas (1) Abraham Lincoln (1) Boston Globe (1) Austin (1) Abe Weiss (1) Bocuse d’Or World Cuisine Contest (1) Atlanta (2) Abe Lincoln (1) Bob Dole (1) Asia (1) Abbie Hoffman (1) Bloomberg LP (3) Arrest (1) Aaron Sorkin (1) BioDiversity Research Institute (1) Arlington, Va. (2) ARYE BARAK (1) Big Apple Companies (1) Arkansas (7) ANTONIN SCALIA (1) Bear Stearns (2) Arizona (11) ANTHONY MWANGI (1) Bad News Bears (1) Argentina (1) ANDREW LLOYD WEBBER (1) Australian Liberal Party (1) Appalachia (1) ANDERS ERICSSON (1) Arianna Huffington (1) Americas (17) AMY WINEHOUSE (1) Arctic National Wildlife Refuge (1) Allegheny (1) AMANDA MARCOTTE (1) Apple (1) Alaska (3) ALI HASSAN AL (1) American Airlines Inc. (1) Akihabara (1) ALEX TREBEK (1) Amazon.com Inc. (1) Africa (5) AL GORE (1) Air Force (1) Afghanistan (7) ABDULRAHMAN ABDULLAH (1) ABC News Inc. (1) ALTOONA, PA (1) ABDUL-KARIM KHALAF (1) Organization Location Person
look for ways to cheat! recall precision
division of labor people supply vocabulary machine annotates documents http://www.precolumbianwomen.com/images/inca-labor.10.gif
example: ACM digital library ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
solution ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
example: a search for boeing powered by
it’s a HITS!
if you prefer sports to computer science ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
roger clemens, then and now powered by
pivoting to a different view powered by
take-away #3 ,[object Object],[object Object],[object Object]
looking forward ,[object Object],[object Object],[object Object],[object Object]
in closing ,[object Object],[object Object],[object Object]
thank you…and come to SIGIR! ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

More Related Content

What's hot

Plagiarism work sheet
Plagiarism work sheetPlagiarism work sheet
Plagiarism work sheet
Vjames12
 
Search Strings
Search StringsSearch Strings
Search Strings
Erin Sees
 
Meabe speeches 2nd sem rev22712
Meabe speeches 2nd sem rev22712Meabe speeches 2nd sem rev22712
Meabe speeches 2nd sem rev22712
Ms. D
 
Research Skills for Level 6 (Follow Up)
Research Skills for Level 6 (Follow Up)Research Skills for Level 6 (Follow Up)
Research Skills for Level 6 (Follow Up)
Swilley Library
 

What's hot (19)

Sociology 462
Sociology 462Sociology 462
Sociology 462
 
Electronic Research: Sources and Strategies
Electronic Research: Sources and StrategiesElectronic Research: Sources and Strategies
Electronic Research: Sources and Strategies
 
ENG 101
ENG 101ENG 101
ENG 101
 
Doing Literature Review
Doing Literature ReviewDoing Literature Review
Doing Literature Review
 
Who's citing whom?
Who's citing whom?Who's citing whom?
Who's citing whom?
 
10 easy ways to increase your citation count a checklist
10 easy ways to increase your citation count  a checklist10 easy ways to increase your citation count  a checklist
10 easy ways to increase your citation count a checklist
 
Subject Searching
Subject Searching Subject Searching
Subject Searching
 
What google scholar can do for you
What google scholar can do for youWhat google scholar can do for you
What google scholar can do for you
 
Law1 ppl journal articles
Law1 ppl journal articlesLaw1 ppl journal articles
Law1 ppl journal articles
 
Humanities international complete
Humanities international completeHumanities international complete
Humanities international complete
 
Plagiarism work sheet
Plagiarism work sheetPlagiarism work sheet
Plagiarism work sheet
 
Reflection on web2.0
Reflection on web2.0Reflection on web2.0
Reflection on web2.0
 
How people search the library from a single search box
How people search the library from a single search boxHow people search the library from a single search box
How people search the library from a single search box
 
Search Strings
Search StringsSearch Strings
Search Strings
 
Meabe speeches 2nd sem rev22712
Meabe speeches 2nd sem rev22712Meabe speeches 2nd sem rev22712
Meabe speeches 2nd sem rev22712
 
Research Skills for Level 6 (Follow Up)
Research Skills for Level 6 (Follow Up)Research Skills for Level 6 (Follow Up)
Research Skills for Level 6 (Follow Up)
 
Finding newspaper articles in factiva ppl2015
Finding newspaper articles in factiva ppl2015Finding newspaper articles in factiva ppl2015
Finding newspaper articles in factiva ppl2015
 
Searching databaseswelshci
Searching databaseswelshciSearching databaseswelshci
Searching databaseswelshci
 
National latina researchers network supercharge your search 2015 webinar
National latina researchers network supercharge your search 2015 webinarNational latina researchers network supercharge your search 2015 webinar
National latina researchers network supercharge your search 2015 webinar
 

Viewers also liked

Viewers also liked (10)

The Future of Text Analytics
The Future of Text AnalyticsThe Future of Text Analytics
The Future of Text Analytics
 
Predictive Text Analytics
Predictive Text AnalyticsPredictive Text Analytics
Predictive Text Analytics
 
singley+mackie Capabilities Deck
singley+mackie Capabilities Decksingley+mackie Capabilities Deck
singley+mackie Capabilities Deck
 
Text Mining Analytics 101
Text Mining Analytics 101Text Mining Analytics 101
Text Mining Analytics 101
 
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
 
Log Data Mining
Log Data MiningLog Data Mining
Log Data Mining
 
Text Analytics for Dummies 2010
Text Analytics for Dummies 2010Text Analytics for Dummies 2010
Text Analytics for Dummies 2010
 
Elements of Text Mining Part - I
Elements of Text Mining Part - IElements of Text Mining Part - I
Elements of Text Mining Part - I
 
Log Mining: Beyond Log Analysis
Log Mining: Beyond Log AnalysisLog Mining: Beyond Log Analysis
Log Mining: Beyond Log Analysis
 
Data Science - Part XI - Text Analytics
Data Science - Part XI - Text AnalyticsData Science - Part XI - Text Analytics
Data Science - Part XI - Text Analytics
 

Similar to Enabling Exploration Through Text Analytics

Search Engine Strategies
Search  Engine  StrategiesSearch  Engine  Strategies
Search Engine Strategies
jsotir
 
Academic Skills 4
Academic Skills 4Academic Skills 4
Academic Skills 4
Hala Nur
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
lynroe
 
Information Literacy Orientation (Fall, 2011)
Information Literacy Orientation (Fall, 2011)Information Literacy Orientation (Fall, 2011)
Information Literacy Orientation (Fall, 2011)
sbishoptcl
 

Similar to Enabling Exploration Through Text Analytics (20)

Database Basics
Database BasicsDatabase Basics
Database Basics
 
Workshop on Systematic Searching (Oslo)
Workshop on Systematic Searching (Oslo)Workshop on Systematic Searching (Oslo)
Workshop on Systematic Searching (Oslo)
 
Internet searching
Internet searchingInternet searching
Internet searching
 
Reproducibility Analytics Lab
Reproducibility Analytics Lab Reproducibility Analytics Lab
Reproducibility Analytics Lab
 
June 1st Library Presentation for CCTS Summer Fellowship
June 1st Library Presentation for CCTS Summer FellowshipJune 1st Library Presentation for CCTS Summer Fellowship
June 1st Library Presentation for CCTS Summer Fellowship
 
Google for Life Science Researchers
Google for Life Science ResearchersGoogle for Life Science Researchers
Google for Life Science Researchers
 
Libguide powerpoint
Libguide powerpointLibguide powerpoint
Libguide powerpoint
 
Introductory Literature Searching Session
Introductory Literature Searching SessionIntroductory Literature Searching Session
Introductory Literature Searching Session
 
Search Engine Strategies
Search  Engine  StrategiesSearch  Engine  Strategies
Search Engine Strategies
 
Academic Skills 4
Academic Skills 4Academic Skills 4
Academic Skills 4
 
FSU SLIS InfoSvcs Wk 3 - Web Search & Evaluation
FSU SLIS InfoSvcs Wk 3 - Web Search & EvaluationFSU SLIS InfoSvcs Wk 3 - Web Search & Evaluation
FSU SLIS InfoSvcs Wk 3 - Web Search & Evaluation
 
Databasics
DatabasicsDatabasics
Databasics
 
Hinari basic course_module_2_workbook_2014_07
Hinari basic course_module_2_workbook_2014_07Hinari basic course_module_2_workbook_2014_07
Hinari basic course_module_2_workbook_2014_07
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 
Big 6 Research Skills
Big 6 Research SkillsBig 6 Research Skills
Big 6 Research Skills
 
Information Literacy Orientation (Fall, 2011)
Information Literacy Orientation (Fall, 2011)Information Literacy Orientation (Fall, 2011)
Information Literacy Orientation (Fall, 2011)
 
TSEM Spring 2017 Fath Class1
TSEM Spring 2017 Fath Class1TSEM Spring 2017 Fath Class1
TSEM Spring 2017 Fath Class1
 
TSEM Fall 2016 Fath Class1
TSEM Fall 2016 Fath Class1TSEM Fall 2016 Fath Class1
TSEM Fall 2016 Fath Class1
 
Usability Testing a Public ERM: Worth the Effort?
Usability Testing a Public ERM: Worth the Effort?Usability Testing a Public ERM: Worth the Effort?
Usability Testing a Public ERM: Worth the Effort?
 
A Gentle Introduction to Text Analysis :)
A Gentle Introduction to Text Analysis :)A Gentle Introduction to Text Analysis :)
A Gentle Introduction to Text Analysis :)
 

More from Daniel Tunkelang

Enterprise Intelligence
Enterprise IntelligenceEnterprise Intelligence
Enterprise Intelligence
Daniel Tunkelang
 
My Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine LearningMy Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine Learning
Daniel Tunkelang
 
Web science - How is it different?
Web science - How is it different?Web science - How is it different?
Web science - How is it different?
Daniel Tunkelang
 
Find and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedInFind and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedIn
Daniel Tunkelang
 
Search as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal JourneySearch as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal Journey
Daniel Tunkelang
 
Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?
Daniel Tunkelang
 
Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem
Daniel Tunkelang
 
Data By The People, For The People
Data By The People, For The PeopleData By The People, For The People
Data By The People, For The People
Daniel Tunkelang
 

More from Daniel Tunkelang (20)

Query Understanding and Ecommerce
Query Understanding and EcommerceQuery Understanding and Ecommerce
Query Understanding and Ecommerce
 
Semantic Equivalence of e-Commerce Queries
Semantic Equivalence of e-Commerce QueriesSemantic Equivalence of e-Commerce Queries
Semantic Equivalence of e-Commerce Queries
 
Helping Searchers Satisfice through Query Understanding
Helping Searchers Satisfice through Query UnderstandingHelping Searchers Satisfice through Query Understanding
Helping Searchers Satisfice through Query Understanding
 
MMM, Search!
MMM, Search!MMM, Search!
MMM, Search!
 
Enterprise Intelligence
Enterprise IntelligenceEnterprise Intelligence
Enterprise Intelligence
 
Query Understanding: A Manifesto
Query Understanding: A ManifestoQuery Understanding: A Manifesto
Query Understanding: A Manifesto
 
Where should you put your data scientists?
Where should you put your data scientists?Where should you put your data scientists?
Where should you put your data scientists?
 
Data Science: A Mindset for Productivity
Data Science: A Mindset for ProductivityData Science: A Mindset for Productivity
Data Science: A Mindset for Productivity
 
My Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine LearningMy Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine Learning
 
Web science - How is it different?
Web science - How is it different?Web science - How is it different?
Web science - How is it different?
 
Better Search Through Query Understanding
Better Search Through Query UnderstandingBetter Search Through Query Understanding
Better Search Through Query Understanding
 
Social Search in a Professional Context
Social Search in a Professional ContextSocial Search in a Professional Context
Social Search in a Professional Context
 
Find and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedInFind and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedIn
 
Search as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal JourneySearch as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal Journey
 
Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?
 
Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem
 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data Scientist
 
Information, Attention, and Trust: A Hierarchy of Needs
Information, Attention, and Trust: A Hierarchy of NeedsInformation, Attention, and Trust: A Hierarchy of Needs
Information, Attention, and Trust: A Hierarchy of Needs
 
Data By The People, For The People
Data By The People, For The PeopleData By The People, For The People
Data By The People, For The People
 
Content, Connections, and Context
Content, Connections, and ContextContent, Connections, and Context
Content, Connections, and Context
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Recently uploaded (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 

Enabling Exploration Through Text Analytics