SlideShare a Scribd company logo
1 of 9
Download to read offline
How to make Emora talk
about Sports intelligently
Emory NLP Weekly
9/2/2020
Sonny Xu
Summary
1. Previous approach
2. Sports Database
3. Improvement: how to make an intelligent component
a. Text-to-SQL integration
b. Data analysis and prediction
c. Opinions
4. Expected challenges
5. Expected result
Previous approach
● Template based dialog organized by different states
● Everything is based on rule
○ The way to generate response
○ The way of transition between two states
● While it’s relatively easy to implement, we are not able to talk about
something hasn’t been prepared.
Database Tables in NBA database
Table nbagames description
● Multi tables
● Lots of abbreviations in
table/column names
● Primary keys:
○ team name
○ game id (combination of date and
team name)
○ player name
New approach
● Three main steps
● At the beginning,
we will focus on
second layer
Overall Logic Flow
Text-to-SQL integration
Spider
● Zero shot
● Multi tables in multi domains
● Questions are complex
WikiSQL
● Cannot be executed against unseen
tables
● Single table in multi domains
● Questions are easier
Challenge
● The database in spider dataset is well
defined, especially for the table name
and column name. However, for sports
database, names are more complex.
● Current research is limited to Exact Set
Match without Values
● Understanding sentences that requires
additional information
○ Which team won the match between Heat
and Bucks yesterday?
Table nbagames description
Prediction
● Will be implemented as a collection
● For most cases, simple linear regression will be sufficient, such as match prediction,
player performance evaluation
○ Why do we need our own prediction while not using third-party resources?
Expected Result
U: How many points did Lebron James get in the match yesterday?
E: He got 30 points. I think he did a great job.
U: Why do you think so?
E: He had a high success rate for both 2-pointers and 3-pointers.
U: I agree. Do you think Los Angeles Lakers will win again in the next match?
E: Yes, because James was playing really well for the past several matches.

More Related Content

More from Jinho Choi

More from Jinho Choi (20)

CKY Parsing
CKY ParsingCKY Parsing
CKY Parsing
 
CS329 - WordNet Similarities
CS329 - WordNet SimilaritiesCS329 - WordNet Similarities
CS329 - WordNet Similarities
 
CS329 - Lexical Relations
CS329 - Lexical RelationsCS329 - Lexical Relations
CS329 - Lexical Relations
 
Automatic Knowledge Base Expansion for Dialogue Management
Automatic Knowledge Base Expansion for Dialogue ManagementAutomatic Knowledge Base Expansion for Dialogue Management
Automatic Knowledge Base Expansion for Dialogue Management
 
Attention is All You Need for AMR Parsing
Attention is All You Need for AMR ParsingAttention is All You Need for AMR Parsing
Attention is All You Need for AMR Parsing
 
Graph-to-Text Generation and its Applications to Dialogue
Graph-to-Text Generation and its Applications to DialogueGraph-to-Text Generation and its Applications to Dialogue
Graph-to-Text Generation and its Applications to Dialogue
 
Real-time Coreference Resolution for Dialogue Understanding
Real-time Coreference Resolution for Dialogue UnderstandingReal-time Coreference Resolution for Dialogue Understanding
Real-time Coreference Resolution for Dialogue Understanding
 
Topological Sort
Topological SortTopological Sort
Topological Sort
 
Tries - Put
Tries - PutTries - Put
Tries - Put
 
Multi-modal Embedding Learning for Early Detection of Alzheimer's Disease
Multi-modal Embedding Learning for Early Detection of Alzheimer's DiseaseMulti-modal Embedding Learning for Early Detection of Alzheimer's Disease
Multi-modal Embedding Learning for Early Detection of Alzheimer's Disease
 
Building Widely-Interpretable Semantic Networks for Dialogue Contexts
Building Widely-Interpretable Semantic Networks for Dialogue ContextsBuilding Widely-Interpretable Semantic Networks for Dialogue Contexts
Building Widely-Interpretable Semantic Networks for Dialogue Contexts
 
Text-to-SQL with Data-Driven Templates
Text-to-SQL with Data-Driven TemplatesText-to-SQL with Data-Driven Templates
Text-to-SQL with Data-Driven Templates
 
Resume Classification with Term Attention Embeddings
Resume Classification with Term Attention EmbeddingsResume Classification with Term Attention Embeddings
Resume Classification with Term Attention Embeddings
 
[DSA-Java] Heap Sort
[DSA-Java] Heap Sort[DSA-Java] Heap Sort
[DSA-Java] Heap Sort
 
Multilingual Propbank Annotation Tools: Cornerstone and Jubilee
Multilingual Propbank Annotation Tools: Cornerstone and JubileeMultilingual Propbank Annotation Tools: Cornerstone and Jubilee
Multilingual Propbank Annotation Tools: Cornerstone and Jubilee
 
Hierarchical Entity Extraction and Ranking with Unsupervised Graph Convolutions
Hierarchical Entity Extraction and Ranking with Unsupervised Graph ConvolutionsHierarchical Entity Extraction and Ranking with Unsupervised Graph Convolutions
Hierarchical Entity Extraction and Ranking with Unsupervised Graph Convolutions
 
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue
Transformers to Learn Hierarchical Contexts in Multiparty DialogueTransformers to Learn Hierarchical Contexts in Multiparty Dialogue
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue
 
Analysis of a State Machine-based Interactive Dialogue Management System
Analysis of a State Machine-based Interactive Dialogue Management SystemAnalysis of a State Machine-based Interactive Dialogue Management System
Analysis of a State Machine-based Interactive Dialogue Management System
 
Level-based Resume Classification on Nursing Positions
Level-based Resume Classification on Nursing PositionsLevel-based Resume Classification on Nursing Positions
Level-based Resume Classification on Nursing Positions
 
NLP Rankings: Publication-based Ranking System and Platform for NLP Research
NLP Rankings: Publication-based Ranking System and Platform for NLP ResearchNLP Rankings: Publication-based Ranking System and Platform for NLP Research
NLP Rankings: Publication-based Ranking System and Platform for NLP Research
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 

How to make Emora talk about Sports Intelligently

  • 1. How to make Emora talk about Sports intelligently Emory NLP Weekly 9/2/2020 Sonny Xu
  • 2. Summary 1. Previous approach 2. Sports Database 3. Improvement: how to make an intelligent component a. Text-to-SQL integration b. Data analysis and prediction c. Opinions 4. Expected challenges 5. Expected result
  • 3. Previous approach ● Template based dialog organized by different states ● Everything is based on rule ○ The way to generate response ○ The way of transition between two states ● While it’s relatively easy to implement, we are not able to talk about something hasn’t been prepared.
  • 4. Database Tables in NBA database Table nbagames description ● Multi tables ● Lots of abbreviations in table/column names ● Primary keys: ○ team name ○ game id (combination of date and team name) ○ player name
  • 5. New approach ● Three main steps ● At the beginning, we will focus on second layer Overall Logic Flow
  • 6. Text-to-SQL integration Spider ● Zero shot ● Multi tables in multi domains ● Questions are complex WikiSQL ● Cannot be executed against unseen tables ● Single table in multi domains ● Questions are easier
  • 7. Challenge ● The database in spider dataset is well defined, especially for the table name and column name. However, for sports database, names are more complex. ● Current research is limited to Exact Set Match without Values ● Understanding sentences that requires additional information ○ Which team won the match between Heat and Bucks yesterday? Table nbagames description
  • 8. Prediction ● Will be implemented as a collection ● For most cases, simple linear regression will be sufficient, such as match prediction, player performance evaluation ○ Why do we need our own prediction while not using third-party resources?
  • 9. Expected Result U: How many points did Lebron James get in the match yesterday? E: He got 30 points. I think he did a great job. U: Why do you think so? E: He had a high success rate for both 2-pointers and 3-pointers. U: I agree. Do you think Los Angeles Lakers will win again in the next match? E: Yes, because James was playing really well for the past several matches.