SlideShare a Scribd company logo
1 of 1
Download to read offline
A Spoken Dialogue System for
   Electronic Program Guide Information Access
                                                                       Seokhwan Kim, Cheongjae Lee, Sangkeun Jung, and Gary Geunbae Lee
                                                                       Pohang University of Science and Technology (POSTECH), South Korea

                          ABSTRACT                                                          AUTOMATIC SPEECH                                                                                   SPOKEN LANGUAGE                                                    EPG DATABASE MANAGER
  In this paper, we present POSTECH Spoken Dialogue System                                    RECOGNIZER                                                                                        UNDERSTANDING                                                  The main purpose of the EPG database manager is to build
 for Electronic Program Guide Information Access (POSSDS-                                                                                                                                                                                                     a content database for the other modules in POSSDS-EPG
 EPG). POSSDS-EPG consists of automatic speech recognizer,                    To build the language model, the candidate utterances that                                           The SLU module of POSSDS-EPG was constructed by a
                                                                                                                                                                                                                                                              with minimal human effort.
 spoken language understanding, dialogue manager, system                     have high probability of being spoken by users are required. We                                      concept spotting approach which aims to extract only the
                                                                                                                                                                                                                                                               We chose an EPG website (http://www.epg.co.kr) dealing
 utterance generator, text-to-speech synthesizer, and EPG                    generate the candidate utterances automatically by using the                                         essential information for predefined meaning representation
                                                                                                                                                                                                                                                              with the information on Korean TV programs. The EPG
 database manager. Each module is designed and implemented                   dialogue examples in the existing example database and the                                           slots. The semantic frame is made up of these slots including
                                                                                                                                                                                                                                                              database manager builds a contents database from the
 to make an effective and practical spoken dialogue system. In               retrieved result from the up-to-date EPG database.                                                   dialogue act, main action, and component slots for the EPG
                                                                                                                                                                                                                                                              information on the website.
 particular, in order to reflect the up-to-date EPG information                                                                                                                   domain.
                                                                               An Existing Utterance                                                                               We regarded the SLU problem as a classification problem,
 which is updated frequently and periodically, we applied a web-                                                                                                                                                                                                     WEB PAGES
                                                                               I want to watch drama Hae-Sin around .                                                             which can be solved by statistical machine learning frame-
 mining technology to the EPG database manager, which builds                   [genre = drama], [program_name = Hae-Sin], [time = 9 pm]
 the content database based on automatically extracted                                                                                                                            works. To build a statistical model for the SLU problem, we
                                                                               Retrieved Results                                                                                                                                                                       Contents       Contents

 information from popular EPG websites. The automatically                      [genre = movie], [program_name = Monster], [time = 11 pm]                                          should prepare the training corpus containing utterances that                        Filtering       Tables


 generated content database is used by other modules in the                    [genre = sports], [program_name = Basketball], [time = 7 pm]                                       have high probability of being spoken by users. We can easily
                                                                               Candidate Utterances                                                                               create a training corpus by reusing the candidate utterances that                                 Information    Extracted
 system for building their own resources. Evaluations show that                                                                                                                                                                                                                      Extraction   Information
                                                                               I want to watch movie Monster around .                                                             are used for building the language model in the speech
 our system performs EPG access task in high performance and
                                                                               I want to watch sports Basketball around .                                                         recognizer.
 can be managed with low cost.                                                                                                                                                                                                                                                                    Building
                                                                                                                                                                                                                                                                                                                EPG DB
                                                                                                                                                                                                                                                                                                    DB




      POSSDS-EPG: POSTECH                                                                   DIALOGUE MANAGER                                                                                   SYSTEM UTTERANCE                                                                    EVALUATIONS
    SPOKEN DIALOGUE SYSTEM                                                                                                                                                                        GENERATOR                                                                             Manually      Automatically Man
                                                                              To develop an effective and practical spoken dialogue system,                                                                                                                        Evaluation
        FOR EPG DOMAIN                                                       we proposed the situation-based dialogue management method                                                                                                                                TCR
                                                                                                                                                                                                                                                                                     Managed System
                                                                                                                                                                                                                                                                                          0.76
                                                                                                                                                                                                                                                                                                         aged System
                                                                                                                                                                                                                                                                                                             0.72
                                                                                                                                                                                    The system utterance generator generates the literal sys-tem
                                                                             using dialogue examples. For the system utterance generation,                                                                                                                             STR                0.65               0.62
 POSSDS-EPG consists of a set of appropriate modules that are                                                                                                                      utterances based on the system action tag and the utterance
                                                                             we automatically construct and index a dialogue example                                                                                                                                  MRA                 0.85               0.85
designed to be connected to each other according to the order. The                                                                                                                 generating template. Each system action tag has at least one
                                                                             database from the dialogue corpus. The dialogue manager                                                                                                                             User Satisfaction        0.75               0.73
overall system aims to output the synthesized spoken response                                                                                                                      utterance generating template which is constructed manually.                 TCR: User Perception of Task Completion Rate
                                                                             retrieves the best dialogue example for the current dialogue
corresponding to an input utterance spoken by the user..                                                                                                                           The system utterance generating task is advanced by filling                  STR: Success Turn Rate
                                                                             situation, which includes a current user utterance, semantic
                                                                                                                                                                                   slots in the template with proper values, such as retrieving                 MRA: Mean Recognition Accuracy
                                                                             frame and discourse history. From the retrieved result, the                                                                                                                        User Satisfaction = aTCR + bSTR + rMRA
                                                                                                                                                                                   results from the EPG database, slot values in the semantic
                                                                             dialogue manager determines the system action tag from the
  User Utterance           ASR                   Language                                                                                                                          frame, and constituents in the discourse history.
                                                 MODEL                       pre-defined tag set.
                                                                                                                                                                                                                                                                          IMPLEMENTATION
                                          NLU
                           SLU
                                         MODEL                 WEB
                                                                                                                                                                Dialogue              System Action Tag    Inform_Channel
                              Semantic
         Meta-Rules           Frame                                                         User’s Utterance                                                    Corpus
                                                                                                                                                                                                           [program_name]은 [channel]에서 합니다.
          For DM
                         Dialogue                                                                                                                                     Automatic       Utterance Template   ( [program_name] eun [channel] e-seo hap-ni-da )
                         Manager                                                                               User         Semantic          Discourse               Indexing                             [program_name] is broadcasted on [channel].
         Dialogue                                                                                            Intention       Frame             History
        Example DB            System                                             System         Domain                                                                                   Slot Values       [program_name = 해신, channel = KBS]
                              Action                                                            Expert
                                                            EPG DB              Responses                                                                       Dialogue
                                                                                                                 Query Generation
         Meta-Rules   System Response      EPG DB
                                                            Manager
                                                                                                                                                               Example DB                                  해신은 KBS에서 합니다.
          For SRG        Generator                                                                                                                                                     System Utterance    ( Hae-Sin eun KBS e-seo hap-ni-da )
                                                                                                                   Utterance Similarity                   Retrieval                                        Hae-Sin is broadcasted on KBS.
                                                                                                                    Lexico-semantic Similarity
                                                                                                                    Discourse history Similarity
                                                                                             Best Dialogue                                                     Dialogue
                           TTS                      System Utterance                                                                                           Examples
                                                                                               Example                   Tie-breaking



           Overview of POSSDS-EPG System Architecture

More Related Content

Similar to A spoken dialog system for electronic program guide information access

VIDEO TO TEXT SUMMARIZER USING AI.pdf
VIDEO TO TEXT SUMMARIZER USING AI.pdfVIDEO TO TEXT SUMMARIZER USING AI.pdf
VIDEO TO TEXT SUMMARIZER USING AI.pdfFreeFire293813
 
SPEECH BASED EMOTION RECOGNITION USING VOICE
SPEECH BASED  EMOTION RECOGNITION USING VOICESPEECH BASED  EMOTION RECOGNITION USING VOICE
SPEECH BASED EMOTION RECOGNITION USING VOICEVamshidharSingh
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech RecognitionThejus Joby
 
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)IRJET Journal
 
IRJET- Designing and Creating Punjabi Speech Synthesis System using Hidden Ma...
IRJET- Designing and Creating Punjabi Speech Synthesis System using Hidden Ma...IRJET- Designing and Creating Punjabi Speech Synthesis System using Hidden Ma...
IRJET- Designing and Creating Punjabi Speech Synthesis System using Hidden Ma...IRJET Journal
 
silent sound technology pdf
silent sound technology pdfsilent sound technology pdf
silent sound technology pdfrahul mishra
 
IRJET- Voice based Billing System
IRJET-  	  Voice based Billing SystemIRJET-  	  Voice based Billing System
IRJET- Voice based Billing SystemIRJET Journal
 
IRJET - Pseudocode to Python Translation using Machine Learning
IRJET - Pseudocode to Python Translation using Machine LearningIRJET - Pseudocode to Python Translation using Machine Learning
IRJET - Pseudocode to Python Translation using Machine LearningIRJET Journal
 
dic-160603172047.pdf
dic-160603172047.pdfdic-160603172047.pdf
dic-160603172047.pdfAkhilJoseph63
 
OCR speech using Labview
OCR speech using LabviewOCR speech using Labview
OCR speech using LabviewBharat Thakur
 
IRJET- Speech to Speech Translation System
IRJET- Speech to Speech Translation SystemIRJET- Speech to Speech Translation System
IRJET- Speech to Speech Translation SystemIRJET Journal
 
MULTILINGUAL SPEECH TO TEXT CONVERSION USING HUGGING FACE FOR DEAF PEOPLE
MULTILINGUAL SPEECH TO TEXT CONVERSION USING HUGGING FACE FOR DEAF PEOPLEMULTILINGUAL SPEECH TO TEXT CONVERSION USING HUGGING FACE FOR DEAF PEOPLE
MULTILINGUAL SPEECH TO TEXT CONVERSION USING HUGGING FACE FOR DEAF PEOPLEIRJET Journal
 
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...IRJET Journal
 
Natural language processing (Python)
Natural language processing (Python)Natural language processing (Python)
Natural language processing (Python)Sumit Raj
 

Similar to A spoken dialog system for electronic program guide information access (20)

VIDEO TO TEXT SUMMARIZER USING AI.pdf
VIDEO TO TEXT SUMMARIZER USING AI.pdfVIDEO TO TEXT SUMMARIZER USING AI.pdf
VIDEO TO TEXT SUMMARIZER USING AI.pdf
 
SPEECH BASED EMOTION RECOGNITION USING VOICE
SPEECH BASED  EMOTION RECOGNITION USING VOICESPEECH BASED  EMOTION RECOGNITION USING VOICE
SPEECH BASED EMOTION RECOGNITION USING VOICE
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
 
team10.ppt.pptx
team10.ppt.pptxteam10.ppt.pptx
team10.ppt.pptx
 
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
 
IRJET- Designing and Creating Punjabi Speech Synthesis System using Hidden Ma...
IRJET- Designing and Creating Punjabi Speech Synthesis System using Hidden Ma...IRJET- Designing and Creating Punjabi Speech Synthesis System using Hidden Ma...
IRJET- Designing and Creating Punjabi Speech Synthesis System using Hidden Ma...
 
silent sound technology pdf
silent sound technology pdfsilent sound technology pdf
silent sound technology pdf
 
IRJET- Voice based Billing System
IRJET-  	  Voice based Billing SystemIRJET-  	  Voice based Billing System
IRJET- Voice based Billing System
 
IRJET- Vocal Code
IRJET- Vocal CodeIRJET- Vocal Code
IRJET- Vocal Code
 
Software
SoftwareSoftware
Software
 
IRJET - Pseudocode to Python Translation using Machine Learning
IRJET - Pseudocode to Python Translation using Machine LearningIRJET - Pseudocode to Python Translation using Machine Learning
IRJET - Pseudocode to Python Translation using Machine Learning
 
Surface realization
Surface realizationSurface realization
Surface realization
 
43
4343
43
 
Speech Dubbing Software
Speech Dubbing SoftwareSpeech Dubbing Software
Speech Dubbing Software
 
dic-160603172047.pdf
dic-160603172047.pdfdic-160603172047.pdf
dic-160603172047.pdf
 
OCR speech using Labview
OCR speech using LabviewOCR speech using Labview
OCR speech using Labview
 
IRJET- Speech to Speech Translation System
IRJET- Speech to Speech Translation SystemIRJET- Speech to Speech Translation System
IRJET- Speech to Speech Translation System
 
MULTILINGUAL SPEECH TO TEXT CONVERSION USING HUGGING FACE FOR DEAF PEOPLE
MULTILINGUAL SPEECH TO TEXT CONVERSION USING HUGGING FACE FOR DEAF PEOPLEMULTILINGUAL SPEECH TO TEXT CONVERSION USING HUGGING FACE FOR DEAF PEOPLE
MULTILINGUAL SPEECH TO TEXT CONVERSION USING HUGGING FACE FOR DEAF PEOPLE
 
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
 
Natural language processing (Python)
Natural language processing (Python)Natural language processing (Python)
Natural language processing (Python)
 

More from Seokhwan Kim

The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)Seokhwan Kim
 
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Seokhwan Kim
 
Dynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingDynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingSeokhwan Kim
 
The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)Seokhwan Kim
 
Natural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionNatural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionSeokhwan Kim
 
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...Seokhwan Kim
 
The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)Seokhwan Kim
 
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Seokhwan Kim
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Seokhwan Kim
 
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...Seokhwan Kim
 
Sequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog StatesSequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog StatesSeokhwan Kim
 
Wikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic TrackingWikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic TrackingSeokhwan Kim
 
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...Seokhwan Kim
 
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...Seokhwan Kim
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionMMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionSeokhwan Kim
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...Seokhwan Kim
 
An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...Seokhwan Kim
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionAn Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionSeokhwan Kim
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템Seokhwan Kim
 
A Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation DetectionA Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation DetectionSeokhwan Kim
 

More from Seokhwan Kim (20)

The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)
 
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
 
Dynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingDynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic Tracking
 
The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)
 
Natural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionNatural Language in Human-Robot Interaction
Natural Language in Human-Robot Interaction
 
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
 
The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)
 
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
 
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
 
Sequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog StatesSequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog States
 
Wikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic TrackingWikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic Tracking
 
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
 
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionMMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognition
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...
 
An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionAn Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information Extraction
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
 
A Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation DetectionA Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation Detection
 

Recently uploaded

Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 

Recently uploaded (20)

Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 

A spoken dialog system for electronic program guide information access

  • 1. A Spoken Dialogue System for Electronic Program Guide Information Access Seokhwan Kim, Cheongjae Lee, Sangkeun Jung, and Gary Geunbae Lee Pohang University of Science and Technology (POSTECH), South Korea ABSTRACT AUTOMATIC SPEECH SPOKEN LANGUAGE EPG DATABASE MANAGER In this paper, we present POSTECH Spoken Dialogue System RECOGNIZER UNDERSTANDING The main purpose of the EPG database manager is to build for Electronic Program Guide Information Access (POSSDS- a content database for the other modules in POSSDS-EPG EPG). POSSDS-EPG consists of automatic speech recognizer, To build the language model, the candidate utterances that The SLU module of POSSDS-EPG was constructed by a with minimal human effort. spoken language understanding, dialogue manager, system have high probability of being spoken by users are required. We concept spotting approach which aims to extract only the We chose an EPG website (http://www.epg.co.kr) dealing utterance generator, text-to-speech synthesizer, and EPG generate the candidate utterances automatically by using the essential information for predefined meaning representation with the information on Korean TV programs. The EPG database manager. Each module is designed and implemented dialogue examples in the existing example database and the slots. The semantic frame is made up of these slots including database manager builds a contents database from the to make an effective and practical spoken dialogue system. In retrieved result from the up-to-date EPG database. dialogue act, main action, and component slots for the EPG information on the website. particular, in order to reflect the up-to-date EPG information domain. An Existing Utterance We regarded the SLU problem as a classification problem, which is updated frequently and periodically, we applied a web- WEB PAGES I want to watch drama Hae-Sin around . which can be solved by statistical machine learning frame- mining technology to the EPG database manager, which builds [genre = drama], [program_name = Hae-Sin], [time = 9 pm] the content database based on automatically extracted works. To build a statistical model for the SLU problem, we Retrieved Results Contents Contents information from popular EPG websites. The automatically [genre = movie], [program_name = Monster], [time = 11 pm] should prepare the training corpus containing utterances that Filtering Tables generated content database is used by other modules in the [genre = sports], [program_name = Basketball], [time = 7 pm] have high probability of being spoken by users. We can easily Candidate Utterances create a training corpus by reusing the candidate utterances that Information Extracted system for building their own resources. Evaluations show that Extraction Information I want to watch movie Monster around . are used for building the language model in the speech our system performs EPG access task in high performance and I want to watch sports Basketball around . recognizer. can be managed with low cost. Building EPG DB DB POSSDS-EPG: POSTECH DIALOGUE MANAGER SYSTEM UTTERANCE EVALUATIONS SPOKEN DIALOGUE SYSTEM GENERATOR Manually Automatically Man To develop an effective and practical spoken dialogue system, Evaluation FOR EPG DOMAIN we proposed the situation-based dialogue management method TCR Managed System 0.76 aged System 0.72 The system utterance generator generates the literal sys-tem using dialogue examples. For the system utterance generation, STR 0.65 0.62 POSSDS-EPG consists of a set of appropriate modules that are utterances based on the system action tag and the utterance we automatically construct and index a dialogue example MRA 0.85 0.85 designed to be connected to each other according to the order. The generating template. Each system action tag has at least one database from the dialogue corpus. The dialogue manager User Satisfaction 0.75 0.73 overall system aims to output the synthesized spoken response utterance generating template which is constructed manually. TCR: User Perception of Task Completion Rate retrieves the best dialogue example for the current dialogue corresponding to an input utterance spoken by the user.. The system utterance generating task is advanced by filling STR: Success Turn Rate situation, which includes a current user utterance, semantic slots in the template with proper values, such as retrieving MRA: Mean Recognition Accuracy frame and discourse history. From the retrieved result, the User Satisfaction = aTCR + bSTR + rMRA results from the EPG database, slot values in the semantic dialogue manager determines the system action tag from the User Utterance ASR Language frame, and constituents in the discourse history. MODEL pre-defined tag set. IMPLEMENTATION NLU SLU MODEL WEB Dialogue System Action Tag Inform_Channel Semantic Meta-Rules Frame User’s Utterance Corpus [program_name]은 [channel]에서 합니다. For DM Dialogue Automatic Utterance Template ( [program_name] eun [channel] e-seo hap-ni-da ) Manager User Semantic Discourse Indexing [program_name] is broadcasted on [channel]. Dialogue Intention Frame History Example DB System System Domain Slot Values [program_name = 해신, channel = KBS] Action Expert EPG DB Responses Dialogue Query Generation Meta-Rules System Response EPG DB Manager Example DB 해신은 KBS에서 합니다. For SRG Generator System Utterance ( Hae-Sin eun KBS e-seo hap-ni-da ) Utterance Similarity Retrieval Hae-Sin is broadcasted on KBS.  Lexico-semantic Similarity  Discourse history Similarity Best Dialogue Dialogue TTS System Utterance Examples Example Tie-breaking Overview of POSSDS-EPG System Architecture