SlideShare a Scribd company logo
WIKIPEDIA-BASED KERNELS
FOR DIALOGUE TOPIC TRACKING
Seokhwan Kim, Rafael E. Banchs, Haizhou Li
Human Language Technology Department
Institute for Infocomm Research (I2R)
6th May 2014
ICASSP, Florence, Italy
Contents
• Introduction
• Problem Definition
• Method
• Evaluation
• Conclusions
Pg 2
Contents
• Introduction
• Problem Definition
• Method
• Evaluation
• Conclusions
Pg 3
Motivation
• Spoken Dialogue Systems
– Next-generation User Interface
– The most natural way for human-human communication
• Single-task dialogues
– Most previous work focuses on single target task
• Eg. Flight Reservation, Bus Information Guide, Restaurant Booking
– Cause limitations in practical uses
• Multi-task dialogues
– [Lin et al. 1999, Ikeda et al. 2008, Celikyilmaz et al. 2011]
– Selecting the most probable system at each turn
– Each system is independently built and operated from others
Pg 4
Related Work
• Text categorization-based Dialogue Topic Identification
– [Nakata et al., 2002; Lagus&Kuusisto, 2002; Adams&Martell, 2008]
– Differences from written texts
• Determinations of topics
– User’s intentions
– System’s decisions
• Available features
– Unable to see the future turns
• Knowledge-based Dialogue Topic Suggestion
– External Knowledge Sources
• Eg. Domain Models, Heuristics, Agendas
• [Roy&Subramaniam, 2006; Young et al., 2007; Bohus&Rudnicky, 2003; Lee et al. 2008
– Limited flexibility
• To handle user-initiative cases
– High cost
• To build a sufficient amount of resources
Pg 5
Contents
• Introduction
• Problem Definition
• Method
• Evaluation
• Conclusions
Pg 6
Dialogue Topic Tracking
• Subtasks
– Dialogue Segmentation
• Segmenting a session into topically coherent sub-dialogues
– Topic Transition Identification
• Identifying the next topic category at each time of topic transition
Pg 7
Dialogue Topic Tracking
• Example
Pg 8
Contents
• Introduction
• Problem Definition
• Method
• Evaluation
• Conclusions
Pg 9
Wikipedia-based Kernel Method
• Vector Space Model
– The simplest approach to represent features for supervised machine
learning methods
– An instance for each turn  A weighted term vector
– Lack of semantic or domain-specific aspects
• Each word is considered as an independent and identical unit
Pg 10
Wikipedia-based Kernel Method
• Wikipedia for Dialogue Topic Tracking
– As an external knowledge source
– Without significant effort for building resources
– Previous work
• [Breuing et al., 2011; Wilcock, 2012]
• Focusing only on a single type of information from Wikipedia
• Wikipedia-based Kernel Method
– Aiming at incorporating various knowledge from Wikipedia
– To map the data into a higher dimensional feature space
• Vector Space Extension
• Vector Transformation
Pg 11
Wikipedia-based Kernel Method
• Vector Extension
Pg 12
Term Vector Concept Vector
…
U: ---------------------------
S: ---------------------------
U: ---------------------------
S: ---------------------------
U: ---------------------------
S: ---------------------------
…
x
β1
β2
Β|D|
d1
d2
d|D|
⁞
Wikipedia-based Kernel Method
• Vector Transformation
– Each extended vector is transformed into a new space
– Transformation Matrix S
– s(di, dj) is the relatedness between di and dj
– Update of Concept Vector Values
Pg 13
Wikipedia-based Kernel Method
Pg 14
Measures of Contextual Relatedness
• How to compute s(di, dj)?
• Category Relatedness
– Based on hierarchical structures of Wikipedia categories
• depth(d): the length of the path from the root node to d
• lcs(di, dj): the least common subsume of the two articles in the hierarchy
Pg 15
Measures of Contextual Relatedness
• Category Overlap Score
– Based on the ratio of common categories of two concepts
– By Jaccard’s coefficient
• Contents Similarity
– Based on the cosine similarity between term vectors from the body texts
Pg 16
Measures of Contextual Relatedness
• Co-occurrence Frequency
– To represent the discourse relatedness obtained from Wikipedia
– Assumption
• The more frequently the mentions about two concepts co-occurred
• The more similar aspects both concepts take in dialogue flows
– By normalized point-wise mutual information
Pg 17
Measures of Contextual Relatedness
• Geographical Closeness
– Domain-specific Measure
– Based on the geographic coordinate information of spatial concepts
• Final Score
Pg 18
Measures of Contextual Relatedness
Pg 19
Contents
• Introduction
• Problem Definition
• Method
• Evaluation
• Conclusions
Pg 20
Evaluation
• Dataset
– Dialogue Corpus on Singapore Tour Guide
• Real human-human mixed initiative conversations
• Between guides and tourists
• Stats
– 35 dialogue sessions
– 21 hours
– 19,651 utterances
• Topics
– 1,642 topic segments
– 9 topic categories
» Opening, Closing, Itinerary, Accommodation, Attraction, Food,
Transportation, Shopping, Other
– Wikipedia Collection
• 3,115 articles related to Singapore
• Collected from Wikipedia database dump as of Feb 2013
Pg 21
Evaluation
• Models
– Training Instances
• 8,318 instances for user-turn-level segmentation
• 1,607 instances for dialogue-segment-level topic prediction
– Support Vector Machine (SVM) Models
• BOW: Baseline only with term vector space
• WK0: Extended vector without transformation
• WK1: s(di, dj) = s1(di, dj)
• WK2: s(di, dj) = s1(di, dj) + s2(di, dj)
• WK3: s(di, dj) = s1(di, dj) + s2(di, dj) + s3(di, dj)
• WK4: s(di, dj) = s1(di, dj) + s2(di, dj) + s3(di, dj) + s4(di, dj)
• WK5: s(di, dj) = s1(di, dj) + s2(di, dj) + s3(di, dj) + s4(di, dj) + s5(di, dj)
• Metrics
– Five-fold Cross-validation
– Segmentation: P/R/F
– Topic Prediction: Accuracy
Pg 22
Evaluation
• Comparison of dialogue topic tracking performances
Pg 23
Evaluation
• Distributions of errors on the cascaded results with WK5
– 71.4% of errors result from segmentation
– 60.0% of errors occurred for system-initiative cases
Pg 24
Contents
• Introduction
• Problem Definition
• Method
• Evaluation
• Conclusions
Pg 25
Conclusions
• Summary
– Wikipedia-based Kernel Method for Dialogue Topic Tracking
– To incorporate various types of information from Wikipedia
– Experimental results show the merits of our proposed approach in
mixed-initiative dialogues
• Ongoing Work
– Using more various types of knowledge from Wikipedia
– To be presented at ACL 2014
• A Composite Kernel Approach for Dialog Topic Tracking with Structured
Domain Knowledge from Wikipedia
Pg 26
Thank You
Pg 27
Contact: kims@i2r.a-star.edu.sg
References
• B. Lin, H. Wang, and L. Lee, “A distributed architecture for cooperative spoken dialogue agents with coherent dialogue state and history,” in Proceedings
of the IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1999.
• S. Ikeda, K. Komatani, T. Ogata, H. G. Okuno, and H. G. Okuno, “Extensibility verification of robust domain selection against out-of-grammar utterances
in multidomain spoken dialogue system.,” in Proceedings of the 9th Annual Conference of the International Speech Communicatiuon Association
(INTERSPEECH), 2008, pp. 487–490.
• A. Celikyilmaz, D. Hakkani-T¨ur, and G. T¨ur, “Approximate inference for domain detection in spoken language understanding.,” in Proceedings of the
12th Annual Conference of the International Speech Communication Association (INTERSPEECH), 2011, pp. 713–716.
• T. Nakata, S. Ando, and A. Okumura, “Topic detection based on dialogue history,” in Proceedings of the 19th international conference on Computational
linguistics (COLING), 2002, pp. 1–7.
• K. Lagus and J. Kuusisto, “Topic identification in natural language dialogues using neural networks,” in Proceedings of the 3rd SIGdial workshop on
Discourse and dialogue, 2002, pp. 95–102.
• P. H. Adams and C. H. Martell, “Topic detection and extraction in chat,” in Proceedings of the 2008 IEEE International Conference on Semantic
Computing, 2008, pp. 581–588.
• S. Roy and L. V. Subramaniam, “Automatic generation of domain models for call centers from noisy transcriptions,” in Proceedings of the 21st
International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, 2006, pp. 737–744.
• S. Young, J. Schatzmann, K. Weilhammer, and H. Ye, “The hidden information state approach to dialog management,” in Proceedings of the
International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2007.
• D. Bohus and A. Rudnicky, “Ravenclaw: dialog management using hierarchical task decomposition and an expectation agenda,” in Proceedings of the
European Conference on Speech, Communication and Technology, 2003, pp. 597–600.
• C. Lee, S. Jung, and G. G. Lee, “Robust dialog management with n-best hypotheses using dialog examples and agenda.,” in Proceedings of the 46th
Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2008, pp. 630–637.
• G. Salton, A. Wong, and C.S. Yang, “A vector space model for automatic indexing,” Communications of the ACM, vol. 18, no. 11, pp. 613–620, 1975.
• G.Wilcock, “Wikitalk: a spoken wikipedia-based opendomain knowledge access system,” in Proceedings of the Workshop on Question Answering for
Complex Domains, 2012, p. 5770.
• A. Breuing, U. Waltinger, and I. Wachsmuth, “Harvesting wikipedia knowledge to identify topics in ongoing natural language dialogs,” in Proceedings of
the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011, pp. 445–450.
• P. Wang and C. Domeniconi, “Building semantic kernels for text classification using wikipedia,” in Proceedings of the 14th ACM SIGKDD international
conference on Knowledge discovery and data mining, 2008, pp. 713–721.
• Z. Wu and M. Palmer, “Verbs semantics and lexical selection,” in Proceedings of the 32nd annual meeting on Association for Computational Linguistics,
1994, pp.133–138.
• C. C. Chang and C. J. Lin, “Libsvm: a library for support vector machines,” ACM Transactions on Intelligent Systems and Technology (TIST), vol. 2, no.
3, pp. 27, 2011.

More Related Content

Similar to Wikipedia-based Kernels for Dialogue Topic Tracking

Understanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesUnderstanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational Databases
Ashis Kumar Chanda
 
Understanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesUnderstanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational Databases
Ashis Chanda
 
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library EthnographyCapturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
OCLC
 
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library EthnographyCapturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Lynn Connaway
 
Towards OpenURL Quality Metrics: Initial Findings
Towards OpenURL Quality Metrics: Initial FindingsTowards OpenURL Quality Metrics: Initial Findings
Towards OpenURL Quality Metrics: Initial Findings
alc28
 
Rizal Conference
Rizal ConferenceRizal Conference
Rizal Conferencerajenmunoo
 
The Social Semantic Server: A Flexible Framework to Support Informal Learning...
The Social Semantic Server: A Flexible Framework to Support Informal Learning...The Social Semantic Server: A Flexible Framework to Support Informal Learning...
The Social Semantic Server: A Flexible Framework to Support Informal Learning...
tobold
 
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
Sebastian Dennerlein
 
Knowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generationKnowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generation
Elena Simperl
 
Experiences in the Development of Geographical Ontologies and Linked Data
Experiences in the Development of Geographical Ontologies and Linked DataExperiences in the Development of Geographical Ontologies and Linked Data
Experiences in the Development of Geographical Ontologies and Linked Data
Oscar Corcho
 
Social media as a tool for terminological research
Social media as a tool for terminological researchSocial media as a tool for terminological research
Social media as a tool for terminological research
TERMCAT
 
Methodological Implications of Using Google Applications (Google Sites and Go...
Methodological Implications of Using Google Applications (Google Sites and Go...Methodological Implications of Using Google Applications (Google Sites and Go...
Methodological Implications of Using Google Applications (Google Sites and Go...
cafuchs
 
Web 2.0 and Social Media in the Language and Literature Classroom
Web 2.0 and Social Media in the Language and Literature ClassroomWeb 2.0 and Social Media in the Language and Literature Classroom
Web 2.0 and Social Media in the Language and Literature ClassroomClaudia Warth
 
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
Libcorpio
 
Wusteman Ticer09
Wusteman Ticer09Wusteman Ticer09
Wusteman Ticer09
Judith Wusteman
 
Web 2.0 & Social Media in Language Teaching
Web 2.0 & Social Media in Language TeachingWeb 2.0 & Social Media in Language Teaching
Web 2.0 & Social Media in Language TeachingClaudia Warth
 
Exposing Library Content with the NISO Metasearch XML Gateway Protocol
Exposing Library Content with the NISO Metasearch XML Gateway ProtocolExposing Library Content with the NISO Metasearch XML Gateway Protocol
Exposing Library Content with the NISO Metasearch XML Gateway Protocol
Electronic Resources & Libraries
 
Standards for Smart Learning Environments
Standards for Smart Learning EnvironmentsStandards for Smart Learning Environments
Standards for Smart Learning Environments
Tore Hoel
 
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
TimelessFuture
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
Carole Goble
 

Similar to Wikipedia-based Kernels for Dialogue Topic Tracking (20)

Understanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesUnderstanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational Databases
 
Understanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesUnderstanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational Databases
 
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library EthnographyCapturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
 
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library EthnographyCapturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
 
Towards OpenURL Quality Metrics: Initial Findings
Towards OpenURL Quality Metrics: Initial FindingsTowards OpenURL Quality Metrics: Initial Findings
Towards OpenURL Quality Metrics: Initial Findings
 
Rizal Conference
Rizal ConferenceRizal Conference
Rizal Conference
 
The Social Semantic Server: A Flexible Framework to Support Informal Learning...
The Social Semantic Server: A Flexible Framework to Support Informal Learning...The Social Semantic Server: A Flexible Framework to Support Informal Learning...
The Social Semantic Server: A Flexible Framework to Support Informal Learning...
 
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
 
Knowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generationKnowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generation
 
Experiences in the Development of Geographical Ontologies and Linked Data
Experiences in the Development of Geographical Ontologies and Linked DataExperiences in the Development of Geographical Ontologies and Linked Data
Experiences in the Development of Geographical Ontologies and Linked Data
 
Social media as a tool for terminological research
Social media as a tool for terminological researchSocial media as a tool for terminological research
Social media as a tool for terminological research
 
Methodological Implications of Using Google Applications (Google Sites and Go...
Methodological Implications of Using Google Applications (Google Sites and Go...Methodological Implications of Using Google Applications (Google Sites and Go...
Methodological Implications of Using Google Applications (Google Sites and Go...
 
Web 2.0 and Social Media in the Language and Literature Classroom
Web 2.0 and Social Media in the Language and Literature ClassroomWeb 2.0 and Social Media in the Language and Literature Classroom
Web 2.0 and Social Media in the Language and Literature Classroom
 
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
 
Wusteman Ticer09
Wusteman Ticer09Wusteman Ticer09
Wusteman Ticer09
 
Web 2.0 & Social Media in Language Teaching
Web 2.0 & Social Media in Language TeachingWeb 2.0 & Social Media in Language Teaching
Web 2.0 & Social Media in Language Teaching
 
Exposing Library Content with the NISO Metasearch XML Gateway Protocol
Exposing Library Content with the NISO Metasearch XML Gateway ProtocolExposing Library Content with the NISO Metasearch XML Gateway Protocol
Exposing Library Content with the NISO Metasearch XML Gateway Protocol
 
Standards for Smart Learning Environments
Standards for Smart Learning EnvironmentsStandards for Smart Learning Environments
Standards for Smart Learning Environments
 
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
 

More from Seokhwan Kim

The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)
Seokhwan Kim
 
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Seokhwan Kim
 
Dynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingDynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic Tracking
Seokhwan Kim
 
Natural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionNatural Language in Human-Robot Interaction
Natural Language in Human-Robot Interaction
Seokhwan Kim
 
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Seokhwan Kim
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Seokhwan Kim
 
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...Seokhwan Kim
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionMMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionSeokhwan Kim
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...Seokhwan Kim
 
A spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessA spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessSeokhwan Kim
 
An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...Seokhwan Kim
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionAn Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionSeokhwan Kim
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템Seokhwan Kim
 
A Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation DetectionA Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation DetectionSeokhwan Kim
 
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...Seokhwan Kim
 

More from Seokhwan Kim (15)

The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)
 
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
 
Dynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingDynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic Tracking
 
Natural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionNatural Language in Human-Robot Interaction
Natural Language in Human-Robot Interaction
 
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
 
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionMMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognition
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...
 
A spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessA spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information access
 
An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionAn Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information Extraction
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
 
A Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation DetectionA Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation Detection
 
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
 

Recently uploaded

top nidhi software solution freedownload
top nidhi software solution freedownloadtop nidhi software solution freedownload
top nidhi software solution freedownload
vrstrong314
 
Enterprise Resource Planning System in Telangana
Enterprise Resource Planning System in TelanganaEnterprise Resource Planning System in Telangana
Enterprise Resource Planning System in Telangana
NYGGS Automation Suite
 
RISE with SAP and Journey to the Intelligent Enterprise
RISE with SAP and Journey to the Intelligent EnterpriseRISE with SAP and Journey to the Intelligent Enterprise
RISE with SAP and Journey to the Intelligent Enterprise
Srikant77
 
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptxTop Features to Include in Your Winzo Clone App for Business Growth (4).pptx
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx
rickgrimesss22
 
Vitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume MontevideoVitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume Montevideo
Vitthal Shirke
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
Globus
 
First Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User EndpointsFirst Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User Endpoints
Globus
 
How to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good PracticesHow to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good Practices
Globus
 
Understanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSageUnderstanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSage
Globus
 
A Sighting of filterA in Typelevel Rite of Passage
A Sighting of filterA in Typelevel Rite of PassageA Sighting of filterA in Typelevel Rite of Passage
A Sighting of filterA in Typelevel Rite of Passage
Philip Schwarz
 
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoamOpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
takuyayamamoto1800
 
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.ILBeyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Natan Silnitsky
 
Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus
 
Into the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdfInto the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdf
Ortus Solutions, Corp
 
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing SuiteAI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
Google
 
Prosigns: Transforming Business with Tailored Technology Solutions
Prosigns: Transforming Business with Tailored Technology SolutionsProsigns: Transforming Business with Tailored Technology Solutions
Prosigns: Transforming Business with Tailored Technology Solutions
Prosigns
 
Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdfEnhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
Jay Das
 
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, BetterWebinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
XfilesPro
 
Corporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMSCorporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMS
Tendenci - The Open Source AMS (Association Management Software)
 
Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024
Paco van Beckhoven
 

Recently uploaded (20)

top nidhi software solution freedownload
top nidhi software solution freedownloadtop nidhi software solution freedownload
top nidhi software solution freedownload
 
Enterprise Resource Planning System in Telangana
Enterprise Resource Planning System in TelanganaEnterprise Resource Planning System in Telangana
Enterprise Resource Planning System in Telangana
 
RISE with SAP and Journey to the Intelligent Enterprise
RISE with SAP and Journey to the Intelligent EnterpriseRISE with SAP and Journey to the Intelligent Enterprise
RISE with SAP and Journey to the Intelligent Enterprise
 
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptxTop Features to Include in Your Winzo Clone App for Business Growth (4).pptx
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx
 
Vitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume MontevideoVitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume Montevideo
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
 
First Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User EndpointsFirst Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User Endpoints
 
How to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good PracticesHow to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good Practices
 
Understanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSageUnderstanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSage
 
A Sighting of filterA in Typelevel Rite of Passage
A Sighting of filterA in Typelevel Rite of PassageA Sighting of filterA in Typelevel Rite of Passage
A Sighting of filterA in Typelevel Rite of Passage
 
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoamOpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
 
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.ILBeyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
 
Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024
 
Into the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdfInto the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdf
 
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing SuiteAI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
 
Prosigns: Transforming Business with Tailored Technology Solutions
Prosigns: Transforming Business with Tailored Technology SolutionsProsigns: Transforming Business with Tailored Technology Solutions
Prosigns: Transforming Business with Tailored Technology Solutions
 
Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdfEnhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
 
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, BetterWebinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
 
Corporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMSCorporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMS
 
Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024
 

Wikipedia-based Kernels for Dialogue Topic Tracking

  • 1. WIKIPEDIA-BASED KERNELS FOR DIALOGUE TOPIC TRACKING Seokhwan Kim, Rafael E. Banchs, Haizhou Li Human Language Technology Department Institute for Infocomm Research (I2R) 6th May 2014 ICASSP, Florence, Italy
  • 2. Contents • Introduction • Problem Definition • Method • Evaluation • Conclusions Pg 2
  • 3. Contents • Introduction • Problem Definition • Method • Evaluation • Conclusions Pg 3
  • 4. Motivation • Spoken Dialogue Systems – Next-generation User Interface – The most natural way for human-human communication • Single-task dialogues – Most previous work focuses on single target task • Eg. Flight Reservation, Bus Information Guide, Restaurant Booking – Cause limitations in practical uses • Multi-task dialogues – [Lin et al. 1999, Ikeda et al. 2008, Celikyilmaz et al. 2011] – Selecting the most probable system at each turn – Each system is independently built and operated from others Pg 4
  • 5. Related Work • Text categorization-based Dialogue Topic Identification – [Nakata et al., 2002; Lagus&Kuusisto, 2002; Adams&Martell, 2008] – Differences from written texts • Determinations of topics – User’s intentions – System’s decisions • Available features – Unable to see the future turns • Knowledge-based Dialogue Topic Suggestion – External Knowledge Sources • Eg. Domain Models, Heuristics, Agendas • [Roy&Subramaniam, 2006; Young et al., 2007; Bohus&Rudnicky, 2003; Lee et al. 2008 – Limited flexibility • To handle user-initiative cases – High cost • To build a sufficient amount of resources Pg 5
  • 6. Contents • Introduction • Problem Definition • Method • Evaluation • Conclusions Pg 6
  • 7. Dialogue Topic Tracking • Subtasks – Dialogue Segmentation • Segmenting a session into topically coherent sub-dialogues – Topic Transition Identification • Identifying the next topic category at each time of topic transition Pg 7
  • 9. Contents • Introduction • Problem Definition • Method • Evaluation • Conclusions Pg 9
  • 10. Wikipedia-based Kernel Method • Vector Space Model – The simplest approach to represent features for supervised machine learning methods – An instance for each turn  A weighted term vector – Lack of semantic or domain-specific aspects • Each word is considered as an independent and identical unit Pg 10
  • 11. Wikipedia-based Kernel Method • Wikipedia for Dialogue Topic Tracking – As an external knowledge source – Without significant effort for building resources – Previous work • [Breuing et al., 2011; Wilcock, 2012] • Focusing only on a single type of information from Wikipedia • Wikipedia-based Kernel Method – Aiming at incorporating various knowledge from Wikipedia – To map the data into a higher dimensional feature space • Vector Space Extension • Vector Transformation Pg 11
  • 12. Wikipedia-based Kernel Method • Vector Extension Pg 12 Term Vector Concept Vector … U: --------------------------- S: --------------------------- U: --------------------------- S: --------------------------- U: --------------------------- S: --------------------------- … x β1 β2 Β|D| d1 d2 d|D| ⁞
  • 13. Wikipedia-based Kernel Method • Vector Transformation – Each extended vector is transformed into a new space – Transformation Matrix S – s(di, dj) is the relatedness between di and dj – Update of Concept Vector Values Pg 13
  • 15. Measures of Contextual Relatedness • How to compute s(di, dj)? • Category Relatedness – Based on hierarchical structures of Wikipedia categories • depth(d): the length of the path from the root node to d • lcs(di, dj): the least common subsume of the two articles in the hierarchy Pg 15
  • 16. Measures of Contextual Relatedness • Category Overlap Score – Based on the ratio of common categories of two concepts – By Jaccard’s coefficient • Contents Similarity – Based on the cosine similarity between term vectors from the body texts Pg 16
  • 17. Measures of Contextual Relatedness • Co-occurrence Frequency – To represent the discourse relatedness obtained from Wikipedia – Assumption • The more frequently the mentions about two concepts co-occurred • The more similar aspects both concepts take in dialogue flows – By normalized point-wise mutual information Pg 17
  • 18. Measures of Contextual Relatedness • Geographical Closeness – Domain-specific Measure – Based on the geographic coordinate information of spatial concepts • Final Score Pg 18
  • 19. Measures of Contextual Relatedness Pg 19
  • 20. Contents • Introduction • Problem Definition • Method • Evaluation • Conclusions Pg 20
  • 21. Evaluation • Dataset – Dialogue Corpus on Singapore Tour Guide • Real human-human mixed initiative conversations • Between guides and tourists • Stats – 35 dialogue sessions – 21 hours – 19,651 utterances • Topics – 1,642 topic segments – 9 topic categories » Opening, Closing, Itinerary, Accommodation, Attraction, Food, Transportation, Shopping, Other – Wikipedia Collection • 3,115 articles related to Singapore • Collected from Wikipedia database dump as of Feb 2013 Pg 21
  • 22. Evaluation • Models – Training Instances • 8,318 instances for user-turn-level segmentation • 1,607 instances for dialogue-segment-level topic prediction – Support Vector Machine (SVM) Models • BOW: Baseline only with term vector space • WK0: Extended vector without transformation • WK1: s(di, dj) = s1(di, dj) • WK2: s(di, dj) = s1(di, dj) + s2(di, dj) • WK3: s(di, dj) = s1(di, dj) + s2(di, dj) + s3(di, dj) • WK4: s(di, dj) = s1(di, dj) + s2(di, dj) + s3(di, dj) + s4(di, dj) • WK5: s(di, dj) = s1(di, dj) + s2(di, dj) + s3(di, dj) + s4(di, dj) + s5(di, dj) • Metrics – Five-fold Cross-validation – Segmentation: P/R/F – Topic Prediction: Accuracy Pg 22
  • 23. Evaluation • Comparison of dialogue topic tracking performances Pg 23
  • 24. Evaluation • Distributions of errors on the cascaded results with WK5 – 71.4% of errors result from segmentation – 60.0% of errors occurred for system-initiative cases Pg 24
  • 25. Contents • Introduction • Problem Definition • Method • Evaluation • Conclusions Pg 25
  • 26. Conclusions • Summary – Wikipedia-based Kernel Method for Dialogue Topic Tracking – To incorporate various types of information from Wikipedia – Experimental results show the merits of our proposed approach in mixed-initiative dialogues • Ongoing Work – Using more various types of knowledge from Wikipedia – To be presented at ACL 2014 • A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain Knowledge from Wikipedia Pg 26
  • 27. Thank You Pg 27 Contact: kims@i2r.a-star.edu.sg
  • 28. References • B. Lin, H. Wang, and L. Lee, “A distributed architecture for cooperative spoken dialogue agents with coherent dialogue state and history,” in Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1999. • S. Ikeda, K. Komatani, T. Ogata, H. G. Okuno, and H. G. Okuno, “Extensibility verification of robust domain selection against out-of-grammar utterances in multidomain spoken dialogue system.,” in Proceedings of the 9th Annual Conference of the International Speech Communicatiuon Association (INTERSPEECH), 2008, pp. 487–490. • A. Celikyilmaz, D. Hakkani-T¨ur, and G. T¨ur, “Approximate inference for domain detection in spoken language understanding.,” in Proceedings of the 12th Annual Conference of the International Speech Communication Association (INTERSPEECH), 2011, pp. 713–716. • T. Nakata, S. Ando, and A. Okumura, “Topic detection based on dialogue history,” in Proceedings of the 19th international conference on Computational linguistics (COLING), 2002, pp. 1–7. • K. Lagus and J. Kuusisto, “Topic identification in natural language dialogues using neural networks,” in Proceedings of the 3rd SIGdial workshop on Discourse and dialogue, 2002, pp. 95–102. • P. H. Adams and C. H. Martell, “Topic detection and extraction in chat,” in Proceedings of the 2008 IEEE International Conference on Semantic Computing, 2008, pp. 581–588. • S. Roy and L. V. Subramaniam, “Automatic generation of domain models for call centers from noisy transcriptions,” in Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, 2006, pp. 737–744. • S. Young, J. Schatzmann, K. Weilhammer, and H. Ye, “The hidden information state approach to dialog management,” in Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2007. • D. Bohus and A. Rudnicky, “Ravenclaw: dialog management using hierarchical task decomposition and an expectation agenda,” in Proceedings of the European Conference on Speech, Communication and Technology, 2003, pp. 597–600. • C. Lee, S. Jung, and G. G. Lee, “Robust dialog management with n-best hypotheses using dialog examples and agenda.,” in Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2008, pp. 630–637. • G. Salton, A. Wong, and C.S. Yang, “A vector space model for automatic indexing,” Communications of the ACM, vol. 18, no. 11, pp. 613–620, 1975. • G.Wilcock, “Wikitalk: a spoken wikipedia-based opendomain knowledge access system,” in Proceedings of the Workshop on Question Answering for Complex Domains, 2012, p. 5770. • A. Breuing, U. Waltinger, and I. Wachsmuth, “Harvesting wikipedia knowledge to identify topics in ongoing natural language dialogs,” in Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011, pp. 445–450. • P. Wang and C. Domeniconi, “Building semantic kernels for text classification using wikipedia,” in Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, 2008, pp. 713–721. • Z. Wu and M. Palmer, “Verbs semantics and lexical selection,” in Proceedings of the 32nd annual meeting on Association for Computational Linguistics, 1994, pp.133–138. • C. C. Chang and C. J. Lin, “Libsvm: a library for support vector machines,” ACM Transactions on Intelligent Systems and Technology (TIST), vol. 2, no. 3, pp. 27, 2011.