SlideShare a Scribd company logo
1 of 16
Download to read offline
NTCIR-12 Pilot Task:
Short Text Conversation (STC)
Lifeng Shang, Zhengdong Lu, Hang Li (Huawei Noah’s Ark Lab, Hong Kong)
Tetsuya Sakai (Waseda University, Japan)
http://ntcir12.noahlab.com.hk/stc.htm
Twitter: @ntcirstc
February 27, 2015@NTCIR-12 Kickoff
Call for Task Participation
Microblogs: Twitter, Weibo... Over 40 million users
What is STC? (1)
POST: “Dr. Hang Li’s Learning to
Rank for IR and NLP second
edition released! Follow!”
What is STC? (2)
POST: “Dr. Hang Li’s Learning to
Rank for IR and NLP second
edition released! Follow!”
COMMENT by Hang Li: “Thanks
ZhiYuan! I’ve added detailed
explanations of the
LambdaMART algorithms etc. “
Coherent AND useful
What is STC? (3)
POST: “How’s the hair?”
What is STC? (4)
POST: “How’s the hair?”
COMMENT by Tetsuya Sakai:
“I don’t have any.”
Coherent but NOT useful
Objectives
• The ultimate objective
Build an open-domain system that can interact
naturally with humans
• The objective for NTCIR-12/13
Build an IR system that effectively
reuses past comments to respond to a post.
STC
LOL
Coherence: the post-comment pair makes sense as a consecutive short
text exchange between two people.
Usefulness: the comment contains information or an opinion that might be
useful to the author of the post.
STC research questions
post comment
comment
post comment
comment
post comment
comment
post comment
comment
post-comment repository
Search and reuse
Given a new post, can a coherent and
useful comment be returned by
searching a post-comment repository?
What are the challenges and
limitations of this IR-based STC
approach? [Ji14]
post
STC
Data and language scope
We also provide English machine
translations of the Chinese posts
and comments.
Number of test posts will be determined using topic set size design
[Sakai14CIKM,Sakai14EVIA]
Task design and evaluation measures
• Ad hoc IR design: given a “new” post, retrieve coherent and useful
comments from repository.
• Pooling and graded relevance assessments
L2: coherent and useful
L1: coherent but not useful
L0: not coherent (and therefore not useful either)
• Evaluation measures (basically one good comment is enough):
G@1 (normalised gain at rank 1)
ERR (expected reciprocal rank)
P+ (Similar to Q-measure, suitable for navigational intents)
[Sakai14PROMISE]
Plans for STC-2@NTCIR-13
• Follow the INTENT-2 “revived run” model [Sakai13INTENT]
• STC-1 participants will keep their systems in the fridge
• When they come back at STC-2, they use both STC-1 and STC-2
systems to handle the STC-2 posts
• Compare STC-1 and STC-2 systems on the STC-2 test collection
STC-1 new posts STC-2 new posts
STC-1 systems STC-2 systems
STC-1 runs STC-2 new runs
Revived runs
Schedule
Feb 27, 2015 NTCIR-12 kickoff
Oct 31, 2015 NTCIR-12 task registration deadline
Nov 2, 2015 STC test topics released
Nov 30, 2015 STC run submission deadline
Dec 2015-Jan 2016 STC relevance assessments + evaluation
Feb 1, 2015 STC results sent to participants + STC draft overview released
Mar 1, 2015 NTCIR-12 participants’ draft papers due / Task organisers’ feedback
May 1, 2015 NTCIR-12 all camera ready papers due
Jun 7-10, 2015 NTCIR-12 conference
We will give you training data as soon as you register!
Sooner the better!
Join us!
http://ntcir12.noahlab.com.hk/stc.htm
Twitter: @ntcirstc
Prospective participants and budget
Huawei will cover
the relevance assessment
cost. No seeding funding
from NTCIR required.
Related tasks
• TREC Microblog (2011-) [Lin13]
Data: twitter, NOT distributed to participants
Tweets2011: only IDs distributed, data downloaded individually
Tweets2013: Evaluation as a Service (access through APIs)
Ad hoc search etc. Evaluation based on binary relevance
• NTCIR Community Question Answering (2010) [Ishikawa10]
Data: Japanese Yahoo! Answers (Chiebukuro)
Given a Q and its responses, rank the responses (which is the best
answer?). Evaluation using G@1 etc.
References
[Ishikawa10] Ishikawa, D., Sakai, T. and Kando, N.: Overview of the NTCIR-8 Community QA Pilot Task
(Part I): The Test Collection and the Task, Proceedings of NTCIR-8, pp.421-432, 2010.
[Ji14] Ji, Z., Lu, Z. and Li, H.: An Information Retrieval Approach to Short Text Conversation, 2014.
http://arxiv.org/abs/1408.6988
[Lin13] Lin, J and Efron, M.: Overview of the TREC-2013 Microblog Track, Proceedings of TREC 2013,
2013.
[Sakai13INTENT] Sakai, T. et al.: Overview of the NTCIR-10 INTENT-2 Task, Proceedings of NTCIR-10,
pp.94-123, 2013.
[Sakai14CIKM] Sakai, T.: Designing Test Collections for Comparing Many Systems, Proceedings of
ACM CIKM 2014, pp.61-70, 2014.
[Sakai14EVIA] Sakai, T.: Topic Set Size Design with Variance Estimates from Two-Way ANOVA,
Proceedings of EVIA 2014, pp.1-8, 2014.
http://www.f.waseda.jp/tetsuya/CIKM2014/ir0030-sakai.pdf
[Sakai14PROMISE] Sakai, T.: Metrics, Statistics, Tests, PROMISE Winter School 2013: Bridging
between Information Retrieval and Databases (LNCS 8173), 2014.
http://research.microsoft.com/en-us/people/tesakai/metrics.pdf

More Related Content

Similar to Short Text Conversation@NTCIR-12 Kickoff

ntcir14centre-overview
ntcir14centre-overviewntcir14centre-overview
ntcir14centre-overviewTetsuya Sakai
 
NTCIR-12 MobileClick-2 Overview
NTCIR-12 MobileClick-2 OverviewNTCIR-12 MobileClick-2 Overview
NTCIR-12 MobileClick-2 Overviewkt.mako
 
South Korea OpenStack UG - Study & Development team activities
South Korea OpenStack UG - Study & Development team activitiesSouth Korea OpenStack UG - Study & Development team activities
South Korea OpenStack UG - Study & Development team activitiesIan Choi
 
The Pupil Has Become the Master: Teacher-Student Model-Based Word Embedding D...
The Pupil Has Become the Master: Teacher-Student Model-Based Word Embedding D...The Pupil Has Become the Master: Teacher-Student Model-Based Word Embedding D...
The Pupil Has Become the Master: Teacher-Student Model-Based Word Embedding D...Jinho Choi
 
2020.01.20 RoboCup@Home Education (Introduction) [EN]
2020.01.20 RoboCup@Home Education (Introduction) [EN]2020.01.20 RoboCup@Home Education (Introduction) [EN]
2020.01.20 RoboCup@Home Education (Introduction) [EN]Jeffrey Too Chuan TAN
 
A Personalized Software Assistant Framework To Achieve User Goals
A Personalized Software Assistant Framework To Achieve User GoalsA Personalized Software Assistant Framework To Achieve User Goals
A Personalized Software Assistant Framework To Achieve User GoalsPradeep K. Venkatesh
 
Model-based programming and AI-assisted software development
Model-based programming and AI-assisted software developmentModel-based programming and AI-assisted software development
Model-based programming and AI-assisted software developmentEficode
 
2020.03.08 RoboCup@Home Education (Introduction) [EN]
2020.03.08 RoboCup@Home Education (Introduction) [EN]2020.03.08 RoboCup@Home Education (Introduction) [EN]
2020.03.08 RoboCup@Home Education (Introduction) [EN]Jeffrey Too Chuan TAN
 
ACB4 tec pre - p4 - presenting a technical paper
ACB4   tec pre - p4 - presenting a technical paperACB4   tec pre - p4 - presenting a technical paper
ACB4 tec pre - p4 - presenting a technical paperNisansa de Silva
 
Using open source assessment and feedback tools
Using open source assessment and feedback toolsUsing open source assessment and feedback tools
Using open source assessment and feedback toolsjisc-elearning
 
OpenStack Technical Committee Vision for 2019 - OpenStack Boston Forum Session
OpenStack Technical Committee Vision for 2019 - OpenStack Boston Forum SessionOpenStack Technical Committee Vision for 2019 - OpenStack Boston Forum Session
OpenStack Technical Committee Vision for 2019 - OpenStack Boston Forum SessionColette Alexander
 
Personal Research Overview presented at the KU-NAIST Research Meeting
Personal Research Overview presented at the KU-NAIST Research MeetingPersonal Research Overview presented at the KU-NAIST Research Meeting
Personal Research Overview presented at the KU-NAIST Research MeetingChawanat Nakasan
 
MobileClick-2 Kickoff Event
MobileClick-2 Kickoff EventMobileClick-2 Kickoff Event
MobileClick-2 Kickoff Eventkt.mako
 
OpenStack Day Taiwan 2016 -Shintaro Mizuno
OpenStack Day Taiwan 2016 -Shintaro MizunoOpenStack Day Taiwan 2016 -Shintaro Mizuno
OpenStack Day Taiwan 2016 -Shintaro Mizunoshintaro mizuno
 
"OpenStack in Japan", from OpenStack Days Taiwan 2016
"OpenStack in Japan", from OpenStack Days Taiwan 2016"OpenStack in Japan", from OpenStack Days Taiwan 2016
"OpenStack in Japan", from OpenStack Days Taiwan 2016shintaro mizuno
 
Keynote ACIS/AAI2014 conference
Keynote ACIS/AAI2014 conferenceKeynote ACIS/AAI2014 conference
Keynote ACIS/AAI2014 conferenceKyoto University
 
Keynote reusability measurement and social community analysis from mooc con...
Keynote   reusability measurement and social community analysis from mooc con...Keynote   reusability measurement and social community analysis from mooc con...
Keynote reusability measurement and social community analysis from mooc con...HannibalHsieh
 
EE6104_Course_Outline.pdf
EE6104_Course_Outline.pdfEE6104_Course_Outline.pdf
EE6104_Course_Outline.pdfssuserfa167c
 

Similar to Short Text Conversation@NTCIR-12 Kickoff (20)

ntcir14centre-overview
ntcir14centre-overviewntcir14centre-overview
ntcir14centre-overview
 
00 intro
00 intro00 intro
00 intro
 
NTCIR-12 MobileClick-2 Overview
NTCIR-12 MobileClick-2 OverviewNTCIR-12 MobileClick-2 Overview
NTCIR-12 MobileClick-2 Overview
 
South Korea OpenStack UG - Study & Development team activities
South Korea OpenStack UG - Study & Development team activitiesSouth Korea OpenStack UG - Study & Development team activities
South Korea OpenStack UG - Study & Development team activities
 
The Pupil Has Become the Master: Teacher-Student Model-Based Word Embedding D...
The Pupil Has Become the Master: Teacher-Student Model-Based Word Embedding D...The Pupil Has Become the Master: Teacher-Student Model-Based Word Embedding D...
The Pupil Has Become the Master: Teacher-Student Model-Based Word Embedding D...
 
2020.01.20 RoboCup@Home Education (Introduction) [EN]
2020.01.20 RoboCup@Home Education (Introduction) [EN]2020.01.20 RoboCup@Home Education (Introduction) [EN]
2020.01.20 RoboCup@Home Education (Introduction) [EN]
 
A Personalized Software Assistant Framework To Achieve User Goals
A Personalized Software Assistant Framework To Achieve User GoalsA Personalized Software Assistant Framework To Achieve User Goals
A Personalized Software Assistant Framework To Achieve User Goals
 
Model-based programming and AI-assisted software development
Model-based programming and AI-assisted software developmentModel-based programming and AI-assisted software development
Model-based programming and AI-assisted software development
 
2020.03.08 RoboCup@Home Education (Introduction) [EN]
2020.03.08 RoboCup@Home Education (Introduction) [EN]2020.03.08 RoboCup@Home Education (Introduction) [EN]
2020.03.08 RoboCup@Home Education (Introduction) [EN]
 
ACB4 tec pre - p4 - presenting a technical paper
ACB4   tec pre - p4 - presenting a technical paperACB4   tec pre - p4 - presenting a technical paper
ACB4 tec pre - p4 - presenting a technical paper
 
Using open source assessment and feedback tools
Using open source assessment and feedback toolsUsing open source assessment and feedback tools
Using open source assessment and feedback tools
 
OpenStack Technical Committee Vision for 2019 - OpenStack Boston Forum Session
OpenStack Technical Committee Vision for 2019 - OpenStack Boston Forum SessionOpenStack Technical Committee Vision for 2019 - OpenStack Boston Forum Session
OpenStack Technical Committee Vision for 2019 - OpenStack Boston Forum Session
 
Personal Research Overview presented at the KU-NAIST Research Meeting
Personal Research Overview presented at the KU-NAIST Research MeetingPersonal Research Overview presented at the KU-NAIST Research Meeting
Personal Research Overview presented at the KU-NAIST Research Meeting
 
MobileClick-2 Kickoff Event
MobileClick-2 Kickoff EventMobileClick-2 Kickoff Event
MobileClick-2 Kickoff Event
 
OpenStack Day Taiwan 2016 -Shintaro Mizuno
OpenStack Day Taiwan 2016 -Shintaro MizunoOpenStack Day Taiwan 2016 -Shintaro Mizuno
OpenStack Day Taiwan 2016 -Shintaro Mizuno
 
"OpenStack in Japan", from OpenStack Days Taiwan 2016
"OpenStack in Japan", from OpenStack Days Taiwan 2016"OpenStack in Japan", from OpenStack Days Taiwan 2016
"OpenStack in Japan", from OpenStack Days Taiwan 2016
 
sem6.pdf
sem6.pdfsem6.pdf
sem6.pdf
 
Keynote ACIS/AAI2014 conference
Keynote ACIS/AAI2014 conferenceKeynote ACIS/AAI2014 conference
Keynote ACIS/AAI2014 conference
 
Keynote reusability measurement and social community analysis from mooc con...
Keynote   reusability measurement and social community analysis from mooc con...Keynote   reusability measurement and social community analysis from mooc con...
Keynote reusability measurement and social community analysis from mooc con...
 
EE6104_Course_Outline.pdf
EE6104_Course_Outline.pdfEE6104_Course_Outline.pdf
EE6104_Course_Outline.pdf
 

More from Tetsuya Sakai (20)

NTCIR15WWW3overview
NTCIR15WWW3overviewNTCIR15WWW3overview
NTCIR15WWW3overview
 
sigir2020
sigir2020sigir2020
sigir2020
 
ipsjifat201909
ipsjifat201909ipsjifat201909
ipsjifat201909
 
sigir2019
sigir2019sigir2019
sigir2019
 
assia2019
assia2019assia2019
assia2019
 
evia2019
evia2019evia2019
evia2019
 
ecir2019tutorial-finalised
ecir2019tutorial-finalisedecir2019tutorial-finalised
ecir2019tutorial-finalised
 
ecir2019tutorial
ecir2019tutorialecir2019tutorial
ecir2019tutorial
 
WSDM2019tutorial
WSDM2019tutorialWSDM2019tutorial
WSDM2019tutorial
 
sigir2018tutorial
sigir2018tutorialsigir2018tutorial
sigir2018tutorial
 
Evia2017unanimity
Evia2017unanimityEvia2017unanimity
Evia2017unanimity
 
Evia2017assessors
Evia2017assessorsEvia2017assessors
Evia2017assessors
 
Evia2017dialogues
Evia2017dialoguesEvia2017dialogues
Evia2017dialogues
 
Evia2017wcw
Evia2017wcwEvia2017wcw
Evia2017wcw
 
sigir2017bayesian
sigir2017bayesiansigir2017bayesian
sigir2017bayesian
 
NL20161222invited
NL20161222invitedNL20161222invited
NL20161222invited
 
AIRS2016
AIRS2016AIRS2016
AIRS2016
 
Nl201609
Nl201609Nl201609
Nl201609
 
ictir2016
ictir2016ictir2016
ictir2016
 
ICTIR2016tutorial
ICTIR2016tutorialICTIR2016tutorial
ICTIR2016tutorial
 

Recently uploaded

Mathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMoumonDas2
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Chameera Dedduwage
 
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...Salam Al-Karadaghi
 
SaaStr Workshop Wednesday w: Jason Lemkin, SaaStr
SaaStr Workshop Wednesday w: Jason Lemkin, SaaStrSaaStr Workshop Wednesday w: Jason Lemkin, SaaStr
SaaStr Workshop Wednesday w: Jason Lemkin, SaaStrsaastr
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AITatiana Gurgel
 
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Pooja Nehwal
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024eCommerce Institute
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Kayode Fayemi
 
Presentation on Engagement in Book Clubs
Presentation on Engagement in Book ClubsPresentation on Engagement in Book Clubs
Presentation on Engagement in Book Clubssamaasim06
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024eCommerce Institute
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesPooja Nehwal
 
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfCTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfhenrik385807
 
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝soniya singh
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxNikitaBankoti2
 
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdfOpen Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdfhenrik385807
 
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...NETWAYS
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...Sheetaleventcompany
 
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...henrik385807
 
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )Pooja Nehwal
 

Recently uploaded (20)

Mathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptx
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)
 
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
 
SaaStr Workshop Wednesday w: Jason Lemkin, SaaStr
SaaStr Workshop Wednesday w: Jason Lemkin, SaaStrSaaStr Workshop Wednesday w: Jason Lemkin, SaaStr
SaaStr Workshop Wednesday w: Jason Lemkin, SaaStr
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AI
 
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
 
Presentation on Engagement in Book Clubs
Presentation on Engagement in Book ClubsPresentation on Engagement in Book Clubs
Presentation on Engagement in Book Clubs
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
 
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfCTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
 
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
 
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdfOpen Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
 
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
 
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
 
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
 

Short Text Conversation@NTCIR-12 Kickoff

  • 1. NTCIR-12 Pilot Task: Short Text Conversation (STC) Lifeng Shang, Zhengdong Lu, Hang Li (Huawei Noah’s Ark Lab, Hong Kong) Tetsuya Sakai (Waseda University, Japan) http://ntcir12.noahlab.com.hk/stc.htm Twitter: @ntcirstc February 27, 2015@NTCIR-12 Kickoff Call for Task Participation
  • 2. Microblogs: Twitter, Weibo... Over 40 million users
  • 3. What is STC? (1) POST: “Dr. Hang Li’s Learning to Rank for IR and NLP second edition released! Follow!”
  • 4. What is STC? (2) POST: “Dr. Hang Li’s Learning to Rank for IR and NLP second edition released! Follow!” COMMENT by Hang Li: “Thanks ZhiYuan! I’ve added detailed explanations of the LambdaMART algorithms etc. “ Coherent AND useful
  • 5. What is STC? (3) POST: “How’s the hair?”
  • 6. What is STC? (4) POST: “How’s the hair?” COMMENT by Tetsuya Sakai: “I don’t have any.” Coherent but NOT useful
  • 7. Objectives • The ultimate objective Build an open-domain system that can interact naturally with humans • The objective for NTCIR-12/13 Build an IR system that effectively reuses past comments to respond to a post. STC LOL Coherence: the post-comment pair makes sense as a consecutive short text exchange between two people. Usefulness: the comment contains information or an opinion that might be useful to the author of the post.
  • 8. STC research questions post comment comment post comment comment post comment comment post comment comment post-comment repository Search and reuse Given a new post, can a coherent and useful comment be returned by searching a post-comment repository? What are the challenges and limitations of this IR-based STC approach? [Ji14] post STC
  • 9. Data and language scope We also provide English machine translations of the Chinese posts and comments. Number of test posts will be determined using topic set size design [Sakai14CIKM,Sakai14EVIA]
  • 10. Task design and evaluation measures • Ad hoc IR design: given a “new” post, retrieve coherent and useful comments from repository. • Pooling and graded relevance assessments L2: coherent and useful L1: coherent but not useful L0: not coherent (and therefore not useful either) • Evaluation measures (basically one good comment is enough): G@1 (normalised gain at rank 1) ERR (expected reciprocal rank) P+ (Similar to Q-measure, suitable for navigational intents) [Sakai14PROMISE]
  • 11. Plans for STC-2@NTCIR-13 • Follow the INTENT-2 “revived run” model [Sakai13INTENT] • STC-1 participants will keep their systems in the fridge • When they come back at STC-2, they use both STC-1 and STC-2 systems to handle the STC-2 posts • Compare STC-1 and STC-2 systems on the STC-2 test collection STC-1 new posts STC-2 new posts STC-1 systems STC-2 systems STC-1 runs STC-2 new runs Revived runs
  • 12. Schedule Feb 27, 2015 NTCIR-12 kickoff Oct 31, 2015 NTCIR-12 task registration deadline Nov 2, 2015 STC test topics released Nov 30, 2015 STC run submission deadline Dec 2015-Jan 2016 STC relevance assessments + evaluation Feb 1, 2015 STC results sent to participants + STC draft overview released Mar 1, 2015 NTCIR-12 participants’ draft papers due / Task organisers’ feedback May 1, 2015 NTCIR-12 all camera ready papers due Jun 7-10, 2015 NTCIR-12 conference We will give you training data as soon as you register! Sooner the better!
  • 14. Prospective participants and budget Huawei will cover the relevance assessment cost. No seeding funding from NTCIR required.
  • 15. Related tasks • TREC Microblog (2011-) [Lin13] Data: twitter, NOT distributed to participants Tweets2011: only IDs distributed, data downloaded individually Tweets2013: Evaluation as a Service (access through APIs) Ad hoc search etc. Evaluation based on binary relevance • NTCIR Community Question Answering (2010) [Ishikawa10] Data: Japanese Yahoo! Answers (Chiebukuro) Given a Q and its responses, rank the responses (which is the best answer?). Evaluation using G@1 etc.
  • 16. References [Ishikawa10] Ishikawa, D., Sakai, T. and Kando, N.: Overview of the NTCIR-8 Community QA Pilot Task (Part I): The Test Collection and the Task, Proceedings of NTCIR-8, pp.421-432, 2010. [Ji14] Ji, Z., Lu, Z. and Li, H.: An Information Retrieval Approach to Short Text Conversation, 2014. http://arxiv.org/abs/1408.6988 [Lin13] Lin, J and Efron, M.: Overview of the TREC-2013 Microblog Track, Proceedings of TREC 2013, 2013. [Sakai13INTENT] Sakai, T. et al.: Overview of the NTCIR-10 INTENT-2 Task, Proceedings of NTCIR-10, pp.94-123, 2013. [Sakai14CIKM] Sakai, T.: Designing Test Collections for Comparing Many Systems, Proceedings of ACM CIKM 2014, pp.61-70, 2014. [Sakai14EVIA] Sakai, T.: Topic Set Size Design with Variance Estimates from Two-Way ANOVA, Proceedings of EVIA 2014, pp.1-8, 2014. http://www.f.waseda.jp/tetsuya/CIKM2014/ir0030-sakai.pdf [Sakai14PROMISE] Sakai, T.: Metrics, Statistics, Tests, PROMISE Winter School 2013: Bridging between Information Retrieval and Databases (LNCS 8173), 2014. http://research.microsoft.com/en-us/people/tesakai/metrics.pdf