SlideShare a Scribd company logo
1 of 17
Download to read offline
Overview	
  of	
  SIGHAN	
  2015	
  Bake-­‐off	
  
for	
  Chinese	
  Spelling	
  Check
Yuen-­‐Hsien	
  Tseng	
  (曾元顯),	
  NaGonal	
  Taiwan	
  Normal	
  Univ.	
  
Lung-­‐Hao	
  Lee	
  (李龍豪),	
  NaGonal	
  Taiwan	
  Normal	
  Univ.	
  
Li-­‐Ping	
  Chang	
  (張莉萍),	
  NaGonal	
  Taiwan	
  Normal	
  Univ.	
  
Hsin-­‐Hsi	
  Chen	
  (陳信希),	
  NaGonal	
  Taiwan	
  Univ.	
  
	
  
IntroducGon
•  Chinese	
  spelling	
  checkers	
  are	
  difficult	
  
– No	
  word	
  delimiters	
  exist	
  among	
  Chinese	
  words	
  
– A	
   Chinese	
   word	
   can	
   contain	
   only	
   a	
   single	
  
character	
  or	
  mulGple	
  characters	
  
– More	
  than	
  13	
  thousand	
  characters	
  	
  	
  	
  
•  The	
  spelling	
  checker	
  is	
  expected	
  to	
  idenGfy	
  all	
  
possible	
   spelling	
   errors,	
   highlight	
   their	
  
locaGons	
  and	
  suggest	
  possible	
  correcGons	
  	
  	
  	
  
SIGHAN	
  2015	
  @	
  Beijing,	
  China
 2
Chinese	
  Spelling	
  Check	
  EvaluaGons
•  The	
  1st	
  Chinese	
  Spelling	
  Check	
  Bake-­‐off	
  
– NaGve	
  Chinese	
  speakers	
  
– SIGHAN-­‐2013	
  workshop	
  @	
  Nagoya,	
  Japan	
  	
  
•  The	
  2nd	
  Chinese	
  Spelling	
  Check	
  Bake-­‐off	
  
– Chinese	
  as	
  a	
  foreign	
  language	
  learners	
  
– CIPS-­‐SIGHAN	
  joint	
  CLP-­‐2014	
  conference	
  @	
  Wuhan	
  
•  The	
  3rd	
  Chinese	
  Spelling	
  Check	
  Bake-­‐off	
  
– Chinese	
  as	
  a	
  foreign	
  language	
  learners	
  
– SIGHAN-­‐2015	
  workshop	
  @	
  Beijing,	
  China	
  
SIGHAN	
  2015	
  @	
  Beijing,	
  China
 3
Task	
  DescripGon
•  The	
  input	
  instance	
  is	
  given	
  a	
  unique	
  passage	
  number	
  PID	
  
•  Each	
  character	
  or	
  punctuaGon	
  mark	
  occupies	
  1	
  spot	
  for	
  
counGng	
  locaGon	
  
•  If	
   the	
   passage	
   contains	
   no	
   spelling	
   errors,	
   the	
   checker	
  
should	
  return	
  “PID,	
  0”	
  
•  If	
  an	
  input	
  passage	
  contains	
  at	
  least	
  one	
  spelling	
  error,	
  
the	
  output	
  format	
  is	
  “PID,	
  [,	
  locaGon,	
  correcGon]+”
SIGHAN	
  2015	
  @	
  Beijing,	
  China
 4
TesGng	
  Examples	
  
•  Example	
  1	
  
– Input:	
  (pid=A2-­‐0047-­‐1)	
  我真的洗碗我可以去看你	
  
– Output:	
  A2-­‐0047-­‐1,	
  4,	
  希,	
  5,	
  望	
  
•  Example	
  2	
  
– Input:	
  (pid=B2-­‐1670-­‐2)	
  在日本,大學生打工的情
況是相當普偏的。	
  
– Output:	
  B2-­‐1670-­‐2,	
  17,	
  遍	
  
•  Example	
  3	
  
– Input:	
  (pid=B2-­‐1903-­‐7)	
  我也是你的朋友,我會永
遠在你身邊。	
  
– Output:	
  B2-­‐1903-­‐7,	
  0	
  	
  	
  	
  CORRECT	
  	
  
SIGHAN	
  2015	
  @	
  Beijing,	
  China
 5
Data	
  PreparaGon
•  The	
  essay	
  secGon	
  of	
  the	
  computer-­‐based	
  Test	
  
of	
  Chinese	
  as	
  a	
  Foreign	
  Language	
  (TOCFL)	
  
•  The	
   spelling	
   errors	
   were	
   manually	
   annotated	
  
by	
  trained	
  naGve	
  Chinese	
  speakers,	
  who	
  also	
  
provided	
   correcGons	
   corresponding	
   to	
   each	
  
error.	
  
SIGHAN	
  2015	
  @	
  Beijing,	
  China
 6
Training	
  Set
•  This	
  set	
  included	
  970	
  
selected	
   essays	
   with	
  
a	
   total	
   of	
   3,143	
  
spelling	
  errors.	
  
•  Each	
   essay	
   is	
   shown	
  
in	
   terms	
   of	
   SGML	
  
format	
  	
  
SIGHAN	
  2015	
  @	
  Beijing,	
  China
 7
Dryrun	
  Set
•  A	
   total	
   of	
   39	
   passages	
   were	
   given	
   to	
  
parGcipants	
  to	
  familiarize	
  themselves	
  with	
  the	
  
final	
  tesGng	
  process.	
  	
  
•  The	
   purpose	
   is	
   to	
   validate	
   the	
   submiked	
  
output	
  format	
  only,	
  and	
  no	
  dryrun	
  outcomes	
  
were	
  considered	
  in	
  the	
  official	
  evaluaGon
SIGHAN	
  2015	
  @	
  Beijing,	
  China
 8
Test	
  Set
•  This	
  set	
  consists	
  of	
  1,100	
  tesGng	
  passages.	
  Half	
  
of	
  these	
  passages	
  contained	
  no	
  spelling	
  errors,	
  
while	
   the	
   other	
   half	
   included	
   at	
   least	
   one	
  
spelling	
  error	
  
•  Open	
  test	
  policy:	
  employing	
  any	
  linguisGc	
  and	
  
computaGonal	
  resources	
  to	
  detect	
  and	
  correct	
  
spelling	
  errors	
  are	
  allowed.	
  
SIGHAN	
  2015	
  @	
  Beijing,	
  China
 9
Performance	
  Metrics
•  Correctness	
  is	
  determined	
  at	
  two	
  levels	
  
– DetecGon-­‐level	
  	
  
– CorrecGon-­‐level	
  
	
  
•  Metrics	
  
– False	
  posiGve	
  rate	
  (FPR)	
  =	
  FP	
  /	
  (FP+TP)	
  
– Accuracy	
  =	
  (TP+TN)	
  /	
  (TP+FP+TN+FN)	
  
– Precision	
  =	
  TP	
  /	
  (TP+FP)	
  
– Recall	
  =	
  TP	
  /	
  (TP+FN)	
  
– F1	
  =	
  2	
  *	
  Precision	
  *	
  Recall	
  /	
  (Precision+Recall)	
  
SIGHAN	
  2015	
  @	
  Beijing,	
  China
 10
EvaluaGon	
  Examples	
  
•  System	
  Results:	
  “A2-­‐0092-­‐2,	
  5,	
  玩”,	
  “A2-­‐0243-­‐	
  1,	
  3,	
  件,	
  4,	
  
康”,	
  “B2-­‐1923-­‐2,	
  8,	
  誤,	
  41,	
  情”,	
  “B2-­‐	
  2731-­‐1,	
  0”,	
  and	
  
“B2-­‐3754-­‐3,	
  11,	
  觀”	
  	
  
•  Gold	
  Standard:	
  “A2-­‐0092-­‐2,	
  0”,	
  “A2-­‐0243-­‐1,	
  3,	
  健,	
  4,	
  康”,	
  
“B2-­‐1923-­‐2,	
  8,	
  誤,	
  41,	
  情”,	
  “B2-­‐2731-­‐1,	
  0”,	
  and	
  
“B2-­‐3754-­‐3,	
  10,	
  觀”,	
  	
  
•  FPR	
  =	
  0.5	
  
•  DetecGon-­‐level	
  	
  Acc.	
  =	
  0.6,	
  Pre.=0.5,	
  Rec.=0.67,	
  	
  F1=0.57	
  
•  CorrecGon-­‐level	
  Acc.	
  =	
  0.4,	
  Pre.=0.25,	
  Rec.=0.33,	
  	
  F1=0.28	
  
SIGHAN	
  2015	
  @	
  Beijing,	
  China
 11
9	
  ParGcipants	
  &	
  15	
  Runs
SIGHAN	
  2015	
  @	
  Beijing,	
  China
 12
TesGng	
  Results
SIGHAN	
  2015	
  @	
  Beijing,	
  China
 13
A	
  Summary	
  of	
  Developed	
  Systems
SIGHAN	
  2015	
  @	
  Beijing,	
  China
 14
Conclusions	
  and	
  Future	
  Work	
  
•  All	
   submissions	
   contribute	
   to	
   the	
   knowledge	
   in	
  
search	
  for	
  an	
  effecGve	
  Chinese	
  spell	
  checkers	
  
•  The	
   individual	
   reports	
   in	
   the	
   Bake-­‐off	
  
proceedings	
   provide	
   useful	
   insight	
   into	
   Chinese	
  
language	
  processing	
  	
  
	
  
•  The	
  future	
  direcGon	
  focuses	
  on	
  the	
  development	
  
of	
  Chinese	
  grammaGcal	
  error	
  correcGon	
  	
  
SIGHAN	
  2015	
  @	
  Beijing,	
  China
 15
Acknowledgments
•  NaGonal	
  Taiwan	
  Normal	
  University	
  
•  Ministry	
  of	
  EducaGon,	
  Taiwan	
  
– Aim	
  for	
  the	
  Top	
  University	
  Project	
  
– Center	
  of	
  Learning	
  Technology	
  for	
  Chinese	
  
•  Ministry	
  of	
  Science	
  and	
  Technology,	
  Taiwan	
  
– InternaGonal	
  Research-­‐Intensive	
  Center	
  of	
  
Excellence	
  Program	
  
– Grant	
  no.:	
  MOST	
  104-­‐2911-­‐I-­‐003-­‐301
SIGHAN	
  2015	
  @	
  Beijing,	
  China
 16
THANK	
  YOU
•  All	
   data	
   sets	
   with	
   gold	
   standards	
   and	
  
evaluaGon	
   tool	
   are	
   publicly	
   available	
   for	
  
research	
  purposes	
  at	
  
	
  	
  	
  
	
  hkp://ir.itc.ntnu.edu.tw/lre/sighan8csc.html
SIGHAN	
  2015	
  @	
  Beijing,	
  China
 17

More Related Content

Similar to Yuen-Hsien Tseng - 2015 - Overview of SIGHAN 2015 Bake-off for Chinese Spelling Check

Python vsr final r
Python vsr   final rPython vsr   final r
Python vsr final rChris Lukens
 
2 anton muzhailo - formal test process improvement. how to invest to the te...
2   anton muzhailo - formal test process improvement. how to invest to the te...2   anton muzhailo - formal test process improvement. how to invest to the te...
2 anton muzhailo - formal test process improvement. how to invest to the te...Ievgenii Katsan
 
MT and Post Editing in master's level translation education
MT and Post Editing in master's level translation education MT and Post Editing in master's level translation education
MT and Post Editing in master's level translation education Jakub Absolon
 
Seq2seq Model to Tokenize the Chinese Language
Seq2seq Model to Tokenize the Chinese LanguageSeq2seq Model to Tokenize the Chinese Language
Seq2seq Model to Tokenize the Chinese LanguageJinho Choi
 
Seq2seq Model to Tokenize the Chinese Language
Seq2seq Model to Tokenize the Chinese LanguageSeq2seq Model to Tokenize the Chinese Language
Seq2seq Model to Tokenize the Chinese LanguageJinho Choi
 
Measuring Linguistic Gains in Immersion Settings: Empirical Studies from Fren...
Measuring Linguistic Gains in Immersion Settings: Empirical Studies from Fren...Measuring Linguistic Gains in Immersion Settings: Empirical Studies from Fren...
Measuring Linguistic Gains in Immersion Settings: Empirical Studies from Fren...CIEE
 
qep-first-year-impact-report
qep-first-year-impact-reportqep-first-year-impact-report
qep-first-year-impact-reportKerri A. Mercer
 
E portfolio pedagogy pharmacy2021-september2020
E portfolio pedagogy pharmacy2021-september2020E portfolio pedagogy pharmacy2021-september2020
E portfolio pedagogy pharmacy2021-september2020Colleen Hodgins
 
Thesis presentation on Computer-Assisted Language Testing in Iran
Thesis presentation on Computer-Assisted Language Testing in IranThesis presentation on Computer-Assisted Language Testing in Iran
Thesis presentation on Computer-Assisted Language Testing in IranSedighehHajizadeh
 
OBE-Implementation Plan 17-8-21.pptx
OBE-Implementation Plan 17-8-21.pptxOBE-Implementation Plan 17-8-21.pptx
OBE-Implementation Plan 17-8-21.pptxsyedmoazzamnizami
 
pansupa lewis translator interpreter cv
pansupa lewis translator interpreter cvpansupa lewis translator interpreter cv
pansupa lewis translator interpreter cvPansupa (Pixie) Lewis
 
LKRU14 and LKAPAC: Five Numbers
LKRU14 and LKAPAC: Five NumbersLKRU14 and LKAPAC: Five Numbers
LKRU14 and LKAPAC: Five Numbersazheglov
 
What Is CELPIP_.pptx
What Is CELPIP_.pptxWhat Is CELPIP_.pptx
What Is CELPIP_.pptxCoursementor2
 
Using the test process improvement models. Case study based on TPI Next model...
Using the test process improvement models. Case study based on TPI Next model...Using the test process improvement models. Case study based on TPI Next model...
Using the test process improvement models. Case study based on TPI Next model...Sigma Software
 
CELPIP COACHING IN CHANDIGARH
CELPIP COACHING IN CHANDIGARHCELPIP COACHING IN CHANDIGARH
CELPIP COACHING IN CHANDIGARHJjaspreetsingh
 
CELPIP COACHING IN CHANDIGARH
CELPIP COACHING IN CHANDIGARHCELPIP COACHING IN CHANDIGARH
CELPIP COACHING IN CHANDIGARHJjaspreetsingh
 
Agile vs. DevOps for Continuous Testing: How to Optimize Your Pipeline
Agile vs. DevOps for Continuous Testing: How to Optimize Your PipelineAgile vs. DevOps for Continuous Testing: How to Optimize Your Pipeline
Agile vs. DevOps for Continuous Testing: How to Optimize Your PipelinePerfecto by Perforce
 
Whose test is it anyway - LW&CN v0.3
Whose test is it anyway - LW&CN v0.3Whose test is it anyway - LW&CN v0.3
Whose test is it anyway - LW&CN v0.3Clinton Nobrega AMBCS
 
E portfolio pedagogy pharmacy2021-september2020
E portfolio pedagogy pharmacy2021-september2020E portfolio pedagogy pharmacy2021-september2020
E portfolio pedagogy pharmacy2021-september2020Colleen Hodgins
 

Similar to Yuen-Hsien Tseng - 2015 - Overview of SIGHAN 2015 Bake-off for Chinese Spelling Check (20)

Python vsr final r
Python vsr   final rPython vsr   final r
Python vsr final r
 
2 anton muzhailo - formal test process improvement. how to invest to the te...
2   anton muzhailo - formal test process improvement. how to invest to the te...2   anton muzhailo - formal test process improvement. how to invest to the te...
2 anton muzhailo - formal test process improvement. how to invest to the te...
 
MT and Post Editing in master's level translation education
MT and Post Editing in master's level translation education MT and Post Editing in master's level translation education
MT and Post Editing in master's level translation education
 
Seq2seq Model to Tokenize the Chinese Language
Seq2seq Model to Tokenize the Chinese LanguageSeq2seq Model to Tokenize the Chinese Language
Seq2seq Model to Tokenize the Chinese Language
 
Seq2seq Model to Tokenize the Chinese Language
Seq2seq Model to Tokenize the Chinese LanguageSeq2seq Model to Tokenize the Chinese Language
Seq2seq Model to Tokenize the Chinese Language
 
Measuring Linguistic Gains in Immersion Settings: Empirical Studies from Fren...
Measuring Linguistic Gains in Immersion Settings: Empirical Studies from Fren...Measuring Linguistic Gains in Immersion Settings: Empirical Studies from Fren...
Measuring Linguistic Gains in Immersion Settings: Empirical Studies from Fren...
 
qep-first-year-impact-report
qep-first-year-impact-reportqep-first-year-impact-report
qep-first-year-impact-report
 
E portfolio pedagogy pharmacy2021-september2020
E portfolio pedagogy pharmacy2021-september2020E portfolio pedagogy pharmacy2021-september2020
E portfolio pedagogy pharmacy2021-september2020
 
Thesis presentation on Computer-Assisted Language Testing in Iran
Thesis presentation on Computer-Assisted Language Testing in IranThesis presentation on Computer-Assisted Language Testing in Iran
Thesis presentation on Computer-Assisted Language Testing in Iran
 
OBE-Implementation Plan 17-8-21.pptx
OBE-Implementation Plan 17-8-21.pptxOBE-Implementation Plan 17-8-21.pptx
OBE-Implementation Plan 17-8-21.pptx
 
pansupa lewis translator interpreter cv
pansupa lewis translator interpreter cvpansupa lewis translator interpreter cv
pansupa lewis translator interpreter cv
 
LKRU14 and LKAPAC: Five Numbers
LKRU14 and LKAPAC: Five NumbersLKRU14 and LKAPAC: Five Numbers
LKRU14 and LKAPAC: Five Numbers
 
What Is CELPIP_.pptx
What Is CELPIP_.pptxWhat Is CELPIP_.pptx
What Is CELPIP_.pptx
 
Using the test process improvement models. Case study based on TPI Next model...
Using the test process improvement models. Case study based on TPI Next model...Using the test process improvement models. Case study based on TPI Next model...
Using the test process improvement models. Case study based on TPI Next model...
 
CELPIP COACHING IN CHANDIGARH
CELPIP COACHING IN CHANDIGARHCELPIP COACHING IN CHANDIGARH
CELPIP COACHING IN CHANDIGARH
 
CELPIP COACHING IN CHANDIGARH
CELPIP COACHING IN CHANDIGARHCELPIP COACHING IN CHANDIGARH
CELPIP COACHING IN CHANDIGARH
 
Agile vs. DevOps for Continuous Testing: How to Optimize Your Pipeline
Agile vs. DevOps for Continuous Testing: How to Optimize Your PipelineAgile vs. DevOps for Continuous Testing: How to Optimize Your Pipeline
Agile vs. DevOps for Continuous Testing: How to Optimize Your Pipeline
 
Basic informatiion for pharmacy faculty members in India
Basic informatiion for pharmacy faculty members in IndiaBasic informatiion for pharmacy faculty members in India
Basic informatiion for pharmacy faculty members in India
 
Whose test is it anyway - LW&CN v0.3
Whose test is it anyway - LW&CN v0.3Whose test is it anyway - LW&CN v0.3
Whose test is it anyway - LW&CN v0.3
 
E portfolio pedagogy pharmacy2021-september2020
E portfolio pedagogy pharmacy2021-september2020E portfolio pedagogy pharmacy2021-september2020
E portfolio pedagogy pharmacy2021-september2020
 

More from Association for Computational Linguistics

Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Association for Computational Linguistics
 
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Association for Computational Linguistics
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsAssociation for Computational Linguistics
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsAssociation for Computational Linguistics
 
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Association for Computational Linguistics
 
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Association for Computational Linguistics
 
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Association for Computational Linguistics
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopAssociation for Computational Linguistics
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...Association for Computational Linguistics
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...Association for Computational Linguistics
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Association for Computational Linguistics
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Association for Computational Linguistics
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopAssociation for Computational Linguistics
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Association for Computational Linguistics
 

More from Association for Computational Linguistics (20)

Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal TextMuis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
 
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
 
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour AnalysisCastro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
 
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
 
Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Elior Sulem - 2018 - Semantic Structural Evaluation for Text SimplificationElior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
 
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
 
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
 
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
 
Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
 
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
 
Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
 

Recently uploaded

CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 

Recently uploaded (20)

CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 

Yuen-Hsien Tseng - 2015 - Overview of SIGHAN 2015 Bake-off for Chinese Spelling Check

  • 1. Overview  of  SIGHAN  2015  Bake-­‐off   for  Chinese  Spelling  Check Yuen-­‐Hsien  Tseng  (曾元顯),  NaGonal  Taiwan  Normal  Univ.   Lung-­‐Hao  Lee  (李龍豪),  NaGonal  Taiwan  Normal  Univ.   Li-­‐Ping  Chang  (張莉萍),  NaGonal  Taiwan  Normal  Univ.   Hsin-­‐Hsi  Chen  (陳信希),  NaGonal  Taiwan  Univ.    
  • 2. IntroducGon •  Chinese  spelling  checkers  are  difficult   – No  word  delimiters  exist  among  Chinese  words   – A   Chinese   word   can   contain   only   a   single   character  or  mulGple  characters   – More  than  13  thousand  characters         •  The  spelling  checker  is  expected  to  idenGfy  all   possible   spelling   errors,   highlight   their   locaGons  and  suggest  possible  correcGons         SIGHAN  2015  @  Beijing,  China 2
  • 3. Chinese  Spelling  Check  EvaluaGons •  The  1st  Chinese  Spelling  Check  Bake-­‐off   – NaGve  Chinese  speakers   – SIGHAN-­‐2013  workshop  @  Nagoya,  Japan     •  The  2nd  Chinese  Spelling  Check  Bake-­‐off   – Chinese  as  a  foreign  language  learners   – CIPS-­‐SIGHAN  joint  CLP-­‐2014  conference  @  Wuhan   •  The  3rd  Chinese  Spelling  Check  Bake-­‐off   – Chinese  as  a  foreign  language  learners   – SIGHAN-­‐2015  workshop  @  Beijing,  China   SIGHAN  2015  @  Beijing,  China 3
  • 4. Task  DescripGon •  The  input  instance  is  given  a  unique  passage  number  PID   •  Each  character  or  punctuaGon  mark  occupies  1  spot  for   counGng  locaGon   •  If   the   passage   contains   no   spelling   errors,   the   checker   should  return  “PID,  0”   •  If  an  input  passage  contains  at  least  one  spelling  error,   the  output  format  is  “PID,  [,  locaGon,  correcGon]+” SIGHAN  2015  @  Beijing,  China 4
  • 5. TesGng  Examples   •  Example  1   – Input:  (pid=A2-­‐0047-­‐1)  我真的洗碗我可以去看你   – Output:  A2-­‐0047-­‐1,  4,  希,  5,  望   •  Example  2   – Input:  (pid=B2-­‐1670-­‐2)  在日本,大學生打工的情 況是相當普偏的。   – Output:  B2-­‐1670-­‐2,  17,  遍   •  Example  3   – Input:  (pid=B2-­‐1903-­‐7)  我也是你的朋友,我會永 遠在你身邊。   – Output:  B2-­‐1903-­‐7,  0        CORRECT     SIGHAN  2015  @  Beijing,  China 5
  • 6. Data  PreparaGon •  The  essay  secGon  of  the  computer-­‐based  Test   of  Chinese  as  a  Foreign  Language  (TOCFL)   •  The   spelling   errors   were   manually   annotated   by  trained  naGve  Chinese  speakers,  who  also   provided   correcGons   corresponding   to   each   error.   SIGHAN  2015  @  Beijing,  China 6
  • 7. Training  Set •  This  set  included  970   selected   essays   with   a   total   of   3,143   spelling  errors.   •  Each   essay   is   shown   in   terms   of   SGML   format     SIGHAN  2015  @  Beijing,  China 7
  • 8. Dryrun  Set •  A   total   of   39   passages   were   given   to   parGcipants  to  familiarize  themselves  with  the   final  tesGng  process.     •  The   purpose   is   to   validate   the   submiked   output  format  only,  and  no  dryrun  outcomes   were  considered  in  the  official  evaluaGon SIGHAN  2015  @  Beijing,  China 8
  • 9. Test  Set •  This  set  consists  of  1,100  tesGng  passages.  Half   of  these  passages  contained  no  spelling  errors,   while   the   other   half   included   at   least   one   spelling  error   •  Open  test  policy:  employing  any  linguisGc  and   computaGonal  resources  to  detect  and  correct   spelling  errors  are  allowed.   SIGHAN  2015  @  Beijing,  China 9
  • 10. Performance  Metrics •  Correctness  is  determined  at  two  levels   – DetecGon-­‐level     – CorrecGon-­‐level     •  Metrics   – False  posiGve  rate  (FPR)  =  FP  /  (FP+TP)   – Accuracy  =  (TP+TN)  /  (TP+FP+TN+FN)   – Precision  =  TP  /  (TP+FP)   – Recall  =  TP  /  (TP+FN)   – F1  =  2  *  Precision  *  Recall  /  (Precision+Recall)   SIGHAN  2015  @  Beijing,  China 10
  • 11. EvaluaGon  Examples   •  System  Results:  “A2-­‐0092-­‐2,  5,  玩”,  “A2-­‐0243-­‐  1,  3,  件,  4,   康”,  “B2-­‐1923-­‐2,  8,  誤,  41,  情”,  “B2-­‐  2731-­‐1,  0”,  and   “B2-­‐3754-­‐3,  11,  觀”     •  Gold  Standard:  “A2-­‐0092-­‐2,  0”,  “A2-­‐0243-­‐1,  3,  健,  4,  康”,   “B2-­‐1923-­‐2,  8,  誤,  41,  情”,  “B2-­‐2731-­‐1,  0”,  and   “B2-­‐3754-­‐3,  10,  觀”,     •  FPR  =  0.5   •  DetecGon-­‐level    Acc.  =  0.6,  Pre.=0.5,  Rec.=0.67,    F1=0.57   •  CorrecGon-­‐level  Acc.  =  0.4,  Pre.=0.25,  Rec.=0.33,    F1=0.28   SIGHAN  2015  @  Beijing,  China 11
  • 12. 9  ParGcipants  &  15  Runs SIGHAN  2015  @  Beijing,  China 12
  • 13. TesGng  Results SIGHAN  2015  @  Beijing,  China 13
  • 14. A  Summary  of  Developed  Systems SIGHAN  2015  @  Beijing,  China 14
  • 15. Conclusions  and  Future  Work   •  All   submissions   contribute   to   the   knowledge   in   search  for  an  effecGve  Chinese  spell  checkers   •  The   individual   reports   in   the   Bake-­‐off   proceedings   provide   useful   insight   into   Chinese   language  processing       •  The  future  direcGon  focuses  on  the  development   of  Chinese  grammaGcal  error  correcGon     SIGHAN  2015  @  Beijing,  China 15
  • 16. Acknowledgments •  NaGonal  Taiwan  Normal  University   •  Ministry  of  EducaGon,  Taiwan   – Aim  for  the  Top  University  Project   – Center  of  Learning  Technology  for  Chinese   •  Ministry  of  Science  and  Technology,  Taiwan   – InternaGonal  Research-­‐Intensive  Center  of   Excellence  Program   – Grant  no.:  MOST  104-­‐2911-­‐I-­‐003-­‐301 SIGHAN  2015  @  Beijing,  China 16
  • 17. THANK  YOU •  All   data   sets   with   gold   standards   and   evaluaGon   tool   are   publicly   available   for   research  purposes  at          hkp://ir.itc.ntnu.edu.tw/lre/sighan8csc.html SIGHAN  2015  @  Beijing,  China 17