SlideShare a Scribd company logo
NAIST	
  at	
  the	
  HOO	
  2012	
  Shared	
  Task	
  
Keisuke	
  Sakaguchi,	
  Yuta	
  Hayashibe,	
  Shuhei	
  Kondo,	
  Lis	
  Kanashiro,	
  Tomoya	
  Mizumoto,	
  Mamoru	
  Komachi,	
  Yuji	
  Matsumoto	
  
ComputaFonal	
  LinguisFcs	
  Lab.	
  Graduate	
  School	
  of	
  InformaFon	
  Science,	
  Nara	
  InsFtute	
  of	
  Science	
  and	
  Technology	
  (NAIST),	
  Japan	
  	
  	
Configurations of the system!
Input →                                → Output !Spelling correction!
Determiner correction!
Preposition correction!
System Architecture for Spelling Error Correction!
System Architecture for Preposition Error Correction!
System Architecture for Determiner Error Correction!
Experiment and Result!
  Spelling correction for unknown words !
  Open source spelling checker: GNU Aspell!
  Ranked by Google Web 1T 5-gram language model!
  Preliminary experiment: 52.4, 72.2, and 60.7% (Precision, Recall, F-score) 	
  12 target prepositions (Chodorow et al., 2010): !
of, in, for, to, by, at, on, from, as, about (covering 91%) !
  Replacement and insertion errors: a single model for detection and correction!
  Deletion errors: focus whether direct objects of verbs need prepositions !
  Syntactic & semantic features described in (Tetreault et al., 2010)!
  Classifier: Maximum Entropy modeling!
  Trained on 2 types of corpus (FCE): !
  “Gold” (Corrected except for Prep errors)!
  “Original” (FCE plain texts)!
  3 target determiners: a, an, the (“an” was normalized to “a” in training & test)!
  check determiners of the left boundary of a noun phrase!
  2 parser models: “Normal” (trained on the normal treebank) vs. !
“mixed” (trained on the treebank & its modified version in which articles at the
left boundary of NPs were removed.)!
  Feature vector representation for each NP using syntax-based feature
templates inspired by (De Felice, 2008)!
  Classifier: Passive aggressive algorithm!
  Training corpus: the CLC FCE dataset and the BNC data with feature
augmentation approach of (Daumé III, 2007)!
References!
Summary!
Future Work!
  Correcting spelling errors of existing words (e.g. *the  then)!
  Getting rich knowledge about verbs from VerbNet and FrameNet!
  Adding target determiners (this, my, etc.)!
 Martin Chodorow, Michael Gamon, and Joel Tetreault. 2010. The Utility of Article and Preposition Error Correction Systems for English
Language Learners: Feedback and Assessment. Language Testing, 27(3):419– 436.!
 Joel Tetreault, Jennifer Foster, and Martin Chodorow. 2010. Using Parse Features for Preposition Selection and Error Detection. In Proceedings
of the 47th Annual Meeting of the Association for Computational Linguistics Short Papers, pages 353–358, Uppsala, Sweden.!
 Rachele De Felice. 2008. Automatic Error Detection in Non-native English. Ph.D. thesis University of Oxford. !
 Hal Daumé III. 2007. Frustratingly Easy Domain Adaptation. In Proceedings of the 45th Annual Meeting of the Association of Computational
Linguistics, pages 256–263, Prague, Czech Republic. !
http://cl.naist.jp/en/ {keisuke-sa, yuta-h, shuhei-k, lis-k, tomoya-m, komachi, matsu}@is.naist.jp 	
  Combined after revision!
 Spelling: Spelling correction improved preposition error correction!
 Prepositions: Performed better when trained with the “original” set!
 Determiners: The "mixed" model improved performance!
  8 different configurations (Runs) 
  Preposition!
  Determiner!
*We	
  re-­‐evaluated	
  the	
  Run2	
  because	
  we	
  submiTed	
  the	
  Run2	
  with	
  the	
  same	
  condiFon	
  as	
  Run0.

More Related Content

What's hot

University-Toronto-Program-The-FundamentalsV2
University-Toronto-Program-The-FundamentalsV2University-Toronto-Program-The-FundamentalsV2
University-Toronto-Program-The-FundamentalsV2Majid Hameed
 
Word sense disambiguation a survey
Word sense disambiguation  a surveyWord sense disambiguation  a survey
Word sense disambiguation a survey
ijctcm
 
Report
ReportReport
Reportbutest
 
A survey on phrase structure learning methods for text classification
A survey on phrase structure learning methods for text classificationA survey on phrase structure learning methods for text classification
A survey on phrase structure learning methods for text classification
ijnlc
 
Learning to Extract Relations for Protein Annotation
Learning to Extract Relations for Protein AnnotationLearning to Extract Relations for Protein Annotation
Learning to Extract Relations for Protein Annotation
Jee-Hyub Kim
 
Interactive Analysis of Word Vector Embeddings
Interactive Analysis of Word Vector EmbeddingsInteractive Analysis of Word Vector Embeddings
Interactive Analysis of Word Vector Embeddings
gleicher
 
A N H YBRID A PPROACH TO W ORD S ENSE D ISAMBIGUATION W ITH A ND W ITH...
A N H YBRID  A PPROACH TO  W ORD  S ENSE  D ISAMBIGUATION  W ITH  A ND  W ITH...A N H YBRID  A PPROACH TO  W ORD  S ENSE  D ISAMBIGUATION  W ITH  A ND  W ITH...
A N H YBRID A PPROACH TO W ORD S ENSE D ISAMBIGUATION W ITH A ND W ITH...
ijnlc
 
Taking into account communities of practice’s specific vocabularies in inform...
Taking into account communities of practice’s specific vocabularies in inform...Taking into account communities of practice’s specific vocabularies in inform...
Taking into account communities of practice’s specific vocabularies in inform...
inscit2006
 
Statistical and Empirical Approaches to Spoken Dialog Systems
Statistical and Empirical Approaches to Spoken Dialog SystemsStatistical and Empirical Approaches to Spoken Dialog Systems
Statistical and Empirical Approaches to Spoken Dialog Systemsbutest
 
Assistive Technology
Assistive TechnologyAssistive Technology
Assistive Technology
jpuglia
 
Software engineering ontology and software testing
Software engineering ontology and software testingSoftware engineering ontology and software testing
Software engineering ontology and software testing
Khushbu Patel
 
Coordinated NPs agree with singular or plural in there-constructions?: A comp...
Coordinated NPs agree with singular or plural in there-constructions?: A comp...Coordinated NPs agree with singular or plural in there-constructions?: A comp...
Coordinated NPs agree with singular or plural in there-constructions?: A comp...
Yu Tamura
 

What's hot (13)

University-Toronto-Program-The-FundamentalsV2
University-Toronto-Program-The-FundamentalsV2University-Toronto-Program-The-FundamentalsV2
University-Toronto-Program-The-FundamentalsV2
 
Word sense disambiguation a survey
Word sense disambiguation  a surveyWord sense disambiguation  a survey
Word sense disambiguation a survey
 
Report
ReportReport
Report
 
A survey on phrase structure learning methods for text classification
A survey on phrase structure learning methods for text classificationA survey on phrase structure learning methods for text classification
A survey on phrase structure learning methods for text classification
 
Learning to Extract Relations for Protein Annotation
Learning to Extract Relations for Protein AnnotationLearning to Extract Relations for Protein Annotation
Learning to Extract Relations for Protein Annotation
 
Interactive Analysis of Word Vector Embeddings
Interactive Analysis of Word Vector EmbeddingsInteractive Analysis of Word Vector Embeddings
Interactive Analysis of Word Vector Embeddings
 
get_certificate
get_certificateget_certificate
get_certificate
 
A N H YBRID A PPROACH TO W ORD S ENSE D ISAMBIGUATION W ITH A ND W ITH...
A N H YBRID  A PPROACH TO  W ORD  S ENSE  D ISAMBIGUATION  W ITH  A ND  W ITH...A N H YBRID  A PPROACH TO  W ORD  S ENSE  D ISAMBIGUATION  W ITH  A ND  W ITH...
A N H YBRID A PPROACH TO W ORD S ENSE D ISAMBIGUATION W ITH A ND W ITH...
 
Taking into account communities of practice’s specific vocabularies in inform...
Taking into account communities of practice’s specific vocabularies in inform...Taking into account communities of practice’s specific vocabularies in inform...
Taking into account communities of practice’s specific vocabularies in inform...
 
Statistical and Empirical Approaches to Spoken Dialog Systems
Statistical and Empirical Approaches to Spoken Dialog SystemsStatistical and Empirical Approaches to Spoken Dialog Systems
Statistical and Empirical Approaches to Spoken Dialog Systems
 
Assistive Technology
Assistive TechnologyAssistive Technology
Assistive Technology
 
Software engineering ontology and software testing
Software engineering ontology and software testingSoftware engineering ontology and software testing
Software engineering ontology and software testing
 
Coordinated NPs agree with singular or plural in there-constructions?: A comp...
Coordinated NPs agree with singular or plural in there-constructions?: A comp...Coordinated NPs agree with singular or plural in there-constructions?: A comp...
Coordinated NPs agree with singular or plural in there-constructions?: A comp...
 

Similar to BEA12_sakaguchi

Machine translation from English to Hindi
Machine translation from English to HindiMachine translation from English to Hindi
Machine translation from English to Hindi
Rajat Jain
 
13. Constantin Orasan (UoW) Natural Language Processing for Translation
13. Constantin Orasan (UoW) Natural Language Processing for Translation13. Constantin Orasan (UoW) Natural Language Processing for Translation
13. Constantin Orasan (UoW) Natural Language Processing for TranslationRIILP
 
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATAIDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
ijistjournal
 
Identifying the semantic relations on
Identifying the semantic relations onIdentifying the semantic relations on
Identifying the semantic relations on
ijistjournal
 
download
downloaddownload
downloadbutest
 
download
downloaddownload
downloadbutest
 
A COMPARATIVE STUDY OF FEATURE SELECTION METHODS
A COMPARATIVE STUDY OF FEATURE SELECTION METHODSA COMPARATIVE STUDY OF FEATURE SELECTION METHODS
A COMPARATIVE STUDY OF FEATURE SELECTION METHODS
kevig
 
Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...
Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...
Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...
Tadahiro Taniguchi
 
Grammarly AI-NLP Club #4 - Understanding and assessing language with neural n...
Grammarly AI-NLP Club #4 - Understanding and assessing language with neural n...Grammarly AI-NLP Club #4 - Understanding and assessing language with neural n...
Grammarly AI-NLP Club #4 - Understanding and assessing language with neural n...
Grammarly
 
AsiaCALL 2017 presentation
AsiaCALL 2017 presentationAsiaCALL 2017 presentation
AsiaCALL 2017 presentation
Takeshi Sato
 
Doc format.
Doc format.Doc format.
Doc format.butest
 
Language Grid
Language GridLanguage Grid
Language Gridlindh
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUECOMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
Journal For Research
 
A prior case study of natural language processing on different domain
A prior case study of natural language processing  on different domain A prior case study of natural language processing  on different domain
A prior case study of natural language processing on different domain
IJECEIAES
 
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGEADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
kevig
 
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGEADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
kevig
 
USING TF-ISF WITH LOCAL CONTEXT TO GENERATE AN OWL DOCUMENT REPRESENTATION FO...
USING TF-ISF WITH LOCAL CONTEXT TO GENERATE AN OWL DOCUMENT REPRESENTATION FO...USING TF-ISF WITH LOCAL CONTEXT TO GENERATE AN OWL DOCUMENT REPRESENTATION FO...
USING TF-ISF WITH LOCAL CONTEXT TO GENERATE AN OWL DOCUMENT REPRESENTATION FO...
cseij
 
Word Segmentation and Lexical Normalization for Unsegmented Languages
Word Segmentation and Lexical Normalization for Unsegmented LanguagesWord Segmentation and Lexical Normalization for Unsegmented Languages
Word Segmentation and Lexical Normalization for Unsegmented Languages
hs0041
 

Similar to BEA12_sakaguchi (20)

Machine translation from English to Hindi
Machine translation from English to HindiMachine translation from English to Hindi
Machine translation from English to Hindi
 
13. Constantin Orasan (UoW) Natural Language Processing for Translation
13. Constantin Orasan (UoW) Natural Language Processing for Translation13. Constantin Orasan (UoW) Natural Language Processing for Translation
13. Constantin Orasan (UoW) Natural Language Processing for Translation
 
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATAIDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
 
Identifying the semantic relations on
Identifying the semantic relations onIdentifying the semantic relations on
Identifying the semantic relations on
 
download
downloaddownload
download
 
download
downloaddownload
download
 
C14-1028
C14-1028C14-1028
C14-1028
 
A COMPARATIVE STUDY OF FEATURE SELECTION METHODS
A COMPARATIVE STUDY OF FEATURE SELECTION METHODSA COMPARATIVE STUDY OF FEATURE SELECTION METHODS
A COMPARATIVE STUDY OF FEATURE SELECTION METHODS
 
UWB semeval2016-task5
UWB semeval2016-task5UWB semeval2016-task5
UWB semeval2016-task5
 
Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...
Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...
Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...
 
Grammarly AI-NLP Club #4 - Understanding and assessing language with neural n...
Grammarly AI-NLP Club #4 - Understanding and assessing language with neural n...Grammarly AI-NLP Club #4 - Understanding and assessing language with neural n...
Grammarly AI-NLP Club #4 - Understanding and assessing language with neural n...
 
AsiaCALL 2017 presentation
AsiaCALL 2017 presentationAsiaCALL 2017 presentation
AsiaCALL 2017 presentation
 
Doc format.
Doc format.Doc format.
Doc format.
 
Language Grid
Language GridLanguage Grid
Language Grid
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUECOMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
 
A prior case study of natural language processing on different domain
A prior case study of natural language processing  on different domain A prior case study of natural language processing  on different domain
A prior case study of natural language processing on different domain
 
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGEADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
 
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGEADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
 
USING TF-ISF WITH LOCAL CONTEXT TO GENERATE AN OWL DOCUMENT REPRESENTATION FO...
USING TF-ISF WITH LOCAL CONTEXT TO GENERATE AN OWL DOCUMENT REPRESENTATION FO...USING TF-ISF WITH LOCAL CONTEXT TO GENERATE AN OWL DOCUMENT REPRESENTATION FO...
USING TF-ISF WITH LOCAL CONTEXT TO GENERATE AN OWL DOCUMENT REPRESENTATION FO...
 
Word Segmentation and Lexical Normalization for Unsegmented Languages
Word Segmentation and Lexical Normalization for Unsegmented LanguagesWord Segmentation and Lexical Normalization for Unsegmented Languages
Word Segmentation and Lexical Normalization for Unsegmented Languages
 

More from Keisuke Sakaguchi

EMNLP 2021 proScript (summary slides)
EMNLP 2021 proScript (summary slides)EMNLP 2021 proScript (summary slides)
EMNLP 2021 proScript (summary slides)
Keisuke Sakaguchi
 
EMNLP 2021 proScript
EMNLP 2021 proScriptEMNLP 2021 proScript
EMNLP 2021 proScript
Keisuke Sakaguchi
 
Acl18 sakaguchi
Acl18 sakaguchiAcl18 sakaguchi
Acl18 sakaguchi
Keisuke Sakaguchi
 
Ijcnlp17 sakaguchi
Ijcnlp17 sakaguchiIjcnlp17 sakaguchi
Ijcnlp17 sakaguchi
Keisuke Sakaguchi
 
ACL17_Sakaguchi
ACL17_SakaguchiACL17_Sakaguchi
ACL17_Sakaguchi
Keisuke Sakaguchi
 
TACL16_Sakaguchi
TACL16_SakaguchiTACL16_Sakaguchi
TACL16_Sakaguchi
Keisuke Sakaguchi
 
ACL13_sakaguchi
ACL13_sakaguchiACL13_sakaguchi
ACL13_sakaguchi
Keisuke Sakaguchi
 
WMT14_sakaguchi
WMT14_sakaguchiWMT14_sakaguchi
WMT14_sakaguchi
Keisuke Sakaguchi
 
COLING12_sakaguchi
COLING12_sakaguchiCOLING12_sakaguchi
COLING12_sakaguchi
Keisuke Sakaguchi
 

More from Keisuke Sakaguchi (10)

EMNLP 2021 proScript (summary slides)
EMNLP 2021 proScript (summary slides)EMNLP 2021 proScript (summary slides)
EMNLP 2021 proScript (summary slides)
 
EMNLP 2021 proScript
EMNLP 2021 proScriptEMNLP 2021 proScript
EMNLP 2021 proScript
 
Acl18 sakaguchi
Acl18 sakaguchiAcl18 sakaguchi
Acl18 sakaguchi
 
Ijcnlp17 sakaguchi
Ijcnlp17 sakaguchiIjcnlp17 sakaguchi
Ijcnlp17 sakaguchi
 
ACL17_Sakaguchi
ACL17_SakaguchiACL17_Sakaguchi
ACL17_Sakaguchi
 
TACL16_Sakaguchi
TACL16_SakaguchiTACL16_Sakaguchi
TACL16_Sakaguchi
 
NAACL15_sakaguchi
NAACL15_sakaguchiNAACL15_sakaguchi
NAACL15_sakaguchi
 
ACL13_sakaguchi
ACL13_sakaguchiACL13_sakaguchi
ACL13_sakaguchi
 
WMT14_sakaguchi
WMT14_sakaguchiWMT14_sakaguchi
WMT14_sakaguchi
 
COLING12_sakaguchi
COLING12_sakaguchiCOLING12_sakaguchi
COLING12_sakaguchi
 

Recently uploaded

Announcement of 18th IEEE International Conference on Software Testing, Verif...
Announcement of 18th IEEE International Conference on Software Testing, Verif...Announcement of 18th IEEE International Conference on Software Testing, Verif...
Announcement of 18th IEEE International Conference on Software Testing, Verif...
Sebastiano Panichella
 
Eureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 PresentationEureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 Presentation
Access Innovations, Inc.
 
Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Doctoral Symposium at the 17th IEEE International Conference on Software Test...Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Sebastiano Panichella
 
Getting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control TowerGetting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control Tower
Vladimir Samoylov
 
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
OECD Directorate for Financial and Enterprise Affairs
 
somanykidsbutsofewfathers-140705000023-phpapp02.pptx
somanykidsbutsofewfathers-140705000023-phpapp02.pptxsomanykidsbutsofewfathers-140705000023-phpapp02.pptx
somanykidsbutsofewfathers-140705000023-phpapp02.pptx
Howard Spence
 
International Workshop on Artificial Intelligence in Software Testing
International Workshop on Artificial Intelligence in Software TestingInternational Workshop on Artificial Intelligence in Software Testing
International Workshop on Artificial Intelligence in Software Testing
Sebastiano Panichella
 
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
Orkestra
 
Obesity causes and management and associated medical conditions
Obesity causes and management and associated medical conditionsObesity causes and management and associated medical conditions
Obesity causes and management and associated medical conditions
Faculty of Medicine And Health Sciences
 
0x01 - Newton's Third Law: Static vs. Dynamic Abusers
0x01 - Newton's Third Law:  Static vs. Dynamic Abusers0x01 - Newton's Third Law:  Static vs. Dynamic Abusers
0x01 - Newton's Third Law: Static vs. Dynamic Abusers
OWASP Beja
 
Bitcoin Lightning wallet and tic-tac-toe game XOXO
Bitcoin Lightning wallet and tic-tac-toe game XOXOBitcoin Lightning wallet and tic-tac-toe game XOXO
Bitcoin Lightning wallet and tic-tac-toe game XOXO
Matjaž Lipuš
 
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdfBonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
khadija278284
 
Acorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutesAcorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutes
IP ServerOne
 

Recently uploaded (13)

Announcement of 18th IEEE International Conference on Software Testing, Verif...
Announcement of 18th IEEE International Conference on Software Testing, Verif...Announcement of 18th IEEE International Conference on Software Testing, Verif...
Announcement of 18th IEEE International Conference on Software Testing, Verif...
 
Eureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 PresentationEureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 Presentation
 
Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Doctoral Symposium at the 17th IEEE International Conference on Software Test...Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Doctoral Symposium at the 17th IEEE International Conference on Software Test...
 
Getting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control TowerGetting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control Tower
 
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
 
somanykidsbutsofewfathers-140705000023-phpapp02.pptx
somanykidsbutsofewfathers-140705000023-phpapp02.pptxsomanykidsbutsofewfathers-140705000023-phpapp02.pptx
somanykidsbutsofewfathers-140705000023-phpapp02.pptx
 
International Workshop on Artificial Intelligence in Software Testing
International Workshop on Artificial Intelligence in Software TestingInternational Workshop on Artificial Intelligence in Software Testing
International Workshop on Artificial Intelligence in Software Testing
 
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
 
Obesity causes and management and associated medical conditions
Obesity causes and management and associated medical conditionsObesity causes and management and associated medical conditions
Obesity causes and management and associated medical conditions
 
0x01 - Newton's Third Law: Static vs. Dynamic Abusers
0x01 - Newton's Third Law:  Static vs. Dynamic Abusers0x01 - Newton's Third Law:  Static vs. Dynamic Abusers
0x01 - Newton's Third Law: Static vs. Dynamic Abusers
 
Bitcoin Lightning wallet and tic-tac-toe game XOXO
Bitcoin Lightning wallet and tic-tac-toe game XOXOBitcoin Lightning wallet and tic-tac-toe game XOXO
Bitcoin Lightning wallet and tic-tac-toe game XOXO
 
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdfBonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
 
Acorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutesAcorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutes
 

BEA12_sakaguchi

  • 1. NAIST  at  the  HOO  2012  Shared  Task   Keisuke  Sakaguchi,  Yuta  Hayashibe,  Shuhei  Kondo,  Lis  Kanashiro,  Tomoya  Mizumoto,  Mamoru  Komachi,  Yuji  Matsumoto   ComputaFonal  LinguisFcs  Lab.  Graduate  School  of  InformaFon  Science,  Nara  InsFtute  of  Science  and  Technology  (NAIST),  Japan     Configurations of the system! Input →                                → Output !Spelling correction! Determiner correction! Preposition correction! System Architecture for Spelling Error Correction! System Architecture for Preposition Error Correction! System Architecture for Determiner Error Correction! Experiment and Result!   Spelling correction for unknown words !   Open source spelling checker: GNU Aspell!   Ranked by Google Web 1T 5-gram language model!   Preliminary experiment: 52.4, 72.2, and 60.7% (Precision, Recall, F-score)   12 target prepositions (Chodorow et al., 2010): ! of, in, for, to, by, at, on, from, as, about (covering 91%) !   Replacement and insertion errors: a single model for detection and correction!   Deletion errors: focus whether direct objects of verbs need prepositions !   Syntactic & semantic features described in (Tetreault et al., 2010)!   Classifier: Maximum Entropy modeling!   Trained on 2 types of corpus (FCE): !   “Gold” (Corrected except for Prep errors)!   “Original” (FCE plain texts)!   3 target determiners: a, an, the (“an” was normalized to “a” in training & test)!   check determiners of the left boundary of a noun phrase!   2 parser models: “Normal” (trained on the normal treebank) vs. ! “mixed” (trained on the treebank & its modified version in which articles at the left boundary of NPs were removed.)!   Feature vector representation for each NP using syntax-based feature templates inspired by (De Felice, 2008)!   Classifier: Passive aggressive algorithm!   Training corpus: the CLC FCE dataset and the BNC data with feature augmentation approach of (Daumé III, 2007)! References! Summary! Future Work!   Correcting spelling errors of existing words (e.g. *the  then)!   Getting rich knowledge about verbs from VerbNet and FrameNet!   Adding target determiners (this, my, etc.)!  Martin Chodorow, Michael Gamon, and Joel Tetreault. 2010. The Utility of Article and Preposition Error Correction Systems for English Language Learners: Feedback and Assessment. Language Testing, 27(3):419– 436.!  Joel Tetreault, Jennifer Foster, and Martin Chodorow. 2010. Using Parse Features for Preposition Selection and Error Detection. In Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics Short Papers, pages 353–358, Uppsala, Sweden.!  Rachele De Felice. 2008. Automatic Error Detection in Non-native English. Ph.D. thesis University of Oxford. !  Hal Daumé III. 2007. Frustratingly Easy Domain Adaptation. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 256–263, Prague, Czech Republic. ! http://cl.naist.jp/en/ {keisuke-sa, yuta-h, shuhei-k, lis-k, tomoya-m, komachi, matsu}@is.naist.jp   Combined after revision!  Spelling: Spelling correction improved preposition error correction!  Prepositions: Performed better when trained with the “original” set!  Determiners: The "mixed" model improved performance!   8 different configurations (Runs)    Preposition!   Determiner! *We  re-­‐evaluated  the  Run2  because  we  submiTed  the  Run2  with  the  same  condiFon  as  Run0.