Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Programming for linguistics: Evaluating Parser Output

655 views

Published on

Published in: Self Improvement
  • Be the first to comment

Programming for linguistics: Evaluating Parser Output

  1. 1. Programming for Linguistics December 16, 2013 Evaluating parser output
  2. 2. Outline of topics 1. Preprocessing (10 slides) 2. Processing (6 slides) 3. Evaluation (1 slide) 4. Results (1 slide) 5. Improvements (1 slide)
  3. 3. Programming for Linguistics 1 Preprocessing Penn Treebank trees as NLTK Tree objects
  4. 4. Programming for Linguistics 1 Preprocessing
  5. 5. Programming for Linguistics 1 Preprocessing
  6. 6. Programming for Linguistics 1 Preprocessing
  7. 7. Programming for Linguistics 1 Preprocessing
  8. 8. Programming for Linguistics 1 Preprocessing
  9. 9. Programming for Linguistics 1 Preprocessing
  10. 10. Programming for Linguistics 1 Preprocessing
  11. 11. Programming for Linguistics 2 Processing D. Klein and C. Manning 2002. A generative constituent-context model for improved grammar induction. In Proceedings of the ACL.
  12. 12. Programming for Linguistics 2 Processing Does this work for original trees and processed trees?
  13. 13. Programming for Linguistics 2 Processing Spans represented as label-span tuples: (‘label’, start, end)
  14. 14. Programming for Linguistics 2 Processing
  15. 15. Programming for Linguistics 2 Processing
  16. 16. Programming for Linguistics 2 Processing
  17. 17. Programming for Linguistics 2 Processing Now what can we do with this information?
  18. 18. Programming for Linguistics 4 Evaluation
  19. 19. Programming for Linguistics 4 Evaluation
  20. 20. Programming for Linguistics 5 Results Evaluation after removing all complex labels and empty categories. GL = gold labels AL = auto labels M = matches
  21. 21. Programming for Linguistics 5 Improvements ● Cross brackets ● Remove only labels/categories ● Consider context ● Constituent types ● Depends on task… Thanks!

×