SlideShare a Scribd company logo
1 of 39
Download to read offline
Transition-Based
Dependency Parsing
SaarbrΓΌcken, December 23rd 2011
David Przybilla – davida@coli.uni-saarland.de
Outline
1. MaltParser
2. Transition Based Parsing
a. Example
b. Oracle
3. Integrating Graph and Transition Based
4. Non –Projective Dependency Parsing
MaltParser
● Different Languages ( No tuning for an Specific Lang)
● Language independent: accurate parsing for a wide
variety of languages
● Accuracy between 80% and 90%
● Deterministic
Treebank MaltParser Dependency Parser
( Transition Based )
input output
Transition Based Parsing
Stack Buffer
π‘Šπ‘–
π‘Šπ‘–+1
π‘Šπ‘–+2
π‘Šπ‘˜
π‘Šπ‘₯
..
..
● Shift
● Left-arc
● Right-arc
● Reduction
Transitions
Actions
Example
John hit the ball
Stack Buffer
John
hit
the
ball
Example
John hit the ball
Stack Buffer
John
hit
the
ball
Transition=Shift
Example
John hit the ball
Stack Buffer
John
hit
the
ball
Transition=left Arc
Subj
Only if β„Ž(π‘—π‘œβ„Žπ‘›) = 0
Example
John hit the ball
Stack Buffer
hit
ball
Transition=Shift
Subj
the
Example
John hit the ball
Stack Buffer
hit
the
ball
Transition=Shift
Subj
Example
John hit the ball
Stack Buffer
hit
the
ball
Transition=left Arc
Subj Det
Only if β„Ž(π‘‘β„Žπ‘’) = 0
Example
John hit the ball
Stack Buffer
hit
ball
Transition=Right Arc
Subj Det
Obj
Only if β„Ž(π‘π‘Žπ‘™π‘™) = 0
Example
John hit the ball
Stack Buffer
hit
ball
Subj Det
Obj
Buffer is Empty= Terminal Configuration
Transition Based Parsing
Stack Buffer
π‘Šπ‘–
π‘Šπ‘–+1
π‘Šπ‘–+2
π‘Šπ‘˜
π‘Šπ‘₯
..
..
Reduction
Stack
π‘Šπ‘₯
..
..
Buffer
π‘Šπ‘–
π‘Šπ‘–+1
π‘Šπ‘–+2
π‘Šπ‘˜
𝑆𝑒𝑛𝑑𝑒𝑛𝑐𝑒: π‘Šπ‘₯ π‘Šπ‘˜β€¦π‘Šπ‘– π‘Šπ‘–+1 … .
Only if β„Ž(π‘Šπ‘˜) β‰  0
Oracle
● Greedy Algorithm, choose a local optimal hoping it will lead
to the global optimal
● It makes Transition Based Algorithm Deterministic.
● Originally there might be more than one possible transition from
one configuration to another
● Construct the Optimal Transition sequence for the
Input Sentence
● How to Build the Oracle? Build a Classifier
Classifier
The Classifier
Classes:
● Shift
● Left-arc
● Right-arc
● Reduction
Feature Vector (Features)
● POS of words in the Buffer
and Stack
● Words themselves
● The First Word in the Stack
● The L World in the Buffer
● The current arcs in the
Graph
Results of the MaltParser
● Evaluation Metrics:
● ASU (Unlabeled Attachment Score): Proportion of Tokens assigned
the correct head
● ASL(Labeled Attachment Score): Proportion of tokens assigned with
the correct head and the correct dependency type
Results of the MaltParser
More flexible Word order
Rich Morphology
More Inflexible Word order,
β€˜poor’ Morphology
English
Chinese
Czech
Turkish
Danish
Dutch
Italian
Swedish
German
Goal ->
Evaluate if Maltparser can do reasonably accurate parsing for a
wide variety of languages
Results of the MaltParser
Results of the MaltParser
● Results:
● Above 80% unlabeled dependency Accuracy (ASU) for all languages
● morphological richness and word order are the cause of variation
across languages
In General lower accuracy for languages like Czech and Turkish.
– There are more non-projective structures in those languages
● It is difficult to do Cross-Language Comparison:
– Big difference in the amount of annotated data
– existence of accurate POS Taggers..
State of the art for Italian, Swedish, Danish, Turkish
Graph Based vs Transition Based
Graph Based
● Search for Optimal Graph
(Highest Scoring Graph)
● Globally Trained(Global
Optimal)
● Limited History of Parsing
Desitions
● Less rich feature
representation
Transition Based
● Search for Optimal Graph by finding
the best transition between two
states. (Local Optimal Desitions)
● Locally Trained (configurations)
● Rich History of Parsing Desitions
● More rich feature but Error
Propagation (Greedy Alg.)
Graph Based vs Transition Based
Graph Based (MST)
● Better for Long
Dependencies
● More accurate for
dependents that are :
● Verbs
● Adjectives
● Adverbs
Transition Based(Malt)
● Better for Short dependencies
● More accurate for dependents
that are:
● Nouns
● Pronouns
Integrate Both Approaches
Integrating Graph and Transition Based
Treebank T Malt Parser Transition Based
Parser
Parsed T
● Integrate both approaches at learning time.
MST Parser
● Base MSTParser guided by Malt
Treebank T MST Parser Transition Based
Parser
Malt Parser
● Base MALTParser guided by MLT
Parsed T
Features used in the Integration
● MSTParser guided by
Malt
● Is arc (𝑖, 𝑗,βˆ—) in 𝐺 π‘šπ‘Žπ‘™π‘‘
● Is arc (𝑖, 𝑗, 𝑙) in 𝐺 π‘šπ‘Žπ‘™π‘‘
● Is arc 𝑖, 𝑗,βˆ— π‘›π‘œπ‘‘ in 𝐺 π‘šπ‘Žπ‘™π‘‘
● Identity of 𝑙’ such that
𝑖, 𝑗, 𝑙′ is in 𝐺 π‘šπ‘Žπ‘™π‘‘
● ..
MaltParser guided by MST
● Is arc (𝑆0
, 𝐡0
,βˆ—) in 𝐺 π‘šπ‘ π‘‘
● Is arc (𝐡0, 𝑆0,βˆ—) in 𝐺 π‘šπ‘ π‘‘
● Head direction of 𝐡0 in 𝐺 π‘šπ‘ π‘‘
(left,right,root..)
● Identity of 𝑙’ such that βˆ—, 𝐡0, 𝑙′
is in 𝐺 π‘šπ‘ π‘‘
𝑆0
=fist element of the Stack, 𝐡0
=First element of the Buffer
Results of Integration
Asl(Correct head And Correct Label)
Results of Integration
Asl(Correct head And Correct Label)
Results of Integration
Asl(Correct head And Correct Label)
Results of Integration
● Graph-based models predict better long arcs
● Each model learn streghts from the others
● The integration actually improves accuracy
● Trying to do more chaining of systems do not
gain better accuracy
Non-Projectivity
● Some Sentences have long distance dependencies which
cannot be parsed with this algorithm
● Cause it only consider relations between neighbors words
● 25% or more of the sentences in some languages are non-
projective
● Useful for some languages with less constraints on word
order
● Harder Problem, There could be relations over unbounded
distances.
Non-Projectivity
A dependency Tree 𝑇 is Projective:
if for every π΄π‘Ÿπ‘ (π‘Šπ‘–, π‘Šπ‘—, π‘Ÿπ‘’π‘™) there is a path from π‘Šπ‘– to π‘Šπ‘˜ , if π‘Šπ‘˜
is between π‘Šπ‘– and π‘Šπ‘—
From β€˜Scheduled’ π‘Š2 there is an arc to π‘Š5 however there is no
way to get to π‘Š4, π‘Š3 from π‘Š2
Non-Projectivity
● Why the previous transition algorithm would not be able to
generate this tree?
Stack Buffer
is
hearing
On
…
…
β€˜is’ can never be reduced
β€˜hearing’ and β€˜on’ will never
get an arc
Handling Non-Projectivity
● Add a new Transition – ’’Swap’’
Stack Buffer
π‘Šπ‘˜
π‘Šπ‘–
π‘Šπ‘–+1
Stack Buffer
π‘Šπ‘˜
π‘Šπ‘–π‘Šπ‘–+1
swap
● Re-Order the initial Input Sentance
Non-Projectivity
Stack Buffer
is
hearing
On
…
…
Stack Buffer
is
..
Hearing
On
…
swap
Non-Projective Dependency Parsing
● Useful for some languages with less constraints on word
order
Theoretically
● Best case 𝑂(𝑁), , that is: no swaps
● Worst Case 𝑂(𝑁2
),
Results
Non-Projective Dependency Parsing
Running Time
● Test on 5 languages( Danish, Arabic, Czech, Slovene, Turkish)
● In practice the running time is 𝑂 𝑁 .
Parsing Accuracy
● Criteria
● Attachment Score: Percentage of tokens with correct head and
dependency label
● Exact match: completely correct labeled dependency tree
Results
Non-Projective Dependency Parsing
● Systems Compared
● 𝑺 𝒖= allowing Non Projective
● 𝑺 𝒑 =Just Projective
● 𝑺 𝒑𝒑=Handling non-Projectivity as a pos-processing
● AS: Percentage of tokens with correct head and dependency label
● EM: completely correct labeled dependency tree
Results
Non-Projective Dependency Parsing
● AS
● Performance of 𝑆 𝑒 is better for for:
– Czech and Slovene οƒ  more non-porjective arcs in this languages.
● In AS 𝑆 𝑒 is lower than 𝑆 𝑝, however the drop is not really significant
● For Arabic the results are not meaningful since there are only 11 non-
projective arcs in the whole set
● ME
● 𝑆 𝑒 outperforms all other parsers.
● The positive effect of 𝑆 𝑒 is dependent on the non-projectivity arcs in the
language
References
● Joakim Nivre, Jens Nilsson, Johan Hall, Atanas Chanev, GΓΌlsen Eryigit, Sandra
KΓΌbler, Svetoslav Marinov, and Erwin Marsi. Maltparser: a language-
independent system for data-driven dependency parsing. Natural Language
Engineering, 13(1):1–41, 2007.
● Joakim Nivre and Ryan McDonald. Integrating graph-based and transition-
based dependency parsers. In Proceedings of ACL-08: HLT, pages 950–958,
Columbus, Ohio, June 2008.
● Joakim Nivre. Non-projective dependency parsing in expected linear time.
In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL
and the 4th International Joint Conference on Natural Language Processing of
the AFNLP, pages 351–359, Suntec, Singapore, 2009.
● Sandra KΓΌbler, Ryan McDonald, Joakim Nivre. Dependency Parsing, Morgan &
Claypool Publishers, 2009

More Related Content

What's hot

Context free grammars
Context free grammarsContext free grammars
Context free grammarsRonak Thakkar
Β 
Regular Expression
Regular ExpressionRegular Expression
Regular Expressionvaluebound
Β 
Feature Selection in Machine Learning
Feature Selection in Machine LearningFeature Selection in Machine Learning
Feature Selection in Machine LearningUpekha Vandebona
Β 
NLP_KASHK:Context-Free Grammar for English
NLP_KASHK:Context-Free Grammar for EnglishNLP_KASHK:Context-Free Grammar for English
NLP_KASHK:Context-Free Grammar for EnglishHemantha Kulathilake
Β 
Intermediate code generation (Compiler Design)
Intermediate code generation (Compiler Design)   Intermediate code generation (Compiler Design)
Intermediate code generation (Compiler Design) Tasif Tanzim
Β 
Grep - A powerful search utility
Grep - A powerful search utilityGrep - A powerful search utility
Grep - A powerful search utilityNirajan Pant
Β 
Error Detection & Recovery
Error Detection & RecoveryError Detection & Recovery
Error Detection & RecoveryAkhil Kaushik
Β 
Greedy Algorithm
Greedy AlgorithmGreedy Algorithm
Greedy AlgorithmWaqar Akram
Β 
Introduction to compiler
Introduction to compilerIntroduction to compiler
Introduction to compilerAbha Damani
Β 
Ensemble Method (Bagging Boosting)
Ensemble Method (Bagging Boosting)Ensemble Method (Bagging Boosting)
Ensemble Method (Bagging Boosting)Abdullah al Mamun
Β 
Programming Language Selection
Programming Language SelectionProgramming Language Selection
Programming Language SelectionDhananjay Nene
Β 
Bootstrapping in Compiler
Bootstrapping in CompilerBootstrapping in Compiler
Bootstrapping in CompilerAkhil Kaushik
Β 
Lecture 18: Gaussian Mixture Models and Expectation Maximization
Lecture 18: Gaussian Mixture Models and Expectation MaximizationLecture 18: Gaussian Mixture Models and Expectation Maximization
Lecture 18: Gaussian Mixture Models and Expectation Maximizationbutest
Β 
CS571: Dependency Parsing
CS571: Dependency ParsingCS571: Dependency Parsing
CS571: Dependency ParsingJinho Choi
Β 
AI 7 | Constraint Satisfaction Problem
AI 7 | Constraint Satisfaction ProblemAI 7 | Constraint Satisfaction Problem
AI 7 | Constraint Satisfaction ProblemMohammad Imam Hossain
Β 
Hadoop Overview & Architecture
Hadoop Overview & Architecture  Hadoop Overview & Architecture
Hadoop Overview & Architecture EMC
Β 
CONTEXT FREE GRAMMAR
CONTEXT FREE GRAMMAR CONTEXT FREE GRAMMAR
CONTEXT FREE GRAMMAR Zahid Parvez
Β 

What's hot (20)

Context free grammars
Context free grammarsContext free grammars
Context free grammars
Β 
Gradient descent method
Gradient descent methodGradient descent method
Gradient descent method
Β 
Regular Expression
Regular ExpressionRegular Expression
Regular Expression
Β 
Feature Selection in Machine Learning
Feature Selection in Machine LearningFeature Selection in Machine Learning
Feature Selection in Machine Learning
Β 
NLP_KASHK:Context-Free Grammar for English
NLP_KASHK:Context-Free Grammar for EnglishNLP_KASHK:Context-Free Grammar for English
NLP_KASHK:Context-Free Grammar for English
Β 
Intermediate code generation (Compiler Design)
Intermediate code generation (Compiler Design)   Intermediate code generation (Compiler Design)
Intermediate code generation (Compiler Design)
Β 
Grep - A powerful search utility
Grep - A powerful search utilityGrep - A powerful search utility
Grep - A powerful search utility
Β 
Unit iv
Unit ivUnit iv
Unit iv
Β 
Error Detection & Recovery
Error Detection & RecoveryError Detection & Recovery
Error Detection & Recovery
Β 
Greedy Algorithm
Greedy AlgorithmGreedy Algorithm
Greedy Algorithm
Β 
Introduction to compiler
Introduction to compilerIntroduction to compiler
Introduction to compiler
Β 
Ensemble Method (Bagging Boosting)
Ensemble Method (Bagging Boosting)Ensemble Method (Bagging Boosting)
Ensemble Method (Bagging Boosting)
Β 
Branch and bound
Branch and boundBranch and bound
Branch and bound
Β 
Programming Language Selection
Programming Language SelectionProgramming Language Selection
Programming Language Selection
Β 
Bootstrapping in Compiler
Bootstrapping in CompilerBootstrapping in Compiler
Bootstrapping in Compiler
Β 
Lecture 18: Gaussian Mixture Models and Expectation Maximization
Lecture 18: Gaussian Mixture Models and Expectation MaximizationLecture 18: Gaussian Mixture Models and Expectation Maximization
Lecture 18: Gaussian Mixture Models and Expectation Maximization
Β 
CS571: Dependency Parsing
CS571: Dependency ParsingCS571: Dependency Parsing
CS571: Dependency Parsing
Β 
AI 7 | Constraint Satisfaction Problem
AI 7 | Constraint Satisfaction ProblemAI 7 | Constraint Satisfaction Problem
AI 7 | Constraint Satisfaction Problem
Β 
Hadoop Overview & Architecture
Hadoop Overview & Architecture  Hadoop Overview & Architecture
Hadoop Overview & Architecture
Β 
CONTEXT FREE GRAMMAR
CONTEXT FREE GRAMMAR CONTEXT FREE GRAMMAR
CONTEXT FREE GRAMMAR
Β 

Similar to Transition Based Dependency Parsing

Word Space Models and Random Indexing
Word Space Models and Random IndexingWord Space Models and Random Indexing
Word Space Models and Random IndexingDileepa Jayakody
Β 
Word Space Models & Random indexing
Word Space Models & Random indexingWord Space Models & Random indexing
Word Space Models & Random indexingDileepa Jayakody
Β 
Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Transl...
Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Transl...Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Transl...
Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Transl...Jungo Kasai
Β 
Networks and Natural Language Processing
Networks and Natural Language ProcessingNetworks and Natural Language Processing
Networks and Natural Language ProcessingAhmed Magdy Ezzeldin, MSc.
Β 
introtonlp-190218095523 (1).pdf
introtonlp-190218095523 (1).pdfintrotonlp-190218095523 (1).pdf
introtonlp-190218095523 (1).pdfAdityaMishra178868
Β 
PL Lecture 01 - preliminaries
PL Lecture 01 - preliminariesPL Lecture 01 - preliminaries
PL Lecture 01 - preliminariesSchwannden Kuo
Β 
Intro to nlp
Intro to nlpIntro to nlp
Intro to nlpankit_ppt
Β 
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools Lifeng (Aaron) Han
Β 
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...Lifeng (Aaron) Han
Β 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Lifeng (Aaron) Han
Β 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)Abdullah al Mamun
Β 
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...Lviv Data Science Summer School
Β 
Graph-to-Text Generation and its Applications to Dialogue
Graph-to-Text Generation and its Applications to DialogueGraph-to-Text Generation and its Applications to Dialogue
Graph-to-Text Generation and its Applications to DialogueJinho Choi
Β 
Machine Learning Applications in Subsurface Analysis: Case Study in North Sea
Machine Learning Applications in Subsurface Analysis: Case Study in North SeaMachine Learning Applications in Subsurface Analysis: Case Study in North Sea
Machine Learning Applications in Subsurface Analysis: Case Study in North SeaYohanes Nuwara
Β 
N20190729
N20190729N20190729
N20190729TMU, Japan
Β 
Go/Ruby/Java: What's next?
Go/Ruby/Java: What's next?Go/Ruby/Java: What's next?
Go/Ruby/Java: What's next?Hernan Wilkinson
Β 
Cassandra: Not Just NoSQL, It's MoSQL
Cassandra: Not Just NoSQL, It's MoSQLCassandra: Not Just NoSQL, It's MoSQL
Cassandra: Not Just NoSQL, It's MoSQLEric Evans
Β 
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmmUnit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmmDhruvKushwaha12
Β 
2211 APSIPA
2211 APSIPA2211 APSIPA
2211 APSIPAWarNik Chow
Β 

Similar to Transition Based Dependency Parsing (20)

Word Space Models and Random Indexing
Word Space Models and Random IndexingWord Space Models and Random Indexing
Word Space Models and Random Indexing
Β 
Word Space Models & Random indexing
Word Space Models & Random indexingWord Space Models & Random indexing
Word Space Models & Random indexing
Β 
Esa act
Esa actEsa act
Esa act
Β 
Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Transl...
Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Transl...Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Transl...
Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Transl...
Β 
Networks and Natural Language Processing
Networks and Natural Language ProcessingNetworks and Natural Language Processing
Networks and Natural Language Processing
Β 
introtonlp-190218095523 (1).pdf
introtonlp-190218095523 (1).pdfintrotonlp-190218095523 (1).pdf
introtonlp-190218095523 (1).pdf
Β 
PL Lecture 01 - preliminaries
PL Lecture 01 - preliminariesPL Lecture 01 - preliminaries
PL Lecture 01 - preliminaries
Β 
Intro to nlp
Intro to nlpIntro to nlp
Intro to nlp
Β 
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
Β 
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
Β 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...
Β 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
Β 
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...
Β 
Graph-to-Text Generation and its Applications to Dialogue
Graph-to-Text Generation and its Applications to DialogueGraph-to-Text Generation and its Applications to Dialogue
Graph-to-Text Generation and its Applications to Dialogue
Β 
Machine Learning Applications in Subsurface Analysis: Case Study in North Sea
Machine Learning Applications in Subsurface Analysis: Case Study in North SeaMachine Learning Applications in Subsurface Analysis: Case Study in North Sea
Machine Learning Applications in Subsurface Analysis: Case Study in North Sea
Β 
N20190729
N20190729N20190729
N20190729
Β 
Go/Ruby/Java: What's next?
Go/Ruby/Java: What's next?Go/Ruby/Java: What's next?
Go/Ruby/Java: What's next?
Β 
Cassandra: Not Just NoSQL, It's MoSQL
Cassandra: Not Just NoSQL, It's MoSQLCassandra: Not Just NoSQL, It's MoSQL
Cassandra: Not Just NoSQL, It's MoSQL
Β 
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmmUnit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Β 
2211 APSIPA
2211 APSIPA2211 APSIPA
2211 APSIPA
Β 

More from David Przybilla

Reproducible datascience [with Terraform]
Reproducible datascience [with Terraform]Reproducible datascience [with Terraform]
Reproducible datascience [with Terraform]David Przybilla
Β 
Python in the land of serverless
Python in the land of serverlessPython in the land of serverless
Python in the land of serverlessDavid Przybilla
Β 
Apache Spark - Introduccion a RDDs
Apache Spark - Introduccion a RDDsApache Spark - Introduccion a RDDs
Apache Spark - Introduccion a RDDsDavid Przybilla
Β 
Procesamiento de Lenguaje Natural
Procesamiento de Lenguaje NaturalProcesamiento de Lenguaje Natural
Procesamiento de Lenguaje NaturalDavid Przybilla
Β 
Label propagation - Semisupervised Learning with Applications to NLP
Label propagation - Semisupervised Learning with Applications to NLPLabel propagation - Semisupervised Learning with Applications to NLP
Label propagation - Semisupervised Learning with Applications to NLPDavid Przybilla
Β 
Automatic generation of domain models for call centers
Automatic generation of domain models for call centersAutomatic generation of domain models for call centers
Automatic generation of domain models for call centersDavid Przybilla
Β 

More from David Przybilla (7)

Reproducible datascience [with Terraform]
Reproducible datascience [with Terraform]Reproducible datascience [with Terraform]
Reproducible datascience [with Terraform]
Β 
Python in the land of serverless
Python in the land of serverlessPython in the land of serverless
Python in the land of serverless
Β 
Terraforming
Terraforming Terraforming
Terraforming
Β 
Apache Spark - Introduccion a RDDs
Apache Spark - Introduccion a RDDsApache Spark - Introduccion a RDDs
Apache Spark - Introduccion a RDDs
Β 
Procesamiento de Lenguaje Natural
Procesamiento de Lenguaje NaturalProcesamiento de Lenguaje Natural
Procesamiento de Lenguaje Natural
Β 
Label propagation - Semisupervised Learning with Applications to NLP
Label propagation - Semisupervised Learning with Applications to NLPLabel propagation - Semisupervised Learning with Applications to NLP
Label propagation - Semisupervised Learning with Applications to NLP
Β 
Automatic generation of domain models for call centers
Automatic generation of domain models for call centersAutomatic generation of domain models for call centers
Automatic generation of domain models for call centers
Β 

Recently uploaded

SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
Β 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
Β 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
Β 
Russian Call Girls in Karol Bagh Aasnvi ➑️ 8264348440 πŸ’‹πŸ“ž Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➑️ 8264348440 πŸ’‹πŸ“ž Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➑️ 8264348440 πŸ’‹πŸ“ž Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➑️ 8264348440 πŸ’‹πŸ“ž Independent Escort S...soniya singh
Β 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz ChruΕ›ciel
Β 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
Β 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
Β 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
Β 
(Genuine) Escort Service Lucknow | Starting β‚Ή,5K To @25k with A/C πŸ§‘πŸ½β€β€οΈβ€πŸ§‘πŸ» 89...
(Genuine) Escort Service Lucknow | Starting β‚Ή,5K To @25k with A/C πŸ§‘πŸ½β€β€οΈβ€πŸ§‘πŸ» 89...(Genuine) Escort Service Lucknow | Starting β‚Ή,5K To @25k with A/C πŸ§‘πŸ½β€β€οΈβ€πŸ§‘πŸ» 89...
(Genuine) Escort Service Lucknow | Starting β‚Ή,5K To @25k with A/C πŸ§‘πŸ½β€β€οΈβ€πŸ§‘πŸ» 89...gurkirankumar98700
Β 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
Β 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
Β 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
Β 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
Β 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
Β 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
Β 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
Β 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
Β 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
Β 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....kzayra69
Β 

Recently uploaded (20)

SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
Β 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
Β 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
Β 
Russian Call Girls in Karol Bagh Aasnvi ➑️ 8264348440 πŸ’‹πŸ“ž Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➑️ 8264348440 πŸ’‹πŸ“ž Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➑️ 8264348440 πŸ’‹πŸ“ž Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➑️ 8264348440 πŸ’‹πŸ“ž Independent Escort S...
Β 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
Β 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
Β 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Β 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Β 
(Genuine) Escort Service Lucknow | Starting β‚Ή,5K To @25k with A/C πŸ§‘πŸ½β€β€οΈβ€πŸ§‘πŸ» 89...
(Genuine) Escort Service Lucknow | Starting β‚Ή,5K To @25k with A/C πŸ§‘πŸ½β€β€οΈβ€πŸ§‘πŸ» 89...(Genuine) Escort Service Lucknow | Starting β‚Ή,5K To @25k with A/C πŸ§‘πŸ½β€β€οΈβ€πŸ§‘πŸ» 89...
(Genuine) Escort Service Lucknow | Starting β‚Ή,5K To @25k with A/C πŸ§‘πŸ½β€β€οΈβ€πŸ§‘πŸ» 89...
Β 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
Β 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Β 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
Β 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
Β 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
Β 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
Β 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
Β 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
Β 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Β 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
Β 
Hot Sexy call girls in Patel NagarπŸ” 9953056974 πŸ” escort Service
Hot Sexy call girls in Patel NagarπŸ” 9953056974 πŸ” escort ServiceHot Sexy call girls in Patel NagarπŸ” 9953056974 πŸ” escort Service
Hot Sexy call girls in Patel NagarπŸ” 9953056974 πŸ” escort Service
Β 

Transition Based Dependency Parsing

  • 1. Transition-Based Dependency Parsing SaarbrΓΌcken, December 23rd 2011 David Przybilla – davida@coli.uni-saarland.de
  • 2. Outline 1. MaltParser 2. Transition Based Parsing a. Example b. Oracle 3. Integrating Graph and Transition Based 4. Non –Projective Dependency Parsing
  • 3. MaltParser ● Different Languages ( No tuning for an Specific Lang) ● Language independent: accurate parsing for a wide variety of languages ● Accuracy between 80% and 90% ● Deterministic Treebank MaltParser Dependency Parser ( Transition Based ) input output
  • 4. Transition Based Parsing Stack Buffer π‘Šπ‘– π‘Šπ‘–+1 π‘Šπ‘–+2 π‘Šπ‘˜ π‘Šπ‘₯ .. .. ● Shift ● Left-arc ● Right-arc ● Reduction Transitions Actions
  • 5.
  • 6. Example John hit the ball Stack Buffer John hit the ball
  • 7. Example John hit the ball Stack Buffer John hit the ball Transition=Shift
  • 8. Example John hit the ball Stack Buffer John hit the ball Transition=left Arc Subj Only if β„Ž(π‘—π‘œβ„Žπ‘›) = 0
  • 9. Example John hit the ball Stack Buffer hit ball Transition=Shift Subj the
  • 10. Example John hit the ball Stack Buffer hit the ball Transition=Shift Subj
  • 11. Example John hit the ball Stack Buffer hit the ball Transition=left Arc Subj Det Only if β„Ž(π‘‘β„Žπ‘’) = 0
  • 12. Example John hit the ball Stack Buffer hit ball Transition=Right Arc Subj Det Obj Only if β„Ž(π‘π‘Žπ‘™π‘™) = 0
  • 13. Example John hit the ball Stack Buffer hit ball Subj Det Obj Buffer is Empty= Terminal Configuration
  • 14. Transition Based Parsing Stack Buffer π‘Šπ‘– π‘Šπ‘–+1 π‘Šπ‘–+2 π‘Šπ‘˜ π‘Šπ‘₯ .. .. Reduction Stack π‘Šπ‘₯ .. .. Buffer π‘Šπ‘– π‘Šπ‘–+1 π‘Šπ‘–+2 π‘Šπ‘˜ 𝑆𝑒𝑛𝑑𝑒𝑛𝑐𝑒: π‘Šπ‘₯ π‘Šπ‘˜β€¦π‘Šπ‘– π‘Šπ‘–+1 … . Only if β„Ž(π‘Šπ‘˜) β‰  0
  • 15. Oracle ● Greedy Algorithm, choose a local optimal hoping it will lead to the global optimal ● It makes Transition Based Algorithm Deterministic. ● Originally there might be more than one possible transition from one configuration to another ● Construct the Optimal Transition sequence for the Input Sentence ● How to Build the Oracle? Build a Classifier
  • 16. Classifier The Classifier Classes: ● Shift ● Left-arc ● Right-arc ● Reduction Feature Vector (Features) ● POS of words in the Buffer and Stack ● Words themselves ● The First Word in the Stack ● The L World in the Buffer ● The current arcs in the Graph
  • 17. Results of the MaltParser ● Evaluation Metrics: ● ASU (Unlabeled Attachment Score): Proportion of Tokens assigned the correct head ● ASL(Labeled Attachment Score): Proportion of tokens assigned with the correct head and the correct dependency type
  • 18. Results of the MaltParser More flexible Word order Rich Morphology More Inflexible Word order, β€˜poor’ Morphology English Chinese Czech Turkish Danish Dutch Italian Swedish German Goal -> Evaluate if Maltparser can do reasonably accurate parsing for a wide variety of languages
  • 19. Results of the MaltParser
  • 20. Results of the MaltParser ● Results: ● Above 80% unlabeled dependency Accuracy (ASU) for all languages ● morphological richness and word order are the cause of variation across languages In General lower accuracy for languages like Czech and Turkish. – There are more non-projective structures in those languages ● It is difficult to do Cross-Language Comparison: – Big difference in the amount of annotated data – existence of accurate POS Taggers.. State of the art for Italian, Swedish, Danish, Turkish
  • 21. Graph Based vs Transition Based Graph Based ● Search for Optimal Graph (Highest Scoring Graph) ● Globally Trained(Global Optimal) ● Limited History of Parsing Desitions ● Less rich feature representation Transition Based ● Search for Optimal Graph by finding the best transition between two states. (Local Optimal Desitions) ● Locally Trained (configurations) ● Rich History of Parsing Desitions ● More rich feature but Error Propagation (Greedy Alg.)
  • 22. Graph Based vs Transition Based Graph Based (MST) ● Better for Long Dependencies ● More accurate for dependents that are : ● Verbs ● Adjectives ● Adverbs Transition Based(Malt) ● Better for Short dependencies ● More accurate for dependents that are: ● Nouns ● Pronouns Integrate Both Approaches
  • 23. Integrating Graph and Transition Based Treebank T Malt Parser Transition Based Parser Parsed T ● Integrate both approaches at learning time. MST Parser ● Base MSTParser guided by Malt Treebank T MST Parser Transition Based Parser Malt Parser ● Base MALTParser guided by MLT Parsed T
  • 24. Features used in the Integration ● MSTParser guided by Malt ● Is arc (𝑖, 𝑗,βˆ—) in 𝐺 π‘šπ‘Žπ‘™π‘‘ ● Is arc (𝑖, 𝑗, 𝑙) in 𝐺 π‘šπ‘Žπ‘™π‘‘ ● Is arc 𝑖, 𝑗,βˆ— π‘›π‘œπ‘‘ in 𝐺 π‘šπ‘Žπ‘™π‘‘ ● Identity of 𝑙’ such that 𝑖, 𝑗, 𝑙′ is in 𝐺 π‘šπ‘Žπ‘™π‘‘ ● .. MaltParser guided by MST ● Is arc (𝑆0 , 𝐡0 ,βˆ—) in 𝐺 π‘šπ‘ π‘‘ ● Is arc (𝐡0, 𝑆0,βˆ—) in 𝐺 π‘šπ‘ π‘‘ ● Head direction of 𝐡0 in 𝐺 π‘šπ‘ π‘‘ (left,right,root..) ● Identity of 𝑙’ such that βˆ—, 𝐡0, 𝑙′ is in 𝐺 π‘šπ‘ π‘‘ 𝑆0 =fist element of the Stack, 𝐡0 =First element of the Buffer
  • 25. Results of Integration Asl(Correct head And Correct Label)
  • 26. Results of Integration Asl(Correct head And Correct Label)
  • 27. Results of Integration Asl(Correct head And Correct Label)
  • 28. Results of Integration ● Graph-based models predict better long arcs ● Each model learn streghts from the others ● The integration actually improves accuracy ● Trying to do more chaining of systems do not gain better accuracy
  • 29. Non-Projectivity ● Some Sentences have long distance dependencies which cannot be parsed with this algorithm ● Cause it only consider relations between neighbors words ● 25% or more of the sentences in some languages are non- projective ● Useful for some languages with less constraints on word order ● Harder Problem, There could be relations over unbounded distances.
  • 30. Non-Projectivity A dependency Tree 𝑇 is Projective: if for every π΄π‘Ÿπ‘ (π‘Šπ‘–, π‘Šπ‘—, π‘Ÿπ‘’π‘™) there is a path from π‘Šπ‘– to π‘Šπ‘˜ , if π‘Šπ‘˜ is between π‘Šπ‘– and π‘Šπ‘— From β€˜Scheduled’ π‘Š2 there is an arc to π‘Š5 however there is no way to get to π‘Š4, π‘Š3 from π‘Š2
  • 31. Non-Projectivity ● Why the previous transition algorithm would not be able to generate this tree? Stack Buffer is hearing On … … β€˜is’ can never be reduced β€˜hearing’ and β€˜on’ will never get an arc
  • 32. Handling Non-Projectivity ● Add a new Transition – ’’Swap’’ Stack Buffer π‘Šπ‘˜ π‘Šπ‘– π‘Šπ‘–+1 Stack Buffer π‘Šπ‘˜ π‘Šπ‘–π‘Šπ‘–+1 swap ● Re-Order the initial Input Sentance
  • 34. Non-Projective Dependency Parsing ● Useful for some languages with less constraints on word order Theoretically ● Best case 𝑂(𝑁), , that is: no swaps ● Worst Case 𝑂(𝑁2 ),
  • 35. Results Non-Projective Dependency Parsing Running Time ● Test on 5 languages( Danish, Arabic, Czech, Slovene, Turkish) ● In practice the running time is 𝑂 𝑁 . Parsing Accuracy ● Criteria ● Attachment Score: Percentage of tokens with correct head and dependency label ● Exact match: completely correct labeled dependency tree
  • 36. Results Non-Projective Dependency Parsing ● Systems Compared ● 𝑺 𝒖= allowing Non Projective ● 𝑺 𝒑 =Just Projective ● 𝑺 𝒑𝒑=Handling non-Projectivity as a pos-processing ● AS: Percentage of tokens with correct head and dependency label ● EM: completely correct labeled dependency tree
  • 37. Results Non-Projective Dependency Parsing ● AS ● Performance of 𝑆 𝑒 is better for for: – Czech and Slovene οƒ  more non-porjective arcs in this languages. ● In AS 𝑆 𝑒 is lower than 𝑆 𝑝, however the drop is not really significant ● For Arabic the results are not meaningful since there are only 11 non- projective arcs in the whole set ● ME ● 𝑆 𝑒 outperforms all other parsers. ● The positive effect of 𝑆 𝑒 is dependent on the non-projectivity arcs in the language
  • 38.
  • 39. References ● Joakim Nivre, Jens Nilsson, Johan Hall, Atanas Chanev, GΓΌlsen Eryigit, Sandra KΓΌbler, Svetoslav Marinov, and Erwin Marsi. Maltparser: a language- independent system for data-driven dependency parsing. Natural Language Engineering, 13(1):1–41, 2007. ● Joakim Nivre and Ryan McDonald. Integrating graph-based and transition- based dependency parsers. In Proceedings of ACL-08: HLT, pages 950–958, Columbus, Ohio, June 2008. ● Joakim Nivre. Non-projective dependency parsing in expected linear time. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pages 351–359, Suntec, Singapore, 2009. ● Sandra KΓΌbler, Ryan McDonald, Joakim Nivre. Dependency Parsing, Morgan & Claypool Publishers, 2009