SlideShare a Scribd company logo
an incremental algorithm for
transition-based ccg parsing
B. R. Ambati, T. Deoskar, M. Johnson and M. Steedman
Miyazawa Akira
January 22, 2016
The Graduate University For Advanced Studies / National Institute of Informatics
incrememtal parser
What does “Incremental” mean?
An incremental parser computes the relationship between words
as soon as it receives them from the input.
Why is incrementality important?
∙ Statistical Machine Translation
∙ Automatic Speech Recognition
1
baseline algorithm
Their baseline algorithm NonInc is based on Zhang and Clark (2011).
It has four actions.
∙ Shift
Push a word from the input buffer to the stack and assign it with a
CCG category.
∙ Reduce Left
Pop the top two nodes from the stack, combine them into a new
node and push it back onto the stack with a new category. The
right node is the head and the left node is reduced.
∙ Reduce Right
This is similar to the action above except that the left node is the
head and the right node is reduced.
∙ Unary
Change the category of the top node on the stack.
2
noninc algorithm
0
Input buffer
John likes mangoes from India madly
Stack
Dependency graph
3
noninc algorithm
1 Shift
Input buffer
likes mangoes from India madly
Stack
NPJohn
Dependency graph
3
noninc algorithm
2 Shift
Input buffer
mangoes from India madly
Stack
NPJohn (SNP)/NPlikes
Dependency graph
3
noninc algorithm
3 Shift
Input buffer
from India madly
Stack
NPJohn (SNP)/NPlikes NPmangoes
Dependency graph
3
noninc algorithm
4 Shift
Input buffer
India madly
Stack
NPJohn (SNP)/NPlikes NPmangoes (NPNP)/NPfrom
Dependency graph
3
noninc algorithm
5 Shift
Input buffer
madly
Stack
NPJohn (SNP)/NPlikes NPmangoes (NPNP)/NPfrom NPIndia
Dependency graph
3
noninc algorithm
6 Reduce Right
Input buffer
madly
Stack
NPJohn (SNP)/NPlikes NPmangoes NPNPfrom
Dependency graph
from
India
3
noninc algorithm
7 Reduce Right
Input buffer
madly
Stack
NPJohn (SNP)/NPlikes NPmangoes
Dependency graph
from
India
mangoes
3
noninc algorithm
8 Reduce Right
Input buffer
madly
Stack
NPJohn SNPlikes
Dependency graph
from
India
mangoes
likes
3
noninc algorithm
9 Shift
Input buffer
Stack
NPJohn SNPlikes (SNP)(SNP)madly
Dependency graph
from
India
mangoes
likes
3
noninc algorithm
10 Reduce Right
Input buffer
Stack
NPJohn SNPlikes
Dependency graph
from
India
mangoes
likes
madly
3
noninc algorithm
11 Reduce Left
Input buffer
Stack
Slikes
Dependency graph
from
India
mangoes
likes
madlyJohn
3
noninc algorithm
Finish
Input buffer
Stack
Slikes
Dependency graph
from
India
mangoes
likes
madlyJohn
3
problem in noninc
The algorithm above is not incremental. The dependncy graph starts
to grow only after almost all the words are pushed to the stack.
To solve this problem, they introduce a revealing technique
(Pareschi and Steedman (1987)).
4
revinc algorithm
0
Input buffer
John likes mangoes from India madly
Stack
Dependency graph
5
revinc algorithm
1 Shift
Input buffer
likes mangoes from India madly
Stack
NPJohn
Dependency graph
5
revinc algorithm
2 Shift
Input buffer
mangoes from India madly
Stack
NPJohn (SNP)/NPlikes
Dependency graph
5
revinc algorithm
3-1 Type-Raising
Input buffer
mangoes from India madly
Stack
S/(SNP)John (SNP)/NPlikes
Dependency graph
5
revinc algorithm
3-2 Reduce Left
Input buffer
mangoes from India madly
Stack
S/NPlikes
Dependency graph
likes
John
5
revinc algorithm
4 Shift
Input buffer
from India madly
Stack
S/NPlikes NPmangoes
Dependency graph
likes
John
5
revinc algorithm
5 Reduce Right
Input buffer
from India madly
Stack
Slikes
Dependency graph
mangoes
likes
John
5
revinc algorithm
6 Shift
Input buffer
India madly
Stack
Slikes (NPNP)/NPfrom
Dependency graph
mangoes
likes
John
5
revinc algorithm
7 Shift
Input buffer
madly
Stack
Slikes (NPNP)/NPfrom NPIndia
Dependency graph
mangoes
likes
John
5
revinc algorithm
8 Reduce Right
Input buffer
madly
Stack
Slikes NPNPfrom
Dependency graph
from
India
mangoes
likes
John
5
revinc algorithm
9 Right Reveal
Input buffer
madly
Stack
Slikes
Dependency graph
from
India
mangoes
likes
John
Slikes NPNPfrom
R>
S/NPlikes NPmangoes
<
NP
>
S
5
revinc algorithm
10 Shift
Input buffer
Stack
Slikes (SNP)(SNP)madly
Dependency graph
from
India
mangoes
likes
John
5
revinc algorithm
11 Left Reveal
Input buffer
Stack
Slikes
Dependency graph
from
India
mangoes
likes
madlyJohn
Slikes (SNP)(SNP)madly
R<
NPJohn SNPlikes
<
SNP
<
S
5
revinc algorithm
Finish
Input buffer
Stack
Slikes
Dependency graph
from
India
mangoes
likes
madlyJohn
5
revealing actions
1. Left Reveal (LRev)
Pop the top two nodes in the stack (left, right). Identify the left
node’s child with a subject dependency. Abstract over this child
node and split the category of left node into two categories.
Combine the nodes using CCG combinators accodingly.
Slikes (SNP)(SNP)madly
R<
NPJohn SNPlikes
<
SNP
<
S
6
revealing actions
2. Right Reveal (RRev)
Pop the top two nodes in the stack (left, right). Check the right
periphery of the left node in the dependency graph, extract all the
nodes with compatible CCG categories and identify all the possible
nodes that the right node can combine with. Abstract over this
node, split the category into two categories accordingly and
combine the nodes using CCG combinators.
Slikes NPNPfrom
R>
S/NPlikes NPmangoes
<
NP
>
S
7
data and settings
Data: CCGbank
∙ training: sections 02–21
∙ development: section 00
∙ testing: section 23
POS tagger:
C&C POS tagger
Supertagger:
C&C supertagger
8
features
∙ NonInc
Features based on the top four nodes in the stack and the next four words in the input buffer.
feature templates
1 S0wp, S0c, S0pc, S0wc, S1wp, S1c, S1pc, S1wc, S2pc, S2wc, S3pc, S3wc,
2 Q0wp, Q1wp, Q2wp, Q3wp,
3 S0Lpc, S0Lwc, S0Rpc, S0Rwc, S0Upc, S0Uwc, S1Lpc, S1Lwc, S1Rpc, S1Rwc, S1Upc,
S1Uwc,
4 S0wcS1wc, S0cS1w, S0wS1c, S0cS1c, S0wcQ0wp, S0cQ0wp, S0wcQ0p, S0cQ0p,
S1wcQ0wp, S1cQ0wp, S1wcQ0p, S1cQ0p,
5 S0wcS1cQ0p, S0cS1wcQ0p, S0cS1cQ0wp, S0cS1cQ0p, S0pS1pQ0p, S0wcQ0pQ1p,
S0cQ0wpQ1p, S0cQ0pQ1wp, S0cQ0pQ1p, S0pQ0pQ1p, S0wcS1cS2c, S0cS1wcS2c,
S0cS1cS2wc, S0cS1cS2, S0pS1pS2,
6 S0cS0HcS0Lc, S0cS0HcS0Rc, S1cS1HcS1Rc, S0cS0RcQ0p, S0cS0RcQ0w, S0cS0LcS1c,
S0cS0LcS1w, S0cS1cS1Rc, S0wS1cS1Rc,
∙ RevInc
↑ + B1c and B1cS0c, where B1 is the bottom most node in the right periphery.
9
measures of incrementality
Measures of incrementality:
∙ Connectedness
the average number of nodes in the stack before shifting
∙ Waiting time
the number of nodes that need to be shifted to the stack before a
dependency between any two nodes in the stack is resolved
Table 1: Connectedness and waiting time.
Algorithm Connectedness Waiting Time
NonInc 4.62 2.98
RevInc 2.15 0.69
10
results and analysis
Table 2: Performance on the development data1
.
Algorithm UP UR UF LP LR LF Cat Acc.
NonInc (beam=1) 92.57 82.60 87.30 85.12 75.96 80.28 91.10
RevInc (beam=1) 91.62 85.94 88.69 83.42 78.25 80.75 90.87
NonInc (beam=16) 92.71 89.66 91.16 85.78 82.96 84.35 92.51
Z&C (beam=16) - - - 87.15 82.95 85.00 92.77
∙ NonInc gets higher precision because it can use more context
while making a decision.
∙ RevInc achieves higher recall because information on nodes is
available even after they are reduced.
1‘U’ stands for unlabaled and ‘L’ stands for labeles. ‘P’, ‘R’ and ‘F’ are precision, recall
and F-score respectively.
11
results and analysis
Table 3: Label-wise F-score of RevInc and NonInc parsers.
Category RevInc NonInc
(NPNP)/NP 81.36 83.21
(NPNP)/NP 78.66 82.94
((SNP)(SNP))/NP 65.09 66.98
((SNP)(SNP))/NP 62.69 65.89
(S[dcl]NP)/NP 78.96 78.29
(S[dcl]NP)/NP 76.71 75.22
(SNP)(SNP) 80.49 76.90
∙ NonInc performs better in labels corresponding to PP due to the
availability of more context.
∙ RevInc has advantage in the case of verbal arguments and verbal
modifiers as the effect of “reveal” actions.
12
parsing speed
Parsing speed:
∙ NonInc parses 110 sentences/sec.
∙ RevInc parses 125 sentences/sec.
Significant amount of parsing time is spent on the feature extraction
step. But in RevInc, usually only two nodes have their feature
extracted because connectedness = 2.15, while all four nodes have to
be processed in NonInc (connectedness = 4.62).
Complex actions, LRev and RRev are rarely used (5%).
13
results and analysis
Table 4: Performance on the test data.
Algorithm UP UR UF LP LR LF Cat Acc.
NonInc (beam=1) 92.45 82.16 87.00 85.59 76.06 80.55 91.39
RevInc (beam=1) 91.83 86.35 89.00 84.02 79.00 81.43 91.17
NonInc (beam=16) 92.68 89.57 91.10 86.20 83.32 84.74 92.70
Z&C (beam=16) - - - 87.43 83.61 85.48 93.12
Hassan et al. 09 - - 86.31 - - - -
F-scores are improved compared to NonInc in both unlabaled and
labeled cases.
14
future plan
∙ Use information about lexical category probabilities (Auli and
Lopez (2011)).
∙ Explore the limited use of a beam to handle lexical ambiguity.
∙ Use a dynamic oracle strategy (Xu et al. (2014)).
∙ Apply the method to SMT and ASR.
15
conclusion
∙ They designed and implemented an incremental CCG parser by
introducing a technique called revealing.
∙ The parser got high scores in two measures of incrementality:
connectedness and waiting time.
∙ It performs better in parsing in the view of recall and F-score.
(labeled: +0.88%, unlabeld: +2.0%)
∙ It parses sentences faster.
16
references
Ambati, R. B., Deoskar, T., Johnson, M., and Steedman, M. (2015). An
incremental algorithm for transition-based CCG parsing. In
Proceedings of the 2015 Conference of NAACL, pages 53–63.
Auli, M. and Lopez, A. (2011). A comparison of loopy belief
propagation and dual decomposition for integrated CCG
supertagging and parsing. In Proceedings of the 49th Annual
Meeting of ACL, pages 470–480.
Pareschi, R. and Steedman, M. (1987). A lazy way to chart-parse with
categorial grammars. In Proceedings of the 25th Annual Meeting of
ACL.
Xu, W., Clark, S., and Zhang, Y. (2014). Shift-reduce CCG parsing with a
dependency model. In Proceedings of the 52nd Annual Meeting of
ACL, pages 218–227.
Zhang, Y. and Clark, S. (2011). Shift-reduce CCG parsing. In
Proceedings of the 49th Annual Meeting of ACL, pages 683–692.
17

More Related Content

Similar to An incremental algorithm for transition-based CCG parsing

Diagnosis of Faulty Sensors in Antenna Array using Hybrid Differential Evolut...
Diagnosis of Faulty Sensors in Antenna Array using Hybrid Differential Evolut...Diagnosis of Faulty Sensors in Antenna Array using Hybrid Differential Evolut...
Diagnosis of Faulty Sensors in Antenna Array using Hybrid Differential Evolut...
IJECEIAES
 
Concepts of Temporal CNN, Recurrent Neural Network, Attention
Concepts of Temporal CNN, Recurrent Neural Network, AttentionConcepts of Temporal CNN, Recurrent Neural Network, Attention
Concepts of Temporal CNN, Recurrent Neural Network, Attention
SaumyaMundra3
 
Adaptive modified backpropagation algorithm based on differential errors
Adaptive modified backpropagation algorithm based on differential errorsAdaptive modified backpropagation algorithm based on differential errors
Adaptive modified backpropagation algorithm based on differential errors
IJCSEA Journal
 
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
IRJET Journal
 
Reduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
Reduction of Active Power Loss byUsing Adaptive Cat Swarm OptimizationReduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
Reduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
ijeei-iaes
 
Performance of Matching Algorithmsfor Signal Approximation
Performance of Matching Algorithmsfor Signal ApproximationPerformance of Matching Algorithmsfor Signal Approximation
Performance of Matching Algorithmsfor Signal Approximation
iosrjce
 
Artificial Intelligence Applications in Petroleum Engineering - Part I
Artificial Intelligence Applications in Petroleum Engineering - Part IArtificial Intelligence Applications in Petroleum Engineering - Part I
Artificial Intelligence Applications in Petroleum Engineering - Part I
Ramez Abdalla, M.Sc
 
Disease Classification using ECG Signal Based on PCA Feature along with GA & ...
Disease Classification using ECG Signal Based on PCA Feature along with GA & ...Disease Classification using ECG Signal Based on PCA Feature along with GA & ...
Disease Classification using ECG Signal Based on PCA Feature along with GA & ...
IRJET Journal
 
H017376369
H017376369H017376369
H017376369
IOSR Journals
 
A New Classifier Based onRecurrent Neural Network Using Multiple Binary-Outpu...
A New Classifier Based onRecurrent Neural Network Using Multiple Binary-Outpu...A New Classifier Based onRecurrent Neural Network Using Multiple Binary-Outpu...
A New Classifier Based onRecurrent Neural Network Using Multiple Binary-Outpu...
iosrjce
 
L010628894
L010628894L010628894
L010628894
IOSR Journals
 
A hybrid sine cosine optimization algorithm for solving global optimization p...
A hybrid sine cosine optimization algorithm for solving global optimization p...A hybrid sine cosine optimization algorithm for solving global optimization p...
A hybrid sine cosine optimization algorithm for solving global optimization p...
Aboul Ella Hassanien
 
DL for molecules
DL for moleculesDL for molecules
DL for molecules
Dai-Hai Nguyen
 
EE 567ProjectDue Tuesday, December 3, 2019 at 640 p.m..docx
EE 567ProjectDue Tuesday, December 3, 2019 at 640 p.m..docxEE 567ProjectDue Tuesday, December 3, 2019 at 640 p.m..docx
EE 567ProjectDue Tuesday, December 3, 2019 at 640 p.m..docx
gidmanmary
 
Gene's law
Gene's lawGene's law
Gene's law
Hoopeer Hoopeer
 
An improved spfa algorithm for single source shortest path problem using forw...
An improved spfa algorithm for single source shortest path problem using forw...An improved spfa algorithm for single source shortest path problem using forw...
An improved spfa algorithm for single source shortest path problem using forw...
IJMIT JOURNAL
 
International Journal of Managing Information Technology (IJMIT)
International Journal of Managing Information Technology (IJMIT)International Journal of Managing Information Technology (IJMIT)
International Journal of Managing Information Technology (IJMIT)
IJMIT JOURNAL
 
An improved spfa algorithm for single source shortest path problem using forw...
An improved spfa algorithm for single source shortest path problem using forw...An improved spfa algorithm for single source shortest path problem using forw...
An improved spfa algorithm for single source shortest path problem using forw...
IJMIT JOURNAL
 
Large Scale Kernel Learning using Block Coordinate Descent
Large Scale Kernel Learning using Block Coordinate DescentLarge Scale Kernel Learning using Block Coordinate Descent
Large Scale Kernel Learning using Block Coordinate Descent
Shaleen Kumar Gupta
 
A COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTION
A COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTIONA COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTION
A COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTION
IJCSEA Journal
 

Similar to An incremental algorithm for transition-based CCG parsing (20)

Diagnosis of Faulty Sensors in Antenna Array using Hybrid Differential Evolut...
Diagnosis of Faulty Sensors in Antenna Array using Hybrid Differential Evolut...Diagnosis of Faulty Sensors in Antenna Array using Hybrid Differential Evolut...
Diagnosis of Faulty Sensors in Antenna Array using Hybrid Differential Evolut...
 
Concepts of Temporal CNN, Recurrent Neural Network, Attention
Concepts of Temporal CNN, Recurrent Neural Network, AttentionConcepts of Temporal CNN, Recurrent Neural Network, Attention
Concepts of Temporal CNN, Recurrent Neural Network, Attention
 
Adaptive modified backpropagation algorithm based on differential errors
Adaptive modified backpropagation algorithm based on differential errorsAdaptive modified backpropagation algorithm based on differential errors
Adaptive modified backpropagation algorithm based on differential errors
 
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
 
Reduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
Reduction of Active Power Loss byUsing Adaptive Cat Swarm OptimizationReduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
Reduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
 
Performance of Matching Algorithmsfor Signal Approximation
Performance of Matching Algorithmsfor Signal ApproximationPerformance of Matching Algorithmsfor Signal Approximation
Performance of Matching Algorithmsfor Signal Approximation
 
Artificial Intelligence Applications in Petroleum Engineering - Part I
Artificial Intelligence Applications in Petroleum Engineering - Part IArtificial Intelligence Applications in Petroleum Engineering - Part I
Artificial Intelligence Applications in Petroleum Engineering - Part I
 
Disease Classification using ECG Signal Based on PCA Feature along with GA & ...
Disease Classification using ECG Signal Based on PCA Feature along with GA & ...Disease Classification using ECG Signal Based on PCA Feature along with GA & ...
Disease Classification using ECG Signal Based on PCA Feature along with GA & ...
 
H017376369
H017376369H017376369
H017376369
 
A New Classifier Based onRecurrent Neural Network Using Multiple Binary-Outpu...
A New Classifier Based onRecurrent Neural Network Using Multiple Binary-Outpu...A New Classifier Based onRecurrent Neural Network Using Multiple Binary-Outpu...
A New Classifier Based onRecurrent Neural Network Using Multiple Binary-Outpu...
 
L010628894
L010628894L010628894
L010628894
 
A hybrid sine cosine optimization algorithm for solving global optimization p...
A hybrid sine cosine optimization algorithm for solving global optimization p...A hybrid sine cosine optimization algorithm for solving global optimization p...
A hybrid sine cosine optimization algorithm for solving global optimization p...
 
DL for molecules
DL for moleculesDL for molecules
DL for molecules
 
EE 567ProjectDue Tuesday, December 3, 2019 at 640 p.m..docx
EE 567ProjectDue Tuesday, December 3, 2019 at 640 p.m..docxEE 567ProjectDue Tuesday, December 3, 2019 at 640 p.m..docx
EE 567ProjectDue Tuesday, December 3, 2019 at 640 p.m..docx
 
Gene's law
Gene's lawGene's law
Gene's law
 
An improved spfa algorithm for single source shortest path problem using forw...
An improved spfa algorithm for single source shortest path problem using forw...An improved spfa algorithm for single source shortest path problem using forw...
An improved spfa algorithm for single source shortest path problem using forw...
 
International Journal of Managing Information Technology (IJMIT)
International Journal of Managing Information Technology (IJMIT)International Journal of Managing Information Technology (IJMIT)
International Journal of Managing Information Technology (IJMIT)
 
An improved spfa algorithm for single source shortest path problem using forw...
An improved spfa algorithm for single source shortest path problem using forw...An improved spfa algorithm for single source shortest path problem using forw...
An improved spfa algorithm for single source shortest path problem using forw...
 
Large Scale Kernel Learning using Block Coordinate Descent
Large Scale Kernel Learning using Block Coordinate DescentLarge Scale Kernel Learning using Block Coordinate Descent
Large Scale Kernel Learning using Block Coordinate Descent
 
A COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTION
A COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTIONA COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTION
A COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTION
 

More from Akira Miyazawa

LuaTeX-jaとbeamerで言語学関連のスライドを作る
LuaTeX-jaとbeamerで言語学関連のスライドを作るLuaTeX-jaとbeamerで言語学関連のスライドを作る
LuaTeX-jaとbeamerで言語学関連のスライドを作る
Akira Miyazawa
 
メタファーの自動生成に向けた客観的評価指標の検討
メタファーの自動生成に向けた客観的評価指標の検討メタファーの自動生成に向けた客観的評価指標の検討
メタファーの自動生成に向けた客観的評価指標の検討
Akira Miyazawa
 
PRML 第14章
PRML 第14章PRML 第14章
PRML 第14章
Akira Miyazawa
 
スペクトラル・クラスタリング
スペクトラル・クラスタリングスペクトラル・クラスタリング
スペクトラル・クラスタリング
Akira Miyazawa
 
最急降下法
最急降下法最急降下法
最急降下法
Akira Miyazawa
 
PRML 第4章
PRML 第4章PRML 第4章
PRML 第4章
Akira Miyazawa
 

More from Akira Miyazawa (6)

LuaTeX-jaとbeamerで言語学関連のスライドを作る
LuaTeX-jaとbeamerで言語学関連のスライドを作るLuaTeX-jaとbeamerで言語学関連のスライドを作る
LuaTeX-jaとbeamerで言語学関連のスライドを作る
 
メタファーの自動生成に向けた客観的評価指標の検討
メタファーの自動生成に向けた客観的評価指標の検討メタファーの自動生成に向けた客観的評価指標の検討
メタファーの自動生成に向けた客観的評価指標の検討
 
PRML 第14章
PRML 第14章PRML 第14章
PRML 第14章
 
スペクトラル・クラスタリング
スペクトラル・クラスタリングスペクトラル・クラスタリング
スペクトラル・クラスタリング
 
最急降下法
最急降下法最急降下法
最急降下法
 
PRML 第4章
PRML 第4章PRML 第4章
PRML 第4章
 

Recently uploaded

TIME TABLE MANAGEMENT SYSTEM testing.pptx
TIME TABLE MANAGEMENT SYSTEM testing.pptxTIME TABLE MANAGEMENT SYSTEM testing.pptx
TIME TABLE MANAGEMENT SYSTEM testing.pptx
CVCSOfficial
 
Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...
Prakhyath Rai
 
smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...
um7474492
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
ecqow
 
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
upoux
 
Object Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOADObject Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOAD
PreethaV16
 
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
nedcocy
 
SCALING OF MOS CIRCUITS m .pptx
SCALING OF MOS CIRCUITS m                 .pptxSCALING OF MOS CIRCUITS m                 .pptx
SCALING OF MOS CIRCUITS m .pptx
harshapolam10
 
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELDEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
ijaia
 
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
upoux
 
Applications of artificial Intelligence in Mechanical Engineering.pdf
Applications of artificial Intelligence in Mechanical Engineering.pdfApplications of artificial Intelligence in Mechanical Engineering.pdf
Applications of artificial Intelligence in Mechanical Engineering.pdf
Atif Razi
 
Height and depth gauge linear metrology.pdf
Height and depth gauge linear metrology.pdfHeight and depth gauge linear metrology.pdf
Height and depth gauge linear metrology.pdf
q30122000
 
Generative AI Use cases applications solutions and implementation.pdf
Generative AI Use cases applications solutions and implementation.pdfGenerative AI Use cases applications solutions and implementation.pdf
Generative AI Use cases applications solutions and implementation.pdf
mahaffeycheryld
 
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
PriyankaKilaniya
 
Blood finder application project report (1).pdf
Blood finder application project report (1).pdfBlood finder application project report (1).pdf
Blood finder application project report (1).pdf
Kamal Acharya
 
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
ydzowc
 
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Sinan KOZAK
 
Transformers design and coooling methods
Transformers design and coooling methodsTransformers design and coooling methods
Transformers design and coooling methods
Roger Rozario
 
Mechatronics material . Mechanical engineering
Mechatronics material . Mechanical engineeringMechatronics material . Mechanical engineering
Mechatronics material . Mechanical engineering
sachin chaurasia
 
Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
Divyanshu
 

Recently uploaded (20)

TIME TABLE MANAGEMENT SYSTEM testing.pptx
TIME TABLE MANAGEMENT SYSTEM testing.pptxTIME TABLE MANAGEMENT SYSTEM testing.pptx
TIME TABLE MANAGEMENT SYSTEM testing.pptx
 
Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...
 
smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
 
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
 
Object Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOADObject Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOAD
 
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
 
SCALING OF MOS CIRCUITS m .pptx
SCALING OF MOS CIRCUITS m                 .pptxSCALING OF MOS CIRCUITS m                 .pptx
SCALING OF MOS CIRCUITS m .pptx
 
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELDEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
 
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
 
Applications of artificial Intelligence in Mechanical Engineering.pdf
Applications of artificial Intelligence in Mechanical Engineering.pdfApplications of artificial Intelligence in Mechanical Engineering.pdf
Applications of artificial Intelligence in Mechanical Engineering.pdf
 
Height and depth gauge linear metrology.pdf
Height and depth gauge linear metrology.pdfHeight and depth gauge linear metrology.pdf
Height and depth gauge linear metrology.pdf
 
Generative AI Use cases applications solutions and implementation.pdf
Generative AI Use cases applications solutions and implementation.pdfGenerative AI Use cases applications solutions and implementation.pdf
Generative AI Use cases applications solutions and implementation.pdf
 
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
 
Blood finder application project report (1).pdf
Blood finder application project report (1).pdfBlood finder application project report (1).pdf
Blood finder application project report (1).pdf
 
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
 
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
 
Transformers design and coooling methods
Transformers design and coooling methodsTransformers design and coooling methods
Transformers design and coooling methods
 
Mechatronics material . Mechanical engineering
Mechatronics material . Mechanical engineeringMechatronics material . Mechanical engineering
Mechatronics material . Mechanical engineering
 
Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
 

An incremental algorithm for transition-based CCG parsing

  • 1. an incremental algorithm for transition-based ccg parsing B. R. Ambati, T. Deoskar, M. Johnson and M. Steedman Miyazawa Akira January 22, 2016 The Graduate University For Advanced Studies / National Institute of Informatics
  • 2. incrememtal parser What does “Incremental” mean? An incremental parser computes the relationship between words as soon as it receives them from the input. Why is incrementality important? ∙ Statistical Machine Translation ∙ Automatic Speech Recognition 1
  • 3. baseline algorithm Their baseline algorithm NonInc is based on Zhang and Clark (2011). It has four actions. ∙ Shift Push a word from the input buffer to the stack and assign it with a CCG category. ∙ Reduce Left Pop the top two nodes from the stack, combine them into a new node and push it back onto the stack with a new category. The right node is the head and the left node is reduced. ∙ Reduce Right This is similar to the action above except that the left node is the head and the right node is reduced. ∙ Unary Change the category of the top node on the stack. 2
  • 4. noninc algorithm 0 Input buffer John likes mangoes from India madly Stack Dependency graph 3
  • 5. noninc algorithm 1 Shift Input buffer likes mangoes from India madly Stack NPJohn Dependency graph 3
  • 6. noninc algorithm 2 Shift Input buffer mangoes from India madly Stack NPJohn (SNP)/NPlikes Dependency graph 3
  • 7. noninc algorithm 3 Shift Input buffer from India madly Stack NPJohn (SNP)/NPlikes NPmangoes Dependency graph 3
  • 8. noninc algorithm 4 Shift Input buffer India madly Stack NPJohn (SNP)/NPlikes NPmangoes (NPNP)/NPfrom Dependency graph 3
  • 9. noninc algorithm 5 Shift Input buffer madly Stack NPJohn (SNP)/NPlikes NPmangoes (NPNP)/NPfrom NPIndia Dependency graph 3
  • 10. noninc algorithm 6 Reduce Right Input buffer madly Stack NPJohn (SNP)/NPlikes NPmangoes NPNPfrom Dependency graph from India 3
  • 11. noninc algorithm 7 Reduce Right Input buffer madly Stack NPJohn (SNP)/NPlikes NPmangoes Dependency graph from India mangoes 3
  • 12. noninc algorithm 8 Reduce Right Input buffer madly Stack NPJohn SNPlikes Dependency graph from India mangoes likes 3
  • 13. noninc algorithm 9 Shift Input buffer Stack NPJohn SNPlikes (SNP)(SNP)madly Dependency graph from India mangoes likes 3
  • 14. noninc algorithm 10 Reduce Right Input buffer Stack NPJohn SNPlikes Dependency graph from India mangoes likes madly 3
  • 15. noninc algorithm 11 Reduce Left Input buffer Stack Slikes Dependency graph from India mangoes likes madlyJohn 3
  • 16. noninc algorithm Finish Input buffer Stack Slikes Dependency graph from India mangoes likes madlyJohn 3
  • 17. problem in noninc The algorithm above is not incremental. The dependncy graph starts to grow only after almost all the words are pushed to the stack. To solve this problem, they introduce a revealing technique (Pareschi and Steedman (1987)). 4
  • 18. revinc algorithm 0 Input buffer John likes mangoes from India madly Stack Dependency graph 5
  • 19. revinc algorithm 1 Shift Input buffer likes mangoes from India madly Stack NPJohn Dependency graph 5
  • 20. revinc algorithm 2 Shift Input buffer mangoes from India madly Stack NPJohn (SNP)/NPlikes Dependency graph 5
  • 21. revinc algorithm 3-1 Type-Raising Input buffer mangoes from India madly Stack S/(SNP)John (SNP)/NPlikes Dependency graph 5
  • 22. revinc algorithm 3-2 Reduce Left Input buffer mangoes from India madly Stack S/NPlikes Dependency graph likes John 5
  • 23. revinc algorithm 4 Shift Input buffer from India madly Stack S/NPlikes NPmangoes Dependency graph likes John 5
  • 24. revinc algorithm 5 Reduce Right Input buffer from India madly Stack Slikes Dependency graph mangoes likes John 5
  • 25. revinc algorithm 6 Shift Input buffer India madly Stack Slikes (NPNP)/NPfrom Dependency graph mangoes likes John 5
  • 26. revinc algorithm 7 Shift Input buffer madly Stack Slikes (NPNP)/NPfrom NPIndia Dependency graph mangoes likes John 5
  • 27. revinc algorithm 8 Reduce Right Input buffer madly Stack Slikes NPNPfrom Dependency graph from India mangoes likes John 5
  • 28. revinc algorithm 9 Right Reveal Input buffer madly Stack Slikes Dependency graph from India mangoes likes John Slikes NPNPfrom R> S/NPlikes NPmangoes < NP > S 5
  • 29. revinc algorithm 10 Shift Input buffer Stack Slikes (SNP)(SNP)madly Dependency graph from India mangoes likes John 5
  • 30. revinc algorithm 11 Left Reveal Input buffer Stack Slikes Dependency graph from India mangoes likes madlyJohn Slikes (SNP)(SNP)madly R< NPJohn SNPlikes < SNP < S 5
  • 31. revinc algorithm Finish Input buffer Stack Slikes Dependency graph from India mangoes likes madlyJohn 5
  • 32. revealing actions 1. Left Reveal (LRev) Pop the top two nodes in the stack (left, right). Identify the left node’s child with a subject dependency. Abstract over this child node and split the category of left node into two categories. Combine the nodes using CCG combinators accodingly. Slikes (SNP)(SNP)madly R< NPJohn SNPlikes < SNP < S 6
  • 33. revealing actions 2. Right Reveal (RRev) Pop the top two nodes in the stack (left, right). Check the right periphery of the left node in the dependency graph, extract all the nodes with compatible CCG categories and identify all the possible nodes that the right node can combine with. Abstract over this node, split the category into two categories accordingly and combine the nodes using CCG combinators. Slikes NPNPfrom R> S/NPlikes NPmangoes < NP > S 7
  • 34. data and settings Data: CCGbank ∙ training: sections 02–21 ∙ development: section 00 ∙ testing: section 23 POS tagger: C&C POS tagger Supertagger: C&C supertagger 8
  • 35. features ∙ NonInc Features based on the top four nodes in the stack and the next four words in the input buffer. feature templates 1 S0wp, S0c, S0pc, S0wc, S1wp, S1c, S1pc, S1wc, S2pc, S2wc, S3pc, S3wc, 2 Q0wp, Q1wp, Q2wp, Q3wp, 3 S0Lpc, S0Lwc, S0Rpc, S0Rwc, S0Upc, S0Uwc, S1Lpc, S1Lwc, S1Rpc, S1Rwc, S1Upc, S1Uwc, 4 S0wcS1wc, S0cS1w, S0wS1c, S0cS1c, S0wcQ0wp, S0cQ0wp, S0wcQ0p, S0cQ0p, S1wcQ0wp, S1cQ0wp, S1wcQ0p, S1cQ0p, 5 S0wcS1cQ0p, S0cS1wcQ0p, S0cS1cQ0wp, S0cS1cQ0p, S0pS1pQ0p, S0wcQ0pQ1p, S0cQ0wpQ1p, S0cQ0pQ1wp, S0cQ0pQ1p, S0pQ0pQ1p, S0wcS1cS2c, S0cS1wcS2c, S0cS1cS2wc, S0cS1cS2, S0pS1pS2, 6 S0cS0HcS0Lc, S0cS0HcS0Rc, S1cS1HcS1Rc, S0cS0RcQ0p, S0cS0RcQ0w, S0cS0LcS1c, S0cS0LcS1w, S0cS1cS1Rc, S0wS1cS1Rc, ∙ RevInc ↑ + B1c and B1cS0c, where B1 is the bottom most node in the right periphery. 9
  • 36. measures of incrementality Measures of incrementality: ∙ Connectedness the average number of nodes in the stack before shifting ∙ Waiting time the number of nodes that need to be shifted to the stack before a dependency between any two nodes in the stack is resolved Table 1: Connectedness and waiting time. Algorithm Connectedness Waiting Time NonInc 4.62 2.98 RevInc 2.15 0.69 10
  • 37. results and analysis Table 2: Performance on the development data1 . Algorithm UP UR UF LP LR LF Cat Acc. NonInc (beam=1) 92.57 82.60 87.30 85.12 75.96 80.28 91.10 RevInc (beam=1) 91.62 85.94 88.69 83.42 78.25 80.75 90.87 NonInc (beam=16) 92.71 89.66 91.16 85.78 82.96 84.35 92.51 Z&C (beam=16) - - - 87.15 82.95 85.00 92.77 ∙ NonInc gets higher precision because it can use more context while making a decision. ∙ RevInc achieves higher recall because information on nodes is available even after they are reduced. 1‘U’ stands for unlabaled and ‘L’ stands for labeles. ‘P’, ‘R’ and ‘F’ are precision, recall and F-score respectively. 11
  • 38. results and analysis Table 3: Label-wise F-score of RevInc and NonInc parsers. Category RevInc NonInc (NPNP)/NP 81.36 83.21 (NPNP)/NP 78.66 82.94 ((SNP)(SNP))/NP 65.09 66.98 ((SNP)(SNP))/NP 62.69 65.89 (S[dcl]NP)/NP 78.96 78.29 (S[dcl]NP)/NP 76.71 75.22 (SNP)(SNP) 80.49 76.90 ∙ NonInc performs better in labels corresponding to PP due to the availability of more context. ∙ RevInc has advantage in the case of verbal arguments and verbal modifiers as the effect of “reveal” actions. 12
  • 39. parsing speed Parsing speed: ∙ NonInc parses 110 sentences/sec. ∙ RevInc parses 125 sentences/sec. Significant amount of parsing time is spent on the feature extraction step. But in RevInc, usually only two nodes have their feature extracted because connectedness = 2.15, while all four nodes have to be processed in NonInc (connectedness = 4.62). Complex actions, LRev and RRev are rarely used (5%). 13
  • 40. results and analysis Table 4: Performance on the test data. Algorithm UP UR UF LP LR LF Cat Acc. NonInc (beam=1) 92.45 82.16 87.00 85.59 76.06 80.55 91.39 RevInc (beam=1) 91.83 86.35 89.00 84.02 79.00 81.43 91.17 NonInc (beam=16) 92.68 89.57 91.10 86.20 83.32 84.74 92.70 Z&C (beam=16) - - - 87.43 83.61 85.48 93.12 Hassan et al. 09 - - 86.31 - - - - F-scores are improved compared to NonInc in both unlabaled and labeled cases. 14
  • 41. future plan ∙ Use information about lexical category probabilities (Auli and Lopez (2011)). ∙ Explore the limited use of a beam to handle lexical ambiguity. ∙ Use a dynamic oracle strategy (Xu et al. (2014)). ∙ Apply the method to SMT and ASR. 15
  • 42. conclusion ∙ They designed and implemented an incremental CCG parser by introducing a technique called revealing. ∙ The parser got high scores in two measures of incrementality: connectedness and waiting time. ∙ It performs better in parsing in the view of recall and F-score. (labeled: +0.88%, unlabeld: +2.0%) ∙ It parses sentences faster. 16
  • 43. references Ambati, R. B., Deoskar, T., Johnson, M., and Steedman, M. (2015). An incremental algorithm for transition-based CCG parsing. In Proceedings of the 2015 Conference of NAACL, pages 53–63. Auli, M. and Lopez, A. (2011). A comparison of loopy belief propagation and dual decomposition for integrated CCG supertagging and parsing. In Proceedings of the 49th Annual Meeting of ACL, pages 470–480. Pareschi, R. and Steedman, M. (1987). A lazy way to chart-parse with categorial grammars. In Proceedings of the 25th Annual Meeting of ACL. Xu, W., Clark, S., and Zhang, Y. (2014). Shift-reduce CCG parsing with a dependency model. In Proceedings of the 52nd Annual Meeting of ACL, pages 218–227. Zhang, Y. and Clark, S. (2011). Shift-reduce CCG parsing. In Proceedings of the 49th Annual Meeting of ACL, pages 683–692. 17