SlideShare a Scribd company logo
1 of 43
Download to read offline
an incremental algorithm for
transition-based ccg parsing
B. R. Ambati, T. Deoskar, M. Johnson and M. Steedman
Miyazawa Akira
January 22, 2016
The Graduate University For Advanced Studies / National Institute of Informatics
incrememtal parser
What does “Incremental” mean?
An incremental parser computes the relationship between words
as soon as it receives them from the input.
Why is incrementality important?
∙ Statistical Machine Translation
∙ Automatic Speech Recognition
1
baseline algorithm
Their baseline algorithm NonInc is based on Zhang and Clark (2011).
It has four actions.
∙ Shift
Push a word from the input buffer to the stack and assign it with a
CCG category.
∙ Reduce Left
Pop the top two nodes from the stack, combine them into a new
node and push it back onto the stack with a new category. The
right node is the head and the left node is reduced.
∙ Reduce Right
This is similar to the action above except that the left node is the
head and the right node is reduced.
∙ Unary
Change the category of the top node on the stack.
2
noninc algorithm
0
Input buffer
John likes mangoes from India madly
Stack
Dependency graph
3
noninc algorithm
1 Shift
Input buffer
likes mangoes from India madly
Stack
NPJohn
Dependency graph
3
noninc algorithm
2 Shift
Input buffer
mangoes from India madly
Stack
NPJohn (SNP)/NPlikes
Dependency graph
3
noninc algorithm
3 Shift
Input buffer
from India madly
Stack
NPJohn (SNP)/NPlikes NPmangoes
Dependency graph
3
noninc algorithm
4 Shift
Input buffer
India madly
Stack
NPJohn (SNP)/NPlikes NPmangoes (NPNP)/NPfrom
Dependency graph
3
noninc algorithm
5 Shift
Input buffer
madly
Stack
NPJohn (SNP)/NPlikes NPmangoes (NPNP)/NPfrom NPIndia
Dependency graph
3
noninc algorithm
6 Reduce Right
Input buffer
madly
Stack
NPJohn (SNP)/NPlikes NPmangoes NPNPfrom
Dependency graph
from
India
3
noninc algorithm
7 Reduce Right
Input buffer
madly
Stack
NPJohn (SNP)/NPlikes NPmangoes
Dependency graph
from
India
mangoes
3
noninc algorithm
8 Reduce Right
Input buffer
madly
Stack
NPJohn SNPlikes
Dependency graph
from
India
mangoes
likes
3
noninc algorithm
9 Shift
Input buffer
Stack
NPJohn SNPlikes (SNP)(SNP)madly
Dependency graph
from
India
mangoes
likes
3
noninc algorithm
10 Reduce Right
Input buffer
Stack
NPJohn SNPlikes
Dependency graph
from
India
mangoes
likes
madly
3
noninc algorithm
11 Reduce Left
Input buffer
Stack
Slikes
Dependency graph
from
India
mangoes
likes
madlyJohn
3
noninc algorithm
Finish
Input buffer
Stack
Slikes
Dependency graph
from
India
mangoes
likes
madlyJohn
3
problem in noninc
The algorithm above is not incremental. The dependncy graph starts
to grow only after almost all the words are pushed to the stack.
To solve this problem, they introduce a revealing technique
(Pareschi and Steedman (1987)).
4
revinc algorithm
0
Input buffer
John likes mangoes from India madly
Stack
Dependency graph
5
revinc algorithm
1 Shift
Input buffer
likes mangoes from India madly
Stack
NPJohn
Dependency graph
5
revinc algorithm
2 Shift
Input buffer
mangoes from India madly
Stack
NPJohn (SNP)/NPlikes
Dependency graph
5
revinc algorithm
3-1 Type-Raising
Input buffer
mangoes from India madly
Stack
S/(SNP)John (SNP)/NPlikes
Dependency graph
5
revinc algorithm
3-2 Reduce Left
Input buffer
mangoes from India madly
Stack
S/NPlikes
Dependency graph
likes
John
5
revinc algorithm
4 Shift
Input buffer
from India madly
Stack
S/NPlikes NPmangoes
Dependency graph
likes
John
5
revinc algorithm
5 Reduce Right
Input buffer
from India madly
Stack
Slikes
Dependency graph
mangoes
likes
John
5
revinc algorithm
6 Shift
Input buffer
India madly
Stack
Slikes (NPNP)/NPfrom
Dependency graph
mangoes
likes
John
5
revinc algorithm
7 Shift
Input buffer
madly
Stack
Slikes (NPNP)/NPfrom NPIndia
Dependency graph
mangoes
likes
John
5
revinc algorithm
8 Reduce Right
Input buffer
madly
Stack
Slikes NPNPfrom
Dependency graph
from
India
mangoes
likes
John
5
revinc algorithm
9 Right Reveal
Input buffer
madly
Stack
Slikes
Dependency graph
from
India
mangoes
likes
John
Slikes NPNPfrom
R>
S/NPlikes NPmangoes
<
NP
>
S
5
revinc algorithm
10 Shift
Input buffer
Stack
Slikes (SNP)(SNP)madly
Dependency graph
from
India
mangoes
likes
John
5
revinc algorithm
11 Left Reveal
Input buffer
Stack
Slikes
Dependency graph
from
India
mangoes
likes
madlyJohn
Slikes (SNP)(SNP)madly
R<
NPJohn SNPlikes
<
SNP
<
S
5
revinc algorithm
Finish
Input buffer
Stack
Slikes
Dependency graph
from
India
mangoes
likes
madlyJohn
5
revealing actions
1. Left Reveal (LRev)
Pop the top two nodes in the stack (left, right). Identify the left
node’s child with a subject dependency. Abstract over this child
node and split the category of left node into two categories.
Combine the nodes using CCG combinators accodingly.
Slikes (SNP)(SNP)madly
R<
NPJohn SNPlikes
<
SNP
<
S
6
revealing actions
2. Right Reveal (RRev)
Pop the top two nodes in the stack (left, right). Check the right
periphery of the left node in the dependency graph, extract all the
nodes with compatible CCG categories and identify all the possible
nodes that the right node can combine with. Abstract over this
node, split the category into two categories accordingly and
combine the nodes using CCG combinators.
Slikes NPNPfrom
R>
S/NPlikes NPmangoes
<
NP
>
S
7
data and settings
Data: CCGbank
∙ training: sections 02–21
∙ development: section 00
∙ testing: section 23
POS tagger:
C&C POS tagger
Supertagger:
C&C supertagger
8
features
∙ NonInc
Features based on the top four nodes in the stack and the next four words in the input buffer.
feature templates
1 S0wp, S0c, S0pc, S0wc, S1wp, S1c, S1pc, S1wc, S2pc, S2wc, S3pc, S3wc,
2 Q0wp, Q1wp, Q2wp, Q3wp,
3 S0Lpc, S0Lwc, S0Rpc, S0Rwc, S0Upc, S0Uwc, S1Lpc, S1Lwc, S1Rpc, S1Rwc, S1Upc,
S1Uwc,
4 S0wcS1wc, S0cS1w, S0wS1c, S0cS1c, S0wcQ0wp, S0cQ0wp, S0wcQ0p, S0cQ0p,
S1wcQ0wp, S1cQ0wp, S1wcQ0p, S1cQ0p,
5 S0wcS1cQ0p, S0cS1wcQ0p, S0cS1cQ0wp, S0cS1cQ0p, S0pS1pQ0p, S0wcQ0pQ1p,
S0cQ0wpQ1p, S0cQ0pQ1wp, S0cQ0pQ1p, S0pQ0pQ1p, S0wcS1cS2c, S0cS1wcS2c,
S0cS1cS2wc, S0cS1cS2, S0pS1pS2,
6 S0cS0HcS0Lc, S0cS0HcS0Rc, S1cS1HcS1Rc, S0cS0RcQ0p, S0cS0RcQ0w, S0cS0LcS1c,
S0cS0LcS1w, S0cS1cS1Rc, S0wS1cS1Rc,
∙ RevInc
↑ + B1c and B1cS0c, where B1 is the bottom most node in the right periphery.
9
measures of incrementality
Measures of incrementality:
∙ Connectedness
the average number of nodes in the stack before shifting
∙ Waiting time
the number of nodes that need to be shifted to the stack before a
dependency between any two nodes in the stack is resolved
Table 1: Connectedness and waiting time.
Algorithm Connectedness Waiting Time
NonInc 4.62 2.98
RevInc 2.15 0.69
10
results and analysis
Table 2: Performance on the development data1
.
Algorithm UP UR UF LP LR LF Cat Acc.
NonInc (beam=1) 92.57 82.60 87.30 85.12 75.96 80.28 91.10
RevInc (beam=1) 91.62 85.94 88.69 83.42 78.25 80.75 90.87
NonInc (beam=16) 92.71 89.66 91.16 85.78 82.96 84.35 92.51
Z&C (beam=16) - - - 87.15 82.95 85.00 92.77
∙ NonInc gets higher precision because it can use more context
while making a decision.
∙ RevInc achieves higher recall because information on nodes is
available even after they are reduced.
1‘U’ stands for unlabaled and ‘L’ stands for labeles. ‘P’, ‘R’ and ‘F’ are precision, recall
and F-score respectively.
11
results and analysis
Table 3: Label-wise F-score of RevInc and NonInc parsers.
Category RevInc NonInc
(NPNP)/NP 81.36 83.21
(NPNP)/NP 78.66 82.94
((SNP)(SNP))/NP 65.09 66.98
((SNP)(SNP))/NP 62.69 65.89
(S[dcl]NP)/NP 78.96 78.29
(S[dcl]NP)/NP 76.71 75.22
(SNP)(SNP) 80.49 76.90
∙ NonInc performs better in labels corresponding to PP due to the
availability of more context.
∙ RevInc has advantage in the case of verbal arguments and verbal
modifiers as the effect of “reveal” actions.
12
parsing speed
Parsing speed:
∙ NonInc parses 110 sentences/sec.
∙ RevInc parses 125 sentences/sec.
Significant amount of parsing time is spent on the feature extraction
step. But in RevInc, usually only two nodes have their feature
extracted because connectedness = 2.15, while all four nodes have to
be processed in NonInc (connectedness = 4.62).
Complex actions, LRev and RRev are rarely used (5%).
13
results and analysis
Table 4: Performance on the test data.
Algorithm UP UR UF LP LR LF Cat Acc.
NonInc (beam=1) 92.45 82.16 87.00 85.59 76.06 80.55 91.39
RevInc (beam=1) 91.83 86.35 89.00 84.02 79.00 81.43 91.17
NonInc (beam=16) 92.68 89.57 91.10 86.20 83.32 84.74 92.70
Z&C (beam=16) - - - 87.43 83.61 85.48 93.12
Hassan et al. 09 - - 86.31 - - - -
F-scores are improved compared to NonInc in both unlabaled and
labeled cases.
14
future plan
∙ Use information about lexical category probabilities (Auli and
Lopez (2011)).
∙ Explore the limited use of a beam to handle lexical ambiguity.
∙ Use a dynamic oracle strategy (Xu et al. (2014)).
∙ Apply the method to SMT and ASR.
15
conclusion
∙ They designed and implemented an incremental CCG parser by
introducing a technique called revealing.
∙ The parser got high scores in two measures of incrementality:
connectedness and waiting time.
∙ It performs better in parsing in the view of recall and F-score.
(labeled: +0.88%, unlabeld: +2.0%)
∙ It parses sentences faster.
16
references
Ambati, R. B., Deoskar, T., Johnson, M., and Steedman, M. (2015). An
incremental algorithm for transition-based CCG parsing. In
Proceedings of the 2015 Conference of NAACL, pages 53–63.
Auli, M. and Lopez, A. (2011). A comparison of loopy belief
propagation and dual decomposition for integrated CCG
supertagging and parsing. In Proceedings of the 49th Annual
Meeting of ACL, pages 470–480.
Pareschi, R. and Steedman, M. (1987). A lazy way to chart-parse with
categorial grammars. In Proceedings of the 25th Annual Meeting of
ACL.
Xu, W., Clark, S., and Zhang, Y. (2014). Shift-reduce CCG parsing with a
dependency model. In Proceedings of the 52nd Annual Meeting of
ACL, pages 218–227.
Zhang, Y. and Clark, S. (2011). Shift-reduce CCG parsing. In
Proceedings of the 49th Annual Meeting of ACL, pages 683–692.
17

More Related Content

Similar to An incremental algorithm for transition-based CCG parsing

EE 567ProjectDue Tuesday, December 3, 2019 at 640 p.m..docx
EE 567ProjectDue Tuesday, December 3, 2019 at 640 p.m..docxEE 567ProjectDue Tuesday, December 3, 2019 at 640 p.m..docx
EE 567ProjectDue Tuesday, December 3, 2019 at 640 p.m..docx
gidmanmary
 

Similar to An incremental algorithm for transition-based CCG parsing (20)

Diagnosis of Faulty Sensors in Antenna Array using Hybrid Differential Evolut...
Diagnosis of Faulty Sensors in Antenna Array using Hybrid Differential Evolut...Diagnosis of Faulty Sensors in Antenna Array using Hybrid Differential Evolut...
Diagnosis of Faulty Sensors in Antenna Array using Hybrid Differential Evolut...
 
Concepts of Temporal CNN, Recurrent Neural Network, Attention
Concepts of Temporal CNN, Recurrent Neural Network, AttentionConcepts of Temporal CNN, Recurrent Neural Network, Attention
Concepts of Temporal CNN, Recurrent Neural Network, Attention
 
Adaptive modified backpropagation algorithm based on differential errors
Adaptive modified backpropagation algorithm based on differential errorsAdaptive modified backpropagation algorithm based on differential errors
Adaptive modified backpropagation algorithm based on differential errors
 
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
 
Reduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
Reduction of Active Power Loss byUsing Adaptive Cat Swarm OptimizationReduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
Reduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
 
Performance of Matching Algorithmsfor Signal Approximation
Performance of Matching Algorithmsfor Signal ApproximationPerformance of Matching Algorithmsfor Signal Approximation
Performance of Matching Algorithmsfor Signal Approximation
 
Artificial Intelligence Applications in Petroleum Engineering - Part I
Artificial Intelligence Applications in Petroleum Engineering - Part IArtificial Intelligence Applications in Petroleum Engineering - Part I
Artificial Intelligence Applications in Petroleum Engineering - Part I
 
Disease Classification using ECG Signal Based on PCA Feature along with GA & ...
Disease Classification using ECG Signal Based on PCA Feature along with GA & ...Disease Classification using ECG Signal Based on PCA Feature along with GA & ...
Disease Classification using ECG Signal Based on PCA Feature along with GA & ...
 
H017376369
H017376369H017376369
H017376369
 
A New Classifier Based onRecurrent Neural Network Using Multiple Binary-Outpu...
A New Classifier Based onRecurrent Neural Network Using Multiple Binary-Outpu...A New Classifier Based onRecurrent Neural Network Using Multiple Binary-Outpu...
A New Classifier Based onRecurrent Neural Network Using Multiple Binary-Outpu...
 
L010628894
L010628894L010628894
L010628894
 
A hybrid sine cosine optimization algorithm for solving global optimization p...
A hybrid sine cosine optimization algorithm for solving global optimization p...A hybrid sine cosine optimization algorithm for solving global optimization p...
A hybrid sine cosine optimization algorithm for solving global optimization p...
 
DL for molecules
DL for moleculesDL for molecules
DL for molecules
 
EE 567ProjectDue Tuesday, December 3, 2019 at 640 p.m..docx
EE 567ProjectDue Tuesday, December 3, 2019 at 640 p.m..docxEE 567ProjectDue Tuesday, December 3, 2019 at 640 p.m..docx
EE 567ProjectDue Tuesday, December 3, 2019 at 640 p.m..docx
 
Gene's law
Gene's lawGene's law
Gene's law
 
An improved spfa algorithm for single source shortest path problem using forw...
An improved spfa algorithm for single source shortest path problem using forw...An improved spfa algorithm for single source shortest path problem using forw...
An improved spfa algorithm for single source shortest path problem using forw...
 
International Journal of Managing Information Technology (IJMIT)
International Journal of Managing Information Technology (IJMIT)International Journal of Managing Information Technology (IJMIT)
International Journal of Managing Information Technology (IJMIT)
 
An improved spfa algorithm for single source shortest path problem using forw...
An improved spfa algorithm for single source shortest path problem using forw...An improved spfa algorithm for single source shortest path problem using forw...
An improved spfa algorithm for single source shortest path problem using forw...
 
Large Scale Kernel Learning using Block Coordinate Descent
Large Scale Kernel Learning using Block Coordinate DescentLarge Scale Kernel Learning using Block Coordinate Descent
Large Scale Kernel Learning using Block Coordinate Descent
 
A COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTION
A COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTIONA COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTION
A COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTION
 

More from Akira Miyazawa (6)

LuaTeX-jaとbeamerで言語学関連のスライドを作る
LuaTeX-jaとbeamerで言語学関連のスライドを作るLuaTeX-jaとbeamerで言語学関連のスライドを作る
LuaTeX-jaとbeamerで言語学関連のスライドを作る
 
メタファーの自動生成に向けた客観的評価指標の検討
メタファーの自動生成に向けた客観的評価指標の検討メタファーの自動生成に向けた客観的評価指標の検討
メタファーの自動生成に向けた客観的評価指標の検討
 
PRML 第14章
PRML 第14章PRML 第14章
PRML 第14章
 
スペクトラル・クラスタリング
スペクトラル・クラスタリングスペクトラル・クラスタリング
スペクトラル・クラスタリング
 
最急降下法
最急降下法最急降下法
最急降下法
 
PRML 第4章
PRML 第4章PRML 第4章
PRML 第4章
 

Recently uploaded

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Dr.Costas Sachpazis
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
ankushspencer015
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
Tonystark477637
 

Recently uploaded (20)

Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and Properties
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Vivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design SpainVivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design Spain
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 

An incremental algorithm for transition-based CCG parsing

  • 1. an incremental algorithm for transition-based ccg parsing B. R. Ambati, T. Deoskar, M. Johnson and M. Steedman Miyazawa Akira January 22, 2016 The Graduate University For Advanced Studies / National Institute of Informatics
  • 2. incrememtal parser What does “Incremental” mean? An incremental parser computes the relationship between words as soon as it receives them from the input. Why is incrementality important? ∙ Statistical Machine Translation ∙ Automatic Speech Recognition 1
  • 3. baseline algorithm Their baseline algorithm NonInc is based on Zhang and Clark (2011). It has four actions. ∙ Shift Push a word from the input buffer to the stack and assign it with a CCG category. ∙ Reduce Left Pop the top two nodes from the stack, combine them into a new node and push it back onto the stack with a new category. The right node is the head and the left node is reduced. ∙ Reduce Right This is similar to the action above except that the left node is the head and the right node is reduced. ∙ Unary Change the category of the top node on the stack. 2
  • 4. noninc algorithm 0 Input buffer John likes mangoes from India madly Stack Dependency graph 3
  • 5. noninc algorithm 1 Shift Input buffer likes mangoes from India madly Stack NPJohn Dependency graph 3
  • 6. noninc algorithm 2 Shift Input buffer mangoes from India madly Stack NPJohn (SNP)/NPlikes Dependency graph 3
  • 7. noninc algorithm 3 Shift Input buffer from India madly Stack NPJohn (SNP)/NPlikes NPmangoes Dependency graph 3
  • 8. noninc algorithm 4 Shift Input buffer India madly Stack NPJohn (SNP)/NPlikes NPmangoes (NPNP)/NPfrom Dependency graph 3
  • 9. noninc algorithm 5 Shift Input buffer madly Stack NPJohn (SNP)/NPlikes NPmangoes (NPNP)/NPfrom NPIndia Dependency graph 3
  • 10. noninc algorithm 6 Reduce Right Input buffer madly Stack NPJohn (SNP)/NPlikes NPmangoes NPNPfrom Dependency graph from India 3
  • 11. noninc algorithm 7 Reduce Right Input buffer madly Stack NPJohn (SNP)/NPlikes NPmangoes Dependency graph from India mangoes 3
  • 12. noninc algorithm 8 Reduce Right Input buffer madly Stack NPJohn SNPlikes Dependency graph from India mangoes likes 3
  • 13. noninc algorithm 9 Shift Input buffer Stack NPJohn SNPlikes (SNP)(SNP)madly Dependency graph from India mangoes likes 3
  • 14. noninc algorithm 10 Reduce Right Input buffer Stack NPJohn SNPlikes Dependency graph from India mangoes likes madly 3
  • 15. noninc algorithm 11 Reduce Left Input buffer Stack Slikes Dependency graph from India mangoes likes madlyJohn 3
  • 16. noninc algorithm Finish Input buffer Stack Slikes Dependency graph from India mangoes likes madlyJohn 3
  • 17. problem in noninc The algorithm above is not incremental. The dependncy graph starts to grow only after almost all the words are pushed to the stack. To solve this problem, they introduce a revealing technique (Pareschi and Steedman (1987)). 4
  • 18. revinc algorithm 0 Input buffer John likes mangoes from India madly Stack Dependency graph 5
  • 19. revinc algorithm 1 Shift Input buffer likes mangoes from India madly Stack NPJohn Dependency graph 5
  • 20. revinc algorithm 2 Shift Input buffer mangoes from India madly Stack NPJohn (SNP)/NPlikes Dependency graph 5
  • 21. revinc algorithm 3-1 Type-Raising Input buffer mangoes from India madly Stack S/(SNP)John (SNP)/NPlikes Dependency graph 5
  • 22. revinc algorithm 3-2 Reduce Left Input buffer mangoes from India madly Stack S/NPlikes Dependency graph likes John 5
  • 23. revinc algorithm 4 Shift Input buffer from India madly Stack S/NPlikes NPmangoes Dependency graph likes John 5
  • 24. revinc algorithm 5 Reduce Right Input buffer from India madly Stack Slikes Dependency graph mangoes likes John 5
  • 25. revinc algorithm 6 Shift Input buffer India madly Stack Slikes (NPNP)/NPfrom Dependency graph mangoes likes John 5
  • 26. revinc algorithm 7 Shift Input buffer madly Stack Slikes (NPNP)/NPfrom NPIndia Dependency graph mangoes likes John 5
  • 27. revinc algorithm 8 Reduce Right Input buffer madly Stack Slikes NPNPfrom Dependency graph from India mangoes likes John 5
  • 28. revinc algorithm 9 Right Reveal Input buffer madly Stack Slikes Dependency graph from India mangoes likes John Slikes NPNPfrom R> S/NPlikes NPmangoes < NP > S 5
  • 29. revinc algorithm 10 Shift Input buffer Stack Slikes (SNP)(SNP)madly Dependency graph from India mangoes likes John 5
  • 30. revinc algorithm 11 Left Reveal Input buffer Stack Slikes Dependency graph from India mangoes likes madlyJohn Slikes (SNP)(SNP)madly R< NPJohn SNPlikes < SNP < S 5
  • 31. revinc algorithm Finish Input buffer Stack Slikes Dependency graph from India mangoes likes madlyJohn 5
  • 32. revealing actions 1. Left Reveal (LRev) Pop the top two nodes in the stack (left, right). Identify the left node’s child with a subject dependency. Abstract over this child node and split the category of left node into two categories. Combine the nodes using CCG combinators accodingly. Slikes (SNP)(SNP)madly R< NPJohn SNPlikes < SNP < S 6
  • 33. revealing actions 2. Right Reveal (RRev) Pop the top two nodes in the stack (left, right). Check the right periphery of the left node in the dependency graph, extract all the nodes with compatible CCG categories and identify all the possible nodes that the right node can combine with. Abstract over this node, split the category into two categories accordingly and combine the nodes using CCG combinators. Slikes NPNPfrom R> S/NPlikes NPmangoes < NP > S 7
  • 34. data and settings Data: CCGbank ∙ training: sections 02–21 ∙ development: section 00 ∙ testing: section 23 POS tagger: C&C POS tagger Supertagger: C&C supertagger 8
  • 35. features ∙ NonInc Features based on the top four nodes in the stack and the next four words in the input buffer. feature templates 1 S0wp, S0c, S0pc, S0wc, S1wp, S1c, S1pc, S1wc, S2pc, S2wc, S3pc, S3wc, 2 Q0wp, Q1wp, Q2wp, Q3wp, 3 S0Lpc, S0Lwc, S0Rpc, S0Rwc, S0Upc, S0Uwc, S1Lpc, S1Lwc, S1Rpc, S1Rwc, S1Upc, S1Uwc, 4 S0wcS1wc, S0cS1w, S0wS1c, S0cS1c, S0wcQ0wp, S0cQ0wp, S0wcQ0p, S0cQ0p, S1wcQ0wp, S1cQ0wp, S1wcQ0p, S1cQ0p, 5 S0wcS1cQ0p, S0cS1wcQ0p, S0cS1cQ0wp, S0cS1cQ0p, S0pS1pQ0p, S0wcQ0pQ1p, S0cQ0wpQ1p, S0cQ0pQ1wp, S0cQ0pQ1p, S0pQ0pQ1p, S0wcS1cS2c, S0cS1wcS2c, S0cS1cS2wc, S0cS1cS2, S0pS1pS2, 6 S0cS0HcS0Lc, S0cS0HcS0Rc, S1cS1HcS1Rc, S0cS0RcQ0p, S0cS0RcQ0w, S0cS0LcS1c, S0cS0LcS1w, S0cS1cS1Rc, S0wS1cS1Rc, ∙ RevInc ↑ + B1c and B1cS0c, where B1 is the bottom most node in the right periphery. 9
  • 36. measures of incrementality Measures of incrementality: ∙ Connectedness the average number of nodes in the stack before shifting ∙ Waiting time the number of nodes that need to be shifted to the stack before a dependency between any two nodes in the stack is resolved Table 1: Connectedness and waiting time. Algorithm Connectedness Waiting Time NonInc 4.62 2.98 RevInc 2.15 0.69 10
  • 37. results and analysis Table 2: Performance on the development data1 . Algorithm UP UR UF LP LR LF Cat Acc. NonInc (beam=1) 92.57 82.60 87.30 85.12 75.96 80.28 91.10 RevInc (beam=1) 91.62 85.94 88.69 83.42 78.25 80.75 90.87 NonInc (beam=16) 92.71 89.66 91.16 85.78 82.96 84.35 92.51 Z&C (beam=16) - - - 87.15 82.95 85.00 92.77 ∙ NonInc gets higher precision because it can use more context while making a decision. ∙ RevInc achieves higher recall because information on nodes is available even after they are reduced. 1‘U’ stands for unlabaled and ‘L’ stands for labeles. ‘P’, ‘R’ and ‘F’ are precision, recall and F-score respectively. 11
  • 38. results and analysis Table 3: Label-wise F-score of RevInc and NonInc parsers. Category RevInc NonInc (NPNP)/NP 81.36 83.21 (NPNP)/NP 78.66 82.94 ((SNP)(SNP))/NP 65.09 66.98 ((SNP)(SNP))/NP 62.69 65.89 (S[dcl]NP)/NP 78.96 78.29 (S[dcl]NP)/NP 76.71 75.22 (SNP)(SNP) 80.49 76.90 ∙ NonInc performs better in labels corresponding to PP due to the availability of more context. ∙ RevInc has advantage in the case of verbal arguments and verbal modifiers as the effect of “reveal” actions. 12
  • 39. parsing speed Parsing speed: ∙ NonInc parses 110 sentences/sec. ∙ RevInc parses 125 sentences/sec. Significant amount of parsing time is spent on the feature extraction step. But in RevInc, usually only two nodes have their feature extracted because connectedness = 2.15, while all four nodes have to be processed in NonInc (connectedness = 4.62). Complex actions, LRev and RRev are rarely used (5%). 13
  • 40. results and analysis Table 4: Performance on the test data. Algorithm UP UR UF LP LR LF Cat Acc. NonInc (beam=1) 92.45 82.16 87.00 85.59 76.06 80.55 91.39 RevInc (beam=1) 91.83 86.35 89.00 84.02 79.00 81.43 91.17 NonInc (beam=16) 92.68 89.57 91.10 86.20 83.32 84.74 92.70 Z&C (beam=16) - - - 87.43 83.61 85.48 93.12 Hassan et al. 09 - - 86.31 - - - - F-scores are improved compared to NonInc in both unlabaled and labeled cases. 14
  • 41. future plan ∙ Use information about lexical category probabilities (Auli and Lopez (2011)). ∙ Explore the limited use of a beam to handle lexical ambiguity. ∙ Use a dynamic oracle strategy (Xu et al. (2014)). ∙ Apply the method to SMT and ASR. 15
  • 42. conclusion ∙ They designed and implemented an incremental CCG parser by introducing a technique called revealing. ∙ The parser got high scores in two measures of incrementality: connectedness and waiting time. ∙ It performs better in parsing in the view of recall and F-score. (labeled: +0.88%, unlabeld: +2.0%) ∙ It parses sentences faster. 16
  • 43. references Ambati, R. B., Deoskar, T., Johnson, M., and Steedman, M. (2015). An incremental algorithm for transition-based CCG parsing. In Proceedings of the 2015 Conference of NAACL, pages 53–63. Auli, M. and Lopez, A. (2011). A comparison of loopy belief propagation and dual decomposition for integrated CCG supertagging and parsing. In Proceedings of the 49th Annual Meeting of ACL, pages 470–480. Pareschi, R. and Steedman, M. (1987). A lazy way to chart-parse with categorial grammars. In Proceedings of the 25th Annual Meeting of ACL. Xu, W., Clark, S., and Zhang, Y. (2014). Shift-reduce CCG parsing with a dependency model. In Proceedings of the 52nd Annual Meeting of ACL, pages 218–227. Zhang, Y. and Clark, S. (2011). Shift-reduce CCG parsing. In Proceedings of the 49th Annual Meeting of ACL, pages 683–692. 17