0% found this document useful (0 votes)

50 views

Dependency Parsing 2: CMSC 723 / LING 723 / INST 725

- Transition-based dependency parsing uses a stack and buffer to build dependency trees incrementally through transition operations like shift and reduce. It can be framed as a structured prediction problem. - Two main approaches are transition-based parsing, which predicts transitions, and graph-based parsing, which scores dependency graphs directly. - Transition-based parsers use a classifier to predict the next transition as an oracle. They are trained on examples generated by simulating gold trees. Features typically represent the top of the stack and buffer.

Uploaded by

Ritik Bh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views

Dependency Parsing 2: CMSC 723 / LING 723 / INST 725

Uploaded by

Ritik Bh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 52

Dependency

Parsing 2
CMSC 723 / LING 723 / INST 725

Marine Carpuat

Fig credits: Joakim Nivre, Dan

Jurafsky & James Martin
Dependency Parsing
• Formalizing dependency trees

• Transition-based dependency parsing

• Shift-reduce parsing
• Transition system
• Oracle
• Learning/predicting parsing actions
Data-driven dependency parsing
Goal: learn a good predictor of dependency graphs
Input: sentence
Output: dependency graph/tree G = (V,A)

Can be framed as a structured prediction task

- very large output space
- with interdependent labels

2 dominant approaches: transition-based parsing and graph-based

parsing
Transition-based dependency parsing
• Builds on shift-reduce parsing
[Aho & Ullman, 1927]

• Configuration
• Stack
• Input buffer of words
• Set of dependency relations

• Goal of parsing
• find a final configuration where
• all words accounted for
• Relations form dependency tree
Transition operators
• Transitions: produce a new • Start state
configuration given current • Stack initialized with ROOT node
configuration • Input buffer initialized with words
in sentence
• Dependency relation set = empty
• Parsing is the task of
• Finding a sequence of transitions
• End state
• That leads from start state to
desired goal state • Stack and word lists are empty
• Set of dependency relations = final
parse
Arc Standard Transition System
• Defines 3 transition operators [Covington, 2001; Nivre 2003]
• LEFT-ARC:
• create head-dependent rel. between word at top of stack and 2nd word
(under top)
• remove 2nd word from stack
• RIGHT-ARC:
• Create head-dependent rel. between word on 2nd word on stack and word on
top
• Remove word at top of stack
• SHIFT
• Remove word at head of input buffer
• Push it on the stack
Arc standard transition systems
• Preconditions
• ROOT cannot have incoming arcs
• LEFT-ARC cannot be applied when ROOT is the 2nd element in stack
• LEFT-ARC and RIGHT-ARC require 2 elements in stack to be applied
Transition-based Dependency Parser
• Assume an oracle

• Parsing complexity
• Linear in sentence
length!

• Greedy algorithm
• Unlike Viterbi for POS
tagging
Transition-Based Parsing Illustrated
Where to we get an oracle?
• Multiclass classification problem
• Input: current parsing state (e.g., current and previous configurations)
• Output: one transition among all possible transitions
• Q: size of output space?

• Supervised classifiers can be used

• E.g., perceptron
• Open questions
• What are good features for this task?
• Where do we get training examples?
Generating Training Examples
• What we have in a treebank • What we need to train an oracle
• Pairs of configurations and
predicted parsing action
Generating training examples
• Approach: simulate parsing to generate reference tree

• Given
• A current config with stack S, dependency relations Rc
• A reference parse (V,Rp)
• Do
Let’s try it out
Features
• Configuration consist of stack, buffer, current set of relations

• Typical features
• Features focus on top level of stack
• Use word forms, POS, and their location in stack and buffer
Features example
• Given configuration • Example of useful features
Features example
Research highlight:
Dependency parsing with stack-LSTMs
• From Dyer et al. 2015: http://www.aclweb.org/anthology/P15-1033

• Idea
• Instead of hand-crafted feature
• Predict next transition using recurrent neural networks to learn
representation of stack, buffer, sequence of transitions
Research highlight:
Dependency parsing with stack-LSTMs
Research highlight:
Dependency parsing with stack-LSTMs
Alternate Transition Systems
Note: A different way of writing arc-standard
transition system
A weakness of arc-standard parsing

Right dependents cannot be attached to their head

until all their dependents have been attached
Arc Eager Parsing
• LEFT-ARC:
• Create head-dependent rel. between word at front of buffer and word at top of
stack
• pop the stack
• RIGHT-ARC:
• Create head-dependent rel. between word on top of stack and word at front of
buffer
• Shift buffer head to stack
• SHIFT
• Remove word at head of input buffer
• Push it on the stack
• REDUCE
• Pop the stack
Arc Eager Parsing Example
Trees & Forests
• A dependency forest (here) is a dependency graph satisfying
• Root
• Single-Head
• Acyclicity
• but not Connectedness
Properties of this transition-based
parsing algorithm

- Correctness
- For every complete transition sequence, the resulting graph is a projective
dependency forest (soundness)
- For every projective dependency forest G, there is a transition sequence that
generates G (completeness)

- Trick: forest can be turned into tree by adding links to ROOT0

Dealing with
non-projectivity
Projectivity
• Arc from head to dependent is projective
• If there is a path from head to every word between head and
dependent

• Dependency tree is projective

• If all arcs are projective
• Or equivalently, if it can be drawn with no crossing edges

• Projective trees make computation easier

• But most theoretical frameworks do not assume projectivity
• Need to capture long-distance dependencies, free word order
Arc-standard parsing can’t produce non-
projective trees
How frequent are non-projective structures?
• Statistics from CoNLL shared task
• NPD = non projective dependencies
• NPS = non projective sentences
How to deal with non-projectivity?
(1) change the transition system

• Add new transitions

• That apply to 2nd word of the stack
• Top word of stack is treated as context

[Attardi 2006]
How to deal with non-projectivity?
(2) pseudo-projective parsing
Solution:
• “projectivize” a non-projective tree by creating
new projective arcs
• That can be transformed back into non-projective
arcs in a post-processing step
How to deal with non-projectivity?
(2) pseudo-projective parsing
Solution:
• “projectivize” a non-projective tree by creating
new projective arcs
• That can be transformed back into non-projective
arcs in a post-processing step
Graph-based parsing
Graph concepts refresher
Directed Spanning Trees
Maximum Spanning Tree
• Assume we have an arc factored model
i.e. weight of graph can be factored as sum or product of weights of its arcs

• Chu-Liu-Edmonds algorithm can find the maximum spanning tree for

us!
• Greedy recursive algorithm
• Naïve implementation: O(n^3)
Chu-Liu-Edmonds illustrated
Chu-Liu-Edmonds illustrated
Chu-Liu-Edmonds illustrated
Chu-Liu-Edmonds illustrated
Chu-Liu-Edmonds illustrated
Arc weights as linear classifiers
Example of classifier features
How to score a graph G
using features?
Arc-factored model By definition of arc weights
assumption as linear classifiers
How can we learn
the classifier from data?
Dependency Parsing: what you should know
• Formalizing dependency trees

• Transition-based dependency parsing

• Shift-reduce parsing
• Transition system: arc standard, arc eager
• Oracle
• Learning/predicting parsing actions

• Graph-based dependency parsing

• A flexible framework that allows many extensions

• RNNs vs feature engineering, non-projectivity
Extension: dynamic oracle
Problem with standard classifier-based oracle:
- It is “static”
- ie tied to optimal config sequence that produces gold tree
- What if there are multiple sequences for a single gold tree?
- How can we recover if the parser deviates from gold sequence?

One solution: “dynamic oracle” [Goldberg & Nivre 2012]

See also Locally Optimal Learning to Search [Chang et al. ICML 2015]
Extension: dynamic oracle
Problem with standard

See [Goldberg & Nivre 2012] for details

17-Transition Based Dependency Parsing-13-09-2024
No ratings yet
17-Transition Based Dependency Parsing-13-09-2024
25 pages
18-Graph Based Dependency Parsing-19-09-2024
No ratings yet
18-Graph Based Dependency Parsing-19-09-2024
19 pages
Dependency Parsing: Pawan Goyal
No ratings yet
Dependency Parsing: Pawan Goyal
38 pages
A Fast and Accurate Dependency Parser Using Neural Networks
No ratings yet
A Fast and Accurate Dependency Parser Using Neural Networks
11 pages
Dependency Parsing
No ratings yet
Dependency Parsing
96 pages
cs224n 2019 Notes04 Dependencyparsing
No ratings yet
cs224n 2019 Notes04 Dependencyparsing
5 pages
CS224n: Natural Language Processing With Deep Learning: Lecture Notes: Part IV Dependency Parsing Winter 2019
No ratings yet
CS224n: Natural Language Processing With Deep Learning: Lecture Notes: Part IV Dependency Parsing Winter 2019
5 pages
Dependency Parsing
No ratings yet
Dependency Parsing
21 pages
Dependency Parsing Ppt
No ratings yet
Dependency Parsing Ppt
34 pages
6752-NLP
No ratings yet
6752-NLP
14 pages
Parsing Dependency
No ratings yet
Parsing Dependency
26 pages
Dependency Parsing And Algorithms With Images
No ratings yet
Dependency Parsing And Algorithms With Images
13 pages
Lecture 08
No ratings yet
Lecture 08
69 pages
Imitation Learning: Modeling & Learning Sequence of Decisions
No ratings yet
Imitation Learning: Modeling & Learning Sequence of Decisions
53 pages
Lecture08 Dependency Parsing
No ratings yet
Lecture08 Dependency Parsing
70 pages
Dependency Parsing - Part II: Pawan Goyal
No ratings yet
Dependency Parsing - Part II: Pawan Goyal
56 pages
2407.17406v1
No ratings yet
2407.17406v1
14 pages
dependency grammar
No ratings yet
dependency grammar
10 pages
Dependency parsing
No ratings yet
Dependency parsing
32 pages
Mcdonald 06
No ratings yet
Mcdonald 06
8 pages
Deep Biaffine Attention For Neural Dependency Parsing
No ratings yet
Deep Biaffine Attention For Neural Dependency Parsing
8 pages
p742 Goldberg 2
No ratings yet
p742 Goldberg 2
9 pages
AMR Parsing As Sequence-to-Graph Transduction
No ratings yet
AMR Parsing As Sequence-to-Graph Transduction
15 pages
NLP Unit 2
No ratings yet
NLP Unit 2
20 pages
Machine 22
No ratings yet
Machine 22
5 pages
What Is Parsing
No ratings yet
What Is Parsing
47 pages
NPTEL NLP Assignment 6
No ratings yet
NPTEL NLP Assignment 6
5 pages
Dependency Parsing
100% (11)
Dependency Parsing
127 pages
Kiperwasser 16
No ratings yet
Kiperwasser 16
16 pages
Natural Language Processing With Deep Learning CS224N/Ling284
No ratings yet
Natural Language Processing With Deep Learning CS224N/Ling284
45 pages
Natural Language Processing With Deep Learning CS224N/Ling284
No ratings yet
Natural Language Processing With Deep Learning CS224N/Ling284
47 pages
Assignment 6 (COPY)
No ratings yet
Assignment 6 (COPY)
6 pages
A Survey of Unsupervised Dependency Parsing
No ratings yet
A Survey of Unsupervised Dependency Parsing
12 pages
cs224n 2023 Lecture04 Dep Parsing
No ratings yet
cs224n 2023 Lecture04 Dep Parsing
45 pages
Online Large-Margin Training of Dependency Parsers: Ryan Mcdonald Koby Crammer Fernando Pereira
No ratings yet
Online Large-Margin Training of Dependency Parsers: Ryan Mcdonald Koby Crammer Fernando Pereira
8 pages
Efficient Third-Order Dependency Parsers Terry Koo and Michael Collins
No ratings yet
Efficient Third-Order Dependency Parsers Terry Koo and Michael Collins
11 pages
Week 6
No ratings yet
Week 6
78 pages
Syntax Directed Translation
No ratings yet
Syntax Directed Translation
23 pages
Unit 4 Syntax-Directed Translation & Intermediate Code Generation
No ratings yet
Unit 4 Syntax-Directed Translation & Intermediate Code Generation
47 pages
DP Final
No ratings yet
DP Final
10 pages
A3 Handout
No ratings yet
A3 Handout
8 pages
Sanskrit Dependency Parsing
No ratings yet
Sanskrit Dependency Parsing
20 pages
2021.tacl-1.8
No ratings yet
2021.tacl-1.8
19 pages
collobert11a
No ratings yet
collobert11a
9 pages
Parsing
No ratings yet
Parsing
10 pages
Lecture 7 - Parser Evaluation, Lexicalized PCFG, Dependency Parsing Introduction - Evaluasi Parser Dan Dependency Parsing
100% (1)
Lecture 7 - Parser Evaluation, Lexicalized PCFG, Dependency Parsing Introduction - Evaluasi Parser Dan Dependency Parsing
27 pages
CD Merged
No ratings yet
CD Merged
153 pages
Syntactic Analysis
No ratings yet
Syntactic Analysis
66 pages
Dependency Parsing Using Neural Network Classifier
No ratings yet
Dependency Parsing Using Neural Network Classifier
4 pages
Telugu Dependency Parsing Using Di - 2017 - Journal of King Saud University - Co
No ratings yet
Telugu Dependency Parsing Using Di - 2017 - Journal of King Saud University - Co
7 pages
NLP Assignment-6 Solution
No ratings yet
NLP Assignment-6 Solution
5 pages
CS 224n Assignment #3: Dependency Parsing: 1. Machine Learning & Neural Networks (8 Points)
No ratings yet
CS 224n Assignment #3: Dependency Parsing: 1. Machine Learning & Neural Networks (8 Points)
7 pages
Dependancy Parsing
No ratings yet
Dependancy Parsing
20 pages
Incrementality in Deterministic Dependency Parsing
No ratings yet
Incrementality in Deterministic Dependency Parsing
8 pages
Bottom Up Parsing and Transition Net Grammar
No ratings yet
Bottom Up Parsing and Transition Net Grammar
7 pages
Syntax Directed Translation
No ratings yet
Syntax Directed Translation
21 pages
Chapter 4
No ratings yet
Chapter 4
53 pages
Oracle 11g Streams Implementer's Guide
From Everand
Oracle 11g Streams Implementer's Guide
Ann L. R. McKinnell
No ratings yet
Data Structures and Algorithms with Python
From Everand
Data Structures and Algorithms with Python
Aadinath Pothuvaal
No ratings yet
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
Unit 8 - Week 6: Assignment 6
No ratings yet
Unit 8 - Week 6: Assignment 6
5 pages
Chapter 4
No ratings yet
Chapter 4
8 pages
DTMF Detection (10011P0405)
No ratings yet
DTMF Detection (10011P0405)
10 pages
L17-Perceptron
No ratings yet
L17-Perceptron
21 pages
Matrices - Recursive Matrix Multiplication Strassen Algorithm - Mathematics Stack Exchange
No ratings yet
Matrices - Recursive Matrix Multiplication Strassen Algorithm - Mathematics Stack Exchange
4 pages
Lecturer 18 Signals & Systems
No ratings yet
Lecturer 18 Signals & Systems
25 pages
2011 Pre-Calc Slides Section 7.2
No ratings yet
2011 Pre-Calc Slides Section 7.2
19 pages
Finite Difference Method: Applied To A Two-Dimensional, Steady-State, Heat Transfer Problem
No ratings yet
Finite Difference Method: Applied To A Two-Dimensional, Steady-State, Heat Transfer Problem
17 pages
Extended Kalman Filter PDF
No ratings yet
Extended Kalman Filter PDF
2 pages
Midterm
No ratings yet
Midterm
4 pages
Spcc Easy Solutions Spcc Easy Solution Book
No ratings yet
Spcc Easy Solutions Spcc Easy Solution Book
71 pages
Img 20221227 0002
No ratings yet
Img 20221227 0002
1 page
Artificial Intelligence Chapter 3: Solving Problems by Searching
No ratings yet
Artificial Intelligence Chapter 3: Solving Problems by Searching
32 pages
Math302 Mid Summer - July 2021 - KCC
No ratings yet
Math302 Mid Summer - July 2021 - KCC
5 pages
Sample Final Exam
No ratings yet
Sample Final Exam
9 pages
CMP3008 LN2 FiniteAutomata
No ratings yet
CMP3008 LN2 FiniteAutomata
35 pages
Python Programming Jagesh Soni
No ratings yet
Python Programming Jagesh Soni
15 pages
ELT-43007 Matlab Ex3
No ratings yet
ELT-43007 Matlab Ex3
4 pages
2023-24 AIML ML Mid-Semester Make-Up Answer-Keys
No ratings yet
2023-24 AIML ML Mid-Semester Make-Up Answer-Keys
6 pages
Lect 8 Simplex Method - 1
No ratings yet
Lect 8 Simplex Method - 1
32 pages
Word
No ratings yet
Word
6 pages
EE247 - Lecture 2 Filters: - Material Covered Today: - Nomenclature - Filter Specifications - Filter Types
No ratings yet
EE247 - Lecture 2 Filters: - Material Covered Today: - Nomenclature - Filter Specifications - Filter Types
31 pages
ect204 final (1)
No ratings yet
ect204 final (1)
1 page
Standard Methods of Solution
No ratings yet
Standard Methods of Solution
4 pages
GENG 300 Numerical Methods: Qatar University
No ratings yet
GENG 300 Numerical Methods: Qatar University
2 pages
ELE539A: Optimization of Communication Systems Princeton University, Spring 2007 Basic Information
No ratings yet
ELE539A: Optimization of Communication Systems Princeton University, Spring 2007 Basic Information
3 pages
Assignment 2: Write Clearly Your Name, Student Number and Lab Number On The Front Page of Your Assignment
No ratings yet
Assignment 2: Write Clearly Your Name, Student Number and Lab Number On The Front Page of Your Assignment
5 pages
Fred Harris Multirate DSP Part 1 - Virginia Tech Tutorial 2011
No ratings yet
Fred Harris Multirate DSP Part 1 - Virginia Tech Tutorial 2011
52 pages
Image Compression (Chapter 8) : CS474/674 - Prof. Bebis
No ratings yet
Image Compression (Chapter 8) : CS474/674 - Prof. Bebis
128 pages
Giuaki
No ratings yet
Giuaki
7 pages

Dependency Parsing 2: CMSC 723 / LING 723 / INST 725

Uploaded by

Dependency Parsing 2: CMSC 723 / LING 723 / INST 725

Uploaded by

Dependency

Fig credits: Joakim Nivre, Dan

• Transition-based dependency parsing

Can be framed as a structured prediction task

2 dominant approaches: transition-based parsing and graph-based

• Supervised classifiers can be used

Right dependents cannot be attached to their head

- Trick: forest can be turned into tree by adding links to ROOT0

• Dependency tree is projective

• Projective trees make computation easier

• Add new transitions

• Chu-Liu-Edmonds algorithm can find the maximum spanning tree for

• Transition-based dependency parsing

• Graph-based dependency parsing

• A flexible framework that allows many extensions

One solution: “dynamic oracle” [Goldberg & Nivre 2012]

See [Goldberg & Nivre 2012] for details

You might also like