0% found this document useful (0 votes)

35 views32 pages

Natural Language Processing UNIT 2

Natural Language Processing PPT

Uploaded by

ayushnair44

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views32 pages

Natural Language Processing UNIT 2

Natural Language Processing PPT

Uploaded by

ayushnair44

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

21CSE356T: Natural Language

Processing

Unit 2

Prepared by: Dr. Pritam Khan

Introduction to Parsing in NLP
• Definition: Parsing analyzes sentence structure
based on grammar rules.

• Importance: Essential for NLP applications

like machine translation and speech
recognition.
Context-Free Grammars (CFGs)
• Definition: Formal grammar with production rules to describe
language syntax.
Purpose of context-free grammar:
• To list all strings in a language using a set of rules (production
rules).
• It extends the capabilities of regular expressions and finite automata.

Components:
• - Terminal symbols (Σ): Actual words.
• - Non-terminal symbols (N): Abstract categories.
• - Production rules (P): Defines symbol replacements.
• - Start symbol (S): Root of parse tree.
Grammar Rules for English
• Phrase Structure Rules:
• - Sentence (S) → NP VP
• - Noun Phrase (NP) → Det N | N
• - Verb Phrase (VP) → V NP | V
• - Prepositional Phrase (PP) → P NP

• Example: 'The cat chases the dog.'

CFG Example
This tree represents the syntactic structure of
the sentence:

•S (Sentence) is split into NP (Noun Phrase)

and VP (Verb Phrase).

•The NP consists of Det (Determiner) "The"

and N (Noun) "cat".

•The VP consists of the V (Verb) "sat" and a

PP (Prepositional Phrase).
•The PP consists of P (Preposition) "on" and
another NP, which includes Det "the" and N
"mat".
Top-Down Parsing
• Definition: Starts with start symbol, rewrites it
into input string.

• Methods:
• - Recursive Descent Parsing: Uses recursive
function calls.
• - Backtracking: Tries different rules.

• Pros: Easy to implement.

• Cons: Inefficient due to backtracking.
Top-Down Parsing
• In this kind of parsing, the parser starts constructing the parse tree from the start symbol and
then tries to transform the start symbol to the input.
• The most common form of top-down parsing uses recursive procedure to process the input.
• Top-down parsing starts its search from the root node and works downwards towards the leaf
node.
• The root is expanded using the grammar with ’S’ as non-terminal symbols.
• Each non-terminal symbol in the resulting sub-tree is then expanded using the appropriate
grammar rules.

S→NP VP NP→Noun Preposition→from|with|on|to

S S

NP VP VP

Level 3:
S
S
S
NP VP NP VP
NP VP
Det Noun
Det Noun PP
Pronoun
S
S
VP
VP
Verb
Verb NP

Level 4:
S

Verb NP
(paint)
Det Noun
(the) (door)
Example 2: Deepika reads the book
Deepika reads the book
S

NP VP

Noun Verb NP
(Deepika) (reads)
Det Noun
(the) (book)
Example 3: Does this flight
S include a meal?
Aux NP VP
(Does)

Det Noun Verb NP

(this) (flight) (include)

Det Noun
(a) (meal)
Advantages and Disadvantages of Top-Down
parsing

• Advantages
• There is a chance of generating wrong
grammatical sentence as it starts generating the
tree using the start symbol of grammar.
• Disadvantages
• Time consuming because it checks each and
every rule of parsing
Bottom-Up Parsing
• Definition: Starts with input, builds up to start
symbol.

• Methods:
• - Shift-Reduce Parsing: Uses stack.
• - LR Parsing: Used in compilers.

• Pros: More efficient than top-down parsing.

• Cons: More complex implementation.
Bottom-Up Parsing
• In this kind of parsing, the parser starts with the input
symbol and tries to construct the parse tree in an upward
direction towards the root.
• At each step the parser checks the rules in the grammar
where the RHS matches the portion of the parse tree
constructed so far.
• It then reduces it using the LHS of the production.
• The parse tree is considered to be successful if the parser
reduces the tree to the start symbol of the grammar.
Example 1: Paint the door

Paint the door

Verb Det Noun

S
Example 2: Deepika reads the book

Deepika reads the book

Noun Verb Det Noun

NP NP

S
Example 3: Does this flight include a meal?

Does this flight include a meal?

Aux Det Noun Verb Det Noun

NP NP

S
Advantages and Disadvantages of
Bottom-Up Parsing
• Advantages
• It never wastes time in exploring a tree that
does not match the input.
• Disadvantages
• It wastes time in generating trees that have no
chance of leading to S rooted tree.
Disadvantages of parsing
• Left Recursion Leading to Infinite Loops:
• Top-down parsers cannot handle left-recursive grammars
directly, as they result in non-terminating recursive calls.
• Ambiguity:
• Ambiguous grammars can lead to multiple valid parse trees
for a single input, complicating the parsing process and
making it difficult to determine the intended structure and
meaning.
• Addressing these disadvantages involves transforming
grammars to remove left recursion and refine them to
resolve ambiguities, ensuring that parsers can operate
efficiently and accurately.
Ambiguity in Parsing
• Definition: Sentence has multiple valid parse trees.

• Example: 'I saw the man with the telescope.'

• - Interpretation 1: I used a telescope.
• - Interpretation 2: The man had a telescope.

• Solutions:
• - Probabilistic Context-Free Grammars (PCFGs)
• - Semantic analysis.

• Visual: Parse trees for ambiguity.

Cocke-Kasami-Younger (CKY)
Parsing Algorithm
The Cocke-Kasami-Younger (CKY) algorithm is a bottom-up parsing
algorithm used for parsing context-free grammars (CFGs) in Chomsky
Normal Form (CNF). It is particularly useful for parsing sentences
efficiently in NLP and is a foundational technique in probabilistic parsing.
Cocke-Kasami-Younger (CKY)
Parsing
• Definition: A dynamic programming method for
CFGs in Chomsky Normal Form.

• Steps:
• 1. Convert CFG to CNF.
• 2. Fill table bottom-up.
• 3. Identify valid parse tree.

• Pros: Efficient for large parsing tasks.

• Cons: Needs CNF conversion.
Steps of the CKY Algorithm
Step 1: Convert the Grammar to Chomsky Normal
Form (CNF)
A CNF grammar has rules of the following forms, any one is to be followed:

1. A→BC (where A,B,C are non-terminals)

2. A→a (where A is a non-terminal, and a is a terminal)
Example:
Standard CFG: Converted to CNF:
Step 2: Initialize the CKY Parsing Table
For an input sentence:
"The dog chases"
Create a table where rows and columns represent substrings of the input.
Step 3: Fill the CKY Table Bottom-Up
1.Fill the diagonal with terminal rules (matching words to CNF rules).
2.Build higher levels using binary productions.
3.Check if the start symbol (S) appears in the top-right cell → If yes, the
sentence is grammatically valid.

Example CKY Parse Table for "The dog chases"

The Dog Chases

The Det
Dog N
Chases V
Det N NP
NP V VP
NP VP S
Dependency Parsing
• Definition: Focuses on word relationships instead
of phrase structures.

• Example: 'The dog chased the ball.'

• - 'chased' is the root verb.
• - 'dog' (subject) depends on 'chased.'
• - 'ball' (object) depends on 'chased.'

• Applications: Syntax-based machine translation.

Earley Parsing
• Definition: Top-down parsing handling left-
recursion efficiently.

• Steps:
• 1. Prediction: Expand non-terminals.
• 2. Scanning: Match terminals.
• 3. Completion: Move to next state.

• Pros: Handles any CFG.

• Cons: Slower than some parsers.
Earley Parsing
A state in Earley Parsing is represented as: A→α∙β,[i]

•A: Non-terminal being expanded.

•α: Already parsed portion.
•β: Remaining to be parsed.
•∙: Position in the rule.
•[i]: Position in the input where this state started.

Three Main Operations

1.Prediction: Expand a non-terminal.
1. If β begins with a non-terminal B, predict all rules of B.
2.Scanning: Match terminals with input.
1. If β begins with a terminal, match it with the current input
symbol.
3.Completion: Complete a rule and advance the parser.
1. If β is empty, find and advance the states that predicted this
rule.
Steps of Earley Parsing
1.Initialize with a start state: S’→∙S,[0]
2.For each input position k:
1. Prediction: Add rules for non-terminals.
2. Scanning: Move the dot past terminals if they match.
3. Completion: Move dot in the states that awaited this completion.
3.Final State:
1. Successful parse if we have: S’→S∙,[0] at the end of the input.
Example Parsing "John eats"
Grammar:
Parsing Steps:
Probabilistic Context-Free
Grammars (PCFGs)
• Definition: CFG with probabilities assigned to rules.

• Example:
• - S → NP VP (0.9)
• - NP → Det N (0.6)
• - VP → V NP (0.7)

• Usage: Resolves ambiguity using probabilities.

• Applications: Speech recognition, machine translation.

Consider a simple grammar for parsing the sentence "The cat sleeps":

Two Possible Parse Trees with Probabilities

Parse Tree 1: Using NP→Det N
Probability:
P(S)=1.0×0.5×1.0×0.7×1.0=0.35
Parse Tree 2: Using NP→N
Probability:
P(S)=1.0×0.5×0.7×1.0=0.35

Total Probability Distribution

Since both trees sum to 1.0 (if normalized), we get a probability distribution
over parse trees:
Parse Tree Probability
NP → Det N 0.35
NP → N 0.35

If there were more trees, they would also be included in the distribution.

Module-2 ch-4
No ratings yet
Module-2 ch-4
32 pages
Longsem2024-25 Cse3015 Eth Ap2024256000125 Reference-material-III
No ratings yet
Longsem2024-25 Cse3015 Eth Ap2024256000125 Reference-material-III
89 pages
Unit 2
No ratings yet
Unit 2
53 pages
NLP M3 SPP
No ratings yet
NLP M3 SPP
53 pages
Module 3 NLP
No ratings yet
Module 3 NLP
32 pages
NLP Parsing Techniques
No ratings yet
NLP Parsing Techniques
54 pages
Parsing Techniques in NLP
No ratings yet
Parsing Techniques in NLP
51 pages
Constituency Parsing in NLP
No ratings yet
Constituency Parsing in NLP
33 pages
Parsing and Ambiguity in NLP
No ratings yet
Parsing and Ambiguity in NLP
18 pages
NLP Unit 3
No ratings yet
NLP Unit 3
17 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
54 pages
Lecture15 Parsing
No ratings yet
Lecture15 Parsing
37 pages
13-Dependency Grammar-03-09-2024
No ratings yet
13-Dependency Grammar-03-09-2024
31 pages
2024 CD-Ch03 Syntaxx Analysis
No ratings yet
2024 CD-Ch03 Syntaxx Analysis
28 pages
Chapter 3 Syntax Analyzer1
No ratings yet
Chapter 3 Syntax Analyzer1
58 pages
Ch3 SyntaxAnalysispdf 2024 01 01 08 48 28
No ratings yet
Ch3 SyntaxAnalysispdf 2024 01 01 08 48 28
134 pages
Syntax Analysis Parsing
No ratings yet
Syntax Analysis Parsing
9 pages
CD Unit-Ii
No ratings yet
CD Unit-Ii
56 pages
CH 08
No ratings yet
CH 08
31 pages
CFG Parsing Techniques Explained
No ratings yet
CFG Parsing Techniques Explained
44 pages
Syntax Analyzer
No ratings yet
Syntax Analyzer
38 pages
Compiler Engineering
No ratings yet
Compiler Engineering
27 pages
Context-Free Grammars and Parsing
No ratings yet
Context-Free Grammars and Parsing
7 pages
NLP Parsing Techniques Explained
No ratings yet
NLP Parsing Techniques Explained
11 pages
What Is Parsing: Parsing Is The Process of Analyzing An Input Sequence in Order
No ratings yet
What Is Parsing: Parsing Is The Process of Analyzing An Input Sequence in Order
9 pages
Basis For Comparison Top-Down Parsing Bottom-Up Parsing
No ratings yet
Basis For Comparison Top-Down Parsing Bottom-Up Parsing
23 pages
Context-Free Grammar and Parsing Techniques
No ratings yet
Context-Free Grammar and Parsing Techniques
76 pages
CSC312 2.docx Updated
No ratings yet
CSC312 2.docx Updated
10 pages
NLP Module 3
No ratings yet
NLP Module 3
41 pages
Chapter 3 - Syntax Analysis
No ratings yet
Chapter 3 - Syntax Analysis
67 pages
Types of Parser
No ratings yet
Types of Parser
17 pages
NLP Unit Ii
No ratings yet
NLP Unit Ii
30 pages
UNIT-2 Protected
No ratings yet
UNIT-2 Protected
29 pages
Syntactic Analysis
No ratings yet
Syntactic Analysis
66 pages
UNIT 2 Notes CD
No ratings yet
UNIT 2 Notes CD
12 pages
Compiler Construction Lecture 12 Predictive Parsing-Step1
No ratings yet
Compiler Construction Lecture 12 Predictive Parsing-Step1
24 pages
Chart Parsing Techniques Explained
No ratings yet
Chart Parsing Techniques Explained
7 pages
M2 Compiler Design
No ratings yet
M2 Compiler Design
51 pages
Advanced NLP: CFG Parsing Guide
No ratings yet
Advanced NLP: CFG Parsing Guide
28 pages
Basic Parsing Techniques Overview
No ratings yet
Basic Parsing Techniques Overview
20 pages
Module-2 1
No ratings yet
Module-2 1
51 pages
Unit 2
No ratings yet
Unit 2
94 pages
Parsing
No ratings yet
Parsing
27 pages
Syntax Analysis in Compiler Design
No ratings yet
Syntax Analysis in Compiler Design
36 pages
Practical Guide to Parsing Techniques
No ratings yet
Practical Guide to Parsing Techniques
9 pages
Top Down
No ratings yet
Top Down
25 pages
Compiler 2
100% (1)
Compiler 2
45 pages
CD Unit 2
No ratings yet
CD Unit 2
19 pages
Session 7 - Syntax Parsing
No ratings yet
Session 7 - Syntax Parsing
53 pages
8 Parsing
No ratings yet
8 Parsing
40 pages
Unit - Ii Topdown Parsing 1. Context-Free Grammars: Definition
No ratings yet
Unit - Ii Topdown Parsing 1. Context-Free Grammars: Definition
26 pages
Chapter 3
No ratings yet
Chapter 3
43 pages
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
No ratings yet
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
44 pages
Csc3205-Syntax - Analysis PDF
No ratings yet
Csc3205-Syntax - Analysis PDF
63 pages
NLP - Shortnotes Unit 3
No ratings yet
NLP - Shortnotes Unit 3
16 pages
Compiler Design-Short Notes
No ratings yet
Compiler Design-Short Notes
61 pages
102 Exam 1 Practice F17
No ratings yet
102 Exam 1 Practice F17
5 pages
Tcs Notes For Academy Tcs Notes For Academy
No ratings yet
Tcs Notes For Academy Tcs Notes For Academy
107 pages
Youngs Geometry Axioms and Proofs Final
No ratings yet
Youngs Geometry Axioms and Proofs Final
13 pages
Testing Argument Validity Using Truth Table
No ratings yet
Testing Argument Validity Using Truth Table
5 pages
MMW 4-Logic
No ratings yet
MMW 4-Logic
23 pages
Unit 2
No ratings yet
Unit 2
14 pages
Automata Theory and Turing Machines
No ratings yet
Automata Theory and Turing Machines
2 pages
2 Syntax Directed Transiation
No ratings yet
2 Syntax Directed Transiation
9 pages
Kaye
No ratings yet
Kaye
15 pages
Truth Values of Compound Propositions PDF
No ratings yet
Truth Values of Compound Propositions PDF
13 pages
Lecture 1 2 (Automata Fundamentals)
No ratings yet
Lecture 1 2 (Automata Fundamentals)
49 pages
Math145fall09 Book PDF
No ratings yet
Math145fall09 Book PDF
146 pages
Math015 Rules
No ratings yet
Math015 Rules
2 pages
McLarty (Review), Toposes and Local Set Theories
No ratings yet
McLarty (Review), Toposes and Local Set Theories
13 pages
Books: Discrete Mathematics and Its Applications by Kenneth H. Rosen
100% (2)
Books: Discrete Mathematics and Its Applications by Kenneth H. Rosen
54 pages
Peter Linz's Guide to GATE Success
No ratings yet
Peter Linz's Guide to GATE Success
4 pages
Resolution in First-Order Logic: Basic Steps For Proving A Conclusion Given Premises (All Expressed in FOL)
No ratings yet
Resolution in First-Order Logic: Basic Steps For Proving A Conclusion Given Premises (All Expressed in FOL)
5 pages
CFG To GNF 1
No ratings yet
CFG To GNF 1
11 pages
It - (R22) - 2-2 - Automata and Compiler Design - Digital Notes - (2023-24)
No ratings yet
It - (R22) - 2-2 - Automata and Compiler Design - Digital Notes - (2023-24)
64 pages
Understanding Logical Statements and Negations
No ratings yet
Understanding Logical Statements and Negations
18 pages
Simplifications of Context-Free Grammars: Costas Buch - RPI 1
No ratings yet
Simplifications of Context-Free Grammars: Costas Buch - RPI 1
51 pages
Toc Bca 2023
No ratings yet
Toc Bca 2023
2 pages
21ad1501-Flat - QB
No ratings yet
21ad1501-Flat - QB
51 pages
Understanding Predicate Logic Concepts
No ratings yet
Understanding Predicate Logic Concepts
49 pages
Untitled Document
No ratings yet
Untitled Document
61 pages
Toc Unit 2
No ratings yet
Toc Unit 2
29 pages
Predicate Logic & Knowledge Representation
No ratings yet
Predicate Logic & Knowledge Representation
128 pages
CD Question Bank
100% (1)
CD Question Bank
16 pages
ACD - Mid 1 Question Paper
No ratings yet
ACD - Mid 1 Question Paper
2 pages

Natural Language Processing UNIT 2

Uploaded by

Natural Language Processing UNIT 2

Uploaded by

21CSE356T: Natural Language

Prepared by: Dr. Pritam Khan

• Importance: Essential for NLP applications

• Example: 'The cat chases the dog.'

•S (Sentence) is split into NP (Noun Phrase)

•The NP consists of Det (Determiner) "The"

•The VP consists of the V (Verb) "sat" and a

• Pros: Easy to implement.

S→NP VP NP→Noun Preposition→from|with|on|to

Det Noun Verb NP

• Pros: More efficient than top-down parsing.

Paint the door

Deepika reads the book

Does this flight include a meal?

• Example: 'I saw the man with the telescope.'

• Visual: Parse trees for ambiguity.

• Pros: Efficient for large parsing tasks.

1. A→BC (where A,B,C are non-terminals)

Example CKY Parse Table for "The dog chases"

The Dog Chases

• Example: 'The dog chased the ball.'

• Applications: Syntax-based machine translation.

• Pros: Handles any CFG.

•A: Non-terminal being expanded.

Three Main Operations

• Usage: Resolves ambiguity using probabilities.

• Applications: Speech recognition, machine translation.

Two Possible Parse Trees with Probabilities

Total Probability Distribution

You might also like