0% found this document useful (0 votes)

19 views33 pages

3 - Deep Learning

The document provides an overview of natural language processing and deep learning. It discusses neural networks, text processing for deep learning models, and various applications of sequence modeling for NLP tasks like sentiment analysis, search queries, machine translation, chatbots, and text summarization.

Uploaded by

Ansruta Mohanty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views33 pages

3 - Deep Learning

Uploaded by

Ansruta Mohanty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Deep Learning and NLP

Natural Language Processing

Session 4

Madhuri Prabhala
Overview
Overview of the class

❑ Neural networks
o Weights and biases
o Forward propogations
o Backward propogations
o Activation function
o Gradient descent

❑ Use cases in text data

❑ Text processing for inputting into Deep learning architecture

Understanding Neural network models

Yes Monthly
Offer home loan or not? income

Model

Home loan
No approved?
Rs. 45000
Low Applicant’s salary High
Steps in the model

1. Take the salary as an input. Similar to a biological neuron taking in

inputs and giving a reaction
2. Check if it is greater than Rs. 45,000

3. If it is then output “Loan approved”.

Monthly Is salary Is home loan

income > 45,000 approved?

X Y
Steps in the model

Spouse’s Threshold of 45,000 is

X1 salary based on total
household income

Applicant’s Home loan

X2 Y
salary approved?

Father’s
X3 salary
Steps in the model

X1 + X2 + X3 > Threshold
Spouse’s
X1 salary
= > X1 + X2 + X3 – Threshold > 0
Threshold of 45,000 is based
on total household income
= > X1 + X2 + X3 – Bias > 0
Applica
X2 nt’s Home loan Y
salary approved?

Father’s X1 + X2 + X3 – Bias > 0 then output should be 1 (Approve)

X3 salary X1 + X2 + X3 – Bias < 0 then output should be 0 (Not approved)
The step function

Z = X1 + X2 + X3 + bias

Output = 1 if (Z = X1 + X2 + X3 + bias) > 0

Step function:
1 (for Z > 0)
Output = 0, otherwise

Output =

0 (for Z < 0)

Here, step function is the “Activation function”

If the Activation function is a step function, it is called a Perceptron.

Weights in the Perceptron

(10,000 * 1) + (15000 * 1) + (7000 * 1) = 32000

Spouse’s
10000 So, what is the value of Z?
salary
1
Z = X1 + X2 + X3 + bias

Applicant 1 Home loan

15000 Y Z = -13,000 (0)
’s salary approved?

1 So, should the loan be approved?

Father’s
7000 salary Answer: No (As step function returns 0, for
values < 0)
Bias -45000
Weights in the Perceptron

(10,000 * 2) + (15000 * 3) + (7000 * 1) = 72000

Spouse’s
10000 So, what is the value of Z?
salary
2
Z = X1 + X2 + X3 + bias

Applicant 3 Home loan

15000 Y Z = 27,000 (1)
’s salary approved?

1 So, should the loan be approved?

Father’s
7000 salary Answer: Yes

Bias -45000
Deep Learning Models
❑ Typically used in the case of unstructured data. Use of training data to improve prediction accuracy.
❑ Multi-layer perceptron
o Number of hidden layers
o Number of neurons in the hidden layers
o Input layers
o Number of neurons in the output layers

❑ Use of newly created features to create more features and other hidden layers

❑ Tensor flow playground:

http://playground.tensorflow.org/#activation=tanh&batchSize=10&dataset=circle&regDataset=reg-
plane&learningRate=0.03&regularizationRate=0&noise=0&networkShape=4,2&seed=0.68420&showTestData=false&discretize=false&percTrainData=50&x=true&y=true&xTimesY=false&xSqu
ared=false&ySquared=false&cosX=false&sinX=false&cosY=false&sinY=false&collectStats=false&problem=classification&initZero=false&hideText=false

❑ Forward and Backward propagation

❑ Epoch (1 cycle of Forward and backward propagation)
Text data – Sequential models
Sequential data – Text data

job data the scientist a is of excellent

the job of a data scientist is excellent

Text data has a sequence.

If the sequence is disturbed, it stops making sense.

Therefore, Text data falls under the category of sequence data.

Sequence modeling in text – Use case – Sentiment classification

o The job of a data scientist is excellent

o I truly dislike my job

Sentiment Classification
Sequence modeling in text – Use case – Search queries

Is Amazon a better E-
commerce site Sequence
compared to Flipkart? Models

Search engine queries

Sequence modeling in text – Use case – Machine Translation

Oppenheimer is a great Sequence Openheimer es una

movie Models gran película.

Language translation
Sequence modeling in text – Use case – Chat bots

Please suggest a
suitable Insurance Sequence Sure, I will help you
policy. Models choose a suitable policy.

Dialogue systems
Sequence modeling in text – Use case – Text summarization

A good data scientist

needs to have five
important skills. Anyone
who wants to start a
career as a data
scientist, must gain
these skills. They are: The five skills required
Sequence
1…… to start a career as a
Models
2…… data scientist are.
3……
4……
5…...

Text summarization
NLP for Deep Learning
Preparing text for deep learning models

1.Text pre-processing

2.Convert them into arrays

3.Feed them into the deep learning models

Text pre-processing for Deep learning

Text cleaning Remove text noise Unwanted or useless information in the text
• URLs, punctuation marks, numbers, special
characters
• Slangs – Bro, dope,etc.
• Spelling mistakes – cntrl, defntly

Text pre-processing

Mapping between text character and

Text representation Encoding
computer memory
• ASCII (English), West Europe (Latin), Big5
(Chinese)
Text encoding

❑ ASCII codes are numerical representations of all the characters

❑ Another encoding can have some other characteristics to represent the English English ASCII Code
characters
a 097
❑ It is important to have a standard encoding of all kinds of text, before any b 098
modeling or analysis on text
c 099
❑ UTF – 8 – Universally accepted encoding for most languages d 100
o All text data should be available in UTF-8 to avoid any discrepancy e 101
o Preferred to convert all text to lower-case.
E.g. Pen and pen are treated differently by the computer f 102
Text cleaning
Representing text data numerically

❑ No machine learning algorithms accept text as inputs.

❑ Therefore, text needs to be converted to numbers

❑ Two ways of converting text to numbers are:

o One-hot encoding
o Word embeddings
Text representation – One hot encoding
o The length of the vector is fixed
o It is equal to the number of unique words in the vocabulary.
o In actual data, the size of the vocabulary, and therefore the
size of the vectors is huge.
I have a rose garden

0 0 1 0 0
1 0 0 0 0
0 1 0 0 0
0 0 0 1 0
0 0 0 0 1
Text representation – One hot encoding - Steps

o Clean the text

o Create tokens from the text

o From the created tokens, prepare a vocabulary

o Prepare the one – hot encoders

Steps in One hot encoding

Original text Cleaned text Tokens Vocabulary

The school is the school is the, school, is, nearby
nearby. nearby the, school, is, nearby,
The tennis class is the tennis class is the, tennis, class, is, tennis, class, fun, give,
fun. fun fun me, book

Give me the book. give me the book give, me, the, book
Steps in one hot encoding – Creating the one hot vector

Size of the vector is equal to the size of the

book 1 0 0 0 0 0 0 0 0 0
vocabulary.
class 0 1 0 0 0 0 0 0 0 0
Vocabulary
fun 0 0 1 0 0 0 0 0 0 0
the, school, is, nearby,
tennis, class, fun, give, give 0 0 0 1 0 0 0 0 0 0
me, book
…………………………………….
…………………………………….
…………………………………….
…………………………………….
In this case the size of the vocabulary
is 8. Size of the one-hot vector will be tennis 0 0 0 0 0 0 0 0 1 0
8.
the 0 0 0 0 0 0 0 0 0 1
Limitation of one hot encoding

❑ Look at these sentences:

o The shop is nearby.
The ______ is nearby. The context is identical
o The college is nearby.

❑ Assuming a vector size of 25: o Is this getting reflected in the

one hot encoding?

book 0 0 0 0 0 00 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 o One hot encoding does not

capture the context.
college 0 0 0 0 0 00 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 o This challenge is overcome by
Word embeddings.
Text representation – Word embeddings

❑ Another approach to numerical representation of the text

❑ Do not depend on the vocabulary size

o Even with a vocabulary size of 40,000, there can be a vector of 300 to 400

❑ They capture the context around the word

o Word embeddings represent position of the word in each context.

❑ They are obtained by training a special neural network architecture

Obtaining Word embeddings

vector of mango ~ vector of guava

Mango

Guava

Cheetah

Leopard

vector of cheetah ~ vector of leopard

Obtaining Word embeddings - Context

Rooster – Male + Female = ?? Hen Male

=> Gender relationship is preserved by the word

embeddings. Female

Rooster

Hen
Obtaining Word embeddings - Approaches

Obtaining word embeddings

Training of word embedding Pre – trained word

representation from scratch embeddings
✓ A huge text corpus is fed to a NN
architecture ✓ word2vec (Google)
✓ This is trained to give out word ✓ GloVe (Stanford)
embeddings
✓ The bigger the text corpus the ✓ Can be downloaded from the
better the word embeddings internet
✓ E.g., Wikipedia, 1000s of news
articles etc.

Large Language Models From Scratch
No ratings yet
Large Language Models From Scratch
29 pages
Cheatsheet Recurrent Neural Networks
No ratings yet
Cheatsheet Recurrent Neural Networks
5 pages
Natural Language Processing With Neural Network - Class3
No ratings yet
Natural Language Processing With Neural Network - Class3
25 pages
3. Graph Representation Learning
No ratings yet
3. Graph Representation Learning
32 pages
Deep Learning Final Sheet
No ratings yet
Deep Learning Final Sheet
915 pages
Natual Language Processing
No ratings yet
Natual Language Processing
33 pages
NLP NN Language Modeling Week5
No ratings yet
NLP NN Language Modeling Week5
33 pages
CS585 Lecture October15th
No ratings yet
CS585 Lecture October15th
162 pages
2022-foundations-tutorial3-sunwang-deeplearning4nlp
No ratings yet
2022-foundations-tutorial3-sunwang-deeplearning4nlp
103 pages
5th Unit
No ratings yet
5th Unit
36 pages
LSTM Lecture
No ratings yet
LSTM Lecture
163 pages
1 AI_Introduction and ML
No ratings yet
1 AI_Introduction and ML
32 pages
L6 - UCLxDeepMind DL2020 document of google
No ratings yet
L6 - UCLxDeepMind DL2020 document of google
141 pages
2AMM30+AY23 24+Text+Mining+Lecture+3
No ratings yet
2AMM30+AY23 24+Text+Mining+Lecture+3
88 pages
Short Course On Deep Learning: Welcome!!
No ratings yet
Short Course On Deep Learning: Welcome!!
57 pages
2009 Tutorial Nips
No ratings yet
2009 Tutorial Nips
113 pages
AIDL03 EvolutionOfAI
No ratings yet
AIDL03 EvolutionOfAI
22 pages
Unit 5b - Natural Language Processing
No ratings yet
Unit 5b - Natural Language Processing
41 pages
Nn4nlp 02 LM
No ratings yet
Nn4nlp 02 LM
47 pages
01-Transformer Based NLP Applications
No ratings yet
01-Transformer Based NLP Applications
55 pages
7 NN Apr 28 2021
No ratings yet
7 NN Apr 28 2021
81 pages
AN2DL_05_2324_Seq2SeqAndWordEmbedding
No ratings yet
AN2DL_05_2324_Seq2SeqAndWordEmbedding
42 pages
Lecture-5-Intro DL
No ratings yet
Lecture-5-Intro DL
39 pages
Natural Language Processing With Deep Learning CS224N/Ling284
No ratings yet
Natural Language Processing With Deep Learning CS224N/Ling284
33 pages
RNN
No ratings yet
RNN
53 pages
14-LookingForward
No ratings yet
14-LookingForward
48 pages
REPORT-MTechPESJul23BGrp2-3 (22-02-25)
No ratings yet
REPORT-MTechPESJul23BGrp2-3 (22-02-25)
15 pages
Lecture 1a - Introduction
No ratings yet
Lecture 1a - Introduction
38 pages
aM3RdIpjnYdPsGKF
No ratings yet
aM3RdIpjnYdPsGKF
20 pages
Summer Course Material
No ratings yet
Summer Course Material
52 pages
Machine Learning and Pattern Recognition Week 8 Neural Net Architectures
No ratings yet
Machine Learning and Pattern Recognition Week 8 Neural Net Architectures
3 pages
Cs224n 2025 Lecture03 Neuralnets
No ratings yet
Cs224n 2025 Lecture03 Neuralnets
96 pages
DL Concepts 1 Overview
No ratings yet
DL Concepts 1 Overview
80 pages
Neural Networks For Machine Learning: Lecture 4a Learning To Predict The Next Word
No ratings yet
Neural Networks For Machine Learning: Lecture 4a Learning To Predict The Next Word
34 pages
10.48550 Arxiv.2204.02311
No ratings yet
10.48550 Arxiv.2204.02311
87 pages
Sequence Models231205
No ratings yet
Sequence Models231205
72 pages
ML 22
No ratings yet
ML 22
29 pages
تمثيل النص كموترات - تدريب _ مايكروسوفت ليرن
No ratings yet
تمثيل النص كموترات - تدريب _ مايكروسوفت ليرن
14 pages
Introduction to Deep Learning 1st Edition Eugene Charniak instant download
100% (1)
Introduction to Deep Learning 1st Edition Eugene Charniak instant download
27 pages
Session 2 ANN 2024
No ratings yet
Session 2 ANN 2024
29 pages
Get Natural language processing with TensorFlow Teach language to machines using Python s deep learning library 1st Edition Thushan Ganegedara PDF ebook with Full Chapters Now
100% (2)
Get Natural language processing with TensorFlow Teach language to machines using Python s deep learning library 1st Edition Thushan Ganegedara PDF ebook with Full Chapters Now
65 pages
Bay Learn 2015 Deep Mind
No ratings yet
Bay Learn 2015 Deep Mind
69 pages
Applications
No ratings yet
Applications
6 pages
Introduction To Deep Learning Charniak Eugene download
No ratings yet
Introduction To Deep Learning Charniak Eugene download
53 pages
Foundations of Large Language Models 1738142777
No ratings yet
Foundations of Large Language Models 1738142777
101 pages
Foundations of LLM
No ratings yet
Foundations of LLM
231 pages
Lecture2 Slides 1
No ratings yet
Lecture2 Slides 1
28 pages
Unit-3
No ratings yet
Unit-3
16 pages
W03 NLP
No ratings yet
W03 NLP
88 pages
DL Unit 1
No ratings yet
DL Unit 1
200 pages
WINSEM2024-25_CSE4006_ETH_AP2024254000689_2025-02-28_Reference-Material-I
No ratings yet
WINSEM2024-25_CSE4006_ETH_AP2024254000689_2025-02-28_Reference-Material-I
39 pages
Deep Learning Tutorial
No ratings yet
Deep Learning Tutorial
133 pages
ML for NLP-LO4
No ratings yet
ML for NLP-LO4
42 pages
Deep Learning for Natural Language GDG Bloomington 1690248059
No ratings yet
Deep Learning for Natural Language GDG Bloomington 1690248059
41 pages
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
Model5 partial
No ratings yet
Model5 partial
52 pages
Deep Learning Notes (1) 2
No ratings yet
Deep Learning Notes (1) 2
54 pages
Nn4ir PDF
No ratings yet
Nn4ir PDF
290 pages
Deep Learning For Information Retrieval
No ratings yet
Deep Learning For Information Retrieval
136 pages
GCSE Maths Revision: Cheeky Revision Shortcuts
From Everand
GCSE Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (2)
SAT Math: Master the Skills in 40 Pages
From Everand
SAT Math: Master the Skills in 40 Pages
Jennifer L Johnson
No ratings yet
#A1. Accelarate Your Career With MST Formula v.1.2 Final
No ratings yet
#A1. Accelarate Your Career With MST Formula v.1.2 Final
36 pages
3 Topic Models
No ratings yet
3 Topic Models
15 pages
1 - Introduction To NLP
No ratings yet
1 - Introduction To NLP
19 pages
3 - Working With Text Data
No ratings yet
3 - Working With Text Data
9 pages
Jay Alammar - Visualizing Machine Learning One Concept at A Time.
No ratings yet
Jay Alammar - Visualizing Machine Learning One Concept at A Time.
15 pages
Multimodal Amharic Hate Speech Detection Using Deep Learning
No ratings yet
Multimodal Amharic Hate Speech Detection Using Deep Learning
7 pages
A Simple Word2vec Tutorial - Zafar Ali - Medium - Reader View
No ratings yet
A Simple Word2vec Tutorial - Zafar Ali - Medium - Reader View
9 pages
WMD PDF
No ratings yet
WMD PDF
10 pages
NLP unit 4
No ratings yet
NLP unit 4
40 pages
NLP - Experiment - 8 - A10
No ratings yet
NLP - Experiment - 8 - A10
16 pages
ESWA-D-23-09051 (1)
No ratings yet
ESWA-D-23-09051 (1)
9 pages
Learning Word Embeddings for Ukrainian: A Comparative Study of FastText Hyperparameters
No ratings yet
Learning Word Embeddings for Ukrainian: A Comparative Study of FastText Hyperparameters
12 pages
2023 Del Valle y de La Fuente Métodos de Análisis de Sentimiento Política y Odio
No ratings yet
2023 Del Valle y de La Fuente Métodos de Análisis de Sentimiento Política y Odio
11 pages
DL Unit-2
No ratings yet
DL Unit-2
51 pages
Counter-Fitting Word Vectors To Linguistic Constraints
No ratings yet
Counter-Fitting Word Vectors To Linguistic Constraints
7 pages
CM3060 NLP Mock Exam Oct2021
No ratings yet
CM3060 NLP Mock Exam Oct2021
4 pages
Sentiment Analysis Based On Deep Learning - A Comparative Study
No ratings yet
Sentiment Analysis Based On Deep Learning - A Comparative Study
29 pages
CCS369-TEXT AND SPEECH ANALYSIS
No ratings yet
CCS369-TEXT AND SPEECH ANALYSIS
3 pages
Lecture01 Introduction FA24
No ratings yet
Lecture01 Introduction FA24
140 pages
Recurrent Neural Network
No ratings yet
Recurrent Neural Network
81 pages
Final Updated Report 13
No ratings yet
Final Updated Report 13
64 pages
Fake News Detection: Muhammad Hassan Ur Rehman Sufyan Ahmed Huzaifa Shuja Taber Bin Zameer
No ratings yet
Fake News Detection: Muhammad Hassan Ur Rehman Sufyan Ahmed Huzaifa Shuja Taber Bin Zameer
21 pages
Learning Stock Market Sentiment Lexicon and Sentiment-Oriented Word Vector From StockTwits
No ratings yet
Learning Stock Market Sentiment Lexicon and Sentiment-Oriented Word Vector From StockTwits
10 pages
TAR 2020 Reading 05
No ratings yet
TAR 2020 Reading 05
20 pages
An Analysis On Financial Statement Fraud Detection For Chinese Listed Companies Using Deep Learning
No ratings yet
An Analysis On Financial Statement Fraud Detection For Chinese Listed Companies Using Deep Learning
17 pages
1-s2.0-S0957417424008327-main
No ratings yet
1-s2.0-S0957417424008327-main
13 pages
Understanding Large Language Models: Learning Their Underlying Concepts and Technologies 1 / converted Edition Thimira Amaratunga - The ebook is available for quick download, easy access to content
100% (2)
Understanding Large Language Models: Learning Their Underlying Concepts and Technologies 1 / converted Edition Thimira Amaratunga - The ebook is available for quick download, easy access to content
68 pages
A Survey of Word Embeddings Based On Deep Learning: Shirui Wang Wenan Zhou Chao Jiang
No ratings yet
A Survey of Word Embeddings Based On Deep Learning: Shirui Wang Wenan Zhou Chao Jiang
24 pages
1 s2.0 S2214212623002740 Main
No ratings yet
1 s2.0 S2214212623002740 Main
12 pages
PPT for the First Paper (1)
No ratings yet
PPT for the First Paper (1)
49 pages
Afaan Oromo Question Classification Using Deep Learning Approachosal
No ratings yet
Afaan Oromo Question Classification Using Deep Learning Approachosal
17 pages
Safeguarding Online Spaces A Powerful Fusion of Federated Learning Word Embeddings and Emotional Features For Cyberbullying Detection
No ratings yet
Safeguarding Online Spaces A Powerful Fusion of Federated Learning Word Embeddings and Emotional Features For Cyberbullying Detection
18 pages

3 - Deep Learning

Uploaded by

3 - Deep Learning

Uploaded by

Deep Learning and NLP

Natural Language Processing

❑ Use cases in text data

❑ Text processing for inputting into Deep learning architecture

1. Take the salary as an input. Similar to a biological neuron taking in

3. If it is then output “Loan approved”.

Monthly Is salary Is home loan

Spouse’s Threshold of 45,000 is

Applicant’s Home loan

Father’s X1 + X2 + X3 – Bias > 0 then output should be 1 (Approve)

Output = 1 if (Z = X1 + X2 + X3 + bias) > 0

Here, step function is the “Activation function”

If the Activation function is a step function, it is called a Perceptron.

(10,000 * 1) + (15000 * 1) + (7000 * 1) = 32000

Applicant 1 Home loan

1 So, should the loan be approved?

(10,000 * 2) + (15000 * 3) + (7000 * 1) = 72000

Applicant 3 Home loan

1 So, should the loan be approved?

❑ Tensor flow playground:

❑ Forward and Backward propagation

job data the scientist a is of excellent

the job of a data scientist is excellent

Text data has a sequence.

If the sequence is disturbed, it stops making sense.

Therefore, Text data falls under the category of sequence data.

o The job of a data scientist is excellent

o I truly dislike my job

Search engine queries

Oppenheimer is a great Sequence Openheimer es una

A good data scientist

2.Convert them into arrays

3.Feed them into the deep learning models

Mapping between text character and

❑ ASCII codes are numerical representations of all the characters

❑ No machine learning algorithms accept text as inputs.

❑ Therefore, text needs to be converted to numbers

❑ Two ways of converting text to numbers are:

o Clean the text

o Create tokens from the text

o From the created tokens, prepare a vocabulary

o Prepare the one – hot encoders

Original text Cleaned text Tokens Vocabulary

Size of the vector is equal to the size of the

❑ Look at these sentences:

❑ Assuming a vector size of 25: o Is this getting reflected in the

book 0 0 0 0 0 00 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 o One hot encoding does not

❑ Another approach to numerical representation of the text

❑ Do not depend on the vocabulary size

❑ They capture the context around the word

❑ They are obtained by training a special neural network architecture

vector of mango ~ vector of guava

vector of cheetah ~ vector of leopard

Rooster – Male + Female = ?? Hen Male

=> Gender relationship is preserved by the word

Obtaining word embeddings

Training of word embedding Pre – trained word

You might also like