0% found this document useful (0 votes)

7 views

dl_nlp_reading materials_bda_cs_25

The document outlines a course on Deep Learning and its applications in Natural Language Processing, detailing the syllabus, prerequisites, class schedule, evaluation criteria, and suggested reading materials. It covers various topics including artificial neural networks, convolutional networks, recurrent networks, and natural language processing techniques. The course is designed for students with a background in mathematics, programming, and machine learning, and includes assignments, projects, and academic ethics guidelines.

Uploaded by

Subhradip Bhattacharyya

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

dl_nlp_reading materials_bda_cs_25

Uploaded by

Subhradip Bhattacharyya

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

S S

by
Language Processing (DL&NLP)
F T
Deep Learning and its Application in Natural

A
DA345

R
Suggested reading materials

D
P
Soumitra Samanta

L
February 24, 2025

& N
D L
Contents
S S
by
T
1 Motivation, Course overview, Syllabus, Prerequisites and
Resources 4

F
1.1 Class schedule: . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.2 Teaching Assistant (TA): . . . . . . . . . . . . . . . . . . . . . 4

A
1.3 Prerequisite (s) . . . . . . . . . . . . . . . . . . . . . . . . . . 4

R
1.4 Course url: . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.5 Credit : 4 (four), approximately 60 credit hours . . . . . . . . 5

D
1.6 Tentative syllabus . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.7 Related books . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

P
1.8 Evaluation: . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.9 Assignments: . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

L
1.10 Project: . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.11 Academic ethics: . . . . . . . . . . . . . . . . . . . . . . . . . 8

N
1.12 DL & NLP related tools . . . . . . . . . . . . . . . . . . . . . 9

&
1.13 NLP datasets repository . . . . . . . . . . . . . . . . . . . . . 9
1.14 DL & NLP related top tier conference . . . . . . . . . . . . . 10

L
1.15 DL & NLP related top journals . . . . . . . . . . . . . . . . . 11
1.16 For recent updates on ML you can follow the arXiv . . . . . . 11

D
1.17 Suggested reading . . . . . . . . . . . . . . . . . . . . . . . . . 11

2 Introduction to Artificial neural network 12

2.1 Suggested reading . . . . . . . . . . . . . . . . . . . . . . . . . 12

3 Perceptron learning algorithm 13

3.1 Suggested reading . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.2 Assignment . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

2
CONTENTS 3

4 Introduction to different activation functions 15

4.1 Suggested reading . . . . . . . . . . . . . . . . . . . . . . . . . 15

S
5 Introduction to loss function and gradients 16
5.1 Suggested reading . . . . . . . . . . . . . . . . . . . . . . . . . 16

S
6 Introduction to backpropagation 17

y
6.1 Suggested reading . . . . . . . . . . . . . . . . . . . . . . . . . 17

b
7 Introduction to parameter initialisation and update rules 18
7.1 Suggested reading . . . . . . . . . . . . . . . . . . . . . . . . . 18

T
7.2 Assignment-2 . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

F
8 Convolutional Networks-1 19
8.1 Suggested reading . . . . . . . . . . . . . . . . . . . . . . . . . 19

A
9 Convolutional Networks-2 20

R
9.1 Suggested reading . . . . . . . . . . . . . . . . . . . . . . . . . 20

D
10 Convolutional Networks-3 21
10.1 Suggested reading . . . . . . . . . . . . . . . . . . . . . . . . . 21

P
10.2 Assignment-3 . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

L
11 Introduction to NLP, language model: N-gram 22
11.1 Suggested reading . . . . . . . . . . . . . . . . . . . . . . . . . 22

N
11.2 Homework . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

&
12 Word embeddings: vector semantics, neural word embed-
ding 23

L
12.1 Suggested reading . . . . . . . . . . . . . . . . . . . . . . . . . 23
12.2 Homework . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

D
Lecture 1
S S
Motivation, Course overview,
Syllabus, Prerequisites and by
Resources
F T
1.1 Class schedule:
R A
D
• Tuesday: 10:30AM - 12:00 PM (IH402)

P
• Wednesday: 12:00PM - 1:30 PM (IH402)

L
• Friday: 12:00PM - 1:30 PM (IH402)

1.2

& N Teaching Assistant (TA):

L
We have a TA in this course:
• TA: Suvajit Patra (2nd yr. PhD student) (IH413)

D
• Email: [email protected]

1.3 Prerequisite (s)

Student should have some knowledge in
• Mathematics: Linear Algebra, Multivariate Calculus, Basis Optimisa-
tion and Basic probability

4
1.4. COURSE URL: 5

• Computer programming: Python

• Basic concept in Algorithms and Data Structure

S
• Introduction to Machine Learning

1.4 Course url:

S
https://xlms.rkmvu.ac.in/course/view.php?id=116

by
T
1.5 Credit : 4 (four), approximately 60 credit

F
hours

A
1.6 Tentative syllabus

R
Here are is a tentative syllabus:

D
• Artificial neural network (ANN): Modelling Single neuron activity, dif-
ferent types of activity functions (sigmoid, tanh, ReLU, ELU etc.), how

P
to connect multiple neurons to form a network, Multi-layer perceptron

L
• Optimization: Back propagation, different loss functions, gradient de-
cent, stochastic gradient decent and different update rules (AdaGrad,

N
RPMSprops, Adam etc.) for network parameters, regularization, dropout,
batch normalisation etc.

&
• Deep learning toolbox: Explore a deep learning toolbox like PyTorch

L
(my personal choice)/ TensorFlow and their autograd functionalities

D
• Convolutional neural network (CNN): Concept of kernel and convolu-
tion, some pooling operation (max, average etc.), some standard CNN
architectures like LeNet, AlexNet, VggNet, ResNet etc. and concept of
transfer learning

• Recurrent neural network (RNN): Sequential data and how to handle

those using neural network, general RNN architecture, some popular
RNN architectures like Long short-term memory (LSTM), Gated re-
current unit (GRU) and their different variants
6LECTURE 1. MOTIVATION, COURSE OVERVIEW, SYLLABUS, PREREQUISITES AND

• Deep generative models: Variational Autoencoders (VAE), Generative

Adversarial Networks (GAN), Normalizing flows, Diffusion models, etc.

S
• Neural language model:

S
– Introduction to NLP
– Text preprocessing: tokenisation, stop words, stemming, lemma-

y
tisation, etc.

b
– Vector representations of text: Bag of Words, TF-IDF, word em-
beddings, Word2Vec, GloVE, etc.

T
– Sequence modelling: Recurrent neural network (RNN), Self-Attention
network, etc.

F
– Transformers: Attention, BERT and its different variants, Encoder-

A
Decoder models

R
– Large language model (LLM): GPT different variants, pre-trained
language model, transfer learning

D
– Application: text classification, sentiment analysis, Named Entity
Recognition (NER), machine translation, text summarization, text
generation, etc.

1.7

L P Related books

N
We will follow multiple books for different topics. Here are some suggested

&
books will follow in our course :

L
• Charu C. Aggarwal. Neural Networks and Deep Learning: A Textbook,
Springer Cham, 2nd edition, 2023.

D
• Simon Haykin. Neural Networks and Learning Machines, Pearson, 3rd
edition, 2009.

• Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep Learning.,

MIT Press, 1st edition, 2016. online

• Aston Zhang, Zachary C. Lipton, Mu Li, and Alexander J. Smola. Dive

into Deep Learning, Cambridge University Press, 2023. online
1.8. EVALUATION: 7

• Simon J.D. Prince. Understanding Deep Learning, MIT Press, 2023.

online

S
• Eugene Charniak. Introduction to deep learning, MIT Press, 2018.

S
• Michael Nielsen. Neural Networks and Deep Learning, online

y
• Ovidiu Calin. Deep Learning Architectures: A Mathematical Approach,
Springer Cham, 1st edition, 2020.

b
• Dan Jurafsky and James H. Martin. Speech and Language Processing.
draft, 3rd edition, 2024. [online]

F T
• Delip Rao, Brian McMahan. Natural Language Processing with Py-
Torch, O’Reilly Media, Inc, 2019

A
• Lewis Tunstall, Leandro von Werra, and Thomas Wolf. Natural Lan-

R
guage Processing with Transformers, O’Reilly Media, Inc, 2022. online
code only

D
• Yoav Goldberg. A Primer on Neural Network Models for Natural Lan-
guage Processing. online

1.8

L P
Evaluation:

N
Approximate weightage of different components in evaluation are as follows:
Midterm Exam 10%

&
Final Exam 40%

L
Assignment and Class test/Quizzes 20%
Project 25%

D
Class attendance 5%

1.9 Assignments:
There will be some programming assignments. For the programming assign-
ment, we will follow Python programming language for this course. The
assignment submission deadline is strict and We will consider 11.59PM as
our day end.
8LECTURE 1. MOTIVATION, COURSE OVERVIEW, SYLLABUS, PREREQUISITES AND

1.10 Project:
• Can be done in a group (max two students)

S
• Be careful about your project partner!

S
• If he is auditing the course then you will be in trouble!

y
• Define your own project

b
• Submit a one page project proposal- within fixed time (first four weeks)?

T
• Finished the work within the time-line

F
• Report submission

A
• Submission deadline: seven days before the final exam date, is strict
and you can adjust your assignment buffer days here

D R
• We will consider 11:59PM as our day end

• Final presentation: 20 min (divided into group members). Five days

P
before the final exam date

L
1.11 Academic ethics:

N
We will follow some academic ethics:

&
• Your grade should reflect your own work.

L
• Copying or paraphrasing someone’s work (code included), or permit-

D
ting your own work to be copied or paraphrased, even if only in part,
is strictly forbidden, and will result in an automatic grade of zero for
the entire assignment or exam in which the copying or paraphrasing
was done.

• So, ask yourself before copying from others.

• If you are going to have trouble completing an assignment, talk to the

instructor and TA before due date.
1.12. DL & NLP RELATED TOOLS 9

1.12 DL & NLP related tools

Here are some popular tools:

S
• Machine Learning in Python - https://scikit-learn.org/stable/

S
• ML in GPU - https://rapids.ai/

y
• PyTorch - https://pytorch.org/

b
• Natural Language Toolkit - https://www.nltk.org/
• NLP for Indian language - https://github.com/AI4Bharat/indicnlp_

T
catalog

F
• Bangla nlp - https://github.com/sagorbrur/bnlp
• ···

1.13 NLP datasets repository

R A
D
You can find some datasets to evaluate yous NLP models here:
• https://github.com/niderhoff/nlp-datasets

P
• https://github.com/sebastianruder/NLP-progress

L
• https://www.nltk.org/nltk_data/

N
• https://universaldependencies.org/

&
• Movie subtitles: https://opus.nlpl.eu/OpenSubtitles-v2018.php

L
• I am not sure the data can be downloadable or not! But you can try
for your application from these sources:

D
– Related to Bengali literature: https://nltr.itewb.gov.in/
– https://nltr.itewb.gov.in/downloads.php
– https://rabindra-rachanabali.nltr.org/node/1
– https://nazrul-rachanabali.nltr.org/
– https://bankim-rachanabali.nltr.org/
– https://sarat-rachanabali.nltr.org/
– https://advaitaashrama.org/cw/content.php
10LECTURE 1. MOTIVATION, COURSE OVERVIEW, SYLLABUS, PREREQUISITES AN

1.14 DL & NLP related top tier conference

• International Conference on Machine Learning (ICML) - https://

S
icml.cc/

S
• Neural Information Processing Systems (NeurIPS) - https://neurips.
cc/

by
• International Conference on Learning Representations (ICLR) - https:
//iclr.cc/

T
• Association for the Advancement of Artificial Intelligence (AAAI) -

F
https://www.aaai.org/

A
• Computer Vision Foundation (CVF) - https://openaccess.thecvf.
com/menu

R
• Association for Computational Linguistics (ACL)[every year] - papers

D
https://aclanthology.org/venues/acl//

P
• Empirical Methods in Natural Language Processing (EMNLP)[every
year] - papers https://aclanthology.org/venues/emnlp/

L
• North American Chapter of the Association for Computational Lin-

N
guistics ( NAACL)[every year] - papers https://aclanthology.org/
venues/naacl/

L&
• European Chapter of the Association for Computational Linguistics
(EACL)[every year] - papers https://aclanthology.org/venues/eacl/

D
• International Conference on Computational Linguistics (COLING) [al-
ternate year (even)] - papers https://aclanthology.org/venues/
coling/

• Conference on Natural Language Learning (CoNLL)[every year] - pa-

pers https://aclanthology.org/venues/conll/

• ···
1.15. DL & NLP RELATED TOP JOURNALS 11

1.15 DL & NLP related top journals

• Journal of Machine Learning Research (JMLR) - https://www.jmlr.

S
org/

S
• Journal of Computational Linguistics (JCL) - https://direct.mit.
edu/coli/

y
• Transactions of the Association for Computational Linguistics (TACL)

b
- https://transacl.org/index.php/tacl/index

• Journal of Information Retrieval (JIR) - https://www.springer.com/

T
journal/10791

F
• ···

1.16
A
For recent updates on ML you can follow
the arXiv

R
D
You can go to Computer Science (CS) section in arXiv and under that you
can find different branches of CS (like ML, CL, AI, IR, etc.).

P
• ML - https://arxiv.org/list/cs.LG/recent

L
• CL - https://arxiv.org/list/cs.CL/recent

N
• AI - https://arxiv.org/list/cs.AI/recent

&
• IR - https://arxiv.org/list/cs.IR/recent

L
• ···

D
1.17 Suggested reading
Please go through the class slides.
Lecture 2
S S
Introduction to Artificial
neural network by
F T
A
2.1 Suggested reading

R
Please go through Chapter 1, till section 1.2.1.6 of Charu Aggarwal’s book [1]

D
(you can find it in our library or you may find it online here but not sure!)
or Chapter 1 of Simon Haykin’s book [7] (you can find it in our library or

P
you may find online here but not sure!).

NL
L&
D
12
Lecture 3
S S
Perceptron learning algorithm
by
3.1 Suggested reading
F T
A
The perceptron learning algorithms were proposed by Frank Rosenblatt in

R
1958 [12]. An online version of the original paper can be found here. Also,
he wrote a technical report [13] in detail about perceptron.

D
You can find perceptron learning algorithms in any Machine Learning or
Pattern Recognition book. Here are some references:

P
You can find the algorithm in Shalev-Shwartz et al. book [14] Chapter
9, Section 9.1.2: Perceptron for Half-spaces and for the convergence proof

L
please go through the Theorem 9.1 in [14].
Mohri et al. book [10] Chapter 8, section 8.3.1: Perceptron algorithm

N
and for convergence proof Theorem 8.8 [14].
For a brief history and original perceptron setup picture in Bishop’s

&
book [3] Chapter 4, Section 4.1.7.

L
Perceptron Algorithm:

D
3.2 Assignment
Implement the perceptron learning algorithm for a two class synthetic data
we have discussed in the class with the following settings:
• Consider a two class classification problem and generate the dataset
(100 points uniformly from each class) using the script form here:
https://xlms.rkmvu.ac.in/pluginfile.php/4571/mod_assign/introattachment/
0/gui_inputs.py?forcedownload=1

13
14 LECTURE 3. PERCEPTRON LEARNING ALGORITHM

Algorithm 1 Perceptron Algorithm

Input: Training samples {(xi , y i )}Ni=1 , where x ∈ R and y ∈ {−1, 1}, and
i d i

an initial weight vector w0 .

S
Output: Final weight vector w.

S
1: w1 ← w0
2: for t ← 1 to T do

y
3: for i ← 1 to N do
4: if y i (wtT xi ) < 0 then

b
5: w t ← w t + y i xi
6: end if

T
7: end for
8: end for

F
9: return wT +1

A
• Implement the perceptron leaning algorithm discussed in the class with

R
following three initialisation:

D
– Randomly
– With the help from your dataset

P
– With zeros

L
• Plot the results (your linear separators) with the data points for the
above three cases.

N
Submission deadline: 15-01-2025 (11:59 PM)

&
Submission file format: your_ID_full_name_perceptron_2d_data_version_
no.ipynb

D L
Lecture 4
S S
Introduction to different
activation functions by
F T
A
4.1 Suggested reading

R
Please go through the class slides.

D
Please go through the Chapter 2 of Ovidiu Calinl’s book [4] (you can find
it in our library). Also, you can go through Chapter 4, section 4.4 of Charu

P
Aggarwal’s book [1] (you can find it in our library or you may find it online
here but not sure!.

NL
L &
D
15
Lecture 5
S S
Introduction to loss function
and gradients by
F T
A
5.1 Suggested reading

R
Please go through Chapter 2 of Charu Aggarwal’s book [1] (you can find it

D
in our library or you may find it online here but not sure!) or Chapter 5 of
Simon J.D. Prince’s book [11] (you may find online here but not sure!).

L P
& N
D L
16
Lecture 6
S S
Introduction to
backpropagation by
F T
A
6.1 Suggested reading

R
Please go through Chapter 2, section 2.4 of Charu Aggarwal’s book [1] (you

D
can find it in our library or you may find it online here but not sure!)

L P
& N
D L
17
Lecture 7
S S
Introduction to parameter
initialisation and update rules by
F T
A
7.1 Suggested reading

R
Please go through the class slides.

D
Please go through Chapter 2, section 2.7 and Chapter 4, section 4.5 of
Charu Aggarwal’s book [1] (you can find it in our library or you may find it

P
online here but not sure!).
Project proposal submission deadline: 22-02-2025 (11:59 PM)

7.2

NL Assignment-2

&
Implement a simple two layers neural network to classify handwritten digits
in MNIST dataset with the following settings:

L
• Please follow the notebook and fill in the blanks (TODO) in first_nn_exc.py

D
• Consider different initialisation strategies discussed in the class.
• Implement different update rules discussed in the class.
• Search for the optimum hyper-parameters (learning rate, number of
hidden layers) through a grid search.
Submission deadline: 31-01-2025 (11:59 PM)
Submission file format: your_ID_full_name_2lr_net_mnist_data_version_
no.ipynb

18
Lecture 8
S S
Convolutional Networks-1
by
8.1 Suggested reading
F T
A
Please go through the class slides.

R
Please go through Chapter 2, section 2.7 and Chapter 4, section 4.5 of
Charu Aggarwal’s book [1] (you can find it in our library or you may find it

D
online here but not sure!) OR you can check Chapter 9 of Ian Goodfellow
et al. book [6]. Chapter 9 is freely downloadable from here.

L P
& N
D L
19
Lecture 9
S S
Convolutional Networks-2
by
9.1 Suggested reading
F T
A
Please go through the class slides.

R
Please go through Chapter 2, section 2.7 and Chapter 4, section 4.5 of
Charu Aggarwal’s book [1] (you can find it in our library or you may find it

D
online here but not sure!) OR you can check Chapter 9 of Ian Goodfellow
et al. book [6]. Chapter 9 is freely downloadable from here. You can also

P
check Christopher M. Bishop and Hugh Bishop’s book [2]. Chapter 10 is
freely online readable from here.

NL
L&
D
20
Lecture 10
S S
Convolutional Networks-3
by
10.1 Suggested reading
F T
A
Please go through the class slides.

R
Please go through Chapter 2, section 2.7 and Chapter 4, section 4.5 of
Charu Aggarwal’s book [1] (you can find it in our library or you may find it

D
online here but not sure!) OR you can check Chapter 9 of Ian Goodfellow
et al. book [6]. Chapter 9 is freely downloadable from here. You can also

P
check Christopher M. Bishop and Hugh Bishop’s book [2]. Chapter 10 is
freely online readable from here.

L
For backpropagation, you can check this.

N
10.2 Assignment-3

&
Implement a CNN to classify different objects in image data. The exact

L
problem will be given by the TA and it will be evaluated on the spot.
Submission deadline: 21-02-2025 (11:59 PM)

D
Submission file format: your_ID_full_name_cnn_version_no.ipynb

21
Lecture 11
S S
Introduction to NLP, language
model: N-gram by
F T
A
11.1 Suggested reading

R
Please go through the class slides.

D
For N-gram language model, you can go through Jurafsky and Martin’s [8]
book Chapter 3 (N-gram Language Models) [online]. For further interest, you

P
can look into the papers referred in Chapter 3

L
11.2 Homework

N
Implement a N-gram model on a toy corpus discussed in the class.

&
For the Bag-of-Words model, you can go through Jacob’s [5] book Chapter
4 (Linguistic applications of classification)[online]. For further interest, I

L
am encouraging you to go through the paper by Pang et al. titled with
Thumbs up?: sentiment classification using machine learning techniques and

D
the paper by Zellig S. Harris titled with Distributional Structure.

22
Lecture 12
S S
Word embeddings: vector
semantics, neural word by
embedding
F T
12.1 Suggested reading
R A
D
First go through the word representation in vectorised form in Jurafsky and

P
Martin’s [8] book Chapter 6 (Vector Semantics and Embeddings) [online]. For
word2vec, please go through the original paper title with efficient estimation

L
of word representations in vector space [9]. A good documentation word2vec
parameters title with word2vec Parameter Learning Explained. An online

N
demo https://ronxin.github.io/wevi/. A word2vec test demo notebook
is here: ss_word2vec_demo.ipynb

L
12.2
&Homework

D
Derive the gradient of cross-entropy loss with respect to all the parameters
in the word2vec model discussed in the class.

23
Bibliography
S S
by
[1] Charu C. Aggarwal. Neural Networks and Deep Learning: A Textbook.

T
Springer Cham, 2nd edition, 2023.

F
[2] Christopher Bishop and Hugh Bishop. Deep Learning: Foundations and
Concepts. Springer Cham, 1st edition, 2023.

A
[3] Christopher M. Bishop. Pattern Recognition and Machine Learning.

R
Springer, 1st edition, 2006.

[4] Ovidiu Calin. Deep Learning Architectures: A Mathematical Approach.

D
Springer Cham, 1st edition, 2020.

P
[5] Jacob Eisenstein. Introduction to Natural Language Processing. MIT
Press, 1st edition, 2019.

L
[6] Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep Learning.

N
MIT Press, 1st edition, 2016.

[7] Simon Haykin. Neural Networks and Learning Machines. Pearson, 3rd

&
edition, 2009.

L
[8] Dan Jurafsky and James H. Martin. Speech and Language Processing.

D
draft, 3rd edition, 2023.

[9] Tomas Mikolov, Kai Chen, Greg S. Corrado, and Jeffrey Dean. Efficient
estimation of word representations in vector space, 2013.

[10] Mehryar Mohri, Afshin Rostamizadeh, and Ameet Talwalkar. Founda-

tions of Machine Learning. The MIT Press, 2nd edition, 2018.

[11] Simon J.D. Prince. Understanding Deep Learning. MIT Press, 1st edi-
tion, 2023.

24
BIBLIOGRAPHY 25

[12] Frank Rosenblatt. The perceptron: A probabilistic model for infor-

mation storage and organization in the brain. Psychological Review,
65(6):386–408, 1958.

S
[13] Frank Rosenblatt. Principles of neurodynamics: Perceptrons and the

S
theory of brain mechanisms. Technical report, Cornell Aeronautical
Laboratory, 1961.

y
[14] Shai Shalev-Shwartz and Shai Ben-David. Understanding Machine

b
Learning: From Theory to Algorithms. Cambridge University Press,
1st edition, 2014.

F T
R A
D
L P
& N
D L

DL Notes 1 5 Deep Learning
100% (1)
DL Notes 1 5 Deep Learning
189 pages
Eng2 q1 w1 Day1
No ratings yet
Eng2 q1 w1 Day1
9 pages
Advance Deep Learning Final. INeuron
100% (1)
Advance Deep Learning Final. INeuron
17 pages
ML C-57 Program Calendar- Sept 23
No ratings yet
ML C-57 Program Calendar- Sept 23
3 pages
Lec-1 Introduction
No ratings yet
Lec-1 Introduction
68 pages
DLNLP - Course Outline
No ratings yet
DLNLP - Course Outline
3 pages
NLP_Module_I_IV
No ratings yet
NLP_Module_I_IV
283 pages
AI For Business Applications Unit 1: Introduction To AI: Faculty Name: Dr. Shivangi Agarwal
No ratings yet
AI For Business Applications Unit 1: Introduction To AI: Faculty Name: Dr. Shivangi Agarwal
52 pages
Deep Learning With Advanced NLP
No ratings yet
Deep Learning With Advanced NLP
18 pages
455969
No ratings yet
455969
15 pages
Deep Learning DSDL32,NLP
No ratings yet
Deep Learning DSDL32,NLP
4 pages
Introduction (BT4222) YL
No ratings yet
Introduction (BT4222) YL
48 pages
Artificial Intelligence Machine Learning Program Brochure
0% (1)
Artificial Intelligence Machine Learning Program Brochure
14 pages
Course Outline: DLCP Curriculum Walkthrough
No ratings yet
Course Outline: DLCP Curriculum Walkthrough
3 pages
Natural Language Processing With Deep Learning 1 PDF
No ratings yet
Natural Language Processing With Deep Learning 1 PDF
37 pages
Cs224D Deep Learning For Natural Language Processing: Richard Socher, PHD
No ratings yet
Cs224D Deep Learning For Natural Language Processing: Richard Socher, PHD
32 pages
Applied Natural Language Processing
No ratings yet
Applied Natural Language Processing
3 pages
CourseOutline PHD RTML
No ratings yet
CourseOutline PHD RTML
4 pages
Natural Language Processing Professional Program
No ratings yet
Natural Language Processing Professional Program
13 pages
Lecture1 PDF
No ratings yet
Lecture1 PDF
39 pages
Chapter 1
No ratings yet
Chapter 1
66 pages
Deep Learning for Natural Language GDG Bloomington 1690248059
No ratings yet
Deep Learning for Natural Language GDG Bloomington 1690248059
41 pages
Deep-Learning-Generative-AI (1) (1)
No ratings yet
Deep-Learning-Generative-AI (1) (1)
6 pages
PGP AIML Online - Brochure
No ratings yet
PGP AIML Online - Brochure
19 pages
Deep Learning Syllabus
No ratings yet
Deep Learning Syllabus
1 page
AI-ML-DS (Level-2) Lab Lesson Plan
No ratings yet
AI-ML-DS (Level-2) Lab Lesson Plan
2 pages
AI
No ratings yet
AI
11 pages
SASIDHAR GOUD (REPORT)
No ratings yet
SASIDHAR GOUD (REPORT)
22 pages
Lecture 1 Logistic.pptx
No ratings yet
Lecture 1 Logistic.pptx
19 pages
NLP Nanodegree Syllabus
No ratings yet
NLP Nanodegree Syllabus
11 pages
Brochure CMU-DELE 03-05-2023 V12
No ratings yet
Brochure CMU-DELE 03-05-2023 V12
12 pages
Csa4020 Deep-Learning LP 1.0 22 Csa4020 Deep-Learning LP 1.0 1 Deep Learning
No ratings yet
Csa4020 Deep-Learning LP 1.0 22 Csa4020 Deep-Learning LP 1.0 1 Deep Learning
2 pages
005 NLP Computer Vision and Neural Network (Machine Learning)
No ratings yet
005 NLP Computer Vision and Neural Network (Machine Learning)
45 pages
AI - (Deep Learning/NLP) : 5 Days
No ratings yet
AI - (Deep Learning/NLP) : 5 Days
4 pages
HDS401 Deep Learning Module Outline
No ratings yet
HDS401 Deep Learning Module Outline
3 pages
CCS369
No ratings yet
CCS369
2 pages
Master of Science in Machine Learning & AI - Liverpool Joh Moore University
No ratings yet
Master of Science in Machine Learning & AI - Liverpool Joh Moore University
6 pages
Neural Network & Fuzzy Logic
No ratings yet
Neural Network & Fuzzy Logic
5 pages
ANN syllabus
No ratings yet
ANN syllabus
5 pages
Plano Ensino ET287 Redes
No ratings yet
Plano Ensino ET287 Redes
3 pages
1 cs772 Introduction Week of 3jan22
No ratings yet
1 cs772 Introduction Week of 3jan22
53 pages
Syllabus-EE 414, 517, Deep Learning, Fall 2023
No ratings yet
Syllabus-EE 414, 517, Deep Learning, Fall 2023
4 pages
Lecture 1
No ratings yet
Lecture 1
100 pages
Deep Learning - Lab - COURSE PLAN (AD3511-DL - Printout
No ratings yet
Deep Learning - Lab - COURSE PLAN (AD3511-DL - Printout
5 pages
Deep Learning Natural Language Processing Term Paper
No ratings yet
Deep Learning Natural Language Processing Term Paper
6 pages
Artificial Intelligence & Machine Learning: Post Graduate Program in
No ratings yet
Artificial Intelligence & Machine Learning: Post Graduate Program in
16 pages
Introduction To Deep Learning - With Complexe Python and TensorFlow Examples - Jürgen Brauer PDF
No ratings yet
Introduction To Deep Learning - With Complexe Python and TensorFlow Examples - Jürgen Brauer PDF
245 pages
CS 4063 Natural Language Processing Outline Spring2022
No ratings yet
CS 4063 Natural Language Processing Outline Spring2022
4 pages
Natural Language Processing Professional Program
No ratings yet
Natural Language Processing Professional Program
12 pages
Lec2 Perceptron MLP
No ratings yet
Lec2 Perceptron MLP
66 pages
The Rise of Machine Learning
No ratings yet
The Rise of Machine Learning
32 pages
AIML V.22 Brochure Newversion22
No ratings yet
AIML V.22 Brochure Newversion22
16 pages
Machine Learning Suggestion
No ratings yet
Machine Learning Suggestion
16 pages
Unit 1
No ratings yet
Unit 1
12 pages
NLP Intro Logistics MIHE
No ratings yet
NLP Intro Logistics MIHE
21 pages
AI & Deep Learning TensorFlow, Keras, PyTorch_80 hours-1
No ratings yet
AI & Deep Learning TensorFlow, Keras, PyTorch_80 hours-1
12 pages
Minor_in_AI_Vizuara_Engineering_Curriculum_COEP (1)
No ratings yet
Minor_in_AI_Vizuara_Engineering_Curriculum_COEP (1)
9 pages
AI-ML Session Reference Material
No ratings yet
AI-ML Session Reference Material
4 pages
DLAI4 Revision
No ratings yet
DLAI4 Revision
6 pages
BTech. 4th Year - Computer Science and Engineering-Artificial Intelligence - 2023-24 - v2
No ratings yet
BTech. 4th Year - Computer Science and Engineering-Artificial Intelligence - 2023-24 - v2
20 pages
Programming Arduino: Getting Started with Sketches
From Everand
Programming Arduino: Getting Started with Sketches
Simon Monk
3.5/5 (5)
2015.458236.Meghe-Dhaka-Tara
No ratings yet
2015.458236.Meghe-Dhaka-Tara
126 pages
IKS_Town_Planning
No ratings yet
IKS_Town_Planning
11 pages
2814462702
No ratings yet
2814462702
3 pages
Seat Matrix of Participating Institutes (Nits/Iiit)
No ratings yet
Seat Matrix of Participating Institutes (Nits/Iiit)
1 page
Detailed Lesson Plan
No ratings yet
Detailed Lesson Plan
3 pages
Lesson Plans For The Week Beginning 9/3/19 Russ Moerer: Psych Apush CCC-Microeconomics
No ratings yet
Lesson Plans For The Week Beginning 9/3/19 Russ Moerer: Psych Apush CCC-Microeconomics
1 page
Curriculum Evaluation Final3
100% (1)
Curriculum Evaluation Final3
41 pages
Articol CLT Bun
No ratings yet
Articol CLT Bun
10 pages
Syllabus For Teaching Profession
No ratings yet
Syllabus For Teaching Profession
11 pages
Social Sciences & Humanities Open: R. Shanthi Priya, P. Shabitha, S. Radhakrishnan
No ratings yet
Social Sciences & Humanities Open: R. Shanthi Priya, P. Shabitha, S. Radhakrishnan
12 pages
Learning Chinese Characters Via Stroke-Based Mobile Game in Education
No ratings yet
Learning Chinese Characters Via Stroke-Based Mobile Game in Education
7 pages
Music: Quarter 3 - Module 3
No ratings yet
Music: Quarter 3 - Module 3
19 pages
Developmental Screening Using The: Philippine Early Childhood Development Checklist
No ratings yet
Developmental Screening Using The: Philippine Early Childhood Development Checklist
30 pages
CursoInglesAC - Mod1Unit 2
No ratings yet
CursoInglesAC - Mod1Unit 2
16 pages
Untitled
No ratings yet
Untitled
16 pages
Afra Nadya Putri Insani - 22202073008 - Uas Research Met
No ratings yet
Afra Nadya Putri Insani - 22202073008 - Uas Research Met
7 pages
Inosluban East, Lipa City, Batangas 4217 Contact Information: 0908-3628-784/0926-366-4911/741-1201
No ratings yet
Inosluban East, Lipa City, Batangas 4217 Contact Information: 0908-3628-784/0926-366-4911/741-1201
4 pages
Learner Guide Bs A Level
100% (1)
Learner Guide Bs A Level
50 pages
Why Machine Learning Matters
No ratings yet
Why Machine Learning Matters
10 pages
Department of Education: Mainaga-San Francisco Elementary School
No ratings yet
Department of Education: Mainaga-San Francisco Elementary School
2 pages
ANAT1101 - Anatomy and Physiology - (Winter 2023 - Current)
No ratings yet
ANAT1101 - Anatomy and Physiology - (Winter 2023 - Current)
7 pages
2024-2025 Python IEEE Projects List
No ratings yet
2024-2025 Python IEEE Projects List
10 pages
Flying A Kite Online Lesson Plan
No ratings yet
Flying A Kite Online Lesson Plan
4 pages
Hedonic Motivations For Online Shopping PDF
No ratings yet
Hedonic Motivations For Online Shopping PDF
3 pages
Empowerment Technologies Web 1 2 3
No ratings yet
Empowerment Technologies Web 1 2 3
2 pages
2pages 6
No ratings yet
2pages 6
2 pages
Teacher QualityUnderstanding The Effectiveness of Teacher Attributes
No ratings yet
Teacher QualityUnderstanding The Effectiveness of Teacher Attributes
9 pages
LEESON PLAN AbilityInability in The Past
No ratings yet
LEESON PLAN AbilityInability in The Past
2 pages
On Being A Good Dog Training Student - Susan Garret
100% (1)
On Being A Good Dog Training Student - Susan Garret
16 pages
1.31. CCF-PGS1-Professional-Growth-Plan-Form.
No ratings yet
1.31. CCF-PGS1-Professional-Growth-Plan-Form.
4 pages
Teachers Classroom Management Practices
No ratings yet
Teachers Classroom Management Practices
10 pages
DLP Perdev 1-21-20
No ratings yet
DLP Perdev 1-21-20
7 pages
Teaching With Simulations: What Are Instructional Simulations?
No ratings yet
Teaching With Simulations: What Are Instructional Simulations?
4 pages

dl_nlp_reading materials_bda_cs_25

Uploaded by

dl_nlp_reading materials_bda_cs_25

Uploaded by

S S

2 Introduction to Artificial neural network 12

3 Perceptron learning algorithm 13

4 Introduction to different activation functions 15

& N Teaching Assistant (TA):

1.3 Prerequisite (s)

• Computer programming: Python

• Basic concept in Algorithms and Data Structure

1.4 Course url:

• Recurrent neural network (RNN): Sequential data and how to handle

• Deep generative models: Variational Autoencoders (VAE), Generative

• Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep Learning.,

• Aston Zhang, Zachary C. Lipton, Mu Li, and Alexander J. Smola. Dive

• Simon J.D. Prince. Understanding Deep Learning, MIT Press, 2023.

• Final presentation: 20 min (divided into group members). Five days

• So, ask yourself before copying from others.

• If you are going to have trouble completing an assignment, talk to the

1.12 DL & NLP related tools

1.13 NLP datasets repository

1.14 DL & NLP related top tier conference

• Conference on Natural Language Learning (CoNLL)[every year] - pa-

1.15 DL & NLP related top journals

• Journal of Information Retrieval (JIR) - https://www.springer.com/

Algorithm 1 Perceptron Algorithm

an initial weight vector w0 .

[4] Ovidiu Calin. Deep Learning Architectures: A Mathematical Approach.

[10] Mehryar Mohri, Afshin Rostamizadeh, and Ameet Talwalkar. Founda-

[12] Frank Rosenblatt. The perceptron: A probabilistic model for infor-

You might also like