0% found this document useful (0 votes)

444 views34 pages

Deep Learning Course Introduction

This document provides an introduction and overview of a lecture on practical deep learning. It discusses the structure and content of the course, including introductions to machine learning, deep learning, and the different types of machine learning. It also covers fundamentals of machine learning like data, models, learning and inference, optimization with gradient descent, and overfitting. Finally, it discusses deep neural networks and popular deep learning frameworks.

Uploaded by

Shivam Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

444 views34 pages

Deep Learning Course Introduction

Uploaded by

Shivam Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

Lecture 1: Introduction to

deep learning

Practical deep learning

About this course
• Introduction to deep learning
• basics of ML assumed
• mostly high-school math
• much of theory, many details skipped
• 1st day: lectures + small-scale exercises using notebooks.csc.fi
• 2nd day: experiments using GPUs at Puhti-AI
• Slides at: https://tinyurl.com/yyej6rxl
• Other materials at GitHub:
https://github.com/csc-training/intro-to-dl
• Gitter chat at: https://gitter.im/csc_training/intro-to-dl
• Focus on text and image classification, no fancy stuff
• Using Python, TensorFlow 2 / Keras, and PyTorch
Further resources
• This course is largely “inspired by”: “Deep
Learning with Python” by François Chollet
• Recommended textbook: “Deep learning”
by Goodfellow, Bengio, Courville
• Lots of further material available online, e.g.:
http://cs231n.stanford.edu/ http://course.fast.ai/
https://developers.google.com/machine-learning/crash-course/
www.nvidia.com/dlilabs http://introtodeeplearning.com/
https://github.com/oxford-cs-deepnlp-2017/lectures,
https://jalammar.github.io/
• Academic courses
What is artificial intelligence?

Artificial intelligence is the ability of a computer to perform

tasks commonly associated with intelligent beings.
What is machine learning?

Machine learning is the study of algorithms that learn from

examples and experience instead of relying on hard-coded rules
and make predictions on new data.
What is deep learning?

Deep learning is a subfield of machine learning focusing on

learning data representations as successive layers of
increasingly meaningful representations.
Image from https://blogs.nvidia.com/blog/2016/07/29/whats-difference-artificial-intelligence-machine-learning-deep-learning-ai/
“Traditional” machine learning:

handcrafted learned
cat
features classifier

Deep, “end-to-end” learning:

learned learned learned

learned
low-level mid-level high-level cat
classifier
features features features
From: Wang & Raj: On the Origin of Deep Learning (2017)
Main types of machine learning
Main types of machine learning

• Supervised learning
cat
• Unsupervised learning
• Self-supervised learning dog
• Reinforcement learning
Main types of machine learning

• Supervised learning

• Unsupervised learning
• Self-supervised learning
• Reinforcement learning
Main types of machine learning

• Supervised learning

• Unsupervised learning
• Self-supervised learning
• Reinforcement learning

Image from https://arxiv.org/abs/1710.10196

Main types of machine learning

• Supervised learning

• Unsupervised learning
• Self-supervised learning
• Reinforcement learning

Animation from https://yanpanlau.github.io/2016/07/10/FlappyBird-Keras.html

Fundamentals of machine
learning
Data
• Humans learn by observation and
unsupervised learning
• model of the world /
common sense reasoning
• Machine learning needs lots of
(labeled) data to compensate
Data

• Tensors: generalization of matrices

to n dimensions (or rank, order, degree)
• 1D tensor: vector
• 2D tensor: matrix
• 3D, 4D, 5D tensors
• numpy.ndarray(shape, dtype)
• Training – validation – test split (+ adversarial test)
• Minibatches
• small sets of input data used at a time
• usually processed independently
Image from: https://arxiv.org/abs/1707.08945
Model – learning/training – inference

http://playground.tensorflow.org/

• parameters 𝜃 and hyperparameters

Optimization
• Mathematical optimization:
“the selection of a best element (with
regard to some criterion) from some
set of available alternatives” (Wikipedia)
• Main types:
finite-step, iterative, heuristic
• Learning as an optimization problem By Rebecca Wilson (originally posted to Flickr as Vicariously) [CC BY 2.0], via Wikimedia Commons

• cost function: loss regularization

Optimization

Image from: Li et al. “Visualizing the Loss Landscape of Neural Nets”, arXiv:1712.09913
Gradient descent

• Derivative and minima/maxima of functions

• Gradient: the derivative of a multivariable function
• Gradient descent:

• (Mini-batch) stochastic gradient

descent (and its variants)

Image from: https://towardsdatascience.com/gradient-descent-algorithm-and-its-variants-10f652806a3

Over- and underfitting, generalization,
regularization
• Models with lots of parameters can
easily overfit to training data
• Generalization: the quality of ML
model is measured on new, unseen
samples
• Regularization: any method* to
prevent overfitting
• simplicity, sparsity, dropout, early stopping
• *) other than adding more data

By Chabacano [GFDL or CC BY-SA 4.0], from Wikimedia Commons

Deep learning
Anatomy of a deep neural network

• Layers
• Input data and targets
• Loss function
• Optimizer
Layers
• Data processing modules
• Many different kinds exist
• densely connected
• convolutional
• recurrent
• pooling, flattening, merging, normalization,
etc.
• Input: one or more tensors
output: one or more tensors
• Usually have a state, encoded as weights
• learned, initially random
• When combined, form a network or
a model
Input data and targets

• The network maps the input data X

to predictions Y′
• During training, the predictions Y′
are compared to true targets Y
using the loss function

cat
dog
Loss function
• The quantity to be minimized (optimized) during training
• the only thing the network cares about
• there might also be other metrics you care about
• Common tasks have “standard” loss functions:
• mean squared error for regression
• binary cross-entropy for two-class classification
• categorical cross-entropy for multi-class classification
• etc.
• https://lossfunctions.tumblr.com/
Optimizer
• How to update the weights
based on the loss function
• Learning rate (+scheduling)
• Stochastic gradient descent,
momentum, and their
variants
• RMSProp is usually a good
first choice
• more info:
http://ruder.io/optimizing-gradient-descent/

Animation from: https://imgur.com/s25RsOr

Anatomy of a deep neural network
Deep learning frameworks
Deep learning frameworks
+

• Actually tools for defining static or

dynamic general-purpose computational +
graphs
• Automatic differentiation ✕ ✕
• Seamless CPU / GPU usage
• multi-GPU, distributed
x y 5
• Python/numpy or R interfaces
• instead of C, C++, CUDA or HIP
• Open source
Deep learning Lasagne Keras TF Estimator torch.nn Gluon

frameworks
Theano TensorFlow CNTK PyTorch MXNet Caffe

CUDA, cuDNN
MKL, MKL-DNN
• Keras is a high-level HIP, MIOpen

neural networks API

• we will use TensorFlow GPUs CPUs
as the compute backend
• included in TensorFlow 2 as tf.keras
• https://keras.io/ , https://www.tensorflow.org/guide/keras
• PyTorch is:
• a GPU-based tensor library
• an efficient library for dynamic neural networks
• https://pytorch.org/

Transformers For Machine Learning A Deep Dive (Uday Kamath, Kenneth L. Graham, Wael Emara)
100% (12)
Transformers For Machine Learning A Deep Dive (Uday Kamath, Kenneth L. Graham, Wael Emara)
284 pages
Pure Install Guide
100% (1)
Pure Install Guide
14 pages
120 Deep Learning Important Questions + Answers ?
No ratings yet
120 Deep Learning Important Questions + Answers ?
68 pages
Python ML & DL Guide 2nd Edition
67% (3)
Python ML & DL Guide 2nd Edition
3 pages
LLMs in Production-MLC - GRC
No ratings yet
LLMs in Production-MLC - GRC
39 pages
Deep Learning
No ratings yet
Deep Learning
243 pages
170 Machine Learning Interview Questios - Greatlearning
100% (1)
170 Machine Learning Interview Questios - Greatlearning
57 pages
Deep Learning Interview Questions and Answers
No ratings yet
Deep Learning Interview Questions and Answers
21 pages
Deep Learning Notes Andrew NG
100% (1)
Deep Learning Notes Andrew NG
54 pages
Convolutional Neural Networks in Python
100% (3)
Convolutional Neural Networks in Python
141 pages
Deep Learning For NLP and Speech Recogni
100% (8)
Deep Learning For NLP and Speech Recogni
640 pages
Interpretable Machine Learning
100% (4)
Interpretable Machine Learning
251 pages
Lab 8
No ratings yet
Lab 8
3 pages
The University of Bamenda Course Syllabus/Outline
No ratings yet
The University of Bamenda Course Syllabus/Outline
2 pages
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
02 - Lecture Note - TensorFlow Ops
No ratings yet
02 - Lecture Note - TensorFlow Ops
21 pages
Reinforcement Learning
100% (1)
Reinforcement Learning
64 pages
Advanced Deep Learning Questions - ChatGPT
No ratings yet
Advanced Deep Learning Questions - ChatGPT
13 pages
GANppt
100% (1)
GANppt
34 pages
Ebook Deep Learning Objective Type Questions
No ratings yet
Ebook Deep Learning Objective Type Questions
102 pages
Day 45 PyTorch Presentation
No ratings yet
Day 45 PyTorch Presentation
67 pages
Machine Learning Tutorial
100% (1)
Machine Learning Tutorial
44 pages
Machine Learning Absolute Beginners
100% (2)
Machine Learning Absolute Beginners
52 pages
Deep Learning Interview Questions - Deep Learning Questions
No ratings yet
Deep Learning Interview Questions - Deep Learning Questions
21 pages
TensorFlow Basics
100% (1)
TensorFlow Basics
38 pages
Sinan Ozdemir Quick Start Guide To Large Language Models Strategies
No ratings yet
Sinan Ozdemir Quick Start Guide To Large Language Models Strategies
285 pages
Deep Learning in Python - Master Data Science and Machine Learning With Modern Neural Networks Written in Python, Theano, and TensorFlow (PDFDrive)
100% (2)
Deep Learning in Python - Master Data Science and Machine Learning With Modern Neural Networks Written in Python, Theano, and TensorFlow (PDFDrive)
104 pages
Machine Learning
100% (5)
Machine Learning
35 pages
Deep Learning Interview Guide
No ratings yet
Deep Learning Interview Guide
17 pages
Deep Learning Decoding Problems
100% (1)
Deep Learning Decoding Problems
103 pages
Neural Networks and Deep Learning - Deep Learning Explained To Your Granny - A Visual Introduction For Beginners Who Want To Make Their Own Deep Learning Neural Network (Machine Learning)
100% (5)
Neural Networks and Deep Learning - Deep Learning Explained To Your Granny - A Visual Introduction For Beginners Who Want To Make Their Own Deep Learning Neural Network (Machine Learning)
84 pages
TensorFlow 2.x Basics: Tensors Guide
No ratings yet
TensorFlow 2.x Basics: Tensors Guide
50 pages
TensorFlow Basics for Beginners
No ratings yet
TensorFlow Basics for Beginners
26 pages
NLP and Generative AI Syllabus - 2025
No ratings yet
NLP and Generative AI Syllabus - 2025
5 pages
MachineLearningSimplified 200401 005435
100% (1)
MachineLearningSimplified 200401 005435
96 pages
Python ML Guide for Beginners
100% (3)
Python ML Guide for Beginners
541 pages
Machine Learning
100% (2)
Machine Learning
211 pages
Deep Learning
100% (3)
Deep Learning
32 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
200 pages
Deep Learning With PyTorch Guide For Beginners and Intermediate
100% (7)
Deep Learning With PyTorch Guide For Beginners and Intermediate
120 pages
(Addison-Wesley Data & Analytics Series) Krohn, J. - Beyleveld, G. - Bassens, A. - Deep Learning Illustrated - A Visual, Interactive Guide To Artificial Intelligence-Pearson Education (2019)
100% (4)
(Addison-Wesley Data & Analytics Series) Krohn, J. - Beyleveld, G. - Bassens, A. - Deep Learning Illustrated - A Visual, Interactive Guide To Artificial Intelligence-Pearson Education (2019)
192 pages
Deep Learning With Tensorflow
No ratings yet
Deep Learning With Tensorflow
50 pages
Machine Learning Midterm
No ratings yet
Machine Learning Midterm
18 pages
Tensorflow 2 Tutorial PDF
100% (4)
Tensorflow 2 Tutorial PDF
66 pages
Artificial Neural Networks: Part 1/3
No ratings yet
Artificial Neural Networks: Part 1/3
25 pages
Tensorflow Presentation
No ratings yet
Tensorflow Presentation
13 pages
Code, Et Tu - LLM, Transformer, RAG AI - Mastering Large Language Models, Transformer Models, and Retrieval-Augmented Generation (RAG) Technology (2024)
100% (3)
Code, Et Tu - LLM, Transformer, RAG AI - Mastering Large Language Models, Transformer Models, and Retrieval-Augmented Generation (RAG) Technology (2024)
317 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
27 pages
AI Lab Manual Version 1.3
100% (1)
AI Lab Manual Version 1.3
123 pages
Machine Learning Techniques
100% (2)
Machine Learning Techniques
45 pages
Visual Guide to Deep Learning
100% (5)
Visual Guide to Deep Learning
42 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
406 pages
2023 LLMBC LLM Foundations
No ratings yet
2023 LLMBC LLM Foundations
92 pages
Deep Learning by AndrewNG Tutorial Notes
No ratings yet
Deep Learning by AndrewNG Tutorial Notes
298 pages
Python Machine Learning Guide
100% (2)
Python Machine Learning Guide
70 pages
Dive Into Deep Learning
100% (2)
Dive Into Deep Learning
291 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
34 pages
Deep Learning Basics for Beginners
No ratings yet
Deep Learning Basics for Beginners
34 pages
00 Pytorch and Deep Learning Fundamentals PDF
No ratings yet
00 Pytorch and Deep Learning Fundamentals PDF
44 pages
23 DeepLearning PDF
No ratings yet
23 DeepLearning PDF
74 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
40 pages
Fundamentals of Deep Learning
No ratings yet
Fundamentals of Deep Learning
195 pages
Service Manual: CX-JT8
No ratings yet
Service Manual: CX-JT8
90 pages
Canon iRC6880i Error Codes Guide
No ratings yet
Canon iRC6880i Error Codes Guide
63 pages
Technical Specs Nova35
No ratings yet
Technical Specs Nova35
1 page
Assignment 1
No ratings yet
Assignment 1
57 pages
Radar Questions
No ratings yet
Radar Questions
1 page
Power Plant Engineering Q&A
No ratings yet
Power Plant Engineering Q&A
3 pages
Motor Starter Components and Function
100% (4)
Motor Starter Components and Function
2 pages
Importance of Software Testing in Software Development Life Cycle
No ratings yet
Importance of Software Testing in Software Development Life Cycle
4 pages
Motivations of Fuzzy Logic
No ratings yet
Motivations of Fuzzy Logic
3 pages
PD78F9116B, 78F9116B (A) : Mos Integrated Circuits
No ratings yet
PD78F9116B, 78F9116B (A) : Mos Integrated Circuits
56 pages
Communication Individual
No ratings yet
Communication Individual
5 pages
ETEK EV Charging Station Solution 2023
No ratings yet
ETEK EV Charging Station Solution 2023
47 pages
Navaluck Pipitthsuksunt-Offer Letter
No ratings yet
Navaluck Pipitthsuksunt-Offer Letter
2 pages
Shreya
No ratings yet
Shreya
10 pages
Mc3 Manual en
No ratings yet
Mc3 Manual en
34 pages
Waste Collection Route Planning
No ratings yet
Waste Collection Route Planning
8 pages
DSM - Mk6es - Hardware Reference Manual.U10.2
No ratings yet
DSM - Mk6es - Hardware Reference Manual.U10.2
34 pages
ECE 303 and 353 - LAB - Manual
No ratings yet
ECE 303 and 353 - LAB - Manual
30 pages
Toaz - Info 100 Tambola Tickets Printable Free PR 1
No ratings yet
Toaz - Info 100 Tambola Tickets Printable Free PR 1
35 pages
Quick Start Guide DesignBase 6.2
100% (1)
Quick Start Guide DesignBase 6.2
67 pages
Understanding Primavera P6 Database Settings by Paul E Harris
No ratings yet
Understanding Primavera P6 Database Settings by Paul E Harris
13 pages
ICT Module 1 CSS NC-II
No ratings yet
ICT Module 1 CSS NC-II
27 pages
Systems Analysis and Design in A Changing World, 7th Edition - Chapter 11 ©2016. Cengage Learning. All Rights Reserved. 1
No ratings yet
Systems Analysis and Design in A Changing World, 7th Edition - Chapter 11 ©2016. Cengage Learning. All Rights Reserved. 1
55 pages
Tracing Down User and Computer Account Deletion in Active Directory - TechNet Blogs
No ratings yet
Tracing Down User and Computer Account Deletion in Active Directory - TechNet Blogs
4 pages
Owners' Perspective in Construction
100% (1)
Owners' Perspective in Construction
286 pages
CC100 Assignment 1
No ratings yet
CC100 Assignment 1
3 pages
UNIT 5 - Information Extraction
No ratings yet
UNIT 5 - Information Extraction
14 pages

Deep Learning Course Introduction

Uploaded by

Deep Learning Course Introduction

Uploaded by

Lecture 1: Introduction to

Practical deep learning

Artificial intelligence is the ability of a computer to perform

Machine learning is the study of algorithms that learn from

Deep learning is a subfield of machine learning focusing on

Deep, “end-to-end” learning:

learned learned learned

Image from https://arxiv.org/abs/1710.10196

Animation from https://yanpanlau.github.io/2016/07/10/FlappyBird-Keras.html

• Tensors: generalization of matrices

• parameters 𝜃 and hyperparameters

• cost function: loss regularization

• Derivative and minima/maxima of functions

• (Mini-batch) stochastic gradient

Image from: https://towardsdatascience.com/gradient-descent-algorithm-and-its-variants-10f652806a3

By Chabacano [GFDL or CC BY-SA 4.0], from Wikimedia Commons

• The network maps the input data X

Animation from: https://imgur.com/s25RsOr

• Actually tools for defining static or

neural networks API

You might also like