0% found this document useful (0 votes)

70 views10 pages

L01 - Introduction-to-ML

This document provides an introduction and overview of a course on neural networks and deep learning. It discusses the motivation for machine learning and how it differs from traditional programming by learning from large datasets rather than being explicitly programmed. It also discusses the types of machine learning, including supervised learning which is the focus of the course. Supervised learning involves using labeled examples to train a model to predict target outputs for new inputs.

Uploaded by

sayali sonavane

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views10 pages

L01 - Introduction-to-ML

Uploaded by

sayali sonavane

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Lecture 1: Introduction

Roger Grosse

This series of readings forms the lecture notes for the course CSC421,
“Neural Networks and Deep Learning,” for undergraduates at the University
of Toronto. I’m aiming for it also to function as a stand-alone mini-textbook
for self-directed learners and for students at other universities. These notes
are aimed at students who have some background in basic calculus, probabil-
ity theory, and linear algebra, but possibly no prior background in machine
learning.

1 Motivation
1.1 Why machine learning?
Think about some of the things we do effortlessly on a day-to-day basis:
visually recognize people, places and things, pick up objects, understand
spoken language, and so on. How would you program a machine to do these
things? Unfortunately, it’s hard to give a step-by-step program, since we
have very little introspective awareness of the workings of our minds. How
do you recognize your best friend? Exactly which facial features do you
pick up on? AI researchers tried for decades to come up with computational
procedures for these sorts of tasks, and it proved frustratingly difficult.
Machine learning takes a different approach: collect lots of data, and
have an algorithm automatically figure out a good behavior from the data.
If you’re trying to write a program to distinguish different categories of
objects (tree, dog, etc.), you might first collect a dataset of images of each
kind of object, and then use a machine learning algorithm to train a model
(such as a neural network) to classify an image as one category or another.
Maybe it will learn to see in a way analogous to the human visual system,
or maybe it will come up with a different approach altogether. Either way,
the whole process can be much easier than specifying everything by hand.
Aside from being easier, there are lots of other reasons we might want
to use machine learning to solve a given problem:

• A system might need to adapt to a changing environment. For in-

stance, spammers are constantly trying to figure out ways to trick our
e-mail spam classifiers, so the classification algorithms will need to
constantly adapt.

• A learning algorithm might be able to perform better than its human

programmers. Learning algorithms have become world champions at
a variety of games, from checkers to chess to Go. This would be
impossible if the programs were only doing what they were explicitly
told to.

1
• We may want an algorithm to behave autonomously for privacy or
fairness reasons, such as with ranking search results or targeting ads.
Here are just a few important applications where machine learning al-
gorithms are regularly deployed:
• Detecting credit card fraud
• Determining when to apply a C-section
• Transcribing human speech
• Recognizing faces
• Robots learning complex behaviors

1.2 How is machine learning different from statistics?

A lot of the algorithms we cover in this course originally came from statistics:
linear regression, principal component analysis (PCA), maximum likelihood
estimation, Bayesian parameter estimation, and Expectation-Maximization
(EM). (Statisticians got there first because we had data before we had com-
puters.) Much of machine learning, from the most basic techniques to the
state-of-the-art algorithms presented at research conferences, is statistical
in flavor. It’s unsurprising that there should be overlap, since both fields
are fundamentally concerned with the question of how to learn things from
data.
What, then, is different about machine learning? Opinions will differ
on this question, but if I had to offer one rule of thumb, it’s this: statistics
is motivated by guiding human decision making, while machine learning
is motivated by autonomous agents. This means that, even when we talk
about the same algorithm, practitioners in the two fields are likely to ask
different questions. Statisticians might put more emphasis on being able to
interpret the results of an algorithm, or being able to rigorously determine
whether a certain observed pattern might have just happened by chance.
Machine learning practitioners might put more emphasis on algorithms that
can perform well in a variety of situations without human intervention. This
overlap in techniques, coupled with the differences in motivation, creates a
lot of awkwardness as practitioners in both fields will talk past each other
without realizing it.

1.3 Why a course on neural networks?

Neural networks are one particular approach to machine learning, very
loosely inspired by how the brain processes information. A neural network
is composed of a large number of units, each of which does very simple com-
putations, but which produce sophisticated behaviors in aggregate. There
are lots of other widely used approaches to machine learning, but this class
focuses on neural networks for several reasons:
• Neural nets are becoming very widely used in the software indus-
try. They underlie systems for speech recognition, translation, rank-
ing search results, face recognition, sentiment analysis, image search,
and many other applications. It’s an important tool to know.

2
• There are powerful software packages like Caffe, Theano, Torch, and
TensorFlow, which allow us to quickly implement sophisticated learn-
ing algorithms.
• Many of the important algorithms are much simpler to explain, com-
pared with other subfields of machine learning. This makes it possible
for undergraduates to quickly get up to speed on state-of-the-art tech-
niques in the field.
This class is very unusual among undergrad classes, in that it covers
modern research techniques, i.e. algorithms introduced in the last 5 years.
It’s pretty amazing that with less than a page of code, we can build learning
algorithms more powerful than the best ones researchers had come up with
as of 5 years ago.
In fact, these software packages make neural nets deceptively easy. One
might wonder, if you can implement a neural net in TensorFlow using a
handful of lines of code, why do we need a whole class on the subject?
The answer is that the algorithms generally won’t work perfectly the first
time. Diagnosing and fixing the problems requires careful detective work
and a sophisticated understanding of what’s going on beneath the hood.
In this class, we’ll work from the bottom up: we’ll derive the algorithms
mathematically, implement them from scratch, and only then look at the
out-of-the-box implementations. This will help us build up the depth of
understanding we need to reason about how an algorithm is behaving.

2 Types of machine learning

I said above that in machine learning, we collect lots of data, and then train
a model to learn a particular behavior from it. But what kind of data do
we collect? The answer will determine what sort of learning algorithm we’ll
apply to any given problem. Roughly speaking, there are three different
types of machine learning:
• In supervised learning, we have examples of the desired behavior.
For instance, if we’re trying to train a neural net to distinguish cars
and trucks, we would collect images of cars and trucks, and label each
one as a car or a truck.
• In reinforcement learning, we don’t have examples of the behav-
ior, but we have some method of determining how good a particular
behavior was — this is known as a reward signal. (By analogy, think
of training dogs to perform tricks.) One example would be training
an agent to play video games, where the reward signal is the player’s
score.
• In unsupervised learning, we have neither labels nor a reward sig-
nal. We just have a bunch of data, and want to look for patterns in
the data. For instance, maybe we have lots of examples of patients
with autism, and want to identify different subtypes of the condition.
This taxonomy is a vast oversimplification, but it will still help us to organize
the algorithms we cover in this course. Now let’s look at some examples
from each category.

3
2.1 Supervised learning
The majority of this course will focus on supervised learning. This is the
best-understood type of machine learning, because (compared with unsu-
pervised and reinforcement learning) supervised learning problems are much
easier to assign a mathematically precise formulation that matches what one
is trying to achieve. In general, one defines a task, where the algorithm’s
goal is to train a model which takes an input (such as an image) and
predicts a target (such as the object category). One collects a dataset
consisting of pairs of inputs and labels (i.e. true values of the target). A
subset of the data, called the training set, is used to train the model, and
a separate subset, called the test set, is used to measure the algorithm’s
performance. There are a lot of highly effective and broadly applicable su-
pervised learning algorithms, many of which will be covered in this course.
For several decades, image classification has been perhaps the pro-
totypical application of neural networks. In the late 1980s, the US Postal
Service was interested in automatically reading handwritten zip codes, so
they collected 9,298 examples of handwritten digits (0-9), given as 16 × 16
images, and labeled each one; the task is to predict the digit class from
the image. This dataset is now known as the USPS Dataset1 . In the ter-
minology of supervised learning, we say that the input is the image, and
the target is the digit class. By the late 1990s, neural networks were good
enough at this task that they became regularly used to sort letters.
In the 1990s, researchers collected a similar but larger handwritten digit
dataset called MNIST2 ; for decades, MNIST has served as the “fruit fly” of
neural network research. I.e., even though handwritten digit classification
is now considered too easy a problem to be of practical interest, MNIST
has been used for almost two decades to benchmark neural net learning
algorithms. Amazingly, this classic dataset continues to yield algorithmic
insights which generalize to challenging problems of more practical interest.
A more challenging task is to classify full-size images into object cat-
egories, a task known as object recognition. The ImageNet dataset3
consists of 14 million images of nearly 22,000 distinct object categories. A
(still rather large) subset of this dataset, containing 1.2 million images in
1000 object categories, is currently one of the most important benchmarks
for computer vision algorithms; this task is known as the ImageNet Large
Scale Visual Recognition Challenge (ILSVRC). Since 2012, all of the best-
performing algorithms have been neural networks. Recently, progress on
the ILSVRC has been extremely rapid, with the error rate4 dropping from
25.7% to 5.7% over the span of a few years!
All of the above examples concerned image classification, where the goal
is to predict a discrete category for each image. A closely related task is
object detection, where the task is to identify all of the objects present in
their image, as well as their locations. I.e., the input is an image, and the
target is a listing of object categories together with their bounding boxes.
1
http://statweb.stanford.edu/~tibs/ElemStatLearn/data.html
2
http://yann.lecun.com/exdb/mnist/
3
http://www.image-net.org/
4
In particular, the top-5 error rate; the algorithm predicts 5 object categories, and
gets it right if any of the 5 is correct.

4
Other variants include localization, where one is given a list of object
categories and has to predict their locations, and semantic segmentation,
where one tries to label each pixel of an image as belonging to an object
category. There are a huge variety of different supervised learning problems
related to image understanding, depending on exactly what one is hoping
to achieve. The variety of tasks can be bewildering, but fortunately we can
approach most of them using very similar principles.
Neural nets have been applied in lots of areas other than vision. Another
important problem domain is language. Consider, for example, the problem
of machine translation. The task is to translate a sentence from one
language (e.g. French) to another language (e.g. English). One has available
a large corpus of French sentences coupled with their English translations; a
good example is the proceedings of the Canadian Parliament. Observe that
this task is more complex than image classification, in that the target is an
entire sentence. Observe also that there generally won’t be a unique best
translation, so it may be preferable for the algorithm to return a probability
distribution over possible translations, rather than a single translation. This
ambiguity also makes evaluation difficult, since one needs to distinguish
almost-correct translations from completely incorrect ones.
The general category of supervised learning problem where the inputs
and targets are both sequences is known as sequence-to-sequence learn-
ing. The sequences need not be of the same type. An important example
is speech recognition, where one is given a speech waveform and wants
to produce a transcription of what was said. Neural networks led to dra-
matic advances in speech recognition around 2010, and form the basis of
all of the modern systems. Caption generation is a task which combines
vision and language understanding; here the task is to take an image and
return a textual description of the image. The most successful approaches
are based on neural nets. Caption generation is far from a solved problem,
and the systems can be fun to experiment with, not least because of their
entertaining errors.5

2.2 Reinforcement Learning

The second type of learning problem is reinforcement learning. Here, one
doesn’t have labels of the correct behavior, but instead has a way of quanti-
tatively evaluating how good a behavior was; this is known as the reward
signal. Reinforcement learning problems generally involve an agent situ-
ated in an environment. In each time step, the agent has available a set of
actions which (either deterministically or stochastically) affect the state of
the agent and the environment. The goal is to learn a policy, determining
which action to perform depending on the state, in order to achieve has
high a reward as possible on average.
Throughout the history of AI, a lot of progress has been driven by game
playing. Over the years, AIs have come to defeat human champions in board
games of increasing complexity, including backgammon, checkers, chess,
and Go. In the case of Go, the success was achieved by a neural network
called AlphaGo. Most of these games involve playing against an opponent,
5
http://deeplearning.cs.toronto.edu/i2t

5
or adversary; this adversarial setting is beyond the scope of this class.
However, single-player games can be formulated as reinforcement learning
problems. For instance, we will look at the example of training an agent
to play classic Atari games. The agent observes the pixels on the screen,
has a set of actions corresponding to the controller buttons, and receives
rewards corresponding to the score of the game. Neural net algorithms have
outperformed humans on many games, in the sense of being able to achieve
a high score in a short period of time.

2.3 Unsupervised Learning

The third type of machine learning, where one has neither labels of the
correct behavior nor a reward signal, is known as unsupervised learning.
Here, one simply has a collection of data and is interested in finding patterns
in the data. We will just barely touch upon unsupervised learning in this
class, because compared with supervised and reinforcement learning, the
principles are less well understood, the algorithms are more mathematically
involved, and one must account for a lot more domain-specific structure.
One of the most important types of unsupervised learning is distribu-
tion modeling, where one has an unlabeled dataset (such as a collection
of images or sentences), and the goal is to learn a probability distribution
which matches the dataset as closely as possible. In principle, one should be
able to generate from, or draw samples from, the distribution, and those
samples should be indistinguishable from the original data. Sometimes we
care about the samples themselves, e.g. if we want to generate images of
textures for graphics applications. Another important use of distribution
models is to resolve ambiguities; for instance, in speech recognition, “recog-
nize speech” may sound very similar to “wreck a nice beach,” but a good
distribution model ought to be able to tell us that the former is a more
likely explanation than the latter.
Another important use of unsupervised learning is to recover latent
structure, or high-level explanations that yield insight into the structure
underlying the data. One important example is clustering, where one is
interested in dividing a set of data points into clusters, where data points
assigned to the same cluster are similar, and data points assigned to differ-
ent clusters are dissimilar. Much fancier models are possible as well. For
instance, a biology lab was running behavior genetics experiments on mice,
and wanted to automatically analyze videos of mice to determine whether
one genetic variant was more likely to engage in a particular behavior than
another variant. If experts had explicitly labeled different behaviors, this
would be a supervised learning problem; however, the lab avoided doing this
because it would have introduced human biases into the interpretation. In-
stead, they ran an unsupervised learning algorithm to automatically analyze
mouse videos and group them into different categories of behaviors.

3 Neural nets and the brain

The neuron is the basic unit of processing in the brain. It has a broad,
branching tree of dendrites, which receive chemical signals from other neu-
rons at junctions called synapses, and convert these into electrical signals.

6
y bias
i'th weight
output output

w1 w weights
2 w3
y =g b+ xi wi
inputs
i
x1 x2 x3
nonlinearity i'th input

Figure 1: Simplified neuron-like processing unit.

The dendrites integrate these electrical signals in complex, nonlinear ways,

and if the combined signal is strong enough, the neuron generates an action
potential. This is an electrical signal that’s propagated down the neuron’s
axon, which eventually causes the neuron to release chemical signals at its
synapses with other neurons. Those neurons then integrate their incoming
signals, and so on.
In machine learning, we abstract away nearly all of this complexity, and
use an extremely simplified model of a neuron shown in Figure [[]]. This
neuron has a set of incoming connections from other neurons, each with
an associated strength, or weight. It computes a value, called the pre-
activation, which is the sum of the incoming signals times their weights:
X
z= wj xj + b.
j

The scalar value b, called a bias, determines the neuron’s activation in the
absence of inputs. The pre-activation is passed through a nonlinearity φ
(also called an activation function) to compute the activation a = φ(z).
Examples of nonlinearities include the logistic sigmoid
1
φ(z) =
1 + e−z
and linear rectification

z if z > 0
φ(z) =
0 if z ≤ 0.

In summary, the activation is computed as

 
X
a = φ wj xj + b .
j

That’s it. That’s all that our idealized neurons do. Note that the whole
idea of a continuous-valued activation is biologically unrealistic, since a real
neuron’s action potentials are an all-or-nothing phenomenon: either they
happen or they don’t, and they do not vary in strength. The continuous-
valued activation is sometimes thought of as representing a “firing rate,” but
mostly we just ignore the whole issue and don’t even think about the rela-
tionships with biology. From now on, we’ll refer to these idealized neurons
using the more scientifically neutral term units, rather than neurons.
If the relationship with biology seems strained, it gets even worse when
we talk about learning, i.e. adapting the weights of the neurons. Most

7
modern neural networks are trained using a procedure called backprop-
agation, where each neuron propagates error signals backwards through
its incoming connections. Nothing analogous has been observed in actual
biological neurons. There have been some creative proposals for how bio-
logical neurons might implement something like backpropagation, but for
the most part we just ignore the issue of whether our neural nets are bio-
logically realistic, and simply try to get the best performance we can out of
the tools we have. (There is a separate field called theoretical neuroscience,
which builds much more accurate models of neurons, towards the goal of
understanding better how the brain works. This field has produced lots of
interesting insights, and has achieved accurate quantitative models of some
neural systems, but so far there doesn’t appear to be much practical benefit
to using more realistic neuronal models in machine learning systems.)
However, neural networks do share one important commonality with the
brain: they consist of a very large number of computational units, each of
which performs a rather simple set of operations, but which in aggregate
produce very sophisticated and complex behaviors. Most of the models
we’ll discuss in this course are simply large collections of units, each of
which computes a linear function followed by a nonlinearity.
Another analogy with the brain is worth pointing out: the brain is or-
ganized into hierarchies of processing, where different brain regions encode
information at different levels of abstraction. Information processing starts
at the retina of the eye, where neurons compute simple center-surround
functions of their inputs. Signals are passed to the primary visual cor-
tex, where (to vastly oversimplify things) cells detect simple image features
such as edges. Information is passed through several additional “layers” of
processing, each one taking place in a different brain region, until the in-
formation reaches areas of the cortex which encode things at a high level of
abstraction. For instance, individual neurons in the infero-temporal cortex
have been shown (again, vastly ovsersimplifying) to encode the identities of
objects.
In summary, visual information is processed in a series of layers of in-
creasing abstraction. This inspired machine learning researchers to build
neural networks which are many layers deep, in hopes that they would
learn analogous representations where higher layers represent increasingly
abstract features. In the last 5 years or so, very deep networks have indeed
been found to achieve startlingly good performance on a wide variety of
problems in vision and other application areas; for this reason, the research
area of neural networks is often referred to as deep learning. There is
some circumstantial evidence that deep networks learn hierarchical repre-
sentations, but this is notoriously difficult to analyze rigorously.

4 Software
There are a lot of software tools that make it easy to build powerful and
sophisticated neural nets. In this course, we will use the programming lan-
guage Python, a friendly but powerful high-level language which is widely
used both in introductory programming courses and a wide variety of pro-
duction systems. Because Python is an interpreted language, executing a

8
line of Python code is very slow, perhaps hundreds of times slower than the
C equivalent. Therefore, we never write algorithms directly using for-loops
in Python. Instead, we vectorize the algorithms by expressing them in
terms of operations on matrices and vectors; those operations are imple-
mented in an efficient low-level language such as C or Fortran. This allows
a large number of computational operations to be performed with minimal
interpreter overhead. In this course, we will use the NumPy library, which
provides an efficient and easy-to-use array abstraction in Python.
Ten years ago, most neural networks were implemented directly on top
of a linear algebra framework like NumPy, or perhaps a lower level pro-
gramming language when efficiency was especially critical. More recently,
a variety of powerful neural net frameworks have been developed, including
Torch, Caffe, Theano, TensorFlow, and PyTorch. These frameworks
make it easy to quickly implement a sophisticated neural net model. Here
are some of the features provided by some or all of these frameworks (we’ll
use TensorFlow as an example):
• Automatic differentiation. If one implements a neural net directly
on top of NumPy, much of the implementational work involves writing
procedures to compute derivatives. TensorFlow automatically con-
structs routines for computing derivatives which are generally at least
as efficient as the ones we would have written by hand.

• Compiling computation graphs. If we implement a network in

NumPy, a lot of time is wasted allocating and deallocating memory
for matrices. TensorFlow takes a different approach: you first build
a graph defining the network’s computation, and TensorFlow figures
out an efficient strategy for performing those computations. It handles
memory efficiently and performs some other code optimizations.

• Libraries of algorithms and network primitives. Lots of differ-

ent neural net primitives and training algorithms have been proposed
in the research literature, and many of these are made available as
black boxes in TensorFlow. This makes it easy to iterate with differ-
ent choices of network architecture and training algorithm.

• GPU support. While NumPy is much faster than raw Python, it’s
not nearly fast enough for modern neural nets. Because neural nets
consist of a large collection of simple processing units, they natu-
rally lend themselves to parallel computation. Graphics processing
units (GPUs) are a particular parallel architecture which has been
especially powerful in training neural nets. It can be a huge pain to
write GPU routines at a low level, but TensorFlow provides an easy
interface so that the same code can run on either a CPU or a GPU.
For this course, we’ll use two neural net frameworks. The first is Au-
tograd, a lightweight automatic differentiation library. It is simple enough
that you will be able to understand how it is implemented; while it is miss-
ing many of the key features of PyTorch or TensorFlow, it provides a useful
mental model for reasoning about those frameworks.
For roughly the second half of the course, we will use PyTorch, a pow-
erful and widely used neural net framework. It’s not quite as popular as

9
TensorFlow, but we think it is easier to learn. But once you are done
with this course, you should find it pretty easy to pick up any of the other
frameworks.

Machine Learning For Absolute Beginners A - Oliver Theobald
100% (2)
Machine Learning For Absolute Beginners A - Oliver Theobald
179 pages
CS321 Grosse Lecture Notes
No ratings yet
CS321 Grosse Lecture Notes
169 pages
Chan, Jamie - Machine Learning With Python For Beginners - A Step-By-Step Guide With Hands-On Projects (Learn Coding Fast With Hands-On Project (2021) - Libgen - Li
100% (1)
Chan, Jamie - Machine Learning With Python For Beginners - A Step-By-Step Guide With Hands-On Projects (Learn Coding Fast With Hands-On Project (2021) - Libgen - Li
200 pages
Project On Economic Load Dispatch Using Genetic Algorithm and Artificial Neural Network Optimization Techniques
100% (1)
Project On Economic Load Dispatch Using Genetic Algorithm and Artificial Neural Network Optimization Techniques
45 pages
Python Bible
No ratings yet
Python Bible
61 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
From Everand
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
Steven Cooper
4/5 (9)
Intro To Machine Learning
100% (1)
Intro To Machine Learning
250 pages
Ml Microst
No ratings yet
Ml Microst
264 pages
Machine Learning_ 2 Books in 1 - The Complete Guide for Beginners to Master Neural Networks, Artificial Intelligence, And Data Science With Python [BooksRack.net]
No ratings yet
Machine Learning_ 2 Books in 1 - The Complete Guide for Beginners to Master Neural Networks, Artificial Intelligence, And Data Science With Python [BooksRack.net]
201 pages
Python Machine Learning - (BooksHash)
100% (1)
Python Machine Learning - (BooksHash)
90 pages
Final Project
No ratings yet
Final Project
9 pages
Machine Learning Is Fun 1565131730
No ratings yet
Machine Learning Is Fun 1565131730
48 pages
Helsenki - Intro To ML
No ratings yet
Helsenki - Intro To ML
35 pages
ML ch1
No ratings yet
ML ch1
20 pages
ML-UNIT - I- Part A
No ratings yet
ML-UNIT - I- Part A
88 pages
machine learning basics
No ratings yet
machine learning basics
2 pages
Machine Learning- UNIT I (1)
No ratings yet
Machine Learning- UNIT I (1)
70 pages
Machine_learning
No ratings yet
Machine_learning
8 pages
ML Overview Notes
No ratings yet
ML Overview Notes
23 pages
Algorithms: Discover The Computer Science and Artificial Intelligence Used to Solve Everyday Human Problems, Optimize Habits, Learn Anything and Organize Your Life
From Everand
Algorithms: Discover The Computer Science and Artificial Intelligence Used to Solve Everyday Human Problems, Optimize Habits, Learn Anything and Organize Your Life
Trust Genics
No ratings yet
New Microsoft Office Word Document
No ratings yet
New Microsoft Office Word Document
28 pages
textbook ML_removed_removed_removed_removed_removed
No ratings yet
textbook ML_removed_removed_removed_removed_removed
37 pages
OCI ML Fundations
No ratings yet
OCI ML Fundations
9 pages
Jntuk r20 Unit-I Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-I Deep Learning Techniques (WWW - Jntumaterials.co - In)
23 pages
1.2.1 ML Intro
No ratings yet
1.2.1 ML Intro
18 pages
Deep learning: deep learning explained to your granny – a guide for beginners
From Everand
Deep learning: deep learning explained to your granny – a guide for beginners
PAT NAKAMOTO
3/5 (2)
ML%20Key%20Concepts
No ratings yet
ML%20Key%20Concepts
139 pages
Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2
From Everand
Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2
William Sullivan
1/5 (1)
Unit 1 (2)
No ratings yet
Unit 1 (2)
46 pages
Practicing Consciousness Slides
No ratings yet
Practicing Consciousness Slides
22 pages
UNIT I-Machine Learning
No ratings yet
UNIT I-Machine Learning
68 pages
1 Lecture 1: Introduction To Machine Learning
No ratings yet
1 Lecture 1: Introduction To Machine Learning
12 pages
Turner, Ryan - Python Machine Learning - The Ultimate Beginner's Guide To Learn Python Machine Learning Step by Step Using Scikit-Learn and Tensorflow (2019)
No ratings yet
Turner, Ryan - Python Machine Learning - The Ultimate Beginner's Guide To Learn Python Machine Learning Step by Step Using Scikit-Learn and Tensorflow (2019)
144 pages
Machine Learning For Everyone
100% (1)
Machine Learning For Everyone
50 pages
Machine Learning
100% (2)
Machine Learning
211 pages
Summer of Science Report On - Intro To Machine Learning
No ratings yet
Summer of Science Report On - Intro To Machine Learning
36 pages
MACHINE LEARNING ALGORITHM - Unit-1-1
100% (1)
MACHINE LEARNING ALGORITHM - Unit-1-1
78 pages
F# For Machine Learning Essentials - Sample Chapter
No ratings yet
F# For Machine Learning Essentials - Sample Chapter
29 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
12 pages
1.2.1 ML intro
No ratings yet
1.2.1 ML intro
15 pages
MACHINE LEARNING
No ratings yet
MACHINE LEARNING
97 pages
Lesson 1
No ratings yet
Lesson 1
20 pages
ML Links
No ratings yet
ML Links
176 pages
ml report
No ratings yet
ml report
19 pages
ML 01
No ratings yet
ML 01
15 pages
Machine Learning_Module 1.Pptx
No ratings yet
Machine Learning_Module 1.Pptx
105 pages
01 Ml Overview Notes
No ratings yet
01 Ml Overview Notes
22 pages
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet
ML Notes(BCS602)
No ratings yet
ML Notes(BCS602)
186 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
From Everand
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
Peter Bradley
No ratings yet
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
Datascience
No ratings yet
Datascience
14 pages
Unit 1&2
No ratings yet
Unit 1&2
270 pages
ML_Unit I_Final
No ratings yet
ML_Unit I_Final
132 pages
UNIT-1 Machine Learning
No ratings yet
UNIT-1 Machine Learning
43 pages
2a - 1
No ratings yet
2a - 1
5 pages
Chapter 1
No ratings yet
Chapter 1
40 pages
Basics of Machine Learning
100% (4)
Basics of Machine Learning
22 pages
Machine Learning
No ratings yet
Machine Learning
17 pages
Lec1 Intro
No ratings yet
Lec1 Intro
119 pages
Application of Artificial Intelligence in Industri
No ratings yet
Application of Artificial Intelligence in Industri
6 pages
Engineering: Automatic Control Systems Array and Phased Array Antenna Basics
No ratings yet
Engineering: Automatic Control Systems Array and Phased Array Antenna Basics
7 pages
Instance Segmentation For Autonomous Vehicle
No ratings yet
Instance Segmentation For Autonomous Vehicle
6 pages
ML Resume First Job
No ratings yet
ML Resume First Job
2 pages
TNCT Q2 Module6
No ratings yet
TNCT Q2 Module6
13 pages
Solar Power Plant Physics Informed Neural Network
No ratings yet
Solar Power Plant Physics Informed Neural Network
4 pages
100 MCQ Questions For Practice
No ratings yet
100 MCQ Questions For Practice
35 pages
XAI Seminar
No ratings yet
XAI Seminar
8 pages
AI For Leaders Course
No ratings yet
AI For Leaders Course
15 pages
Module - 1 - Tie Answers
No ratings yet
Module - 1 - Tie Answers
8 pages
Artifical Intelligence Notes Part 1
No ratings yet
Artifical Intelligence Notes Part 1
22 pages
MyPractice - Question Bank - Results
No ratings yet
MyPractice - Question Bank - Results
80 pages
Kedar Dabhadkar: Experience
No ratings yet
Kedar Dabhadkar: Experience
2 pages
Artificial Intelligence Infographic - 101
No ratings yet
Artificial Intelligence Infographic - 101
1 page
Eee GR18
No ratings yet
Eee GR18
11 pages
2 Ann Architecture Nafees
100% (1)
2 Ann Architecture Nafees
30 pages
FPGA Based Implementation of Neural Network
No ratings yet
FPGA Based Implementation of Neural Network
5 pages
AI and Ethics Bundle
No ratings yet
AI and Ethics Bundle
75 pages
A New Approach For Persian Speech Recognition
No ratings yet
A New Approach For Persian Speech Recognition
6 pages
Neural Networks For Dummies
No ratings yet
Neural Networks For Dummies
10 pages
Bishop 1994
No ratings yet
Bishop 1994
30 pages
MCSE - PGCS202 - SOFT COMPUTING - R18 - Booklet
No ratings yet
MCSE - PGCS202 - SOFT COMPUTING - R18 - Booklet
2 pages
The Design of Intelligent Washing Machine Controller Based On FPGA
No ratings yet
The Design of Intelligent Washing Machine Controller Based On FPGA
4 pages
CS607 SOLVED MCQs FINAL TERM BY JUNAID
No ratings yet
CS607 SOLVED MCQs FINAL TERM BY JUNAID
26 pages
Canfis
No ratings yet
Canfis
12 pages
Unit 1: Dashboarding Through Data Question Bank
No ratings yet
Unit 1: Dashboarding Through Data Question Bank
5 pages
MRK - Spring 2022 - CS719 - 2 - MS210400057
No ratings yet
MRK - Spring 2022 - CS719 - 2 - MS210400057
6 pages
Abhijit Manepatil
No ratings yet
Abhijit Manepatil
3 pages

L01 - Introduction-to-ML

Uploaded by

L01 - Introduction-to-ML

Uploaded by

Lecture 1: Introduction

• A system might need to adapt to a changing environment. For in-

• A learning algorithm might be able to perform better than its human

1.2 How is machine learning different from statistics?

1.3 Why a course on neural networks?

2 Types of machine learning

2.2 Reinforcement Learning

2.3 Unsupervised Learning

3 Neural nets and the brain

Figure 1: Simplified neuron-like processing unit.

The dendrites integrate these electrical signals in complex, nonlinear ways,

In summary, the activation is computed as

• Compiling computation graphs. If we implement a network in

• Libraries of algorithms and network primitives. Lots of differ-

You might also like