0% found this document useful (0 votes)

98 views8 pages

Understanding ANN and MNIST Dataset

Uploaded by

maherbakhtawarmaher

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

98 views8 pages

Understanding ANN and MNIST Dataset

Uploaded by

maherbakhtawarmaher

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

ANN Notes

An Artificial Neural Network (ANN) is a computer system designed to mimic the way the human
brain processes information. Just like our brains have billions of neurons that work together to
help us think and learn, an ANN has "artificial neurons" (also called nodes or units) that help it
recognize patterns and make decisions based on data.

Key Concepts of an ANN:

1. Neuron: In an ANN, a neuron is the basic building block. Each neuron receives information,
processes it, and then passes it on. In an ANN diagram, these neurons are often represented by
circles.

2. Layers:
- Input Layer: This is where data first enters the network. Each neuron in this layer represents
one feature of the data (like the pixel values in an image or scores in a survey).
- Hidden Layers: These are in the middle and do the actual processing. The neurons in these
layers detect patterns or features by performing calculations on the inputs they receive.
- Output Layer: This is the final layer, and it produces the result, like identifying if an image is
of a cat or a dog. Each neuron in the output layer represents a possible answer.

3. Weights and Biases:

- Weights: Each connection between neurons has a weight, which determines how much
influence one neuron has on the next. Adjusting weights is how an ANN "learns."
- Bias: Biases help fine-tune the network's outputs, allowing it to make better predictions.

4. Activation Functions: These are formulas applied to each neuron's output to help decide if
the neuron should "fire" (send its result to the next layer) or stay inactive. Common activation
functions include ReLU (Rectified Linear Unit), which only passes positive values, and Sigmoid,
which gives a result between 0 and 1.

5. Learning:
- ANNs learn by adjusting the weights and biases using a process called *training*. Training
usually involves a method called **backpropagation**, where the network is corrected based on
its errors. By showing the network thousands or even millions of examples and using an
algorithm called *gradient descent*, it gradually improves.

Example:
Imagine teaching a neural network to identify cats in pictures. During training, you would show it
thousands of labeled images (some with cats, some without). The network will adjust its weights
and biases to minimize errors, learning which features (like pointy ears or fur patterns) signal the
presence of a cat.

Why ANNs are Useful

ANNs are great at recognizing patterns in complex data, like images, sounds, and text. They’re
widely used in technologies we use daily, such as speech recognition (like Siri or Google
Assistant), self-driving cars, and medical diagnosis tools.

To implement an Artificial Neural Network (ANN) in TensorFlow without using Keras, we’ll use
TensorFlow’s lower-level API to manually define the architecture, forward pass, and training
loop. Here’s how you can do it using the MNIST dataset:

The MNIST dataset (Modified National Institute of Standards and Technology) is a well-known
dataset in machine learning and computer vision, often used as a benchmark for evaluating
image classification algorithms. Here’s an overview of what it contains and how it's used:

Overview of MNIST Dataset

1. Contents:
○ The dataset consists of 70,000 images of hand-written digits (0-9).
○ It’s split into 60,000 training images and 10,000 test images.
○ Each image is 28x28 pixels in grayscale, making each image a 784-pixel feature
vector (28 × 28 = 784).
2. Classes:
○ There are 10 classes, one for each digit (0 through 9).
○ Each image is labeled with the corresponding digit it represents, which is the
ground truth label.
3. Why MNIST is Popular:
○ Simplicity: Since it’s grayscale and low resolution, MNIST is computationally
inexpensive, allowing for quick experimentation.
○ Consistency: It’s used as a standardized benchmark across many models,
making it easier to compare results.
○ Starter Dataset: MNIST is often considered a “Hello World” dataset for image
recognition and deep learning, serving as an introductory task for new learners in
machine learning.
4. Applications in Learning:
○ Classification Models: Commonly used to teach models like logistic regression,
support vector machines, neural networks, and convolutional neural networks
(CNNs).
○ Image Preprocessing: MNIST helps in practicing techniques like normalization,
scaling, and reshaping images for neural networks.
○ Evaluation Metrics: Used to illustrate model performance metrics, including
accuracy, precision, recall, and confusion matrices.

Steps:

1. Load and preprocess the data

2. Define the model architecture and weights
3. Implement the forward pass (model prediction)
4. Define the loss function and optimizer
5. Train the model using a custom training loop

Here’s a complete code example:

Explanation of the Code

Class Definition for the Model: We created an `ANNModel` class to hold the weights, biases,
and the forward pass method.
Forward Pass: This function calculates the output of the network for given input data by
passing data through two hidden layers with ReLU activations, followed by an output layer with
softmax (used in the evaluation).
Loss Function: We use sparse categorical cross-entropy as it’s suitable for integer labels.
Training Step: Each training step calculates gradients and applies them to update the model’s
weights.
Custom Training Loop: For each epoch, we loop through batches of data and perform training.
Evaluation: Calculates model accuracy on the test set by comparing predicted labels to true
labels.
This code is fully functional in TensorFlow without Keras, and it provides more control over each
part of the ANN, making it suitable for lower-level neural network experimentation.

Key TensorFlow Concepts Explained

Tensors

Tensors are the core data structures in TensorFlow. They are multidimensional arrays, similar to
NumPy arrays, but optimized for performance in machine learning tasks.

Variables

Variables in TensorFlow are mutable tensors that are used to store model parameters (weights
and biases). They can be updated during training.

Operations

Operations are functions that manipulate tensors. Common operations include matrix
multiplication, addition, and activation functions.

Gradient Tape

The [Link] context manager records the operations for automatic differentiation.
This allows TensorFlow to compute gradients, which are essential for optimizing the model
during training.

Activation Functions

Activation functions introduce non-linearity into the model, allowing it to learn complex patterns.
ReLU (Rectified Linear Unit) is commonly used due to its simplicity and effectiveness.

Loss Functions

Loss functions measure how well the model's predictions match the actual labels. They are
crucial for guiding the optimization process.

Optimizers

Optimizers update the model's parameters based on the computed gradients. Adam is a popular
choice due to its adaptive learning rate, which helps achieve faster convergence.

MNIST Image Classification Guide
No ratings yet
MNIST Image Classification Guide
23 pages
Deep Learning with TensorFlow Guide
No ratings yet
Deep Learning with TensorFlow Guide
45 pages
Deep Learning Fundamentals Overview
No ratings yet
Deep Learning Fundamentals Overview
9 pages
Feedforward Neural Network with Keras
No ratings yet
Feedforward Neural Network with Keras
58 pages
EC601 Control Systems Project Report
No ratings yet
EC601 Control Systems Project Report
12 pages
Neural Network for Handwritten Digit Classification
No ratings yet
Neural Network for Handwritten Digit Classification
30 pages
Overview of Artificial Neural Networks
No ratings yet
Overview of Artificial Neural Networks
6 pages
Deep Learning Lab Manual JNTUK R20
No ratings yet
Deep Learning Lab Manual JNTUK R20
50 pages
Classifying Handwritten Digits with Keras
No ratings yet
Classifying Handwritten Digits with Keras
41 pages
Introduction to Artificial Neural Networks
No ratings yet
Introduction to Artificial Neural Networks
31 pages
Evaluating Feedforward Neural Networks
No ratings yet
Evaluating Feedforward Neural Networks
35 pages
Image Classification with ANN and CNN
No ratings yet
Image Classification with ANN and CNN
8 pages
Deep Learning Overview and Applications
No ratings yet
Deep Learning Overview and Applications
28 pages
Deep Learning Lab Manual JNTUK R20
No ratings yet
Deep Learning Lab Manual JNTUK R20
50 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
5 pages
Deep Learning with TensorFlow Overview
100% (1)
Deep Learning with TensorFlow Overview
50 pages
Introduction to Convolutional Neural Networks
No ratings yet
Introduction to Convolutional Neural Networks
12 pages
Neural Network Basics for MNIST Classification
No ratings yet
Neural Network Basics for MNIST Classification
44 pages
TensorFlow Guide for Data Science
No ratings yet
TensorFlow Guide for Data Science
25 pages
Neural Networks with TensorFlow Guide
No ratings yet
Neural Networks with TensorFlow Guide
21 pages
Beginner's Guide to Neural Networks
No ratings yet
Beginner's Guide to Neural Networks
16 pages
Deep Learning TensorFlow and Keras
No ratings yet
Deep Learning TensorFlow and Keras
454 pages
TensorFlow Neural Network Implementation
No ratings yet
TensorFlow Neural Network Implementation
29 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
3 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
29 pages
Introduction to Neural Networks
No ratings yet
Introduction to Neural Networks
8 pages
Deep Learning with TensorFlow Guide
No ratings yet
Deep Learning with TensorFlow Guide
62 pages
TensorFlow Convolutional Networks Guide
No ratings yet
TensorFlow Convolutional Networks Guide
34 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
37 pages
Deep Learning: Redundancy Explained
No ratings yet
Deep Learning: Redundancy Explained
28 pages
Introduction to Convolutional Neural Networks
No ratings yet
Introduction to Convolutional Neural Networks
12 pages
Deep Learning Image Classification Techniques
No ratings yet
Deep Learning Image Classification Techniques
57 pages
Neural Network Performance Analysis
No ratings yet
Neural Network Performance Analysis
9 pages
Machine Learning: Neural Network Concepts
No ratings yet
Machine Learning: Neural Network Concepts
10 pages
Practical Guide to CNNs in Vision
No ratings yet
Practical Guide to CNNs in Vision
34 pages
Understanding Tensors in Deep Learning
No ratings yet
Understanding Tensors in Deep Learning
28 pages
RNN Limitations in Deep Learning Tasks
No ratings yet
RNN Limitations in Deep Learning Tasks
32 pages
CNN Image Classification with Python
No ratings yet
CNN Image Classification with Python
8 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
15 pages
Neural Networks with Keras & TensorFlow
No ratings yet
Neural Networks with Keras & TensorFlow
14 pages
Deep Learning Fundamentals and Applications
No ratings yet
Deep Learning Fundamentals and Applications
6 pages
Practical TensorFlow Machine Learning Guide
No ratings yet
Practical TensorFlow Machine Learning Guide
637 pages
Advanced Keras Model Architectures
No ratings yet
Advanced Keras Model Architectures
11 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
30 pages
Understanding "α ∉ a_α" δ-Systems
100% (2)
Understanding "α ∉ a_α" δ-Systems
145 pages
Neural Networks: Concepts and Applications
No ratings yet
Neural Networks: Concepts and Applications
19 pages
Anatomy of Neural Networks in Keras
No ratings yet
Anatomy of Neural Networks in Keras
13 pages
Deep Learning ConvNet Tutorial
No ratings yet
Deep Learning ConvNet Tutorial
8 pages
Deep Learning Course Overview
No ratings yet
Deep Learning Course Overview
20 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
3 pages
CNN and AlexNet Implementation Guide
No ratings yet
CNN and AlexNet Implementation Guide
6 pages
Week 2 Neural Network Assignment Guide
No ratings yet
Week 2 Neural Network Assignment Guide
10 pages
Neural Networks Overview by Sanchari Das
No ratings yet
Neural Networks Overview by Sanchari Das
36 pages
Deep Learning Training Overview
No ratings yet
Deep Learning Training Overview
82 pages
Build MLP Classifier with Keras
No ratings yet
Build MLP Classifier with Keras
17 pages
Understanding ANN Classifiers
No ratings yet
Understanding ANN Classifiers
25 pages
Deep Learning with Neural Networks
No ratings yet
Deep Learning with Neural Networks
46 pages
Anatomy of Neural Networks in Deep Learning
100% (1)
Anatomy of Neural Networks in Deep Learning
47 pages
Machine Learning with ANN: Regression & Classification
No ratings yet
Machine Learning with ANN: Regression & Classification
75 pages
Data Science and AI Course Syllabus
100% (1)
Data Science and AI Course Syllabus
20 pages
Deep Learning Assignment Questions
No ratings yet
Deep Learning Assignment Questions
9 pages
Overview of Generative Adversarial Networks
No ratings yet
Overview of Generative Adversarial Networks
3 pages
KNN and SVM in Machine Learning
No ratings yet
KNN and SVM in Machine Learning
115 pages
Overview of Deep Learning Libraries
No ratings yet
Overview of Deep Learning Libraries
5 pages
CS3491 AI & ML Improvement Test II
No ratings yet
CS3491 AI & ML Improvement Test II
1 page
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
24 pages
Guide to Building RNN Models
No ratings yet
Guide to Building RNN Models
4 pages
Overview of Neural Network Architecture
100% (1)
Overview of Neural Network Architecture
72 pages
Machine Learning for Gender Prediction
No ratings yet
Machine Learning for Gender Prediction
19 pages
Unsupervised Learning Techniques
No ratings yet
Unsupervised Learning Techniques
55 pages
MLP for MNIST and Neural Networks Guide
No ratings yet
MLP for MNIST and Neural Networks Guide
9 pages
Training Multi-Layer Feedforward DNNs
No ratings yet
Training Multi-Layer Feedforward DNNs
9 pages
Clustering Insights: DBSCAN vs HAC
No ratings yet
Clustering Insights: DBSCAN vs HAC
2 pages
Machine Learning Mastery Guide
No ratings yet
Machine Learning Mastery Guide
2 pages
CSE/STAT 416 Summer 2019 Final Exam
No ratings yet
CSE/STAT 416 Summer 2019 Final Exam
18 pages
Deep Learning Overview for CS771
No ratings yet
Deep Learning Overview for CS771
25 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
GAN Implementation with CIFAR-10 Data
No ratings yet
GAN Implementation with CIFAR-10 Data
9 pages
Neural Networks Overview and Training
No ratings yet
Neural Networks Overview and Training
59 pages
Exam Huawei Prepa
No ratings yet
Exam Huawei Prepa
7 pages
Intro To ML
No ratings yet
Intro To ML
9 pages
LSTM: Pros and Cons Explained
No ratings yet
LSTM: Pros and Cons Explained
16 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
31 pages
Training Techniques for Neural Networks
No ratings yet
Training Techniques for Neural Networks
52 pages
Optimization and Backpropagation in Deep Learning
No ratings yet
Optimization and Backpropagation in Deep Learning
75 pages
Machine Learning Assignments Overview
No ratings yet
Machine Learning Assignments Overview
3 pages
Neural Networks: Models and Algorithms
No ratings yet
Neural Networks: Models and Algorithms
4 pages
Attention Mechanisms in RNNs
No ratings yet
Attention Mechanisms in RNNs
103 pages

Understanding ANN and MNIST Dataset

Uploaded by

Understanding ANN and MNIST Dataset

Uploaded by

ANN Notes

Key Concepts of an ANN:

3. Weights and Biases:

Why ANNs are Useful

Overview of MNIST Dataset

1. Load and preprocess the data

Here’s a complete code example:

Key TensorFlow Concepts Explained

You might also like