0% found this document useful (0 votes)

12 views25 pages

AI2025 Lecture09 Inperson Slide

The document is a lecture on shallow neural networks presented by Taewook Kang, focusing on the architecture, parameters, and gradient descent methods for training such networks. It includes detailed mathematical formulations for cost functions, partial derivatives, and the application of binary cross-entropy loss. The content is adapted from Prof. Woowhan Jung's materials at Hanyang University.

Uploaded by

chiyeon0607

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views25 pages

AI2025 Lecture09 Inperson Slide

Uploaded by

chiyeon0607

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

1

AI System Semiconductor Design

Lecture9 (In-person): Shallow Neural
Network
4/2/2025
Lecturer: Taewook Kang
Acknowledgments
Lecture material adapted from
Prof. Woowhan Jung, DSLAB, Hanyang Univ.

SKKU Kang Research Group / SEE3007 Spring 2025 1 1

REVIEW: SHALLOW NEURAL NETWORK

SKKU Kang Research Group / SEE3007 Spring 2025 2

Shallow Neural Network
Parameters: 𝜽 = {𝑾[1] , 𝒃[1] , 𝒘 2 , 𝑏[2] }
tanh
Loss 𝐿 𝑦,
ො 𝑦
𝑥1 Cost 𝐽 𝜃 = σ𝑚
𝑞=1 𝐿 𝑦
ො 𝑞 , 𝑦 (𝑞)
tanh
Gradient descent
𝑥2 𝜎 𝑦ො σ𝑚 ො 𝑞 ,𝑦 (𝑞)
𝑖=1 𝛻𝜽 𝐿 𝑦
tanh 𝜽 ≔ 𝜽 − 𝜂𝛻𝜽 𝐽 𝜽 = 𝜽 − 𝜂
𝑚
𝑥3
𝑞 , 𝑦 (𝑞)
tanh So, we will compute 𝛻𝜽𝐿 𝑦ො for 1 ≤ 𝑞 ≤ 𝑚

𝜕𝐿 𝑎 2 , 𝑦
𝜕𝑏 [2]
𝒛[1] = 𝑾[1] 𝒙 + 𝒃[1]
𝜕𝐿 𝑎 2 , 𝑦 For 1 ≤ 𝑖 ≤ ℎ
𝒂[1] = tanh 𝒛[1] Element-wise 2
𝜕𝑤𝑖
representations
𝑧 [2] =𝒘 2 𝑇 𝒂[1] + 𝑏[2] 𝜕𝐿 𝑎 2 , 𝑦
For 1 ≤ 𝑖 ≤ ℎ
𝑦ො = 𝑎 2 =𝜎 𝑧 2 𝜕𝑏𝑖
1

𝜕𝐿 𝑎 2 , 𝑦
For 1 ≤ 𝑖 ≤ ℎ, 1 ≤ 𝑗 ≤ 𝑛
1
SKKU Kang Research Group / SEE3007 Spring 2025 𝜕𝑊𝑖𝑗 3
Partial Derivatives: 2nd layer
𝑦
Parameters 𝑾[1] , 𝒃[1] , 𝒘 2 , 𝑏 [2]
𝒙
𝑇
𝑾[1] 𝒛1 =𝑾1 𝒙+𝒃1 𝒂 1 = tanh 𝒛 1 𝑧 2 =𝒘2 𝒂𝟏 +𝑏2 𝑎2 =𝜎 𝑧2 𝐿 𝑎 2 ,𝑦

𝒃[1] 𝒘[2] Binary cross entropy

𝑏[2] 𝐿 𝑎 2 , 𝑦 = −𝑦 log 𝑎 2 − 1 − 𝑦 log 1 − 𝑎 2

𝜕𝐿 𝑎 2 ,𝑦 𝜕𝐿 𝑎 2 ,𝑦 𝜕𝑎 2 𝜕𝑧 2
▪ 𝜕𝑏 [2]
=
𝜕𝑎 [2] 𝜕𝑧 [2] 𝜕𝑏 [2]
𝑑𝜎 𝑥
= 𝜎 𝑥 (1 − 𝜎 𝑥 )
𝑑𝑥
−𝑦 1−𝑦 2
= [2] + 𝜎 𝑧 1−𝜎 𝑧 2
𝑎 1 − 𝑎[2]

−𝑦 1−𝑦 2 1−𝑎2
= [2] + 𝑎
𝑎 1 − 𝑎[2]
Same with logistic
regression
= −𝑦 1 − 𝑎 2 + 𝑎[2] 1 − 𝑦 =𝑎[2] − y

SKKU Kang Research Group / SEE3007 Spring 2025 4

Partial Derivatives: 2nd layer
𝑦
Parameters 𝑾[1] , 𝒃[1] , 𝒘 2 , 𝑏 [2]
𝒙
𝑇
𝑾[1] 𝒛1 =𝑾1 𝒙+𝒃1 𝒂 1 = tanh 𝒛 1 𝑧 2 =𝒘2 𝒂𝟏 +𝑏2 𝑎2 =𝜎 𝑧2 𝐿 𝑎 2 ,𝑦

𝒃[1] 𝒘[2] Binary cross entropy

𝑏[2] 𝐿 𝑎 2 , 𝑦 = −𝑦 log 𝑎 2 − 1 − 𝑦 log 1 − 𝑎 2

𝜕𝐿 𝑎 2 ,𝑦 𝜕𝐿 𝑎 2 ,𝑦 𝜕𝑎 2 𝜕𝑧 2
▪ 𝜕𝑤𝑖
2 =
𝜕𝑎 [2] 𝜕𝑧 [2] 𝜕𝑤 2
𝑖

𝜕𝑧 2
= 𝑎[2] − 𝑦
𝜕𝑤𝑖 2

= 𝑎[2] − 𝑦 𝑎𝑖1

SKKU Kang Research Group / SEE3007 Spring 2025 5

Partial Derivatives: 1st layer
𝑦
Parameters 𝑾[1] , 𝒃[1] , 𝒘 2 , 𝑏 [2]
𝒙
𝑇
𝑾[1] 𝒛1 =𝑾1 𝒙+𝒃1 𝒂 1 = tanh 𝒛 1 𝑧 2 =𝒘2 𝒂𝟏 +𝑏2 𝑎2 =𝜎 𝑧2 𝐿 𝑎 2 ,𝑦

𝒃[1] 𝒘[2]
𝑏[2] 𝐿 𝑎 2 , 𝑦 = −𝑦 log 𝑎 2 − 1 − 𝑦 log 1 − 𝑎 2
1 1
𝜕𝐿 𝑎 2 ,𝑦 𝜕𝐿 𝑎 2 ,𝑦 𝜕𝑎 2 𝜕𝑧 2 𝜕𝑎𝑖 𝜕𝑧𝑖
= 𝑑 tanh 𝑥
𝜕𝑏𝑖
1 𝜕𝑎 [2] 𝜕𝑧 [2] 𝜕𝑎 1 𝜕𝑧 1 𝜕𝑏 1 = 1 − tanh2 𝑥
𝑖 𝑖 𝑖 𝑑𝑥
1 1
𝜕𝑧 2 𝜕𝑎𝑖 𝜕𝑧𝑖
= 𝑎[2] − 𝑦
𝜕𝑎𝑖1 𝜕𝑧𝑖 1 𝜕𝑏𝑖 1

= 𝑎[2] − 𝑦 𝑤𝑖 2 1 − tanh2 𝑧𝑖 1 ⋅ 1

2 12
= 𝑎[2] − 𝑦 𝑤𝑖 1 − 𝑎𝑖
SKKU Kang Research Group / SEE3007 Spring 2025 6
Partial Derivatives: 1st layer
𝑦
Parameters 𝑾[1] , 𝒃[1] , 𝒘 2 , 𝑏 [2]
𝒙
𝑇
𝑾[1] 𝒛1 =𝑾1 𝒙+𝒃1 𝒂 1 = tanh 𝒛 1 𝑧 2 =𝒘2 𝒂𝟏 +𝑏2 𝑎2 =𝜎 𝑧2 𝐿 𝑎 2 ,𝑦

𝒃[1] 𝒘[2]
𝑏[2] 𝐿 𝑎 2 , 𝑦 = −𝑦 log 𝑎 2 − 1 − 𝑦 log 1 − 𝑎 2
1 1
𝜕𝐿 𝑎 2 ,𝑦 𝜕𝐿 𝑎 2 ,𝑦 𝜕𝑎 2 𝜕𝑧 2 𝜕𝑎𝑖 𝜕𝑧𝑖 Note
1 =
𝜕𝑊𝑖𝑗 𝜕𝑎 [2] 𝜕𝑧 [2] 𝜕𝑎 1 𝜕𝑧 1 𝜕𝑊 1 1 1𝑇 1
𝑖 𝑖 𝑖𝑗 𝑧𝑖 = 𝒘𝑖 𝒙 + 𝑏𝑖
𝑚
1
2 12 𝜕𝑧𝑖 = ෍ 𝑊𝑖𝑗 𝑥𝑗 + 𝑏𝑖
1 1
= 𝑎[2] − 𝑦 𝑤𝑖 1 − 𝑎𝑖
𝜕𝑊𝑖𝑗1 𝑗=1
1
𝜕𝑧𝑖
⇒ 1
= 𝑥𝑗
12 𝜕𝑊𝑖𝑗
= 𝑎[2] − 𝑦 𝑤𝑖 2 1 − 𝑎𝑖 𝑥𝑗

SKKU Kang Research Group / SEE3007 Spring 2025 7

Partial derivatives
𝜕𝐿 𝑎 2 ,𝑦
▪ 𝜕𝑏 [2]
= 𝑎[2] − y
𝜕𝐿 𝑎 2 ,𝑦 1
▪ 𝜕𝑤𝑖
2 = 𝑎[2] − 𝑦 𝑎𝑖

𝜕𝐿 𝑎 2 ,𝑦 2 1 2
▪ 𝜕𝑏𝑖
1 = 𝑎 [2]
− 𝑦 𝑤𝑖 1 − 𝑎𝑖

𝜕𝐿 𝑎 2 ,𝑦 [2] 2 1 2
▪ 1
𝜕𝑊𝑖𝑗
= 𝑎 − 𝑦 𝑤𝑖 1 − 𝑎𝑖 𝑥𝑗

SKKU Kang Research Group / SEE3007 Spring 2025 8

Partial Derivatives and Gradients
Vectorization
Partial Derivatives Gradients

𝜕𝐿 𝑎 2 , 𝑦 𝜕𝐿 𝑎 2 , 𝑦
[2]
= 𝑎[2] − y = 𝑎 [2] − y
𝜕𝑏 𝜕𝑏 [2]

𝜕𝐿 𝑎 2 , 𝑦 1 𝛻𝒘[2] 𝐿 𝑎 2 , 𝑦 = 𝑎[2] − 𝑦 𝒂[1]

2
= 𝑎[2] − 𝑦 𝑎𝑖
𝜕𝑤𝑖
𝜕𝐿 𝑎 2 , 𝑦 𝛻𝒃[1] 𝐿 𝑎 2 , 𝑦 = 𝑎[2] − 𝑦 𝒘[2] ⊙ 𝒆[1]
[2] 2 12
1
= 𝑎 − 𝑦 𝑤𝑖 1 − 𝑎𝑖
𝜕𝑏𝑖 𝛻𝑾[1] 𝐿 𝑎 2 , 𝑦 = 𝑎 2 − 𝑦 𝒘 2 ⊙ 𝒆 1 ⨂𝒙
𝜕𝐿 𝑎 2 , 𝑦 [2] 2 12
1
= 𝑎 − 𝑦 𝑤𝑖 1 − 𝑎𝑖 𝑥𝑗
𝜕𝑊𝑖𝑗 where 𝒆[1] = 1 − 𝒂 1 ⊙ 𝒂 1

Outer product
⊙: element-wise product (a.k.a. Hadamard product)
⨂: outer product

SKKU Kang Research Group / SEE3007 Spring 2025 9

REVIEW: LOGISTIC REGRESSION
ASSIGNMENT – XOR DATA

SKKU Kang Research Group / SEE3007 Spring 2025 10

Logistic regression: boolean operators
▪ Training logistic regression models for Boolean operators
▪ Requirements
▪ AND, OR, XOR
▪ You need to build a dataset for each operator
▪ may not working for an operator
▪ Use numpy arrays
▪ Initialization with lists: x, y
▪ Random initialization: w, b
▪ Use numpy operator
▪ Inner product
▪ Addition

SKKU Kang Research Group / SEE3007 Spring 2025 11

Lecture 5 Assignment
▪ Logistic regression on Boolean data cost
▪ Report required contents
1. Add source code for model & training
2. For each operator (AND, OR, XOR):
(1) Cost plot for epoch
epoch
▪ Use different learning rates (at least 3 learning rates)
(2) Show predicted results for whole input combinations→ e.g.,

3. Explain whether the logistic regression model works well for AND, OR, XOR data
▪ For one operator, the logistic regression won’t work. Which one?
▪ Due: 2025/3/27 (Thu) 11:59 PM (after in-person lecture07)
▪ So, you have two more classes (including today) to ask questions to finish.
▪ Submit to iCampus

SKKU Kang Research Group / SEE3007 Spring 2025 12

PYTHON PRACTICE
XOR CLASSIFICATION WITH SNN

SKKU Kang Research Group / SEE3007 Spring 2025 13

Slicing a numpy array

SKKU Kang Research Group / SEE3007 Spring 2025 14

Slicing a numpy array with condition

SKKU Kang Research Group / SEE3007 Spring 2025 15

Outer product

𝜕𝐿 𝑎 2 , 𝑦 2 12
1
= 𝑎[2] − 𝑦 𝑤𝑖 1 − 𝑎𝑖 𝑥𝑗
𝜕𝑊𝑖𝑗

SKKU Kang Research Group / SEE3007 Spring 2025 16

Data preparation

SKKU Kang Research Group / SEE3007 Spring 2025 17

Data Plot - Two Different Ways

SKKU Kang Research Group / SEE3007 Spring 2025 18

Model

𝒛[1] = 𝑾[1] 𝒙 + 𝒃[1]

𝒂[1] = tanh 𝒛[1]

𝑇
𝑧 [2] = 𝒘 2 𝒂[1] + 𝑏[2]

𝑦ො = 𝑎 2 = 𝜎 𝑧 2

SKKU Kang Research Group / SEE3007 Spring 2025 19

Train (with element-wise operations)

𝜕𝐿 𝑎 2 , 𝑦
[2]
= 𝑎[2] − y
𝜕𝑏

𝜕𝐿 𝑎 2 , 𝑦
2
= 𝑎[2] − 𝑦 𝑎𝑖1
𝜕𝑤𝑖

𝜕𝐿 𝑎 2 , 𝑦 2
= 𝑎[2] − 𝑦 𝑤𝑖 2 1 − 𝑎𝑖1
𝜕𝑏𝑖 1

𝜕𝐿 𝑎 2 , 𝑦 2
= 𝑎[2] − 𝑦 𝑤𝑖 2 1 − 𝑎𝑖1 𝑥𝑗
𝜕𝑊𝑖𝑗1

SKKU Kang Research Group / SEE3007 Spring 2025 20

Test Results

SKKU Kang Research Group / SEE3007 Spring 2025 21

Decision Boundary

▪ How does your decision boundary look like?

▪ Does it make sense? Is it good enough?
▪ If it doesn’t look good enough, how can we improve it?
▪ Hint: Increase Nepoch or increase number of neurons

SKKU Kang Research Group / SEE3007 Spring 2025 22

Train (with Vector Operations) – Assignment
Complete this code

𝜕𝐿 𝑎 2 , 𝑦
[2]
= 𝑎[2] − y
𝜕𝑏

𝛻𝒘[2] 𝐿 𝑎 2 , 𝑦 = 𝑎[2] − 𝑦 𝒂[1]

𝛻𝒃[1] 𝐿 𝑎 2 , 𝑦 = 𝑎[2] − 𝑦 𝒘[2] ⊙ 𝒆[1]

?
2 2
𝛻𝑾[1] 𝐿 𝑎 , 𝑦 = 𝑎 − 𝑦 𝒘 2 ⊙𝒆 1 ⨂𝒙
= 𝛻𝒃[1] 𝐿 𝑎 2 , 𝑦 ⨂𝒙 ?

𝒆[1] = 1 − 𝒂 1 ⊙ 𝒂 1

SKKU Kang Research Group / SEE3007 Spring 2025 23

LECTURE09 ASSIGNMENT

SKKU Kang Research Group / SEE3007 Spring 2025 24

Modeling XOR with a Shallow Neural Network
▪ Train a shallow neural network that acts like an XOR operator – We already did this
for the element-wise model.
▪ Goal: use vector operations
▪ Complete the code in “Train (with Vector Operations) – Assignment” slide
▪ There are a total of 3 question mark blocks. You need to fill in those lines.
▪ Report requirements
▪ Source code
▪ Model part (vector operations, including initialization, forward pass)
▪ Training part
▪ Test result - model.predict part
▪ Decision Boundary plot - Add a few sentences of analysis.
▪ Is it good enough?
▪ How many neurons did you use? How many Nepoch did you use?
▪ Due: 4/11 Tue 8:59AM
SKKU Kang Research Group / SEE3007 Spring 2025 25

IP Practical File 2024-25
100% (7)
IP Practical File 2024-25
22 pages
(Series in Computational Physics) David J. Pine - Introduction To Python For Science and Engineering (2019, CRC Press) PDF
100% (5)
(Series in Computational Physics) David J. Pine - Introduction To Python For Science and Engineering (2019, CRC Press) PDF
389 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
12 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
Top 100 Python Interview Questions & Answers For 2021 - Edureka
No ratings yet
Top 100 Python Interview Questions & Answers For 2021 - Edureka
24 pages
National Public School: Name-Mohit Kumar Class-XII Subject - Informatics Practices (065) Board Roll No.
No ratings yet
National Public School: Name-Mohit Kumar Class-XII Subject - Informatics Practices (065) Board Roll No.
35 pages
Python For Data Science
No ratings yet
Python For Data Science
5 pages
Python Topics
No ratings yet
Python Topics
3 pages
Designing A G+2 Structure Using Python With Graphical User Interface
No ratings yet
Designing A G+2 Structure Using Python With Graphical User Interface
9 pages
Object Detection With 10 Lines of Code
No ratings yet
Object Detection With 10 Lines of Code
9 pages
Object Detection and Identification A Project Report: November 2019
No ratings yet
Object Detection and Identification A Project Report: November 2019
45 pages
SIC - C&P - Chapter 1. Unit 1
No ratings yet
SIC - C&P - Chapter 1. Unit 1
81 pages
Data Analytics Chennai
No ratings yet
Data Analytics Chennai
20 pages
Sheet #6 Ensemble + Neural Nets + Linear Regression + Backpropagation + CNN
No ratings yet
Sheet #6 Ensemble + Neural Nets + Linear Regression + Backpropagation + CNN
4 pages
Lecture 6 - Multi-Layer Feedforward Neural Networks Using Matlab Part 2
No ratings yet
Lecture 6 - Multi-Layer Feedforward Neural Networks Using Matlab Part 2
3 pages
Lecture 1
No ratings yet
Lecture 1
65 pages
Statistics and Analytics - Unit 4 - Python Experiments 1
No ratings yet
Statistics and Analytics - Unit 4 - Python Experiments 1
3 pages
Py Report
No ratings yet
Py Report
13 pages
Sparseautoencoder 2011new
No ratings yet
Sparseautoencoder 2011new
19 pages
Lec 100
No ratings yet
Lec 100
13 pages
Agsgbos2023 2024 Odd Sem-10-10-23
No ratings yet
Agsgbos2023 2024 Odd Sem-10-10-23
74 pages
Homework Assignment 3 Homework Assignment 3
No ratings yet
Homework Assignment 3 Homework Assignment 3
10 pages
Sparse Autoencoder
No ratings yet
Sparse Autoencoder
15 pages
Week 1 Solutions
No ratings yet
Week 1 Solutions
8 pages
CS229 Lecture 2 PDF
100% (1)
CS229 Lecture 2 PDF
48 pages
Bai 1 Eng
No ratings yet
Bai 1 Eng
10 pages
Msep2013 L5
No ratings yet
Msep2013 L5
14 pages
Notes Chapter8
No ratings yet
Notes Chapter8
4 pages
DNN Cluster S2 22 MidSem Regular
No ratings yet
DNN Cluster S2 22 MidSem Regular
6 pages
Backpropagation: Loading Data
No ratings yet
Backpropagation: Loading Data
12 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
hw07 Neural Soln PDF
No ratings yet
hw07 Neural Soln PDF
6 pages
DNN Cluster S2 22 MidSem Makeup
No ratings yet
DNN Cluster S2 22 MidSem Makeup
7 pages
ML Labs
No ratings yet
ML Labs
46 pages
Data Mining and Machine Learning: Fundamental Concepts and Algorithms
No ratings yet
Data Mining and Machine Learning: Fundamental Concepts and Algorithms
59 pages
Christopher Manning Lecture 3: Neural Net Learning: Gradients by Hand (Matrix Calculus) and Algorithmically (The Backpropagation Algorithm)
No ratings yet
Christopher Manning Lecture 3: Neural Net Learning: Gradients by Hand (Matrix Calculus) and Algorithmically (The Backpropagation Algorithm)
84 pages
Logistic Regression - Update - 2
No ratings yet
Logistic Regression - Update - 2
60 pages
cs224n 2023 Lecture03 Neuralnets
No ratings yet
cs224n 2023 Lecture03 Neuralnets
83 pages
Lecture20 Backprop
No ratings yet
Lecture20 Backprop
77 pages
Deep Learning Lectures - 2
No ratings yet
Deep Learning Lectures - 2
73 pages
DeepLearning Recap
No ratings yet
DeepLearning Recap
104 pages
Ai Record Work
No ratings yet
Ai Record Work
20 pages
Lecture12 Diff
No ratings yet
Lecture12 Diff
31 pages
R22EF170 - 4th SEM - SDP - Report
No ratings yet
R22EF170 - 4th SEM - SDP - Report
11 pages
Loan Approval Final Report
No ratings yet
Loan Approval Final Report
42 pages
Winter 23 Model Answer
No ratings yet
Winter 23 Model Answer
21 pages
DL03 Classroom SNN
No ratings yet
DL03 Classroom SNN
41 pages
(10주차 - 3차시 - 학습자료) 3.반도체 장비와 설비
No ratings yet
(10주차 - 3차시 - 학습자료) 3.반도체 장비와 설비
17 pages
W02 MLOptDL
No ratings yet
W02 MLOptDL
23 pages
Predicting The Market Value of Tesla Vehicles
No ratings yet
Predicting The Market Value of Tesla Vehicles
45 pages
AD3511 Deep Learning Lab Manual
No ratings yet
AD3511 Deep Learning Lab Manual
54 pages
CP Assingment53545556575859 - Sign
No ratings yet
CP Assingment53545556575859 - Sign
28 pages
Digital Integrated Circuit 03 Wires
No ratings yet
Digital Integrated Circuit 03 Wires
24 pages
Deep Learning Manual
No ratings yet
Deep Learning Manual
53 pages
Digital Integrated Circuit 01 Welcome&Intro
No ratings yet
Digital Integrated Circuit 01 Welcome&Intro
38 pages
Introduction of Machine Learning
No ratings yet
Introduction of Machine Learning
61 pages
Digital Integrated Circuit 02 Manufacturing Process and Layout
No ratings yet
Digital Integrated Circuit 02 Manufacturing Process and Layout
43 pages
Matplotlib in Python
No ratings yet
Matplotlib in Python
23 pages
Neural Network Training
No ratings yet
Neural Network Training
73 pages
Spring 2015 Mid-Sem Q - A
No ratings yet
Spring 2015 Mid-Sem Q - A
10 pages
AD3511 - Deep Learning Lab Manual
No ratings yet
AD3511 - Deep Learning Lab Manual
61 pages
Unit-VI-Introduction-to-Libraries - And-Modules (NEP)
No ratings yet
Unit-VI-Introduction-to-Libraries - And-Modules (NEP)
25 pages
practicalMachineLearning Lecture3
No ratings yet
practicalMachineLearning Lecture3
25 pages
02 Data Viz
No ratings yet
02 Data Viz
27 pages
L3 Ann
No ratings yet
L3 Ann
15 pages
IBest DeepLearning
No ratings yet
IBest DeepLearning
123 pages
Chapter 4 Assignment
No ratings yet
Chapter 4 Assignment
5 pages
AI42001 Machine Learing Foundations ES 2024
No ratings yet
AI42001 Machine Learing Foundations ES 2024
18 pages
05 Optimization Basics
No ratings yet
05 Optimization Basics
94 pages
Ad3511 Deep Learning Lab Manual - 241230 - 204240
No ratings yet
Ad3511 Deep Learning Lab Manual - 241230 - 204240
63 pages
Privacy and Security in Online Social Media JAN-2025
No ratings yet
Privacy and Security in Online Social Media JAN-2025
40 pages
CNN
No ratings yet
CNN
11 pages
DL Quiz1
No ratings yet
DL Quiz1
5 pages
Niraj DL
No ratings yet
Niraj DL
15 pages
Tutorial On Neural Networks - 18MAR2024
No ratings yet
Tutorial On Neural Networks - 18MAR2024
33 pages
HW 3
No ratings yet
HW 3
12 pages
Student Grade Prediction
No ratings yet
Student Grade Prediction
35 pages
AI2025 Lecture07 Inperson Slide
No ratings yet
AI2025 Lecture07 Inperson Slide
21 pages
AI2025 Lecture02 Recording Slides
No ratings yet
AI2025 Lecture02 Recording Slides
52 pages
AI2025 Lecture05 Inperson Slide
No ratings yet
AI2025 Lecture05 Inperson Slide
47 pages
AI2025 Lecture06 Recording Slide
No ratings yet
AI2025 Lecture06 Recording Slide
38 pages
AI2025 Lecture08 Recording Slide
No ratings yet
AI2025 Lecture08 Recording Slide
38 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Artificial Neural Network Methods For The Solution of Second Order Boundary Value Problems
No ratings yet
Artificial Neural Network Methods For The Solution of Second Order Boundary Value Problems
15 pages
AI2025 Lecture04 Recording Slide
No ratings yet
AI2025 Lecture04 Recording Slide
42 pages
AI2025 Lecture10 Recording Slide
No ratings yet
AI2025 Lecture10 Recording Slide
46 pages
19CSE456 - VI Sem May 2022
No ratings yet
19CSE456 - VI Sem May 2022
6 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
Excel With Ai A Comprehensive Guide To Copilot and Python Integration Publishing Download
No ratings yet
Excel With Ai A Comprehensive Guide To Copilot and Python Integration Publishing Download
82 pages
Short Course Machine Learning F de Vuyst 1715052496
No ratings yet
Short Course Machine Learning F de Vuyst 1715052496
74 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
65 pages
1) Deep - Learning
No ratings yet
1) Deep - Learning
60 pages
(1주차 3차시 학습자료) 1.집적회로이론및설계 1강
No ratings yet
(1주차 3차시 학습자료) 1.집적회로이론및설계 1강
15 pages
(2주차 1차시 학습자료) 1.집적회로이론및설계 2강
No ratings yet
(2주차 1차시 학습자료) 1.집적회로이론및설계 2강
16 pages
(2주차 2차시 학습자료) 1.집적회로이론및설계 2강
No ratings yet
(2주차 2차시 학습자료) 1.집적회로이론및설계 2강
16 pages
Exact Trigonometric Table for all Angles
From Everand
Exact Trigonometric Table for all Angles
Bhava Nath Dahal
No ratings yet
Exact Trigonometry Table for All Angles
From Everand
Exact Trigonometry Table for All Angles
Bhava Nath Dahal
No ratings yet

AI2025 Lecture09 Inperson Slide

Uploaded by

AI2025 Lecture09 Inperson Slide

Uploaded by

1

AI System Semiconductor Design

SKKU Kang Research Group / SEE3007 Spring 2025 1 1

SKKU Kang Research Group / SEE3007 Spring 2025 2

𝒃[1] 𝒘[2] Binary cross entropy

SKKU Kang Research Group / SEE3007 Spring 2025 4

𝒃[1] 𝒘[2] Binary cross entropy

SKKU Kang Research Group / SEE3007 Spring 2025 5

SKKU Kang Research Group / SEE3007 Spring 2025 7

SKKU Kang Research Group / SEE3007 Spring 2025 8

𝜕𝐿 𝑎 2 , 𝑦 1 𝛻𝒘[2] 𝐿 𝑎 2 , 𝑦 = 𝑎[2] − 𝑦 𝒂[1]

SKKU Kang Research Group / SEE3007 Spring 2025 9

SKKU Kang Research Group / SEE3007 Spring 2025 10

SKKU Kang Research Group / SEE3007 Spring 2025 11

SKKU Kang Research Group / SEE3007 Spring 2025 12

SKKU Kang Research Group / SEE3007 Spring 2025 13

SKKU Kang Research Group / SEE3007 Spring 2025 14

SKKU Kang Research Group / SEE3007 Spring 2025 15

SKKU Kang Research Group / SEE3007 Spring 2025 16

SKKU Kang Research Group / SEE3007 Spring 2025 17

SKKU Kang Research Group / SEE3007 Spring 2025 18

𝒛[1] = 𝑾[1] 𝒙 + 𝒃[1]

𝒂[1] = tanh 𝒛[1]

SKKU Kang Research Group / SEE3007 Spring 2025 19

SKKU Kang Research Group / SEE3007 Spring 2025 20

SKKU Kang Research Group / SEE3007 Spring 2025 21

▪ How does your decision boundary look like?

SKKU Kang Research Group / SEE3007 Spring 2025 22

𝛻𝒘[2] 𝐿 𝑎 2 , 𝑦 = 𝑎[2] − 𝑦 𝒂[1]

𝛻𝒃[1] 𝐿 𝑎 2 , 𝑦 = 𝑎[2] − 𝑦 𝒘[2] ⊙ 𝒆[1]

SKKU Kang Research Group / SEE3007 Spring 2025 23

SKKU Kang Research Group / SEE3007 Spring 2025 24

You might also like