0% found this document useful (0 votes)

12 views

Ann On Fpga Ieee

Implementation of ANN on FPGA.

Uploaded by

H.Bhargav

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

Ann On Fpga Ieee

Implementation of ANN on FPGA.

Uploaded by

H.Bhargav

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Hardware Implementation of an Artificial Neural Network

Nazeih M.Botros, Ph.D., and M.Abdul-Aziz

Departmentof Electrical Engineering
Southern Illinois University
Carbondale,IL 62901.
Tel: 618-536-2364 Fax: 618-453-7455

Abstract-In this study we present a hardware implementation 1

of a fully digital and fully interconnected feed forward back- f(a)= -
1+ e-"
...............(2)
propagation artificial network using Xilinx FPCAs. The
network consists of an input layer witb five nodes, a single Training of the network is carried out as follows:
hidden layer with four nodes, and an output layer with two i) Initialize the weights and offsets. Each node, except the
nodes. These nodes are fully interconnected to each other input nodes, is assigned to an initial offset. The input nodes
between adjacent layers. Training is done off-line on a
conventional digital computer where the final values of the have no offset; they act as buffers.
weights are obtained. Each node is implemented with two ii) Starting from the output layer and going backward to the
XC3042 FPGAs and a 1K x 8 EPROM. Tbe network is tested input layer, adjust the weights and offsets recursively until
successfhlly by comparing the values of the output nodes for a the weights stabilized. 'Ihe weights and offsets are adjusted
different Input pattems with thase obtained from simulating by using the formulas:
the network on a PC. The number of FPGAs used can be
dgninccrlly decreased m well as the speed can be increased U 4K wi jnew = W.1 J.old + ej xi ................(3)
or higher family FPCA is used.
0 Few= e + P Ej 1 ..................(4)
I. INTRODUCTION
In recent years, implementation of Field Programmable ou',yv',
Gate Arrays (FPGA) in realizing complex hardware system uyg~

has been accelerated. The relatively low cost and easiness of

implementation and reprogramming of FFGA offer attractive
features for the hardware designer in comparison with the
VLSI technology [11. Field programmable gate arrays are
high density ASKS that can be configured by the user. They
combine the flexibility of gate arrays with desktop m"
programmability.
The artificial neural network implementedin this study is a
three-layer back-propagation network. See Figure 1. The
network is used to classify selected input patterns. It
consists of 3 layers: an input layer with 5 nodes, a hidden ................. ......-......."......................
I.."

layer with 4 nodes and an output layer with 2 nodes. The

network is selected due to its high performance as a classifier
and the easiness of its training procedure [2]. 'Ihe input to
the network is a continuous valued vector x,, x *,........x 5 .
5
The output of the network is the class of the current input.
The output of node j, y., is calculated as follows: Figure 1. The Artificial Neural Network

yj= f[[Twv.j'- oil............ (1)

Where p is the gain factor and is assumed to be 0.5, is the
error. In this study, the weights are considered stabilized if
the value of each new weight is greater than 95% of its
Where e* is the offst (bias) of node jv WiJ is the weight of pr,=vious(old)value. If node j is an outputnode, then
J
the connection between node j and node i, and the function Ej = yj (1- yj Xdj- yj ).................(5)
f is the sigmoid non-linearity: Where dj is the desired output of node j and yj is the actual
output. The desired output for all output nodes is set to zero
01993 I E E
0-7803-0999-5/93/$03.00
except for the node that corresponds to the current input bit input to the hidden and output nodes y,. is multiplied by
training set which is set to 1, [2]. If node j is a hidden node, 8-bit signed weights Wij to form 16-bit signed product. The
thcn: products (five in the hidden and four in the output layer) and
cj = Yj (1- Yj)Z %wjk .............(6) a 16-bit signed bias value 8; are accumulated into a 20-bit
k
Where k is over all nodes in the layers above node j.
11. GENERAL ARCHITEXTURE
The 20-bit sum in the accumulator isthen scaled to a 10-bit
Figure 2 shows the general architecture of the network. value. The 10-bit scaling is selected because it is the
The input layer consists of five nodes (neurons), the hidden minimum number of bits that can be relained without
layer of four nodes and the output layer of two d e s . These deterioratingthe accuzacy of the sum.This Whit scaled sum
nodes am fully interconnected to each other between adjacent serves as the address of a 1K x 8 EPROM where a sigmoid
layers. Training is done by off-line simulation of the activation function f is realized as a lookup table. The
network on a PC. The final values of weights are obtained at activation function produces an %-bit output. Two's
the end of training session. The input layer does no complement number system is employed to handle the
processing but simply buffers the data. The nodes in the multiplications and additionsof negative numbers. This
other layers first form a weighted sum of their inputs. The 8-

clock

I control bus

lnput layer

Figure 2. General Architecture of the Network.

1253
partial products and the bias value is stored in a 20-bit
accumulator. An investigation of the behavior of the sigmoid
function shows that it saturates to approximately 1.0 when
U 2 +7 and saturates to approximately 0.0 if a s -8.
1
f W =-
1+e-=’
Figure 3 shows a schematic diagram of Accordingly the 20-bits is scaled to 9 bits.

the processing flow for the hidden node. Each node in the Control Unit
network is built by two XC3042 FPGAs and a 1K by 8
EPROM. The first FPGA carries input latches and A separate micro programmed controller drives the entire
multipliers. The second carries a 20 bit fast circuit. Two 4-bit asynchronous nZ counters are cascaded
adder/accumulator circuit and a scaling logic. These two to generate the addresses for the control memory. The
FPGAs compute the weighted sum of five inputs (four in the counters are driven by a 4 MHz system clock. The
output layer) and the bias value and then scale the result. The asynchronous clear inputs of the counters are connected to a
EPROM holds the activation functions for the node. The push button switch. the EIW of the least signifxant counter
system clock is 4 MHz which is the maximum speed that is connected to the Q output of a JK flip flop whose Preset &
could be achieved due to the use of slower EPROMs and two Clear inputs are driven by two control signals produced by
FpGAs per node. The en& network is driven by a micro the conml memory. See figure 4.
programmed controller The controller generates a proper
sequence of signals to control the timing for both layers. IV. RESULTS
Computations for nodes of the same layer are done in
The network is simulated by software and same input
-1. pattems are applied to both the software and the hardwate
network. In both cases the outputs are calculated. Table I
III. DETAILS OF THE ARCHITECWRE shows the outputs and y2 of both networks. See Figure 1.
As shown in this table the hardware network (Hard W.)
Multiplication performs correctly. The hardware implementation computes
4 million interconnections (approx. 70,000 decisions) per
Since the weights and biases are contants @redetermined), second. This speed allows the implementation of the
multiplication of any number with them can be done in a network in real-time applications.
loot-up table fashion The CLBs (Configurable Logic
Blocks) of the FPGA are programmed to realize these look-
up tables. Multiplying an 8-bit number by an 8-bit constant V.DISCUSSION AND CONCLUSION
produces a sixteen bit product. The 8x8 multiplication is
broken into two-8x4 multiplications and one addition. The
mast significant partial mutt (8x4) is shifted by four bits We have presented a successful hardware implementation
before adding it to the least significant partial product. The of a simple artificial neural network. The implementation
shift is realized by physically shifting (routing) the most can be expanded to realize more complex networks.
significant bits. One CLB is used lo generate two bits of Reconfigurability and adaptability are the main features of
the product. The twelve bits of the partial product are the hardware. For a new application only the weights, biases
generated by using six CLBs. and scaling parameters need to be re configured on the CLBs
without changing the basic design. It is easily expandable
just by adding more nodes with the same design.
Summation
Xilinx FPGAs and other alike FPGAs are found feasible
The next task performed by the nodes is to produce the and efficient tools for the design of neural nets. They offer
sum of the partial products into a single 20-bit sum. The 20 acceptable densities without the cost and length design cycles
bits are selected so that no overflow can happen. A 20-bit of full custom circuits. Their reconfigurability and desktop
fast carry look-ahead adder is designed to carry out the programmability allow to make design changes at user‘s
summations. Each node in the hidden layer adds up ten 16- terminal, thereby avoiding the fabrication cycle times and
bit partial products and a bias into a 20-bit positive edge non-recurring engineering charges. Although (due to our
triggered accumulator to produce a single sum. The output limited funds) the use. of two XC3042 FPGAs (50 MHz) and
nodes perform the Same but for eight partial product. a 1K x 8 EPROM (450ns)per ndde makes the network bulky,
we found that its size and speed can be greatly improved by
Scaling and Activation Function using higher density FPGAs. FPGA XC3090 can easily
accommodate the circuits in the two XC3042 used in this
The final task performed by the nodes of hidden and study. It will also significantly reduce the size as well as
output layers is the scaling and the application of activation increase the speed by eliminating the 5511s [approx.) delay
(sigmoid) function. The final result of addition of all the between the I/O pins of two FPGAs. RAMS can be
1254
. r

4
Control unlt . *
b
1 b I,

* * 2

lnput buffers' I r J LCA

hardw lr ed L
b
a s
U3
3
HP8 8,~ 5
2 1
8 x 8 two's complement *duct selecl
multtpller

20 b l t accumulator

blt sum

I Actlvatlon
lKx8EPR functlon
-

t8 bit output for next layer

Flgure 3.. Schematic 01 dlgltal neuron (hldden unlt)

1255
implemented inside the 4SK FPGA series. These R A M can
be programmed for sigmoid lookup tables and can be
downloaded with the bit stream during configuration, to
further increase thc speed and reduce cht size to 1 chip per
node. Use of pipeline techniques in each node as well as
bctween successive layers of thc network and higher speed
EPROMs and FPGAs can also greatly increase the speed.
Very high density FPGAs may provide room to build m m
than one neuron in one chip. A1 present time we are
modifying thc design to include onchip training.

VI. REFERENCES
(1) C. E. Cox, and W. Ekkehard Blanz, ' Ganglon- A Fast
Hardwan Implementation of a Connectionist Classifier,
a EEE-CICC Phoenix, Arizona. 1991.

[2] R. Lippmann, ' An Introduction to Computing With

Neural Nets," IEEE-ASSP, Magazine, 4-22, April 1987.

ha1t 2
-0 fF 4j-
/

t
Figure 4. The Control Circuit.

1256

- ... . . . ., ... .". ..

Table 1. Results of Software and Hardware Networks

1257

Unit 2 Machine Learning Notes
100% (1)
Unit 2 Machine Learning Notes
25 pages
FPGA Implementation of A Trained Neural Network: Seema Singh, Shreyashree Sanjeevi, Suma V, Akhil Talashi
No ratings yet
FPGA Implementation of A Trained Neural Network: Seema Singh, Shreyashree Sanjeevi, Suma V, Akhil Talashi
10 pages
Hardware Implementation of A MLP Network With On-Chip Learning
No ratings yet
Hardware Implementation of A MLP Network With On-Chip Learning
6 pages
International Refereed Journal of Engineering and Science (IRJES)
No ratings yet
International Refereed Journal of Engineering and Science (IRJES)
4 pages
FFSN Inplementation4
No ratings yet
FFSN Inplementation4
18 pages
Module 3 Ppt
No ratings yet
Module 3 Ppt
83 pages
Effect of Varying Neurons in The Hidden Layer of Neural Network For Simple Character Recognition
No ratings yet
Effect of Varying Neurons in The Hidden Layer of Neural Network For Simple Character Recognition
4 pages
19 - Introduction To Neural Networks
No ratings yet
19 - Introduction To Neural Networks
7 pages
Python Neural Network
No ratings yet
Python Neural Network
5 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
14 pages
ANN Architecture
No ratings yet
ANN Architecture
41 pages
UNIT - 4
No ratings yet
UNIT - 4
17 pages
Ml Unit 3 Study Material-1
No ratings yet
Ml Unit 3 Study Material-1
32 pages
Neural Net Notes
No ratings yet
Neural Net Notes
7 pages
Soft Computing-Lab File
No ratings yet
Soft Computing-Lab File
40 pages
John Bullinaria's Step by Step Guide To Implement Neuronal Network in C
No ratings yet
John Bullinaria's Step by Step Guide To Implement Neuronal Network in C
6 pages
LIET III-II CSE AIML IV UNIT Previous Yrs QN Papers Qns and Answers
No ratings yet
LIET III-II CSE AIML IV UNIT Previous Yrs QN Papers Qns and Answers
15 pages
AANN Slide Chapter1
No ratings yet
AANN Slide Chapter1
42 pages
Artificial Neural Network and Its Applications
No ratings yet
Artificial Neural Network and Its Applications
21 pages
Unit 12
No ratings yet
Unit 12
26 pages
Module 3 Chap 4 ANNs
No ratings yet
Module 3 Chap 4 ANNs
69 pages
Pattern Recognition Using Neural Network (Project Proposal For Image Processing)
No ratings yet
Pattern Recognition Using Neural Network (Project Proposal For Image Processing)
6 pages
Supervised Learning Unit 4-Neural Network
No ratings yet
Supervised Learning Unit 4-Neural Network
30 pages
Artificial Neural Network (2)
No ratings yet
Artificial Neural Network (2)
75 pages
Seminar Ann
No ratings yet
Seminar Ann
27 pages
Unit 5
No ratings yet
Unit 5
144 pages
Hardware Implementation of Neural Networks
No ratings yet
Hardware Implementation of Neural Networks
5 pages
BTP Project Report
No ratings yet
BTP Project Report
13 pages
B.Tech Project Mid Term Report: Handwritten Digits Recognition Using Neural Networks
No ratings yet
B.Tech Project Mid Term Report: Handwritten Digits Recognition Using Neural Networks
13 pages
Wang2003 Chapter ArtificialNeuralNetwork
No ratings yet
Wang2003 Chapter ArtificialNeuralNetwork
20 pages
UNIT4_Part1 aiml
No ratings yet
UNIT4_Part1 aiml
79 pages
ML Unit-5 Final
No ratings yet
ML Unit-5 Final
23 pages
Letters: Direct Neural-Network Hardware-Implementation Algorithm
No ratings yet
Letters: Direct Neural-Network Hardware-Implementation Algorithm
4 pages
Essential Concept in Artificial Neural Networks
No ratings yet
Essential Concept in Artificial Neural Networks
27 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
34 pages
Training An Artificial Neural Network With Op-Amp
No ratings yet
Training An Artificial Neural Network With Op-Amp
6 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
37 pages
AAI unit 2
No ratings yet
AAI unit 2
147 pages
Chapter 3-1 Neural Network
No ratings yet
Chapter 3-1 Neural Network
43 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
20 pages
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
No ratings yet
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
45 pages
Unit 1
No ratings yet
Unit 1
70 pages
TO Artificial Neural Networks
No ratings yet
TO Artificial Neural Networks
22 pages
Artificial Neural Network Application in Logic System: Siddharth Saxena TCET, Mumbai
No ratings yet
Artificial Neural Network Application in Logic System: Siddharth Saxena TCET, Mumbai
5 pages
List of Figures
No ratings yet
List of Figures
15 pages
Introduction To Neural Networks: Revision Lectures: © John A. Bullinaria, 2004
No ratings yet
Introduction To Neural Networks: Revision Lectures: © John A. Bullinaria, 2004
24 pages
Artificial Neural Networks - MiniProject
100% (1)
Artificial Neural Networks - MiniProject
16 pages
Neural Network
No ratings yet
Neural Network
29 pages
Multilayer Backpropagation Neural Networks For Implementation of Logic Gates
No ratings yet
Multilayer Backpropagation Neural Networks For Implementation of Logic Gates
12 pages
Max78000 Article Series Part 1
No ratings yet
Max78000 Article Series Part 1
4 pages
Module 1
No ratings yet
Module 1
100 pages
IISTE May 30th Edition Peer Reviewed Art
No ratings yet
IISTE May 30th Edition Peer Reviewed Art
6 pages
Computerized Paper Evaluation-Using Neural Network
100% (1)
Computerized Paper Evaluation-Using Neural Network
21 pages
TO Artificial Neural Networks
No ratings yet
TO Artificial Neural Networks
22 pages
Institute For Advanced Management Systems Research Department of Information Technologies Abo Akademi University
No ratings yet
Institute For Advanced Management Systems Research Department of Information Technologies Abo Akademi University
41 pages
Machine Learning Unit 5 Notes
No ratings yet
Machine Learning Unit 5 Notes
19 pages
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
MARIO FRANCO
No ratings yet
Cisco Packet Tracer Implementation: Building and Configuring Networks: 1, #1
From Everand
Cisco Packet Tracer Implementation: Building and Configuring Networks: 1, #1
S. R. Jena
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Ricoh Error Code
No ratings yet
Ricoh Error Code
7 pages
SL CS Notes
No ratings yet
SL CS Notes
64 pages
OSELAS - BSP Pengutronix Mini2440 Quickstart
No ratings yet
OSELAS - BSP Pengutronix Mini2440 Quickstart
62 pages
User Manual: Café Cop Version 2.1.5
No ratings yet
User Manual: Café Cop Version 2.1.5
24 pages
Abap Workflow
100% (1)
Abap Workflow
29 pages
QNAP QTS 4.0 Bali
No ratings yet
QNAP QTS 4.0 Bali
104 pages
Top Boo
No ratings yet
Top Boo
1 page
Q. No Sub Q.No Answer: (Autonomous)
No ratings yet
Q. No Sub Q.No Answer: (Autonomous)
23 pages
Glade Tutorial
No ratings yet
Glade Tutorial
5 pages
Tricentis Tosca PoV Process Partners
No ratings yet
Tricentis Tosca PoV Process Partners
14 pages
Magnetic Storage
No ratings yet
Magnetic Storage
4 pages
PLT
100% (1)
PLT
202 pages
Javalab File
No ratings yet
Javalab File
167 pages
System Software and Compiler Design
No ratings yet
System Software and Compiler Design
34 pages
Huawei OceanStor Dorado 5000
No ratings yet
Huawei OceanStor Dorado 5000
8 pages
DSA Lab Manual
No ratings yet
DSA Lab Manual
131 pages
Kaspersky Vulnerability and Patch Management Datasheet
No ratings yet
Kaspersky Vulnerability and Patch Management Datasheet
2 pages
TikTokio - TikTok Downloader - Download TikTok Videos Without Watermark
No ratings yet
TikTokio - TikTok Downloader - Download TikTok Videos Without Watermark
14 pages
Programming Languages MCQs
No ratings yet
Programming Languages MCQs
7 pages
Product/Service Acer Nitro 5 (Laptop)
No ratings yet
Product/Service Acer Nitro 5 (Laptop)
19 pages
CloudFoundations - 08a - Databases - AWS RDS
No ratings yet
CloudFoundations - 08a - Databases - AWS RDS
21 pages
Assignment #2: Programming Fundamentals
No ratings yet
Assignment #2: Programming Fundamentals
7 pages
Maze Generator Source Code
100% (1)
Maze Generator Source Code
4 pages
Gen AI Study Jam
No ratings yet
Gen AI Study Jam
13 pages
Difference Between Explain Plan and Autotrace: %cpu Time
No ratings yet
Difference Between Explain Plan and Autotrace: %cpu Time
2 pages
Altos EasyStore Manual 021907
No ratings yet
Altos EasyStore Manual 021907
169 pages
Operating Systems: Simple/Basic Paging
No ratings yet
Operating Systems: Simple/Basic Paging
30 pages
Mailings Start Mail Merge Start Mail Merge: How To Use Mail Merge in Microsoft Word
No ratings yet
Mailings Start Mail Merge Start Mail Merge: How To Use Mail Merge in Microsoft Word
4 pages
Paduraru Iuliana 411
No ratings yet
Paduraru Iuliana 411
8 pages
Concurrency in Go
No ratings yet
Concurrency in Go
27 pages

Ann On Fpga Ieee

Uploaded by

Ann On Fpga Ieee

Uploaded by

Hardware Implementation of an Artificial Neural Network

Nazeih M.Botros, Ph.D., and M.Abdul-Aziz

Abstract-In this study we present a hardware implementation 1

has been accelerated. The relatively low cost and easiness of

layer with 4 nodes and an output layer with 2 nodes. The

yj= f[[Twv.j'- oil............ (1)

Figure 2. General Architecture of the Network.

lnput buffers' I r J LCA

t8 bit output for next layer

Flgure 3.. Schematic 01 dlgltal neuron (hldden unlt)

[2] R. Lippmann, ' An Introduction to Computing With

- ... . . . ., ... .". ..

You might also like