Download full Computer Vision Using Deep Learning Neural Network Architectures with Python and Keras 1st Edition Vaibhav Verdhan ebook all chapters
Download full Computer Vision Using Deep Learning Neural Network Architectures with Python and Keras 1st Edition Vaibhav Verdhan ebook all chapters
com
https://textbookfull.com/product/computer-vision-using-deep-
learning-neural-network-architectures-with-python-and-
keras-1st-edition-vaibhav-verdhan/
OR CLICK BUTTON
DOWNLOAD NOW
https://textbookfull.com/product/supervised-learning-with-python-
concepts-and-practical-implementation-using-python-vaibhav-verdhan/
textboxfull.com
https://textbookfull.com/product/supervised-learning-with-python-
concepts-and-practical-implementation-using-python-1st-edition-
vaibhav-verdhan/
textboxfull.com
https://textbookfull.com/product/deep-learning-with-python-develop-
deep-learning-models-on-theano-and-tensorflow-using-keras-jason-
brownlee/
textboxfull.com
Beginning Anomaly Detection Using Python-Based Deep
Learning: With Keras and PyTorch Sridhar Alla
https://textbookfull.com/product/beginning-anomaly-detection-using-
python-based-deep-learning-with-keras-and-pytorch-sridhar-alla/
textboxfull.com
https://textbookfull.com/product/number-systems-for-deep-neural-
network-architectures-1st-edition-alsuhli/
textboxfull.com
https://textbookfull.com/product/reinforcement-learning-with-open-ai-
tensorflow-and-keras-using-python-1st-edition-abhishek-nandy/
textboxfull.com
Computer
Vision Using
Deep Learning
Neural Network Architectures with
Python and Keras
—
Vaibhav Verdhan
Computer Vision
Using Deep Learning
Neural Network Architectures
with Python and Keras
Vaibhav Verdhan
Computer Vision Using Deep Learning: Neural Network Architectures
with Python and Keras
Vaibhav Verdhan
Limerick, Ireland
v
Table of Contents
vi
Table of Contents
vii
Table of Contents
viii
Table of Contents
ix
Table of Contents
References����������������������������������������������������������������������������������������297
Major activation functions and layers used in CNN������������������������������������������297
Google Colab�����������������������������������������������������������������������������������������������������298
Index�������������������������������������������������������������������������������������������������303
x
About the Author
Vaibhav Verdhan is a seasoned data science
professional with rich experience spanning
across geographies and domains. He is
a hands-on technical expert and has led
multiple engagements in machine learning
and artificial intelligence. He is a leading
industry expert, is a regular speaker at
conferences and meetups, and mentors
students and professionals. Currently, he
resides in Ireland and is working as a Principal
Data Scientist.
xi
About the Technical Reviewer
Vishwesh Ravi Shrimali graduated from BITS Pilani in 2018, where he
studied mechanical engineering. Since then, he has worked with Big
Vision LLC on Deep Learning and computer vision and was involved in
creating official OpenCV AI courses. Currently, he is working at Mercedes-
Benz Research and Development India Pvt. Ltd. He has a keen interest
in programming and AI and has applied that interest in mechanical
engineering projects. He has also written multiple blogs on OpenCV and
Deep Learning on Learn OpenCV, a leading blog on computer vision. He
has also coauthored Machine learning for OpenCV 4 (second edition) by
Packt. When he is not writing blogs or working on projects, he likes to go
on long walks or play his acoustic guitar.
xiii
Acknowledgments
I would like to express my thanks to the following people. It is the results of
their hard work and passion that are advancing this field:
Ross Girshick
Jeff Donahue
Trevor Darrell
Jitendra Malik
Shaoqing Ren
Kaiming He
Jian Sun
Christian Szegedy
Wei Liu
Yangqing Jia
Pierre Sermanet
Scott Reed
Dragomir Anguelov
Dumitru Erhan
Vincent Vanhoucke
Andrew Rabinovich
Sergey Ioffe
Jonathon Shlens
Xiangyu Zhang
Omkar M. Parkhi
Andrea Vedaldi
Andrew Zisserman
Yaniv Taigman
Ming Yang
xv
Acknowledgments
Marc’Aurelio Ranzato
Lior Wolf
Yann LeCun
Leon Bottou
Yoshua Bengio
Patrick Haffner
Sefik Ilkin Serengil
xvi
Introduction
Innovation distinguishes between a leader and a follower.
—Steve Jobs
How good is your driving? Will you drive better than an autonomous
driving system? Or do you think an algorithm will perform better than a
specialist in classifying medical images? It can be a tricky question. But
artificial intelligence has outperformed doctors in detecting lung cancer
and breast cancer by analyzing images! Ouch!
Nature has been very kind to grant us powers of sight, taste, smell,
touch, and hearing. Out of these senses, the power of sight allows us to
appreciate the beauty of the world, enjoy the colors, recognize the faces
of our family and loved ones, and above all relish this beautiful world and
life. With time, humans amplified the power of the brain and made path-
breaking inventions and discoveries. The wheel or airplane, printing press
or clock, light bulb or personal computers – innovations have changed the
way we live, work, travel, decide, and progress. These innovations make life
simpler, easier, and far enjoyable and safe.
Data science and Deep Learning are allowing us to further enhance
the innovative buckets. Using Deep Learning, we are able to replicate
the power of vision given by nature. The computers are being trained to
perform the same tasks done by a human being. It can be detection of
colors or shape or size, classifying between a cat or a dog or a horse, or
driving on a road – the use cases are many. The solutions are applicable
for all the sectors like retail, manufacturing, BFSI, agriculture, security,
transport, pharmaceuticals, and so on.
This book is an attempt to explain the concepts of Deep Learning
and Neural Network for computer vision problems. We are examining
xvii
Introduction
xviii
Foreword
Computer Vision, not too long ago the exclusive purview of science fiction,
is quickly becoming commonplace across industries, if not in society at
large. The progress in the field to emulate human vision, that most prized
of human senses, is nothing but astonishing. It was only 1957 when Russell
Kirsch scanned the world’s first photograph, a black and white image of
his boy1. By the late 1980s, the work of Sirovich and Kirby2 helped establish
face recognition as a viable technology for biometric applications.
Facebook made the technology ubiquitous, notwithstanding privacy
concerns and legal challenges action3, when in 2010, it incorporated face
recognition in its social media platform.
The capabilities of Deep Learning vision systems to interpret and
extract information from images permeates all aspects of society. Only the
most skeptical among us doubt a not too distant future with self-driving
cars outnumbering those driven by their human counterparts or computer-
aided diagnosis (CADx) of medical images becoming an ordinary
service supplied by medical providers. Computer vision applications
1
he Associated Press “Computer scientist, pixel inventor Russell Kirsch dead
T
at 91”, The Associated Press, AP News, August 13, 2020, https://apnews.com/
article/technology-oregon-science-portland-us-news-db92e0b593f5156da9
70c0a1e9f90944. Accessed 9 December 2020.
2
Low-Dimensional Procedure for the Characterization of Human Faces, L. Sirovich
and M Kirby, March 1987, Journal of the Optical Society of America. A, Optics and
image science 4(3):519-24
3
Natasha Singer and Mike Isaac “Facebook to Pay $550 Million to Settle Facial
Recognition Suit” The New York Times, Jan 29, 2020 , https://www.nytimes.
com/2020/01/29/technology/facebook-privacy-lawsuit-earnings.html.
Accessed 9 December 2020.
xix
Foreword
already control access to our mobile devices and can outperform human
inspectors in the tedious but critical task of inspecting for defects in all
types of manufacturing processes. That is how I met Vaibhav, or V, as he is
known to his friends and colleagues. Collaborating on methods to improve
existing computer vision systems to ensure defect-free products critical for
human vision. Not lost is an appreciation of the circular history. We teach
computers how to see; they help manufacture products vital to improve
and care for human vision.
In this book, V takes a practical and convenient approach to the
subject. The abundant use of case studies is facilitated by ready-to-use
Python code and links to datasets and other tools. The practitioner’s
learning experience is enhanced by access to the resources needed to work
in a step-by-step fashion through each case study. The book organizes the
subject into three parts. In chapters 1 through 4, V describes the nature
of Neural Networks and demystifies how they learn. Along the way, he
points out different architectures and their historical significance. The
practitioner gets to experience, with all required resources in hand, the
elegant simplicity of LeNet, the improved efficiency of AlexNet, and
the popular VGG Net. In chapters 5 through 7, the practitioner applies
simple yet powerful computer vision applications such as training
systems to detect objects and recognize human faces. When progressing
into performing video analytics, we encounter the nagging problem of
vanishing and exploding gradients and how to overcome it using skip
connections in the ResNet architecture. Finally, in chapter 8, we review
the complete model development process, starting with a correctly
defined business problem and systematically advancing until the model is
deployed and maintain in a production environment.
xx
Foreword
xxi
CHAPTER 1
Introduction to
Computer Vision and
Deep Learning
Vision is the best gift to mankind by God.
Right from our birth, vision allows us to develop a conscious mind. Colors,
shapes, objects, and faces are all building blocks for our world. This gift of
nature is quite central to our senses.
Computer vision is one of the capabilities which allows the machines
to replicate this power. And using Deep Learning, we are enhancing our
command and making advancements in this field.
This book will examine the concepts of computer vision in the light
of Deep Learning. We will study the basic building blocks of Neural
Networks, develop pragmatic use cases by taking a case study–based
approach, and compare and contrast the performance of various solutions.
We will discuss the best practices, share the tips and insights followed
in the industry, make you aware of the common pitfalls, and develop a
thought process to design Neural Networks.
2
Chapter 1 Introduction to Computer Vision and Deep Learning
Info The easiest way to get RGB for any color is to open it in Paint
in Windows operating system. Hover over any color and get the
respective RGB value. In Mac OS, you can use Digital Colour Meter.
Deep Learning allows us to develop use cases which are much more
complex to be resolved using traditional Image Processing techniques. For
example, detecting a face can be done using OpenCV too, but to be able to
recognize one will require Deep Learning.
During the process of developing computer vision solutions using
Deep Learning, we prepare our image dataset first. During preparation, we
might have to perform grayscaling of the images, detect contours, crop the
images, and then feed them to the Neural Network.
OpenCV is the most famous library for such tasks. As a first step, let’s
develop some building blocks of these Image Processing methods. We will
create three solutions using OpenCV.
3
Chapter 1 Introduction to Computer Vision and Deep Learning
The images used for the solutions are the commonly available
ones. You are advised to examine the code and follow the step-by-step
implementation done. We will detect shape, colors, and a face in an image.
Let’s dive into the exciting world of images!
import numpy as np
import cv2
image = cv2.imread('Color.png')
4
Chapter 1 Introduction to Computer Vision and Deep Learning
Figure 1-1. Raw image to be used for color detection. The image
shown has four different colors, and the OpenCV solution will detect
them individually
lower_range = np.array([110,50,50])
upper_range = np.array([130,255,255])
5
Chapter 1 Introduction to Computer Vision and Deep Learning
As visible, the blue color is highlighted in white, while the rest of the
image is in black. By changing the ranges in step 5, you can detect different
colors of your choice.
With color done, it is time to detect a shape in an image; let’s do it!
6
Chapter 1 Introduction to Computer Vision and Deep Learning
import numpy as np
import cv2
shape_image = cv2.imread('shape.png')
Figure 1-3. Raw input image for detecting the three shapes of circle,
triangle, and rectangle
contours,h = cv2.findContours(thresh,1,2)
7
Chapter 1 Introduction to Computer Vision and Deep Learning
Figure 1-4. The output of the color detection system. A circle is shown
in yellow, square is shown in red, and triangle is shown in green
8
Chapter 1 Introduction to Computer Vision and Deep Learning
You can now detect shapes in any image. We have detected a circle,
a triangle, and a square. A good challenge will be to detect a pentagon or
hexagon; are you game?
Let’s do something more fun now!
9
Chapter 1 Introduction to Computer Vision and Deep Learning
import numpy as np
import cv2
face_cascade = cv2.CascadeClassifier(
'haarcascade_frontalface_default.xml')
eye_cascade = cv2.CascadeClassifier(
'haarcascade_eye.xml')
img = cv2.imread('Vaibhav.jpg')
Figure 1-5. Raw input image of a face which is used for the face
detection using the Haar-cascade solution
10
Random documents with unrelated
content Scribd suggests to you:
The Project Gutenberg eBook of The Yale Literary
Magazine (Vol. LXXXVIII, No. 9, June 1923)
This ebook is for the use of anyone anywhere in the United
States and most other parts of the world at no cost and with
almost no restrictions whatsoever. You may copy it, give it away
or re-use it under the terms of the Project Gutenberg License
included with this ebook or online at www.gutenberg.org. If you
are not located in the United States, you will have to check the
laws of the country where you are located before using this
eBook.
Author: Various
Language: English
The
Yale Literary Magazine
Conducted by the
Students of Yale University.
June, 1923.
New Haven: Published by the Editors.
Printed at the Van Dyck Press, 121-123 Olive St., New Haven.
ESTABLISHED 1818
Contents
JUNE, 1923
EDITORS
BUSINESS MANAGERS
MORRIS TYLER.
Welcome to our website – the ideal destination for book lovers and
knowledge seekers. With a mission to inspire endlessly, we offer a
vast collection of books, ranging from classic literary works to
specialized publications, self-development books, and children's
literature. Each book is a new journey of discovery, expanding
knowledge and enriching the soul of the reade
Our website is not just a platform for buying books, but a bridge
connecting readers to the timeless values of culture and wisdom. With
an elegant, user-friendly interface and an intelligent search system,
we are committed to providing a quick and convenient shopping
experience. Additionally, our special promotions and home delivery
services ensure that you save time and fully enjoy the joy of reading.
textbookfull.com