0% found this document useful (0 votes)
57 views

Paper 34-Birds Identification System Using Deep Learning

This document summarizes a research paper that investigates using deep learning for bird identification. The paper uses a database of 4340 bird images collected in Jordan. It applies convolutional neural networks (CNN), specifically VGG-19, to extract features from the images. Principal component analysis is used to reduce the dimensionality of the features. Various classifiers - KNN, decision tree, random forest, and artificial neural networks - are tested on the features. The artificial neural networks classifier achieved the highest accuracy of 70.99% for bird identification.

Uploaded by

Hui Shan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
57 views

Paper 34-Birds Identification System Using Deep Learning

This document summarizes a research paper that investigates using deep learning for bird identification. The paper uses a database of 4340 bird images collected in Jordan. It applies convolutional neural networks (CNN), specifically VGG-19, to extract features from the images. Principal component analysis is used to reduce the dimensionality of the features. Various classifiers - KNN, decision tree, random forest, and artificial neural networks - are tested on the features. The artificial neural networks classifier achieved the highest accuracy of 70.99% for bird identification.

Uploaded by

Hui Shan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/351361220

Birds Identification System using Deep Learning

Article  in  International Journal of Advanced Computer Science and Applications · May 2021

CITATIONS READS
3 4,429

2 authors:

Suleyman Al-Showarah Sohyb Qbailat


Mu’tah University Mu’tah University
14 PUBLICATIONS   42 CITATIONS    3 PUBLICATIONS   5 CITATIONS   

SEE PROFILE SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Effects of user age on smartphones measured with an Eye-Tracker View project

Gestures on smartphones for user classification and identification View project

All content following this page was uploaded by Suleyman Al-Showarah on 03 June 2021.

The user has requested enhancement of the downloaded file.


(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 4, 2021

Birds Identification System using Deep Learning


Suleyman A. Al-Showarah1, Sohyb T. Al-qbailat2
Faculty of Information Technology
Mutah University, Karak
Jordan

Abstract—Identifying birds is one of challenging role for bird algorithms and methods [1][3][4][7][14], but this study differs
watchers due to the similarity of the birds’ forms/image from others in using the following operations: combine
background and the lack of experience for watchers. So, it needs between the fc6/fc7, max between fc6/fc7, min between
a computer system based images to help birdwatchers in order to fc6/fc7, and the average for fc6/fc7 based on VGG-19. Hence,
identify birds. This study aims at investigating the use of deep the field of birdwatching needs more investigations to develop
learning for birds’ identification using convolutional neural systems with new technique that help to identify birds.
network for extracting features from images. The investigation
was performed on database contained 4340 images that collected As the database of images were collected from Jordan, and
by the paper author from Jordan. The Principal Component the statistics number of birds in Jordan as stated in [13] are
Analysis (was applied on layer 6 and 7, as well as on the 434 species belonging to 66 families.
statistical operations of merging the two layers like: average,
minimum, maximum and combine of both layers. The datasets This study aims at investigating the use of deep learning
were investigated by the following classifiers: Artificial neural for birds’ identification using VGG-19 for extracting features
networks, K-Nearest Neighbor, Random Forest, Naïve Bayes and from images. In order to achieve this aim, the investigation for
Decision Tree. Whereas, the metrics used in each classifier are: the performance of different classifiers were performed on the
accuracy, precision, recall, and F-Measure. The results of following classifiers: (KNN, Decision Tree, Random Forest,
investigation include and not limited to the following, the PCA and ANN) on the collected reliable database of birds images
used on the deep features does not only reduce the that available in Jordan.
dimensionality, and therefore, the training/testing time is
reduced significantly, but also allows for increasing the VGG-19 considered as one of the most important models
identification accuracy, particularly when using the Artificial of Convolutional Neural Networks (CNN). Therefore, CNN is
Neural Networks classifier. Based on the results of classifiers; considered as the strongest technique for deep learning used in
Artificial neural networks showed high classification accuracy image identification [9].
(70.9908), precision (0.718), recall (0.71) and F-Measure (0.708)
The main reason of using VGG-19 is to provide high
compared to other classifiers.
precision by finding features with distinctive details in the
Keywords—Birds identification; deep learning convolutional image like the difference in lighting conditions and other
neural networks (CNN); VGG-19; principal component analysis objects surrounding the birds [3]. Moreover, PCA could be
(PCA) employed as dimensionality reduction tools with these
features that would help to reduce number of features that will
I. INTRODUCTION make the training time less.
Many people are interested in observing and studying The motivation to conduct this study represented by:
wildlife, especially in birdwatching. The role of birdwatching 1) The shortage in the field of identifying birds based on
is to preserve the nature by observing bird’s behavior and images. 2) To the best of our knowledge, we have not come
migration pattern. The challenge for bird watchers in across to any study conducted using VGG-19 for identifying
identifying birds based images remains difficult due to the birds. 3) There is shortage in database available in the world
similarity of the birds’ forms/ image background and the lack except these two databases that available in [1] [18]. This case
of experience in this field for watchers [1]. is applicable to Jordan, as there is no database of images for
As mentioned in [17] that birds Voice or Videos were used birds, and there is no program was developed to identify birds.
in earlier technique to predict it species, but this technique Based on the extracted features using VGG-19, the
have many challenges to give an accurate result due to other contribution of this study can provide a research fields with a
background of birds/animal voices. So, images can be best comparison between the results of different aforementioned
choice to be used to identify birds’ species. To implement this classifiers.
technique, the images for all birds’ species need to be trained
to generate a model. Then deep learning algorithm will This study organized into six sections. Section II
convert uploaded image into gray scale format and apply that introduces the overview of previous studies on all related
image on train model to predict best match species name for subjects. Section III describes the used database. Section IV
the uploaded image. discusses the model design and the methodology for the
experiment. Then Section V discusses the results of the
Also, during the previous years, artificial intelligence is experimental, and finally, Section VI presents paper
used in the field of bird watching based images using different conclusion.

251 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 4, 2021

II. RELATED WORK available in [1]. In their study, their database consisted of 300-
Machine learning (ML) represents a set of techniques that 400 different images consists of number of bird species. In
allow systems to discover the required representations to their study, the algorithm used to extract image features is
features detection or classification from the raw data. The AlexNet and then classified by using a SVM classifier. The
performance of works in the classification system depends on results of accuracy is 85%.
the quality of the features. As such of this study can be The researchers in [11] used multiple pre-CNN networks
categorize under the field of ML; this is to make a search in algorithms like: (AlexNet, VGG19 and GoogleNet) on birds
this area for the studies that belong to birds’ identification. dataset that is called (Caltech-UCSD Birds-200-2011). Based
In the literature review, there are number of studies on approach of combining between the aforementioned
conducted in field of identifying birds. But they were algorithms together, the results showed that this approach
conducted in different algorithms and methods, as follows: improved the accuracy that reached to 81.91%, when applied
on Caltech-UCSD Birds-200-2011 dataset compared to other
There are number of studies conducted for identifying datasets used in the same study.
birds based audio/ video like [4][11][6][10]. While other
studies conducted to identify birds based images using AI Another study conducted by [4] in field of database birds
algorithms [1][3][14], but not in what was conducted in this based images and birds identification system. Their study
study. This study used different operations like: MAX, MIN, aimed to classification the birds during flight from video clips.
AVERAGE, and Combine between the layers fc6/fc7 based They approximately collected 952 clips and extracted about
on VGG-19 algorithm. 16,1907 frame photos of 13 birds’ species. In order to improve
the accuracy, the researchers used the two features:
In field of birds database-based images and birds appearance and motion features. Then, they compared their
identification system, the researchers in [19] conducted study proposed method with the classifiers (VGG, MobileNet). The
on data collected mostly from North American of 200 bird proposed method achieved a 90% correct classification rate
species, where they called it: (Caltech-UCSD Birds 200 when using Random forest classifier.
(CUB-200)). They conducted their study based on two simple
features: image sizes and color histograms. In the case of In field of birds’ identification system, the researchers in
image sizes, they represented each image by its width and [3] applied different methods like: 1) softmax regression using
height in pixels. But in case the color histograms, they used 10 manually features on the Caltech-UCSD-Birds-200 dataset
bins per channel, where an applied Principal Component [19]. 2) A multi-class SVM was applied on HOG and RGB on
Analysis was applied. Their results showed how the features extracted from images. 3) A CNN was applied using
performance of the NN classifier degrades as the number of transfer learning algorithm to classify birds. The results of
classes in the dataset is increased, as in [18]. The performance comparing the three methods 46% when using CNN.
of the image size features are close to chance at 0.6% for the In the next section, the database content, number of
200 classes, while the color histogram features increase the images, source of images, and the challenges to classify
performance to 1.7%. Another example of studies that images are explained.
conducted in field of database for birds based images and
birds’ identification system, the researchers in [18] increased III. DATABASE DESIGN
the number of images to 11788 images; as it was 6033 in [19]. The database of birds images were collected from Jordan,
Where they used RGB color histograms and histograms of and it consists of 4340 images of 434 bird species. The
vector-quantized SIFT descriptors with a linear SVM. The database images were obtained from scientific sources and
results obtained of their study for the classification accuracy is were approved by Jordanian Bird Watching Association based
17.3%. on their scientific names [13].
Also, in the field of birds’ identification system, the The images have different backgrounds, where some of
researchers in [14] proposed a new feature to distinguish the them were taken in shadow condition, lightening background,
types of birds. In their study, they used the ratio of the and some of them have other objects in the images as
distance from the eye to the beak root, and the beak width. background. This has added a huge challenge to the
This feature was integrated in the decision tree, and then in researchers to extract features, and to provide high accuracy.
SVM. This proposal was applied to the database that called
(CUB-200-2011 dataset) that mentioned in [18]. The results IV. PROPOSED METHOD
achieved for correct classification rate is 84%. This section presents the procedures that used for the
Another study conducted on birds-identification. Their proposed method in identifying birds using VGG-19. Fig. 1
database was collected in India by the researchers that shows the proposed model.

252 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 4, 2021

Fig. 1. Proposed Model.

The following steps explains the proposed model of this V. EXPERIMENTAL RESULTS AND DISCUSSIONS
study, as follows:
This section presents the performance evaluation results
Step 1): The feature vectors will be extracted form images for the study dataset, which includ the accuracy , F-measure,
automatically using MATLAB for Pretrained VGG-19 to recall, precision and training time for each classifier as
build dataset that includes (feature factors: fc6 and fc7). Each follows:- 1KNN, 3KNN, 5KNN, ANN, Naïve Bayes, Random
dataset (e.g. fc6) contains 4096 columns (representing feature Forest and Decision Tree.
vectors) and 4340 rows (representing the number of samples
(images). The results of this study are displayed as follows:

Step 2): The statistical operations like: (min, max, average, A. Results of both Orginal/Pure fc6/fc7 Datasets Separately
and combined them together) were performed on the Table I shows the results of both orginal of fc6 and fc7
original/pure of fc6 and fc7 layers, this is to obtain new datasets. Naive Bayes has achieved the highest accuracy
dataset to be used in the next stage (step 3) of using classifiers. results for fc6 and fc7 which are (59.002) and (56.106).. While
Explanation on statistical operations, as follows: for the time spend to conduct the test and training dataset,
Decision Tree has spend large time (1406.69s), but KNNs
• Max: used to find the largest value between the two spend less time (0s) compared to other classifers. This is
values in fc6 and fc7 and put value in a new group. because it has no training model; where the test example is
• Min: used to find the less value between the two values compared directly to other examples in the training set, and
in fc6 and fc7 and put value in a new group. that why it is slow in testing, particularly when used a large
number of examples in the training [8][16]. This results match
• Average: used to find average the two values in fc6 and with the results in [5] [12].
fc7 and put value in a new group.
B. Results of the Statitsical Operations on fc6 and fc7
• Combined them together: used to combine the first Datasets
group (4096) next to the second group (4096). This is The section show the results of three dataset by applying
to have a new group that contains 8192 features in this statistical operations(avgerage, maximum, minmum) between
study. the fc6 and fc7 layers.
Step 3): A PCA will be applied on the original/pure of fc6, Table II shows results of the statitsical operations on
fc7, the dataset that obtained from the previous stage (step 2); fc6/fc7 datasets , where Naive Bayes has achieved the highest
this is to produce a new datasets. accuracy results for AVERAGE, MAX, and MIN, which are
The data obtained using the pre-trained VGG-19, is very (57.30), (60.99) , and (57.60) respectively. Despite of the
large (4096), therefore, the PCA was implemented to reduce Naive Bayes have scored acceptable accuracy, F-measure,
the number of features. In PCA, there were set of percentages recall, and precision that outperformed all classifiers, but also
used to show the variance of the data in the results, which are: it was achieved with acceptable training time. This result dis-
95%, 97% and 99% variance of the data (the 4096 features). match with other studies [2] [15].
Step 4): The results were performed based on applying set C. Results of Combine between (Original fc6/ fc7) Dataset
of classifiers on the datasets that obtained from (step 2 and A new dataset was obtained called combine by combining
step 3). of fc6 (4096)and fc7 (4096), which contained 8192 feature
vector, and accordingly will obtained the results:

253 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 4, 2021

Table III shows the birds identification results where In Table IV, the classifer ANN was not used in the
Naive Bayes has achieved the highest accuracy results in previous Tables I to III. This can be explained as follows:
combine (59.4009) of accuracy. The second high result of ANN is the best classifier to be used for deep features, if and
accuracy is 1KNN that has achieved accuracy of 50.2074. only if it is provided with a smaller number of deep features,
While for the time spend to conduct the test and training otherwise, i.e. if it is applied on the original/pure deep
dataset, Decision Tree spend large time (|2484.01s), but KNNs features, which obtained from the VGG-19 layer 6 or 7 or any
spend less time (0s) compared with other classifiers. merging of them both, the training time would be
unacceptably long [2] [15][11].
D. Results of Both Original/pure fc6/fc7 after Applying PCA
Tables IV to V shows the idntification results for each
classifier after applying PCA (95%,97%,and 99%).

TABLE I. IDENTIFICATION RESULT OF BOTH ORIGINAL/PURE FC6/FC7 DATASETS

Classifiers Accuracy Precision Recall F-measure Training Time (seconds)


fc6 results
1KNN 47.0507 0.542 0.471 0.479 0
3KNN 41.7512 0.506 0.418 0.421 0
5KNN 44.3318 0.535 0.443 0.451 0
Naïve Bayes 59.0092 0.642 0.59 0.601 9.15
Random forest 14.447 0.227 0.144 0.153 35.93
Decision Tree 12.8802 0.133 0.129 0.127 1438.85
fc7 results
1KNN 50.8065 0.552 0.508 0.511 0
3KNN 46.1751 0.544 0.462 0.463 0
5KNN 47.4885 0.556 0.475 0.48 0.02
Naïve bayes 56.106 0.609 0.561 0.571 8.23
Random forest 21.2442 0.295 0.212 0.22 60.53
Decision Tree 17.5115 0.185 0.175 0.174 1406.69

TABLE II. IDENTIFICATION RESULTS OF AVERAGE, MAXIMUM AND MINIMUM (FC6⊕FC7)

Classifiers Accuracy Precision Recall f-measure Training Time (sec)


Avg results
1KNN 47.0276 0.536 0.47 0.478 0
3KNN 41.5668 0.497 0.416 0.418 0
5KNN 43.8479 0.523 0.438 0.444 0
Naïve Bayes 57.3041 0.624 0.573 0.584 7.11
Random forest 14.424 0.22 0.144 0.148 44.96
Decision tree 13.3641 0.139 0.134 0.131 1278.55
Max results
KNN 49.6313 0.577 0.496 0.505 0
3KNN 44.9309 0.555 0.449 0.456 0.11
5KNN 47.5806 0.583 0.476 0.487 0
Naïve Bayes 60.9908 0.67 0.61 0.622 7.08
Random forest 16.8433 0.265 0.168 0.176 31.28
Decision tree 14.9078 0.154 0.149 0.148 1467.56
Min results
1KNN 44.9309 0.513 0.449 0.456 0
3KNN 39.2627 0.491 0.393 0.396 0.02
5KNN 40.1152 0.494 0.401 0.408 0
Naïve Bayes 57.6037 0.632 0.576 0.586 7.19
Random forest 12.9493 0.204 0.129 0.133 53.8
Decision tree 11.0829 0.118 0.111 0.11 1198.16

254 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 4, 2021

TABLE III. IDENTIFICATION RESULTS OF COMBINE BETWEEN (ORIGINAL FC6/ FC7) DATASET

Classifiers Accuracy Precision Recall f-measure Training Time (sec)


Combine results
KNN 50.2074 0.555 0.502 0.506 0.01
3KNN 45.2995 0.539 0.453 0.455 0.74
5KNN 47.6728 0.563 0.477 0.482 0
Naïve Bayes 59.4009 0.64 0.594 0.603 14.68
Random forest 18.1797 0.256 0.182 0.185 47.38
Decision tree 16.129 0.166 0.161 0.159 2484.01

TABLE IV. IDENTIFICATION RESULTS OF ORIGINAL/PURE FC6 AFTER APPLYING PCA (95%,97%,99%)

Classifiers Accuracy Precision Recall F-measure Training Time (sec)

fc6 (PCA 95%) results


ANN 68.8018 0.695 0.688 0.685 23378.32
1KNN 27.6959 0.52 0.277 0.317 0
3KNN 15.2074 0.391 0.152 0.185 0
5KNN 15.2765 0.416 0.153 0.19 0
Naïve Bayes 52.0737 0.631 0.521 0.549 0.41
Random forest 6.7281 0.13 0.067 0.067 20.99
Decision tree 14.5392 0.153 0.145 0.144 107.12
fc6 (PCA 97%) results
ANN 70 0.658 0.65 0.648 19022.88
KNN 19.2857 0.49 0.193 0.237 0
3KNN 8.1797 0.278 0.082 0.104 0
5KNN 8.1567 0.292 0.082 0.107 0
Naïve Bayes 48.318 0.622 0.483 0.52 1.11
Random forest 3.8018 0.085 0.038 0.038 27.03
Decision tree 14.1014 0.154 0.141 0.141 188.18
fc6 (PCA 99%) results
ANN 62.3733 0.642 0.624 0.623 48850.24
KNN 8.6636 0.325 0.087 0.113 0
3KNN 1.8433 0.072 0.018 0.022 0
5KNN 1.9355 0.079 0.019 0.023 0
Naïve bayes 37.9032 0.581 0.379 0.428 1.44
Random forest 2.0507 0.04 0.021 0.02 28.25
Decision tree 13.1567 0.143 0.132 0.132 471.77

Applying PCA has influnced on the training time for fc6 Table V shows the birds identification results for fc7
that made it less for all classifers in Table IV-after applying where the highest accuracy resultant from applying PCA of
PCA compared to the training times in Tables I to III, before (95%,97% and 99%) are in favors of ANN with (65.2995,
applying PCA, especially for Random Forest and Naïve 65.2995 and 67.9493), respectively.
Bayes. The highest accuracy resultant from applying PCA of
(95%, 97% and 99%) is in favor of ANN with (68.8018, 70 The second high accuracy resultant from applying PCA of
and 62.3733%), respectively, which can be attributed to the all percentage of (95%, 97% and 99%) is Naïve Bayes, has
reduced feature vector. achieved accuracy of (58.3641, 56.9585 and 56.3825%),
respectively.
So, it is worth mentioning that the ANN classifier was not
used with other sets except those obtained after applying the E. Results of the Statistical Operations on (fc6 and fc7) after
PCA, this is because of its unacceptable training time. This Applying PCA
results matches with previous studies that stated the training This section presents the identification results of the
time for ANN spend large compared with other classifers statistical operations on each of (average, maximum and
[2][15]. minimum) between the fc6 and fc7 after applying PCA

255 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 4, 2021

(95%,97%,99%), as well as the results of training time for While for the time spend to conduct the test and training
each classifier, as follws: dataset, ANN spend large time 54151.88s.
Table VI shows the birds identification results in (average Table VIII shows the birds identification results in
between (fc6 and fc7)) where the highest accuracy resultant (minimum between (fc6 and fc7)) where the highest accuracy
from applying PCA of (95%, 97% and 99%) are in favors of resultant from applying PCA of (95%) are in favors of ANN
ANN with (69.5622, 69.9078 and 65.5069) respectively. The with (70.8295). It is noted that the result of the ANN is
second-high accuracy resultant from applying PCA of all appeared only for the PCA (95%), but not for the percentage
percentage of (95%, 97% and 99%) is Naïve Bayes that has (97%, and 99%). This is because the large number of features
achieved accuracy of (53.3871, 49.7926 and 39.8157%) for each of PCA (97% and 99%) that reached to (1205 and
respectively. While the time spend to conduct the test and 1910) features respectively. Also, due to its unacceptable
training dataset, ANN spend large time 58379.22s , where that training time (that takes days to provide the results. While
PCA 95 spend less time compared to PCA 97and PCA99. Naïve Bayes achieved accuracy resultant from applying PCA
of all percentage of (95%, 97% and 99%), they are as follows
Table VII shows the birds identification results in (48.7327, 44.1014 and 35%), respectively. While for the time
(maximum between (fc6 and fc7)) where the highest accuracy spend to conduct the test and training dataset, ANN spend
resultant from applying PCA of (95%) are in favors of ANN large time 42677.02s.
with (66.9816) . It is noted that the results of the ANN is
appeared only for PCA (95%), but not for the percentage of F. Results of Combining Feature Vector after Applying PCA
(97%, and 99%). This is because the large number of features This section shows the results of combining between fc6
for each of PCA (97% and 99%) that reached to (1428, and (4096) and fc7 (4096) that reached 8192, but this number of
2117) features, respectively. Therefore there will not be results features have been reduced after appling PCA (95%, 97%,
when using ANN, due to its unacceptable training time (that 99%) that become (250, 440 and 1080) features, respectively.
takes days to provide the results. The results of combine, as follows:

TABLE V. IDENTIFICATION RESULTS OF ORIGINAL/PURE FC7 AFTER APPLYING PCA (95%,97%,99%)

Classifiers Accuracy Precision Recall F measure Training Time (sec)

fc7 (PCA 95%) results


ANN 64.977 0.658 0.65 0.648 12295.32
KNN 41.4055 0.509 0.414 0.427 0
3KNN 34.9078 0.502 0.349 0.365 0
5KNN 36.7051 0.52 0.367 0.386 0
Naïve bayes 58.3641 0.643 0.584 0.598 0.06
Random forest 15.8986 0.24 0.159 0.167 15.5
Decision tree 17.0737 0.177 0.171 0.169 40.36
fc7 (PCA 97%) results
ANN 65.2995 0.66 0.653 0.651 15658
KNN 38.6175 0.532 0.386 0.409 0.01
3KNN 29.7926 0.507 0.298 0.326 0
5KNN 30.4147 0.52 0.304 0.337 0
Naïve bayes 56.9585 0.646 0.57 0.588 0.11
Random forest 12.2811 0.211 0.123 0.13 17.32
Decision tree 16.4977 0.173 0.165 0.164 71.95
fc7 (PCA 99%) results
ANN 67.9493 0.686 0.679 0.676 23197.76
KNN 27.3272 0.565 0.273 0.324 0.01
3KNN 15.2995 0.45 0.153 0.195 0
5KNN 15.4147 0.464 0.154 0.198 0
Naïve bayes 56.3825 0.678 0.564 0.592 0.53
Random forest 4.3779 0.088 0.044 0.044 22.7
Decision tree 14.7926 0.151 0.148 0.146 137.95

256 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 4, 2021

TABLE VI. IDENTIFICATION RESULTS OF AVERAGE BETWEEN (FC6 AND FC7) AFTER APPLYING OF PCA (95%,97%,99%)

Classifiers Accuracy Precision Recall F measure Training Time (sec)


AVG (PCA95%) results
ANN 69.5622 0.703 0.696 0.693 16452.89
KNN 29.1705 0.523 0.292 0.331 0
3KNN 16.6359 0.418 0.166 0.202 0.02
5KNN 16.659 0.429 0.167 0.205 0
Naïve bayes 53.3871 0.635 0.534 0.56 1.35
Random forest 5.7143 0.111 0.057 0.057 15.99
Decision tree 15.2304 0.165 0.152 0.152 85.88
AVG (PCA 97%) results
ANN 69.9078 0.707 0.699 0.696 24498.83
KNN 20.6221 0.503 0.206 0.254 0
3KNN 8.5484 0.282 0.085 0.109 0
5KNN 8.8249 0.316 0.088 0.115 0
Naïve Bayes 49.7926 0.631 0.498 0.53 0.48
Random forest 3.9862 0.082 0.04 0.039 22.49
Decision tree 14.0553 0.149 0.141 0.14 138.67
AVG (PCA 99%) results
ANN 65.5069 0.666 0.655 0.652 58379.22
KNN 9.4009 0.34 0.094 0.121 0
3KNN 2.1889 0.089 0.022 0.027 0
5KNN 2.2811 0.094 0.023 0.028 0
Naïve bayes 39.8157 0.607 0.398 0.45 2.2
Random forest 2.0968 0.04 0.021 0.019 26.89
Decision tree 13.9171 0.149 0.139 0.139 314.45

TABLE VII. IDENTIFICATION RESULTS OF MAXIMUM BETWEEN (FC6 AND FC7) AFTER APPLYING OF PCA (95%,97%,99%)

Classifiers Accuracy Precision Recall F measure Training Time (sec)


MAX (PCA95%) results
ANN 66.9816 0.68 0.67 0.668 54151.88
KNN 13.2488 0.432 0.132 0.172 0
3KNN 4.7235 0.169 0.047 0.059 0
5KNN 4.447 0.161 0.044 0.055 0
Naïve bayes 46.5207 0.658 0.465 0.517 1.75
Random forest 2.9032 0.071 0.029 0.032 30.5
Decision tree 15.0922 0.16 0.151 0.15 423.62
MAX (PCA 97%) results
ANN - - - - -
KNN 7.8341 0.32 0.078 0.106 0
3KNN 2.0507 0.07 0.021 0.022 0
5KNN 2.3272 0.073 0.023 0.026 0
Naïve bayes 41.682 0.648 0.417 0.478 2.46
Random forest 2.6959 0.07 0.027 0.028 33
Decision tree 14.8618 0.158 0.149 0.148 556.78
MAX (PCA 99%) results
ANN - - - - -
KNN 3.2258 0.132 0.032 0.043 0
3KNN 0.9908 0.026 0.01 0.008 0
5KNN 1.0599 0.026 0.011 0.009 0
Naïve bayes 32.1889 0.585 0.322 0.38 3.52
Random forest 1.7512 0.043 0.018 0.02 35.19
Decision tree 14.6774 0.156 0.147 0.146 1517.2

257 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 4, 2021

Table IX shows the birds identification results in (combine 2) Some others of previous studies conducted on dataset
between fc6 and fc7) where the highest accuracy resultant containing a large number of images in training dataset
from applying PCA of (95%,97% and 99%) are in favors of (examples) like in [4], [3], [14] that used (161907), (11788),
ANN with (69.5392, 70.9908 and 67.9263), respectively. The (11788) examples recpectively compared to this study which
second high accuracy resultant from applying PCA of all
contained a few images (4340 examples). Few number of
percentage of (95%, 97% and 99%) is Naïve Bayes that has
achieved accuracy of (57.235, 54.1475 and 43.7558%), images (examples) for each bird usually leads to low accuracy
respectively. compared to the large examples, but in constant it was not.
This leads to make more covident in the results of this study.
While for the time spend to conduct the test and training 3) There were studies conducted for identifying birds
dataset, ANN spend large time (56279.29s). Comparison using different algorithms and methods based audio/ video
between the proposal work and previous researchers’ works.
like [4][11][6][10], while other studies conducted to identify
Table X compares the results of the proposed approach birds based images using AI algorithms [1][3][17]. This is less
with three similar approaches for birds identification. in what was conducted in this study that used deep-learning
Table X has approved that the output of our proposal can algorithms and different statistical operations like: MAX,
be considered as one of the interesting study comapred to the MIN, AVERAGE, and combine between the layers fc6/fc7
previous researchs, for several reasons: based on VGG-19 algorithm.
4) This study conducted on different methods like:
1) Some of previous studies were conducted on small
combine between the fc6/fc7, max of fc6/fc7, min of fc6/fc7,
dataset birds (categories) like in [4], [7] that used (13), (16)
and the average for fc6/fc7 based on VGG-19.
categories recpectively, compared to this study that used
(434).
TABLE VIII. IDENTIFICATION RESULTS OF MINIMUM BETWEEN (FC6 AND FC7) AFTER APPLYING OF PCA (95%,97%,99%)

Classifiers Accuracy Precision Recall F measure Training Time (sec)


MIN (PCA 95%) results
ANN 70.8295 0.715 0.708 0.993 42677.02
KNN 17.6037 0.515 0.176 0.223 0
3KNN 6.106 0.234 0.061 0.078 0
5KNN 6.1982 0.238 0.062 0.078 0
Naïve bayes 48.7327 0.661 0.487 0.539 1.2
Random forest 3.5023 0.093 0.035 0.038 33.74
Decision tree 13.6636 0.153 0.142 0.142 2829.83
MIN (PCA 97%) results
ANN - - - - -
KNN 9.5853 0.371 0.096 0.129 0
3KNN 2.5115 0.079 0.025 0.027 0
5KNN 2.6267 0.096 0.026 0.029 0
Naïve bayes 44.1014 0.652 0.441 0.501 2.5
Random forest 2.5115 0.057 0.025 0.025 29.52
Decision tree 13.6636 0.147 0.137 0.136 1007.75
MIN (PCA 99%) results
ANN - - - - -
KNN 3.871 0.176 0.039 0.051 0.01
3KNN 0.8756 0.024 0.009 0.007 0
5KNN 0.7834 0.019 0.008 0.007 0
Naïve bayes 35 0.615 0.35 0.414 3.38
Random forest 2.0507 0.046 0.021 0.021 31.93
Decision tree 13.341 0.144 0.133 0.134 547.58

258 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 4, 2021

TABLE IX. IDENTIFICATION RESULTS OF COMBINE ON (FC6 AND FC7) AFTER APPLYING PCA (95%, 97%, 99%)

Classifiers Accuracy Precision Recall F measure Training Time (sec)


Combine (PCA 95%) results
ANN 69.5392 0.703 0.695 0.693 20103.19
KNN 35 0.544 0.35 0.382 0
3KNN 24.3088 0.459 0.243 0.275 0
5KNN 24.1705 0.487 0.242 0.28 0
Naïve bayes 57.235 0.653 0.572 0.592 0.89
Random forest 7.3041 0.166 0.073 0.08 167.7
Decision tree 16.0599 0.167 0.161 0.159 96.93
Combine (PCA 97%) results
ANN 70.9908 0.718 0.71 0.708 24033.56
KNN 27.1659 0.547 0.272 0.319 0
3KNN 13.7558 0.397 0.138 0.172 0
5KNN 14.5161 0.436 0.145 0.186 0
Naïve bayes 54.1475 0.654 0.541 0.568 0.66
Random forest 5.1152 0.115 0.051 0.054 26.12
Decision tree 15.4839 0.161 0.155 0.153 128.03
Combine (PCA 99%) results
ANN 67.9263 0.685 0.679 0.675 56279.29
KNN 10.8065 0.39 0.108 0.142 0.02
3KNN 2.9493 0.106 0.029 0.035 0
5KNN 2.9493 0.112 0.029 0.036 0
Naïve bayes 43.7558 0.647 0.438 0.49 3.59
Random forest 2.3733 0.038 0.024 0.022 113.17
Decision tree 14.8618 0.153 0.149 0.147 403.92

TABLE X. COMPARISON BETWEEN PROPOSAL OF THIS STUDY AND RELATED WORKS

Method Dataset (name) # of example # of category Result


[4] CNN+RandomForest Frames of Videos 161907 13 ACC=90%
[3] Regularized Softmax Reg w/ Broad Classes CUB200-2011 11788 200 ACC=70%
[14] HSV+SVM CUB200-2011 11788 200 ACC=83.87%
CVIP 2018 Bird Species
[7] Mask R-CNN + Ensemble Model 150 16 Precision= 56.58
challenge
proposal of this
(Combine orginal (fc6+fc7) after PCA)+ANN JOP(new datset) 4340 434 ACC=71%
study
VI. CONCLUSION Since the size of the deep feature vector obtained from the
This study aims at investigating the use of deep learning VGG19’s layers (6 or 7) is very large (4096), we opt for
for birds’ identification system using VGG-19 for extracting Principal Component Analysis (PCA) and to do the
features from images. VGG-19 is one of the pre-trained dimensionality reduction. Moreover, it was created more
convolutional neural network (CNN) networks that used for feature vectors called statistical operations to generate more
image identification which was used in this paper to extract datasets from (fc6 and fc7) using average, minimum,
the features from birds’ images. maximum and combine of both layers.
Database of this study is contained 4340 images of 434 The created datasets (i.e. with PCA and without PCA), as
bird species obtained from scientific sources and where well as the datasets that created from statistical operations are
approval by Jordanian Bird Watching Association based on used as input for classification using various machine learning
scientific name. classifiers including Artificial neural networks (ANN), K-
Nearest Neighbor (KNN), Random Forest, Naïve Bayes and
In this study, the two layers in the structure of VGG19 to Decision Tree.
get the features were used layer 6 (called fc6) and layer 7
(called fc7); each layer consists of 4096 features. The results of investigation in this study include and not
limited to the following, the PCA used on the deep features
does not only reduce the dimensionality, and therefore, the

259 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 4, 2021

training/testing time is reduced significantly, but also allows [9] Hijazi, Samer, Rishi Kumar, and Chris Rowen. 2015. “Using
for increasing the identification accuracy, particularly when Convolutional Neural Networks for Image Recognition.” . IP Group,
Cadence. Retrieved from https://ip.cadence.com/uploads/901/cnn_wp-
using the ANN classifier. Based on the results of classifiers; pdf.
ANN showed high classification accuracy (70.9908), precision [10] Incze, A., Jancsó, H. B., Szilágyi, Z., Farkas, A., & Sulyok, C. 2018.
(0.718), recall (0.71) and F-Measure (0.708) compared to Bird sound recognition using a convolutional neural network. In 2018
other classifiers. IEEE 16th International Symposium on Intelligent Systems and
Informatics (SISY) :295-300 IEEE.
It is recommended to conduct more investigation to [11] Korzh, Oxana, Mikel Joaristi, and Edoardo Serra B. 2018.
improve accuracy results and to reduce training time using “Convolutional Neural Network Ensemble Fine-Tuning for Extended
different algorthms. Transfer.” In International Conference on Big Data, 110–23. Retrieved
from http://dx.doi.org/10.1007/978-3-319-94301-5_9.
REFERENCES
[12] Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E Hinton. 2012.
[1] Tayal, Madhuri, Atharva Mangrulkar, Purvashree Waldey, and Chitra “ImageNet Classification with Deep Convolutional Neural Networks.”
Dangra. 2018. “Bird Identification by Image Recognition.” Helix 8(6): 25: 1–9. Technologies.
4349–4352.
[13] The Royal Society For The Conservation Of Nature 2017.
[2] Albustanji, Abeer. 2019. “Veiled-Face Recognition Using Deep “Birdwatching in Jordan”. Retrieved from https://migratorysoaringbirds.
Learning.” Mutah University. birdlife.org/sites/default/files/jordan_birding_brochure.pdf.
[3] Alter, Anne L, and Karen M Wang. 2017. “An Exploration of Computer [14] Qiao, Baowen, Zuofeng Zhou, Hongtao Yang, and Jianzhong Cao. 2017.
Vision Techniques for Bird Species Classification.”. “Bird Species Recognition Based on SVM Classifier and Decision
[4] Atanbori, John et al. 2018. “Classification of Bird Species from Video Tree.” In 2017 First International Conference on Electronics
Using Appearance and Motion Features” Ecological Informatics 48: 12– Instrumentation & Information Systems 1–4.
23. [15] Sarayrah, Bayan mahmoud. 2019. “Finger Knuckle Print Recognition
[5] Brownlee, Jason. 2016. “How To Use Classification Machine Learning Using Deep Learning.” Mutah University.
Algorithms in Weka.” Retrieved from https://machinelearningmastery. [16] S. Al-Showarah et. al. 2020. “The Effect of Age and Screen Sizes on the
com/use-classification-machine-learning-algorithms-weka/. Usability of Smartphones Based on Handwriting of English Words on
[6] Cai, J., Ee, D., Pham, B., Roe, P., & Zhang, J. (2007, December). Sensor the Touchscreen”, Mu’tah Lil-Buhuth wad-Dirasat, Natural and Applied
network for the monitoring of ecosystem: Bird species recognition. In Sciences series, Vol. 35, No. 1, 2020. ISSN: 1022-6812.
2007 3rd international conference on intelligent sensors, sensor [17] Triveni, G., Malleswari, G. N., Sree, K. N. S., & Ramya, M (2020). Bird
networks and information 293-298. IEEE. Species Identification using Deep Fuzzy Neural Network Int. J. Res.
[7] Kumar, A., & Das, S. D. 2018. “Bird Species Classification Using Appl. Sci. Eng. Technol.(IJRASET), 8: 1214-1219.
Transfer Learning with Multistage Training”. In Workshop on Computer [18] C. Wah, S. Branson, P. Welinder, P. Perona, and S. Belongie 2011. “The
Vision Applications 28-38. Springer, Singapore. Caltech-UCSD Birds-200-2011 Dataset.” Technical Report CNS-TR-
[8] Hassanat, A. (2018). “Furthest-pair-based binary search tree for 2011-001, California Institute of Technology.
speeding big data classification using k-nearest neighbors”. Big Data, [19] Welinder, Peter et al. 2010. “Caltech-UCSD Birds 200.” Technical
6(3): 225-235. ReportCNS-TR-2010-001, California Institute of Technology, 2010.

260 | P a g e
www.ijacsa.thesai.org
View publication stats

You might also like