A Sea Creatures Classification Method Using CNN
A Sea Creatures Classification Method Using CNN
Abstract: Recently, Autonomous Underwater Vehicles (AUVs) provide high resolution seafloor images to the
researchers. The provided images are classified by the researchers one by one, and this task increases the burden on the
researchers. In this research, we propose an automatic classification method for the sea creatures usingٻconvolutional
neural networks to provide theٻrequested information, such as crab images, to researcher. The proposed method is
comprised image enhancement process, segmentation process and classification process. In the image enhancement
process, which is used Retinex model, the visibility of seafloor images is improved. For the candidate’s area selection,
the Saliency map is employed to extract the represented areas in seafloor images. In the classification process, the
selected areas are recognized based on the biological group. The total candidate detection rate was 64%, and the total
recognition accuracy was 67% by the evaluation.
Authorized licensed use limited to: University of Tasmania. Downloaded on March 12,2021 at 23:25:32 UTC from IEEE Xplore. Restrictions apply.
In the Retinex model, an image comprises an L is estimated by the a Gaussian filters, as follows;
illumination factor and a reflectance factor. The ଵ ି൫௫ మ ା௬ మ ൯
ܮఒ ሺݔǡ ݕሻ ൌ ܫఒ ሺݔǡ ݕሻ כ ή ቀ ቁ, (2)
illumination factor, which is from light source, causes ଶగఙ మ ଶఙ మ
low visibility because illumination contains light The symbol “ ”כrepresents convolution and σ is the
attenuation. The reflectance is the uninfluenced color of standard deviation of the distribution. The image
object by illumination. The Retinex model is; enhancement process using Retinex model is shown in
Fig.4, and the processed results are shown in Fig. 5.
ܴఒ ሺݔǡ ݕሻ ൌ ߙሼܫఒ ሺݔǡ ݕሻ െ ܮఒ ሺݔǡ ݕሻሽ ߚ. (1) For the image enhancement, the used parameters are
where I represents input image, λ is color plane, (x, y) is σ: 128, α: 384, β: 128. The parameters are selected
manually based on image size and approximate
the image coordinates, L is the estimated illumination, α
photographed altitude. The enhanced images show
is scaling gain parameter, β is scaling offset parameter,
improved color and contrast.
and R is the output of process.
Fig. 5 Processed results by image enhancement (Used parameters- σ: 128, α: 384, β: 128)
421
Authorized licensed use limited to: University of Tasmania. Downloaded on March 12,2021 at 23:25:32 UTC from IEEE Xplore. Restrictions apply.
2018 18th International Conference on Control, Automation and Systems (ICCAS 2018)
Oct. 17~20, 2018; YongPyong Resort, PyeongChang, GangWon, Korea
End
422
Authorized licensed use limited to: University of Tasmania. Downloaded on March 12,2021 at 23:25:32 UTC from IEEE Xplore. Restrictions apply.
Fig. 7 Classification process result
423
Authorized licensed use limited to: University of Tasmania. Downloaded on March 12,2021 at 23:25:32 UTC from IEEE Xplore. Restrictions apply.