Apple Detection in Natural Tree Canopies From
Apple Detection in Natural Tree Canopies From
images
Abstract
In this work we develop a real time system that recognizes occluded green apples within a tree
canopy using infra-red and color images in order to achieve automated harvesting. Infra-red provides
clues regarding the physical structure and location of the apples based on their temperature (leaves
accumulate less heat and radiate faster than apples), while color images provide evidence of circular
shape. Initially the optimal registration parameters are obtained using maximization of mutual
information. Haar features are then applied separately to color and infra-red images through a process
called Boosting, to detect apples from the background. A contribution reported in this work, is the
voting scheme added to the output of the RGB Haar detector which reduces false alarms without
affecting the recognition rate. The resulting classifiers alone can partially recognize the on-trees
apples however when combined together the recognition accuracy is increased.
Keywords: mutual information, multi-modal registration, sensor fusion, Haar detector, apple
detection
Introduction
In the last few years, object recognition algorithms are focusing on the efficient detection of objects
in natural scenes. A system is developed to recognize in real-time partially occluded apples regardless
of position, scale, shadow pattern and illumination within a tree canopy.
The work is motivated by the fact that labor for orchard tasks constitutes the largest expense
(Jiménez et al., 2000), and hence there is a need to develop autonomous robotic fruit picking
systems. Here we address the first step in such a system by tackling the problem of on tree green
apple detection using real-time machine vision algorithms. The complexity of the task involves
the successful discrimination of ‘green’ apples within scenes of ‘green leaves’, shadow patterns,
branches and other objects found in natural tree canopies. Color and edges are features highly
dependent on illumination while texture is highly sensitive to the proximity (scale) of the object.
An excellent review regarding apple recognition systems was presented in (Jiménez et al., 2000b).
The concept of background modeling using Gaussian mixture color distributions in RGB images
was used in Tabb et al., (2006). This algorithm detected 85 to 96 percent of both red and yellow
apples assuming a uniform background in an artificial environment. Color distribution models for
fruit, leaf and background classes were used in Annamalai et al., (2003) in a citrus fruit counting
algorithm. In Stanjnko et al., (2004) pixel thermal values were mapped to RGB values and detected
using the normalized difference index. However the efficiency of the algorithm was affected by
the apple’s position on the tree and degree of sunlight. In Sapina (2001), textural features extracted
from the gray level co-occurrence matrix were used to discriminate between warm objects and their
background in thermal images. On the same vein, a threshold selection approach was proposed by
Fernandez et al. (1993) based on apple’s texture features in grayscale images. The authors assume
that all the apples have a bright spot (due to their exposure to sunlight) and the apple region is
Figure
Figure 1. Flowchart of 1. Flowchart
multimodal of procedure.
apple detection multimodal apple
detection procedure
Table 1. Registration parameteres RMS error using the five 5 similarity indices.
Measure RMS
Ds Dq Dt x(%) Dty(%)
Figure 3. Eleven Haar features: edge, line, diagonal, center surround and rotated features.
x y
I ( x , y ) = ∑∑ N ( x' , y' ) Figure 3. Eleven
(3) Haar features: edge, line, diagonal, center (3)
surround and rota
x' =0 y' =0
⎪⎩0 , otherwise
, ei ,2 = ei ,1
Table 2. Dataset used to train the classifiers.
The majority voting scheme was used to accept or reject the hypothesis about whether the detected
sub-windows were or were not apples. In addition, two rules were implemented to accept a
hypothesis: (a) the detected window does not include sub-windows, (b) the detected sub-window
size is smaller than k*median (W,H), where we used k=1.5.
The following subsections describe the performance of the multimodal apple detection system
using first, the RGB and IR Haar detectors independently.
Classfiers
D1
Decision
Rule
D2
Label
w ∑ Max
D3
Table 3. Detection rate using the color Haar detector with and without the voting.
Hits Missed FA
Figure.
Table 4. Hit rate and false 6. Nine
alarms applesofdetected
per stage the Haarbydetector.
the IR Haar detector
17 274/504 0.456 80
18 263/504 241/504 61
19 245/504 259/504 51
20 231/504 273/504 47
Conclusions
We presented an algorithm for apple detection in natural scenes using a multimodal approach.
Initially the optimal registration parameters are obtained using maximization of mutual information
and are stored for later use. Then, Haar features in color and infra-red images are obtained through
an Adaboost algorithm. Later, a voting scheme is used to improve the detection results. Finally, the
detection results are fused after applying the best transformation found in the first step. A contribution
reported in this work, is the voting scheme added to the output of the RGB Haar detector which
drops the false alarms with little effect on the recognition rate. The resulting classifiers alone can
partially recognize the on-trees apples however when combined together the recognition accuracy
is increased. Although the algorithm did not detect all apples and contains false alarms, the main
concern is its implementation in a robotic fruit picking scenario. In this case, the performance of
the algorithm seems to be a sufficient for prepositioning a robot picking arm. Since images will
be acquired from cameras mounted on the robotic arm which can be oriented to take close up
pictures, gradually all the apples in the tree can be found and false alarms can be identified as the
robot arm explores the canopy. In spite of the relatively low recognition accuracy, this is the first
system, to our knowledge, that can deal with ‘green’ apple detection, that are partially occluded
with shadow patterns, from a tree canopy of ‘green’ leaves, branches, and sky background. Future
work will include increasing the robustness of the Haar classifiers by increasing the sample set,
and incorporate morphologic information to the voting scheme.
Acknowledgments
This research was supported by Research Grant No US-3715-05 from BARD, The United States
- Israel Binational Agricultural Research and Development Fund, and by the Paul Ivanier Center
for Robotics Research and Production Management, Ben-Gurion University of the Negev.
Annamalai, P. and Lee, W.S., 2003. Citrus Yield Mapping System Using Machine Vision. ASAE Annual
International Meeting. Paper number 031002.
Fernandez-Maloigne, C., Laugier, D. and Boscolo, C., 1993. Detection of apples with texture analyses for an
apple picker robot. Intelligent Vehicles ‘93 Symposium. 1993: 323-328.
Freund Y. and Shapire, R.E., 1996. Experiments with a new boosting algorithm. In Machine Learning: Proc.
of the 13th Int. Conf., pp. 148-156.
Jiménez, A.R., Ceres, R. and Pons, J.L., 2000a. A vision system based on a laser range-finder applied to robotic
fruit harvesting. Machine Vision and Applications 11(6): 321-329.
Jiménez, A.R., Ceres, R., and Pons, J.L., 2000b. A survey of computer vision methods for locating fruit on
trees. Trans. ASAE 43(6): 1911-1920.
Lienhart, R. and Maydt, J., 2002. An Extended Set of Haar-like features for Rapid Object Detection. In Proc.
of the IEEE Conf. on Image Processing (ICIP ’02), pp. 155-162.
Sapina, R., 2001. Computing textural features based on co-occurrence matrix for infrared images. Proceedings
of 2nd International Symposium on Image and Signal Processing and Analysis, 2001. ISPA 2001. 373-376.
Stanjnko, D., Lakota, M. and Hocevar, M., 2004. Estimation of number and diameter of apple fruits in an
orchard during the growing season by thermal imaging. Computers and Electronics in Agr. 42: 31-42.
Tabb, A.L. Peterson, D.L., and Park, J., 2006. Segmentation of Apple Fruit from Video via Background
Modeling. ASABE Annual International Meeting, 2006. Paper number 063060.
Viola, P. and Jones, M.J., 2004. Robust real-time face detection. International Journal of Computer Vision,
57(2):137-154.
Viola, P. and Wells, W.M. III, 1995. Alignment by maximization of mutual information. In Proc. 5th Int. Conf.
Computer Vision, June 1995, pp. 16-23.
Zhao, J., Tow, J. and Katupitiya, J., 2005. On-tree fruit recognition using texture properties and color data.
IEEE/RSJ Int. Conf. Intell. Robots and Systems, pp. 263-268.