Fall Detection2018
Fall Detection2018
Abstract—Automatic human fall detection is a challenging task of the data collected by each sensor, so a plenty of sensors
of healthcare in smart homes, and video cameras have been proved should be installed, which is costly and difficult. Also, the data
to be efficient in addressing this problem. Although existing has high noise to signal ratio. Video cameras can be
methods perform relatively well, they are all built upon “hand- conveniently mounted in various places of the residence, and
crafted” features, thus constraining the performance of the model they collect data with rich information content that can be used
to some presumed conditions and scenarios, and making it for several tasks. The downside, however, is that cameras are
vulnerable to any deviation from the assumed settings. In this considered intrusive and introduce privacy issues if video
paper, we propose a deep-learning-based approach for human fall content is captured and stored. Nowadays, inexpensive depth
detection, using long short-term memory neural network. Our
cameras, such as the Microsoft Kinect, can address some of the
model is not restricted to any specific circumstances, and
privacy issues and under proper implementation conditions
performance evaluations show that it outperforms all the existing
methods. could be a promising and feasible option for human fall
detection in the context of smart home.
Keywords—human fall detection; deep learning; long short-term Limited research has been done so far on depth-camera-
memory (LSTM); smart home; depth camera based fall detection. In [3], Marzahl et al. apply a manual pre-
I. INTRODUCTION segmentation to exclude areas where falling is not possible to
happen, e.g., cupboards. Then, a decision tree detects any person
The concept of a "smart home" is a major step towards lying on the floor by exploiting spatial characteristics of
wellness and improved quality of life and a hot interdisciplinary segmented objects. Out of the 55 fall samples collected in a
research topic bringing together artificial intelligence, cloud laboratory setting, 93% of them have been classified correctly.
computing, communications and networks, psychology and In [4], Mastorakis and Markis use the 3D bounding box of the
healthcare [1]. Monitoring the well-being of the residents is an subject, extracted by OpenNI framework [5], and employ
expected service to be provided by a smart home. Although thresholding on the height, width, and depth of the bounding box
older adults, patients newly released from hospital and as well as their first derivatives to detect falling incidents. All
chronically ill people are more vulnerable to falling, everyone is falls have been detected with no false alarms for a dataset of 48
prone to it. It can be specifically dangerous for people living fall samples and 112 non-fall samples, collected in a laboratory
alone, as the incident might take hours or even days to be setting. Bian et al. in [6] extract the key joints of the human body
discovered with the person remaining injured or unconscious, using their own proposed algorithm, and apply support vector
while it is very important, if not vital, for the fallen person to be machine (SVM) on the 3D trajectory of the head joint to detect
immediately taken care of and possibly transported to the fall incidents. In evaluation on a dataset of 380 samples,
hospital. Hence, there is a need for a monitoring system, in a collected in a laboratory setting and containing equal number of
smart residence, which can automatically detect fall incidents fall and non-fall samples, all falls have been detected and 9 false
and send alarms to the emergency medical center and/or family alarms have been generated. In [7], Rougier et al. detect the
and friends. ground plane of the room, segment the person from the
Devices used for fall detection are divided into three background, localize and track the 3D centroid of the person,
categories: wearable devices, ambient sensors, and video and detect falls by thresholding the centroid height relative to
cameras, each having its own pros and cons, which makes them the ground and the centroid velocity. Only one fall has not been
complementary to each other. Wearable devices are relatively detected out of 25 fall samples, and no false alarms have been
inexpensive and can directly measure kinematic quantities. reported on 54 non-fall samples. All the samples have been
However, they cannot recognize complex physical motions and collected in a laboratory setting. In [8], Planinc and Kampel
can be considered intrusive [2]. Ambient sensors, on the other employ the skeletal data extracted by Microsoft Software
hand, are versatile in application and can be installed anywhere Development Kit (SDK) for Kinect to calculate the orientation
or embedded in any object without interfering with the resident’s of the person’s major axis and the height of the spine relative to
daily life. Nevertheless, limited information can be obtained out the ground. These features are then used for fall detection. The
Output Layer
(Classes)
Transfer Learning
General Features
Input Layer
(Sequence of Body
Skeletons)
(1) Multiclass LSTM (2) Two-class LSTM
Fig. 4. The ROC curves and their corresponding AUCs for different Fig. 5. The predision-recall curve for the optimal parameters
numbers of the hidden units in both layers
layers up to three. As shown in Fig. 2, the performance of the optimized model, showing the precision and recall values at the
two-layer LSTM was good enough and adding the third layer threshold of 0.5. Table 2 compares the performance of our
did not improve it much, so we did not go above three layers, model with that of the best depth-map-based fall detection
and set the number of the layers at two. The dropout ratio and methods, which are all built using hand-crafted features. With
the number of the hidden units were selected through random 93% precision and 96% recall, our deep model outperforms
search. According to Fig. 3, the performance of the model is the Rougier’s [7] and Plannic’s [8] with considerable margins.
best at the dropout ratio of 0.5. Fig. 4 shows that the best
performance is obtained with 20 hidden units for both layers, V. CONCLUSION
though the combination of 70 units for the first layer and 30 units In this work, we proposed a deep learning model for human
for the second layer is also close to optimum. fall detection in videos captured by depth cameras, with
potential application in smart homes. Our approach takes depth
In order to compare our proposed model with the existing map sequences and determines if a falling incident has
methods, we had to use a metric other than AUC, since the other happened, so that an alarm can be generated and sent to
papers do not provide AUC for their methods. This metric, family/friends or medical service staff. This is the first deep-
which is not as comprehensive as AUC in evaluating the learning-based method in the field of human fall detection. It has
performance, consists of precision and recall values at the fixed area under the ROC curve of 0.99, and it outperforms all the
threshold of 0.5. Fig. 5 presents the precision-recall curve of our existing fall detection algorithms, which are based on hand-
crafted features.