Unsupervised Lifelong Learning for Robots
Unsupervised Lifelong Learning for Robots
Here 0 ≤ η ≤ 1 is the mixing proportion between the local information gain, which can be measure by computing the
and the global strategies. perplexity score of the observations in the context of the data
observed thus far. We assume that robot can make observations
IV. E XPERIMENT at potential next steps in the exploration path. The problem of
getting stuck in a local optima can be addressed by adding a
We simulated our proposed topic model to run in realtime repulsive potential from previously visited locations [4]. We
at 5 frames per second, on an intel core i7 CPU. The original hypothesize that such exploration can be used for collecting
data was collected at 0.2 frames per second. We used a param- observation data that can be used to study patchy and transient
eters values α = 0.1, β = 10.0, γ = 1e − 6, η = 0.5 for our phenomena, such as animal mating aggregations underwater,
experiment.Our preliminary experiments have suggested that or feeding events, which are otherwise hard to study using data
topic modeling based techniques can be used to characterize collected over pre-determined trajectories.
previously unknown benthic scenes. The dataset contains over
2000 images. Results shown in Fig. 1 demonstrate that we
can characterize various the substrate types in a completely VI. C ONCLUSION
unsupervised manner and in realtime. These results are from We have presented here a realtime technique for auto-
the data collected by SeaBED vehicle [8], on an expedition to matically characterizing the scene as observed by a robot.
explore Hannibal seamount in Panama [6]. We have presented some preliminary results hinting towards
enabling the ability of the marine robot to continuously learn
V. D ISCUSSION
various benthic substrate types in-situ. We believe that such
The proposed life long unsupervised learning approach can capability can allow future exploration robots to adaptively
be utilized for lifelong exploration of many different types. target their data collection towards the more interesting regions
in the world that are represented by locations with high model
Having the ability to characterize the substrate type can perplexity, or rare topic labels, such as demonstrated by our
be used for substrate specific data collection. The exploration prior work [4]. The unsupervised and realtime nature of the
mission an be defined to either target a specific substrate type approach also makes it an ideal tool for post-mission analysis
of interest, or to to spread the data collection efforts equally of large observation datasaets.
amongst various substrate types.
The proposed topic modeling can also be used to perform ACKNOWLEDGMENT
exploration that aims to maximize information gain in semantic
space [4]. At each step in time, we can evaluate the explo- This work was supported by The Investment in Science
ration utility of the neighboring locations in terms of their Fund at WHOI.
R EFERENCES Y. Girdhar, R. C. Holleman, J. Churchill, H. Singh, and D. K. Ralston. A
crab swarm at an ecological hotspot: patchiness and population density
[1] A. Banerjee and S. Basu. Topic Models over Text Streams: A Study from AUV observations at a coastal, tropical seamount. PeerJ, 4:e1770,
of Batch and Online Unsupervised Learning. In SIAM International apr 2016.
Conference on Data Mining, page 6, 2007.
[7] A. Ranganathan, S. Matsumoto, and D. Ilstrup. Towards illumination
[2] K. R. Canini, L. Shi, and T. L. Griffiths. Online Inference of Topics
invariance for visual localization. In 2013 IEEE International Confer-
with Latent Dirichlet Allocation. Proceedings of the International
ence on Robotics and Automation, pages 3791–3798. IEEE, may 2013.
Conference on Artificial Intelligence and Statistics, 5(1999):65–72,
2009. [8] H. Singh, C. Roman, O. Pizarro, R. Eustice, and A. Can. Towards
[3] Y. Girdhar, W. Cho, M. Campbell, J. Pineda, E. Clarke, and H. Singh. High-resolution Imaging from Underwater Vehicles. The International
Anomaly Detection in Unstructured Environments using Bayesian Non- Journal of Robotics Research, 26(1):55–74, jan 2007.
parametric Scene Modeling. In IEEE International Conference on [9] D. M. Steinberg, O. Pizarro, and S. B. Williams. Hierarchical Bayesian
Robotics and Automation (ICRA), 2016. models for unsupervised scene understanding. Computer Vision and
[4] Y. Girdhar and G. Dudek. Modeling curiosity in a mobile robot for long- Image Understanding, 131:128–144, feb 2015.
term autonomous exploration and monitoring. Autonomous Robots, sep [10] Y. W. Teh and M. I. Jordan. Hierarchical Bayesian Nonparametric
2015. Models with Applications. Bayesian nonparametrics, pages 158—-207,
[5] C. McManus, B. Upcroft, and P. Newman. Learning place-dependant 2010.
features for long-term vision-based localisation. Autonomous Robots, [11] Y. W. Teh, M. I. Jordan, M. J. Beal, and D. M. Blei. Hierarchical
39(3):363–387, oct 2015. Dirichlet Processes. Journal of the American Statistical Association,
[6] J. Pineda, W. Cho, V. Starczak, A. F. Govindarajan, H. M. Guzman, 101(476):1566–1581, dec 2006.