3-D Object Pose Determination Using Computer Vision
3-D Object Pose Determination Using Computer Vision
00
Printed in Great Britain. All rights reserved Copyright © 1990 Pergamon Press plc
Paul M. Griffin
School of Industrialand Systems Engineering
Georgia Instituteof Technology
Atlanta, Georgia 30332--0205
215
216 Proceedings of the 12th Annual Conference on Computers & Industrial Engineering
algorithm is not dependent on how the range information Superquadrics have some very useful properties
was obtained. for computer vision. First, superquadrics have a well
defined inside--outside function. Given a point in space,
OBJECT REPRESENTATION the inside--~utside function were the point lies relative the
the surface of the superquadric. For the superellipse:
One of the goals of this research is to develop a
pose determination methodology which can be used for
rr, 1~/~ r~ 1~/6~ ~ / "
objects of general shape. In order to achieve this goal, it
is important to use a general object representation. Many
different object representations for computer vision have
= [LL J +L J
=
be the only classification discussed in this paper. A
discussion of the other representations may be found in
[1]. The surface of the superellipsoid is defined by the n07,¢) 1/aB co s ;~'-'E1( Ti)sin ~'-6 :g(W)
spherical product of the sine--codne and secant--tangent
curves. The 3--D vector form is: s/~s sin~-6z(~)
• (~,~) =
aS cos6S(e)cos6~(w)
a~ cos61(Tl)sin6g(W) .
l Barr [1,2] has developed a set of angle preserving
transformations for superquadrics. The transformations
include quadratic bending, linear tapering and cavity
deformation. These transformations increase the modeling
a3 sinEl(~) capability of superquadrics. For details, the reader is
referred to Barr [1,2]. Figure 3 shows example super--
quadrics with these transformations applied.
--~'/2 ~ ~] ~ ~'/2 While superquadrics are quite descriptive in
themselves, they are not general enough to describe 3--D
objects with arbitrary shapes. Pentland [16] argues that
The variables al, a~ and a3 are the scale parameters
superquadrics can be put together like lumps of clay to
which determine the length along the z, 11 and z axes
define complex objects. We use a relational graph in
respectively. The variables 61 and 6~ are called the
order to achieve this.
shape parameters. The shape parameters define the
A relational graph G is a four--tuple (V,E,Lv,Le),
roundness of the curve. Figure 2 shows the 2--D version
where V is a n o n - e m p t y vertex set, E is an edge set, and
of the superellipsoid curve for varying values of e.
Lv and L e are the vertex and edge labels respectively.
Each element of V is a superquadric primitive. If there is
a relationship between superquadrics then this is expressed
in E. The vertex label, Lv, is a record which specifies the
type of superquadric, the shape and scale parameters, and
the absolute positional parameters. The edge label, Le,
describes the relationship between superquadric primitives
in terms of Boolean combinations (e.g. or, and, differ-
ence operations).
O = Tr.O'
qq
el
Figure 3. 3--D Superquadric Examples
where:
In order to determine the minimum enclosing
1 spheres, we used an extension of the E]singa--Hearn a l g o -
R I R.Tt rithm [6]. The E]singa--Hearn algorithm finds the mini--
m u m enclosing circle for the 2--D problem by choosing
T~--
J
I. . . . , points which define increasingly larger circles. The
0 0 0 1 1 algorithm has a worst-case complexity of O(n2) where n
is the number of points. We have modified the Elsinga--
Hearn algorithm to work in 3--D. The modified version
c0zC0y cO:sOysO~-~OzcOxcO~sOycOx+sO:sOx has the same worst case complexity. Details may be
found in [i0].
R= s0zc0y *OzaOysOx+cOzcOxsOzaOycOx-cOzlOx In order to determine the object pose, the follow--
ing steps are taken:
-sOy coy s 0x c0yC0x
1. Determine the minimum enclosing spheres for
both the database and observed objects.
T=[T~ Ty T~I,
2. Align the spheres at their centers.
where c a = co.(a) and sa = sin(a).
One way in which the rotational and translations] 3. Make an initialassignment of 0x, 0y and 0z
offlmts can be determined is by a nonlinear least squares by using the matrix of central moments [13].
estimation technique which utilizes the inside--outside
function of the superquadric. Messimer [15] presents a 4. Use a nonlinear least squares estimation
two-ttage method using nonlinear least squares e s t i m a - technique to compute the final estimates of the
tion for 2--D parts. This method can easily be extended rotational offsets.
to 3--D parts. However, for the 3--D case, there are six
parameters which must be solved for instead of one. To The nonlinear least squares estimation technique
overcome this problem, we make use of an enclosing mentioned in step 4 is an extended version of the 2--D
object. method presented in Messimer [15]. Details of the
In order to define the translational offsets, some approach may be found in [10].
reference point for the object must be defined. W e define A couple of points should be mentioned about the
this reference point as the center of the minimum enclos--- approach. First, the slowest part of the approach occurs
ing sphere which contains the object. The minimum at step 4. It is possible to speed up this step, however, by
enclosing sphere is formulated as a minimax problem. For subMmpling the pointsprovided in the rangedata. In
a set of points P, where each Pi E P is defined by position fact, when the iterations are first started, a small number
vector <zi,~i,zi>, we want to minimize the function: of points can be used, and as the solution starts to
converge, more points can be added for refinement.
Another possibility is that for most applications, the
/(a,b,c) = max[(zi--,)~+(yi--b)~
+(zi--c)~II~. object will be in one of only a few stable positions. This
i
is due to the fact that for most applications, the object is
sitting of a fiat surface. If the object is in a stable
For the observed object, the point set P is given by the position, then only one rotational parameter need be
range data. For the database object, however, this point solved for. If the set of stable positions is fairly small,
set must be specified. These can either be specified then the process can be repeated for each element in the
explicitly by the user when the object is being entereu m set. Also, the initialrotational offset can determine what
the database, or can be determined by defining a step size the stable position is. Having to solve for only one rota-
for varying values of r] and w. tional parameter greatly speeds up the process.
218 Proceedings of the 12th Annual Conference on Computers & IndustrialEngineering
where r is the radius defined in the a~umption made 3. C.A. Chang, 3. Goldman and J.M. Pan, Part posi-
about the bound on each range point. If this bound is tioning with feature masks for computer vision systems,
violated then not there is one of two po~ibilities. Either lIE Transactions, 19, 182--189 (1987).
not enough range data k available for pose determination
in which c u e more data must be collected, or the 4. R.T. Chin and C.R. Dyer, Model--basod recognition in
observed object differs for the database object. robot vision, Computing Surrey, 18, 68--108 (1986).
One final point needs to be discussed. As
mentioned earlier, Pentlsnd [16] argues that the object 5. M. Dhome, M. Richetin, J.T. Lapreste, and G. Rives,
Ihould be represented at different levels of granularity. Determination of the attitude of 3 - D objects from a
Both Pentland [16] and Sollna and Bajcsy [17] have single perspective view, IEEE Transactions on Pattern
developed a technique to determine the shape and scale Anal#sis and Machine Intelligence, 11, 1265--1278 (1989).
parameters for a superquadric. In this case the object is
not described by superquadrlc primitives,but by a single 6. 3. Elzinga and D.W. Hearn, Geometric solutions for
superquadric. The method presented will stillwork in some minimax location problems, Transportation Science,
this case. In fact,any levelof granularityis acceptable as 6, 379--394 (1972).
long as the object is represented by some set of super--
quadrlcs. 7. M. Gardner, The superellipse: a curve that lles
between the ellipse and the rectangle, Scientific American
CONCLUSIONS 213, 224--234 (1985).
This paper has presented a method to determine 8. S.J. Gordon and W.P. Seering, Real--time part p o s i -
the object pose of 3 - D objects. The method require8 no tion sensing, IEEE Transactions on Pattern Analysis and
preproceuing steps such as the determination of edges or Machine Intelligence, 10, 374--388 (1988).
other higher moments. The method is also applicable to
3 - D objects of general shapes due to the superquadric-- 9. P.M. Griffin, Correspondence of 2--D projections by
based repre~ntation used. Finally, by the use of an bipartite matching, Pattern Recognition Letters, 9, 361--
enclosing object, the method reduces the number of 308 (1989).
parameters which must be solved for the nonlinear least
~uares estimation. I0. P.M. Griffin, Determination of object pose using
range data, Working Paper, School of Industrial and
Systems Engineering, Georgia Institute of Technology,
Atlanta, G A (1990).