|
This part is currently being updated.
Acquisition and initialisation of models
The first part of the problem is the model-driven segmentation
in Laplacian Eigenspace. The input abstraction layer is voxels and the
neighbourhood relationship of the voxels is used to compute the Laplacian
of the adjacency graph. The nodes are then mapped to 6-D Laplacian
eigenspace using the eigenvectors corresponding to the smallest non-zero
eigenvalues of the Laplacian matrix. We show that this transformation maps
segments whose lengths are greater than their thicknesses to 1-D curves in
eigenspace. We can then fit splines to these 1-D curves and segment them at
their joints. The two images on the left correspond to the 6-D eigenspace
and we have segmented one segment by fitting a spline.
The second part of the project is to acquire a set of key frames where the
voxels have been segmented and registered using a prior model and a
probabilistic registration method. This set of key frames can be used to
estimate the human body model in two steps: estimate a skeleton based human
body model and joint locations using human body statistics and computed
skeleton, and then fit a super-quadric model using the segmented voxels.
The images (from left to right) denote the voxels (unsegmented), voxels
(segmented), computed skeleton curve and estimated super-quadric skeleton
model. Five frames were used to estimate the model.
Tracking using multiple cues
Tracking of a complex articulated object such as a human being is a
difficult task and we need to use human models to obtain robust and
accurate results.
In our approach we use multiple cameras and a human shape model to track
the human motion. We use both the motion and tructural cues that we can
obtain from the synchronized video in a predictor-corrector framework
(Iterated Extended Kalman Filter). Motion cues, though reliable and
and robust, by themselves are not sufficient because the error in the
estimation tends to accumulate over time and we eventually lose track.
Structural information such as edges and rough silhoettes do not suffer
from that problem, but they are difficult to estimate and use in the
estimation. They can be efficiently used if the initial estimate of the
pose is close to the correct value. In our method we predict the change
in pose using the motion cues and correct the pose using static cues
such as edges and motion segmentation.
|