|
Dynamic 3D Video |
||||||||||||||||
Summary:Using a multi-resolution hierarchy of temporally-varying subdivision surfaces as a modeling primitive we recover the shape and non-rigid 3D motion of a talking head that is observed from multiple view points. The results capture the shape and motion accurately. Main MenuTopic Menu |
Spatio-temporal Models from VideoIntroduction.We demonstrate a method to compute three-dimensional (3D) motion fields on a face. Eleven synchronized and calibrated cameras are positioned around a talking person (see Fig.4), and observe its head in motion (Figs.1-3). We represent the head as a multi-resolution subdivision surface, which is fitted in a global optimization step to the spatio-temporal contour and multi-camera stereo data derived from all images. The non-rigid displacement of the mesh from frame to frame, the 3D motion field, is initialized from the spatio-temporal derivatives in all the images and then refined together with the shape using spatio-temporal stereo information. We integrate these cues over time, thus producing an animated representation of the talking head. Our ability to estimate 3D motion fields points to a new framework for the study of action. The 3D motion fields can serve as an intermediate representation, which can be analyzed using geometrical and statistical tools for the purpose of extracting representations of generic actions.
Results of the Spatial and Temporal Structure Reconstruction. The final result of the reconstruction algorithm can be seen in the 360 rotation around the model (Fig.5), unfortunately only 2 color cameras were available, thus we have this mixture of gray-scale and color textures.
Further Information. For more details about the multi-resolution mesh representation and the reconstruction algorithm please read the accompanying paper
|
||||||||||||||||
| © 2003 by Jan Neumann jn at videogeometry dot com (return to top of page) | |||||||||||||||||