We propose a method to estimate the pose of a hand in a sequence of stereo images. This is a difficult problem since a hand is a complex object with a high number of degrees of freedom, and automatically segmenting the hand in the images is not easy. Our method is intended to solve these problems. Two video cameras feed two images to a stereo correlation algorithm, allowing the 3D reconstruction of the scene. Then a 3D articulated model of the hand, made of truncated cones and spheres, is fitted to this reconstruction in order to estimate the pose of the palm and fingers. We are dealing with model-based tracking of hand movement, in which we suppose that the pose of the hand is known in the first images.
展开▼