Real-Time Automated Video and Audio Capture with Multiple Cameras and Microphones
VLSI Signal Processing, pp. 81-99, 2001.
person trackingtalker trackingacoustic localizationhead pose estimationspeech enhancementMore(2+)
This work presents the acoustic and visual-based tracking system functioning at the Harvard Intelligent Multi-Media Environments Laboratory (HIMMEL). The environment is populated with a number of microphones and steerable video cameras. Acoustic source localization, video-based face tracking and pose estimation, and multi-channel speech e...More
Full Text (Upload PDF)
PPT (Upload PPT)