Large-Scale Visual Speech Recognition
arXiv: Computer Vision and Pattern Recognition, Volume abs/1807.05162, 2019, Pages 4135-4139.
This work presents a scalable solution to open-vocabulary visual speech recognition. To achieve this, we constructed the largest existing visual speech recognition dataset, consisting of pairs of text and video clips of faces speaking (3,886 hours of video). In tandem, we designed and trained an integrated lipreading system, consisting of...More
PPT (Upload PPT)