Large-Scale Visual Speech Recognition

Yannis M. Assael
Yannis M. Assael
Thomas Paine
Thomas Paine
Cían Hughes
Cían Hughes
Utsav Prabhu
Utsav Prabhu
Lorrayne Bennett
Lorrayne Bennett
Marie Mulville
Marie Mulville
Ben Coppin
Ben Coppin

arXiv: Computer Vision and Pattern Recognition, Volume abs/1807.05162, 2019, Pages 4135-4139.

Cited by: 43|Bibtex|Views140|Links
EI

Abstract:

This work presents a scalable solution to open-vocabulary visual speech recognition. To achieve this, we constructed the largest existing visual speech recognition dataset, consisting of pairs of text and video clips of faces speaking (3,886 hours of video). In tandem, we designed and trained an integrated lipreading system, consisting of...More

Code:

Data:

Your rating :
0

 

Tags
Comments