Scaling Up Sign Spotting Through Sign Language Dictionaries

Gül Varol,Liliane Momeni,Samuel Albanie,Triantafyllos Afouras,Andrew Zisserman

International Journal of Computer Vision（2022）

引用 12|浏览101

暂无评分

摘要

The focus of this work is sign spotting –given a video of an isolated sign, our task is to identify whether and where it has been signed in a continuous, co-articulated sign language video. To achieve this sign spotting task, we train a model using multiple types of available supervision by: (1) watching existing footage which is sparsely labelled using mouthing cues; (2) reading associated subtitles (readily available translations of the signed content) which provide additional weak-supervision ; (3) looking up words (for which no co-articulated labelled examples are available) in visual sign language dictionaries to enable novel sign spotting. These three tasks are integrated into a unified learning framework using the principles of Noise Contrastive Estimation and Multiple Instance Learning. We validate the effectiveness of our approach on low-shot sign spotting benchmarks. In addition, we contribute a machine-readable British Sign Language (BSL) dictionary dataset of isolated signs, BslDict , to facilitate study of this task. The dataset, models and code are available at our project page.

查看译文

关键词

Sign language recognition, Sign spotting, Few-shot learning

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要