Practical Comparable Data Collection for Low-Resource Languages via Images

Cited by: 0|Bibtex|Views67|Links

Abstract:

We propose a method of curating high-quality comparable training data for low-resource languages without requiring that the annotators are bilingual. Our method involves using a carefully selected set of images as a pivot between the source and target languages by getting captions for such images in both languages independently. Human e...More

Code:

Data:

Your rating :
0

 

Tags
Comments