GECKO - A Tool for Effective Annotation of Human Conversations.

Golan Levy,Raquel Sitman,Ido Amir, Eduard Golshtein, Ran Mochary, Eilon Reshef,Roi Reichart,Omri Allouche

INTERSPEECH(2019)

引用 5|浏览5
暂无评分
摘要
With the dramatic improvement in automated speech recognition (ASR) accuracy, a variety of machine learning (ML) and natural language processing (NLP) algorithms are designed for human conversation data. Supervised machine learning and particularly deep neural networks (DNNs) require large annotated datasets in order to train high quality models. In this paper we describe Gecko, a tool for annotation of speech and language features of conversations. Gecko allows efficient and effective segmentation of the voice signal by speaker as well as annotation of the linguistic content of the conversation. A key feature of Gecko is the presentation of the output of automatic segmentation and transcription systems in an intuitive user interface for editing. Gecko allows annotation of Voice Activity Detection (VAD), Diarization, Speaker Identification and ASR outputs on a large scale. Both annotators and data scientists have reported improvement in the speed and accuracy of work. Gecko is publicly available for the benefit of the community at https://github.com/gong-io/gecko.
更多
查看译文
关键词
Annotation, Labeling, Speaker segmentation, Diarization, Speech recognition, VAD
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要