Developing a Generic Focus Modality for Multimodal Interactive Environments

ICMI '23 Companion: Companion Publication of the 25th International Conference on Multimodal Interaction(2023)

引用 0|浏览6
暂无评分
摘要
In human communication we need to establish the target of our message and understand if someone is addressing us. Such mechanisms facilitate communication in environments where several potential interlocutors exist. With the advances of speech technologies supporting interaction with computers, the establishment of a device as an interlocutor has often been performed resorting to wake-up words. While this addresses the issue, it is far from the naturalness and efficiency of what we can accomplish in human-human communication. In this regard, research has considered alternatives, such as the visual focus of attention, but the implementations are often scenario specific and not easily available for a generalized use. In this paper, we argue that the establishment of a machine as an interlocutor, particularly for speech interaction, should consider a wide range of verbal and nonverbal aspects, and we conceptualize and present a first proof-of-concept of its integration as a core feature in a multimodal interactive framework.
更多
查看译文
关键词
generic focus modality,environments
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要