The VoxWorld Platform for Multimodal Embodied Agents.

Nikhil Krishnaswamy,William Pickard,Brittany Cates,Nathaniel Blanchard,James Pustejovsky

International Conference on Language Resources and Evaluation (LREC)（2022）

引用 0|浏览22

暂无评分

摘要

We present a five-year retrospective on the development of the VoxWorld platform, first introduced as a multimodal platform for modeling motion language, that has evolved into a platform for rapidly building and deploying embodied agents with contextual and situational awareness, capable of interacting with humans in multiple modalities, and exploring their environments. In particular, we discuss the evolution from the theoretical underpinnings of the VoxML modeling language to a platform that accommodates both neural and symbolic inputs to build agents capable of multimodal interaction and hybrid reasoning. We focus on three distinct agent implementations and the functionality needed to accommodate all of them: Diana, a virtual collaborative agent; Kirby, a mobile robot; and BabyBAW, an agent who self-guides its own exploration of the world.

查看译文

关键词

multimodality, multimodal interaction, situated grounding, embodied agent, modeling platform, simulation

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要