Object-and-Action Aware Model for Visual Language Navigation
european conference on computer vision, pp. 303-317, 2020.
Vision-and-Language Navigation (VLN) is unique in that it requires turning relatively general natural-language instructions into robot agent actions, on the basis of the visible environment. This requires to extract value from two very different types of natural-language information. The first is object description (e.g., 'table', 'door...More
PPT (Upload PPT)