基本信息
浏览量:80

个人简介
Research highlights:
Multisensory supervision: Today's computer vision methods need human supervision, such as object labels, to learn about the world. Humans, on the other hand, learn a great deal from associations between senses: vision trains hearing, touch trains vision, etc. Inspired by this idea, I've been developing models that learn about the world by finding structure in multimodal sensation—especially "self-supervised" computer vision methods that learn from sound.
Touch sensing: When humans interact with objects, they use many modalities: touch sensing helps us position our hands and select which forces to exert, while vision is useful for choosing where to grip. Recently, I've been developing multimodal methods for robotic grasping.
Spotting fake images: Computer vision researchers face a dilemma: as our methods get better, so do the tools for malicious image manipulation. To address this growing issue, I've been developing methods for detecting fake images.
3D reconstruction: To interact with the world, we need to know not just what is in a scene, but what is where. I've developed methods for reconstructing scenes in 3D from multiple visual cues.
Multisensory supervision: Today's computer vision methods need human supervision, such as object labels, to learn about the world. Humans, on the other hand, learn a great deal from associations between senses: vision trains hearing, touch trains vision, etc. Inspired by this idea, I've been developing models that learn about the world by finding structure in multimodal sensation—especially "self-supervised" computer vision methods that learn from sound.
Touch sensing: When humans interact with objects, they use many modalities: touch sensing helps us position our hands and select which forces to exert, while vision is useful for choosing where to grip. Recently, I've been developing multimodal methods for robotic grasping.
Spotting fake images: Computer vision researchers face a dilemma: as our methods get better, so do the tools for malicious image manipulation. To address this growing issue, I've been developing methods for detecting fake images.
3D reconstruction: To interact with the world, we need to know not just what is in a scene, but what is where. I've developed methods for reconstructing scenes in 3D from multiple visual cues.
研究兴趣
论文共 38 篇作者统计合作学者相似作者
按年份排序按引用量排序主题筛选期刊级别筛选合作者筛选合作机构筛选
时间
引用量
主题
期刊级别
合作者
合作机构
CoRR (2023): 2426-2436
引用0浏览0EI引用
0
0
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)pp.6945-6956, (2023)
引用0浏览0EI引用
0
0
CoRR (2023): 6430-6440
引用0浏览0EI引用
0
0
arxiv(2023)
引用0浏览0EI引用
0
0
ACM Trans. Graph.no. 4 (2023): 46:1-46:10
引用0浏览0EI引用
0
0
arxiv(2023)
引用0浏览0EI引用
0
0
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022)no. 1 (2022): 3355-3366
加载更多
作者统计
合作学者
合作机构
D-Core
- 合作者
- 学生
- 导师
数据免责声明
页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果,我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问,可以通过电子邮件方式联系我们:report@aminer.cn