谷歌浏览器插件
订阅小程序
在清言上使用

MLLM-Tool: A Multimodal Large Language Model for Tool Agent Learning

Chenyu Wang,Weixin Luo,Qianyu Chen, Haonan Mai, Jindi Guo,Sixun Dong, Xiaohua, Xuan,Zhengxin Li,Lin Ma,Shenghua Gao

IEEE/CVF Winter Conference on Applications of Computer Vision(2025)

引用 2|浏览45
关键词
Large Language Models,Functional Identification,Real Purpose,External Tools,Multimodal Input,Training Set,Image Quality,Computational Resources,Hallucinations,Selection Tool,Inference Time,Mapping Relationship,Performance In Areas,Test Subset,Unique Treatment,Single Instruction,API Calls,Types Of Ambiguity
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要