Chrome Extension
WeChat Mini Program
Use on ChatGLM

SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models

CVPR 2024(2024)

Cited 13|Views65
Key words
Language Model,Large Language Models,Vision Language,Training Dataset,Input Image,Bounding Box,Training Stage,Training Loss,Training Step,Training Iterations,Reward Function,Target Network,Alignment Quality,Training Framework,Human Preferences,Deep Q-network,Text Annotation,Visual Encoding,Multiple Benchmarks,Proximal Policy Optimization,Bridge Components,Reward Model
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined