Adaptive Adam-Based Optimizers Using Second-Order Weight Decoupling and Gradient-Aware Weight Decay for Vision Transformer
Machine Vision and Applications(2025)
Key words
Adam-based optimizer,Weight decoupling,Transformers,Weight decay,Adaptive optimizers
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined