Chrome Extension

WeChat Mini Program

Use on ChatGLM

Log in

Academic Profile User Profile

My Following Paper Collections Browse History

Softmax with Regularization: Better Value Estimation in Multi-Agent Reinforcement Learning

Ling Pan,Tabish Rashid,Bei Peng,Longbo Huang,Shimon Whiteson

arXiv (Cornell University)（2021）

Cited 2|Views79

AI Read Science

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Chat Paper

Summary is being generated by the instructions you defined