Off-Policy Actor-Critic with Shared Experience Replay.
ICLR 2020(2020)
Key words
Reinforcement Learning,Scheduling Policies
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined