Approximation of Convex Envelope Using Reinforcement Learning.

Vivek S. Borkar, Adit Akarsh

CoRR(2023)

引用 0|浏览1
暂无评分
摘要
Oberman gave a stochastic control formulation of the problem of estimating the convex envelope of a non-convex function. Based on this, we develop a reinforcement learning scheme to approximate the convex envelope, using a variant of Q-learning for controlled optimal stopping. It shows very promising results on a standard library of test problems.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要