Module Roland TD 10 - 搜索 News

Distributed TD(0) With Almost No Communication

Abstract: We provide a new non-asymptotic analysis of distributed temporal difference learning with linear function approximation. Our approach relies on “one-shot averaging,” where N agents run ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果

Distributed TD(0) With Almost No Communication

今日热点