ReadFlow shows previews for discovered articles. Full reading happens at the original publisher unless you save an article to your personal library.
bair.berkeley.edu

RL without TD learning

In this post, I’ll introduce a reinforcement learning (RL) algorithm based on an “alternative” paradigm: divide and conquer. Unlike traditional methods, this algorithm is not based on temporal difference (TD)...