Abstract: In this paper, we establish error bounds of adaptive dynamic programming algorithms for solving undiscounted infinite-horizon optimal control problems of ...
Reinforcement Learning - An Introduction, 2nd Edition, written by Richard S. Sutton and Andrew G. Barto, is kind of bible of reinforcement learning. It is a required reading for students and ...
This research was accepted by the 2023 International Conference on Learning Representations (ICLR) (opens in new tab), which is dedicated to the advancement of the branch of artificial intelligence ...
Abstract: This paper proposes a dynamic programming algorithm for management of aircraft power supply system, which takes reliability as the optimization goal, and the power supply priority, the basic ...
The Smith-Waterman algorithm is a dynamic programming algorithm to find the optimal alignment of two sequences. This repository provides a basic implementation of the Smith-Waterman algorithm that has ...
Considering that the probability distribution of random variables in stochastic programming usually has incomplete information due to a perfect sample data in many real applications, this paper ...
China is once more tightening its grip on internet content, and this time algorithms are in the spotlight. The Cyberspace Administration of China has published upcoming rules that dictate how internet ...
A health care algorithm makes black patients substantially less likely than their white counterparts to receive important medical treatment. The major flaw affects millions of patients, and was just ...