Andrew G. Barto
University of Pennsylvania
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Andrew G. Barto.
Archive | 2004
Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch
This chapter focuses on learning to act in a near-optimal manner through reinforcement learning for problems that either have no model or whose model is very complex. The emphasis here is on continuous action space (CAS) methods. Monte-Carlo approaches are employed to estimate function values in an iterative, incremental procedure. Derivative-free line search methods are used to find a near-optimal action in the continuous action space for a discrete subset of the state space. This near-optimal policy is then extended to the entire continuous state space using a fuzzy additive model. To compensate for approximation errors, a modified procedure for perturbing the generated control policy is developed. Convergence results, under moderate assumptions and stopping criteria, are established. References to sucessful applications of the controller are provided.
Handbook of Learning and Approximate Dynamic Programming | 2012
Michael T. Rosenstein; Andrew G. Barto; Jennie Si; Andy Barto; Warren Buckler Powell; Don Wunsch
Handbook of Learning and Approximate Dynamic Programming | 2004
Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch
Handbook of Learning and Approximate Dynamic Programming | 2004
Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch
Handbook of Learning and Approximate Dynamic Programming | 2004
Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch
Archive | 2004
Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch
Handbook of Learning and Approximate Dynamic Programming | 2004
Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch
Handbook of Learning and Approximate Dynamic Programming | 2004
Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch
Handbook of Learning and Approximate Dynamic Programming | 2004
Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch
Archive | 2004
Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch