Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Andrew G. Barto is active.

Publication


Featured researches published by Andrew G. Barto.


Archive | 2004

NearOptimal Control Through Reinforcement Learning and Hybridization

Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch

This chapter focuses on learning to act in a near-optimal manner through reinforcement learning for problems that either have no model or whose model is very complex. The emphasis here is on continuous action space (CAS) methods. Monte-Carlo approaches are employed to estimate function values in an iterative, incremental procedure. Derivative-free line search methods are used to find a near-optimal action in the continuous action space for a discrete subset of the state space. This near-optimal policy is then extended to the entire continuous state space using a fuzzy additive model. To compensate for approximation errors, a modified procedure for perturbing the generated control policy is developed. Convergence results, under moderate assumptions and stopping criteria, are established. References to sucessful applications of the controller are provided.


Handbook of Learning and Approximate Dynamic Programming | 2012

Supervised Actor‐Critic Reinforcement Learning

Michael T. Rosenstein; Andrew G. Barto; Jennie Si; Andy Barto; Warren Buckler Powell; Don Wunsch


Handbook of Learning and Approximate Dynamic Programming | 2004

Guidance in the Use of Adaptive Critics for Control

Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch


Handbook of Learning and Approximate Dynamic Programming | 2004

ADP: Goals, Opportunities and Principles

Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch


Handbook of Learning and Approximate Dynamic Programming | 2004

Hierarchical Decision Making

Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch


Archive | 2004

Handbook of Learning and Approximate Dynamic Programming (IEEE Press Series on Computational Intelligence)

Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch


Handbook of Learning and Approximate Dynamic Programming | 2004

Robust Reinforcement Learning for Heating, Ventilation, and Air Conditioning Control of Buildings

Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch


Handbook of Learning and Approximate Dynamic Programming | 2004

Learning and Optimization: From a System Theoretic Perspective

Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch


Handbook of Learning and Approximate Dynamic Programming | 2004

Toward Dynamic Stochastic Optimal Power Flow

Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch


Archive | 2004

Supervised ActorCritic Reinforcement Learning

Jennie Si; Andrew G. Barto; Warren Buckler Powell; Don Wunsch

Collaboration


Dive into the Andrew G. Barto's collaboration.

Top Co-Authors

Avatar

Don Wunsch

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Jennie Si

University of Massachusetts Amherst

View shared research outputs
Top Co-Authors

Avatar

Warren Buckler Powell

University of Massachusetts Amherst

View shared research outputs
Top Co-Authors

Avatar

Michael T. Rosenstein

University of Massachusetts Amherst

View shared research outputs
Researchain Logo
Decentralizing Knowledge