Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Leemon C. Baird is active.

Publication


Featured researches published by Leemon C. Baird.


Adaptive Behavior | 1993

A hierarchical network of provably optimal learning control systems: extensions of the Associative Control Process (ACP) network

Leemon C. Baird; A. Harry Klopf

An associative control process (ACP) network is a learning control system that can reproduce a variety of animal learning results from classical and instrumental conditioning experiments (Klopf, Morgan, & Weaver, 1993; see also the article, A Hierarchical Network of Control Systems that Learn). The ACP networks proposed and tested by Klopf, Morgan, and Weaver are not guaranteed, however, to learn optimal policies for maximizing reinforcement. Optimal behavior is guaranteed for a reinforcement learning system such as Q-learning (Watkins, 1989), but simple Q-learning is incapable of reproducing the animal learning results that ACP networks reproduce. We propose two new models that reproduce the animal learning results and are provably optimal. The first model, the modified ACP network, embodies the smallest number of changes necessary to the ACP network to guarantee that optimal policies will be learned while still reproducing the animal learning results. The second model, the single-layer ACP network, embodies the smallest number of changes necessary to Q-learning to guarantee that it reproduces the animal learning results while still learning optimal policies. We also propose a hierarchical network architecture within which several reinforcement learning systems (e.g., Q-learning systems, single-layer ACP networks, or any other learning controller) can be combined in a hierarchy. We implement the hierarchical network architecture by combining four of the single-layer ACP networks to form a controller for a standard inverted pendulum dynamic control problem. The hierarchical controller is shown to learn more reliably and more than an order of magnitude faster than either the single-layer ACP network or the Barto, Sutton, and Anderson (1983) learning controller for the benchmark problem.


Archive | 2001

Apparatus and method for authenticating access to a network resource

Leemon C. Baird; Mance E. Harmon; R. Reed Young; James E. Armstrong


international conference on machine learning | 1996

Residual Q-Learning Applied to Visual Attention

Cesar Bandera; Francisco J. Vico; José Manuel Bravo; Mance E. Harmon; Leemon C. Baird


IEEE Transactions on Reliability | 1996

Multi-Agent Residual Advantage Learning with General Function Approximation.

Mance E. Harmon; Leemon C. Baird


neural information processing systems | 1994

Advantage Updating Applied to a Differential Game

Mance E. Harmon; Leemon C. Baird; A. Harry Klopf


Archive | 2007

Low power wireless communication method

Leemon C. Baird; Mance E. Harmon; John Kelly Hughes


Archive | 2015

DEVICES AND METHODS FOR THREAT-BASED AUTHENTICATION FOR ACCESS TO COMPUTING RESOURCES

Mance E. Harmon; Leemon C. Baird; David Chase; David Waite


simulation of adaptive behavior | 1993

Extensions of the associative control process (ACP) network: hierarchies and provable optimality

Leemon C. Baird; A. Harry Klopf


Archive | 2008

Power over data cable system and method

Leemon C. Baird; John Kelly Hughes


Archive | 1996

Reinforcement Learning: An Alternative Approach to Machine Intelligence

Leemon C. Baird; Mance E. Harmon; A. Harry Klopf

Collaboration


Dive into the Leemon C. Baird's collaboration.

Top Co-Authors

Avatar

Mance E. Harmon

Wright-Patterson Air Force Base

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Cesar Bandera

New Jersey Institute of Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge