Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Sarangapani Jagannathan is active.

Publication


Featured researches published by Sarangapani Jagannathan.


IEEE Transactions on Neural Networks | 2010

Output Feedback Control of a Quadrotor UAV Using Neural Networks

Travis Dierks; Sarangapani Jagannathan

In this paper, a new nonlinear controller for a quadrotor unmanned aerial vehicle (UAV) is proposed using neural networks (NNs) and output feedback. The assumption on the availability of UAV dynamics is not always practical, especially in an outdoor environment. Therefore, in this work, an NN is introduced to learn the complete dynamics of the UAV online, including uncertain nonlinear terms like aerodynamic friction and blade flapping. Although a quadrotor UAV is underactuated, a novel NN virtual control input scheme is proposed which allows all six degrees of freedom (DOF) of the UAV to be controlled using only four control inputs. Furthermore, an NN observer is introduced to estimate the translational and angular velocities of the UAV, and an output feedback control law is developed in which only the position and the attitude of the UAV are considered measurable. It is shown using Lyapunov theory that the position, orientation, and velocity tracking errors, the virtual control and observer estimation errors, and the NN weight estimation errors for each NN are all semiglobally uniformly ultimately bounded (SGUUB) in the presence of bounded disturbances and NN functional reconstruction errors while simultaneously relaxing the separation principle. The effectiveness of proposed output feedback control scheme is then demonstrated in the presence of unknown nonlinear dynamics and disturbances, and simulation results are included to demonstrate the theoretical conjecture.


IEEE Transactions on Neural Networks | 1996

Multilayer discrete-time neural-net controller with guaranteed performance

Sarangapani Jagannathan; Frank L. Lewis

A family of novel multilayer discrete-time neural-net (NN) controllers is presented for the control of a class of multi-input multi-output (MIMO) dynamical systems. The neural net controller includes modified delta rule weight tuning and exhibits a learning while-functioning-features. The structure of the NN controller is derived using a filtered error/passivity approach. Linearity in the parameters is not required and certainty equivalence is not used. This overcomes several limitations of standard adaptive control. The notion of persistency of excitation (PE) for multilayer NN is defined and explored. New online improved tuning algorithms for discrete-time systems are derived, which are similar to sigma or epsilon-modification for the case of continuous-time systems, that include a modification to the learning rate parameter plus a correction term. These algorithms guarantee tracking as well as bounded NN weights in nonideal situations so that PE is not needed. An extension of these novel weight tuning updates to NN with an arbitrary number of hidden layers is discussed. The notions of discrete-time passive NN, dissipative NN, and robust NN are introduced. The NN makes the closed-loop system passive.


IEEE Transactions on Neural Networks | 2012

Online Optimal Control of Affine Nonlinear Discrete-Time Systems With Unknown Internal Dynamics by Using Time-Based Policy Update

Travis Dierks; Sarangapani Jagannathan

In this paper, the Hamilton-Jacobi-Bellman equation is solved forward-in-time for the optimal control of a class of general affine nonlinear discrete-time systems without using value and policy iterations. The proposed approach, referred to as adaptive dynamic programming, uses two neural networks (NNs), to solve the infinite horizon optimal regulation control of affine nonlinear discrete-time systems in the presence of unknown internal dynamics and a known control coefficient matrix. One NN approximates the cost function and is referred to as the critic NN, while the second NN generates the control input and is referred to as the action NN. The cost function and policy are updated once at the sampling instant and thus the proposed approach can be referred to as time-based ADP. Novel update laws for tuning the unknown weights of the NNs online are derived. Lyapunov techniques are used to show that all signals are uniformly ultimately bounded and that the approximated control signal approaches the optimal control input with small bounded error over time. In the absence of disturbances, an optimal control is demonstrated. Simulation results are included to show the effectiveness of the approach. The end result is the systematic design of an optimal controller with guaranteed convergence that is suitable for hardware implementation.


Automatica | 2012

Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses

Hao Xu; Sarangapani Jagannathan; Frank L. Lewis

In this paper, the stochastic optimal control of linear networked control system (NCS) with uncertain system dynamics and in the presence of network imperfections such as random delays and packet losses is derived. The proposed stochastic optimal control method uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation of unknown NCS with time-varying system matrices. Next, a stochastic suboptimal control scheme which uses AE and Q-learning is introduced for the regulation of unknown linear time-invariant NCS that is derived using certainty equivalence property. Update laws for online tuning the unknown parameters of the AE to obtain the Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the estimated control signals converge to optimal or suboptimal control inputs. Simulation results are included to show the effectiveness of the proposed schemes. The result is an optimal control scheme that operates forward-in-time manner for unknown linear systems in contrast with standard Riccati equation-based schemes which function backward-in-time.


Neural Networks | 2009

2009 Special Issue: Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence

Travis Dierks; Balaje T. Thumati; Sarangapani Jagannathan

The optimal control of linear systems accompanied by quadratic cost functions can be achieved by solving the well-known Riccati equation. However, the optimal control of nonlinear discrete-time systems is a much more challenging task that often requires solving the nonlinear Hamilton-Jacobi-Bellman (HJB) equation. In the recent literature, discrete-time approximate dynamic programming (ADP) techniques have been widely used to determine the optimal or near optimal control policies for affine nonlinear discrete-time systems. However, an inherent assumption of ADP requires the value of the controlled system one step ahead and at least partial knowledge of the system dynamics to be known. In this work, the need of the partial knowledge of the nonlinear system dynamics is relaxed in the development of a novel approach to ADP using a two part process: online system identification and offline optimal control training. First, in the system identification process, a neural network (NN) is tuned online using novel tuning laws to learn the complete plant dynamics so that a local asymptotic stability of the identification error can be shown. Then, using only the learned NN system model, offline ADP is attempted resulting in a novel optimal control law. The proposed scheme does not require explicit knowledge of the system dynamics as only the learned NN model is needed. The proof of convergence is demonstrated. Simulation results verify theoretical conjecture.


IEEE Transactions on Automatic Control | 1996

Discrete-time neural net controller for a class of nonlinear dynamical systems

Sarangapani Jagannathan; Frank L. Lewis

A family of two-layer discrete-time neural net (NN) controllers is presented for the control of a class of mnth-order MIMO dynamical system. No initial learning phase is needed so that the control action is immediate; in other words, the neural network (NN) controller exhibits a learning-while-functioning-feature instead of a learning-then-control feature. A two-layer NN is used which is linear in the tunable weights. The structure of the neural net controller is derived using a filtered error approach. It is indicated that delta-rule-based tuning, when employed for closed-loop control, can yield unbounded NN weights if: 1) the net cannot exactly reconstruct a certain required function, or 2) there are bounded unknown disturbances acting on the dynamical system. Certainty equivalence is not used, overcoming a major problem in discrete-time adaptive control. In this paper, new online tuning algorithms for discrete-time systems are derived which are similar to /spl epsiv/-modification for the case of continuous-time systems that include a modification to the learning rate parameter and a correction term to the standard delta rule.


Automatica | 1996

Identification of nonlinear dynamical systems using multilayered neural networks

Sarangapani Jagannathan; Frank L. Lewis

A novel multilayer discrete-time neural net paradigm is presented for the identification of multi-input multi-output (MIMO) nonlinear dynamical systems. The major novelty of this approach is a rigorous proof of identification error convergence that reveals a requirement for a new identifier structure and nonstandard weight tuning algorithms. The NN identifier includes modified delta rule weight tuning and exhibits a learning-while-functioning feature instead of learning-then-functioning, so that the identification is on-line with no explicit off-line learning phase needed. The structure of the neural net (NN) identifier is derived using a passivity aproach. Linearity in the parameters is not required and certainty equivalence is not used. The notion of persistency of excitation (PE) and passivity properties of the multilayer NN are defined and used in the convergence analysis of both the identification error and the weight estimates.


advances in computing and communications | 2010

Optimal control of affine nonlinear continuous-time systems

Travis Dierks; Sarangapani Jagannathan

Solving the Hamilton-Jacobi-Isaacs (HJI) equation, commonly used in H∞ optimal control, is often referred to as a two-player differential game where one player tries to minimize the cost function while the other tries to maximize it. In this paper, the HJI equation is formulated online and forward-in-time using a novel single online approximator (SOLA)-based scheme to achieve optimal regulation and tracking control of affine nonlinear continuous-time systems. The SOLA-based adaptive approach is designed to learn the infinite horizon HJI equation, the corresponding optimal control input, and the worst case disturbance. A novel parameter tuning algorithm is derived which not only achieves the optimal cost function, control input, and the disturbance, but also ensures the system states remain bounded during the online learning. Lyapunov methods are used to show that all signals are uniformly ultimately bounded (UUB) while ensuring the approximated signals approach their optimal values with small bounded error. In the absence of OLA reconstruction errors, asymptotic convergence to the optimal signals is demonstrated, and simulation results illustrate the effectiveness of the approach.


systems man and cybernetics | 2012

Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems using Online Approximators

Qinmin Yang; Sarangapani Jagannathan

In this paper, reinforcement learning state- and output-feedback-based adaptive critic controller designs are proposed by using the online approximators (OLAs) for a general multi-input and multioutput affine unknown nonlinear discretetime systems in the presence of bounded disturbances. The proposed controller design has two entities, an action network that is designed to produce optimal signal and a critic network that evaluates the performance of the action network. The critic estimates the cost-to-go function which is tuned online using recursive equations derived from heuristic dynamic programming. Here, neural networks (NNs) are used both for the action and critic whereas any OLAs, such as radial basis functions, splines, fuzzy logic, etc., can be utilized. For the output-feedback counterpart, an additional NN is designated as the observer to estimate the unavailable system states, and thus, separation principle is not required. The NN weight tuning laws for the controller schemes are also derived while ensuring uniform ultimate boundedness of the closed-loop system using Lyapunov theory. Finally, the effectiveness of the two controllers is tested in simulation on a pendulum balancing system and a two-link robotic arm system.


IEEE Transactions on Neural Networks | 2013

Neural Network-Based Optimal Adaptive Output Feedback Control of a Helicopter UAV

David Nodland; Hassan Zargarzadeh; Sarangapani Jagannathan

Helicopter unmanned aerial vehicles (UAVs) are widely used for both military and civilian operations. Because the helicopter UAVs are underactuated nonlinear mechanical systems, high-performance controller design for them presents a challenge. This paper introduces an optimal controller design via an output feedback for trajectory tracking of a helicopter UAV, using a neural network (NN). The output-feedback control system utilizes the backstepping methodology, employing kinematic and dynamic controllers and an NN observer. The online approximator-based dynamic controller learns the infinite-horizon Hamilton-Jacobi-Bellman equation in continuous time and calculates the corresponding optimal control input by minimizing a cost function, forward-in-time, without using the value and policy iterations. Optimal tracking is accomplished by using a single NN utilized for the cost function approximation. The overall closed-loop system stability is demonstrated using Lyapunov analysis. Finally, simulation results are provided to demonstrate the effectiveness of the proposed control design for trajectory tracking.

Collaboration


Dive into the Sarangapani Jagannathan's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Travis Dierks

Missouri University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Maciej J. Zawodniok

Missouri University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Frank L. Lewis

University of Texas at Arlington

View shared research outputs
Top Co-Authors

Avatar

Avimanyu Sahoo

Missouri University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Balaje T. Thumati

Missouri University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Vignesh Narayanan

Missouri University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Shahab Mehraeen

Louisiana State University

View shared research outputs
Top Co-Authors

Avatar

Mariesa L. Crow

Missouri University of Science and Technology

View shared research outputs
Researchain Logo
Decentralizing Knowledge