Is this you? Create Your Porfile

Shubhendu Bhasin

Indian Institute of Technology Delhi

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Shubhendu Bhasin is active.

Explore More

Publication

Featured researches published by Shubhendu Bhasin.

Automatica | 2013

A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems

Shubhendu Bhasin; Rushikesh Kamalapurkar; Marcus Johnson; Kyriakos G. Vamvoudakis; Frank L. Lewis; Warren E. Dixon

An online adaptive reinforcement learning-based solution is developed for the infinite-horizon optimal control problem for continuous-time uncertain nonlinear systems. A novel actor-critic-identifier (ACI) is proposed to approximate the Hamilton-Jacobi-Bellman equation using three neural network (NN) structures-actor and critic NNs approximate the optimal control and the optimal value function, respectively, and a robust dynamic neural network identifier asymptotically approximates the uncertain system dynamics. An advantage of using the ACI architecture is that learning by the actor, critic, and identifier is continuous and simultaneous, without requiring knowledge of system drift dynamics. Convergence of the algorithm is analyzed using Lyapunov-based adaptive control methods. A persistence of excitation condition is required to guarantee exponential convergence to a bounded region in the neighborhood of the optimal control and uniformly ultimately bounded (UUB) stability of the closed-loop system. Simulation results demonstrate the performance of the actor-critic-identifier method for approximate optimal control.

Automatica | 2015

Approximate optimal trajectory tracking for continuous-time nonlinear systems

Rushikesh Kamalapurkar; Huyen T. Dinh; Shubhendu Bhasin; Warren E. Dixon

Adaptive dynamic programming has been investigated and used as a method to approximately solve optimal regulation problems. However, the extension of this technique to optimal tracking problems for continuous-time nonlinear systems has remained a non-trivial open problem. The control development in this paper guarantees ultimately bounded tracking of a desired trajectory, while also ensuring that the enacted controller approximates the optimal controller.

IEEE Transactions on Automatic Control | 2012

RISE-Based Adaptive Control of a Control Affine Uncertain Nonlinear System With Unknown State Delays

Nitin Sharma; Shubhendu Bhasin; Qiang Wang; Warren E. Dixon

A continuous robust adaptive control method is designed for a class of uncertain nonlinear systems with unknown constant time-delays in the states. Specifically, a robust adaptive control method and a delay-free gradient-based desired compensation adaptation law (DCAL) are utilized to compensate for unknown time-delays, linearly parameterizable uncertainties, and additive bounded disturbances for a general nonlinear system. Despite these disturbances, a Lyapunov Krasovskii-based analysis is used to conclude that the system output asymptotically tracks a desired time varying bounded trajectory.

chinese control conference | 2010

Composite Adaptation for Neural Network-Based Controllers

Parag M. Patre; Shubhendu Bhasin; Z. D. Wilcox; Warren E. Dixon

This paper presents a novel approach to design a composite adaptation law for neural networks that uses both the system tracking errors and a prediction error containing parametric information by devising an innovative swapping procedure that uses the recently developed Robust Integral of the Sign of the Error (RISE) feedback method. Semi-global asymptotic tracking is proven for an Euler-Lagrange system.

conference on decision and control | 2011

Nonlinear two-player zero-sum game approximate solution using a Policy Iteration algorithm

Marcus Johnson; Shubhendu Bhasin; Warren E. Dixon

An approximate online solution is developed for a two-player zero-sum game subject to continuous-time nonlinear uncertain dynamics and an infinite horizon quadratic cost. A novel actor-critic-identifier (ACI) structure is used to implement the Policy Iteration (PI) algorithm, wherein a robust dynamic neural network (DNN) is used to asymptotically identify the uncertain system, and a critic NN is used to approximate the value function. The weight update laws for the critic NN are generated using a gradient-descent method based on a modified temporal difference error, which is independent of the system dynamics. This method finds approximations of the optimal value function, and the saddle point feedback control policies. These policies are computed using the critic NN and the identifier DNN and guarantee uniformly ultimately bounded (UUB) stability of the closed-loop system. The actor, critic and identifier structures are implemented in real-time, continuously and simultaneously.

american control conference | 2011

Nonlinear control of an autonomous underwater vehicle: A RISE-based approach

Nicholas R. Fischer; Shubhendu Bhasin; Warren E. Dixon

Autonomous and remotely operated marine vehicles such as ships and submarines are becoming a key component in several aspects of maritime industry and defense. This paper explores the development of a nonlinear controller for a fully actuated autonomous underwater vehicle (AUV) using a robust integral of the sign of the error (RISE) feedback term with a neural network (NN) based feedforward term to achieve semi-global asymptotic tracking results in the presence of complete model uncertainty and unknown disturbances. A simulation is provided to demonstrate the proposed controller on an experimentally validated AUV model.

IEEE Transactions on Automatic Control | 2013

Robust Identification-Based State Derivative Estimation for Nonlinear Systems

Shubhendu Bhasin; Rushikesh Kamalapurkar; Huyen T. Dinh; Warren E. Dixon

A robust identification-based state derivative estimation method for uncertain nonlinear systems is developed. The identifier architecture consists of a recurrent multilayer dynamic neural network which approximates the system dynamics online, and a continuous robust feedback Robust Integral of the Sign of the Error (RISE) term which accounts for modeling errors and exogenous disturbances. Numerical simulations provide comparisons with existing robust derivative estimation methods including: a high gain observer, a 2-sliding mode robust exact differentiator, and numerical differentiation methods, such as backward difference and central difference.

conference on decision and control | 2010

A model-free robust policy iteration algorithm for optimal control of nonlinear systems

Shubhendu Bhasin; Marcus Johnson; Warren E. Dixon

An online model-free solution is developed for the infinite-horizon optimal control problem for continuous-time nonlinear systems. A novel actor-critic-identifier (ACI) structure is used to implement the Policy Iteration algorithm, wherein two neural network structures are used - a robust dynamic neural network (DNN) to asymptotically identify the uncertain system with additive disturbances, and a critic NN to approximate the value function. The weight update laws for the critic NN are generated using a gradient-descent method based on a modified temporal difference error, which is independent of the system dynamics. The optimal control law (or the actor) is computed using the critic NN and the identifier DNN. Uniformly ultimately bounded (UUB) stability of the closed-loop system is guaranteed. The actor, critic and identifier structures are implemented in real-time, continuously and simultaneously.

advances in computing and communications | 2010

Predictor-based control for an uncertain Euler-Lagrange system with input delay

Nitin Sharma; Shubhendu Bhasin; Qiang Wang; Warren E. Dixon

Control of nonlinear systems with actuator delay is a challenging problem because of the need to develop some form of prediction of the nonlinear dynamics. The problem becomes more difficult for systems with uncertain dynamics. In this paper, tracking controllers are developed for an Euler-Lagrange system with time-delayed actuation, parametric uncertainty, and additive bounded disturbances. One controller is developed under the assumption that the inertia is known, and a second controller is developed when the inertia is unknown. For each case a predictor-like method is developed to address the time delay in the control input. Lyapunov-Krasovskii functionals are used within a Lyapunov-based stability analysis to prove semi-global uniformly ultimately bounded tracking.

american control conference | 2008

Adaptive neural network satellite attitude control in the presence of inertia and CMG actuator uncertainties

William MacKunis; Keith Dupree; Shubhendu Bhasin; Warren E. Dixon

A neural network-based adaptive attitude tracking controller is developed in this paper, which achieves attitude tracking in the presence of parametric uncertainty, nonlinear actuator disturbances, and unmodeled external disturbance torques, which do not satisfy the linear-in-the-parameters assumption (i.e., non-LP). The satellite control torques are produced by means of a cluster of control moment gyroscopes (CMGs), which have uncertain dynamic and static friction in the gimbals in addition to unknown electromechanical disturbances. Some challenges encountered in the control design are that the control input is premultiplied by a non-square, time-varying, nonlinear, uncertain matrix and is embedded in a discontinuous nonlinearity. Controller performance is proven via Lyapunov stability analysis.

Explore More