Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Ruizhuo Song is active.

Publication


Featured researches published by Ruizhuo Song.


IEEE Transactions on Neural Networks | 2014

Adaptive Dynamic Programming for a Class of Complex-Valued Nonlinear Systems

Ruizhuo Song; Wendong Xiao; Huaguang Zhang; Chang-Yin Sun

In this brief, an optimal control scheme based on adaptive dynamic programming (ADP) is developed to solve infinite-horizon optimal control problems of continuous-time complex-valued nonlinear systems. A new performance index function is established on the basis of complex-valued state and control. Using system transformations, the complex-valued system is transformed into a real-valued one, which overcomes Cauchy-Riemann conditions effectively. With the transformed system and the performance index function, a new ADP method is developed to obtain the optimal control law by using neural networks. A compensation controller is developed to compensate the approximation errors of neural networks. Stability properties of the nonlinear system are analyzed and convergence properties of the weights for neural networks are presented. Finally, simulation results demonstrate the performance of the developed optimal control scheme for complex-valued nonlinear systems.


IEEE Transactions on Neural Networks | 2016

Data-Driven Zero-Sum Neuro-Optimal Control for a Class of Continuous-Time Unknown Nonlinear Systems With Disturbance Using ADP

Qinglai Wei; Ruizhuo Song; Pengfei Yan

This paper is concerned with a new data-driven zero-sum neuro-optimal control problem for continuous-time unknown nonlinear systems with disturbance. According to the input-output data of the nonlinear system, an effective recurrent neural network is introduced to reconstruct the dynamics of the nonlinear system. Considering the system disturbance as a control input, a two-player zero-sum optimal control problem is established. Adaptive dynamic programming (ADP) is developed to obtain the optimal control under the worst case of the disturbance. Three single-layer neural networks, including one critic and two action networks, are employed to approximate the performance index function, the optimal control law, and the disturbance, respectively, for facilitating the implementation of the ADP method. Convergence properties of the ADP method are developed to show that the system state will converge to a finite neighborhood of the equilibrium. The weight matrices of the critic and the two action networks are also convergent to finite neighborhoods of their optimal ones. Finally, the simulation results will show the effectiveness of the developed data-driven ADP methods.


IEEE Transactions on Neural Networks | 2015

Multiple Actor-Critic Structures for Continuous-Time Optimal Control Using Input-Output Data

Ruizhuo Song; Frank L. Lewis; Qinglai Wei; Huaguang Zhang; Zhong Ping Jiang; Daniel S. Levine

In industrial process control, there may be multiple performance objectives, depending on salient features of the input-output data. Aiming at this situation, this paper proposes multiple actor-critic structures to obtain the optimal control via input-output data for unknown nonlinear systems. The shunting inhibitory artificial neural network (SIANN) is used to classify the input-output data into one of several categories. Different performance measure functions may be defined for disparate categories. The approximate dynamic programming algorithm, which contains model module, critic network, and action network, is used to establish the optimal control in each category. A recurrent neural network (RNN) model is used to reconstruct the unknown system dynamics using input-output data. NNs are used to approximate the critic and action networks, respectively. It is proven that the model error and the closed unknown system are uniformly ultimately bounded. Simulation results demonstrate the performance of the proposed optimal control scheme for the unknown nonlinear system.


IEEE Transactions on Systems, Man, and Cybernetics | 2016

Off-Policy Actor-Critic Structure for Optimal Control of Unknown Systems With Disturbances

Ruizhuo Song; Frank L. Lewis; Qinglai Wei; Huaguang Zhang

An optimal control method is developed for unknown continuous-time systems with unknown disturbances in this paper. The integral reinforcement learning (IRL) algorithm is presented to obtain the iterative control. Off-policy learning is used to allow the dynamics to be completely unknown. Neural networks are used to construct critic and action networks. It is shown that if there are unknown disturbances, off-policy IRL may not converge or may be biased. For reducing the influence of unknown disturbances, a disturbances compensation controller is added. It is proven that the weight errors are uniformly ultimately bounded based on Lyapunov techniques. Convergence of the Hamiltonian function is also proven. The simulation study demonstrates the effectiveness of the proposed optimal control method for unknown systems with disturbances.


Neurocomputing | 2013

Multi-objective optimal control for a class of unknown nonlinear systems based on finite-approximation-error ADP algorithm

Ruizhuo Song; Wendong Xiao; Huaguang Zhang

In this paper, an optimal control method for a class of unknown discrete-time nonlinear systems with general multi-objective performance indices is proposed. In the design of the optimal controller, only available input-output data are required instead of known system dynamics, and the data-based identifier is established with stability proof. By the weighted sum technology, the multi-objective optimal control problem is transformed into the single objective optimization. To obtain the solution of the HJB equation, the novel finite-approximation-error adaptive dynamic programming (ADP) algorithm is presented with convergence proof. The detailed theoretic analyses for the relationship of the approximation accuracy and the algorithm convergence are given. It is shown that, as convergence conditions are satisfied, the iterative performance index functions can converge to a finite neighborhood of the greatest lower bound of all performance index functions. Neural networks are used to approximate the performance index function and compute the optimal control policy, respectively, for facilitating the implementation of the iterative ADP algorithm. Finally, two simulation examples are given to illustrate the performance of the proposed method.


systems man and cybernetics | 2018

Discrete-Time Local Value Iteration Adaptive Dynamic Programming: Convergence Analysis

Qinglai Wei; Frank L. Lewis; Derong Liu; Ruizhuo Song; Hanquan Lin

In this paper, convergence properties are established for the newly developed discrete-time local value iteration adaptive dynamic programming (ADP) algorithm. The present local iterative ADP algorithm permits an arbitrary positive semidefinite function to initialize the algorithm. Employing a state-dependent learning rate function, for the first time, the iterative value function and iterative control law can be updated in a subset of the state space instead of the whole state space, which effectively relaxes the computational burden. A new analysis method for the convergence property is developed to prove that the iterative value functions will converge to the optimum under some mild constraints. Monotonicity of the local value iteration ADP algorithm is presented, which shows that under some special conditions of the initial value function and the learning rate function, the iterative value function can monotonically converge to the optimum. Finally, three simulation examples and comparisons are given to illustrate the performance of the developed algorithm.


IEEE Transactions on Systems, Man, and Cybernetics | 2017

Discrete-Time Optimal Control via Local Policy Iteration Adaptive Dynamic Programming

Qinglai Wei; Derong Liu; Qiao Lin; Ruizhuo Song

In this paper, a discrete-time optimal control scheme is developed via a novel local policy iteration adaptive dynamic programming algorithm. In the discrete-time local policy iteration algorithm, the iterative value function and iterative control law can be updated in a subset of the state space, where the computational burden is relaxed compared with the traditional policy iteration algorithm. Convergence properties of the local policy iteration algorithm are presented to show that the iterative value function is monotonically nonincreasing and converges to the optimum under some mild conditions. The admissibility of the iterative control law is proven, which shows that the control system can be stabilized under any of the iterative control laws, even if the iterative control law is updated in a subset of the state space. Finally, two simulation examples are given to illustrate the performance of the developed method.


IEEE Transactions on Neural Networks | 2017

Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games

Ruizhuo Song; Frank L. Lewis; Qinglai Wei

This paper establishes an off-policy integral reinforcement learning (IRL) method to solve nonlinear continuous-time (CT) nonzero-sum (NZS) games with unknown system dynamics. The IRL algorithm is presented to obtain the iterative control and off-policy learning is used to allow the dynamics to be completely unknown. Off-policy IRL is designed to do policy evaluation and policy improvement in the policy iteration algorithm. Critic and action networks are used to obtain the performance index and control for each player. The gradient descent algorithm makes the update of critic and action weights simultaneously. The convergence analysis of the weights is given. The asymptotic stability of the closed-loop system and the existence of Nash equilibrium are proved. The simulation study demonstrates the effectiveness of the developed method for nonlinear CT NZS games with unknown system dynamics.


Science in China Series F: Information Sciences | 2014

A new self-learning optimal control laws for a class of discrete-time nonlinear systems based on ESN architecture

Ruizhuo Song; Wendong Xiao; Chang-Yin Sun

A novel self-learning optimal control method for a class of discrete-time nonlinear systems is proposed based on iteration adaptive dynamic programming (ADP) algorithm. It is proven that the iteration costate functions converge to the optimal one, and a detailed convergence analysis of the iteration ADP algorithm is given. Furthermore, echo state network (ESN) architecture is used as the approximator of the costate function for each iteration. To ensure the reliability of the ESN approximator, the ESN mean square training error is constrained in the satisfactory range. Two simulation examples are given to demonstrate that the proposed control method has a fast response speed due to the special structure and the fast training process.


IEEE/CAA Journal of Automatica Sinica | 2017

Optimal constrained self-learning battery sequential management in microgrid via adaptive dynamic programming

Qinglai Wei; Derong Liu; Yu Liu; Ruizhuo Song

This paper concerns a novel optimal self-learning battery sequential control scheme for smart home energy systems. The main idea is to use the adaptive dynamic programming U+0028 ADP U+0029 technique to obtain the optimal battery sequential control iteratively. First, the battery energy management system model is established, where the power efficiency of the battery is considered. Next, considering the power constraints of the battery, a new non-quadratic form performance index function is established, which guarantees that the value of the iterative control law cannot exceed the maximum charging/discharging power of the battery to extend the service life of the battery. Then, the convergence properties of the iterative ADP algorithm are analyzed, which guarantees that the iterative value function and the iterative control law both reach the optimums. Finally, simulation and comparison results are given to illustrate the performance of the presented method.

Collaboration


Dive into the Ruizhuo Song's collaboration.

Top Co-Authors

Avatar

Qinglai Wei

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Wendong Xiao

University of Science and Technology Beijing

View shared research outputs
Top Co-Authors

Avatar

Derong Liu

Guangdong University of Technology

View shared research outputs
Top Co-Authors

Avatar

Benkai Li

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Frank L. Lewis

University of Texas at Arlington

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Chang-Yin Sun

University of Science and Technology Beijing

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Qiao Lin

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Pengfei Yan

Chinese Academy of Sciences

View shared research outputs
Researchain Logo
Decentralizing Knowledge