Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Yunpeng Pan is active.

Publication


Featured researches published by Yunpeng Pan.


advances in computing and communications | 2015

Data-driven differential dynamic programming using Gaussian processes

Yunpeng Pan; Evangelos A. Theodorou

We present a Bayesian nonparametric trajectory optimization framework for systems with unknown dynamics using Gaussian Processes (GPs), called Gaussian Process Differential Dynamic Programming (GPDDP). Rooted in the Dynamic Programming principle and second-order local approximations of the value function, GPDDP learns time-varying optimal control policies from sampled data. Based on this framework, we propose two algorithms for implementations. We demonstrate the effectiveness and efficiency of the proposed framework using three numerical examples.


ieee symposium on adaptive dynamic programming and reinforcement learning | 2014

Nonparametric infinite horizon Kullback-Leibler stochastic control

Yunpeng Pan; Evangelos A. Theodorou

We present two nonparametric approaches to Kullback-Leibler (KL) control, or linearly-solvable Markov decision problem (LMDP) based on Gaussian processes (GP) and Nyström approximation. Compared to recently developed parametric methods, the proposed data-driven frameworks feature accurate function approximation and efficient on-line operations. Theoretically, we derive the mathematical connection of KL control based on dynamic programming with earlier work in control theory which relies on information theoretic dualities for the infinite time horizon case. Algorithmically, we give explicit optimal control policies in nonparametric forms, and propose on-line update schemes with budgeted computational costs. Numerical results demonstrate the effectiveness and usefulness of the proposed frameworks.


robotics science and systems | 2015

Robust Trajectory Optimization: A Cooperative Stochastic Game Theoretic Approach

Yunpeng Pan; Evangelos A. Theodorou; Kaivalya Bakshi

We present a novel trajectory optimization framework to address the issue of robustness, scalability and efficiency in optimal control and reinforcement learning. Based on prior work in Cooperative Stochastic Differential Game (CSDG) theory, our method performs local trajectory optimization using cooperative controllers. The resulting framework is called Cooperative Game-Differential Dynamic Programming (CG-DDP). Compared to related methods, CG-DDP exhibits improved performance in terms of robustness and efficiency. The proposed framework is also applied in a data-driven fashion for belief space trajectory optimization under learned dynamics. We present experiments showing that CG-DDP can be used for optimal control and reinforcement learning under external disturbances and internal model uncertainties.


advances in computing and communications | 2017

Belief space stochastic control under unknown dynamics

Yunpeng Pan; Kamil Saigol; Evangelos A. Theodorou

We present a sampling-based stochastic optimal control (SOC) framework for systems with unknown dynamics based on the path integral formulation and probabilistic inference. This work is motivated by three major limitations of related SOC methods: first, full knowledge of the dynamics model is usually required. Second, model uncertainty is neglected. Third, convergence of the iterative scheme is quite slow. In order to cope with these issues, our method performs sampling in belief space using approximate inference in probabilistic models. When performing probability-weighted averaging, each sample is weighted by its predictive uncertainty. In addition, our method leverages covariance matrix adaptation to achieve faster convergence. We demonstrate the effectiveness and efficiency of the proposed method using a simulated cart-pole swing up task.


AIAA Guidance, Navigation, and Control Conference | 2017

Pseudospectral Model Predictive Control under Partially Learned Dynamics

Manan Gandhi; Yunpeng Pan; Evangelos A. Theodorou

Trajectory optimization of a controlled dynamical system is an essential part of autonomy, however many trajectory optimization techniques are limited by the fidelity of the underlying parametric model. In the field of robotics, a lack of model knowledge can be overcome with machine learning techniques, utilizing measurements to build a dynamical model from the data. This paper aims to take the middle ground between these two approaches by introducing a semi-parametric representation of the underlying system dynamics. Our goal is to leverage the considerable information contained in a traditional physics based model and combine it with a data-driven, non-parametric regression technique known as a Gaussian Process. Integrating this semi-parametric model with model predictive pseudospectral control, we demonstrate this technique on both a cart pole and quadrotor simulation with unmodeled damping and parametric error. In order to manage parametric uncertainty, we introduce an algorithm that utilizes Sparse Spectrum Gaussian Processes (SSGP) for online learning after each rollout. We implement this online learning technique on a cart pole and quadrator, then demonstrate the use of online learning and obstacle avoidance for the dubin vehicle dynamics.


international conference of the ieee engineering in medicine and biology society | 2016

Cross-entropy optimization for neuromodulation

Harleen K. Brar; Yunpeng Pan; Babak Mahmoudi; Evangelos A. Theodorou

This study presents a reinforcement learning approach for the optimization of the proportional-integral gains of the feedback controller represented in a computational model of epilepsy. The chaotic oscillator model provides a feedback control systems view of the dynamics of an epileptic brain with an internal feedback controller representative of the natural seizure suppression mechanism within the brain circuitry. Normal and pathological brain activity is simulated in this model by adjusting the feedback gain values of the internal controller. With insufficient gains, the internal controller cannot provide enough feedback to the brain dynamics causing an increase in correlation between different brain sites. This increase in synchronization results in the destabilization of the brain dynamics, which is representative of an epileptic seizure. To provide compensation for an insufficient internal controller an external controller is designed using proportional-integral feedback control strategy. A cross-entropy optimization algorithm is applied to the chaotic oscillator network model to learn the optimal feedback gains for the external controller instead of hand-tuning the gains to provide sufficient control to the pathological brain and prevent seizure generation. The correlation between the dynamics of neural activity within different brain sites is calculated for experimental data to show similar dynamics of epileptic neural activity as simulated by the network of chaotic oscillators.


neural information processing systems | 2014

Probabilistic Differential Dynamic Programming

Yunpeng Pan; Evangelos A. Theodorou


international conference on artificial intelligence and statistics | 2017

Learning from Conditional Distributions via Dual Embeddings

Bo Dai; Niao He; Yunpeng Pan; Byron Boots; Le Song


arXiv: Robotics | 2017

Agile Off-Road Autonomous Driving Using End-to-End Deep Imitation Learning.

Yunpeng Pan; Ching-An Cheng; Kamil Saigol; Keuntaek Lee; Xinyan Yan; Evangelos A. Theodorou; Byron Boots


neural information processing systems | 2015

Sample efficient path integral control under uncertainty

Yunpeng Pan; Evangelos A. Theodorou; Michail Kontitsis

Collaboration


Dive into the Yunpeng Pan's collaboration.

Top Co-Authors

Avatar

Evangelos A. Theodorou

Georgia Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Byron Boots

Georgia Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Kamil Saigol

Georgia Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Xinyan Yan

Georgia Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Manan Gandhi

Georgia Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Bo Dai

Georgia Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Ching-An Cheng

Georgia Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

George I. Boutselis

Georgia Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Le Song

Georgia Institute of Technology

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge