Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Zachary N. Sunberg is active.

Publication


Featured researches published by Zachary N. Sunberg.


IEEE Transactions on Systems, Man, and Cybernetics | 2016

Information Space Receding Horizon Control for Multisensor Tasking Problems

Zachary N. Sunberg; Suman Chakravorty; Richard Scott Erwin

In this paper, we present a receding horizon solution to the problem of optimal scheduling for multiple sensors monitoring a group of dynamical targets. The term target is used here in the classic sense of being the object that is being sensed or observed by the sensors. This problem is motivated by the space situational awareness (SSA) problem. The multisensor optimal scheduling problem can be posed as a multiagent Markov decision process on the information space which has a dynamic programming (DP) solution. We present a simulation-based stochastic optimization technique that exploits the structure inherent in the problem to obtain variance reduction along with a distributed solution. This stochastic optimization technique is combined with a receding horizon approach which uses online solution of the control problems to obviate the need to solve the computationally intractable multiagent information space DP problem and hence, makes the technique computationally tractable. The technique is tested on a moderate scale SSA example which is nonetheless computationally intractable for existing solution techniques.


international conference on robotics and automation | 2016

Optimized and trusted collision avoidance for unmanned aerial vehicles using approximate dynamic programming

Zachary N. Sunberg; Mykel J. Kochenderfer; Marco Pavone

Safely integrating unmanned aerial vehicles into civil airspace is contingent upon development of a trustworthy collision avoidance system. This paper proposes an approach whereby a parameterized resolution logic that is considered trusted for a given range of its parameters is adaptively tuned online. Specifically, to address the potential conservatism of the resolution logic with static parameters, we present a dynamic programming approach for adapting the parameters dynamically based on the encounter state. We compute the adaptation policy offline using a simulation-based approximate dynamic programming method that accommodates the high dimensionality of the problem. Numerical experiments show that this approach improves safety and operational performance compared to the baseline resolution logic, while retaining trustworthiness.


advances in computing and communications | 2017

The value of inferring the internal state of traffic participants for autonomous freeway driving

Zachary N. Sunberg; Christopher Ho; Mykel J. Kochenderfer

Safe interaction with human drivers is one of the primary challenges for autonomous vehicles. In order to plan driving maneuvers effectively, the vehicles control system must infer and predict how humans will behave based on their latent internal state (e.g., intentions and aggressiveness). This research uses a simple model for human behavior with unknown parameters that make up the internal states of the traffic participants and presents a method for quantifying the value of estimating these states and planning with their uncertainty explicitly modeled. An upper performance bound is established by an omniscient Monte Carlo Tree Search (MCTS) planner that has perfect knowledge of the internal states. A baseline lower bound is established by planning with MCTS assuming that all drivers have the same internal state. MCTS variants are then used to solve a partially observable Markov decision process (POMDP) that models the internal state uncertainty to determine whether inferring the internal state offers an advantage over the baseline. Applying this method to a freeway lane changing scenario reveals that there is a significant performance gap between the upper bound and baseline. POMDP planning techniques come close to closing this gap, especially when important hidden model parameters are correlated with measurable parameters.


advances in computing and communications | 2014

Information space sensor tasking for Space Situational Awareness

Zachary N. Sunberg; Suman Chakravorty; Richard Scott Erwin

In this paper, we apply a receding horizon control approach to the sensor tasking aspect of a simplified version of the Space Situational Awareness (SSA) problem: “Given a small number of sensors and a large number of satellites, how should the sensors be used to maximize the information gained about the states of the satellites” Finding the globally optimal solution to this partially observed Markov decision process is computationally intractable. However, by using a stochastic gradient ascent algorithm proposed in previous work to improve an open-loop control policy over a shortened horizon, large performance improvements can be made over a baseline myopic tasking policy in a computationally tractable manner. The structure of this approach also allows for a distributed implementation in which each sensor acts as an agent that is semi-independent from the others.


Journal of Machine Learning Research | 2017

POMDPs.jl: a framework for sequential decision making under uncertainty

Maxim Egorov; Zachary N. Sunberg; Edward Balaban; Tim Allan Wheeler; Jayesh K. Gupta; Mykel J. Kochenderfer


intelligent robots and systems | 2017

Simultaneous active parameter estimation and control using sampling-based Bayesian reinforcement learning

Patrick Slade; Preston Culbertson; Zachary N. Sunberg; Mykel J. Kochenderfer


arXiv: Robotics | 2016

Optimized and Trusted Collision Avoidance for Unmanned Aerial Vehicles using Approximate Dynamic Programming (Technical Report)

Zachary N. Sunberg; Mykel J. Kochenderfer; Marco Pavone


international conference on automated planning and scheduling | 2018

Online Algorithms for POMDPs with Continuous State, Action, and Observation Spaces.

Zachary N. Sunberg; Mykel J. Kochenderfer


ieee intelligent vehicles symposium | 2018

Exploiting Hierarchy for Scalable Decision Making in Autonomous Driving

Ekhlas Sonu; Zachary N. Sunberg; Mykel J. Kochenderfer


arXiv: Systems and Control | 2018

Estimation and Control Using Sampling-Based Bayesian Reinforcement Learning.

Patrick Slade; Zachary N. Sunberg; Mykel J. Kochenderfer

Collaboration


Dive into the Zachary N. Sunberg's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Richard Scott Erwin

Air Force Research Laboratory

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge