Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Daniela Pucci de Farias is active.

Publication


Featured researches published by Daniela Pucci de Farias.


IEEE Transactions on Automatic Control | 2000

Output feedback control of Markov jump linear systems in continuous-time

Daniela Pucci de Farias; José Claudio Geromel; J.B.R. do Val; Oswaldo Luiz V. Costa

This paper addresses the dynamic output feedback control problem of continuous-time Markovian jump linear systems. The fundamental point in the analysis is an LMI characterization, comprising all dynamical compensators that stabilize the closed-loop system in the mean square sense. The H/sub 2/ and H/sub /spl infin//-norm control problems are studied, and the H/sub 2/ and H/sub /spl infin// filtering problems are solved as a by product.


Mathematics of Operations Research | 2004

On Constraint Sampling in the Linear Programming Approach to Approximate Dynamic Programming

Daniela Pucci de Farias; Benjamin Van Roy

In the linear programming approach to approximate dynamic programming, one tries to solve a certain linear program--the ALP--that has a relatively small numberK of variables but an intractable numberM of constraints. In this paper, we study a scheme that samples and imposes a subset ofm <


american control conference | 2007

Embedding Health Management into Mission Tasking for UAV Teams

Mario Valenti; Brett Bethke; Jonathan P. How; Daniela Pucci de Farias; John Vian

Coordinated multi-vehicle autonomous systems can provide incredible functionality, but off-nominal conditions and degraded system components can render this capability ineffective. This paper presents techniques to improve mission-level functional reliability through better system self-awareness and adaptive mission planning. In particular, we extend the traditional definition of health management, which has historically referred to the process of actively monitoring and managing vehicle sub-systems (e.g., avionics) in the event of component failures, to the context of multiple vehicle operations and autonomous multi-agent teams. In this case, health management information about each mission system component is used to improve the mission systems self-awareness and adapt vehicle, guidance, task and mission plans. This paper presents the theoretical foundations of our approach and recent experimental results on a new UAV testbed.


AIAA Guidance, Navigation and Control Conference and Exhibit | 2007

Mission Health Management for 24/7 Persistent Surveillance Operations

Mario Valenti; D. Dale; Jonathan P. How; Daniela Pucci de Farias; John Vian; Boeing Phantom

This paper presents the development and implementation of techniques used to manage autonomous unmanned aerial vehicles (UAVs) performing 24/7 persistent surveillance operations. Using an indoor flight testbed, flight test results are provided to demonstrate the complex issues encountered by operators and mission managers when executing an extended persistent surveillance operation in realtime. This paper presents mission health monitors aimed at identifying and improving mission system performance to avoid down time, increase mission system eciency and reduce operator loading. This paper discusses the infrastructure needed to execute an autonomous persistent surveillance operation and presents flight test results from one of our recent automated UAV recharging experiments. Using the RAVEN at MIT, we present flight test results from a 24 hr, fully-autonomous air vehicle flight-recharge test and an autonomous, multi-vehicle extended mission test using small, electric-powered air vehicles.


Journal of Optimization Theory and Applications | 2000

On the existence of fixed points for approximate value iteration and temporal-difference learning

Daniela Pucci de Farias; B. Van Roy

Approximate value iteration is a simple algorithm that combats the curse of dimensionality in dynamic programs by approximating iterates of the classical value iteration algorithm in a spirit reminiscent of statistical regression. Each iteration of this algorithm can be viewed as an application of a modified dynamic programming operator to the current iterate. The hope is that the iterates converge to a fixed point of this operator, which will then serve as a useful approximation of the optimal value function. In this paper, we show that, in general, the modified dynamic programming operator need not possess a fixed point; therefore, approximate value iteration should not be expected to converge. We then propose a variant of approximate value iteration for which the associated operator is guaranteed to possess at least one fixed point. This variant is motivated by studies of temporal-difference (TD) learning, and existence of fixed points implies here existence of stationary points for the ordinary differential equation approximated by a version of TD that incorporates exploration.


Mathematics of Operations Research | 2006

A Cost-Shaping Linear Program for Average-Cost Approximate Dynamic Programming with Performance Guarantees

Daniela Pucci de Farias; Benjamin Van Roy

We introduce a new algorithm based on linear programming for optimization of average-cost Markov decision processes (MDPs). The algorithm approximates the differential cost function of a perturbed MDP via a linear combination of basis functions. We establish a bound on the performance of the resulting policy that scales gracefully with the number of states without imposing the strong Lyapunov condition required by its counterpart in de Farias and Van Roy [de Farias, D. P., B. Van Roy. 2003. The linear programming approach to approximate dynamic programming. Oper. Res.51(6) 850--865]. We investigate implications of this result in the context of a queueing control problem.


Journal of the ACM | 2006

Combining expert advice in reactive environments

Daniela Pucci de Farias; Nimrod Megiddo

“Experts algorithms” constitute a methodology for choosing actions repeatedly, when the rewards depend both on the choice of action and on the unknown current state of the environment. An experts algorithm has access to a set of strategies (“experts”), each of which may recommend which action to choose. The algorithm learns how to combine the recommendations of individual experts so that, in the long run, for any fixed sequence of states of the environment, it does as well as the best expert would have done relative to the same sequence. This methodology may not be suitable for situations where the evolution of states of the environment depends on past chosen actions, as is usually the case, for example, in a repeated non-zero-sum game.A general exploration-exploitation experts method is presented along with a proper definition of value. The definition is shown to be adequate in that it both captures the impact of an experts actions on the environment and is learnable. The new experts method is quite different from previously proposed experts algorithms. It represents a shift from the paradigms of regret minimization and myopic optimization to consideration of the long-term effect of a players actions on the environment. The importance of this shift is demonstrated by the fact that this algorithm is capable of inducing cooperation in the repeated Prisoners Dilemma game, whereas previous experts algorithms converge to the suboptimal non-cooperative play. The method is shown to asymptotically perform as well as the best available expert. Several variants are analyzed from the viewpoint of the exploration-exploitation tradeoff, including explore-then-exploit, polynomially vanishing exploration, constant-frequency exploration, and constant-size exploration phases. Complexity and performance bounds are proven.


Siam Journal on Optimization | 2008

Decentralized Resource Allocation in Dynamic Networks of Agents

Hariharan Lakshmanan; Daniela Pucci de Farias

We consider the problem of n agents that share m common resources. The objective is to derive an optimal allocation that maximizes a global objective expressed as a separable concave objective function. We propose a decentralized, asynchronous gradient-descent method that is suitable for implementation in the case where the communication between agents is described in terms of a dynamic network. This communication model accommodates situations such as mobile agents and communication failures. The method is shown to converge provided that the objective function has Lipschitz-continuous gradients. We further consider a randomized version of the same algorithm for the case where the objective function is nondifferentiable but has bounded subgradients. We show that both algorithms converge to near-optimal solutions and derive convergence rates in terms of the magnitude of the gradient of the objective function. We show how to accommodate nonnegativity constraints on the resources using the results derived. Ex...


conference on decision and control | 2003

On constraint sampling in the linear programming approach to approximate linear programming

Daniela Pucci de Farias; B. Van Roy

In the linear programming approach to approximate dynamic programming, one tries to solve a certain linear program - the ALP -, which has a relatively small number K of variables but an intractable number M of constraints. In this paper, we study a scheme that samples and imposes a subset of m /spl Lt/ M constraints. A natural question that arises in this context is: How must m scale with respect to K and M in order to ensure that the resulting approximation is almost as good as one given by exact solution of the ALP? We show that, under certain idealized conditions, m can be chosen independently of M and need grow only as a polynomial in K.In the linear programming approach to approximate dynamic programming, one tries to solve a certain linear program - the ALP -, which has a relatively small number K of variables but an intractable number M of constraints. In this paper, we study a scheme that samples and imposes a subset of m /spl Lt/ M constraints. A natural question that arises in this context is: How must m scale with respect to K and M in order to ensure that the resulting approximation is almost as good as one given by exact solution of the ALP? We show that, under certain idealized conditions, m can be chosen independently of M and need grow only as a polynomial in K.


Journal of The Optical Society of America A-optics Image Science and Vision | 2008

Analytical optical solution of the extension of the depth of field using cubic-phase wavefront coding. Part II. Design and optimization of the cubic phase

Saeed Bagheri; Paulo E. X. Silveira; Ramkumar Narayanswamy; Daniela Pucci de Farias

In this paper we use our derived approximate representation of the modulation transfer function to analytically solve the problem of the extension of the depth of field for two cases of interest: uniform quality imaging and task-based imaging. We derive the optimal result for each case as a function of the problem specifications. We also compare the two different imaging cases and discuss the advantages of using our optimization approach for each case. We also show how the analytical solutions given in this paper can be used as a convenient design tool as opposed to previous lengthy numerical optimizations.

Collaboration


Dive into the Daniela Pucci de Farias's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Paulo E. X. Silveira

University of Colorado Boulder

View shared research outputs
Top Co-Authors

Avatar

George Barbastathis

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Jonathan P. How

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Hariharan Lakshmanan

Massachusetts Institute of Technology

View shared research outputs
Researchain Logo
Decentralizing Knowledge