William Uther
University of New South Wales
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by William Uther.
Journal of Artificial Intelligence Research | 2011
Joel Veness; Kee Siong Ng; Marcus Hutter; William Uther; David Silver
This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. Our approach is based on a direct approximation of AIXI, a Bayesian optimality notion for general reinforcement learning agents. Previously, it has been unclear whether the theory of AIXI could motivate the design of practical algorithms. We answer this hitherto open question in the affirmative, by providing the first computationally feasible approximation to the AIXI agent. To develop our approximation, we introduce a new Monte-Carlo Tree Search algorithm along with an agent-specific extension to the Context Tree Weighting algorithm. Empirically, we present a set of encouraging results on a variety of stochastic and partially observable domains. We conclude by proposing a number of directions for future research.
Journal of Applied Logic | 2013
Marcus Hutter; John W. Lloyd; Kee Siong Ng; William Uther
Abstract Automated reasoning about uncertain knowledge has many applications. One difficulty when developing such systems is the lack of a completely satisfactory integration of logic and probability. We address this problem directly. Expressive languages like higher-order logic are ideally suited for representing and reasoning about structured knowledge. Uncertain knowledge can be modeled by using graded probabilities rather than binary truth values. The main technical problem studied in this paper is the following: Given a set of sentences, each having some probability of being true, what probability should be ascribed to other (query) sentences? A natural wish-list, among others, is that the probability distribution (i) is consistent with the knowledge base, (ii) allows for a consistent inference procedure and in particular (iii) reduces to deductive logic in the limit of probabilities being 0 and 1, (iv) allows (Bayesian) inductive reasoning and (v) learning in the limit and in particular (vi) allows confirmation of universally quantified hypotheses/sentences. We translate this wish-list into technical requirements for a prior probability and show that probabilities satisfying all our criteria exist. We also give explicit constructions and several general characterizations of probabilities that satisfy some or all of the criteria and various (counter)examples. We also derive necessary and sufficient conditions for extending beliefs about finitely many sentences to suitable probabilities over all sentences, and in particular least dogmatic or least biased ones. We conclude with a brief outlook on how the developed theory might be used and approximated in autonomous reasoning agents. Our theory is a step towards a globally consistent and empirically satisfactory unification of probability and logic.
international conference on intelligent transportation systems | 2009
Carlos Aydos; Bernhard Hengst; William Uther
Faced with increasing congestion on urban roads, authorities need better real-time traffic information to manage traffic. Kalman Filters are efficient algorithms that can be adapted to track vehicles in urban traffic given noisy sensor data. A Kalman Filter process model that approximates dynamic vehicle behaviour is a reusable subsystem for modelling the dynamics of a multi-vehicle traffic system. The challenge is choosing an appropriate process model that produces the smallest estimation errors. This paper provides a comparative analysis and evaluation of Linear and Unscented Kalman Filters process models for urban traffic applications.
Archive | 2003
Min Sub Kim; William Uther
In: (pp. pp. 1937-1945). (2009) | 2009
Joel Veness; David Silver; William Uther; Alan D. Blair
international joint conference on artificial intelligence | 2005
Kathryn Kasmarik; William Uther; Mary-Lou Maher
adaptive agents and multi agents systems | 2010
Scott Sanner; William Uther; Karina Valdivia Delgado
Archive | 2003
Jin Chen; Eric Chung; Ross Edwards; Nathan Wong; Eileen Mak; Raymond Sheh; Min Sub Kim; Alex Tang; Nicodemus Sutanto; Bernhard Hengst; Claude Sammut; William Uther; Ict Australia
international joint conference on artificial intelligence | 2013
Marcus Hutter; John W. Lloyd; Kee Siong Ng; William Uther
Archive | 2008
John W. Lloyd; Kee Siong Ng; William Uther