William Uther | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where William Uther is active.

Explore More

Publication

Featured researches published by William Uther.

Journal of Artificial Intelligence Research | 2011

A Monte-Carlo AIXI approximation

Joel Veness; Kee Siong Ng; Marcus Hutter; William Uther; David Silver

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. Our approach is based on a direct approximation of AIXI, a Bayesian optimality notion for general reinforcement learning agents. Previously, it has been unclear whether the theory of AIXI could motivate the design of practical algorithms. We answer this hitherto open question in the affirmative, by providing the first computationally feasible approximation to the AIXI agent. To develop our approximation, we introduce a new Monte-Carlo Tree Search algorithm along with an agent-specific extension to the Context Tree Weighting algorithm. Empirically, we present a set of encouraging results on a variety of stochastic and partially observable domains. We conclude by proposing a number of directions for future research.

Journal of Applied Logic | 2013

Probabilities on Sentences in an Expressive Logic

Marcus Hutter; John W. Lloyd; Kee Siong Ng; William Uther

Abstract Automated reasoning about uncertain knowledge has many applications. One difficulty when developing such systems is the lack of a completely satisfactory integration of logic and probability. We address this problem directly. Expressive languages like higher-order logic are ideally suited for representing and reasoning about structured knowledge. Uncertain knowledge can be modeled by using graded probabilities rather than binary truth values. The main technical problem studied in this paper is the following: Given a set of sentences, each having some probability of being true, what probability should be ascribed to other (query) sentences? A natural wish-list, among others, is that the probability distribution (i) is consistent with the knowledge base, (ii) allows for a consistent inference procedure and in particular (iii) reduces to deductive logic in the limit of probabilities being 0 and 1, (iv) allows (Bayesian) inductive reasoning and (v) learning in the limit and in particular (vi) allows confirmation of universally quantified hypotheses/sentences. We translate this wish-list into technical requirements for a prior probability and show that probabilities satisfying all our criteria exist. We also give explicit constructions and several general characterizations of probabilities that satisfy some or all of the criteria and various (counter)examples. We also derive necessary and sufficient conditions for extending beliefs about finitely many sentences to suitable probabilities over all sentences, and in particular least dogmatic or least biased ones. We conclude with a brief outlook on how the developed theory might be used and approximated in autonomous reasoning agents. Our theory is a step towards a globally consistent and empirically satisfactory unification of probability and logic.

international conference on intelligent transportation systems | 2009

Kalman filter process models for urban vehicle tracking

Carlos Aydos; Bernhard Hengst; William Uther

Faced with increasing congestion on urban roads, authorities need better real-time traffic information to manage traffic. Kalman Filters are efficient algorithms that can be adapted to track vehicles in urban traffic given noisy sensor data. A Kalman Filter process model that approximates dynamic vehicle behaviour is a reusable subsystem for modelling the dynamics of a multi-vehicle traffic system. The challenge is choosing an appropriate process model that produces the smallest estimation errors. This paper provides a comparative analysis and evaluation of Linear and Unscented Kalman Filters process models for urban traffic applications.

Archive | 2003

Automatic Gait Optimisation for Quadruped Robots

Min Sub Kim; William Uther

In: (pp. pp. 1937-1945). (2009) | 2009

Bootstrapping from game tree search

Joel Veness; David Silver; William Uther; Alan D. Blair

international joint conference on artificial intelligence | 2005

Motivated agents

Kathryn Kasmarik; William Uther; Mary-Lou Maher

adaptive agents and multi agents systems | 2010

Approximate dynamic programming with affine ADDs

Scott Sanner; William Uther; Karina Valdivia Delgado

Archive | 2003

A Description of the rUNSWift 2003 Legged Robot Soccer Team

Jin Chen; Eric Chung; Ross Edwards; Nathan Wong; Eileen Mak; Raymond Sheh; Min Sub Kim; Alex Tang; Nicodemus Sutanto; Bernhard Hengst; Claude Sammut; William Uther; Ict Australia

international joint conference on artificial intelligence | 2013