Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Erik Talvitie is active.

Publication


Featured researches published by Erik Talvitie.


Journal of Artificial Intelligence Research | 2018

Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents

Marlos C. Machado; Marc G. Bellemare; Erik Talvitie; Joel Veness; Matthew J. Hausknecht; Michael H. Bowling

The Arcade Learning Environment (ALE) is an evaluation platform that poses the challenge of building AI agents with general competency across dozens of Atari 2600 games. It supports a variety of different problem settings and it has been receiving increasing attention from the scientific community, leading to some high-profile success stories such as the much publicized Deep Q-Networks (DQN). In this article we take a big picture look at how the ALE is being used by the research community. We show how diverse the evaluation methodologies in the ALE have become with time, and highlight some key concerns when evaluating agents in the ALE. We use this discussion to present some methodological best practices and provide new benchmark results using these best practices. To further the progress in the field, we introduce a new version of the ALE that supports multiple game modes and provides a form of stochasticity we call sticky actions. We conclude this big picture look by revisiting challenges posed when the ALE was introduced, summarizing the state-of-the-art in various problems and highlighting problems that remain open.


Journal of Artificial Intelligence Research | 2011

Learning to make predictions in partially observable environments without a generative model

Erik Talvitie; Satinder P. Singh

When faced with the problem of learning a model of a high-dimensional environment, a common approach is to limit the model to make only a restricted set of predictions, thereby simplifying the learning problem. These partial models may be directly useful for making decisions or may be combined together to form a more complete, structured model. However, in partially observable (non-Markov) environments, standard model-learning methods learn generative models, i.e. models that provide a probability distribution over all possible futures (such as POMDPs). It is not straightforward to restrict such models to make only certain predictions, and doing so does not always simplify the learning problem. In this paper we present prediction profile models: non-generative partial models for partially observable systems that make only a given set of predictions, and are therefore far simpler than generative models in some cases. We formalize the problem of learning a prediction profile model as a transformation of the original model-learning problem, and show empirically that one can learn prediction profile models that make a small set of important predictions even in systems that are too complex for standard generative models.


international joint conference on artificial intelligence | 2018

Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents (Extended Abstract).

Marlos C. Machado; Marc G. Bellemare; Erik Talvitie; Joel Veness; Matthew J. Hausknecht; Michael H. Bowling

The Arcade Learning Environment (ALE) is an evaluation platform that poses the challenge of building AI agents with general competency across dozens of Atari 2600 games. It supports a variety of different problem settings and it has been receiving increasing attention from the scientific community, leading to some high-profile success stories such as the much publicized Deep Q-Networks (DQN). In this article we take a big picture look at how the ALE is being used by the research community. We show how diverse the evaluation methodologies in the ALE have become with time, and highlight some key concerns when evaluating agents in the ALE. We use this discussion to present some methodological best practices and provide new benchmark results using these best practices. To further the progress in the field, we introduce a new version of the ALE that supports multiple game modes and provides a form of stochasticity we call sticky actions. We conclude this big picture look by revisiting challenges posed when the ALE was introduced, summarizing the state-of-the-art in various problems and highlighting problems that remain open.


international joint conference on artificial intelligence | 2007

An experts algorithm for transfer learning

Erik Talvitie; Satinder P. Singh


adaptive agents and multi-agents systems | 2016

State of the Art Control of Atari Games Using Shallow Reinforcement Learning

Yitao Liang; Marlos C. Machado; Erik Talvitie; Michael H. Bowling


neural information processing systems | 2008

Simple Local Models for Complex Dynamical Systems

Erik Talvitie; Satinder P. Singh


international conference on machine learning | 2014

Skip Context Tree Switching

Marc G. Bellemare; Joel Veness; Erik Talvitie


uncertainty in artificial intelligence | 2014

Model regularization for stable sample rollouts

Erik Talvitie


national conference on artificial intelligence | 2015

Improving exploration in UCT using local manifolds

Sriram Srinivasan; Erik Talvitie; Michael H. Bowling


national conference on artificial intelligence | 2016

Self-Correcting Models for Model-Based Reinforcement Learning.

Erik Talvitie

Collaboration


Dive into the Erik Talvitie's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Matthew J. Hausknecht

University of Texas at Austin

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge