Efficient abstraction selection in reinforcement learning

Seijen, H.H. van; Whiteson, S.; Kester, L.J.H.M.

Efficient abstraction selection in reinforcement learning

article

2014

Seijen, H.H. van

Whiteson, S.

Kester, L.J.H.M.

This article addresses reinforcement learning problems based on factored Markov decision processes (MDPs) in which the agent must choose among a set of candidate abstractions, each build up from a different combination of state components. We present and evaluate a new approach that can perform effective abstraction selection that is more resource-efficient and/or more general than existing approaches. The core of the approach is to make selection of an abstraction part of the learning agent's decision-making process by augmenting the agent's action space with internal actions that select the abstraction it uses. We prove that under certain conditions this approach results in a derived MDP whose solution yields both the optimal abstraction for the original MDP and the optimal policy under that abstraction. We examine our approach in three domains of increasing complexity: contextual bandit problems, episodic MDPs, and general MDPs with context-specific structure.

Topics

Abstraction selection Model-free learning Reinforcement learning Structure learning Abstracting Markov processes Context-specific structures Contextual bandits Decision making process Model-free learning Resource-efficient

TNO Identifier

520172

Repository link

https://resolver.tno.nl/uuid:7482d37d-699b-4191-8971-b156313a4c86

ISSN

08247935

Source

Computational Intelligence, 30(4), pp. 657-699.

Pages

657-699

Files

To receive the publication files, please send an e-mail request to TNO Repository.

Efficient abstraction selection in reinforcement learning

Make TNO yours!