Dynamic choice of state abstraction in q-learning

Q-learning associates states and actions of a Markov Decision Process to expected future reward through online learning. In practice, however, when the state space is large and experience is still limited, the algorithm will not find a match between current state and experience unless some details describing states are ignored. On the other hand, reducing state information affects long term performance because decisions will need to be made on less informative inputs. We propose a variation of Q-learning that gradually enriches state descriptions, after enough experience is accumulated. This is coupled with an ad-hoc exploration strategy that aims at collecting key information that allows the algorithm to enrich state descriptions earlier. Experimental results obtained by applying our algorithm to the arcade game Pac-Man show that our approach significantly outperforms Q-learning during the learning process while not penalizing long-term performance.

Author:

Marco Tamassia

Fabio Zambetta

William L. Raffe

Florian 'Floyd' Mueller

Xiaodong Li

Presented At:

ECAI 2016: Volume 285 Frontiers in Artificial Intelligence and Applications (FAIA), pp. 46-54

Year:

2016

Type:

Conference Proceedings

Student Games

Your Nightmare is a Snail Called Death
Subject: Game Design Studio 2
2024
by Marley Horsnell-Proud, Harry Lecoutre, Xavier Arias, Joshua Isaraela, George Drca, Nicholas Danaskos
Work Life Balance
Subject: Game Design Studio 2
2024
by Chun Hei Wong, Hok Lam Ng, Huimin Liu, Chi Shun Woo, Hayden Wen Xuan Poi
SPEED X
Subject: Game Design Studio 2
2024
by Jiacheng Liang, Hongfei Wei, Xingjian Ma, Kai Peng, Hanru Wang
SpectrumOps: Rescue Mission
Subject: Game Design Studio 2
2024
by Kyaw Thiha, Kingshuk Dholakia, Hugo Tien
Modline Drift
Subject: Game Design Studio 2
2024
by Tom Howarth, Alex Valacos, Lauren Szmandiuk, Xander Akkari, Ethan Lucas
MLSP.
Subject: Game Design Studio 2
2024
by Shuhao Jia, Lin Kyaw Khant, ManHo Chan, Pat Ramses Rulete Labirua
CRUSHERS
Subject: Game Design Studio 2
2024
by Anthony Thai, Chanel Parfait, Iskandar Muminov, Michael Lam
Caved In
Subject: Game Design Studio 2
2024
by Samuel McKenzie-Sell, Sherwin Eskandaril, Dylan Archers , Natasha Kelly , Keegan Wallis
Cart Crashers
Subject: Game Design Studio 2
2024
by Harrison Clark, Matthew Tindale, Nathan Kyle Galang, Sameul Coa, Stephen Roberts, Vo Van Thanh Nhan
Cardboard Crusade
Subject: Game Design Studio 2
2024
by Jacob Reed, Natalia Tang, Joaquin Yambao, Matthew Leydon

Dynamic choice of state abstraction in q-learning

Your Nightmare is a Snail Called Death

Work Life Balance

SPEED X

SpectrumOps: Rescue Mission

Modline Drift

MLSP.

CRUSHERS

Caved In

Cart Crashers

Cardboard Crusade

Game News

Games Reviews

Past Events

You are here

Dynamic choice of state abstraction in q-learning

Game News

Games Reviews

Past Events