Nichols, B.

A comparison of action selection methods for implicit policy method reinforcement learning in continuous action-space

Conference paper

Nichols, B. 2016. A comparison of action selection methods for implicit policy method reinforcement learning in continuous action-space. International Joint Conference on Neural Networks (IJCNN 2016). Vancouver, Canada 24 - 29 Jul 2016 IEEE. pp. 3785-3792 https://doi.org/10.1109/IJCNN.2016.7727688

ISBN
Type	Conference paper
Title	A comparison of action selection methods for implicit policy method reinforcement learning in continuous action-space
Authors	Nichols, B.
Abstract	In this paper I investigate methods of applying reinforcement learning to continuous state- and action-space problems without a policy function. I compare the performance of four methods, one of which is the discretisation of the action-space, and the other three are optimisation techniques applied to finding the greedy action without discretisation. The optimisation methods I apply are gradient descent, Nelder-Mead and Newton's Method. The action selection methods are applied in conjunction with the SARSA algorithm, with a multilayer perceptron utilized for the approximation of the value function. The approaches are applied to two simulated continuous state- and action-space control problems: Cart-Pole and double Cart-Pole. The results are compared both in terms of action selection time and the number of trials required to train on the benchmark problems.
Conference	International Joint Conference on Neural Networks (IJCNN 2016)
Page range	3785-3792
ISSN	2161-4407
Hardcover	9781509006205
Publisher	IEEE
Publication dates
Print	03 Nov 2016
Publication process dates
Deposited	19 May 2016
Accepted	15 Mar 2016
Output status	Published
Accepted author manuscript	PID4199377.pdf
Copyright Statement	Full text: © 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Digital Object Identifier (DOI)	https://doi.org/10.1109/IJCNN.2016.7727688
Language	English
Book title	2016 International Joint Conference on Neural Networks (IJCNN)

Permalink -

https://repository.mdx.ac.uk/item/86677

Log in to edit

Download files

Accepted author manuscript

PID4199377.pdf

32
total views
23
total downloads
1
views this month
0
downloads this month

A comparison of action selection methods for implicit policy method reinforcement learning in continuous action-space

Download files

Accepted author manuscript

32

23

1

0

Export as