Publicidad

Publicidad

becas.universia.netBiblioteca.Net

Buscar recursos:

Buscador Google

Resource data



Ver

Importance Sampling for Reinforcement Learning with Multiple Objectives
Shelton, Christian Robert
Location: AITR-2001-003
CBCL-204
http://hdl.handle.net/1721.1/5568

This thesis considers three complications that arise from applying reinforcement learning to a real-world application. In the process of using reinforcement learning to build an adaptive electronic market-maker, we find the sparsity of data, the partial observability of the domain, and the multiple objectives of the agent to cause serious problems for existing reinforcement learning algorithms. We employ importance sampling (likelihood ratios) to achieve good performance in partially observable Markov decision processes with few data. Our importance sampling estimator requires no knowledge about the environment and places few restrictions on the method of collecting data. It can be used efficiently with reactive controllers, finite-state controllers, or policies with function approximation. We present theoretical analyses of the estimator and incorporate it into a reinforcement learning algorithm. Additionally, this method provides a complete return surface which can be used to balance multiple objectives dynamically. We demonstrate the need for multiple goals in a variety of applications and natural solutions based on our sampling method. The thesis concludes with example results from employing our algorithm to the domain of automated electronic market-making.

Belongs to: DSpace at MIT

Descargar SCORM

¡Sea el primero en solicitar este recurso!

Para poder solicitar este recurso debe identificarse como usuario de la biblioteca

Users rating

No hay ninguna valoración para este recurso. Sea el primero en valorar este recurso.

Detalles del recurso

Importance Sampling for Reinforcement Learning with Multiple Objectives
Id. 25089
Idioma inglés (Estados Unidos)
Titulo Importance Sampling for Reinforcement Learning with Multiple Objectives
Autor(es) Shelton, Christian Robert
Location AITR-2001-003
CBCL-204
http://hdl.handle.net/1721.1/5568
Versión 1.0
Estado Final
Descripción This thesis considers three complications that arise from applying reinforcement learning to a real-world application. In the process of using reinforcement learning to build an adaptive electronic market-maker, we find the sparsity of data, the partial observability of the domain, and the multiple objectives of the agent to cause serious problems for existing reinforcement learning algorithms. We employ importance sampling (likelihood ratios) to achieve good performance in partially observable Markov decision processes with few data. Our importance sampling estimator requires no knowledge about the environment and places few restrictions on the method of collecting data. It can be used efficiently with reactive controllers, finite-state controllers, or policies with function approximation. We present theoretical analyses of the estimator and incorporate it into a reinforcement learning algorithm. Additionally, this method provides a complete return surface which can be used to balance multiple objectives dynamically. We demonstrate the need for multiple goals in a variety of applications and natural solutions based on our sampling method. The thesis concludes with example results from employing our algorithm to the domain of automated electronic market-making.
Tipo 108 p.
10551422 bytes
1268632 bytes
application/postscript
application/pdf
Palabras clave AI
Tipo de Interactividad Expositivo
Nivel de Interactividad muy bajo
Audiencia Estudiante
Profesor
Autor
Estructura Atomic
Coste no
Copyright
Formatos 108 p.
10551422 bytes
1268632 bytes
application/postscript
application/pdf
Requerimientos técnicos Browser: Any
Relación [References] AITR-2001-003
[References] CBCL-204
Fecha de contribución 07-may-2008
Contacto