Simple statistical gradient-following algorithms for connectionist reinforcement learning

Abstract This article presents a general class of associative reinforcement learning algorithms for connectionist networks containing stochastic units. These algorithms, called REINFORCE algorithms, are shown to make weight adjustments in a direction that lies along the gradient of expected reinforc...
Ausführliche Beschreibung

Gespeichert in:
Autor*in:

Williams, Ronald J. [verfasserIn]

Format:

Artikel

Sprache:

Englisch

Erschienen:

1992

Schlagwörter:

Reinforcement learning

connectionist networks

gradient descent

mathematical analysis

Anmerkung:

© Kluwer Academic Publishers 1992

Übergeordnetes Werk:

Enthalten in: Machine learning - Kluwer Academic Publishers, 1986, 8(1992), 3-4 vom: Mai, Seite 229-256

Übergeordnetes Werk:

volume:8 ; year:1992 ; number:3-4 ; month:05 ; pages:229-256

Links:

Volltext

DOI / URN:

10.1007/BF00992696

Katalog-ID:

OLC2026512213

Nicht das Richtige dabei?

Schreiben Sie uns!