A two step algorithm for learning from unspecific reinforcement

We study a simple learning model based on the Hebb rule to cope with "delayed", unspecific reinforcement. In spite of the unspecific nature of the information-feedback, convergence to asymptotically perfect generalization is observed, with a rate depending, however, in a non- universal way...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Kühn, Reimer (VerfasserIn) , Stamatescu, Ion-Olimpiu (VerfasserIn)
Dokumenttyp:	Article (Journal) Kapitel/Artikel
Sprache:	Englisch
Veröffentlicht:	1999
In:	Arxiv
Online-Zugang:	Verlag, kostenfrei, Volltext: http://arxiv.org/abs/cond-mat/9902354
Verfasserangaben:	Reimer Kühn, Ion-Olimpiu Stamatescu

Beschreibung
Zusammenfassung:	We study a simple learning model based on the Hebb rule to cope with "delayed", unspecific reinforcement. In spite of the unspecific nature of the information-feedback, convergence to asymptotically perfect generalization is observed, with a rate depending, however, in a non- universal way on learning parameters. Asymptotic convergence can be as fast as that of Hebbian learning, but may be slower. Moreover, for a certain range of parameter settings, it depends on initial conditions whether the system can reach the regime of asymptotically perfect generalization, or rather approaches a stationary state of poor generalization.
Beschreibung:	Gesehen am 06.03.2018
Beschreibung:	Online Resource

A two step algorithm for learning from unspecific reinforcement

Ähnliche Einträge