A two step algorithm for learning from unspecific reinforcement
We study a simple learning model based on the Hebb rule to cope with "delayed", unspecific reinforcement. In spite of the unspecific nature of the information-feedback, convergence to asymptotically perfect generalization is observed, with a rate depending, however, in a non- universal way...
Saved in:
| Main Authors: | , |
|---|---|
| Format: | Article (Journal) Chapter/Article |
| Language: | English |
| Published: |
1999
|
| In: |
Arxiv
|
| Online Access: | Verlag, kostenfrei, Volltext: http://arxiv.org/abs/cond-mat/9902354 |
| Author Notes: | Reimer Kühn, Ion-Olimpiu Stamatescu |
Search Result 1