Skip to content

Commit 0a5eb08

Browse files
author
Alexander Ororbia
committed
added some text to rl-snn lesson
1 parent 7b6aed2 commit 0a5eb08

File tree

1 file changed

+5
-0
lines changed

1 file changed

+5
-0
lines changed

docs/museum/rl_snn.md

+5
Original file line numberDiff line numberDiff line change
@@ -11,6 +11,11 @@ exhibit can be found
1111

1212
## Modeling Operant Conditioning through Modulation
1313

14+
Operant conditioning refers to the idea that there are environmental stimuli that can either increase or decrease the occurrence of (voluntary) behaviors; in other words, positive stimuli can lead to future repeats of a certain behavior whereas negative stimuli can lead to (i.e., punish) a decrease in future occurences. Ultimately, operant conditioning, through consequences, shapes voluntary behavior where actions followed by rewards are repeated and actions followed by punished/negative outcomes diminish.
15+
16+
In this lesson, we will model very simple case of operant conditioning for a neuronal motor circuit used to engage in the navigation of a simple maze. The maze's design will be the rat T-maze and the "rat" will be allowed to move, at a particular point in the maze, in one of four directions (North, South, West, and East). A positive reward will be supplied to our rat neuronal circuit if it makes progress towards the direction of food (placed in the upper right corner of the T-maze) and a negative reward will be provided if fails to make progress/gets stuck, i.e., a dense reward functional will be employed.
17+
18+
1419

1520
### Reward-Modulated Spike-Timing-Dependent Plasticity (R-STDP)
1621

0 commit comments

Comments
 (0)