|
1 |
| -# Lecture 4C: Reward-Modulated Spike-Timing-Dependent Plasticity |
| 1 | +# Lecture 4D: Reward-Modulated Spike-Timing-Dependent Plasticity |
2 | 2 |
|
3 | 3 | In this lesson, we will build on the notions of spike-timing-dependent
|
4 | 4 | plasticity (STDP), covered [earlier here](../neurocog/stdp.md), to construct
|
@@ -247,9 +247,8 @@ for i in range(T_max):
|
247 | 247 | ```
|
248 | 248 |
|
249 | 249 | which will run all three models simultaneously for `200` simulated milliseconds
|
250 |
| -and collect statistics of interest. We may then finally make several |
251 |
| -plots of what happens under each STDP mode. First, we will plot the resulting |
252 |
| -synaptic magnitude over time, like so: |
| 250 | +and collect statistics of interest. We may then finally make several plots of what happens under each STDP mode |
| 251 | +(reproducing some key results in <b>[1]</b>. First, we will plot the resulting synaptic magnitude over time, like so: |
253 | 252 |
|
254 | 253 | ```python
|
255 | 254 | import matplotlib.pyplot as plt
|
@@ -374,3 +373,8 @@ modulated STDP updates will only occur when the signal is non-zero; this is
|
374 | 373 | the advantage that MSTDP-ET offers over MSTDP as the synaptic change
|
375 | 374 | dynamics persist (yet decay) in between reward presentation times and thus
|
376 | 375 | MSTDP-ET will be more effective in cases when the reward signal is delayed.
|
| 376 | + |
| 377 | +## References |
| 378 | + |
| 379 | +<b>[1]</b> Florian, Răzvan V. "Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity." |
| 380 | +Neural computation 19.6 (2007): 1468-1502. |
0 commit comments