Cholinergic modulation enables scalable action selection learning in a computational model of the striatum
Metadatos
Mostrar el registro completo del ítemAutor
González-Redondo, Álvaro; Garrido Alcázar, Jesús Alberto; Hellgren Kotaleski, Jeanette; Grillner, Sten; Ros, EduardoEditorial
Nature Publishing Group
Materia
Dopamine Acetylcholine Spiking neural network
Fecha
2025-10-07Referencia bibliográfica
González-Redondo, Á., Garrido, J.A., Hellgren Kotaleski, J. et al. Cholinergic modulation enables scalable action selection learning in a computational model of the striatum. Sci Rep 15, 34902 (2025). https://doi.org/10.1038/s41598-025-18776-3
Patrocinador
MICIU/AEI/10.13039/501100011033 - FEDER, UE (PID2022-140095NB-I00, SENSCOMP); Vetenskapsrådet (VR-M-2020-01652); Horizon 2020 Framework Programme (945539, HBP SGA3); EU Horizon Europe Programme (101147319, EBRAINS 2.0 Project); European Union - Horizon Europe (101137289); Universidad de Granada / CBUA (Open access)Resumen
The striatum plays a central role in action selection and reinforcement learning, integrating cortical
inputs with dopaminergic signals encoding reward prediction errors. While dopamine modulates
synaptic plasticity underlying value learning, the mechanisms that enable selective reinforcement of
behaviorally relevant stimulus-action associations–the structural credit assignment problem–remain
poorly understood, especially in environments with multiple competing stimuli and actions. Here,
we present a computational model in which acetylcholine (ACh), released by striatal cholinergic
interneurons, acts as a channel-specific gating signal that restricts plasticity to brief temporal windows
following action execution. The model implements a biologically plausible three-factor learning rule
requiring presynaptic activity, postsynaptic depolarization, and phasic dopamine, with plasticity
gated by cholinergic pauses that temporally align with behaviorally relevant events. This mechanism
ensures that only synapses involved in the selected behavior are eligible for modification. Through
systematic evaluation across tasks with distractors and contingency reversals, we show that AChgated learning promotes synaptic specificity, suppresses cross-channel interference, and yields
increasingly competitive performance relative to Q-learning in complex tasks, reflecting the scalability
of the proposed learning mechanism. Moreover, the model reveals distinct roles for striatal pathways:
direct pathway (D1) neurons maintain stimulus-specific responses, while indirect pathway (D2)
neurons are progressively recruited to suppress outdated associations during policy adaptation. These
findings provide a mechanistic account of how coordinated cholinergic and dopaminergic signaling can
support scalable and efficient reinforcement learning in the striatum, consistent with experimental
observations of pathway-specific plasticity.





