ePoster

​EMERGENCE OF COGNITIVE VARIABLES IS SUBJECT TO ARCHITECTURAL CONSTRAINTS IN RECURRENT DECISION CIRCUITS

Carlos Antolínand 3 co-authors

Universidad Autónoma de Madrid

FENS Forum 2026 (2026)
Barcelona, Spain
Board PS01-07AM-348

Presentation

Date TBA

Board: PS01-07AM-348

Poster preview

​EMERGENCE OF COGNITIVE VARIABLES IS SUBJECT TO ARCHITECTURAL CONSTRAINTS IN RECURRENT DECISION CIRCUITS poster preview

Event Information

Poster Board

PS01-07AM-348

Abstract

Reinforcement learning specifies goals for neural circuits (i.e., maximize reward), but it does not determine how internal representations should be organized. In biological systems, however, value, choice, and confidence appear in distinct circuit stages, suggesting that architecture constrains which cognitive variables can exist at all. We investigate this tension between learning objectives and circuit structure using a bimodular actor–critic recurrent network trained on an economic choice task and transferred into a spiking twin while preserving latent geometry (Fig. a). The architecture comprises a control-oriented “actor” that selects actions and an evaluative “critic” that predicts reward from the actor’s state. Although both modules are optimized for the same task, population analyses reveal a systematic division of representational roles: action selection is supported by low-dimensional value-difference coordinates in the actor, whereas object-centered value representations emerge selectively in the critic (Fig. b). Decision confidence is not explicitly trained and is not localized to specific neurons; instead, it arises as a population-level invariant of the same geometry that supports choice (Fig. c). By expanding network size while preserving dynamics, we show that valuation variables require population redundancy for stability (Fig. d-e), whereas action selection remains robust in compact circuits. Finally, we demonstrate that a minimal actor–critic architecture cannot generate categorical commitment states, despite successful learning, revealing a principled architectural limit. These results show that circuit hierarchy and population structure fundamentally constrain which cognitive variables can emerge in recurrent neural systems. Therefore, learning goals alone are insufficient to determine internal codes.

Figure. Spiking actor–critic network and population coding of confidence and value. (a) Schematic of the spiking actor–critic system obtained by transferring trained continuous networks. Offers enter the actor through random, non-transferred input weights (purple), propagate through transferred recurrent weights (black), and are passed to the critic through another set of random weights, where activity evolves in an analogous recurrent structure. In both modules, population activity encodes continuous decision variables: action bias (left vs. right) in the actor and graded object values for juices a and b in the critic. (b) Raster plot of a critic gLIF neuron tuned to the offered value of juice b, grouped by offer magnitude (color). The gray shaded region (1100–1800 ms) indicates the analysis window used for sector classification. The Fano factor (0.94) indicates near–Poisson variability. (c) Difference population activity (left axis) and Bayesian confidence predicted by a trained logistic readout (right axis) across offers, shown for correct (circles) and error (triangles) trials. Population activity closely tracks Bayesian confidence (Spearman ρ = 0.80, p = 0.013), indicating that confidence emerges at the population level rather than from specialized neurons. (d) Choice patterns reconstructed from the difference populations in the actor (left) and critic (right) before and after increasing module size from N = 100 to N = 400. Actor performance is preserved, whereas critic performance improves with expansion. (e) Sector distributions and coefficients of determination for population codes. Increasing population size raises the number of encoding neurons while preserving sector organization, selectively stabilizing value-related representations while leaving action selection largely unchanged, revealing an asymmetry between valuation and control computations.

Recommended posters

Cookies

We use essential cookies to run the site. Analytics cookies are optional and help us improve World Wide. Learn more.