ePoster

Understanding rat behavior in a complex task via non-deterministic policies

Johannes Niediek,Maciej M. Jankowski,Ana Polterovich,Alexander Kazakov,Israel Nelken
COSYNE 2022(2022)
Lisbon, Portugal
Presented: Mar 19, 2022

Conference

COSYNE 2022

Lisbon, Portugal

Resources

Authors & Affiliations

Johannes Niediek,Maciej M. Jankowski,Ana Polterovich,Alexander Kazakov,Israel Nelken

Abstract

We trained five rats to perform a complex auditory-guided task in a large environment (diameter 160 cm) with twelve nose-poke ports. To obtain rewards, rats had to position themselves at specific locations indicated by sounds. Despite the nontrivial task, rats reached high success rates within two 70-minute sessions. We modeled the task as a Markov Decision Process. Observed rat trajectories resembled the model's optimal policies. However, while optimal policies were deterministic, observed behavior was non-deterministic. We introduced non-deterministic, information-limited policies that realize optimal reward rates under constraints on the Kullback-Leibler divergence from a default, non-informative policy (Tishby’s complexity, TC). We estimated the TC of rat movement and nose-poking over more than 10 months by comparing observed behavior with TC-limited policies. Our model revealed a prolonged, large increase in the TC over time. Significantly, this prolonged behavioral refinement was not discernible via reward rates, and to our knowledge has not been described previously. The model also captured individual propensities for preferring some foraging strategies over others. Specifically, one strategy required sharp-angled body-turns. By transiently altering the task structure, we successfully encouraged rats to increase their preference for that strategy. Concurrently, our model uncovered a permanent decrease in body-turn cost in every rat, with new costs that differed between rats but were constant over time within rat. Recording with chronically implanted silicon probes from the left insular cortex, we found that in many neurons, firing rates (averaged over ten minutes) strongly correlated with TC, computed in the same time periods. Significantly, our model is based on first principles of information theory, and does not employ ad-hoc measures of behavior. Thus, we present here novel insights into rat behavioral refinement over very long time scales, and introduce TC as a regressor for cortical activity.

Unique ID: cosyne-22/understanding-behavior-complex-task-b408d6e7