Reward Outcome
reward outcome
Do Capuchin Monkeys, Chimpanzees and Children form Overhypotheses from Minimal Input? A Hierarchical Bayesian Modelling Approach
Abstract concepts are a powerful tool to store information efficiently and to make wide-ranging predictions in new situations based on sparse data. Whereas looking-time studies point towards an early emergence of this ability in human infancy, other paradigms like the relational match to sample task often show a failure to detect abstract concepts like same and different until the late preschool years. Similarly, non-human animals have difficulties solving those tasks and often succeed only after long training regimes. Given the huge influence of small task modifications, there is an ongoing debate about the conclusiveness of these findings for the development and phylogenetic distribution of abstract reasoning abilities. Here, we applied the concept of “overhypotheses” which is well known in the infant and cognitive modeling literature to study the capabilities of 3 to 5-year-old children, chimpanzees, and capuchin monkeys in a unified and more ecologically valid task design. In a series of studies, participants themselves sampled reward items from multiple containers or witnessed the sampling process. Only when they detected the abstract pattern governing the reward distributions within and across containers, they could optimally guide their behavior and maximize the reward outcome in a novel test situation. We compared each species’ performance to the predictions of a probabilistic hierarchical Bayesian model capable of forming overhypotheses at a first and second level of abstraction and adapted to their species-specific reward preferences.
The Desire to Know: Non-Instrumental Information Seeking in Mice
Animals are motivated to acquire knowledge. A particularly striking example is information seeking behavior: animals often seek out sensory cues that will inform them about the properties of uncertain future rewards, even when there is no way for them to use this information to influence the reward outcome, and even when this information comes at a considerable cost. Evidence from monkey electrophysiology and human fMRI studies suggests that orbitofrontal cortex and midbrain dopamine neurons represent the subjective value of knowledge during information seeking behavior. However, it remains unclear how the brain assigns value to information and how it integrates this with other incentives to drive behavior. We have therefore developed a task to test if information preferences are present in mice and study how informational value is imparted on stimuli. Mice are trained to enter a center port and receive an initial odor that instructs them to either go to an informative side port, go to an uninformative side port, or choose freely between them. The chosen side port then yields a second odor cue followed by a delayed probabilistic water reward. The informative port’s odor cue indicates whether the upcoming reward will be big or small. The uninformative port’s odor cue is uncorrelated with the trial outcome. Crucially, the two ports only differ in their odor cues, not in their water value since both offer identical probabilities of big and small rewards. We find that mice prefer the informative port. This preference is evident as a higher percentage choice of the informative port when given a free choice (67% +/- 1.7%, n = 14, p < 0.03), as well as by faster reaction times when instructed to go to the informative port (544ms +/- 21ms vs 795ms +/- 21ms, n = 14, p < 0.001). The preference for information is robust to within-animal reversals of informative and uninformative port locations, and, moreover, mice are willing to pay for information by choosing the informative port even if its reward amount is reduced to be substantially lower than the uninformative port. These behavioral observations suggest that odor stimuli are imparted with informational value as mice learn the information seeking task. We are currently imaging neural activity in orbitofrontal cortex with microendoscopes to identify changes in neural activity that may reflect value associated with the acquisition of knowledge.