ePoster

TRUTH BIAS IN HUMAN REINFORCEMENT LEARNING

Juan Vidal-Perezand 2 co-authors

University College London (UCL)

FENS Forum 2026 (2026)
Barcelona, Spain
Board PS02-07PM-074

Presentation

Date TBA

Board: PS02-07PM-074

Poster preview

TRUTH BIAS IN HUMAN REINFORCEMENT LEARNING poster preview

Event Information

Poster Board

PS02-07PM-074

Abstract

Truth bias, the inclination to accept information as true even when cues of deception are present, is a well-documented phenomenon in social and cognitive psychology. However, much of this evidence derives from belief reports concerning abstract beliefs shaped by complex ideological priors and episodic memory, while its influence on instrumental reinforcement learning (RL) remains underexplored. We investigated this using a disinformation two-armed bandit task where 201 participants repeatedly chose between two options to maximize reward outcomes. They received outcome-feedback about their choices from sources with different propensities for lying (0%, 33%, 67%, or 100%; explicitly instructed to participants). When a source lied, the feedback was the opposite of the actual reward-outcome (Fig. 1a-c). This ensured that every truthful source (e.g., 100%) had an equally informative deceitful source (e.g., 0%). We applied computational RL modeling to capture how participants integrated this feedback. The model estimated a "credit assignment" parameter for each source-feedback, reflecting the magnitude of belief updates based on such feedback. Our computational modeling revealed a clear truth bias: participants learned significantly more from truthful feedback compared to deceitful feedback (Fig. 1d). Furthermore, since the feedback source's honesty was forewarned before each choice (Fig. 1c), we could examine how expected truthfulness influenced decisions. We found that participants were both less accurate (Fig. 1e) and slower (Fig. 1f) in their choices when they expected to receive deceitful (rather than truthful) feedback. Our findings demonstrate a deeply ingrained truth bias that influences both fundamental learning mechanisms and decision-making behavior within reinforcement learning.

A six-panel figure summarizing the experimental design and results. Top Row (Task Design): Panel a: Diagrams showing the feedback mechanism. For a truthful agent, "True" feedback matches the outcome (e.g., a sad face for a loss). For a lying agent, feedback is inverted (e.g., a sad face represents a monetary reward). Panel b: An "Honesty" scale presented to participants, ranging from 0% (orange, 1 star) to 100% (blue, 4 stars), with intermediate levels at 33% and 67%. Panel c: An example trial timeline showing the presentation of an agent (e.g., 2-star honesty) alongside two bandit options, followed by the agent providing feedback (e.g., a sad face) after a choice is made. Bottom Row (Results): Panel d: A graph of "Credit Assignment" (y-axis) versus "Uncertainty level" (x-axis). A blue line (Truth) tracks significantly higher than an orange line (Lie). Individual data points show a dense distribution where participants assign more value weight to truthful feedback (approx. 1.5 for 100% honesty) compared to deceitful feedback (approx. 1.0 for 0% honesty), even though both are equally informative. Panel e: A line graph of "Accuracy rate" (y-axis). The blue line (Truth) remains above the orange line (Lie) across both uncertainty levels, showing participants make correct choices more often when the source is truthful. Panel f: A line graph of "Reaction time" in milliseconds (y-axis). The orange line (Lie) spikes upward in the "Certain" condition (0% honesty) to approx. 815ms, significantly higher than the blue line (100% honesty) at approx. 790ms, indicating slower decision speeds when expecting lies. Asterisks indicate statistical significance (p<0.001).

Recommended posters

Cookies

We use essential cookies to run the site. Analytics cookies are optional and help us improve World Wide. Learn more.