Modeling the trial-by-trial dynamics of a reversal learning task with reinforcement learning