Each subject
where
Each trial is associated with a color type. Conditioned on the color type
This is a sequential decision making problem as the subject should improve estimate of AD to optimize the next choice. Here we model an ideal Bayesian player, who updates belief by the Bayes rule and score the choices accordingly.
Here we assume the variance of the likelihood
(Motive for the assumption : Subjects already played a practice version of the game before the actual experiment, where virtual predators have an identical AD distribution mean to the ones in the actual experiment.)
The conjugate prior on the mean is also a Gaussian with broad variance
By observing a sequence of
By independence of AD across trials, the total risk (the expected total negative reward) is a sum of risks over trials.
Fixing color type
where
For arbitrary utility
The money reward
Hence,
Pain, as a negative reward, occurs from electric shock if unable to escape.
For given type
Subject
The gap between the cumulative rewards.
Note that this is defined w.r.t. the expected reward from the Bayesian player.
Furthermore, we can quantify the actual cumulative reward w.r.t. the distribution of reward from the Bayesian player.
The cumulative reward gained by playing the optimal strategy
Again, the quantile can be computed from Monte Carlo.
Each subject is parameterized by parameter
By the independent of choices,
The problem is convex and hence
The parameter captures the rationality of subject:
-
$\beta \rightarrow +\infty$ : rational -
$\beta \rightarrow -\infty$ : anti-rational -
$\beta = 0 $ : dumb