Neural Circuits Trained with Standard Reinforcement Learning Can Accumulate Probabilistic Information during Decision Making.
2017.Neural Comput, 29(2):368-393.
Free Full Text at Europe PMC
PMC5462093