« Mid-thesis milestone prompts a new thesis | Main | Monkey Monastery: Call for feedback »

Apples to Oranges: Lessons in reward design, taught by neurons

According to news of paper by Padoa-Schioppa & Assad. in Nature Neuroscience, neurons in a primate's orbitofrontal cortex correlate to behavioral preference for a juice when presented with a ratio of apple and grape juice. Moreover, the neural correlation was invariant regardless of other options, suggesting that preference transitivity (A to B, B to C, therefore A to C) is reflected in neural activity of the orbitofrontal cortex. As commenter Neil Farbstein wrote, it's "comparing apples to oranges!"

This research was performed on rhesus monkeys. I wonder what their preference was in terms of a ratio of duration of how many days in testing 600 to 2400 trials of sips of juice compared to how many days surviving in the wild. Their methods on one male and female rhesus monkey were:


Under general anesthesia, we implanted a head-restraining device and a recording chamber on the skull of the monkeys, and implanted a scleral eye coil. We used large, oval, custom-made chambers (main axes 50 x 30 mm), centered on stereotaxic coordinates (A30, L0), with the longer axis parallel to a coronal plane. Following surgery, monkeys were given antibiotics (cefazolin, 20 mg per kg of body weight) and analgesics (buprenorphine, 0.005 mg per kg; flunixin, 1 mg per kg) for 3 d. During the experiments, monkeys sat in a monkey chair in a darkened room. The head was restrained and the eye position was monitored continuously using a scleral eye-coil system (Riverbend Instruments). A computer monitor was placed 57 cm in front of the monkeys, and the behavioral task was controlled by custom-written software.

Related to preference functions in Nature Neuroscience is Kable & Glimcher's article "The neural correlates of subjective value during intertemporal choice", which "provides unambiguous evidence that the subjective value of potential rewards is explicitly represented in the human brain."

The actual interface of the simulation was a trivial computer game (or computer investing) of sorts: An offer appears on screen as money and delay (like "$40 / 30 days"), a button is held down, six seconds pass, a green dot appears, and the subject releases the button. For motor task independence, half the trials release indicated immediate ($20) or delayed reward (variable). The amount on the screen was money that would be given after the duration of delay.

The article recounted the hyperbolic function of subjective value: SV = 1 / (1 + kD); in which subjective value (SV) equals inverse of subject-specific constant (k from 0.0005 to 0.1189) and delay of reward (D in days). This function might have application to reward design in games, especially if a similar formula holds true on the time scale of seconds or minutes of a single playing session. An example exists in harvesting or saving the little girls in BioShock. Harvesting has an immediate reward whereas saving offers a delayed reward. Are there other examples that illustrate delayed reward design?

Especially for multiplayer games, the relative ranking of the reward compared to others is important, at least for men. Experiment in neuroeconomics lab at University of Bonn says the activitation of neural reward evaluators in the brain correlate to getting a higher reward or performing better than another participant. So competition, is rewarding for the winners to the extent that there are losers. And advance in a society is measured by surpassing the Joneses.

There is a hint that this psychological mechanism may extend to women, that they at least compare themselves to other women when making decisions. When mitigating risks in news of one study on breast cancer, the woman was more likely mitigate risk when she was in a risk category above average, less likely when in a risk category below average. Interestingly, the comparison influence remained independent from a range of absolute risks. Thus their decision correlated less to absolute risk, and more to comparison with something like "losing" to their peers.

Comments (2)

Wiggledog [TypeKey Profile Page]:

Good to see you're reading neuro papers, Ethan. You might want to check out Robert Sapolsky's papers on how alpha- and beta-male baboons interact.

Thanks for the recommendation! In an Edge article, Robert's survey of gratification postponement and altruism is relevant to reward design, and his discussion of a baboon dropping out of the rat race is eye-opening.

The typical male baboon career trajectory is to fight your way to the top while building some good coalitional skills. When you're relatively high-ranking and if you're going to stay up there, you switch from physical prowess to psychological intimidation and social skills. But eventually it catches up with you and you finally get into a key fight and get killed or crippled or are utterly defeated and you crash way down. However, every decade you'll get some guy who's fought his way up, and six months into his ascendancy suddenly decides, "Who needs this?" and voluntarily walks away from it. They seem to have some sort of epiphanal mid-life crisis and go on to spend the rest of their lives hanging out with infants and forming social attachments with females.

He's high on my reading list this winter; I guess I'll start with Monkeyluv.

Happy holiday!

Post a comment

(If you haven't left a comment here before, you may need to be approved by the site owner before your comment will appear. Until then, it won't appear on the entry. Thanks for waiting.)

About

This page contains a single entry from the blog posted on December 15, 2007 2:43 AM.

The previous post in this blog was Mid-thesis milestone prompts a new thesis.

The next post in this blog is Monkey Monastery: Call for feedback.

Many more can be found on the main index page or by looking through the archives.

Powered by
Movable Type 3.31