Cerebral Cortex, Vol. 10, No. 3, 263-271,
March 2000
© 2000 Oxford University Press
Delay Activity of Orbital and Lateral Prefrontal Neurons of the Monkey Varying with Different Rewards
Department of Psychology, Tokyo Metropolitan Institute for Neuroscience, Musashidai 2-6, Fuchu, Tokyo 183-0042, Japan
| Abstract |
|---|
|
|
|---|
We examined neuronal activity in the orbitofrontal cortex (OFC) in relation to reward expectancy and compared findings with those of the lateral prefrontal cortex (LPFC) in the monkey. Activity of OFC neurons was examined in a delayed reaction time task where every four trials constituted one block within which three kinds of rewards and no reward were delivered in a fixed order. More than half of OFC delay neurons were related to the expectancy of delivery or nodelivery of a reward as the response outcome, while some neurons showed nature-of-reward-specific anticipatory activity changes. These delay-related activities reflected the preference of the animal for each kind of reward and were modulated by the motivational state of the animal. LPFC neurons are reported to show nature-ofreward-specific anticipatory activity changes in a delayed response task when several different kinds of rewards are used. Such rewarddependent activity is observed in LPFC delay neurons both with and without spatially differential delay (working memory-related) activity. Although reward expectancy-related activity is commonly observed in both OFC and LPFC, it is suggested that the OFC is more concerned with motivational aspects, while the LPFC is related to both the cognitive and motivational aspects of the expectancy of response outcome.
| Introduction |
|---|
|
|
|---|
The orbitofrontal cortex (OFC) plays important roles in motivation and emotion (Stuss and Benson, 1986
The lateral prefrontal cortex (LPFC) has been shown to play important roles in higher cognitive operations such as retaining working memory in both human and non-human primates (Petrides, 1994
; Goldman-Rakic, 1996
; Fuster, 1997
; Courtney et al., 1998
; D'sEsposito et al., 1998
). Neuronal activity in the primate LPFC has been extensively studied in working memory task situations, and working memory-related activity changes have commonly been observed (Niki, 1974
; Funahashi et al., 1989
; Miller et al., 1996
; Rao et al., 1997
). Recently, we have shown that delay neurons of the LPFC also participate in the expectancy of response outcome in relation to the reward that is expected to be delivered in given trials (Watanabe, 1996
). LPFC neurons have also been shown to respond to reward, reinforcement and error (Niki and Watanabe, 1979
; Watanabe, 1989
). Because the OFC is more related to motivational operations than the LPFC (Fuster, 1997
), and delay-related activity changes of primate OFC neurons have not been examined sufficiently since the pioneering study by Rosenkilde et al. (Rosenkilde et al., 1981
), we investigated whether delay neurons of the OFC are also involved in reward expectancy and whether there are differences in the characteristics of reward expectancy-related activity between the OFC and LPFC.
Behavioral experiments on rodents indicate that when different magnitudes of rewards and no reward are delivered in response to the animal's action in a fixed order, for example in the order of 4, 2, 1 and 0 pellets, the animal comes to expect the delivery of a specific magnitude of reward or no reward as the response outcome in each trial (Hulse and Dorsky, 1977
). We examined OFC neuronal activity in relation to the expectancy, not of different magnitudes of but of different kinds' of reward, by training monkeys in a delayed reaction time task where every four trials constituted one block and three different kinds of rewards and no reward were delivered in a fixed order.
In this paper we first report the results of the experiment where we examined the delay-related activities of OFC neurons. Second, we briefly describe reward expectancy-related LPFC neuronal activities that have already been reported (Watanabe, 1996
). Then we compare the characteristics of delay activity of OFC and LPFC neurons in relation to the expectancy of a response outcome to investigate the possible roles of the OFC and LPFC in goal-directed behavior.
| Delay Activity of Orbitofrontal Neurons in Relation to Reward Expectancy |
|---|
|
|
|---|
Materials and Methods
Experimental Design and Recording
We trained two monkeys (Macaca fuscata) on three kinds of delayed reaction time tasks (Fig. 1
). Each monkey was seated on a primate chair facing a panel that contained a rectangular window, a circular key and a hold lever below them. The window contained two screens, one opaque and one transparent with thin vertical lines. Food reward was given in the window while liquid reward was given through one of three tubes attached to the animal's mouth.
|
In the Cued liquid reward task (Fig. 1a
In the Cued food reward task (Fig. 1b
), food rewards instead of liquid rewards were used. Within a block of four trials, the color cue was presented also in the order of redredredgreen. To the animal's correct response, two screens of the window were raised and the animal could obtain a piece (~0.3 g) of food reward on red cue trials while an empty tray was presented to the animal on green cue trials. The outcome of the animal's correct responses within each block was the delivery of different kinds of rewards in the fixed order of (1) sweet potato, (2) raisin, (3) cabbage and (4) no reward.
In the Visible food reward task (Fig. 1c
), instead of the color light as a cue, the presence or absence of a particular food indicated the outcome of the animal's response. During the cue period, the opaque screen was raised and the animal could see a food reward or empty tray behind the transparent screen. The order of presenting three different kinds of food rewards and empty tray as a cue, and thus the order of the animal's response outcomes, was the same as that in the Cued food reward task.
In these tasks, the animal was only required to press the key within 1 s after the go signal presentation to obtain the reward. Although the animal was not explicitly required to memorize the serial position of each kind of reward in a block, nor required to expect the specific reward in each trial, behavioral experiments (Hulse and Dorsky, 1977
) have suggested that the animal would do so.
Preferences for different kinds of foods by individual animals were examined separately from the experiment, by free choice tests among potato, raisin and cabbage rewards, and also by choice tests between each pair. Preferences for different kinds of liquid rewards were examined by testing the animal's willingness to perform the task with one kind of reward after refusing to perform the task with another kind of reward.
Details of the surgery and recording methods have been described previously (Watanabe, 1990
; Hikosaka, 1999
). Extracellular recordings were made using an Elgiloy electrode (Suzuki and Azuma, 1976
), and impulses recorded from isolated single neurons were fed through a window discriminator to a computer. During the recording, the activity of each neuron was first examined in a certain task for ~40 trials (10 blocks), which constituted one recording epoch. Then the activity was reexamined in this task after one or two recording epochs in another task(s). The data obtained from different epochs in the same task were compiled for analysis. To record neuronal activity, electrodes were inserted vertically in the frontal plane. For precise placement of electrodes, a guide tube was used. The guide tube was placed on the dura of the cortex and electrodes were advanced through the guide tube. The recording sites extended from 2938 mm anterior to the interaural plane and from medial to lateral portions of the ventral surface of the prefrontal cortex (Fig. 2
).
|
During the recording, probe tests were sometimes conducted where the three different kinds of rewards were delivered in a different order from the original fixed order within a block.
Data Analysis
The data were analyzed off-line. Raster displays and frequency histograms were used for graphic representation. Non-parametric tests (U and H tests) were used for statistical analysis. In this paper, we focus on neuronal activity in the OFC during the delay period. Changes in neuronal activity during the delay period were first compared with those during the pre-cue control period, and were then compared with each other among the four reward conditions (three different kinds of rewards and no-reward).
To evaluate characteristics of activity changes in OFC neurons during the delay period in relation to the difference in response outcome (three kinds of rewards and no-reward), two kinds of indices were employed: rewardno reward discrimination ratio (RNRDR) and reward preference ratio (RPR). The RNRDR was calculated by the following formula:
![]() |
![]() |
Histology
After the final recording session, each monkey was deeply anesthetized with pentobarbital sodium (45 mg/kg) and perfused transcardially with warm saline followed by 10% formal saline. The brain was removed and blocks of the brain were placed in fixative containing 10% formalin and 30% sucrose until they sank. The brains were frozen and sectioned at a thickness of 50 µm along the coronal plane. Every fifth section was stained for cell bodies with cresyl violet. The electrode tracks were reconstructed from both traces of electrode penetration and electrolytic lesions that were made at selected penetration sites.
Results
Preference tests for different kinds of food rewards revealed that the animal consistently preferred cabbage to potato to raisin. Preference tests for different kinds of liquids revealed that the animal preferred orange juice and grape juice far more than water in that the animal was willing to perform the task for an orange juice or grape juice reward after refusing to perform it for water reward, while the reverse never occurred. There was no consistent difference in the animal's preference between orange and grape juice rewards.
There were 207 (130 and 77) penetrations in two monkeys. Of 501 OFC neurons isolated in two monkeys, 235 (47%) were task-related. Of these 235 neurons, 88 (18%) could be examined in at least two different kinds of tasks. We focus here on 50 neurons that showed delay-related differential activity depending on differences in the response outcome (three kinds of rewards and no-reward) on at least two of three different kinds of tasks. Half (n = 25) of the neurons showed activations during the delay period for all kinds of reward trials without nature-of-reward specificity, but not in no-reward trials. Six neurons showed activations during the delay period only in no-reward trials without showing activation in any reward trials. Five neurons showed nature-of-reward specificity showing anticipatory activity changes during the delay period only before obtaining a specific reward. Two neurons showed activity changes during the delay period in the trials of both no-reward and least preferred reward (water). In the remaining neurons (n = 12), the characteristics of delay-related activity changes in relation to the reward were not consistent among different tasks, for example some showed delay-related activation in certain reward trials in one task while discriminating reward and no-reward trials in another task.
Some examples of OFC delay neurons showing reward-related activity changes are presented in Figure 3
. The example in Figure 3a
discriminated between reward and no-reward but not among different kinds of rewards. This neuron, which was examined in the Cued food reward task, showed delay-related activations for all reward trials but not in no-reward trials. Since the same characteristic activity changes were observed in the Visible food reward task (not shown), where actual food or an empty tray was presented as the cue, the differential activity observed in this neuron is not considered to be related to the difference in the color (red versus green) of the cue indicating the presence or absence of reward.
|
An example of a neuron that demonstrated selectivity to the nature-of-reward is shown in Figure 3b
Another OFC neuron is shown in Figure 3c
. This neuron showed activations in both water and no-reward trials during the delay period. Similar activations in no-reward trials but no activation in any reward trials were observed in the Visible and cued food reward tasks (not shown). It appears that this neuron responded similarly to no-reward and to the least preferred reward. In this neuron, no-reward and least preferred rewardrelated activation started before the cue presentation.
In food reward tasks, the animal sometimes refused to ingest the least preferred food reward (raisin) despite the fact the animal continued to perform the task as before. An example of neuronal activity examined during such periods is shown in Figure 4
. This neuron, which is the same as the one shown in Figure 3a
, was examined in the Visible food reward task before, during and after the animal refused to ingest the raisin. After performing several hundred trials of Visible and cued food reward tasks, and thus after ingesting a substantial amount of food, the animal became reluctant to ingest raisin and the magnitude of activation during the delay period in raisin reward trials decreased (Fig. 4a
, after eight trials). When the animal finally refused to ingest raisin, this neuron did not show any activation during the delay period in raisin reward trials (Fig. 4b
), although the animal continued to perform the task in order to advance to the next trial where a more preferred reward (cabbage) could be obtained. However, this neuron continued to show (slightly reduced) activations during the delay period in potato and cabbage reward trials (Fig. 4d
), while the animal refused to ingest raisin. After another hundred trials in the Cued liquid reward task, the animal again began to ingest raisin during the Visible food reward task. This neuron now showed activations again during the delay period in raisin reward trials (Fig. 4c
). The mean firing rate of this neuron during the delay period in raisin reward trials was 6.4, 0.2 and 5.8 spikes/s, (1) before the animal refused to ingest raisin, (2) while the animal was refusing it and (3) when the animal began to ingest it again, respectively. There was a significant difference between the refusal period and each ingestion period (P < 0.01, Student's t-test). The reaction time (RT) of the animal for each period (before, during and after raisin refusal) in raisin reward trials was 524, 672 and 530 ms, respectively (Fig. 4e
), with significant differences in RT between the refusal period and each ingestion period (P < 0.01, Student's t-test).
|
To qualitatively examine how and to what extent the difference in response outcomes is reflected in delay activity of OFC neurons, we analyzed two kinds of indices that measured the discriminability of individual neurons between the best reward and no reward (RNRDR) and between the best and worst rewards (RPR) (see Materials and Methods).
Since there was no significant difference in the distribution of RNRDR values observed in OFC neurons among the three different kinds of tasks, we present the mean RNRDR value obtained from all three tasks in Figure 5
. There were two clusters in the distribution: one cluster of neurons (shaded part in the figure) showed more delay-related activations in reward than in noreward trials and demonstrated smaller values, ranging from 0.01 to 0.69, while the other cluster of neurons showed more delay-related activations in no-reward than in reward trials and showed larger values, ranging from 1.02 to 4.58.
|
RPR values were calculated only on those neurons that showed delay-related activity changes in reward trials. Distributions of RPR values of OFC neurons for three different kinds of tasks are shown in Figure 6
|
When the serial position of each of three kinds of rewards within a block was changed in probe tests, neurons with natureof-reward sensitivity showed activity changes according to the relational order among sequential components, according to which reward had been delivered in the previous trial. For example, neurons such as the one in Figure 3b
The RT of the animal was significantly longer on no-reward than on any reward trial. Although there was almost no difference in RT among different kinds of reward trials when the animal was well motivated at the beginning of the daily recording session, there were sometimes differences during the middle of the recording. Although there was no significant correlation between RT and the magnitude of delay-related activity changes in most OFC neurons within reward trials or within no-reward trials, significant correlations were sometimes observed in a few OFC neurons on specific occasions, as shown in Figure 4
.
Histological examination revealed that reward-related delay neurons were found in all areas explored in the OFC, with a slight but not significant tendency for more such neurons to be located in the lateral portions (medial area 12) (Fig. 2
).
Discussion
In the delayed reaction time task, we found OFC neurons that showed differential activity during the delay period depending on the presence or absence of reward, or depending on the nature of reward that would be delivered in given trials. Similar discriminative responses between reward and no-reward trials were observed in most OFC delay neurons for both Visible and Cued food reward tasks. Thus, the differential delay activity observed is considered not to be associated with the difference in the color of the cue which indicated the presence or absence of future reward, nor with the difference in the appearance of reward.
There was clear clustering in the RNRDR values, indicating that there were two types of OFC neurons (Fig. 5
): one type of neuron, with a value of >1, was more activated in the absence while the other type, which constituted the majority and had a value of <0.7, was activated in the presence of reward. The existence of two clusters indicates that most OFC neurons showed clear activity changes either on reward or on no-reward trials, but not on both, suggesting that OFC delay neurons are more concerned with the presence or absence of reward.
Considering that the RPR value can range from 0 to 1.0, with a lower value reflecting better discrimination between the best and worst rewards, the mean values obtained (0.700.72) may indicate that discriminations among different kinds of rewards are not very sharp in OFC neurons (Fig. 6
). These results may also indicate that OFC neurons are more sensitive to the difference between reward and no-reward than to the nature-ofreward, at least in the task situation where there are trials in which no-reward can be expected. Indeed, the majority of OFC neurons (n = 31, 62%) discriminated between reward and no-reward but not among different kinds of rewards. These neurons are considered to be involved in the expectancy of delivery or no-delivery of reward as a response outcome, which information may be more interesting to the animal than the information concerning the nature-of-reward.
Reward expectancy-related activity was found to be modified by the motivational state of the animal. When the animal refused to ingest a specific food (raisin), which was the least preferred, there were no delay-related activations that had previously been observed in association with the animal's ingesting the reward (Fig. 4
). Neurons of the OFC and lateral hypothalamus have been reported to stop responding to the sight and taste of the food or liquid after an animal is fed until satiety (Burton et al., 1976
; Rolls et al., 1989
; Critchley and Rolls, 1996
). This process is nature-of-reward specific and these neurons continue to respond to other kinds of rewards with which the animal is not satiated (Burton et al., 1976
; Rolls et al., 1989
; Critchley and Rolls, 1996
). The present results indicate that the delay activity of reward expectancy-related OFC neurons is also nature-of-reward specific, because motivation-dependent modification of neuronal activity was observed only in a certain reward (raisin) but not in other reward trials (Fig. 4
). The fact also indicates that delay activity of OFC neurons is related not to the appearance of reward such as the shape or color, which does not vary, but to the degree of preference of the animal for each reward, which does vary during the task performance. It seems that the raisin reward, which had previously been estimated to be, to some extent, preferred by the animal, became non-preferred after having been ingested in a sufficient amount, and this process was reflected in the activity of OFC neurons. In other words, activity of OFC neurons seems to reflect the degree of the animal's preference for a certain reward determined by the animal's motivational state. Similarly, after obtaining a good amount of liquid, water may become as non-preferable as no-reward. Thus, an OFC neuron such as that shown in Figure 3c
, which showed activations on both water and no-reward trials, may be related to discriminating between two kinds (preferred and non-preferred) of outcomes.
In the present experiment, the method of reward delivery (fixed order of delivery of three different kinds of rewards and no reward within a block of four trials) allowed us to examine the order-related expectancy process in the OFC. Although the presence or absence of reward was indicated by the color cue, the red cue itself did not explicitly inform the animal of what specific reward would be delivered in a given trial in Cued food and liquid reward tasks. The animal could only deduce what reward would be delivered in a given trial from the fixed order of delivery of different kinds of rewards within a block. Thus, delay neurons showing nature-of-reward-specific activity changes (e.g. Fig. 3b
) are considered to be involved in this deduction process as well as being involved in the expectancy of the specific reward. The animal could deduce the kind of reward in each trial either (1) from the relational order among sequence components (raisin always comes after potato and cabbage always comes after raisin) or (2) from the numerical order within a block in relation to whether the current trial was the first, second or third. By changing the serial position of each of three kinds of rewards within a block, it was possible to examine which strategy the animal was employing. It was found that the animal was using the former strategy, since nature-of-rewardspecific anticipatory activity was found to be determined by the reward that had been delivered in the previous trial, but not by the ordinal position of a certain reward within a block.
| Reward Expectancy-related Neuronal Activity in the LPFC |
|---|
|
|
|---|
To compare characteristics of delay activity of OFC and LPFC neurons in relation to the response outcome, we introduce here reward expectancy-related neuronal activities of the LPFC which were reported previously (Watanabe, 1996
|
Of 124 LPFC neurons that showed delay-related activity changes, 42 were intensively examined using several kinds of rewards. Half of these neurons (n = 21) showed different activity changes with different rewards. Figure 7b
There were many LPFC delay neurons that showed spatial specificity. It is of interest whether those spatially differential delay neurons also showed reward dependency. The neuron in Figure 7c
showed a higher rate of firing on left trials than on right trials during the delay period irrespective of the nature of the reward. Besides that, this neuron showed reward-dependent delay activity, showing the largest activity changes in cabbage reward trials, intermediate activity changes in potato reward trials and the least activity changes in raisin reward trials. The proportion of neurons showing reward-dependent activity was about the same in delay neurons with spatial specificity and those without spatial specificity.
RPR was also calculated for LPFC neurons. There was no significant difference in RPR values among three different kinds of delayed response tasks. The range was 0.220.96 and the mean was 0.63 when the data for all tasks were combined. Furthermore, there was no significant difference in the values or distributions of RPR between OFC and LPFC neurons.
The majority of neurons (17/21) examined in all three different kinds of tasks showed different patterns or magnitudes of delay-related activity changes between foodand liquidreward tasks and/or between visible and cued task situations. The majority of reward-dependent delay neurons showed more activity changes in response to the preferred than to the nonpreferred reward, indicating that their activities also reflect the animal's preference for each kind of reward.
Although the foods or liquids themselves were not presented during the cue period in Cued food and liquid reward tasks, the animal learned what reward was being used by experiencing a newly given reward for two or three trials, since the same reward was used in a block of ~50 trials. It is thought that the animal deduced information about the currently used reward from its experience in previous trials and expected that specific reward. Reward-dependent delay activity of LPFC neurons is thus considered to reflect the expectancy of visual, gustatory and/or olfactory images of the specific food as well as its motivational value. The mean RPR value of 0.63 indicates that the discriminability of LPFC neurons among different kinds of rewards was not very sharp, either. Many LPFC neurons showed differences in the characteristics of activity changes among different kinds of tasks, indicating that expectancy of a specific reward may be attained by the ensemble of activities of reward-specific and task-dependent differential delay neurons.
In the LPFC, there were many delay neurons that showed both reward dependency and spatial specificity. These neurons are considered to be involved in two different kinds of information processing one retaining spatial information in working memory and the other related to retrieving and expecting the specific reward.
| Comparison of the Characteristics of Delay-related Activity of Orbital and Lateral Prefrontal Neurons |
|---|
|
|
|---|
OFC and LPFC neurons were found to show differential delay activity depending on whether there would be reward or noreward and/or on what kind of reward would be delivered. These neurons are considered to be involved in the expectancy of response outcome. Since both OFC and LPFC, especially OFC, play important roles in motivation and emotion (Stuss and Benson, 1986
The animal was not explicitly required to retain in memory the reward information during the delay period in either the delayed reaction time or delayed response tasks. Thus, the results of our experiments indicate that the OFC and LPFC are involved in representing incidental reward information, which is not indispensable for the correct task performance. What, then, is the functional significance of such reward-related anticipatory neuronal activity observed in the OFC and LPFC when such activity is not indispensable for the correct task performance? It is considered that the animal behaves to attain goals such as obtaining food and mating partners or escaping from danger. Expectancy-related neuronal activity in the OFC and LPFC may be useful for guiding the animal to pay attention to the most relevant dimensions in the (task) situation so that the goal would be attained more effectively. Indeed, focal attention induces selective representation of only relevant information in LPFC neurons during a working memory task (Rainer et al. 1998
). The fact that the majority of OFC and LPFC neurons showed greater activation when expecting the preferred rather than the nonpreferred reward may indicate that the animal is guided to pay more attention to task situations involving the more preferred reward.
Such neuronal activity may also be useful for processing the response outcome more efficiently. As far as there is no discrepancy between the animal's expectancy and the response outcome, even if the outcome is the absence of reward, the outcome is not surprising to the animal and thus would be processed automatically without receiving much attention. However, when there is a discrepancy between the two, the outcome is surprising and should receive more attention for further processing. Indeed, surprise is considered to be important for learning to occur (Rescorla and Wagner, 1972
). Without expectancy-related neurons, the animal may become relatively indifferent to the outcome, and thus may not efficiently process the outcome even when required to do so. This may in turn induce disturbance in the learning of new behavior according to the change in reinforcement contingency. The deficit in reversal learning and extinction of learned operant responses which is observed in OFC-ablated monkeys (Butter, 1969
) may be caused by such disturbance. By facilitating the current goal-directed behavior and by constituting the basis of learning, expectancyrelated neuronal activity, even if it is not directly associated with correctly performing a task in each trial, is considered to have survival value to the animal.
Although OFC neurons were more sensitive to the presence or absence of reward rather than to the nature-of-reward, it was also shown that OFC neurons were not simply involved in discriminating the presence or absence of reward. Some OFC neurons showed clear differential delay activity in relation to the expectancy of different kinds of rewards. Even those OFC delay neurons that apparently represented only the presence or absence of reward were found to be nature-of-reward sensitive as well because their activities in response to a particular reward were modified by changes in the animal's motivational state (e.g. Fig. 4
). It seems that the activity of expectancy-related OFC neurons is dependent on the attraction of the reward in relation to how preferable it is to the animal. It appears that the majority of OFC neurons discriminate simply between two (preferred and non-preferred) situations and only some neurons discriminate among individual rewards with different degrees of preference.
A recent study on the monkey indicates that OFC neurons are not related to coding and retaining spatial and object information (Tremblay and Schultz, 1999
). Even though the human OFC is indicated to be involved in cognitive operations such as decision making (Bechara et al., 1998
), such cognitive operations cannot be achieved without support of motivational operations within the OFC, as the somatic marker hypothesis indicates (Damasio, 1994
). Considering that the activity of OFC neurons was found to be dependent on the motivational state of the animal, OFC neurons may be more related to the expectancy of hedonistic aspects of reward such as the degree of pleasure or aversion associated with delivery or no-delivery of a certain reward. Interestingly, a recent study by Tremblay and Schultz indicated that OFC neurons reflect the relative, but not the absolute, preference of each reward to the animal (Tremblay and Schultz, 1999
).
Although LPFC neurons are well documented to be involved in cognitive operations such as retaining working memory (Goldman-Rakic, 1996
; Miller et al., 1996
), we found LPFC delay neurons that appear to be involved in both cognitive and motivational operations since activity changes occurred in relation to both working memory and reward expectancy (Watanabe, 1996
). Concerning the characteristics of motivational operations in the LPFC, delay neurons of this area were reported to be sensitive to the presence or absence of expected reward in previous studies on delayed reaction time tasks (Watanabe, 1990
, 1992
). Activities of LPFC delay neurons were also found to reflect the animal's preference for each kind of expected reward (Watanabe, 1996
). The value and distribution of the RPR were not different between OFC and LPFC neurons. Thus, there appears to be no significant difference in the characteristics of reward expectancy related neuronal activity between the OFC and LPFC.
LPFC neurons code the correctness of the response independent of the presence or absence of reward (Watanabe, 1989
), or code the discrepancy between the expectancy of a specific reward and the response outcome (Watanabe, 1996
). The LPFC is proposed to be involved in corollary discharge or efference copy (sending neural impulses to sensory structures that somehow prepare those structures for anticipated changes in sensory input as the result of an impending movement) (Teuber, 1964
). Thus, the anticipatory neuronal activity observed in the LPFC, besides reflecting the preference of the expected outcome, may be concerned with the cognitive aspects of reward expectancy such as anticipating visual, tactile or olfactory images of the response outcome, preparing for the reception of a certain reward and not any other reward, or preparing for the no-reward outcome.
LPFC neurons have been proposed to have domain-specific properties, with what and where aspects of the stimulus being retained in working memory in different LPFC areas (Wilson et al., 1993
). However, it has recently been shown that delay neurons related to what and those related to where are not differently distributed within the LPFC and there are many LPFC neurons which are involved in the integration of what and where (Rao et al., 1997
). We have observed that LPFC neurons are involved in two different kinds of information processing: retaining spatial (where) information in working memory and the expectancy of a specific reward. If we consider that reward-dependent anticipatory activity is concerned with the what aspects of the stimulus, our results also indicate that neurons related to what and those related to where are intermingled in the LPFC, and some LPFC delay neurons are involved in the integration of what and where aspects of the stimulus.
The OFC plays important roles in motivational operations through its intimate connections with the amygdala, which plays important roles in motivation and emotion (Rolls, 1999
). However, connections between the LPFC and amygdala are sparse (Barbas, 1995
). Thus, the motivational operation concerning reward expectancy may be conducted first in the OFC and then the information may be transmitted to the LPFC, where integration of motivational and cognitive operations would be achieved. In conclusion, although further studies are needed to examine what aspects of reward (visual appearance, taste, smell or degree of preference) is represented in prefrontal neurons, it is suggested that the OFC is more concerned with the motivational aspects, while the LPFC is related to both the cognitive and motivational aspects, of the expectancy of the response outcome (Watanabe, 1998
).
| Notes |
|---|
|
|
|---|
We express our thanks to the anonymous referees for providing critical comments that guided improvement of the manuscript, and to M. Sakagami, S. Shirakawa, M. Odagiri, T. Kojima, K. Tsutui and H. Takenaka for their assistance during the experiment. This study was supported by Grant-in-Aid for Scientific Research on Priority Areas from the Ministry of Education, Science, Sports and Culture of Japan (nos 08279248, 09268242, 10164250, 11145244 ).
Address correspondence to Masataka Watanabe Ph.D., Department of Psychology, Tokyo Metropolitan Institute for Neuroscience, Musashidai 2-6, Fuchu, Tokyo 183-0042, Japan. Email: masataka{at}tmin.ac.jp.
| References |
|---|
|
|
|---|
Barbas H (1995) Anatomic basis of cognitive-emotional interactions in the primate prefrontal cortex. Neurosci Biobehav Rev 19:499510.[Web of Science][Medline]
Baylis LL, Gaffan D (1991) Amygdalectomy and ventromedial prefrontal ablation produce similar deficits in food choice and in simple object discrimination learning for an unseen reward. Exp Brain Res 86:617622.[Web of Science][Medline]
Bechara A, Damasio H, Tranel D, Anderson SW (1998) Dissociation of working memory from decision making within the human prefrontal cortex. J Neurosci 18:428437.
Burton MJ, Rolls ET, Mora F (1976) Effects of hunger on the responses of neurons in the lateral hypothalamus to the sight and taste of food. Exp Neurol 51:668677.[Web of Science][Medline]
Butter CM (1969) Perseveration in extinction and in discrimination reversal tasks following selective frontal ablations in Macaca mulatta. Physiol Behav 4:163171.[Web of Science]
Courtney SM, Petit L, Haxby JV, Ungerleider, LG (1998) The role of prefrontal cortex in working memory: examining the contents of consciousness. Phil Trans R Soc Lond B 353:18191828.
Critchley HD, Rolls ET (1996) Hunger and satiety modify the responses of olfactory and visual neurons in the primate orbitofrontal cortex. J Neurophysiol 75:16731686.
Damasio AR (1994) Descartes's error. New York: Grosset/Putnam.
D'sEsposito M, Aguirre, GK, Zarahn E, Ballard D, Shin RK, Lease J (1998) Functional MRI studies of spatial and nonspatial working memory. Cogn Brain Res 7:113.[Medline]
Funahashi S, Bruce CJ, Goldman-Rakic PS (1989) Mnemonic coding of visual space in the monkey's dorsolateral prefrontal cortex. J Neurophysiol 61:331349.
Fuster JM (1997) The prefrontal cortex. Anatomy, physiology and neuropsychology of the frontal lobe, 3rd edn. New York: LippincottRaven.
Goldman-Rakic PS (1996) The prefrontal landscape: implications of functional architecture for understanding human mentation and the central executive. Phil Trans R Soc Lond B 351:14451453.[Web of Science][Medline]
Hikosaka K (1999) Tolerances of responses to visual patterns in neurons of the posterior inferotemporal cortex in the macaque against changing stimulus size and orientation, and deleting patterns. Behav Brain Res 100:6776.[Web of Science][Medline]
Hulse SH, Dorsky NP (1977) Structural complexity as a determinant of serial pattern learning. Learn Motiv 8:488506.[Web of Science]
McEnaney KW, Butter CM (1969) Perseveration of responding and nonresponding in monkeys with orbital frontal ablations. J Comp Physiol Psychol 68:558561.[Web of Science][Medline]
Miller EK, Erichson CA, Desimone R (1996) Neural mechanisms of visual working memory in prefrontal cortex of the macaque. J Neurosci 16:51545167.
Niki H (1974) Differential activity of prefrontal units during right and left delayed response trials. Brain Res 70:346349.[Web of Science][Medline]
Niki H, Watanabe M (1979) Prefrontal and cingulate unit activity during timing behavior in the monkey. Brain Res 171:213224.[Web of Science][Medline]
Petrides M (1994) Frontal lobes and working memory: evidence from investigations of the effects of cortical excisions in nonhuman primates. In: Handbook of neuropsychology, Vol 9: The frontal lobes (Boller F, Grafman J, eds), pp 5982. Amsterdam: Elsevier.
Rainer G, Asaad WF, Miller EK (1998) Selective representation of relevant information by neurons in the primate prefrontal cortex. Nature 393:577579.[Medline]
Rao SC, Rainer G, Miller EK (1997) Integration of what and where in the primate prefrontal cortex. Science 276:821824.
Rescorla RA, Wagner, AR (1972) A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement. In: Classical conditioning II: Current research and theory (Black AH, Prokasy WF, eds), pp. 6499. New York: AppletonCentury-Crofts.
Rolls ET (1999) The brain and emotion. Oxford: Oxford University Press.
Rolls ET, Sienkiewicz ZJ, Yaxley S (1989) Hunger modulates the responses to gustatory stimuli of single neurons in the caudolateral orbitofrontal cortex of the macaque monkey. Eur J Neurosci 1:5360.[Web of Science][Medline]
Rosenkilde CE, Bauer RH, Fuster JM (1981) Single cell activity in ventral prefrontal cortex of behaving monkeys. Brain Res 209:375394.[Web of Science][Medline]
Schoenbaum G, Chiba AA, Gallagher M (1998) Orbitofrontal cortex and basolateral amygdala encode expected outcomes during learning. Nature Neurosci 1:155159.[Web of Science][Medline]
Stuss DT, Benson DF (1986) The frontal lobes. New York: Raven Press.
Suzuki H, Azuma M (1976) A glass-insulated Elgiloy microelectrode for recording unit activity in chronic monkey experiments. Electroencephalogr Clin Neurophysiol 41:9395.[Web of Science][Medline]
Teuber H-L (1964) The riddle of frontal lobe function in man. In: The frontal granular cortex and behavior (Warren JM, Akert K, eds), pp. 410477. New York: McGraw-Hill.
Thorpe SJ, Rolls ET, Maddison S (1983) The orbitofrontal cortex: neuronal activity in the behaving monkey. Exp Brain Res 49:93115.[Web of Science][Medline]
Tinklepaugh OL (1928) An experimental study of representation factors in monkeys. J Comp Psychol 8:197236.[Web of Science]
Tremblay L, Schultz W (1999) Relative reward preference coded in primate orbitofrontal cortex. Nature 398:704708.[Medline]
Watanabe M (1989) The appropriateness of behavioral responses coded in post-trial activity of primate prefrontal units. Neurosci Lett 101: 113117.[Web of Science][Medline]
Watanabe M (1990) Prefrontal unit activity during associative learning in the monkey. Exp Brain Res 80:296309.[Web of Science][Medline]
Watanabe M (1992) Frontal units of the monkey coding the associative significance of visual and auditory stimuli. Exp Brain Res 89:233247.[Web of Science][Medline]
Watanabe M (1996) Reward expectancy in primate prefrontal neurons. Nature 382:629632.[Medline]
Watanabe M (1998) Cognitive and motivational operations in primate prefrontal neurons. Rev Neurosci 9:225241.[Web of Science][Medline]
Wilson FAW, Ó Scalaidhe SP, Goldman-Rakic PS (1993) Dissociation of object and spatial processing domains in primate prefrontal cortex. Science 260:19551958.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
T. V. Maia Reinforcement learning, conditioning, and the brain: Successes and challenges Cogn Affect Behav Neurosci, December 1, 2009; 9(4): 343 - 364. [Abstract] [PDF] |
||||
![]() |
T. Kojima, H. Onoe, K. Hikosaka, K.-i. Tsutsui, H. Tsukada, and M. Watanabe Default Mode of Brain Activity Demonstrated by Positron Emission Tomography Imaging in Awake Monkeys: Higher Rest-Related than Working Memory-Related Activity in Medial Cortical Areas J. Neurosci., November 18, 2009; 29(46): 14463 - 14471. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-i. Okada, K. Toyama, Y. Inoue, T. Isa, and Y. Kobayashi Different Pedunculopontine Tegmental Neurons Signal Predicted and Actual Task Rewards J. Neurosci., April 15, 2009; 29(15): 4858 - 4870. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.S. Man, H.F. Clarke, and A.C. Roberts The Role of the Orbitofrontal Cortex and Medial Striatum in the Regulation of Prepotent Responses to Food Rewards Cereb Cortex, April 1, 2009; 19(4): 899 - 906. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Tsujimoto, A. Genovesio, and S. P. Wise Monkey Orbitofrontal Cortex Encodes Response Choices Near Feedback Time J. Neurosci., February 25, 2009; 29(8): 2569 - 2574. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. H. Lara, S. W. Kennerley, and J. D. Wallis Encoding of Gustatory Working Memory by Orbitofrontal Neurons J. Neurosci., January 21, 2009; 29(3): 765 - 774. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. van Duuren, J. Lankelma, and C. M. A. Pennartz Population Coding of Reward Magnitude in the Orbitofrontal Cortex of the Rat J. Neurosci., August 20, 2008; 28(34): 8590 - 8603. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Tanji and E. Hoshi Role of the Lateral Prefrontal Cortex in Executive Behavioral Control Physiol Rev, January 1, 2008; 88(1): 37 - 57. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Simmons and B. J. Richmond Dynamic Changes in Representations of Preceding and Upcoming Reward in Monkey Orbitofrontal Cortex Cereb Cortex, January 1, 2008; 18(1): 93 - 103. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J Frank, A. Scheres, and S. J Sherman Understanding decision-making deficits in neurological conditions: insights from models of natural action selection Phil Trans R Soc B, September 29, 2007; 362(1485): 1641 - 1654. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. M. Williams and E. Gordon Dynamic Organization of the Emotional Brain: Responsivity, Stability, and Instability Neuroscientist, August 1, 2007; 13(4): 349 - 370. [Abstract] [PDF] |
||||
![]() |
E. A. Murray, J. P. O'Doherty, and G. Schoenbaum What We Know and Do Not Know about the Functions of the Orbitofrontal Cortex after 20 Years of Cross-Species Studies J. Neurosci., August 1, 2007; 27(31): 8166 - 8169. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. van Duuren, F. A. N. Escamez, R. N.J.M.A. Joosten, R. Visser, A. B. Mulder, and C. M.A. Pennartz Neural coding of reward magnitude in the orbitofrontal cortex of the rat during a five-odor olfactory discrimination task Learn. Mem., June 11, 2007; 14(6): 446 - 456. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. X Cohen Individual differences and the neural representations of reward expectation and reward prediction error Soc Cogn Affect Neurosci, March 1, 2007; 2(1): 20 - 30. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Izquierdo and E. A. Murray Selective Bilateral Amygdala Lesions in Rhesus Monkeys Fail to Disrupt Object Reversal Learning J. Neurosci., January 31, 2007; 27(5): 1054 - 1062. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Z. Goldstein, N. Alia-Klein, D. Tomasi, L. Zhang, L. A. Cottone, T. Maloney, F. Telang, E. C. Caparelli, L. Chang, T. Ernst, et al. Is Decreased Prefrontal Cortical Sensitivity to Monetary Reward Associated With Impaired Motivation and Self-Control in Cocaine Addiction? Am J Psychiatry, January 1, 2007; 164(1): 43 - 51. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Amiez, J.P. Joseph, and E. Procyk Reward Encoding in the Monkey Anterior Cingulate Cortex Cereb Cortex, July 1, 2006; 16(7): 1040 - 1055. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Galvan, T. A. Hare, C. E. Parra, J. Penn, H. Voss, G. Glover, and B. J. Casey Earlier development of the accumbens relative to orbitofrontal cortex might underlie risk-taking behavior in adolescents. J. Neurosci., June 21, 2006; 26(25): 6885 - 6892. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Gutierrez, J. M. Carmena, M. A. L. Nicolelis, and S. A. Simon Orbitofrontal Ensemble Activity Monitors Licking and Distinguishes Among Natural Rewards J Neurophysiol, January 1, 2006; 95(1): 119 - 133. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. Roesch and C. R. Olson Neuronal Activity in Primate Orbitofrontal Cortex Reflects the Value of Time J Neurophysiol, October 1, 2005; 94(4): 2457 - 2471. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Campos, B. Breznen, K. Bernheim, and R. A. Andersen Supplementary Motor Area Encodes Reward Expectancy in Eye-Movement Tasks J Neurophysiol, August 1, 2005; 94(2): 1325 - 1335. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. Roesch and C. R. Olson Neuronal Activity Dependent on Anticipated and Elapsed Delay in Macaque Prefrontal Cortex, Frontal and Supplementary Eye Fields, and Premotor Cortex J Neurophysiol, August 1, 2005; 94(2): 1469 - 1497. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Oya, R. Adolphs, H. Kawasaki, A. Bechara, A. Damasio, and M. A. Howard III Electrophysiological correlates of reward prediction error recorded in the human prefrontal cortex PNAS, June 7, 2005; 102(23): 8351 - 8356. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Tsujimoto and T. Sawaguchi Neuronal Activity Representing Temporal Prediction of Reward in the Primate Prefrontal Cortex J Neurophysiol, June 1, 2005; 93(6): 3687 - 3692. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Miyachi, X. Lu, S. Inoue, T. Iwasaki, S. Koike, A. Nambu, and M. Takada Organization of Multisynaptic Inputs from Prefrontal Cortex to Primary Motor Cortex as Revealed by Retrograde Transneuronal Transport of Rabies Virus J. Neurosci., March 9, 2005; 25(10): 2547 - 2556. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Izquierdo, R. K. Suda, and E. A. Murray Bilateral Orbital Prefrontal Cortex Lesions in Rhesus Monkeys Disrupt Choices Guided by Both Reward Value and Reward Contingency J. Neurosci., August 25, 2004; 24(34): 7540 - 7548. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Matthews, M. P. Paulus, and J. E. Dimsdale Contribution of Functional Neuroimaging to Understanding Neuropsychiatric Side Effects of Interferon in Hepatitis C Psychosomatics, August 1, 2004; 45(4): 281 - 286. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. Roesch and C. R. Olson Neuronal Activity Related to Reward Value and Motivation in Primate Frontal Cortex Science, April 9, 2004; 304(5668): 307 - 310. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. Roesch and C. R. Olson Impact of Expected Reward on Neuronal Activity in Prefrontal Cortex, Frontal and Supplementary Eye Fields and Premotor Cortex J Neurophysiol, September 1, 2003; 90(3): 1766 - 1789. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. C. Cromwell and W. Schultz Effects of Expectations for Different Reward Magnitudes on Neuronal Activity in Primate Striatum J Neurophysiol, May 1, 2003; 89(5): 2823 - 2838. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. B. Pochon, R. Levy, P. Fossati, S. Lehericy, J. B. Poline, B. Pillon, D. Le Bihan, and B. Dubois The neural system that bridges reward and cognition in humans: An fMRI study PNAS, April 16, 2002; 99(8): 5669 - 5674. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Watanabe, K. Hikosaka, M. Sakagami, and S.-i. Shirakawa Coding and Monitoring of Motivational Context in the Primate Prefrontal Cortex J. Neurosci., March 15, 2002; 22(6): 2391 - 2400. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. M. Perlstein, T. Elbert, and V. A. Stenger Dissociation in human prefrontal cortex of affective influences on working memory-related activity PNAS, January 24, 2002; (2002) 241650598. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. K. Hassani, H. C. Cromwell, and W. Schultz Influence of Expectation of Different Rewards on Behavior-Related Neuronal Activity in the Striatum J Neurophysiol, June 1, 2001; 85(6): 2477 - 2489. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Schoenbaum and B. Setlow Integrating Orbitofrontal Cortex into Prefrontal Theory: Common Processing Themes across Species and Subdivisions Learn. Mem., May 1, 2001; 8(3): 134 - 147. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Schultz, L. Tremblay, and J. R. Hollerman Reward Processing in Primate Orbitofrontal Cortex and Basal Ganglia Cereb Cortex, March 1, 2000; 10(3): 272 - 283. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. M. Perlstein, T. Elbert, and V. A. Stenger Dissociation in human prefrontal cortex of affective influences on working memory-related activity PNAS, February 5, 2002; 99(3): 1736 - 1741. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Knutson, C. M. Adams, G. W. Fong, and D. Hommer Anticipation of Increasing Monetary Reward Selectively Recruits Nucleus Accumbens J. Neurosci., August 15, 2001; 21(16): RC159 - RC159. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||





















