Cerebral Cortex Advance Access originally published online on August 31, 2005
Cerebral Cortex 2006 16(6):761-778; doi:10.1093/cercor/bhj021
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuning Curve Shift by Attention Modulation in Cortical Neurons: a Computational Study of its Mechanisms
1 Instituto de Neurociencias de Alicante, Universidad Miguel Hernández Consejo Superior de Investigaciones Científicas, 03550 Sant Joan d'Alacant, Spain and 2 Volen Center for Complex Systems, Brandeis University, Waltham, MA 02454, USA
Address correspondence to Albert Compte, Instituto de Neurociencias de Alicante, Universidad Miguel Hernández Consejo Superior de Investigaciones Científicas, 03550 Sant Joan d'Alacant, Spain. Email: acompte{at}umh.es.
| Abstract |
|---|
|
|
|---|
Physiological studies of visual attention have demonstrated that focusing attention near a visual cortical neuron's receptive field (RF) results in enhanced evoked activity and RF shift. In this work, we explored the mechanisms of attention induced RF shifts in cortical network models that receive an attentional spotlight. Our main results are threefold. First, whereas a spotlight input always produces toward-attention shift of the population activity profile, we found that toward-attention shifts in RFs of single cells requires multiplicative gain modulation. Secondly, in a feedforward two-layer model, focal attentional gain modulation in first-layer neurons induces RF shift in second-layer neurons downstream. In contrast to experimental observations, the feedforward model typically fails to produce RF shifts in second-layer neurons when attention is directed beyond RF boundaries. We then show that an additive spotlight input combined with a recurrent network mechanism can produce the observed RF shift. Inhibitory effects in a surround of the attentional focus accentuate this RF shift and induce RF shrinking. Thirdly, we considered interrelationship between visual selective attention and adaptation. Our analysis predicts that the RF size is enlarged (respectively reduced) by attentional signal directed near a cell's RF center in a recurrent network (resp. in a feedforward network); the opposite is true for visual adaptation. Therefore, a refined estimation of the RF size during attention and after adaptation would provide a probe to differentiate recurrent versus feedforward mechanisms for RF shifts.
Key Words: computational model feedforward network receptive field recurrent network selective attention sensory adaptation spotlight
| Introduction |
|---|
|
|
|---|
Attention is a mechanism by which the brain gates the access of sensory stimuli to its limited processing resources (Treisman and Gelade, 1980
Connor et al. (1996
, 1997
) looked more closely at the spatial properties of the attentional modulation of neuronal receptive fields in area V4. They found a response gradient surrounding the attended target, as if nearby receptive fields shifted towards the attentional focus (shift effect). They also found that neuronal responses were differentially scaled as attention was directed in opposite directions around the receptive field (directionality effect). RF shift may seem to be an obvious consequence of spatial attention being mediated by a spotlight bias input to the recorded cortical area. Here we show that, in fact, this is not the case. A previous attempt to explain the shift effect (McAdams and Maunsell, 1999
; Maunsell and McAdams, 2001
) made use of the layered feedforward architecture of the visual processing pathway. This feedforward scenario posits multiplicative scaling by attention of neurons in an early visual area (V1, V2), which project to a secondary area (V4, MT) and induce RF shift in neurons therein. This feedforward model was proposed as a scheme or word-model, but it has not been tested quantitatively. The purpose of this paper is to study a mathematical implementation of this feedforward model and to consider an alternative scenario within a single recurrent circuit receiving a focal additive excitatory input (a plausible physiological substrate for the attentional spotlight). This study focuses on attention induced RF shifts, and will not deal with the directionality effect observed also by Connor et al. (1996
, 1997
).
It is known from psychological studies that selective attention interacts with adaptation mechanisms in the visual system (Zucker, 1990
; Chaudhuri, 1990
; Lankheet and Verstraten, 1995
; Alais and Blake, 1999
). Recent physiological work explored the effects of attention on stimuli of varying contrasts, and it was found that neuronal sensitivity to stimulus contrast is affected by attention in an inverse manner to adaptation (Reynolds et al., 2000
; Martinez-Trujillo and Treue, 2002
).
Suggestively, experiments (Müller et al., 1999
; Dragoi et al., 2000
; Yao and Dan, 2001
; Fu et al., 2002
; Felsen et al., 2002
; Kohn and Movshon, 2004
) have shown that visual adaptation protocols induced tuning curve partial shifts along a variety of stimulus feature dimensions (orientation and motion direction). These observations motivated us to study both the RF shift properties for attention and adaptation in a model network. To this end, we shall assume that adaptation occurs because of the reduction in excitability of neurons, which can be implemented as a substractive current into the neurons (Carandini and Ferster, 1997
; Sanchez-Vives and McCormick, 2000
; Wang et al., 2003
), or a negative additive spotlight. We focus our modeling on the shifts in receptive field mapping of neurons in V4 (Connor et al., 1996
, 1997
). However, attention may be focused on a stimulus feature (such as orientation or velocity), rather than location-based (Treue and Martínez Trujillo, 1999
). Even though different attentional mechanisms may be involved along different stimulus dimensions (Lee et al., 1997
; Corbetta and Shulman, 2002
), our results should apply equally well to feature-based selective attention.
| Materials and Methods |
|---|
|
|
|---|
We used a computational approach to explore mechanistically the involvement of the local and feedforward circuitry in the shift of the RF tuning of extrastriate neurons induced by attention (Connor et al., 1996
The recurrent model is schematically represented in Figure 2A and presented in the Results section. It obeys the self-consistent equation
![]() |
if |x xS| < l, and IS(x) = 0 otherwise, xS being the position where the stimulus is presented, and l the spatial spread of the feedforward afferent projections. The attentional additive bias is
if |x xA| < l, and IA(x) = 0 otherwise, xA being the location of the attentional focus. A0 is either zero or negative. If A0 is negative, IA(x) provides an inhibitory input to neurons with receptive fields peripheral to the attentional focus (assuming
'A >
A). Finally, the recurrent input into cell xi is
where J(xixj) is the strength of the connection between the postsynaptic neuron at xi and its presynaptic partner at xj:
if |xi xj| < l, and J (xi xj) = 0 otherwise. In all simulations shown here N = 512, T = 1, L = 4l, l = 3.14, and
S =
J = 1.31,
A = 0.35,
'A = 0.87, unless otherwise indicated. With this parameter choice, the unattended RF radius (half width at half height) measures 0.81, and it is larger than the attentional focus size by
50%. The rest of parameters are typically illustrated in two different conditions: strong excitatory recurrence (Fig. 5B) and strong inhibitory recurrence (Fig. 5A). For strong excitatory recurrence: S0 = 0.46, S1 = 0.66, A0 = 0, A1 = 0.089, J0 = 2.5, and J1 = 8.5. For strong inhibitory recurrence: S0 = 0.34, S1 = 1.09, A0 = 0, A1 = 0.28, J0 = 11.9, and J1 = 15.3.
|
|
The feedforward model is represented in Figure 2B and discussed in Results. It contains two layers of neurons and their steady-state activations are described by the equations
![]() |
S =
A = 0.21,
J = 0.71, S0 = 0, S1 = 0.42, J0 = 0, J1 = 6.38, A0 = 0 and A1 = 0.5. With this choice of parameters tuning curves in the second layer have approximately the same tuning width than receptive fields in the recurrent model, and are
3.5 times larger than first-layer RFs. Notice also that by choosing T = 0 and S0 = 0, neurons in the first layer never use the rectification mechanism in their input-output relationships. We choose this particular case because one can then substitute [I]+ = I and this allows for precise analytical calculations (shown in the Appendix). We prove, however, that our main points regarding this model do not depend on this particular choice (see Fig. 4C). When we simulate an attentional signal with inhibitory surround effect, we use
'A = 0.52, A0 = 0.48 and A1 = 1.5.
|
For each of these models, and each parameter set explored, we found the network activity pattern in response to a single stimulus (centered at xS), and the spatial tuning curve of a given single neuron (with RF centered at x) in response to different spatial stimuli. This is done as follows. Once the parameters for model connectivity and input stimulation are chosen, we solve self-consistently the network activity equations (Fig. 2). The solution thus obtained, the steady-state response of each neuron in the network to fixed stimulus and attentional signal, is what we call the population activity profile R(x|xS). We then repeat this procedure for all possible locations xS of the stimulus signal, keeping everything else fixed. We thus obtain a family of population activity profiles R(x,xS). If we look at a single neuron (fix x, typically in our graphs x = 0) and plot its responses to different locations of the stimulus signal, we obtain the spatial tuning curve (or RF) of that given neuron at x:R(xS|x). It is this curve that we can compare to results of single unit recordings (Connor et al., 1996
In order to quantify the effects of attention on the receptive field of a neuron, we define two quantities: the receptive field shift and the shrinking factor. Without attention, the neuron at x has a symmetric, bell-shaped receptive field and its maximum response occurs when the stimulus is presented at x. Under attention, the receptive field might shift and/or change size. In order to assess the shift of the receptive field we typically use a measure based on the location of the maximum firing in the receptive field: If in the presence of attention at location xA (xA > x), the maximum response occurs when the stimulus peaks at xM
x, the receptive field shift is defined as xM x, i.e. the distance in cortical space between the positions of the stimuli that elicit a maximum response when attention is present and when it is absent. When this quantity is positive (negative), the shift is towards (away from) attention. In a few cases (Fig. 4C) we also tried systematically another measure of RF shift based on a Gaussian fit to ensure that our conclusions are not dependent on the particular measure of shift used. Specifically, we fitted a Gaussian function (least-squares fit) to the RF points, but only for those firing rates that exceeded one-half of the maximum rate in the unattended tuning curve for the corresponding set of parameters. The center xM of the fitted Gaussian was taken as the RF center and, thus, this measure of shift was defined as xM x. The shrinking factor is defined as the width at half height of the attended RF divided by the width at half height of the unattended RF. A shrinking factor smaller (larger) than unity indicates that the spatial tuning curve (or RF) shrinks (expands) under attention.
In all population activity profiles and spatial tuning curves shown here, only the central half of the network is plotted to avoid showing effects due to the free boundary conditions. In order to check for the robustness of the effects discussed in the recurrent network model, we carried out parameter sweeps around the values of the parameters reported. We found that any parameter could be changed by ±10%, and the qualitative results of the recurrent model in the paper would still hold. In particular, the two modes of operation illustrated in Figure 5 are robust in their qualitative features (direction of shift in neuronal RFs relative to shift in population activity profile) to a change of ± 10% in their parameters. In addition, we checked whether the shape of the input-output relationship could be critical in generating the towards-attention RF shift: we tried with the inputoutput function
and we could still see the toward-attention shift effect for a set of parameter values very close to that reported in Figure 5B. With respect to the feedforward model simulations and calculations (see Appendix), we always used Gaussian functions for input profiles and distance-dependent network connectivity, because they are usually seen to approximate well experimental data. However, we have checked that our results are not critically dependent on this choice: we have repeated our calculations and simulations when all curves are given by Cauchy distributions (with much longer tails than Gaussians: 1/(
2 + x2)) and our conclusions still hold, even quantitatively (not shown).
In addition, we have also checked that our conclusion regarding the limited range of RF shifts holds irrespective of the firing threshold used. To this end we checked the full range of threshold values that still evoke some sensory response in the first-layer network (see Fig. 4C). Furthermore, for the purpose of generality we have studied how dependent our conclusions are on the form of f(I) in the calculations of the Appendix: our analysis (data not shown) demonstrated that our conclusions hold qualitatively as long as cellular input-output relationships are monotonously increasing functions of the input, and under the condition that cells operate far from their strongly saturated output regime. Both of these requirements are biologically plausible and generally considered true for typical cortical neurons.
All these robustness checks point at the fact that the results reported do not require fine-tuning of parameters.
| Results |
|---|
|
|
|---|
Our simulations were primarily motivated by the shift effect observed by Connor et al. (1996
Spotlight Bias and Shift Effect: A Heuristic Argument
In order to clarify the challenge in explaining the shift effect (Connor et al., 1996
, 1997
) within a local network model where attention acts via a spotlight bias input, we consider the following heuristic constructions. First, let us assume that neurons within the network do not interact with each other, their only inputs are bottom-up and top-down signals from outside the network, and their only function is to add them and transduce them into their output firing rate. We further assume that the bottom-up sensory signal and the top-down attentional signal are independent from each other, so that they carry purely stimulus and attention information, respectively. We can then plot the population activity profile for given stimuli (Fig. 1A, upper panel) and the spatial tuning curve of a single neuron for all stimuli (Fig. 1A, lower panel). In this oversimplified scenario, the attentional spotlight obviously shifts the population activity profile towards the attentional input location (Fig. 1A, upper panel). However, the receptive field of a single neuron does not show any kind of shift (Fig. 1A, lower panel). This is intuitively easy to understand: with a fixed attentional signal, when one records from a given neuron while the stimulus is varied, the attentional signal is just an additive constant on the sensory input and it does not change where the maximum of the curve occurs. This example illustrates the general point that shifts in population activity profiles usually do not carry over to spatial tuning curves (RFs of single neurons). Interactions between neurons in the network and/or between sensory and attentional signals are necessary ingredients for the shift effect reported by Connor et al. (1996
, 1997
) to occur in the RFs of cortical neurons.
|
In the following argument we dissociate two different effects in population activity profiles that might be induced by attention: pure shift and pure multiplicative scaling. In realistic scenarios (as we will consider later on) both of these effects are likely to participate in attentional modulation of the network activity. However, we can heuristically dissociate these two effects first and then explore their interrelationship in a more realistic setting. To this end, we first consider a situation in which recurrent connections and/or stimulus-attention interactions are such that the only action of an attentional top-down signal on the population activity profile is to shift the unattended population activity profile by a fixed relative amount toward attention. If we plot population activity profiles and the spatial tuning curve for this case (Fig. 1B) we obtain that a pure shift in the population activity profile induces a shift of the spatial tuning curve in the opposite direction, i.e. away from attention. In this case, the neuron shows the maximum response when it is located at the peak of the population activity profile. This happens when the stimulus is more to the left and further away from the attentional focus than in the unattended case, and the attentional signal shifts the population activity profile to the right so that it peaks at the recorded neuron. This example provides a more striking illustration of how population activity profiles and spatial tuning curves can differ largely in their qualitative properties. The spotlight bias does explain the shift in population activity profile, but it does not account by itself for the receptive field shifts observed by Connor et al. (1996
Finally, we imagine a different scenario, where shifts are completely absent in the population activity profile and the only effect of attention is a multiplicative scaling of the profile, the modulation factor is larger when the stimulus and attentional signals are closer to each other. For now we do not need to specify the mechanisms for such a multiplicative modulation. The population activity profiles and the spatial tuning curve (Fig. 1C) can be plotted for this situation, and we see that spatial tuning curves now shift in the right direction (towards the attentional focus), even though the population activity profiles remain centered around the stimulus location. This observation suggests a possible role of multiplicative scaling in bringing about receptive field shifts.
To summarize, our heuristic discussions show that: (i) qualitative properties of spatial tuning curve (the experimental observable neuronal receptive fields) do not necessarily carry over to population activity profiles (which are the behaviorally relevant counterpart); (ii) either recurrent connectivity within the local network or extrinsic interactions between stimulus and attention signals to the local network, or both, are needed to account for the receptive field shifts of Connor et al. (1996)
; and (iii) under appropriate network interactions, the network activity can shift towards attention and/or scale multiplicatively as a result of attention, resulting in opposite effects on the direction of the neuronal RF shift. Similar arguments have been presented to link perceptual effects like the tilt effect with the contextual modifications of tuning curves of V1 neurons (Gilbert and Wiesel, 1990
). We now turn to mechanistic conditions under which RF shifts might be induced by attention in cortical networks.
Recurrent versus Feedforward Cortical Architectures for RF Shifts
There are arguably many different ways in which receptive field modulations can be generated in biologically plausible neural circuits. It is known that both recurrent circuitry, feedforward connectivity and feedback connectivity are fundamental elements of cortical information processing (Douglas and Martin, 2004
), and the observed attention effect is possibly produced by a combination of these various mechanisms. However, it remains an open question as to which of these may be responsible for the RF shift effect observed in a given cortical area. To shed light on this important issue, we studied two general and contrasting scenarios from the point of view of the connectivity: in the first case the neural circuit under consideration is endowed with dense local horizontal connectivity (a recurrent network), whereas in the second case the shift is primarily induced in the course of activity propagation along multiple layers of a feedforward network. Specifically we will compare the two models depicted in Figure 2. In both models, neurons are positioned in their network according to the center of their receptive field on a line of length L.
In the recurrent model (Fig. 2A), only one layer of neurons is simulated, which receive additive external inputs from both the stimulus (IS(x)) and the attentional control system (IA(x)) and recurrent inputs dependent on the activity of neighboring neurons [according to a connectivity profile J(xi xj) that typically incorporates an inhibitory surround, in what is known as a Mexican Hat connectivity (Amari, 1977
; Salinas and Abbott, 1996
; Kang et al., 2003
)]. Mathematically, the firing rate of a cell with the RF center at xi, R(xi), obeys the equation displayed in Figure 2A.
The sensory input IS(x) and the attentional input IA(x) are given by truncated Gaussian functions (see Materials and Methods for details). Depending on the parameter values, IA(x) can include an inhibitory input to neurons with receptive fields peripheral to the attentional focus. Finally, the recurrent input that a given cell at xi feels is a sum over all neurons in the network,
where J(xi xj) is the strength of the connection between the postsynaptic neuron at xi and its presynaptic partner at xj. This Gaussian-shaped connectivity J(xi xj) provides cooperative excitatory interactions between neurons nearby in cortical space (and, therefore, with overlapping receptive fields) and possibly inhibitory coupling between neurons with non-overlapping receptive fields.
The feedforward model is a formal implementation of a mechanism described by McAdams and Maunsell (1999)
. Its schematic representation and its defining equations are displayed in Figure 2B. We simulate two layers of neurons, one corresponding to an upstream area (V1 or V2) and the other one to a downstream area (V4). We will refer henceforth to the network representing the upstream area as first-layer network and to the one representing the downstream area as second-layer network. In accordance with the known properties of the early visual pathway, first-layer neurons (upstream area) have smaller receptive fields than second-layer neurons (downstream area). This is accomplished by having the sensory input IS(y) impinge on the first layer as a narrow Gaussian, and the activity elicited in those neurons is then propagated to the second layer via a fan-out feedforward connectivity profile J(x y) that generates much wider receptive fields. Attention enters the feedforward model as a factor fA(y) in the slope of the input-output relationship of first-layer neurons, and it typically affects neurons in a region commensurate with the size of the receptive field in that area, as suggested by McAdams and Maunsell (1999)
.
The attentional factor is given by fA(y) = 1 + IA(y), so that attention acts in the first-layer neurons by controlling their response gain in a location-specific manner: positive attentional modulations IA(y) generate a steepening of their input-output relationship, whereas negative attentional modulations (typically in a surround of the attentional focus) will result in a reduced neuronal response gain. We use consistent nomenclature with the recurrent network above so that IS(y), IA(y) and J(x y) are given by the truncated Gaussian functions explicited in Materials and Methods.
Parameters are chosen so that tuning curves in the second layer have approximately the same tuning width than receptive fields in the recurrent model (see Materials and Methods).
The Feedforward Model for RF Shifts
We first consider the feedforward model of Figure 2B. The feedforward model proposed by McAdams and Maunsell (1999)
emphasizes the fact that the attentional input targets neurons in a cortical area such that the size of the attentional beam (as required by the task) matches the size of the targeted neurons' RFs. Confirming previous proposals (McAdams and Maunsell, 1999
), we found that such a model indeed produces receptive field shifts in the downstream layer that represents area V4. When shifts are produced, the magnitude of the shifts can be quite large, if one assesses this through the maximum of the tuning curve (see Methods). Note that, by definition, maximal RF shift occurs when the tuning curve moves all the way to the location of the attentional focus. In the feedforward model, when RF shifts are observed the maximum of the tuning curve can indeed be moved close to the attention focus in the feedforward model (Fig. 3A, right panel), although in this case the tuning curve is no longer Gaussian but has a bimodal shape. Another interesting feature of this mechanism is the fact that the size of the RF (assessed as half width at half height) is reduced when attention is focused on the neuron's RF center (Fig. 3B), we will come back to this later. Both of this effects are enhanced by, but not dependent on, the presence of an inhibitory surround in the attentional beam. Thus, the feedforward model is an effective architecture for generating RF shifts in areas downstream from the area where responses are multiplicatively scaled. As we now show, however, there are limitations that make this mechanism unable on its own to replicate the data of Connor et al. (1996
, 1997
).
|
We found that, under the assumption that the attentional focus footprint (spatial size) matches the RF size of neurons in the first layer, the feedforward model is able to produce RF shifts in a second-layer neuron only when attention is focused in the vicinity of the neuron's RF center, and not beyond the RF boundary (Fig. 4A). This limit can be shown rigorously by a quantitative mathematical analysis (see the Appendix). We have checked that this result is not dependent on the value of the firing threshold in neurons of the first-layer network, nor on the method used to assess the magnitude of the RF shift (Fig. 4C). Hence, the limited range for RF shifts is an intrinsic feature of the feedforward model, not a mere consequence of choice of parameter values. In contrast, the experiments of Connor et al. (1996
|
Receptive Field Shift in a Recurrent Spotlight Network Model
We have shown that to produce a toward-attention RF shift in the original feedforward network (with the attentional footprint commensurate with first-layer neurons' RF), attention needs to be focused well within the radius of a downstream neuron's RF (in contrast with the results of Connor et al. (1996
, 1997
)). This feedforward mechanism can only replicate the observations in area V4 by Connor et al. (1996
, 1997
) if one assumes that attentional modulation in the upstream area (V1) is strong and has a very broad footprint, of the size of RFs in the downstream area (V4). Although there is no conclusive evidence ruling out a broad attentional beam in V1, attentional modulations in V1 are reportedly very weak at the neuronal level. Therefore, we now turn to analyzing whether mechanisms intrinsic to a local network can generate RF shifts on their own. Our candidate mechanism is reverberatory interactions within the local circuit.
What recurrent circuitry mechanisms can implement the attentional shift effect in the local network? To address this question we considered a model of a cortical module of neurons in area V4 which have different receptive fields but the same stimulus selectivity (orientation, spatial frequency, etc.). The network model consists of recurrently connected firing rate neurons ordered along a line according to the center of their receptive fields. Recurrent connections are strongest between neighboring neurons and the network receives two types of external input: a topographic stimulus signal, and a topographic spotlight attentional bias input which is spatially more focused than the stimulus input (see Fig. 2A and Materials and Methods for details). Interestingly, if recurrent connections are purely inhibitory (light gray and squares in Fig. 5C, left panel), the receptive fields shift away from the attentional focus (Fig. 5A, bottom panel). By contrast, if recurrent excitation is strong (black and circles in Fig. 5C, left panel), the RFs shift towards the attentional focus (Fig. 5B, bottom panel). Under these two different operational regimes, the recurrent connections within the network determine different ways in which the attention signal modulates the population activity profile. Thus, in one case attention induces an important shift towards attention in the population activity profile (Fig. 5A, upper panel), while in the other case it mostly results in a multiplicative scaling of the network activity without significant spatial shift (Fig. 5B, upper panel). Strong recurrent excitatory connections favor the multiplicative effect, hence a receptive field shift towards attention (see Fig. 1). As shown in Figure 5C, a gradual increase in the overall excitatory coupling of the network leads to a transition from away-from-attention shift (data in light gray and squares) to towards-attention-shift (data in black and circles).
Therefore, local recurrent circuitry can account by itself for the shift effect observed by Connors et al. (1996
, 1997
) provided the internal circuitry of extrastriate cortex operates in the regime with sufficiently strong recurrent excitatory connections illustrated in Figure 5B rather than that of Figure 5A.
In Figure 6 we simulate the Connor's experiment using the same parameters as Figure 5B, except with the attentional focus well outside of the neuron's RF. In close similarity with the single unit recording data of Connor et al. (1997)
(compare with their fig. 2), in response to five stimuli presented in the receptive field, the neural activity is shifted towards the attentional signal, and the maximum response is slightly enhanced compared to the unattended case. These results suggest that the observed receptive field shift in V4 neurons can be accounted for if the underlying local circuit is endowed with sufficient recurrent connections to operate in the multiplicative regime of Figure 5B.
|
Shift and Shrinking Effects through Attentional Surround Inhibition
Figure 5 shows that a recurrent network in an appropriate operational regime can generate the receptive field shift towards attention in a spotlight single recurrent network model. However, the receptive field shifts thus obtained are relatively small (see Fig. 5B). So, we asked under what conditions the receptive field shift is maximal. We found that the maximal shift occurs if the attentional signal contains a surround inhibitory component, i.e. when the attentional input excites neurons with receptive field near the attentional locus but it inhibits neurons with receptive fields peripheral to the attentional focus (Fig. 7). Not only does the receptive field shift become larger with an attentional inhibitory surround, but the receptive field also appears to shrink around the position that now elicits the maximal response in the neuron, which is shifted towards the attentional locus (Fig. 7). This shrinking is seen to progressively increase as the attentional inhibitory surround becomes stronger (Fig. 7B). However, the RF shrinking occurs only when attention has an offset with respect to the neuronal preferred location. For attention focused right on top of the neuron's RF, one would see RF enlargement rather than shrinking (see Fig. 9). Notice that this effect does not require attentional inhibitory surround, but is a common feature of a strongly recurrent network, which we argue can be used to distinguish a recurrent from a feedforward architecture (see below).
|
|
In the regime of Fig. 5A where the receptive field shifts away from attention, we observed that the addition of an attentional surround inhibition increases the magnitude of the receptive field shift away from attention. On the other hand, the shrinking of the receptive field under inhibitory surround attentional input occurs also in this network regime (data not shown, but see Fig. 9).
RF Shifts Induced by an Adapting Stimulus
So far, we have focused on explaining the RF shift caused by attention (Connor et al., 1996
, 1997
) by using two alternative computational architectures. Recently, however, there have been reports on shifts of neuronal tuning curves following protocols of visual adaptation in V1 (Dragoi et al., 2000
; Yao and Dan, 2001
; Fu et al., 2002
; Felsen et al., 2002
) and in MT (Kohn and Movshon, 2004
). A link between attention and adaptation is suggested by the fact that adaptation is known to reduce the sensitivity to contrast, and attention has recently been shown to act as an effective increase in contrast (Reynolds et al., 2000
; Martinez-Trujillo and Treue, 2002
). Because of the potential interaction that such a relationship implies, we considered in our model how adaptation might shift the receptive fields of V4 neurons. The effect of a prior, long-lasting presentation of an adapting stimulus to our model neurons can be modeled as a reduction of excitability in a subset of neurons in our network during the course of our receptive field mapping procedure. The underlying, plausible assumption is that the time course of adaptation is much slower than the dynamics of receptive field changes that we are studying. Therefore, if an adapting stimulus is presented at location xA for a long time and it induces visual adaptation, subsequent presentation of a test stimulus at location x will result in a network response to the test stimulus as if the network was receiving a constant hyperpolarizing bias current peaked at location xA. We implement this by injecting a biasing hyperpolarizing input to the network (Carandini and Ferster, 1997
; Sanchez-Vives and McCormick, 2000
; Wang et al., 2003
).
Our level of modeling is not explicit enough so as to identify a synaptic or intrinsic neuronal mechanism for this adaptation: both mechanisms effectively reduce the excitability of neurons and would be modeled in the same way here. A scenario where adaptation occurs upstream from the modeled cortical circuit is also consistent with this implementation. Specifically, in our recurrent network model we use the same mathematical framework presented in Figure 2A but taking
A = 0.71, A0 = 0 and A1 < 0. The broader extent of adapting, relative to attentional, additive input footprints (
A) is justified by the fact that the adaptation bias current is triggered by neuronal activity and therefore is comparable to the receptive field size, whereas the attentional beam can be more focused. For the simulation of adaptation-induced modulations in the feedforward scenario (Fig. 2B), we modeled the adaptation effect at the level of neural responses in the first layer, consistent with the finding of Kohn and Movshon (2004)
: we modify the equation for the first network to read: R(yj) = [IS(xi) + IA(yj) T]+, and we use A0 = 0, A1 = 0.2.
The simulations show that in the two cortical networks that we consider here (Fig. 2), adaptation induces RF shifts in the opposite direction than attention (Fig. 9). We illustrate this explicitly for the recurrent network model (Fig. 2A) in Figure 8: adaptation gives rise to shifts of receptive fields towards or away from the adapted location, depending on whether recurrent connections are predominantly inhibitory or strongly excitatory, respectively (Fig. 8C).
|
Distinguishing the Models by their RF Modulations
We compared the different models quantitatively by examining how the receptive field of a neuron changes shape as an attentional spotlight is progressively moved away from the center of the RF (Fig. 9). We observe that neither of the three models (strong recurrent inhibition, strong recurrent excitation or feedforward) can be distinguished by means of the change in the neuron's maximal rate under attention (or adaptation). As is clear in Figure 9A, in all three cases, the presence of an inhibitory surround around the central focus of attention (dashed lines) is manifested by a firing rate reduction when attention comes to the flanks of the receptive field, and a firing rate increase for attention near the preferred location (as illustrated in Fig. 7B, left panel, when increasing the inhibitory surround, the focal excitation is compensatorily increased as well). Figure 9B shows the RF shifts for the three models. In contrast to the other two models, in a strongly inhibitory recurrent network attention induces away-from-attention shifts and adaptation induces towards-adaptation shifts, albeit small (compare shaded areas). In all three models, inhibitory attentional surround also manifests itself through a reversal of the sense of the receptive field shift when the location of attention focus is larger than a critical value. Note, again, that when the attention focus is beyond the RF boundary, RF shift is significant only in a recurrent network but not in a feedforward network.
Figure 9C displays the width of the receptive field (defined at half height). Interestingly, when attention is directed to the center of the RF (x = 0), both recurrent network models yield a larger RF width with attention than without it (left and middle panels, shaded areas). By contrast, the feedforward model predicts a narrower receptive field in the attended than in the unattended case (right panel, shaded area, see also Fig. 3B). The effect is also evident with adaptation: it narrows RF in recurrent models, whereas the opposite occurs in the feedforward model. Intuitively, one can understand this difference in the light of the different ways in which the attention/adaptation signals interact with the stimulus. In recurrent models, the modulatory signal is added together with the stronger sensory signal. When stimuli are varied in order to compute the tuning curve of a neuron, the modulatory signal received by this neuron is fixed. Because of this extra excitatory input, it is not surprising that the neuron's response to each stimulus is larger, hence the tuning curve (measured at half-height) is wider. The situation is different in the feedforward model. The output of a neuron yi (in the first-layer) subject to an attentional bias input centered on yA in the first layer is R(yj) = fA(xA yj)[IS(yj) T]+, where fA(xA yj) is a Gaussian function of the distance between the attention focus xA and yj. A neuron xi in the second layer receives the weighted input J(xi yj)R(yj). When the attention focus is near the center of its RF, xA
xi. Therefore, we can rewrite J(xi yj)R(yj) as Jeff(xi yj)[IS(yi) T]+, with Jeff(yj xi) = J(xi yj)fA(xiyj). Because the product of two Gaussian functions is still a Gaussian with a narrower width
eff, given by








0: xA/
= 


