Cerebral Cortex Advance Access originally published online on August 14, 2006
Cerebral Cortex 2007 17(6):1260-1273; doi:10.1093/cercor/bhl050
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Feature Article |
Neural Bases of Stereopsis across Visual Field of the Alert Macaque Monkey
Centre de Recherche Cerveau & Cognition, Centre National de la Recherche Scientifique, Université Paul Sabatier, Faculté de Médecine de Rangueil Toulouse 3, France
Address correspondence to Yves Trotter, Centre de Recherche Cerveau & Cognition (UMR 5549), Faculté de Médecine de Rangueil, 31062 Toulouse Cédex 9, France. Email: Yves.Trotter{at}cerco.ups-tlse.fr.
| Abstract |
|---|
|
|
|---|
Left and right retinal images of an object seen by the 2 eyes can occupy slightly disparate horizontal and/or vertical locations. The role of horizontal disparity (HD) in stereoscopic vision is well established, but the functional contribution of vertical disparity (VD) remains unclear. Various psychophysical studies have shown that HD and VD are used differently by the visual system depending on their location in the visual field, whether near the center of gaze or more peripheral. We show this horizontal/vertical distinction at the cellular level in monkey primary visual cortex (area V1). The range of VD encoding is reduced in central but not in the peripheral representation of the visual field. Moreover, neurons respond selectively to particular combinations of both types of disparities depending on the coded orientation as predicted by the disparity energy model. The preferred orientations of neurons near the fovea present a vertical bias that is well suited for stereopsis based on HD selectivity alone. In the periphery, instead, preferred orientations are radially biased, which allows a peripheral detector to convey the same depth signal based on either HD or VD. Such an organization has functional implications in both the perceptual and oculomotor domains.
Key Words: central and peripheral V1 disparity energy model extracellular recordings horizontal and vertical disparities
| Introduction |
|---|
|
|
|---|
Due to the lateral separation between the eyes, an object lying in the binocular field of view can form images in disparate locations in the left and right retinas, along both the horizontal and vertical dimensions. Because the eyes are displaced horizontally, there is an anisotropy between horizontal disparity (HD) and vertical disparity (VD), both in their natural range of occurrence and in the information they carry. This is reflected by the fact that stereo images with disparate horizontal positions create a perception of depth when they are fused (Wheatstone 1838
To understand this functional anisotropy, it is useful to consider the geometry of binocular vision. The horizontal separation between the eyes means that, for a given feature in the left retinal image, its possible matches in the right retina lie along a line (epipolar line), with an orientation always close to horizontal (Fig. 1, epipolar lines for a gaze straight ahead, in black, or in tertiary position, dotted lines). For this reason, HD is generally considered as the main, or unique, signal for stereoscopic vision. However, VD also appears as retinal eccentricity increases and is mainly expressed as a vertical shift of the epipolar lines that depends on the eyes position. The pale gray areas in Figure 1 show the range of VD that can be encountered at various retinal locations for a range of gaze azimuth (±25°), elevation (±25°), and fixation distance (from 10 cm to +
) along epipolar line segments extending ±1° horizontally (see Materials and Methods for details). It is clear that, even with the large range of binocular fixation conditions considered in this simulation, VD is usually very weak close to the fovea (for a special case where VD can be created in the central field by placing vertical occluders in front of oblique lines, see Farell 1998
) and increases with increasing retinal eccentricity.
|
There are psychophysical evidences that the visual system does not strictly follow the epipolar constraint and search for corresponding features along both the horizontal and vertical dimensions (Stevenson and Schor 1997
From an oculomotor point of view, corrective vergence eye movements (horizontal, vertical, or cyclorotational) are used to keep the eyes accurately aligned and thus constrain positional disparity in a range allowing fusion. Input signals for these vergence eye movements are respectively HD, VD, or particular combinations of both (cyclodisparities) (Howard 2002
). Whereas horizontal vergence exhibits a gain that is already maximal for small stimuli (same gain for a central stimulus 0.75° or 65° in diameter), the gain for vertical and cyclovergence increases as the stimulus diameter increases up to 20° (Howard and others 1994
, 2000
). Altogether these observations lead to the conclusion that HD and VD are exploited differently by the visual system and as a function of the eccentricity at which they appear.
The primary visual area (V1) is the first cortical site where monocular signals emanating from the left and right retinas converge onto single cells. Some of these cells are positional disparity detectors in the sense that they are optimally activated by binocular stimuli that fall in slightly disparate retinal locations due to the existence of slight mismatches between their left and right RF locations (Barlow and others 1967
; Nikara and others 1968
; Joshua and Bishop 1970
) and/or internal structure (Tsao and others 2003
). As the first binocular stage, V1 is of prime importance for computing the signal used by both the stereovision and the oculomotor control systems. However, very few studies have tackled the issue of the functional organization of V1 for disparity encoding as it relates to the geometrical and functional considerations described above. In trying to reveal the neural basis of stereoscopic vision, most previous studies have focused on HD encoding in the central visual field representation, and very little is known about VD encoding (Poggio 1995
; Trotter 1995
; Gonzalez and Perez 1998
; Cumming and DeAngelis 2001
). Pioneering works performed in the primary visual cortex of anesthetized paralyzed cats revealed the existence of both horizontal and vertical positional disparities but suffered from an inherent imprecision due to the acute preparation in which the exact vergence state of the animal is not controlled (Barlow and others 1967
; Nikara and others 1968
; Joshua and Bishop 1970
).
In the central visual field representation of V1, of the 3 studies performed in behaving primates, 2 offer indirect evidence (Gonzalez and others 1993
, 2003
) and one gives direct proof (Cumming 2002
) of a specialization for HD processing, with a wider range of encoding along the HD than along the VD dimension. We have recently reported VD selectivity in peripheral V1 (Durand and others 2002
) with tuning curve profiles similar to those found for HD at the same retinal eccentricities.
Because these studies suggest a difference in VD processing between the central and peripheral representations, our first objective was to compare the tuning of central and peripheral disparity detectors in the same monkeys and with the same stimuli and experimental conditions. A second objective was to study the potential link that exists between HD/VD encoding and orientation tuning in V1. The disparity energy model (Freeman and Ohzawa 1990
) predicts a strong link for V1 disparity detectors between their preferred orientation for contours and their disparity sensitivity, with a higher sensitivity orthogonal to the preferred orientation. Surprisingly, such a relationship has not been reported in central V1 (Cumming 2002
; Gonzalez and others 2003
). In Cumming's study, a horizontal elongation of the disparity response surfaces (visual responses as a function of HD and VD) is reported for a majority of cells, irrespective of their preferred orientation. This was interpreted by the author as another expression of the HD specialization in V1. Thus, we were also interested to know if, in the peripheral representation where such a specialization is not expected, this cornerstone prediction of the disparity energy model could be verified.
| Materials and Methods |
|---|
|
|
|---|
A detailed description of the general methods has been reported elsewhere (Trotter and Celebrini, 1999; Durand and others 2002
Visual Stimulation
Stereoscopic stimulation was performed using dynamic random dot stereograms (dRDS) generated through ferroelectric stereo glasses (60 frames per second per eye), size 6° x 6°, dot density 20%, and dot size 3.5 min of arc. Generally, HD and VD were varied within the range [0.6°, +0.6°], with a sampling step of 0.2°. For cells with very fine disparity tuning, the disparity range was set to [0.3°, +0.3°] with a 0.1° step size. For cells with very coarse disparity tuning, the disparity range was set to [1.2°, +1.2°] with a 0.4° step. Orientation selectivity was tested using square-wave gratings (circular window 6° diameter) with a spatial frequency of 2 cycles/degree in 8 steps of 22.5° for a 180° range. The different stimuli were flashed binocularly for 500 ms, centered on the RF, and presented 5 times randomly interleaved. Spike activity was collected from 300 ms before the appearance of the fixation target (spontaneous activity) until 500 ms after the stimulus onset (visual evoked activity).
Simulation of the Natural VD Range
We used simulation to estimate the range of naturally occurring VD as a function of retinal eccentricity in our experimental conditions but also over a broad range of binocular fixation conditions. Helmholtz coordinate system was used to specify eye position (positive directions clockwise, right and up). The torsional state of the eyes was calculated using an intermediate between L2 and Listing's law, similar to the one used by Schreiber and others (2001)
, where the cyclorotational state of the left and right eyes (CL and CR) is specified by CL = HLV/2 + 0.15DV and CR = HRV/2 0.15DV. HL and HR are the horizontal angles of the left and right eyes, V is their common vertical angle, and D is the vergence angle (HL HR). We used a planar projection plane situated at one unit distance from the eyes projection center to calculate the positional disparities arising in the left eye for fixed points on the right retina using the equations given in Garding and others (1995)
. The only addition to the left-to-right-eye transformation matrices was a rotational term accounting for the relative cyclorotational state of the 2 eyes. For each point on the right retina, positional disparity was calculated for an object situated slightly in front and slightly behind the horizontal horopter (position of the points in space projecting without HD). The epipolar line was then defined as the line segment passing through their projections on the left eye projection plane and extending ±1° horizontally (which is an approximation of the fusion range for HD). This was repeated over a broad range of fixation distance (vergence angle from 17° to 0°), gaze azimuth (±25°), and elevation (±25°). Monkey interocular distance was set to 3 cm.
Data Analysis
Spatial Analysis
The criterion for disparity (horizontal and vertical) and orientation selectivity was a P value < 0.05 in a 1-way analysis of variance (ANOVA). We computed an ANOVA-based selectivity index to quantify its strength. This index corresponds to the mean intercondition variance of the visual responses divided by the mean inter- plus intracondition variance. The maximum value this index can reach is 1, if the response strength varies when changing the condition (disparity or orientation) while remaining constant for repetitions of the same condition (Prince, Pointon, and others 2002
).
Disparity tuning curves were fit with a Gabor function. We followed the method described by Prince, Pointon, and others (2002)
to set the disparity frequency parameter according to the frequency spectrum obtained from the mean raw tuning curve before Gabor fitting. Preferred disparity was the disparity value evoking the most extreme response level (peak or trough) relative to the baseline level of activity. Orientation tuning curves were fit with a Gaussian function, and the peak of the Gaussian bell was used as an estimate of the preferred orientation. Only cells for which the fit accounted for more than 75% of the intercondition variability were used in this analysis.
Visual responses evoked by various HD/VD combinations were recorded for 78 cells to build disparity response surfaces. For the majority of cells (n = 42), 49 combinations were tested with at least 5 repetitions: 7 HD x 7 VD, both between 0.6° and +0.6° (some cells were tested with very fine, ±0.3°, or coarse, ±1.2°, disparity ranges). For the other cells (n = 36), we used a reduced protocol in which VD selectivity was measured for several (35) HD values after measurement of the HD tuning curve.
Prior to the 2-dimensional (2D) Gabor fitting, a 2D fast fourier transform was used to obtain the surfaces' frequency spectrum after removal of the DC component (i.e., mean overall visual response), and the zero-frequency component was shifted to the center of spectrum. Disparity frequency was then set to the distance between the peak of highest energy and the spectrum center. The disparity modulation axis (perpendicular to the parallel stripes of the Gabor function) was set to match the orientation of the axes passing through the spectrum center and the peak of highest energy. A 2D Gabor function of the form
|
|
![]() |
(the orientation of the disparity modulation axis set by the method described above). A is the amplitude of the Gabor relative to B, its baseline level. The Gaussian envelope is defined by its width,
, and its peak position (h0, v0). The
is the aspect ratio that specifies how elliptical the Gaussian envelope is (
= 1 for a spherical envelop). The cosine term is defined by its frequency, f, and its phase,
, relative to the peak of the Gaussian envelope. Preferred disparity and surface elongation axis were determined from the fitted surface. Preferred disparity was the HD/VD combination eliciting the most extreme response relative to baseline (this could be a peak, for a "tuned excitatory" [TE] cell, or a trough, for a "tuned inhibitory" [TI] cell).
To determine the disparity elongation axis following the method proposed by Cumming (2002)
, only the main response peak was kept (i.e., the level of activity higher than half the peak amplitude relative to baseline) from which was measured the orientation of the long axis. Note that for this analysis, disparity frequency and modulation axis extracted from the 2D spectrum were used as starting parameters but not fixed for the fit, in order to allow direct comparison of the results.
The treatment described so far was also applied to a set of 999 synthetic surfaces generated from each experimental surface by parametric bootstrap (Efron and Tibshirani 1993
) to assess the 95% confidence intervals of the various parameters.
Temporal Analysis
Visual onset latency was determined from the visual responses evoked by the optimal disparity (or orientation). A mean poststimulus time histogram (PSTH) was constructed and smoothed with a Gaussian kernel (sigma between 3 and 8 ms, depending on the overall response strength) in order to compensate for the relatively weak number (5) of responses (Lu and others 1995
). Distribution of spike counts in the 1-ms bins before stimulus onset (from 200 to 0 ms) was fitted with a Poisson distribution and visual onset was set to the first of 20 subsequent poststimulus bins for which the probability of belonging to the baseline Poisson distribution was <0.001. Because the smoothing can lead to an underestimation of visual response onset, it was corrected by sliding this first visual bin on the smoothed PSTH until inflexion of the first visual response peak (the peak of its first derivative). Selectivity appearance was then calculated on the unsmoothed PSTH by running successive ANOVA tests in a temporal window starting from the visual response onset and iteratively incremented by 1 ms. Selectivity onset was set the first of 10 successive bins for which the ANOVA P value < 0.05.
| Results |
|---|
|
|
|---|
General Results
Data were obtained from 3 monkeys. In total, 633 cells were recorded in area V1 for at least one of the following tests: orientation, HD, or VD selectivity. Two regions of area V1 were explored: 1) central V1 (cV1; N = 319), corresponding to the operculum, in which cells were recorded within 7° of the visual field center (median RF eccentricity, 3.2° [0.5°6.3°]) and had RF size ranging from 0.5° to 1.5° and 2) peripheral V1 (pV1; N = 314), lying along the bank of the calcarine sulcus where RFs are slightly larger (from 1° to 4°) at retinal eccentricities ranging from 7° to 27° (median 14.8°) (Fig. 2A,B); see (Daniel and Whitteridge 1961
; Gattass and others 1981
; Van Essen and others 1984
) for a detailed visual topography. Orientation- and disparity-selective cells were encountered across the whole range of retinal eccentricities we explored (Fig. 2C), and we did not observe any difference in the proportions of selective cells between central and peripheral V1 either for orientation (about 90%) or for disparities (about 55%). Only disparity-selective cells for which the Gabor fit accounted for at least 75% of the tuning curve variance were kept for further analysis (93% of the total population of disparity-selective cells). Similarly, 86% of the orientation-selective cells for which the Gaussian fit described at least 75% of the tuning curve variance were kept (Table 1).
|
|
Spatial Analysis: Encoding of Central and Peripheral Disparities
Categories of HD and VD Detectors
In their pioneering works performed with behaving monkeys, Poggio and others (Poggio and Fischer 1977
; Poggio and others 1985
, 1988
) introduced the idea of categories of HD-selective cells: "Tuned" cells have sharp tuning profiles with a preferred disparity close to 0° for TE and TI cells and more far away from 0° for the "tuned near" (TN) and "tuned far" (TF) cells. "Reciprocal" cells exhibit symmetric profiles often centered on 0°, with a preference for positive versus negative disparities ("far" cells) or the opposite ("near" cells).
We used similar terminology for VD-selective cells, without any a priori functional meaning (see preliminary data in Durand and others 2002
). Examples of VD tuning profiles illustrating these 6 categories are shown in Figure 3 with their corresponding raster displays.
|
Disparity tuning curves were parameterized using 1D Gabor functions and quantitatively classified according to both disparity frequency and preferred disparity. Tuning measures did not reveal clusters (Fig. 4A,B), in agreement with previous studies reporting that these categories (at least for HD selectivity) do not represent distinct cell populations but rather prototypes along a continuum of tuning profile shapes (Levay and Voigt 1988
|
When comparing the disparity encoding ranges (Fig. 4A,B top histograms), it appears that preferred VDs are essentially concentrated around 0° in the central field and are distributed over a much wider range in the periphery (standard deviation [SD] ratio = 2.70; F-test; P < 0.0001). This same trend was also apparent for HD in peripheral V1 (SD ratio = 1.33; P < 0.005). In central V1, the range of preferred HDs was more than 2 times broader than that for VDs (SD ratio = 2.40, P < 0.0001), whereas no such anisotropy was found in peripheral V1 (SD ratio = 1.18, P > 0.05).
To evaluate the competency of the same disparity detectors to encode both horizontal and vertical components, we show in Figure 5, the distribution of preferred disparities, for a subset of 78 cells for which we recorded disparity response surfaces. In that case, the preferred disparity is a 2D parameter (combination of HD and VD) extracted from the 2D Gabor fitting of the response surfaces. It confirms a specialization for HD coding in central V1 (SD ratio = 2.78, P < 0.0001), as shown by the horizontal shape of the scatter plot also reported in previous studies (Cumming 2002
; see also Fig. 4C in Pack and others 2003
). By comparison, in peripheral V1, the dispersion of the data points is similar in both horizontal and vertical dimensions (SD ratio = 1.20, P > 0.05).
|
We calculated the VD that would naturally occur in these cells' RFs for a gaze straight ahead and a fixation distance of 50 cm (experimental conditions; see Materials and Methods for details), in order to compare with the VD actually preferred by the same cells. Naturally occurring VD ranged between ±0.005° and ±0.30° for the central and peripheral populations of V1 cells, respectively. The VD encoding range was significantly higher than the range of naturally occurring VD (F-test, P < 0.0001 both in cV1 and pV1), and significant correlation was found between these 2 measures neither in cV1 (r = 0.34, P > 0.05) nor in pV1 (r = 0.25, P > 0.05). This can be interpreted as indicating that VD tuning is not set to match the VD naturally occurring for a particular gaze configuration but rather across a wider range of viewing conditions.
Temporal Analysis
Because differences were found concerning spatial tuning characteristics between central and peripheral disparity detectors, we assessed possible differences in the temporal domain by analyzing both response latencies and selectivity onsets to random dots stimuli and to oriented gratings (see Materials and Methods and Table 2). Some cells (9%) were discarded from this analysis because of a too low level of response to compute response latencies.
|
The distributions of visual latencies in response to HD, VD, and orientation, for central and for peripheral V1 cells, are shown in Figure 6A. No difference was found between the central and peripheral representations of V1 (Wilcoxon rank sum test, P > 0.05) either for HD and VD or for orientation (Table 2). Similarly, for the time needed by individual cells to show a disparity- or orientation-selective response, no difference was observed between the cells with central versus peripheral RFs. Thus, similar temporal characteristics were found for the central and peripheral population of cells. Moreover, no difference was found among the categories of disparity-selective cells, either for the visual latency (median latencies are 49 ms for TE/TI cells, 49 ms for TN/TF cells, and 50 ms for near/far cells) or for the selectivity onset (respectively, 77 ms, 86 ms, and 83 ms; Wilcoxon rank sum test, P > 0.05). The only significant difference was found between onsets for disparity selectivity and orientation selectivity, in both regions (Fig. 6B). Selectivity for grating orientation appears very rapidly after the beginning of the visual responses (median of 8 and 9 ms for central and peripheral V1), confirming previous studies (Vogels and Orban 1991
|
Orientation/Disparity Relationship
Central and Peripheral Distributions of Preferred Orientation
RF orientation of disparity-selective neurons is an important issue because theoretically, the current model for disparity selectivity (binocular energy model; Ozhawa and others 1990) explicitly requires an orthogonal relationship between orientation and disparity axes, implying that disparity detectors in V1 should be most sensitive to disparities introduced orthogonal to their preferred orientation. To address this issue, we first asked, at the population level, if the differences we found in disparity coding between central and peripheral regions were reflected in the orientation distribution as well. Second, we studied at the level of individual neurons (n = 51), the relationship between RF orientation and disparities coding (by comparing their preferred orientation with the orientation of their disparity response surface). Among the cells that were tested for orientation and at least one disparity dimension in cV1 (n = 86)/pV1 (n = 107), 49%/48% were selective to both, 35%/29% were only orientation selective, 6%/9% only disparity selective, and 10%/14% were neither orientation nor disparity selective. No difference was observed concerning the orientation tuning bandwidth, assessed by the Gaussian fit parameters, between central and peripheral V1 cell populations (Wilcoxon test, P > 0.05, see Table 1).
We defined 2 angles to quantify the preferred orientation coded by V1 neurons. The first one (
H) is classically determined relatively to the horizontal axis and the second one (
P) is relative to the polar axis passing from the visual field center to the cell's RF center. An example of orientation-selective cell is displayed in Figure 7A with a preferred orientation angle of 53° and a polar axis of 45°. Thus, this peripheral V1 neuron was oriented obliquely relative to the horizontal axis (
H = 53°) but nearly parallel (or radial) relative to the polar axis [
P = 8° (53° (45°))]. The distribution of
H and
P is shown in Figure 7B for central (top row) and peripheral (bottom row) cells. We found that, at the population level, the previously shown specialization of central V1 for HD coding is accompanied by a vertical bias of the distribution of preferred orientation (
H: the bias is just below significance with a Rayleigh test of uniformity, P = 0.065, but it reaches significance when uniformity is tested against the alternative of an expected nonuniform distribution with 90° mean, V test/90°, P < 0.05; Fischer 1993
; Zar 1998
) (Fig. 7B top left). Others (Mansfield 1974
; Poggio and Fischer 1977
; Bauer and others 1980
; De Valois and others 1982
; Celebrini and others 1993
) have already described this vertical bias. We did not find such vertical bias in the peripheral representation of the visual field (V test/90°, P = 0.24) (Fig. 7B top right).
|
Furthermore, we observed a significant radial preference in the peripheral field (
P: Rayleigh test, P < 0.05, V test/0°, P < 0.01) (Fig. 7B bottom right) but not in central V1 (V test/0°, P = 0.42) (Fig. 7B bottom left). Such a radial organization of the RFs, reported once in the superficial layers of central V1 (Bauer and Dow 1989Link between Orientation and HD/VD Encoding
To test more specifically the disparity energy model, we studied the relationship between 2D disparity encoding and orientation selectivity on the same cells by recording disparity response surfaces for 51 (23 in cV1 and 28 in pV1) orientation-selective cells. Three examples are shown in Figure 8A.
|
In order to compare the orientation of these disparity response surfaces with the preferred orientation of the same cells, we defined the disparity modulation axis as the axis along which the visual response varies maximally for changes in angular disparity values. According to the disparity energy model (Freeman and Ohzawa 1990
|
These results are in agreement with the predictions of the disparity energy model (Freeman and Ohzawa 1990
| Discussion |
|---|
|
|
|---|
The main objective of this study was a quantitative comparison of retinal dispary encoding in the central and peripheral visual field representations of primary visual cortex. We measured neuronal selectivity for both HD and VD, as well as the relationship between their joint encoding and the preferred orientation.
Our results confirm the existence of V1 neurons in the peripheral field representation (up to 30°) selective for both orientation (Battaglini and others 1993
) and disparity (Durand and others 2002
). They are similar in proportion, strength, tuning bandwidth, and also regarding their temporal characteristics (response and selectivity onsets) to the neuronal selectivity found in the central field representation (<7°). In contrast, marked differences were found in the spatial organization of disparity and orientation coding between the center and the periphery: differences in the coding range for disparities and in the distributions of preferred orientation. We also gave clear evidence of a relationship between neuronal sensitivity to disparities and orientation coding, in line with the prediction of the disparity energy model. On the basis of these results, we propose a functional organization of V1 disparity detectors that is retinal eccentricity dependent.
HD/VD Encoding Range
In the representation of the peripheral visual field, preferred HD and VD ranges are similar, whereas, near the representation of central vision, the ranges are very different, with preferred VDs tightly clustered around 0°. This result might be expected by the fact that VDs do not naturally occur in central vision (Fig. 1). This anisotropy in the ranges of HD and VD encoding confirms a previous report from perifoveal V1 (Cumming 2002
) and further reveals that this anisotropy is restricted to the representation of the central visual field.
VDs occurring naturally for a monkey fixating straight ahead at 50 cm are within ±0.02° in the central visual field (up to 7°) and cover a larger range of ±0.45° in peripheral field (up to 30°). These ranges increase drastically when considering a wide range of eye positions: up to ±1° in the center and up to ±7° within the periphery (see Fig. 1). Thus, the VD encoding ranges found for the central and peripheral populations of V1 cells (about ±0.25° in central V1 and ±0.60° in peripheral V1, see Fig. 5) do not cover the full range of possibly arising VD. However, most of the viewing conditions considered in this simulation are quite uncommon (for instance a gaze directed 25° to the right and 25° down with a fixation distance of 10 cm). When considering VD arising within 20° of retinal eccentricity (our peripheral population was mainly composed of cells with RFs within 20°) with more common fixation conditions (gaze direction ±10° for both azimuth and elevation and fixation distance of 20 cm or more), the VD ranges found in the center and in the periphery are ±0.24° and ±1.00°, respectively, close to the encoding ranges we reported in the central and peripheral field representations of V1 (see Figs 4A,B and 5). It has been reported that binocular fusion (and thus stereoscopic vision) can fail for very eccentric gaze associated with close fixation distance, probably because the VDs generated in such conditions are out of the binocular fusion range (Schreiber and others 2001
). This could reflect the fact that V1 disparity detectors are tuned to encode a range of commonly encountered VD (i.e., associated with common viewing conditions) rather than the full range of possibly arising VD.
Disparity Energy Model
According to the disparity energy model (Ohzawa and others 1990
), the orientation of the disparity response surface of V1 neurons should match their RF orientation, with highest disparity sensitivity orthogonal to the latter. Our results directly validate this prediction as we found precisely this relationship for neurons in the central as well as in the peripheral representations of V1. They are in agreement, although indirectly, with recent observations based on surface responses obtained by reverse correlation technique, which report an orthogonal relationship between V1-oriented subunits and disparity interactions (Pack and others, 2003
).
However, our results partly contradict a study reporting a horizontal elongation of the disparity response surfaces irrespective of the cells' preferred orientation in central V1 (Cumming 2002
). To confirm that the apparent discrepancy was not due to a difference in the analytical methods, we reprocessed our data using the same method as in Cumming's study. When looking at the disparity elongation axis after 2D Gabor fitting of the response surfaces, we still found a clear parallel relationship between the elongation axis of the disparity response surfaces and the preferred orientation in both central and peripheral V1. However, we also found that among the 9 cells in the central field representation that do not exhibit this parallel relationship, 6 have a horizontal elongation axes (Fig. 9C, black arrow). A possible overrepresentation of such cells might have been responsible for the weak relationship reported between disparities and orientation encoding (Cumming 2002
). In peripheral V1, we did not observe such a tendency. Thus, if a horizontal elongation of the disparity response surfaces exists, it is likely restricted to the central field representation. Despite the claim that it represents another aspect of the specialization for HD (Cumming 2002
; Read and Cumming 2004
), its functional advantages remain unclear because it implies a coarser tuning to HD.
Another study reported a lack of relationship between disparity and orientation encodings in central V1 (Gonzalez and others 2003
). However, the sampling precision used in this study to investigate VD selectivity was too coarse (steps of 0.45°) as we have shown that, in central V1, preferred VDs are found over only a narrow range of ±0.25° (see also Cumming 2002
). This alone explains why these authors failed to find neurons with nonzero preferred VD as well as why they could not demonstrate any relationship with orientation selectivity.
Distribution of the Preferred Orientation
An important difference between central and peripheral fields found in our study is the vertical bias of the preferred orientation in the center versus the radial bias in the periphery. This difference is consistent with the observed encoding ranges in both regions because, according to the disparity energy model, a VD detector will be optimal for HD encoding (DeAngelis and others 1991
), whereas an oblique detector will have similar characteristics along the HD and VD dimensions. It should be noted that these biases are also observed with the smaller samples of disparity response surfaces recorded in central (n = 26) and peripheral (n = 52) representations of V1. When comparing the cell populations having disparity axis (modulation/elongation), rather parallel (0°30°), oblique (30°60°), or perpendicular (60°90°) relative to the horizontal and to the RF polar axis, we found a significant horizontal bias of the disparity modulation in cV1 (
2 test, P < 0.05) and a significant radial bias of the disparity elongation axis in pV1 (
2 test, P < 0.05).
Our results show that at the first cortical stage of binocular processing, many V1 neurons are selective for both HD and VD and that their responses to these disparity components are tightly related to their RF orientation. In the central part of the visual field, the specialization for HD encoding is reflected by the encoding range anisotropy and the fact that cells with vertically oriented RFs are well suited for encoding HD (Ohzawa and others 1990
; DeAngelis and others 1991
). In contrast, in the periphery, HD and VD are encoded over similar ranges and disparity detectors have a radial organization. Such an organization has probable implications in both the oculomotor and perceptual domains that are discussed below.
V1 Disparity Detectors and the Oculomotor Control of Binocular Fixation
Poggio (1995)
proposed an involvement of the TE/TI cells in the fine oculomotor control of horizontal eye alignment. Extending this idea to the TE/TI-like cells for alignment control in the vertical dimension could explain the predominance of these categories in the central visual field representation (because horizontal and vertical vergence eye movements exhibit maximum gain for central stimuli). The fact that VD will rarely occur in central vision for reasons other than error in vertical alignment can explain why it is in the vertical dimension that these categories are the most prominent.
A notable difference between horizontal vergence and vertical vergence is that the gain of horizontal vergence is already maximal for small stimuli, whereas the gain for vertical vergence increases with increasing stimulus size up to 20° in diameter (Howard and others 2000
). Because we have shown that the VD encoding range is narrow in the central field representation and increases quickly with increasing retinal eccentricity, it is possible that larger stimuli will recruit populations of peripheral disparity detectors that are more suited to encode the range of VD (±0.5°) used to elicit vertical vergence in the study of Howard and others (2000)
.
The third type of alignment error (torsional) generates cyclodisparity, which arises along isoeccentricity circles centered on the fovea and increases with increasing retinal eccentricity. Theoretically, a radial organization of the disparity detectors is suited to deal with cyclodisparity because it provides both a better tolerance to cyclodisparity and a finer encoding to control cyclovergence eye movements. Our results show that such a radial bias is actually found at peripheral eccentricities (but not in the central field representation), where it is accompanied by an isotropic HD/VD encoding range.
V1 Disparity Detectors and Stereoscopic Vision
Stereoscopic vision is preferentially expressed in central vision, where HD drives the stereoscopic percept, whereas VD generally disturbs or cancels it. The vertical bias found for the preferred orientations in central V1 is compatible with a higher sensitivity to HD than to VD and, in combination with the wider HD encoding range, creates a specialization for HD coding in the central visual field. The very narrow VD encoding range is useful in reducing the binocular correspondence search zone in the nonpertinent vertical dimension, thus reducing the chances of false matching. It can also explain the elliptical shape of the Panum's binocular fusion area observed in the central visual field (Ogle and Prangen 1953
; Tyler 1991
) and the weak tolerance to an artificially added VD in a stereoscopic stimulus (with the deleterious effect of this signal on central stereoscopic vision; Nielsen and Poggio 1984
; Prazdny 1987
; Stevenson and Schor 1997
).
For large stereoscopic stimuli extending into the peripheral visual field, VD has been shown to contribute to stereoscopic depth perception. The "induced effect" documented by Ogle (1938
, 1962
) was the first demonstration of such an influence. When a vertical magnifying lens is put in front of one eye while looking straight ahead at a large frontoparallel surface, this surface appears to be slanted in depth about its vertical axis, away from the eye with the lens. This effect defies any explanation of stereoscopic vision based only on a HD signal because the vertical magnifying lens produces only VD.
Because VD is insensitive to local depth variations but varies with distance and direction of the gaze, it can theoretically be used to recover these viewing parameters (Mayhew and Longuet-Higgins 1982
; Gillam and Lawergren 1983
), which are required for a correct interpretation of HD in terms of stereoscopic depth. However, if the models proposed so far are mathematically valid and fit with psychophysical observations for large stereoscopic surfaces (Rogers and Bradshaw 1993
; Howard and Rogers 2002
), none of them has received physiological support. For instance, neurons decoupling the HD and VD signals and integrating VD globally or regionally across the visual field have not yet been documented.
More recently, Matthews and others (2003)
have proposed a model accounting for stereoscopic depth perception from VD based on disparity encoding by V1-like neurons. In this model, a vertically oriented disparity detector will not produce any depth signal from VD, whereas an oblique detector will produce a depth signal with a local sign (near/far) that depends on its radial/perpendicular orientation. The model assumes that the disparity energy model is valid and thus that the neuronal response evoked by VD is a function of the disparity detector's RF orientation. To account for the induced effect, a radial organization of the disparity detectors is also assumed to prevent a cancellation of the depth signal produced by disparity detectors at all orientations for stimuli with no dominant orientation. Our results directly demonstrate both assumptions of this model for the peripheral representation of V1: the validity of the disparity energy model and the radial organization of the peripheral disparity detectors. Because our results also show that disparity detectors in central V1 are not organized radially but rather vertically, they can also explain why only weak effects of VD are encountered in the central field representation (because vertically oriented detectors are not suited to encode VD) and only from oriented stimuli (because radial/perpendicular detectors are equally represented). Thus, the functional organization of disparity encoding in V1 can account for at least a part of the effects of VD in stereoscopic vision. However, it does not rule out the possibility of further processing of VD, notably its regional and/or global pooling, in order to gather information about absolute distance and azimuth of visual objects or to extract the viewing parameters (fixation distance and gaze direction).
In conclusion, this study reveals distinct functional organizations for the encoding of binocular disparity within the central and peripheral representations of the visual field. In addition, our results confirm the previously reported specialization for HD processing in V1 (Cumming 2002
) while revealing that this specialization is confined to the central field representation. In the periphery, we show a radial organization of disparity detectors and an isotropic encoding along the HD and VD dimensions over a range consistent with naturally occurring VDs at the retinal eccentricities considered. Finally, we demonstrate experimentally one of the cornerstone assumptions of the disparity energy model that predicts a relationship between disparity and orientation encodings. Overall, these results show that area V1, the first binocular stage in visual processing, is organized according to the geometrical characteristics of binocular vision in the central and peripheral field and that this organization can explain various functional features associated with the perceptual and oculomotor aspects of binocular vision.
| Acknowledgments |
|---|
This work was supported by the Human Frontier Science Program and Centre National de la Recherche Scientifique. We thank C. Lummert, E. Galy, and S.P. Zhu for their participation in early experiments. We are grateful to Dr Rick Born for valuable comments on the manuscript. Conflict of Interest: None declared.
| References |
|---|
|
|
|---|
Backus BT, Banks MS, van Ee R, Crowell JA. Horizontal and vertical disparity, eye position, and stereoscopic slant perception. Vision Res (1999) 39:11431170.[CrossRef][Web of Science][Medline]
Barlow HB, Blakemore C, Pettigrew JD. The neural mechanism of binocular depth discrimination. J Physiol (1967) 193:327342.
Battaglini PP, Galletti C, Fattori P. Functional properties of neurons in area V1 of the awake monkeys: peripheral versus central visual field representation. Arch Ital Biol (1993) 131:303315.[Web of Science][Medline]
Bauer JA Jr, Owens DA, Thomas J, Held R. Monkeys show an oblique effect. Perception (1980) 8:247253.[CrossRef][Web of Science]
Bauer R, Dow BM. Complementary visual maps for orientation coding in upper and lower layers of the monkey's foveal striate cortex. Exp Brain Res (1989) 76:503509.[CrossRef][Web of Science][Medline]
Celebrini S, Thorpe S, Trotter Y, Imbert M. Dynamics of orientation coding in area V1 of the awake primate. Vis Neurosci (1993) 10:811825.[Web of Science][Medline]
Chen Y, Wang Y, Qian N. Modeling V1 disparity tuning to time-varying stimuli. J Neurophysiol (2001) 86:143155.
Cumming BG. An unexpected specialization for horizontal disparity in primate primary visual cortex. Nature (2002) 418:633636.[CrossRef][Medline]
Cumming BG, DeAngelis GC. The physiology of stereopsis. Annu Rev Neurosci (2001) 24:203238.[CrossRef][Web of Science][Medline]
Daniel PM, Whitteridge D. The representation of the visual field on the cerebral cortex in monkeys. J Physiol (1961) 159:203221.
DeAngelis GC, Ohzawa I, Freeman RD. Depth is encoded in the visual cortex by a specialized receptive field structure. Nature (1991) 352:156159.[CrossRef][Medline]
De Valois RL, Yund EW, Hepler N. The orientation and direction selectivity of cells in macaque visual cortex. Vision Res (1982) 22:531544.[CrossRef][Web of Science][Medline]
Durand JB, Zhu S, Celebrini S, Trotter Y. Neurons in parafoveal areas V1 and V2 encode vertical and horizontal disparities. J Neurophysiol (2002) 88:28742879.
Efron B, Tibshirani RJ. An introduction to the Bootstrap (1993) New York: Chapman and Hall.
Farell B. Two-dimensional matches from one-dimensional stimulus components in human stereopsis. Nature (1998) 395:689693.[CrossRef][Medline]
Fischer NI. Statistical analysis of circular data (1993) UK: Cambridge University Press: Cambridge.
Freeman RD, Ohzawa I. On the neurophysiological organization of binocular vision. Vision Res (1990) 30:16611676.[CrossRef][Web of Science][Medline]
Garding J, Porrill J, Mayhew JE, Frisby JP. Stereopsis, vertical disparity and relief transformations. Vision Res (1995) 35:703722.[CrossRef][Web of Science][Medline]
Gattass R, Gross CG, Sandell JH. Visual topography of V2 in the macaque. J Comp Neurol (1981) 201:519539.[CrossRef][Web of Science][Medline]
Gillam B, Lawergren B. The induced effect, vertical disparity, and stereoscopic theory. Percept Psychophys (1983) 34:121130.[Web of Science][Medline]
Gonzalez F, Justo MS, Bermudez MA, Perez R. Sensitivity to horizontal and vertical disparity and orientation preference in areas V1 and V2 of the monkey. Neuroreport (2003) 14:829832.[CrossRef][Web of Science][Medline]
Gonzalez F, Perez R. Neural mechanisms underlying stereoscopic vision. Prog Neurobiol (1998) 55:191224.[CrossRef][Web of Science][Medline]
Gonzalez F, Relova JL, Perez R, Acuna C, Alonso JM. Cell responses to vertical and horizontal retinal disparities in the monkey visual cortex. Neurosci Lett (1993) 160:167170.[CrossRef][Web of Science][Medline]
Howard IP. Basic mechanisms (2002) Toronto, Canada: I. Porteus.
Howard IP, Fang X, Allison RS, Zacher JE. Effects of stimulus size and eccentricity on horizontal and vertical vergence. Exp Brain Res (2000) 130:124132.[CrossRef][Web of Science][Medline]
Howard IP, Rogers BJ. Depth perception (2002) Toronto, Canada: I. Porteus.
Howard IP, Sun L, Shen X. Cycloversion and cyclovergence: the effects of the area and position of the visual display. Exp Brain Res (1994) 100:509514.[Web of Science][Medline]
Joshua DE, Bishop PO. Binocular single vision and depth discrimination. Receptive field disparities for central and peripheral vision and binocular interaction on peripheral single units in cat striate cortex. Exp Brain Res (1970) 10:389416.[CrossRef][Web of Science][Medline]
Julesz B. Foundations of cyclopean perception (1971) IL: University of Chicago Press: Chicago.
Levay S, Voigt TO. Ocular dominance and disparity coding in cat visual cortex. Vis Neurosci (1988) 1:395414.[Web of Science][Medline]
Leventhal AG. Relationship between preferred orientation and receptive field position of neurons in cat striate cortex. J Comp Neurol (1983) 220:476483.[CrossRef][Web of Science][Medline]
Lu SM, Guido W, Vaughan JW, Sherman SM. Latency variability of responses to visual stimuli in cells of the cat's lateral geniculate nucleus. Exp Brain Res (1995) 105:717.[Web of Science][Medline]
Mansfield RJ. Neural basis of orientation perception in primate vision. Science (1974) 186:11331135.
Matthews N, Meng X, Xu P, Qian N. A physiological theory of depth perception from vertical disparity. Vision Res (2003) 43:8599.[CrossRef][Web of Science][Medline]
Mayhew JE, Longuet-Higgins HC. A computational model of binocular depth perception. Nature (1982) 297:376378.[CrossRef][Medline]
Mazer JA, Vinje WE, McDermott J, Schiller PH, Gallant JL. Spatial frequency and orientation tuning dynamics in area V1. Proc Natl Acad Sci USA (2002) 99:16451650.
Nielsen KR, Poggio T. Vertical image registration in stereopsis. Vision Res (1984) 24:11331140.[CrossRef][Web of Science][Medline]
Nikara T, Bishop PO, Pettigrew JD. Analysis of retinal correspondence by studying receptive fields of binocular single units in cat striate cortex. Exp Brain Res (1968) 6:353372.[Web of Science][Medline]
Ogle KN. Induced size effect. I. A new phenomenon in binocular space-perception associated with the relative sizes of the images of the two eyes. Arch Ophthalmol (1938) 20:604623.
Ogle KN. The optical space sense. In: The eyeDavson H, ed. (1962) New York: Academic Press. 211432.
Ogle KN, Prangen AD. Observations on vertical divergences and hyperphorias. AMA Arch Ophthalmol (1953) 49:313334.[Medline]
Ohzawa I, DeAngelis GC, Freeman RD. Stereoscopic depth discrimination in the visual cortex: neurons ideally suited as disparity detectors. Science (1990) 249:10371041.
Pack CC, Born RT, Livingstone MS. Two-dimentional substructure of stereo and motion interactions in macaque visual cortex. Neuron (2003) 37:525535.[CrossRef][Web of Science][Medline]
Pigarev IN, Nothdurft HC, Kastner S. Neurons with radial receptive fields in monkey area V4A: evidence of a subdivision of prelunate gyrus based on neuronal response properties. Exp Brain Res (2002) 22:633636.
Poggio GF. Mechanisms of stereopsis in monkey visual cortex. Cereb Cortex (1995) 3:193204.
Poggio GF, Fischer B. Binocular interaction and depth sensitivity in striate and prestriate cortex of behaving rhesus monkey. J Neurophysiol (1977) 40:13921405.
Poggio GF, Gonzalez F, Krause F. Stereoscopic mechanisms in monkey visual cortex: binocular correlation and disparity selectivity. J Neurosci (1988) 8:45314550.[Abstract]
Poggio GF, Motter BC, Squatrito S, Trotter Y. Responses of neurons in visual cortex (V1 and V2) of the alert macaque to dynamic random-dot stereograms. Vision Res (1985) 25:397406.[CrossRef][Web of Science][Medline]
Prazdny K. Vertical disparity nulling in random-dot stereograms. Biol Cybern (1987) 56:6167.[CrossRef][Web of Science][Medline]
Prince SJ, Cumming BG, Parker AJ. Range and mechanism of encoding of horizontal disparity in macaque V1. J Neurophysiol (2002) 87:209221.
Prince SJ, Pointon AD, Cumming BG, Parker AJ. Quantitative analysis of the responses of V1 neurons to horizontal disparity in dynamic random-dot stereograms. J Neurophysiol (2002) 87:191208.
Pugh MC, Ringach DL, Shapley R, Shelley MJ. Computational modeling of orientation tuning dynamics in monkey primary visual cortex. J Comput Neurosci (2000) 8:143159.[CrossRef][Web of Science][Medline]
Read JC, Cumming BG. Understanding the cortical specialization for horizontal disparity. Neural Comput (2004) 16:19832020.[CrossRef][Web of Science][Medline]
Rodionova EI, Revishchin AV, Pigarev IN. Distant cortical locations of the upper and lower quadrants of the visual field represented by neurons with elongated and radially oriented receptive fields. Exp Brain Res (2004) 158:373377.[Web of Science][Medline]
Rogers BJ, Bradshaw MF. Vertical disparities, differential perspective and binocular stereopsis. Nature (1993) 361:253255.[CrossRef][Medline]
Schreiber K, Crawford JD, Fetter M, Tweed D. The motor side of depth vision. Nature (2001) 410:819822.[CrossRef][Medline]
Stevenson SB, Schor CM. Human stereo matching is not restricted to epipolar lines. Vision Res (1997) 37:27172723.[CrossRef][Web of Science][Medline]
Trotter Y. Cortical representation of visual three-dimensional space. Perception (1995) 24:287298.[Web of Science][Medline]
Trotter Y, Celebrini S. Gaze direction controls response gain in primary visual-cortex neurons. Nature (1999) 398:239242.[CrossRef][Medline]
Tsao DY, Conway BR, Livingstone MS. Receptive fields of disparity-tuned simple cells in macaque V1. Neuron (2003) 38:103114.[CrossRef][Web of Science][Medline]
Tyler CJ. The horopter and binocular fusion (1991) Boston: CRC.
Van Essen DC, Newsome WT, Maunsell JH. The visual field representation in striate cortex of the macaque monkey: asymmetries, anisotropies, and individual variability. Vision Res (1984) 24:429448.[CrossRef][Web of Science][Medline]
Vogels R, Orban GA. Quantitative study of striate single unit responses in monkeys performing an orientation discrimination task. Exp Brain Res (1991) 84:111.[Web of Science][Medline]
Wheatstone C. Contribution to the physiology of visionpart the first. On some remarkable and hitherto unobserved phenomena of binocular vision. Philos Trans R Soc Lond B Biol Sci (1838) 128:371394.[CrossRef]
Zar JH. Biostatistical analysis (1998) 4th ed. New Jersey: Prentice Hall.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
K. Miura, Y. Sugita, K. Matsuura, N. Inaba, K. Kawano, and F. A. Miles The Initial Disparity Vergence Elicited With Single and Dual Grating Stimuli in Monkeys: Evidence for Disparity Energy Sensing and Nonlinear Interactions J Neurophysiol, November 1, 2008; 100(5): 2907 - 2918. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Chowdhury, D. L. Christiansen, M. L. Morgan, and G. C. DeAngelis Effect of Vertical Disparities on Depth Representation in Macaque Monkeys: MT Physiology and Behavior J Neurophysiol, February 1, 2008; 99(2): 876 - 887. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||










