Cerebral Cortex Advance Access originally published online on September 12, 2006
Cerebral Cortex 2007 17(7):1672-1679; doi:10.1093/cercor/bhl077
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Multisensory Interactions within Human Primary Cortices Revealed by BOLD Dynamics
1 Service de Radiodiagnostic et Radiologie Interventionnelle, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland, 2 Institut de Traitement des Signaux, Ecole Polytechnique Fédérale, Lausanne, Switzerland, 3 Division Autonome de Neuropsychologie, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland, 4 Functional Brain Mapping Laboratory, Department of Neurology, University Hospital, Geneva, Switzerland
Address correspondence to: Roberto Martuzzi, Radiology Service, CHUV, rue du Bugnon 46, 1011 Lausanne, Switzerland. Email: roberto.martuzzi{at}chuv.ch or to Micah M. Murray, The Functional Electrical Neuroimaging Laboratory, Neuropsychology Division, CHUV, Hôpital Nestlé, 5 Av. Pierre Decker, 1011 Lausanne Switzerland. Email: micah.murray{at}chuv.ch.
| Abstract |
|---|
|
|
|---|
Whether signals from different sensory modalities converge and interact within primary cortices in humans is unresolved, despite emerging evidence in animals. This is partially because of debates concerning the appropriate analyses of functional magnetic resonance imaging (fMRI) data in response to multisensory phenomena. Using event-related fMRI, we observed that simple auditory stimuli (noise bursts) activated primary visual cortices and that simple visual stimuli (checkerboards) activated primary auditory cortices, indicative of multisensory convergence. Moreover, analyses of blood oxygen leveldependent response dynamics revealed facilitation of hemodynamic response peak latencies and slopes for multisensory auditoryvisual stimuli versus either unisensory condition, indicative of multisensory interactions within primary sensory cortices. Neural processing at the lowest cortical levels can be modulated by interactions between the senses. Temporal information in fMRI data can reveal these modulations and overcome analytic and interpretational challenges of more traditional procedures. In addition to providing an essential translational link with animal models, these results suggest that longstanding notions of cortical organization need to be revised to include multisensory interactions as an inherent component of functional brain organization.
Key Words: auditory cross-modal fMRI multisensory integration primary cortex visual
| Introduction |
|---|
|
|
|---|
Sensory inputs converge and interact, influencing perception and behavior (e.g. Stein and Meredith 1993
Regarding interactions between the auditory and visual systems, several questions remain unresolved. For example, it is unknown whether monosynaptic projections between primary cortices in monkeys also exist in humans and, if so, whether they produce multisensory interactions that are measurable noninvasively. Although the former is currently not feasible with existing tracing methods, the latter can be assessed using brain imaging techniques. Convergence and interaction effects have been obtained for speech/faces (e.g. Calvert 2001
; Pekkola and others 2005
), letters/vocalizations (e.g. van Atteveldt and others 2004
), and environmental objects (e.g. Beauchamp and others 2004
).
However, the use of functional magnetic resonance imaging (fMRI) to identify multisensory effects is debated (Calvert 2001
; Beauchamp 2005
; Laurienti and others 2005
). In particular, it is unclear whether criteria that have been applied in electrophysiological studies at the single neuron level are valid for fMRI analyses. For example, criteria for convergence (i.e. responding to multiple senses), multisensory enhancement (i.e. responding more to multisensory than to both unisensory stimuli), supraadditivity (i.e. responding more to multisensory than to the summed unisensory responses), and sensitivity to congruent stimulus features (e.g. spatial position, temporal coincidence, or object-related/semantic attributes) have been argued as prone to reporting falsely positive results (Laurienti and others 2005
). Despite these considerations, some studies have reported multisensory effects within primary and adjacent auditory cortices in response to somatosensory (Foxe and others 2002
) or visual stimuli (e.g. Pekkola and others 2005
). To date, none have observed effects within primary visual cortex or examined whether multisensory effects in low-level cortical regions can similarly be elicited by meaningless, rudimentary stimuli. Instead, investigations have thus far been limited to meaningful stimuli, and it remains unknown whether effects in low-level cortices are mediated by higher-order processes. Resolving such questions is critical for determining whether interaction mechanisms between sensory systems are a general, perhaps automatic, property or are instead regulated by stimulus-specific processes.
Interpretational caveats of standard fMRI were bypassed by analyzing dynamics of the blood oxygenation leveldependent (BOLD) signal. Recent developments in event-related fMRI indicate that latency analyses can be performed on the directly measured BOLD signal (e.g. Josephs and others 1997
; Henson and others 2002; Bellgowan and others 2003; Martuzzi and others 2006
). As such, we analyzed both the spatial and the temporal pattern of responses during auditoryvisual (AV) integration.
| Materials and Methods |
|---|
|
|
|---|
Subjects
Twelve healthy subjects (mean ± SD age = 29.4 ± 7.1 years; 6 female) with normal hearing, normal or corrected-to-normal vision, and no history of neurological or psychiatric disease participated. Each provided written informed consent to procedures approved by the Ethics Committee of the Faculty of Biology and Medicine at the University of Lausanne.
Stimuli and Task
Subjects performed a simple reaction time task to visual (a nonalternating yellow on black centrally presented checkerboard, measuring 24° x 32° in total size and each square covering 0.8° x 0.8° of visual angle), auditory (a binaural noise burst), or simultaneous AV stimuli (each 150-ms duration). Stimulus conditions were pseudorandomly intermixed across trials. Subjects were instructed to respond as fast as possible with their right hand upon detection of any stimulus by pressing keys. Behavioral data were acquired using button presses on a MRI-compatible device (Photon Control Inc., Burnaby, BC, Canada). Subjects pressed 4 keys, one per finger, in one swift and continuous movement like tapping one's fingers on a table, and reaction times were recorded as the latency at which the first of the keys was pressed. Stimulus delivery and the acquisition of behavioral data were controlled by E-prime (Psychology Software Tools, Inc., Pittsburgh, PA). Behavioral data from 2 of the twelve subjects were lost due to technical failures.
The interstimulus interval varied pseudorandomly from 14.2 to 17.8 s in steps of 200 ms, allowing the BOLD signal to return to baseline between stimulus presentations (e.g. Josephs and others 1997
). There was a pseudorandom, variable delay between stimulus onset time and volume acquisition of 01.8 s at steps of 200 ms, yielding a total of 10 different delays. Jittering stimulus presentation relative to volume acquisition permitted the BOLD response to be effectively sampled with a temporal resolution of 200 ms (see Supplementary Animation and Martuzzi and others 2006
for additional details). The experiment consisted of 4 sessions, each including 10 repetitions per experimental condition. Therefore, each of the 3 experimental conditions collectively included 4 volume acquisitions at each of the 10 delays used.
Magnetic Resonance Imaging
fMRI data were acquired using an event-related design on a 3.0-T Philips Intera system equipped with an 8-channel head coil. BOLD signals were obtained with a single shot gradient-echo echo-planer imaging sequence (repetition time [TR] = 2 s, echo time [TE] = 30 ms, field of view [FoV] = 224 mm, flip angle = 90°, matrix size 64 x 64). Each volume was comprised of 16 slices (slice thickness 5 mm, gap 1 mm) covering the entire cerebral hemispheres and acquired in ascending order (i.e. first slice at the bottom of the head). To provide precise structural and anatomical localization of brain activity, a sagittal T1-weighted 3D gradient-echo sequence was acquired for each subject (160 contiguous sagittal slices, slice thickness 1 mm, matrix size 256 x 256, TR = 9.9 ms, TE = 4.6 ms, FoV = 256 mm, flip angle = 8°).
Spatial fMRI Analyses
Two types of fMRI analyses were conducted in order to investigate multisensory interactions: the first in terms of the spatial pattern of activated brain regions, the second in terms of changes in temporal dynamics within activated brain regions. The latter analysis was done by measuring the shift in peak latency of the estimated hemodynamic response function within a given area across stimulus conditions (detailed below).
Activation maps were obtained using SPM2 software (Wellcome Department of Cognitive Neurology, London, UK). Functional volumes were first spatially realigned to the first volume acquired and temporarily realigned to the first slice acquired. Volumes were then normalized to the Montreal Neurological Institute (MNI) template, resampled to a voxel size of 3 x 3 x 3 mm3, and smoothed with an isotropic Gaussian kernel (full width half maximum = 6 mm). For each subject, a high-pass filter was applied on the time series to minimize possible effects of baseline drift. The statistical analysis was performed with the General Linear Model, using as a basis function the canonical hemodynamic response function and its temporal derivative, as defined in SPM2. Structural and functional volumes were coregistered within the same coordinate system by normalizing structural images to the MNI template brain and resampling voxels to a 1 x 1 x 1 mm3 size. Inference on the population (group analysis) was obtained by means of second-level statistics, according to the random effects theory. Analyses were conducted to determine the active regions in each condition, separately (voxel-level threshold at P < 0.001, uncorrected; 15 voxel spatial-extent threshold) both on a single subject basis and on the group results.
Temporal fMRI Analyses
The hemodynamic response was reconstructed by averaging the 4 samples collected at each time point relative to stimulus onset and then filtering to remove high-frequency noise. Peak latency and intensity of the hemodynamic signals were measured after fitting the signal around the peak (±0.8 s, equal to 4 data points before and after the peak) with a cubic curve. We would emphasize that cubic fitting was solely applied to these data points encompassing the peak of the acquired time course response, and that time courses shown always display the filtered raw data.
For each experimental condition, peak latencies were derived for each voxel within the brain. To ensure a valid measure of peak latency, analyses were spatially restricted to the regions identified as responsive to auditory and visual stimuli. For each subject, visual-responsive regions (VRRs) were defined according to the overlap between activation maps from the visual and multisensory conditions and auditory-responsive regions (ARRs) by the overlap between activation maps from the auditory and multisensory conditions (see Beauchamp 2005
for a similar approach in analyses of BOLD amplitude). This overlap criterion also minimizes the likelihood of falsely considering a voxel as responsive to either visual or auditory stimulation. Within VRRs we statistically tested (paired t-test) the difference between BOLD peak latencies from the visual and multisensory conditions, whereas within ARRs this analysis was between auditory and multisensory conditions. It should be noted that these tests were conducted on a voxel-wise level and that not all subjects necessarily exhibited the same VRRs and ARRs. In such instances, no peak latency would be measured for a particular subject at a specific voxel. This would result in an empty cell in the analysis matrix and thus increases the propensity for falsely negative results. This constituted the group-level analysis of shifts in BOLD peak latency. We considered only those clusters meeting both a P < 0.05 alpha criterion (uncorrected) and also a 15-voxel spatial-extent criterion.
Although the above approach can identify modulation in BOLD peak latency throughout the entire brain, it does not account for intersubject variability in cortical functional geometry. To partially overcome this limitation without restricting our analyses to particular anatomical subdivisions (i.e. to conduct analyses throughout the entire brain volume), for each subject we identified voxels within individual VRRs and ARRs that were also within the regions defined by the aforementioned group-level analysis of shifts in BOLD peak latency. This analysis yielded a subset of 4 regionsprimary visual and auditory cortices, bilaterally (see Results). Within each of these 4 individually identified regions, we calculated the mean BOLD response so as to obtain a single time curve per condition and per subject. The peaks of these time curves, within each region separately, were then statistically analyzed (multivariate test and post hoc paired t-tests) across stimulus conditions. One subject did not show a reliable response (i.e. exceeding the noise level) to auditory stimulation within primary visual areas bilaterally and was therefore excluded from analyses including this condition in these regions. Peak intensity values of these time curves were similarly analyzed. Lastly, we also analyzed the slope of these curves as a post hoc supplement to our analyses of peak latency. For each curve, the slope was defined in the following manner. First, we defined the positions and intensities of the peak and the immediately preceding minimum. From these 2 points, we then fit a line within the acquired data points lying between the 20% and 90% intensity values of this range. The slope of this line was then analyzed, as above, with a multivariate test and follow-up t-tests.
| Results |
|---|
|
|
|---|
Behavioral data confirmed that multisensory interactions occurred. Mean reaction times were faster for the multisensory than either visual or auditory condition (mean ± SEM = 355 ± 28 ms, 379 ± 25 ms, and 400 ± 30 ms, respectively; F2,8 = 36.38; P < 0.001; see Fig. 1a), replicating prior demonstrations of a redundant signals effect between audition and vision (e.g. Raab 1962
|
Our behavioral results are suggestive of an "asymmetry" in the mean reaction time data, such that the difference between the multisensory and visual conditions is approximately the same magnitude as the difference between the visual and auditory conditions. It is important to note, however, that although the difference between the multisensory and each unisensory condition is significant, the difference between the visual and auditory conditions is not (please see caption to Fig. 1). This pattern replicates previously published studies (Schröger and Widmann 1998
Two series of event-related fMRI data analyses were conductedone in terms of spatial activation maps, following standard procedures and another in terms of BOLD response peak latencies (see Materials and Methods). Activation maps (Fig. 2) and BOLD times series (Fig. 3) show that primary cortices of each sensory modality (i.e. calcarine cortex and Heschl's gyrus) responded to both visual and auditory stimulation, indicative of multisensory convergence. In addition, the activation maps also show robust responses within the left primary motor cortex, left somatosensory areas, the supplementary motor area (SMA), and thalamic regions. Because subjects performed a button-press to each stimulus presentation, irrespective of sensory modality, the tactile input likely elicited responses within somatosensory regions. This notion is in part supported by the left-lateralized activations in somatosensory and motor regions (see Fig. 2). Consequently, it would be difficult to disambiguate any multisensory convergence (i.e. responses to the auditory and/or visual stimuli themselves) in these areas.
|
|
In order to assess multisensory interactions (i.e. where these convergent inputs alter responses to simultaneous AV stimulation) while also minimizing issues in fMRI investigations of multisensory interactions that stem, in part, from analyses of BOLD signal amplitude (Calvert 2001
To this point, our analyses revealed both multisensory convergence and shifts in BOLD dynamics within primary auditory and visual cortices following simultaneous AV stimulation. This method allows us to investigate interactions throughout the entire brain but one possible shortcoming is that it does not account for intersubject anatomical and functional variability (Amunts and others 2000
; Rademacher and others 2001
). However, to our knowledge there is no universally accepted method for intersubject alignment of functional activations, and existing approaches minimally require the preselection of anatomical subdivisions for analyses (Pekkola and others 2005
). More specific to our study, it is possible that superposition of activated areas across individuals was incomplete. That is, for this voxel-wise analysis it is possible that a given subject did not exhibit a robust BOLD response at a particular voxel, in which case no peak latency would be measured. The above analysis can therefore be considered conservative.
To partially overcome these issues, we identified voxels within individual subject VRRs and ARRs that were also within the regions defined by the aforementioned group-level analysis of BOLD peak latency shifts performed on each voxel (see Materials and Methods for details). This yielded a subset of 4 regionsprimary visual and auditory cortices, bilaterally. For each subject we calculated the mean BOLD response time curve in response to each condition in each of these individually defined regions (Fig. 3b). In other words, the cluster-wise analysis used the contiguous regions defined in the above voxel-wise analyses to screen for contiguous voxels at the individual subject level. That is, we determined which voxels showed a robust BOLD response for each subject and then took the average across them before measuring the peak response latency. Importantly, this cluster-level analysis differs from the above voxel-level analysis in that each subject contributes a value to each test, thereby maintaining the degrees of freedom while also partially allowing for variation in functional anatomy. Peak latencies and intensities were statistically compared using experimental condition as the within-subjects factor. Each region showed a significant main effect of experimental condition on peak latencies that was explained by earlier peak latencies for the multisensory than either unisensory condition (Fig. 3b and upper portion of Table 1). It is important to note that the differences in peak latencies were larger than our 200 ms temporal sampling frequency, excluding the cubic fitting procedure as an explanation for the present results. A significant main effect of condition on peak intensity was also shown in each region, which was due to significantly smaller auditory responses within visual areas and smaller visual responses within auditory cortices (Fig. 3b and lower portion of Table 1). Conversely, no significant differences were obtained either between multisensory and auditory intensities within auditory areas or between multisensory and visual intensities within visual areas. Thus, the present effects on peak latency cannot follow from a simple trade-off between lower response intensity and earlier peak latencies because responses with significantly earlier peak latencies could also have significantly larger intensities (see Table 1). To further exclude such a possibility, we also assessed whether the slope of the response to the AV condition was steeper than that of either unisensory condition (see Materials and Methods for details). The results of these analyses can be found in Supplementary Table 1. In agreement with our analyses of peak latency, each region showed a significant main effect of experimental condition that was explained by a steeper slope for the multisensory than either unisensory condition (Fig. 3b and Supplementary Table 1).
|
As a final step, we investigated whether or not the observed shifts in BOLD peak latency correlated with the observed reaction time facilitation. To do this, we first calculated the peak latency difference between the visual (V) and AV conditions and between the auditory (A) and AV conditions for each region exhibiting a significant facilitation in BOLD peak latency for the AV condition (i.e. bilateral primary auditory and visual cortices). We also calculated the reaction time difference between the V and AV conditions and between the A and AV conditions. In no case was a significant correlation observed (all P values > 0.05), providing no evidence for a direct correspondence between effects on BOLD peak latency and performance (see also Formisano and others 2002
| Discussion |
|---|
|
|
|---|
This is the first demonstration of multisensory interactions in primary visual and auditory cortices, which manifested as dynamic shifts in BOLD response latencies. No other regions showed significant effects on peak BOLD latency. Conjointly, we observed robust responses, in terms of BOLD amplitude, to both senses within low-level cortices. Simple visual stimuli lead to responses within auditory cortices and vice versa. These findings raise the question of their underlying neurophysiologic bases. Given the emerging anatomical and electrophysiological evidence for AV multisensory interactions within the earliest processing stages (e.g. Schroeder and Foxe 2005
A principal finding of the present study is that primary cortices responded to stimuli of other sensory systems. This is indicative of multisensory convergence, according to the criteria defined by Stein and Meredith (1993)
. In primary auditory cortex bilaterally, there were robust responses to visual stimuli that were nonetheless significantly smaller than the response to either auditory or multisensory stimuli (see Fig. 3 and Table 1). Similarly, in primary visual cortices bilaterally, there were responses to auditory stimuli that were significantly smaller (i.e. approximately half the magnitude) than the responses to either visual or multisensory stimuli (Fig. 3 and Table 1). Several recent studies have also documented AV convergence within primary or near-primary cortices (Calvert 2001
; Pekkola and others 2005
; Tanabe and others 2005
). Others, using a block design, have shown deactivation in auditory cortices in response to visual stimuli and vice versa (e.g. Laurienti and others 2002
; see also Haxby and others 1994
; Kawashima and others 1995
). However, these modulations were not present on multisensory blocks (Laurienti and others 2002
), and the other studies were limited to examinations of selective attention to specific visual features. Thus, selective attention to one sensory modality might hinder the observation of multisensory effects such that positive multisensory convergence (i.e. stimuli of both senses leading to positive-going activations) may depend on paradigms that include a multisensory context and attention to multiple sensory modalities (see Laurienti and others 2002
; Brosch and others 2005
; Tanabe and others 2005
). In the present paradigm, attention was continuously allocated to both the auditory and visual modalities because there was equal likelihood that either sense would be stimulated on a given trial.
Even if attention could account for multisensory convergence (i.e. frank responses to both sensory modalities), it cannot readily account for either the interaction effects on reaction times or the facilitation of BOLD response peak latencies for multisensory relative to both unisensory conditions in the present study (i.e. significantly faster reaction times and BOLD peak latencies to the AV condition). This is corroborated by the fact that the facilitation of reaction times (i.e. the redundant signals effect) exceeded probability summation. That is, likelihood of a fast reaction time following a multisensory stimulus was greater than the summed likelihoods of an equally fast reaction time following either unisensory stimulus (see Fig. 1). This is indicative of facilitative integrative processing exceeding any contribution of selective attention. Similarly, selective attention cannot account for the fact that BOLD peak latency was facilitated for the AV condition. If such were the case, one would expect that BOLD peak latency for the AV condition would be equivalent to the faster of the 2 unisensory conditions. Our results, however, indicate that the peak latency in response to the AV condition was earlier than either unisensory condition and that the peak latencies in response to the 2 unisensory conditions did not significantly differ from each other (see Table 1).
Task demands, by contrast, do not appear to be a determining factor for observing multisensory convergence or interactions. For example, auditorysomatosensory multisensory convergence and supraadditive interactions have been shown using passive paradigms where subjects were nonetheless aware that stimuli would be presented in either or both sensory modalities (Foxe and others 2000
, 2002
). Whether or not this applies to effects on BOLD dynamics will be a topic for future experiments. However, electrophysiological studies in nonhuman primates provide one line of evidence that multisensory convergence and interactions occur under passive conditions and even under anesthesia. Frank responses to both visual and somatosensory stimuli have been recorded within primary and belt auditory cortices (e.g. Schroeder and others 2001
; Fu and others 2003
; Ghazanfar and others 2005
; see also Kayser and others 2006
for recent fMRI results in macaques); though, to our knowledge, similar experiments within visual cortices have not yet been conducted. This collective pattern of results suggests that unisensory stimulation in a paradigm lacking a multisensory context can indeed elicit multisensory effects (in particular convergence). To date, such effects have been most consistently observed using intracranial electrophysiological methods in animals. Our results nonetheless support the future use of event-related designs in combination with high-field fMRI in noninvasively identifying multisensory phenomena under unisensory conditions.
Analyses of BOLD dynamics represent a methodological advancement for identifying multisensory brain regions with fMRI. Here, multisensory interactions led to changes in BOLD latency but not amplitude (see Fig. 3). Importantly, these latency effects did not follow from a simple amplitude/latency trade-off. In auditory cortices, responses to the multisensory condition peaked earlier than either unisensory condition, even though its amplitude was equal to that following auditory stimulation and larger than that following visual stimulation. Likewise, in visual cortices, responses to the multisensory condition peaked earlier than either unisensory condition, even though its amplitude was equal to that following visual stimulation and larger than that following auditory stimulation (see Table 1 for detailed statistics).
Several laboratories have recently been examining the validity of different analyses for identifying multisensory interactions (Calvert 2001
; Beauchamp 2005
; Laurienti and others 2005
). As noted by one laboratory, these approaches inherently assume that signals within a given voxel emanate from a singular, homogenous neural population in terms of its responsiveness (Laurienti and others 2005
). When this is not the case, canonical criteria of convergence and enhancement can be overly liberal and yield falsely positive results. Supraadditivity as an analysis criterion has also been subject to criticism. Laurienti and others (2005)
contended, based largely on the frequency of observing supra- and subadditive interaction profiles at the individual neuron level, that at the level of fMRI voxels such populations would not be separable and that a supraadditive criterion would be prone to falsely negative results. Beauchamp (2005)
also suggests that this criterion is overly strict. Instead, Beauchamp supports first restricting analyses to those voxels showing activation to any experimental condition and then applying a threshold wherein the multisensory response must exceed the mean of the unisensory responses. In accord with this proposal, we first spatially restricted our analyses to voxels responsive to both audition and vision. This was done to ensure that peak latency shifts occurred in locations active under unisensory conditions, and that a peak latency shift was apparent in voxels showing a robust, positive BOLD response. More importantly, we would contend that analyses of BOLD response latency bypass the aforementioned interpretational concerns associated with analyses of BOLD response amplitude and provide a clear metric of multisensory interactions. Our methods highlight that the full range of effects may go undetected by typical analysis approaches of BOLD amplitude. One proposition is that an earlier peak reflects facilitated neural processing time (Henson and others 2002). Although appealing, it will be important for future investigations to detail more fully the bases for latency shifts in the BOLD signal.
The present data do not allow us to differentiate feedforward from feedback activity within a cortical region. Still, the anatomical studies that first identified direct projections between primary cortices noted that axon terminals were situated predominantly within layers 1 and 6, consistent with a functionally feedback profile (Rockland and Ojima 2003
). This interpretation is likewise supported by electrophysiological recordings in the case of visual inputs into auditory cortices, which were distributed across the cortical laminae (Schroeder and others 2003
). One speculative possibility is that the magnitude of the observed BOLD responses in the present study might be representative of the distribution of inputs into the region. In the case of visual inputs into auditory cortices, this distribution may be diffuse, whereas auditory inputs into visual cortices may be rather limited or focused. One level of support for this possibility stems from the work of Logothetis (2003)
demonstrating a higher level of coupling between local field potentials, considered to be a measure of input activity within a region, and the BOLD response than between multiunit activity, considered to be a measure of the output activity within a region, and the BOLD response. Substantiating the above speculation will require further experimentation in animal models that specify the laminar origin of signals, though some work has begun in this direction (e.g. Schroeder and others 1998
; Goense and Logothetis 2006
). A further speculation is that direct interactions between primary auditory and visual cortices are the basis for the observed latency shifts. As mentioned above, several laboratories have now independently identified monosynaptic projections between these cortices in nonhuman primates (Falchier and others 2002
; Rockland and Ojima 2003
; Cappe and Barone 2005
). Although such information in humans would be of immense importance, it is presently not feasible with existing staining methods. An alternative viewpoint would be that the present effects are instead mediated by another region. Although we cannot unequivocally exclude such a possibility, it is surprising that such a region did not itself show a BOLD latency shift or amplitude modulation. In addition, one might also have expected that latency shifts in primary cortices would be mirrored by effects in motor-related cortices that would in turn underlie the observed facilitation in reaction times. The present study provides no evidence that such is occurring. For one, no effects on peak latency were observed in M1, the SMA, or subcortical regions that were nonetheless identified as active under all stimulation conditions (see Supplementary Figure). Second, there was no evidence of a significant correlation between shifts in BOLD peak latency and behavioral facilitation. Finally, as discussed above, the evidence for multisensory interactions during both passive and active conditions suggests that task performance is not directly linked with interactions within low-level cortices. We therefore contend that the most parsimonious interpretation of the present results is that there are direct, but not forcibly feedforward, interactions between primary auditory and visual cortices of humans.
Multisensory interactions within primary cortices and between rudimentary stimuli require that long-standing notions of cortical organization be revised to include multisensory interactions as a fundamental component of neural organization (e.g. Wallace and others 2004
). Here, we show how investigation of BOLD dynamics can address the current gap in knowledge regarding the neurophysiological bases of and brain regions contributing to multisensory interactions.
| Supplementary Material |
|---|
|
|
|---|
Supplementary material can be found at: http://www.cercor.oxfordjournals.org/.
| Notes |
|---|
|
|
|---|
The first 2 authors contributed equally to this study.
| Acknowledgments |
|---|
We thank Dr Charles Schroeder and Dr Glenn Wylie, as well as 2 anonymous reviewers for helpful comments on a preliminary version of this manuscript. This study has been supported by the Swiss National Science Foundation (SNSF 3200B0-100606 and SNSF 3200BO-105680/1) and the Leenaards Foundation (2005 prize for the promotion of scientific research). Conflict of Interest: None declared.
| References |
|---|
|
|
|---|
Amunts K, Malikovic A, Mohlberg H, Schormann T, Zilles K. Brodmann's areas 17 and 18 brought into stereotaxic spacewhere and how variable? Neuroimage (2000) 11:6684.[CrossRef][ISI][Medline]
Beauchamp MS. Statistical criteria in FMRI studies of multisensory integration. Neuroinformatics (2005) 3:93113.[CrossRef][ISI][Medline]
Beauchamp MS, Argall BD, Bodurka J, Duyn JH, Martin A. Unraveling multisensory integration: patchy organization within human STS multisensory cortex. Nat Neurosci (2004) 7:11901192.[CrossRef][ISI][Medline]
Bellgowan PS, Saad ZS, Bandettini PA. Understanding neural system dynamics through task modulation and measurement of functional MRI amplitude, latency, and width. Proc Natl Acad Sci USA (2003) 100:14151419.
Brosch M, Selezneva E, Scheich H. Nonauditory events of a behavioral procedure activate auditory cortex of highly trained monkeys. J Neurosci (2005) 25:67976806.
Calvert GA. Crossmodal processing in the human brain: insights from functional neuroimaging studies. Cereb Cortex (2001) 11:11101123.
Cappe C, Barone P. Heteromodal connections supporting multisensory integration at low levels of cortical processing in the monkey. Eur J Neurosci (2005) 22:28862902.[CrossRef][ISI][Medline]
Clavagnier S, Falchier A, Kennedy H. Long-distance feedback projections to area V1: implications for multisensory integration, spatial awareness, and visual consciousness. Cogn Affect Behav Neurosci (2004) 4:117126.[Medline]
Falchier A, Clavagnier S, Barone P, Kennedy H. Anatomical evidence of multimodal integration in primate striate cortex. J Neurosci (2002) 22:57495759.
Formisano E, Linden DE, Di Salle F, Trojano L, Esposito F, Sack AT, Grossi D, Zanella FE, Goebel R. Tracking the mind's image in the brain I: time-resolved fMRI during visuospatial mental imagery. Neuron (2002) 35:185194.[CrossRef][ISI][Medline]
Foxe JJ, Morocz IA, Murray MM, Higgins BA, Javitt DC, Schroeder CE. Multisensory auditory-somatosensory interactions in early cortical processing revealed by high-density electrical mapping. Brain Res Cogn Brain Res (2000) 10:7783.[CrossRef][Medline]
Foxe JJ, Wylie GR, Martinez A, Schroeder CE, Javitt DC, Guilfoyle D, Ritter W, Murray MM. Auditory-somatosensory multisensory processing in auditory association cortex: an fMRI study. J Neurophysiol (2002) 88:540543.
Fu KM, Johnston TA, Shah AS, Arnold L, Smiley J, Hackett TA, Garraghty PE, Schroeder CE. Auditory cortical neurons respond to somatosensory stimulation. J Neurosci (2003) 23:75107515.
Ghazanfar AA, Maier JX, Hoffman KL, Logothetis NK. Multisensory integration of dynamic faces and voices in rhesus monkey auditory cortex. J Neurosci (2005) 25:50045012.
Giard MH, Peronnet F. Auditory-visual integration during multimodal object recognition in humans: a behavioral and electrophysiological study. J Cogn Neurosci (1999) 11:473490.
Goense JB, Logothetis NK. Laminar specificity in monkey V1 using high-resolution SE-fMRI. Magn Reson Imaging (2006) 24:381392.[CrossRef][ISI][Medline]
Gonzalez Andino SL, Murray MM, Foxe JJ, de Peralta Menendez RG. How single-trial electrical neuroimaging contributes to multisensory research. Exp Brain Res (2005) 166:298304.[CrossRef][ISI][Medline]
Haxby JV, Horwitz B, Ungerleider LG, Maisog JM, Pietrini P, Grady CL. The functional organization of human extrastriate cortex: a PET-rCBF study of selective attention to faces and locations. J Neurosci (1994) 14:63366353.[Abstract]
Henson RN, Price CJ, Rugg MD, Turner R, Friston KJ. Detecting latency differences in event-related BOLD responses: application to words versus nonwords and initial versus repeated face presentations. Neuroimage (2002) 15:8397.[CrossRef][ISI][Medline]
Josephs O, Turner R, Friston K. Event-related fMRI. Hum Brain Mapp (1997) 5:243248.[CrossRef][ISI]
Kawashima R, O'Sullivan BT, Roland PE. Positron-emission tomography studies of cross-modality inhibition in selective attentional tasks: closing the "mind's eye". Proc Natl Acad Sci USA (1995) 92:59695972.
Kayser C, Petkov CI, Augath M, Logothetis NK. Integration of touch and sound in auditory cortex. Neuron (2006) 48:373384.[CrossRef][ISI]
Laurienti PJ, Burdette JH, Wallace MT, Yen YF, Field AS, Stein BE. Deactivation of sensory-specific cortex by cross-modal stimuli. J Cogn Neurosci (2002) 14:420429.
Laurienti PJ, Perrault TJ, Stanford TR, Wallace MT, Stein BE. On the use of superadditivity as a metric for characterizing multisensory integration in functional neuroimaging studies. Exp Brain Res (2005) 166:289297.[CrossRef][ISI][Medline]
Logothetis NK. The underpinnings of the BOLD functional magnetic resonance imaging signal. J Neurosci (2003) 23:39633971.
Martuzzi R, Murray MM, Maeder PP, Fornari E, Thiran JP, Clarke S, Michel CM, Meuli RA. Visuo-motor pathways in humans revealed by event-related fMRI. Exp Brain Res (2006) 170:472487.[CrossRef][ISI][Medline]
Miller J. Divided attention: evidence for coactivation with redundant signals. Cognit Psychol (1982) 14:247279.[CrossRef][ISI][Medline]
Molholm S, Ritter W, Murray MM, Javitt DC, Schroeder CE, Foxe JJ. Multisensory auditory-visual interactions during early sensory processing in humans: a high-density electrical mapping study. Brain Res Cogn Brain Res (2002) 14:115128.[CrossRef][Medline]
Murray MM, Molholm S, Michel CM, Heslenfeld DJ, Ritter W, Javitt DC, Schroeder CE, Foxe JJ. Grabbing your ear: rapid auditory-somatosensory multisensory interactions in low-level sensory cortices are not constrained by stimulus alignment. Cereb Cortex (2005) 15:963974.
Pekkola J, Ojanen V, Autti T, Jaaskelainen IP, Mottonen R, Tarkiainen A, Sams M. Primary auditory cortex activation by visual speech: an fMRI study at 3 T. Neuroreport (2005) 16:125128.[CrossRef][ISI][Medline]
Raab D. Statistical facilitation of simple reaction times. Trans N Y Acad Sci (1962) 24:574590.[ISI][Medline]
Rademacher J, Morosan P, Schormann T, Schleicher A, Werner C, Freund HJ, Zilles K. Probabilistic mapping and volume measurement of human primary auditory cortex. Neuroimage (2001) 13:669683.[ISI][Medline]
Rockland KS, Ojima H. Multisensory convergence in calcarine visual areas in macaque monkey. Int J Psychophysiol (2003) 50:1926.[CrossRef][ISI][Medline]
Schroeder CE, Foxe JJ. Multisensory contributions to low-level, unisensory processing. Curr Opin Neurobiol (2005) 15:454458.[CrossRef][ISI][Medline]
Schroeder CE, Lindsley RW, Specht C, Marcovici A, Smiley JF, Javitt DC. Somatosensory input to auditory association cortex in the macaque monkey. J Neurophysiol (2001) 85:13221327.
Schroeder CE, Mehta A, Givre SJ. A spatiotemporal profile of visual system activation revealed by current source density analysis in the awake macaque. Cereb Cortex (1998) 8:575592.
Schroeder CE, Smiiley J, Fu KG, McGinnis T, O'Connell MN, Hackett TA. Anatomical mechanisms and functional implications of multisensory convergence in early cortical processing. Int J Psychophysiol (2003) 50:518.[CrossRef][ISI][Medline]
Schröger E, Widmann A. Speeded responses to audiovisual signal changes result from bimodal integration. Psychophysiology (1998) 35:755759.[CrossRef][ISI][Medline]
Stein BE, Meredith MA. The merging of the senses (1993) Cambridge, MA: MIT Press.
Tanabe HC, Honda M, Sadato T. Functionally segregated neural substrates for arbitrary audiovisual paired-association learning. J Neurosci (2005) 25:64096418.
van Atteveldt N, Formisano E, Goebel R, Blomert L. Integration of letters and speech sounds in the human brain. Neuron (2004) 43:271282.[CrossRef][ISI][Medline]
Wallace MT, Ramachandran R, Stein BE. A revised view of sensory cortical parcellation. Proc Natl Acad Sci USA (2004) 101:21672172.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
G. F. Alpert, G. Hein, N. Tsai, M. J. Naumer, and R. T. Knight Temporal Characteristics of Audiovisual Information Processing J. Neurosci., May 14, 2008; 28(20): 5344 - 5349. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Romei, M. M. Murray, L. B. Merabet, and G. Thut Occipital Transcranial Magnetic Stimulation Has Opposing Effects on Visual and Auditory Stimulus Detection: Implications for Multisensory Interactions J. Neurosci., October 24, 2007; 27(43): 11465 - 11472. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



