Cerebral Cortex Advance Access originally published online on May 15, 2007
Cerebral Cortex 2008 18(1):230-242; doi:10.1093/cercor/bhm049
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Beyond Superior Temporal Cortex: Intersubject Correlations in Narrative Speech Comprehension
1 Ahmanson-Lovelace Brain Mapping Center, 2 Neuroscience Interdepartmental Program, 3 Department of Psychiatry and Biobehavioral Sciences, 4 Semel Institute for Neuroscience and Human Behavior, 5 Center for the Biology of Creativity, 6 Brain Research Institute, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
Address correspondence to Stephen M. Wilson, Laboratory for Cognitive Brain Research, Department of Cognitive Sciences, University of California, Irvine, CA 92697, USA. Email: stephenw{at}ucla.edu.
| Abstract |
|---|
|
|
|---|
The role of superior temporal cortex in speech comprehension is well established, but the complete network of regions involved in understanding language in ecologically valid contexts is less clearly understood. In a functional magnetic resonance imaging (fMRI) study, we presented 24 subjects with auditory or audiovisual narratives, and used model-free intersubject correlational analyses to reveal brain areas that were modulated in a consistent way across subjects during the narratives. Conventional comparisons to a resting state were also performed. Both analyses showed the expected recruitment of superior temporal areas, however, the intersubject correlational analyses also revealed an extended network of areas involved in narrative speech comprehension. Two findings stand out in particular. Firstly, many areas in the "default mode" network (typically deactivated relative to rest) were systematically modulated by the time-varying properties of the auditory or audiovisual input. These areas included the anterior cingulate and adjacent medial frontal cortex, and the posterior cingulate and adjacent precuneus. Secondly, extensive bilateral inferior frontal and premotor regions were implicated in auditory as well as audiovisual language comprehension. This extended network of regions may be important for higher-level linguistic processes, and interfaces with extralinguistic cognitive, affective, and interpersonal systems.
Key Words: default mode narrative speech comprehension
| Introduction |
|---|
|
|
|---|
The central role of the superior temporal cortex in speech comprehension has been known for over a century, since the pioneering work of Wernicke (1874)
However, it is clear that in everyday use, language must interface with numerous other systems such as working memory, conceptual knowledge, emotion, and social cognition. So we would expect that many brain regions beyond superior temporal cortex must be involved in narrative speech comprehension. Several neuroimaging studies of narrative comprehension have indeed suggested the involvement of a number of regions beyond classical perisylvian language areas (Fletcher et al. 1995
; Xu et al. 2005
; for review see Mar 2004
). In particular, medial prefrontal cortex has been implicated in a number of studies, and has been interpreted as reflecting social cognitive processes such as "theory of mind" (Fletcher et al. 1995
; Gallagher et al. 2000
; Ferstl and von Cramon 2002
; Xu et al. 2005
). Several studies have also shown the involvement of more posterior midline regions in posterior cingulate cortex and/or the precuneus, which may be involved in linking incoming information with prior knowledge, or episodic memory retrieval (Fletcher et al. 1995
; Maguire et al. 1999
; Ferstl and von Cramon 2002
; Xu et al. 2005
; Schmithorst et al. 2006
). Narrative-related activations have been observed in the posterior superior temporal sulcus (STS) or angular gyrus; these regions are important for a range of cognitive processes including attention, mental imagery and social cognition, all of which would plausibly be components of understanding discourse (Fletcher et al. 1995
; Gallagher et al. 2000
; Ferstl and von Cramon 2002
; Xu et al. 2005
; Schmithorst et al. 2006
). Another theme is that a shift toward greater right-hemisphere involvement of numerous regions is associated with language in context (e.g., St. George et al. 1999
; Robertson et al. 2000
; Xu et al. 2005
).
Many of the studies that have succeeded in identifying extra-perisylvian regions involved in narrative comprehension have employed written materials (Fletcher et al. 1995
; St. George et al. 1999
; Gallagher et al. 2000
; Robertson et al. 2000
; Xu et al. 2005
) or have relied on subtle manipulations of the extent to which sentences cohere with one another (Ferstl and von Cramon 2002
) or with prior context (Maguire et al. 1999
). However, in marked contrast to these findings, studies of auditory narrative comprehension, where listening to narratives has been contrasted with resting blocks or acoustic control conditions, have not consistently identified any regions besides the superior temporal cortex bilaterally and the left IFG (Mazoyer et al. 1993
; Dehaene et al. 1997
; Perani et al. 1998
; Giraud et al. 2000
; Papathanassiou et al. 2000
; Ahmad et al. 2003
; Crinion et al. 2003
; Tzourio-Mazoyer et al. 2004
; Crinion and Price 2005
; Skipper et al. 2005
; Schmithorst et al. 2006
; Alho et al. 2006
). Extra-perisylvian regions identified in small subsets of these studies include the right IFG (Dehaene et al. 1997
; Tzourio-Mazoyer et al. 2004
), the precuneus (Perani et al. 1998
; Schmithorst et al. 2006
), and regions in the vicinity of the angular gyrus (Perani et al. 1998
; Crinion et al. 2003
; Schmithorst et al. 2006
).
One possibility is that the necessity of comparing speech comprehension with some baseline obscures activity in brain areas involved in higher levels of comprehension, beyond auditory processing. Some studies of speech comprehension have used resting baselines (e.g., Mazoyer et al. 1993
; Skipper et al. 2005
), whereas many others have used acoustically matched control conditions such as backwards speech (e.g., Dehaene et al. 1997
; Crinion et al. 2003
), but in either case, higher-level cognitive processes which are difficult to constrain presumably take place during the baseline conditions. Even regions which are neither activated nor deactivated relative to a baseline might nevertheless be involved in speech comprehension, because mean signal could be statistically equivalent even though distinct processes are taking place in each condition.
In particular, a set of brain areas termed the "default mode" network (Raichle et al. 2001
) have been observed to be consistently deactivated relative to rest or passive sensory processing when subjects engage in a variety of different tasks; these default mode areas include the anterior cingulate and adjacent medial frontal cortex, the posterior cingulate and adjacent precuneus, and the left and right angular gyri (Shulman et al. 1997
; Binder et al. 1999
; Gusnard and Raichle 2001
; Mazoyer et al. 2001
; Raichle et al. 2001
; McKiernan et al. 2003
, 2006
). These areas are thought to be involved in ongoing internal processes at rest, such as semantic processing, and monitoring of internal states and the external environment. Semantic processing is an important aspect of speech comprehension, so some default mode areas may be essential components of a wider language comprehension network (Binder et al. 1999
; McKiernan et al. 2003
, 2006
; Iacoboni et al. 2004
). Furthermore, the content of perceived speech can provide information concerning the environment, or influence the listener's internal state directly, so other default mode areas may also interface with areas involved in speech perception.
To circumvent the issues which arise when comparing a condition of interest with a baseline, we presented subjects with naturalistic auditory or audiovisual narratives, and used model-free intersubject correlational analysis (Hasson et al. 2004
) to identify cortical areas which are systematically modulated by the linguistic input and the processing it entails. This method of analysis requires no control condition, instead identifying as significant those voxels which tend to respond similarly across subjects over the course of a stimulus that varies in time along dimensions of interest. This implies that neural activity in these voxels must be sensitive to time-varying properties of the stimulus, such as dynamic changes in demands on phonological, syntactic, semantic, or extralinguistic processing. Our results revealed the involvement of numerous regions not typically reported in studies of auditory narrative comprehension, including much of the default mode network, and extensive bilateral inferior frontal and premotor areas. This extended set of regions may be important for higher-level linguistic processes and interfaces with conceptual and affective representations.
| Materials and Methods |
|---|
|
|
|---|
Participants
A total of 24 native English-speaking participants were scanned with functional magnetic resonance imaging (fMRI). Twelve subjects (3 males, mean age 24.2, range 19–33 years) listened to auditory cartoon narratives, and 12 subjects (6 males, mean age 24.7 years, range 20–31 years) viewed and listened to audiovisual cartoon narratives. All participants gave written informed consent and were compensated for their participation, and the study was approved by the UCLA Institutional Review Board.
Experimental Design
The auditory and audiovisual stimuli consisted of cartoon narrations (McNeill 1992
). We showed an actor Looney Tunes cartoons from the video "Carrotblanca" (Fig. 1a, Warner Brothers Family Entertainment) and she was videotaped while recounting the plots of various stories to a listener behind the camera (Fig. 1b). The actor's hands and face were visible at all times, so language-related visual stimuli included mouth movements, head movements, and numerous beat, iconic and other gestures. The actor, who was not a professional, was given no instructions regarding the storytelling, however, she was chosen because she naturally produced prolific and expressive gestures.
|
In the fMRI experiment, each subject was scanned during 2 runs. In one run, the narratives "Carrotblanca" (6'32'') and "Hare Do" (6'41'') were presented, and in the other run "Dripalong Daffy" (4'40''), "The Scarlet Pumpernickel" (4'31'') and "Box Office Bunny" (2'57'') were presented. There were 16 s of rest (with blank screen) between narratives, as well as at the start and end of each run. The order of runs, and of narratives within runs, was counterbalanced across subjects. Subjects in the auditory-only and audiovisual groups heard exactly the same soundtracks, so the only difference between the groups was the presence or absence of visual information.
Subjects were instructed simply to watch and/or listen to the narratives, and were told that they would be asked questions about the plots. The soundtracks were presented through scanner-compatible headphones at a volume sufficiently loud that the speech could be readily perceived over the scanner noise. The sound volume was set individually for each subject to a comfortable level during preliminary scans. Subjects in the auditory-only condition in particular reported that it was necessary to concentrate and pay attention in order to follow the plots of the narratives over the background scanner noise. When asked questions after the scanning session, subjects in both groups had no difficulty in recalling elements of the plots.
The visual stimuli were presented through custom-made goggles (Resonance Technology Inc., Northridge, CA).
Image Acquisition
Functional images were acquired on a 3-T Siemens Allegra scanner at the Ahmanson-Lovelace Brain Mapping Center at UCLA. There were 2 functional runs (repetition time [TR] = 2000 ms; echo time [TE] = 25 ms; flip angle = 90°; 36 axial slices with interleaved acquisition; 3 x 3 x 4 mm resolution; field of view = 192 x 192 x 144 mm). The number of volumes acquired was 421 for the 2 longer narratives, or 397 for the 3 shorter narratives. In addition, 2 volumes were acquired and discarded to allow for magnetization to reach steady state.
For registration purposes, high-resolution T2-weighted images coplanar with the functional images were acquired (TR = 5000 ms; TE = 33 ms; flip angle = 90°; 36 axial slices; 1.5 x 1.5 x 4 mm resolution; field of view = 192 x 192 x 144 mm).
Image Processing
The fMRI data were preprocessed using tools from FMRIB Software Library (Smith et al., 2004
): after skull stripping and motion correction, the data were smoothed with a Gaussian kernel (8mm FWHM) and mean signal intensity was normalized across subjects.
Functional images were aligned using FMRIB's Linear Image Registration Tool to high-resolution coplanar images via an affine transformation with 6 degrees of freedom. High-resolution coplanar images were then aligned to the standard Montreal Neurological Institute (MNI) average of 152 brains using an affine transformation with 12 degrees of freedom.
Standard Analysis
A standard subtraction analysis comparing auditory or audiovisual language comprehension with rest was performed with the FMRISTAT toolbox (Worsley et al. 2002
) in MATLAB (Mathworks, Natick, MA). A general linear model was fit to the data from each voxel in each subject, in functional space. The boxcar design matrix was convolved with a hemodynamic response function modeled as a difference of 2 gamma functions. Temporal drift was removed by adding a cubic spline in the frame times to the design matrix (one covariate per 2 min of scan time), and spatial drift was removed by adding a covariate in the whole volume average. Six motion parameters (3 each for translation and rotation) were also included as confounds of no interest. Autocorrelation parameters were estimated at each voxel and used to whiten the data and design matrix. The 2 runs within each subject were combined using a fixed effects model, then the resulting statistical images were registered to MNI space by concatenating the transformation matrices derived above.
Group analysis was performed for each of the 2 groups (auditory only, and audiovisual) with FMRISTAT, using a mixed effects linear model (Worsley et al. 2002
). Standard deviations from individual subject analyses were passed up to the group level. Variance ratio images were not smoothed (i.e., a conventional group analysis was performed). The resulting t statistic images were thresholded at t > 3.106 (df = 11, P < 0.005 uncorrected) at the voxel level, with a minimum cluster size then applied so that only clusters significant at P < 0.05 (corrected) according to Gaussian Random Field (GRF) theory were reported.
The 2 groups were compared with one another using a mixed effects linear model implemented with FMRISTAT. In this case, t statistic images were thresholded at t > 2.819 (df = 22, P < 0.005 uncorrected), before being corrected based on GRF theory as above.
Statistical parameter maps were displayed as overlays on a high-resolution single subject T1 image (Holmes et al. 1998
) using Analysis of Functional Neuroimages (Cox 1996
). In the tables of regions showing significant signal increases or decreases, anatomical labels were determined manually by inspecting significant regions in relation to the anatomical data averaged across the subjects, with reference to an atlas of neuroanatomy (Duvernoy 1999
). In cases of large activated areas spanning more than one region, prominent local maxima were identified and tabulated separately.
Supplementary analyses were performed including further continuously varying explanatory variables in addition to the "boxcar" variable which modeled the narratives, in order to model some of the internal structure of the narrative blocks. In the auditory-only condition, the root mean square (RMS) energy of the speech signal was included, and for the audiovisual condition, this auditory variable was included along with 2 additional variables quantifying the speed of motion of the actor's left and right hands. Each of these variables varied continuously with bins of 100 ms. The RMS energy was determined using a custom MATLAB script, and the actor's hand positions were tracked manually on the videos and the difference between positions at each 100-ms interval was calculated using another MATLAB script. Each continuous variable was convolved with the same hemodynamic response function as the boxcar variable.
Intersubject Correlational Analysis
The intersubject correlational analysis was based on the methods described by Hasson et al. (2004)
. Each subject's preprocessed functional data was transformed to MNI space, and split up according to narrative. Then a model was fit for each narrative consisting of temporal drift terms (a cubic spline in the frame times, one covariate per 2 min of scan time), 6 motion parameters as above, and the whole volume average, none of which were convolved with a hemodynamic response function. Removing the whole volume average is similar to factoring out what is termed the "nonspecific component" by Hasson et al. (2004)
. The whole volume average was highly correlated across subjects watching the same movies, and removing it reduces estimates of intersubject correlation (Hasson et al. 2004
). Furthermore, the first 16 s of each narrative were excluded, so that common responses to the onset of the narrative (following on from rest) could not account for intersubject correlations. Model fitting was performed with FMRISTAT, and the residuals from this analysis were saved and used for the next stage.
Intersubject correlation maps were then constructed for every pair of subjects belonging to the same group (auditory or audiovisual). There were 12 subjects in each group, so there were 66 pairwise maps created for each group (Fig. 1c). These maps were created by a custom MATLAB program that computed the correlation coefficient r between residual timecourses obtained above at each voxel. The r statistic is not normally distributed, so it was converted to a normal distribution using the Fisher z transformation: z = log((1 + r)/(1 – r))/2. In practice, this correction makes little difference for relatively small values of r such as were obtained in this study.
Group analyses were performed to discover at which voxels the intersubject correlations were significantly greater than zero. Note that under the null hypothesis, the expected value of r, and hence of z, is 0, because correlations would be positive or negative at random if the voxel in question is insensitive to the stimulus.
However, we were concerned that for each comparison we have 66 z scores, but only 12 subjects. To discover the distribution of the t statistic in this case, a null data set was created by shifting the data in time, such that time series were no longer aligned across subjects. The algorithm was run as above, except that at each voxel, the 2 time series being compared were both offset by a random number of volumes. For instance, supposing that a narrative was 50 volumes long, and the randomly chosen first volume was 10, then the volumes were rearranged in the order (10, 11, 12, ..., 49, 50, 1, 2, 3, ..., 8, 9). The 2 time series being compared were offset from one another by at least 5 volumes. Note that the discontinuity created by wrapping the data around does not significantly distinguish the null data from the real data, because temporal autocorrelation was very low in the residual data sets (
< 0.03 in most voxels). This was confirmed based on simulations with randomly generated data based on autoregressive models with various parameters.
The null data set was analyzed with FMRISTAT to derive a t statistical parameter map, and we examined the distribution of the t statistic. We found that it was distributed approximately as t (65) (Fig. 1d). In particular, to threshold a t (65) map at voxelwise P < 0.005 requires a threshold of t > 2.654. The proportion of observations with t > 2.654 in the null data set was 0.0039. Finally, note that the observed distribution of the unshifted real data, also depicted in Figure 1d, is very different: many voxels were significantly correlated across subjects.
In sum, it appears that a t statistic generated based on the 66 pairwise images is distributed as approximately t (65) under the null hypothesis, and can be treated as such for the purpose of thresholding. Group analyses of the intersubject correlational maps were therefore performed as above, except t statistic images were thresholded at t > 2.654 (df = 65, P < 0.005 uncorrected) at the voxel level for each group, and at t > 2.614 (df = 130, P < 0.005 uncorrected) for the between-group comparison, then a minimum cluster size based on GRF theory was applied. Statistical parameter maps were displayed and tables created as described above.
| Results |
|---|
|
|
|---|
The group data for auditory-only speech comprehension are shown in Figure 2a and Table 1. The standard subtraction analysis (green outlines) revealed signal increases in bilateral superior temporal cortex, consistent with numerous previous studies of narrative comprehension (e.g., Mazoyer et al. 1993
|
|
The intersubject correlational analysis (red–yellow–white color scale) also demonstrated robust intersubject correlations in bilateral superior temporal cortex, paralleling the results of the standard analysis. However, numerous additional regions showed reliable intersubject correlations. These included several midline areas: the anterior cingulate gyrus, medial superior frontal gyrus, posterior cingulate, and precuneus, which were mostly deactivated relative to rest in the standard analysis. The intercorrelated regions in superior temporal cortex extended much more posteriorly and dorsally into the angular gyri in both hemispheres. There were extensive bilateral inferior frontal regions that were intercorrelated among subjects, extending into premotor cortex in the precentral gyrus.
For the subjects in the audiovisual speech comprehension group, the results were similar in many respects (Fig. 2b, Table 2). The most prominent differences were that activations, as well as reliable intersubject correlations, were observed in early visual areas and visual motion areas, reflecting the fact that the stimuli also involved the visual modality. Signal decreases, though only modest intersubject correlations, occurred in anterior occipital regions, where the peripheral visual field (which was not stimulated) is represented (Engel et al. 1994
). Similar signal decreases have been shown to most likely reflect reduced neural activity in nonstimulated visual areas, perhaps a form of surround suppression (Shmuel et al. 2002
).
|
As in the auditory-only condition, sizeable bilateral inferior frontal regions extending into premotor areas were intercorrelated across subjects. In this case, bilateral inferior frontal activity was also found relative to rest in the standard analysis, albeit considerably more circumscribed.
The audiovisual and auditory-only groups were then directly compared (Fig. 3, Tables 3 and 4). In the standard analysis, the only regions showing greater signal change in the audiovisual condition relative to the auditory condition were early visual and visual motion areas (Fig. 3a). The intersubject correlational analysis also showed significantly greater correlations across subjects in these areas, along with one additional region: the right posterior STS, previously implicated in perception of biological motion (Allison et al. 2000
; Pelphrey et al. 2005
).
|
|
|
Although in the standard analysis bilateral IFG activations were observed only for the audiovisual group, this difference between groups did not prove to be significant. No frontal regions were significantly more correlated among audiovisual subjects, but there were such areas that did not reach the minimum cluster size; peak coordinates were (–56, 16, 20; t = 3.0) in the left dorsal pars opercularis, and (42, 12, 24; t = 3.7) in the right inferior frontal junction.
The reverse comparison—auditory-only relative to audiovisual—is reported in Figure 3b and Table 4. The standard analysis showed greater activity relative to rest in the auditory group in bilateral primary auditory cortex in the transverse temporal gyri (Rademacher et al. 2001
). The intersubject correlational analysis did not show reliable correlations across groups in the transverse temporal gyri, however, reliable differences in intersubject correlations were observed more ventrally, centered in the anterior STS, in both hemispheres. These STS regions extended as far anteriorly as the temporal pole; clusters extended from y = –42 to y = 32 on the left, and from y = –36 to y = 24 on the right. A number of premotor and prefrontal areas were also more closely correlated across auditory-only than audiovisual subjects: the left ventral precentral gyrus, left orbital gyrus, left inferior frontal sulcus/middle frontal gyrus and left anterior superior frontal gyrus, the right inferior frontal sulcus/middle frontal gyrus, and the right anterior superior frontal gyrus.
Because the standard analysis employed only a simple "boxcar" variable to model each narrative, we carried out further analyses including the RMS energy of the speech signal in both the auditory-only and the audiovisual conditions, and 2 additional variables quantifying the speed of motion of the actor's left and right hands for the audiovisual condition. The regions activated in these analyses are shown in Supplementary Tables 1 and 2. The sets of regions activated by the boxcar regressors in these fuller models were very similar to the analyses reported above where the boxcar regressors were the only explanatory variables in the models. The RMS energy of the speech signal was positively correlated with the transverse temporal gyri bilaterally in each group, reflecting activation of primary auditory cortex. The hand motion regressors were correlated with ipsilateral early visual areas (because, for instance, the actor's left hand appears in the subject's right visual field, which projects to left visual cortex), as well as bilateral visual motion areas, in some cases extending into the STS. Almost all voxels activated by these additional regressors were also activated by the boxcar regressors, so although these variables confirmed the roles of various sensory regions, they did not reveal additional areas which may have been missed by the simple boxcar analyses.
| Discussion |
|---|
|
|
|---|
Both the standard analysis and the intersubject correlational analysis replicated the involvement of bilateral temporal areas in speech comprehension, which has been shown in numerous prior studies (for review see Hickok and Poeppel 2004
Default Mode Network
A consistent set of brain regions are deactivated in multiple different active task conditions in comparison with passive or resting conditions. Regions commonly deactivated include the ventral anterior cingulate gyrus, dorsomedial frontal cortex, posterior cingulate cortex and the precuneus, and the angular gyrus (Shulman et al. 1997
; Binder et al. 1999
; Mazoyer et al. 2001
; Gusnard and Raichle 2001
; McKiernan et al. 2003
).
In the standard analysis, deactivations relative to rest were observed in all of these regions in the present study (see Fig. 2, Tables 1 and 2). The most widely accepted explanation for these signal changes is that they represent the attenuation of a default mode involving processes such as monitoring of internal and external states, and "stream of consciousness" (Shulman et al. 1997
; Binder et al. 1999
; Gusnard and Raichle 2001
; McKiernan et al. 2003
).
A novel finding of the present study is that many of these regions were robustly correlated across subjects, as revealed in the intersubject correlational analysis. Data from the rest condition, as well as transitional volumes between rest and task, did not even enter into this analysis, so these correlations cannot reflect processes related to the resting state per se. Rather, the correlations must reflect modulation of these regions by the time-varying content of the narratives, and the linguistic, conceptual and affective processing which they entail. This demonstrates that default mode regions are not simply shut off in response to an active task. Instead, the data suggest 2 possible interpretations, which are not mutually exclusive. The first is that the narratives make differential demands as a function of time on the processes subserved by the default mode network. This appears likely given the evidence that semantic processing is one function of default mode areas (Binder et al. 1999
; McKiernan et al. 2003
). For instance, some parts of the narratives may be more semantically complex than other parts, so regions involved in semantic processing may be more active during the more complex stages of the narratives, consistently across subjects. The second interpretation is that the global level of engagement may vary in the narratives as a function of time, and this may contribute to the intersubject correlations observed in default mode areas. It has been shown that default mode regions are systematically downregulated as a function of task difficulty (Greicius and Menon 2004
; McKiernan et al. 2006
), so it is plausible that during parts of the narratives that are more engaging, default mode activity is more downregulated, which would result in correlations across subjects to the extent that subjects find the same parts of the narratives more or less engaging.
The functions of the various regions which make up the default mode network are not well understood, however, functional interpretations have been proposed for each area. The ventral, rostral section of the anterior cingulate gyrus appears to be involved with affective and emotional processes, whereas dorsal anterior cingulate cortex is more concerned with cognitive and motor functions (Bush et al. 2000
). The adjacent dorsomedial prefrontal cortex is thought to be concerned with monitoring one's own internal state, as well as attributing mental states to others (Frith and Frith 1999
), or with social processing more generally (Iacoboni et al. 2004
). As for the posterior midline regions, Gusnard and Raichle (2001)
have proposed that the role of these areas in the default mode network is to represent and monitor the external environment. Activations of posterior midline regions in narrative comprehension studies have been interpreted as reflecting the linking of incoming information with prior knowledge, and episodic memory retrieval (e.g., Xu et al. 2005
).
The dorsal part of the angular gyrus bilaterally was deactivated relative to rest in both the auditory and audiovisual groups, consistent with its part in the default mode network. However, unlike the other major default mode regions, significant intersubject correlations were not observed in this part of the angular gyrus. Importantly though, bilateral superior temporal regions showing correlations across subjects extended dorsally and posteriorly to include the posterior STS and the ventral part of the angular gyrus. This contrasted with the standard analysis, where these superior temporal activations did not extend so far back. Thus, there is a discrepancy between the 2 methods, in that the intersubject correlational analysis implies the involvement of posterior superior temporal and inferior parietal regions that are not more active than rest in the standard analysis. The results from the intersubject correlational analysis are more consistent with lesion studies, which have demonstrated that lesions to this region produce conduction aphasia (Green and Howes 1978
). In general, this area has been argued to be important for auditory to articulatory mapping in language comprehension and production (Hickok and Poeppel 2000
, 2004
). We suggest that in the standard analysis the involvement of this region in speech comprehension is obscured, because it lies adjacent to the deactivated dorsal part of the angular gyrus. But the dorsal part of the angular gyrus that was deactivated relative to rest was not correlated across subjects and so appears to be concerned with internal processes that are not systematically modulated by linguistic input.
Previous studies of auditory narrative comprehension have rarely reported deactivations relative to baseline, and default mode regions have usually not been activated relative to baseline; exceptions in a handful of studies include the precuneus (Perani et al. 1998
; Schmithorst et al. 2006
) and regions in the vicinity of the angular gyrus (Perani et al. 1998
; Crinion et al. 2003
; Schmithorst et al. 2006
).
Our results demonstrating intersubject correlations in default mode regions are at variance with those of Golland et al. (2007)
, who argued for a partition of cortical areas into an "extrinsic" system concerned with processing of sensory input, which was correlated across multiple presentations of the same time-varying audiovisual stimulus (a movie), and an "intrinsic" system important for internal processes, which was not correlated across multiple presentations of the same movie. The intrinsic system was argued to have much in common with the default mode network. Golland et al. (2007)
defined the intrinsic system as voxels correlated with the timecourse of "seed" regions of interest in the inferior parietal cortex (IPC), which was chosen because it was the area which most consistently did not show correlations between repeated presentations of the same movie (similar to the angular gyri in our study). Significant intersubject correlations were not observed in the intrinsic system, which included most default mode areas with the exception of the anterior cingulate gyrus.
We propose 2 possible reasons for this discrepancy with our results. Firstly, Golland et al. (2007)
assessed correlations between signal in response to 2 presentations of the same movie to each subject, rather than calculating correlations across subjects. If default mode regions are especially important for higher-level cognitive and affective processes, rather than more basic sensory processes, then it is logical that they respond differently to a movie which had already been seen recently. This might contribute to explaining the lack of correlations observed. In a previous study by the same group, correlated regions potentially in the default mode network were reported in the cingulate gyrus and retrosplenial cortex (Hasson et al. 2004
).
A second major difference between our study and Golland et al. (2007)
is that we used videos with constant linguistic content, whereas they presented subjects with a segment of a feature movie which contained language only some of the time. It is possible that the default mode regions we observed to be intercorrelated across subjects are especially involved in higher-level linguistic processes in particular, and are not engaged in such a consistent manner across individuals for different kinds of stimuli.
IFG and Premotor Cortex
Intersubject correlational analyses revealed extensive bilateral regions in the IFG and adjacent premotor cortex where there were significant intersubject correlations. This implies that these regions are sensitive to time-varying properties of the input and the computations entailed. The left IFG in particular (i.e., Broca's area) has been demonstrated to be involved in semantic, syntactic and phonological processes in both speech production and comprehension (Bookheimer 2002
). Because the information content in each of these domains is constantly varying in the course of a narrative, the intersubject correlations in this region are not surprising. Left frontal activations have been observed in most previous studies of auditory narrative comprehension, though the precise regions reported have generally been much more circumscribed and have varied considerably from study to study. In the standard analysis in the present study, there were actually no significant activations in the IFG in either hemisphere in the auditory-only group. Although small clusters of voxels were observed in the pars triangularis of each hemisphere exceeding the threshold corresponding to P < 0.005, their cluster sizes were not close to significance: 800 mm3 in the left hemisphere (P = 0.78), and 336 mm3 in the right hemisphere (P = 1.00).
Activity in the right IFG was also shown to be highly significantly correlated across subjects, to a degree similar to the left IFG. Right IFG involvement has rarely been reported in previous studies of auditory narrative comprehension, with occasional exceptions (Dehaene et al. 1997
; Tzourio-Mazoyer et al. 2004
). However, right-hemisphere areas, including the IFG, are thought to play a role in a range of linguistic processes including prosody (Ross 1981
; Wildgruber et al. 2005
) and understanding of higher-level discourse (St. George et al. 1999
; Robertson et al. 2000
; Xu et al. 2005
; see Bookheimer 2002
; Jung-Beeman 2005
, for review). We propose that the robust correlations across subjects that we observed in the right IFG reflect the sensitivity of the right IFG to modulation of such higher-level processes.
Why are inferior frontal activations in either hemisphere so much less extensive in previous studies of narrative speech comprehension, and in the standard analysis in the present study? High activity at rest or in passive conditions probably cannot account for the failure to observe bilateral IFG activity in narrative comprehension studies, because only parts of the left IFG (and not the right) have been suggested to belong to the default mode network (Shulman et al. 1997
; Binder et al. 1999
), and even the left IFG has not been identified in all studies (McKiernan et al. 2003
; Greicius and Menon 2004
). Rather, our results suggest that the left and right IFG do not exhibit a consistent signal increase or decrease during narrative comprehension, but rather they show a consistent signal fluctuation which tracks one or more aspects of the input. Precisely which aspects are tracked cannot be determined from our study, but recent reviews of the literature shed some light on the kinds of processes the left and right IFG might be concerned with (Bookheimer 2002
; Jung-Beeman 2005
).
The left ventral precentral gyrus (ventral premotor cortex), and bilateral regions spanning the inferior frontal sulcus and middle frontal gyrus, were more correlated across subjects in the auditory-only group than the audiovisual group. Comprehension of the narratives was considerably more difficult in the auditory-only condition, due to the lack of visual phonemic cues and the interference of the scanner noise with the auditory stimuli. This suggests that the differential recruitment of these frontal areas may reflect increased processing difficulty. In particular, we propose that frontal areas may play a role in generating top-down models of hypothesized linguistic structures, which would be assessed with respect to the acoustic input in superior temporal regions. A recent study has argued for a similar role for premotor cortex in low-level phonetic perception (Wilson and Iacoboni 2006
). Under this view, increased intersubject correlations in the auditory-only group would reflect common modulations across subjects for parts of the narratives that were more difficult to understand and made increased demands on top-down processes.
Regions Differentially Implicated in Audiovisual Speech Perception
Besides early visual and visual motion areas, there was just one region that showed significantly greater correlations within the audiovisual group compared with the auditory group: the right STS. The STS, particularly in the right hemisphere, has been demonstrated in numerous studies to be important for perception of biological motion (Allison et al. 2000
; Pelphrey et al. 2005
). Our audiovisual stimuli contained movements of the arms, hands, head, mouth, and eyes. Another context in which the STS is often implicated is crossmodal binding in audiovisual speech perception (Calvert et al. 2000
; Macaluso et al. 2004
). In a previous study comparing audiovisual narrative comprehension with auditory-only narrative comprehension, Skipper et al. (2005)
also reported greater activation of bilateral posterior superior and middle temporal regions for audiovisual speech.
Although there were no frontal regions which responded significantly more strongly to audiovisual narratives, nor that were more intercorrelated across subjects in the audiovisual condition, bilateral posterior inferior frontal areas were activated relative to rest in the standard analysis for the audiovisual group but not for the auditory-only group. Furthermore, the left dorsal pars opercularis and right inferior frontal junction (adjacent to the pars opercularis) showed greater intersubject correlations for the audiovisual subjects which did not reach the minimum cluster size criterion. These findings are consistent with a large body of research that has implicated regions in the IFG in the coding of actions (Rizzolatti and Craighero 2004
), the actions in the present study being the speech-related gestures produced by the actor, as well as possibly the head, eye and mouth movements. Our identification of the dorsal pars opercularis in particular is consistent with recent data showing that this is the inferior frontal region most systematically implicated in action observation (Iacoboni et al. 2005
; Molnar-Szakacs et al. 2005
, 2006
).
Superior Temporal Cortex
Both the standard analysis and the intersubject correlational analysis revealed greater involvement of superior temporal regions in the more difficult auditory condition relative to the audiovisual condition. However, the precise regions implicated were not identical across the 2 analyses. The standard analysis showed that there was greater activity in the transverse temporal gyri bilaterally, that is, primary auditory cortex. In contrast, the intersubject analysis did not reveal enhanced correlations between subjects in this area, but rather more ventrally and anterior in the STS, extending as far anteriorly as the temporal pole. It is likely that the more challenging auditory-only condition required increased auditory attention, which is known to increase signal in primary sensory areas (Pugh et al. 1996
). However, because the temporal patterns of activity in these areas would simply reflect acoustic properties that are identical in the auditory-only and audiovisual conditions, there was no difference in the extent of intersubject correlations, even though there was more signal change in the auditory condition. On the other hand, activity in the anterior STS regions which showed increased intersubject correlations must reflect not only acoustic information but also linguistic processing, which we suggest would have had a qualitatively different temporal structure in the more heavily taxed auditory-only group. This constitutes evidence in support of a ventral, anterior route for speech perception in superior temporal cortex that has been proposed by several groups (Scott et al. 2000
; Scott and Wise 2004
; Liebenthal et al. 2005
). It is noteworthy though that we observed increased intersubject correlations in the STS bilaterally, supporting the idea that the earliest stages of speech perception are bilateral (Hickok and Poeppel 2000
, 2004
).
Conclusion
Intersubject correlational analysis proved to be a useful complement to conventional subtraction analysis, as it revealed a wide network of regions involved in auditory or audiovisual narrative comprehension. Several "default mode" areas—ventral and dorsal anterior cingulate and adjacent medial frontal regions, and the posterior cingulate and adjacent precuneus—were modulated in a consistent manner across subjects by the narratives, despite being largely deactivated relative to rest. Extensive bilateral inferior frontal and premotor regions were also highly correlated across subjects. We propose that this network of regions beyond the superior temporal cortex is important for higher-level linguistic processes, and interfaces with extralinguistic cognitive, affective, and interpersonal systems.
| Supplementary Material |
|---|
|
|
|---|
Supplementary material can be found at: http://www.cercor.oxfordjournals.org/.
| Acknowledgments |
|---|
We thank Susan Arick, Lisa Aziz-Zadeh, Susan Duncan, Susan Goldin-Meadow, Amy Hubbard, Jonas Kaplan, and Roy Mukamel for their assistance in the design, implementation and analysis of this study, and 2 anonymous reviewers for their helpful comments. For generous support we thank the Brain Mapping Medical Research Organization, Brain Mapping Support Foundation, Pierson-Lovelace Foundation, The Ahmanson Foundation, William M. and Linda R. Dietel Philanthropic Fund at the Northern Piedmont Community Foundation, Tamkin Foundation, Jennifer Jones-Simon Foundation, Capital Group Companies Charitable Foundation, Robson Family, and Northstar Fund. The project described was supported by grants from the National Science Foundation (REC0107077), National Institute of Mental Health (MH63680), and grant numbers RR12169, RR13642, and RR00865 from the National Center for Research Resources (NCRR), a component of the National Institutes of Health (NIH); its contents are solely the responsibility of the authors and do not necessarily represent the official views of NCRR or NIH. Conflict of Interest: None declared.
| References |
|---|
|
|
|---|
Ahmad Z, Balsamo LM, Sachs BC, Xu B, Gaillard WD. Auditory comprehension of language in young children: neural networks identified with fMRI. Neurology (2003) 60:1598–1605.
Alho K, Vorobyev VA, Medvedev SV, Pakhomov SV, Starchenko MG, Tervaniemi M, Naatanen R. Selective attention to human voice enhances brain activity bilaterally in the superior temporal sulcus. Brain Res (2006) 1075:142–150.[CrossRef][Web of Science][Medline]
Allison T, Puce A, McCarthy G. Social perception from visual cues: role of the STS region. Trends Cogn Sci (2000) 4:267–278.[CrossRef][Web of Science][Medline]
Binder JR, Frost JA, Hammeke TA, Bellgowan PS, Rao SM, Cox RW. Conceptual processing during the conscious resting state. A functional MRI study. J Cogn Neurosci (1999) 11:80–95.[CrossRef][Web of Science][Medline]
Bookheimer S. Functional MRI of language: new approaches to understanding the cortical organization of semantic processing. Annu Rev Neurosci (2002) 25:151–188.[CrossRef][Web of Science][Medline]
Bush G, Luu P, Posner MI. Cognitive and emotional influences in anterior cingulate cortex. Trends Cogn Sci (2000) 4:215–222.[CrossRef][Web of Science][Medline]
Calvert GA, Campbell R, Brammer MJ. Evidence from functional magnetic resonance imaging of crossmodal binding in human heteromodal cortex. Curr Biol (2000) 10:649–657.[CrossRef][Web of Science][Medline]
Cox RW. AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. Comput Biomed Res (1996) 29:162–173.[CrossRef][Web of Science][Medline]
Crinion JT, Lambon-Ralph MA, Warburton EA, Howard D, Wise RJ. Temporal lobe regions engaged during normal speech comprehension. Brain (2003) 126:1193–1201.
Crinion J, Price CJ. Right anterior superior temporal activation predicts auditory sentence comprehension following aphasic stroke. Brain (2005) 128:2858–2871.
Dehaene S, Dupoux E, Mehler J, Cohen L, Paulesu E, Perani D, van de Moortele PF, Lehericy S, Le Bihan D. Anatomical variability in the cortical representation of first and second language. Neuroreport (1997) 8:3809–3815.[Web of Science][Medline]
Duvernoy HM. The human brain: surface, three-dimensional sectional anatomy with MRI, and blood supply (1999) New York: Springer.
Engel SA, Rumelhart DE, Wandell BA, Lee AT, Glover GH, Chichilnisky EJ, Shadlen MN. fMRI of human visual cortex. Nature (1994) 369:525.[CrossRef][Medline]
Ferstl EC, von Cramon DY. What does the frontomedian cortex contribute to language processing: coherence or theory of mind? NeuroImage (2002) 17:1599–1612.[CrossRef][Web of Science][Medline]
Fletcher PC, Happe F, Frith U, Baker SC, Dolan RJ, Frackowiak RS, Frith CD. Other minds in the brain: a functional imaging study of "theory of mind" in story comprehension. Cognition (1995) 57:109–128.[CrossRef][Web of Science][Medline]
Frith CD, Frith U. Interacting minds—a biological basis. Science (1999) 286:1692–1695.
Gallagher HL, Happe F, Brunswick N, Fletcher PC, Frith U, Frith CD. Reading the mind in cartoons and stories: an fMRI study of "theory of mind" in verbal and nonverbal tasks. Neuropsychologia (2000) 38:11–21.[CrossRef][Web of Science][Medline]
Giraud AL, Truy E, Frackowiak RS, Gregoire MC, Pujol JF, Collet L. Differential recruitment of the speech processing system in healthy subjects and rehabilitated cochlear implant patients. Brain (2000) 123:1391–1402.
Golland Y, Bentin S, Gelbard H, Benjamini Y, Heller R, Nir Y, Hasson U, Malach R. Extrinsic and intrinsic systems in the posterior cortex of the human brain revealed during natural sensory stimulation. Cereb Cortex (2007) 17:766–777.
Green E, Howes DH. The nature of conduction aphasia: a study of anatomic and clinical features and of underlying mechanisms. In: Studies in neurolinguistics—Whitaker A, Whitaker HA, eds. (1978) San Diego: Academic Press. 123–156.
Greicius MD, Menon V. Default-mode activity during a passive sensory task: uncoupled from deactivation but impacting activation. J Cogn Neurosci (2004) 16:1484–1492.[CrossRef][Web of Science][Medline]
Gusnard DA, Raichle ME. Searching for a baseline: functional imaging and the resting human brain. Nat Rev Neurosci (2001) 2:685–694.[CrossRef][Web of Science][Medline]
Hasson U, Nir Y, Levy I, Fuhrmann G, Malach R. Intersubject synchronization of cortical activity during natural vision. Science (2004) 303:1634–1640.
Hickok G, Poeppel D. Towards a functional neuroanatomy of speech perception. Trends Cogn Sci (2000) 4:131–138.[CrossRef][Web of Science][Medline]
Hickok G, Poeppel D. Dorsal and ventral streams: a framework for understanding aspects of the functional anatomy of language. Cognition (2004) 92:67–99.[CrossRef][Web of Science][Medline]
Holmes CJ, Hoge R, Collins L, Woods R, Toga AW, Evans AC. Enhancement of MR images using registration for signal averaging. J Comput Assist Tomogr (1998) 22:324–333.[CrossRef][Web of Science][Medline]
Iacoboni M, Lieberman MD, Knowlton BJ, Molnar-Szakacs I, Moritz M, Throop CJ, Fiske AP. Watching social interactions produces dorsomedial prefrontal and medial parietal BOLD fMRI signal increases compared to a resting baseline. NeuroImage (2004) 21:1167–1173.[CrossRef][Web of Science][Medline]
Iacoboni M, Molnar-Szakacs I, Gallese V, Buccino G, Mazziotta JC, Rizzolatti G. Grasping the intentions of others with one's own mirror neuron system. PLoS Biol (2005) 3(3):e79.[CrossRef][Medline]
Jung-Beeman M. Bilateral brain processes for comprehending natural language. Trends Cogn Sci (2005) 9:512–518.[CrossRef][Web of Science][Medline]
Liebenthal E, Binder JR, Spitzer SM, Possing ET, Medler DA. Neural substrates of phonemic perception. Cereb Cortex (2005) 15:1621–1631.
Macaluso E, George N, Dolan R, Spence C, Driver J. Spatial and temporal factors during processing of audiovisual speech: a PET study. NeuroImage (2004) 21:725–732.[CrossRef][Web of Science][Medline]
Maguire EA, Frith CD, Morris RG. The functional neuroanatomy of comprehension and memory: the importance of prior knowledge. Brain (1999) 122:1839–1850.
Mar RA. The neuropsychology of narrative: story comprehension, story production and their interrelation. Neuropsychologia (2004) 42:1414–1434.[CrossRef][Web of Science][Medline]
Mazoyer BM, Tzourio N, Frak V, Syrota A, Murayama N, Levrier O, Salamon G, Dehaene S, Cohen L, Mehler J. The cortical representation of speech. J Cogn Neurosci (1993) 5:467–479.[CrossRef][Web of Science]
Mazoyer B, Zago L, Mellet E, Bricogne S, Etard O, Houde O, Crivello F, Joliot M, Petit L, Tzourio-Mazoyer N. Cortical networks for working memory and executive functions sustain the conscious resting state in man. Brain Res Bull (2001) 54:287–298.[CrossRef][Web of Science][Medline]
McKiernan KA, D'Angelo BR, Kaufman JN, Binder JR. Interrupting the "stream of consciousness": an fMRI investigation. NeuroImage (2006) 29:1185–1191.[CrossRef][Web of Science][Medline]
McKiernan KA, Kaufman JN, Kucera-Thompson J, Binder JR. A parametric manipulation of factors affecting task-induced deactivation: an fMRI study. J Cogn Neurosci (2003) 15:394–408.[CrossRef][Web of Science][Medline]
McNeill D. Hand and mind (1992) Chicago: University of Chicago Press.
Molnar-Szakacs I, Iacoboni M, Koski L, Mazziotta JC. Functional segregation within pars opercularis of the inferior frontal gyrus: evidence from fMRI studies of imitation and action observation. Cereb Cortex (2005) 15:986–994.
Molnar-Szakacs I, Kaplan JT, Greenfield PM, Iacoboni M. Observing complex action sequences: the role of the fronto-parietal mirror neuron system. NeuroImage (2006) 33:923–935.[CrossRef][Web of Science][Medline]
Papathanassiou D, Etard O, Mellet E, Zago L, Mazoyer B, Tzourio-Mazoyer N. A common language network for comprehension and production: a contribution to the definition of language epicenters with PET. NeuroImage (2000) 11:347–357.[CrossRef][Web of Science][Medline]
Pelphrey KA, Morris JP, Michelich CR, Allison T, McCarthy G. Functional anatomy of biological motion perception in posterior temporal cortex: an FMRI study of eye, mouth and hand movements. Cereb Cortex (2005) 15:1866–1876.
Perani D, Paulesu E, Galles NS, Dupoux E, Dehaene S, Bettinardi V, Cappa SF, Fazio F, Mehler J. The bilingual brain. Proficiency and age of acquisition of the second language. Brain (1998) 121:1841–1852.
Pugh KR, Shaywitz BA, Shaywitz SE, Fulbright RK, Byrd D, Skudlarski P, Shankweiler DP, Katz L, Constable RT, Fletcher J, et al. Auditory selective attention: an fMRI investigation. NeuroImage (1996) 4:159–173.[CrossRef][Web of Science][Medline]
Rademacher J, Morosan P, Schormann T, Schleicher A, Werner C, Freund HJ, Zilles K. Probabilistic mapping and volume measurement of human primary auditory cortex. NeuroImage (2001) 13:669–683.[Web of Science][Medline]
Raichle ME, MacLeod AM, Snyder AZ, Powers WJ, Gusnard DA, Shulman GL. A default mode of brain function. Proc Natl Acad Sci USA (2001) 98:676–682.
Rizzolatti G, Craighero L. The mirror-neuron system. Annu Rev Neurosci (2004) 27:169–192.[CrossRef][Web of Science][Medline]
Robertson DA, Gernsbacher MA, Guidotti SJ, Robertson RR, Irwin W, Mock BJ, Campana ME. Functional neuroanatomy of the cognitive process of mapping during discourse comprehension. Psychol Sci (2000) 11:255–260.[CrossRef][Web of Science][Medline]
Rodd JM, Davis MH, Johnsrude IS. The neural mechanisms of speech comprehension: fMRI studies of semantic ambiguity. Cereb Cortex (2005) 15:1261–1269.
Ross ED. The aprosodias. Functional-anatomic organization of the affective components of language in the right hemisphere. Arch Neurol (1981) 38:561–569.
Schmithorst VJ, Holland SK, Plante E. Cognitive modules utilized for narrative comprehension in children: a functional magnetic resonance imaging study. NeuroImage (2006) 29:254–266.[CrossRef][Web of Science][Medline]
Scott SK, Blank CC, Rosen S, Wise RJ. Identification of a pathway for intelligible speech in the left temporal lobe. Brain (2000) 123:2400–2406.
Scott SK, Wise RJS. The functional neuroanatomy of prelexical processing in speech perception. Cognition (2004) 92:13–45.[CrossRef][Web of Science][Medline]
Shmuel A, Yacoub E, Pfeuffer J, Van de Moortele PF, Adriany G, Hu X, Ugurbil K. Sustained negative BOLD, blood flow and oxygen consumption response and its coupling to the positive response in the human brain. Neuron (2002) 36:1195–1210.[CrossRef][Web of Science][Medline]
Shulman GL, Fiez JA, Corbetta M, Buckner RL, Miezin FM, Raichle ME, Petersen SE. Common blood flow changes across visual tasks: II. Decreases in cerebral cortex. J Cogn Neurosci (1997) 9:648–663.[Web of Science]
Skipper JI, Nusbaum HC, Small SL. Listening to talking faces: motor cortical activation during speech perception. NeuroImage (2005) 25:76–89.[CrossRef][Web of Science][Medline]
Smith SM, Jenkinson M, Woolrich MW, Beckmann CF, Behrens TE, Johansen-Berg H, Bannister PR, De Luca M, Drobnjak I, Flitney DE, et al. Advances in functional and structural MR image analysis and implementation as FSL. NeuroImage (2004) 23:S208–S219.[CrossRef][Web of Science][Medline]
St. George M, Kutas M, Martinez A, Sereno MI. Semantic integration in reading: engagement of the right hemisphere during discourse processing. Brain (1999) 122:1317–1325.
Tzourio-Mazoyer N, Josse G, Crivello F, Mazoyer B. Interindividual variability in the hemispheric organization for speech. NeuroImage (2004) 21:422–435.[CrossRef][Web of Science][Medline]
Wernicke C. Der Aphasische Symptomencomplex (1874) Breslau: Cohn and Weigert.
Wildgruber D, Riecker A, Hertrich I, Erb M, Grodd W, Ethofer T, Ackermann H. Identification of emotional intonation evaluated by fMRI. NeuroImage (2005) 24:1233–1241.[CrossRef][Web of Science][Medline]
Wilson SM, Iacoboni M. Neural responses to non-native phonemes varying in producibility: evidence for the sensorimotor nature of speech perception. NeuroImage (2006) 33:316–325.[CrossRef][Web of Science][Medline]
Wilson SM, Saygin AP, Sereno MI, Iacoboni M. Listening to speech activates motor areas involved in speech production. Nat Neurosci (2004) 7:701–702.[CrossRef][Web of Science][Medline]
Wise RJ, Scott SK, Blank SC, Mummery CJ, Murphy K, Warburton EA. Separate neural subsystems within Wernicke's area. Brain (2001) 124:83–95.
Worsley KJ, Liao C, Aston J, Petre V, Duncan GH, Morales F, Evans AC. A general statistical analysis for fMRI data. NeuroImage (2002) 15:1–15.[CrossRef][Web of Science][Medline]
Xu J, Kemeny S, Park G, Frattali C, Braun A. Language in context: emergent features of word, sentence, and narrative comprehension. NeuroImage (2005) 25:1002–1015.[CrossRef][Web of Science][Medline]
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
U. Hasson, E. Yang, I. Vallines, D. J. Heeger, and N. Rubin A Hierarchy of Temporal Receptive Windows in Human Cortex J. Neurosci., March 5, 2008; 28(10): 2539 - 2550. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



