Publication | Closed Access
Acoustic correlates of information structure
426
Citations
55
References
2010
Year
PsychoacousticsNeurolinguisticsAcoustic CorrelatesFocus LocationPhonologyLanguage ProcessingApplied LinguisticsSyntaxExperimental PragmaticProsody (Film Studies)Corpus AnalysisLanguage StudiesHealth SciencesAuditory ModelingSpeech ProductionSpeech CommunicationLanguage PerceptionNarrow Object FocusSpeech AcousticsInformation StructureSpeech ProcessingLinear ResidualisationParalinguisticsSpeech PerceptionLinguistics
The study investigates whether English speakers prosodically mark information structure, identifies acoustic features linked to different aspects of information structure, and evaluates listeners’ ability to retrieve this information from the signal. The authors manipulated information structure in subject–verb–object sentences by preceding them with questions that made elements either focused or given, with focused elements further distinguished by implicit or explicit contrast sets and by narrow versus wide focus. Across all experiments, speakers reliably marked focus location and breadth with greater intensity, longer duration, and higher mean and maximum F0 for focused elements, listeners could detect these prosodic cues, contrastive focus was marked with greater intensity, longer duration, and lower F0, and linear residualisation and discriminant modelling revealed cross‑speaker generalisations and acoustic features underlying meaning differences.
This paper reports three studies aimed at addressing three questions about the acoustic correlates of information structure in English: (1) do speakers mark information structure prosodically, and, to the extent they do; (2) what are the acoustic features associated with different aspects of information structure; and (3) how well can listeners retrieve this information from the signal? The information structure of subject–verb–object sentences was manipulated via the questions preceding those sentences: elements in the target sentences were either focused (i.e., the answer to a wh-question) or given (i.e., mentioned in prior discourse); furthermore, focused elements had either an implicit or an explicit contrast set in the discourse; finally, either only the object was focused (narrow object focus) or the entire event was focused (wide focus). The results across all three experiments demonstrated that people reliably mark (1) focus location (subject, verb, or object) using greater intensity, longer duration, and higher mean and maximum F0, and (2) focus breadth, such that narrow object focus is marked with greater intensity, longer duration, and higher mean and maximum F0 on the object than wide focus. Furthermore, when participants are made aware of prosodic ambiguity present across different information structures, they reliably mark focus type, so that contrastively focused elements are produced with greater intensity, longer duration, and lower mean and maximum F0 than noncontrastively focused elements. In addition to having important theoretical consequences for accounts of semantics and prosody, these experiments demonstrate that linear residualisation successfully removes individual differences in people's productions thereby revealing cross-speaker generalisations. Furthermore, discriminant modelling allows us to objectively determine the acoustic features that underlie meaning differences.
| Year | Citations | |
|---|---|---|
Page 1
Page 1