Publication | Closed Access
An analysis of transcription consistency in spontaneous speech from the buckeye corpus
26
Citations
1
References
2002
Year
Unknown Venue
Buckeye CorpusSpeech CorpusPsycholinguisticsSpoken Language ProcessingCommunicationPhonologyCorpus LinguisticsSpeech RecognitionPhoneticsComputational LinguisticsLanguage StudiesStatisticsInformal SpeechSegmentation AgreementHealth SciencesSpeech SynthesisTranscription ConsistencySpontaneous SpeechSpeech CommunicationSpeech TechnologySpeech AnalysisSpeech ProcessingSpeech PerceptionLinguistics
We present a preliminary analysis of transcriber consistency in labeling and segmentation of words and phones in the Buckeye corpus of spontaneous, informal speech. We find that pairwise inter-transcriber agreement on exact phone label match was 76%, and segmentation agreement within 20% of phone pair length was 75%, though longer phones are more consistently segmented than shorter phones. Patterns of consistency variation in labeling are observed as a function of phonetic categories that are similar to patterns reported for read speech. More agreement is seen on consonants than on vowels, and on fricatives and labials than on other consonant classes. In general, we find that shorter, more reduced words and phones result in more transcriber disagreement.
| Year | Citations | |
|---|---|---|
Page 1
Page 1