SUMMARY : Session P19-E

 

Title Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus
Authors E. Coussť, S. Gillis
Abstract In this paper, we assess an aspect of the quality of the broad phonetic transcriptions in the Spoken Dutch Corpus (CGN). The corpus contains speech from native speakers of Dutch originating from The Netherlands and the Dutch speaking part of Belgium. The phonetic transcriptions were made by transcribers from both regions. In previous research, we have identified regional differences in the transcribers' behaviour. In this paper, we explore the precise sources of the regional bias in the CGN transcriptions and we evaluate its impact on the phonetic transcriptions. More specifically, (1) the regional bias in the canonical transcriptions that served as the basis for the verification task of the transcribers is critically analysed, and (2) we verify in an experiment the regional bias introduced by the transcribers themselves. The possible effects of this inherent regional bias in the CGN transcriptions on subsequent linguistic analyses are briefly discussed.
Keywords evaluation, phonetic transcriptions, regional bias, Spoken Dutch Corpus
Full paper Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus