Summary of the paper

Title LDC Forced Aligner
Authors Xiaoyi Ma
Abstract This paper describes the LDC forced aligner which was designed to align audio and transcripts. Unlike existing forced aligners, LDC forced aligner can align partially transcribed audio files, and also audio files with large chunks of non-speech segments, such as noise, music, silence etc, by inserting optional wildcard phoneme sequences between sentence or paragraph boundaries. Based on the HTK tool kit, LDC forced aligner can align audio and transcript on sentence or word level. This paper also reports its usage on English and Mandarin Chinese data.
Topics Corpus (creation, annotation, etc.), Speech resource/database, Tools, systems, applications
Full paper LDC Forced Aligner
