Title

TableTrans, MultiTrans, InterTrans and TreeTrans: Diverse Tools Built on the Annotation Graph Toolkit

Authors

Steven Bird (Linguistic Data Consortium, University of Pennsylvania)

Kazuaki Maeda (Linguistic Data Consortium, University of Pennsylvania)

Xiaoyi Ma (Linguistic Data Consortium, University of Pennsylvania)

Haejoong Lee (Linguistic Data Consortium, University of Pennsylvania)

Beth Randall (Linguistic Data Consortium, University of Pennsylvania)

Salim Zayat (Linguistic Data Consortium, University of Pennsylvania)

Session

MMP1: Multimodal Resources And Tools

Abstract

Four diverse tools built on the Annotation Graph Toolkit are described. Each tool associates linguistic codes and structures with time-series data. All are based on the same software library and tool architecture. TableTrans is for observational coding, using a spreadsheet whose rows are aligned to a signal. MultiTrans is for transcribing multi-party communicative interactions recorded using multi-channel signals. InterTrans is for creating interlinear text aligned to audio. TreeTrans is for creating and manipulating syntactic trees. This work demonstrates that the development of diverse tools and re-use of software components is greatly facilitated by a common high-level application programming interface for representing the data and managing input/output, together with a common architecture for managing the interaction of multiple components.

Keywords

Speech transcription, Conversation, Observational coding, Interlinear text, Tree annotation

Full Paper

285.pdf