Title

Applying Computational Linguistic Techniques in a Documentary Project for Q’anjob’al (Mayan, Guatemala)

Author(s)

Jonas Kuhn, B’alam Mateo-Toledo

The University of Texas at Austin, Department of Linguistics

Session

P19-SW

Abstract

This paper reports on a number of experiments in which we applied standard techniques from NLP in the context of documentation of endangered languages. We concentrated on the use of existing, freely available toolkits. Specifically, we explore the use of Finite-State Morphological Analysis, Maximum Entropy Part-of-Speech Tagging, and N-Gram Language Modeling.

Keyword(s)

Endangered languages, corpora, finite-state morphology, Maximum Entroy tagging, N-gram language models

Language(s)

Q’anjob’al (Mayan, Guatemala)

Full Paper

412.pdf