Summary of the paper

Title UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese
Authors Toshinobu Ogiso, Mamoru Komachi, Yasuharu Den and Yuji Matsumoto
Abstract In order to construct an annotated diachronic corpus of Japanese, we propose to create a new dictionary for morphological analysis of Early Middle Japanese (Classical Japanese) based on UniDic, a dictionary for Contemporary Japanese. Differences between the Early Middle Japanese and Contemporary Japanese, which prevent a naïve adaptation of UniDic to Early Middle Japanese, are found at the levels of lexicon, morphology, grammar, orthography and pronunciation. In order to overcome these problems, we extended dictionary entries and created a training corpus of Early Middle Japanese to adapt UniDic for Contemporary Japanese to Early Middle Japanese. Experimental results show that the proposed UniDic-EMJ, a new dictionary for Early Middle Japanese, achieves as high accuracy (97%) as needed for the linguistic research on lexicon and grammar in Japanese classical text analysis.
Topics Corpus (creation, annotation, etc.), Lexicon, lexical database, Part of speech tagging
Full paper UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese
Bibtex @InProceedings{OGISO12.906,
  author = {Toshinobu Ogiso and Mamoru Komachi and Yasuharu Den and Yuji Matsumoto},
  title = {UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese},
  booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)},
  year = {2012},
  month = {may},
  date = {23-25},
  address = {Istanbul, Turkey},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {978-2-9517408-7-7},
  language = {english}
 }
Powered by ELDA © 2012 ELDA/ELRA