Annotating the functional chunks in Chinese sentences
Qiang Zhou (State Key Laboratory of Intelligent Technology and Systems Dept. of Computer Science and Technology, Tsinghua University, Beijing 100084, P. R. China)
Elliott Franco Drabek (State Key Laboratory of Intelligent Technology and Systems Dept. of Computer Science and Technology, Tsinghua University, Beijing 100084, P. R. China)
Fuji Ren (Dept. of Information Science and Intelligent Systems Faculty of Engineering, The University of Tokushima 2-1 Minamijosanjima, Tokushima 770-8506, Japan)
WO5: Syntactic Annotation
The paper proposed a new syntactic annotation scheme --- functional chunk, which tried to represent information about grammatical relations between sentence-level predicates and their arguments. Under this scheme, we built a Chinese chunk bank with about two million Chinese characters, and developed some learned models for automatically annotating fresh text with functional chunks. We also proposed a two-stages approach to build Chinese tree bank on the top of chunk bank, and gave some experimental results of chunk-based syntactic parser to show the advantage of functional chunk for parsing performance increase. All these work lays good foundations for further research project to build a large scale Chinese tree bank.
Corpus annotation, Functional chunk, Partial parsing, Chunk bank, Treebank