Title

PIA-Core: Semantic Annotation through Example-based Learning

Authors

Nigel Collier (National Institute of Informatics (NII) National Center of Sciences, 2-1-2 Hitotsubashi Chiyoda-ku, Tokyo 101-8430, Japan)

Koichi Takeuchi (National Institute of Informatics (NII) National Center of Sciences, 2-1-2 Hitotsubashi Chiyoda-ku, Tokyo 101-8430, Japan)

Session

WP4: Corpus Annotation

Abstract

This paper summarizes the aims and scope of the PIA (Portable Information Access) project’s PIA-Core system for automatic annotation of documents on the Semantic Web, i.e. the next generation World Wide Web. The focus of the project is to develop a portable information extraction system that can be easily adapted to new domains. PIA has its foundations on three resources: the PIA-Core information extraction module, application modules and PIA guidelines for ensuring consistent annotation. We are currently developing PIA-Core based on advanced machines learning methods to automatically annotate documents with terminology, names, temporal and quantity expressions etc. using examples of annotated documents.

Keywords

Semantic annotation

Full Paper

328.pdf