PIA-Core: Semantic Annotation through Example-based Learning
Nigel Collier (National Institute of Informatics (NII) National Center of Sciences, 2-1-2 Hitotsubashi Chiyoda-ku, Tokyo 101-8430, Japan)
Koichi Takeuchi (National Institute of Informatics (NII) National Center of Sciences, 2-1-2 Hitotsubashi Chiyoda-ku, Tokyo 101-8430, Japan)
WP4: Corpus Annotation
This paper summarizes the aims and scope of the PIA (Portable Information Access) projectís PIA-Core system for automatic annotation of documents on the Semantic Web, i.e. the next generation World Wide Web. The focus of the project is to develop a portable information extraction system that can be easily adapted to new domains. PIA has its foundations on three resources: the PIA-Core information extraction module, application modules and PIA guidelines for ensuring consistent annotation. We are currently developing PIA-Core based on advanced machines learning methods to automatically annotate documents with terminology, names, temporal and quantity expressions etc. using examples of annotated documents.