Title

Extracting Information for Automatic Indexing of Multimedia Material

Authors

Horacio Saggion (Department of Computer Science, University of Sheffield, Regent Court 211 Portobello Street S1 4DP - Sheffield - England, UK)

Hamish Cunningham (Department of Computer Science, University of Sheffield, Regent Court 211 Portobello Street S1 4DP - Sheffield - England, UK)

Diana Maynard (Department of Computer Science, University of Sheffield, Regent Court 211 Portobello Street S1 4DP - Sheffield - England, UK)

Kalina Bontcheva (Department of Computer Science, University of Sheffield, Regent Court 211 Portobello Street S1 4DP - Sheffield - England, UK)

Oana Hamza (Department of Computer Science, University of Sheffield, Regent Court 211 Portobello Street S1 4DP - Sheffield - England, UK)

Christian Ursu (Department of Computer Science, University of Sheffield, Regent Court 211 Portobello Street S1 4DP - Sheffield - England, UK)

Yorick Wilks (Department of Computer Science, University of Sheffield, Regent Court 211 Portobello Street S1 4DP - Sheffield - England, UK)

Session

MMO3: Collection & Indexing Of Multimodal LR

Abstract

This paper discusses our work on information extraction (IE) from multi-lingual, multi-media, multi-genre Language Resources, in a domain where there are many different event types. This work is being carried out in the context of MUMIS, an EU-funded project that aims at the development of basic technology for the creation of a composite index from multiple and multi-lingual sources. Our approach to IE relies on a finite state machinery provided by GATE, a General Architecture for Text Engineering, pipelined with full syntactic analysis and discourse interpretation implemented in Prolog.

Keywords

Automatic indexing, Multimedia material

Full Paper

157.pdf