LREC 2006 Conference
Main Conference Programme

Click on each session title for more details.

Each session title is preceded by a unique code. The letters "O" and "P" refer to the type of presentation. The letters "G", "T", "W", "S", "E", and "M", referring to the domain(s) treated, may be combined.

  • O: Oral presentation
  • P: Poster presentation
  • G: General
  • T: Terminology
  • W: Written
  • S: Speech
  • E: Evaluation
  • M: Multimodality

Day 1 | Day 2 | Day 3

Day 1, Wednesday 24th May 2006
Schedule Programme Room
10:00-10:45 Opening Session
10:45-11:05 Coffee Break
11:05-13:25 5 Oral Sessions in Parallel
  O1-W: Lexicons, Semantics & Infrastructural Issues 
  O2-EW: Named Entity Recognition & Time Annotation
  O3-EW: Authoring Tools, Information Extraction and Retrieval 
  O4-S: Speech Corpora and Dialogue
  O5-E: Machine Translation and Evaluation
13:30-14:40 Lunch Break
14:40-16:40 5 Oral Sessions in Parallel
  O6-WM: Ontologies
  O7-GWT: Standards
  O8-W: Lexicon Acquisition
  O9-SE: TTS & Units for TTS
  O10-E: Parsing Evaluation
16:40-17:00 Coffee Break
17:00-18:20 Poster Sessions
  P1-W: Anaphora & Coreference
Module 7 - 2nd floor
  P2-W: Corpora & LR Infrastructures
Module 7 - 2nd floor
  P3-W: Machine Translation
Module 7 - 2nd floor
  P4-W: Question Answering, Summarisation, Classification
Module 7 - 2nd floor
  P5-W: Learning and Acquisition
Module 7 - 2nd floor
  P6-W: Corpora: Creation, Annotation
Module 7 - 2nd floor
  P7-T: Terminology & Ontologies
Module 7 - 2nd floor
  P8-E: Evaluation of Question Answering & Summarisation
Module 7 - 3rd floor
  P9-M: Multimodal Tools
Module 7 - 3rd floor
  P10-S: Speech Corpora & Annotation
Module 7 - 3rd floor
18:20-18:25 5-minute Break
18:25-19:45 1 Panel & 4 Oral Sessions in Parallel
18:25-19:45 Panel 1: Perspectives for Ontologies in Linguistics

Moderator: Peter Wittenburg

  O11-W: Corpus & Lexicon
  O12-W: Tokenization and Tagging
  O13-M: Multimodal Corpora & Natural Interaction
  O14-W: Sentiments & Semantics

Day 1 | Day 2 | Day 3

O1-W Lexicons, Semantics & Infrastructural Issues
Chairperson:Dan Tufiş

11.05-11.25 A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons
Yoshihiko Hayashi, Toru Ishida
11.25-11.45 Moving to dynamic computational lexicons with LeXFlow
Claudia Soria, Maurizio Tesconi, Francesca Bertagna, Nicoletta Calzolari, Andrea Marchetti, Monica Monachini
11.45-12.05 IMORPHĒ : An Inheritance and Equivalence Based Morphology Description Compiler
Violetta Cavalli-Sforza, Abdelhadi Soudi
12.05-12.25 A Computational Lexicon of Contemporary Hebrew
Alon Itai, Shuly Wintner, Shlomo Yona
12.25-12.45 A methodology for the joint development of the Basque WordNet and Semcor
Eneko Agirre, Izaskun Aldezabal, Jone Etxeberria, Eli Izagirre, Karmele Mendizabal, Eli Pociello and Mikel Quintian
12.45-13.05 Building a WordNet for Arabic
Sabri Elkateb, William Black, Horacio Rodríguez, Musa Alkhalifa, Piek Vossen, Adam Pease, Christiane Fellbaum
13.05-13.25 User-friendly ontology authoring using a controlled language
Valentin Tablan, Tamara Polajnar, Hamish Cunningham, Kalina Bontcheva
O2-EW Named Entity Recognition & Time Annotation
Chairperson:James Pustejovsky
11.05-11.25 What in the World is a Shahab? Wide Coverage Named Entity Recognition for Arabic
Luke Nezda, Andrew Hickl, John Lehmann and Sarmand Fayyaz
11.25-11.45 Greek Named Entity Recognition using Support Vector Machines, Maximum Entropy and Onetime
Ionas Michailidis, Konstantinos Diamantaras, Spiros Vasileiadis, Yannick Frère
11.45-12.05 Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation
Bruno Pouliquen, Marco Kimler, Ralf Steinberger, Camelia Ignat, Tamara Oellinger, Ken Blackler, Flavio Fluart, Wajdi Zaghouani, Anna Widiger, Ann-Charlotte Forslund, Clive Best
12.05-12.25 Multi-domain Multi-lingual Named Entity Recognition: Revisiting & Grounding the resources issue
Voula Giouli, Alexis Konstandinidis, Elina Desypri, Harris Papageorgiou
12.25-12.45 Temporality in relation with discourse structure
Corina Forăscu , Ionut Cristian Pistol, Dan Cristea
12.45-13.05 Analysis of TimeBank as a Resource for TimeML Parsing
Branimir Boguraev and Rie Kubota Ando
13.05-13.25 An Annotated Corpus of Typical Durations of Events
Feng Pan, Rutu Mulkar and Jerry R. Hobbs
O3-EW Authoring Tools, Information Extraction and Retrieval
Chairperson:Virach Sornlertlamvanich

11.05-11.25 A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker
Flora Ramírez Bustamante, Alfredo Arnaiz, Mar Ginés
11.25-11.45 Corpus-Induced Corpus Clean-up
Martin Reynaert
11.45-12.05 Spelling Error Patterns in Spanish for Word Processing Applications
Flora Ramírez Bustamante, Enrique López Díaz
12.05-12.25 Rebuilding Lexical Resources for Information Retrieval using Sense Folder Detection and Merging Methods
Ernesto William De Luca and Andreas Nürnberger
12.25-12.45 A Methodology and Tool for Representing Language Resources for Information Extraction
José Iria, Fabio Ciravegna
12.45-13.05 Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
Nizar Habash, Clinton Mah, Sabiha Imran, Randy Calistri-Yeh and Páraic Sheridan
13.05-13.25 Comparison of Resource Discovery Methods
Alex Klassmann, Freddy Offenga, Daan Broeder, Romuald Skiba, Peter Wittenburg
O4-S Speech Corpora and Dialogue
Chairperson:Asunción Moreno
11.05-11.25 The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
Christopher Cieri, Walt Andrews, Joseph P. Campbell, George Doddington, Jack Godfrey, Shudong Huang, Mark Liberman, Alvin Martin, Hirotaka Nakasone, Mark Przybocki, Kevin Walker
11.25-11.45 Spoken Russian in the Russian National Corpus (RNC)
Elena Grishina
11.45-12.05 Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives
Emma Barker, Ryuichiro Higashinaka, François Mairesse, Robert Gaizauskas, Marilyn Walker, Jonathan Foster
12.05-12.25 Towards automatic transcription of Somali language
Nimaan Abdillahi, Nocera Pascal, Bonastre Jean-François
12.25-12.45 JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
Catia Cucchiarini, Hugo Van hamme, Olga van Herwijnen and Felix Smits
12.45-13.05 Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News
Sylvain Galliano, Edouard Geoffrois, Guillaume Gravier, Jean-François Bonastre, Djamel Mostefa, Khalid Choukri
13.05-13.25 Learning Database Content for Spoken Dialogue System Design
Joseph Polifroni and Marilyn Walker
O5-E Machine Translation and Evaluation
Chairperson:Margaret King
11.05-11.25 Evaluation of Automatic Speech Recognition and Speech Translation within TC-STAR: Results from the first evaluation campaign
Djamel Mostefa, Olivier Hamon, Khalid Choukri
11.25-11.45 X-Score: Automatic Evaluation of Machine Translation Grammaticality
O Hamon, M Rajman
11.45-12.05 Formal v. Informal: Register-Differentiated Arabic MT Evaluation in the PLATO Paradigm
Keith J. Miller and Michelle Vanni
12.05-12.25 TransType2 : The Last Word
Elliott Macklovitch
12.25-12.45 Automatic Testing and Evaluation of Multilingual Language Technology Resources and Components
Ulrich Schäfer, Daniel Beck
12.45-13.05 CESTA: First Conclusions of the Technolanguage MT Evaluation Campaign
O. Hamon , A. Popescu-Belis, K. Choukri, M. Dabbadie, A. Hartley, W. Mustafa El Hadi, M. Rajman, I. Timimi
13.05-13.25 Integrated Linguistic Resources for Language Exploitation Technologies
Stephanie Strassel, Christopher Cieri, Andrew Cole, Denise DiPersio, Mark Liberman, Xiaoyi Ma, Mohamed Maamouri, Kazuaki Maeda
O6-WMT Ontologies
Chairperson:Paul Buitelaar
14.40-15.00 Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus
P. Cimiano, M. Hartung, E. Ratsch
15.00-15.20 An Incremental Tri-Partite Approach To Ontology Learning
José Iria, Christopher Brewster, Fabio Ciravegna and Yorick Wilks
15.20-15.40 Exploiting text for extracting image processing resources
Gregory Grefenstette, Fathi Debili, Christian Fluhr, Svitlana Zinger
15.40-16.00 Using Core Ontology for Domain Lexicon Structuring
Rita Marinelli, Adriana Roventini, Giovanni Spadoni
16.00-16.20 Interaction between Lexical Base and Ontology with Formal Concept Analysis
Sujian Li, Qin Lu, Wenjie Li, Ruifeng Xu
16.20-16.40 Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases
Roberto Bartolini, Caterina Caracciolo, Emiliano Giovannetti, Alessandro Lenci, Simone Marchi, Vito Pirrelli, Chiara Renso, Laura Spinsanti
O7-GWT Standards
Chairperson:Christopher Cieri
14.40-15.00 Representing Linguistic Corpora and Their Annotations
Nancy Ide, Laurent Romary
15.00-15.20 SynAF: Towards a Standard for Syntactic Annotation
Thierry Declerck
15.20-15.40 Lexical Markup Framework (LMF)
Gil Francopoulo, Monte George, Nicoletta Calzolari, Monica Monachini, Nuria Bel, Mandy Pet, Claudia Soria
15.40-16.00 Conversion of WordNet to a standard RDF/OWL representation
Mark van Assem, Aldo Gangemi, Guus Schreiber
16.00-16.20 EuroTermBank – a Terminology Resource based on Best Practice
Lina Henriksen, Claus Povlsen and Andrejs Vasiljevs
16.20-16.40 A novel Textual Encoding paradigm based on Semantic Web tools and semantics
G. Tummarello, C. Morbidoni, F. Kepler, F. Piazza, P. Puliti
O8-W Lexicon Acquisition
Chairperson:Leonardo Lesmo
14.40-15.00 Automatic extraction of subcategorization frames for French
Paula Chesley, Susanne Salmon-Alt
15.00-15.20 Extraction of Temporal Information from Texts in Swedish
Anders Berglund, Richard Johansson, Pierre Nugues
15.20-15.40 The Mass-Count Distinction: Acquisition and Disambiguation
Michael Schiehlen, Kristina Spranger
15.40-16.00 Automatic Acquisition of Semantics-Extraction Patterns from Parallel Texts
Pavel Smrž
16.00-16.20 Automated Deep Lexical Acquisition for Robust Open Texts Processing
Yi Zhang, Valia Kordoni
16.20-16.40 Data-driven Amharic-English Bilingual Lexicon Acquisition
Saba Amsalu
O9-SE TTS & Units for TTS
Chairperson:Christoph Draxler
14.40-15.00 Development of a phoneme-to-phoneme (p2p) converter to improve the grapheme-to-phoneme (g2p) conversion of names
Qian Yang, Jean-Pierre Martens, Nanneke Konings, Henk van den Heuvel
15.00-15.20 Set-up of a Unit-Selection Synthesis with a Prominent Voice
Stefan Breuer, Sven Bergmann, Ralf Dragon and Sebastian Möller
15.20-15.40 Creation and analysis of a Polish speech database for use in unit selection synthesis
Dominika Oliver, Krzysztof Szklanny
15.40-16.00 ECESS Inter-Module Interface Specification for Speech Synthesis
Javier Pérez, Antonio Bonafonte, Horst-Udo Hain, Eric Keller, Stefan Breuer, Jilei Tian
16.00-16.20 A joint prosody evaluation of French text-to-speech synthesis systems
Marie-Neige Garcia, Christophe d’Alessandro, Gérard Bailly, Philippe Boula de Mareüil, Michel Morel
16.20-16.40 TC-STAR: Specifications of Language Resources and Evaluation for Speech Synthesis
A. Bonafonte, H. Höge, I. Kiss, A. Moreno, U. Ziegenhain, H. van den Heuvel, H.-U. Hain, X.S. Wang, M.N. Garcia
O10-E Parsing Evaluation
Chairperson:Massimo Poesio
14.40-15.00 Data, Annotations and Measures in EASY: the Evaluation Campaign for Parsers of French
Patrick Paroubek, Isabelle Robba, Anne Vilnat, Christelle Ayache
15.00-15.20 Exploring HPSG-based Treebanks for Probabilistic Parsing
Günter Neumann and Berthold Crysmann
15.20-15.40 Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations
Rajen Subba, Barbara Di Eugenio, Elena Terenzi
15.40-16.00 SParseval: Evaluation Metrics for Parsing Speech
Brian Roark, Mary Harper, Eugene Charniak, Bonnie Dorr, Mark Johnson, Jeremy G. Kahn, Yang Liu, Mari Ostendorf, John Hale, Anna Krasnyanskaya, Matthew Lease, Izhak Shafran, Matthew Snover, Robin Stewart, Lisa Yung
16.00-16.20 A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation
Alexandre Denis, Matthieu Quignard, Guillaume Pitel
16.20-16.40 Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
Tristan Vanrullen, Philippe Blache, Jean-Marie Balfourier
O11-W Corpus & Lexicon
Chairperson:Karel Pala
18.25-18.45 A translated corpus of 30,000 French SMS
Cédrick Fairon, Sébastien Paumier
18.45-19.05 The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level
Irene Castellón, Ana Fernández-Montraveta, Gloria Vázquez, Laura Alonso Alemany, Joan Antoni Capilla
19.05-19.25 A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German
Stefan Klatt
19.25-19.45 Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences
Kentaro Inui, Toru Hirano, Ryu Iida, Atsushi Fujita and Yuji Matsumoto
O12-W Tokenization and Tagging
Chairperson:Patrick Paroubek
18.25-18.45 The importance of precise tokenizing for deep grammars
Martin Forst, Ronald M. Kaplan
18.45-19.05 Towards Unified Chinese Segmentation Algorithm
Fu Lee Wang, Xiaotie Deng and Feng Zou
19.05-19.25 Tagset Mapping and Statistical Training Data Cleaning-up
Felix Pîrvan, Dan Tufiş
19.25-19.45 New Approach to Frequency Dictionaries — Czech Example
Jaroslava Hlaváčová
O13-M Multimodal Corpora & Natural Interaction
Chairperson:Jean-Claude Martin
18.25-18.45 A Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow
Nick Campbell, Toshiyuki Sadanobu, Masataka Imura, Naoto Iwahashi, Suzuki Noriko & Damien Douxchamps
18.45-19.05 The OSU Quake 2004 corpus of two-party situated problem-solving dialogs
Donna K. Byron, Eric Fosler-Lussier
19.05-19.25 Developing a Contextualized Multimodal Corpus for Human-Robot Interaction
Anders Green, Helge Hüttenrauch, Elin Anna Topp, Kerstin Severinson Eklundh
19.25-19.45 Gathering a corpus of multimodal computer-mediated meetings with focus on text and audio interaction
Saturnino Luz, Matt-Mouley Bouamrane, Masood Masoodian
O14-W Sentiments & Semantics
Chairperson:Yorick Wilks
18.25-18.45 Semantic Tag Extraction from WordNet Glosses
Alina Andreevskaia, Sabine Bergler
18.45-19.05 SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining
Andrea Esuli and Fabrizio Sebastiani
19.05-19.25 The Affective Weight of Lexicon
Carlo Strapparava, Alessandro Valitutti, Oliviero Stock
19.25-19.45 Methods for Creating Semantic Orientation Dictionaries
Maite Taboada, Caroline Anthony and Kimberly Voll
P1-W Anaphora & Coreference
Chairperson:Véronique Moriceau
Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains
Anders Nøklestad, Øystein Reigem, Christer Johansson
NPs for Events: Experiments in Coreference Annotation
Laura Hasler, Constantin Orasan, Karin Naumann
Annotating Bridging Anaphors in Italian: in Search of Reliability
Tommaso Caselli and Irina Prodanof
Exploiting logical document structure for anaphora resolution
Daniela Goecke, Andreas Witt
RefRef: A Tool for Viewing and Exploring Coreference Space
Hisami Suzuki and Gary Kacmarcik
PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit
Whitney Gegg-Harrison, Donna K. Byron
An Anaphora Resolution-Based Anonymization Module
M. Poesio, M. A. Kabadjov, P. Goux, U. Kruschwitz, E. Bishop and L. Corti
If it were then, then when was it? Establishing the anaphoric role of then
Georgiana Puşcaşu, Ruslan Mitkov
P2-W Corpora & LR Infrastructures
Chairperson:Sonja Bosch
Collection, Encoding and Linguistic Processing of a Swedish Medical Corpus - The MEDLEX Experience
Dimitrios Kokkinakis
User requirements analysis for the design of a reference corpus of written Dutch
Nelleke Oostdijk and Lou Boves
The DiaCORIS project: a diachronic corpus of written Italian
C. Onelli, D. Proietti, C. Seidenari, F. Tamburini
Annotating COMPARA, a Grammar-aware Parallel Corpus
Diana Santos, Susana Inácio
A Closer Look at Skip-gram Modelling
David Guthrie, Ben Allison, Wei Liu, Louise Guthrie, Yorick Wilks
All Greek to me! An automatic Greeklish to Greek transliteration system
Aimilios Chalamandaris, Athanassios Protopapas, Pirros Tsiakoulis, Spyros Raptis
P3-W Machine Translation
Chairperson:Anna Sågvall-Hein
Lexical similarity can distinguish between automatic and manual translations
Agam Patel and Dragomir R. Radev
Czech-English Word Alignment
Ondřej Bojar and Magdalena Prokopová
Building a resource for studying translation shifts
Lea Cyrus
A Hebrew Tree Bank Based on Cantillation Marks
Andi Wu, Kirk Lowery
Discriminant-Based MRS Banking
Stephan Oepen and Jan Tore Lønning
Annotation Guidelines for Czech-English Word Alignment
Ivana Kruijff-Korbayová, Klára Chvátalová, Oana Postolache
Automatic Construction of Japanese WordNet
Hiroyuki Kaji and Mariko Watanabe
Example-Based Machine Translation Using a Dictionary of Word Pairs
Reinhard Rapp, Carlos Martin Vide
Non-probabilistic alignment of rare German and English nominal expressions
Bettina Schrader
POS-based Word Reorderings for Statistical Machine Translation
Maja Popović , Hermann Ney
METIS-II: Machine Translation for Low Resource Languages
Vincent Vandeghinste, Ineke Schuurman, Michael Carl, Stella Markantonatou, Toni Badia
Dependency-Based Phrase Alignment
Radu Ion, Alexandru Ceauşu and Dan Tufiş
P4-W Question Answering, Summarisation, Classification
Chairperson:Claire Grover
Bilingual Machine-Aided Indexing
Jorge Civera and Alfons Juan
Data for question answering: The case of why
Suzan Verberne, Lou Boves, Nelleke Oostdijk and Peter-Arno Coppen
Multilingual Multidocument Summarization Tools and Evaluation
Horacio Saggion
Language Resources for Background Gathering
Horacio Saggion and Robert Gaizauskas
Mining Implicit Entities in Queries
Wei Li, Wenjie Li, Qin Lu
Representation and Inference for Open-Domain QA: Strength and Limits of two Italian Semantic Lexicons
Francesca Bertagna
SlinkET: A Partial Modal Parser for Events
Roser Saurí, Marc Verhagen, James Pustejovsky
P5-W Learning and Acquisition
Chairperson:Timothy Baldwin
Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator
Vít Nováček, Pavel Smrž, Jan Pomikálek
Case Frame Compilation from the Web using High-Performance Computing
Daisuke Kawahara, Sadao Kurohashi
The Lefff 2 syntactic lexicon for French: architecture, acquisition, use
Benoît Sagot, Lionel Clément, Éric Villemonte de La Clergerie and Pierre Boullier
Semantic Analysis of Abstract Nouns to Compile a Thesaurus of Adjectives
Kyoko Kanzaki, Qing Ma, Eiko Yamamoto and Hitoshi Isahara
Leveraging Machine Readable Dictionaries in Discriminative Sequence Models
Ben Wellner and Marc Vilain
New tools for the encoding of lexical data extracted from corpus
Núria Bel, Sergio Espeja, Montserrat Marimon
Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus
Yasunori Ohishi, Katunobu Itou, Kazuya Takeda, Atsushi Fujii
P6-W Corpora: Creation, Annotation
Chairperson:Costanza Navarretta
Mixing WordNet, VerbNet and PropBank for studying verb relations
Maria Teresa Pazienza, Marco Pennacchiotti, Fabio Massimo Zanzotto
A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects
Juri Apresjan, Igor Boguslavsky, Boris Iomdin, Leonid Iomdin, Andrei Sannikov, Victor Sizov
Annotating the Predicate-Argument Structure of Chinese Nominalizations
Nianwen Xue
Towards a Slovene Dependency Treebank
Sašo Džeroski, Tomaž Erjavec, Nina Ledinek, Petr Pajas, Zdenek Žabokrtsky, Andreja Žele
Talbanken05: A Swedish Treebank with Phrase Structure and Dependency Annotation
Joakim Nivre, Jens Nilsson, Johan Hall
POS tagset design for Italian
Raffaella Bernardi, Andrea Bolognesi, Corrado Seidenari, Fabio Tamburini
Linguistic and Biological Annotations of Biological Interaction Events
Tomoko Ohta, Yuka Tateisi, Jin-Dong Kim, Akane Yakushiji, Jun-ichi Tsujii
Structure, Annotation and Tools in the Basque ZT Corpus
N. Areta, A. Gurrutxaga, I. Leturia, Z. Polin, R. Saiz, I. Alegria, X. Artola, A. Diaz de Ilarraza, N. Ezeiza, A. Sologaistoa, A. Soroa, A. Valverde
A new approach to syntactic annotation
Noguchi Masaki, Ichikawa Hiroshi, Hashimoto Taiichi and Tokunaga Takenobu
An Annotated Corpus Management Tool: ChaKi
Yuji Matsumoto, Masayuki Asahara, Kiyota Hashimoto, Yukio Tono, Akira Ohtani, Toshio Morita
Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach
Yunqing Xia, Kam-Fai Wong and Wenjie Li
Recurrent Markov Cluster (RMCL) Algorithm for the Refinement of the Semantic Network
Jaeyoung Jung, Maki Miyake and Hiroyuki Akama
KNACK-2002: a Richly Annotated Corpus of Dutch Written Text
Véronique Hoste and Guy De Pauw
Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project
Florbela Barreto, António Branco, Eduardo Ferreira, Amália Mendes, Maria Fernanda Bacelar do Nascimento, Filipe Nunes, João Ricardo Silva
Methodological Aspects of Semantic Annotation
Harry Bunt and Amanda Schiffrin
P7-T Terminology & Ontologies
Chairperson:Bolette Sandford Pedersen
Constructing a Named Entity Ontology from Web Corpora
Ming-Shun Lin and Hsin-Hsi Chen
Khurshid Ahmad, Maria Teresa Musacchio, Giuseppe Palumbo
Khurshid Ahmad, Maria Teresa Musacchio, Giuseppe Palumbo
From Natural Language to Databases via Ontologies
Leonardo Lesmo, Livio Robaldo
T2O - Recycling Thesauri into a Multilingual Ontology
José João Almeida, Alberto Simões
Towards an Ontology for Art and Colours
Luciana Bordoni, Tiziana Mazzoli
SYMBERED - a Symbol-Concept Editing Tool
Mats Lundälv, Katarina Mühlenbock, Bengt Farre, Annika Brännström,
A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames
Yoji Kiyota, Hiroshi Nakagawa
A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs
Hennie Brugman, Véronique Malaisé, Luit Gazendam
Multilingual Lexical Semantic Resources for Ontology Translation
Thierry Declerck, Asunción Gómez Pérez, Ovidiu Vela, Zeno Gantner, David Manzano-Macho
Retrieving Terminological Data from the TxtCeram Tagged Domain Corpus: A First Step towards a Terminological Ontology
Anna Estellés, Amparo Alcina, Victoria Soler
Corpógrafo V3 From Terminological Aid to Semi-automatic Knowledge Engineering
Luís Sarmento, Belinda Maia, Diana Santos, Ana Pinto and Luís Cabral
Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing
Yasuko Senda, Yasusi Sinohara, Manabu Okumura
P8-E Evaluation of Question Answering & Summarisation
Chairperson:Barbara Di Eugenio
A Tree Kernel approach to Question and Answer Classification in Question Answering Systems
Alessandro Moschitti and Roberto Basili
Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia
Irene Cramer, Jochen L. Leidner, Dietrich Klakow
Component Evaluation in a Question Answering System
Luís Fernando Costa, Luís Sarmento
FRASQUES : A Question Answering system in the EQueR evaluation campaign
Brigitte Grau, Anne-Laure Ligozat, Isabelle Robba, Anne Vilnat, Laura Monceaux
Exploiting Dynamic Passage Retrieval for Spoken Question Recognition and Context Processing towards Speech-driven Information Access Dialogue
Tomoyosi Akiba
Evaluation for Scenario Question Answering Systems
Matthew W. Bilotti and Eric Nyberg
Towards Holistic Summarization - Selecting Summaries, Not Sentences
Martin Hassel & Jonas Sjöbergh
Computer-aided summarisation - what the user really wants
Constantin Orăsan and Laura Hasler
P9-M Multimodal Tools
Chairperson:Stelios Piperidis
ANNEX - a web-based Framework for Exploiting Annotated Media Resources
Peter Berck, Albert Russel
ELAN: a Professional Framework for Multimodality Research
Peter Wittenburg, Hennie Brugman, Albert Russel, Alex Klassmann, Han Sloetjes
TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface
Andrei Popescu-Belis and Maria Georgescul
The Eclipse Annotator: an extensible system for multimodal corpus creation
Fabian Behrens, Jan-Torsten Milde
A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit
Kazuaki Maeda, Haejoong Lee, Julie Medero, Stephanie Strassel
Visual Surveillance and Video Annotation and Description
Khurshid Ahmad, Craig Bennett & Tim Oliver
P10-S Speech Corpora & Annotation
Chairperson:Bruce Millar
DanPASS - A Danish Phonetically Annotated Spontaneous Speech Corpus
Nina Grønnum
Layered Speech-Act Annotation for Spoken Dialogue Corpus
Yuki Irie, Shigeki Matsubara, Nobuo Kawaguchi, Yukiko Yamaguchi, Yasuyoshi Inagaki
A Syntactically Annotated Corpus of Japanese Spoken Monologue
Tomohiro Ohno, Shigeki Matsubara, Hideki Kashioka, Naoto Kato, Yasuyoshi Inagaki
Slips and errors in spoken data transcription
Isabella Chiari
A BLARK extension for temporal annotation mining
Dafydd Gibbon, Flaviane Romani Fernandes, Thorsten Trippel
Annotating Information Structure in a Corpus of Spoken Danish
Patrizia Paggio
A French Non-Native Corpus for Automatic Speech Recognition
Tien-Ping Tan, Laurent Besacier
General and Task-Specific Corpus Resources for Polish Adult Learners of English
Anna Bogacka, Katarzyna Dziubalska- Kołaczyk, Grzegorz Krynicki, Dawid Pietrala, Mikołaj Wypych
SINOD - Slovenian non-native speech database
Andrej Žgank , Darinka Verdonik, Aleksandra Zögling Markuš, Zdravko Kačič
Bilingual speech corpus in two phonetically similar languages
Vicente Alabau, Carlos D. Martínez
Bikers Accessing the Web: The SmartWeb Motorbike Corpus
Moritz Kaiser, Hannes Mögele, Florian Schiel
Generation of Language Resources for the Development of Speech Technologies in Catalan
Asunción Moreno, Albert Febrer, Lluis Márquez
Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA
José-Miguel Benedí, Eduardo Lleida, Amparo Varona, María-José Castro, Isabel Galiano, Raquel Justo, Iñigo López de Letona, Antonio Miguel
The Ritel Corpus - An annotated Human-Machine open-domain question answering spoken dialog corpus
Sophie Rosset, Sandra Petel
Methodology of Lombard Speech Database Acquisition: Experiences with CLSD
Hynek Bořil, Tomáš Bořil, Petr Pollák
A pilot study for a Corpus of Dutch Aphasic Speech (CoDAS)
Eline Westerhout, Paola Monachesi
Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb)
Renata Savy, Francesco Cutugno, Claudia Crocco


Day 1 | Day 2 | Day 3

Day 2, Thursday 25th May 2006
Schedule Programme Room
09:00-09:40 2 Keynote Speeches
  Enrico Motta:The role of language and mining technologies in engineering and utilizing the semantic web
  Noriko Kando: Evaluation of Information Access Technologies with Asian Languages at NTCIR Workshop
09:40-09:45 5-minute Break
09:45-11:05 5 Oral Sessions in Parallel
  O15-W: Treebanks and Acquisition
  O16-W: MWEs and Collocations
  O17-TGW: Multilingual Tools and Terminology
  O18-M: Multimodal Corpora Annotation and Tools
  O19-GW: Annotation and Annotation Tools
11:05-11:25 Coffee Break
11:25-13:25 5 Oral Sessions in Parallel
  O20-W: Multilingual Corpora
  O21-TE: Structuring Terminology
  O22-GW: Infrastructures, Methodologies and Standards
  O23-SG: Speech Corpora & Annotation
  O24-EW: Evaluation of Information Retrieval
13:30-14:40 Lunch Break
14:40-16:40 5 Oral Sessions in Parallel
  O25-WE: Machine Translation and Evaluation
  O26-W: Question Answering
  O27-G: Large Programs, Data Centers and International Cooperation
  O28-SE: ASR, Tools and Evaluation
  O29-E: Evaluation Semantics and Senses
16:40-17:00 Coffee Break
17:00-18:20 Poster Sessions
  P11-W: Computational Lexicons
Module 7 - 2nd floor
  P12-W: Named Entity
Module 7 - 2nd floor
  P13-W: Corpora
Module 7 - 2nd floor
  P14-GW: General Issues and Large Programs
Module 7 - 2nd floor
  P15-W: Multiwords & Collocations
Module 7 - 2nd floor
  P16-T: Terminology Multiwords & Collocations
Module 7 - 2nd floor
  P17-E: Evaluation: annotation, acquisition, alignment, recognition
Module 7 - 2nd floor
  P18-M: Multimodal Corpora and Annotation
Module 7 - 3rd floor
  P19-ES: Evaluation : Speech
Module 7 - 3rd floor
  P20-S: Speech Corpora for TTS Systems and Emotion and Speech tools
Module 7 - 3rd floor
18:20-18:25 5-minute Break
18:25-19:45 1 Panel & 4 Oral Sessions in Parallel
18:25-19:45 Panel:
Resources for the Processing of Affect in Interactions
Moderator: Nick Campbell
  O30-W: Parallel Corpora & MT
  O31-W: Corpus Construction & Annotation
  O32-EW: Coreference, Summarisation and Evaluation
  O33-EMT: Evaluation of Multimodality & Dialogue

Day 1 | Day 2 | Day 3


O15-W Treebanks and Acquisition
Chairperson:Dan Cristea
9.45-10.05 Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment
Felice Dell’Orletta, Alessandro Lenci, Simonetta Montemagni, Vito Pirrelli
10.05-10.25 Perspectives of Turning Prague Dependency Treebank into a Knowledge Base
Václav Novák, Jan Hajič
10.25-10.45 Developing and Using a Pilot Dialectal Arabic Treebank
Mohamed Maamouri, Ann Bies, Tim Buckwalter, Mona Diab, Nizar Habash, Owen Rambow, Dalila Tabessi
10.45-11.05 Generating Typed Dependency Parses from Phrase Structure Parses
Marie-Catherine de Marneffe, Bill MacCartney and Christopher D. Manning
O16-W MWEs and Collocations
Chairperson:Monica Monachini
9.45-10.05 The Italian Metaphor Database
Antonietta Alonge
10.05-10.25 The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon
Rainer Osswald, Hermann Helbig, Sven Hartrumpf
10.25-10.45 Using collocations from comparable corpora to find translation equivalents
Serge Sharoff, Bogdan Babych, Anthony Hartley
10.45-11.05 The Collection of Distributionally Idiosyncratic Items: A Multilingual Resource for Linguistic Research
Manfred Sailer and Beata Trawiński
O17-TGW Multilingual Tools and Terminology
Chairperson:Hitoshi Isahara
9.45-10.05 Aligning Multilingual Thesauri
Dan Ştefănescu, Dan Tufiş
10.05-10.25 A Development Tool For Multilingual Ontology-based Conceptual Dictionaries
G. Ajani, G. Boella, L. Lesmo, M. Martin, A. Mazzei, P. Rossi
10.25-10.45 Reconsidering Language Identification for Written Language Resources
Baden Hughes, Timothy Baldwin, Steven Bird, Jeremy Nicholson and Andrew MacKinlay
10.45-11.05 Champollion: A Robust Parallel Text Sentence Aligner
Xiaoyi Ma
O18-M Multimodal Corpora Annotation and Tools
Chairperson:Gregory Grefenstette
9.45-10.05 NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora
John Niekrasz, Alexander Gruenstein
10.05-10.25 Beyond Multimedia Integration: corpora and annotations for cross-media decision mechanisms
Katerina Pastra
10.25-10.45 H. C. Andersen Conversation Corpus
Niels Ole Bernsen, Laila Dybkjær and Svend Kiilerich
10.45-11.05 A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
Daniel Sonntag, Massimo Romanelli
O19-GW Annotation and Annotation Tools
Chairperson:Chu-Ren Huang
9.45-10.05 SALTO – A Versatile Multi-Level Annotation Tool
Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski and Sebastian Pado
10.05-10.25 Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources
Magesh Balasubramanya, Michael Higgins, Peter Lucas, Jeff Senn and Dominic Widdows
10.25-10.45 SHALMANESER – A Toolchain For Shallow Semantic Parsing
Katrin Erk and Sebastian Padó
10.45-11.05 Building Annotated Written and Spoken Arabic LR’s in NEMLAR Project
M. Yaseen, M. Attia, B. Maegaard, K. Choukri, N. Paulsson, S. Haamid, S. Krauwer, C. Bendahman, H. Fersøe, M. Rashwan, B. Haddad, C. Mukbel, A. Mouradi, A. Al-Kufaishi, M. Shahin, N. Chenfour, A. Ragheb
O20-W Multilingual Corpora
Chairperson:Steven Krauwer
11.25-11.45 A Uniform Interface to Large-Scale Linguistic Resources
Serge Sharoff
11.45-12.05 Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language
Dragoş Ciobanu , Tony Hartley and Serge Sharoff
12.05-12.25 A Cross-language Approach to Rapid Creation of New Morpho-syntactically Annotated Resources
Anna Feldman, Jirka Hana, Chris Brew
12.25-12.45 Multilingual parallel treebanking: a lean and flexible approach
Jonas Kuhn, Michael Jellinghaus
12.45-13.05 Parallel Syntactic Annotation of Multiple Languages
Owen Rambow, Bonnie Dorr, David Farwell, Rebecca Green, Nizar Habash, Stephen Helmreich, Eduard Hovy, Lori Levin, Keith J. Miller, Teruko Mitamura, Florence Reeder, Advaith Siddharthan
13.05-13.25 CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling
Jonas Granfeldt, Pierre Nugues, Malin Ågren, Jonas Thulin, Emil Persson, Suzanne Schlyter
O21-TE Structuring Terminology
Chairperson:Maria Teresa Pazienza
11.25-11.45 Hierarchical Relationships "is-a": Distinguishing Belonging, Inclusion and Part/of Relationships
Christophe Jouis
11.45-12.05 Building a network of topical relations from a corpus
Olivier Ferret
12.05-12.25 Lemma-oriented dictionaries, concept-oriented terminology and translation memories
André Le Meur, Marie-Jeanne Derouin
12.25-12.45 Hantology - A Linguistic Resource for Chinese Language Processing and Studying
Ya-Min Chou, Chu-Ren Huang
12.45-13.05 Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts
Jean-François Couturier, Sylvain Neuvel, Patrick Drouin
13.05-13.25 The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text
Beatrice Alex, Malvina Nissim, Claire Grover
O22-GW Infrastructures, Methodologies and Standards
Chairperson:Harry Bunt
11.25-11.45 Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community
Baden Hughes
11.45-12.05 LAMUS – the Language Archive Management and Upload System
Daan Broeder, Andreas Claus, Freddy Offenga, Romuald Skiba, Paul Trilsbeek, Peter Wittenburg
12.05-12.25 Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
Maria Gavrilidou, Penny Labropoulou, Stelios Piperidis, Voula Giouli, Nicoletta Calzolari, Monica Monachini, Claudia Soria, Khalid Choukri
12.25-12.45 Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
Felix Sasaki
12.45-13.05 Integrating Linguistic Resources: The American National Corpus Model
Nancy Ide, Keith Suderman
13.05-13.25 Foundations of Modern Language Resource Archives
Peter Wittenburg, Daan Broeder, Wolfgang Klein, Stephen Levinson, Laurent Romary
O23-SG Speech Corpora & Annotation
Chairperson:Shuichi Itahashi
11.25-11.45 Linguistic Resources for Speech Parsing
Ann Bies, Stephanie Strassel, Haejoong Lee, Kazuaki Maeda, Seth Kulick, Yang Liu, Mary Harper, Matthew Lease
11.45-12.05 Dependency-structure Annotation to Corpus of Spontaneous Japanese
Kiyotaka Uchimoto, Ryoji Hamabe, Takehiko Maruyama, Katsuya Takanashi, Tatsuya Kawahara and Hitoshi Isahara
12.05-12.25 Interoperability of audio corpora: the case of the French corpora
Olivier Baude, Michel Jacobson, Atanas Tchobanov, Richard Walter
12.25-12.45 An observatory on Spoken Italian linguistic resources and descriptive standards
Miriam Voghera, Francesco Cutugno
12.45-13.05 Tagging a Corpus of Interpreted Speeches: the European Parliament Interpreting Corpus (EPIC)
Annalisa Sandrelli, Claudio Bendazzoli
13.05-13.25 Low-cost Customized Speech Corpus Creation for Speech Technology Applications
Kazuaki Maeda, Christopher Cieri, Kevin Walker
O24-EW Evaluation of Information Retrieval
Chairperson:Dragomir Radev
11.25-11.45 Building a Heterogeneous Information Retrieval Collection of Printed Arabic Documents
Abdelrahim Abdelsapor, Noha Adly, Kareem Darwish, Ossama Emam, Walid Magdy, Magdi Nagi
11.45-12.05 Hand-crafted versus Machine-learned Inflectional Rules: The Euroling-SiteSeeker Stemmer and CST’s Lemmatiser
Hercules Dalianis and Bart Jongejan
12.05-12.25 Tagging Heterogeneous Evaluation Corpora for Opinionated Tasks
Lun-Wei Ku, Yu-Ting Liang and Hsin-Hsi Chen
12.25-12.45 Test Collections for Patent Retrieval and Patent Classification in the Fifth NTCIR Workshop
Atsushi Fujii, Makoto Iwayama, Noriko Kando
12.45-13.05 The Impact of Evaluation on Multilingual Information Retrieval System Development
Carol Peters
13.05-13.25 Blind Evaluation for Thai Search Engines
Shisanu Tongchim, Prapass Srichaivattana, Virach Sornlertlamvanich, Hitoshi Isahara
O25-WE Machine Translation and Evaluation
Chairperson:Alan Melby
14.40-15.00 IQмт: A Framework for Automatic Machine Translation Evaluation
Jesús Giménez, Enrique Amigó
15.00-15.20 A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation
Andrei Popescu-Belis, Paula Estrella, Margaret King, Nancy Underwood
15.20-15.40 Error Analysis of Statistical Machine Translation Output
David Vilar, Jia Xu, Luis Fernando D’Haro, Hermann Ney
15.40-16.00 Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
Kiyotaka Uchimoto, Naoko Hayashida, Toru Ishida and Hitoshi Isahara
16.00-16.20 MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation
Alexandre Patry, Fabrizio Gotti and Philippe Langlais
16.20-16.40 Training a Statistical Machine Translation System without GIZA++
Arne Mauser, Evgeny Matusov, Hermann Ney
O26-W Question Answering
Chairperson:Bernardo Magnini
14.40-15.00 Towards Natural Interactive Question Answering
Gerhard Fliedner
15.00-15.20 Mining Knowledge from Wikipedia for the Question Answering task
Davide Buscaldi, Paolo Rosso
15.20-15.40 Language Challenges for Data Fusion in Question-Answering
Véronique Moriceau
15.40-16.00 Summarizing Answers for Complicated Questions
Liang Zhou, Chin-Yew Lin and Eduard Hovy
16.00-16.20 An Answer Bank for Temporal Inference
Sanda Harabagiu and Cosmin Adrian Bejan
16.20-16.40 Using Semantic Overlap Scoring in Answering TREC Relationship Questions
Gregory Marton, Boris Katz
O27-G Large Programs, Data Centers and International Cooperation
Chairperson:Makoto Nagao
14.40-15.00 Oriental COCOSDA: Past, Present and Future
Shuichi Itahashi, Chiu-yu Tseng, Satoshi Nakamura
15.00-15.20 KUNSTI – Knowledge Generation for Norwegian Language Technology
Bente Maegaard, Jens-Erik Fenstad, Lars Ahrenberg, Knut Kvale, Katarina Mühlenbock, Bernt-Erik Heid
15.20-15.40 The Dutch-Flemish HLT Programme STEVIN: Essential Speech and Language Technology Resources
Elisabeth D’Halleweyn, Jan Odijk, Lisanne Teunissen, Catia Cucchiarini
15.40-16.00 Techno-langue: The French National Initiative for Human Language Technologies (HLT)
Stéphane Chaudiron & Joseph Mariani
16.00-16.20 The BLARK concept and BLARK for Arabic
Bente Maegaard, Steven Krauwer, Khalid Choukri, Lise Damsgaard Jørgensen
16.20-16.40 More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities
Christopher Cieri, Mark Liberman
O28-SE ASR, Tools and Evaluation
Chairperson:Mary Harper
14.40-15.00 REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
Manny Rayner, Pierrette Bouillon, Beth Ann Hockey, Nikos Chatzichrisafis
15.00-15.20 Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria
Tomoyuki Kato, Tomiki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
15.20-15.40 Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions
Julie Mauclair, Yannick Estève, Simon Petit-Renaud, Paul Deléglise
15.40-16.00 Discourse functions of duration in Mandarin: resource design and implementation
Shu-Chuan Tseng, Dafydd Gibbon
16.00-16.20 Multiple Dimension Levenshtein Edit Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech
Jonathan G. Fiscus, Jerome Ajot, Nicolas Radde and Christophe Laprun
16.20-16.40 Competitive Evaluation of Commercially Available Speech Recognizers in Multiple Languages
Susanne Burger, Zachary A. Sloane and Jie Yang
O29-E Evaluation Semantics and Senses
Chairperson:Branimir Boguraev
14.40-15.00 Construction of a FrameNet Labeler for Swedish Text
Richard Johansson, Pierre Nugues
15.00-15.20 Towards pertinent evaluation methodologies for word-space models
Magnus Sahlgren
15.20-15.40 Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet
Sabine Schulte im Walde
15.40-16.00 Measuring Agreement on Set-valued Items (MASI) for Semantic and Pragmatic Annotation
Rebecca Passonneau
16.00-16.20 Inducing Sense-Discriminating Context Patterns from Sense-Tagged Corpora
Anna Rumshisky, James Pustejovsky
16.20-16.40 Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory
Roberto Navigli
O30-W Parallel Corpora & MT
Chairperson:Keith Miller
18.25-18.45 Parallel Corpora and Phrase-Based Statistical Machine Translation for New Language Pairs via Multiple Intermediaries
Andreas Eisele
18.45-19.05 Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
Yoshihiko Nitta, Masashi Saraki and Satoru Ikehara
19.05-19.25 Creating a Large-Scale Arabic to French Statistical Machine Translation System
Saša Hasan, Anas El Isbihani, Hermann Ney
19.25-19.45 Corpus Support for Machine Translation at LDC
Xiaoyi Ma, Christopher Cieri
O31-W Corpus Construction & Annotation
Chairperson:Benjamin K. Tsou
18.25-18.45 Language Specific and Topic Focused Web Crawling
Olena Medelyan, Stefan Schulz, Jan Paetzold, Michael Poprat, Kornél Markó
18.45-19.05 RoCo-News: A Hand Validated Journalistic Corpus of Romanian
Dan Tufiş, Elena Irimia
19.05-19.25 Rule-Based Chunking and Reusability
Claire Grover and Richard Tobin
19.25-19.45 Corpus Annotation as a Test of a Linguistic Theory
Eva Hajičová and Petr Sgall
O32-EW Coreference, Summarisation and Evaluation
Chairperson:Baden Hughes
18.25-18.45 Evaluating Automatically Generated Timelines from the Web
Roberta Catizone, Angelo Dalli and Yorick Wilks
18.45-19.05 Transferring Coreference Chains through Word Alignment
Oana Postolache, Dan Cristea and Constantin Orasan
19.05-19.25 Coreference Resolution with and without Linguistic Knowledge
Olga Uryupina
19.25-19.45 Automated Summarization Evaluation with Basic Elements
Eduard Hovy, Chin-Yew Lin, Liang Zhou and Junichi Fukumoto
O33-EMT Evaluation of Multimodality & Dialogue
Chairperson:Dafydd Gibbon
18.25-18.45 Usability evaluation of 3G multimodal services in Telefónica Móviles España
Juan José Rodríguez Soler, Pedro Concejero Cerezo, Carlos Lázaro Ávila, Daniel Tapias Merino
18.45-19.05 Act-Topic Patterns for Automatically Checking Dialogue Models
Hans Dybkjær and Laila Dybkjær
19.05-19.25 Evaluation of multimodal components within CHIL
Djamel Mostefa ; Marie-Neige Garcia, Khalid Choukri
19.25-19.45 Dimensions in Dialogue Act Annotation
Harry Bunt
P11-W Computational Lexicons
Chairperson:Kyoko Kanzaki
  A Unified Structure for Dutch Dialect Dictionary Data
Folkert de Vriend, Lou Boves, Henk van den Heuvel, Roeland van Hout, Joep Kruijsen, Jos Swanenberg
  Dictionary Building with the Jibiki Platform: the GDEF case
Mathieu Mangeot, Antoine Chalvin Hungarian lexical database and morphological grammar
Viktor Trón, Péter Halácsy, Péter Rebrus, András Rung, Péter Vajda, Eszter Simon
  Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes
Bruno Cartoni
  Building Slovene WordNet
Tomaž Erjavec, Darja Fišer
  Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MORPHOSAURUS Subword Lexicon
Stefan Schulz, Kornél Markó, Philipp Daumke, Udo Hahn, Susanne Hanser, Percy Nohama, Roosewelt Leite de Andrade, Edson Pacheco, Martin Romacker
  LEXADV - a multilingual semantic Lexicon for Adverbs
Sanni Nimb
  WS4LR: A Workstation for Lexical Resources
Cvetana Krstev, Ranka Stankovic, Duško Vitas and Ivan Obradovic
  LexikoNet - a lexical database based on type and role hierarchies
Alexander Geyken, Norbert Schrader
  Towards a Generative Lexical Resource: The Brandeis Semantic Ontology
James Pustejovsky, Catherine Havasi, Jessica Littman, Anna Rumshisky, Marc Verhagen
  Creation of a Japanese Adverb Dictionary that Includes Information on the Speaker’s Communicative Intention Using Machine Learning
Toshiyuki Kanamaru, Masaki Murata and Hitoshi Isahara
  Linking Verbal Entries of Different Lexical Resources
Adriana Roventini
  Merging two Ontology-based Lexical Resources
Nilda Ruimy
  Towards machine-readable lexicons for South African Bantu languages
Sonja E. Bosch, Laurette Pretorius and Jackie Jones
  Valency Lexicon of Czech Verbs: Alternation-Based Model
Markéta Lopatková, Zdenek Žabokrtský, Karolina Skwarska
  Syntactic Lexicon of Polish Predicative Nouns
Grazyna Vetulani, Zygmunt Vetulani, Tomasz Obrebski
  Building a Lexical Database for an Interactive Joke-Generator
R. Manurung, D. O’Mara, H. Pain, G. Ritchie, A. Waller
P12-W Named Entity
Chairperson:Irina Prodanof
  Identifying Named Entities in Text Databases from the Natural History Domain
Caroline Sporleder, Marieke van Erp, Tijn Porcelijn, Antal van den Bosch, Pim Arntzen
  The Information Commons Gazetteer: A Public Resource of Populated Places and Worldwide Administrative Divisions
Peter Lucas, Magesh Balasubramanya, Dominic Widdows and Michael Higgins
  Named Entity Extraction with Conjunction Disambiguation
Pawel Mazur and Robert Dale
  OntoNERdIE - Mapping and Linking Ontologies to Named Entity Recognition and Information Extraction Resources
Ulrich Schäfer
P13-W Corpora
Chairperson:Thierry Declerck
  Creating Tools for Morphological Analysis of Sumerian
Valentin Tablan, Wim Peters, Diana Maynard, Hamish Cunningham
  A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material
Christoph Benzmüller, Helmut Horacek, Henri Lesourd, Ivana Kruijff-Korbayova, Marvin Schiller, Magdalena Wolska
  Comparing linguistic information in treebank annotations
Cristina Bosco, Vincenzo Lombardo
  A Factored Functional Dependency Transformation of the English Penn Treebank for Probabilistic Surface Generation
Irene Langkilde-Geary, Justin Betteridge
  The ALVIS Format for Linguistically Annotated Documents
A. Nazarenko, E. Alphonse, J. Derivière, T. Hamon, G. Vauvert, D. Weissenbacher
  BACO - A large database of text and co-occurrences
Luís Sarmento
  The African Varieties of Portuguese: Compiling Comparable Corpora and Analysing Data-derived Lexicon
Maria Fernanda Bacelar do Nascimento, José Bettencourt Gonçalves, Luísa Pereira, Antónia Estrela, Afonso Pereira, Rui Santos and Sancho M. Oliveira
  On the data base of Romanian syllables and some of its quantitative and cryptographic aspects
Liviu P. Dinu, Anca Dinu
  Corpus Portal for Search in Monolingual Corpora
Uwe Quasthoff, Matthias Richter, Christian Biemann
  Morphological annotation of Korean with Directly Maintainable Resources
Ivan Berlocher, Hyun-gue Huh, Eric Laporte, Jee-sun Nam
  Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development
Antal van den Bosch, Ineke Schuurman, Vincent Vandeghinste
  Syntactic Annotation of Large Corpora in STEVIN
Gertjan van Noord, Ineke Schuurman, Vincent Vandeghinste
  Skeleton Parsing in Chinese: Annotation Scheme and Guidelines
May Lai-Yin Wong
  The ASK Corpus - a Language Learner Corpus of Norwegian as a Second Language
Kari Tenfjord, Paul Meurer, Knut Hofland
  Corpus Development and Publication
Andrew W. Cole
  Building a historical corpus for Classical Portuguese: some technological aspects
Maria Clara Paixão de Sousa, Thorsten Trippel
  UAM Text Tools - a flexible NLP architecture
Tomasz Obrebski, Michal Stolarski
P14-GW General Issues and Large Programs
Chairperson:Stephanie Strassel
  Development of the First LRs for Macedonian: Current Projects
Ruska Ivanovska-Naskova
  Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People
Tonio Wandmacher, Jean-Yves Antoine
  Dialectal resources on-line: the ALT-Web experience
Nella Cucurullo, Simonetta Montemagni, Matilde Paoli, Eugenio Picchi, Eva Sassolini
  Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian
Monica Monachini, Nicoletta Calzolari, Khalid Choukri, Jochen Friedrich, Giulio Maltese, Michele Mammini, Jan Odijk, Marisa Ulivieri
  Next Generation Language Resources using Grid
Federico Calzolari, Eva Sassolini, Manuela Sassi, Sebastiana Cucurullo, Eugenio Picchi, Francesca Bertagna, Alessandro Enea, Monica Monachini, Claudia Soria, Nicoletta Calzolari
  LEXUS, a web-based tool for manipulating lexical resources
Marc Kemps-Snijders, Mark-Jan Nederhof, Peter Wittenburg
  Metadata Profile in the ISO Data Category Registry
Freddy Offenga, Daan Broeder, Peter Wittenburg, Julien Ducret, Laurent Romary
  The pragmatic combination of different cross-lingual resources for multilingual information services
Hans Uszkoreit, Feiyu Xu, Jörg Steffen, Ilhan Aslan
  Exploring opportunities for comparability and enrichment by linking lexical databases
Isa Maks, Bob Boelhouwer
P15-W Multiwords & Collocations
Chairperson:Marko Tadic
  The Design and Construction of A Chinese Collocation Bank
Ruifeng Xu, Qin Lu, Sujian Li
  Local Document Relevance Clustering in IR Using Collocation Information
Leo Wanner, Margarita Alonso Ramos
  Semi-automatic Building of Swedish Collocation Lexicon
Silvie Cinková, Pavel Pecina, Petr Podveský and Pavel Schlesinger
  Elaborating the parameterized Equivalence Class Method for Dutch
Nicole Grégoire
  COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions
Amália Mendes, Sandra Antunes, Maria Fernanda Bacelar do Nascimento, João Miguel Casteleiro, Luísa Pereira and Tiago Sá
  Finite state tokenisation of an orthographical disjunctive agglutinative language: The verbal segment of Northern Sotho
Winston N. Anderson and Petronella M. Kotzé
P16-T Terminology Multiwords & Collocations
Chairperson:Khurshid Ahmad
  KB-N: Automatic Term Extraction from Knowledge Bank of Economics
Magnar Brekke, Kai Innselset, Marita Kristiansen, Kari Øvsthus
  Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts
Panunzi Alessandro, Fabbri Marco, Moneglia Massimo
  Extraction of Cross Language Term Correspondences
Hans Hjelm
  Extraction tools for collocations and their morphosyntactic specificities
Julia Ritz, Ulrich Heid
  Semantic-Based Keyword Recovery Function for Keyword Extraction System
Rachada Kongkachandra and Kosin Chamnongthai
P17-E Evaluation: annotation, acquisition, alignment, recognition
Chairperson:Hanne Fersøe
  A Grapheme-Based Approach for Accent Restoration in Gikuyu
Peter W. Wagacha, Guy De Pauw and Pauline W. Githinji
  On Automatic Assignment of Verb Valency Frames in Czech
Jirí Semecký
  A Self-Referring Quantitative Evaluation of the ATR Basic Travel Expression Corpus (BTEC)
Kyo Kageura and Genichiro Kikui
  Inter-annotator Agreement on a Multilingual Semantic Annotation Task
Rebecca Passonneau, Nizar Habash, Owen Rambow
  A highly accurate Named Entity corpus for Hungarian
György Szarvas, Richárd Farkas, László Felföldi, András Kocsor, János Csirik
  Turning a Dependency Treebank into a PSG-style Constituent Treebank
Eckhard Bick
  Evaluation of Web-based Corpora: Effects of Seed Selection and Time Interval
Motoko Ueyama
  Recognizing Acronyms and their Definitions in Swedish Medical Texts
Dimitrios Kokkinakis and Dana Dannélls
  Evaluation of multilingual text alignment systems: the ARCADE II project
Yun-Chuang Chiao, Olivier Kraif, Dominique Laurent, Thi Minh Huyen Nguyen, Nasredine Semmar, François Stuck, Jean Véronis, Wajdi Zaghouani
  Finding representative sets of dialect words for geographical regions
Marko Salmenkivi
  HAREM: An Advanced NER Evaluation Contest for Portuguese
Diana Santos, Nuno Seco, Nuno Cardoso and Rui Vilela
P18-M Multimodal Corpora and Annotation
Chairperson:Patrizia Paggio
  BITT: A Corpus for Topic Tracking Evaluation on Multimodal Human-Robot-Interaction
Jan Frederik Maas, Britta Wrede
  Sign Language corpus analysis: Synchronisation of linguistic annotation and numerical data
Jérémie Segouat, Annelies Braffort and Emilie Martin
  A German Sign Language Corpus of the Domain Weather Report
Jan Bungeroth, Daniel Stein, Philippe Dreuw, Morteza Zahedi, Hermann Ney
  Annotation of Emotions in Real-Life Video Interviews: Variability between Coders
S. Abrilian, L. Devillers, J-C. Martin
  Creation of a Corpus of Multimodal Spontaneous Expressions of Emotions in Human-Machine Interaction
G. Le Chenadec, V. Maffiolo, N. Chateau and J.M. Colletta
  Wizard-of-Oz Data Collection for Perception and Interaction in Multi-User Environments
Petra-Maria Strauß, Holger Hoffmann, Wolfgang Minker, Heiko Neumann, Günther Palm, Stefan Scherer, Friedhelm Schwenker, Harald Traue, Welf Walter and Ulrich Weidenbacher
  The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player
Ivana Kruijff-Korbayová, Tilman Becker, Nate Blaylock, Ciprian Gerstenberger, Michael Kaißer, Peter Poller, Verena Rieser, Jan Schehl
P19-ES Evaluation : Speech
Chairperson:Laila Dybkaer
  Analyzing the Effects of Spoken Dialog Systems on Driving Behavior
Jeongwoo Ko, Fumihiko Murase, Teruko Mitamura, Eric Nyberg, Masahiko Tateishi, Ichiro Akahori
  SUS-based Method for Speech Reception Threshold Measurement in French
Alexander Raake, Brian FG Katz
  A joint intelligibility evaluation of French text-to-speech systems: the EvaSy SUS/ACR campaign
Philippe Boula de Mareüil, Christophe d’ Alessandro, Alexander Raake, Gérard Bailly, Marie-Neige Garcia, Michel Morel
  Edit Distance: A Metric for Machine Translation Evaluation
Mark Przybocki, Gregory Sanders, Audrey Le
  Evaluation Methods of a Linguistically Enriched Translation Memory System
Gábor Hodász
  Task-based MT Evaluation: From Who/When/Where Extraction to Event Understanding
Jamal Laoudi, Calandra Tate, Clare R. Voss
  Results of the French Evalda-Media evaluation campaign for literal understanding
H. Bonneau-Maynard, C. Ayache, F. Bechet, A. Denis, A. Kuhn, F. Lefevre, D. Mostefa, M. Quignard, S. Rosset, C. Servan, J. Villaneau
  Human and machine recognition as a function of SNR
Bernt Andrassy and Harald Hoege
  Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries
Jan Hoidekr, J.V. Psutka, Aleš Pražák, Josef Psutka
  Field Evaluation of a Single-Word Pronunciation Training System
Niels Ole Bernsen, Thomas K. Hansen, Svend Kiilerich and Torben Kruchov Madsen
  Acceptance Testing of a Spoken Language Translation System
Rafael Banchs, Antonio Bonafonte, Javier Pérez
  Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus
Evie Coussé & Steven Gillis
  Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign
Dirk Bühler and Wolfgang Minker
  MEDUSA: User-Centred Design and usability evaluation of Automatic Speech Recognition telephone services in Telefónica Móviles España
Juan José Rodríguez Soler, Pedro Concejero Cerezo, Daniel Tapias Merino, Alberto José Sánchez
P20-S Speech Corpora for TTS Systems and Emotion and Speech tools
Chairperson:Justus Roux
  ProGmatica: a Prosodic and Pragmatic Database for European Portuguese
Daniela Braga, Luís Coelho, João P. Teixeira, Diamantino Freitas
  FonDat1: A Speech Synthesis Corpus for Norwegian
Ingunn Amdal and Torbjørn Svendsen
  Spanish Synthesis Corpora
Martí Umbert, Asunción Moreno, Pablo Agüero, Antonio Bonafonte
  SmartWeb UMTS Speech Data Collection - The SmartWeb Handheld Corpus
Hannes Mögele, Moritz Kaiser, Florian Schiel
  Speech Recordings in Public Schools in Germany - the Perfect Show Case for Web-based Recordings and Annotation
Christoph Draxler, Klaus Jänsch
  An Open Source Prosodic Feature Extraction Tool
Zhongqiang Huang, Lei Chen, Mary Harper
  Court Stenography-To-Text (“STT”) in Hong Kong: A Jurilinguistic Engineering Effort
Benjamin K. Tsou, Tom B. Y. Lai, K. K. Sin, Lawrence Y. L. Cheung
  Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque
Ibon Saratxaga, Eva Navas, Inmaculada Hernáez, Iker Luengo

Day 1 | Day 2 | Day 3

Day 3, Friday 26th May 2006
Schedule Programme Room
09:00-09:40 2 Keynote Speeches
  Eduard Hovy: Toward Semantic Corpora: Creating Concepts from Words via Senses, and Storing them in an Ontology
  Jean Carletta: Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus
09:40-09:45 5-minute Break
09:45-11:05 5 Oral Sessions in Parallel
  O34-WE: Morphology & Tagging
  O35-TW: Terminology & Knowledge Acquisition
  O36-W: Semantically Annotated Corpora
  O37-S: Lexicon and Pronunciation
  O38-W: Lexicon Syntax and Semantics
11:05-11:25 Coffee Break
11:25-13:25 1 Panel & 4 Oral Sessions in Parallel

Strategic Directions of National & International Research Funding

Moderator: Hans Uszkoreit

  O39-WT: Lexicons, Semantics, Clustering Tools
  O40-T: Creating Terminologies
  O41-M: Emotions
  O42-EW: Question Answering Evaluation
13:30-14:40 Lunch Break
14:40-16:00 Poster Sessions
  P21-W: Corpora, Lexicons & Multilinguality
Module 7 - 2nd floor
  P22-W: Tools for Corpora and Lexicons
Module 7 - 2nd floor
  P23-G: Web services and Digital Archives and Libraries
Module 7 - 2nd floor
  P24-W: Semantics, Ontologies, Semantic Web
Module 7 - 2nd floor
  P25-T: Terminology Lexicons & Tools
Module 7 - 2nd floor
  P26-E: Evaluation: Lexicons, Tools
Module 7 - 2nd floor
  P27-E: Evaluation methodologies & Evaluation of Language Technologies
Module 7 - 3rd floor
  P28-M: Multimodal: Methodologies & Natural Interaction
Module 7 - 3rd floor
  P29-S: S2S Translation
Module 7 - 3rd floor
  P30-S: Speech: TTS & ASR
Module 7 - 3rd floor
16:00-16:20 Coffee Break
16:20-17:20 Antonio Zampolli Prize Talk
17:20-17:25 5-minute Break
17:25-18:20 Closing Session by Programme Committee

Day 1 | Day 2 | Day 3

O34-WE Morphology & Tagging
Chairperson:Kiril Simov
9.45-10.05 Morphological Tools for Six Small Uralic Languages
Attila Novák
10.05-10.25 A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora
Joris Vaneyghen, Guy De Pauw, Dirk Van Compernolle, Walter Daelemans
10.25-10.45 Part-of-Speech Tagging of Transcribed Speech
Margot Mieskes, Michael Strube
10.45-11.05 Feature-Based Encoding and Querying Language Resources with Character Semantics
Baden Hughes, Dafydd Gibbon and Thorsten Trippel
O35-TW Terminology & Knowledge Acquisition
Chairperson:Louise Guthrie
9.45-10.05 Terminological Resources Acquisition Tools: Towards a User-Oriented Evaluation Model
Widad Mustafa el Hadi, Ismail Timimi, Marianne Dabbadie, Khalid Choukri, Olivier Hamon, Yun-Chuang Chiao
10.05-10.25 Word Knowledge Acquisition for Computational Lexicon Construction
Thatsanee Charoenporn, Canasai Kruengkrai, Thanaruk Theeramunkong, Virach Sornlertlamvanich and Hitoshi Isahara
10.25-10.45 Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence
Mathieu Lafourcade
10.45-11.05 Clustering acronyms in biomedical text for disambiguation
Naoaki Okazaki and Sophia Ananiadou
O36-W Semantically Annotated Corpora
Chairperson:Eva Hajicova
9.45-10.05 I-CAB: the Italian Content Annotation Bank
B. Magnini, E. Pianta, C. Girardi, M. Negri, L. Romano, M. Speranza, V. Bartalesi Lenzi and R. Sprugnoli
10.05-10.25 The SALSA Corpus: a German corpus resource for lexical semantics
Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Padó and Manfred Pinkal
10.25-10.45 Adding multi-layer semantics to the Greek Dependency Treebank
Harris Papageorgiou, Elina Desipri, Maria Koutsombogera, Kanella Pouli, Prokopis Prokopidis
10.45-11.05 A Preliminary Study for Building the Basque PropBank
Eneko Agirre, Izaskun Aldezabal, Jone Etxeberria, Eli Pociello
O37-S Lexicon and Pronunciation
Chairperson:Ute Ziegenhain
9.45-10.05 SI-PRON: a Pronunciation Lexicon for Slovenian
Jerneja Žganec Gros, Varja Cvetko-Orešnik, Primož Jakopin, Aleš Mihelič
10.05-10.25 “Casselberveetovallarga” and Other Unpronounceable Places: The CrossTowns Corpus
Stefan Schaden, Ute Jekosch
10.25-10.45 Lexicon Development for Varieties of Spoken Colloquial Arabic
David Graff, Tim Buckwalter, Hubert Jin, Mohamed Maamouri
10.45-11.05 Experimental detection of vowel pronunciation variants in Amharic
Thomas Pellegrini and Lori Lamel
O38-W Lexicon Syntax and Semantics
Chairperson:Vito Pirrelli
9.45-10.05 Semantic Descriptors: The Case of Reflexive Verbs
Milena Slavcheva
10.05-10.25 A Large Subcategorization Lexicon for Natural Language Processing Applications
Anna Korhonen, Yuval Krymolowski, Ted Briscoe
10.25-10.45 PrepNet: a Multilingual Lexical Description of Prepositions
Patrick Saint-Dizier
10.45-11.05 Extending VerbNet with Novel Verb Classes
Karin Kipper, Anna Korhonen, Neville Ryant, Martha Palmer
O39-WT Lexicons, Semantics, Clustering Tools
Chairperson:Nancy Ide
11.25-11.45 Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words
Md. Aminul Islam and Diana Inkpen
11.45-12.05 Ongoing Developments in Automatically Adapting Lexical Resources to the Biomedical Domain
Dominic Widdows, Adil Toumouh, Beate Dorow, Ahmed Lehireche
12.05-12.25 Dealing with Imbalanced Data using Bayesian Techniques
Manolis Maragoudakis, Katia Kermanidis, Aristogiannis Garbis and Nikos Fakotakis
12.25-12.45 An Introduction to NLP-based Textual Anonymisation
Ben Medlock
12.45-13.05 Proper Names and Linguistic Dynamics
Rita Marinelli, Remo Bindi
13.05-13.25 The Look and Feel of a Confident Entailer
Vasile Rus, Art Graesser
O40-T Creating Terminologies
Chairperson:Udo Hahn
11.25-11.45 A Methodology for Developing Multilingual Resources for Terminology
Marie-Claude L’ Homme, Hee Sook Bae
11.45-12.05 Towards a terminological resource for biomedical text mining
Goran Nenadic, Naoki Okazaki, Sophia Ananiadou
12.05-12.25 Development of Linguistic Ontology on Natural Sciences and Technology
B. Dobrov, N. Loukachevitch
12.25-12.45 Compiling large language resources using lexical similarity metrics for domain taxonomy learning
Ronny Melz, Pum-Mo Ryu, Key-Sun Choi
12.45-13.05 The LOIS Project
Wim Peters, Maria Teresa Sagri, Daniela Tiscornia, Sara Castagnoli
13.05-13.25 Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
Stefanie Anstein, Gerhard Kremer, Uwe Reyle
O41-M Emotions
Chairperson:Niels Ole Bernsen
11.25-11.45 Fear-type emotions of the SAFE Corpus: annotation issues
Chloé Clavel, Ioana Vasilescu, Laurence Devillers, Thibaut Ehrette and Gaël Richard
11.45-12.05 Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches
L. Devillers, R. Cowie, J-C. Martin, E. Douglas-Cowie, S. Abrilian, M. McRorie
12.05-12.25 Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus
Kornel Laskowski and Susanne Burger
12.25-12.45 Annotating Emotion in Meetings
Dennis Reidsma, Dirk Heylen, Roeland Ordelman
12.45-13.05 Improving Automatic Emotion Recognition from Speech via Gender Differentiation
Thurid Vogt, Elisabeth André
13.05-13.25 Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
J.-C. Martin, G. Caridakis, L. Devillers, K. Karpouzis, S. Abrilian
O42-EW Question Answering Evaluation
Chairperson:Carol Peters
11.25-11.45 Question Answering Evaluation Survey
L. Gillard, P. Bellot, M. El-Bèze
11.45-12.05 Exploiting Multiple Semantic Resources for Answer Selection
Jeongwoo Ko, Laurie Hiyakumoto, Eric Nyberg
12.05-12.25 Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering
Hideki Shima, Mengqiu Wang, Frank Lin, Teruko Mitamura
12.25-12.45 Impact of Question Decomposition on the Quality of Answer Summaries
Finley Lacatusu, Andrew Hickl, Sanda Harabagiu
12.45-13.05 The Multilingual Question Answering Track at CLEF
Bernardo Magnini, Danilo Giampiccolo, Lili Aunimo, Christelle Ayache, Petya Osenova, Anselmo Peñas, Maarten de Rijke, Bogdan Saccaleanu, Diana Santos, Richard Sutcliffe
13.05-13.25 EQueR: the French Evaluation campaign of Question-Answering Systems
Christelle Ayache, Brigitte Grau, Anne Vilnat
P21-W Corpora, Lexicons & Multilinguality
Chairperson:Bayan Shawar
  Building a Swedish-Turkish Parallel Corpus
Beáta Bandmann Megyesi, Anna Sågvall Hein, Éva Csató Johanson
  Acquis Communautaire Sentence Alignment using Support Vector Machines
Alexandru Ceausu, Dan Stefanescu, Dan Tufis
  The English-Slovene ACQUIS corpus
Tomaž Erjavec
  The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages
Ralf Steinberger, Bruno Pouliquen, Anna Widiger, Camelia Ignat, Tomaž Erjavec, Dan Tufis, Dániel Varga
  The MULINCO corpus and corpus platform
Bente Maegaard, Lene Offersgaard, Lina Henriksen, Hanne Jansen, Xavier Lepetit, Costanza Navarretta, Claus Povlsen
  ISA & ICA - Two Web Interfaces for Interactive Alignment of Bitexts
Jörg Tiedemann
  MORBO/COMP: a multilingual database of compound words
Emiliano Guevara, Sergio Scalise, Antonietta Bisetto, Chiara Melloni
  Documenting variation across Europe and the Mediterranean: The Pavia Typological Database
Andrea Sansò
  From PropBank to EngValLex: Adapting PropBank-Lexicon to the Valency Theory of the Functional Generative Description
Silvie Cinková
  Building a Parallel Multilingual Corpus (Arabic-Spanish-English)
Doaa Samy, Antonio Moreno Sandoval, José M. Guirao, Enrique Alfonseca
  Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
Wei-Yun Ma, Chu-Ren Huang
  SAM - an annotation editor for parallel texts
Markus Geilfuss, Jan-Torsten Milde
P22-W Tools for Corpora and Lexicons
Chairperson:Takenobu Tokunaga
  CoGrOO: a Brazilian-Portuguese Grammar Checker based on the CETENFOLHA Corpus
Jorge Kinoshita, Laís do Nascimento Salvador, Carlos Eduardo Dantas de Menezes
  Tree Searching/Rewriting Formalism
Petr Nemec
  A Lexicalized Tree-Adjoining Grammar for Vietnamese
H. Phuong Le, T. M. Huyen Nguyen, Laurent Romary, Azim Roussanaly
  Improving coverage and parsing quality of a large-scale LFG for German
Christian Rohrer, Martin Forst
  Open Source Corpus Analysis Tools for Malay
Timothy Baldwin and Su’ad Awab
  MaltParser: A Data-Driven Parser-Generator for Dependency Parsing
Joakim Nivre, Johan Hall and Jens Nilsson
  Deep non-probabilistic parsing of large corpora
Benoît Sagot and Pierre Boullier
  FreP: An Electronic Tool for Extracting Frequency Information of Phonological Units from Portuguese Written Text
S. Frota, M. Vigário and & F. Martins
  Tregex and Tsurgeon: tools for querying and manipulating tree data structures
Roger Levy and Galen Andrew
  Grammar-based tools for the creation of tagging resources for an unresourced language: the case of Northern Sotho
Ulrich Heid, Elsabé Taljard, Danie J. Prinsloo
  A Part-of-speech tagger for Irish using Finite-State Morphology and Constraint Grammar Disambiguation
E. Uí Dhonnchadha, J. Van Genabith
  Using a morphological analyzer in high precision POS tagging of Hungarian
Péter Halácsy, András Kornai, Csaba Oravecz, Viktor Trón, Dániel Varga
  Annotation of Temporal Relations with Tango
Marc Verhagen, Robert Knippen, Inderjeet Mani, James Pustejovsky
  BULB: A Unified Lexical Browser
Catherine Havasi, James Pustejovsky, Marc Verhagen
  Preprocessing and Tokenisation Standards in DELPH-IN Tools
Benjamin Waldron, Ann Copestake, Ulrich Schäfer, Bernd Kiefer
  A Corpus Search System Utilizing Lexical Dependency Structure
Yoshihide Kato, Shigeki Matsubara, Yasuyoshi Inagaki
  A framework for real-time dictionary updating
Cédrick Fairon, Sébastien Paumier
  The wraetlic NLP suite
Enrique Alfonseca, Antonio Moreno-Sandoval, José María Guirao and Maria Ruiz-Casado
  FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
J. Atserias, B. Casas, E. Comelles, M. González, L. Padró, M. Padró
P23-G Web services and Digital Archives and Libraries
Chairperson:Zygmunt Vetulani
  Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano
R. Bernardi, D. Calvanese, L. Dini, V. Di Tommaso, E. Frasnelli, U. Kugler, B. Plank
  Technologies for a Federation of Language Resource Archives
Daan Broeder, Freddy Offenga, Peter Wittenburg, Peter van der Kamp, David Nathan, Sven Strömqvist
  Ontology-based Language Archive Utilization
Peter Berck, Hans-Jörg Bibiko, Marc Kemps-Snijders, Albert Russel, Peter Wittenburg
  An API for accessing the Data Category Registry
Marc Kemps-Snijders, Julien Ducret, Laurent Romary, Peter Wittenburg
  Functioning of the Centre for Dutch Language and Speech Technology
Michel Boekestein, Griet Depoorter, Remco van Veenendaal
  On the Web Trilingual Sign Language Dictionary to Learn the foreign Sign Language without Learning a Target Spoken Language
Emiko Suzuki, Tomomi Suzuki and Kyoko Kakihana
P24-W Semantics, Ontologies, Semantic Web
Chairperson:Núria Bel
  LexiPass methodology: a conceptual path from frames to senses and back
Alessandro Oltramari
  Principles for annotating and reasoning with spatial information
Paul C. Morarescu
  Ontology-based Information Extraction with SOBA
Paul Buitelaar, Philipp Cimiano, Stefania Racioppa, Melanie Siegel
  Detection of inconsistencies in concept classifications in a large dictionary - Toward an improvement of the EDR electronic dictionary
Eiko Yamamoto, Kyoko Kanzaki and Hitoshi Isahara
  Alexandria: A Powerful Multilingual Resource For web
Dominique Dutoit
  Ontology Driven K-Portal Construction and K-Service Provision
Asanee Kawtrakul, Chaveevan Pechsiri, Trakul Permpool, Dussadee Thamvijit, Phukao Sornprasert, Chaiyakorn Yingsaeree, Mukda Suktarachan
  The role of lexical resources in matching classification schemas
P. Bouquet, L. Serafini, S. Zanobini
  Shallow Semantic Annotation of Bulgarian
Kiril Simov and Petya Osenova
  Generic NLP Tools for Supporting Shallow Ontology Building
Thierry Declerck, Mihaela Vela
  Word Sense Disambiguation and Semantic Disambiguation for Construction Types in Deep Processing Grammars
Dorothee Beermann and Lars Hellan
  A Framework to Integrate Ubiquitous Knowledge Modeling
Porfírio Filipe, Nuno Mamede
P25-T Terminology Lexicons & Tools
Chairperson:Anthony Hartley
  Features of Terms in Actual Nursing Activities
Hiromi itoh Ozaku, Akinori Abe, Kaoru Sagara, Noriaki Kuwahara and Kiyoshi Kogure
  Automated detection and annotation of term definitions in German text corpora
Angelika Storrer, Sandra Wellinghoff
  SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations
Judit Feliu, Jorge Vivaldi, M. Teresa Cabré
  A Study on Terminology Extraction Based on Classified Corpora
Chen Yirong, Lu Qin, Li Wenjie, Sui Zhifang, Ji Luning
  Automatic Detection of Orthographics Cues for Cognate Recognition
Andrea Mulloni and Viktor Pekar
  Toward a Pan-Chinese Thesaurus
Benjamin K. Tsou and Oi Yee Kwong
  Natural Language Processing: A Terminological And Statistical Approach
Gabriella Pardelli, Manuela Sassi, Sara Goggi, Paola Orsolini
  Detecting Inter-domain Semantic Shift using Syntactic Similarity
Masaki Itagaki, Anthony Aue, Takako Aikawa
P26-E Evaluation: Lexicons, Tools
Chairperson:Catia Cucchiarini
  Linguistic features modeling based on Partial New Cache
Kamel Smaïli, Caroline Lavecchia and Jean-Paul Haton
  Structuring a Domain Vocabulary in a General Knowledge Environment
Nilda Ruimy
  CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project
Rebecca Passonneau, Roberta Blitz, David Elson, Angela Giral, Judith Klavans
  A Conditional Random Field Framework for Thai Morphological Analysis
Canasai Kruengkrai, Virach Sornlertlamvanich, Hitoshi Isahara
  Getting Deeper Semantics than Berkeley FrameNet with MSFA
Kow Kuroda, Masao Utiyama, Hitoshi Isahara
  Integrating Language Knowledge Resources to Extend the English Inclusion Classifier to a New Language
Beatrice Alex
  Building a Large-Scale Repository of Textual Entailment Rules
Milen Kouylekov, Bernardo Magnini
  Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure
Stefan Schaden
  Evaluating Morphosyntactic Tagging of Croatian Texts
Željko Agic, Marko Tadic
  Querying Both Parallel And Treebank Corpora: Evaluation Of A Corpus Query System
Ulrik Petersen
  An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks
Julie Medero, Kazuaki Maeda, Stephanie Strassel, Christopher Walker
  Intelligent Dictionary Interfaces: Usability Evaluation of Access-Supporting Enhancements
Anna Sinopalnikova and Pavel Smrž
  Automatic Evaluation and Compositon of NLP Pipelines with Web Services
Harry Halpin
  Romanian Valence Dictionary in XML Format
Ana-Maria Barbu, Emil Ionescu, Verginica Barbu Mititelu
P27-E Evaluation methodologies & Evaluation of Language Technologies
Chairperson:Andrei Popescu-Belis
  Rote: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics
Agnes Lisowska and Nancy L. Underwood
  Evaluating Symbiotic Systems: the challenge
Margaret King and Nancy Underwood
  The Evolution of an Evaluation Framework for a Text Mining System
Nancy L. Underwood, Agnes Lisowska
  A task-oriented framework for evaluating theme detection systems: A discussion paper
Fidelia Ibekwe-SanJuan
  Tools and methods for objective or contextual evaluation of topic segmentation
Laurianne Sitbon, Patrice Bellot
  Evaluation of Stop Word Lists in Chinese Language
Feng Zou, Fu Lee Wang, Xiaotie Deng, Song Han
  A Dependency-based Algorithm for Grammar Conversion
Alessandro Bahgat Shehata and Fabio Massimo Zanzotto
  A Deep Linguistic Analysis for Cross-language Information Retrieval
Nasredine Semmar, Meriama Laib, Christian Fluhr
  Linguistic Suite for Polish Cadastral System
Witold Abramowicz, Agata Filipowska, Jakub Piskorski, Krzysztof Wecel, Karol Wieloch
  Sentiments on a Grid: Analysis of Streaming News and Views
Khurshid Ahmad, Lee Gillam, David Cheng
  Query Expansion on Compounds
Bolette Sandford Pedersen
  Predicting MT Quality as a Function of the Source Language
David M. Rojas, Takako Aikawa
P28-M Multimodal: Methodologies & Natural Interaction
Chairperson:Laurence Devillers
  Extending the Wizard of Oz Methodology for Language-enabled Multimodal Systems
Martin Rajman, Marita Ailomaa, Agnes Lisowska, Miroslav Melichar, Susan Armstrong
  Does a Virtual Talking Face Generate Proper Multimodal Cues to Draw User’s Attention to Points of Interest?
Stephan Raidt, Gérard Bailly & Frederic Elisei
  Tangible Objects for the Acquisition of Multimodal Interaction Patterns
Ronnie Taib, Natalie Ruiz
P29-S S2S Translation
Chairperson:Nelleke Oostdijk
  Are you ready for a call? - Spontaneous conversations in tourism for speech-to-speech translation systems
Darinka Verdonik, Matej Rojc
  GAIA: Common Framework for the Development of Speech Translation Technologies
Javier Pérez and Antonio Bonafonte
  Collection of Simultaneous Interpreting Patterns by Using Bilingual Spoken Monologue Corpus
Hitomi Tohyama, Shigeki Matsubara
  TC-STAR: New language resources for ASR and SLT purposes
Henk van den Heuvel, Khalid Choukri, Christian Gollan, Asuncion Moreno, Djamel Mostefa
P30-S Speech: TTS & ASR
Chairperson:Florian Schiel
  Tools and resources for speech synthesis arising from a Welsh TTS project
Briony Williams, Rhys James Jones and Ivan Uemlianin
  Developing Speech Synthesis for Under-Resourced Languages by “Faking it”: An Experiment with Somali
Harold Somers, Gareth Evans and Zeinab Mohamed
  Language identification from suprasegmental cues: Speech synthesis of Greek utterances from different dialectal variations
Athanassia Lida Dimou & Aimilios Chalamandaris
  Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech
Hiromichi Kawanami, Takahiro Kitamura and Kiyohiro Shikano
  Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems
M. Bahrani, H. Sameti, N. Hafezi and H. Movasagh
  Bootstrapping New Language ASR Capabilities: Achieving Best Letter-to-Sound Performance under Resource Constraints
Jim Talley
  Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech
Pavel Ircing, Jan Hoidekr, Josef Psutka

Day 1 | Day 2 | Day 3

Copyright © 2006 ELRA