LREC 2006 Conference
List of accepted papers





 Nina Grønnum

 DanPASS - A Danish Phonetically Annotated Spontaneous Speech Corpus


 Andi Wu Kirk Lowery

 A Hebrew Tree Bank Based on Cantillation Marks


 Stéphane CHAUDIRON and Joseph MARIANI

 Technolangue : The French National Initiative for Human Language Technologies (HLT)


 Manny Rayner, Pierrette Bouillon, Beth Ann Hockey and Nikos Chatzichrisafis

 REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications


 Anders Berglund, Richard Johansson, Pierre Nugues

 Extraction of Temporal Information from Texts in Swedish


 Hlavacova Jaroslava

 New Approach to Frequency Dictionaries - Czech Example


 Flora Ramírez Bustamante, Alfredo Arnaiz, Mar Ginés

 A Spell Checker for a World Language


 Elliott Macklovitch

 TransType2: The last word


 Ibon Saratxaga, Eva Navas, Inmaculada Hernáez, Iker Luengo

 Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque


 Dimitrios Kokkinakis

 Collection, Encoding and Linguistic Processing of a Swedish Medical Corpus - The MEDLEX Experience


 Noguchi Masaki, Ichikawa Hiroshi, Hashimoto Taiichi and Tokunaga Takenobu

 A new approach to syntactic annotation


 Tien-Ping Tan, Laurent Besacier

 A French Non-Native Corpus for Automatic Speech Recognition


 Andrea Sansò

 Documenting variation across Europe and the Mediterranean: The Pavia Typological Database


 H. Movasagh, H. Sameti, N. Hafezi, M. Bahrani

 Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems


 Shisanu Tongchim, Prapass Srichaivattana, Virach Sornlertlamvanich, Hitoshi Isahara

 Blind Evaluation for Thai Search Engines


 Yuji Matsumoto, Masayuki Asahara, Kiyota Hashimoto, Yukio Tono, Akira Ohtani, and Toshio Morita

 An Annotated Corpus Management Tool: ChaKi


 Christophe Jouis

 Hierarchical relation " is-a ": distinguishing Belonging, inclusion and part/of relationships.


 Ivan Berlocher, Hyun-gue Huh, Eric Laporte, Jee-sun Nam

 Morphological annotation of Korean with Directly Maintainable Resources


 Patrick Saint-Dizier

 PrepNet: a multilingual lexical description of prepositions


 Marie-Claude L’ Homme and Hee Sook Bae

 A Methodology for Developing Multilingual Resources for Terminology


 Stephan Raidt, Gérard Beilly, Frédéric Elisei

 Does a virtual talking face generate proper multimodal cues to draw the user’s attention to points of interest?


 May Lai-Yin Wong

 Skeleton parsing in written Chinese: annotation scheme and guidelines


 Constantin Orasan and Laura Hasler

 Computer-aided summarisation — what the user really wants


 Nianwen Xue

 Predicate-Argument structure annotation of nominalized verbs in Chinese


 Kyo Kageura and Gen’ichiro Kikui

 A self-referring quantitative evaluation of the ATR basic travel expression corpus BTEC


 Hiromi itoh Ozaku, Akinori Abe, Kaoru Sagara, Noriaki Kuwahara, Kiyoshi Kogure

 Features of terms in actual nursing activities


 Nuno Seco, Diana Santos, Nuno Cardoso and Rui Vilela

 HAREM: An Advanced NER Evaluation Contest for Portuguese


 Rafael Banchs, Antonio Bonafonte, Javier Pérez

 Acceptance Testing of a Spoken Language Translation System


 Valentin Tablan, Wim Peters, Diana Maynard

 Creating Tools for Morphological Analysis of Sumerian


 Ingunn Amdal and Torbjørn Svendsen

 FonDat1: A Speech Synthesis Corpus for Norwegian


 Alon Itai, Shuly Wintner and Shlomo Yona

 A Computational Lexicon of Contemporary Hebrew


 Ruska Ivanovska- Naskova

 Development of the first LRs for Macedonian: current projects


 Reinhard Rapp, Carlos Martin Vide

 Example-based Machine Translation Using a Dictionary of Word Pairs


 Widad MUSTAFA EL HADI, Ismail TIMIMI, Marianne DABBADIE, Khalid CHOUKRI, Yun-Chuang CHIAO, Olivier HAMON

 Terminological Resources Acquisition Tools: Towards a User-Oriented Evaluation Model


 Núria Bel , Sergio Espeja , Montserrat Marimon

 New tools for the encoding of lexical data extracted from corpus


 Daniela Braga, Luis Coelho, Joao Paulo Teixeira, Diamantino Freitas

 Progmatica: A Prosodic Database for European Portuguese


 Enrique Amigó, Jesús Giménez

 IQMT: A Framework for Machine Translation Evaluation


 Tommaso Caselli & Irina Prodanof

 Annotating bridging anaphors in Italian:in search of reliability.


 Henk van den Heuvel, Khalid Choukri, Christian Gollan, Asuncion Moreno, Djamel Mostefa

 TC-STAR: New language resources for ASR and SST purposes


 Yunqing Xia, Kam-Fai Wong and Wenjie Li

 Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach


 Baden Hughes

 Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community


 Yasunori OHISHI, Katunobu ITOU, Kazuya TAKEDA, and Atsushi FUJII

 Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus


 Catherine Havasi, James Pustejovsky, and Marc Verhagen

 BULB: A Unified Lexical Browser


 Ulrich Schäfer, Daniel Beck

 Automatic Testing and Evaluation of Multilingual Language Technology Resources and Components


 Elena Grishina

 Spoken Russian in Russian National Corpus (RNC)


 Paul Buitelaar, Philipp Cimiano, Stefania Racioppa, Melanie Siegel

 Ontology-based Information Extraction with SOBA


 Dominique Dutoit

 Alexandria: a web 2.0 interface displaying linguistic data for the web semantic


 Jan Frederik Maas, Britta Wrede

 BITT: A corpus for Topic Tracking evaluation on multimodal Human-Robot-Interaction


 Hercules Dalianis, Bart Jongejan

 Hand-crafted versus machine-learned inflectional rules: The Euroling-SiteSeeker stemmer and CST’s lemmatiser


 Christian Rohrer & Martin Forst

 Improving coverage and parsing quality of a large-scale LFG for German


 Bettina Schrader

 Non-probabilistic alignment of rare German and English nominal expressions


 Paula Chesley, Susanne Salmon-Alt

 Automatic extraction of subcategorization frames for French


 Yuki Irie, Shigeki Matsubara, Nobuo Kawaguchi, Yukiko Yamaguchi, Yasuyoshi Inagaki

 Layered Speech-Act Annotation for Spoken Dialogue Corpus


 Khurshid Ahmad, Craig Bennett and Tim Oliver

 Visual Surveillance Annotation and Description


 Mathieu MANGEOT and Antoine CHALVIN

 Dictionary Building with the Papillon Platform: the GDEF case


 Tomohiro OHNO, Shigeki MATSUBARA, Hideki KASHIOKA, Naoto KATO, Yasuyoshi INAGAKI

 A Syntactically Annotated Corpus of Japanese Spoken Monologue


 Špela Vintar, Gaël Dias, Tomaž Erjavec, Darja Fišer, Jerneja Žganec Gros

 Bilingual lexicon enrichment for a speech-to-speech translation system


 Jerneja Žganec Gros, Varja Cvetko-Orešnik, Primož Jakopin, Aleš Mihelic

 SI-PRON: A Reference Pronunciation Lexicon for Slovenian


 Silvie Cinková

 From PropBank to EngValLex: Adapting PropBank-Lexicon to the Valency Theory of Functional Generative Description



 Training language models without appropriate language resources : experiments with an augmentative and alternative communication system for disabled people


 Brian Roark, Mary Harper, Eugene Charniak, Bonnie Dorr, Mark Johnson, Jeremy Kahn, Yang Liu, Mari Ostendorf, John Hale, Anna Krasnyanskaya, Matt Lease, Izhak Shafran, Matt Snover, Robin Stewart, Lisa Yung

 SParseval: Evaluation Metrics for Parsing Speech


 Ming-Shun Lin and Hsin-Hsi Chen

 Constructing a Named Entity Ontology from Web Corpora


 Flora Ramírez Bustamante, Enrique López Díaz

 Spelling Error Patterns in Spanish for Word Processing Applications


 Panikos Heracleous, Tomomi Kaino, Hiroshi Saruwatari, Kiyohiro Shikano

 Non-audible murmur (NAM) recognition in clean and noisy environments using non-linear and human perception-based features


 Christer Johansson, Anders Nøklestad, Øysten Reigem

 Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of co-reference chains.


 Hitomi Tohyama, Shigeki Matsubara

 Collection of Simultaneous Interpreting Patterns by Using Bilingual Spoken Monologue Corpus


 Shuichi ITAHASHI, Chiu-yu TSENG, Satoshi NAKAMURA

 Oriental COCOSDA: Past, Present and Future


 Angelika Storrer, Sandra Wellinghoff

 Automated detection and annotation of term definitions in German text corpora


 M.Yaseen, M. Atteya, K.Choukri, B.Haddad, B.Maegaard, N.Paulsson, S.Krauwer, C.Mokbel, H Fersøe, A Ali, C Bendahman

 Building Annotated Written Arabic Corpus Resources in NEMLAR Project


 Sašo Džeroski, Tomaž Erjavec, Nina Ledinek, Petr Pajas, Zdenek Žabokrtsky, Andreja Žele

 Towards a Slovene Dependency Treebank


 Canasai Kruengkrai, Virach Sornlertlamvanich, Hitoshi Isahara

 A Conditional Random Field Framework for Thai Morphological Analysis


 Yoshihide Kato, Shigeki Matsubara and Yasuyoshi Inagaki

 A Corpus Search System Utilizing Lexical Dependency Structure


 Peter Berck, Albert Russel

 ANNEX - a web-based framework for exploiting annotated media resources


 Tomaž Erjavec

 The English-Slovene ACQUIS corpus


 Daan Broeder, Andreas Claus, Freddy Offenga, Romuald Skiba, Paul Trilsbeek, Peter Wittenburg

 LAMUS - the Language Archive Management and Upload System


 Daan Broeder, Freddy Offenga, Peter Wittenburg, Peter van der Kamp, David Nathan, Sven Strömqvist

 Technologies for a Federation of Language Resource Archives


 Marc Kemps-Snijders, Julien Ducret, Laurent Romary, Peter Wittenburg

 An API for Accessing the ISO Data Category Registry


 Peter Wittenburg, Daan Broeder, Wolfgang Klein, Stephen Levinson, Laurent Romary

 Foundations of Modern Language Resource Archives


 Freddy Offenga, Daan Broeder, Peter WIttenburg, Julien Ducret, Laurent Romary

 The Metadata Profile in the ISO Data Category Registry


 Marc Kemps-Snijders, Peter Wittenburg

 LEXUS - a web-based tool for manipulating lexical resources


 Tomaž Erjavec, Darja Fišer

 Building Slovene Wordnet


 Luciana Bordoni, Tiziana Mazzoli

 Towards an ontology for art and colours


 Richard Johansson and Pierre Nugues

 Construction of a FrameNet Labeler for Swedish Text


 Han Sloetjes, Albert Russel, Alex Klassmann, Peter Wittenburg, Hennie Brugman

 ELAN V2.4: a professional framework for multimodality research


 Peter Berck, Hans-Jörg Bibiko, Marc Kemps-Snijders, Albert Russel, Peter Wittenburg

 Ontology-based Language Archive Utilization


 Joakim Nivre, Johan Hall and Jens Nilsson

 MaltParser: A Data-Driven Parser-Generator for Dependency Parsing


 Andrej Zgank, Darinka Verdonik, Aleksandra Zogling Markus, Zdravko Kacic

 SINOD - Slovenian non-native speech database


 Mark van Assem, Aldo Gangemi, and Guus Schreiber

 Conversion of WordNet to a standard RDF/OWL representation


 Antal van den Bosch, Ineke Schuurman, Vincent Vandeghinste

 Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development


 Mark Przybocki, Gregory Sanders, Audrey Le

 Edit-Distance - A Metric for Machine Translation Evaluation


 Niels Ole Bernsen, Laila Dybkjær and Svend Kiilerich

 H. C. Andersen Conversation Corpus


 Magnus Sahlgren

 Towards a pertinent evaluation methodology for word-space models


 Andrei Popescu-Belis, Paula Estrella, Margaret King, Nancy Underwood

 A model for context-based evaluation of language processing systems and its application to machine translation evaluation


 Martin Forst & Ronald M. Kaplan

 The importance of precise tokenizing for deep grammars


 Annalisa Sandrelli & Claudio Bendazzoli

 Tagging a corpus of interpreted speeches: the European Parliament Interpreting Corpus (EPIC)


 Hisami Suzuki, Gary Kacmarcik

 RefRef: a Tool for Viewing and Exploring Coreference Space


 Bernt Andrassy and Harald Hoege

 Evaluation of in-car ASR using an Articulation Index based WER(SNR)-measure


 R. Manurung, D. O’Mara, H. Pain, G. Ritchie, A. Waller

 Building a Lexical Database for an Interactive Joke-Generator


 Bruno Cartoni

 Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes.


 Serge Sharoff

 A uniform interface to large-scale linguistic resources


 Carlo Strapparava, Alessandro Valitutti, Oliviero Stock

 The Affective Weight of Lexicon


 Agnes Lisowska, Nancy Underwood

 ROTE: a tool to support users in defining the relative importance of quality characteristics


 Serge Sharoff, Bogdan Babych, Anthony Hartley

 Using collocations from comparable corpora to find translation equivalents


 Folkert de Vriend, Jos Swanenberg, Joep Kruijsen, Henk van den Heuvel, Lou Boves & Roeland van Hout

 A Unified Structure for Dutch Dialect Dictionary Data


 E. Uí Dhonnchadha & J. Van Genabith

 A Part-of-speech tagger for Irish using Finite-State Morphology and Constraint Grammar Disambiguation


 Véronique Moriceau

 Language Challenges for Data Fusion in Question-Answering


 Luís Sarmento, Luís Cabral

 BACO - A large database of text and co-occurrences


 Ulrich Schäfer

 OntoNERdIE — Mapping and Linking Ontologies to Named Entity Recognition and Information Extraction Resources


 Jonathan Fiscus, Jerome Ajot, Nicolas Radde, and Christophe Laprun

 Multiple Dimension Levenshtein Edit Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech


 J.Atserias, B.Casas, E.Comelles, M.Gonzàlez, L.Padró, M.Padró

 FreeLing 1.3: Syntactic and semantic services in an open-source NLP library


 Jiri Semecky

 On Automatic Assignment of Verb Valency Frames in Czech


 Ben Medlock

 An Introduction to NLP-based Textual Anonymisation


 Hennie Brugman, Luit Gazendam, Véronique Malaisé

 A web based general thesaurus browser to support indexing of television and radio programs


 Judit Feliu; Jorge Vivaldi; M. Teresa Cabré

 SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations


 Irene Cramer, Jochen L. Leidner, Dietrich Klakow

 Building An Evaluation Corpus for German Question Answering by Harvesting Wikipedia


 Roald Eiselen

 Suggestions on the evaluation of suggestion adequacy for spelling checkers


 Gerhard Fliedner

 Towards Natural Interactive Question Answering


 B. Waldron, A. Copestake, U. Schaefer, B. Kiefer

 Preprocessing and Tokenisation Standards in DELPH-IN Tools


 Juri Apresjan, Igor Boguslavsky, Boris Iomdin, Leonid Iomdin, Andrei Sannikov, Victor Sizov

 A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects


 Martin Hassel, Jonas Sjöbergh

 Towards Holistic Summarization - Selecting Summaries, Not Sentences


 Stefan Schaden, Ute Jekosch

 Casselberveetovallarga and other unpronounceable places: The CrossTowns Corpus released


 Dimitrios Kokkinakis, Dana Dannélls

 Recognizing Acronyms and their Definitions in Swedish Medical Texts


 Lun-Wei Ku, Yu-Ting Liang and Hsin-Hsi Chen

 Tagging Heterogeneous Evaluation Corpora for Opinionated Tasks


 Joakim Nivre, Jens Nilsson and Johan Hall

 Talbanken05: A Swedish Treebank with Phrase Structure and Dependency Annotation


 Oana Postolache and Dan Cristea and Constantin Orasan

 Transferring coreference chains through word alignment.


 G.Tummarello, C.Morbidoni, F. Kepler, F. Piazza, P.Puliti

 Semantic Web based Textual Markup: a preliminary evaluation


 Anna Bogacka

 A task-specific corpus resource for Polish adult learners of English


 Olena Medelyan, Stefan Schulz, Jan Paetzold, Michael Poprat, Kornél Marcó

 Language Specific and Topic Focused Web Crawling


 Martin Reynaert

 Corpus-Induced Corpus Clean-up


 Andrei Popescu-Belis and Maria Georgescul

 TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface


 Feng Pan, Rutu Mulkar, and Jerry R. Hobbs

 An Annotated Corpus of Typical Durations of Events


 Agam Patel and Dragomir Radev

 Lexical similarity can distinguish between automatic and manual translations


 Nick Campbell, Toshiyuki Sadanobu, Damien Douxchamps, Masataka Imura, Suzuki Noriko and Mayumi Bono

 A multimedia database of meetings and informal interactions for tracking participant involvement and discourse flow


 Fu Lee Wang, Xiaotie Deng, Feng Zou

 Towards Unified Chinese Segmentation Algorithm


 Doaa Samy, Antonio Moreno Sandoval, José M. Guirao, Enrique Alfonseca

 Building and Assessment of a Parallel Multilingual Corpus (Arabic-Spanish-English)


 Doaa Samy and Antonio Moreno Sandoval

 XML-Based Arabic Part of Speech Tagger


 Donna K. Byron and Eric Fosler-Lussier

 The OSU Quake 2004 corpus of two-party situated problem-solving dialogs


 Md. Aminul Islam and Diana Inkpen

 Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words


 Yoji Kiyota, Hiroshi Nakagawa

 A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames


 Ronnie Taib, Natalie Ruiz

 Tangible Objects for the Acquisition of Multimodal Interaction Patterns


 Václav Novák and Jan Hajič

 Perspectives of Turning Prague Dependency Treebank into a Knowledge Base


 Jonas Granfeldt, Pierre Nugues, Emil Persson, Jonas Thulin, Malin Ågren, Suzanne Schlyter

 CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling


 Hui Tan, Claire Cardie

 Annotating a multi-entity cross-document noun phrase coreference corpus: issues and preliminary results


 Qian Yang, Jean-Pierre Martens, Nanneke Konings, Henk van den Heuvel

 Development of a phoneme-to-phoneme (p2p) converter to improve the grapheme-to-phoneme (g2p) conversion of names


 Jaeyoung Jung, Maki Miyake, Hiroyuki Akama

 Recurrent Markov Cluster(RMCL)Algorithm for the Refinement of the Semantic Network


 Catia Cucchiarini, Hugo Van hamme, Felix Smits

 JASMIN-CGN: Extension of the Spoken Dutch Corpus with speech of elderly people, children and non-natives in the human-machine interaction modality.


 Cédrick Fairon, Sébastien Paumier

 A framework for real time dictionary updating


 Vicente Alabau and Carlos D. Martínez

 Bilingual speech corpus in two phonetically similar languages


 Vincent Vandeghinste, Ineke Schuurman, Michael Carl, Stella Markantonatou, Toni Badia

 METIS-II: Machine Translation for Low Resource Languages


 Elisabeth D’Halleweyn, Jan Odijk, Lisanne Teunissen, Catia Cucchiarini

 The Dutch-Flemish HLT Programme STEVIN: Essential Speech and Language Technology Resources


 Nuria GALA and André Valli

 A computational lexicon of French verbal subcategorization frames


 Emiko Suzuki and Kyoko Kakihana

 On the Web Trilingual Sign Language Dictionary to Learn the foreign Sign Language without Learning a Target Spoken Language


 Daniela Goecke, Andreas Witt

 Exploiting logical document structure and text type structure for anaphora resolution


 Catalina Hallett, Donia Scott and Richard Power

 Textual rendering of complex post-coordinated clinical concepts from a medical ontology


 Cédrick Fairon, Sébastien Paumier

 A translated corpus of 30,000 French SMS


 Feng Zou, Fu Lee Wang, Xiaotie Deng, Song Han

 Evaluation of Stop Word List in Chinese Language


 Anna Sinopalnikova and Pavel Smrz

 Intelligent Dictionary Interfaces: Usability Evaluation of Access-Supporting Enhancements


 Florian Schiel, Moritz Kaiser, Hannes Mögele

 German SmartWeb UMTS Speech Data Collection


 Žabokrtský, Z., Lopatková, M., Skwarska, K.

 Valency Lexicon of Czech Verbs: Alternation-Based Model


 Darinka Verdonik, Matej Rojc

 Are you ready for a call? - Spontaneous conversations in tourism for speech-to-speech translation systems


 Florian Schiel, Hannes Mögele, Moritz Kaiser

 Bikers Accessing the Web: The SmartWeb Motorbike Corpus


 Asanee Kawtrakul

 Ontology Driven Knowledge Portal Construction


 Dan Cristea, Corina Forascu, Ionut Cristian Pistol

 Temporality in relation with discourse structure


 Eva Hajičová, Petr Sgall

 Corpus Annotation As a Test of a Linguistic Theory


 Ondrej Bojar and Magdalena Prokopova

 Czech-English Word Alignment


 Sergio Scalise, Antonietta Bisetto, Emiliano Guevara, Chiara Melloni

 MORBO/COMP: a multilingual database of compound words


 Daniel Sonntag and Massimo Romanelli

 A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications


 Nicole Grégoire

 A Standard Lexical Representation of Multiword Expressions


 Anders Green, Helge Hüttenrauch, Elin Anna Topp, Kerstin Severinson Eklundh

 Developing a Contextualized Multimodal Corpus for Human-Robot Interaction


 Wei-Yun Ma, Chu-Ren Huang

 Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus


 Ana-Maria Barbu, Emil Ionescu, Verginica Barbu Mititelu

 Romanian Valence Dictionary in XML Format


 Niels Ole Bernsen, Thomas K. Hansen, Svend Kiilerich and Torben Kruchov Madsen

 Field Evaluation of a Single-Word Pronunciation Training System


 Wei Li, Wenjie Li, Qin Lu

 Mining the Implicit Entities in the Queries


 Kiyotaka Uchimoto, Ryoji Hamabe, Takehiko Maruyama, Katsuya Takanashi, Tatsuya Kawahara, and Hitoshi Isahara

 Dependency-structure Annotation to Corpus of Spontaneous Japanese


 N. Areta, A. Gurrutxaga, I. Leturia, Z. Polin, R. Saiz (Elhuyar) I. Alegria, X. Artola, A. Diaz de Ilarraza, N. Ezeiza, A. Sologaistoa, A. Soroa, A. Valverde (IXA)



 Joseph Polifroni and Marilyn Walker

 Learning Database Content for Spoken Dialogue System Design


 Jorge Civera and Alfons Juan

 Bilingual Machine-Aided Indexing


 Panunzi Alessandro, Fabbri Marco, Moneglia Massimo



 Luís Fernando Costa

 Component evaluation in a question answering system


 Stefan Breuer, Sven Bergmann, Ralf Dragon, Sebastian Möller

 Set-up of a Unit-Selection Synthesis with a Prominent Voice


 Nasredine Semmar, Meriama Laib and Christian Fluhr

 Using a Deep Linguistic Analysis in Cross-language Information Retrieval


 Susana Inácio and Diana Santos

 Annotating COMPARA, a grammar-aware parallel corpus


 Lina Henriksen, Claus Povlsen, Andrejs Vasiljevs

 EuroTermBank - a terminology resource based on best practice


 Florbela Barreto, António Branco, Eduardo Ferreira, Amália Mendes, Maria Fernanda Bacelar do Nascimento, Filipe Nunes, João Ricardo Silva

 A Large High Quality Linguistically Interpreted Open Corpus of Portuguese: the TagShare Corpus


 Luís Sarmento, Belinda Maia, Diana Santos, Luís Cabral, Ana Sofia Pinto

 Corpógrafo V3 From simple word concordance to semi-automatic knowledge engineering


 Liviu P. Dinu

 On the data base of Romanian syllables


 Alessandro Bahgat, Fabio Massimo Zanzotto

 A Dependency-based Algorithm for Grammar Conversion


 Khurshid Ahmad, Maria Teresa Musacchio and Giuseppe Palumbo

 Ontological Terminological Commitments and the Discourse of Specialist Communities


 Alessandro Oltramari

 FrameNet as a conceptual window to WordNet


 Chloe Clavel, Ioana Vasilescu, Laurence Devillers, Thibaut Ehrette, Gaël Richard

 The SAFE Corpus: fear-type emotions detection for surveillance applications


 Arne Mauser, Evgeny Matusov, Hermann Ney

 Training a Statistical Machine Translation System without GIZA++


 Evie Coussé and Steven Gillis

 Regional Bias in the Broad Phonetic Transcription of the Spoken Dutch Corpus


 Atsushi Fujii, Makoto Iwayama, Noriko Kando

 Test Collections for Patent Retrieval and Patent Classification in the Fifth NTCIR Workshop


 Željko Agić, Marko Tadić

 Evaluating Morphosyntactic Tagging of Croatian Texts


 Martin Rajman, Marita Ailomaa, Agnes Lisowska, Mirek Melichar, Susan Armstrong

 Extending the Wizard of Oz Technique for Multimodal Language-enabled Systems


 Gertjan van Noord, Ineke Schuurman, Vincent Vandeghinste

 Syntactic Annotation of Large Corpora in STEVIN


 Davide Buscaldi, Paolo Rosso

 Mining Knowledge from Wikipedia for the Question Answering Task


 Sabine Schulte im Walde

 Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet


 Peter W. Wagacha, Guy De Pauw and Pauline W. Githinji

 Automatic Diacritic Placemenent using Memory-Based Learning: a Gikuyu Language Case Study


 Juan José Rodríguez Soler, Pedro Concejero Cerezo, Daniel Tapias Merino, Alberto José Sánchez

 MEDUSA: User-Centred Design and usability evaluation of Automatic Speech Recognition telephone services in Telefónica Móviles España


 Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Pado, Manfred Pinkal

 The SALSA corpus: a German corpus resource for lexical semantics


 Ralf Steinberger, Bruno Pouliquen, Anna Widiger, Camelia Ignat, Tomaz Erjavec, Dan Tufis, Dániel Varga

 The JRC-Acquis: A multilingual aligned parallel corpus with 20+ languages


 Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Pado

 SALTO — A Versatile Multi-Level Annotation Tool


 Veronique Hoste and Guy De Pauw

 KNACK-2002: a Richly Annotated Corpus of Dutch Written Text


 Philipp Cimiano, Matthias Hartung and Esther Ratsch

 Finding the appropriate generalization level for binary ontological relations extracted from the Genia corpus


 Tomoyuki Kato, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano

 Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria


 Margot Mieskes, Michael Strube

 Part-of-Speech Tagging of Transcribed Speech


 Branimir Boguraev and Rie Kubota Ando

 Analysis of TimeBank as a resource for TimeML Parsing


 Daisuke Kawahara and Sadao Kurohashi

 Case Frame Compilation from the Web using High-Performance Computing


 Patrick Paroubek, Anne Vilnat, Isabelle Robba, Christelle Ayache, Louis-Gabriel Pouillot

 EASY: Evaluation Campaign of Syntactic Parsers of French


 Toshiyuki Kanamaru, Masaki Murata, Hitoshi Isahara

 Creating a Japanese Adverb Dictionary That Includes Information About Speaker’s Communicative Intention Using Machine Learning


 Gregory Grefenstette, Fathi Debili, Christian Fluhr, Svitlana Zinger

 Exploiting text for extracting image processing resources


 Naoaki Okazaki and Sophia Ananiadou

 Clustering acronyms in biomedical text for disambiguation


 Goran Nenadic, Naoki Okazaki, Sophia Ananiadou

 Towards a terminological resource for biomedical text mining


 Hans Hjelm

 Extraction of Cross Language Term Correspondences


 David Guthrie, Ben Allison, Wei Liu, Louise Guthrie, Yorick Wilks

 A Closer Look at Skip-gram Modelling


 Dobrov B., Loukachevitch N.

 Development of Linguistic Ontology on Natural Sciences and Technology


 Matthew W. Bilotti and Eric Nyberg

 Evaluation for Scenario Question Answering Systems


 Loukachevitch N., Dobrov B.

 Information Significance as a Factor of Inclusion of Multiword Expressions to Computational Resources


 Dirk Bühler, Wolfgang Minker



 Stephan Oepen and Jan Tore Lønning

 Discriminant-Based MRS Banking


 György Szarvas, Richárd Farkas, László Felföldi, András Kocsor

 Highly accurate Named Entity corpus for Hungarian


 Thierry Declerck and Mihaela Vela

 A Generic NLP Tool for Supporting Shallow Ontology Building


 Julia Ritz, Ulrich Heid

 Extraction tools for collocations and their morphosyntactic specificities


 Luke Nezda, Andrew Hickl, John Lehmann, and Sarmad Fayyaz

 What in the world is a Shahab? Wide Coverage Named Entity Recognition for Arabic


 Massimo Poesio, Philippe Goux, and Mijail A. Kabadjov



 Manfred Sailer and Beata Trawinski

 The Collection of Distributionally Idiosyncratic Items: A Multilingual Resource for Linguistic Research


 Danie J. Prinsloo, Elsabé Taljard, Ulrich Heid

 Grammar-based tools for the creation of tagging resources for an unresourced language: the case of Northern Sotho


 Maria Clara Paixão de Sousa and Thorsten Trippel

 Building a historical corpus for Classical Portuguese: some technological aspects


 Maria Teresa Pazienza, Marco Pennacchiotti, Fabio Massimo Zanzotto

 Mixing WordNet, Verbnet and PropBank for studying verb relations


 Leo Wanner and Margarita Alonso Ramos

 Local Document Relevance Clustering in IR Using Collocation Information


 Andrea Esuli and Fabrizio Sebastiani

 SentiWordNet: A Publicly Available Lexical Resource for Opinion Mining


 Alexandre Denis, Matthieu Quignard, Guillaume Pitel

 A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation


 Felix Sasaki

 Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources


 Margaret King and Nancy Underwood

 Evaluating Symbiotic Systems: the challenge


 A. Chalamandaris, A. Protopapas, P. Tsiakoulis, S. Raptis

 All Greek to me! An automatic Greeklish to Greek transliteration system


 Thurid Vogt, Elisabeth Andre

 Improving Automatic Emotion Recognition from Speech via Gender Differentiation


 Khurshid Ahmad, Lee Gillam and David Cheng

 Sentiments on a Grid: Analysis of Streaming News and Views


 Briony Williams and Ivan Uemlianin

 A procedure for facilitating the development of manual letter-to-sound rules for the Festival speech synthesis framework


 Thierry Declerck, Ovidiu Vela, Asunción Gómez Pérez, Zeno Gantner,and David Manzano-Macho

 Multilingual Lexical Resources for Ontology translation


 Beatrice Alex, Malvina Nissim and Claire Grover

 The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text


 Ben Wellner and Marc Vilain

 Leveraging Machine Readable Dictionaries in Discriminative Sequence Models


 Sasa Hasan, Anas El Isbihani, Hermann Ney

 Creating a Large-Scale Arabic to French Statistical Machine Translation System


 Chen Yirong,Lu Qin,Li Wenjie,Sui Zhifang

 A Study on Terminology Extraction Based on Classified Corpora


 Anna Estelles Palanca, María Amparo Alcina Caudet, Victoria Soler Puertes

 Retrieving terminological data from the TxtCeram tagged domain corpus: first step on a terminological ontology.


 Violetta Cavalli-Sforza, Abdelhadi Soudi

 IMORPHE: An Inheritance and Equivalence Based Morphology Description Compiler


 Laurianne Sitbon and Patrice Bellot

 Tools and methods for topic segmentation of texts and contextual evaluation


 L. Devillers, R. Cowie, J-C. Martin, E. Douglas-Cowie

 Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches


 Maja Popovic, Hermann Ney

 POS-based Word Reorderings for Statistical Machine Translation


 David Vilar, Jia Xu, Hermann Ney, Luis Fernando D’Haro

 Error Analysis of Statistical Machine Translation Output


 Castellón, Irene; Fernández-Montraveta, Ana; Vázquez, Gloria; Alonso, Laura; Capilla, Joan Antonio

 The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level


 Javier Pérez, Antonio Bonafonte

 GAIA: A Common Framework for the Development of Speech Translation Techonologies


 Attila Novák

 Morphological Tools for Six Small Uralic Languages


 Javier Pérez, Antonio Bonafonte, Horst-Udo Hain, Eric Keller, Stefan Breuer and Jilei Tian

 ECESS Inter-Module Interface Specification for Speech Synthesis


 Petr Nemec

 Tree Searching/Rewriting Interpreter Tool


 Maite Taboada, Caroline Anthony and Kimberly Voll

 Methods for creating semantic orientation dictionaries


 Mahmoud Shokrollahi-Far,Antal van den Bosch, Harry Bunt, Walter Daelemans, Ayaz Isazadeh

 Bootstrapping POS Tags on the Vocalism of Classical Arabic…


 Hans C. Boas

 CoLangDoc: A web-based tool for collaborative research on endangered languages


 Masaki Itagaki, Anthony Aue, and Takako Aikawa

 Detecting Inter-domain Semantic Shift using Syntactic Similarity


 Pablo D. Agüero, Jordi Adell, Mónica Caballero, Antonio Bonafonte and Asunción Moreno

 Database pruning and unit recovering for unit selection speech synthesis



 Methodology of Lombard Speech Database Acquisition: Experiences with CLSD


 Harry Bunt

 Design Principles for Multidimensional Dialogue Act Annotation Schemes


 Gabriel Murray and Ben Hachey

 Improved Automatic Evaluation of Summaries Using a Large Semantic Space


 Olivier Baude, Michel Jacobson, Atanas Tchobanov & Richard Walter

 Interoperability of the sound corpora: the case of the French corpora


 Pavel Pecina, Pavel Schlesinger

 Evaluation of Classification Methods for Collocation Extraction


 R. Bernardi, D. Calvanese, L. Dini, V. Di Tommaso, E. Frasnelli, B. Plank

 Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano


 Irene Langkilde-Geary and Justin Betteridge

 A Factored Functional Dependency Transformation of the English Penn Treebank for Probabilistic Surface Generation


 Jim Talley

 Bootstrapping new language ASR capabilities: Achieving best letter-to-sound performance under resource constraints


 Eduard Hovy, Chin-Yew Lin, Liang Zhou, and Junichi Fukumoto

 Automated Summarization Evaluation with Basic Elements


 Hiroyuki Kaji and Mariko Watanabe

 Automatic Construction of Japanese WordNet


 Marie-Catherine de Marneffe, Bill MacCartney, Christopher D. Manning

 Generating Typed Dependency Parses from Phrase Structure Parses


 Sónia Frota, Marina Vigário & Fernando Martins

 FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text


 Ulrik Petersen

 Querying both parallel and treebank corpora: Evaluation of a corpus query system


 Liang Zhou, Chin-Yew Lin, and Eduard Hovy

 Summarizing Answers for Complicated Questions


 Monica Monachini, Nicoletta Calzolari, Khalid Choukri, Jochen Friedrich, Michele Mammini, Jan Odijk, Marisa Ulivieri

 Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian


 Ronny Melz, Pum-Mo Ryu, Key-Sun Choi

 Compiling a large Korean IT-related language resource by language independent methods for domain-specific taxonomy learning


 Felix Pîrvan, Dan Tufis

 Tagsets Mapping and Statistical Training Data Cleaning-up


 Dan Tufiş, Elena Irimia

 RoCo-News: A Hand Validated Journalistic Corpus of Romanian


 Eckhard Bick

 Turning a Dependency Treebank into a PSG-style Constituent Treebank


 Dan Ştefănescu, Dan Tufiş

 Aligning multilingual thesauri


 Radu Ion, Alexandru Ceauşu, Dan Tufiş

 Dependency-Based Phrase Alignment


 Alexandru Ceauşu, Dan Ştefănescu, Dan Tufiş

 Acquis Communautaire sentence alignment using Support Vector Machines


 Claire Grover and Richard Tobin

 Rule-Based Chunking and Reusability


 Baden Hughes, Timothy Baldwin, Steven Bird, Jeremy Nicholson and Andrew MacKinlay

 Reconsidering Language Identification for Written Language Resources


 Sun-Mee Bae and Key-Sun Choi

 A Comparison of Conceptual Maps Between Korean and Chinese Words in CoreNet


 Yasuko SENDA, Yasusi SINOHARA, Manabu OKUMURA

 Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing


 Lundälv, Mats, Mühlenbock, Katarina, Farre, Bengt and Brännström, Annika

 SYMBERED - a symbol-concept editing tool


 Yi Zhang, Valia Kordoni

 Automated Deep Lexical Acquisition for Robust Open Texts Processing


 Martin, J.-C., Caridakis, G., Devillers, L., Karpouzis, K., Abrilian, S.

 Manual Annotation and Image Processing of Multimodal Emotional Behaviours in TV Interviews


 Cvetana Krstev, Ranka Stanković, Duško Vitas, and Ivan Obradović

 WS4LR: A Workstation for Lexical Resources


 Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer

 Extending VerbNet with novel verb classes


 James Pustejovsky, Catherine Havasi, Roser Sauri, Patrick Hanks, Jessica Littman, Anna Rumshisky, Jose Castano, Marc Verhagen

 Towards a Generative Lexical Resource: The Brandeis Semantic Ontology


 Hans Dybkjær and Laila Dybkjær

 Act-Topic Patterns for Automatically Checking Dialogue Models


 Rojas, David M. and Takako Aikawa

 Predicting MT quality as a function of the source language


 Pawel Mazur and Robert Dale

 Named Entity Extraction with Conjunction Disambiguation


 Beatrice Alex

 Integrating Language Knowledge Resources to Extend the English Inclusion Classifier to a New Language


 Griet Depoorter, Remco van Veenendaal

 Functioning of the Centre for Dutch Language and Speech Technology (TST Centre)


 Bente Maegaard, Lene Offersgaard, Lina Henriksen, Costanza Navarretta, Claus Povlsen, Hanne Jansen, Xavier Lepetit

 The MULINCO corpus and corpus platform


 Claudia Soria, Maurizio Tesconi, Francesca Bertagna, Nicoletta Calzolari, Andrea Marchetti, Monica Monachini

 Moving to dynamic computational lexicons with LeXFlow


 Caroline Sporleder, Marieke van Erp, Tijn Porcelijn, Antal van den Bosch and Pim Arntzen

 Identifying Named Entities in Text Databases from the Natural History Domain


 Harold Somers, Gareth Evans and Zeinab Mohamed

 Developing Speech Synthesis for Under-Resourced Languages by ‘Faking it’ : An Experiment with Somali



 Using richly annotated trilingual language resources for acquiring reading skills in a foreign language


 Guido Boella, Alessandro Mazzei, Piercarlo Rossi, Marco Martin

 A Development Tool For Multilingual Ontology-based Conceptual Dictionaries


 Jens-Erik Fenstad, Lars Ahrenberg, Knut Kvale, Bente Maegaard, Katarina Mühlenbock, Bernt-Erik Heid

 KUNSTI - Knowledge Generation for Norwegian Language Technology


 Péter Halácsy, Dániel Varga, Csaba, Oravecz, András Kornai, Viktor Trón

 Using a morphological analyzer in high precision POS tagging of Hungarian


 Toumouh Adil, Widdows Dominic, Lehireche Ahmed, Malki Mimoun

 Adapting WordNet to the Medical Domain using Lexicosyntactic Patterns in the Ohsumed Corpus


 Renata Savy, Francesco Cutugno, Claudia Crocco

 Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).


 Baden Hughes, Dafydd Gibbon and Thorsten Trippel

 Feature-Based Encoding and Querying Language Resources with Character Semantics


 Rajen Subba, Barbara Di Eugenio, Elena Terenzi

 Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations


 Kiyotaka Uchimoto, Naoko Hayashida, Toru Ishida, and Hitoshi Isahara

 Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence


 Isa Maks and Michel Boekestein

 Exploring opportunities for enrichment by linking lexical databases


 Horacio Saggion

 Multilingual Multidocument Summarization Tools and Evaluation


 Olivier Ferret

 Building a network of topical relations from a corpus


 P. Bouquet, L. Serafini, S. Zanobini

 The role of lexical resources in matching classification schemas


 Manolis Maragoudakis, Katia Kermanidis, Aristogiannis Garbis and Nikos Fakotakis

 Dealing with imbalanced data using Bayesian techniques


 J.M.Benedí, E.Lleida, A.Varona, M.J.Castro, I.Galiano, R.Justo, I.López and A.Miguel

 Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA


 Rainer Osswald, Hermann Helbig, Sven Hartrumpf

 The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon


 Jean Véronis, Yun-Chuang Chiao

 The ARCADE II project : Evaluation of multilingual text alignment systems


 Francesca Bertagna

 Representation and Inference for Open-Domain QA: strength and limits of two Italian Semantic Lexicons


 Abdelrahim Abdelsapor, Noha Adly, Kareem Darwish, Ossama Emam, Magdi Nagi

 Building a Heterogeneous Information Retrieval Collection of Printed Arabic Documents


 Saturnino Luz, Matt-Mouley Bouamrane, Masood Masoodian

 Gathering a corpus of multimodal computer-mediated meetings with focus on text and audio interaction


 A. L. Dimou, A. Chalamandaris

 Language identification from suprasegmental cues: Speech synthesis of Greek utterances from different dialectal variations


 Roger Levy and Galen Andrew

 Tregex and Tsurgeon: tools for querying and manipulating tree data structures


 L. Gillard, P. Bellot, M. El-Bèze



 B. Magnini, E. Pianta, C. Girardi, M. Negri, L. Romano, M. Speranza, and R. Sprugnoli

 I-CAB: the Italian Content Annotation Bank


 Bente Maegaard, Steven Krauwer, Khalid Choukri, Lise Damsgaard Jørgensen

 The BLARK concept and BLARK for Arabic


 Gabriella Pardelli, Manuela Sassi, Sara Goggi, Paola Orsolini

 Natural Language Processing: A Terminological and Statistical Approach


 Suzan Verberne, Lou Boves, Nelleke Oostdijk

 Data for question answering: the case of ’why’


 Kiril Simov and Petya Osenova

 Shallow Semantic Annotation of Bulgarian


 Christopher Cieri, Walt Andrews, Joseph P. Campbell, George Doddington, Jack Godfrey, Shudong Huang, Mark Liberman, Alvin Martin, Hirotaka Nakasone, Mark Przybocki, Kevin Walker

 The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research


 Laurence Devillers, Jean-Claude Martin, Sarkis Abrilian

 Annotation of Emotions in Real-Life Video Interviews: Variability between Coders


 Ya-Min Chou, Chu-Ren Huang

 Hantology-A Linguistic Resource for Chinese Language Processing and Studying


 Voula Giouli, Penny Labropoulou, Maria Gavrilidou, Stelios Piperidis, Monica Monachini, Claudia Soria, Nicoletta Calzolari, Khalid Choukri

 Language resources production models: the case of INTERA multilingual corpus and terminology


 Kyoko Kanzaki,Qing Ma,Eiko Yamamoto and Hitoshi Isahara

 Semantic Analysis of Abstract Nouns to compile a thesaurus of Adjectives


 Katrin Erk and Sebastian Pado

 FRED and ROSY — A Flexible Toolbox For Semantic Role Assignment


 Valentin Tablan, Tamara Polajnar, Hamish Cunningham, Kalina Bontcheva

 User-friendly ontology authoring using a controlled language


 Laura Hasler and Constantin Orasan

 Towards cross-document event annotation


 Amália Mendes, Maria Fernanda Bacelar do Nascimento, Sandra Antunes, Luísa Pereira

 COMBINA-PT: a large corpus-extracted and hand-checked lexical database of Portuguese multiword expressions


 David Graff, Tim Buckwalter, Mohamed Maamouri, Hubert Jin

 Lexicon Development for Varieties of Spoken Dialectal Arabic


 Alexandre Patry, Fabrizio Gotti, Philippe Langlais

 MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation


 Mohamed Maamouri, Ann Bies, Tim Buckwalter, Mona Diab, Nizar Habash, Owen Rambow, Dalila Tabessi

 Developing and Using a Pilot Dialectal Arabic Treebank


 Beata Bandmann Megyesi, Eva Csato Johansson, Anna Sågvall Hein

 Building a Swedish-Turkish Parallel Corpus


 Horacio Saggion, Robert Gaizauskas

 Language Resources for Background Gathering


 Jennifer Doyon, Keith Miller, John Henderson, Linda Van Guilder, Steven Van Guilder, Kristian Concepcion, Florence Reeder

 Document Exploitation Improvement by Component Evaluation


 Julie Medero, Kazuaki Maeda, Stephanie Strassel, Christopher Walker

 An Efficient Approach to Gold Standard Annotation: Decision Points for Complex Tasks


 Stefan Klatt

 A Corpus-Based Approach to the Interpretation of Unknown Words with an Application to German


 Sophie Rosset, Sandra Petel

 The Ritel Corpus - An annotated Human-Machine open-domain question answering spoken dialog corpus


 Anna Feldman, Jirka Hana, Chris Brew

 A cross-language approach to rapid creation of new morpho-syntactically annotated resources


 Konstantinos I. Diamantaras, Ionas Michailidis, Spyros Vasileiadis, Yannick Frère

 Greek Named Entities Recognition using Support Vector Machines, Maximum Entropy Models and Onetime


 Anna Korhonen, Yuval Krymolowski, Ted Briscoe

 A Large Subcategorization Lexicon for Natural Language Processing Applications


 Nancy Ide, Keith Suderman

 An Open Linguistic Infrastructure for American English


 Nancy Ide, Laurent Romary

 Representing Linguistic Corpora and Their Annotations


 Zhongqiang Huang, Lei Chen, Mary Harper

 An Open Source Prosodic Feature Extraction Tool


 Alina Andreevskaia and Sabine Bergler

 SEMANTIC Tag Extraction from WordNet Glosses


 Kow Kuroda and Hitoshi Isahara

 Getting Deeper Semantics than Berkeley FrameNet with MSFA


 Enrique Alfonseca, Antonio Moreno-Sandoval, Jose Maria Guirao and Maria Ruiz-Casado

 The wraetlic NLP suite


 Tomoko Ohta, Yuka Tateisi, Jin-Dong Kim, Akane Yakushiji, Jun-ichi Tsujii

 Linguistic and Biological Annotations of Biological Interaction Events


 Knut Hofland, Paul Meurer, Kari Tenfjord

 The ASK corpus - a language learner corpus of Norwegian as a second language


 Ivana Kruijff-Korbayova, Klara Chvatalova, Oana Postolache

 Annotation Guidelines for Czech-English Word Alignment


 Jérémie Segouat, Annelies Braffort and Emilie Martin

 Sign Language corpus analysis: Synchronisation of linguistic annotation and numerical data


 Gil Francopoulo, Monte George, Nicoletta Calzolari, Monica Monachini, Nuria Bel, Mandy Pet, Claudia Soria

 Lexical Markup Framework (LMF)


 Bruno Pouliquen, Marco Kimler, Ralf Steinberger, Camelia Ignat, Tamara Oellinger, Ken Blackler, Flavio Fluart, Wajdi Zaghouani, Anna Widiger, Ann-Charlotte Forslund, Clive Best

 Geocoding multilingual texts: recognition, disambiguation and visualisation


 Bolette Sandford Pedersen

 Search on Danish compounds using shallow linguistic techniques


 Thatsanee Charoenporn, Canasai Kruengkrai, Thanaruk Theeramunkong, Virach Sornlertlamvanich, and Hitoshi Isahara

 Word Knowledge Acquisition for Computational Lexicon Construction


 Wim Peters, Maria Teresa Sagri, Daniela Tiscornia, Sara Castagnoli

 The LOIS Project


 Joris Vaneyghen, Dirk Van Compernolle

 A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora


 Adriana Roventini - Nilda Ruimy

 Linking Verbal Entries of Different Lexical Resources Linking Verbal Entries of Different Lexical Resources


 K Choukri, M Dabbadie, O Hamon, A Hartley, W Mustafa El Hadi, A Popescu-Belis, M Rajman, S Surcin, I Timimi

 CESTA Machine Translation Evaluation Campaign: towards a reliable, reusable protocol


 Marie-Jeanne Derouin, André Le Meur

 Lemma-oriented dictionaries, concept-oriented terminology and translation memories


 Martí Umbert, Asunción Moreno, Pablo Aguero, Antonio Bonafonte

 Spanish Synthesis Corpora


 Pavel Ircing, Jan Hoidekr, Josef Psutka

 Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech


 Milena Slavcheva

 Semantic Descriptors: The Case of Reflexive Verbs


 Guenter Neumann and Berthold Crysmann

 Exploring HPSG-based Treebanks for Probabilistic Parsing


 Rita Marinelli, Remo Bindi

 Proper Names and Linguistic Dynamics


 Sonja Bosch, Jackie Jones, Laurette Pretorius

 Towards machine-readable lexicons for South African Bantu languages


 Le Chenadec G., Maffiolo V., Chateau N. and Colletta J.M.

 Creation of a corpus of multimodal spontaneous expressions of emotions in Human-Machine Interaction


 Yoshihiko Hayashi, Toru Ishida

 A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons


 Rita Marinelli, Adriana Roventini

 Using Core Ontology for Domain Lexicon Structuring


 W. Abramowicz, A. Filipowska, K. Wecel, K. Wieloch, J. Piskorski

 Linguistic Suite for Polish Cadastral System


 Hiromichi Kawanami, Takahiro Kitamura and Kiyohiro Shikano

 Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech


 Pavel Smrz, Vit Novacek

 Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator


 Bernardi R., Bolognesi A., Seidenari C., Tamburini F.

 POS tagset design for Italian


 Kentaro Inui, Toru Hirano, Ryu Iida, Atsushi Fujita and Yuji Matsumoto

 Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences


 Onelli C., Proietti D., Seidenari C., Tamburini F.

 The DiaCORIS project: a diachronic corpus of written Italian


 Eneko Agirre, Izaskun Aldezabal, Jone Etxeberria, Eli Pociello

 A preliminary study for building the Basque PropBank


 Eiko Yamamoto, Kyoko Kanzaki, and Hitoshi Isahara

 Detection of inconsistencies in concept classifications in a large dictionary — Toward an improvement of the EDR electronic dictionary —


 Eneko Agirre, Izaskun Aldezabal, Jone Etxeberria, Eli Izagirre, Karmele Mendizabal, Eli Pociello and Mikel Quintian

 A methodology for the joint development of the Basque WordNet and Semcor


 Jan Hoidekr, J.V. Psutka, Aleš Pražák, Josef Psutka

 Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries


 Javier Cuétara-Priede, Luis Villaseñor-Pineda, Manuel Montes-y-Gómez, Manuel Alberto Pérez-Coutiño

 TranscríbEMex: An automatic phonetic transcriber for the Mexican Spanish


 Stefanie Anstein, Gerhard Kremer, Uwe Reyle

 Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology


 Mathieu Lafourcade

 Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence


 Mariona Taulé, Roser Morante, Juan Aparicio, M Antònia Martí

 3LB-LEX and 3LB SEM lexical resources for Spanish and Catalan


 Ernesto William De Luca and Andreas Nürnberger

 Rebuilding Lexical Resources for Information Retrieval using Sense Folder Detection and Merging Methods.


 Pavel Smrz, Vit Novacek

 Automatic Acquisition of Semantics-Extraction Patterns from Parallel Texts


 Lea Cyrus

 Building a resource for studying translation shifts


 Christoph Draxler

 Speech Recordings in Public Schools in Germany — the Perfect Show Case for Web-based Recordings and Annotation


 B K Tsou, B Y Lai, K K Sin, L Y L Cheung

 Court Stenography-To-Text (“STT”) in Hong Kong: A Jurilinguistic Engineering Effort


 Lars Hellan & Dorothee Beermann

 Word Sense Disambiguation and Semantic Construction Analysis for Deep Processing Grammars


 Dennis Reidsma, Dirk Heylen, Roeland Ordelman

 Annotating emotion in meetings


 H. Bonneau-Maynard, C. Ayache, F. Bechet, A. Denis, , A. Kuhn, F. Lefevre, D. Mostefa, M. Quignard, S. Rosset, C. Servan, J. Villaneau

 Results of the French Evalda-Media evaluation campaign for literal understanding


 Jonas Kuhn, Michael Jellinghaus

 Multilingual parallel treebanking: a lean and flexible approach


 Julie Mauclair, Yannick Estève, Simon Petit-Renaud, Paul Deléglise

 Automatic detection of well recognized words in automatic speech transcription


 Francesca Bertagna, Federico Calzolari, Nicoletta Calzolari, Sebastiana Cucurullo, Alessandro Enea, Monica Monachini, Eugenio Picchi, Manuela Sassi, Eva Sassolini, Claudia Soria

 Next Generation Language Resources using GRID


 Sanni Nimb

 LEXADV - a multilingual semantic lexicon for adverbs


 Voula Giouli, Alexis Konstadinidis, Elina Desipri, Harris Papageorgiou, Stelios Piperidis

 Multi-domain Multi-lingual Named Entity Recognition: Re-visiting & grounding the resources issue


 Rebecca Passonneau and Nizar Habash and Owen Rambow

 Inter-annotator agreement on a multilingual semantic annotation task


 Rebecca Passonneau

 Measuring Agreement on Set-Valued Items (MASI) for Semantic and Pragmatic Annotation


 Tomoyosi Akiba

 Exploiting Dynamic Passage Retrieval for Spoken Question Recognition and Context Processing towards Speech-driven Information Access Dialogue


 Marc Verhagen, Robert Knippen, Inderjeet Mani, and James Pustejovsky

 Annotation of TimeML Relations with Tango


 Patrizia Paggio

 Information structure and pauses in a corpus of spoken Danish


 Quasthoff, Uwe; Biemann, Chris; Richter, Matthias

 Corpus Portal for Search in 16 Monolingual Corpora


 Tristan Vanrullen, Philippe Blache & Jean-Marie Balfourier

 Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy


 Andreas Eisele

 Parallel corpora and phrase-based statistical MT for new language pairs via multiple intermediaries


 Jae-Ho Kim, Sun-Mee Bae and Key-Sun Choi

 Automatic Generation of Medical Term Definition from Large Corpora


 Owen Rambow, Bonnie Dorr, David Farwell, Rebecca Green, Nizar Habash, Stephen Helmreich, Eduard Hovy, Lori Levin, Keith J. Miller, Teruko Mitamura, Florence Reeder, Advaith Siddharthan

 Parallel Syntactic Annotation of Multiple Languages


 Sylvain Galliano, Edouard Geoffrois, Jean-François Bonastre, Guillaume Gravier, Djamel Mostefa, Khalid Choukri.

 Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News


 Juan José Rodríguez Soler, Lucía Palacios Fernández, Pedro Concejero Cerezo, Carlos Lázaro Ávila, Daniel Tapias Merino

 Usability evaluation of 3G multimodal services in Telefónica Móviles España


 Márton Miháltz, Gábor Pohl

 Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation


 Porfírio Filipe and Nuno Mamede



 Felice Dell’Orletta, Alessandro Lenci, Simonetta Montemagni, Vito Pirrelli

 Searching Treebanks for Functional Constraints: Crosslingual Experiments in Grammatical Relations Assignment


 Declerck Thierry

 SynAF: Towards a standard for Syntactic Annotation


 Christelle Ayache, Brigitte Grau, Anne Vilnat

 Evaluation of question-answering systems : The French EQueR-EVALDA Evaluation Campaign


 Maria Fernanda Bacelar do Nascimento, José Bettencourt Gonçalves, Luísa Pereira, Antónia Estrela, Afonso Pereira

 The African Varieties of Portuguese: Compiling Comparable Corpora and analysing data-derived lexicons


 Richard Evans and Viktor Pekar

 BiRD: A prototype system to build and exploit resource databases for researchers


 Benjamin K. Tsou and Oi Yee Kwong

 Toward a Pan-Chinese Thesaurus


 Nelleke Oostdijk, Lou Boves

 User requirements analysis for the design of a reference corpus of written Dutch


 Brigitte Grau, Anne-Laure Ligozat, Isabelle Robba, Anne Vilnat

 FRASQUES : a Question Answering system in the EQueR evaluation campaign


 Gábor Hodász

 Evaluation Methods of a Linguistically Enriched Translation Memory System


 José João Almeida, Alberto Simões

 T2O - Recycling Thesauri into a Multilingual Ontology


 Saba Amsalu

 Data-driven Amharic-English Bilingual Lexicon Acquisition


 Jörg Tiedemann

 ISA & ICA - Two Web Interfaces for Interactive Alignment of Bitexts


 Wolfgang Minker, Heiko Neumann, Günther Palm, Petra-Maria Strauß, Harald Traue, Ulrich Weidenbacher



 Nancy Underwood, Agnes Lisowska

 The evolution of an evaluation framework for a text mining system


 Eline Westerhout, Paola Monachesi

 A pilot study for a Corpus of Dutch Aphasic Speech (CODAS)


 Jan Bungeroth, Daniel Stein, Philippe Dreuw, Morteza Zahedi and Hermann Ney

 A German Sign Language Corpus of the Domain Weather Report


 Roberto Bartolini, Caterina Caracciolo, Emiliano Giovannetti, Alessandro Lenci, Simone Marchi, Vito Pirrelli, Chiara Renso, Laura Spinsanti

 Creation and Use of Lexicons and Ontologies for Natural Language Interface to Databases


 MULLONI, Andrea; PEKAR, Viktor



 Timothy Baldwin and Su’ad Awab

 Open Source Corpus Analysis Tools for Malay


 Fidelia Ibekwe-SanJuan

 A task-oriented framework for evaluating theme detection systems.


 Asunción Moreno, Albert Febrer, Lluis Márquez

 Generation of Language Resources for the Development of Speech Technologies in Catalan


 Georgiana Puscasu and Ruslan Mitkov

 If it were then, then when was it? Establishing the anaphoric role of “then”


 Viktor Trón, Péter Halácsy, Péter Rebrus, András Rung, Péter Vajda, Eszter Simon Hungarian lexical database and morphological grammar


 Hong Phuong Le, Thi Minh Huyen Nguyen, Laurent Romary, Azim Roussanaly

 A Lexicalized Tree Adjoining Grammar for Vietnamese


 Silvie Cinkova, Pavel Pecina, Petr Podvesky, Pavel Schlesinger

 Semi-automatic Building of Swedish Collocation Lexicon


 Dominika Oliver and Krzysztof Szklanny

 Creation and analysis of a Polish speech database for use in unit selection speech synthesis


 Jorge Kinoshita Laís do Nascimento Salvador Carlos Eduardo Dantas de Menezes

 CoGrOO: a Brazilian-Portuguese Grammar Checker based on the CETENFOLHA Corpus


 Stefan Schaden

 Evaluation of automatically generated transcriptions of non-native pronunciations using a phonetic distance measure


 Isabella Chiari

 Slips and errors in spoken data transcription


 Motoko Ueyama

 Evaluation of Web-based Corpora: Effects of Seed Selection and Time Interval


 José Iria, Christopher Brewster, Fabio Ciravegna and Yorick Wilks



 Thomas Pellegrini and Lori Lamel

 Experimental detection of vowel prononciation variants in Amharic


 R. Catizone, A. Dalli and Y. WIlks

 A Methodology for Evaluating Timelines


 Ivana Kruijff-Korbayova, Tilman Becker, Nate Blaylock, Ciprian Gerstenberger, Michael Kaisser, Peter Poller, Verena Rieser, Jan Schehl

 The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player


 Rebecca Passonneau, Roberta Blitz, David Elson, Angela Giral, and Judith Klavans

 CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project


 Anna Rumshisky, James Pustejovsky

 Inducing sense-discriminating context patterns from sense-tagged corpora


 Milen Kouylekov and Bernardo Magnini

 Building a Large-Scale Repository of Textual Entailment Rules


 Alessandro Moschitti and Roberto Basili

 A Tree Kernel approach to Question and Answer Classification in Question Answering Systems


 C. d’ Alessandro, P. Boula de Mareüil, G. Bailly, M.-N. Garcia, M. Morel

 A joint intelligibility evaluation of French text to speech systems: the EVASY/SUS campaign


 Winston Anderson and Petro Kotzé

 Finite state tokenisation of an orthographical disjunctive agglutinative language: The verbal segment of Northern Sotho


 Couturier, Jean-François; Drouin, Patrick; Neuvel, Sylvain

 Applying lexical constraints on morpho-syntactic patterns for the identification of conceptual relational content in specialized texts.


 Katerina Pastra

 Beyond Multimedia Integration Mechanisms: in need of extending multimedia annotation schemes


 Yoshihiko Nitta, Masashi Saraki, Satoru Ikehara

 Building Carefully Tagged Bi-lingual Corpora to Cope with Linguistic Idiosyncrasy


 Roser Saurí, Marc Verhagen, James Pustejovsky

 SlinkET: A Partial Modal Parser for Events


 Christopher Cieri, Mark Liberman

 More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities


 A. Bonafonte, H. Höge, I. Kiss, A. Moreno, U. Ziegenhain, H. van den Heuvel, H.U. Hain, X. S. Wang, M. N. Garcia

 TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis


 Miriam Voghera, Francesco Cutugno

 An Observatory on Spoken Italian Linguistic Resources and Descriptive Standards


 Kamel Smaïli, Caroline Lavecchia and J.P. Haton

 Linguistic Features Modeling Based on Partial New Cache Model


 Pum-Mo Ryu, Key-Sun Choi, and Ronny Melz

 Taxonomy Learning using Term Specificity and Similarity


 Stefan Schulz, Kornél Markó, Percy Nohama, Roosewelt Leite de Andrade, Edson José Pacheco, Udo Hahn, Martin Romacker

 The MORPHOSAURUS Medical Subword Lexicon


 Marko Salmenkivi

 Finding representative sets of dialect words for geographical regions


 Olga Uryupina

 Coreference resolution with and without linguistic knowledge


 Keith Miller and Michelle Vanni

 Modern Standard Arabic v. Dialect: Comparing Arabic-English MT Performance


 O Hamon, M Rajman

 X-Score : Automatic Evaluation of Machine Translation Grammaticality


 Andrew Roberts, Latifa Al-Sulaiti and Eric Atwell

 A comparison of concordancers for Arabic: WordSmith, Mono-Conc, Xaira and aConCorde


 Roberto Navigli

 Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory


 Dafydd Gibbon, Flaviane Romani Fernandes, Thorsten Trippel

 A BLARK extension for temporal annotation mining


 Michael Schiehlen and Kristina Spranger

 The Mass-Count Distinction: Acquisition and Disambiguation


 Lars Nygaard & Stephan Oepen

 Identifying Transparent Compounds in a Large Computational Lexicon


 Andrew W. Cole

 Corpus Development and Publication


 Shu-Chuan Tseng, Dafydd Gibbon

 Discourse functions of duration in Mandarin


 Leonardo Lesmo Livio Robaldo

 From Natural Language to Databases via Ontologies


 Adeline Nazarenko, Erick Alphonse, Julien Derivière, Thierry Hamon, Guillaume Vauvert, Davy Weissenbacher

 The ALVIS Format for Linguistically Annotated Documents


 Alexander Raake and Brian FG Katz

 SUS-based Method for Speech Reception Threshold Measurement in French


 Stephanie Strassel, Christopher Cieri, Andrew Cole, Denise DiPersio, Mark Liberman, Xiaoyi Ma, Mohamed Maamouri

 Integrated Linguistic Resources for Language Exploitation Technologies


 Xiaoyi Ma

 Champollion: A Robust Parallel Text Sentence Aligner


 Fabian Behrens & Jan-Torsten Milde

 The Eclipse Annotator: an extensible system for multimodal corpus creation


 Harris Papageorgiou, Elina Desipri, Maria Koutsombogera, Kanella Pouli, Andromaxi Griva, Prokopis Prokopidis

 Adding multi-layer semantics to the Greek Dependency Treebank


 Cristina BOSCO, Vincenzo LOMBARDO

 Comparing linguistic information in treebank annotations


 Nella Cucurullo, Simonetta Montemagni, Matilde Paoli, Eugenio Picchi, Eva Sassolini

 Dialectal resources on-line: the ALT-Web experience


 Xiaoyi Ma, Chris Cieri

 Corpus Support for Machine Translation at LDC


 Ann Bies, Stephanie Strassel, Haejoong Lee, Kazuaki Maeda, Seth Kulick, Yang Liu, Mary Harper, Matthew Lease

 Linguistic Resources for Speech Parsing


 Tomasz Obrebski, Michal Stolarski

 UAM Text Tools - a flexible NLP architecture


 Markus Geilfuss & Jan-Torsten Milde

 SAM - an annotation editor for parallel texts


 Hans Uszkoreit, Feiyu Xu, Joerg Steffen, Ilhan Aslan

 The pragmatic combination of different crosslingual resources for multilingual information services


 Nizar Habash, Clinton Mah, Sabiha Imran, Randy Calistri-Yeh and Paraic Sheridan

 Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval


 Grażyna Vetulani, Zygmunt Vetulani, Tomasz Obrębski

 Syntactic Lexicon of Polish Predicative Nouns


 Alonge Antonietta

 The Italian Metaphor Database


 Georgiana Puscasu

 How to identify temporal clauses?


 Jose Iria, Fabio Ciravegna

 A Methodology and Tool for Representing Language Resources for Information Extraction


 Harry Halpin

 Automatic Evaluation and Compositon of NLP Pipelines with Web Services


 C. de Loupy

 The OuRAL Project


 Harry Bunt and Amanda Schiffrin

 Methodological aspects of semantic annotation


 Whitney Gegg-Harrison, Donna Byron

 PYCOT: An OT-Based Pronoun Resolution Tool


 Emma Barker, Ryuichiro Higashinaka, François Mairesse, Robert Gaizauskas, Marilyn Walker, Jonathan Foster

 Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives


 Jeongwoo Ko, Laurie Hiyakumoto, Eric Nyberg

 Exploiting Multiple Semantic Resources for Answer Selection


 Kazuaki Maeda, Christopher Cieri and Kevin Walker

 Low-cost Customized Speech Corpus Creation for Speech Technology Applications


 John Niekrasz and Alexander Gruenstein

 NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Discourse


 Christoph Benzmueller, Helmut Horacek, Henri Lesourd, Ivana Kruijff-Korbayova, Marvin Schiller, Magdalena Wolska

 A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material.


 J.Laoudi, C. Tate, C. Voss

 Task-based MT Evaluation: From Who/When/Where Extraction to Event Understanding


 Kazuaki Maeda, Haejoong Lee, Julie Medero, Stephanie Strassel

 A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit


 Hideki Shima, Mengqiu Wang, Frank Lin, Teruko Mitamura

 Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering


 Jeongwoo Ko, Fumihiko Murase, Teruko Mitamura, Eric Nyberg, Masahiko Tateishi, and Ichiro Akahori

 Analyzing the Effects of Spoken Dialog Systems on Driving Behavior


 Magesh Balasubramanya, Peter Lucas, Michael Higgins and Dominic Widdows

 Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources


 Vasile Rus, Art Graesser

 The Look and Feel of a Textual Entailer


 R.Gaizauskas, M. Hepple and H.Harkema

 Indirect Evaluation of Parsers Using Dependency Relations to Assess Semantic Output


 Gregory Marton and Boris Katz

 Automatic TREC Relationship Answering and Nugget Evaluation


 Finley Lacatusu, Andrew Hickl, Sanda Harabagiu

 Impact of Question Decomposition on the Quality of Answer Summaries


 Sanda M. Harabagiu, Cristina Nicolae, Gabriel Nicolae

 Coreference by Consolidation


 Sanda Harabagiu and Cosmin Adrian Bejan

 An Answer Bank for Temporal Inference


 Paul C Morarescu

 Principles for annotating and reasoning with spatial information


 Li Sujian, Lu Qin, Li Wenjie, Xu Ruifeng

 Interact between Lexical Base and Ontology with Formal Concept Analysis


 Rachada Kongkachandra and Kosin Chamnongthai



 Ruifeng Xu, Qin Lu and Sujian Li

 The Design and Construction of Chinese Collocation Bank


 Nilda Ruimy, Adriana Roventini

 Merging two Ontology-based Lexical Resources


 Nimaan Abdillahi, Nocera Pascal

 Towards automatic transcription of Somali language


 Susanne Burger, Zachary A. Sloane, Jie Yang

 Competitive Evaluation of Commercially Available Speech Recognizers in Multiple Languages


 Kornel Laskowski and Susanne Burger

 Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus


 W. Black, S. Elkateb, H. Rodriguez, M. Alkhalifa, P. Vossen, A. Pease, C. Fellbaum

 Introducing the Arabic WordNet Project


 Benoît Sagot and Pierre Boullier

 Deep non-probabilistic parsing of large corpora


 Magnar Brekke, Kai Innselset, Marita Kristiansen & Kari Øvsthus

 KB-N: Automatic Term Extraction from Knowledge Bank of Economics


 Alexander Klassmann, Freddy Offenga, Daan Broeder, Romuald Skiba, Peter Wittenburg

 Comparison of Resource Discovery Methods


 Peter Lucas, Magesh Balasubramanya, and Dominic Widdows

 The Information Commons Gazetteer: A Public Resource of Populated Places and Worldwide Administrative Divisions


 Benoît Sagot, Lionel Clément, Eric Villemonte de La Clergerie and Pierre Boullier

 The Lefff 2 syntactic lexicon for French: architecture, acquisition, use


 Nilda Ruimy

 Structuring a Domain Vocabulary in a General Knowledge Environment


 Alexander Geyken, Norbert Schrader

 LexikoNet - a lexical database based on type and role hierarchies


 Djamel Mostefa ; Olivier Hamon, Khalid CHOUKRI et. al.

 TC-STAR: Evaluation of speech to speech components


 Djamel Mostefa ; Marie-Neige Garcia, Olivier Hamon, Khalid CHOUKRI et. al.

 CHIL: Evaluation of multimodal technology components


 Carol Peters

 The Impact of Evaluation on Multilingual Information Retrieval System Development


 Carol Peters , Bernardo Magnini, Christelle Ayache, et. al.

 CLEF Q-A special track


 Marie-Neige Garcia , Christophe d’Alessandro , Philippe Boula de Mareüil

 A joint prosody evaluation of French text-to-speech systems: the EVASY Prosody campaign

Copyright © 2006 ELRA