PAPERS: Browse articles of the conference sorted by title

1 - A - B - C - D - E - F - G - H - I - J - K - L - M - N - O - P - Q - R - S - T - U - V - W - Y

15 Years of Language Resource Creation and Sharing: a Progress Report on LDC Activities Christopher Cieri and Mark Liberman


A Bilingual Corpus of Inter-linked Events Tommaso Caselli, Nancy Ide and Roberto Bartolini
A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources Carmen Banea, Rada Mihalcea and Janyce Wiebe
A Bottom-up Comparative Study of EuroWordNet and WordNet 3.0 Lexical and Semantic Relations Maria Teresa Pazienza, Armando Stellato and Alexandra Tudorache
A Common Ground for Virtual Humans: Using an Ontology in a Natural Language Oriented Virtual Human Architecture Arno Hartholt, Thomas Russ, David Traum, Eduard Hovy and Susan Robinson
A Common Multimedia Annotation Framework for Cross Linking Cultural Heritage Digital Collections Hennie Brugman, Véronique Malaisé and Laura Hollink
A Common Parts-of-Speech Tagset Framework for Indian Languages Baskaran Sankaran, Kalika Bali, Monojit Choudhury, Tanmoy Bhattacharya, Pushpak Bhattacharyya, Girish Nath Jha, S. Rajendran, K. Saravanan, L. Sobha and K.V. Subbarao
A Compact Arabic Lexical Semantics Language Resource Based on the Theory of Semantic Fields Mohamed Attia, Mohsen Rashwan, Ahmed Ragheb, Mohamed Al-Badrashiny and Husein Al-Basoumy
A Comparative Cross-Domain Study of the Occurrence of Laughter in Meeting and Seminar Corpora Susanne Burger, Kornel Laskowski and Matthias Woelfel
A Comparative Evaluation of Term Recognition Algorithms Ziqi Zhang, Jose Iria, Christopher Brewster and Fabio Ciravegna
A Comparative Study on Language Identification Methods Lena Grothe, Ernesto William De Luca and Andreas Nürnberger
A Comparison of Various Methods for Concept Tagging for Spoken Language Understanding Stefan Hahn, Patrick Lehnen, Christian Raymond and Hermann Ney
A Conceptual Approach to Web Image Retrieval Adrian Popescu and Gregory Grefenstette
A Contextual Dynamic Network Model for WSD Using Associative Concept Dictionary Jun Okamoto, Kiyoko Uchiyama and Shun Ishizaki
A Coreference Corpus and Resolution System for Dutch Iris Hendrickx, Gosse Bouma, Frederik Coppens, Walter Daelemans, Veronique Hoste, Geert Kloosterman, Anne-Marie Mineur, Joeri Van Der Vloet and Jean-Luc Verschelde
A Corpus for Cross-Document Co-reference David Day, Janet Hitzeman, Michael Wick, Keith Crouch and Massimo Poesio
A Dependency Parser for Thai Shisanu Tongchim, Randolf Altmeyer, Virach Sornlertlamvanich and Hitoshi Isahara
A Development Environment for Configurable Meta-Annotators in a Pipelined NLP Architecture Youssef Drissi, Branimir Boguraev, David Ferrucci, Paul Keyser and Anthony Levas
A Flexible Wizard of Oz Environment for Rapid Prototyping Stefan Scherer and Petra-Maria Strauß
A Framework for Identity Resolution and Merging for Multi-source Information Extraction Milena Yankova, Horacio Saggion and Hamish Cunningham
A Framework for Multilingual Ontology Mapping Cássia Trojahn, Paulo Quaresma and Renata Vieira
A Framework for Standardized Syntactic Annotation Thierry Declerck
A Fully Annotated Corpus for Studying the Effect of Cognitive Ageing on Users’ Interactions with Spoken Dialogue Systems Kallirroi Georgila, Maria Wolters, Vasilis Karaiskos, Melissa Kronenthal, Robert Logie, Neil Mayo, Johanna Moore and Matt Watson
A General Methodology for Mapping EuroWordNets to the Suggested Upper Merged Ontology Dennis Spohr
A Grid of Regional Language Archives Paul Trilsbeek, Daan Broeder, Tobias Valkenhoef and Peter Wittenburg
A Ground Truth Dataset for Matching Culturally Diverse Romanized Person Names Mark Arehart and Keith J Miller
A Guide for the Production of Reusable Language Resources Victoria Arranz, Franck Gandcher, Valérie Mapelli and Khalid Choukri
A Hybrid Approach to Extracting and Classifying Verb+Noun Constructions Amalia Todiraşcu, Dan Tufiş, Ulrich Heid, Christopher Gledhill, Dan Ştefanescu, Marion Weller and François Rousselot
A Hybrid Morphology-Based POS Tagger for Persian Mehrnoush Shamsfard and Hakimeh Fadaee
A Japanese-English Technical Lexicon for Translation and Language Research Fredric Gey, David Evans and Noriko Kando
A Knowledge-Modeling Approach for Multilingual Regulus Lexica Marianne Santaholma and Nikos Chatzichrisafis
A LAF/GrAF based Encoding Scheme for underspecified Representations of syntactic Annotations. Manuel Kountz, Ulrich Heid and Kerstin Eckart
A Large-Scale Web Data Collection as a Natural Language Processing Infrastructure Keiji Shinzato, Daisuke Kawahara, Chikara Hashimoto and Sadao Kurohashi
A lexicon for biology and bioinformatics: the BOOTStrep experience. Valeria Quochi, Monica Monachini, Riccardo Del Gratta and Nicoletta Calzolari
A Lightweight and Efficient Tool for Cleaning Web Pages Stefan Evert
A Linguistic Resource for Discovering Event Structures and Resolving Event Coreference Cosmin Bejan and Sanda Harabagiu
A Method for Automatically Constructing Case Frames for English Daisuke Kawahara and Kiyotaka Uchimoto
A Multi-Genre SMT System for Arabic to French Saša Hasan and Hermann Ney
A Multilingual Database of Polarity Items Beata Trawinski and Jan-Philipp Soehn
A Multi-Lingual Dictionary of Dirty Words Jonas Sjöbergh and Kenji Araki
A Multimodal Infant Behavior Annotation for Developmental Analysis of Demonstrative Expressions Shigeyoshi Kitazawa, Shinya Kiriyama, Tomohiko Kasami, Shogo Ishikawa, Naofumi Otani, Hiroaki Horiuchi and Yoichi Takebayashi
A Multi-sensor Speech Database with Applications towards Robust Speech Processing in hostile Environments Tomas Dekens, Yorgos Patsis, Werner Verhelst, Frédéric Beaugendre and François Capman
A Multi-Word Term Extraction Program for Arabic Language Siham Boulaknadel, Beatrice Daille and Driss Aboutajdine
A Pilot Arabic Propbank Martha Palmer, Olga Babko-Malaya, Ann Bies, Mona Diab, Mohamed Maamouri, Aous Mansouri and Wajdi Zaghouani
A Proper Approach to Japanese Morphological Analysis: Dictionary, Model, and Evaluation Yasuharu Den, Junpei Nakamura, Toshinobu Ogiso and Hideki Ogura
A Question Answering System for German. Experiments with Morphological Linguistic Resources Florian Koehler, Hinrich Schuetze and Michaela Atterer
A Real-World Emotional Speech Corpus for Modern Greek Theodoros Kostoulas, Todor Ganchev, Iosif Mporas and Nikos Fakotakis
A Research on Automatic Chinese Catchword Extraction Han Ren, Donghong Ji and Lei Han
A Semantic Memory for Incremental Ontology Population Berenike Loos and Lasse Schwarten
A Semantically Annotated Swedish Medical Corpus Dimitrios Kokkinakis
A Simple Method for Tagset Comparision Markus Dickinson and Charles Jochim
A Study of Parentheticals in Discourse Corpora - Implications for NLG Systems Eva Banik and Alan Lee
A Suite to Compile and Analyze an LSP Corpus Rogelio Nazar, Jorge Vivaldi and Teresa Cabré
A Taxonomy of Lexical Metadata Categories Bodil Nistrup Madsen and Hanne Erdman Thomsen
A Test Suite for Inference Involving Adjectives Marilisa Amoia and Claire Gardent
A Text-based Query Interface to OWL Ontologies Danica Damljanovic, Valentin Tablan and Kalina Bontcheva
A Three-stage Disfluency Classifier for Multi Party Dialogues Margot Mieskes and Michael Strube
A Trainable Tokenizer, solution for multilingual texts and compound expression tokenization Oana Frunza
A Unified Database of Dependency Treebanks: Integrating, Quantifying & Evaluating Dependency Data Olga Pustylnikov, Alexander Mehler and Rüdiger Gleim
A Web Browser Extension for Growing-up Ontological Knowledge from Traditional Web Content Maria Teresa Pazienza, Marco Pennacchiotti and Armando Stellato
Acquiring a Poor Man’s Inflectional Lexicon for German Peter Adolphs
Acquiring a Taxonomy from the German Wikipedia Laura Kassner, Vivi Nastase and Michael Strube
Acquiring Naturalistic Concept Descriptions from the Web Tony Veale and Yanfen Hao
Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language Briony Williams and Rhys James Jones
Acquisition and Evaluation of a Dialog Corpus through WOz and Dialog Simulation Techniques David Griol, Lluís F. Hurtado, Encarna Segarra and Emilio Sanchis
Active Annotation in the LUNA Italian Corpus of Spontaneous Dialogues Christian Raymond, Kepa Joseba Rodriguez and Giuseppe Riccardi
Adaptation of Relation Extraction Rules to New Domains Feiyu Xu, Hans Uszkoreit, Hong Li and Niko Felger
Adapting International Standard for Asian Language Technologies Tokunaga Takenobu, Dain Kaplan, Chu-Ren Huang, Shu-Kai Hsieh, Nicoletta Calzolari, Monica Monachini, Claudia Soria, Kiyoaki Shirai, Virach Sornlertlamvanich, Thatsanee Charoenporn and Xia YingJu
Adjectives in the Dutch Semantic Lexical Database CORNETTO Isa Maks, Piek Vossen, Roxane Segers and Hennie van der Vliet
Adjudicator Agreement and System Rankings for Person Name Search Mark Arehart, Chris Wolf and Keith J. Miller
ALC: Alcohol Language Corpus Florian Schiel, Christian Heinrich, Sabine Barfüßer and Thomas Gilg
All, and only, the Errors: more Complete and Consistent Spelling and OCR-Error Correction Evaluation Martin Reynaert
Amazigh Language Terminology in Morocco or Management of a “Multidimensional” Variation Aicha Bouhjar
An AI-inspired intelligent agent/student architecture to combine Language Resources research and teaching Bayan Abu Shawar and Eric Atwell
An Approach to Modeling Heterogeneous Resources for Information Extraction Lei Xia and Jose Iria
An Automatic Close Copy Speech Synthesis Tool for Large-Scale Speech Corpus Evaluation Dafydd Gibbon and Jolanta Bachan
An Economic View on Human Language Technology Evaluation Edouard Geoffrois
An Empirical Approach to a Preliminary Successful Identification and Resolution of Temporal Expressions in Spanish News Corpora María Teresa Vicente-Díez, Doaa Samy and Paloma Martínez
An eRulemaking Corpus: Identifying Substantive Issues in Public Comments Claire Cardie, Cynthia Farina, Matt Rawding and Adil Aijaz
An Evaluation of Spoken and Textual Interaction in the RITEL Interactive Question Answering System Dave Toney, Sophie Rosset, Aurélien Max, Olivier Galibert and Eric Bilinski
An Evaluation Resource for Geographic Information Retrieval Thomas Mandl, Fredric Gey, Giorgio Di Nunzio, Nicola Ferro, Mark Sanderson, Diana Santos and Christa Womser-Hacker
An Exchange Format for Multimodal Annotations Thomas Schmidt, Susan Duncan, Oliver Ehmer, Jeffrey Hoyt, Michael Kipp, Dan Loehr, Magnus Magnusson, Travis Rose and Han Sloetjes
An Experimental Methodology for an End-to-End Evaluation in Speech-to-Speech Translation Olivier Hamon and Djamel Mostefa
An Infrastructure, Tools and Methodology for Evaluation of Multicultural Name Matching Systems Keith J. Miller, Mark Arehart, Catherine Ball, John Polk, Alan Rubenstein, Kenneth Samuel, Elizabeth Schroeder Eva Vecchi and Chris Wolf
An Inverted Index for Storing and Retrieving Grammatical Dependencies Michaela Atterer and Hinrich Schütze
An Unsupervised Probabilistic Approach for the Detection of Outliers in Corpora David Guthrie, Louise Guthrie and Yorick Wilks
Analysis and Performance of Morphological Query Expansion and Language-Filtering Words on Basque Web Searching Igor Leturia, Antton Gurrutxaga, Nerea Areta and Eli Pociello
Anaphora Resolution Exercise: an Overview Constantin Orăsan, Dan Cristea, Ruslan Mitkov and António Branco
Anaphoric Annotation in the ARRAU Corpus Massimo Poesio and Ron Artstein
ANAWIKI: Creating Anaphorically Annotated Resources through Web Cooperation Massimo Poesio, Udo Kruschwitz and Chamberlain Jon
AnCora: Multilevel Annotated Corpora for Catalan and Spanish Mariona Taulé, M. Antònia Martí and Marta Recasens
AnCora-Verb: A Lexical Resource for the Semantic Annotation of Corpora Juan Aparicio, Mariona Taulé and M.Antònia Martí
ANNALIST - ANNotation ALIgnment and Scoring Tool George Demetriou, Robert Gaizauskas, Haotian Sun and Angus Roberts
Annotating “tense” in a Tense-less Language Nianwen Xue, Hua Zhong and Kai-Yun Chen
Annotating Abstract Pronominal Anaphora in the DAD Project Costanza Navarretta and Sussi Olsen
Annotating an Arabic Learner Corpus for Error Ghazi Abuhakema, Reem Faraj, Anna Feldman and Eileen Fitzpatrick
Annotating Expressions of Opinion and Emotion in the Italian Content Annotation Bank Andrea Esuli, Fabrizio Sebastiani and Ilaria Urciuoli
Annotating Students’ Understanding of Science Concepts Rodney D. Nielsen, Wayne Ward, James Martin and Martha Palmer
Annotating Subjective Content in Meetings Theresa Wilson
Annotating Superlatives Silke Scheible
Annotating Topics of Opinions Veselin Stoyanov and Claire Cardie
Annotation and analysis of overlapping speech in political interviews Martine Adda-Decker, Claude Barras, Gilles Adda, Patrick Paroubek, Philippe Boula de Mareuil and Benoit Habert
Annotation by Category: ELAN and ISO DCR Han Sloetjes and Peter Wittenburg
Annotation Guidelines for Chinese-Korean Word Alignment Jin-Ji Li, Dong-Il Kim and Jong-Hyeok Lee
Annotation of Information Structure: an Evaluation across different Types of Texts Julia Ritz, Stefanie Dipper and Michael Götze
Annotation of Nuggets and Relevance in GALE Distillation Evaluation Olga Babko-Malaya
Annotation of WordNet Verbs with TimeML Event Classes Georgiana Puscasu and Verginica Barbu Mititelu
Annotation Tool Development for Large-Scale Corpus Creation Projects at the Linguistic Data Consortium Kazuaki Maeda, Haejoong Lee, Shawn Medero, Julie Medero, Robert Parker and Stephanie Strassel
Answering List Questions using Co-occurrence and Clustering Majid Razmara and Leila Kosseim
Application of Resource-based Machine Translation to Real Business Scenes Hitoshi Isahara, Masao Utiyama, Eiko Yamamoto, Akira Terada and Yasunori Abe
Applying Automated Metrics to Speech Translation Dialogs Sherri Condon, Jon Phillips, Christy Doran, John Aberdeen, Dan Parvaz, Beatrice Oshika, Greg Sanders and Craig Schlenoff
Approximating Learning Curves for Active-Learning-Driven Annotation Tomanek Katrin and Udo Hahn
Arabic WordNet: Semi-automatic Extensions using Bayesian Inference Horacio Rodríquez, David Farwell, Javi Ferreres, Manuel Bertran, Musa Alkhalifa and M. Antonia Martí
Assessing the Costs of Machine-Assisted Corpus Annotation through a User Study Eric Ringger, Marc Carmen, Robbie Haertel, Kevin Seppi, Deryle Lonsdale, Peter McClanahan, James Carroll and Noel Ellison
ASV Toolbox: a Modular Collection of Language Exploration Tools Chris Biemann, Uwe Quasthoff, Gerhard Heyer and Florian Holz
Audio Database in Support of Potentiel Threat and Crisis Situation Management Stavros Ntalampiras, Ilyas Potamitis, Todor Ganchev and Nikos Fakotakis
Authorship Attribution of E-Mail: Comparing Classifiers over a New Corpus for Evaluation Ben Allison and Louise Guthrie
Authorship Identification of Romanian Texts with Controversial Paternity Liviu Dinu, Marius Popescu and Anca Dinu
Automated Subject Induction from Query Keywords through Wikipedia Categories and Subject Headings Yoji Kiyota, Noriyuki Tamura, Satoshi Sakai, Hiroshi Nakagawa and Hidetaka Masuda
Automatic Acquisition for low frequency lexical items Núria Bel, Sergio Espeja and Montserrat Marimon
Automatic Acquisition of Usage Information for Language Resources Shunsuke Kozawa, Hitomi Tohyama, Kiyotaka Uchimoto and Shigeki Matsubara
Automatic Assessment of Japanese Text Readability Based on a Textbook Corpus Satoshi Sato, Suguru Matsuyoshi and Yohsuke Kondoh
Automatic Construction of a Japanese-Chinese Dictionary via English Hiroyuki Kaji, Shin’ichi Tamamura and Dashtseren Erdenebat
Automatic Document Quality Control Neil Newbold and Lee Gillam
Automatic Emotional Degree Labeling for Speakers’ Anger Utterance during Natural Japanese Dialog Yoshiko Arimoto, Sumio Ohno and Hitoshi Iida
Automatic Evaluation Measures for Statistical Machine Translation System Optimization Arne Mauser, Sasa Hasan and Hermann Ney
Automatic extraction of subcategorization frames for Italian Dino Ienco, Serena Villata and Cristina Bosco
Automatic Extraction of Textual Elements from News Web Pages Hossam Ibrahim, Kareem Darwish and Abdel-Rahim Madany
Automatic Identification of Temporal Information in Tourism Web Pages Stéphanie Weiser, Philippe Laublet and Jean-Luc Minel
Automatic Learning and Evaluation of User-Centered Objective Functions for Dialogue System Optimisation Verena Rieser and Oliver Lemon
Automatic Phone Segmentation of Expressive Speech Laure Charonnat, Gaëlle Vidal and Olivier Boeffard
Automatic Phoneme Segmentation with Relaxed Textual Constraints Pierre Lanchantin, Andrew C. Morris, Xavier Rodet and Christophe Veaux
Automatic Rewriting of Patient Record Narratives Catalina Hallett and David Hardcastle
Automatic Rich Annotation of Large Corpus of Conversational transcribed speech: the Chunking Task of the EPAC Project Jean-Yves Antoine, Abdenour Mokrane and Nathalie Friburger
Automatic Translation of Biomedical Terms by Supervised Machine Learning Vincent Claveau


Babylon Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages Michael Mohler and Rada Mihalcea
BART: A modular toolkit for coreference resolution Yannick Versley, Simone Ponzetto, Massimo Poesio, Vladimir Eidelman, Alan Jern, Jason Smith, Xiaofeng Yang and Alessandro Moschitti
Benchmark Databases for Video-Based Automatic Sign Language Recognition Philippe Dreuw, Carol Neidle, Vassilis Athitsos, Stan Sclaroff and Hermann Ney
Benchmarking Textual Annotation Tools for the Semantic Web Diana Maynard
Bilingual Text Classification using the IBM 1 Translation Model Jorge Civera and Alfons Juan-Císcar
BioSec Multimodal Biometric Database in Text-Dependent Speaker Recognition Doroteo Toledano, Daniel Hernandez-Lopez, Cristina Esteve-Elizalde, Julian Fierrez, Javier Ortega-Garcia, Daniel Ramos and Joaquin Gonzalez-Rodriguez
BLEU+: a Tool for Fine-Grained BLEU Computation A. Cuneyd Tantug, Kemal Oflazer and Ilknur Durgar El-Kahlout
BOEMIE Ontology-Based Text Annotation Tool Pavlina Fragkou, Georgios Petasis, Aris Theodorakos, Vangelis Karkaletsis and Constantine Spyropoulos
Boosting Precision and Recall of Hyponymy Relation Acquisition from Hierarchical Layouts in Wikipedia Asuka Sumida, Naoki Yoshinaga and Kentaro Torisawa
Boot-Strapping a WordNet Using Multiple Existing WordNets Francis Bond, Hitoshi Isahara, Kyoko Kanzaki and Kiyotaka Uchimoto
Bootstrapping Language Description: the case of Mpiemo (Bantu A, Central African Republic) Harald Hammarström, Christina Thornell, Malin Petzell and Torbjörn Westerlund
Borrowing Language Resources for Development of Automatic Speech Recognition for Low- and Middle-Density Languages Lynette Melnar and Chen Liu
Bridging the Gap between Linguists and Technology Developers: Large-Scale, Sociolinguistic Annotation for Dialect and Speaker Recognition Christopher Cieri, Stephanie Strassel, Meghan Glenn, Reva Schwartz, Wade Shen and Joseph Campbell
Building a Bilingual Representation of the Roget Thesaurus for French to English Machine Translation Violaine Prince and Jacques Chauché
Building a Bio-Event Annotated Corpus for the Acquisition of Semantic Frames from Biomedical Corpora Paul Thompson, Philip Cotter, John McNaught, Sophia Ananiadou, Simonetta Montemagni, Andrea Trabucco and Giulia Venturi
Building a Corpus of Temporal-Causal Structure Steven Bethard, William Corvey, Sara Klingenstein and James H. Martin
Building a Federation of Language Resource Repositories: the DAM-LR Project and its Continuation within CLARIN. Daan Broeder, David Nathan, Sven Strömqvist and Remco van Veenendaal
Building a Golden Collection of Parallel Multi-Language Word Alignment Joao Graca, Joana Paulo Pardal, Luisa Coheur and Diamantino Caseiro
Building a Greek corpus for Textual Entailment Evi Marzelou, Maria Zourari, Voula Giouli and Stelios Piperidis
Building a Multilingual Lexical Resource for Named Entity Disambiguation, Translation and Transliteration Wolodja Wentland, Johannes Knopp, Carina Silberer and Matthias Hartung
Building Affective Lexicons from Specific Corpora for Automatic Sentiment Analysis Yves Bestgen
Building an Annotated Corpus for Text Summarization and Question Answering Patcharee Varasai, Chaveevan Pechsiri, Thana Sukvari, Vee Satayamas and Asanee Kawtrakul
Building Bilingual Lexicons using Lexical Translation Probabilities via Pivot Languages Takashi Tsunakawa, Naoaki Okazaki and Jun’ichi Tsujii
Building Mobile Spoken Dialogue Applications Using Regulus Nikos Tsourakis, Maria Georgescul, Pierrette Bouillon and Manny Rayner
Building of a Speech Corpus Optimised for Unit Selection TTS Synthesis Jindřich Matoušek, Daniel Tihelka and Jan Romportl
Building the Valency Lexicon of Arabic Verbs Viktor Bielický and Otakar Smrž


CallSurf: Automatic Transcription, Indexing and Structuration of Call Center Conversational Speech for Knowledge Extraction and Query by Content Martine Garnier-Rizet, Gilles Adda, Frederik Cailliau, Sylvie Guillemin-Lanne, Claire Waast-Richard, Lori Lamel, Stephan Vanni and Claire Waast-Richard
Can we Evaluate the Quality of Generated Text? David Hardcastle and Donia Scott
Causal Relation Extraction Eduardo Blanco, Nuria Castell and Dan Moldovan
Centering Theory for Evaluation of Coherence in Computer-Aided Summaries Laura Hasler
Certification and Cleaning up of a Text Corpus: Towards an Evaluation of the “Grammatical” Quality of a Corpus Cyril Grouin
Challenges in Pronoun Resolution System for Biomedical Text Ngan Nguyen, Jin-Dong Kim and Jun’ichi Tsujii
Characterization of Scientific and Popular Science Discourse in French, Japanese and Russian Lorraine Goeuriot, Natalia Grabar and Béatrice Daille
Children’s Oral Reading Corpus (CHOREC): Description and Assessment of Annotator Agreement Leen Cleuren, Jacques Duchateau, Pol Ghesquière and Hugo Van hamme
Chinese Core Ontology Construction from a Bilingual Term Bank Yirong Chen, Qin Lu, Wenjie Li and Gaoying Cui
Chinese Term Extraction Based on Delimiters Yuhang Yang, Qin Lu and Tiejun Zhao
Chooser: a Multi-Task Annotation Tool Svetla Koeva, Borislav Rizov and Svetlozara Leseva
CLARIN: Common Language Resources and Technology Infrastructure Tamás Váradi, Steven Krauwer, Peter Wittenburg, Martin Wynne and Kimmo Koskenniemi
Classification Procedures for Software Evaluation Muriel Amar, Sophie David, Rachel Panckhurst and Lisa Whistlecroft
Cleaneval: a Competition for Cleaning Web Pages Marco Baroni, Francis Chantree, Adam Kilgarriff and Serge Sharoff
CLIoS: Cross-lingual Induction of Speech Recognition Grammars Nadine Perera, Michael Pitz and Manfred Pinkal
Clustering of Terms from Translation Dictionaries and Synonyms Lists to Automatically Build more Structured Linguistic Resources Maria Teresa Pazienza and Armando Stellato
Clustering Related Terms with Definitions Scott Piao, John McNaught and Sophia Ananiadou
Coding Emotional Events in Audiovisual Corpora Laurence Devillers and Jean-Claude Martin
COLDIC, a Lexicographic Platform for LMF compliant lexica Núria Bel, Sergio Espeja, Montserrat Marimon and Marta Villegas
Collection and Preprocessing of Czech Sign Language Corpus for Sign Language Recognition Pavel Campr, Marek Hrúz and Jana Trojanová
Combined Systems for Automatic Phonetic Transcription of Proper Nouns Antoine Laurent, Téva Merlin, Sylvain Meignier, Yannick Estève and Paul Deléglise
Combining Multiple Models for Speech Information Retrieval Muath Alzghool and Diana Inkpen
Combining Terminology Resources and Statistical Methods for Entity Recognition: an Evaluation Angus Roberts, Robert Gaizasukas, Mark Hepple and Yikun Guo
Communicating Unknown Words in Machine Translation Matthias Eck, Stephan Vogel and Alex Waibel
Comparing Corpus-based to Web-based Lookup Techniques for Automatic English Inclusion Detection Beatrice Alex
Comparing Dependency and Constituent Syntax for Frame-semantic Analysis Richard Johansson and Pierre Nugues
Comparing Italian parsers on a common Treebank: the EVALITA experience Cristina Bosco, Alessandro Mazzei, Vincenzo Lombardo, Giuseppe Attardi, Anna Corazza, Alberto Lavelli, Leonardo Lesmo, Giorgio Satta and Maria Simi
Comparing Set-Covering Strategies for Optimal Corpus Design Jonathan Chevelu, Nelly Barbot, Olivier Boeffard and Arnaud Delhay
Complete and Consistent Annotation of WordNet using the Top Concept Ontology Javier Álvez, Jordi Atserias, Jordi Carrera, Salvador Climent, Egoitz Laparra, Antoni Oliver and German Rigau
Computational Models for Event Type Classification in Context Alessandra Zarcone and Alessandro Lenci
Conceptual Modeling of Ontology-based Linguistic Resources with a Focus on Semantic Relations Francisco Alvarez Montero, Antonio Vaquero Sanchez and Fernando Saenz Perez
Condensing Sentences for Subtitle Generation Prokopis Prokopidis, Vassia Karra, Aggeliki Papagianopoulou and Stelios Piperidis
Connecting Text Mining and Pathways using the PathText Resource Rune Sætre, Brian Kemper, Kanae Oda, Naoaki Okazaki, Yukiko Matsuoka, Norihiro Kikuchi, Hiroaki Kitano, Yoshimasa Tsuruoka, Sophia Ananiadou and Jun’ichi Tsujii
Constructing a Corpus that Indicates Patterns of Modification between Draft and Final Translations by Human Translators Takeshi Abekawa and Kyo Kageura
Constructing a Database of Non-Japanese Pronunciations of Different Japanese Romanizations Reiko Kaji and Hajime Mochizuki
Constructing Evaluation Corpora for Automated Clinical Named Entity Recognition Philip Ogren, Guergana Savova and Christopher Chute
Construction and Analysis of Word-level Time-aligned Simultaneous Interpretation Corpus Takahiro Ono, Hitomi Tohyama and Shigeki Matsubara
Construction of a Metadata Database for Efficient Development and Use of Language Resources Hitomi Tohyama, Shunsuke Kozawa, Kiyotaka Uchimoto, Shigeki Matsubara and Hitoshi Isahara
Controlling Redundancy in Referring Expressions Jette Viethen, Robert Dale, Emiel Krahmer, Mariet Theune and Pascal Touset
Converting Romanized Persian to the Arabic Writing Systems Jalal Maleki and Lars Ahrenberg
CORP-ORAL: Spontaneous Speech Corpus for European Portuguese Fabíola Santos and Tiago Freitas
Corpus Analysis of Spoken Smart-Home Interactions with Older Users Sebastian Möller, Florian Gödde and Maria Wolters
Corpus and Voices for Catalan Speech Synthesis Antonio Bonafonte, Jordi Adell, Ignasi Esquerra, Silvia Gallego, Asunción Moreno and Javier Pérez
Corpus Co-Occurrence, Dictionary and Wikipedia Entries as Resources for Semantic Relatedness Information Michael Roth and Sabine Schulte im Walde
Corpus Exploitation from Wikipedia for Ontology Construction Gaoying Cui, Qin Lu, Wenjie Li and Yirong Chen
Corpus-based Semantic Relatedness for the Construction of Polish WordNet Bartosz Broda, Magdalena Derwojedowa, Maciej Piasecki and Stanislaw Szpakowicz
Corpus-Based Tools for Computer-Assisted Acquisition of Reading Abilities in Cognate Languages Svitlana Kurella, Serge Sharoff and Anthony Hartley
Cost-Sensitive Learning in Answer Extraction Michael Wiegand, Jochen L. Leidner and Dietrich Klakow
Creating a Research Collection of Question Answer Sentence Pairs with Amazon’s Mechanical Turk Michael Kaisser and John Lowe
Creating and Exploiting Multimodal Annotated Corpora Philippe Blache, Roxane Bertrand and Gaëlle Ferré
Creating and Using a Correlated Corpus to Glean Communicative Commonalities Jade Goldstein-Stewart, Kerri Goodwin, Roberta Sabin and Ransom Winder
Creating Glossaries Using Pattern-Based and Machine Learning Techniques Eline Westerhout and Paola Monachesi
Creating Sentence-Aligned Parallel Text Corpora from a Large Archive of Potential Parallel Text using BITS and Champollion Kazuaki Maeda, Xiaoyi Ma and Stephanie Strassel
Creation of Learner Corpus and Its Application to Speech Recognition Hiroki Yamazaki, Keisuke Kitamura, Takashi Harada and Seiichi Yamamoto
Cross-Corpus Evaluation of Word Alignment Sylwia Ozdowska
Cross-Domain Dialogue Act Tagging Nick Webb, Ting Liu, Mark Hepple and Yorick Wilks
Czech MWE Database Karel Pala, Lukáš Svoboda and Pavel Šmerk
CzEng 0.7: Parallel Corpus with Community-Supplied Translations Ondřej Bojar, Miroslav Janíček, Zdeněk žabokrtský, Pavel Češka and Peter Beňa


Data Collection for the CHIL CLEAR 2007 Evaluation Campaign Nicolas Moreau, Djamel Mostefa, Rainer Stiefelhagen, Susanne Burger and Khalid Choukri
Definition Extraction Using a Sequential Combination of Baseline Grammars and Machine Learning Classifiers Łukasz Degórski, Michał Marcińczuk and Adam Przepiórkowski
♠ Demo: An Open Source Tool for Partial Parsing and Morphosyntactic Disambiguation Aleksander Buczyński and Adam Przepiórkowski
Dependency-Based Relation Mining for Biomedical Literature Fabio Rinaldi, Gerold Schneider, Kaarel Kaljurand and Michael Hess
Deriving Rhetorical Complexity Data from the RST-DT Corpus Sandra Williams and Richard Power
Design and Data Collection for Spoken Polish Dialogs Database Krzysztof Marasek and Ryszard Gubrynowicz
Design and Recording of Czech Audio-Visual Database with Impaired Conditions for Continuous Speech Recognition Jana Trojanová, Marek Hrúz, Pavel Campr and Milos Zelezny
Design of a Multimodal Database for Research on Automatic Detection of Severe Apnoea Cases Rubén Fernández, Luis A. Hernández, Eduardo López, José Alcázar, Guillermo Portillo and Doroteo T. Toledano
Designing and Evaluating a Russian Tagset Serge Sharoff, Mikhail Kopotev, Tomaz Erjavec, Anna Feldman and Dagmar Divjak
Detecting Co-Derivative Documents in Large Text Collections Jan Pomikálek and Pavel Rychlý
Detecting Errors in Semantic Annotation Markus Dickinson and Chong Min Lee
Developing a Phonemic and Syllabic Frequency Inventory for Spontaneous Spoken Castilian Spanish and their Comparison to Text-Based Inventories Antonio Moreno Sandoval, Doroteo Torre Toledano, Raúl de la Torre, Marta Garrote and José M. Guirao
Developing a TT-MCTAG for German with an RCG-based Parser Laura Kallmeyer, Timm Lichte, Wolfgang Maier, Yannick Parmentier and Johannes Dellert
Developing Corpus of Japanese Classroom Lecture Speech Contents Masatoshi Tsuchiya, Satoru Kogure, Hiromitsu Nishizaki, Kengo Ohta and Seiichi Nakagawa
Developing Non-European Translation Pairs in a Medium-Vocabulary Medical Speech Translation System Pierrette Bouillon, Sonia Halimi, Yukie Nakao, Kyoko Kanzaki, Hitoshi Isahara, Nikos Tsourakis, Marianne Starlander, Beth Ann Hockey and Manny Rayner
Developing Verb Frames for Hindi Rafiya Begum, Samar Husain, Lakshmi Bai and Dipti Misra Sharma
Development and Alignment of a Domain-Specific Ontology for Question Answering Shiyan Ou, Viktor Pekar, Constantin Orăsan, Christian Spurk and Matteo Negri
Development of the Japanese WordNet Hitoshi Isahara, Fransis Bond, Kiyotaka Uchimoto, Masao Utiyama and Kyoko Kanzaki
Developments of “Lëtzebuergesch” Resources for Automatic Speech Processing and Linguistic Studies Martine Adda-Decker, Thomas Pellegrini, Eric Bilinski and Gilles Adda
DIAC+: a Professional Diacritics Recovering System Dan Tufiş and Alexandru Ceauşu
Diacritic Annotation in the Arabic Treebank and its Impact on Parser Evaluation Mohamed Maamouri, Seth Kulick and Ann Bies
Dialogue, Speech and Images: the Companions Project Data Set Yorick Wilks, David Benyon, Christopher Brewster, Pavel Ircing and Oli Mival
Dictionary of Multiword Expressions for Translation into highly Inflected Languages Daiga Deksne, Raivis Skadins and Inguna Skadina
Division of Example Sentences Based on the Meaning of a Target Word Using Semi-Supervised Clustering Hiroyuki Shinnou and Minoru Sasaki
Do we Still Need Gold Standards for Evaluation? Thierry Poibeau and Cédric Messiant
Does Netgraph Fit Prague Dependency Treebank? Jiří Mírovský
Domain-Specific English-To-Spanish Translation of FrameNet Mario Crespo Miguel and Paul Buitelaar


EASY, Evaluation of Parsers of French: what are the Results? Patrick Paroubek, Isabelle Robba, Anne Vilnat and Christelle Ayache
Eksairesis: A Domain-Adaptable System for Ontology Building from Unstructured Text Katia Lida Kermanidis, Aristomenis Thanopoulos, Manolis Maragoudakis and Nikos Fakotakis
Elicited Imitation as an Oral Proficiency Measure with ASR Scoring C. Ray Graham, Deryle Lonsdale, Casey Kennington, Aaron Johnson and Jeremiah McGhee
Emotion Recognition from Speech: Stress Experiment Stefan Scherer, Hansjörg Hofmann, Malte Lampmann, Martin Pfeil, Steffen Rhinow, Friedhelm Schwenker and Günther Palm
Encoding Terms from a Scientific Domain in a Terminological Database: Methodology and Criteria Rita Marinelli, Melissa Tiberi and Remo Bindi
Enhancing an English-Polish Electronic Dictionary for Multiword Expression Research Piotr Bański and Radosław Moszczyński
Enhancing the Arabic Treebank: a Collaborative Effort toward New Annotation Guidelines Mohamed Maamouri, Ann Bies and Seth Kulick
Enriching Frame Semantic Resources with Dependency Graphs Hagen Fürstenau
Enriching GermaNet with verb-noun relations - a case study of lexical acquisition Lothar Lemnitzer, Holger Wunsch and Piklu Gupta
Enriching the Venice Italian Treebank with Dependency and Grammatical Relations Sara Tonelli, Rodolfo Delmonte and Antonella Bristot
Ensuring Semantic Interoperability on Lexical Resources Marc Kemps-Snijders, Claus Zinn, Jacquelijn Ringersma, and Menzo Windhouwer
Entity Translation and Alignment in the ACE-07 ET Task Zhiyi Song and Stephanie Strassel
Error Analysis for Learning-based Coreference Resolution Olga Uryupina
Estimating the Resource Adaption Cost from a Resource Rich Language to a Similar Resource Poor Language Anil Kumar Singh, Kiran Pala and Harshit Surana
Estimating Word Phonosemantics Victoria Bobicev and Tatiana Zidraşco
Evaluating a German Sketch Grammar: A Case Study on Noun Phrase Case Kremena Ivanova, Ulrich Heid, Sabine Schulte im Walde, Adam Kilgarriff and Jan Pomikalek
Evaluating and Extending the Coverage of HPSG Grammars: A Case Study for German Jeremy Nicholson, Valia Kordoni, Yi Zhang, Timothy Baldwin and Rebecca Dridan
Evaluating Complement-Modifier Distinctions in a Semantically Annotated Corpus Mark McConville and Myroslava O. Dzikovska
Evaluating Dialogue Act Tagging with Naive and Expert Annotators Jeroen Geertzen, Volha Petukhova and Harry Bunt
Evaluating Evaluation Metrics for Ontology-Based Applications: Infinite Reflection Diana Maynard, Wim Peters and Yaoyong Li
Evaluating Robustness Of A QA System Through A Corpus Of Real-Life Questions Laurianne Sitbon, Patrice Bellot and Philippe Blache
Evaluating Summaries Automatically - A system Proposal Paulo C F de Oliveira, Edson Wilson Torrens, Alexandre Cidral, Sidney Schossland and Evandro Bittencourt
Evaluating the Ontology underlying sMail - the Conceptual Framework for Semantic Email Communication Simon Scerri, Myriam Mencke, Brian Davis and Siegfried Handschuh
Evaluating the Relationship between Linguistic and Geographic Distances using a 3D Visualization Folkert De Vriend, Jan Pieter Kunst, Louis Ten Bosch, Charlotte Giesbers and Roeland Van Hout
Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series - Takanobu Nishiura, Masato Nakayama, Yuki Denda, Norihide Kitaoka, Kazumasa Yamamoto, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shingo Kuroiwa, Kazuya Takeda and Satoshi Nakamura
Evaluation Metrics for Automatic Temporal Annotation of Texts Xavier Tannier and Philippe Muller
Evaluation of a Cross-lingual Romanian-English Multi-document Summariser Constantin Orasan and Oana Andreea Chiorean
Evaluation of a Machine Translation System for Low Resource Languages: METIS-II Vincent Vandeghinste, Peter Dirix, Ineke Schuurman, Stella Markantonatou, Sokratis Sofianopoulos, Marina Vassiliou, Olga Yannoutsou, Toni Badia, Maite Melero, Gemma Boleda, Michael Carl and Paul Schmidt
Evaluation of Context-Dependent Phrasal Translation Lexicons for Statistical Machine Translation Marine Carpuat and Dekai Wu
Evaluation of Different Segmentation Techniques for Dialogue Turns Carlos D. Martínez-Hinarejos and Vicent Tamarit
Evaluation of Lexical Resources and Semantic Networks on a Corpus of Mental Associations Laurianne Sitbon, Patrice Bellot and Philippe Blache
Evaluation of Linguistics-Based Translation Janne Bondi Johannessen, Torbjørn Nordgård and Lars Nygaard
Evaluation of Modules and Tools for Speech Synthesis: the ECESS Framework Harald Höge, Zdravko Kacic, Bojan Kotnik, Matej Rojc, Nicolas Moreau and Horst-Udo Hain
Evaluation of Natural Language Tools for Italian: EVALITA 2007 Bernardo Magnini, Amedeo Cappelli, Fabio Tamburini, Cristina Bosco, Alessandro Mazzei, Vincenzo Lombardo, Francesca Bertagna, Nicoletta Calzolari, Antonio Toral, Valentina Bartalesi Lenzi, Rachele Sprugnoli and Manuela Speranza
Evaluation of several Maximum Likelihood Linear Regression Variants for Language Adaptation. Míriam Luján Mares, Carlos David Martínez Hinarejos and Vicent Alabau Gonzalvo
Evaluation of Spoken Document Retrieval for Historic Speech Collections Willemijn Heeren, Franciska de Jong, Laurens van der Werff, Marijn Huijbregts and Roeland Ordelman
Evaluation of Virtual Keyboards for West-African Languages Chantal Enguehard and Harouna Naroua
Event Detection and Summarization in Weblogs with Temporal Collocations Chun-Yuan Teng and Hsin-Hsi Chen
Evolutionary Basic Notions for a Thematic Representation of General Knowledge Alain Joubert and Mathieu Lafourcade
Experimental Fast-Tracking of Morphological Analysers for Nguni Languages Sonja Bosch, Laurette Pretorius, Kholisa Podile and Axel Fleisch
Experiments on Processing Overlapping Parallel Corpora Mark Fishel and Heiki-Jaan Kaalep
Experiments to Investigate the Connection between Case Distribution and Topical Relevance of Search Terms in an Information Retrieval Setting Jussi Karlgren, Hercules Dalianis and Bart Jongejan
Exploitation of an Arabic Language Resource for Machine Translation Evaluation: using Buckwalter-based Lookup Tool to Augment CMU Alignment Algorithm Clare Voss, Jamal Laoudi and Jeffrey Micher
Exploiting Lexical Resources for Disambiguating CJK and Arabic Orthographic Variants Jack Halpern
Exploiting Multiply Annotated Corpora in Biomedical Information Extraction Tasks Barry Haddow and Beatrice Alex
Exploiting the Role of Position Feature in Chinese Relation Extraction Peng Zhang, Wenjie Li, Furu Wei, Qin Lu and Yuexian Hou
Exploring and Enriching a Language Resource Archive via the Web Marc Kemps-Snijders, Alex Klassmann, Claus Zinn, Peter Berck, Albert Russel and Peter Wittenburg
Exploring and Navigating: Tools for GermaNet Marc Finthammer and Irene Cramer
Extended Named Entity Ontology with Attribute Information Satoshi Sekine
Extracting and Querying Relations in Scientific Papers on Language Technology Ulrich Schäfer, Hans Uszkoreit, Christian Federmann, Torsten Marek and Yajing Zhang
Extracting Concrete Senses of Lexicon through Measurement of Conceptual Similarity in Ontologies Siaw-Fong Chung, Laurent Prévot, Mingwei Xu, Kathleen Ahrens, Shu-Kai Hsieh and Chu-Ren Huang
Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary Torsten Zesch, Christof Müller and Iryna Gurevych
Extraction and Evaluation of Keywords from Learning Objects: a Multilingual Approach Lothar Lemnitzer and Paola Monachesi
Extraction of Attribute Concepts from Japanese Adjectives Kyoko Kanzaki, Francis Bond, Noriko Tomuro and Hitoshi Isahara
Extraction of Informative Expressions from Domain-specific Documents Eiko Yamamoto, Hitoshi Isahara, Akira Terada and Yasunori Abe


F0 of Adolescent Speakers - First Results for the German Ph@ttSessionz Database Christoph Draxler, Florian Schiel and Tania Ellbogen
FATE: a FrameNet-Annotated Corpus for Textual Entailment Aljoscha Burchardt and Marco Pennacchiotti
Finding the Sources and Targets of Subjective Expressions Josef Ruppenhofer, Swapna Somasundaran and Janyce Wiebe
Fine-grained Opinion Topic and Polarity Identification Xiwen Cheng and Feiyu Xu
First Broadcast News Transcription System for Khmer Language Sopheap Seng, Sethserey Sam, Laurent Besacier, Brigitte Bigi and Eric Castelli
Foundation of a Component-based Flexible Registry for Language Resources and Technology Daan Broeder, Thierry Declerck, Erhard Hinrichs, Stelios Piperidis, Laurent Romary, Nicoletta Calzolari and Peter Wittenburg
Frame Information Transfer from English to Italian Sara Tonelli and Emanuele Pianta
From D-Coi to SoNaR: a reference corpus for Dutch Nelleke Oostdijk, Martin Reynaert, Paola Monachesi, Gertjan Van Noord, Roeland Ordelman, Ineke Schuurman and Vincent Vandeghinste
From Extracting to Abstracting: Generating Quasi-abstractive Summaries Zhuli Xie, Barbara Di Eugenio and Peter C. Nelson
From Field Notes towards a Knowledge Base Piroska Lendvai and Steve Hunt
From Human Communication to Intelligent User Interfaces: Corpora of Spoken Estonian Tiit Hennoste, Olga Gerassimenko, Riina Kasterpalu, Mare Koit, Andriela Rääbis and Krista Strandson
From Research to Application in Multilingual Information Access: the Contribution of Evaluation Carol Peters, Martin Braschler, Giorgio Di Nunzio, Nicola Ferro, Julio Gonzalo, Mark Sanderson
From Sentence to Discourse: Building an Annotation Scheme for Discourse Based on Prague Dependency Treebank Lucie Mladová, Sárka Zikánová and Eva Hajicová


Generalising Lexical Translation Strategies for MT Using Comparable Corpora Bogdan Babych, Serge Sharoff and Anthony Hartley
Generating a Morphological Lexicon of Organization Entity Names Nikola Ljubesic, Tomislava Lauc and Damir Boras
Generating Bilingual Dictionaries by Transitivity Luka Nerima and Eric Wehrli
GENIA-GR: a Grammatical Relation Corpus for Parser Evaluation in the Biomedical Domain Yuka Tateisi, Yusuke Miyao, Kenji Sagae and Jun’ichi Tsujii
Geo-WordNet: Automatic Georeferencing of WordNet Davide Buscaldi and Paolo Rosso
German Today: a really extensive Corpus of Spoken Standard German Caren Brinckmann, Stefan Kleiner, Ralf Knöbl and Nina Berend
Glossa: a Multilingual, Multimodal, Configurable User Interface Lars Nygaard, Joel Priestley, Anders Nøklestad and Janne Bondi Johannessen
GMT to +2 or how can TimeML be used in Romanian Corina Forăscu


Harvesting Multi-Word Expressions from Parallel Corpora Špela Vintar and Darja Fišer
Head or Non-head? Semi-automatic Procedures for Extracting and Classifying Subcategorisation Properties of Compounds. Ekaterina Lapshinova-Koltunski and Ulrich Heid
Holy Moses! Leveraging Existing Tools and Resources for Entity Translation Jean Tavernier, Rosa Cowan and Michelle Vanni
How to Compare Treebanks Sandra Kübler, Wolfgang Maier, Ines Rehbein and Yannick Versley
How to Evaluate and Raise the Quality in a Collaborative Lexicographic Approach Dan Cristea, Corina Forăscu, Marius Răschip and Michael Zock
Hungarian Word-Sense Disambiguated Corpus Veronika Vincze, György Szarvas, Attila Almási, Dóra Szauter, Róbert Ormándi, Richárd Farkas, Csaba Hatvani and János Csirik
Hydra: a Modal Logic Tool for Wordnet Development, Validation and Exploration Borislav Rizov


I saw TREE trees in the park: How to Correct Real-Word Spelling Mistakes Davide Fossati and Barbara Di Eugenio
Identification of Comparable Argument-Head Relations in Parallel Corpora Kathrin Spreyer, Jonas Kuhn and Bettina Schrader
Identification of Naturally Occurring Numerical Expressions in Arabic Nizar Habash and Ryan Roth
Identifying Foreign Person Names in Chinese Text Stephan Busemann and Yajing Zhang
Identifying Strategic Information from Scientific Articles through Sentence Classification Fidelia Ibekwe-SanJuan, Chaomei Chen and Roberto Pinho
Improving Contextual Quality Models for MT Evaluation Based on Evaluators’ Feedback Paula Estrella, Andrei Popescu-Belis and Maghi King
Improving NER in Arabic Using a Morphological Tagger Benjamin Farber, Dayne Freitag, Nizar Habash and Owen Rambow
Improving Statistical Machine Translation Efficiency by Triangulation Yu Chen, Andreas Eisele and Martin Kay
In Contrast - A Complex Discourse Connective Erhard Hinrichs and Monica Lau
In-car Speech Data Collection along with Various Multimodal Signals Akira Ozaki, Sunao Hara, Takashi Kusakawa, Chiyomi Miyajima, Takanori Nishino, Norihide Kitaoka, Katunobu Itou and Kazuya Takeda
Induction of Treebank-Aligned Lexical Resources Tejaswini Deoskar and Mats Rooth
Influence of Text Type and Text Length on Anaphoric Annotation Daniela Goecke, Maik Stührenberg and Andreas Witt
Information Extraction Tools and Methods for Understanding Dialogue in a Companion Roberta Catizone, Alexiei Dingli, Hugo Pinto and Yorick Wilks
Integrating Audio and Visual Information for Modelling Communicative Behaviours Perceived as Different Michelina Savino, Laura Scivetti and Mario Refice
Integrating Lexical Units, Synsets and Ontology in the Cornetto Database Piek Vossen, Isa Maks, Roxane Segers and Hennie VanderVliet
Integrating Metaphor Information into RDF/OWL EuroWordNet Ernesto William De Luca and Birte Lönneker-Rodman
Integration of a Multilingual Keyword Extractor in a Document Management System Andrea Agili, Marco Fabbri, Alessandro Panunzi and Manuel Zini
Inter-sentential Coreferences in Semantic Networks: An Evaluation of Manual Annotation Václav Novák and Keith Hall
Introducing DRS (The Digital Replay System): a Tool for the Future of Corpus Linguistic Research and Analysis Dawn Knight and Paul Tennent
Investigating the Structure of Procedural Texts for Answering How-to Questions Estelle Delpech and Patrick Saint-Dizier
IrcamCorpusTools: an Extensible Platform for Spoken Corpora Exploitation Christophe Veaux, Gregory Beller and Xavier Rodet
Is this NE tagger getting old? Cristina Mota and Ralph Grishman
ISOcat: Corralling Data Categories in the Wild Marc Kemps-Snijders, Menzo Windhouwer, Peter Wittenburg and Sue Ellen Wright


JMWNL: an Extensible Multilingual Library for Accessing Wordnets in Different Languages Maria Teresa Pazienza, Armando Stellato and Alexandra Tudorache
JURISDIC: Polish Speech Database for Taking Dictation of Legal Texts Grazyna Demenko, Stefan Grocholewski, Katarzyna Klessa, Jerzy Ogórkiewicz, Agnieszka Wagner, Marek Lange, Daniel Śledziński and Natalia Cylwik


Keywords, k-NN and Neural Networks: a Support for Hierarchical Categorization of Texts in Brazilian Portuguese Susana Azeredo, Silvia Moraes and Vera Lima
KnoFusius: a New Knowledge Fusion System for Interpretation of Gene Expression Data Pavel Smrz
Knowledge Sources for Bridging Resolution in Multi-Party Dialog Mark-Christoph Mueller, Margot Mieskes and Michael Strube
Knowledge-based Coreference Resolution for Hungarian Márton Miháltz
KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures Piek Vossen, Eneko Agirre, Nicoletta Calzolari, Christiane Fellbaum, Shu-kai Hsieh, Chu-Ren Huang, Hitoshi Isahara, Kyoko Kanzaki, Andrea Marchetti, Monica Monachini, Federico Neri, Remo Raffaelli, German Rigau, Maurizio Tescon and Joop VanGent


Language Resources and Chemical Informatics C.J. Rupp, Ann Copestake, Peter Corbett, Peter Murray-Rust, Advaith Siddharthan, Simone Teufel and Benjamin Waldron
Language Resources and Tools for Swedish: A Survey Kjell Elenius, Eva Forsbom and Beáta Megyesi
Language Resources for Semantic Document Annotation and Crosslingual Retrieval Petya Osenova, Kiril Simov and Eelco Mossel
Language Resources for Studying Argument Chris Reed, Raquel Mochales Palau, Glenn Rowe and Marie-Francine Moens
Language-Sites: Accessing and Presenting Language Resources via Geographic Information Systems Dieter van Uytvanck, Alex Dukers, Jacquelijn Ringersma and Paul Trilsbeek
Latest Developments in ELRA’s Services Valérie Mapelli, Victoria Arranz, Hélène Mazo and Khalid Choukri
LC-STAR II: Starring more Lexica Ute Ziegenhain, Hanne Fersoe, Henk van den Heuvel and Asuncion Moreno
Learning Morphology with Morfette Grzegorz Chrupala, Georgiana Dinu and Josef van Genabith
Learning Patterns for Building Resources about Semantic Relations in the Medical Domain Mehdi Embarek and Olivier Ferret
Learning properties of Noun Phrases: from data to functions Valeria Quochi and Basilio Calderone
Learning the Species of Biomedical Named Entities from Annotated Corpora Xinglong Wang and Claire Grover
Learning-based Detection of Scientific Terms in Patient Information Veronique Hoste, Els Lefever, Klaar Vanopstal and Isabelle Delaere
Let’s not Argue about Semantics Johan Bos
Lexical Ontology Extraction using Terminology Analysis: Automating Video Annotation Neil Newbold, Bogdan Vrusias and Lee Gillam
Lexical Resources for Automatic Translation of Constructed Neologisms: the Case Study of Relational Adjectives Bruno Cartoni
Lexical Resources for Semantics Extraction Rajat Mohanty and Pushpak Bhattacharyya
Lexical Substitution as a Framework for Multiword Evaluation Diana McCarthy
Lexicon Schemas and Related Data Models: when Standards Meet Users Thorsten Trippel, Michael Maxwell, Greville Corbett, Cambell Prince, Christopher Manning, Stephen Grimes and Steve Moran
Lexicon, Grammar, and Multilinguality in the Japanese FrameNet Kyoko Ohara
LexSchem: a Large Subcategorization Lexicon for French Verbs Cédric Messiant, Thierry Poibeau and Anna Korhonen
LILA: Cellular Telephone Speech Databases from Asia Eric Sanders, Asuncion Moreno, Herbert Tropf, Lynette Melnar, Nurit Dekel, Breanna Gillies and Niklas Paulsson
Linguistic Description and Automatic Extraction of Definitions from German Court Decisions Stephan Walter
Linguistic Resources and Evaluation Techniques for Evaluation of Cross-Document Automatic Content Extraction Stephanie Strassel, Mark Przybocki, Kay Peterson, Zhiyi Song and Kazuaki Maeda
Linguistic Resources for Reconstructing Spontaneous Speech Text Erin Fitzgerald and Frederick Jelinek
Linguistic Structure and Bilingual Informants Help Induce Machine Translation of Lesser-Resourced Languages Christian Monson, Ariadna Font Llitjós, Vamshi Ambati, Lori Levin, Alon Lavie, Alison Alvarez, Roberto Aranovich, Jaime Carbonell, Robert Frederking, Erik Peterson and Katharina Probst
Linguistically Light Lexical Extensions for Ontologies Brian Davis, Siegfried Handschuh, Alexander Troussov, John Judge and Mikhail Sogrin
LIRICS Semantic Role Annotation: Design and Evaluation of a Set of Data Categories Volha Petukhova and Harry Bunt
L-ISA: Learning Domain Specific Isa-Relations from the Web Alessandra Potrich and Emanuele Pianta
LMM: an OWL-DL MetaModel to Represent Heterogeneous Lexical Knowledge Davide Picca, Alfio Massimiliano Gliozzo and Aldo Gangemi
Local Methods for On-Demand Out-of-Vocabulary Word Retrieval Stanislas Oger, Georges Linares and Frederic Bechet
Low-Complexity Heuristics for Deriving Fine-Grained Classes of Named Entities from Web Textual Data Marius Pasca
Low-Density Language Bootstrapping: the Case of Tajiki Persian Karine Megerdoomian and Dan Parvaz
LX-Service: Web Services of Language Technology for Portuguese António Branco, Francisco Costa, Pedro Martins, Filipe Nunes, João Silva and Sara Silveira


Making Text Resources Accessible to the Reader: the Case of Patent Claims Simon Mille and Leo Wanner
MaltEval: an Evaluation and Visualization Tool for Dependency Parsing Jens Nilsson and Joakim Nivre
Management of Large Annotation Projects Involving Multiple Human Judges: a Case Study of GALE Machine Translation Post-editing Meghan Lammie Glenn, Stephanie Strassel, Lauren Friedman, Haejoong Lee and Shawn Medero
Manual vs Assisted Transcription of Prepared and Spontaneous Speech Thierry Bazillon, Yannick Estève and Daniel Luzzati
Mapping Events and Abstract Entities from PAROLE-SIMPLE-CLIPS to ItalWordNet Adriana Roventini and Nilda Ruimy
Mapping Roget’s Thesaurus and WordNet to French Gerard de Melo and Gerhard Weikum
MASC: the Manually Annotated Sub-Corpus of American English Nancy Ide, Collin Baker, Christiane Fellbaum, Charles Fillmore and Rebecca Passonneau
Measures for Term and Sentence Relevances: an Evaluation for German Heike Bieler and Stefanie Dipper
MEDAR: Collaboration between European and Mediterranean Arabic Partners to Support the Development of Language Technology for Arabic Bente Maegaard, Mohammed Atiyya, Khalid Choukri, Steven Krauwer, Chafic Mokbel and Mustafa Yaseen
memasysco: XML schema based metadata management system for speech corpora Joachim Gasch, Caren Brinckmann and Sylvia Dickgießer
Merging a Syntactic Resource with a WordNet: a Feasibility Study of a Merge between STO and DanNet Bolette Sandford Pedersen, Anna Braasch, Lina Henriksen, Sussi Olsen and Claus Povlsen
MeSH©: from a Controlled Vocabulary to a Processable Resource Dimitrios Kokkinakis
Methodologies for Designing and Recording Speech Databases for Corpus Based Synthesis Luís Oliveira, Sérgio Paulo, Luís Figueira, Carlos Mendes, Ana Nunes and Joaquim Godinho
Methodology for Evaluating the Usability of User Interfaces in Mobile Services Pedro Concejero, Daniel Tapias, Juan José Rodríguez, Juan Carlos Luengo and Sebastián Sánchez
Minding the Source: Automatic Tagging of Reported Speech in Newspaper Articles Ralf Krestel, Sabine Bergler and René Witte
MISTRAL: a Statistical Machine Translation Decoder for Speech Recognition Lattices Alexandre Patry and Philippe Langlais
Modeling Document Dynamics: an Evolutionary Approach Jahna Otterbacher and Dragomir Radev
Modelling Word Similarity: an Evaluation of Automatic Synonymy Extraction Algorithms. Kris Heylen, Yves Peirsman, Dirk Geeraerts and Dirk Speelman
More Semantic Links in the SIMPLE-CLIPS Database Nilda Ruimy and Antonio Toral
Morphosyntactic Resources for Automatic Speech Recognition Stéphane Huet, Guillaume Gravier and Pascale Sébillot
MTriage: Web-enabled Software for the Creation, Machine Translation, and Annotation of Smart Documents Reginald Hobbs, Jamal Laoudi and Clare Voss
Multimodal Spontaneous Expressive Speech Corpus for Hungarian Márk Fék, Nicolas Audibert, János Szabó, Albert Rilliard, Géza Németh and Véronique Aubergé
Mutual Bilingual Terminology Extraction Le An Ha, Gabriela Fernandez, Ruslan Mitkov and Gloria Corpas


Named Entity Recognition for Digitised Historical Texts Claire Grover, Sharon Givon, Richard Tobin and Julian Ball
Named Entity Relation Mining using Wikipedia Adrian Iftene and Alexandra Balahur-Dobrescu
Named Entity WordNet Antonio Toral, Rafael Muñoz and Monica Monachini
Navigating through Dense Annotation Spaces Branimir Boguraev and Mary Neff
New Functions of FrameSQL for Multilingual FrameNets Hiroaki Sato
New Resources for Document Classification, Analysis and Translation Technologies Stephanie Strassel, Lauren Friedman, Safa Ismael and Linda Brandschain
New Telephone Speech Databases for French: a Children Database and an optimized Adult Corpus Djamel Mostefa and Arnaud Vallee
NineOneOne: Recognizing and Classifying Speech for Handling Minority Language Emergency Calls Udhyakumar Nallasamy, Alan Black, Tanja Schultz and Robert Frederking


Odds of Successful Transfer of Low-Level Concepts: a Key Metric for Bidirectional Speech-to-Speech Machine Translation in DARPA’s TRANSTAC Program Gregory Sanders, Sebastien Bronsart, Sherri Condon and Craig Schlenoff
ODL: an Object Description Language for Lexical Information Michael Rosner
On Classifying Coherent/Incoherent Romanian Short Texts Anca Dinu
On the Durational Reduction of Repeated Mentions: Recency and Speaker Effects Viktor Tron
On the Role of the NIMITEK Corpus in Developing an Emotion Adaptive Spoken Dialogue System Milan Gnjatovic and Dietmar Roesner
On the Use of Web Resources and Natural Language Processing Techniques to Improve Automatic Speech Recognition Systems Gwénolé Lecorvé, Guillaume Gravier and Pascale Sébillot
Ontologizing Lexicon Access Functions based on an LMF-based Lexicon Taxonomy Yoshihiko Hayashi, Chiharu Narawa, Monica Monachini, Claudia Soria and Nicoletta Calzolari
Ontology Learning and Semantic Annotation: a Necessary Symbiosis Emiliano Giovannetti, Simone Marchi, Simonetta Montemagni and Roberto Bartolini
Ontology Search with the OntoSelect Ontology Library Paul Buitelaar and Thomas Eigner
Ontology-Based Interface Specifications for a NLP Pipeline Architecture Ekaterina Buyko, Christian Chiarcos and Antonio Pareja-Lora
Ontology-Based XQuery’ing of XML-Encoded Language Resources on Multiple Annotation Layers Georg Rehm, Richard Eckart, Christian Chiarcos and Johannes Dellert
OpenCCG Workbench and Visualization Tool Thepchai Supnithi, Suchinder Singh, Taneth Ruangrajitpakorn, Prachya Boonkwan and Monthika Boriboon
Opinion Annotation in On-line Chinese Product Reviews Ruifeng Xu, Yunqing Xia, Kam-Fai Wong and Wenjie Li


Parallel Creation of Gigaword Corpora for Medium Density Languages - an Interim Report Péter Halácsy, András Kornai, Péter Németh and Dániel Varga
Parallel Multi-Theory Annotations of Syntactic Structure Jerid Francom and Mans Hulden
Parameters for Topic Boundary Detection in Multi-Party Dialogues Margot Mieskes and Michael Strube
ParsCit: an Open-source CRF Reference String Parsing Package Isaac Councill, C Lee Giles and Min-Yen Kan
Parser Evaluation and the BNC: Evaluating 4 constituency parsers with 3 metrics Jennifer Foster and Josef van Genabith
PASSAGE: from French Parser Evaluation to Large Sized Treebank Eric Villemonte de la Clergerie, Olivier Hamon, Djamel Mostefa, Christelle Ayache, Patrick Paroubek and Anne Vilnat
Performance Evaluation of Speech Translation Systems Brian Weiss, Craig Schlenoff, Greg Sanders, Michelle Steves, Sherri Condon, Jon Phillips and Dan Parvaz
Personae: a Corpus for Author and Personality Prediction from Text Kim Luyckx and Walter Daelemans
Phone Segmentation Tool with Integrated Pronunciation Lexicon and Czech Phonetically Labelled Reference Database. Petr Pollák, Jan Volín and Radek Skarnitzl
Phrase-Based Machine Translation based on Simulated Annealing Caroline Lavecchia, David Langlois and Kamel Smaïli
Ping-pong Document Clustering using NMF and Linkage-Based Refinement Hiroyuki Shinnou and Minoru Sasaki
Portuguese-English Word Alignment: some Experiments Diana Santos and Alberto Simões
POS Tagging for German: how important is the Right Context? Steliana Ivanova and Sandra Kuebler
Post-MT Term Swapper: Supplementing a Statistical Machine Translation System with a User Dictionary Masaki Itagaki and Takako Aikawa
Pragmatic Annotation of Discourse Markers in a Multilingual Parallel Corpus (Arabic- Spanish-English) Doaa Samy and Ana González-Ledesma
Presentation of the New ISO-Standard for the Representation of Entries in Dictionaries: ISO 1951 Marie-Jeanne Derouin and André Le Meur
Process Model for Composing High-quality Text Corpora Mikko Lounela
Producing a Test Collection for Patent Machine Translation in the Seventh NTCIR Workshop Atsushi Fujii, Masao Utiyama, Mikio Yamamoto and Takehito Utsuro
Producing an Encyclopedic Dictionary using Patent Documents Atsushi Fujii
Production in a Multimodal Corpus: how Speakers Communicate Complex Actions Carlos Gómez Gallo, T. Florian Jaeger, James Allen and Mary Swift
Professor or Screaming Beast? Detecting Anomalous Words in Chinese Wei Liu, Ben Allison and Louise Guthrie
Projecting Propbank Roles onto the CCGbank Stephen Boxwell and Michael White
Prolexbase: a Multilingual Relational Lexical Database of Proper Names Denis Maurel
ProPOSEL: A Prosody and POS English Lexicon for Language Engineering Claire Brierley and Eric Atwell


Quality Assurance of Automatic Annotation of Very Large Corpora: a Study based on heterogeneous Tagging System Chu-Ren Huang, Lung-Hao Lee, Jia-Fei Hog, Wei-guang Qu and Shiwen Yu
Question Answering on Speech Transcriptions: the QAST evaluation in CLEF Lori Lamel, Sophie Rosset, Christelle Ayache, Djamel Mostefa, Jordi Turmo and Pere Comas
Quick Rich Transcriptions of Arabic Broadcast News Speech Data Chomicha Bendahman, Meghan Glenn, Djamel Mostefa, Niklas Paulsson and Stephanie Strassel


RACAI’s Linguistic Web Services Dan Tufiş, Radu Ion, Alexandru Ceauşu and Dan Ştefănescu
Rapid Deployment of a New METIS Language Pair: Catalan-English Toni Badia, Maite Melero and Oriol Valentín
Recording Speech of Children, Non-Natives and Elderly People for HLT Applications: the JASMIN-CGN Corpus. Catia Cucchiarini, Joris Driesen, Hugo Van hamme and Eric Sanders
Relation between Agreement Measures on Human Labeling and Machine Learning Performance: Results from an Art History Domain Rebecca Passonneau, Tom Lippincott, Tae Yano and Judith Klavans
Relationships between Nursing Converstaions and Activities Hiromi Itoh Ozaku, Akinori Abe, Kaoru Sagara and Kiyoshi Kogure
Representation of Atypical Entities in Ontologies Christophe Jouis and Julien Bourdaillet
Resources for Persuasion Marco Guerini, Carlo Strapparava and Oliviero Stock
Reusable Tagset Conversion Using Tagset Drivers Daniel Zeman
Revealing Relations between Open and Closed Answers in Questionnaires through Text Clustering Evaluation Magnus Rosell and Sumithra Velupillai
Robust Parsing with a Large HPSG Grammar Yi Zhang and Valia Kordoni
Romanian Lexical Data Bases: Inflected and Syllabic Forms Dictionaries Ana-Maria Barbu
Romanian Semantic Role Resource Diana Trandabat and Maria Husarciuc
Rule-Based Chunker for Croatian Kristina Vuckovic, Marko Tadic and Zdravko Dovedan
RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpus Ingunn Amdal, Ole Morten Strand, Jørn Almberg and Torbjørn Svendsen


Saxon: an Extensible Multimedia Annotator Mark Greenwood, José Iria and Fabio Ciravegna
Scaling Answer Type Detection to Large Hierarchies Kirk Roberts and Andrew Hickl
SCARE: a Situated Corpus with Annotated Referring Expressions Laura Stoia, Darla Magdalena Shockley, Donna K. Byron and Eric Fosler-Lussier
SECTra_w.1: an Online Collaborative System for Evaluating, Post-editing and Presenting MT Translation Corpora Cong-phap Huynh, Christian Boitet and Hervé Blanchon
Selection of Japanese-English Equivalents by Integrating High-quality Corpora and Huge Amounts of Web Data Qing Ma, Koichi Nakao, Masaki Murata and Hitoshi Isahara
Semantic Annotation Layer in Russian National Corpus: Lexical Classes of Nouns and Adjectives Olga N. Lashevskaja and Olga Yu. Shemanaeva
Semantic Annotations for Biology: a Corpus Development Initiative at the Jena University Language & Information Engineering (JULIE) Lab Udo Hahn, Elena Beisswanger, Ekaterina Buyko, Michael Poprat, Katrin Tomanek and Joachim Wermter
Semantic Frame Annotation on the French MEDIA corpus Marie-Jean Meurs, Frédéric Duvert, Frédéric Bechet, Fabrice Lefevre and Renato De Mori
Semantic Press Eugenio Picchi, Eva Sassolini, Sebastiana Cucurullo, Francesca Bertagna and Paola Baroni
Semantic Role Labeling Tools Trained on the Cast3LB-CoNNL-SemRol Corpus Roser Morante
Semantic Vectors: a Scalable Open Source Package and Online Technology Management Application Dominic Widdows and Kathleen Ferraro
Semantically Annotated Snapshot of the English Wikipedia Jordi Atserias, Hugo Zaragoza, Massimiliano Ciaramita and Giuseppe Attardi
Semi-automatic Building Method for a Multidimensional Affect Dictionary for a New Language Guillaume Pitel and Gregory Grefenstette
Semiotic-based Ontology Evaluation Tool (S-OntoEval) Renata Dividino, Massimo Romanelli and Daniel Sonntag
Sensitivity of Automated MT Evaluation Metrics on Higher Quality MT Output: BLEU vs Task-Based Evaluation Methods Bogdan Babych and Anthony Hartley
Sentence Alignment in DPC: Maximizing Precision, Minimizing Human Effort Julia Trushkina, Lieve Macken and Hans Paulussen
Sentiment Analysis and the Use of Extrinsic Datasets in Evaluation Ann Devitt and Khurshid Ahmad
Sentiment Analysis Based on Probabilistic Models Using Inter-Sentence Information Kugatsu Sadamitsu, Satoshi Sekine and Mikio Yamamoto
Sign Language Corpus Annotation: toward a new Methodology Emilie Chételat-Pelé and Annelies Braffort
Similar Term Discovery using Web Search Peter Anick, Vijay Murthi and Shaji Sebastian
Simple-Clips ongoing research: more information with less data by implementing inheritance Riccardo Del Gratta, Nilda Ruimy and Antonio Toral
Slovene Terminology Web Portal and the TBX-Compatible Simplified DTD/schema Simon Krek, Vojko Gorjanc and Špela Arhar
Smarty - Extendable Framework for Bilingual and Multilingual Comprehension Assistants Todor Arnaudov and Ruslan Mitkov
Some Fine Points of Hybrid Natural Language Parsing Peter Adolphs, Stephan Oepen, Ulrich Callmeier, Berthold Crysmann, Dan Flickinger and Bernd Kiefer
SpatialML: Annotation Scheme, Corpora, and Tools Inderjeet Mani, Janet Hitzeman, Justin Richer, Dave Harris, Rob Quimby and Ben Wellner
Spatiotemporal Annotation Using MiniSTEx: how to deal with Alternative, Foreign, Vague and/or Obsolete Names? Ineke Schuurman
Spatiotemporal Coding in ANVIL Michael Kipp
Speaker Recognition: Building the Mixer 4 and 5 Corpora Linda Brandschain, Christopher Cieri, David Graff, Abby Neely and Kevin Walker
Spectral Clustering for a Large Data Set by Reducing the Similarity Matrix Size Hiroyuki Shinnou and Minoru Sasaki
Speech Errors on Frequently Observed Homophones in French: Perceptual Evaluation vs Automatic Classification Rena Nemoto, Ioana Vasilescu and Martine Adda-Decker
Spelling Correction: from Two-Level Morphology to Open Source Iñaki Alegria, Klara Ceberio, Nerea Ezeiza, Aitor Soroa and Gregorio Hernandez
Spock - a Spoken Corpus Client Maarten Janssen and Tiago Freitas
Standardising Bilingual Lexical Resources According to the Lexicon Markup Framework Isa Maks, Carole Tiberius and Remco van Veenendaal
Statistical Evaluation of Information Distillation Systems J.V. White, D. Hunter and J.D. Goldstein
Statistical Identification of English Loanwords in Korean Using Automatically Generated Training Data Kirk Baker and Chris Brew
STC-TIMIT: Generation of a Single-channel Telephone Corpus Nicolas Morales, Javier Tejedor, Javier Garrido, Jose Colas and Doroteo T. Toledano
Strengthening the Estonian Language Technology Einar Meister and Jaak Vilo
Structural Metadata Annotation of Speech Corpora: Comparing Broadcast News and Broadcast Conversations Jáchym Kolář and Jan Švec
Subdomain Sensitive Statistical Parsing using Raw Corpora Barbara Plank and Khalil Sima’an
Subjective Evaluation of an Emotional Speech Database for Basque Iñaki Sainz, Ibon Saratxaga, Eva Navas, Inmaculada Hernáez, Jon Sanchez, Iker Luengo and Igor Odriozola
Supersense Tagger for Italian Davide Picca, Alfio Massimiliano Gliozzo and Massimiliano Ciaramita
Swedish-Turkish Parallel Treebank Beáta Megyesi, Bengt Dahlqvist, Eva Pettersson and Joakim Nivre
Synchronizing Translated Movie Subtitles Jörg Tiedemann
System Evaluation on a Named Entity Corpus from Clinical Notes Karin Schuler, Vinod Kaggal, James Masanz, Philip Ogren and Guergana Savova


Tagging a Hebrew Corpus: the Case of Participles Meni Adler, Yael Netzer, Yoav Goldberg, David Gabay and Michael Elhadad
Tagging Spanish Texts: the Problem of Problem of “SE” Guadalupe Aguado de Cea, Javier Puche and José Ángel Ramos
Talking and Looking: the SmartWeb Multimodal Interaction Corpus Florian Schiel and Hannes Mögele
Tapping Huge Temporally Indexed Textual Resources with WCTAnalyze Sebastian Gottwald, Matthias Richter, Gerhard Heyer and Gerik Scheuermann
Targeting Chinese Nominal Compounds in Corpora Weiruo Qu, Christoph Ringlstetter and Randy Goebel
Task-Based Evaluation of Meeting Browsers: from Task Elicitation to User Behavior Analysis Andrei Popescu-Belis, Mike Flynn, Pierre Wellner and Philippe Baudrion
Temporal Aspects of Terminology for Automatic Term Recognition: Case Study on Women’s Studies Terms Junko Kubo, Keita Tsuji and Shigeo Sugimoto
Test Collections for Spoken Document Retrieval from Lecture Audio Data Tomoyosi Akiba, Kiyoaki Aikawa, Yoshiaki Itoh, Tatsuya Kawahara, Hiroaki Nanjo, Hiromitsu Nishizaki, Norihito Yasuda, Yoichi Yamashita and Katunobu Itou
Text Independent Speaker Identification in Multilingual Environments Iker Luengo, Eva Navas, Iñaki Sainz, Ibon Saratxaga, Jon Sanchez, Igor Odriozola and Inma Hernaez
Thai Broadcast News Corpus Construction and Evaluation Markpong Jongtaveesataporn, Chai Wutiwiwatchai, Koji Iwano and Sadaoki Furui
The 2008 Oriental COCOSDA Book Project: in Commemoration of the First Decade of Sustained Activities in Asia Shuichi Itahashi and Chiu-yu Tseng
The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics Steven Bird, Robert Dale, Bonnie Dorr, Bryan Gibson, Mark Joseph, Min-Yen Kan, Dongwon Lee, Brett Powley, Dragomir Radev and Yee Fan Tan
The Annotation Guidelines of the Latin Dependency Treebank and Index Thomisticus Treebank: the Treatment of some specific Syntactic Constructions in Latin David Bamman, Marco Passarotti, Roberto Busa and Gregory Crane
The ATCOSIM Corpus of Non-Prompted Clean Air Traffic Control Speech Konrad Hofbauer, Stefan Petrik and Horst Hering
The ATIS Sign Language Corpus Jan Bungeroth, Daniel Stein, Philippe Dreuw, Hermann Ney, Sara Morrissey, Andy Way and Lynette van Zijl
The Automatic Mapping of Princeton WordNet Lexical-Conceptual Relations onto the Brazilian Portuguese WordNet Database Bento Carlos Dias-da-Silva, Ariani Di Felippo and Maria das Graças Volpe Nunes
The AUTONOMATA Spoken Names Corpus Henk van den Heuvel, Jean-Pierre Martens, Bart D’hoore, Kristof D’hanens, Nanneke Konings
The BNC Parsed with RASP4UIMA Øistein E. Andersen, Julien Nioche, Ted Briscoe and John Carroll
The Construction and Evaluation of Word Space Models Yves Peirsman, Simon De Deyne, Kris Heylen and Dirk Geeraerts
The Dutch-Flemish Comprehensive Approach to HLT Stimulation and Innovation: STEVIN, HLT Agency and beyond Peter Spyns, Elisabeth D’Halleweyn and Catia Cucchiarini
The Encoding of lexical implications in VerbNet Predicates of change of locations Annie Zaenen, Daniel Bobrow and Cleo Condoravdi
The European Thesaurus on International Relations and Area Studies - a Multilingual Resource for Indexing, Retrieval, and Translation Michael Kluck and Axel Huckstorf
The Extended Architecture of Hantology for Japan Kanji Ya-Min Chou, Chu-Ren Huang and Jia-Fei Hong
The IFADV Corpus: a Free Dialog Video Corpus Rob van Son, Wieneke Wesseling, Eric Sanders and Henk van den Heuvel
The INFILE Project: a Crosslingual Filtering Systems Evaluation Campaign Romaric Besançon, Stéphane Chaudiron, Djamel Mostefa, Ismail Timimi and Khalid Choukri
The Italian Particle “ne”: Corpus Construction and Analysis Malvina Nissim and Sara Perboni
The Japanese FrameNet Software Tools Hiroaki Saito, Shunta Kuboya, Takaaki Sone, Hayato Tagami and Kyoko Ohara
The JOS Morphosyntactically Tagged Corpus of Slovene Tomaš Erjavec and Simon Krek
The Kalshnikov 691 Dependency Bank Tomas By
The LECTRA Corpus - Classroom Lecture Transcriptions in European Portuguese Isabel Trancoso, Rui Martins, Helena Moniz, Ana Isabel Mata and M. Céu Viana
The Linguistic Data Consortium Member Survey: Purpose, Execution and Results Marian Reed, Denise DiPersio and Christopher Cieri
The Metadata-Database of a Next Generation Sustainability Web-Platform for Language Resources Georg Rehm, Oliver Schonefeld, Andreas Witt, Timm Lehmberg, Christian Chiarcos, Hanan Bechara, Florian Eishold, Kilian Evang, Magdalena Leshtanska, Aleksandar Savkov and Matthias Stark
The MoveOn Motorcycle Speech Corpus Thomas Winkler, Theodoros Kostoulas, Richard Adderley, Christian Bonkowski, Todor Ganchev, Joachim Köhler and Nikos Fakotakis
The Penn Discourse TreeBank 2.0. Rashmi Prasad, Nikhil Dinesh, Alan Lee, Eleni Miltsakaki, Livio Robaldo, Aravind Joshi and Bonnie Webber
The PIT Corpus of German Multi-Party Dialogues Petra-Maria Strauß, Holger Hoffmann, Wolfgang Minker, Heiko Neumann, Günther Palm, Stefan Scherer, Harald Traue and Ulrich Weidenbacher
The Pronouncing Dictionary of Austrian German (AGPD) and the Austrian Phonetic Database (ADABA): Report on a large Phonetic Resources Database of the three Major Varieties of German Rudolf Muhr
The QALL-ME Benchmark: a Multilingual Resource of Annotated Spoken Requests for Question Answering Elena Cabrio, Milen Kouylekov, Bernardo Magnini, Matteo Negri, Laura Hasler, Constantin Orasan, David Tomas, Jose Luis Vicedo, Guenter Neumann and Corinna Weber
The TextPro Tool Suite Emanuele Pianta, Christian Girardi and Roberto Zanoli
The U.S. Policy Agenda Legislation Corpus Volume 1 - a Language Resource from 1947 - 1998 Stephen Purpura, John Wilkerson and Dustin Hillard
The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters D. Llorens, F. Prat, A. Marzal, J. M. Vilar, M. J. Castro, J. C. Amengual, S. Barrachina, A. Castellanos, S. España, J. A. Gómez, J. Gorbe, A. Gordo, V. Palazón, G. Peris, R. Ramos-Garijo and F. Zamora
The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines Cvetana Krstev, Ranka Stanković, Duško Vitas and Ivan Obradović
Tools & Resources for Visualising Conversational-Speech Interaction Nick Campbell
Tools for Collocation Extraction: Preferences for Active vs. Passive Ulrich Heid and Marion Weller
Toward Active Learning in Data Selection: Automatic Discovery of Language Features During Elicitation Jonathan Clark, Robert Frederking and Lori Levin
Towards a Glossary of Activities in the Ontology Engineering Field Mari Carmen Suárez-Figueroa and Asunción Gómez-Pérez
Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems Georg Rehm, Marina Santini, Alexander Mehler, Pavel Braslavski, Rüdiger Gleim, Andrea Stubbe, Svetlana Symonenko, Mirko Tavosanis and Vedrana Vidulin
Towards a Vector Space Model for FrameNet-like Resources Marco Pennacchiotti, Diego De Cao, Paolo Marocco and Roberto Basili
Towards Formal Interpretation of Semantic Annotation Harry Bunt and Chwhynny Overbeeke
Towards Heterogeneous Automatic MT Error Analysis Jesús Giménez and Lluis Màrquez
Towards Semi Automatic Construction of a Lexical Ontology for Persian Mehrnoush Shamsfard
Towards Spanish Verbs’ Selectional Preferences Automatic Acquisition: Semantic Annotation of the SenSem Corpus Jordi Carrrera, Irene Castellón, Salvador Climent and Marta Coll-Florit
Towards the National Corpus of Polish Adam Przepiórkowski, Rafał L. Górski, Barbara Lewandowska-Tomaszyk and Marek Łaziński
Training and Evaluation of POS Taggers on the French MULTITAG Corpus Alexandre Allauzen and Hélène Bonneau-Maynard
Translation Adequacy and Preference Evaluation Tool (TAP-ET) Mark Przybocki, Kay Peterson and Sebastien Bronsart
Translation-oriented Word Sense Induction Based on Parallel Corpora Marianna Apidianaki
Tree Distance and Some Other Variants of Evalb Martin Emms
Treebank-Based Acquisition of LFG Parsing Resources for French Natalie Schluter and Josef van Genabith
Turning a Term Extractor into a new Domain: first Experiences Jorge Vivaldi, Anna Joan and Mercè Lorente


UFRA: a UIMA-based Approach to Federated Language Resource Architecture Riccardo Del Gratta, Roberto Bartolini, Tommaso Caselli, Monica Monachini, Claudia Soria and Nicoletta Calzolari
Uncertainty Corpus: Resource to Study User Affect in Complex Spoken Dialogue Systems Kate Forbes-Riley, Diane Litman, Scott Silliman and Amruta Purandare
UnsuParse: unsupervised Parsing with unsupervised Part of Speech Tagging Christian Hänig, Stefan Bordag and Uwe Quasthoff
Unsupervised Acquisition of Verb Subcategorization Frames from Shallow-Parsed Corpora Alessandro Lenci, Barbara McGillivray, Simonetta Montemagni and Vito Pirrelli
Unsupervised and Domain Independent Ontology Learning: Combining Heterogeneous Sources of Evidence David Manzano-Macho, Asunción Gómez-Pérez and Daniel Borrajo
Unsupervised Learning-based Anomalous Arabic Text Detection Nasser Abouzakhar, Ben Allison and Louise Guthrie
Unsupervised Lexical Acquisition for Part of Speech Tagging Dan Tufis, Elena Irimia, Radu Ion and Alexandru Ceausu
Unsupervised Parts-of-Speech Induction for Bengali Joydeep Nath, Monojit Choudhury, Animesh Mukherjee, Christian Biemann and Niloy Ganguly
Unsupervised Relation Extraction From Web Documents Kathrin Eichler, Holmer Hemsen and Günter Neumann
Unsupervised Resource Creation for Textual Inference Applications Jeremy Bensley and Andrew Hickl
User-Centred Design of Error Correction Tools Martí Quixal, Toni Badia, Francesc Benavent, Jose R. Boullosa, Judith Domingo, Bernat Grau, Guillem Massó and Oriol Valentín
Using a Probabilistic Model of Context to Detect Word Obfuscation Sanaz Jabbari, Ben Allison and Louise Guthrie
Using Lexical Acquisition to Enrich a Predicate Argument Reusable Database Paula Cristina Vaz, David Martins de Matos and Nuno J. Mamede
Using Log-linear Models for Tuning Machine Translation Output Michael Carl
Using Movie Subtitles for Creating a Large-Scale Bilingual Corpora Einav Itamar and Alon Itai
Using Parsed Corpora for Estimating Stochastic Inversion Transduction Grammars Germán Sanchis and Joan Andreu Sánchez
Using Random Indexing to improve Singular Value Decomposition for Latent Semantic Analysis Linus Sellberg and Arne Jönsson
Using Reordering in Statistical Machine Translation based on Alignment Block Classification Marta R. Costa-jussà, José A. R. Fonollosa and Enric Monte
Using Semantically Annotated Corpora to Build Collocation Resources Margarita Alonso Ramos, Owen Rambow and Leo Wanner
Using Similarity Measures to Extend the LinGO Lexicon Lynne Cahill
Using Similarity Metrics For Terminology Recognition Jonathan Butters and Fabio Ciravegna
Using the Complexity of the Distribution of Lexical Elements as a Feature in Authorship Attribution Leanne Spracklin, Diana Inkpen and Amiya Nayak
Using the Multilingual Central Repository for Graph-Based Word Sense Disambiguation Eneko Agirre and Aitor Soroa
Using the Web as a Linguistic Resource to Automatically Correct Lexico-Syntactic Errors Matthieu Hermet, Alain Désilets and Stan Szpakowicz


Valentino: A Tool for Valence Shifting of Natural Language Texts Marco Guerini, Carlo Strapparava and Oliviero Stock
Validating the Quality of Full Morphological Annotation Drahomíra johanka Spoustová, Pavel Pecina, Jan Hajič and Miroslav Spousta
Verb-Noun Collocation SyntLex Dictionary: Corpus-Based Approach Grazyna Vetulani, Zygmunt Vetulani and Tomasz Obrębski
Vox Populi Annotation: Measuring Intensity of Ideological Perspectives by Aggregating Group Judgments Wei-Hao Lin and Alexander Hauptmann


WEB-Based Listening Test System for Speech Synthesis and Speech Conversion Evaluation Laurent Blin, Olivier Boeffard and Vincent Barreaud
What is poorly Said is a Little Funny Jonas Sjöbergh and Kenji Araki
What would you Ask a conversational Agent? Observations of Human-Agent Dialogues in a Museum Setting Susan Robinson, David Traum, Midhun Ittycheriah and Joe Henderer
What’s in a Colour? Studying and Contrasting Colours with COMPARA Diana Santos, Maria do Rosário Silva and Susana Inácio
WNTERM: Enriching the MCR with a Terminological Dictionary Eli Pociello, Antton Gurrutxaga, Eneko Agirre, Izaskun Aldezabal and German Rigau
Word Alignment Annotation in a Japanese-Chinese Parallel Corpus Yujie Zhang, Zhulong Wang, Kiyotaka Uchimoto, Qing Ma and Hitoshi Isahara
Word Segmentation of Vietnamese Texts: a Comparison of Approaches Quang Thắng Đinh, Hồng Phương Lê, Thi Minh Huyền Nguyễn, Cẩm Tú Nguyễn, Mathias Rossignol and Xuẩn Lương Vũ
Word-Based or Morpheme-Based? Annotation Strategies for Modern Hebrew Clitics Reut Tsarfaty and Yoav Goldberg
Word-level Dependency-structure Annotation to Corpus of Spontaneous Japanese and its Application Kiyotaka Uchimoto and Yasuharu Den
Words in Contexts: Digital Editions of Literary Journals in the “AAC - Austrian Academy Corpus” Hanno Biber, Evelyn Breiteneder and Karlheinz Mörth
Workbench with Authoring Tools for Collaborative Multi-lingual Ontological Knowledge Construction and Maintenance Mukda Suktarachan, Dussadee Thamvijit, Daoyos Noikongka, Panita Yongyuth, Puwarat Pavaputanont Na Mahasarakham, Asanee Kawtrakul, Asanee Kawtrakul and Margherita Sini
WOZ Acoustic Data Collection for Interactive TV Alessio Brutti, Luca Cristoforetti, Walter Kellermann, Lutz Marquardt and Maurizio Omologo


Yet another Platform for Extracting Knowledge from Corpora Francesca Fallucchi and Fabio Massimo Zanzotto

Powered by ELDA © 2008 ELDA/ELRA