A, B, C, D, E, F, G, H, I, J, K, L, M, N, O, P, Q, R, S, T, U, V, W, X, Y, Z

A  
Afonso Susana Floresta Sintá(c)tica: A treebank for Portuguese.
Agua Ana Multilingual Summarization by Integrating Linguistic Resources in the MLIS-MUSI Project.
Aguilar Río José Miguel Compiling an Interactive Literary Translation Web Site for Education Purposes
Ahafhaf Mohamed Toward an objective and generic Method for Spoken Language Understanding Systems Evaluation: an extension of the DCR method.
Ahrenberg Lars  A System for Incremental and Interactive Word Linking.
Aibar Pablo A Labelling Proposal to Annotate Dialogues.
Alderuccio  Daniela An ontology-based approach in the literary research: two case-studies
Alfonseca  Enrique   Improving an Ontology Refinement Method with Hyponymy Patterns.
Proposal for Evaluating Ontology Refinement Methods.
Almqvist Ingrid   Scaling Up an MT Prototype for Industrial Use - Databases and Data Flow.
Alonge Antonieta  Which way should we go? Metaphoric expressions in lexical resources.
Alsina Alex CATCG: a general purpose parsing tool applied.
Amdal  Ingunn   Evaluation of Pronunciation Variants in the ASR Lexicon for Different Speaking Styles.
Ananiadou Sophia  Tuning Context Features with Genetic Algorithms.
Automatic Acronym Acquisition and Term Variation Management within Domain-Specific Texts.
Andersson Michael  A System for Incremental and Interactive Word Linking.
Ando Maya   Extraction of Associative Attributes from Nouns and Quantitative Expression of Prototype Concept.
Androutsopoulos  Ion  Ellogon: A New Text Engineering Platform.
Antoine Jean-Yves  Predictive and objective evaluation of speech understanding: the “challenge” evaluation campaign of the I3 speech workgroup of the French CNRS.
Towards a large corpus of spoken dialogue in French that will be freely available: the "Parole Publique" project and its first realisations.
Armstrong Susan Electronic Dictionaries - from Publisher Data to a Distribution Server: the DicoPro, DicoEast and RERO Projects.
Artola X. A Class Library for the Integration of NLP Tools: Definition and implementation of an Abstract Data Type Collection for the manipulation of SGML documents in a context of stand-off linguistic annotation
Atkins Sue  From Resources to Applications. Designing the Multilingual ISLE Lexical Entry.
Austin Birgit ELSST: a broad-based Multilingual Thesaurus for the Social Sciences.
B  
Baayen Harald Experiences from the Spoken Dutch Corpus Project.
Bacelar do Nascimento F.   The C-ORAL-ROM Project. New methods for spoken language archives in a multilingual romance corpus.
Badia Toni CATCG: a general purpose parsing tool applied.
Bagga Amit A Text-based for Detection and Filtering of Commercial Segments in Broadcast News.
Baglioni M. Knowledge Mining and Discovery for Searching in Literary Texts.
Baker Collin F. Seeing Arguments through Transparent Structures.
The FrameNet Database and Software Tools.
Baker Paul A Unicode-based Environment for Creation and Use of Language Resources.
EMILLE, A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and Harmonisation.
Bakx Ilse Usability Evaluation of a Dutch Multimodal System for Train Timetable Information.
Balabanova Elisaveta Building a Linguistically Interpreted Corpus of Bulgarian: the BulTreeBank
Baldridge Jasón  Leo: an Architecture for Sharing Resources for Unification-Based Grammars.
Baldwin Timothy   Enhanced Japanese Electronic Dictionary Look-up.
Multiword expressions: linguistic precision and reusability.
Balkan  Lorna    ELSST: a broad-based Multilingual Thesaurus for the Social Sciences.
Ballester Almudena    Combining statistics on n-grams for automatic term recognition.
Barbu  Ana-Maria  Lexical token alignment: experiments, results and applications.
Barbu Catalina  Error analysis in anaphora resolution.
A corpus based investigation of morphological disagreement in anaphoric relations.
AR-Engine - a framework for unrestricted co-reference resolution.
Bartolini Roberto  Multilingual Summarization by Integrating Linguistic Resources in the MLIS-MUSI Project
The Lexicon-Grammar Balance in Robust Parsing of Italian.
Bay  Jörg Evaluation Corpora for Sense Disambiguation in the Medical Domain.
Bel Nuria Design and Evaluation of a SLDS for E-Mail Access through the Telephone.
From Resources to Applications. Designing the Multilingual ISLE Lexical Entry.
From DTD to relational dB. An automatic generation of a lexicographical station out off ISLE guidelines.
Belskis Alexander Multilingual Terminology Extraction and Validation.
Belz  Anja  PILLS:  Multilingual generation of medical information documents with overlapping content.
Benabbou A.   Study and quantification of the declination for the Arabic speech synthesis system PARADIS.
Bennison Peter  Translation Tracking System: A tool for managing translation archives.
Bensaçon Romaric Evaluation of a Vector Space Similarity Measure in a Multilingual Framework.
Bensimon Nelly  Multimodal and Adaptative Pedagogical Resources.
Bentivogli Luisa Opportunistic Semantic Tagging.
Beringer  Nicole  End-to-End Evaluation of Multimodal Dialogue Systems – can we Transfer Established Methods?
Bernard  Pascale  Computerized linguistic resources of the research laboratory ATILF for lexical and textual analysis: Frantext, TLFi, and the software Stella.
Bertagna Francesca From Resources to Applications. Designing the Multilingual ISLE Lexical Entry.
Bia Alejandro   Building ancient Spanish dictionaries for spell-checking of DL texts
Biatov Konstantin   Methods and Tools for Speech Data Acquisition exploiting a Database of German Parliamentary Speeches and Transcripts from the Internet.
Bick Eckhard    Floresta Sintá(c)tica: A treebank for Portuguese.
Bilacy Slaven  Enhanced Japanese Electronic Dictionary Look-up.
Binnenpoorte Diana  Word Segmentation in the Spoken Dutch Corpus.
A Field Survey for Establishing Priorities in the Development of HLT Resources for Dutch.
Bird Steven TableTrans, MultiTrans, InterTrans and TreeTrans: Diverse Tools Built on the Annotation Graph Toolkit.
An integrated framework for treebanks and multilayer annotations
Creating Annotation Tools with the Annotation Graph Toolkit.
Models and Tools for Collaborative Annotation
Bizouard Sophie  Evaluating resource acquisition tools for Information Extraction.
Blache Philippe An evaluation of different symbolic shallow parsing techniques.
Black Alan W  Field Testing the Tongues Speech-to-Speech Machine Translation System.
Evaluation and collection of proper name pronunciations online.
Black Ezra Beyond Tag Trigrams: New Local Features for Tagging.
Blum Claude   The C-ORAL-ROM Project. New methods for spoken language archives in a multilingual romance corpus.
Boas Hans C.   Bilingual FrameNet Dictionaries for Machine Translation.
Boëffard  Olivier The Greedy Algorithm and its Application to the Construction of a Continuous Speech Database.
Boguslavsky Igor Development of a Dependency Treebank for Russian and its Possible Applications in NLP.
Boitet Christian UNL Lexical Selection with Conceptual Vectors.
Boleda Gemma  CATCG: a general purpose parsing tool applied.
Bonafonte Antonio  Interface Databases: Design and Collection of a Multilingual Emotional Speech Database.
Bond Francis Multiword expressions: linguistic precision and reusability.
Towards a Thesaurus of Predicates.
Bong-Wan Kim Speech Information Technology & Industry Promotion Center in Korea: Activities and Directions.
Bonneau-Maynard Hélčne Annotations for Dynamic Diagnosis of the Dialog State
Bontcheva Kalina  A Unicode-based Environment for Creation and Use of Language Resources.
Extracting Information for Automatic Indexing of Multimedia Material.
Bordel G.  Automatic Morphological Segmentation for Continuous Speech Recognition of Basque.
Bordoni  Luciana An ontology-based approach in the literary research: two case-studies
Borin  Lars Living off the land: The Web as a source of practice texts for learners of less prevalent languages.
Bott Steffan CATCG: a general purpose parsing tool applied.
Bouayad-Agha  Nadjet PILLS:  Multilingual generation of medical information documents with overlapping content.
Bouillon Pierrette Acquisition of Qualia Elements from Corpora - Evaluation of a Symbolic Learning Method
From Resources to Applications. Designing the Multilingual ISLE Lexical Entry.
Bouma Gosse  Querying Dependency Treebanks in XML.
Bousquet-Vernhettes C. Predictive and objective evaluation of speech understanding: the “challenge” evaluation campaign of the I3 speech workgroup of the French CNRS.
Boves Lous  Experiences from the Spoken Dutch Corpus Project.
Bowker Lynne Translation Tracking System: A tool for managing translation archives.
Bozzi  Andrea LAperLA: an integrated graphical-linguistic System for old printed Latin Texts.
Braasch Anna  Current Developments of STO - the Danish Lexicon Project for NLP and HLT Applications.
Branco António Nexing Corpus: a corpus of verbal protocols on syllogistic reasoning.
Brants Sabine Developments in the TIGER Annotation Scheme and their Realization in the Corpus.
Braschler Martin The Importance of Evaluation for Cross-Language System Development: the CLEF Experience.
Briscoe Ted Robust Accurate Statistical Annotation of General Text.
Subcategorization Acquisition as an Evaluation Method for WSD.
Broeder Dan Metadata Tools Supporting Controlled Vocabulary Services.
LREP: A Language Repository Exchange Protocol.
Metadata Proposals for Corpora and Lexica.
Brown Dunstan A typological database of agreement
Brown Ralf D. Field Testing the Tongues Speech-to-Speech Machine Translation System.
Brugman Hennie  Multimodal Annotations in Gesture and Sign Language Studies
Multimedia Annotation with Multilingual Input Methods and Search Support.
Buhmann Jeska Annotation of prominent words, prosodic boundaries and segmental lengthening by non-expert transcribers in the Spoken Dutch Corpus.
Buitelaar Paul Evaluation Corpora for Sense Disambiguation in the Medical Domain.
An Efficient and Flexible Format for Linguistic and Semantic Annotation.
Burger Susanne NESPOLE!’s Multilingual and Multimodal Corpus.
Busemann Stephan  Multilingual Summarization by Integrating Linguistic Resources in the MLIS-MUSI Project.
C  
Caballero Mónica Multidialectal Spanish Modeling for ASR.
Cabré  M. Teresa Towards an Ontology for a Human Genome Knowledge Base
Cadée Niels Advanced Tools for the Study of Natural Interactivity.
Caelen Jean  Towards a large corpus of spoken dialogue in French that will be freely available: the "Parole Publique" project and its first realisations.
Cahill Aoife TTS - A Treebank Tool Suite.
Calle Javier Natural Language Dialogue in a Virtual Assistant Interface.
Calzolari Nicoletta From Resources to Applications. Designing the Multilingual ISLE Lexical Entry.
CLIPS, a Multi-level Italian Computational Lexicon: a Glimpse to Data.
Multilingual Summarization by Integrating Linguistic Resources in the MLIS-MUSI Project.
Integrating Two Semantic Lexicons, SIMPLE and ItalWordNet: What Can We Gain?.
Towards Best Practice for Multiword Expressions in Computational Lexicons.
Caminero  Javier Design and Evaluation of a SLDS for E-Mail Access through the Telephone.
A Multilingual Speaker Verification System: Architecture and Performance Evaluation.
Campbell Nick Automatic Detection of Acoustic Centres of Reliability for Tagging Paralinguistic Information in Expressive Speech.
Recording techniques for capturing natural every-day speech.
Capelli  Giuseppe Linguistic and Computational Problems for the Creation of an Italian Children's Corpus of Spoken Language.
Capelli A. Knowledge Mining and Discovery for Searching in Literary Texts.
Capstick Joanne COLLATE: Competence Center in Speech and Language Technology.
Cardeńosa Jesús The UNL System.
Carletta Jean Advanced Tools for the Study of Natural Interactivity.
Carreras Xavier A Flexible Distributed Architecture for Natural Language Analyzers.
Carroll John Robust Accurate Statistical Annotation of General Text.
Cartier Emmanuel Multilingual Summarization by Integrating Linguistic Resources in the MLIS-MUSI Project.
Casper Johanneke Annotation of prominent words, prosodic boundaries and segmental lengthening by non-expert transcribers in the Spoken Dutch Corpus.
Cassán Adán BDCon: A Spanish knowledge database.
A step forward to hypertext.
Cassidy Steve XQuery as an Annotation Query Language: a Use Case Analysis.
Castelli Margherita  Which way should we go? Metaphoric expressions in lexical resources.
Catarsi M. N.  Knowledge Mining and Discovery for Searching in Literary Texts.
Cattoni Roldano  ADAM: The SI-TAL Corpus of Annotated Dialogues.
Cavaglia Gabriela Measuring corpus homogeneity using a range of measures for inter-document distance.
Cervell Sergi  BDCon: A Spanish knowledge database.
A step forward to hypertext.
Chali Yllias Experiments in Topic Detection.
Chan Chiu-ki  Using Parallel Corpora to enrich Multilingual Lexical Resources.
Chardin Ivan Development of a Dependency Treebank for Russian and its Possible Applications in NLP.
Charfuelán Marcela A XML-based tool for evaluation of SLDS.
Chenfour N. Study and quantification of the declination for the Arabic speech synthesis system PARADIS.
Chevreau Karine Multilingual Summarization by Integrating Linguistic Resources in the MLIS-MUSI Project.
Choi  Key-Sun Word Sense Disambiguation with Information Retrieval Technique.
Choi Yong-Seok Word Sense Disambiguation with Information Retrieval Technique.
Choukri Khalid The C-ORAL-ROM Project. New methods for spoken language archives in a multilingual romance corpus.
Give me a bug. a framework for a bug report service.
OrienTel - Multilingual access to interactive communication services for the Mediterranean and the Middle East.
Choy-Kim  Chuah Computational Linguistics at Universiti Sains Malaysia
Chung-hye Han Development and Evaluation of a Korean Treebank and its Application to NLP
Cid Antonio The C-ORAL-ROM Project. New methods for spoken language archives in a multilingual romance corpus.
Cieri Christopher  The DASL Project: a Case Study in Data Re-Annotation and Re-Use.
Language Resource Creation and Distribution at the Linguistic Data Consortium: A Progress Report.
TIDES Language Resources: A Resource Map for Translingual Information Access.
Claveau Vincent Acquisition of Qualia Elements from Corpora - Evaluation of a Symbolic Learning Method
Clough Paul Building and annotating a corpus for the study of journalistic text reuse
Coch José  Multilingual Summarization by Integrating Linguistic Resources in the MLIS-MUSI Project.
Collier Nigel PIA-Core: Semantic Annotation through Example-based Learning
Progress on Multi-lingual Named Entity Annotation Guidelines using RDF (S).
Collin Olivier Using the Web as a Linguistic Resource for Learning Reformulations Automatically.
Colom Mireia BDCon: A Spanish knowledge database.
A step forward to hypertext.
Constant  Matthieu Methods for Constructing Lexicon-Grammar Resources: The Example of Measure Expressions.
Constantini Erica NESPOLE!’s Multilingual and Multimodal Corpus.
Copestake Ann Multiword expressions: linguistic precision and reusability.
Coppen P. A. Using Grammatical Description as a Metalanguage Resource.
Coray Giovanni SAM: System for Multi-criteria Text Alignment.
Corazzari Ornella The Lexico-semantic Annotation of an Italian Treebank
Corbett Greville A typological database of agreement
Cotsomrong Patcharika Phonetically Distributed Continuous Speech Corpus for Thai Language.
Cotton Scott An integrated framework for treebanks and multilayer annotations
Couvreur Christophe Database Adaptation for Speech Recognition in Cross-Environmental Conditions.
Cranen Bert Usability Evaluation of a Dutch Multimodal System for Train Timetable Information.
Cresti Emanuela The C-ORAL-ROM Project. New methods for spoken language archives in a multilingual romance corpus.
Creswell Cassandre  A Dependency Treebank for English.
Cristea Dan The Use of Referential Constraints in Structuring Discourse.
AR-Engine - a framework for unrestricted co-reference resolution.
Cucchiarini Catia  A Human Language Technologies Platform for the Dutch language: awareness, management maintenance and distribution.
A Field Survey for Establishing Priorities in the Development of HLT Resources for Dutch.
Cucurullo Sebastiana Italian arabic linguistic tools.
Cunningham Hamish A Unicode-based Environment for Creation and Use of Language Resources.
Extracting Information for Automatic Indexing of Multimedia Material.
EMILLE, A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and Harmonisation.
How feasible is the reuse of grammars for Named Entity Recognition?.
D  
D´Halleweyn Elisabeth A Human Language Technologies Platform for the Dutch language: awareness, management maintenance and distribution.
Dabbadie Marianne  Terminological Enrichment for non-Interactive MT Evaluation
Daelemans Walter Evaluation of Machine Learning Methods for Natural Language Processing Tasks.
A Field Survey for Establishing Priorities in the Development of HLT Resources for Dutch.
Daille Beatrice  Extracting French-Japanese Word Pairs from Bilingual Corpora based on Transliteration Rules.
Incremental Recognition and Referential Categorization of French Proper Names.
Daleno Domenico Methods and Tools for Prosodic Analysis of a Spoken Italian Corpus.
Dalli Angelo Creation and Evaluation of Extensible Language Resources for Maltese.
Danieli Morena ADAM: The SI-TAL Corpus of Annotated Dialogues.
Dantsuji M.  English Speech Database Read by Japanese Learners for CALL System Development.
das Graças Volpe Nunes M. DIADORIM - A Lexical Database for Brazilian Portuguese.
De Vriend F. Using Grammatical Description as a Metalanguage Resource.
A Field Survey for Establishing Priorities in the Development of HLT Resources for Dutch.
Declerck Thierry LREP: A Language Repository Exchange Protocol.
COLLATE: Competence Center in Speech and Language Technology.
Demiros Iason Multi-level XML-based Corpus Annotation.
Demuynck  Kris  Word Segmentation in the Spoken Dutch Corpus.
An Improved Algorithm for the Automatic Segmentation of Speech Corpora.
Dendien  Jacques Computerized linguistic resources of the research laboratory ATILF for lexical and textual analysis: Frantext, TLFi, and the software Stella.
Denecke Matthias Signatures, Typed Feature Structures and RDFS.
Derouin Marie-Jeanne  Report on the Revision of the Lexicographical Standard ISO 1951 Presentation/Representation of Entries in Dictionaries.
Devillers Laurence Annotations for Dynamic Diagnosis of the Dialog State
Di Eugenio Barbara The binomial cumulative distribution function, or, is my system better than yours?.
Díaz de Ilarraza A. A Class Library for the Integration of NLP Tools: Definition and implementation of an Abstract Data Type Collection for the manipulation of SGML documents in a context of stand-off linguistic annotation
Diehl Frank SPEECON – Speech Databases for Consumer Devices: Database Specification and Validation.
Dienes Péter Efficient Stochastic Part-of-Speech Tagging for Hungarian.
Dieter Maas Bilingual Indexing for Information Retrieval with AUTINDEX.
Dima Gabriela-Eugenia AR-Engine - a framework for unrestricted co-reference resolution.
Dimitriadis Alexis  A unified system for accessing typological databases.
Dini Luca  SiSSA: An Infrastructure for Developing NLP Applications.
Distante Raffaella CLIPS, a Multi-level Italian Computational Lexicon: a Glimpse to Data.
Dobrov Boris V.  Evaluation of Thesaurus on Sociopolitical Life as Information-Retrieval Tool.
Docio Fernandez  Laura  Acoustic Modeling and Training of a Bilingual ASR System when a Minority Language is Involved.
Doikoff Dimitar Building a Linguistically Interpreted Corpus of Bulgarian: the BulTreeBank
Domeij Rickard Different Ways of Evaluating a Swedish Grammar Checker.
Dorow Beate Using Parallel Corpora to enrich Multilingual Lexical Resources.
Dowdall James Technical Terminology as a Critical Resource.
Dowding John Leo: an Architecture for Sharing Resources for Unification-Based Grammars.
Draxler  Christoph  Three New Corpora at the Bavarian Archive for Speech Signals – and a First Step Towards Distributed Web-Based Recording.
Drozdzynsky  Witold A Flexible XML-based Regular Compiler for Creation and Conversion of Linguistic Resources.
Drude S.  Analysis of Lexical Structures from Field Linguistics and Language Engineering.
Duchateau  Jacques Word Segmentation in the Spoken Dutch Corpus.
An Improved Algorithm for the Automatic Segmentation of Speech Corpora.
Duclaye Florence Using the Web as a Linguistic Resource for Learning Reformulations Automatically.
Dutilh Tilly Implementation and Evaluation of PAROLE PoS in a National Context
Dutoit Dominique  Evaluating resource acquisition tools for Information Extraction.
An Algorithm to Find Words from Definitions.
Dwyer A. Methods of Language Documentation in the DOBES project.
Dybkjćr Laila THE NITE WORKBENCH. A Tool for Annotation of Natural Interactivity and Multimodal Data
Natural Interactivity Resources – Data, Annotation Schemes and Tools.
Advanced Tools for the Study of Natural Interactivity.
Dzeroski Saso A Machine Learning Approach to Automatic Functor Assignment in the Prague Dependency Treebank.
E  
Early Susana Leo: an Architecture for Sharing Resources for Unification-Based Grammars.
Eguchi Koji Sensitivity of IR systems Evaluation to Topic Difficulty.
Eickeler Stefan Creation of an Annotated German Broadcast Speech Database for Spoken Document Retrieval.
Elhadad Noemie Collection and linguistic processing of a large-scale corpus of medical articles.
Eman Ossama  OrienTel - Multilingual access to interactive communication services for the Mediterranean and the Middle East.
Eon-Suk Ko  Development and Evaluation of a Korean Treebank and its Application to NLP
Erbach Gregor COLLATE: Competence Center in Speech and Language Technology.
Esteban López Cristina A XML-based tool for evaluation of SLDS.
Etheridge  Anne ELSST: a broad-based Multilingual Thesaurus for the Social Sciences.
Evans David A. Expanding lexicons by inducing paradigms and  validating attested forms.
Evans Richard  Assessing the difficulty of finding people in texts.
A corpus based investigation of morphological disagreement in anaphoric relations.
Bilingual alignment of anaphoric expressions.
Evert Stefan Advanced Tools for the Study of Natural Interactivity.
YAC - A Recursive Chunker for Unrestricted German Text.
Ezeiza N. Automatic Morphological Segmentation for Continuous Speech Recognition of Basque.
A Class Library for the Integration of NLP Tools: Definition and implementation of an Abstract Data Type Collection for the manipulation of SGML documents in a context of stand-off linguistic annotation
F  
Fabre Cécile  Acquisition of Qualia Elements from Corpora - Evaluation of a Symbolic Learning Method
Fakotakis Nikos Combining Bayesian and Support Vector Machines Learning to automatically complete Syntactical Information for HPSG-like Formalisms.
Comparative Evaluation of Collocation Extraction Metrics.
DELOS: An Automatically Tagged Economic Corpus for Modern Greek.
Falavigna Daniele The C-ORAL-ROM Project. New methods for spoken language archives in a multilingual romance corpus.
Farmakiotou Dimitra Multilingual XML-Based Named Entity Recognition for E-Retail Domains.
PatEdit: An Information Extraction Pattern Editor for Fast System Customization.
Feliu  Judit Towards an Ontology for a Human Genome Knowledge Base
Fellbaum Christiane From Resources to Applications. Designing the Multilingual ISLE Lexical Entry.
Ferret Oliver Building domain specific lexical hierarchies from corpora.
Fillmore Charles J Seeing Arguments through Transparent Structures.
The FrameNet Database and Software Tools.
Towards Best Practice for Multiword Expressions in Computational Lexicons.
Finch Andrew Beyond Tag Trigrams: New Local Features for Tagging.
Fiscus Jonathan G. NIST Rich Transcription 2002 Evaluation: A Preview.
A Pratical Introduction to ATLAS
Fitschen Arne Using Descriptive Generalisations in the Acquisition of Lexical Data for Word Formation.
Flickinger Dan Multiword expressions: linguistic precision and reusability.
Fluhr Christian  Building domain specific lexical hierarchies from corpora.
Font Llitjos Ariadna Evaluation and collection of proper name pronunciations online.
Forsbom Eva Scaling Up an MT Prototype for Industrial Use - Databases and Data Flow.
Fourour Nordine Incremental Recognition and Referential Categorization of French Proper Names.
Franco Drabek  Elliott Annotating the functional chunks in Chinese sentences.
François Hélčne The Greedy Algorithm and its Application to the Construction of a Continuous Speech Database.
Frederking Robert  Field Testing the Tongues Speech-to-Speech Machine Translation System.
Frid Nadezhda Development of a Dependency Treebank for Russian and its Possible Applications in NLP.
Fujii Atsushi Producing a Large-scale Encyclopedic Corpus over the Web.
Fumitada Itakura Multi-Dimensional Data Acquisition for Integrated Acoustic Information Research.
G  
Gaizauskas Robert  EMILLE, A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and Harmonisation.
Building and annotating a corpus for the study of journalistic text reuse
A Comparison of Machine Learning Algorithms for Prepositional Phrase Attachment.
Gallardo Carolina The UNL System.
Galvani Greghi Juliana DIADORIM - A Lexical Database for Brazilian Portuguese.
Gaminde Ińaki BIZKAIFON: A sound archive of dialectal varieties of spoken Basque.
García Bernabé  Myriam ELSST: a broad-based Multilingual Thesaurus for the Social Sciences.
García Mateo  Carmen Acoustic Modeling and Training of a Bilingual ASR System when a Minority Language is Involved.
García-Granada Fernando A Labelling Proposal to Annotate Dialogues.
García-Serrano Ana Integrating Spanish Linguistic Resources in a Web Site Assistant.
Natural Language Dialogue in a Virtual Assistant Interface.
Garofolo John NIST Rich Transcription 2002 Evaluation: A Preview.
A Pratical Introduction to ATLAS
Gasperin  Caroline Evaluation of parsed corpora: Experiments in user-transparent and user-visible evaluation.
Gedge Oren SpeechDat across all America: SALA II.
OrienTel - Multilingual access to interactive communication services for the Mediterranean and the Middle East.
Database Adaptation for Speech Recognition in Cross-Environmental Conditions.
Gendner Véronique A Protocol for Evaluating Analyzers of Syntax (PEAS).
Comparative study of oral and written French automatically tagged with morpho-syntactic information.
Geutner Petra Design of the VICO Spoken Dialogue System: Evaluation of User Expectations by Wizard-of-Oz Experiments.
Ghorbel Hatem SAM: System for Multi-criteria Text Alignment.
Gibbon Dafydd Annotation Driven Concordancing: the PAX Toolkit.
Enhanced Dialogue Markup for Crisis Talk Scenario Resources
Gil Ŕngel CATCG: a general purpose parsing tool applied.
Giouli Voula Multi-level XML-based Corpus Annotation.
Glass Michael The binomial cumulative distribution function, or, is my system better than yours?.
Glesner Angelika User-State Labeling Procedures For The Multimodal Data Collection Of SmartKom.
Goedemans Rob A unified system for accessing typological databases.
Goedertier Wim Experiences from the Spoken Dutch Corpus Project.
Word Segmentation in the Spoken Dutch Corpus.
Gojenola k. A Class Library for the Integration of NLP Tools: Definition and implementation of an Abstract Data Type Collection for the manipulation of SGML documents in a context of stand-off linguistic annotation
Gomes Luís Nexing Corpus: a corpus of verbal protocols on syllogistic reasoning.
González Rodríguez  Joaquin A Multilingual Speaker Verification System: Architecture and Performance Evaluation.
Goulian Jérôme Predictive and objective evaluation of speech understanding: the “challenge” evaluation campaign of the I3 speech workgroup of the French CNRS.
Grefenstette Gregory Expanding lexicons by inducing paradigms and  validating attested forms.
Grigoriev  Nikolai Development of a Dependency Treebank for Russian and its Possible Applications in NLP.
Grigorieva Svetlana Development of a Dependency Treebank for Russian and its Possible Applications in NLP.
Grishman  Ralph From Resources to Applications. Designing the Multilingual ISLE Lexical Entry.
Formal Mechanisms for Capturing Regularizations.
Diversity of Scenarios in Information extraction.
Summarization System Integrated with Named Entity Tagging and IE pattern Discovery.
Towards Best Practice for Multiword Expressions in Computational Lexicons.
Grosskopf Beate SPEECON – Speech Databases for Consumer Devices: Database Specification and Validation.
Grover Clarie  Multilingual XML-Based Named Entity Recognition for E-Retail Domains.
Gruenstein Alexander  Language Resources for Multi-Modal Dialogue Systems.
Guazzini Elisabetta CLIPS, a Multi-level Italian Computational Lexicon: a Glimpse to Data.
Linguistic and Computational Problems for the Creation of an Italian Children's Corpus of Spoken Language.
Gut Ulrike The TASX-environment: an XML-based toolset for time aligned speech corpora
H  
Haber  Renato Floresta Sintá(c)tica: A treebank for Portuguese.
Haeseryn W. Using Grammatical Description as a Metalanguage Resource.
Hagčge Caroline A Robust and Flexible Platform for Dependency Extraction.
Hahn  Udo Towards Very Large Ontologies for Medical Language Processing.
Hajicova Eva Argument/Valency Structure in PropBank, LCS Database and Prague Dependency Treebank: A Comparative Pilot Study.
Hamon Thierry How to evaluate necessary cooperative systems of terminology building?.
Hamza Oana A Unicode-based Environment for Creation and Use of Language Resources.
Extracting Information for Automatic Indexing of Multimedia Material.
How feasible is the reuse of grammars for Named Entity Recognition?.
Hanke Thomas iLex - A tool for Sign Language Lexicography and Corpus Analysis.
Hansen Silvia Developments in the TIGER Annotation Scheme and their Realization in the Corpus.
Hanzl Vaclav Tool for Czech Pronunciation Generation Combining Fixed Rules with Pronunciation Lexicon and Lexicon Management Tool
Hara Shoichiro Resource Sharing System for Humanity Researches
Harabagiu Sanda M. Multidocument Summarization with GISTexter.
Hardie Andrew EMILLE, A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and Harmonisation.
Hartley Anthony Automatic Ranking of MT Systems.
Hasida  Kôiti Construction of a Japanese Relevance-tagged Corpus.
Hathout Nabil From WordNet to CELEX: acquiring morphological links from dictionaries of synonyms.
Webaffix: Discovering Morphological Links on the WWW.
Heid Ulrich Using Descriptive Generalisations in the Acquisition of Lexical Data for Word Formation.
Advanced Tools for the Study of Natural Interactivity.
Hemsen Holmer A XML-based tool for evaluation of SLDS.
Hernáez  Inmaculada BIZKAIFON: A sound archive of dialectal varieties of spoken Basque.
Hernández Gómez L. Design and Evaluation of a SLDS for E-Mail Access through the Telephone.
A XML-based tool for evaluation of SLDS.
HMMs for Automatic Phonetic Segmentation
A Class Library for the Integration of NLP Tools: Definition and implementation of an Abstract Data Type Collection for the manipulation of SGML documents in a context of stand-off linguistic annotation
Hess Michael Technical Terminology as a Critical Resource.
Heuft Barbara OrienTel - Multilingual access to interactive communication services for the Mediterranean and the Middle East.
Heyer Gerhard Information Extraction from Text Corpora: Using Filters on Collocation Sets.
Hinrichs Erhard W. A Hybrid Architecture for Robust Parsing of German.
Hiromichi Kawanami Designing speech database with prosodic variety for expressive TTS system.
Hiroyuki  Shinnou Learning of word sense disambiguation rules by Co-training, checking co-occurrence of features.
Hockenmaier Julia Acquiring Compact Lexicalized Grammars from a Cleaner Treebank.
Hoekstra Heleen Syntactic Analysis in the Spoken Dutch Corpus (CGN).
Annotation of prominent words, prosodic boundaries and segmental lengthening by non-expert transcribers in the Spoken Dutch Corpus.
Höge Harald SpeechDat across all America: SALA II.
Project Proposal TC-STAR - Make Speech to Speech Translation Real
Give me a bug. a framework for a bug report service.
Höldrich Robert The Pronouncing Dictionary of Austrian German and the other Major Varieties of German - A Phonetic Resources Database on the Pronunciation of German.
Horbach Sabine SpeechDat across all America: SALA II.
Horvat  Bogomir Designing Prosodic Databases for Automatic Modeling of Slovenian Language in a Multilingual TTS System.
Preliminary Evaluation of Slovenian Mobile Database PoliDat.
Hoste Veronique Evaluation of Machine Learning Methods for Natural Language Processing Tasks.
Hovy  Eduard Computer-Aided Specification of Quality Models for Machine Translation Evaluation.
Hozjan Vladimir Objective analysis of emotional speech for English and Slovenian Interface emotional speech databases.
Interface Databases: Design and Collection of a Multilingual Emotional Speech Database.
Hozumi  Tanaka Constructing a lexicon of action.
Enhanced Japanese Electronic Dictionary Look-up.
Towards a Thesaurus of Predicates.
Hu Oliver How to evaluate necessary cooperative systems of terminology building?.
Hurault-Plantet Martine Cooperation between black box and glass box approaches for the evaluation of a question answering system.
Huttunen Silja Diversity of Scenarios in Information extraction.
I  
Ide Nancy Standards for Language Resources.
The American National Corpus: More Than the Web Can Provide.
Towards Best Practice for Multiword Expressions in Computational Lexicons.
Illouz Gabriel A Protocol for Evaluating Analyzers of Syntax (PEAS).
Imamura Kenji Comparing and Extracting Paraphrasing Words with 2-Way Bilingual Dictionaries.
Inagaki Yasuyoshi Bilingual Spoken Monologue Corpus for Simultaneous Machine Interpretation Research.
Iomdin Leonid Development of a Dependency Treebank for Russian and its Possible Applications in NLP.
Iraola Moreno Luis Multilingual Terminology Extraction and Validation.
Isahara Hitoshi Automatic extraction of differences between spoken and written languages, and automatic translation from the written to the spoken language.
The Valence Patterns of Japanese Verbs Extracted From The EDR Corpus.
Summarization System Integrated with Named Entity Tagging and IE pattern Discovery.
Isard Amy Advanced Tools for the Study of Natural Interactivity.
Ishikawa  Tetsuya Producing a Large-scale Encyclopedic Corpus over the Web.
Ishikawa Tomonori Acquiring Lexical Knowledge for Anaphora Resolution.
Ishizaki Shun Extraction of Associative Attributes from Nouns and Quantitative Expression of Prototype Concept.
Iskra Dorota SPEECON – Speech Databases for Consumer Devices: Database Specification and Validation.
OrienTel - Multilingual access to interactive communication services for the Mediterranean and the Middle East.
Itahashi Shuich The Present Status of Speech Database in Japan: Development, Management, and Application to Speech Research.
Ito Akinori Continuous Speech Recognition Consortium  an Open Repository for CSR Tools and Models.
Ivanova Krassimira Building a Linguistically Interpreted Corpus of Bulgarian: the BulTreeBank
J  
Jaerborg Jerker Lexical and Textual Resources for Sense Recognition and Description.
Jakopin Primož The feasibility of a complete text corpus.
Jameson Anthony COLLATE: Competence Center in Speech and Language Technology.
Jardino Michele A Protocol for Evaluating Analyzers of Syntax (PEAS).
Jong-Hoon Oh Word Sense Disambiguation with Information Retrieval Technique.
Jorg Brigitte COLLATE: Competence Center in Speech and Language Technology.
Jung Daniel  Humans as Corpus - Language Learning Strategies in Virtually Mediated Authentic Environments
Junichi Fukumoto Progress on Multi-lingual Named Entity Annotation Guidelines using RDF (S).
K  
Kaalep Heiki-Jaan Using the Text Corpus to Create a Comprehensive List of Phrasal Verbs
Kacic Zdravko Preliminary Evaluation of Slovenian Mobile Database PoliDat.
Design and Implementation of the Slovenian Phonetic and Morphology Lexicons for the Use in Spoken Language Applications.
Objective analysis of emotional speech for English and Slovenian Interface emotional speech databases.
Interface Databases: Design and Collection of a Multilingual Emotional Speech Database.
Kahusk Neeme Technical Terminology as a Critical Resource.
Kakihana  Kyoko Japanese and American Sign Language Dictionary System for Japanese and English Users.
Kaljurand  Kaarel Technical Terminology as a Critical Resource.
Kallmayer Laura VIQTORYA -- A Visual Query Tool for Syntactically Annotated Corpora
Kan  Min-Yen Using the Annotated Bibliography as a Resource for Indicative Summarization.
Kando  Noriko Sensitivity of IR systems Evaluation to Topic Difficulty.
Karger Reinhard COLLATE: Competence Center in Speech and Language Technology.
Karkaletsis Vangelis Ellogon: A New Text Engineering Platform.
Multilingual XML-Based Named Entity Recognition for E-Retail Domains.
PatEdit: An Information Extraction Pattern Editor for Fast System Customization.
Kashioka Hideki Translation Unit Concerning Timing of Simultaneous Translation.
Comparing and Extracting Paraphrasing Words with 2-Way Bilingual Dictionaries.
Katunobu Itou Continuous Speech Recognition Consortium  an Open Repository for CSR Tools and Models.
Producing a Large-scale Encyclopedic Corpus over the Web.
Kavallieratou  Ergina Diagnostic Assessment of Telephone Transmission Impact on ASR Performance and Human-to-Human Speech Quality.
Kawaguchi  Nobuo Bilingual Spoken Monologue Corpus for Simultaneous Machine Interpretation Research.
Multi-Dimensional Data Acquisition for Integrated Acoustic Information Research.
Kawahara  Daisuke Construction of a Japanese Relevance-tagged Corpus.
kawahara  Tatsuya Continuous Speech Recognition Consortium  an Open Repository for CSR Tools and Models.
Kermanidis Katia Combining Bayesian and Support Vector Machines Learning to automatically complete Syntactical Information for HPSG-like Formalisms.
Kermes Hannah YAC - A Recursive Chunker for Unrestricted German Text.
Kiessling Andreas SPEECON – Speech Databases for Consumer Devices: Database Specification and Validation.
Kikui  Genichiro Proposal of a very-large-corpus acquisition method by cell-formed registration.
King  Margaret Computer-Aided Specification of Quality Models for Machine Translation Evaluation.
Kingsbury Paul From TreeBank to PropBank.
Kinyon Alexandra Identifying Verb Arguments and their Syntactic Function in the Penn Treebank.
Kipp Michael Annotating and Measuring Multimodal Behaviour – Tycoon Metrics in the Anvil Tool.
Kita S. Multimodal Annotations in Gesture and Sign Language Studies
Klassmann Alexander Multimedia Annotation with Multilingual Input Methods and Search Support.
Klavans Judith A Method for Automatically Building and Evaluating Dictionary Resources.
Using the Annotated Bibliography as a Resource for Indicative Summarization.
Kloosterman Geert Querying Dependency Treebanks in XML.
Kluck Michael Inside the Evaluation Process of the Cross-Language Evaluation Forum (CLEF): Issues of Multilingual Topic Creation and Multilingual Relevance Assessment.
Knutsson Ola Different Ways of Evaluating a Swedish Grammar Checker.
Kobayashi Kazuhiro The Valence Patterns of Japanese Verbs Extracted From The EDR Corpus.
Köhler Joachim Creation of an Annotated German Broadcast Speech Database for Spoken Document Retrieval.
Methods and Tools for Speech Data Acquisition exploiting a Database of German Parliamentary Speeches and Transcripts from the Internet.
Koichi Takeuchi PIA-Core: Semantic Annotation through Example-based Learning
Progress on Multi-lingual Named Entity Annotation Guidelines using RDF (S).
Koit Mare Technical Terminology as a Critical Resource.
koizumi   Atsuko An Annotated Japanese Sign Language Corpus.
Kokkinakis Dimitrios Lexical and Textual Resources for Sense Recognition and Description.
Kokkinakis George Combining Bayesian and Support Vector Machines Learning to automatically complete Syntactical Information for HPSG-like Formalisms.
Comparative Evaluation of Collocation Extraction Metrics.
DELOS: An Automatically Tagged Economic Corpus for Modern Greek.
Kolkovska Sia Building a Linguistically Interpreted Corpus of Bulgarian: the BulTreeBank
Kolodnytsky Mykola THE NITE WORKBENCH. A Tool for Annotation of Natural Interactivity and Multimodal Data
Advanced Tools for the Study of Natural Interactivity.
Konstantinidis Alexis Multi-level XML-based Corpus Annotation.
Kopecek Ivan Databases of Heterogeneous Segments for Concatenative Speech Synthesis.
Korhonen Anna Subcategorization Acquisition as an Evaluation Method for WSD.
Kosaka Michiko Formal Mechanisms for Capturing Regularizations.
Kosho Shudo Modal Expressions in Natural Language Sentence and Their Similarity.
Koutsias Ionnis PatEdit: An Information Extraction Pattern Editor for Fast System Customization.
Kouylekov Milen  Building a Linguistically Interpreted Corpus of Bulgarian: the BulTreeBank
Kow Eric Towards Reusable NLP Components.
Kramer Markus Multimedia Annotation with Multilingual Input Methods and Search Support.
Kreidlin Leonid Development of a Dependency Treebank for Russian and its Possible Applications in NLP.
Krishnamurthy Ramesh A corpus-based investigation of junk emails.
Kruyt Truus Implementation and Evaluation of PAROLE PoS in a National Context
Kübler Sandra A Hybrid Architecture for Robust Parsing of German.
Kucerova Ivona Argument/Valency Structure in PropBank, LCS Database and Prague Dependency Treebank: A Comparative Pilot Study.
Kunze Claudia GermaNet - representation, visualization, application.
Kuriyama Kazuko Sensitivity of IR systems Evaluation to Topic Difficulty.
Kurokawa Kazuya A Web-based English Abstract Writing Tool Using a Tagged E-J Parallel Corpus.
Kurz Daniela A Domain Adaptive Approach to Automatic Acquisition of Domain Relevant Terms and their Relations with Bootstrapping
Kuwabara Hisao The Present Status of Speech Database in Japan: Development, Management, and Application to Speech Research.
Kyo Kageura Extracting French-Japanese Word Pairs from Bilingual Corpora based on Transliteration Rules.
L  
Lacatusu Finley Multidocument Summarization with GISTexter.
Lam  Wai Developing Infrastructure for the Evaluation of Single and Multi-document Summarization Systems in a Cross-lingual Environment.
Lambeau Fabre Multiword expressions: linguistic precision and reusability.
Lamel Lori Annotations for Dynamic Diagnosis of the Dialog State
Langlais  Philippe Translators at work with TRANSTYPE: Resource and Evaluation.
Lapalme Guy Translators at work with TRANSTYPE: Resource and Evaluation.
Laprun Cristophe A Pratical Introduction to ATLAS
Larson Martha Creation of an Annotated German Broadcast Speech Database for Spoken Document Retrieval.
Lauer Christoph Advanced Tools for the Study of Natural Interactivity.
Lauforcade Mathieu UNL Lexical Selection with Conceptual Vectors.
Laureys  Tom Word Segmentation in the Spoken Dutch Corpus.
An Improved Algorithm for the Automatic Segmentation of Speech Corpora.
Lavelli Alberto SiSSA: An Infrastructure for Developing NLP Applications.
Le An Ha Learning description of term patterns using glossary resources.
Le Meur André Report on the Revision of the Lexicographical Standard ISO 1951 Presentation/Representation of Entries in Dictionaries.
Lecomte  Josette Computerized linguistic resources of the research laboratory ATILF for lexical and textual analysis: Frantext, TLFi, and the software Stella.
Lee Akinobu Continuous Speech Recognition Consortium  an Open Repository for CSR Tools and Models.
Lee Haejoong TableTrans, MultiTrans, InterTrans and TreeTrans: Diverse Tools Built on the Annotation Graph Toolkit.
Creating Annotation Tools with the Annotation Graph Toolkit.
Models and Tools for Collaborative Annotation
Lee Young-Ju Speech Information Technology & Industry Promotion Center in Korea: Activities and Directions.
Leisher Mark A Unicode-based Environment for Creation and Use of Language Resources.
Leităo José Nexing Corpus: a corpus of verbal protocols on syllogistic reasoning.
Lemnitzer Lothar GermaNet - representation, visualization, application.
Lemon Oliver Language Resources for Multi-Modal Dialogue Systems.
Lenci Alessandro From Resources to Applications. Designing the Multilingual ISLE Lexical Entry.
Multilingual Summarization by Integrating Linguistic Resources in the MLIS-MUSI Project.
The Lexicon-Grammar Balance in Robust Parsing of Italian
Towards Best Practice for Multiword Expressions in Computational Lexicons.
Lenz  Eva Anna Converting a Corpus into a Hypertext: An Approach Using XML Topic Maps and XSLT.
Lesmo Leonardo Transformed Subcategorization Frames in Chunk Parsing.
Letellier-Zarshenas Sabine Towards a large corpus of spoken dialogue in French that will be freely available: the "Parole Publique" project and its first realisations.
Levinson St. Multimodal Annotations in Gesture and Sign Language Studies
Lezius Wolfgang Advanced Tools for the Study of Natural Interactivity.
Liberman Mark Language Resource Creation and Distribution at the Linguistic Data Consortium: A Progress Report.
TIDES Language Resources: A Resource Map for Translingual Information Access.
Lida Kermanidis Katia DELOS: An Automatically Tagged Economic Corpus for Modern Greek.
Lin  Jimmy The Web as a Resource for Question Answering: Perspectives and Challenges.
Linden André SAM: System for Multi-criteria Text Alignment.
Linhard Klaus Database Adaptation for Speech Recognition in Cross-Environmental Conditions.
Lombardo Vicenzo Transformed Subcategorization Frames in Chunk Parsing.
Lönneker Birte Building Concept Frames based on Text Corpora.
López de Ipińa K. Automatic Morphological Segmentation for Continuous Speech Recognition of Basque.
Loranger  Marie Translators at work with TRANSTYPE: Resource and Evaluation.
Louka Katerina End-to-End Evaluation of Multimodal Dialogue Systems – can we Transfer Established Methods?
Loukachevitch Natalia V. Evaluation of Thesaurus on Sociopolitical Life as Information-Retrieval Tool.
M  
Ma Xiaoyi TableTrans, MultiTrans, InterTrans and TreeTrans: Diverse Tools Built on the Annotation Graph Toolkit.
Creating Annotation Tools with the Annotation Graph Toolkit.
Models and Tools for Collaborative Annotation
Maci Ermanno SiSSA: An Infrastructure for Developing NLP Applications.
MacLeod Catherine From Resources to Applications. Designing the Multilingual ISLE Lexical Entry.
Lexical Annotation for Multi-word Entries Containing Nominalizations.
Towards Best Practice for Multiword Expressions in Computational Lexicons.
Madariaga Imanol BIZKAIFON: A sound archive of dialectal varieties of spoken Basque.
Maeda  Kazuaki TableTrans, MultiTrans, InterTrans and TreeTrans: Diverse Tools Built on the Annotation Graph Toolkit.
Creating Annotation Tools with the Annotation Graph Toolkit
Models and Tools for Collaborative Annotation
Magnini Bernardo Towards Automatic Evaluation of Question/Answering Systems
Maiorano Steven J. Multidocument Summarization with GISTexter.
Makino S. English Speech Database Read by Japanese Learners for CALL System Development.
Mana  Nadia The Lexico-semantic Annotation of an Italian Treebank
Manandhar  Suresh Improving an Ontology Refinement Method with Hyponymy Patterns.
Proposal for Evaluating Ontology Refinement Methods.
Manstetten Dietrich Design of the VICO Spoken Dialogue System: Evaluation of User Expectations by Wizard-of-Oz Experiments.
Manubu Okumura Constructing a lexicon of action.
Some Examinations of Intrinsic Methods for Summary Evaluation Based on the Text Summarization Challenge (TSC).
Mapelli Valerie The C-ORAL-ROM Project. New methods for spoken language archives in a multilingual romance corpus.
Maragoudakis  Manolis Combining Bayesian and Support Vector Machines Learning to automatically complete Syntactical Information for HPSG-like Formalisms.
Maragoudakis Enmanuel OrienTel - Multilingual access to interactive communication services for the Mediterranean and the Middle East.
Marasek Krysztof SPEECON – Speech Databases for Consumer Devices: Database Specification and Validation.
Marimón Montserrat Design and Evaluation of a SLDS for E-Mail Access through the Telephone.
Marín Rafael BDCon: A Spanish knowledge database.
A step forward to hypertext.
Marinelli Rita Proper Names In A Semantic Database.
Marińo José B. Multidialectal Spanish Modeling for ASR.
Markert  Katja Towards a Corpus Annotated for Metonymies: the Case of Location Names.
Martell Craig FORM: An Extensible, Kinematically-based Gesture Annotation Scheme.
Martens Jean-Pierre  Experiences from the Spoken Dutch Corpus Project.
Annotation of prominent words, prosodic boundaries and segmental lengthening by non-expert transcribers in the Spoken Dutch Corpus.
Word Segmentation in the Spoken Dutch Corpus.
Martin Alvin NIST Rich Transcription 2002 Evaluation: A Preview.
Martin Jean-Claude Annotating and Measuring Multimodal Behaviour – Tycoon Metrics in the Anvil Tool.
Multimodal and Adaptative Pedagogical Resources.
Martín Municio Ángel Combining statistics on n-grams for automatic term recognition.
Martin Patricia SpeechDat across all America: SALA II.
Martin Philippe The C-ORAL-ROM Project. New methods for spoken language archives in a multilingual romance corpus.
Martínez Paloma Integrating Spanish Linguistic Resources in a Web Site Assistant.
Martínez-Hinarejos Carlos D. A Labelling Proposal to Annotate Dialogues.
Masayuki Asahara Use of XML and Relational Databases for Consistent Development and Maintenance of Lexicons and Annotated Corpora.
Masumi  Narita A Web-based English Abstract Writing Tool Using a Tagged E-J Parallel Corpus.
Matsubara  Shigeki Bilingual Spoken Monologue Corpus for Simultaneous Machine Interpretation Research.
Multi-Dimensional Data Acquisition for Integrated Acoustic Information Research.
Matsumoto Kenji Automatic Alignment of Japanese and English Newspaper Articles using an MT System and a Bilingual Company Name Dictionary.
Matsumoto Yuji Use of XML and Relational Databases for Consistent Development and Maintenance of Lexicons and Annotated Corpora.
Maxwell Mike Resources for Morphology Learning and Evaluation.
Maynard Diana A Unicode-based Environment for Creation and Use of Language Resources.
Extracting Information for Automatic Indexing of Multimedia Material.
How feasible is the reuse of grammars for Named Entity Recognition?.
Mazzini Giampaolo SiSSA: An Infrastructure for Developing NLP Applications.
McDonald Scott Multilingual XML-Based Named Entity Recognition for E-Retail Domains.
McEnery Tony A Unicode-based Environment for Creation and Use of Language Resources.
EMILLE, A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and Harmonisation.
McKeown  Kathleen Using the Annotated Bibliography as a Resource for Indicative Summarization.
Merenciano Josep M. BDCon: A Spanish knowledge database.
A step forward to hypertext.
Merkel Magnus A System for Incremental and Interactive Word Linking.
Metzing  Dieter Co-reference annotation and resources: A multilingual corpus of typologically diverse languages.
Meyers Adam Formal Mechanisms for Capturing Regularizations.
Michelassi P. Knowledge Mining and Discovery for Searching in Literary Texts.
Mihalcea Rada F. Bootstrapping Large Sense Tagged Corpora.
Miháltz Márton Automatism and User Interaction: Building a Hungarian WordNet.
Mikros George Quantitative parameters in corpus design: Estimating the optimum text size in Modern Greek language.
Milde Jan-Torsten The TASX-environment: an XML-based toolset for time aligned speech corpora
Miller  Ken ELSST: a broad-based Multilingual Thesaurus for the Social Sciences.
Miller  Pam ELSST: a broad-based Multilingual Thesaurus for the Social Sciences.
Miller Keith Scaling the ISLE Framework: Use of Existing Corpus Resources for Validation of MT Evaluation Metrics across Languages.
Mimura Masato Continuous Speech Recognition Consortium  an Open Repository for CSR Tools and Models.
Minematsu N. English Speech Database Read by Japanese Learners for CALL System Development.
Mineur Anne-Marie A unified system for accessing typological databases.
Mitchell Brian A Comparison of Machine Learning Algorithms for Prepositional Phrase Attachment.
Mitkov Ruslan A corpus based investigation of morphological disagreement in anaphoric relations.
Bilingual alignment of anaphoric expressions.
Modic  Robert A Contrastive Acoustic-Phonetic Analysis of Slovenian and English Diphthongs.
Mokhtari Parham Automatic Detection of Acoustic Centres of Reliability for Tagging Paralinguistic Information in Expressive Speech.
Molina Antonio Word Sense Disambiguation using Statistical Models and WordNet.
Molino Stefano CLIPS, a Multi-level Italian Computational Lexicon: a Glimpse to Data.
Möller  Sebastian Diagnostic Assessment of Telephone Transmission Impact on ASR Performance and Human-to-Human Speech Quality.
Monachesi Paola A unified system for accessing typological databases.
Monachini Monica CLIPS, a Multi-level Italian Computational Lexicon: a Glimpse to Data.
Monceaux Laura Cooperation between black box and glass box approaches for the evaluation of a question answering system.
A Protocol for Evaluating Analyzers of Syntax (PEAS).
Moneglia Massimo The C-ORAL-ROM Project. New methods for spoken language archives in a multilingual romance corpus.
Montemagni Simonetta The Lexicon-Grammar Balance in Robust Parsing of Italian.
Moody John Field Testing the Tongues Speech-to-Speech Machine Translation System.
Moore Johanna Fish or Fowl:A Wizard of Oz Evaluation of Dialogue Strategies in the Restaurant Domain.
Moortgat Michael Experiences from the Spoken Dutch Corpus Project.
Using the Spoken Dutch Corpus for type-logical grammar induction.
Syntactic Analysis in the Spoken Dutch Corpus (CGN).
Moot Richard Using the Spoken Dutch Corpus for type-logical grammar induction.
Morarescu Paul Multidocument Summarization with GISTexter.
Moreno Asunción SpeechDat across all America: SALA II.
OrienTel - Multilingual access to interactive communication services for the Mediterranean and the Middle East.
Multidialectal Spanish Modeling for ASR.
Interface Databases: Design and Collection of a Multilingual Emotional Speech Database.
Moreno Lidia Word Sense Disambiguation using Statistical Models and WordNet.
Bilingual alignment of anaphoric expressions.
Moreno Ortiz Antonio New Developments in Ontological Semantics.
Moreno Sandoval Antonio The C-ORAL-ROM Project. New methods for spoken language archives in a multilingual romance corpus.
Moretti L. Knowledge Mining and Discovery for Searching in Literary Texts.
Morin Emmanuel Incremental Recognition and Referential Categorization of French Proper Names.
Morlesín José F. Design and Evaluation of a SLDS for E-Mail Access through the Telephone.
Mosel U. Methods of Language Documentation in the DOBES project.
Mouradi A. Study and quantification of the declination for the Arabic speech synthesis system PARADIS.
Moyal Ami Database Adaptation for Speech Recognition in Cross-Environmental Conditions.
Muhr Rudolf The Pronouncing Dictionary of Austrian German and the other Major Varieties of German - A Phonetic Resources Database on the Pronunciation of German.
Muischnek Kadri Using the Text Corpus to Create a Comprehensive List of Phrasal Verbs
Müller   Frank H. A Hybrid Architecture for Robust Parsing of German.
Müller  Achim F. Designing Prosodic Databases for Automatic Modeling of Slovenian Language in a Multilingual TTS System.
Muller Christoph An API for Discourse-level Access to XML-encoded Corpora.
Muńoz  R. Bilingual alignment of anaphoric expressions.
Murata  Masaki Automatic extraction of differences between spoken and written languages, and automatic translation from the written to the spoken language.
Muresan Smaranda A Method for Automatically Building and Evaluating Dictionary Resources.
Mustafa El Hadi Widad Terminological Enrichment for non-Interactive MT Evaluation
N  
Nahli Ouafae Italian arabic linguistic tools.
Nakagawa S. English Speech Database Read by Japanese Learners for CALL System Development.
Nakamura Satoshi The Present Status of Speech Database in Japan: Development, Management, and Application to Speech Research.
Nanba  Hidetsugu Some Examinations of Intrinsic Methods for Summary Evaluation Based on the Text Summarization Challenge (TSC).
Na-Rae Han Development and Evaluation of a Korean Treebank and its Application to NLP
Navas Eva BIZKAIFON: A sound archive of dialectal varieties of spoken Basque.
Navigli Roberto Automatic Adaptation of WordNet to Domains.
Negri Matteo Towards Automatic Evaluation of Question/Answering Systems
Nenadić Goran Tuning Context Features with Genetic Algorithms.
Automatic Acronym Acquisition and Term Variation Management within Domain-Specific Texts.
Nic Gearailt Donnla Multilingual XML-Based Named Entity Recognition for E-Retail Domains.
Nicolas Pascale Towards a large corpus of spoken dialogue in French that will be freely available: the "Parole Publique" project and its first realisations.
Nilsson  Kristina Living off the land: The Web as a source of practice texts for learners of less prevalent languages.
Nimb Sanni Adverbs in Semantic Lexica for NLP - The extension of the Danish SIMPLE lexicon with Time Adverbs.
Nirenburg Sergei New Developments in Ontological Semantics.
Nissim  Malvina Towards a Corpus Annotated for Metonymies: the Case of Location Names.
Nobata  Chikashi Summarization System Integrated with Named Entity Tagging and IE pattern Discovery.
Extended Named Entity Hierarchy.
Progress on Multi-lingual Named Entity Annotation Guidelines using RDF (S).
Nogueiras  Albino OrienTel - Multilingual access to interactive communication services for the Mediterranean and the Middle East.
Interface Databases: Design and Collection of a Multilingual Emotional Speech Database.
Noldus Lucas P. J.J. Advanced Tools for the Study of Natural Interactivity.
Norihiro Ogata Progress on Multi-lingual Named Entity Annotation Guidelines using RDF (S).
Nübel Rita Bilingual Indexing for Information Retrieval with AUTINDEX.
Nugues Pierre An Algorithm to Find Words from Definitions.
O  
Offenga Freddy Metadata Tools Supporting Controlled Vocabulary Services.
Okamoto  Jun Extraction of Associative Attributes from Nouns and Quantitative Expression of Prototype Concept.
Okumura Ryo Enhanced Japanese Electronic Dictionary Look-up.
Ole Bernsen Niels THE NITE WORKBENCH. A Tool for Annotation of Natural Interactivity and Multimodal Data
Natural Interactivity Resources – Data, Annotation Schemes and Tools.
Advanced Tools for the Study of Natural Interactivity.
Oliveira Luis  Morphosyntactic Disambiguation for TTS Systems.
Olsen Sussi  Lemma selection in domain specific computational lexica - some specific problems
Olsson Leif-Jöran Scaling Up an MT Prototype for Industrial Use - Databases and Data Flow.
Oostdijk  Nelleke Experiences from the Spoken Dutch Corpus Project.
Orasan Constantin Assessing the difficulty of finding people in texts.
A corpus-based investigation of junk emails.
Building annotated resources for automatic text summarisation.
Oravecz Csaba Efficient Stochastic Part-of-Speech Tagging for Hungarian.
Ortega García  Javier A Multilingual Speaker Verification System: Architecture and Performance Evaluation.
Osenova Petya Building a Linguistically Interpreted Corpus of Bulgarian: the BulTreeBank
Osolsobe Klara A Procedure for Word Derivational Processes Concerning Lexicon Extension in Highly Inflected Languages
Otero  Josep M. P Design and Evaluation of a SLDS for E-Mail Access through the Telephone.
P  
Padró Lluís A Flexible Distributed Architecture for Natural Language Analyzers.
Paggio Patrizia Semantic Lexical Resources Applied to Content-based Querying - the OntoQuery Project.
Pajot Sylvain A Pratical Introduction to ATLAS
Pala Karel Databases of Heterogeneous Segments for Concatenative Speech Synthesis.
A Procedure for Word Derivational Processes Concerning Lexicon Extension in Highly Inflected Languages
Paliouras Georgios Ellogon: A New Text Engineering Platform.
Pallet David NIST Rich Transcription 2002 Evaluation: A Preview.
Palmer Martha From Resources to Applications. Designing the Multilingual ISLE Lexical Entry.
Development and Evaluation of a Korean Treebank and its Application to NLP
From TreeBank to PropBank.
Palomar M. Bilingual alignment of anaphoric expressions.
Papageorgiou Harris Multi-level XML-based Corpus Annotation.
Pardos Fernando Combining statistics on n-grams for automatic term recognition.
Paroubek Patrick A Protocol for Evaluating Analyzers of Syntax (PEAS).
Pastra Katerina How feasible is the reuse of grammars for Named Entity Recognition?.
Pazienza María Teresa Multilingual XML-Based Named Entity Recognition for E-Retail Domains.
Pearce Darren A Comparative Evaluation of Collocation Extraction Techniques
Pease Catherine Bilingual Indexing for Information Retrieval with AUTINDEX.
Pecchia Laura Linguistic and Computational Problems for the Creation of an Italian Children's Corpus of Spoken Language.
Pedersen Bolette S. Semantic Lexical Resources Applied to Content-based Querying - the OntoQuery Project.
Penide-Lopez Victoria  End-to-End Evaluation of Multimodal Dialogue Systems – can we Transfer Established Methods?
Peral J. Bilingual alignment of anaphoric expressions.
Pérez Gema BDCon: A Spanish knowledge database.
A step forward to hypertext.
Petasis Georgios Ellogon: A New Text Engineering Platform.
Multilingual XML-Based Named Entity Recognition for E-Retail Domains.
PatEdit: An Information Extraction Pattern Editor for Fast System Customization.
Petek  Bojan A Contrastive Acoustic-Phonetic Analysis of Slovenian and English Diphthongs.
Peters Carol The Importance of Evaluation for Cross-Language System Development: the CLEF Experience.
Peters W. Analysis of Lexical Structures from Field Linguistics and Language Engineering.
Metadata Proposals for Corpora and Lexica.
Pfitzinger Hartmut R. Reducing Segmental Duration Variation by Local Speech Rate Normalization of Large Spoken Language Resources.
Pianesi Fabio NESPOLE!’s Multilingual and Multimodal Corpus.
SiSSA: An Infrastructure for Developing NLP Applications.
Pianta Emanuele Opportunistic Semantic Tagging.
Piao S. L. Building and annotating a corpus for the study of journalistic text reuse
Picchi Eugenio Italian arabic linguistic tools.
Pierrel  Jean-Marie Computerized linguistic resources of the research laboratory ATILF for lexical and textual analysis: Frantext, TLFi, and the software Stella.
Pinto Elisabeth SpeechDat across all America: SALA II.
Piperidis Stelios Multi-level XML-based Corpus Annotation.
Pirrelli Vito Advanced Tools for the Study of Natural Interactivity.
The Lexicon-Grammar Balance in Robust Parsing of Italian.
Piskorski  Jakub A Domain Adaptive Approach to Automatic Acquisition of Domain Relevant Terms and their Relations with Bootstrapping
A Flexible XML-based Regular Compiler for Creation and Conversion of Linguistic Resources.
Pius ten Hacken Word Formation and the Validation of Lexical Resources.
Pla Ferran Word Sense Disambiguation using Statistical Models and WordNet.
Poesio Massimo Acquiring Lexical Knowledge for Anaphora Resolution.
Poibeau Thierry Evaluating resource acquisition tools for Information Extraction.
Pollak Petr Tool for Czech Pronunciation Generation Combining Fixed Rules with Pronunciation Lexicon and Lexicon Management Tool
Pönninghaus  Jens Co-reference annotation and resources: A multilingual corpus of typologically diverse languages.
Popescu-Belis  Andrei Corpus-based Evaluation of a French Spelling and Grammar Checker.
Electronic Dictionaries - from Publisher Data to a Distribution Server: the DicoPro, DicoEast and RERO Projects.
Computer-Aided Specification of Quality Models for Machine Translation Evaluation.
Porta Zamorano Jordi Combining statistics on n-grams for automatic term recognition.
Postolache Oana-Diana AR-Engine - a framework for unrestricted co-reference resolution.
Power  Richard PILLS:  Multilingual generation of medical information documents with overlapping content.
Prasad Rashmi Automatic Evaluation: Using a DATE Dialogue Act Tagger for User Satisfaction and Task Completion Prediction.
Preiss Judita Subcategorization Acquisition as an Evaluation Method for WSD.
Prevete Roberto Towards Automatic Evaluation of Question/Answering Systems
Prodanof Irina SiSSA: An Infrastructure for Developing NLP Applications.
Prokopidis Prokopis Multi-level XML-based Corpus Annotation.
Prolo Carlos A. Identifying Verb Arguments and their Syntactic Function in the Penn Treebank.
Prószéky Gábor Automatism and User Interaction: Building a Hungarian WordNet.
Przybocki Mark NIST Rich Transcription 2002 Evaluation: A Preview.
Q  
Qi  Hong Evaluating Web-based Question Answering Systems.
Qu  Yan Expanding lexicons by inducing paradigms and  validating attested forms.
Quasthoff Uwe Information Extraction from Text Corpora: Using Filters on Collocation Sets.
Quixal Martí CATCG: a general purpose parsing tool applied.
R  
Raake Alexander Does the Content of Speech Influence its Perceived Sound Quality?.
Radev Dragomir Developing Infrastructure for the Evaluation of Single and Multi-document Summarization Systems in a Cross-lingual Environment.
Evaluating Web-based Question Answering Systems.
Raileanu Diana Evaluation Corpora for Sense Disambiguation in the Medical Domain.
An Efficient and Flexible Format for Linguistic and Semantic Annotation.
Rajman Martin Automatic Ranking of MT Systems.
Evaluation of a Vector Space Similarity Measure in a Multilingual Framework.
Rambow Owen A Dependency Treebank for English.
Ramesh Ganesh A Text-based for Detection and Filtering of Commercial Segments in Broadcast News.
Randall Beth TableTrans, MultiTrans, InterTrans and TreeTrans: Diverse Tools Built on the Annotation Graph Toolkit.
Rapp Reinhard A Part-of-Speech-Based Search Algorithm for Translation Memories.
Rapp Stefan An Iterative Data Collection Approach for Multimodal Dialogue Systems.
Raskin Victor New Developments in Ontological Semantics.
Refice Mario Methods and Tools for Prosodic Analysis of a Spoken Italian Corpus.
Reichel Uwe Multi-Tier Annotations in the Verbmobil Corpus.
Reithinger Norbert Advanced Tools for the Study of Natural Interactivity.
Relańo José  Design and Evaluation of a SLDS for E-Mail Access through the Telephone.
Ren  Fuji Annotating the functional chunks in Chinese sentences.
Renmans Bram Syntactic Analysis in the Spoken Dutch Corpus (CGN).
Reppen Randi The American National Corpus: More Than the Web Can Provide.
Réty Jean-Hugues Multimodal and Adaptative Pedagogical Resources.
Ribarov Kiril Old Sources and Modern Procedures: Computer Processing of Old-Church Slavonic.
Ribeiro Ricardo Morphosyntactic Disambiguation for TTS Systems.
Rinaldi Fabio Technical Terminology as a Critical Resource.
Rincón Antonio SpeechDat across all America: SALA II.
Ripplinger Bärbel An Efficient and Flexible Format for Linguistic and Semantic Annotation.
Robba Isabelle A Protocol for Evaluating Analyzers of Syntax (PEAS).
Robert Gilbert Electronic Dictionaries - from Publisher Data to a Distribution Server: the DicoPro, DicoEast and RERO Projects.
Rodrigo-Aguado Luis Natural Language Dialogue in a Virtual Assistant Interface.
Rodríguez  M. Carmen Design and Evaluation of a SLDS for E-Mail Access through the Telephone.
Rojc Matej Design and Implementation of the Slovenian Phonetic and Morphology Lexicons for the Use in Spoken Language Applications.
Romary Laurent Standards for Language Resources.
Towards Reusable NLP Components.
LREP: A Language Repository Exchange Protocol.
Rose Tony The Reuters Corpus Volume 1 -from Yesterday's News to Tomorrow's Language Resources.
Rosner Michael The Future of Maltilex.
Rosset Sophie Predictive and objective evaluation of speech understanding: the “challenge” evaluation campaign of the I3 speech workgroup of the French CNRS.
Annotations for Dynamic Diagnosis of the Dialog State
Rousseau-Hans Françoise Building domain specific lexical hierarchies from corpora.
Roux Claude A Robust and Flexible Platform for Dependency Extraction.
Roventini Adriana Proper Names In A Semantic Database.
Integrating Two Semantic Lexicons, SIMPLE and ItalWordNet: What Can We Gain?.
Rudzionis Algimantas Lithuanian Speech Database LTDIGITS
Rudzionis Vytautas Lithuanian Speech Database LTDIGITS
Ruimy Nilda CLIPS, a Multi-level Italian Computational Lexicon: a Glimpse to Data.
Ruiz Ureńa Rafael J. Combining statistics on n-grams for automatic term recognition.
Ruiz-Cristina Alberto Integrating Spanish Linguistic Resources in a Web Site Assistant.
Rüter Wolff Creation of an Annotated German Broadcast Speech Database for Spoken Document Retrieval.
Ruz  Pedro M Design and Evaluation of a SLDS for E-Mail Access through the Telephone.
A Multilingual Speaker Verification System: Architecture and Performance Evaluation.
Rzewuska Maria  S Terminology Resources in the Context of a Major Translation Project.
S  
Sacaleanu Bogdan An Efficient and Flexible Format for Linguistic and Semantic Annotation.
Sadao Kurohashi Construction of a Japanese Relevance-tagged Corpus.
Sag Ivan A.  Multiword expressions: linguistic precision and reusability.
Sagawa Hirohiko An Annotated Japanese Sign Language Corpus.
Saggion Horacio Extracting Information for Automatic Indexing of Multimedia Material.
Developing Infrastructure for the Evaluation of Single and Multi-document Summarization Systems in a Cross-lingual Environment.
Sĺgvall Hein Anna Scaling Up an MT Prototype for Industrial Use - Databases and Data Flow.
Salmen Angelika  Multi-Modal Menus And Traffic Interaction. Timing As A Crucial Factor For User Driven Mode Decision
Salmon-Alt Susanne Nominal Expressions in Multilingual Corpora: Definites and Demonstratives.
Samaritakis Georgios Multilingual XML-Based Named Entity Recognition for E-Retail Domains.
Sánchez Juan MŞ BIZKAIFON: A sound archive of dialectal varieties of spoken Basque.
Sánchez León Fernando Combining statistics on n-grams for automatic term recognition.
Sanchez Quero Manuel Building ancient Spanish dictionaries for spell-checking of DL texts
Sanchis Emilio A Labelling Proposal to Annotate Dialogues.
Sandrini Vanessa ADAM: The SI-TAL Corpus of Annotated Dialogues.
Santos  Diana Evaluation of parsed corpora: Experiments in user-transparent and user-visible evaluation.
Floresta Sintá(c)tica: A treebank for Portuguese.
Sasaki  Felix Co-reference annotation and resources: A multilingual corpus of typologically diverse languages.
Sassen Claudia Enhanced Dialogue Markup for Crisis Talk Scenario Resources
Sassolini Eva Italian arabic linguistic tools.
Sato  Hiroaki Seeing Arguments through Transparent Structures.
The FrameNet Database and Software Tools.
Säuberlich Bettina Using Descriptive Generalisations in the Acquisition of Lexical Data for Word Formation.
Savino Michelina Methods and Tools for Prosodic Analysis of a Spoken Italian Corpus.
Schaden Stefan A Database for the Analysis of Cross-Lingual Pronunciation Variants of European City Names.
Schadle Igor Towards a large corpus of spoken dialogue in French that will be freely available: the "Parole Publique" project and its first realisations.
Scherf  Oliver A Flexible XML-based Regular Compiler for Creation and Conversion of Linguistic Resources.
Schiel  Florian Three New Corpora at the Bavarian Archive for Speech Signals – and a First Step Towards Distributed Web-Based Recording.
The SmartKom Multimodal Corpus at BAS.
User-State Labeling Procedures For The Multimodal Data Collection Of SmartKom.
Multi-Tier Annotations in the Verbmobil Corpus.
Schiffman Barry Building a Resource for Evaluating the Importance of Sentences.
Schmeier Sven A Domain Adaptive Approach to Automatic Acquisition of Domain Relevant Terms and their Relations with Bootstrapping
Schmidd Paul Bilingual Indexing for Information Retrieval with AUTINDEX.
Schmitz Klaus-Dirk Subject-field-specific Ontologies and Terminologies for the Web Community.
Schneider  René n-grams of Seeds: A Hybrid System for Corpus-Based Text Summarization.
Schulte im Walde Sabine Acquiring Lexical Knowledge for Anaphora Resolution.
A Subcategorisation Lexicon for German Verbs induced from a Lexicalised PCFG.
Schulz  Stefan Towards Very Large Ontologies for Medical Language Processing.
Schuurman Ineke Syntactic Analysis in the Spoken Dutch Corpus (CGN).
Scott  Donia PILLS:  Multilingual generation of medical information documents with overlapping content.
Scott Michael J. The binomial cumulative distribution function, or, is my system better than yours?.
Sébillot Pascale Acquisition of Qualia Elements from Corpora - Evaluation of a Symbolic Learning Method
Sedlacek Radek A Procedure for Word Derivational Processes Concerning Lexicon Extension in Highly Inflected Languages
Segarra Encarna  Word Sense Disambiguation using Statistical Models and WordNet.
Sekine Satoshi Summarization System Integrated with Named Entity Tagging and IE pattern Discovery.
Extended Named Entity Hierarchy.
Senia Franco SpeechDat across all America: SALA II.
Seretan Violeta The Use of Referential Constraints in Structuring Discourse.
Severinson Eklundh Kerstin Different Ways of Evaluating a Swedish Grammar Checker.
Sgall Petr A Machine Learning Approach to Automatic Functor Assignment in the Prague Dependency Treebank.
Shammass Shaunie Database Adaptation for Speech Recognition in Cross-Environmental Conditions.
Shammass Sherrie OrienTel - Multilingual access to interactive communication services for the Mediterranean and the Middle East.
Sharoff  Serge Meaning as use: exploitation of aligned corpora for the contrastive study of lexical semantics.
Shikano Kiyohiro Continuous Speech Recognition Consortium  an Open Repository for CSR Tools and Models.
Designing speech database with prosodic variety for expressive TTS system.
Shimizu K. English Speech Database Read by Japanese Learners for CALL System Development.
Shimohata  Mitsuo Automatic paraphrasing based on parallel corpus for normalization.
Statistical Machine Translation on Paraphrased Corpora.
Shin Saim Word Sense Disambiguation with Information Retrieval Technique.
Shirai  Kiyoaki Construction of a Word Sense Tagged Corpus for SENSEVAL-2 Japanese Dictionary Task.
Shirai Satoshi  Towards a Thesaurus of Predicates.
Siemund Rainer OrienTel - Multilingual access to interactive communication services for the Mediterranean and the Middle East.
Silva Joăo Nexing Corpus: a corpus of verbal protocols on syllogistic reasoning.
Simoni Jean-Luc Building domain specific lexical hierarchies from corpora.
Simov Alexander  Building a Linguistically Interpreted Corpus of Bulgarian: the BulTreeBank
Simov Kiril Building a Linguistically Interpreted Corpus of Bulgarian: the BulTreeBank
Slavcheva Milena Building a Linguistically Interpreted Corpus of Bulgarian: the BulTreeBank
Solá  Mercedes A Multilingual Speaker Verification System: Architecture and Performance Evaluation.
Soria Claudia Advanced Tools for the Study of Natural Interactivity.
ADAM: The SI-TAL Corpus of Annotated Dialogues.
Soroa A. A Class Library for the Integration of NLP Tools: Definition and implementation of an Abstract Data Type Collection for the manipulation of SGML documents in a context of stand-off linguistic annotation
Spasić Irena Tuning Context Features with Genetic Algorithms.
Automatic Acronym Acquisition and Term Variation Management within Domain-Specific Texts.
Spenke Harriet Multimedia Annotation with Multilingual Input Methods and Search Support.
Spyropoulos Constantine D. Ellogon: A New Text Engineering Platform.
PatEdit: An Information Extraction Pattern Editor for Fast System Customization.
Starlander Marianne Corpus-based Evaluation of a French Spelling and Grammar Checker.
Steedman Mark  Acquiring Compact Lexicalized Grammars from a Cleaner Treebank.
Steffens Frank  Design of the VICO Spoken Dialogue System: Evaluation of User Expectations by Wizard-of-Oz Experiments.
Steinbrecher Eric Field Testing the Tongues Speech-to-Speech Machine Translation System.
Steiner  Ilona VIQTORYA -- A Visual Query Tool for Syntactically Annotated Corpora
Steininger Silke The SmartKom Multimodal Corpus at BAS.
User-State Labeling Procedures For The Multimodal Data Collection Of SmartKom.
Stergar  Janez Designing Prosodic Databases for Automatic Modeling of Slovenian Language in a Multilingual TTS System.
Stevenson Mark The Reuters Corpus Volume 1 -from Yesterday's News to Tomorrow's Language Resources.
Storrer Angelika Converting a Corpus into a Hypertext: An Approach Using XML Topic Maps and XSLT.
Stranáková-Lopatková M.  Valency Dictionary of Czech Verbs: Complex Tectogrammatical Annotation.
Strassel Stephanie Developing Infrastructure for the Evaluation of Single and Multi-document Summarization Systems in a Cross-lingual Environment.
The DASL Project: a Case Study in Data Re-Annotation and Re-Use.
Strik H.  A Field Survey for Establishing Priorities in the Development of HLT Resources for Dutch.
Strube Michael An API for Discourse-level Access to XML-encoded Corpora.
An Iterative Data Collection Approach for Multimodal Dialogue Systems.
Sturm Janienke Usability Evaluation of a Dutch Multimodal System for Train Timetable Information.
A Field Survey for Establishing Priorities in the Development of HLT Resources for Dutch.
Suderman Keith The American National Corpus: More Than the Web Can Provide.
Sudo Kiyoshi Extended Named Entity Hierarchy.
Suebvisai  Sinaporn Phonetically Distributed Continuous Speech Corpus for Thai Language.
Sugaya Fumiaki Toward a Broad-coverage Bilingual Corpus for Speech Translation of Travel Conversations in the Real World.
Proposal of a very-large-corpus acquisition method by cell-formed registration.
Automatic machine translation selection scheme to output the best result.
Suguru Saito Constructing a lexicon of action.
Sukkar Rafid SpeechDat across all America: SALA II.
Sumita  Eiichiro Toward a Broad-coverage Bilingual Corpus for Speech Translation of Travel Conversations in the Real World.
Automatic paraphrasing based on parallel corpus for normalization.
Statistical Machine Translation on Paraphrased Corpora.
Supphanat  Kanongphara Phonetically Distributed Continuous Speech Corpus for Thai Language.
Sutcliffe Richard F. E. Searching via Keywords or Concept Hierarchies - Which is Better?.
Suzuki Emiko  Japanese and American Sign Language Dictionary System for Japanese and English Users.
Svendsen  Torbjřrn Evaluation of Pronunciation Variants in the ASR Lexicon for Different Speaking Styles.
Swerts Marc Annotation of prominent words, prosodic boundaries and segmental lengthening by non-expert transcribers in the Spoken Dutch Corpus.
Szkely Rachel Automatism and User Interaction: Building a Hungarian WordNet.
T  
Taber Harriet A Dependency Treebank for English.
Tablan Valentin A Unicode-based Environment for Creation and Use of Language Resources.
Tadic Marko Building the Croatian National Corpus.
Takagi Akira  Bilingual Spoken Monologue Corpus for Simultaneous Machine Interpretation Research.
Takano Ogino The Valence Patterns of Japanese Verbs Extracted From The EDR Corpus.
Takao Kazutaka Comparing and Extracting Paraphrasing Words with 2-Way Bilingual Dictionaries.
Takeda Kazuya  The Present Status of Speech Database in Japan: Development, Management, and Application to Speech Research.
Continuous Speech Recognition Consortium  an Open Repository for CSR Tools and Models.
Multi-Dimensional Data Acquisition for Integrated Acoustic Information Research.
Takehito Utsuro A Web-based English Abstract Writing Tool Using a Tagged E-J Parallel Corpus.
Takenobu Tokunaga Constructing a lexicon of action.
Enhanced Japanese Electronic Dictionary Look-up.
Takeuchi Masaru An Annotated Japanese Sign Language Corpus.
Takezawa Toshiyuki The Present Status of Speech Database in Japan: Development, Management, and Application to Speech Research.
Toward a Broad-coverage Bilingual Corpus for Speech Translation of Travel Conversations in the Real World.
Proposal of a very-large-corpus acquisition method by cell-formed registration.
Automatic machine translation selection scheme to output the best result.
Tamburini  Fabio Automatic detection of prosodic prominence in continuous speech.
A dynamic model for reference corpora structure definition
Tanaka Hideki Automatic Alignment of Japanese and English Newspaper Articles using an MT System and a Bilingual Company Name Dictionary.
Tanev Hristo  Towards Automatic Evaluation of Question/Answering Systems
Tanguy Ludovic Webaffix: Discovering Morphological Links on the WWW.
Tapias Daniel Design and Evaluation of a SLDS for E-Mail Access through the Telephone.
A Multilingual Speaker Verification System: Architecture and Performance Evaluation.
Tavoni M. Knowledge Mining and Discovery for Searching in Literary Texts.
Teixeira Martins Ronaldo DIADORIM - A Lexical Database for Brazilian Portuguese.
Terken Jacques Usability Evaluation of a Dutch Multimodal System for Train Timetable Information.
Teufel Simone  Developing Infrastructure for the Evaluation of Single and Multi-document Summarization Systems in a Cross-lingual Environment.
Collection and linguistic processing of a large-scale corpus of medical articles.
Teunissen Lisanne  A Human Language Technologies Platform for the Dutch language: awareness, management maintenance and distribution.
Thanapoulos Aristomenis Comparative Evaluation of Collocation Extraction Metrics.
Thaning Sten Scaling Up an MT Prototype for Industrial Use - Databases and Data Flow.
Thurmair Gregor From Resources to Applications. Designing the Multilingual ISLE Lexical Entry.
Tiberius Carole How to build a multilingual inheritance-based lexicon.
A typological database of agreement
Tiedermann Jörg Scaling Up an MT Prototype for Industrial Use - Databases and Data Flow.
MatsLex - a Multilingual Lexical Database for Machine Translation
Timimi Ismaďl Terminological Enrichment for non-Interactive MT Evaluation
Todirascu Amalia Towards Reusable NLP Components.
Tomiyama Y. English Speech Database Read by Japanese Learners for CALL System Development.
Tomoki Toda  Designing speech database with prosodic variety for expressive TTS system.
Toporowska Gronostaj M. Lexical and Textual Resources for Sense Recognition and Description.
Torre Toledano Doroteo HMMs for Automatic Phonetic Segmentation
Toshifumi Tanabe Modal Expressions in Natural Language Sentence and Their Similarity.
Tovar Edmundo The UNL System.
Trancoso Isabel Morphosyntactic Disambiguation for TTS Systems.
Trippel Thorsten Annotation Driven Concordancing: the PAX Toolkit.
Tropf Herbert OrienTel - Multilingual access to interactive communication services for the Mediterranean and the Middle East.
Tsuji Keita Extracting French-Japanese Word Pairs from Bilingual Corpora based on Transliteration Rules.
Tsuyoshi Masuda Designing speech database with prosodic variety for expressive TTS system.
Tufis  Dan Lexical token alignment: experiments, results and applications.
Turini F. Knowledge Mining and Discovery for Searching in Literary Texts.
Türk Uli End-to-End Evaluation of Multimodal Dialogue Systems – can we Transfer Established Methods?
Türk Ulrich The SmartKom Multimodal Corpus at BAS.
U  
Uí Dhonnchadha Elaine A Two-level Morphological Analyser and Generator for Irish using Finite-State Transducers.
Uibo Heli Experimental Two-Level Morphology of Estonian.
Ule Tylman A Hybrid Architecture for Robust Parsing of German.
Ulivieri Marisa CLIPS, a Multi-level Italian Computational Lexicon: a Glimpse to Data.
Integrating Two Semantic Lexicons, SIMPLE and ItalWordNet: What Can We Gain?.
Ursu Christian A Unicode-based Environment for Creation and Use of Language Resources.
Extracting Information for Automatic Indexing of Multimedia Material.
Uszkoreity  Hans  COLLATE: Competence Center in Speech and Language Technology.
V  
Valderrábanos Antonio S. Multilingual Terminology Extraction and Validation
Valentín Lluís BDCon: A Spanish knowledge database.
A step forward to hypertext.
Valentín Oriol CATCG: a general purpose parsing tool applied.
van den Heuvel Henk SpeechDat across all America: SALA II.
SPEECON – Speech Databases for Consumer Devices: Database Specification and Validation.
Give me a bug. a framework for a bug report service.
van der Wouden Ton Syntactic Analysis in the Spoken Dutch Corpus (CGN).
van Eynde Frank Experiences from the Spoken Dutch Corpus Project.
van Genabith Josef TTS - A Treebank Tool Suite.
van Heuven Vincent J. Annotation of prominent words, prosodic boundaries and segmental lengthening by non-expert transcribers in the Spoken Dutch Corpus.
Van Parys Ruben Word Segmentation in the Spoken Dutch Corpus.
Vandeghinste  Vincent Lexicon Optimization: Maximizing Lexical Coverage in Speech Recognition through Automated Compounding.
Vanni Michelle Scaling the ISLE Framework: Use of Existing Corpus Resources for Validation of MT Evaluation Metrics across Languages.
Vanrullen Tristan An evaluation of different symbolic shallow parsing techniques.
Váradi Tamás The Hungarian National Corpus.
Vargas M. Isabel Italian arabic linguistic tools.
Veber Marek A Procedure for Word Derivational Processes Concerning Lexicon Extension in Highly Inflected Languages
Velardi Paola Automatic Adaptation of WordNet to Domains.
Verdonik Darinka  Design and Implementation of the Slovenian Phonetic and Morphology Lexicons for the Use in Spoken Language Applications.
Veronis Jean The C-ORAL-ROM Project. New methods for spoken language archives in a multilingual romance corpus.
Vichot Franz Multilingual XML-Based Named Entity Recognition for E-Retail Domains.
Vider Kadri Technical Terminology as a Critical Resource.
Viera Renata Acquiring Lexical Knowledge for Anaphora Resolution.
Nominal Expressions in Multilingual Corpora: Definites and Demonstratives.
Vigouroux Nadine Predictive and objective evaluation of speech understanding: the “challenge” evaluation campaign of the I3 speech workgroup of the French CNRS.
Villaneau Jeanne Predictive and objective evaluation of speech understanding: the “challenge” evaluation campaign of the I3 speech workgroup of the French CNRS.
Villavicencio Aline Multiword expressions: linguistic precision and reusability.
Villegas  Marta From Resources to Applications. Designing the Multilingual ISLE Lexical Entry.
From DTD to relational dB. An automatic generation of a lexicographical station out off ISLE guidelines.
Vilnat Anne A Protocol for Evaluating Analyzers of Syntax (PEAS).
Vindigni Michael Multilingual XML-Based Named Entity Recognition for E-Retail Domains.
Vintar Spela Evaluation Corpora for Sense Disambiguation in the Medical Domain.
An Efficient and Flexible Format for Linguistic and Semantic Annotation.
Vivaldi  Jorge Towards an Ontology for a Human Genome Knowledge Base
Vögele Andreas Advanced Tools for the Study of Natural Interactivity.
W  
Wahlster Wolfgang  COLLATE: Competence Center in Speech and Language Technology.
Walker Marilyn Automatic Evaluation: Using a DATE Dialogue Act Tagger for User Satisfaction and Task Completion Prediction.
A Dependency Treebank for English.
Fish or Fowl:A Wizard of Oz Evaluation of Dialogue Strategies in the Restaurant Domain.
Wambacq  Patrick An Improved Algorithm for the Automatic Segmentation of Speech Corpora.
Wang Fusi Usability Evaluation of a Dutch Multimodal System for Train Timetable Information.
Ward Monica Issues in the design, construction and use of Language Resources (LR) for Endangered Languages (Els).
Watanabe Taro Statistical Machine Translation on Paraphrased Corpora.
Wätchter-Kollpacher Eva The Pronouncing Dictionary of Austrian German and the other Major Varieties of German - A Phonetic Resources Database on the Pronunciation of German.
Wathelet Ringo Beyond Tag Trigrams: New Local Features for Tagging.
Wegener  Claudia Co-reference annotation and resources: A multilingual corpus of typologically diverse languages.
Wegst Tillmann  COLLATE: Competence Center in Speech and Language Technology.
Weiguo Fan Evaluating Web-based Question Answering Systems.
Weijnitz Per Scaling Up an MT Prototype for Industrial Use - Databases and Data Flow.
Weilhammer Karl Multi-Tier Annotations in the Verbmobil Corpus.
White Kieran Searching via Keywords or Concept Hierarchies - Which is Better?.
Whitehead Miles The Reuters Corpus Volume 1 -from Yesterday's News to Tomorrow's Language Resources.
Whittaker Steve  Fish or Fowl:A Wizard of Oz Evaluation of Dialogue Strategies in the Restaurant Domain.
Widdows Dominic  Using Parallel Corpora to enrich Multilingual Lexical Resources.
Wilks Yorick Extracting Information for Automatic Indexing of Multimedia Material.
How feasible is the reuse of grammars for Named Entity Recognition?
Willems Don Metadata Tools Supporting Controlled Vocabulary Services.
Witt  Andreas Co-reference annotation and resources: A multilingual corpus of typologically diverse languages.
Wittenburg Peter Methods of Language Documentation in the DOBES project.
Multimodal Annotations in Gesture and Sign Language Studies
Analysis of Lexical Structures from Field Linguistics and Language Engineering.
LREP: A Language Repository Exchange Protocol.
Metadata Proposals for Corpora and Lexica.
Wolff Christian Information Extraction from Text Corpora: Using Filters on Collocation Sets.
Wolinski Francis Multilingual XML-Based Named Entity Recognition for E-Retail Domains.
Womser-Hacker Christa Inside the Evaluation Process of the Cross-Language Evaluation Forum (CLEF): Issues of Multilingual Topic Creation and Multilingual Relevance Assessment.
Wright Hastie Helen Automatic Evaluation: Using a DATE Dialogue Act Tagger for User Satisfaction and Task Completion Prediction.
Wu Harris Evaluating Web-based Question Answering Systems.
Wutiwiwatchai  Chai Phonetically Distributed Continuous Speech Corpus for Thai Language.
Wynne Martin  The Language Resource Archive of the 21st Century.
X  
Xu Feiyu  A Domain Adaptive Approach to Automatic Acquisition of Domain Relevant Terms and their Relations with Bootstrapping
A Flexible XML-based Regular Compiler for Creation and Conversion of Linguistic Resources
Y  
Yablonsky Serge A. Corpora as Object-Oriented System. From UML-notation to Implementation.
Yamada Atsushi Continuous Speech Recognition Consortium  an Open Repository for CSR Tools and Models.
Yamamoto  Mikio The Present Status of Speech Database in Japan: Development, Management, and Application to Speech Research
Yamamoto Hirofumi Toward a Broad-coverage Bilingual Corpus for Speech Translation of Travel Conversations in the Real World.
Yamamoto Kazuhide Towards a Thesaurus of Predicates.
Yamamoto Seiichi Toward a Broad-coverage Bilingual Corpus for Speech Translation of Travel Conversations in the Real World.
Proposal of a very-large-corpus acquisition method by cell-formed registration.
Automatic machine translation selection scheme to output the best result.
Yamashita Akiko Use of XML and Relational Databases for Consistent Development and Maintenance of Lexicons and Annotated Corpora.
Yanagida Masuzo  Automatic machine translation selection scheme to output the best result.
Yangarber Roman Diversity of Scenarios in Information extraction.
Yasuda Keiji Automatic machine translation selection scheme to output the best result.
Yasuharu Den Use of XML and Relational Databases for Consistent Development and Maintenance of Lexicons and Annotated Corpora.
Yasunaga Hisashi Resource Sharing System for Humanity Researches
Yasuo Koyama Modal Expressions in Natural Language Sentence and Their Similarity.
Yoneda  Ryuichi Use of XML and Relational Databases for Consistent Development and Maintenance of Lexicons and Annotated Corpora.
Yongnam Um Speech Information Technology & Industry Promotion Center in Korea: Activities and Directions.
Yoshimoto K. English Speech Database Read by Japanese Learners for CALL System Development
Yoshimura Kenji  Modal Expressions in Natural Language Sentence and Their Similarity.
Yusoff  Zaharin Computational Linguistics at Universiti Sains Malaysia
Yvon François Using the Web as a Linguistic Resource for Learning Reformulations Automatically.
Z  
Zabokrtsý Zdenek  Valency Dictionary of Czech Verbs: Complex Tectogrammatical Annotation.
A Machine Learning Approach to Automatic Functor Assignment in the Prague Dependency Treebank.
Zakaria Kurdi Mohamed Predictive and objective evaluation of speech understanding: the “challenge” evaluation campaign of the I3 speech workgroup of the French CNRS.
Toward an objective and generic Method for Spoken Language Understanding Systems Evaluation: an extension of the DCR method.
Zalbide Xabier BIZKAIFON: A sound archive of dialectal varieties of spoken Basque.
Zampolli Antonio  From Resources to Applications. Designing the Multilingual ISLE Lexical Entry.
CLIPS, a Multi-level Italian Computational Lexicon: a Glimpse to Data.
Towards Best Practice for Multiword Expressions in Computational Lexicons.
Zayat Salim TableTrans, MultiTrans, InterTrans and TreeTrans: Diverse Tools Built on the Annotation Graph Toolkit.
Zgank Andrej Preliminary Evaluation of Slovenian Mobile Database PoliDat.
Zhou Qiang Annotating the functional chunks in Chinese sentences.
Zitouni  Imed OrienTel - Multilingual access to interactive communication services for the Mediterranean and the Middle East.