| Keyword |
Paper Title |
Paper |
| Accessibility | Accessibility of Multilingual Terminological Resources - Current Problems and Prospects for the Future | 283 |
| A Web-based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts | 363 |
| Acoustic Adaptation | SPEECON - Speech Data for Consumer Devices | 63 |
| Acquisition Tools | SpeechDat-Car Fixed Platform | 331 |
| Adjuncts | Automatic Assignment of Grammatical Relations | 218 |
| African Languages | Developing a Multilingual Telephone Based Information System in African Languages | 329 |
| Agglutinative Languages | A Word-level Morphosyntactic Analyzer for Basque | 44 |
| Alignment | Building the Croatian-English Parallel Corpus | 119 |
| Grammarless Bracketing in an Aligned Bilingual Corpus | 164 |
| The Bank of Swedish | 300 |
| Analogy-Based Aquisition | Controlled Bootstrapping of Lexico-semantic Classes as a Bridge between Paradigmatic and Syntagmatic Knowledge: Methodology and Evaluation | 99 |
| Analysis-Modification-Synthesis systems | The Cost258 Signal Generation Test Array | 1 |
| Analytic Tools | Introduction of KIBS (Korean Information Base System) Project | 239 |
| Analytical Functions | Rule-based Tagging: Morphological Tagset versus Tagset of Analytical Functions | 199 |
| Anaphora | Coreference Resolution Evaluation Based on Descriptive Specificity | 131 |
| Annotating a Corpus to Develop and Evaluate Discourse Entity Realization Algorithms: Issues and Preliminary Results | 193 |
| Anaphora Resolution | CLinkA A Coreferential Links Annotator | 179 |
| Annotated Corpora | On the Usage of Kappa to Evaluate Agreement on Coding Tasks | 206 |
| Annotated Corpus | Hua Yu: A Word-segmented and Part-Of-Speech Tagged Chinese Corpus | 372 |
| Annotated Dialogue Corpora | Dialogue Annotation for Language Systems Evaluation | 33 |
| Annotation | GREEK ToBI: A System for the Annotation of Greek Speech Corpora | 7 |
| The EUDICO Project, Multi Media Annotation over the Internet | 45 |
| An XML-based Representation Format for Syntactically Annotated Corpora | 59 |
| The Spoken Dutch Corpus. Overview and First Evaluation | 110 |
| Annotating Communication Problems Using the MATE Workbench | 134 |
| The MATE Workbench Annotation Tool, a Technical Description | 166 |
| Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium | 209 |
| Large, Multilingual, Broadcast News Corpora for Cooperative Research in Topic Detection and Tracking: The TDT-2 and TDT-3 Corpus Efforts | 210 |
| Annotating Resources for Information Extraction | 263 |
| Annotation Errors | What are Transcription Errors and Why are They made? | 205 |
| Annotation frameworks | Dialogue Annotation for Language Systems Evaluation | 33 |
| Annotation Graphs | Transcribing with Annotation Graphs | 192 |
| Annotation Guidelines | Developing Guidelines and Ensuring Consistency for Chinese Text Annotation | 287 |
| Annotation Schema | Building a Treebank for Italian: a Data-driven Annotation Schema | 220 |
| Annotation tool | FAST - Towards a Semi-automatic Annotation of Corpora | 130 |
| A Graphical Parametric Language-Independent Tool for the Annotation of Speech Corpora | 314 |
| Annotation Tools | Dialogue Annotation for Language Systems Evaluation | 33 |
| Corpora of Slovene Spoken Language for Multi-lingual Applications | 288 |
| SegWin: a Tool for Segmenting, Annotating, and Controlling the Creation of a Database of Spoken Italian Varieties | 310 |
| Interactive Corpus Annotation | 334 |
| NaniTrans: a Speech Labelling Tool | 345 |
| Annotation of a Multichannel Noisy Speech Corpus | 358 |
| Applications | Portuguese Corpora at CLUL | 72 |
| Architecture | Software Infrastructure for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis | 170 |
| Architectures | Enabling Resource Sharing in Language Generation: an Abstract Reference Architecture | 244 |
| Argument Structure | Establishing the Upper Bound and Inter-judge Agreement of a Verb Classification Task | 233 |
| ASR | MHATLex: Lexical Resources for Modelling the French Pronunciation | 37 |
| Regional Pronunciation Variants for Automatic Segmentation | 307 |
| Assesment Techniques | COCOSDA - a Progress Report | 364 |
| Assessment and Evaluation | For a Repository of NLP Tools | 316 |
| Automated Style Categorisation | Automatic Style Categorisation of Corpora in the Greek Language | 301 |
| Automatic annotation | Target Suites for Evaluating the Coverage of Text Generators | 92 |
| Automatic Evaluation | Evaluation of Word Alignment Systems | 137 |
| How to Evaluate Your Question Answering System Every Day ... and Still Get Real Work Done | 203 |
| Automatic Generation of Constructed Words | GéDériF: Automatic Generation and Analysis of Morphologically Constructed Lexical Resources | 279 |
| Automatic Lexical Acquisition | Establishing the Upper Bound and Inter-judge Agreement of a Verb Classification Task | 233 |
| Automatic Term Extraction | Term-based Identification of Sentences for Text Summarisation | 106 |
| Automatic Text Summarization | A Comparison of Summarization Methods Based on Task-based Evaluation | 14 |
| Automatically Category Code Assignment | Automatically Expansion of Thesaurus Entries with a Different Thesaurus | 142 |
| Batch Download | A Self-Expanding Corpus Based on Newspapers on the Web | 362 |
| Benchmark | Dialogue Annotation for Language Systems Evaluation | 33 |
| Best Practice | A Methodology for Evaluating Spoken Language Dialogue Systems and Their Components | 135 |
| Bilingual Comparable Corpora | Designing a Tool for Exploiting Bilingual Comparable Corpora | 20 |
| Bilingual Corpora | TransSearch: A Free Translation Memory on the World Wide Web | 12 |
| Bilingual Corpora Processing | Automatic Extraction of English-Chinese Term Lexicons from Noisy Bilingual Corpora | 208 |
| Bilingual Corpus | Grammarless Bracketing in an Aligned Bilingual Corpus | 164 |
| Bilingual Dictionary | A Bilingual Electronic Dictionary for Frame Semantics | 69 |
| Bilingual Lexicon Extraction | Evaluation of Word Alignment Systems | 137 |
| Bilingual Pronunciation Dictionary | Design Methodology for Bilingual Pronunciation Dictionary | 269 |
| Binational Policies | A Platform for Dutch in Human Language Technologies | 348 |
| Bi-textual Database | TransSearch: A Free Translation Memory on the World Wide Web | 12 |
| BNC2 | Morphological Tagging to Resolve Morphological Ambiguities | 277 |
| Bootstrapping Methods | Automatically Augmenting Terminological Lexicons from Untagged Text | 320 |
| Bracketed Corpus (Treebank) | Developing Guidelines and Ensuring Consistency for Chinese Text Annotation | 287 |
| CALL | Constructing a Tagged E-J Parallel Corpus for Assisting Japanese Software Engineers in Writing English Abstracts | 78 |
| Call-Center Applications | Issues in the Evaluation of Spoken Dialogue Systems - Experience from the ACCeSS Project | 162 |
| CAPI | SpeechDat-Car Fixed Platform | 331 |
| Car environment | SPEECHDAT-CAR. A Large Speech Database for Automotive Environments | 373 |
| Case Frame | Design and Construction of Knowledge base for Verb using MRD and Tagged Corpus | 237 |
| Cenario | Evaluating Multi-party Multi-modal Systems | 368 |
| CES | A Parallel Corpus of Italian/German Legal Texts | 140 |
| Chinese Information Processing | Hua Yu: A Word-segmented and Part-Of-Speech Tagged Chinese Corpus | 372 |
| Chinese Language Processing | Developing Guidelines and Ensuring Consistency for Chinese Text Annotation | 287 |
| Chinese-English Lexicons | Chinese-English Semantic Resource Construction | 327 |
| Chunks | PoS Disambiguation and Partial Parsing Bidirectional Interaction | 169 |
| Classified Listing of NLP Software | The New Edition of the Natural Language Software Registry (an Initiative of ACL hosted at DFKI) | 267 |
| Client-Server | An Open Architecture for the Construction and Administration of Corpora | 176 |
| Client-Server Model | POSCAT: A Morpheme-based Speech Corpus Annotation Tool | 224 |
| Clustering | Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
| Evaluating Summaries for Multiple Documents in an Interactive Environment | 163 |
| Coding Schemes | Annotating Communication Problems Using the MATE Workbench | 134 |
| Collaborative System | Evaluating Multi-party Multi-modal Systems | 368 |
| Collocation | Collocations as Word Co-ocurrence Restriction Data - An Application to Japanese Word Processor - | 2 |
| Towards a Strategy for a Representation of Collocations - Extending the Danish PAROLE-lexicon | 47 |
| Improving Lexical Databases with Collocational Information: Data from Portuguese | 340 |
| Collocations | A Bilingual Electronic Dictionary for Frame Semantics | 69 |
| A Flexible Infrastructure for Large Monolingual Corpora | 226 |
| Combination of Automatic and Manual Annotation | Interactive Corpus Annotation | 334 |
| Combined Classifiers | Using a Large Set of EAGLES-compliant Morpho-syntactic Descriptors as a Tagset for Probabilistic Tagging | 11 |
| Combining Systems | Bootstrapping a Tagged Corpus through Combination of Existing Heterogeneous Taggers | 155 |
| Complements | Automatic Assignment of Grammatical Relations | 218 |
| Completeness | A Methodology for Evaluating Spoken Language Dialogue Systems and Their Components | 135 |
| Compound Nouns | ARISTA Generative Lexicon for Compound Greek Medical Terms | 360 |
| Compounding | Annotating, Disambiguating & Automatically Extending the Coverage of the Swedish SIMPLE Lexicon | 84 |
| Computational Lexicography | A Bilingual Electronic Dictionary for Frame Semantics | 69 |
| Computational Lexicon | CDB - A Database of Lexical Collocations | 189 |
| Computational Lexicons | SIMPLE: A General Framework for the Development of Multilingual Lexicons | 61 |
| An Approach to Lexical Development for Inflectional Languages | 256 |
| Computational Psycholinguistics | The Treatment of Adjectives in SIMPLE: Theoretical Observations | 366 |
| Computer-Assisted Translation Tools | Designing a Tool for Exploiting Bilingual Comparable Corpora | 20 |
| Computer-assisted Writing Tool | Constructing a Tagged E-J Parallel Corpus for Assisting Japanese Software Engineers in Writing English Abstracts | 78 |
| Concept Abstraction | Abstraction of the EDR Concept Classification and its Effectiveness in Word Sense Disambiguation | 75 |
| Concept Accuracy | Issues in the Evaluation of Spoken Dialogue Systems - Experience from the ACCeSS Project | 162 |
| Concept Description | Abstraction of the EDR Concept Classification and its Effectiveness in Word Sense Disambiguation | 75 |
| Concept Hierarchy | Abstraction of the EDR Concept Classification and its Effectiveness in Word Sense Disambiguation | 75 |
| Conceptual Representation | Extraction of Concepts and Multilingual Information Schemes from French and English Economics Documents | 202 |
| Constraint Grammar | Providing Internet Access to Portuguese Corpora: the AC/DC Project | 85 |
| Construction of Linguistic Resources | Where Opposites Meet. A Syntactic Meta-scheme for Corpus Annotation and Parsing Evaluation | 98 |
| Consumer Electronics | SPEECON - Speech Data for Consumer Devices | 63 |
| Context | The Context (not only) for Humans | 156 |
| The (Un)Deterministic Nature of Morphological Context | 200 |
| Context-Based Method | The (Un)Deterministic Nature of Morphological Context | 200 |
| Continuous Speech | Corpora of Slovene Spoken Language for Multi-lingual Applications | 288 |
| Continuous Speech Recognition over the Telephone | Issues in Design and Collection of Large Telephone Speech Corpus for Slovenian Language | 246 |
| Controlled Languages | Looking for Errors: A Declarative Formalism for Resource-adaptive Language Checking | 299 |
| Copyright | An Open Architecture for the Construction and Administration of Corpora | 176 |
| Coreference | Coreference in Annotating a Large Corpus | 19 |
| Coreference Annotation: Whither? | 100 |
| Coreference Resolution Evaluation Based on Descriptive Specificity | 131 |
| Coreference Annotation | CLinkA A Coreferential Links Annotator | 179 |
| Coreferential Chain | CLinkA A Coreferential Links Annotator | 179 |
| Corpora | Providing Internet Access to Portuguese Corpora: the AC/DC Project | 85 |
| Evaluating Translation Quality as Input to Product Development | 136 |
| The MATE Workbench Annotation Tool, a Technical Description | 166 |
| An Open Architecture for the Construction and Administration of Corpora | 176 |
| SegWin: a Tool for Segmenting, Annotating, and Controlling the Creation of a Database of Spoken Italian Varieties | 310 |
| Corpora Workbench | The Bank of Swedish | 300 |
| Corporate Usage of HLR | The Establishment of Motorola's Human Language Data Resource Center: Addressing the Criticality of Language Resources in the Industrial Setting | 260 |
| Corpus | Semantico-syntactic Tagging of Very Large Corpora: the Case of Restoration of Nodes on the Underlying Level | 18 |
| Coreference in Annotating a Large Corpus | 19 |
| FAST - Towards a Semi-automatic Annotation of Corpora | 130 |
| Learning Verb Subcategorization from Corpora: Counting Frame Subsets | 145 |
| The American National Corpus: A Standardized Resource for American English | 196 |
| Building a Treebank for Italian: a Data-driven Annotation Schema | 220 |
| Application of WordNet ILR in Czech Word-formation | 223 |
| Introduction of KIBS (Korean Information Base System) Project | 239 |
| ARC A3: A Method for Evaluating Term Extracting Tools and/or Semantic Relations between Terms from Corpora | 247 |
| Language Resources Development at the Spanish Royal Academy | 297 |
| Design and Implementation of the Online ILSP Greek Corpus | 336 |
| A Self-Expanding Corpus Based on Newspapers on the Web | 362 |
| A Web-based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts | 363 |
| Corpus Administration | An Open Architecture for the Construction and Administration of Corpora | 176 |
| Corpus Analysis | Modern Greek Corpus Taxonomy | 351 |
| Corpus Annotation | FAST - Towards a Semi-automatic Annotation of Corpora | 130 |
| Annotating a Corpus to Develop and Evaluate Discourse Entity Realization Algorithms: Issues and Preliminary Results | 193 |
| Building a Treebank for French | 230 |
| Principled Hidden Tagset Design for Tiered Tagging of Hungarian | 249 |
| Inter-annotator Agreement for a German Newspaper Corpus | 333 |
| Interactive Corpus Annotation | 334 |
| Corpus Architecture | The American National Corpus: A Standardized Resource for American English | 196 |
| Corpus Based Linguistics | TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation | 254 |
| Corpus Building | An Open Architecture for the Construction and Administration of Corpora | 176 |
| Corpus Design | Designing a Tool for Exploiting Bilingual Comparable Corpora | 20 |
| The Spoken Dutch Corpus. Overview and First Evaluation | 110 |
| Corpus Encoding | A Parallel Corpus of Italian/German Legal Texts | 140 |
| XCES: An XML-based Encoding Standard for Linguistic Corpora | 172 |
| Corpus Linguistics | Building the Croatian-English Parallel Corpus | 119 |
| Building a Treebank for French | 230 |
| Corpus Preparation | LT TTT - A Flexible Tokenisation Tool | 93 |
| Corpus Processing | Automatic Extraction of Semantic Similarity of Words from Raw Technical Texts | 302 |
| A Software Toolkit for Sharing and Accessing Corpora Over the Internet | 337 |
| Corpus-Based Techniques | An Approach to Lexical Development for Inflectional Languages | 256 |
| Correctness | A Methodology for Evaluating Spoken Language Dialogue Systems and Their Components | 135 |
| co-Training | Minimally Supervised Japanese Named Entity Recognition: Resources and Evaluation | 258 |
| Creativity | Addizionario: an Interactive Hypermedia Tool for Language Learning | 374 |
| Croatian | Building the Croatian-English Parallel Corpus | 119 |
| Cross-language Information Retrieval | The Evaluation of Systems for Cross-language Information Retrieval | 70 |
| Evaluating Wordnets in Cross-language Information Retrieval: the ITEM Search Engine | 250 |
| Cross-Lingual | Cross-lingual Interpolation of Speech Recognition Models | 147 |
| Czech National Corpus | Derivation in the Czech National Corpus | 153 |
| Danish verbs | Semantic Encoding of Danish Verbs in SIMPLE - Adapting a Verb Framed Model to a Satellite-framed Language | 13 |
| Data Architectures | XCES: An XML-based Encoding Standard for Linguistic Corpora | 172 |
| Data Centers | Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium | 209 |
| Data Collection | SPEECON - Speech Data for Consumer Devices | 63 |
| MDWOZ: A Wizard of Oz Environment for Dialog Systems Development | 104 |
| Data Collection and Distribution | Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium | 209 |
| Large, Multilingual, Broadcast News Corpora for Cooperative Research in Topic Detection and Tracking: The TDT-2 and TDT-3 Corpus Efforts | 210 |
| Data interchange | Accessibility of Multilingual Terminological Resources - Current Problems and Prospects for the Future | 283 |
| Data Set | IREX: IR & IE Evaluation Project in Japanese | 27 |
| Data Visualization | Evaluating Multi-party Multi-modal Systems | 368 |
| Database | Design and Implementation of the Online ILSP Greek Corpus | 336 |
| Database Access | Object-oriented Access to the Estonian Phonetic Database | 128 |
| The Universal XML Organizer: UXO | 253 |
| Databases | Models of Russian Text/Speech Interactive Databases for Supporting of Scientific, Practical and Cultural Researches | 185 |
| SegWin: a Tool for Segmenting, Annotating, and Controlling the Creation of a Database of Spoken Italian Varieties | 310 |
| Data-Collection | What are Transcription Errors and Why are They made? | 205 |
| DCR Evaluation | Obtaining Predictive Results with an Objective Evaluation of Spoken Dialogue Systems: Experiments with the DCR Assessment Paradigm | 36 |
| Decision List Learning | Minimally Supervised Japanese Named Entity Recognition: Resources and Evaluation | 258 |
| Definitions | Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
| Deletions | Semantico-syntactic Tagging of Very Large Corpora: the Case of Restoration of Nodes on the Underlying Level | 18 |
| Dependency | Semantico-syntactic Tagging of Very Large Corpora: the Case of Restoration of Nodes on the Underlying Level | 18 |
| Coreference in Annotating a Large Corpus | 19 |
| Dependency Format | Building a Treebank for Italian: a Data-driven Annotation Schema | 220 |
| Dependency Syntax | Rule-based Tagging: Morphological Tagset versus Tagset of Analytical Functions | 199 |
| Dependency Tree | Automatic Assignment of Grammatical Relations | 218 |
| Derivation | Derivation in the Czech National Corpus | 153 |
| Application of WordNet ILR in Czech Word-formation | 223 |
| Derivational Morphology | GéDériF: Automatic Generation and Analysis of Morphologically Constructed Lexical Resources | 279 |
| Descriptive Specificity | Coreference Resolution Evaluation Based on Descriptive Specificity | 131 |
| Design Methodology | Design Methodology for Bilingual Pronunciation Dictionary | 269 |
| Development Environment | MDWOZ: A Wizard of Oz Environment for Dialog Systems Development | 104 |
| Software Infrastructure for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis | 170 |
| Diacritic Characters Restoration | A Web-based Text Corpora Development System | 105 |
| Dialog Interaction Model | MDWOZ: A Wizard of Oz Environment for Dialog Systems Development | 104 |
| Dialog System | Predictive Performance of Dialog Systems | 303 |
| Dialogue | Obtaining Predictive Results with an Objective Evaluation of Spoken Dialogue Systems: Experiments with the DCR Assessment Paradigm | 36 |
| Dialogue Corpora | The Influence of Scenario Constraints on the Spontaneity of Speech. A Comparison of Dialogue Corpora | 217 |
| Dictation Free Software | IPA Japanese Dictation Free Software Project | 261 |
| Dictation Systems | Methods and Metrics for the Evaluation of Dictation Systems: a Case Study | 56 |
| Dictionaries | Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
| Dictionary | The Concede Model for Lexical Databases | 335 |
| Dictionary Definitions | Automatic Generation of Dictionary Definitions from a Computational Lexicon | 306 |
| Dictionary Formats | Electronic Language Resources for Polish: POLEX, CEGLEX and GRAMLEX | 62 |
| Diphone Inventory | Corpora of Slovene Spoken Language for Multi-lingual Applications | 288 |
| Discourse | Shallow Discourse Genre Annotation in CallHome Spanish | 228 |
| Discourse Annotation | Annotating Events and Temporal Information in Newswire Texts | 321 |
| Discriminant Function Analysis | Modern Greek Corpus Taxonomy | 351 |
| Distributed Corpora | A Software Toolkit for Sharing and Accessing Corpora Over the Internet | 337 |
| Distributed Processing | A Software Toolkit for Sharing and Accessing Corpora Over the Internet | 337 |
| Distribution | Recent Developments within the European Language Resources Association (ELRA) | 377 |
| Document Routing | An Architecture for Document Routing in Spanish: Two Language Components, Pre-processor and Parser | 91 |
| Domains | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
| Dry sound source | Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition | 356 |
| DSSSL | Enhancing Speech Corpus Resources with Multiple Lexical Tag Layers | 183 |
| Dutch | Part of Speech Tagging and Lemmatisation for the Spoken Dutch Corpus | 216 |
| NL-Translex: Machine Translation for Dutch | 294 |
| Dutch (spoken) | The Spoken Dutch Corpus. Overview and First Evaluation | 110 |
| Dynamic Corpora | Live Lexicons and Dynamic Corpora Adapted to the Network Resources for Chinese Spoken Language Processing Applications in an Internet Era | 214 |
| EAGLES | Methods and Metrics for the Evaluation of Dictation Systems: a Case Study | 56 |
| E-J Parallel Corpus | Constructing a Tagged E-J Parallel Corpus for Assisting Japanese Software Engineers in Writing English Abstracts | 78 |
| ELDA | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
| Recent Developments within the European Language Resources Association (ELRA) | 377 |
| Electronic Dictionaries | Electronic Language Resources for Polish: POLEX, CEGLEX and GRAMLEX | 62 |
| Coping with Lexical Gaps when Building Aligned Multilingual Wordnets | 236 |
| ELRA | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
| Recent Developments within the European Language Resources Association (ELRA) | 377 |
| Empirical Methods | Annotating a Corpus to Develop and Evaluate Discourse Entity Realization Algorithms: Issues and Preliminary Results | 193 |
| Encyclopedia | A Step toward Semantic Indexing of an Encyclopedic Corpus | 161 |
| End-User Environments | Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications | 342 |
| English | Building the Croatian-English Parallel Corpus | 119 |
| Morphological Tagging to Resolve Morphological Ambiguities | 277 |
| English and Korean | Design Methodology for Bilingual Pronunciation Dictionary | 269 |
| English Pronunciation | Design Methodology for Bilingual Pronunciation Dictionary | 269 |
| Environment sound recognition | Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition | 356 |
| Error-Spotting | On the Use of Prosody for On-line Evaluation of Spoken Dialogue Systems | 43 |
| Evaluation | Enhancing the TDT Tracking Evaluation | 5 |
| English Senseval: Report and Results | 8 |
| Using a Large Set of EAGLES-compliant Morpho-syntactic Descriptors as a Tagset for Probabilistic Tagging | 11 |
| IREX: IR & IE Evaluation Project in Japanese | 27 |
| Methods and Metrics for the Evaluation of Dictation Systems: a Case Study | 56 |
| Guidelines for Japanese Speech Synthesizer Evaluation | 77 |
| Target Suites for Evaluating the Coverage of Text Generators | 92 |
| Evaluation of a Dialogue System Based on a Generic Model that Combines Robust Speech Understanding and Mixed-initiative Control | 101 |
| The Spoken Dutch Corpus. Overview and First Evaluation | 110 |
| Galaxy-II as an Architecture for Spoken Dialogue Evaluation | 116 |
| Coreference Resolution Evaluation Based on Descriptive Specificity | 131 |
| A Methodology for Evaluating Spoken Language Dialogue Systems and Their Components | 135 |
| Evaluating Translation Quality as Input to Product Development | 136 |
| Determining the Tolerance of Text-handling Tasks for MT Output | 139 |
| Morphosyntactic Tagging of Slovene: Evaluating Taggers and Tagsets | 146 |
| Issues in the Evaluation of Spoken Dialogue Systems - Experience from the ACCeSS Project | 162 |
| Evaluating Summaries for Multiple Documents in an Interactive Environment | 163 |
| CLinkA A Coreferential Links Annotator | 179 |
| Annotating a Corpus to Develop and Evaluate Discourse Entity Realization Algorithms: Issues and Preliminary Results | 193 |
| The (Un)Deterministic Nature of Morphological Context | 200 |
| ARC A3: A Method for Evaluating Term Extracting Tools and/or Semantic Relations between Terms from Corpora | 247 |
| TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation | 254 |
| Evaluation of a Generic Lexical Semantic Resource in Information Extraction | 259 |
| An Evaluation Tool for Machine Translation: Fast Evaluation for MT Research | 278 |
| Design Issues in Text-Independent Speaker Recognition Evaluation | 286 |
| Predictive Performance of Dialog Systems | 303 |
| Inter-annotator Agreement for a German Newspaper Corpus | 333 |
| A Platform for Dutch in Human Language Technologies | 348 |
| Cardinal, Nominal or Ordinal Similarity Measures in Comparative Evaluation of Information Retrieval Process | 367 |
| Evaluation (predictive) | Sublanguage Dependent Evaluation: Toward Predicting NLP performances | 252 |
| Evaluation Criteria | Using a Formal Approach to Evaluate Grammars | 285 |
| Evaluation in Written Language Processing | NL-Translex: Machine Translation for Dutch | 294 |
| Evaluation Methodology | Evaluating Multi-party Multi-modal Systems | 368 |
| Evaluation Metrics | Evaluation of Word Alignment Systems | 137 |
| Evaluation of Language Resources and Tools | Evaluating Wordnets in Cross-language Information Retrieval: the ITEM Search Engine | 250 |
| Evaluation Protocols | Evaluation of TRANSTYPE, a Computer-aided Translation Typing System: A Comparison of a Theoretical- and a User-oriented Evaluation Procedures | 34 |
| Event Recognition | Tuning Lexicons to New Operational Scenarios | 330 |
| Events | Annotating Events and Temporal Information in Newswire Texts | 321 |
| Exercise(s) | Determining the Tolerance of Text-handling Tasks for MT Output | 139 |
| Expanding Entry | Automatically Expansion of Thesaurus Entries with a Different Thesaurus | 142 |
| Extraction | A Step toward Semantic Indexing of an Encyclopedic Corpus | 161 |
| ExTrECC | Designing a Tool for Exploiting Bilingual Comparable Corpora | 20 |
| Feature Structures | A Proposal for the Integration of NLP Tools using SGML-Tagged Documents | 68 |
| Finite State Technologies | An Optimised FS Pronunciation Resource Generator for Highly Inflecting Languages | 251 |
| Finite State Transducers | A Robust Parser for Unrestricted Greek Text | 174 |
| Finite-state tools | Terms Specification and Extraction within a Linguistic-based Intranet Service | 17 |
| Formal Approach | Using a Formal Approach to Evaluate Grammars | 285 |
| Formal Descriptions | Tools for the Generation of Morphological Entries in Dictionaries | 338 |
| Formalism | Cardinal, Nominal or Ordinal Similarity Measures in Comparative Evaluation of Information Retrieval Process | 367 |
| Format Independent | The EUDICO Project, Multi Media Annotation over the Internet | 45 |
| Formation of Company (Motorola) Organization | The Establishment of Motorola's Human Language Data Resource Center: Addressing the Criticality of Language Resources in the Industrial Setting | 260 |
| Frames | Learning Verb Subcategorization from Corpora: Counting Frame Subsets | 145 |
| Framework | Software Infrastructure for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis | 170 |
| French Market | For a Repository of NLP Tools | 316 |
| French Phonetic Lexicon | A French Phonetic Lexicon with Variants for Speech and Language Processing | 133 |
| Frequency | Rarity of Words in a Language and in a Corpus | 295 |
| Functional Annotation | Controlled Bootstrapping of Lexico-semantic Classes as a Bridge between Paradigmatic and Syntagmatic Knowledge: Methodology and Evaluation | 99 |
| Generation | Annotating a Corpus to Develop and Evaluate Discourse Entity Realization Algorithms: Issues and Preliminary Results | 193 |
| Resources for Multilingual Text Generation in Three Slavic Languages | 241 |
| Generative Lexicon | ARISTA Generative Lexicon for Compound Greek Medical Terms | 360 |
| Genericity | Obtaining Predictive Results with an Objective Evaluation of Spoken Dialogue Systems: Experiments with the DCR Assessment Paradigm | 36 |
| Genre | Shallow Discourse Genre Annotation in CallHome Spanish | 228 |
| German Dialectal Regions | Regional Pronunciation Variants for Automatic Segmentation | 307 |
| Gold Standard | Evaluation of Word Alignment Systems | 137 |
| Grammar Acquisition | A Treebank of Spanish and its Application to Parsing | 66 |
| Grammar Checking | Looking for Errors: A Declarative Formalism for Resource-adaptive Language Checking | 299 |
| Grammars | Using a Formal Approach to Evaluate Grammars | 285 |
| Language Resources Development at the Spanish Royal Academy | 297 |
| An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG | 371 |
| Grammatical coverage | Target Suites for Evaluating the Coverage of Text Generators | 92 |
| Grammatical Data | Tools for the Generation of Morphological Entries in Dictionaries | 338 |
| Grammatical Dictionary | The Multi-layer Language Knowledge Base of Chinese NLP | 29 |
| Grammatical Relations | Automatic Assignment of Grammatical Relations | 218 |
| Grammatical Rules | Automatic Style Categorisation of Corpora in the Greek Language | 301 |
| Grammatical Tagging | A Web-based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts | 363 |
| Grapheme-Phoneme Conversion | An Optimised FS Pronunciation Resource Generator for Highly Inflecting Languages | 251 |
| Grapheme-to-Phoneme Conversion | A Computational Platform for Development of Morphologic and Phonetic Lexica | 175 |
| Design of Optimal Slovenian Speech Corpus for Use in the Concatenative Speech Synthesis System | 177 |
| Graphic User Interface | PLEDIT - A New Efficient Tool for Management of Multilingual Pronunciation Lexica and Batchlists | 53 |
| Graphical | A Graphical Parametric Language-Independent Tool for the Annotation of Speech Corpora | 314 |
| Greek | GREEK ToBI: A System for the Annotation of Greek Speech Corpora | 7 |
| Named Entity Recognition in Greek Texts | 173 |
| A Robust Parser for Unrestricted Greek Text | 174 |
| A Unified POS Tagging Architecture and its Application to Greek | 181 |
| Greek Characters | GRUHD: A Greek database of Unconstrained Handwriting | 289 |
| Greek Language | Automatic Style Categorisation of Corpora in the Greek Language | 301 |
| Design and Implementation of the Online ILSP Greek Corpus | 336 |
| GSM signals | SPEECHDAT-CAR. A Large Speech Database for Automotive Environments | 373 |
| Haitian Creole | Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications | 342 |
| Hands-free speech recognition | Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition | 356 |
| Hebrew | Creation of Spoken Hebrew Databases | 52 |
| HPSG | An HPSG-Annotated Test Suite for Polish | 272 |
| HTML-to-Text Conversion | A Web-based Text Corpora Development System | 105 |
| Human Assessors | The TREC-8 Question Answering Track | 26 |
| Human Intervention | Semi-automatic Construction of a Tree-annotated Corpus Using an Iterative Learning Statistical Language Model | 341 |
| Human-Computer Interaction | Evaluating Multi-party Multi-modal Systems | 368 |
| Hyperlexicon | A Multi-view Hyperlexicon Resource for Speech and Language System Development | 243 |
| Idiom | Collocations as Word Co-ocurrence Restriction Data - An Application to Japanese Word Processor - | 2 |
| Implicit Relations | ARISTA Generative Lexicon for Compound Greek Medical Terms | 360 |
| Impulse response | Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition | 356 |
| In-Car Speech Data | Annotation of a Multichannel Noisy Speech Corpus | 358 |
| Induction | Tuning Lexicons to New Operational Scenarios | 330 |
| Industrial Human Language Resource Center | The Establishment of Motorola's Human Language Data Resource Center: Addressing the Criticality of Language Resources in the Industrial Setting | 260 |
| Inflectional Languages | An Approach to Lexical Development for Inflectional Languages | 256 |
| Information Base | Introduction of KIBS (Korean Information Base System) Project | 239 |
| Information Dissemination | Recent Developments within the European Language Resources Association (ELRA) | 377 |
| Information Extraction | IREX: IR & IE Evaluation Project in Japanese | 27 |
| LT TTT - A Flexible Tokenisation Tool | 93 |
| Coreference Annotation: Whither? | 100 |
| Named Entity Recognition in Greek Texts | 173 |
| A Flexible Infrastructure for Large Monolingual Corpora | 226 |
| Minimally Supervised Japanese Named Entity Recognition: Resources and Evaluation | 258 |
| Evaluation of a Generic Lexical Semantic Resource in Information Extraction | 259 |
| Annotating Resources for Information Extraction | 263 |
| Automatically Augmenting Terminological Lexicons from Untagged Text | 320 |
| Annotating Events and Temporal Information in Newswire Texts | 321 |
| Information Retrieval | A Comparison of Summarization Methods Based on Task-based Evaluation | 14 |
| IREX: IR & IE Evaluation Project in Japanese | 27 |
| Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
| An Architecture for Document Routing in Spanish: Two Language Components, Pre-processor and Parser | 91 |
| Large, Multilingual, Broadcast News Corpora for Cooperative Research in Topic Detection and Tracking: The TDT-2 and TDT-3 Corpus Efforts | 210 |
| ARC A3: A Method for Evaluating Term Extracting Tools and/or Semantic Relations between Terms from Corpora | 247 |
| Information Society | SPEECON - Speech Data for Consumer Devices | 63 |
| In-situs Evaluation | Evaluation of TRANSTYPE, a Computer-aided Translation Typing System: A Comparison of a Theoretical- and a User-oriented Evaluation Procedures | 34 |
| Integration of NLP Tools | A Proposal for the Integration of NLP Tools using SGML-Tagged Documents | 68 |
| Interactive Machine Translation | Evaluation of TRANSTYPE, a Computer-aided Translation Typing System: A Comparison of a Theoretical- and a User-oriented Evaluation Procedures | 34 |
| Inter-Annotator Agreement | Semantic Tagging for the Penn Treebank | 197 |
| Inter-annotator Agreement for a German Newspaper Corpus | 333 |
| Intercoder Reliability | On the Usage of Kappa to Evaluate Agreement on Coding Tasks | 206 |
| Inter-Judge Agreement | Establishing the Upper Bound and Inter-judge Agreement of a Verb Classification Task | 233 |
| Internet | The EUDICO Project, Multi Media Annotation over the Internet | 45 |
| Towards a Standard for Meta-descriptions of Language Resources | 125 |
| Models of Russian Text/Speech Interactive Databases for Supporting of Scientific, Practical and Cultural Researches | 185 |
| Live Lexicons and Dynamic Corpora Adapted to the Network Resources for Chinese Spoken Language Processing Applications in an Internet Era | 214 |
| Design and Implementation of the Online ILSP Greek Corpus | 336 |
| Intonaion Labeling | Spontaneous Speech Corpus of Japanese | 262 |
| Intonation | GREEK ToBI: A System for the Annotation of Greek Speech Corpora | 7 |
| ISDN | SpeechDat-Car Fixed Platform | 331 |
| ISO | Methods and Metrics for the Evaluation of Dictation Systems: a Case Study | 56 |
| Italian | Building a Treebank for Italian: a Data-driven Annotation Schema | 220 |
| Iterative Learning | Semi-automatic Construction of a Tree-annotated Corpus Using an Iterative Learning Statistical Language Model | 341 |
| Japanese | IREX: IR & IE Evaluation Project in Japanese | 27 |
| Spontaneous Speech Corpus of Japanese | 262 |
| Japanese Dictation | IPA Japanese Dictation Free Software Project | 261 |
| Japanese Named Entity Recognition | Minimally Supervised Japanese Named Entity Recognition: Resources and Evaluation | 258 |
| Japanese Text Analysis | Guidelines for Japanese Speech Synthesizer Evaluation | 77 |
| Java | Cairo: An Alignment Visualization Tool | 58 |
| Annotation of a Multichannel Noisy Speech Corpus | 358 |
| Kana-to-Kanji Conversion | Collocations as Word Co-ocurrence Restriction Data - An Application to Japanese Word Processor - | 2 |
| Knowledge | A Step toward Semantic Indexing of an Encyclopedic Corpus | 161 |
| Terminology in Korea: KORTERM | 276 |
| Knowledge Acquisition | The Treatment of Adjectives in SIMPLE: Theoretical Observations | 366 |
| Knowledge Base | Design and Construction of Knowledge base for Verb using MRD and Tagged Corpus | 237 |
| Knowledge Elicitation | Towards A Universal Tool For NLP Resource Acquisition | 28 |
| Knowledge Extraction | Development of Acoustic and Linguistic Resources for Research and Evaluation in Interactive Vocal Information Servers | 90 |
| Knowledge-Rich NLP | Something Borrowed, Something Blue: Rule-based Combination of POS Taggers | 158 |
| Korean Pronunciation | Design Methodology for Bilingual Pronunciation Dictionary | 269 |
| Labeling | Labeling of Prosodic Events in Slovenian Speech Database GOPOLIS | 292 |
| Labelling | NaniTrans: a Speech Labelling Tool | 345 |
| Language Bank | The Bank of Swedish | 300 |
| Language Corpus | Rarity of Words in a Language and in a Corpus | 295 |
| Language Engineering | Software Infrastructure for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis | 170 |
| Terminology in Korea: KORTERM | 276 |
| Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
| Developing a Multilingual Telephone Based Information System in African Languages | 329 |
| Language Knowledge | The Multi-layer Language Knowledge Base of Chinese NLP | 29 |
| Language Learning | Addizionario: an Interactive Hypermedia Tool for Language Learning | 374 |
| Language Modelling | Using Lexical Semantic Knowledge from Machine Readable Dictionaries for Domain Independent Language Modelling | 357 |
| Language Processing Resource Reusability | Experiences of Language Engineering Algorithm Reuse | 151 |
| Language Requirements | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
| Language Resource | Towards A Universal Tool For NLP Resource Acquisition | 28 |
| CDB - A Database of Lexical Collocations | 189 |
| Language Resource Creation | Providing Internet Access to Portuguese Corpora: the AC/DC Project | 85 |
| Language Resources | PLEDIT - A New Efficient Tool for Management of Multilingual Pronunciation Lexica and Batchlists | 53 |
| Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium | 209 |
| Large, Multilingual, Broadcast News Corpora for Cooperative Research in Topic Detection and Tracking: The TDT-2 and TDT-3 Corpus Efforts | 210 |
| Issues in Design and Collection of Large Telephone Speech Corpus for Slovenian Language | 246 |
| Language Resources Infrastructure | Software Infrastructure for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis | 170 |
| Language Teaching | Spoken Portuguese: Geographic and Social Varieties | 71 |
| Language Variation | Modern Greek Corpus Taxonomy | 351 |
| Language-Independent | A Graphical Parametric Language-Independent Tool for the Annotation of Speech Corpora | 314 |
| Languages Resources | Recent Developments within the European Language Resources Association (ELRA) | 377 |
| Large-Scale Corpus | The Multi-layer Language Knowledge Base of Chinese NLP | 29 |
| Latin America | SALA: SpeechDat across Latin America. Results of the First Phase | 10 |
| LE Platforms | Experiences of Language Engineering Algorithm Reuse | 151 |
| Legal Issues | A Platform for Dutch in Human Language Technologies | 348 |
| Recent Developments within the European Language Resources Association (ELRA) | 377 |
| Lemma and Part-of-Speech Based Aligment | Some Technical Aspects about Aligning Near Languages | 186 |
| Lexica | A Computational Platform for Development of Morphologic and Phonetic Lexica | 175 |
| Lexical Acquisition | Chinese-English Semantic Resource Construction | 327 |
| Tuning Lexicons to New Operational Scenarios | 330 |
| Lexical Database | The Bank of Swedish | 300 |
| The Concede Model for Lexical Databases | 335 |
| Lexical Databases | Improving Lexical Databases with Collocational Information: Data from Portuguese | 340 |
| Lexical Description | Producing LRs in Parallel with Lexicographic Description: the DCC project | 112 |
| Lexical Function | A Bilingual Electronic Dictionary for Frame Semantics | 69 |
| Lexical Gaps | Coping with Lexical Gaps when Building Aligned Multilingual Wordnets | 236 |
| Lexical Resources | Creating and Using Domain-specific Ontologies for Terminological Applications | 22 |
| MHATLex: Lexical Resources for Modelling the French Pronunciation | 37 |
| ItalWordNet: a Large Semantic Database for Italian | 129 |
| GéDériF: Automatic Generation and Analysis of Morphologically Constructed Lexical Resources | 279 |
| Lexical Semantics | Semantic Encoding of Danish Verbs in SIMPLE - Adapting a Verb Framed Model to a Satellite-framed Language | 13 |
| SIMPLE: A General Framework for the Development of Multilingual Lexicons | 61 |
| Automatic Extraction of Semantic Similarity of Words from Raw Technical Texts | 302 |
| Improving Lexical Databases with Collocational Information: Data from Portuguese | 340 |
| Using Lexical Semantic Knowledge from Machine Readable Dictionaries for Domain Independent Language Modelling | 357 |
| The Treatment of Adjectives in SIMPLE: Theoretical Observations | 366 |
| Lexical Tuning | Tuning Lexicons to New Operational Scenarios | 330 |
| Lexical-Semantic Database | A Bilingual Electronic Dictionary for Frame Semantics | 69 |
| Lexicography | Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
| Producing LRs in Parallel with Lexicographic Description: the DCC project | 112 |
| Language Resources Development at the Spanish Royal Academy | 297 |
| Lexicon | Semantic Encoding of Danish Verbs in SIMPLE - Adapting a Verb Framed Model to a Satellite-framed Language | 13 |
| Producing LRs in Parallel with Lexicographic Description: the DCC project | 112 |
| Language Resources Development at the Spanish Royal Academy | 297 |
| Tuning Lexicons to New Operational Scenarios | 330 |
| Lexicon Design | A Multi-view Hyperlexicon Resource for Speech and Language System Development | 243 |
| Lexicon Implementation | A Multi-view Hyperlexicon Resource for Speech and Language System Development | 243 |
| Lexicon Specification | A Multi-view Hyperlexicon Resource for Speech and Language System Development | 243 |
| Lexicon-Cum-Corpus | CDB - A Database of Lexical Collocations | 189 |
| Linguistic Annotation | POSCAT: A Morpheme-based Speech Corpus Annotation Tool | 224 |
| Linguistic Data Acquisition | The Universal XML Organizer: UXO | 253 |
| Linguistic Database | CDB - A Database of Lexical Collocations | 189 |
| Linguistic Resources | Towards a Standard for Meta-descriptions of Language Resources | 125 |
| Linguistic Software | Tools for the Generation of Morphological Entries in Dictionaries | 338 |
| Lingware Development | Using a Formal Approach to Evaluate Grammars | 285 |
| Listening and Understanding | Spoken Portuguese: Geographic and Social Varieties | 71 |
| Live Lexicon | Live Lexicons and Dynamic Corpora Adapted to the Network Resources for Chinese Spoken Language Processing Applications in an Internet Era | 214 |
| Local grammars | Morphological Tagging to Resolve Morphological Ambiguities | 277 |
| Log files | Dialogue Annotation for Language Systems Evaluation | 33 |
| Low Bit Rate Speech Compression | Perceptual Evaluation of a New Subband Low Bit Rate Speech Compression System based on Waveform Vector Quantization and SVD Postfiltering | 16 |
| LR Domains | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
| LR Tools | An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG | 371 |
| LVCSR | IPA Japanese Dictation Free Software Project | 261 |
| LVCSR Workbench | IPA Japanese Dictation Free Software Project | 261 |
| Machine Learning | Bootstrapping a Tagged Corpus through Combination of Existing Heterogeneous Taggers | 155 |
| Something Borrowed, Something Blue: Rule-based Combination of POS Taggers | 158 |
| Machine Readable Dictionaries | Using Lexical Semantic Knowledge from Machine Readable Dictionaries for Domain Independent Language Modelling | 357 |
| Machine Translation | A Word Sense Disambiguation Method Using Bilingual Corpus | 15 |
| Evaluating Translation Quality as Input to Product Development | 136 |
| Determining the Tolerance of Text-handling Tasks for MT Output | 139 |
| An Evaluation Tool for Machine Translation: Fast Evaluation for MT Research | 278 |
| NL-Translex: Machine Translation for Dutch | 294 |
| Chinese-English Semantic Resource Construction | 327 |
| Maintenance | A Platform for Dutch in Human Language Technologies | 348 |
| MARTIF | A Multi-view Hyperlexicon Resource for Speech and Language System Development | 243 |
| Masking-and-Matching Technique | Automatic Style Categorisation of Corpora in the Greek Language | 301 |
| Maturity | For a Repository of NLP Tools | 316 |
| MAUS | Regional Pronunciation Variants for Automatic Segmentation | 307 |
| Medical Terms | ARISTA Generative Lexicon for Compound Greek Medical Terms | 360 |
| Membership | Recent Developments within the European Language Resources Association (ELRA) | 377 |
| Meta-Data | Towards a Standard for Meta-descriptions of Language Resources | 125 |
| Method | ARC A3: A Method for Evaluating Term Extracting Tools and/or Semantic Relations between Terms from Corpora | 247 |
| Metrics | Issues in the Evaluation of Spoken Dialogue Systems - Experience from the ACCeSS Project | 162 |
| Evaluating Multi-party Multi-modal Systems | 368 |
| Microphone array | Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition | 356 |
| Minimally Supervised Approach | Minimally Supervised Japanese Named Entity Recognition: Resources and Evaluation | 258 |
| Minority Languages | Recruitment Techniques for Minority Language Speech Databases: Some Observations | 167 |
| Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications | 342 |
| Modular Programming | An Open Architecture for the Construction and Administration of Corpora | 176 |
| Monolingual Corpora | A Flexible Infrastructure for Large Monolingual Corpora | 226 |
| Monolingual LR | An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG | 371 |
| Morpheme-Based Annotation | POSCAT: A Morpheme-based Speech Corpus Annotation Tool | 224 |
| Morphological ambiguities | Morphological Tagging to Resolve Morphological Ambiguities | 277 |
| Morphological Analysis | Language Resources Development at the Spanish Royal Academy | 297 |
| Morphological Processing | Automatic Style Categorisation of Corpora in the Greek Language | 301 |
| Morphological tagging | Morphological Tagging to Resolve Morphological Ambiguities | 277 |
| Morphology | Terms Specification and Extraction within a Linguistic-based Intranet Service | 17 |
| A Word-level Morphosyntactic Analyzer for Basque | 44 |
| A Computational Platform for Development of Morphologic and Phonetic Lexica | 175 |
| Enhancing Speech Corpus Resources with Multiple Lexical Tag Layers | 183 |
| An Optimised FS Pronunciation Resource Generator for Highly Inflecting Languages | 251 |
| An Approach to Lexical Development for Inflectional Languages | 256 |
| Morphophonology | An Optimised FS Pronunciation Resource Generator for Highly Inflecting Languages | 251 |
| Morpho-Syntactic Annotation | A Web-based Text Corpora Development System | 105 |
| Morphosyntax | A Word-level Morphosyntactic Analyzer for Basque | 44 |
| Morpohological Annotation vs Tagging | The Context (not only) for Humans | 156 |
| MRD | Design and Construction of Knowledge base for Verb using MRD and Tagged Corpus | 237 |
| MUC-7 | CLinkA A Coreferential Links Annotator | 179 |
| Multi-Channel Databases | Annotation of a Multichannel Noisy Speech Corpus | 358 |
| Multi-Criteria Diagnosis | Obtaining Predictive Results with an Objective Evaluation of Spoken Dialogue Systems: Experiments with the DCR Assessment Paradigm | 36 |
| Multidimensional Statistics | TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation | 254 |
| Multi-Document Summarization | Evaluating Summaries for Multiple Documents in an Interactive Environment | 163 |
| Multilingual | SPEECHDAT-CAR. A Large Speech Database for Automotive Environments | 373 |
| Multi-Lingual | Cross-lingual Interpolation of Speech Recognition Models | 147 |
| Multilingual Alignment | Lexical and Translation Equivalence in Parallel Corpora | 122 |
| Multilingual Corpora | Something Borrowed, Something Blue: Rule-based Combination of POS Taggers | 158 |
| Multilingual Information Schemes | Extraction of Concepts and Multilingual Information Schemes from French and English Economics Documents | 202 |
| Multilingual LR | Resources for Multilingual Text Generation in Three Slavic Languages | 241 |
| Multilingual Terminological Resources | Accessibility of Multilingual Terminological Resources - Current Problems and Prospects for the Future | 283 |
| Multilingual Test Collections | The Evaluation of Systems for Cross-language Information Retrieval | 70 |
| Multilingual Tool | CLinkA A Coreferential Links Annotator | 179 |
| Multilingual Wordnets | Coping with Lexical Gaps when Building Aligned Multilingual Wordnets | 236 |
| Multi-Linguality | The COST 249 SpeechDat Multilingual Reference Recogniser | 274 |
| Multimedia | Addizionario: an Interactive Hypermedia Tool for Language Learning | 374 |
| Multi-Media | The EUDICO Project, Multi Media Annotation over the Internet | 45 |
| Multi-Media and Multi-Modal Language Resources | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
| Named Entity Extraction | Annotating Resources for Information Extraction | 263 |
| Named Entity Recognition | LT TTT - A Flexible Tokenisation Tool | 93 |
| Named Entity Recognition in Greek Texts | 173 |
| Named Entity Tagging | Development of Acoustic and Linguistic Resources for Research and Evaluation in Interactive Vocal Information Servers | 90 |
| Natural Language Generation | Target Suites for Evaluating the Coverage of Text Generators | 92 |
| Enabling Resource Sharing in Language Generation: an Abstract Reference Architecture | 244 |
| Automatic Generation of Dictionary Definitions from a Computational Lexicon | 306 |
| Natural Language Processing | A Word Sense Disambiguation Method Using Bilingual Corpus | 15 |
| Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
| TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation | 254 |
| Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
| Natural Language Processing and Information Retrieval | Evaluating Wordnets in Cross-language Information Retrieval: the ITEM Search Engine | 250 |
| Natural Language Processing Systems | Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications | 342 |
| Neo-Classical Compounds | Use of Greek and Latin Forms for Term Detection | 55 |
| Neural Networks | Shallow Discourse Genre Annotation in CallHome Spanish | 228 |
| Newspapers | A Self-Expanding Corpus Based on Newspapers on the Web | 362 |
| NIST | Design Issues in Text-Independent Speaker Recognition Evaluation | 286 |
| NLG | Target Suites for Evaluating the Coverage of Text Generators | 92 |
| NLP | Creating and Using Domain-specific Ontologies for Terminological Applications | 22 |
| Morphological Tagging to Resolve Morphological Ambiguities | 277 |
| Automatic Extraction of Semantic Similarity of Words from Raw Technical Texts | 302 |
| NLP Applications | For a Repository of NLP Tools | 316 |
| NLP Offer | For a Repository of NLP Tools | 316 |
| NLP Tools | Electronic Language Resources for Polish: POLEX, CEGLEX and GRAMLEX | 62 |
| Language Resources Development at the Spanish Royal Academy | 297 |
| NLP-Lexicon | Towards a Strategy for a Representation of Collocations - Extending the Danish PAROLE-lexicon | 47 |
| Non-Linear | The (Un)Deterministic Nature of Morphological Context | 200 |
| Non-Native Speech | The ISLE Corpus of Non-Native Spoken English | 313 |
| Non-Uniform Units | Design of Optimal Slovenian Speech Corpus for Use in the Concatenative Speech Synthesis System | 177 |
| Objective evaluation | The Cost258 Signal Generation Test Array | 1 |
| Object-Oriented Approach | Object-oriented Access to the Estonian Phonetic Database | 128 |
| OCR | Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications | 342 |
| OCR Database | GRUHD: A Greek database of Unconstrained Handwriting | 289 |
| On-Line Evaluation | On the Use of Prosody for On-line Evaluation of Spoken Dialogue Systems | 43 |
| Ontologies | Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
| Ontology | Creating and Using Domain-specific Ontologies for Terminological Applications | 22 |
| Oral Corpora | Portuguese Corpora at CLUL | 72 |
| Oral Databases | SALA: SpeechDat across Latin America. Results of the First Phase | 10 |
| SPEECHDAT-CAR. A Large Speech Database for Automotive Environments | 373 |
| Organisational Issues | A Platform for Dutch in Human Language Technologies | 348 |
| Orthographic Labeling | Development of Acoustic and Linguistic Resources for Research and Evaluation in Interactive Vocal Information Servers | 90 |
| Orthographic Transcription | Orthographic Transcription of the Spoken Dutch Corpus | 87 |
| Orthography Conversion | Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications | 342 |
| Paradigmatic and Semantic Properties of Suffixes | Derivation in the Czech National Corpus | 153 |
| Parallel Corpora | Lexical and Translation Equivalence in Parallel Corpora | 122 |
| Something Borrowed, Something Blue: Rule-based Combination of POS Taggers | 158 |
| Parallel Corpus | Building the Croatian-English Parallel Corpus | 119 |
| A Parallel Corpus of Italian/German Legal Texts | 140 |
| Parallel Multilingual Evaluation Materials | A Parallel English-Japanese Query Collection for the Evaluation of On-Line Help Systems | 248 |
| Parallel Text | Evaluation of Word Alignment Systems | 137 |
| Parallell Corpora | The Bank of Swedish | 300 |
| Parametric | A Graphical Parametric Language-Independent Tool for the Annotation of Speech Corpora | 314 |
| PAROLE | Towards a Strategy for a Representation of Collocations - Extending the Danish PAROLE-lexicon | 47 |
| Parsing | A Treebank of Spanish and its Application to Parsing | 66 |
| Providing Internet Access to Portuguese Corpora: the AC/DC Project | 85 |
| An Architecture for Document Routing in Spanish: Two Language Components, Pre-processor and Parser | 91 |
| Building a Treebank for French | 230 |
| Parsing Evaluation | Where Opposites Meet. A Syntactic Meta-scheme for Corpus Annotation and Parsing Evaluation | 98 |
| Partial Parsing | PoS Disambiguation and Partial Parsing Bidirectional Interaction | 169 |
| A Robust Parser for Unrestricted Greek Text | 174 |
| Partnership | Recent Developments within the European Language Resources Association (ELRA) | 377 |
| Part-of-Speech Tagging | A Web-based Text Corpora Development System | 105 |
| The Bank of Swedish | 300 |
| Perception | On the Use of Prosody for On-line Evaluation of Spoken Dialogue Systems | 43 |
| Perceptual Evaluation | Perceptual Evaluation of a New Subband Low Bit Rate Speech Compression System based on Waveform Vector Quantization and SVD Postfiltering | 16 |
| Perceptual Evaluation of Text-to-Speech Implementation of Enclitic Stress in Greek | 48 |
| Performance Variations | Sublanguage Dependent Evaluation: Toward Predicting NLP performances | 252 |
| Performances Measures | Predictive Performance of Dialog Systems | 303 |
| Phonetic Database | Object-oriented Access to the Estonian Phonetic Database | 128 |
| Phonetic Knowledge | Object-oriented Access to the Estonian Phonetic Database | 128 |
| Phonetics | Models of Russian Text/Speech Interactive Databases for Supporting of Scientific, Practical and Cultural Researches | 185 |
| Phrasal verbs | Semantic Encoding of Danish Verbs in SIMPLE - Adapting a Verb Framed Model to a Satellite-framed Language | 13 |
| Polish | An HPSG-Annotated Test Suite for Polish | 272 |
| Polish Morphology | Electronic Language Resources for Polish: POLEX, CEGLEX and GRAMLEX | 62 |
| Polysemy | What's in a Thesaurus? | 180 |
| Portuguese Corpus | Spoken Portuguese: Geographic and Social Varieties | 71 |
| Portuguese Resources | Improving Lexical Databases with Collocational Information: Data from Portuguese | 340 |
| Portuguese Varieties | Spoken Portuguese: Geographic and Social Varieties | 71 |
| Portuguese Corpora at CLUL | 72 |
| PoS Disambiguation | PoS Disambiguation and Partial Parsing Bidirectional Interaction | 169 |
| POS Tagging | Something Borrowed, Something Blue: Rule-based Combination of POS Taggers | 158 |
| A Unified POS Tagging Architecture and its Application to Greek | 181 |
| Part of Speech Tagging and Lemmatisation for the Spoken Dutch Corpus | 216 |
| Sublanguage Dependent Evaluation: Toward Predicting NLP performances | 252 |
| POS1 | Morphological Tagging to Resolve Morphological Ambiguities | 277 |
| Predicate-Argument Structure | Semantic Tagging for the Penn Treebank | 197 |
| Priorities | A Platform for Dutch in Human Language Technologies | 348 |
| Privileged co-Occurrence | Improving Lexical Databases with Collocational Information: Data from Portuguese | 340 |
| Probabilistic Method | Using Few Clues Can Compensate the Small Amount of Resources Available for Word Sense Disambiguation | 350 |
| Processing | A Strategy for the Syntactic Parsing of Corpora: from Constraint Grammar Output to Unification-based Processing | 111 |
| Productivity of Suffixes | Derivation in the Czech National Corpus | 153 |
| Project Introduction | Introduction of KIBS (Korean Information Base System) Project | 239 |
| Prompting Strategy | Dialogue and Prompting Strategies Evaluation in the DEMON System | 38 |
| Pronunciation | An Optimised FS Pronunciation Resource Generator for Highly Inflecting Languages | 251 |
| Pronunciation Dictionary | Design Methodology for Bilingual Pronunciation Dictionary | 269 |
| Pronunciation Lexica Editor | PLEDIT - A New Efficient Tool for Management of Multilingual Pronunciation Lexica and Batchlists | 53 |
| Pronunciation Lexicon | PLEDIT - A New Efficient Tool for Management of Multilingual Pronunciation Lexica and Batchlists | 53 |
| Pronunciation Model | MHATLex: Lexical Resources for Modelling the French Pronunciation | 37 |
| Pronunciation Training | The ISLE Corpus of Non-Native Spoken English | 313 |
| Pronunciation Variants | A French Phonetic Lexicon with Variants for Speech and Language Processing | 133 |
| Prosodic Boundaries | Labeling of Prosodic Events in Slovenian Speech Database GOPOLIS | 292 |
| Prosody | GREEK ToBI: A System for the Annotation of Greek Speech Corpora | 7 |
| On the Use of Prosody for On-line Evaluation of Spoken Dialogue Systems | 43 |
| Perceptual Evaluation of Text-to-Speech Implementation of Enclitic Stress in Greek | 48 |
| Labeling of Prosodic Events in Slovenian Speech Database GOPOLIS | 292 |
| Quality Control | Developing Guidelines and Ensuring Consistency for Chinese Text Annotation | 287 |
| Query Language | The MATE Workbench Annotation Tool, a Technical Description | 166 |
| Question Answering | The TREC-8 Question Answering Track | 26 |
| How to Evaluate Your Question Answering System Every Day ... and Still Get Real Work Done | 203 |
| Question-Answering | A Step toward Semantic Indexing of an Encyclopedic Corpus | 161 |
| Questionnaires | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
| Rarity | Rarity of Words in a Language and in a Corpus | 295 |
| Rating Scales | Evaluating Translation Quality as Input to Product Development | 136 |
| Reading Comprehension | How to Evaluate Your Question Answering System Every Day ... and Still Get Real Work Done | 203 |
| Real environments | Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition | 356 |
| Recruitment | Recruitment Techniques for Minority Language Speech Databases: Some Observations | 167 |
| Reduced Frequency | Rarity of Words in a Language and in a Corpus | 295 |
| Reference Systems | The COST 249 SpeechDat Multilingual Reference Recogniser | 274 |
| Referential Expressions | Annotating a Corpus to Develop and Evaluate Discourse Entity Realization Algorithms: Issues and Preliminary Results | 193 |
| Regional Pronunciation Rules | Regional Pronunciation Variants for Automatic Segmentation | 307 |
| Relational Model and Database for Lexical Collocations | CDB - A Database of Lexical Collocations | 189 |
| Research Center | Terminology in Korea: KORTERM | 276 |
| Resource Discovery | Towards a Standard for Meta-descriptions of Language Resources | 125 |
| Resources | SIMPLE: A General Framework for the Development of Multilingual Lexicons | 61 |
| Electronic Language Resources for Polish: POLEX, CEGLEX and GRAMLEX | 62 |
| Reusability as Easy Adaptability: A Substantial Advance in NL Technology | 298 |
| Reusability | Enabling Resource Sharing in Language Generation: an Abstract Reference Architecture | 244 |
| Reusability as Easy Adaptability: A Substantial Advance in NL Technology | 298 |
| Reuse | Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium | 209 |
| Reuse of Resources | Bootstrapping a Tagged Corpus through Combination of Existing Heterogeneous Taggers | 155 |
| Reusing LR | Resources for Multilingual Text Generation in Three Slavic Languages | 241 |
| Rexical Semantic Networks | ItalWordNet: a Large Semantic Database for Italian | 129 |
| Robust Parsing | A Step toward Semantic Indexing of an Encyclopedic Corpus | 161 |
| Rule-Based | Rule-based Tagging: Morphological Tagset versus Tagset of Analytical Functions | 199 |
| Rule-Based Morphology | Tools for the Generation of Morphological Entries in Dictionaries | 338 |
| Rule-Based Stemming | GéDériF: Automatic Generation and Analysis of Morphologically Constructed Lexical Resources | 279 |
| SALT | Accessibility of Multilingual Terminological Resources - Current Problems and Prospects for the Future | 283 |
| Satellited-Framed Language | Semantic Encoding of Danish Verbs in SIMPLE - Adapting a Verb Framed Model to a Satellite-framed Language | 13 |
| Scheme-Scripting | The Universal XML Organizer: UXO | 253 |
| Scientific and Cultural Research | Models of Russian Text/Speech Interactive Databases for Supporting of Scientific, Practical and Cultural Researches | 185 |
| Segmentation | Annotation of a Multichannel Noisy Speech Corpus | 358 |
| Segmentation Tools | SegWin: a Tool for Segmenting, Annotating, and Controlling the Creation of a Database of Spoken Italian Varieties | 310 |
| Semantic Classification | Controlled Bootstrapping of Lexico-semantic Classes as a Bridge between Paradigmatic and Syntagmatic Knowledge: Methodology and Evaluation | 99 |
| Semantic Lexicon | Automatic Generation of Dictionary Definitions from a Computational Lexicon | 306 |
| Semantic Lexicons | Annotating, Disambiguating & Automatically Extending the Coverage of the Swedish SIMPLE Lexicon | 84 |
| Semantic Network | A Step toward Semantic Indexing of an Encyclopedic Corpus | 161 |
| Semantic Resource Construction | Chinese-English Semantic Resource Construction | 327 |
| Semantic Similarity | What's in a Thesaurus? | 180 |
| Semantic Tagging | An Experiment of Lexical-Semantic Tagging of an Italian Corpus | 60 |
| Annotating, Disambiguating & Automatically Extending the Coverage of the Swedish SIMPLE Lexicon | 84 |
| Semantics | Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
| Semetic Language SR | Creation of Spoken Hebrew Databases | 52 |
| Sense Discrimination | Lexical and Translation Equivalence in Parallel Corpora | 122 |
| Sense Distinctions | Semantic Tagging for the Penn Treebank | 197 |
| Sense Tagging | An Experiment of Lexical-Semantic Tagging of an Italian Corpus | 60 |
| SENSEVAL | English Senseval: Report and Results | 8 |
| Sentence Aligment | Some Technical Aspects about Aligning Near Languages | 186 |
| Sentence Alignment | Automatic Extraction of English-Chinese Term Lexicons from Noisy Bilingual Corpora | 208 |
| Sentence Extraction | Term-based Identification of Sentences for Text Summarisation | 106 |
| Server | The Cost258 Signal Generation Test Array | 1 |
| SGML | A Proposal for the Integration of NLP Tools using SGML-Tagged Documents | 68 |
| A Multi-view Hyperlexicon Resource for Speech and Language System Development | 243 |
| Shallow Parsing | Annotating, Disambiguating & Automatically Extending the Coverage of the Swedish SIMPLE Lexicon | 84 |
| Shallow Processing | Looking for Errors: A Declarative Formalism for Resource-adaptive Language Checking | 299 |
| Shalow Parsing | Building a Treebank for French | 230 |
| Signal-Level Annotation | POSCAT: A Morpheme-based Speech Corpus Annotation Tool | 224 |
| Similarity Measure | Cardinal, Nominal or Ordinal Similarity Measures in Comparative Evaluation of Information Retrieval Process | 367 |
| SIMPLE | Annotating, Disambiguating & Automatically Extending the Coverage of the Swedish SIMPLE Lexicon | 84 |
| SIMPLE Computational Lexicon | Automatic Generation of Dictionary Definitions from a Computational Lexicon | 306 |
| Size-Limited Resources | Using Few Clues Can Compensate the Small Amount of Resources Available for Word Sense Disambiguation | 350 |
| SLDSs Evaluation Procedures | Dialogue Annotation for Language Systems Evaluation | 33 |
| Slovene Langauge | Morphosyntactic Tagging of Slovene: Evaluating Taggers and Tagsets | 146 |
| Software Architectur | TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation | 254 |
| SORIDA | Design Methodology for Bilingual Pronunciation Dictionary | 269 |
| Sound Scene | Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition | 356 |
| Spanish | A Treebank of Spanish and its Application to Parsing | 66 |
| Language Resources Development at the Spanish Royal Academy | 297 |
| Spanish and Portuguese | SALA: SpeechDat across Latin America. Results of the First Phase | 10 |
| Speaker Recognition | Design Issues in Text-Independent Speaker Recognition Evaluation | 286 |
| Speech | The Cost258 Signal Generation Test Array | 1 |
| Guidelines for Japanese Speech Synthesizer Evaluation | 77 |
| Models of Russian Text/Speech Interactive Databases for Supporting of Scientific, Practical and Cultural Researches | 185 |
| Shallow Discourse Genre Annotation in CallHome Spanish | 228 |
| Speech Act | Shallow Discourse Genre Annotation in CallHome Spanish | 228 |
| Speech Corpora | Orthographic Transcription of the Spoken Dutch Corpus | 87 |
| Enhancing Speech Corpus Resources with Multiple Lexical Tag Layers | 183 |
| A Graphical Parametric Language-Independent Tool for the Annotation of Speech Corpora | 314 |
| Speech Corpus | Corpora of Slovene Spoken Language for Multi-lingual Applications | 288 |
| Speech Corpus Annotation | The ISLE Corpus of Non-Native Spoken English | 313 |
| Speech Corpus Annotation Tool | POSCAT: A Morpheme-based Speech Corpus Annotation Tool | 224 |
| Speech Corpus Design | The ISLE Corpus of Non-Native Spoken English | 313 |
| Speech Data Annotation | Development of Acoustic and Linguistic Resources for Research and Evaluation in Interactive Vocal Information Servers | 90 |
| Speech Database | Labeling of Prosodic Events in Slovenian Speech Database GOPOLIS | 292 |
| Speech Databases | Recruitment Techniques for Minority Language Speech Databases: Some Observations | 167 |
| Issues in Design and Collection of Large Telephone Speech Corpus for Slovenian Language | 246 |
| Developing a Multilingual Telephone Based Information System in African Languages | 329 |
| SpeechDat-Car Fixed Platform | 331 |
| COCOSDA - a Progress Report | 364 |
| Speech Dictation Task | Issues in Design and Collection of Large Telephone Speech Corpus for Slovenian Language | 246 |
| Speech Generation | Evaluation of a Dialogue System Based on a Generic Model that Combines Robust Speech Understanding and Mixed-initiative Control | 101 |
| Speech Input/Output | COCOSDA - a Progress Report | 364 |
| Speech Lexica | Enhancing Speech Corpus Resources with Multiple Lexical Tag Layers | 183 |
| Speech Processing | Object-oriented Access to the Estonian Phonetic Database | 128 |
| Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
| Speech Recognition | SALA: SpeechDat across Latin America. Results of the First Phase | 10 |
| Creation of Spoken Hebrew Databases | 52 |
| SPEECON - Speech Data for Consumer Devices | 63 |
| Evaluation of a Dialogue System Based on a Generic Model that Combines Robust Speech Understanding and Mixed-initiative Control | 101 |
| Cross-lingual Interpolation of Speech Recognition Models | 147 |
| Spontaneous Speech Corpus of Japanese | 262 |
| The ISLE Corpus of Non-Native Spoken English | 313 |
| Using Lexical Semantic Knowledge from Machine Readable Dictionaries for Domain Independent Language Modelling | 357 |
| SPEECHDAT-CAR. A Large Speech Database for Automotive Environments | 373 |
| Speech Synthesis | Perceptual Evaluation of Text-to-Speech Implementation of Enclitic Stress in Greek | 48 |
| Speech Transcription Tool | Transcribing with Annotation Graphs | 192 |
| Speech Understanding | Evaluation of a Dialogue System Based on a Generic Model that Combines Robust Speech Understanding and Mixed-initiative Control | 101 |
| SpeechDat | Recruitment Techniques for Minority Language Speech Databases: Some Observations | 167 |
| SpeechDat-Car | SpeechDat-Car Fixed Platform | 331 |
| Speech-to-Speech Machine Translation | Lessons Learned from a Task-based Evaluation of Speech-to-Speech Machine Translation | 215 |
| Spoken Commands | Corpora of Slovene Spoken Language for Multi-lingual Applications | 288 |
| Spoken Corpora | GREEK ToBI: A System for the Annotation of Greek Speech Corpora | 7 |
| Spoken Corpus | Language Resources Development at the Spanish Royal Academy | 297 |
| Spoken Database | Creation of Spoken Hebrew Databases | 52 |
| Spoken Dialog Systems | MDWOZ: A Wizard of Oz Environment for Dialog Systems Development | 104 |
| Spoken Dialogue Architectures | Galaxy-II as an Architecture for Spoken Dialogue Evaluation | 116 |
| Spoken Dialogue Dystems | Dialogue and Prompting Strategies Evaluation in the DEMON System | 38 |
| Galaxy-II as an Architecture for Spoken Dialogue Evaluation | 116 |
| Spoken Dialogue Systems | On the Use of Prosody for On-line Evaluation of Spoken Dialogue Systems | 43 |
| Evaluation of a Dialogue System Based on a Generic Model that Combines Robust Speech Understanding and Mixed-initiative Control | 101 |
| A Methodology for Evaluating Spoken Language Dialogue Systems and Their Components | 135 |
| Spoken Dutch | Orthographic Transcription of the Spoken Dutch Corpus | 87 |
| Spoken Language Corpora | The Spoken Dutch Corpus. Overview and First Evaluation | 110 |
| Spoken Language Dialogue Systems | Issues in the Evaluation of Spoken Dialogue Systems - Experience from the ACCeSS Project | 162 |
| Spoken Language Resources | Orthographic Transcription of the Spoken Dutch Corpus | 87 |
| Annotating Communication Problems Using the MATE Workbench | 134 |
| Spoken Language Understanding | Obtaining Predictive Results with an Objective Evaluation of Spoken Dialogue Systems: Experiments with the DCR Assessment Paradigm | 36 |
| Spoken Language Varieties | SegWin: a Tool for Segmenting, Annotating, and Controlling the Creation of a Database of Spoken Italian Varieties | 310 |
| Spoken Portuguese | Spoken Portuguese: Geographic and Social Varieties | 71 |
| Spontaneous Speech | What are Transcription Errors and Why are They made? | 205 |
| The Influence of Scenario Constraints on the Spontaneity of Speech. A Comparison of Dialogue Corpora | 217 |
| Spontaneous Speech Corpus of Japanese | 262 |
| Standardisation | Annotating Communication Problems Using the MATE Workbench | 134 |
| COCOSDA - a Progress Report | 364 |
| Standardization | Using a Large Set of EAGLES-compliant Morpho-syntactic Descriptors as a Tagset for Probabilistic Tagging | 11 |
| Towards a Standard for Meta-descriptions of Language Resources | 125 |
| Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications | 342 |
| Standards | The American National Corpus: A Standardized Resource for American English | 196 |
| Standards and Tools | Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium | 209 |
| Statistical Language Model | Semi-automatic Construction of a Tree-annotated Corpus Using an Iterative Learning Statistical Language Model | 341 |
| Statistical Linguistics | Modern Greek Corpus Taxonomy | 351 |
| Statistical Machine Translation | Evaluation of TRANSTYPE, a Computer-aided Translation Typing System: A Comparison of a Theoretical- and a User-oriented Evaluation Procedures | 34 |
| Cairo: An Alignment Visualization Tool | 58 |
| Statistical NLP | Term-based Identification of Sentences for Text Summarisation | 106 |
| Statistics | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
| Stress | Perceptual Evaluation of Text-to-Speech Implementation of Enclitic Stress in Greek | 48 |
| Structure Driven Processing | Interarbora and Thistle - Delivering Linguistic Structure by the Internet | 319 |
| Stylesheet | The MATE Workbench Annotation Tool, a Technical Description | 166 |
| Stylistic Analysis | Modern Greek Corpus Taxonomy | 351 |
| Subband Approach | Perceptual Evaluation of a New Subband Low Bit Rate Speech Compression System based on Waveform Vector Quantization and SVD Postfiltering | 16 |
| Subcategorization | Learning Verb Subcategorization from Corpora: Counting Frame Subsets | 145 |
| Automatic Assignment of Grammatical Relations | 218 |
| Subject Field Codes | Integrating Subject Field Codes into WordNet | 219 |
| Sublanguages | Sublanguage Dependent Evaluation: Toward Predicting NLP performances | 252 |
| ARISTA Generative Lexicon for Compound Greek Medical Terms | 360 |
| Success Rate | Issues in the Evaluation of Spoken Dialogue Systems - Experience from the ACCeSS Project | 162 |
| Surface realization | Target Suites for Evaluating the Coverage of Text Generators | 92 |
| Surveys | For a Repository of NLP Tools | 316 |
| Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
| Recent Developments within the European Language Resources Association (ELRA) | 377 |
| SVD | Perceptual Evaluation of a New Subband Low Bit Rate Speech Compression System based on Waveform Vector Quantization and SVD Postfiltering | 16 |
| Syntactic Analysis | A Strategy for the Syntactic Parsing of Corpora: from Constraint Grammar Output to Unification-based Processing | 111 |
| Syntactic and Semantic Constraints | Extraction of Concepts and Multilingual Information Schemes from French and English Economics Documents | 202 |
| Syntactic Annotation of Corpora | Where Opposites Meet. A Syntactic Meta-scheme for Corpus Annotation and Parsing Evaluation | 98 |
| Syntactic Normalization | An Architecture for Document Routing in Spanish: Two Language Components, Pre-processor and Parser | 91 |
| Syntactic Patterns | Producing LRs in Parallel with Lexicographic Description: the DCC project | 112 |
| Syntax | Semantico-syntactic Tagging of Very Large Corpora: the Case of Restoration of Nodes on the Underlying Level | 18 |
| Coreference in Annotating a Large Corpus | 19 |
| Towards A Universal Tool For NLP Resource Acquisition | 28 |
| An XML-based Representation Format for Syntactically Annotated Corpora | 59 |
| A Treebank of Spanish and its Application to Parsing | 66 |
| Learning Verb Subcategorization from Corpora: Counting Frame Subsets | 145 |
| Syntax-Semantics Linking | SIMPLE: A General Framework for the Development of Multilingual Lexicons | 61 |
| Synthesizer | Guidelines for Japanese Speech Synthesizer Evaluation | 77 |
| System Evaluation Methodologies | The Evaluation of Systems for Cross-language Information Retrieval | 70 |
| Tag Set for Chinese | Hua Yu: A Word-segmented and Part-Of-Speech Tagged Chinese Corpus | 372 |
| Tagged Corpus | Design and Construction of Knowledge base for Verb using MRD and Tagged Corpus | 237 |
| Tagger Evaluation | Part of Speech Tagging and Lemmatisation for the Spoken Dutch Corpus | 216 |
| Tagging | Morphosyntactic Tagging of Slovene: Evaluating Taggers and Tagsets | 146 |
| Bootstrapping a Tagged Corpus through Combination of Existing Heterogeneous Taggers | 155 |
| Rule-based Tagging: Morphological Tagset versus Tagset of Analytical Functions | 199 |
| Building a Treebank for French | 230 |
| TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation | 254 |
| Tagset | Rule-based Tagging: Morphological Tagset versus Tagset of Analytical Functions | 199 |
| Tagset Design | Using a Large Set of EAGLES-compliant Morpho-syntactic Descriptors as a Tagset for Probabilistic Tagging | 11 |
| Part of Speech Tagging and Lemmatisation for the Spoken Dutch Corpus | 216 |
| Principled Hidden Tagset Design for Tiered Tagging of Hungarian | 249 |
| Tagset Reduction | Principled Hidden Tagset Design for Tiered Tagging of Hungarian | 249 |
| Target suites | Target Suites for Evaluating the Coverage of Text Generators | 92 |
| Task-Based Evaluation | A Comparison of Summarization Methods Based on Task-based Evaluation | 14 |
| Lessons Learned from a Task-based Evaluation of Speech-to-Speech Machine Translation | 215 |
| TDT | Enhancing the TDT Tracking Evaluation | 5 |
| TEI | The Concede Model for Lexical Databases | 335 |
| TEI-Conformant Feature Structures | A Proposal for the Integration of NLP Tools using SGML-Tagged Documents | 68 |
| Telephone Speech | SALA: SpeechDat across Latin America. Results of the First Phase | 10 |
| Telephony Applications | Creation of Spoken Hebrew Databases | 52 |
| Telephony Speech Databases | The COST 249 SpeechDat Multilingual Reference Recogniser | 274 |
| Temporal Information | Annotating Events and Temporal Information in Newswire Texts | 321 |
| Term Detection | Use of Greek and Latin Forms for Term Detection | 55 |
| Term Extraction | Designing a Tool for Exploiting Bilingual Comparable Corpora | 20 |
| Automatic Extraction of English-Chinese Term Lexicons from Noisy Bilingual Corpora | 208 |
| Terminological Resources | Term-based Identification of Sentences for Text Summarisation | 106 |
| Terminology | Terms Specification and Extraction within a Linguistic-based Intranet Service | 17 |
| Creating and Using Domain-specific Ontologies for Terminological Applications | 22 |
| Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
| Use of Greek and Latin Forms for Term Detection | 55 |
| Terminology in Korea: KORTERM | 276 |
| Terminology Extraction | ARC A3: A Method for Evaluating Term Extracting Tools and/or Semantic Relations between Terms from Corpora | 247 |
| Terminology Lexicons | Automatically Augmenting Terminological Lexicons from Untagged Text | 320 |
| Terminology Resource | Terminology in Korea: KORTERM | 276 |
| Test Suite | An HPSG-Annotated Test Suite for Polish | 272 |
| Test suites | Target Suites for Evaluating the Coverage of Text Generators | 92 |
| Text | Resources for Multilingual Text Generation in Three Slavic Languages | 241 |
| Text Analysis | Terms Specification and Extraction within a Linguistic-based Intranet Service | 17 |
| Text Categorization | Modern Greek Corpus Taxonomy | 351 |
| Text Classification | Integrating Subject Field Codes into WordNet | 219 |
| Text Corpora | A Web-based Text Corpora Development System | 105 |
| Text Processing | A Computational Platform for Development of Morphologic and Phonetic Lexica | 175 |
| Design of Optimal Slovenian Speech Corpus for Use in the Concatenative Speech Synthesis System | 177 |
| Text Summarisation | Term-based Identification of Sentences for Text Summarisation | 106 |
| Text Technology | Enhancing Speech Corpus Resources with Multiple Lexical Tag Layers | 183 |
| Text Typology | TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation | 254 |
| Text Understanding | Coreference Resolution Evaluation Based on Descriptive Specificity | 131 |
| Text-Handling tasks | Determining the Tolerance of Text-handling Tasks for MT Output | 139 |
| Text-independence | Design Issues in Text-Independent Speaker Recognition Evaluation | 286 |
| Text-To-Speech | Perceptual Evaluation of Text-to-Speech Implementation of Enclitic Stress in Greek | 48 |
| Textual Typology | Sublanguage Dependent Evaluation: Toward Predicting NLP performances | 252 |
| Thematic Roles | Chinese-English Semantic Resource Construction | 327 |
| Thesaurus | Automatically Expansion of Thesaurus Entries with a Different Thesaurus | 142 |
| What's in a Thesaurus? | 180 |
| Tiered Tagging | Using a Large Set of EAGLES-compliant Morpho-syntactic Descriptors as a Tagset for Probabilistic Tagging | 11 |
| Principled Hidden Tagset Design for Tiered Tagging of Hungarian | 249 |
| ToBI | GREEK ToBI: A System for the Annotation of Greek Speech Corpora | 7 |
| Tokenisation | LT TTT - A Flexible Tokenisation Tool | 93 |
| Tool Directory | For a Repository of NLP Tools | 316 |
| Tool Typology | For a Repository of NLP Tools | 316 |
| Tools | Portuguese Corpora at CLUL | 72 |
| Annotating Communication Problems Using the MATE Workbench | 134 |
| Reusability as Easy Adaptability: A Substantial Advance in NL Technology | 298 |
| Tools for Grammar Evaluation | An HPSG-Annotated Test Suite for Polish | 272 |
| Topic Detection | Evaluating Summaries for Multiple Documents in an Interactive Environment | 163 |
| Topic Detection and Tracking | Large, Multilingual, Broadcast News Corpora for Cooperative Research in Topic Detection and Tracking: The TDT-2 and TDT-3 Corpus Efforts | 210 |
| Topic Tracking | Enhancing the TDT Tracking Evaluation | 5 |
| Topic(s) of Interest | Determining the Tolerance of Text-handling Tasks for MT Output | 139 |
| Traditional Dictionaries | Tools for the Generation of Morphological Entries in Dictionaries | 338 |
| Trained Systems | Annotating Resources for Information Extraction | 263 |
| Training Data | Semantic Tagging for the Penn Treebank | 197 |
| Transcription | What are Transcription Errors and Why are They made? | 205 |
| Transcription Tool | The Universal XML Organizer: UXO | 253 |
| Transformation Based Learning | A Unified POS Tagging Architecture and its Application to Greek | 181 |
| Translation Equivalents | Designing a Tool for Exploiting Bilingual Comparable Corpora | 20 |
| Translation Memory | TransSearch: A Free Translation Memory on the World Wide Web | 12 |
| Translation Quality | Evaluating Translation Quality as Input to Product Development | 136 |
| Translation Resource | TransSearch: A Free Translation Memory on the World Wide Web | 12 |
| Translation Studies | A Software Toolkit for Sharing and Accessing Corpora Over the Internet | 337 |
| Translational English | A Software Toolkit for Sharing and Accessing Corpora Over the Internet | 337 |
| Tree Bank | Introduction of KIBS (Korean Information Base System) Project | 239 |
| Tree-Annotated Coprpus | Semi-automatic Construction of a Tree-annotated Corpus Using an Iterative Learning Statistical Language Model | 341 |
| Treebank | A Treebank of Spanish and its Application to Parsing | 66 |
| Building a Treebank for Italian: a Data-driven Annotation Schema | 220 |
| Building a Treebank for French | 230 |
| Turn-Taking | The Influence of Scenario Constraints on the Spontaneity of Speech. A Comparison of Dialogue Corpora | 217 |
| Unconstrained Speech Recognition | Development of Acoustic and Linguistic Resources for Research and Evaluation in Interactive Vocal Information Servers | 90 |
| Unconstrained Writing | GRUHD: A Greek database of Unconstrained Handwriting | 289 |
| Up-Translation | The Concede Model for Lexical Databases | 335 |
| Usability | Dialogue and Prompting Strategies Evaluation in the DEMON System | 38 |
| Evaluating Multi-party Multi-modal Systems | 368 |
| Usage | For a Repository of NLP Tools | 316 |
| User Friendly Search System | A Web-based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts | 363 |
| User Interface | CLinkA A Coreferential Links Annotator | 179 |
| User Needs | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
| Users | Determining the Tolerance of Text-handling Tasks for MT Output | 139 |
| Valency | Learning Verb Subcategorization from Corpora: Counting Frame Subsets | 145 |
| Validation | The TREC-8 Question Answering Track | 26 |
| Recent Developments within the European Language Resources Association (ELRA) | 377 |
| Variants | Regional Pronunciation Variants for Automatic Segmentation | 307 |
| Variation | The Influence of Scenario Constraints on the Spontaneity of Speech. A Comparison of Dialogue Corpora | 217 |
| Verb | Learning Verb Subcategorization from Corpora: Counting Frame Subsets | 145 |
| Design and Construction of Knowledge base for Verb using MRD and Tagged Corpus | 237 |
| Verb Classification | Establishing the Upper Bound and Inter-judge Agreement of a Verb Classification Task | 233 |
| Vernacular Languages | Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications | 342 |
| Visualization | Cairo: An Alignment Visualization Tool | 58 |
| Interarbora and Thistle - Delivering Linguistic Structure by the Internet | 319 |
| Web | A Self-Expanding Corpus Based on Newspapers on the Web | 362 |
| Web Application | Design and Implementation of the Online ILSP Greek Corpus | 336 |
| Web Interface | A Web-based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts | 363 |
| Web Interfaces | Providing Internet Access to Portuguese Corpora: the AC/DC Project | 85 |
| Web Search | A Flexible Infrastructure for Large Monolingual Corpora | 226 |
| Web-Based Concordance | A Self-Expanding Corpus Based on Newspapers on the Web | 362 |
| Welsh | Recruitment Techniques for Minority Language Speech Databases: Some Observations | 167 |
| Wizard of Oz Simulation | MDWOZ: A Wizard of Oz Environment for Dialog Systems Development | 104 |
| Word Accent | Labeling of Prosodic Events in Slovenian Speech Database GOPOLIS | 292 |
| Word Alignment | Cairo: An Alignment Visualization Tool | 58 |
| Evaluation of Word Alignment Systems | 137 |
| Word Clustering | Automatic Extraction of Semantic Similarity of Words from Raw Technical Texts | 302 |
| Word Combinations | Towards a Strategy for a Representation of Collocations - Extending the Danish PAROLE-lexicon | 47 |
| Word Formation | Application of WordNet ILR in Czech Word-formation | 223 |
| Word Formation Rules | The Multi-layer Language Knowledge Base of Chinese NLP | 29 |
| Word Segmentation | The Multi-layer Language Knowledge Base of Chinese NLP | 29 |
| Word Segmentation and Part-of-Speech Tagging | Hua Yu: A Word-segmented and Part-Of-Speech Tagged Chinese Corpus | 372 |
| Word Sense Disambiguation | English Senseval: Report and Results | 8 |
| Abstraction of the EDR Concept Classification and its Effectiveness in Word Sense Disambiguation | 75 |
| Tuning Lexicons to New Operational Scenarios | 330 |
| Using Few Clues Can Compensate the Small Amount of Resources Available for Word Sense Disambiguation | 350 |
| Word Sense Tagging | A Word Sense Disambiguation Method Using Bilingual Corpus | 15 |
| WordNet | Integrating Subject Field Codes into WordNet | 219 |
| Application of WordNet ILR in Czech Word-formation | 223 |
| Evaluation of a Generic Lexical Semantic Resource in Information Extraction | 259 |
| Word-Sense Disambiguation | An Experiment of Lexical-Semantic Tagging of an Italian Corpus | 60 |
| World Wide Web | TransSearch: A Free Translation Memory on the World Wide Web | 12 |
| Written Corpora | Portuguese Corpora at CLUL | 72 |
| Written Language Resources | NL-Translex: Machine Translation for Dutch | 294 |
| XCES | Building the Croatian-English Parallel Corpus | 119 |
| xfst | An Optimised FS Pronunciation Resource Generator for Highly Inflecting Languages | 251 |
| XML | An XML-based Representation Format for Syntactically Annotated Corpora | 59 |
| Building the Croatian-English Parallel Corpus | 119 |
| A Step toward Semantic Indexing of an Encyclopedic Corpus | 161 |
| The MATE Workbench Annotation Tool, a Technical Description | 166 |
| XCES: An XML-based Encoding Standard for Linguistic Corpora | 172 |
| A Unified POS Tagging Architecture and its Application to Greek | 181 |
| Enhancing Speech Corpus Resources with Multiple Lexical Tag Layers | 183 |
| A Multi-view Hyperlexicon Resource for Speech and Language System Development | 243 |
| The Concede Model for Lexical Databases | 335 |
| XML Mark-Up | LT TTT - A Flexible Tokenisation Tool | 93 |
| XML Representation | Automatic Generation of Dictionary Definitions from a Computational Lexicon | 306 |
| XML-Editor | The Universal XML Organizer: UXO | 253 |