KEYWORD: Browse articles of the conference sorted by keyword

 

A - B - C - D - E - F - G - H - I - J - K - L - M - N - O - P - Q - R - S - T - U - V - W - X

Keyword title
A  
Abbreviation Clustering acronyms in biomedical text for disambiguation
Accent restoration A Grapheme-Based Approach for Accent Restoration in Gikuyu
Access by meaning Intelligent Dictionary Interfaces: Usability Evaluation of Access-Supporting Enhancements
Acquisition Building a network of topical relations from a corpus

Next Generation Language Resources using Grid
Acronym Clustering acronyms in biomedical text for disambiguation
Acronym recognition Towards a terminological resource for biomedical text mining
Acronyms and definitions Recognizing Acronyms and their Definitions in Swedish Medical Texts
Act-topic annotation Act-Topic Patterns for Automatically Checking Dialogue Models
Actantial structure A Methodology for Developing Multilingual Resources for Terminology
Adaptation Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People
Adequacy CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

X-Score: Automatic Evaluation of Machine Translation Grammaticality
Adjudication A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit

An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks
Adverbs LEXADV - a multilingual semantic Lexicon for Adverbs
Agglutinative Morphological annotation of Korean with Directly Maintainable Resources
Agreement Annotating Emotions in Meetings
Alignment Aligning Multilingual Thesauri

Czech-English Word Alignment
Alternation Valency Lexicon of Czech Verbs: Alternation-Based Model
Anagram-key spelling correction Corpus-Induced Corpus Clean-up
Analysis A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora
Anaphora Coreference Resolution with and without Linguistic Knowledge

Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains
And DUC. Automated Summarization Evaluation with Basic Elements.
Annotated corpora HAREM: An Advanced NER Evaluation Contest for Portuguese
Annotation A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation

An Annotated Corpus Management Tool: ChaKi

An Introduction to NLP-based Textual Anonymisation

Annotating Emotions in Meetings

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

Dependency-structure Annotation to Corpus of Spontaneous Japanese

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

H. C. Andersen Conversation Corpus

KNACK-2002: a Richly Annotated Corpus of Dutch Written Text

Lexicon Development for Varieties of Spoken Colloquial Arabic

Linguistic Resources for Speech Parsing

Morphological annotation of Korean with Directly Maintainable Resources

NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora

NPs for Events: Experiments in Coreference Annotation

The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text

The MULINCO corpus and corpus platform

UAM Text Tools - a flexible NLP architecture

Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
Annotation of Temporal Relations with Tango
Annotation evaluation Fear-type emotions of the SAFE Corpus: annotation issues
Annotation guidelines An Annotated Corpus of Typical Durations of Events

NPs for Events: Experiments in Coreference Annotation
Annotating COMPARA, a Grammar-aware Parallel Corpus
Annotation mining A BLARK extension for temporal annotation mining

Discourse functions of duration in Mandarin: resource design and implementation
Annotation of emotion Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus
Annotation scheme Annotating Bridging Anaphors in Italian: in Search of Reliability

NPs for Events: Experiments in Coreference Annotation
Annotation tool Dependency-structure Annotation to Corpus of Spontaneous Japanese
Anonymisation An Introduction to NLP-based Textual Anonymisation
Answer selection Exploiting Multiple Semantic Resources for Answer Selection
Answer Time-Bank An Answer Bank for Temporal Inference
Appraisal Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches
Arabic Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Developing and Using a Pilot Dialectal Arabic Treebank

Lexicon Development for Varieties of Spoken Colloquial Arabic

Parallel Syntactic Annotation of Multiple Languages

What in the world is a Shahab?: Wide Coverage Named Entity Recognition for Arabic
Argument structure Romanian Valence Dictionary in XML Format

Valency Lexicon of Czech Verbs: Alternation-Based Model
Artificial intelligence MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation
Asia Oriental COCOSDA: Past, Present and Future
Augmentation of lexicons Moving to dynamic computational lexicons with LeXFlow
Augmentative and Alternative Communication Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People
Automatic alignment Building a Swedish-Turkish Parallel Corpus
Automatic generation of web portals Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator
Automatic lexical acquisition A Large Subcategorization Lexicon for Natural Language Processing Applications
Automatic MT evaluation CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

X-Score: Automatic Evaluation of Machine Translation Grammaticality
Automatic phonetic transcription Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure
Automatic segmentation Creation and analysis of a Polish speech database for use in unit selection synthesis
Automatic Speech Recognition (ASR) Bootstrapping New Language ASR Capabilities: Achieving Best Letter-to-Sound Performance under Resource Constraints

Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign
Automatic Text Summarization Towards Holistic Summarization -- Selecting Summaries, Not Sentences

Computer-aided summarisation -- what the user really wants
Average reduced frequency New Approach to Frequency Dictionaries - Czech Example
B  
Back translation Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
Basic Elements Automated Summarization Evaluation with Basic Elements.
Summarizing Answers for Complicated Questions
Bilingual Data-driven Amharic-English Bilingual Lexicon Acquisition
Bilingual speech corpus in two phonetically similar languages
Bilingual Machine-Aided Indexing
Bilingual corpora Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
Bilingual lexicon acquisition Automatic Detection of Orthographic Cues for Cognate Recognition
Bio-NLP Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus
Bioinformatics Linguistic and Biological Annotations of Biological Interaction Events
Biomedical corpora The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text
Biomedical literature Clustering acronyms in biomedical text for disambiguation
Biomedicine Towards a terminological resource for biomedical text mining
Bitext Building a Swedish-Turkish Parallel Corpus
BLARK A BLARK extension for temporal annotation mining
BLAST A BLARK extension for temporal annotation mining
Blind Evaluation Blind Evaluation for Thai Search Engines
Bracket scores SParseval: Evaluation Metrics for Parsing Speech
Bridging Annotating Bridging Anaphors in Italian: in Search of Reliability
Broadcast news Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

SINOD - Slovenian non-native speech database
C  
Case frame A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames

Case Frame Compilation from the Web using High-Performance Computing
Catalan Bilingual speech corpus in two phonetically similar languages
Categorial ambiguity Annotating COMPARA, a Grammar-aware Parallel Corpus
Centering PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit
CESTA evaluation Creating a Large-Scale Arabic to French Statistical MachineTranslation System
CGN JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
Character semantics Feature-based Encoding and Querying Language Resources with Character Semantics
Chat language Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach
Chemical compounds Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
Children JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
Chinese collocation The Design and Construction of A Chinese Collocation Bank
Chinese Segmentation Towards Unified Chinese Segmentation Algorithm
Chinese Sketch Engine Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
Chinese thesaurus Toward a Pan-Chinese Thesaurus
Chunking Rule-Based Chunking and Reusability
Class-based language models Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech
Classification Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
CLEF The Multilingual Question Answering Track at CLEF
CLIR Extraction of Cross Language Term Correspondences
Clustering Clustering acronyms in biomedical text for disambiguation

Finding representative sets of dialect words for geographical regions
Co-occurence BACO - A large database of text and co-occurrences

Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words
COCOSDA Oriental COCOSDA: Past, Present and Future
Cognate detection Automatic Detection of Orthographic Cues for Cognate Recognition
Cognitive Theory of Metaphor The Italian Metaphor Database
Collocation bank The Design and Construction of A Chinese Collocation Bank
Collocation extraction Extraction tools for collocations and their morphosyntactic specificities

Semi-automatic Building of Swedish Collocation Lexicon
Collocations COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

Extraction tools for collocations and their morphosyntactic specificities
Comparability and enrichment Exploring opportunities for Comparability and Enrichment by Linking lexical databases
Compounds Query Expansion on Compounds
Computational Lexicography Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon
Computational dialectology Dialectal resources on-line: the ALT-Web experience
Computational lexicon The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon
A Large Subcategorization Lexicon for Natural Language Processing Applications
Representation and Inference for Open-Domain QA: Strength and Limits of two Italian Semantic Lexicons
Computational morphology Morphological Tools for Six Small Uralic Languages
Computational Semantics Towards Holistic Summarization -- Selecting Summaries, Not Sentences
Computational Terminology Extraction of Cross Language Term Correspondences
Computer-aided summarization Computer-aided summarisation -- what the user really wants
Computer-assisted language learning Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language
Concept Set Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence
Conceptual Interlingua Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
Conceptual markers Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts
Conceptual Metaphor The Italian Metaphor Database
Conceptual relations Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts
Conceptual Vectors Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence
Confidence measure Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions
Confusables Corpus-Induced Corpus Clean-up
Conjunction Disambiguation Named Entity Extraction with Conjunction Disambiguation
Constraint Grammar Turning a Dependency Treebank into a PSG-style Constituent Treebank
Construction of lexical resources Dialectal resources on-line: the ALT-Web experience
Content selection Learning Database Content for Spoken Dialogue System Design
Context-based evaluation A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation
Context-independent spelling correction A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker

Spelling Error Patterns in Spanish for Word Processing Applications
Contributing factor A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames
Controlled language Predicting MT Quality as a Function of the Source Language
Conversational speech Linguistic Resources for Speech Parsing
Conversational telephone speech The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
Conversion From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
Cooperative answers Language Challenges for Data Fusion in Question-Answering
CoreLex Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations
Coreference Coreference Resolution with and without Linguistic Knowledge

Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains

Towards Natural Interactive Question Answering
Corpora A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German

Development of the First LRs for Macedonian: Current Projects

EQueR: the French Evaluation campaign of Question-Answering Systems

I-CAB: the Italian Content Annotation Bank

Oriental COCOSDA: Past, Present and Future

SALTO - A Versatile Multi-Level Annotation Tool

Structure, Annotation and Tools in the Basque ZT Corpus

The Multilingual Question Answering Track at CLEF
Corpus An Annotated Corpus Management Tool: ChaKi

An Annotated Corpus of Typical Durations of Events

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia

Case Frame Compilation from the Web using High-Performance Computing

Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

Discourse functions of duration in Mandarin: resource design and implementation

Evaluation of multimodal components within CHIL: The evaluation packages and results

Extraction tools for collocations and their morphosyntactic specificities

Gathering a corpus of multimodal computer-mediated meetings

Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project

Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words

The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon

The MULINCO corpus and corpus platform

Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language
Corpus acquisition and preprocessing Creating a Large-Scale Arabic to French Statistical MachineTranslation System
Corpus analysis tools FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
Corpus annotation A methodology for the joint development of the Basque WordNet and Semcor

Annotating COMPARA, a Grammar-aware Parallel Corpus

Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach

Corpus Annotation as a Test of a Linguistic Theory
The SALSA Corpus: a German Corpus Resource for Lexical Semantics
Corpus annotation tool A new approach to syntactic annotation
Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development
Corpus Based Synthesis Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque
Corpus construction KNACK-2002: a Richly Annotated Corpus of Dutch Written Text
Corpus design User requirements analysis for the design of a reference corpus of written Dutch
Corpus development A Uniform Interface to Large-Scale Linguistic Resources

Linguistic Resources for Speech Parsing
Corpus processing UAM Text Tools - a flexible NLP architecture
Corpus query interface A Uniform Interface to Large-Scale Linguistic Resources
Corpus search A Corpus Search System Utilizing Lexical Dependency Structure

Annotating COMPARA, a Grammar-aware Parallel Corpus
Corpus size Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
Corpus Spoken Dutch JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
Corpus-cleanup Corpus-Induced Corpus Clean-up
Corpus-induced Corpus-Induced Corpus Clean-up
Correference Transferring Coreference Chains through Word Alignment
Correlation Automated Summarization Evaluation with Basic Elements.
Cross-language information retrieval The Impact of Evaluation on Multilingual Information Retrieval System Development
Cross-level queries Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).
Cross-lingual and trans-lingual techniques The pragmatic combination of different crosslingual resources
Cross-lingual information retrieval A Deep Linguistic Analysis for Cross-language Information Retrieval

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
Crosslingual Bilingual Machine-Aided Indexing
Czech Towards a Slovene Dependency Treebank
D  
Danish LEXADV - a multilingual semantic Lexicon for Adverbs

Query Expansion on Compounds
Data centers More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities
Integrated Linguistic Resources for Language Exploitation Technologies
Data collection Data for question answering: The case of why
Data sparsity A Closer Look at Skip-gram Modelling
Data-driven methods MaltParser: A Data-Driven Parser-Generator for Dependency Parsing
Database On the data base of Romanian syllables and some of its quantitative and cryptographic aspects
Spanish Synthesis Corpora
Database creation Creation and analysis of a Polish speech database for use in unit selection synthesis
Database Design Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque
Date integration Language Challenges for Data Fusion in Question-Answering
Decision points An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks
Decoder MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation
Deep lexical acquisition Automated Deep Lexical Acquisition for Robust Open Texts Processing
Deep parsing A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation

Preprocessing and Tokenisation Standards in DELPH-IN Tools
Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
Dependency grammar Towards a Slovene Dependency Treebank
Turning a Dependency Treebank into a PSG-style Constituent Treebank
Dependency parsing MaltParser: A Data-Driven Parser-Generator for Dependency Parsing
Dependency-Based Phrase Alignment
Dependency relations Corpus Annotation as a Test of a Linguistic Theory
Dependency scores SParseval: Evaluation Metrics for Parsing Speech
Dependency structure A Corpus Search System Utilizing Lexical Dependency Structure

An Annotated Corpus Management Tool: ChaKi

Dependency-structure Annotation to Corpus of Spontaneous Japanese
Dependency tree From Natural Language to Databases via Ontologies
Diacritic placement A Grapheme-Based Approach for Accent Restoration in Gikuyu
Diagrams Semantic Descriptors: The Case of Reflexive Verbs
Dialect A Unified Structure for Dutch Dialect Dictionary Data

Finding representative sets of dialect words for geographical regions
Dialectal Arabic Developing and Using a Pilot Dialectal Arabic Treebank

Lexicon Development for Varieties of Spoken Colloquial Arabic
Dialectal databases Dialectal resources on-line: the ALT-Web experience
Dialectal Treebank Developing and Using a Pilot Dialectal Arabic Treebank
Dialectometry Finding representative sets of dialect words for geographical regions
Dialog The OSU Quake 2004 corpus of two-party situated problem-solving dialogs
Dialogue Towards Natural Interactive Question Answering
Dialogue annotation TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface
Dialogue model Act-Topic Patterns for Automatically Checking Dialogue Models
Dialogue system user speech Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech
Dialogue systems Learning Database Content for Spoken Dialogue System Design
A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
Dictionary A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

A Unified Structure for Dutch Dialect Dictionary Data

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
Dictionary creation Developing and Using a Pilot Dialectal Arabic Treebank
Dictionary interfaces Intelligent Dictionary Interfaces: Usability Evaluation of Access-Supporting Enhancements
Dictionary model A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons
Digital libraries Natural Language Processing: A Terminological and Statistical Approach
Dimensions Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches
Disambiguation On Automatic Assignment of Verb Valency Frames in Czech
Discourse phenomena Discourse functions of duration in Mandarin: resource design and implementation
Disfluencies Linguistic Resources for Speech Parsing
Disparate languages Data-driven Amharic-English Bilingual Lexicon Acquisition
Distillation Integrated Linguistic Resources for Language Exploitation Technologies
Distributed databases The Information Commons Gazetteer
Distributed index The Information Commons Gazetteer
Doctor-patient dialogue Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali
Document categorization Rebuilding Lexical Resources for Information Retrieval using Sense Folder Detection and Merging Methods
DOLCE A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
Domain ontology A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames
Using Core Ontology for Domain Lexicon Structuring
Domain specific knowledge Structuring a Domain Vocabulary in a General Knowledge Environment
Domain-specific corpora Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model
Dutch User requirements analysis for the design of a reference corpus of written Dutch
A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora
Dynamic lexicons Moving to dynamic computational lexicons with LeXFlow
E  
ECESS ECESS Inter-Module Interface Specification for Speech Synthesis
EContent users Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
Editing Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains
Elderly JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
Electronic dictionaries Development of the First LRs for Macedonian: Current Projects
Ellipsis Towards Natural Interactive Question Answering
EMMA A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
Emotion Annotating Emotions in Meetings

Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews

Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches

Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque

Emotion recognition Improving Automatic Emotion Recognition from Speech via Gender Differentiaion
Emotional database Fear-type emotions of the SAFE Corpus: annotation issues
Emotional valence Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus
EngValLex From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
English Data-driven Amharic-English Bilingual Lexicon Acquisition

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

General and Task-Specific Corpus Resources for Polish Adult Learners of English

Parallel Syntactic Annotation of Multiple Languages

The English-Slovene ACQUIS corpus
English inclusions Integrating Language Knowledge Resources to Extend the English Inclusion
EQUER Question Answering Evaluation Survey
Error mining Automated Deep Lexical Acquisition for Robust Open Texts Processing
EU legislation The English-Slovene ACQUIS corpus
Eurovoc Aligning Multilingual Thesauri
Evaluation A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation

A joint prosody evaluation of French text-to-speech synthesis systems

Act-Topic Patterns for Automatically Checking Dialogue Models

Annotating Bridging Anaphors in Italian: in Search of Reliability

Automated Summarization Evaluation with Basic Elements.

Automatic Evaluation and Composition of NLP Pipelines with Web Services

Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy

Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

Czech-English Word Alignment

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

EQueR: the French Evaluation campaign of Question-Answering Systems

Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign

Evaluation of multilingual text alignment systems: the ARCADE II project

Evaluation of multimodal components within CHIL: The evaluation packages and results

HAREM: An Advanced NER Evaluation Contest for Portuguese

Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet

Predicting MT Quality as a Function of the Source Language

Question Answering Evaluation Survey

REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications

ROTE: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics

Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus

Representation and Inference for Open-Domain QA: Strength and Limits of two Italian Semantic Lexicons

Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

The Multilingual Question Answering Track at CLEF
Evaluation contest HAREM: An Advanced NER Evaluation Contest for Portuguese
Evaluation programs Integrated Linguistic Resources for Language Exploitation Technologies
Evaluation resources HAREM: An Advanced NER Evaluation Contest for Portuguese
Evaluation tools A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation
Event annotation Adding multi-layer semantics to the Greek Dependency Treebank
Event coreference NPs for Events: Experiments in Coreference Annotation
Event durations An Annotated Corpus of Typical Durations of Events
Event taxonomy Adding multi-layer semantics to the Greek Dependency Treebank
Extraction from Spoken Text. In Proceeding of LREC 2004. Paris Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts
Extractive Summarization Multilingual Multidocument Summarization Tools and Evaluation
F  
Fear detection Fear-type emotions of the SAFE Corpus: annotation issues
Feature extraction Improving Automatic Emotion Recognition from Speech via Gender Differentiaion
Feature structures Feature-based Encoding and Querying Language Resources with Character Semantics
Features Linguistic features modeling based on Partial New Cache
Features-Cache model Linguistic features modeling based on Partial New Cache
FGD Perspectives of Turning Prague Dependency Treebank into a Knowledge Base
Figurative Language Proper Names and Linguistic Dynamics
Filed Recording SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus
Filtering data Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions
Finite-state transducers The importance of precise tokenizing for deep grammars
Fluency CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

X-Score: Automatic Evaluation of Machine Translation Grammaticality
Focus Annotating Information Structure in a Corpus of Spoken Danish
Forensic The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
Frame On Automatic Assignment of Verb Valency Frames in Czech
FrameNet Construction of a FrameNet Labeler for Swedish Text

The SALSA Corpus: a German Corpus Resource for Lexical Semantics
Framework GAIA: Common Framework for the Development of Speech Translation Technologies
France: ELRA Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts
Free software FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
French Integrating Language Knowledge Resources to Extend the English Inclusion

Parallel Syntactic Annotation of Multiple Languages

The Lefff 2 syntactic lexicon for French: architecture, acquisition, use
French Question Answering Campaign Question Answering Evaluation Survey
Frequency FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text

New Approach to Frequency Dictionaries - Czech Example
Functional Generative Description From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
Fusion The Information Commons Gazetteer
G  
GALE Integrated Linguistic Resources for Language Exploitation Technologies
Gazetteer The Information Commons Gazetteer
Gender detection from speech Improving Automatic Emotion Recognition from Speech via Gender Differentiaion
General Lexical Resources The Italian Metaphor Database
Generation A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora
German Integrating Language Knowledge Resources to Extend the English Inclusion

The SALSA Corpus: a German Corpus Resource for Lexical Semantics

The importance of precise tokenizing for deep grammars
German parsers Improving coverage and parsing quality of a large-scale LFG for German
Gold standard An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks
Grammar A Lexicalized Tree-Adjoining Grammar for Vietnamese
Granularity Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
Grapheme to phoneme conversion Creation and analysis of a Polish speech database for use in unit selection synthesis
Grapheme-based approach A Grapheme-Based Approach for Accent Restoration in Gikuyu
Greedy algorithm Tools and resources for speech synthesis arising from a Welsh TTS project
Grid Next Generation Language Resources using Grid
H  
Hand-crafted grammars Improving coverage and parsing quality of a large-scale LFG for German

The importance of precise tokenizing for deep grammars
Head gesture Tangible Objects for the Acquisition of Multimodal Interaction Patterns
Hebrew A Computational Lexicon of Contemporary Hebrew
Hesitation Discourse functions of duration in Mandarin: resource design and implementation
Heterogeneous resources Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
High-performance computing Case Frame Compilation from the Web using High-Performance Computing
Hindi Parallel Syntactic Annotation of Multiple Languages
Historical corpus Building a historical corpus for Classical Portuguese: some technological aspects
Human associations Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet
Human error A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames
Human-computer communication Creation and analysis of a Polish speech database for use in unit selection synthesis
Human-machine interaction JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
I  
ICSI Meeting Recorder Project Part-of-Speech Tagging of Transcribed Speech
Idioms Extraction tools for collocations and their morphosyntactic specificities
Image processing Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
Inconsistencies The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text
Information extraction COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

Constructing a Named Entity Ontology from Web Corpora

I-CAB: the Italian Content Annotation Bank

Named Entity Extraction with Conjunction Disambiguation

Linguistic and Biological Annotations of Biological Interaction Events

Ontology-based Information Extraction with SOBA
Information commons The Information Commons Gazetteer
Information retrieval Query Expansion on Compounds

Development of Linguistic Ontology on Natural Sciences and Technology
Information structure Annotating Information Structure in a Corpus of Spoken Danish

Corpus Annotation as a Test of a Linguistic Theory
Information theory Evaluation of Stop Word Lists in Chinese Language
Infrastructure Automatic Evaluation and Composition of NLP Pipelines with Web Services

A Unified Structure for Dutch Dialect Dictionary Data
Integration GAIA: Common Framework for the Development of Speech Translation Technologies
Intelligibility Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing
Inter-annotator agreement An Annotated Corpus of Typical Durations of Events
Interaction Towards Natural Interactive Question Answering
Interface Computer-aided summarisation -- what the user really wants

ECESS Inter-Module Interface Specification for Speech Synthesis
Interlanguages “Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus
Internationalization Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
Internet A Uniform Interface to Large-Scale Linguistic Resources
Internet search Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community
ISO-DIS-24610-1 Feature-based Encoding and Querying Language Resources with Character Semantics
J  
Japanese Parallel Syntactic Annotation of Multiple Languages
K  
Knowledge Engineering Ontology Driven K-Portal Construction and K-Service Provision
Knowledge extraction Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
Knowledge Modeling A Framework to Integrate Ubiquitous Knowledge Modeling
Knowledge Portal Ontology Driven K-Portal Construction and K-Service Provision
Knowledge representation The LOIS Project
Knowledge Services Provision Ontology Driven K-Portal Construction and K-Service Provision
Knowledge sharing A new approach to syntactic annotation
Korean Morphological annotation of Korean with Directly Maintainable Resources

Parallel Syntactic Annotation of Multiple Languages
L  
L2S Bootstrapping New Language ASR Capabilities: Achieving Best Letter-to-Sound Performance under Resource Constraints
Labelling Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches
Language Engineering Ontology Driven K-Portal Construction and K-Service Provision
Language identification Reconsidering Language Identification for Written Language Resources

Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
Language interaction Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems
Language modeling Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech
Language modelling A Closer Look at Skip-gram Modelling
Language Models Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment

Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People
Language resource interoperability Moving to dynamic computational lexicons with LeXFlow
Language resource A new approach to syntactic annotation

Development of the First LRs for Macedonian: Current Projects

Integrating Language Knowledge Resources to Extend the English Inclusion

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

LexikoNet - a lexical database based on type and role hierarchies

On the data base of Romanian syllables and some of its quantitative and cryptographic aspects

Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community

Syntactic Lexicon of Polish Predicative Nouns

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources

A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons


Corpus Portal for Search in Monolingual Corpora

Morphological annotation of Korean with Directly Maintainable Resources

BACO - A large database of text and co-occurrences
Language resources production models Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
Language service A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons
Language technology Query Expansion on Compounds
Language typology Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment
LDC More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities
Learning by Emergence Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence
Legal Ontologies A Development Tool For Multilingual Ontology-based Conceptual
Legitimate translation variation Using collocations from comparable corpora to find translation equivalents
Lemmatiser Open Source Corpus Analysis Tools for Malay
Lemmatization Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development
Lemmatizer Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project
Less widely digitally available languages Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
Letter-to-Sound Bootstrapping New Language ASR Capabilities: Achieving Best Letter-to-Sound Performance under Resource Constraints
Letter-to-sound rules Tools and resources for speech synthesis arising from a Welsh TTS project
Levantine Developing and Using a Pilot Dialectal Arabic Treebank
Lexica A Unified Structure for Dutch Dialect Dictionary Data
Lexical acquisition The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon
Lexical anaphora Annotating Bridging Anaphors in Italian: in Search of Reliability
Lexical association measures Semi-automatic Building of Swedish Collocation Lexicon
COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions
Lexical database Building a network of topical relations from a corpus

Natural Language Processing: A Terminological and Statistical Approach

COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

Proper Names and Linguistic Dynamics

Using Core Ontology for Domain Lexicon Structuring
Lexical entailment Mixing WordNet, VerbNet and PropBank for studying verb relations
Lexical ontology LexikoNet - a lexical database based on type and role hierarchies
Lexical resource Merging two Ontology-based Lexical Resources

Structuring a Domain Vocabulary in a General Knowledge Environment

The SALSA Corpus: a German Corpus Resource for Lexical Semantics

Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet
Lexical semantics The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon

The SALSA Corpus: a German Corpus Resource for Lexical Semantics
Mixing WordNet, VerbNet and PropBank for studying verb relations
Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations
Lexical variation Finding representative sets of dialect words for geographical regions
Lexical workflow Moving to dynamic computational lexicons with LeXFlow
Lexical-semantic knowledge bases A methodology for the joint development of the Basque WordNet and Semcor
Lexicon A Computational Lexicon of Contemporary Hebrew

A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora

Building a network of topical relations from a corpus

Data-driven Amharic-English Bilingual Lexicon Acquisition

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Elaborating the parameterized Equivalence Class Method for Dutch

LEXADV - a multilingual semantic Lexicon for Adverbs

Lexicon Development for Varieties of Spoken Colloquial Arabic

Morphological annotation of Korean with Directly Maintainable Resources

Natural Language Processing: A Terminological and Statistical Approach

The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon

Valency Lexicon of Czech Verbs: Alternation-Based Model
Lexicon Architecture Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon
Lexicon Construction Word Knowledge Acquisition for Computational Lexicon Construction
Lexicon extension Automated Deep Lexical Acquisition for Robust Open Texts Processing
Lexicon to Ontology Mapping Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases
Lexicon-grammar Syntactic Lexicon of Polish Predicative Nouns
LFG Improving coverage and parsing quality of a large-scale LFG for German

The importance of precise tokenizing for deep grammars
Linguistic analysis Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
Linguistic annotations The English-Slovene ACQUIS corpus
Linguistic databases Tregex and Tsurgeon: tools for querying and manipulating tree data structures
Linguistic ontology Development of Linguistic Ontology on Natural Sciences and Technology
Linguistic resources Integrated Linguistic Resources for Language Exploitation Technologies

Linguistic Resources for Speech Parsing
Linkage discovery Dependency-Based Phrase Alignment
Linking Merging two Ontology-based Lexical Resources
Linking of lexical databases Exploring opportunities for Comparability and Enrichment by Linking lexical databases
Localization Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
Location-aware computing The Information Commons Gazetteer
Lombard speech database Methodology of Lombard Speech Database Acquisition: Experiences with CLSD
LR Infrastructures and Architectures Techno-langue: The French National Initiative for Human Language Technologies (HLT)
LR national/international projects Techno-langue: The French National Initiative for Human Language Technologies (HLT)
LTAG A Lexicalized Tree-Adjoining Grammar for Vietnamese
M  
Machine Multilingual parallel treebanking: a lean and flexible approach
Machine learning A Grapheme-Based Approach for Accent Restoration in Gikuyu

An Introduction to NLP-based Textual Anonymisation

Learning Database Content for Spoken Dialogue System Design

On Automatic Assignment of Verb Valency Frames in Czech

Recognizing Acronyms and their Definitions in Swedish Medical Texts

Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development
Machine translatability Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
Machine translation Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

Dependency-Based Phrase Alignment

Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation

Predicting MT Quality as a Function of the Source Language
CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

X-Score: Automatic Evaluation of Machine Translation Grammaticality
Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign
METIS-II: Machine Translation for Low Resource Languages
Machine translation evaluation A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation
Machine-Aided Indexing Bilingual Machine-Aided Indexing
Machine-readable lexicons Towards machine-readable lexicons for South African Bantu languages
Malay Open Source Corpus Analysis Tools for Malay
Manual annotation Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
Manual word-alignment Annotation Guidelines for Czech-English Word Alignment
Mark-up tools Rule-Based Chunking and Reusability
Markup Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
MDE Linguistic Resources for Speech Parsing
Meaning Perspectives of Turning Prague Dependency Treebank into a Knowledge Base
Meaning elicitation The role of lexical resources in matching classification schemas
Medical corpus Recognizing Acronyms and their Definitions in Swedish Medical Texts
Meeting Gathering a corpus of multimodal computer-mediated meetings

Annotating Emotions in Meetings
Meeting browsers TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface
Meeting corpora Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus
Meeting data Evaluation of multimodal components within CHIL: The evaluation packages and results
Memory based learning Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains
Merging Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions
Metamodel Methodological Aspects of Semantic Annotation
Metaphor Database The Italian Metaphor Database
Methodology A Preliminary Study for Building the Basque PropBank
Minority and endangered languages Morphological Tools for Six Small Uralic Languages
Mixed-lingual texts Integrating Language Knowledge Resources to Extend the English Inclusion
Mixer The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
Mobile and multimodal interaction The pragmatic combination of different crosslingual resources
Monolingual corpora Corpus Portal for Search in Monolingual Corpora
Morpheme Morphological annotation of Korean with Directly Maintainable Resources
Morphological analysis A Deep Linguistic Analysis for Cross-language Information Retrieval

FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

New Approach to Frequency Dictionaries - Czech Example
Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon

Morphological Tools for Six Small Uralic Languages
Morphological annotation Developing and Using a Pilot Dialectal Arabic Treebank
Morphological Lexicon Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian
Morphology A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora

Creating Tools for Morphological Analysis of Sumerian

Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes.

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Extraction tools for collocations and their morphosyntactic specificities

HAREM: An Advanced NER Evaluation Contest for Portuguese

Lexicon Development for Varieties of Spoken Colloquial Arabic

Morphological annotation of Korean with Directly Maintainable Resources
Morphosyntax Extraction tools for collocations and their morphosyntactic specificities
Motorbike Bikers Accessing the Web: The SmartWeb Motorbike Corpus
Movement detection Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
MPEG-7 A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
Multi-Document Summarization Impact of Question Decomposition on the Quality of Answer Summaries
MultiNet Perspectives of Turning Prague Dependency Treebank into a Knowledge Base
Multidocument Multilingual Multidocument Summarization Tools and Evaluation
Multilevel database Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).
Multilingual
Multilingual annotation Parallel Syntactic Annotation of Multiple Languages
Multilingual Corpora Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
Multilingual dabatase A Methodology for Developing Multilingual Resources for Terminology
Multilingual information services The pragmatic combination of different crosslingual resources
Multilingual parallel corpora Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
Evaluation of multilingual text alignment systems: the ARCADE II project
Multilingual Question Answering Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering

The Multilingual Question Answering Track at CLEF
Multilinguality Aligning Multilingual Thesauri

Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language
Multimedia corpus The OSU Quake 2004 corpus of two-party situated problem-solving dialogs
Multimedia Meeting Recordings Gathering a corpus of multimodal computer-mediated meetings
Multimodal Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems
Multimodal behavior Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
Multimodal corpora H. C. Andersen Conversation Corpus

TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface
NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora
Multimodal interfaces Evaluation of multimodal components within CHIL: The evaluation packages and results

TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface
Multimodality A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
Multiparty dialogues Part-of-Speech Tagging of Transcribed Speech
Multiword expressions Elaborating the parameterized Equivalence Class Method for Dutch

Using collocations from comparable corpora to find translation equivalents
COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions
Mutation rules Automatic Detection of Orthographic Cues for Cognate Recognition
N  
Named Entity Ontology Constructing a Named Entity Ontology from Web Corpora
Named Entity Recognition Named Entity Extraction with Conjunction Disambiguation

What in the world is a Shahab?: Wide Coverage Named Entity Recognition for Arabic

FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

HAREM: An Advanced NER Evaluation Contest for Portuguese

Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project
Named entities A Deep Linguistic Analysis for Cross-language Information Retrieval
Named entity annotation A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit
Named-Entity Tagging Identifying Named Entities in Text Databases from the Natural History Domain
National Project An observatory on Spoken Italian linguistic resources and descriptive standards.
Natural language access to database From Natural Language to Databases via Ontologies
Natural language engineering Automatic Evaluation and Composition of NLP Pipelines with Web Services
Natural Language Interfaces to GIS Databases Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases
Natural Language Processing (NLP) Automatic Evaluation and Composition of NLP Pipelines with Web Services

Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach

Development of Linguistic Ontology on Natural Sciences and Technology

Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign

Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development

A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

Turning a Dependency Treebank into a PSG-style Constituent Treebank

Naturalistic Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches
NLTK PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit
Non-machine-translatable parts Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
Non-native speech Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure

JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality

SINOD - Slovenian non-native speech database

“Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus
Non-words Corpus-Induced Corpus Clean-up
Noncompositionality The SALSA Corpus: a German Corpus Resource for Lexical Semantics
Normalized edit distance Automatic Detection of Orthographic Cues for Cognate Recognition
Norwegian FonDat1: A Speech Synthesis Corpus for Norwegian
NP coreference NPs for Events: Experiments in Coreference Annotation
Nursing Terminologies Features of Terms in Actual Nursing Activities
O  
Ontology A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation

A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

Feature-based Encoding and Querying Language Resources with Character Semantics

From Natural Language to Databases via Ontologies

LEXADV - a multilingual semantic Lexicon for Adverbs

Ontology-based Information Extraction with SOBA

Structuring a Domain Vocabulary in a General Knowledge Environment
A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

Aligning Multilingual Thesauri

The LOIS Project
Ontology acquisition Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator
Ontology Development from Text Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering
Ontology Engineering Ontology Driven K-Portal Construction and K-Service Provision
Ontology learning Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus
Ontology mapping Merging two Ontology-based Lexical Resources
Ontology Modelling Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases
Ontology modelling for trans/cross-lingual dialogues The pragmatic combination of different crosslingual resources
Open Source REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
Open-domain Q&A evaluation Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia
Open-domain free text translation The pragmatic combination of different crosslingual resources
Optimality Theory PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit
Organizational/policy issues Techno-langue: The French National Initiative for Human Language Technologies (HLT)
Oriental Oriental COCOSDA: Past, Present and Future
Orthographic cues Automatic Detection of Orthographic Cues for Cognate Recognition
Orthography FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text
P  
Pan-Chinese lexical resource Toward a Pan-Chinese Thesaurus
Parallel corpora Annotating COMPARA, a Grammar-aware Parallel Corpus

Building a Swedish-Turkish Parallel Corpus

Construction of a FrameNet Labeler for Swedish Text

Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation

Multilingual parallel treebanking: a lean and flexible approach

The English-Slovene ACQUIS corpus

Transferring Coreference Chains through Word Alignment
Paraphrase Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
Parse scoring SParseval: Evaluation Metrics for Parsing Speech
Parser evaluation Improving coverage and parsing quality of a large-scale LFG for German
Parsers Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
Parsing A Lexicalized Tree-Adjoining Grammar for Vietnamese

Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations

Improving coverage and parsing quality of a large-scale LFG for German

MaltParser: A Data-Driven Parser-Generator for Dependency Parsing

Multilingual parallel treebanking: a lean and flexible approach

The importance of precise tokenizing for deep grammars

Developing and Using a Pilot Dialectal Arabic Treebank
Parsing Evaluation Data, Annotations and Measures in EASY the Evaluation Campaign for Parsers of French.
Part-of-speech tagging A Deep Linguistic Analysis for Cross-language Information Retrieval
Pattern generation A Corpus Search System Utilizing Lexical Dependency Structure
Pattern search UAM Text Tools - a flexible NLP architecture
Pattern transformation Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
PDT Perspectives of Turning Prague Dependency Treebank into a Knowledge Base
Phonetic distance Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure
Phonetic lexicons Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure
Phonetic transcriptions Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus
Phonetics Feature-based Encoding and Querying Language Resources with Character Semantics
Phonology FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text
Phrase alignment Dependency-Based Phrase Alignment
Phrase-based statistical machine translation Parallel Corpora and Phrase-Based Statistical Machine Translation for New Language Pairs via Multiple Intermediaries
Phrase-book translation The pragmatic combination of different crosslingual resources
Platforms REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
PMI Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words
POS tagging An Annotated Corpus Management Tool: ChaKi

Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project
FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development
A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German
Polish General and Task-Specific Corpus Resources for Polish Adult Learners of English
Populated places The Information Commons Gazetteer
Portuguese Building a historical corpus for Classical Portuguese: some technological aspects
Portuguese African varieties The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon
Prague-English Dependency Treebank From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
Predicate – argument structure Adding multi-layer semantics to the Greek Dependency Treebank
Predicate-argument Building a resource for studying translation shifts
Predicate-argument structure Shalmaneser - A Toolchain For Shallow Semantic Parsing
Predicative nouns Syntactic Lexicon of Polish Predicative Nouns
Prefix Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes.
Prepositions The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon
Preprocessing Preprocessing and Tokenisation Standards in DELPH-IN Tools
Problem-solving language The OSU Quake 2004 corpus of two-party situated problem-solving dialogs
Profile Creation Language Resources for Background Gathering
Projectivity Corpus Annotation as a Test of a Linguistic Theory
Pronoun Resolution PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit
Pronunciation lexicon SI-PRON: A Pronunciation Lexicon for Slovenian
Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian
Pronunciation of names “Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus
Pronunciation variation Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure

“Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus
Proofing tools A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker

Spelling Error Patterns in Spanish for Word Processing Applications
PropBank From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
Property Grammars Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
Prosodic transplantation A joint prosody evaluation of French text-to-speech synthesis systems
Prosody A joint prosody evaluation of French text-to-speech synthesis systems
Prosody analysis Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech
Protein names The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text
Pseudonymisation An Introduction to NLP-based Textual Anonymisation
Python PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit
Q  
QA Question Answering Evaluation Survey
Quality model ROTE: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics

The Evolution of an Evaluation Framework for a Text Mining System
Query expansion Query Expansion on Compounds

Using collocations from comparable corpora to find translation equivalents
Query-interpretation Summarizing Answers for Complicated Questions
Question Language Resources for Background Gathering
Question Answering An Answer Bank for Temporal Inference

Data for question answering: The case of why

Impact of Question Decomposition on the Quality of Answer Summaries

Question Answering Evaluation Survey

Representation and Inference for Open-Domain QA: Strength and Limits of two Italian Semantic Lexicons

Towards Natural Interactive Question Answering

Exploiting Multiple Semantic Resources for Answer Selection

EQueR: the French Evaluation campaign of Question-Answering Systems

The Multilingual Question Answering Track at CLEF

Summarizing Answers for Complicated Questions

Component Evaluation in a Question Answering System
Question Decomposition Impact of Question Decomposition on the Quality of Answer Summaries
R  
Random Indexing Towards Holistic Summarization -- Selecting Summaries, Not Sentences
Reader Model Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing
Recording methodology Low-cost Customized Speech Corpus Creation for Speech Technology Applications
Regional bias Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus
Regular changes in valency structure Valency Lexicon of Czech Verbs: Alternation-Based Model
Relation Annotation Shallow Semantic Annotation of Bulgarian
Relational database Lexicon Development for Varieties of Spoken Colloquial Arabic
Relative ordering ROTE: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics
Resource building The LOIS Project
Resource merging Exploiting Multiple Semantic Resources for Answer Selection
Resource-scarce languages A Grapheme-Based Approach for Accent Restoration in Gikuyu
Resource The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon
Results EQueR: the French Evaluation campaign of Question-Answering Systems

The Multilingual Question Answering Track at CLEF
Reusability Rule-Based Chunking and Reusability
Rhythm A BLARK extension for temporal annotation mining
Roadmap Reconsidering Language Identification for Written Language Resources
Robust parsing Automated Deep Lexical Acquisition for Robust Open Texts Processing
Role labelling A Preliminary Study for Building the Basque PropBank
Romanian syllables On the data base of Romanian syllables and some of its quantitative and cryptographic aspects
Romanized Pinyin Towards Unified Chinese Segmentation Algorithm
Rule patterns Act-Topic Patterns for Automatically Checking Dialogue Models
Rule-based Rule-Based Chunking and Reusability
S  
Scandinavian LEXADV - a multilingual semantic Lexicon for Adverbs
Search Query Expansion on Compounds
Search Engines Towards Unified Chinese Segmentation Algorithm
Second language speech corpus General and Task-Specific Corpus Resources for Polish Adult Learners of English
Securing interpretability Feature-based Encoding and Querying Language Resources with Character Semantics
Selectional restrictions Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus

LexikoNet - a lexical database based on type and role hierarchies
Semantic A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation

A Preliminary Study for Building the Basque PropBank

Shalmaneser - A Toolchain For Shallow Semantic Parsing
Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words
Semantic Analysis Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence
Semantic Annotation SALTO - A Versatile Multi-Level Annotation Tool

Shallow Semantic Annotation of Bulgarian
Discriminant-Based MRS Banking

FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

I-CAB: the Italian Content Annotation Bank

Methodological Aspects of Semantic Annotation
Semantic classification Semantic Descriptors: The Case of Reflexive Verbs
Semantic information Merging two Ontology-based Lexical Resources
Semantic lexicons Building Slovene WordNet
Semantic matching The role of lexical resources in matching classification schemas
Semantic network Structuring a Domain Vocabulary in a General Knowledge Environment
Semantic Parsing Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations
Shalmaneser - A Toolchain For Shallow Semantic Parsing
Semantic pattern Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
Semantic relations Structuring a Domain Vocabulary in a General Knowledge Environment
Semantic relations detection SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations
Semantic representation Parallel Syntactic Annotation of Multiple Languages

Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign
Semantic resources Exploiting Multiple Semantic Resources for Answer Selection
Semantic role labeling Adding multi-layer semantics to the Greek Dependency Treebank

Construction of a FrameNet Labeler for Swedish Text
Semantic roles The SALSA Corpus: a German Corpus Resource for Lexical Semantics
Semantic Similarity Towards Holistic Summarization -- Selecting Summaries, Not Sentences
Semantic tagging Semantic Tag Extraction from WordNet Glosses
Semantic verb classes Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet
Semantic Web A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

Automatic Evaluation and Composition of NLP Pipelines with Web Services

NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora

Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator
A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

The role of lexical resources in matching classification schemas
Semantically related terms A Methodology for Developing Multilingual Resources for Terminology
Semantics A methodology for the joint development of the Basque WordNet and Semcor

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

Building a network of topical relations from a corpus

HAREM: An Advanced NER Evaluation Contest for Portuguese

Linguistic and Biological Annotations of Biological Interaction Events

The Look and Feel of a Confident Entailer
Semi-automatic annotation KNACK-2002: a Richly Annotated Corpus of Dutch Written Text
Sense clustering Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory
Sense mapping Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory
Sensitivity An Introduction to NLP-based Textual Anonymisation
Sentence alignment Acquis Communautaire Sentence Alignment using Support Vector Machines
Sentence tokeniser Open Source Corpus Analysis Tools for Malay
Sentiment analysis Semantic Tag Extraction from WordNet Glosses
Serbian WordNet Building Slovene WordNet
Shallow analysis Query Expansion on Compounds
Shallow Parsing Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
The wraetlic NLP suite
Similar Event Search Language Resources for Background Gathering
Similarity Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence

Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words
SIMPLE LEXADV - a multilingual semantic Lexicon for Adverbs
Situational Prompting Bikers Accessing the Web: The SmartWeb Motorbike Corpus

SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus
Skip-gram A Closer Look at Skip-gram Modelling
SKOS A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs
Slovene WordNet Building Slovene WordNet
Slovenian SI-PRON: A Pronunciation Lexicon for Slovenian

SINOD - Slovenian non-native speech database

The English-Slovene ACQUIS corpus

Towards a Slovene Dependency Treebank
SMILES strings Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
Software GAIA: Common Framework for the Development of Speech Translation Technologies

UAM Text Tools - a flexible NLP architecture
Somali Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali
Source code GAIA: Common Framework for the Development of Speech Translation Technologies
Source quality Predicting MT Quality as a Function of the Source Language
Spanish Parallel Syntactic Annotation of Multiple Languages

Spanish Synthesis Corpora
Spatial ontology Principles for annotating and reasoning with spatial information
Spatial semantics Principles for annotating and reasoning with spatial information
Speaker recognition The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
Special Terms Dictionaries Features of Terms in Actual Nursing Activities
Specifications TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis
ECESS Inter-Module Interface Specification for Speech Synthesis
Speech A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit

Gathering a corpus of multimodal computer-mediated meetings

Oriental COCOSDA: Past, Present and Future
Speech and pointing gesture H. C. Andersen Conversation Corpus
Speech Corpus Bikers Accessing the Web: The SmartWeb Motorbike Corpus

SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus
Bilingual speech corpus in two phonetically similar languages
FonDat1: A Speech Synthesis Corpus for Norwegian
Speech corpus design Low-cost Customized Speech Corpus Creation for Speech Technology Applications
Speech database Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech

SINOD - Slovenian non-native speech database
Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque
Speech feedback Methodology of Lombard Speech Database Acquisition: Experiences with CLSD
Speech parsing Linguistic Resources for Speech Parsing
Speech recognition Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech

GAIA: Common Framework for the Development of Speech Translation Technologies

Methodology of Lombard Speech Database Acquisition: Experiences with CLSD

REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
Speech synthesis Creation and analysis of a Polish speech database for use in unit selection synthesis

Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali

ECESS Inter-Module Interface Specification for Speech Synthesis

FonDat1: A Speech Synthesis Corpus for Norwegian

GAIA: Common Framework for the Development of Speech Translation Technologies

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

Tools and resources for speech synthesis arising from a Welsh TTS project
Speech timing A BLARK extension for temporal annotation mining
Speech translation GAIA: Common Framework for the Development of Speech Translation Technologies

REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
Speech-to-Speech Translation (SST) Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign
Spell checking A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker

Spelling Error Patterns in Spanish for Word Processing Applications
Spelling error detection A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker

Spelling Error Patterns in Spanish for Word Processing Applications
Spelling error patterns A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker

Spelling Error Patterns in Spanish for Word Processing Applications
Spoken Corpora Features of Terms in Actual Nursing Activities
Spoken dialogue systems REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
A Framework to Integrate Ubiquitous Knowledge Modeling
Spoken Dutch Corpus Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus
Spoken Italian corpus Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).
Spoken Italian Linguistic Resources An observatory on Spoken Italian linguistic resources and descriptive standards.
Spoken Language Translation (SLT) Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign
Spoken language Annotating Information Structure in a Corpus of Spoken Danish
Spoken language corpora “Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus
Spoken metadata Linguistic Resources for Speech Parsing
Spontaneous conversation Discourse functions of duration in Mandarin: resource design and implementation
Spontaneous speech corpus Dependency-structure Annotation to Corpus of Spontaneous Japanese
SSML ECESS Inter-Module Interface Specification for Speech Synthesis
Standard based archive Building a historical corpus for Classical Portuguese: some technological aspects
Standardisation A Unified Structure for Dutch Dialect Dictionary Data
Standardization A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

ECESS Inter-Module Interface Specification for Speech Synthesis

Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
Standards for lexicons Exploring opportunities for Comparability and Enrichment by Linking lexical databases
Standoff annotation Preprocessing and Tokenisation Standards in DELPH-IN Tools
Statistical Machine Translation Creating a Large-Scale Arabic to French Statistical MachineTranslation System

MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation

Czech-English Word Alignment

Statistical language modeling Linguistic features modeling based on Partial New Cache
Statistical method Languages: Amharic Data-driven Amharic-English Bilingual Lexicon Acquisition
Statistical modeling Evaluation of Stop Word Lists in Chinese Language
Stochastic parsing Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign
Stop word list Evaluation of Stop Word Lists in Chinese Language
Structural event detection Linguistic Resources for Speech Parsing
Stylistic alignment Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia
Subcategorization A Large Subcategorization Lexicon for Natural Language Processing Applications

Automatic extraction of subcategorization frames for French
Sumerian Creating Tools for Morphological Analysis of Sumerian
Summarization Summarizing Answers for Complicated Questions
SUMO A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
Support vector machines Acquis Communautaire Sentence Alignment using Support Vector Machines
Support verbs Syntactic Lexicon of Polish Predicative Nouns
Surface patterns Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts
Surface syntactic analysis Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
Surveillance application Fear-type emotions of the SAFE Corpus: annotation issues
Syllabus Project A Development Tool For Multilingual Ontology-based Conceptual
Synchronous corpus Toward a Pan-Chinese Thesaurus
Syntacs Linguistic and Biological Annotations of Biological Interaction Events
Syntactic analysis A Deep Linguistic Analysis for Cross-language Information Retrieval

Data for question answering: The case of why
Syntactic annotation Developing and Using a Pilot Dialectal Arabic Treebank

Parallel Syntactic Annotation of Multiple Languages
Syntactic lexicon The Lefff 2 syntactic lexicon for French: architecture, acquisition, use
Syntactic parsing Towards a Slovene Dependency Treebank
Syntactic structures A new approach to syntactic annotation
Syntax Multilingual parallel treebanking: a lean and flexible approach

The Look and Feel of a Confident Entailer
Synthesis Spanish Synthesis Corpora
System evaluation The Impact of Evaluation on Multilingual Information Retrieval System Development
System extension Integrating Language Knowledge Resources to Extend the English Inclusion
Systems EQueR: the French Evaluation campaign of Question-Answering Systems
T  
Tagging Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

Morphological annotation of Korean with Directly Maintainable Resources

The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text

Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
Tagging errors detection and correction Tagset Mapping and Statistical Training Data Cleaning-up
Tagsets mapping Tagset Mapping and Statistical Training Data Cleaning-up
TC-STAR ECESS Inter-Module Interface Specification for Speech Synthesis
Teaching The MULINCO corpus and corpus platform
Techninal Term Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing
Temporal Expression Identification and Normalisation If “it” were “then”, then when was “it”? Establishing the anaphoric role of “then”
Temporal Inference An Answer Bank for Temporal Inference
Temporal information An Annotated Corpus of Typical Durations of Events
Term associations Towards a terminological resource for biomedical text mining
Term extraction Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations

Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model
Term identification Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
Term variation Clustering acronyms in biomedical text for disambiguation
Terminological databases Towards a terminological resource for biomedical text mining
Terminology A Methodology for Developing Multilingual Resources for Terminology

Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology

Using Core Ontology for Domain Lexicon Structuring
Terminology acquisition Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model
Terminology Extraction Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering
Terminology recognition Towards a terminological resource for biomedical text mining
Test suite A Lexicalized Tree-Adjoining Grammar for Vietnamese
Text Gathering a corpus of multimodal computer-mediated meetings
Text alignment Evaluation of multilingual text alignment systems: the ARCADE II project
Text Analysis A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German
Text Classification Bilingual Machine-Aided Indexing
Text Database BACO - A large database of text and co-occurrences
Text mining A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames

Natural Language Processing: A Terminological and Statistical Approach

The Evolution of an Evaluation Framework for a Text Mining System
Text Prediction Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People
Text-to-speech synthesis A joint prosody evaluation of French text-to-speech synthesis systems
Textual coreference Corpus Annotation as a Test of a Linguistic Theory
Textual entailment Mixing WordNet, VerbNet and PropBank for studying verb relations

The Look and Feel of a Confident Entailer
Building a Large-Scale Repository of Textual Entailment Rules
Thai Search Engine Blind Evaluation for Thai Search Engines
The Chinese Gigaword Corpus Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
Thesauri Aligning Multilingual Thesauri

Development of Linguistic Ontology on Natural Sciences and Technology
Thesaurus browsing A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs
Third language acquisition Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language
Time An Annotated Corpus of Typical Durations of Events
Tokenisation Preprocessing and Tokenisation Standards in DELPH-IN Tools

Tools and resources for speech synthesis arising from a Welsh TTS project
Tokenization The importance of precise tokenizing for deep grammars
Tool Act-Topic Patterns for Automatically Checking Dialogue Models

Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains
Tool design An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks
Toolkits The wraetlic NLP suite
Tools Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project

SALTO - A Versatile Multi-Level Annotation Tool

The MULINCO corpus and corpus platform
Tools for annotation Structure, Annotation and Tools in the Basque ZT Corpus
Topic classification Next Generation Language Resources using Grid
Training corpora Tagset Mapping and Statistical Training Data Cleaning-up
Training in an unsupervised way Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions
Transcription A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit

Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

Integrated Linguistic Resources for Language Exploitation Technologies
Transcription evaluation Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure
Transducer Morphological annotation of Korean with Directly Maintainable Resources
Translation Integrated Linguistic Resources for Language Exploitation Technologies
Translation aid Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
Translation equivalents Using collocations from comparable corpora to find translation equivalents
Translation shifts Building a resource for studying translation shifts
Translation studies Building a resource for studying translation shifts

The MULINCO corpus and corpus platform
Translation training A Uniform Interface to Large-Scale Linguistic Resources
Tree Edit Distance Building a Large-Scale Repository of Textual Entailment Rules
Tree query Tregex and Tsurgeon: tools for querying and manipulating tree data structures
Tree search Tregex and Tsurgeon: tools for querying and manipulating tree data structures
Tree transformation Tregex and Tsurgeon: tools for querying and manipulating tree data structures
Treebank Linguistic Resources for Speech Parsing

Shallow Semantic Annotation of Bulgarian
Treebanking Discriminant-Based MRS Banking

Multilingual parallel treebanking: a lean and flexible approach
Treebanks Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment

Towards a Slovene Dependency Treebank

Tregex and Tsurgeon: tools for querying and manipulating tree data structures

Turning a Dependency Treebank into a PSG-style Constituent Treebank
Typology of Multiword Expressions COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions
Typos Corpus-Induced Corpus Clean-up
U  
UBIQUITOUS DOMAIN MODEL A Framework to Integrate Ubiquitous Knowledge Modeling
UMTS Bikers Accessing the Web: The SmartWeb Motorbike Corpus

SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus
Under-resourced languages Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali
Underspecification Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
Unification grammar REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
Unified Lexicon Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian
Unified Morphosyntactic Specifications Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian
Unit selection Creation and analysis of a Polish speech database for use in unit selection synthesis

Tools and resources for speech synthesis arising from a Welsh TTS project
Universal databases The Information Commons Gazetteer
Unknown Word Interpretation A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German
Unknown words Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes.
Unsupervised Corpus-Induced Corpus Clean-up
Unsupervised classifier Integrating Language Knowledge Resources to Extend the English Inclusion
Uralic languages Morphological Tools for Six Small Uralic Languages
User centred evaluation The Evolution of an Evaluation Framework for a Text Mining System
User-feedback Computer-aided summarisation -- what the user really wants
V  
Vagueness Annotating COMPARA, a Grammar-aware Parallel Corpus
Valence dictionary Romanian Valence Dictionary in XML Format
Valency From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
Valency frames From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
Validation of language resources Czech-English Word Alignment
Vallex From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
Verb On Automatic Assignment of Verb Valency Frames in Czech
Verb classes From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description

The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon
VerbNet Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations
Verbal lexicon Automatic extraction of subcategorization frames for French
Verbal predicates Semantic Descriptors: The Case of Reflexive Verbs
Verbo-nominal collocations Syntactic Lexicon of Polish Predicative Nouns
Verbs From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
Vietnamese A Lexicalized Tree-Adjoining Grammar for Vietnamese
W  
Web Data Towards Unified Chinese Segmentation Algorithm
Web Mining Constructing a Named Entity Ontology from Web Corpora

Towards Unified Chinese Segmentation Algorithm
Web Services Automatic Evaluation and Composition of NLP Pipelines with Web Services
Web based language resouce access Building a historical corpus for Classical Portuguese: some technological aspects
Web-based tool Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering
Welsh Tools and resources for speech synthesis arising from a Welsh TTS project
Why-questions Data for question answering: The case of why
Wizard of Oz experiments Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems
Word alignment Non-probabilistic alignment of rare German and English nominal expressions

Dependency-Based Phrase Alignment

Transferring Coreference Chains through Word Alignment

Annotation Guidelines for Czech-English Word Alignment

Word sense disambiguation Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation

Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory
WordNet Building Slovene WordNet

The Italian Metaphor Database

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Semantic Tag Extraction from WordNet Glosses

The role of lexical resources in matching classification schemas
Words Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words
Written Dutch KNACK-2002: a Richly Annotated Corpus of Dutch Written Text
Written language resource User requirements analysis for the design of a reference corpus of written Dutch
X  
XML A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

ECESS Inter-Module Interface Specification for Speech Synthesis

Preprocessing and Tokenisation Standards in DELPH-IN Tools

Romanian Valence Dictionary in XML Format

Rule-Based Chunking and Reusability