LREC 2006 - Proceedings sorted by keywords

A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora

Analysis

Anaphora

Coreference Resolution with and without Linguistic Knowledge

Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains

And DUC.

Automated Summarization Evaluation with Basic Elements.

Annotated corpora

HAREM: An Advanced NER Evaluation Contest for Portuguese

Annotation

A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation

An Annotated Corpus Management Tool: ChaKi

An Introduction to NLP-based Textual Anonymisation

Annotating Emotions in Meetings

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

Dependency-structure Annotation to Corpus of Spontaneous Japanese

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

H. C. Andersen Conversation Corpus

KNACK-2002: a Richly Annotated Corpus of Dutch Written Text

Lexicon Development for Varieties of Spoken Colloquial Arabic

Linguistic Resources for Speech Parsing

Morphological annotation of Korean with Directly Maintainable Resources

NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora

NPs for Events: Experiments in Coreference Annotation

The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text

The MULINCO corpus and corpus platform

UAM Text Tools - a flexible NLP architecture

Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
Annotation of Temporal Relations with Tango

Annotation evaluation

Fear-type emotions of the SAFE Corpus: annotation issues

Annotation guidelines

An Annotated Corpus of Typical Durations of Events

NPs for Events: Experiments in Coreference Annotation
Annotating COMPARA, a Grammar-aware Parallel Corpus

Annotation mining

A BLARK extension for temporal annotation mining

Discourse functions of duration in Mandarin: resource design and implementation

Annotation of emotion

Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus

Annotation scheme

Annotating Bridging Anaphors in Italian: in Search of Reliability

NPs for Events: Experiments in Coreference Annotation

Annotation tool

Dependency-structure Annotation to Corpus of Spontaneous Japanese

Anonymisation

An Introduction to NLP-based Textual Anonymisation

Answer selection

Exploiting Multiple Semantic Resources for Answer Selection

Answer Time-Bank

An Answer Bank for Temporal Inference

Appraisal

Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches

Arabic

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Developing and Using a Pilot Dialectal Arabic Treebank

Lexicon Development for Varieties of Spoken Colloquial Arabic

Parallel Syntactic Annotation of Multiple Languages

What in the world is a Shahab?: Wide Coverage Named Entity Recognition for Arabic

Argument structure

Romanian Valence Dictionary in XML Format

Valency Lexicon of Czech Verbs: Alternation-Based Model

Artificial intelligence

MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation

Asia

Oriental COCOSDA: Past, Present and Future

Augmentation of lexicons

Moving to dynamic computational lexicons with LeXFlow

Augmentative and Alternative Communication

Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People

Automatic alignment

Building a Swedish-Turkish Parallel Corpus

Automatic generation of web portals

Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator

Automatic lexical acquisition

A Large Subcategorization Lexicon for Natural Language Processing Applications

Automatic MT evaluation

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

X-Score: Automatic Evaluation of Machine Translation Grammaticality

Automatic phonetic transcription

Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure

Automatic segmentation

Creation and analysis of a Polish speech database for use in unit selection synthesis

Automatic Speech Recognition (ASR)

Bootstrapping New Language ASR Capabilities: Achieving Best Letter-to-Sound Performance under Resource Constraints

Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign

Automatic Text Summarization

Towards Holistic Summarization -- Selecting Summaries, Not Sentences

Computer-aided summarisation -- what the user really wants

Average reduced frequency

New Approach to Frequency Dictionaries - Czech Example

Back translation

Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence

Basic Elements

Automated Summarization Evaluation with Basic Elements.
Summarizing Answers for Complicated Questions

Bilingual

Data-driven Amharic-English Bilingual Lexicon Acquisition
Bilingual speech corpus in two phonetically similar languages
Bilingual Machine-Aided Indexing

Bilingual corpora

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

Bilingual lexicon acquisition

Automatic Detection of Orthographic Cues for Cognate Recognition

Bio-NLP

Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus

Bioinformatics

Linguistic and Biological Annotations of Biological Interaction Events

Biomedical corpora

The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text

Biomedical literature

Clustering acronyms in biomedical text for disambiguation

Biomedicine

Towards a terminological resource for biomedical text mining

Bitext

Building a Swedish-Turkish Parallel Corpus

BLARK

BLAST

Blind Evaluation for Thai Search Engines

Blind Evaluation

Bracket scores

SParseval: Evaluation Metrics for Parsing Speech

Bridging

Annotating Bridging Anaphors in Italian: in Search of Reliability

Broadcast news

Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

SINOD - Slovenian non-native speech database

Case frame

A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames

Case Frame Compilation from the Web using High-Performance Computing

Catalan

Bilingual speech corpus in two phonetically similar languages

Categorial ambiguity

Annotating COMPARA, a Grammar-aware Parallel Corpus

Centering

PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit

CESTA evaluation

Creating a Large-Scale Arabic to French Statistical MachineTranslation System

CGN

JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality

Character semantics

Feature-based Encoding and Querying Language Resources with Character Semantics

Chat language

Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach

Chemical compounds

Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology

Children

JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality

Chinese collocation

The Design and Construction of A Chinese Collocation Bank

Chinese Segmentation

Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus

Chinese Sketch Engine

Chinese thesaurus

Toward a Pan-Chinese Thesaurus

Chunking

Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech

Class-based language models

Classification

Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology

CLEF

The Multilingual Question Answering Track at CLEF

CLIR

Extraction of Cross Language Term Correspondences

Clustering

Clustering acronyms in biomedical text for disambiguation

Finding representative sets of dialect words for geographical regions

Co-occurence

BACO - A large database of text and co-occurrences

Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words

COCOSDA

Oriental COCOSDA: Past, Present and Future

Cognate detection

Automatic Detection of Orthographic Cues for Cognate Recognition

Cognitive Theory of Metaphor

The Design and Construction of A Chinese Collocation Bank

Collocation bank

Collocation extraction

Extraction tools for collocations and their morphosyntactic specificities

Semi-automatic Building of Swedish Collocation Lexicon

Collocations

COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

Extraction tools for collocations and their morphosyntactic specificities

Comparability and enrichment

Exploring opportunities for Comparability and Enrichment by Linking lexical databases

Compounds

Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon

Computational Lexicography

Computational dialectology

Dialectal resources on-line: the ALT-Web experience

Computational lexicon

The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon
A Large Subcategorization Lexicon for Natural Language Processing Applications
Representation and Inference for Open-Domain QA: Strength and Limits of two Italian Semantic Lexicons

Computational morphology

Morphological Tools for Six Small Uralic Languages

Computational Semantics

Towards Holistic Summarization -- Selecting Summaries, Not Sentences

Computational Terminology

Extraction of Cross Language Term Correspondences

Computer-aided summarization

Computer-aided summarisation -- what the user really wants

Computer-assisted language learning

Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language

Concept Set

Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence

Conceptual Interlingua

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Conceptual markers

Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts

Conceptual Metaphor

Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts

Conceptual relations

Conceptual Vectors

Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence

Confidence measure

Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions

Confusables

Named Entity Extraction with Conjunction Disambiguation

Conjunction Disambiguation

Constraint Grammar

Turning a Dependency Treebank into a PSG-style Constituent Treebank

Construction of lexical resources

Dialectal resources on-line: the ALT-Web experience

Content selection

Learning Database Content for Spoken Dialogue System Design

Context-based evaluation

A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation

Context-independent spelling correction

A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker

Spelling Error Patterns in Spanish for Word Processing Applications

Contributing factor

A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames

Controlled language

Predicting MT Quality as a Function of the Source Language

Conversational speech

The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Conversational telephone speech

Conversion

From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description

Cooperative answers

Language Challenges for Data Fusion in Question-Answering

CoreLex

Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations

Coreference

Coreference Resolution with and without Linguistic Knowledge

Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains

Towards Natural Interactive Question Answering

Corpora

A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German

Development of the First LRs for Macedonian: Current Projects

EQueR: the French Evaluation campaign of Question-Answering Systems

I-CAB: the Italian Content Annotation Bank

Oriental COCOSDA: Past, Present and Future

SALTO - A Versatile Multi-Level Annotation Tool

Structure, Annotation and Tools in the Basque ZT Corpus

The Multilingual Question Answering Track at CLEF

Corpus

An Annotated Corpus Management Tool: ChaKi

An Annotated Corpus of Typical Durations of Events

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia

Case Frame Compilation from the Web using High-Performance Computing

Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

Discourse functions of duration in Mandarin: resource design and implementation

Evaluation of multimodal components within CHIL: The evaluation packages and results

Extraction tools for collocations and their morphosyntactic specificities

Gathering a corpus of multimodal computer-mediated meetings

Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project

Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words

The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon

The MULINCO corpus and corpus platform

Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language

Corpus acquisition and preprocessing

Creating a Large-Scale Arabic to French Statistical MachineTranslation System

Corpus analysis tools

FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

Corpus annotation

A methodology for the joint development of the Basque WordNet and Semcor

Annotating COMPARA, a Grammar-aware Parallel Corpus

Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach

Corpus Annotation as a Test of a Linguistic Theory
The SALSA Corpus: a German Corpus Resource for Lexical Semantics

Corpus annotation tool

A new approach to syntactic annotation
Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development

Corpus Based Synthesis

Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque

Corpus construction

KNACK-2002: a Richly Annotated Corpus of Dutch Written Text

Corpus design

User requirements analysis for the design of a reference corpus of written Dutch

Corpus development

A Uniform Interface to Large-Scale Linguistic Resources

Linguistic Resources for Speech Parsing

Corpus processing

UAM Text Tools - a flexible NLP architecture

Corpus query interface

A Uniform Interface to Large-Scale Linguistic Resources

Corpus search

A Corpus Search System Utilizing Lexical Dependency Structure

Annotating COMPARA, a Grammar-aware Parallel Corpus

Corpus size

Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus

Corpus Spoken Dutch

JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality

Corpus-cleanup

Corpus-induced

Transferring Coreference Chains through Word Alignment

Correference

Correlation

Automated Summarization Evaluation with Basic Elements.

Cross-language information retrieval

The Impact of Evaluation on Multilingual Information Retrieval System Development

Cross-level queries

Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).

Cross-lingual and trans-lingual techniques

The pragmatic combination of different crosslingual resources

Cross-lingual information retrieval

A Deep Linguistic Analysis for Cross-language Information Retrieval

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Crosslingual

Bilingual Machine-Aided Indexing

Czech

Towards a Slovene Dependency Treebank

Danish

LEXADV - a multilingual semantic Lexicon for Adverbs

Query Expansion on Compounds

Data centers

More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities
Integrated Linguistic Resources for Language Exploitation Technologies

Data collection

Data for question answering: The case of why

Data sparsity

A Closer Look at Skip-gram Modelling

Data-driven methods

MaltParser: A Data-Driven Parser-Generator for Dependency Parsing

Database

On the data base of Romanian syllables and some of its quantitative and cryptographic aspects
Spanish Synthesis Corpora

Database creation

Creation and analysis of a Polish speech database for use in unit selection synthesis

Database Design

Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque

Date integration

Language Challenges for Data Fusion in Question-Answering

Decision points

An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks

Decoder

MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation

Deep lexical acquisition

Automated Deep Lexical Acquisition for Robust Open Texts Processing

Deep parsing

A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation

Preprocessing and Tokenisation Standards in DELPH-IN Tools
Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy

Dependency grammar

Towards a Slovene Dependency Treebank
Turning a Dependency Treebank into a PSG-style Constituent Treebank

Dependency parsing

MaltParser: A Data-Driven Parser-Generator for Dependency Parsing
Dependency-Based Phrase Alignment

Dependency relations

Corpus Annotation as a Test of a Linguistic Theory

Dependency scores

SParseval: Evaluation Metrics for Parsing Speech

Dependency structure

A Corpus Search System Utilizing Lexical Dependency Structure

An Annotated Corpus Management Tool: ChaKi

Dependency-structure Annotation to Corpus of Spontaneous Japanese

Dependency tree

From Natural Language to Databases via Ontologies

Diacritic placement

A Grapheme-Based Approach for Accent Restoration in Gikuyu

Diagrams

Semantic Descriptors: The Case of Reflexive Verbs

Dialect

A Unified Structure for Dutch Dialect Dictionary Data

Finding representative sets of dialect words for geographical regions

Dialectal Arabic

Developing and Using a Pilot Dialectal Arabic Treebank

Lexicon Development for Varieties of Spoken Colloquial Arabic

Dialectal databases

Dialectal resources on-line: the ALT-Web experience

Dialectal Treebank

Developing and Using a Pilot Dialectal Arabic Treebank

Dialectometry

Finding representative sets of dialect words for geographical regions

Dialog

The OSU Quake 2004 corpus of two-party situated problem-solving dialogs

Dialogue

Towards Natural Interactive Question Answering

Dialogue annotation

TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface

Dialogue model

Act-Topic Patterns for Automatically Checking Dialogue Models

Dialogue system user speech

Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech

Dialogue systems

Learning Database Content for Spoken Dialogue System Design
A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

Dictionary

A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

A Unified Structure for Dutch Dialect Dictionary Data

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Dictionary creation

Developing and Using a Pilot Dialectal Arabic Treebank

Dictionary interfaces

Intelligent Dictionary Interfaces: Usability Evaluation of Access-Supporting Enhancements

Dictionary model

A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

Digital libraries

Natural Language Processing: A Terminological and Statistical Approach

Dimensions

Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches

Disambiguation

On Automatic Assignment of Verb Valency Frames in Czech

Discourse phenomena

Discourse functions of duration in Mandarin: resource design and implementation

Disfluencies

Data-driven Amharic-English Bilingual Lexicon Acquisition

Disparate languages

Distillation

Integrated Linguistic Resources for Language Exploitation Technologies

Distributed databases

Distributed index

Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali

Doctor-patient dialogue

Document categorization

Rebuilding Lexical Resources for Information Retrieval using Sense Folder Detection and Merging Methods

DOLCE

A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

Domain ontology

A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames
Using Core Ontology for Domain Lexicon Structuring

Domain specific knowledge

Structuring a Domain Vocabulary in a General Knowledge Environment

Domain-specific corpora

Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model

Dutch

User requirements analysis for the design of a reference corpus of written Dutch
A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora

Dynamic lexicons

Moving to dynamic computational lexicons with LeXFlow

ECESS

ECESS Inter-Module Interface Specification for Speech Synthesis

EContent users

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

Editing

Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains

Elderly

JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality

Electronic dictionaries

Development of the First LRs for Macedonian: Current Projects

Ellipsis

Towards Natural Interactive Question Answering

EMMA

A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

Emotion

Annotating Emotions in Meetings

Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews

Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches

Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque

Emotion recognition

Improving Automatic Emotion Recognition from Speech via Gender Differentiaion

Emotional database

Fear-type emotions of the SAFE Corpus: annotation issues

Emotional valence

Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus

EngValLex

From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description

English

Data-driven Amharic-English Bilingual Lexicon Acquisition

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

General and Task-Specific Corpus Resources for Polish Adult Learners of English

Parallel Syntactic Annotation of Multiple Languages

The English-Slovene ACQUIS corpus

English inclusions

Integrating Language Knowledge Resources to Extend the English Inclusion

EQUER

Question Answering Evaluation Survey

Error mining

Automated Deep Lexical Acquisition for Robust Open Texts Processing

EU legislation

The English-Slovene ACQUIS corpus

Eurovoc

Aligning Multilingual Thesauri

Evaluation

A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation

A joint prosody evaluation of French text-to-speech synthesis systems

Act-Topic Patterns for Automatically Checking Dialogue Models

Annotating Bridging Anaphors in Italian: in Search of Reliability

Automated Summarization Evaluation with Basic Elements.

Automatic Evaluation and Composition of NLP Pipelines with Web Services

Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy

Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

Czech-English Word Alignment

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

EQueR: the French Evaluation campaign of Question-Answering Systems

Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign

Evaluation of multilingual text alignment systems: the ARCADE II project

Evaluation of multimodal components within CHIL: The evaluation packages and results

HAREM: An Advanced NER Evaluation Contest for Portuguese

Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet

Predicting MT Quality as a Function of the Source Language

Question Answering Evaluation Survey

REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications

ROTE: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics

Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus

Representation and Inference for Open-Domain QA: Strength and Limits of two Italian Semantic Lexicons

Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

The Multilingual Question Answering Track at CLEF

Evaluation contest

HAREM: An Advanced NER Evaluation Contest for Portuguese

Evaluation programs

Integrated Linguistic Resources for Language Exploitation Technologies

Evaluation resources

HAREM: An Advanced NER Evaluation Contest for Portuguese

Evaluation tools

A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation

Event annotation

Adding multi-layer semantics to the Greek Dependency Treebank

Event coreference

NPs for Events: Experiments in Coreference Annotation

Event durations

Adding multi-layer semantics to the Greek Dependency Treebank

Event taxonomy

Extraction from Spoken Text. In Proceeding of LREC 2004. Paris

Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts

Extractive Summarization

Multilingual Multidocument Summarization Tools and Evaluation

Fear detection

Fear-type emotions of the SAFE Corpus: annotation issues

Feature extraction

Improving Automatic Emotion Recognition from Speech via Gender Differentiaion

Feature structures

Feature-based Encoding and Querying Language Resources with Character Semantics

Features

Linguistic features modeling based on Partial New Cache

Features-Cache model

Linguistic features modeling based on Partial New Cache

FGD

Perspectives of Turning Prague Dependency Treebank into a Knowledge Base

Figurative Language

Proper Names and Linguistic Dynamics

Filed Recording

SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus

Filtering data

Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions

Finite-state transducers

The importance of precise tokenizing for deep grammars

Fluency

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

X-Score: Automatic Evaluation of Machine Translation Grammaticality

Focus

Annotating Information Structure in a Corpus of Spoken Danish

Forensic

The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Frame

On Automatic Assignment of Verb Valency Frames in Czech

FrameNet

Construction of a FrameNet Labeler for Swedish Text

The SALSA Corpus: a German Corpus Resource for Lexical Semantics

Framework

GAIA: Common Framework for the Development of Speech Translation Technologies

France: ELRA

Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts

Free software

FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

French

Integrating Language Knowledge Resources to Extend the English Inclusion

Parallel Syntactic Annotation of Multiple Languages

The Lefff 2 syntactic lexicon for French: architecture, acquisition, use

French Question Answering Campaign

Question Answering Evaluation Survey

Frequency

FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text

New Approach to Frequency Dictionaries - Czech Example

Functional Generative Description

From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description

Fusion

Integrated Linguistic Resources for Language Exploitation Technologies

GALE

Gazetteer

Improving Automatic Emotion Recognition from Speech via Gender Differentiaion

Gender detection from speech

General Lexical Resources

A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora

Generation

German

Integrating Language Knowledge Resources to Extend the English Inclusion

The SALSA Corpus: a German Corpus Resource for Lexical Semantics

The importance of precise tokenizing for deep grammars

German parsers

Improving coverage and parsing quality of a large-scale LFG for German

Gold standard

An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks

Grammar

A Lexicalized Tree-Adjoining Grammar for Vietnamese

Granularity

Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy

Grapheme to phoneme conversion

Creation and analysis of a Polish speech database for use in unit selection synthesis

Grapheme-based approach

A Grapheme-Based Approach for Accent Restoration in Gikuyu

Greedy algorithm

Tools and resources for speech synthesis arising from a Welsh TTS project

Grid

Next Generation Language Resources using Grid

Hand-crafted grammars

Improving coverage and parsing quality of a large-scale LFG for German

The importance of precise tokenizing for deep grammars

Head gesture

Tangible Objects for the Acquisition of Multimodal Interaction Patterns

Hebrew

A Computational Lexicon of Contemporary Hebrew

Hesitation

Discourse functions of duration in Mandarin: resource design and implementation

Heterogeneous resources

Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus

High-performance computing

Case Frame Compilation from the Web using High-Performance Computing

Hindi

Parallel Syntactic Annotation of Multiple Languages

Historical corpus

Building a historical corpus for Classical Portuguese: some technological aspects

Human associations

Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet

Human error

A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames

Human-computer communication

Creation and analysis of a Polish speech database for use in unit selection synthesis

Human-machine interaction

JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality

ICSI Meeting Recorder Project

Part-of-Speech Tagging of Transcribed Speech

Idioms

Extraction tools for collocations and their morphosyntactic specificities

Image processing

Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews

Inconsistencies

The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text

Information extraction

COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

Constructing a Named Entity Ontology from Web Corpora

I-CAB: the Italian Content Annotation Bank

Named Entity Extraction with Conjunction Disambiguation

Linguistic and Biological Annotations of Biological Interaction Events

Ontology-based Information Extraction with SOBA

Information commons

Query Expansion on Compounds

Development of Linguistic Ontology on Natural Sciences and Technology

Information retrieval

Information structure

Annotating Information Structure in a Corpus of Spoken Danish

Corpus Annotation as a Test of a Linguistic Theory

Information theory

Evaluation of Stop Word Lists in Chinese Language

Infrastructure

Automatic Evaluation and Composition of NLP Pipelines with Web Services

A Unified Structure for Dutch Dialect Dictionary Data

Integration

GAIA: Common Framework for the Development of Speech Translation Technologies

Intelligibility

Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing

Inter-annotator agreement

Towards Natural Interactive Question Answering

Interaction

Interface

Computer-aided summarisation -- what the user really wants

ECESS Inter-Module Interface Specification for Speech Synthesis

Interlanguages

“Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus

Internationalization

Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources

Internet

A Uniform Interface to Large-Scale Linguistic Resources

Internet search

Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community

ISO-DIS-24610-1

Feature-based Encoding and Querying Language Resources with Character Semantics

Japanese

Parallel Syntactic Annotation of Multiple Languages

Knowledge Engineering

Ontology Driven K-Portal Construction and K-Service Provision

Knowledge extraction

Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence

Knowledge Modeling

A Framework to Integrate Ubiquitous Knowledge Modeling

Knowledge Portal

Ontology Driven K-Portal Construction and K-Service Provision

Knowledge representation

The LOIS Project

Knowledge Services Provision

Ontology Driven K-Portal Construction and K-Service Provision

Knowledge sharing

A new approach to syntactic annotation

Korean

Morphological annotation of Korean with Directly Maintainable Resources

Parallel Syntactic Annotation of Multiple Languages

L2S

Bootstrapping New Language ASR Capabilities: Achieving Best Letter-to-Sound Performance under Resource Constraints

Labelling

Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches

Language Engineering

Ontology Driven K-Portal Construction and K-Service Provision

Language identification

Reconsidering Language Identification for Written Language Resources

Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources

Language interaction

Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems

Language modeling

Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech

Language modelling

A Closer Look at Skip-gram Modelling

Language Models

Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment

Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People

Language resource interoperability

Moving to dynamic computational lexicons with LeXFlow

Language resource

A new approach to syntactic annotation

Development of the First LRs for Macedonian: Current Projects

Integrating Language Knowledge Resources to Extend the English Inclusion

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

LexikoNet - a lexical database based on type and role hierarchies

On the data base of Romanian syllables and some of its quantitative and cryptographic aspects

Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community

Syntactic Lexicon of Polish Predicative Nouns

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources

A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

Corpus Portal for Search in Monolingual Corpora

Morphological annotation of Korean with Directly Maintainable Resources

BACO - A large database of text and co-occurrences

Language resources production models

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

Language service

A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

Language technology

Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment

Language typology

LDC

More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities

Learning by Emergence

Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence

Legal Ontologies

A Development Tool For Multilingual Ontology-based Conceptual

Legitimate translation variation

Using collocations from comparable corpora to find translation equivalents

Lemmatiser

Open Source Corpus Analysis Tools for Malay

Lemmatization

Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development

Lemmatizer

Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project

Less widely digitally available languages

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

Letter-to-Sound

Bootstrapping New Language ASR Capabilities: Achieving Best Letter-to-Sound Performance under Resource Constraints

Letter-to-sound rules

Tools and resources for speech synthesis arising from a Welsh TTS project

Levantine

Developing and Using a Pilot Dialectal Arabic Treebank

Lexica

A Unified Structure for Dutch Dialect Dictionary Data

Lexical acquisition

The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon

Lexical anaphora

Annotating Bridging Anaphors in Italian: in Search of Reliability

Lexical association measures

Semi-automatic Building of Swedish Collocation Lexicon
COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

Lexical database

Building a network of topical relations from a corpus

Natural Language Processing: A Terminological and Statistical Approach

COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

Proper Names and Linguistic Dynamics

Using Core Ontology for Domain Lexicon Structuring

Lexical entailment

Mixing WordNet, VerbNet and PropBank for studying verb relations

Lexical ontology

LexikoNet - a lexical database based on type and role hierarchies

Lexical resource

Merging two Ontology-based Lexical Resources

Structuring a Domain Vocabulary in a General Knowledge Environment

The SALSA Corpus: a German Corpus Resource for Lexical Semantics

Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet

Lexical semantics

The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon

The SALSA Corpus: a German Corpus Resource for Lexical Semantics
Mixing WordNet, VerbNet and PropBank for studying verb relations
Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations

Lexical variation

Finding representative sets of dialect words for geographical regions

Lexical workflow

Moving to dynamic computational lexicons with LeXFlow

Lexical-semantic knowledge bases

A methodology for the joint development of the Basque WordNet and Semcor

Lexicon

A Computational Lexicon of Contemporary Hebrew

A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora

Building a network of topical relations from a corpus

Data-driven Amharic-English Bilingual Lexicon Acquisition

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Elaborating the parameterized Equivalence Class Method for Dutch

LEXADV - a multilingual semantic Lexicon for Adverbs

Lexicon Development for Varieties of Spoken Colloquial Arabic

Morphological annotation of Korean with Directly Maintainable Resources

Natural Language Processing: A Terminological and Statistical Approach

The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon

Valency Lexicon of Czech Verbs: Alternation-Based Model

Lexicon Architecture

Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon

Lexicon Construction

Word Knowledge Acquisition for Computational Lexicon Construction

Lexicon extension

Automated Deep Lexical Acquisition for Robust Open Texts Processing

Lexicon to Ontology Mapping

Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases

Lexicon-grammar

Improving coverage and parsing quality of a large-scale LFG for German

The importance of precise tokenizing for deep grammars

LFG

Linguistic analysis

Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology

Linguistic annotations

The English-Slovene ACQUIS corpus

Linguistic databases

Tregex and Tsurgeon: tools for querying and manipulating tree data structures

Linguistic ontology

Development of Linguistic Ontology on Natural Sciences and Technology

Linguistic resources

Integrated Linguistic Resources for Language Exploitation Technologies

Linguistic Resources for Speech Parsing

Linkage discovery

Dependency-Based Phrase Alignment

Linking

Merging two Ontology-based Lexical Resources

Linking of lexical databases

Exploring opportunities for Comparability and Enrichment by Linking lexical databases

Localization

Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources

Location-aware computing

Methodology of Lombard Speech Database Acquisition: Experiences with CLSD

Lombard speech database

LR Infrastructures and Architectures

Techno-langue: The French National Initiative for Human Language Technologies (HLT)

LR national/international projects

Techno-langue: The French National Initiative for Human Language Technologies (HLT)

LTAG

A Lexicalized Tree-Adjoining Grammar for Vietnamese

Machine

Multilingual parallel treebanking: a lean and flexible approach

Machine learning

A Grapheme-Based Approach for Accent Restoration in Gikuyu

An Introduction to NLP-based Textual Anonymisation

Learning Database Content for Spoken Dialogue System Design

On Automatic Assignment of Verb Valency Frames in Czech

Recognizing Acronyms and their Definitions in Swedish Medical Texts

Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development

Machine translatability

Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence

Machine translation

Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

Dependency-Based Phrase Alignment

Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation

Predicting MT Quality as a Function of the Source Language
CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

X-Score: Automatic Evaluation of Machine Translation Grammaticality
Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign
METIS-II: Machine Translation for Low Resource Languages

Machine translation evaluation

A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation

Machine-Aided Indexing

Bilingual Machine-Aided Indexing

Machine-readable lexicons

Towards machine-readable lexicons for South African Bantu languages

Malay

Open Source Corpus Analysis Tools for Malay

Manual annotation

Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews

Manual word-alignment

Annotation Guidelines for Czech-English Word Alignment

Mark-up tools

Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources

Markup

MDE

Perspectives of Turning Prague Dependency Treebank into a Knowledge Base

Meaning

Meaning elicitation

The role of lexical resources in matching classification schemas

Medical corpus

Recognizing Acronyms and their Definitions in Swedish Medical Texts

Meeting

Gathering a corpus of multimodal computer-mediated meetings

Annotating Emotions in Meetings

Meeting browsers

TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface

Meeting corpora

Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus

Meeting data

Evaluation of multimodal components within CHIL: The evaluation packages and results

Memory based learning

Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains

Merging

Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions

Metamodel

Methodological Aspects of Semantic Annotation

Metaphor Database

A Preliminary Study for Building the Basque PropBank

Methodology

Minority and endangered languages

Morphological Tools for Six Small Uralic Languages

Mixed-lingual texts

Integrating Language Knowledge Resources to Extend the English Inclusion

Mixer

The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Mobile and multimodal interaction

The pragmatic combination of different crosslingual resources

Monolingual corpora

Corpus Portal for Search in Monolingual Corpora

Morpheme

Morphological annotation of Korean with Directly Maintainable Resources

Morphological analysis

A Deep Linguistic Analysis for Cross-language Information Retrieval

FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

New Approach to Frequency Dictionaries - Czech Example
Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon

Morphological Tools for Six Small Uralic Languages

Morphological annotation

Developing and Using a Pilot Dialectal Arabic Treebank

Morphological Lexicon

Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian

Morphology

A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora

Creating Tools for Morphological Analysis of Sumerian

Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes.

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Extraction tools for collocations and their morphosyntactic specificities

HAREM: An Advanced NER Evaluation Contest for Portuguese

Lexicon Development for Varieties of Spoken Colloquial Arabic

Morphological annotation of Korean with Directly Maintainable Resources

Morphosyntax

Extraction tools for collocations and their morphosyntactic specificities

Motorbike

Bikers Accessing the Web: The SmartWeb Motorbike Corpus

Movement detection

Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews

MPEG-7

A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

Multi-Document Summarization

Impact of Question Decomposition on the Quality of Answer Summaries

MultiNet

Perspectives of Turning Prague Dependency Treebank into a Knowledge Base

Multidocument

Multilingual Multidocument Summarization Tools and Evaluation

Multilevel database

Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).

Multilingual

Multilingual annotation

Parallel Syntactic Annotation of Multiple Languages

Multilingual Corpora

Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources

Multilingual dabatase

A Methodology for Developing Multilingual Resources for Terminology

Multilingual information services

The pragmatic combination of different crosslingual resources

Multilingual parallel corpora

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
Evaluation of multilingual text alignment systems: the ARCADE II project

Multilingual Question Answering

Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering

The Multilingual Question Answering Track at CLEF

Multilinguality

Aligning Multilingual Thesauri

Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language

Multimedia corpus

The OSU Quake 2004 corpus of two-party situated problem-solving dialogs

Multimedia Meeting Recordings

Gathering a corpus of multimodal computer-mediated meetings

Multimodal

Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems

Multimodal behavior

Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews

Multimodal corpora

H. C. Andersen Conversation Corpus

TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface
NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora

Multimodal interfaces

Evaluation of multimodal components within CHIL: The evaluation packages and results

TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface

Multimodality

A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

Multiparty dialogues

Part-of-Speech Tagging of Transcribed Speech

Multiword expressions

Elaborating the parameterized Equivalence Class Method for Dutch

Using collocations from comparable corpora to find translation equivalents
COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

Mutation rules

Automatic Detection of Orthographic Cues for Cognate Recognition

Named Entity Ontology

Constructing a Named Entity Ontology from Web Corpora

Named Entity Recognition

Named Entity Extraction with Conjunction Disambiguation

What in the world is a Shahab?: Wide Coverage Named Entity Recognition for Arabic

FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

HAREM: An Advanced NER Evaluation Contest for Portuguese

Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project

Named entities

A Deep Linguistic Analysis for Cross-language Information Retrieval

Named entity annotation

A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit

Named-Entity Tagging

Identifying Named Entities in Text Databases from the Natural History Domain

National Project

An observatory on Spoken Italian linguistic resources and descriptive standards.

Natural language access to database

From Natural Language to Databases via Ontologies

Natural language engineering

Automatic Evaluation and Composition of NLP Pipelines with Web Services

Natural Language Interfaces to GIS Databases

Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases

Natural Language Processing (NLP)

Automatic Evaluation and Composition of NLP Pipelines with Web Services

Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach

Development of Linguistic Ontology on Natural Sciences and Technology

Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign

Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development

A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

Turning a Dependency Treebank into a PSG-style Constituent Treebank

Naturalistic

Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches

NLTK

PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit

Non-machine-translatable parts

Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence

Non-native speech

Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure

JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality

SINOD - Slovenian non-native speech database

“Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus

Non-words

The SALSA Corpus: a German Corpus Resource for Lexical Semantics

Noncompositionality

Normalized edit distance

Automatic Detection of Orthographic Cues for Cognate Recognition

Norwegian

FonDat1: A Speech Synthesis Corpus for Norwegian

NP coreference

NPs for Events: Experiments in Coreference Annotation

Nursing Terminologies

Features of Terms in Actual Nursing Activities

Ontology

A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation

A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

Feature-based Encoding and Querying Language Resources with Character Semantics

From Natural Language to Databases via Ontologies

LEXADV - a multilingual semantic Lexicon for Adverbs

Ontology-based Information Extraction with SOBA

Structuring a Domain Vocabulary in a General Knowledge Environment
A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

Aligning Multilingual Thesauri

The LOIS Project

Ontology acquisition

Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator

Ontology Development from Text

Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering

Ontology Engineering

Ontology Driven K-Portal Construction and K-Service Provision

Ontology learning

Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus

Ontology mapping

Merging two Ontology-based Lexical Resources

Ontology Modelling

Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases

Ontology modelling for trans/cross-lingual dialogues

The pragmatic combination of different crosslingual resources

Open Source

REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications

Open-domain Q&A evaluation

Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia

Open-domain free text translation

The pragmatic combination of different crosslingual resources

Optimality Theory

PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit

Organizational/policy issues

Techno-langue: The French National Initiative for Human Language Technologies (HLT)

Oriental

Oriental COCOSDA: Past, Present and Future

Orthographic cues

Automatic Detection of Orthographic Cues for Cognate Recognition

Orthography

FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text

Pan-Chinese lexical resource

Toward a Pan-Chinese Thesaurus

Parallel corpora

Annotating COMPARA, a Grammar-aware Parallel Corpus

Building a Swedish-Turkish Parallel Corpus

Construction of a FrameNet Labeler for Swedish Text

Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation

Multilingual parallel treebanking: a lean and flexible approach

The English-Slovene ACQUIS corpus

Transferring Coreference Chains through Word Alignment

Paraphrase

Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence

Parse scoring

SParseval: Evaluation Metrics for Parsing Speech

Parser evaluation

Improving coverage and parsing quality of a large-scale LFG for German

Parsers

Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy

Parsing

A Lexicalized Tree-Adjoining Grammar for Vietnamese

Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations

Improving coverage and parsing quality of a large-scale LFG for German

MaltParser: A Data-Driven Parser-Generator for Dependency Parsing

Multilingual parallel treebanking: a lean and flexible approach

The importance of precise tokenizing for deep grammars

Developing and Using a Pilot Dialectal Arabic Treebank

Parsing Evaluation

Data, Annotations and Measures in EASY the Evaluation Campaign for Parsers of French.

Part-of-speech tagging

A Deep Linguistic Analysis for Cross-language Information Retrieval

Pattern generation

A Corpus Search System Utilizing Lexical Dependency Structure

Pattern search

UAM Text Tools - a flexible NLP architecture

Pattern transformation

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

PDT

Perspectives of Turning Prague Dependency Treebank into a Knowledge Base

Phonetic distance

Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure

Phonetic lexicons

Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure

Phonetic transcriptions

Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus

Phonetics

Feature-based Encoding and Querying Language Resources with Character Semantics

Phonology

FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text

Phrase alignment

Dependency-Based Phrase Alignment

Phrase-based statistical machine translation

Parallel Corpora and Phrase-Based Statistical Machine Translation for New Language Pairs via Multiple Intermediaries

Phrase-book translation

The pragmatic combination of different crosslingual resources

Platforms

REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications

PMI

Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words

POS tagging

An Annotated Corpus Management Tool: ChaKi

Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project
FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development
A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German

Polish

General and Task-Specific Corpus Resources for Polish Adult Learners of English

Populated places

Building a historical corpus for Classical Portuguese: some technological aspects

Portuguese

Portuguese African varieties

The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon

Prague-English Dependency Treebank

From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description

Predicate – argument structure

Adding multi-layer semantics to the Greek Dependency Treebank

Predicate-argument

Building a resource for studying translation shifts

Predicate-argument structure

Shalmaneser - A Toolchain For Shallow Semantic Parsing

Predicative nouns

Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes.

Prefix

Prepositions

The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon

Preprocessing

Preprocessing and Tokenisation Standards in DELPH-IN Tools

Problem-solving language

The OSU Quake 2004 corpus of two-party situated problem-solving dialogs

Profile Creation

Language Resources for Background Gathering

Projectivity

Corpus Annotation as a Test of a Linguistic Theory

Pronoun Resolution

PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit

Pronunciation lexicon

SI-PRON: A Pronunciation Lexicon for Slovenian
Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian

Pronunciation of names

“Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus

Pronunciation variation

Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure

“Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus

Proofing tools

A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker

Spelling Error Patterns in Spanish for Word Processing Applications

PropBank

From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description

Property Grammars

Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy

Prosodic transplantation

A joint prosody evaluation of French text-to-speech synthesis systems

Prosody

A joint prosody evaluation of French text-to-speech synthesis systems

Prosody analysis

Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech

Protein names

The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text

Pseudonymisation

An Introduction to NLP-based Textual Anonymisation

Python

PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit

Question Answering Evaluation Survey

Quality model

ROTE: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics

The Evolution of an Evaluation Framework for a Text Mining System

Query expansion

Query Expansion on Compounds

Using collocations from comparable corpora to find translation equivalents

Query-interpretation

Summarizing Answers for Complicated Questions

Question

Language Resources for Background Gathering

Question Answering

An Answer Bank for Temporal Inference

Data for question answering: The case of why

Impact of Question Decomposition on the Quality of Answer Summaries

Question Answering Evaluation Survey

Representation and Inference for Open-Domain QA: Strength and Limits of two Italian Semantic Lexicons

Towards Natural Interactive Question Answering

Exploiting Multiple Semantic Resources for Answer Selection

EQueR: the French Evaluation campaign of Question-Answering Systems

The Multilingual Question Answering Track at CLEF

Summarizing Answers for Complicated Questions

Component Evaluation in a Question Answering System

Question Decomposition

Impact of Question Decomposition on the Quality of Answer Summaries

Random Indexing

Towards Holistic Summarization -- Selecting Summaries, Not Sentences

Reader Model

Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing

Recording methodology

Low-cost Customized Speech Corpus Creation for Speech Technology Applications

Regional bias

Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus

Regular changes in valency structure

Valency Lexicon of Czech Verbs: Alternation-Based Model

Relation Annotation

Shallow Semantic Annotation of Bulgarian

Relational database

Lexicon Development for Varieties of Spoken Colloquial Arabic

Relative ordering

ROTE: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics

Resource building

The LOIS Project

Resource merging

Exploiting Multiple Semantic Resources for Answer Selection

Resource-scarce languages

A Grapheme-Based Approach for Accent Restoration in Gikuyu

Resource

The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon

Results

EQueR: the French Evaluation campaign of Question-Answering Systems

The Multilingual Question Answering Track at CLEF

Reusability

Rhythm

Reconsidering Language Identification for Written Language Resources

Roadmap

Robust parsing

Automated Deep Lexical Acquisition for Robust Open Texts Processing

Role labelling

A Preliminary Study for Building the Basque PropBank

Romanian syllables

On the data base of Romanian syllables and some of its quantitative and cryptographic aspects

Romanized Pinyin

Act-Topic Patterns for Automatically Checking Dialogue Models

Rule patterns

Rule-based

LEXADV - a multilingual semantic Lexicon for Adverbs

Scandinavian

Search Engines

General and Task-Specific Corpus Resources for Polish Adult Learners of English

Second language speech corpus

Securing interpretability

Feature-based Encoding and Querying Language Resources with Character Semantics

Selectional restrictions

Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus

LexikoNet - a lexical database based on type and role hierarchies

Semantic

A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation

A Preliminary Study for Building the Basque PropBank

Shalmaneser - A Toolchain For Shallow Semantic Parsing
Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words

Semantic Analysis

Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence

Semantic Annotation

SALTO - A Versatile Multi-Level Annotation Tool

Shallow Semantic Annotation of Bulgarian
Discriminant-Based MRS Banking

FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

I-CAB: the Italian Content Annotation Bank

Methodological Aspects of Semantic Annotation

Semantic classification

Semantic Descriptors: The Case of Reflexive Verbs

Semantic information

Merging two Ontology-based Lexical Resources

Semantic lexicons

Building Slovene WordNet

Semantic matching

The role of lexical resources in matching classification schemas

Semantic network

Structuring a Domain Vocabulary in a General Knowledge Environment

Semantic Parsing

Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations
Shalmaneser - A Toolchain For Shallow Semantic Parsing

Semantic pattern

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

Semantic relations

Structuring a Domain Vocabulary in a General Knowledge Environment

Semantic relations detection

SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations

Semantic representation

Parallel Syntactic Annotation of Multiple Languages

Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign

Semantic resources

Exploiting Multiple Semantic Resources for Answer Selection

Semantic role labeling

Adding multi-layer semantics to the Greek Dependency Treebank

Construction of a FrameNet Labeler for Swedish Text

Semantic roles

The SALSA Corpus: a German Corpus Resource for Lexical Semantics

Semantic Similarity

Towards Holistic Summarization -- Selecting Summaries, Not Sentences

Semantic tagging

Semantic Tag Extraction from WordNet Glosses

Semantic verb classes

Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet

Semantic Web

A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

Automatic Evaluation and Composition of NLP Pipelines with Web Services

NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora

Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator
A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

The role of lexical resources in matching classification schemas

Semantically related terms

A Methodology for Developing Multilingual Resources for Terminology

Semantics

A methodology for the joint development of the Basque WordNet and Semcor

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

Building a network of topical relations from a corpus

HAREM: An Advanced NER Evaluation Contest for Portuguese

Linguistic and Biological Annotations of Biological Interaction Events

The Look and Feel of a Confident Entailer

Semi-automatic annotation

KNACK-2002: a Richly Annotated Corpus of Dutch Written Text

Sense clustering

Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory

Sense mapping

Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory

Sensitivity

An Introduction to NLP-based Textual Anonymisation

Sentence alignment

Acquis Communautaire Sentence Alignment using Support Vector Machines

Sentence tokeniser

Open Source Corpus Analysis Tools for Malay

Sentiment analysis

Semantic Tag Extraction from WordNet Glosses

Serbian WordNet

Building Slovene WordNet

Shallow analysis

Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
The wraetlic NLP suite

Shallow Parsing

Similar Event Search

Language Resources for Background Gathering

SIMPLE

LEXADV - a multilingual semantic Lexicon for Adverbs

Situational Prompting

Bikers Accessing the Web: The SmartWeb Motorbike Corpus

SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus

Skip-gram

A Closer Look at Skip-gram Modelling

SKOS

A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs

Slovene WordNet

Building Slovene WordNet

Slovenian

SI-PRON: A Pronunciation Lexicon for Slovenian

SINOD - Slovenian non-native speech database

The English-Slovene ACQUIS corpus

Towards a Slovene Dependency Treebank

SMILES strings

Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology

Software

GAIA: Common Framework for the Development of Speech Translation Technologies

UAM Text Tools - a flexible NLP architecture

Somali

Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali

Source code

GAIA: Common Framework for the Development of Speech Translation Technologies

Source quality

Predicting MT Quality as a Function of the Source Language

Spanish

Parallel Syntactic Annotation of Multiple Languages

Spanish Synthesis Corpora

Spatial ontology

Principles for annotating and reasoning with spatial information

Spatial semantics

Principles for annotating and reasoning with spatial information

Speaker recognition

The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Special Terms Dictionaries

Features of Terms in Actual Nursing Activities

Specifications

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis
ECESS Inter-Module Interface Specification for Speech Synthesis

Speech

A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit

Gathering a corpus of multimodal computer-mediated meetings

Oriental COCOSDA: Past, Present and Future

Speech and pointing gesture

H. C. Andersen Conversation Corpus

Speech Corpus

Bikers Accessing the Web: The SmartWeb Motorbike Corpus

SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus
Bilingual speech corpus in two phonetically similar languages
FonDat1: A Speech Synthesis Corpus for Norwegian

Speech corpus design

Low-cost Customized Speech Corpus Creation for Speech Technology Applications

Speech database

Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech

SINOD - Slovenian non-native speech database
Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque

Speech feedback

Methodology of Lombard Speech Database Acquisition: Experiences with CLSD

Speech parsing

Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech

GAIA: Common Framework for the Development of Speech Translation Technologies

Methodology of Lombard Speech Database Acquisition: Experiences with CLSD

REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications

Speech recognition

Speech synthesis

Creation and analysis of a Polish speech database for use in unit selection synthesis

Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali

ECESS Inter-Module Interface Specification for Speech Synthesis

FonDat1: A Speech Synthesis Corpus for Norwegian

GAIA: Common Framework for the Development of Speech Translation Technologies

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

Tools and resources for speech synthesis arising from a Welsh TTS project

Speech timing

GAIA: Common Framework for the Development of Speech Translation Technologies

REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications

Speech translation

Speech-to-Speech Translation (SST)

Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign

Spell checking

A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker

Spelling Error Patterns in Spanish for Word Processing Applications

Spelling error detection

A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker

Spelling Error Patterns in Spanish for Word Processing Applications

Spelling error patterns

A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker

Spelling Error Patterns in Spanish for Word Processing Applications

Spoken Corpora

Features of Terms in Actual Nursing Activities

Spoken dialogue systems

REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
A Framework to Integrate Ubiquitous Knowledge Modeling

Spoken Dutch Corpus

Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus

Spoken Italian corpus

Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).

Spoken Italian Linguistic Resources

An observatory on Spoken Italian linguistic resources and descriptive standards.

Spoken Language Translation (SLT)

Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign

Spoken language

Annotating Information Structure in a Corpus of Spoken Danish

Spoken language corpora

“Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus

Spoken metadata

Discourse functions of duration in Mandarin: resource design and implementation

Spontaneous conversation

Spontaneous speech corpus

Dependency-structure Annotation to Corpus of Spontaneous Japanese

SSML

ECESS Inter-Module Interface Specification for Speech Synthesis

Standard based archive

Building a historical corpus for Classical Portuguese: some technological aspects

Standardisation

A Unified Structure for Dutch Dialect Dictionary Data

Standardization

A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

ECESS Inter-Module Interface Specification for Speech Synthesis

Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources

Standards for lexicons

Exploring opportunities for Comparability and Enrichment by Linking lexical databases

Standoff annotation

Preprocessing and Tokenisation Standards in DELPH-IN Tools

Statistical Machine Translation

Creating a Large-Scale Arabic to French Statistical MachineTranslation System

MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation

Czech-English Word Alignment

Statistical language modeling

Linguistic features modeling based on Partial New Cache

Statistical method Languages: Amharic

Data-driven Amharic-English Bilingual Lexicon Acquisition

Statistical modeling

Evaluation of Stop Word Lists in Chinese Language

Stochastic parsing

Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign

Stop word list

Evaluation of Stop Word Lists in Chinese Language

Structural event detection

Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia

Stylistic alignment

Subcategorization

A Large Subcategorization Lexicon for Natural Language Processing Applications

Automatic extraction of subcategorization frames for French

Sumerian

Creating Tools for Morphological Analysis of Sumerian

Summarization

Summarizing Answers for Complicated Questions

SUMO

A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

Support vector machines

Acquis Communautaire Sentence Alignment using Support Vector Machines

Support verbs

Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts

Surface patterns

Surface syntactic analysis

Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy

Surveillance application

Fear-type emotions of the SAFE Corpus: annotation issues

Syllabus Project

A Development Tool For Multilingual Ontology-based Conceptual

Synchronous corpus

Toward a Pan-Chinese Thesaurus

Syntacs

Linguistic and Biological Annotations of Biological Interaction Events

Syntactic analysis

A Deep Linguistic Analysis for Cross-language Information Retrieval

Data for question answering: The case of why

Syntactic annotation

Developing and Using a Pilot Dialectal Arabic Treebank

Parallel Syntactic Annotation of Multiple Languages

Syntactic lexicon

The Lefff 2 syntactic lexicon for French: architecture, acquisition, use

Syntactic parsing

Towards a Slovene Dependency Treebank

Syntactic structures

A new approach to syntactic annotation

Syntax

Multilingual parallel treebanking: a lean and flexible approach

The Look and Feel of a Confident Entailer

Synthesis

Spanish Synthesis Corpora

System evaluation

The Impact of Evaluation on Multilingual Information Retrieval System Development

System extension

Integrating Language Knowledge Resources to Extend the English Inclusion

Systems

EQueR: the French Evaluation campaign of Question-Answering Systems

Tagging

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

Morphological annotation of Korean with Directly Maintainable Resources

The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text

Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus

Tagging errors detection and correction

Tagset Mapping and Statistical Training Data Cleaning-up

Tagsets mapping

Tagset Mapping and Statistical Training Data Cleaning-up

TC-STAR

ECESS Inter-Module Interface Specification for Speech Synthesis

Teaching

The MULINCO corpus and corpus platform

Techninal Term

Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing

Temporal Expression Identification and Normalisation

If “it” were “then”, then when was “it”? Establishing the anaphoric role of “then”

Temporal Inference

An Answer Bank for Temporal Inference

Temporal information

Towards a terminological resource for biomedical text mining

Term associations

Term extraction

Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations

Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model

Term identification

Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology

Term variation

Clustering acronyms in biomedical text for disambiguation

Terminological databases

Towards a terminological resource for biomedical text mining

Terminology

A Methodology for Developing Multilingual Resources for Terminology

Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology

Using Core Ontology for Domain Lexicon Structuring

Terminology acquisition

Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model

Terminology Extraction

Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering

Terminology recognition

Towards a terminological resource for biomedical text mining

Test suite

A Lexicalized Tree-Adjoining Grammar for Vietnamese

Text

Gathering a corpus of multimodal computer-mediated meetings

Text alignment

Evaluation of multilingual text alignment systems: the ARCADE II project

Text Analysis

A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German

Text Classification

Bilingual Machine-Aided Indexing

Text Database

BACO - A large database of text and co-occurrences

Text mining

A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames

Natural Language Processing: A Terminological and Statistical Approach

The Evolution of an Evaluation Framework for a Text Mining System

Text Prediction

Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People

Text-to-speech synthesis

A joint prosody evaluation of French text-to-speech synthesis systems

Textual coreference

Corpus Annotation as a Test of a Linguistic Theory

Textual entailment

Mixing WordNet, VerbNet and PropBank for studying verb relations

The Look and Feel of a Confident Entailer
Building a Large-Scale Repository of Textual Entailment Rules

Thai Search Engine

Blind Evaluation for Thai Search Engines

The Chinese Gigaword Corpus

Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus

Thesauri

Aligning Multilingual Thesauri

Development of Linguistic Ontology on Natural Sciences and Technology

Thesaurus browsing

A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs

Third language acquisition

Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language

Time

Preprocessing and Tokenisation Standards in DELPH-IN Tools

Tools and resources for speech synthesis arising from a Welsh TTS project

Tokenisation

Tokenization

The importance of precise tokenizing for deep grammars

Tool

Act-Topic Patterns for Automatically Checking Dialogue Models

Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains

Tool design

An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks

Toolkits

The wraetlic NLP suite

Tools

Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project

SALTO - A Versatile Multi-Level Annotation Tool

The MULINCO corpus and corpus platform

Tools for annotation

Structure, Annotation and Tools in the Basque ZT Corpus

Topic classification

Next Generation Language Resources using Grid

Training corpora

Tagset Mapping and Statistical Training Data Cleaning-up

Training in an unsupervised way

Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions

Transcription

A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit

Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

Integrated Linguistic Resources for Language Exploitation Technologies

Transcription evaluation

Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure

Transducer

Morphological annotation of Korean with Directly Maintainable Resources

Translation

Integrated Linguistic Resources for Language Exploitation Technologies

Translation aid

Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence

Translation equivalents

Using collocations from comparable corpora to find translation equivalents

Translation shifts

Building a resource for studying translation shifts

Translation studies

Building a resource for studying translation shifts

The MULINCO corpus and corpus platform

Translation training

A Uniform Interface to Large-Scale Linguistic Resources

Tree Edit Distance

Building a Large-Scale Repository of Textual Entailment Rules

Tree query

Tregex and Tsurgeon: tools for querying and manipulating tree data structures

Tree search

Tregex and Tsurgeon: tools for querying and manipulating tree data structures

Tree transformation

Tregex and Tsurgeon: tools for querying and manipulating tree data structures

Treebank

Linguistic Resources for Speech Parsing

Shallow Semantic Annotation of Bulgarian

Treebanking

Discriminant-Based MRS Banking

Multilingual parallel treebanking: a lean and flexible approach

Treebanks

Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment

Towards a Slovene Dependency Treebank

Tregex and Tsurgeon: tools for querying and manipulating tree data structures

Turning a Dependency Treebank into a PSG-style Constituent Treebank

Typology of Multiword Expressions

COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

Typos

A Framework to Integrate Ubiquitous Knowledge Modeling

UBIQUITOUS DOMAIN MODEL

UMTS

Bikers Accessing the Web: The SmartWeb Motorbike Corpus

SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus

Under-resourced languages

Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali

Underspecification

Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology

Unification grammar

REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications

Unified Lexicon

Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian

Unified Morphosyntactic Specifications

Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian

Unit selection

Creation and analysis of a Polish speech database for use in unit selection synthesis

Tools and resources for speech synthesis arising from a Welsh TTS project

Universal databases

A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German

Unknown Word Interpretation

Unknown words

Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes.

Unsupervised

Integrating Language Knowledge Resources to Extend the English Inclusion

Unsupervised classifier

Uralic languages

Morphological Tools for Six Small Uralic Languages

User centred evaluation

The Evolution of an Evaluation Framework for a Text Mining System

User-feedback

Computer-aided summarisation -- what the user really wants

Vagueness

Annotating COMPARA, a Grammar-aware Parallel Corpus

Valence dictionary

Romanian Valence Dictionary in XML Format

Valency

From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description

Valency frames

From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description

Validation of language resources

Czech-English Word Alignment

Vallex

From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description

Verb

On Automatic Assignment of Verb Valency Frames in Czech

Verb classes

From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description

The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon

VerbNet

Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations

Verbal lexicon

Automatic extraction of subcategorization frames for French

Verbal predicates

Semantic Descriptors: The Case of Reflexive Verbs

Verbo-nominal collocations

From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description

Verbs

Vietnamese

A Lexicalized Tree-Adjoining Grammar for Vietnamese

Web Data