Keywords

A / B / C / D / E / F / G / H / I / J / K / L / M / N / O / P / Q / R / S / T / U / V / W / X / Z


 
Keyword Title

"se-constructions"

Semantic categorization of Spanish se-constructions 

(bi)national action

Linguistic annotation of the Spoken Dutch Corpus: If we had to do it all over again ... 

3D images

AV@CAR: A Spanish Multichannel Multimodal Corpus for In-Vehicle Automatic Audio-Visual Speech Recognition 

A

 

Aboutness

Textual Distraction as a Basis for Evaluating Automatic Summarisers

Abridgement

Textual Distraction as a Basis for Evaluating Automatic Summarisers

Abstract nouns

Extraction of Hyperonymy of Adjectives from Large Corpora by Using the Neural Network Model

Accessibility

The Centre for Dutch Language and Speech Technology (TST Centre) 

Accessing language resources

Architecture for Distributed Language Resource Management and Archiving

Acoustic adaptation

Transcrigal: A Bilingual System for Automatic Indexing of Broadcast News

Acoustic model

Comparative Evaluations in the Domain of Automatic Speech Recognition 

Acoustic-phonetic decoding

Evaluation of a Spoken Phonetic Databse in Basque Language

Acquisition

 

Acquisition and Annotation of Slovenian Broadcast News Database

Automatic Acquisition of Sense Examples using ExRetriever

Bootstrapping a database of German multi-word expressions

Corpus based Enrichment of GermaNet Verb Frames

Acronyms

Automatic Building Gazetteers of Co-referring Named Entities

Acts

From Acts and Topics to Transactions and Dialogue Smoothness

Adaptation

 

Language Model Adaptation for Statistical Machine Translation based on Information Retrieval

Transcrigal: A Bilingual System for Automatic Indexing of Broadcast News

Adapting NLP tools

Experiments on Building Language Resources for Multi-Modal Dialogue Systems

Adjectives

Extraction of Hyperonymy of Adjectives from Large Corpora by Using the Neural Network Model

Advanced Front-End

Development and Integration of the LDA-Toolkit into the COST249 SpeechDat (II) SIG Reference Recognizer

Adverb attachment in parse trees

A Corpus-based Syntactic Lexicon for Adverbs

Adverbs

A Corpus-based Syntactic Lexicon for Adverbs

Affective lexicon

WordNet-Affect: an Affective Extension of WordNet

African languages

The African Speech Technology Project: An Assessment

WALA: a multilingual resource repository for West African Languages

Afrikaans

A Spoken Afrikaans Language Resource Designed for Research on Pronunciation Variations

Agent platforms

A2Q: an agent-based architecure for multilingual Q&A

Aliases

Automatic Building Gazetteers of Co-referring Named Entities

Aligned multilingual corpora

Multilingual Corpus-based Approach to the Resolution of English -ing

Alignment

Annotation of anaphoric expressions in an aligned bilingual corpus

Automatic extraction of syntactic semantic patterns for multilingual resources

Construction of a Bilingual Arabic-Spanish Lexicon of Verbs Based on a Parallel Corpus

Ambiguity

A Search Tool for Corpora with Positional Tagsets and Ambiguities

American English

The American National Corpus First Release

Analogy

Using Paradigm Tables to Generate New Utterances Similar to those Existing in Linguistic Resources

Analysis

Generating an Arabic full-form lexicon for bidirectional morphology lookup

The Automatic Content Extraction (ACE) Program - Tasks, Data, and Evaluation

Analysis representation

A Model of Semantic Representations Analysis For Chinese Sentences

Analysis-synthesis

Perceptual Evaluation of Quality Deterioration Owing to Prosody Modification

Anaphor

Information Extraction from Hindi Texts

Anaphora

Towards a Reference Annotation Framework

Anaphora resolution

A General-Purpose  off-the-shelf Anaphora Resolution Module: Implementation and Preliminary Evaluation

A Public Reference Implementation of the RAP Anaphora Resolution Algorithm

Annotation of anaphoric expressions in an aligned bilingual corpus

Can Anaphoric Definite Descriptions be Replaced by Pronouns?

Online Evaluation of Coreference Resolution

Pronominal Anaphora Resolution for Unrestricted Text

Toward Text Understanding: Integrating Relevance-tagged Corpus and Automatically Constructed Case Frames

Anchor text

Exploiting Anchor Text as a Lexical Resource

Annotated Acoustic Databases

The African Speech Technology Project: An Assessment

Annotation

Abstracting a Dialogue Act Tagset for Meeting Processing

An Annotated German-Language Medical Text Corpus as Language Resource

An Annotation Scheme for Information Status in Dialogue

An XML Representation for Annotated Handwriting Datasets for Online Handwriting Recognition

Annotating a corpus for building a domain-specific knowledge base

Annotating Noun Argument Structure for NomBank

ANNOTATION OF COREFERENCE RELATIONS AMONG LINGUISTIC EXPRESSIONS AND IMAGES IN BIOLOGICAL ARTICLES

Automatic audio and manual transcripts alignment, time-code transfer and selection of exact transcripts

Building and Using a Corpus of Shallow Dialog Annotated Meetings

Callisto: A Configurable Annotation Workbench

Co-reference in Japanese Task-oriented Dialogues: A Contribution to the Development of Language-specific and Language-general Annotation Schemes and Resources

Development of Slovenian Broadcast News Speech Database

Evaluation of Transcription and Annotation Tools for a Multi-modal, Multi-party Dialogue Corpus

Experiences in Collection of Handwriting Data for Online Handwriting Recognition in Indic Scripts

Exploiting Coreference Annotations for Text-to-Hypertext Conversion

Image-Language Multimodal Corpora: Needs, Lacunae and an AI Synergy for Annotation

Online Evaluation of Coreference Resolution

OntoTag's Linguistic Ontologies: Enhancing Higher Level and Semantic Web Annotations

Phonological Treebanks - Issues in Generation and Application

Resources for Place Name Analysis

The Automatic Content Extraction (ACE) Program - Tasks, Data, and Evaluation

Towards Intelligent Written Cultural Heritage Processing - Lexical Processing

Annotation Editor

A Suite of Tools for Marking Up Textual Data for Temporal Text Mining Scenarios

Infrastructure for Collaborative Annotation of Speech

Annotation graphs

Talkbank: Building an Open Unified Multimodal Database of Communicative Interaction

Annotation infrastructure

A Multi-Modal Documentation System for Warao

Annotation Tools for Large-Scale Corpus Development: Using AGTK at the Linguistic Data Consortium

Annotation model

Annotation Multi-media / Multi-modal Resources with ELAN

Annotation schema

Towards a Reference Annotation Framework

Annotation scheme

An Annotation Scheme for a Rhetorical Analysis of Biology Articles

Annotation tool

Abar-Hitz: An Annotation Tool for the Basque Dependency Treebank

Orthographic and Phonetic Annotation of Very Large Czech Corpora with Quality Assessment

Annotation tools

Towards General-Purpose Annotation Tools – How far are we today?

Annotation vocabulary

Infrastructure for Collaborative Annotation of Speech

Answer Patterns

Multilingual Pattern Libraries for Question Answering: a Case Study for Definition Questions

Application design

Functional Requirements for an Interlinear Text Editor

Application of a deep grammar

Utilization of Multiple Language Resources for Robust Grammar-Based Tense and Aspect Classification

Arabic

Generating an Arabic full-form lexicon for bidirectional morphology lookup

OrienTel - Telephony Databases Across Northern Africa and the Middle East

Arabic Processing

Construction of a Bilingual Arabic-Spanish Lexicon of Verbs Based on a Parallel Corpus

Archiving language resources

Architecture for Distributed Language Resource Management and Archiving

Argument

Annotating Noun Argument Structure for NomBank

Argument structure

Towards a Dynamic Lexicon: Predicting the Syntactic Argument Structure of Complex Verbs

Argument--adjunct distinction

Automatic Methods to Supplement Broad-Coverage Subcategorization Lexicons

Argumentation

An argumentative annotation schema for meeting discussions

Artificial Neural Network (ANN)

Comparative Evaluations in the Domain of Automatic Speech Recognition 

Asia

Collection of SLR in the Asian-Pacific area

ASR

OrienTel - Telephony Databases Across Northern Africa and the Middle East

Associative concept dictionary

Automatic Extraction of Hyponyms from Japanese Newspapers Using Lexico-syntactic Patterns

Atlas

Callisto: A Configurable Annotation Workbench

Attribute-value pairs

COMPARATIVE EVALUATION OF A STOCHASTIC PARSER ON SEMANTIC AND SYNTACTIC-SEMANTIC LABELS

Audio library

SpeechRecorder - a Universal Platform Independent Multi-Channel Audio Recording Software

Audio segmentation

Development of Slovenian Broadcast News Speech Database

Audio-Visual Automatic Speech Recognition

AV@CAR: A Spanish Multichannel Multimodal Corpus for In-Vehicle Automatic Audio-Visual Speech Recognition

Automated acquisition of IE patterns

Categorizing Web Pages as a Preprocessing Step for Information Extraction

Automated evaluation

Automated Morphological Segmentation and Evaluation

Automated methods

Calibrating Resource-light Automatic MT Evaluation: A Cheap Approach to Ranking MT Systems by the Usability of their Output

Automatic acquisition

Discarding noise in an automatically acquired lexicon of support verb constructions

Automatic and manual evaluation

Comparison of some automatic and manual methods for summary evaluation based on the Text Summarization Challenge 2

Automatic annotation

From Acts and Topics to Transactions and Dialogue Smoothness

Automatic annotations

Use and Evaluation of Prosodic Annotations in Dutch

Automatic aquisition

Morphology Based Automatic Acquisition of Large-coverage Lexica

Automatic conversion

Converting Treebank Annotations to Language Neutral Syntax

Automatic Corpus Acquisition

The Effect of Bias on an Automatically-built Word Sense Corpus

Automatic disambiguation

Semantic categorization of Spanish se-constructions

Automatic discovery

Automatic Extraction of Hyponyms from Japanese Newspapers Using Lexico-syntactic Patterns

Automatic error detection

The overview of the SST speech corpus of Japanese learner English and evaluation through the experiment on automatic detection of learners' errors

Automatic evaluation

Incremental Methods to Select Test Sentences for Evaluating Translation Ability

Interpreting Bleu/NIST Scores: How Much Improvement do We Need to Have a Better System?

Modelling Legitimate Translation Variation for Automatic Evaluation of MT Quality

Probabilistic Detection of Context-Sensitive Spelling Errors

Re-using high-quality resources for continued evaluation of automated summarization systems

Automatic extraction

 

Automatic extraction of syntactic semantic patterns for multilingual resources

The Verb in the Terminological Collocations. Contribution to the Development of a Morphological Analyser: MorphoComp

Automatic extraction of concepts

A complete understanding speech system based on semantic concepts

Automatic indexing

Transcrigal: A Bilingual System for Automatic Indexing of Broadcast News

Automatic learning

An Automatic Method for Constructing Domain-Specific Ontology Resources

Automatic lexical acquisition

Automatic Acquisition of Paradigmatic Relations using Iterated Co-occurrences

Automatic parsing

Prague Czech-English Dependency Treebank. Syntactically Annotated Resources for Machine Translation

Automatic Phone Segmentation

Automatic Phonemic Labeling and Segmentation of Spoken Dutch

Automatic phonetic transcription

Improving Automatic Phonetic Transcription of Spontaneous Speech through Variant-Based Pronunciation Variation Modelling

Automatic Phonetic Transcriptions

Automatic Phonemic Labeling and Segmentation of Spoken Dutch

Automatic scoring

Application of the BLEU Method for Evaluating Free-text Answers in an E-learning Environment

Automatic segmentation

MAUS Goes Iterative

Automatic Speech Recognition

Creation and Validation of Large Lexica for Speech-to-Speech Translation Purposes

Development and Integration of the LDA-Toolkit into the COST249 SpeechDat (II) SIG Reference Recognizer

Issues in Annotation of the Czech Spontaneous Speech Corpus in the MALACH Project

Automatic Speech Recognition (ASR)

Comparative Evaluations in the Domain of Automatic Speech Recognition 

Speech recognition simulation and its application for Wizard of Oz experiments

automatic summarization

 

A comparison of summarisation methods based on term specificity estimation

Comparison of some automatic and manual methods for summary evaluation based on the Text Summarization Challenge 2

Re-using high-quality resources for continued evaluation of automated summarization systems

automatic temporal annotation

A Framework for Temporal Resolution

Automatic term extraction

A practical competition of different filters used in automatic term extraction

Automatic Text Processing

 

Development of Bilingual Domain-Specific Ontology for Automatic Conceptual Indexing

Development of Ontologies with Minimal Set of Conceptual Relations

Automating

Automated Morphological Segmentation and Evaluation

Availability

Multifunctional Computational Lexicon of Contemporary Portuguese: An Available Resource for Multitype Applications

Providing on-line access to Portuguese language resources: corpora and lexicons

AVTTS

EVALUATION OF A SPEECH CUER: FROM MOTION CAPTURE TO A CONCATENATIVE TEXT-TO-CUED SPEECH SYSTEM

B

 

Basic language resource

The Language Belongs to the People!

Basque lanaguage

Evaluation of a Spoken Phonetic Databse in Basque Language.

Bayesian Information Criterion

Enriching a Thai Lexical Database with Selectional Preferences

Bayesian networks

A Bayesian Model for Shallow Syntactic Parsing of Natural Language Texts

Acquiring Bayesian Networks from Text

Bayesian Semantics Incorporation to Web Content for Natural Language Information Retrieval

Learning to predict Pitch Accents using Bayesian Belief Networks for Greek Language

Beamforming

Evaluation of Microphone Array Front-Ends for ASR - an Extension of the AURORA Framework

Benchmarking

Benchmarking ontology tools. A case study for the WebODE platform

Bengali language evaluation

A Framework for Evaluating the Suitability of Non-English Corpora for Language Engineering

Bibliography

The GENOMA-KB Platform: Queries Over Integrated Linguistic Resources

Bilingual

The Mixer Corpus of Multilingual, Multichannel Speaker Recognition Data

Bilingual corpus

A Comparative Study on Human Communication Behaviors and Linguistic Characteristics for Speech-to-Speech Translation

Bilingual terminology extraction

French-English multi-word term alignment based on lexical context analysis

Bilingualism

Transcrigal: A Bilingual System for Automatic Indexing of Broadcast News

Biology texts

An Annotation Scheme for a Rhetorical Analysis of Biology Articles

Biomedicine

PBIE: A DATA PREPARATION TOOLKIT toward DEVELOPING a PARSING-BASED INFORMATION EXTRACTION SYSTEM

Bipartite graph

Grouping Synonymous Sentences from a Parallel Corpus

Blackbox

Designing a Realistic Evaluation of an End-to-end Interactive Question Answering System

BLARK

NEMLAR - An Arabic Language Resources project

The new Dutch-Flemish HLT Programme: a concerted effort to stimulate the HLT sector

BLEU

Application of the BLEU Method for Evaluating Free-text Answers in an E-learning Environment

How Does Automatic Machine Translation Evaluation Correlate With Human Scoring as the Number of Reference Translations Increases?

Interpreting Bleu/NIST Scores: How Much Improvement do We Need to Have a Better System?

Boosting

Raising the Bar: Stacked Conservative Error Correction Beyond Boosting

Bootstrapping

Automatic Language-Independent Induction of Gazetteer Lists

Interpreting Bleu/NIST Scores: How Much Improvement do We Need to Have a Better System?

The Effect of Bias on an Automatically-built Word Sense Corpus

Border crossing domain

Developing Language Resources for a Transnational Digital Government System

Brazilian Portuguese

The Lácio-Web: Corpora and Tools to advance Brazilian Portuguese Language Investigations and Computational Linguistic Tools

Broadcast news

Acquisition and Annotation of Slovenian Broadcast News Database

Collection and Evaluation of Broadcast News Data for Arabic

Development of Slovenian Broadcast News Speech Database

Network of Data Centres (NetDC): BNSC – An Arabic Broadcast News Speech Corpus

The COST278 pan-European Broadcast News Database

The ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

Transcrigal: A Bilingual System for Automatic Indexing of Broadcast News

C

 

Canonical models

Rethinking Reusable Resources

Car

AV@CAR: A Spanish Multichannel Multimodal Corpus for In-Vehicle Automatic Audio-Visual Speech Recognition

Car environment

Creation and Assessment of Korean Speech and Noise DB in Car Environment

Evaluation of Microphone Array Front-Ends for ASR - an Extension of the AURORA Framework

Card-files

Towards Intelligent Written Cultural Heritage Processing - Lexical Processing

CART

Learning to predict Pitch Accents using Bayesian Belief Networks for Greek Language

Case frames

Toward Text Understanding: Integrating Relevance-tagged Corpus and Automatically Constructed Case Frames

Case roles

A Syntactically Annotated Corpus of Tibetan

Causal Knowledge acquisition

Acquiring Bayesian Networks from Text

Causal relationships

Acquiring Bayesian Networks from Text

Cellphone

The American English SALA-II Data Collection

Central and Eastern European languages

MULTEXT-East Version 3: Multilingual Morphosyntactic Specifications, Lexicons and Corpora

CGN database

Use and Evaluation of Prosodic Annotations in Dutch

Character-level n-gram modeling

N-Gram Language Modeling for Robust Multi-Lingual Document Classification

Charniak's parser

Using the Penn Treebank to Evaluate Non-Treebank Parsers

Chat

A Chatbot as a Novel Corpus Visualization Tool

CHILDES

Adding Syntactic Annotations to Transcripts of Parent-Child Dialogs

Talkbank: Building an Open Unified Multimodal Database of Communicative Interaction

Children

Development of New Telephone Speech Databases for French: The NEOLOGOS Project

Children' speech

You stupid tin box' - children interacting with the AIBO robot: A cross-linguistic emotional speech corpus

Chinese

Korean-Chinese-Japanese Multilingual Wordnet with Shared Semantic Hierarchy

Class n-grams

A Galician Textual Corpus for Morphosyntactic Tagging with Application to Text-to-Speech Synthesis

Classification

Integrated Language Technologies for Multilingual Information Services in the MEMPHIS Project

Classroom discourse

Talkbank: Building an Open Unified Multimodal Database of Communicative Interaction

CLEF

Evaluation of Cross-Language Information Retrieval Using the Domain-Specific GIRT Data as Parallel German-English Corpus

CLIPS

Using Semantic Language Resources to Support Textual Inference for Question Answering

CLIR

EuroWordNet as a Resource for Cross-language Information Retrieval

Cluster search method

The Bilingual Web Dictionary on Demand

Clustering

Cluster Analysis and Classification of Named Entities

Highlighting latent structure in documents

Unsupervised Text Mining for Ontology Extraction: An Evaluation of Statistical Measures

Cognate

Semi-Automatic Derivation of a French Lexicon from CLIPS

Cognitive Metaphor

Metaphors in Wordnets: from Theory to Practice

Collaboration

An Emerging Transcontinental Collaborative Research and Education Agenda in Human Language Technologies

Collaborative annotation

Collaborative Annotation of Sign Language Data with Peer-to-Peer Technology

Infrastructure for Collaborative Annotation of Speech

Collaborative Commentary

Talkbank: Building an Open Unified Multimodal Database of Communicative Interaction

Collaborative Commentary: Opening Up Spoken Language Databases

Collocation

Improving Collocation Extraction for High Frequency Words

Collocation extraction

Collocation Extraction Using Web Statistics

Using the Web as a Corpus for the Syntactic-Based Collocation Identification

Collocations

Automatic Acquisition of Paradigmatic Relations using Iterated Co-occurrences

Bootstrapping a database of German multi-word expressions

Enriching the Spanish EuroWordNet by Collocations

Identifying Morphosyntactic Preferences in Collocations

The Verb in the Terminological Collocations. Contribution to the Development of a Morphological Analyser: MorphoComp

Colloquial words

Issues in Annotation of the Czech Spontaneous Speech Corpus in the MALACH Project

Common words

Creating Slovenian Language Resources for Development of Speech-to-Speech Translation Components

Comparable corpora

Constructing Word-Sense Association Networks from Bilingual Dictionary and Comparable Corpora

French-English multi-word term alignment based on lexical context analysis

Making Monolingual Corpora Comparable: a Case Study of Bulgarian and Croatian

Comparison

An Analysis of the Relative Difficulty of Reuters-21578 Subsets

Comparison between memory-based learning and maximum entropy modeling

Memory-based Classification of Proper Names in Norwegian

Complex nominals

Representing Italian Complex Nominals: a Pilot Study

Complex predicates

Extending WordNets to Implicit Information

Compound word lexicon

Automatic Generation of Compound Word Lexicon for Hindi Speech Synthesis

Compounds

Finding the Correct Interpretation of Swedish Compounds a Statistical Approach

Computational lexicography

Tools for Upgrading Printed Dictionaries by Means of Corpus-based Lexical Acquisition

Computational Lexicon

A Corpus-based Syntactic Lexicon for Adverbs

Computational morphological analyser

Software Tools for Morphological Tagging of Zulu Corpora and Lexicon Development

Computational morphology

Enlarging the Croatian Morphological Lexicon by Automatic Lexical Acquisition from Raw Corpora

SMOR: A German Computational Morphology Covering Derivation, Composition, and Inflection

Computational semantics

Some Meaning Procedures of Ontological Semantics

Computational Terminology

Using cooccurrence statistics and the web to discover synonyms in a technical language

Computer assisted language learning

ALLES: Integrating NLP in ICALL Applications

Computer vision

Framework for data-driven video-realistic audio-visual speech-synthesis

Concept annotation

A Fine-Grained Evaluation Method for Speech-to-Speech Machine Translation Using Concept Annotations

Concept creation

Concept Creation in Lexical Ontologies

Concept dictionaries

Automatic Keyword Extraction from Spoken Text. A Comparison of two Lexical Resources: the EDR and WordNet

Concept modelling

Principles of a system for terminological concept modelling

Concept space

A Multi-Modal Documentation System for Warao

Concept system

Principles of a system for terminological concept modelling

Conceptual graph bank

Building a Conceptual GraphBank for Chinese Language

Conceptual taxonomy

Exploring Balkanet Shared Ontology for Multilingual Conceptual Indexing

Concordancer

A Search Tool for Corpora with Positional Tagsets and Ambiguities

Confidence estimation

Training a Sentence-Level Machine Translation Confidence Measure

Confidence intervals

Interpreting Bleu/NIST Scores: How Much Improvement do We Need to Have a Better System?

Confidence measures

An Efficient Word Confidence Measure Using Likelihood Ratio Scores

Connector usage

Connector Usage in the English Essay Writing of Japanese EFL Learners

Consistency

Consistent Storage of Metadata in Inference Lexica: The MetaLex Approach

Constraint grammar

A Named Entity Recognizer for Danish

Constraint-based grammars

A Lexicon Module for a Grammar Development Environment

Content Interoperability

Content Interoperability of Lexical Resources: Open Issues and “MILE” Perspectives

Content representation

Standardization in Multimodal Content Representation: Some Methodological Issues

Content-based indexing

Exploring Balkanet Shared Ontology for Multilingual Conceptual Indexing

Context

Generating Coreferential Descriptions from a Structured Model of the Context

Context-dependent rules

A Graphical Tool for Handling Rule Grammars in Java Speech Grammar Format

Context-sensitive spelling errors

Probabilistic Detection of Context-Sensitive Spelling Errors

Context-sensitive understanding capability

The French MEDIA/EVALDA project: the evaluation of the understanding capability of Spoken Language Dialogue Systems

Continuous Speech Recognition

Development of Resources for a Bilingual Automatic Index System of Broadcast News in Basque and Spanish

Contrastive topic

Annotators' Agreement: The Case of Topic-Focus Articulation

Controlled Language

Evaluation and Adaptation of a Specialised Language Checking Tool for Non-specialised Machine Translation and Non-expert MT Users for Multi-lingual Telecooperation

Controlled language

Tone-of-Voice and Controlled Language Techniques

Conversation

A Chatbot as a Novel Corpus Visualization Tool

Evaluating Conversation with Hans Christian Andersen

Usability Evaluation of Multimodal and Domain-Oriented Spoken Language Dialogue Systems

Conversation analysis

Talkbank: Building an Open Unified Multimodal Database of Communicative Interaction

Conversational speech

Speech & Expression - The Value of a Longitudinal Corpus

Conversational telephone speech

The Fisher Corpus: a Resource for the Next Generations of Speech-to-Text

Conversion

Bypassing Greeklish!

Converstion of dependency structures to phrase structures

Mapping Dependency Structures to Phrase Structures and the Automatic Acquisition of Mapping Rules

Co-occurrences

Automatic Acquisition of Paradigmatic Relations using Iterated Co-occurrences

Core Lexicon

Distributional Consistency: As a General Method for Defining a Core Lexicon

Coreference

Computing Reliability for Coreference Annotation

Generating Coreferential Descriptions from a Structured Model of the Context

Towards a Reference Annotation Framework

Automatic Building Gazetteers of Co-referring Named Entities

Co-reference in Japanese Task-oriented Dialogues: A Contribution to the Development of Language-specific and Language-general Annotation Schemes and Resources

Coreference Hypertext

Exploiting Coreference Annotations for Text-to-Hypertext Conversion

Coreference Resolution

Evaluating Name-Matching for Coreference Resolution

Online Evaluation of Coreference Resolution

Corpora

A powerful and versatile XML format for representing role-semantic annotation

Annotating Noun Argument Structure for NomBank

Applying Computational Linguistic Techniques in a Documentary Project for Q’anjob’al (Mayan Guatemala)

Computing Reliability for Coreference Annotation

Evaluation and Adaptation of the Celex Dutch Morphological Database

Integration of Russian Language Resources

Providing on-line access to Portuguese language resources: corpora and lexicons

Rethinking readability of digital editions - the case of the AAC’s "Digital Brenner"

The AAC [Austrian Academy Corpus] An Enterprise to Develop Large Electronic Text Corpora

The Automatic Content Extraction (ACE) Program - Tasks, Data, and Evaluation

The Corpógrafo – a Web-based environment for corpora research

The Cross-Breeding of Dictionaries

Corpora associated tools

The Lácio-Web: Corpora and Tools to advance Brazilian Portuguese Language Investigations and Computational Linguistic Tools

Corpora creation

The CLaRK System:  XML-based Corpora Development System for Rapid Prototyping

Corpora tools

Querying both time-aligned and hierarchical corpora with NXT Search

Corporate Voice

Tone-of-Voice and Controlled Language Techniques

Corpus

A Chatbot as a Novel Corpus Visualization Tool

A Search Tool for Corpora with Positional Tagsets and Ambiguities

A Syntactically Annotated Corpus of Tibetan

Building a Paraphrase Corpus for Speech Translation

Corpus based Enrichment of GermaNet Verb Frames

Development of a Corpus Workbench for the METU Turkish Corpus

Enlarging the Croatian Morphological Lexicon by Automatic Lexical Acquisition from Raw Corpora

EVALDA-CESART Project: Terminological Resources Acquisition Tools Evaluation Campaign

Evaluation of Transcription and Annotation Tools for a Multi-modal, Multi-party Dialogue Corpus

Multifunctional Computational Lexicon of Contemporary Portuguese: An Available Resource for Multitype Applications

Open Collaborative Development of the Thai Language Resources for Natural Language Processing

Parsing Ungrammatical Input: An Evaluation Procedure

Sejong Korean Corpora in the Making

Speech & Expression - The Value of a Longitudinal Corpus

Text Corpora, Local Grammars and Prediction

The American National Corpus First Release

The GENOMA-KB Platform: Queries Over Integrated Linguistic Resources

The GENOMA-KB project: towards the integration of concepts, terms, textual corpora and entities

The Lancaster Corpus of Mandarin Chinese: A Corpus for Monolingual and Contrastive Language Study

Toward Text Understanding: Integrating Relevance-tagged Corpus and Automatically Constructed Case Frames

Word Association Norms as a Unique Supplement of Traditional Language Resources

Corpus analysis

Why do you ignore me? - Proof that not all direct speech is bad

Corpus annotation

Acquiring Reusable Multilingual Phonotactic Resources

An Annotated Corpus of Tutorial Dialogs on Mathematical Theorem Proving

Annotators' Agreement: The Case of Topic-Focus Articulation

Exploiting Semantic Web Technologies for Intelligent Access to Historical Documents

Issues in Annotation of the Czech Spontaneous Speech Corpus in the MALACH Project

Raising the Bar: Stacked Conservative Error Correction Beyond Boosting

Semi-automatic Syntactic and Semantic Corpus Annotation with a Deep Parser

The American National Corpus First Release

The Penn Discourse Treebank

Toward an Annotation Software for Video of Sign Language, Including Image Processing Tools and Signing Space Modelling

Corpus annotation tool

Retrieving Annotated Corpora for Corpus Annotation

Corpus based method

Automatic Extraction of Hyponyms from Japanese Newspapers Using Lexico-syntactic Patterns

Corpus comparison

Detection of Domain Specific Terminology Using Corpora Comparison

Corpus conversion

Automatic transformation of phrase treebanks to dependency trees

Corpus creation

Exploiting Semantic Web Technologies for Intelligent Access to Historical Documents

Corpus design

Issues in Corpus Development for Multi-party Multi-modal Task-oriented Dialogue

Corpus Development

Creation of a Doctor-Patient Dialogue Corpus Using Standardized Patients

Corpus encoding

Term Translations in Parallel Corpora: Discovery and Consistency Check

Corpus exploration

Putting the Dutch PAROLE Corpus to Work

Corpus for Mexican Spanish

VOXMEX Speech Database: Design of a Phonetically Balanced Corpus

Corpus investigation

A New Approach to the Corpus-based Statistical Investigation of Hungarian Multi-word Lexemes

Corpus linguistics

Annotating a corpus for building a domain-specific knowledge base

MiniCors and Cast3LB: Two Semantically Tagged Spanish Corpora

Corpus Production

Evaluating Factors Impacting the Accuracy of Forced Alignments in a Multimodal Corpus

Corpus profiling

A Framework for Evaluating the Suitability of Non-English Corpora for Language Engineering

Corpus query

Querying both time-aligned and hierarchical corpora with NXT Search

Corpus statistics

Morphology Based Automatic Acquisition of Large-coverage Lexica

Corpus tools

The Corpógrafo – a Web-based environment for corpora research

Corpus, acoustic

An Acoustic Corpus Contemplating Regional Variation for Studies of European Portuguese Nasals

Corpus-based Analysis of Interlanguage

Connector Usage in the English Essay Writing of Japanese EFL Learners

Corpus-based learning

Raising the Bar: Stacked Conservative Error Correction Beyond Boosting

CQP

A Search Tool for Corpora with Positional Tagsets and Ambiguities

Creation of Basic Language Resources

A Language Resources Infrastructure for Bulgarian

Creation of LR

Linguistic annotation of the Spoken Dutch Corpus: If we had to do it all over again ...

Creativity

Concept Creation in Lexical Ontologies

Enriching WordNet Via Generative Metonymy and Creative Polysemy

Croatian

Enlarging the Croatian Morphological Lexicon by Automatic Lexical Acquisition from Raw Corpora

Cross Validation

Comparative Evaluations in the Domain of Automatic Speech Recognition 

Cross-channel

The Mixer Corpus of Multilingual, Multichannel Speaker Recognition Data

Cross-language comparison

A Fine-Grained Evaluation Method for Speech-to-Speech Machine Translation Using Concept Annotations

Cross-Language Information Retrieval

Development of Bilingual Domain-Specific Ontology for Automatic Conceptual Indexing

The Future of Evaluation for Cross-Language Information Retrieval Systems

Cross-language retrieval

Evaluation of Cross-Language Information Retrieval Using the Domain-Specific GIRT Data as Parallel German-English Corpus

Cross-level observations

Querying both time-aligned and hierarchical corpora with NXT Search

Cross-Lingual

CrossTowns: Automatically Generated Phonetic Lexicons of Cross-Lingual Pronunciation Variants of European City Names

Crosslingual document classification

Cross-effective cross-lingual document classification

Cross-lingual information retrieval

EuroWordNet as a Resource for Cross-language Information Retrieval

Evaluation Resources for Concept-based Cross-Lingual Information Retrieval in the Medical Domain

Crosslingual information transfer

Exploring Portability of Syntactic Information from English to Basque

Crosslingual retrieval

Integrated Language Technologies for Multilingual Information Services in the MEMPHIS Project

Cross-lingual speech recognition

The COST 278 MASPER initiative - crosslingual speech recognition with large telephone databases

Crosslingual summarization

MEAD - A Platform for Multidocument Multilingual Text Summarization

Cross-linguistics

You stupid tin box' - children interacting with the AIBO robot: A cross-linguistic emotional speech corpus.

Cross-system comparison

A Fine-Grained Evaluation Method for Speech-to-Speech Machine Translation Using Concept Annotations

CrossTowns

CrossTowns: Automatically Generated Phonetic Lexicons of Cross-Lingual Pronunciation Variants of European City Names

Cued speech

EVALUATION OF A SPEECH CUER: FROM MOTION CAPTURE TO A CONCATENATIVE TEXT-TO-CUED SPEECH SYSTEM

Cyrrilic

Russian Information Retrieval Evaluation Seminar

Czech

Issues in Annotation of the Czech Spontaneous Speech Corpus in the MALACH Project

The Design of Czech Language Formal Listening Tests for the Evaluation of TTS Systems

Czech corpus

Semantic Annotating of Czech Corpus via WSD

Czech National Corpus

The Design of Czech Language Formal Listening Tests for the Evaluation of TTS Systems

D

 

DAML+OIL

Ontology Evaluation Functionalities of RDF(S), DAML+OIL, and OWL Parsers and Ontology Platforms

Danish language technology

A Named Entity Recognizer for Danish

Data architecture

Co-reference in Japanese Task-oriented Dialogues: A Contribution to the Development of Language-specific and Language-general Annotation Schemes and Resources

Data categories

A Global Data Category Registry for Interoperable Language Resources

Towards a Reference Annotation Framework

Data category registry

A Global Data Category Registry for Interoperable Language Resources

Data centers

A Progress Report from the Linguistic Data Consortium: Recent Activities in Resource Creation and Distribution and the Development of Tools and Standards

Linguistic Resources for Effective, Affordable, Reusable Speech-to-Text

Data collection

 

Collecting and Sharing Bilingual Spontaneous Speech Corpora: the ChinFaDial Experiment

Current Projects in Languages of Military Interest at the Defense Language Institute

SALA II across the finish line: a large collection of mobile telephone speech databases from North and Latin America completed

The Fisher Corpus: a Resource for the Next Generations of Speech-to-Text

Data distribution

Current Projects in Languages of Military Interest at the Defense Language Institute

Data formats

Concept-based queries: Combining and Reusing Linguistic Corpus Formats and Query Languages

Data labeling

COMPARATIVE EVALUATION OF A STOCHASTIC PARSER ON SEMANTIC AND SYNTACTIC-SEMANTIC LABELS

Data meta-models

A Global Data Category Registry for Interoperable Language Resources

Data Mining

A Suite of Tools for Marking Up Textual Data for Temporal Text Mining Scenarios

CHeM: A System for the Automatic Analysis of e-mails in the Restoration and Conservation Domain

Exploiting Anchor Text as a Lexical Resource

Data repository

An Information Repository Model for Advanced Question Answering Systems

Data reuse

Rethinking Reusable Resources

Database

Bootstrapping a database of German multi-word expressions

Integration of Russian Language Resources

Talkbank: Building an Open Unified Multimodal Database of Communicative Interaction

Database annotation

Orthographic and Phonetic Annotation of Very Large Czech Corpora with Quality Assessment

Databases

A Large-Scale Resource for Storing and Recognizing Technical Terminology

Generating Coreferential Descriptions from a Structured Model of the Context

Methods of digital access for legal language documentation

The SPARTACUS-Database: a Spanish Sentence Database for Offline Handwriting Recognition

Data-driven adaptation

A data-driven adaptation of prosody in a multilingual TTS

Dataset profiling

Frequent Term Distribution Measures for Dataset Profiling

Datasets

An XML Representation for Annotated Handwriting Datasets for Online Handwriting Recognition

Experiences in Collection of Handwriting Data for Online Handwriting Recognition in Indic Scripts

DBT

Computational Lexicography and Carlo Emilio Gadda, Principe dell'Analisi e Duca della Buona Cognizione

Linguistic Miner: an Italian Linguistic Knowledge System

Declination phenomenon

Corpus Design, Recording and Phonetic Analysis of Greek Emotional Database

Deep

Semi-automatic Syntactic and Semantic Corpus Annotation with a Deep Parser

Deep and Shallow NLP Processing

The DeepThought Core Architecture Framework

Deep Linguistic Processing

Road-testing the English Resource Grammar over the British National Corpus

Deep NLP

Lexical Entry Templates for Robust Deep Parsing

Deep written language resources

Creating multi-purpose linguistic resources for Modern Greek: a deep Modern Greek Grammar

Definite descriptions

Can Anaphoric Definite Descriptions be Replaced by Pronouns?

Definition

Identifying Definitions in Text Collections for Question Answering

Definition Knowledge

Identifying Definitions in Text Collections for Question Answering

Depdendency structures

Mapping Dependency Structures to Phrase Structures and the Automatic Acquisition of Mapping Rules

Dependency annotated corpus

Phrase-Based Dependency Evaluation of a Japanese Parser

Dependency Parsing

Exploiting Language Resources for Semantic Web Annotations

Dependency trees

Automatic transformation of phrase treebanks to dependency trees

Derivation

The Core of the Czech Derivational Dictionary

Derivational dictionary

The Core of the Czech Derivational Dictionary

Derivational morphology

Derivational Relations in Flectional Languages - Czech Case

Design process

User Query Analysis for the Specification and Evaluation of a Dialogue Processing and Retrieval System

Devanagaric

Experiences in Collection of Handwriting Data for Online Handwriting Recognition in Indic Scripts

Development of lexical resources

Combining Heterogeneous Lexical Resources

DFSA

Automated Morphological Segmentation and Evaluation

Diachronic corpus

A natural language approach to information management: tracking scientific advances through the structure of words

Diachronic language database

The Integrated Language Database of 8th - 21st-Century Dutch

Dialect adaptation

Cypriot Speech Database: Data Collection and Greek to Cypriot Dialect Adaptation

Dialog Systems

Framework for data-driven video-realistic audio-visual speech-synthesis

Dialogue

Collecting and Sharing Bilingual Spontaneous Speech Corpora: the ChinFaDial Experiment

Evaluation of Transcription and Annotation Tools for a Multi-modal, Multi-party Dialogue Corpus

Dialogue Acts

Abstracting a Dialogue Act Tagset for Meeting Processing

An argumentative annotation schema for meeting discussions

Human dialogue modelling using annotated corpora

Dialogue annotation

An argumentative annotation schema for meeting discussions

Dialogue corpora collection

An Environment for Dialogue Corpora Collection (ENDIACC)

Dialogue Models

The African Speech Technology Project: An Assessment

Dialogue Processing

Building and Using a Corpus of Shallow Dialog Annotated Meetings

Human dialogue modelling using annotated corpora

Dialogue smoothness

From Acts and Topics to Transactions and Dialogue Smoothness

Dialogue system

Evaluation of a Multimodal Dialogue System for Small-screen Devices

Dialogues

Linguistic annotation of the Spoken Dutch Corpus: If we had to do it all over again ...

Diathesis alternations

Verb Valency Descriptors for a Syntactic Treebank

Dictionary

 

Augmenting Manual Dictionaries for Statistical Machine Translation Systems

Definition, dictionaries and tagger for Extended Named Entity Hierarchy

The Cross-Breeding of Dictionaries

Dictionary lookup

Evaluating the FOKS Error Model

Dictionary making

Tools for Upgrading Printed Dictionaries by Means of Corpus-based Lexical Acquisition

Dictionary of inflected forms

Lexical Analysis of Agglutinative Languages Using a Dictionary of Lemmas and Lexical Transducers

Dictionary of lemmas

Lexical Analysis of Agglutinative Languages Using a Dictionary of Lemmas and Lexical Transducers

Didactic

The OLISSIPO and LECTIO Projects

Digital access

Methods of digital access for legal language documentation

Digital Library

Development of Resources for a Bilingual Automatic Index System of Broadcast News in Basque and Spanish

Digitization

Rethinking readability of digital editions - the case of the AAC’s "Digital Brenner"

Direct speech

Why do you ignore me? - Proof that not all direct speech is bad

Disambiguation

Comparing the Ambiguity Reduction Abilities of Probabilistic Context-Free Grammars

Finding the Correct Interpretation of Swedish Compounds a Statistical Approach

Resources for Place Name Analysis

Discourse

A General-Purpose off-the-shelf Anaphora Resolution Module: Implementation and Preliminary Evaluation

CST Bank: A Corpus for the Study of Cross-document Structural Relationships

Information Extraction from Hindi Texts

RevisionBank: A Resource for Revision-based Multi-document Summarization and Evaluation

Using the NITE XML Toolkit on the Switchboard Corpus to Study Syntactic Choice: A Case Study

Discourse analysis

Using large multi-purpose corpora for specific research questions: discourse phenomena related to wh-questions in the Spoken Dutch Corpus

Discourse connectives

The Penn Discourse Treebank

Discourse entity

Computing Reliability for Coreference Annotation

Discourse Markers

Mining the Web for Discourse Markers

Discourse patterns

Multiple Sequence Alignment for characterizing the linear structure of revision

Discourse semantics

Utilization of Multiple Language Resources for Robust Grammar-Based Tense and Aspect Classification

Discovery

A Large Metadata Domain of Language Resources

Cross-Disciplinary Integration of Metadata Descriptions

Disfluency annotation

A Multi-Modal Documentation System for Warao

Annotation Tools for Large-Scale Corpus Development: Using AGTK at the Linguistic Data Consortium

Distance Measures

Using WordNet to Measure Semantic Orientations of Adjectives

Distributed Language Resources

Building Distributed Language Resources by Grid Computing

Distribution

The Centre for Dutch Language and Speech Technology (TST Centre)

Distributional consistency

Distributional Consistency: As a General Method for Defining a Core Lexicon

Distributional semantics

Publicly Available Topic Signatures for all WordNet Nominal Senses

Distributional similarity

A Freely Available Automatically Generated Thesaurus of Related Words

Utilizing the One-Sense-per-Discourse Constraint for Fully Unsupervised Word Sense Induction and Disambiguation

DM decomposition

Grouping Synonymous Sentences from a Parallel Corpus

Doctor-Patient Interaction

Creation of a Doctor-Patient Dialogue Corpus Using Standardized Patients

Document classification

Cross-effective cross-lingual document classification

Document clustering

Meaningful Clusters

Domain markers detection

Automatically selecting domain markers for terminology extraction

Domain specific language resources

Discovery of (New) Knowledge and the Analysis of Text Corpora

Domain-oriented system

Evaluating Conversation with Hans Christian Andersen

Domain-specific corpora

BootCaT: Bootstrapping Corpora and Terms from the Web

Domain-specific data

Evaluation of Cross-Language Information Retrieval Using the Domain-Specific GIRT Data as Parallel German-English Corpus

DTD

Bilingual Connections for Trilingual Corpora: An XML Approach

Duration

Corpus Design, Recording and Phonetic Analysis of Greek Emotional Database

Duration modeling

Duration Modeling for Turkish Text-to-Speech Synthesis System

Dutch

Putting the Dutch PAROLE Corpus to Work

Use and Evaluation of Prosodic Annotations in Dutch

Dynamic Bayesian networks

Language Modeling using Dynamic Bayesian Networks

Dynamic reordering

A Hybrid Strategy for Regular Grammar Parsing

E

 

EAGLES

Towards basic categories for describing properties of texts in a corpus

Education

An Emerging Transcontinental Collaborative Research and Education Agenda in Human Language Technologies

Edutainment

Evaluating Conversation with Hans Christian Andersen

ELAN

Annotation Multi-media / Multi-modal Resources with ELAN

Collaborative Annotation of Sign Language Data with Peer-to-Peer Technology

E-learning

Application of the BLEU Method for Evaluating Free-text Answers in an E-learning Environment

Incremental Methods to Select Test Sentences for Evaluating Translation Ability

Electroglotograph

An Acoustic Corpus Contemplating Regional Variation for Studies of European Portuguese Nasals

Electronic dictionaries

Making an XML-based Japanese-Slovene Learners' Dictionary

ELRA

ELRA Validation Methodology and Standard Promotion for Linguistic Resources

Recent Activities within the European Language Resources Association: Issues on Sharing Language Resources and Evaluation

Emotion detection

Reliability of Lexical and Prosodic Cues in two Real-life Spoken Dialog Corpora

Emotional features

Reusing Language Resources for Speech Applications involving Emotion

Emotional speech

Corpus Design, Recording and Phonetic Analysis of Greek Emotional Database

Designing and Recording an Audiovisual Database of Emotional Speech in Basque

Getting to the Heart of the Matter; Speech is more than just the Expression of Text or Language

Evaluating an Authentic Audio-Visual Expressive Speech Corpus

Endangered languages

A Multi-Modal Documentation System for Warao

Applying Computational Linguistic Techniques in a Documentary Project for Q’anjob’al (Mayan Guatemala)

English

Converting Treebank Annotations to Language Neutral Syntax

Tone-of-Voice and Controlled Language Techniques

English article

Detecting Errors in English Article Usage with a Maximum Entropy Classifier Trained on a Large, Diverse Corpus

English determiner

Detecting Errors in English Article Usage with a Maximum Entropy Classifier Trained on a Large, Diverse Corpus

English -ing ambiguity

Multilingual Corpus-based Approach to the Resolution of English -ing

English semantic tagger

Evaluating Lexical Resources for A Semantic Tagger

Ensemble learning

Semi-supervised learning by Fuzzy clustering and Ensemble learning

Entities

The Automatic Content Extraction (ACE) Program - Tasks, Data, and Evaluation

Entity

The GENOMA-KB Platform: Queries Over Integrated Linguistic Resources

Entity extraction

Exploiting Anchor Text as a Lexical Resource

Entropy

Automated Morphological Segmentation and Evaluation

Error correction

Raising the Bar: Stacked Conservative Error Correction Beyond Boosting

Error detection

NLP-enhanced error Checking for Catalan unrestricted text

Unexpected Productions May Well be Errors

Error tagging

The overview of the SST speech corpus of Japanese learner English and evaluation through the experiment on automatic detection of learners' errors

Ethnolinguistics

A Multi-Modal Documentation System for Warao

Euro WordNet

EuroWordNet as a Resource for Cross-language Information Retrieval

Enriching EWN with Syntagmatic Information by means of WSD

Enriching the Spanish EuroWordNet by Collocations

Metaphors in Wordnets: from Theory to Practice

Spanish WordNet 1.6: Porting the Spanish WordNet Across Princeton Versions

Towards the MEANING Top Ontology: Sources of Ontological Meaning

EVALDA

Technolangue: A Permanent Evaluation and Information Infrastructure

Evaluation

A comparison of summarisation methods based on term specificity estimation

A practical competition of different filters used in automatic term extraction

A Public Reference Implementation of the RAP Anaphora Resolution Algorithm

Agreement in Human Factoid Annotation for Summarization Evaluation

Automatic Methods to Supplement Broad-Coverage Subcategorization Lexicons

Benchmarking ontology tools. A case study for the WebODE platform

Calibrating Resource-light Automatic MT Evaluation: A Cheap Approach to Ranking MT Systems by the Usability of their Output

Comparing the Ambiguity Reduction Abilities of Probabilistic Context-Free Grammars

Data Driven Ontology Evaluation

Enriching a French Treebank

Evaluating Conversation with Hans Christian Andersen

Evaluating Lexical Resources for A Semantic Tagger

Evaluating Multimodal NLG using Production Experiments

Evaluation and Adaptation of a Specialised Language Checking Tool for Non-specialised Machine Translation and Non-expert MT Users for Multi-lingual Telecooperation

Evaluation of a Multimodal Dialogue System for Small-screen Devices

Evaluation of Consensus on the Annotation of Prosodic Breaks in the Romance Corpus of Spontaneous Speech “C-ORAL-ROM”

Evaluation of Cross-Language Information Retrieval Using the Domain-Specific GIRT Data as Parallel German-English Corpus

Evaluation of Microphone Array Front-Ends for ASR - an Extension of the AURORA Framework

Evaluation of Multi-party Virtual Reality Dialogue Interaction

Evaluation Resources for Concept-based Cross-Lingual Information Retrieval in the Medical Domain

Frequent Term Distribution Measures for Dataset Profiling

Handling Subtle Sense Distinctions through Wordnet Semantic Types

How Does Automatic Machine Translation Evaluation Correlate With Human Scoring as the Number of Reference Translations Increases?

MEAD - A Platform for Multidocument Multilingual Text Summarization

Methodology for Rapid Prototyping and Testing of ASR Based User Interfaces

On the problems of creating a golden standard of inflected forms in Portuguese

On the Usefulness of Large Spoken Language Corpora for Linguistic Research

Online Evaluation of Coreference Resolution

Parsing Ungrammatical Input: An Evaluation Procedure

Russian Information Retrieval Evaluation Seminar

Textual Distraction as a Basis for Evaluating Automatic Summarisers

The Automatic Content Extraction (ACE) Program - Tasks, Data, and Evaluation

The DeepThought Core Architecture Framework

The Future of Evaluation for Cross-Language Information Retrieval Systems

The Influence of the Labeller’s Regional Background on Phonetic Transcriptions: Implications for the Evaluation of Spoken Language Resources

Unsupervised Text Mining for Ontology Extraction: An Evaluation of Statistical Measures

Usability Evaluation of Multimodal and Domain-Oriented Spoken Language Dialogue Systems

Usability Evaluation of Spoken Dialogue Systems

User Query Analysis for the Specification and Evaluation of a Dialogue Processing and Retrieval System

Using PiTagger for Lemmatization and PoS Tagging of a Spontaneous Speech Corpus: C-ORAL-ROM Italian

Evaluation campaign

 

EVALDA-CESART Project: Terminological Resources Acquisition Tools Evaluation Campaign

The ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

Evaluation contests

On the problems of creating a golden standard of inflected forms in Portuguese

Evaluation data

Methodology For Building Thematic Indexes In Medecine For French

Evaluation Methodologies

Designing a Realistic Evaluation of an End-to-end Interactive Question Answering System

Evaluation methodology

A Critical Survey of the Methodology for IE Evaluation

NIST Language Technology Evaluation Cookbook

The French MEDIA/EVALDA project: the evaluation of the understanding capability of Spoken Language Dialogue Systems

Evaluation methods

INSPIRE: Evaluation of a Smart-Home System for Infotainment Management and Device Control

Evaluation morphology

On the problems of creating a golden standard of inflected forms in Portuguese

Evaluation of annotation

Annotators' Agreement: The Case of Topic-Focus Articulation

Evaluation of resources

Extending a verb-lexicon using a semantically annotated corpus

Evaluation of subjective tasks

Re-using high-quality resources for continued evaluation of automated summarization systems

Evaluation paradigms

The Ongoing Evaluation Campaign of Syntactic Parsing of French: EASY

Evaluation plan

NIST Language Technology Evaluation Cookbook

Evaluation protocol

EVALDA-CESART Project: Terminological Resources Acquisition Tools Evaluation Campaign

Evaluation Standards

Beyond TREC's Filtering Track

Event annotation

A Suite of Tools for Marking Up Textual Data for Temporal Text Mining Scenarios

Event Extraction

Text Corpora, Local Grammars and Prediction

Events

The Automatic Content Extraction (ACE) Program - Tasks, Data, and Evaluation

Example-Based MT

Automatic Translation Memory Fuzzy Match Post-Editing: A Step beyond Traditional TM/MT Integration

Experience Evaluation

Designing a Realistic Evaluation of an End-to-end Interactive Question Answering System

Experimental setting

An Environment for Dialogue Corpora Collection (ENDIACC)

Exploitations

A Danish Lexicon Resource - Ready for Applications

Expressive speech processing

Speech & Expression - The Value of a Longitudinal Corpus

Extended data

The Mixer Corpus of Multilingual, Multichannel Speaker Recognition Data

Extraction

EVALDA-CESART Project: Terminological Resources Acquisition Tools Evaluation Campaign

Textual Distraction as a Basis for Evaluating Automatic Summarisers

The Automatic Content Extraction (ACE) Program - Tasks, Data, and Evaluation

F

 

Fast transcription

Automatic audio and manual transcripts alignment, time-code transfer and selection of exact transcripts

Feature distributions

The Statistical Analysis of Morphosyntactic Distributions

Feature Structure

Towards an International Standard on Feature Structure Representation

Field recordings

Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Project

Filiation

Dynamic Lexicographic Data Modelling. A Diachronic Dictionary Development Report

Finite state filter

Bootstrapping a database of German multi-word expressions

Finite state machines

A Large-Scale Resource for Storing and Recognizing Technical Terminology

Finite state phonology

A Multilingual Phonological Resource Toolkit for Ubiquitous Speech Technology

Finite state transducer

SMOR: A German Computational Morphology Covering Derivation, Composition, and Inflection

Finite-state morphology

Applying Computational Linguistic Techniques in a Documentary Project for Q’anjob’al (Mayan Guatemala)

Finite-state parsing

Portuguese Large-scale Language Resources for NLP Applications

Fixed telephone speech

Development of New Telephone Speech Databases for French: The NEOLOGOS Project

Flectional languages

Derivational Relations in Flectional Languages - Czech Case

F-Measure

How Does Automatic Machine Translation Evaluation Correlate With Human Scoring as the Number of Reference Translations Increases?

Forced Alignment

Evaluating Factors Impacting the Accuracy of Forced Alignments in a Multimodal Corpus

Formalization of terminology work

Principles of a system for terminological concept modelling

FrameNet

FrameNet as a ``Net''

Free distribution

Collecting and Sharing Bilingual Spontaneous Speech Corpora: the ChinFaDial Experiment

Free software

FreeLing: An Open-Source Suite of Language Analyzers

The Language Belongs to the People!

Freeware

Collecting and Sharing Bilingual Spontaneous Speech Corpora: the ChinFaDial Experiment

French

Enriching a French Treebank

French-English corpus

Annotation of anaphoric expressions in an aligned bilingual corpus

Frequency

Multifunctional Computational Lexicon of Contemporary Portuguese: An Available Resource for Multitype Applications

Frequency dictionary

Creating open language resources for Hungarian

Functional tagging

Enriching a French Treebank

Fundamental frequency

Corpus Design, Recording and Phonetic Analysis of Greek Emotional Database

Future perspectives

A Danish Lexicon Resource - Ready for Applications

Fuzzy clustering

Semi-supervised learning by Fuzzy clustering and Ensemble learning

G

 

Gadda C.E.

Computational Lexicography and Carlo Emilio Gadda, Principe dell'Analisi e Duca della Buona Cognizione

GALEMU

Evaluation of Different Similarity Measures for the Extraction of Multiword Units in a Reinforcement Learning Environment

Galician

Parallel corpora for the Galician language: building and processing of the CLUVI (Linguistic Corpus of the University of Vigo)

Galician corpus

A Galician Textual Corpus for Morphosyntactic Tagging with Application to Text-to-Speech Synthesis

Gazetteer

Resources for Place Name Analysis

Gazetteers

Automatic Building Gazetteers of Co-referring Named Entities

General purpose

Towards General-Purpose Annotation Tools – How far are we today?

Generation

Generating an Arabic full-form lexicon for bidirectional morphology lookup

Generating Coreferential Descriptions from a Structured Model of the Context

Using Paradigm Tables to Generate New Utterances Similar to those Existing in Linguistic Resources

Genetic algorithms

Evaluation of Different Similarity Measures for the Extraction of Multiword Units in a Reinforcement Learning Environment

GENOMA-KB

The GENOMA-KB Platform: Queries Over Integrated Linguistic Resources

Geographical named entities

Automatic Classification of Geographical Named Entities

German

A High Quality Partial Parser for Annotating German Text Corpora

MAUS Goes Iterative

The BITS Speech Synthesis Corpus for German

GermaNet

Corpus based Enrichment of GermaNet Verb Frames

Gesture

Talkbank: Building an Open Unified Multimodal Database of Communicative Interaction

Gesture annotation

CoGesT: A Formal Transcription System for Conversational Gesture

Gigaword corpus

Creating open language resources for Hungarian

Gloss generation

Automatic Generation of Glosses in the OntoLearn System

Golden standard

On the problems of creating a golden standard of inflected forms in Portuguese

Grammar and style checker

A Grammar and Style Checker Based on Internet Searches

Grammar development

A Lexicon Module for a Grammar Development Environment

eGram - a Grammar Development Environment and its Usage for Language Generation

Grammar editor

eGram - a Grammar Development Environment and its Usage for Language Generation

Grammar extraction

Building a Large Grammar for Italian

Grammar Matrix

Creating multi-purpose linguistic resources for Modern Greek: a deep Modern Greek Grammar

Grammars

Semi-automatic Acquisition of Command Grammar

Grammatical Relations

Adding Syntactic Annotations to Transcripts of Parent-Child Dialogs

Grapheme-to-phoneme

A Graphical Tool for Handling Rule Grammars in Java Speech Grammar Format

Greek

Bypassing Greeklish!

Greeklish

Bypassing Greeklish!

Grid Computing

Building Distributed Language Resources by Grid Computing

GUI

Callisto: A Configurable Annotation Workbench

H

 

Handling productivity

Towards a Dynamic Lexicon: Predicting the Syntactic Argument Structure of Complex Verbs

Handwriting recognition

The SPARTACUS-Database: a Spanish Sentence Database for Offline Handwriting Recognition

Hearing-impaired

Using a Parallel Transcript/Subtitle Corpus for Sentence Compression

Hidden Markov Model (HMM)

Comparative Evaluations in the Domain of Automatic Speech Recognition 

Hidden Markov models

COMPARATIVE EVALUATION OF A STOCHASTIC PARSER ON SEMANTIC AND SYNTACTIC-SEMANTIC LABELS

Hierarchical relations

Extraction of Hyperonymy of Adjectives from Large Corpora by Using the Neural Network Model

High Frequency Word

Improving Collocation Extraction for High Frequency Words

Hindi Grapheme-to-Phoneme (G2P) conversion

Automatic Generation of Compound Word Lexicon for Hindi Speech Synthesis

HLT

Technolangue: A Permanent Evaluation and Information Infrastructure

HLT infrastructure

The new Dutch-Flemish HLT Programme: a concerted effort to stimulate the HLT sector

Towards A Language Infrastructure for the Semantic Web

Homogeneity measures

Frequent Term Distribution Measures for Dataset Profiling

HPSG

Creating multi-purpose linguistic resources for Modern Greek: a deep Modern Greek Grammar

Information Extraction from Hindi Texts

Human Computer Interaction

Framework for data-driven video-realistic audio-visual speech-synthesis

Human Genome

The GENOMA-KB Platform: Queries Over Integrated Linguistic Resources

Human Judgements

A Fine-Grained Evaluation Method for Speech-to-Speech Machine Translation Using Concept Annotations

Human Language Technology

An Emerging Transcontinental Collaborative Research and Education Agenda in Human Language Technologies

Human translation

Incremental Methods to Select Test Sentences for Evaluating Translation Ability

Hybrid constraints

Hybrid Constraints for Robust Parsing: First Experiments and Evaluation

Hyperlinks

Exploiting Anchor Text as a Lexical Resource

Hyponym

Automatic Extraction of Hyponyms from Japanese Newspapers Using Lexico-syntactic Patterns

I

 

Idiom

A Multilingual Database of Idioms

Idioms

Reusable Lexical Representations for Idioms

IE Rule Acquisition

A Domain-Independent Approach to IE Rule Development 

Illustration

A Chatbot as a Novel Corpus Visualization Tool

ILR

The Effect of Text Difficulty on Machine Translation Performance -- A Pilot Study with ILR-Rated texts in Spanish, Farsi, Arabic, Russian and Korean

Image Processing

Toward an Annotation Software for Video of Sign Language, Including Image Processing Tools and Signing Space Modelling

IMDI metadata

Embedding IMDI Metadata into a Large Phonetic Corpus

Incremental development

A pattern extraction workbench combining multiple linguistic levels

Index System

Development of Resources for a Bilingual Automatic Index System of Broadcast News in Basque and Spanish

Indexing

EVALDA-CESART Project: Terminological Resources Acquisition Tools Evaluation Campaign

Linguistic Corpus Search

Inductive logic programming

Automatic Classification of Geographical Named Entities

Inference

Acquiring Bayesian Networks from Text

Consistent Storage of Metadata in Inference Lexica: The MetaLex Approach

Generating Coreferential Descriptions from a Structured Model of the Context

Inflectional morphology

Creation of reusable components and language resources for Named Entity Recognition in Russian

Information Extraction 

 

A Critical Survey of the Methodology for IE Evaluation

A Domain-Independent Approach to IE Rule Development 

A pattern extraction workbench combining multiple linguistic levels

An Annotation Scheme for a Rhetorical Analysis of Biology Articles

Automatic Language-Independent Induction of Gazetteer Lists

Categorizing Web Pages as a Preprocessing Step for Information Extraction

Discovery of (New) Knowledge and the Analysis of Text Corpora

Identifying Definitions in Text Collections for Question Answering

Information Extraction from Hindi Texts

Integrated Language Technologies for Multilingual Information Services in the MEMPHIS Project

Linguistic Miner: an Italian Linguistic Knowledge System

NameNet: A Self-Improving Resource for Name Classification

PBIE: A DATA PREPARATION TOOLKIT toward DEVELOPING a PARSING-BASED INFORMATION EXTRACTION SYSTEM

Semantic Mark-up of Italian Legal Texts Through NLP-based Techniques

Information Filtering

Beyond TREC's Filtering Track

Information management

A natural language approach to information management: tracking scientific advances through the structure of words

Information Retrieval

Automatic Keyword Extraction from Spoken Text. A Comparison of two Lexical Resources: the EDR and WordNet

CHeM: A System for the Automatic Analysis of e-mails in the Restoration and Conservation Domain

Collecting Spontaneously Spoken Queries for Information Retrieval

Development of Resources for a Bilingual Automatic Index System of Broadcast News in Basque and Spanish

Evaluation of Cross-Language Information Retrieval Using the Domain-Specific GIRT Data as Parallel German-English Corpus

Finding Semantic Associations on Express Lane

From Weaver to the ALPAC Report 

Highlighting latent structure in documents

Information Retrieval System Using Latent Contextual Relevance

Language Model Adaptation for Statistical Machine Translation based on Information Retrieval

Information Retrieval System

The Future of Evaluation for Cross-Language Information Retrieval Systems

Information Status

An Annotation Scheme for Information Status in Dialogue

Information structure

The MULI Project: Annotation and Analysis of Information Structure in German and English

Inheritance

Semi-automatic UNL Dictionary Generation using WordNet.PT

InkML

An XML Representation for Annotated Handwriting Datasets for Online Handwriting Recognition

Integrated System

Linguistic Miner: an Italian Linguistic Knowledge System

Interactive

Design of an Interactive Web-based User Interface for Speech Database Query Formation

Interactivity

A pattern extraction workbench combining multiple linguistic levels

Inter-annotator Agreement

An Annotation Scheme for Information Status in Dialogue

Interdisciplinary

Cross-Disciplinary Integration of Metadata Descriptions

Interface issues

The Lácio-Web: Corpora and Tools to advance Brazilian Portuguese Language Investigations and Computational Linguistic Tools

Interfaces

Rethinking readability of digital editions - the case of the AAC’s "Digital Brenner"

Interlinear text

Functional Requirements for an Interlinear Text Editor

Interlingua annotations

The Italian NESPOLE! Corpus: A Multilingual Database with Interlingua Annotation in Tourism and Medical Domains

Interlingual

A Fine-Grained Evaluation Method for Speech-to-Speech Machine Translation Using Concept Annotations

Interlingual alignment

A Methodology and Associated Tools for Building Interlingual Wordnets

Internet

A Grammar and Style Checker Based on Internet Searches

Enlarging the Croatian Morphological Lexicon by Automatic Lexical Acquisition from Raw Corpora

Internet available tools

An Environment for Dialogue Corpora Collection (ENDIACC)

Internet filtering

NLP-enhanced Content Filtering within the POESIA Project

Interoperability

Cross-Disciplinary Integration of Metadata Descriptions

Infrastructure for Collaborative Annotation of Speech

Interpreter

Collecting and Sharing Bilingual Spontaneous Speech Corpora: the ChinFaDial Experiment

Interpreting

Collecting and Sharing Bilingual Spontaneous Speech Corpora: the ChinFaDial Experiment

Intranet

Intranet Try To Find Project (ITTF): An Approach for the Search of Relevant Information Inside an Organization

IPA scheme

The COST 278 MASPER initiative - crosslingual speech recognition with large telephone databases

ISO

Towards an International Standard on Feature Structure Representation

ISO 639

Standards for Language Codes: Developing ISO

ISO TC 37

A Global Data Category Registry for Interoperable Language Resources

Italian

Representing Italian Complex Nominals: a Pilot Study

Italian Literature

Computational Lexicography and Carlo Emilio Gadda, Principe dell'Analisi e Duca della Buona Cognizione

ItalWordNet

Using Semantic Language Resources to Support Textual Inference for Question Answering

Iterated co-occurrences

Automatic Acquisition of Paradigmatic Relations using Iterated Co-occurrences

ITTF

Intranet Try To Find Project (ITTF): An Approach for the Search of Relevant Information Inside an Organization

J

 

Japanese

Evaluating the FOKS Error Model

Korean-Chinese-Japanese Multilingual Wordnet with Shared Semantic Hierarchy

Making an XML-based Japanese-Slovene Learners' Dictionary

Japanese EFL learners

Connector Usage in the English Essay Writing of Japanese EFL Learners

Japanese spatial noun

Classification of Japanese Spatial Nouns

Java Speech Grammar Format (JSGF),

A Graphical Tool for Handling Rule Grammars in Java Speech Grammar Format

JavaScript

Putting the Dutch PAROLE Corpus to Work

J-ToBI, Japanese

Japanese MULTEXT: A Prosodic Corpus

K

 

Kanji

Evaluating the FOKS Error Model

Keyword Extraction

Automatic Keyword Extraction from Spoken Text. A Comparison of two Lexical Resources: the EDR and WordNet

Knowledge acquisition

Constructing Word-Sense Association Networks from Bilingual Dictionary and Comparable Corpora

Knowledge base

Annotating a corpus for building a domain-specific knowledge base

Knowledge Bases

Human Language Technology Elements in a Knowledge Organisation System -The VID project

The GENOMA-KB project: towards the integration of concepts, terms, textual corpora and entities

Knowledge discovery

Incremental Knowledge Acquisition from WordNet and EuroWordNet

Knowledge life-cycle

A natural language approach to information management: tracking scientific advances through the structure of words

Knowledge management

Integrated Language Technologies for Multilingual Information Services in the MEMPHIS Project

Knowledge Managenemt

The Role of MultiWord Terminology in Knowledge Management

Knowledge representation

Investigation on Semantics to Improve the COVAX System

Knowledge/Domain structure

Intelligent Building of Language Resources for HLT Applications

Knowledge-rich NLP

Some Meaning Procedures of Ontological Semantics

Korean

Korean-Chinese-Japanese Multilingual Wordnet with Shared Semantic Hierarchy

L

 

Lancaster Corpus

The Lancaster Corpus of Mandarin Chinese: A Corpus for Monolingual and Contrastive Language Study

Language acquisition

A Flexible Language Acquisition Tool Kit for Natural Language Processing

Language agility

Automatic Language-Independent Induction of Gazetteer Lists

Language codes

Standards for Language Codes: Developing ISO

Language Database

Collaborative Commentary: Opening Up Spoken Language Databases

Language documentation

A Multi-Modal Documentation System for Warao

Securing Interpretability: The Case of Ega Language Documentation

WALA: a multilingual resource repository for West African Languages

Language engineering

Automatic Language-Independent Induction of Gazetteer Lists

Language Errors

Parsing Ungrammatical Input: An Evaluation Procedure

Language for special purposes

Mercedes, A Term-In-Context Highlighter

Language identification

Integrated Language Technologies for Multilingual Information Services in the MEMPHIS Project

Language independent

Unexpected Productions May Well be Errors

Language learner

Evaluating the FOKS Error Model

Language learning

NLP-enhanced error Checking for Catalan unrestricted text

Talkbank: Building an Open Unified Multimodal Database of Communicative Interaction

Language model

Comparative Evaluations in the Domain of Automatic Speech Recognition 

Transcrigal: A Bilingual System for Automatic Indexing of Broadcast News

Language Modeling

Language Model Adaptation for Statistical Machine Translation based on Information Retrieval

N-Gram Language Modeling for Robust Multi-Lingual Document Classification

Language models

Language Modeling using Dynamic Bayesian Networks

Language neutral syntax

Converting Treebank Annotations to Language Neutral Syntax

Language resource

Open Collaborative Development of the Thai Language Resources for Natural Language Processing

Industrial Needs for Language Resources

Language resource management

Developping tools and building linguistic resources for Vietnamese morpho-syntactic processing

Language resources

A Global Data Category Registry for Interoperable Language Resources

An Information Repository Model for Advanced Question Answering Systems

Content Interoperability of Lexical Resources: Open Issues and “MILE” Perspectives

Creation and Validation of Large Lexica for Speech-to-Speech Translation Purposes

Design and Implementation of a Semantic Search Engine for Portuguese

Development of New Telephone Speech Databases for French: The NEOLOGOS Project

Development of Resources for a Bilingual Automatic Index System of Broadcast News in Basque and Spanish

ENABLER Thematic Network of National Projects: Technical, Strategic and Political Issues of LRs

NEMLAR - An Arabic Language Resources project

Open Resources for Language Technology

Providing on-line access to Portuguese language resources: corpora and lexicons

Rethinking Reusable Resources

Steps Towards Semantically Annotated Language Resources

The Fisher Corpus: a Resource for the Next Generations of Speech-to-Text

The Mixer Corpus of Multilingual, Multichannel Speaker Recognition Data

The new Dutch-Flemish HLT Programme: a concerted effort to stimulate the HLT sector

Towards Intelligent Written Cultural Heritage Processing - Lexical Processing

Using Semantic Language Resources to Support Textual Inference for Question Answering

Web Services Architecture for Language Resources

Web Services for Language Resources and Language Technology Applications

Language resources and evaluation

Recent Activities within the European Language Resources Association: Issues on Sharing Language Resources and Evaluation

Language resources development

Developing Language Resources for a Transnational Digital Government System

Language resources for language engineering

Portuguese Large-scale Language Resources for NLP Applications

Language resources management

Developing Language Resources for a Transnational Digital Government System

Language similarities

The COST 278 MASPER initiative - crosslingual speech recognition with large telephone databases

Language Study

The Lancaster Corpus of Mandarin Chinese: A Corpus for Monolingual and Contrastive Language Study

Language technology

NIST Language Technology Evaluation Cookbook

Language technology aplications

Web Services for Language Resources and Language Technology Applications

Large corpora

Linguistic Corpus Search

Using large multi-purpose corpora for specific research questions: discourse phenomena related to wh-questions in the Spoken Dutch Corpus

WinPitch Corpus, a Text to Speech Alignment Tool for Multimodal Corpora

Large corpus

Detecting Errors in English Article Usage with a Maximum Entropy Classifier Trained on a Large, Diverse Corpus

Large DataBase

Linguistic Miner: an Italian Linguistic Knowledge System

Large lexica

Creation and Validation of Large Lexica for Speech-to-Speech Translation Purposes

Large spoken corpora

Improving Automatic Phonetic Transcription of Spontaneous Speech through Variant-Based Pronunciation Variation Modelling

Large vocabulary continuous speech recognition

Collecting Spontaneously Spoken Queries for Information Retrieval

Large-scale corpus

A Multi-Modal Documentation System for Warao

Annotation Tools for Large-Scale Corpus Development: Using AGTK at the Linguistic Data Consortium

Latent Context Relevance

Information Retrieval System Using Latent Contextual Relevance

Latent Semantic Indexing

Information Retrieval System Using Latent Contextual Relevance

Latent Structure

Highlighting latent structure in documents

Latin

The OLISSIPO and LECTIO Projects

LC-STAR

Creating Slovenian Language Resources for Development of Speech-to-Speech Translation Components

Learner corpora

Connector Usage in the English Essay Writing of Japanese EFL Learners

Learner english

Connector Usage in the English Essay Writing of Japanese EFL Learners

Learning from unlabeled data

Corpus-based Learning of Lexical Resources for German Named Entity Recognition

Legacy data

Securing Interpretability: The Case of Ega Language Documentation

Legal language

Methods of digital access for legal language documentation

Legitimate translation variation

Modelling Legitimate Translation Variation for Automatic Evaluation of MT Quality

Lesk algorithm

Evaluating Variants of the Lesk Approach for Disambiguating Words

Lexemes

A New Approach to the Corpus-based Statistical Investigation of Hungarian Multi-word Lexemes

Lexica

Creating Slovenian Language Resources for Development of Speech-to-Speech Translation Components

ELRA Validation Methodology and Standard Promotion for Linguistic Resources

Lexical Acquisition

A Similarity Measure for Unsupervised Semantic Disambiguation

Clustering Concept Hierarchies from Text

Enlarging the Croatian Morphological Lexicon by Automatic Lexical Acquisition from Raw Corpora

Road-testing the English Resource Grammar over the British National Corpus

Tools for Upgrading Printed Dictionaries by Means of Corpus-based Lexical Acquisition

Lexical analysis

Lexical Analysis of Agglutinative Languages Using a Dictionary of Lemmas and Lexical Transducers

Lexical association measures

Using the Web as a Corpus for the Syntactic-Based Collocation Identification

Lexical change

Dynamic Lexicographic Data Modelling. A Diachronic Dictionary Development Report

Lexical cohesion

Summarization of Multimodal Information

Textual Distraction as a Basis for Evaluating Automatic Summarisers

Lexical coverage

A Danish Lexicon Resource - Ready for Applications

Evaluating Lexical Resources for A Semantic Tagger

Lexical database

A Danish Lexicon Resource - Ready for Applications

A Multilingual Database of Idioms

Enriching a Thai Lexical Database with Selectional Preferences

Lexical entry templates

Lexical Entry Templates for Robust Deep Parsing

Lexical Functions

Enriching the Spanish EuroWordNet by Collocations

Lexical information

Discovery of (New) Knowledge and the Analysis of Text Corpora

Lexical ontologies

A Methodology and Associated Tools for Building Interlingual Wordnets

Lexical resources

Automatic Language-Independent Induction of Gazetteer Lists

Computational Lexicography and Carlo Emilio Gadda, Principe dell'Analisi e Duca della Buona Cognizione

Creation of reusable components and language resources for Named Entity Recognition in Russian

Extending a verb-lexicon using a semantically annotated corpus

FrameNet as a ``Net''

MULTEXT-East Version 3: Multilingual Morphosyntactic Specifications, Lexicons and Corpora

Portuguese Large-scale Language Resources for NLP Applications

Utilization of Multiple Language Resources for Robust Grammar-Based Tense and Aspect Classification

Lexical semantic database

Building a Maritime Domain Lexicon: a Few Considerations on the Database Structure and the Semantic Coding

Proper Names and Polysemy: from a Lexicographic Experience

Lexical semantics

A Methodology and Associated Tools for Building Interlingual Wordnets

Extending a verb-lexicon using a semantically annotated corpus

Lexical transducer

Lexical Analysis of Agglutinative Languages Using a Dictionary of Lemmas and Lexical Transducers

Lexical Usuality

Distributional Consistency: As a General Method for Defining a Core Lexicon

Lexical-conceptual structure

Extending WordNets to Implicit Information

Lexical-semantic relations

Semi-automatic UNL Dictionary Generation using WordNet.PT

Lexicography

Dynamic Lexicographic Data Modelling. A Diachronic Dictionary Development Report

Identifying Morphosyntactic Preferences in Collocations

The Cross-Breeding of Dictionaries

Lexicology

The Integral Dictionary: An Ontological Resource for the Semantic Web Integration of EuroWordNet, Balkanet, TID and SUMO

Lexicon 

A Lexicon Module for a Grammar Development Environment

Building Part-of-speech Corpora through Histogram Hopping

Consistent Storage of Metadata in Inference Lexica: The MetaLex Approach

Generating an Arabic full-form lexicon for bidirectional morphology lookup

Integration of Russian Language Resources

Multifunctional Computational Lexicon of Contemporary Portuguese: An Available Resource for Multitype Applications

The Cross-Breeding of Dictionaries

The Rationale for Building Resources Expressly for NLP

Verb Valency Descriptors for a Syntactic Treebank

Lexicon acquisition

Automatic Bilingual Lexicon Acquisition Using Random Indexing of Aligned Bilingual Data

Lexicon development

Software Tools for Morphological Tagging of Zulu Corpora and Lexicon Development

Lexicons

Open-source Tools for Creation, Maintenance, and Storage of Lexical Resources for Language Generation from Ontologies

Providing on-line access to Portuguese language resources: corpora and lexicons

Reusable Lexical Representations for Idioms

Lexicosyntactic pattern

Automatic Extraction of Hyponyms from Japanese Newspapers Using Lexico-syntactic Patterns

LGPL

FreeLing: An Open-Source Suite of Language Analyzers

Limited-purpose

Towards General-Purpose Annotation Tools – How far are we today?

Linear Discriminant Analysis

Development and Integration of the LDA-Toolkit into the COST249 SpeechDat (II) SIG Reference Recognizer

Linear regression

Incremental Methods to Select Test Sentences for Evaluating Translation Ability

Linear SVM

Corpus-based Learning of Lexical Resources for German Named Entity Recognition

Linearization

Mapping Dependency Structures to Phrase Structures and the Automatic Acquisition of Mapping Rules

Linguistic annotation

A Registry of Standard Data Categories for Linguistic Annotation

Linguistic annotation of the Spoken Dutch Corpus: If we had to do it all over again ...

Using the NITE XML Toolkit on the Switchboard Corpus to Study Syntactic Choice: A Case Study

Linguistic constructions

Linguistic Corpus Search

Linguistic coverage

A Danish Lexicon Resource - Ready for Applications

Linguistic data repositories

Rethinking Reusable Resources

Linguistic difference

A Comparison of Two Variant Corpora: The Same Content with Different Sources

Linguistic expertise

A Flexible Language Acquisition Tool Kit for Natural Language Processing

Linguistic Infrastructure

ENABLER Thematic Network of National Projects: Technical, Strategic and Political Issues of LRs

Linguistic Knowledge

Linguistic Miner: an Italian Linguistic Knowledge System

The SPARTACUS-Database: a Spanish Sentence Database for Offline Handwriting Recognition

Linguistic Knowledge Base

Semi-automatic UNL Dictionary Generation using WordNet.PT

Linguistic methods

A word alignment system based on a translation equivalence extractor

Linguistic Ontologies

Human Language Technology Elements in a Knowledge Organisation System -The VID project

Linguistic Ontology

Development of Bilingual Domain-Specific Ontology for Automatic Conceptual Indexing

Development of Ontologies with Minimal Set of Conceptual Relations

Linguistic resources

ALLES: Integrating NLP in ICALL Applications

Human Language Technology Elements in a Knowledge Organisation System -The VID project

Linguistic similarity

A Comparison of Two Variant Corpora: The Same Content with Different Sources

Linguistic-based corpus processing

Portuguese Large-scale Language Resources for NLP Applications

Listening tests (MRT, SUS, MOS, CCR)

The Design of Czech Language Formal Listening Tests for the Evaluation of TTS Systems

Literature

Rethinking readability of digital editions - the case of the AAC’s "Digital Brenner"

The AAC [Austrian Academy Corpus] An Enterprise to Develop Large Electronic Text Corpora

Local Grammar

Text Corpora, Local Grammars and Prediction

Logical

Annotating Noun Argument Structure for NomBank

LRs

Recent Activities within the European Language Resources Association: Issues on Sharing Language Resources and Evaluation

The Centre for Dutch Language and Speech Technology (TST Centre)

LTAG

Building a Large Grammar for Italian

M

 

Machine Learning

A Chatbot as a Novel Corpus Visualization Tool

A Labelled Corpus for Prepositional Phrase Attachment

Automatic Classification of Geographical Named Entities

Large Scale Experiments for Semantic Labeling of Noun Phrases in Raw Text

Unexpected Productions May Well be Errors

Machine Learning for NLP

SVMTool: A general POS tagger generator based on Support Vector Machines

Machine Translation 

A Fine-Grained Evaluation Method for Speech-to-Speech Machine Translation Using Concept Annotations

An Emerging Transcontinental Collaborative Research and Education Agenda in Human Language Technologies

Building a Paraphrase Corpus for Speech Translation

Calibrating Resource-light Automatic MT Evaluation: A Cheap Approach to Ranking MT Systems by the Usability of their Output

Creation of a Doctor-Patient Dialogue Corpus Using Standardized Patients

Developing Language Resources for a Transnational Digital Government System

How Does Automatic Machine Translation Evaluation Correlate With Human Scoring as the Number of Reference Translations Increases?

Integrated Language Technologies for Multilingual Information Services in the MEMPHIS Project

Modelling Legitimate Translation Variation for Automatic Evaluation of MT Quality

MT Goes Farming: Comparing Two Machine Translation Approaches on a New Domain

Multilingual Corpus-based Approach to the Resolution of English -ing

The Effect of Text Difficulty on Machine Translation Performance -- A Pilot Study with ILR-Rated texts in Spanish, Farsi, Arabic, Russian and Korean

The Translation Correction Tool: English-Spanish User Studies

Training a Sentence-Level Machine Translation Confidence Measure

Translation memories enrichment by statistical bilingual segmentation

Machine translation resources

Multimodal Multilingual Resources in the Subtitling Process

Machine-learning

Duration Modeling for Turkish Text-to-Speech Synthesis System

Evaluation and Adaptation of the Celex Dutch Morphological Database

Maintenance

The Centre for Dutch Language and Speech Technology (TST Centre)

MALACH project

Issues in Annotation of the Czech Spontaneous Speech Corpus in the MALACH Project

Management

The Centre for Dutch Language and Speech Technology (TST Centre)

Management of language resources

Architecture for Distributed Language Resource Management and Archiving

Man-robot-communication

You stupid tin box' - children interacting with the AIBO robot: A cross-linguistic emotional speech corpus.

Manual annotation

Annotation Multi-media / Multi-modal Resources with ELAN

RevisionBank: A Resource for Revision-based Multi-document Summarization and Evaluation

Unexpected Productions May Well be Errors

Mapudungun

Data Collection and Analysis of Mapudungun Morphology for Spelling Correction

Markup

Investigation on Semantics to Improve the COVAX System

Migrating Language Resources from SGML to XML: the Text Encoding Initiative Recommendations

Markup languages

Bayesian Semantics Incorporation to Web Content for Natural Language Information Retrieval

Matching rules

Semi-Automatic Derivation of a French Lexicon from CLIPS

MAUS

MAUS Goes Iterative

Maximal Projections of Heads

Using the Penn Treebank to Evaluate Non-Treebank Parsers

Maximum entropy

Detecting Errors in English Article Usage with a Maximum Entropy Classifier Trained on a Large, Diverse Corpus

Large Scale Experiments for Semantic Labeling of Noun Phrases in Raw Text

Maximum Entroy tagging

Applying Computational Linguistic Techniques in a Documentary Project for Q’anjob’al (Mayan Guatemala)

MDL

Automated Morphological Segmentation and Evaluation

Measure of Dispersion

Distributional Consistency: As a General Method for Defining a Core Lexicon

Medical application

An Annotated German-Language Medical Text Corpus as Language Resource

Pumping Documents Through a Domain and Genre Classification Pipeline

Medical Dialogue Data

Creation of a Doctor-Patient Dialogue Corpus Using Standardized Patients

Medical domains

The Italian NESPOLE! Corpus: A Multilingual Database with Interlingua Annotation in Tourism and Medical Domains

Medical resources

Methodology For Building Thematic Indexes In Medecine For French

Meeting Processing and Retrieval

Building and Using a Corpus of Shallow Dialog Annotated Meetings

Memory-based learning

Memory-based Classification of Proper Names in Norwegian

MeSH thesaurus

Methodology For Building Thematic Indexes In Medecine For French

Metadata

A Large Metadata Domain of Language Resources

Consistent Storage of Metadata in Inference Lexica: The MetaLex Approach

Cross-Disciplinary Integration of Metadata Descriptions

Highlighting latent structure in documents

Human Language Technology Elements in a Knowledge Organisation System -The VID project

Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Project

Securing Interpretability: The Case of Ega Language Documentation

Using Profiles for IMDI Metadata Creatation

WALA: a multilingual resource repository for West African Languages

Metadata extraction

Linguistic Resources for Effective, Affordable, Reusable Speech-to-Text

Meta-model

Towards a Reference Annotation Framework

Metamodels

Standardization in Multimodal Content Representation: Some Methodological Issues

MetaMorpho TM

MetaMorpho TM: A Rule-Based Translation Corpus

Metrics

Measurements of Spoken Language Variability in a Multilingual Corpus. Predictable Aspects

Military languages

Current Projects in Languages of Military Interest at the Defense Language Institute

Minimum description length

Automated Morphological Segmentation and Evaluation

Minimum Edit Distance

Evaluating Name-Matching for Coreference Resolution

Minority language

Spoken and Written Language Resources for Vietnamese

Mobile phone

Evaluation of a Multimodal Dialogue System for Small-screen Devices

Mobile teleservices

SALA II across the finish line: a large collection of mobile telephone speech databases from North and Latin America completed

Mobile terminal

Terminal Device Oriented Comparable Corpora and its Alignment -- Towards Extracting Paraphrasing Patterns --

Modalities

Usability Evaluation of Multimodal and Domain-Oriented Spoken Language Dialogue Systems

Modern Greek

Creating multi-purpose linguistic resources for Modern Greek: a deep Modern Greek Grammar

Modified Frequency

Distributional Consistency: As a General Method for Defining a Core Lexicon

Monolingual

The Lancaster Corpus of Mandarin Chinese: A Corpus for Monolingual and Contrastive Language Study

Monolingual lexical resources

A Methodology and Associated Tools for Building Interlingual Wordnets

Morhological analyser

Integration of Russian Language Resources

Morph

Sejong Korean Corpora in the Making

Morphological analysers

How to Disassemble Alphabetical Processions - Morphological Treatment of Unknown Words

Morphological analysis

A Morphological Analyzer for Standard Albanian

FreeLing: An Open-Source Suite of Language Analyzers

How to Disassemble Alphabetical Processions - Morphological Treatment of Unknown Words

Morphological dictionaries

Combining Heterogeneous Lexical Resources

Morphological lexica

Morphology Based Automatic Acquisition of Large-coverage Lexica

Morphological parser

Lexical Analysis of Agglutinative Languages Using a Dictionary of Lemmas and Lexical Transducers

Morphological segmentation

Automated Morphological Segmentation and Evaluation

Morphological tagging

Software Tools for Morphological Tagging of Zulu Corpora and Lexicon Development

Morphology

Automated Morphological Segmentation and Evaluation

Evaluation and Adaptation of the Celex Dutch Morphological Database

Generating an Arabic full-form lexicon for bidirectional morphology lookup

Identifying Morphosyntactic Preferences in Collocations

On the problems of creating a golden standard of inflected forms in Portuguese

The Core of the Czech Derivational Dictionary

The Statistical Analysis of Morphosyntactic Distributions

Morphosyntactic annotation

Combining symbolic and statistical methods in morphological analysis and unknown word guessing

The Verb in the Terminological Collocations. Contribution to the Development of a Morphological Analyser: MorphoComp

Morpho-syntactic description

Developping tools and building linguistic resources for Vietnamese morpho-syntactic processing

Morphosyntactic preferences

Identifying Morphosyntactic Preferences in Collocations

Morphosyntactic specifications

MULTEXT-East Version 3: Multilingual Morphosyntactic Specifications, Lexicons and Corpora

Morphosyntactic tagger

A Galician Textual Corpus for Morphosyntactic Tagging with Application to Text-to-Speech Synthesis

Motion capture

EVALUATION OF A SPEECH CUER: FROM MOTION CAPTURE TO A CONCATENATIVE TEXT-TO-CUED SPEECH SYSTEM

MRS

Creating multi-purpose linguistic resources for Modern Greek: a deep Modern Greek Grammar

MT

Evaluation and Adaptation of a Specialised Language Checking Tool for Non-specialised Machine Translation and Non-expert MT Users for Multi-lingual Telecooperation

MT correction

The Translation Correction Tool: English-Spanish User Studies

MT Evaluation

A Fine-Grained Evaluation Method for Speech-to-Speech Machine Translation Using Concept Annotations

MT Goes Farming: Comparing Two Machine Translation Approaches on a New Domain

The Translation Correction Tool: English-Spanish User Studies

Training a Sentence-Level Machine Translation Confidence Measure

MULTEXT

Japanese MULTEXT: A Prosodic Corpus

Multi-agent

Evaluation of Multi-party Virtual Reality Dialogue Interaction

Multichannel

AV@CAR: A Spanish Multichannel Multimodal Corpus for In-Vehicle Automatic Audio-Visual Speech Recognition

Multi-channel front-end

Evaluation of Microphone Array Front-Ends for ASR - an Extension of the AURORA Framework

Multidocument summarization

MEAD - A Platform for Multidocument Multilingual Text Summarization

RevisionBank: A Resource for Revision-based Multi-document Summarization and Evaluation

Multi-level annotation

A powerful and versatile XML format for representing role-semantic annotation

Multilexemic

The Verb in the Terminological Collocations. Contribution to the Development of a Morphological Analyser: MorphoComp

Multilingual

A Multi-Modal Documentation System for Warao

Annotation Tools for Large-Scale Corpus Development: Using AGTK at the Linguistic Data Consortium

The Italian NESPOLE! Corpus: A Multilingual Database with Interlingua Annotation in Tourism and Medical Domains

The Mixer Corpus of Multilingual, Multichannel Speaker Recognition Data

Multilingual corpora

MULTEXT-East Version 3: Multilingual Morphosyntactic Specifications, Lexicons and Corpora

Multilingual corpus annotation

The MULI Project: Annotation and Analysis of Information Structure in German and English

Multi-lingual document classification

N-Gram Language Modeling for Robust Multi-Lingual Document Classification

Multilingual Information Access

The Future of Evaluation for Cross-Language Information Retrieval Systems

Multilingual Knowledge Acquisition

Cross-Language Acquisition of Semantic Models for Verbal Predicates

Multilingual language resources

Bilingual Connections for Trilingual Corpora: An XML Approach

Multilingual lexical resources

Cross-effective cross-lingual document classification

Multilingual LR

The COST278 pan-European Broadcast News Database

Multilingual morphology

Semi-Automatic Derivation of a French Lexicon from CLIPS

Multilingual resources

Exploring Balkanet Shared Ontology for Multilingual Conceptual Indexing

Multilingual Pattern Libraries for Question Answering: a Case Study for Definition Questions

Multilingual speech technology

A Multilingual Phonological Resource Toolkit for Ubiquitous Speech Technology

Multilingual subtitling

Multimodal Multilingual Resources in the Subtitling Process

Multilingual TTS

A data-driven adaptation of prosody in a multilingual TTS

Multilingual wordnets

A Methodology and Associated Tools for Building Interlingual Wordnets

Multilingualism

Korean-Chinese-Japanese Multilingual Wordnet with Shared Semantic Hierarchy

Multilinguality

Automatic extraction of syntactic semantic patterns for multilingual resources

Integrated Language Technologies for Multilingual Information Services in the MEMPHIS Project

Intelligent Building of Language Resources for HLT Applications

Measurements of Spoken Language Variability in a Multilingual Corpus. Predictable Aspects

Multilingual Content Processing

The C-ORAL-ROM CORPUS. A Multilingual Resource of Spontaneous Speech for Romance Languages.

Multimedia

 

Annotation Multi-media / Multi-modal Resources with ELAN

The C-ORAL-ROM CORPUS. A Multilingual Resource of Spontaneous Speech for Romance Languages

Multimedia coreference

ANNOTATION OF COREFERENCE RELATIONS AMONG LINGUISTIC EXPRESSIONS AND IMAGES IN BIOLOGICAL ARTICLES

Multimedia Information Retrieval

An argumentative annotation schema for meeting discussions

Multimodal

AV@CAR: A Spanish Multichannel Multimodal Corpus for In-Vehicle Automatic Audio-Visual Speech Recognition

Evaluation of a Multimodal Dialogue System for Small-screen Devices

The Italian NESPOLE! Corpus: A Multilingual Database with Interlingua Annotation in Tourism and Medical Domains

User Query Analysis for the Specification and Evaluation of a Dialogue Processing and Retrieval System

Evaluation of Multi-party Virtual Reality Dialogue Interaction

Multimodal content

Standardization in Multimodal Content Representation: Some Methodological Issues

Multimodal corpora 

Experiments on Building Language Resources for Multi-Modal Dialogue Systems

Framework for data-driven video-realistic audio-visual speech-synthesis

Image-Language Multimodal Corpora: Needs, Lacunae and an AI Synergy for Annotation

Multimodal data management

Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Project

Multimodal Dialog

Evaluating Factors Impacting the Accuracy of Forced Alignments in a Multimodal Corpus

Multi-modal dialogue

Issues in Corpus Development for Multi-party Multi-modal Task-oriented Dialogue

Multimodal LR

The COST278 pan-European Broadcast News Database

Multimodal man-machine dialogue

Experiments on Building Language Resources for Multi-Modal Dialogue Systems

Multimodal Meaning

Multimodal Meaning Representation for Generic Dialogue Systems Architectures

Multimodal resources

Multimodal Multilingual Resources in the Subtitling Process

Multimodality

Evaluating Multimodal NLG using Production Experiments

Towards General-Purpose Annotation Tools – How far are we today?

Multi-party dialogue

Issues in Corpus Development for Multi-party Multi-modal Task-oriented Dialogue

Multiple linguistic levels

A pattern extraction workbench combining multiple linguistic levels

Multiple sequence alignment

Multiple Sequence Alignment for characterizing the linear structure of revision

Multi-speaker

Evaluation of Multi-party Virtual Reality Dialogue Interaction

Multi-style LR

A Spoken Afrikaans Language Resource Designed for Research on Pronunciation Variations

Multi-word

A New Approach to the Corpus-based Statistical Investigation of Hungarian Multi-word Lexemes

Multiword detection

FreeLing: An Open-Source Suite of Language Analyzers

Multiword expression

A Multilingual Database of Idioms

Multiword expressions

Reusable Lexical Representations for Idioms

Bootstrapping a database of German multi-word expressions

Multiword Lexical Units

Cluster Analysis and Classification of Named Entities

MultiWord Terminology

The Role of MultiWord Terminology in Knowledge Management

Multi-word terms

French-English multi-word term alignment based on lexical context analysis

Multiword unit extraction

Evaluation of Different Similarity Measures for the Extraction of Multiword Units in a Reinforcement Learning Environment

Mutual information

Collocation Extraction Using Web Statistics

Improving Collocation Extraction for High Frequency Words

MWE - identification and tagging

Portuguese Large-scale Language Resources for NLP Applications

mWER

How Does Automatic Machine Translation Evaluation Correlate With Human Scoring as the Number of Reference Translations Increases?

N

 

Naďve Bayes

Learning to predict Pitch Accents using Bayesian Belief Networks for Greek Language

Evaluating Variants of the Lesk Approach for Disambiguating Words

Name Matching

Evaluating Name-Matching for Coreference Resolution

Named entities

Automatic Building Gazetteers of Co-referring Named Entities

Automatic Classification of Geographical Named Entities

Cluster Analysis and Classification of Named Entities

Exploiting Language Resources for Semantic Web Annotations

Named Entity

Definition, dictionaries and tagger for Extended Named Entity Hierarchy

Named entity extraction

Pattern Discovery in Named Organization Corpus

Named entity recognition

A Named Entity Recognizer for Danish

Corpus-based Learning of Lexical Resources for German Named Entity Recognition

Creation of reusable components and language resources for Named Entity Recognition in Russian

Large Scale Experiments for Semantic Labeling of Noun Phrases in Raw Text

Memory-based Classification of Proper Names in Norwegian

Raising the Bar: Stacked Conservative Error Correction Beyond Boosting

NameNet: A Self-Improving Resource for Name Classification

Named organization corpus

Pattern Discovery in Named Organization Corpus

Named-entity annotation

A Multi-Modal Documentation System for Warao

Annotation Tools for Large-Scale Corpus Development: Using AGTK at the Linguistic Data Consortium

Nasal vowels

An Acoustic Corpus Contemplating Regional Variation for Studies of European Portuguese Nasals

Natural interactivity

Towards General-Purpose Annotation Tools – How far are we today?

Natural language approach

A natural language approach to information management: tracking scientific advances through the structure of words

Natural language generation

Evaluating Multimodal NLG using Production Experiments

Natural language parsing

A Named Entity Recognizer for Danish

Using the Penn Treebank to Evaluate Non-Treebank Parsers

Natural language processing

A Bayesian Model for Shallow Syntactic Parsing of Natural Language Texts

A Comparison of Two Variant Corpora: The Same Content with Different Sources

Data Driven Ontology Evaluation

Pronominal Anaphora Resolution for Unrestricted Text

Natural language processing in biology

Part-of-Speech Annotation of Biology Research Abstracts

Natural Language Question-Answering

INQUER: A WordNet-based Question-Answering Application

NE subclassification

Automatic Classification of Geographical Named Entities

Neologisms

The Bilingual Web Dictionary on Demand

NE-Recognition

Extraction of Polish Named-Entities

NetDC

Network of Data Centres (NetDC): BNSC – An Arabic Broadcast News Speech Corpus

Network Model

Extraction of Hyperonymy of Adjectives from Large Corpora by Using the Neural Network Model

Network of grammars

A Hybrid Strategy for Regular Grammar Parsing

Neural

Extraction of Hyperonymy of Adjectives from Large Corpora by Using the Neural Network Model

Neural networks

A data-driven adaptation of prosody in a multilingual TTS

News

Terminal Device Oriented Comparable Corpora and its Alignment -- Towards Extracting Paraphrasing Patterns --

n-gram language models

Applying Computational Linguistic Techniques in a Documentary Project for Q’anjob’al (Mayan Guatemala)

n-gram model

Pumping Documents Through a Domain and Genre Classification Pipeline

n-gram modelling

Human dialogue modelling using annotated corpora

NIST

 

How Does Automatic Machine Translation Evaluation Correlate With Human Scoring as the Number of Reference Translations Increases?

Interpreting Bleu/NIST Scores: How Much Improvement do We Need to Have a Better System?

NITE XML Toolkit

Using the NITE XML Toolkit on the Switchboard Corpus to Study Syntactic Choice: A Case Study

NLG tools

Open-source Tools for Creation, Maintenance, and Storage of Lexical Resources for Language Generation from Ontologies

NLP

A Suite of Tools for Marking Up Textual Data for Temporal Text Mining Scenarios

ALLES: Integrating NLP in ICALL Applications

From Weaver to the ALPAC Report 

NLP Application

The DeepThought Core Architecture Framework

NLP Architecture

The DeepThought Core Architecture Framework

NLP processing

NLP-enhanced error Checking for Catalan unrestricted text

NLP resources

The Rationale for Building Resources Expressly for NLP

NLP-products

How to Disassemble Alphabetical Processions - Morphological Treatment of Unknown Words

NLPWin

Using the Penn Treebank to Evaluate Non-Treebank Parsers

Noise filtering

Discarding noise in an automatically acquired lexicon of support verb constructions

Nominalization

Annotating Noun Argument Structure for NomBank

Nominals

Annotating Noun Argument Structure for NomBank

Non-compositional translation

French-English multi-word term alignment based on lexical context analysis

Normalisation

Evaluating Name-Matching for Coreference Resolution

Norwegian

Memory-based Classification of Proper Names in Norwegian

Nouns

Annotating Noun Argument Structure for NomBank

NTPC

Raising the Bar: Stacked Conservative Error Correction Beyond Boosting

Number/date/amounts recognition

FreeLing: An Open-Source Suite of Language Analyzers

O

 

Offline handwriting recognition

The SPARTACUS-Database: a Spanish Sentence Database for Offline Handwriting Recognition

Old-Church Slavonic

Towards Intelligent Written Cultural Heritage Processing - Lexical Processing

One sense per discourse

Utilizing the One-Sense-per-Discourse Constraint for Fully Unsupervised Word Sense Induction and Disambiguation

Online handwriting

An XML Representation for Annotated Handwriting Datasets for Online Handwriting Recognition

Experiences in Collection of Handwriting Data for Online Handwriting Recognition in Indic Scripts

Online tool

The Translation Correction Tool: English-Spanish User Studies

Ontologies 

An Automatic Method for Constructing Domain-Specific Ontology Resources

Concept Creation in Lexical Ontologies

Enriching WordNet Via Generative Metonymy and Creative Polysemy

Integrated Language Technologies for Multilingual Information Services in the MEMPHIS Project

Intelligent Building of Language Resources for HLT Applications

OntoTag's Linguistic Ontologies: Enhancing Higher Level and Semantic Web Annotations

Semantic Mark-up of Italian Legal Texts Through NLP-based Techniques

The Integral Dictionary: An Ontological Resource for the Semantic Web Integration of EuroWordNet, Balkanet, TID and SUMO

Towards Ontology Engineering Based on Linguistic Analysis

Towards the MEANING Top Ontology: Sources of Ontological Meaning

Ontology

Annotating a corpus for building a domain-specific knowledge base

ANNOTATION OF COREFERENCE RELATIONS AMONG LINGUISTIC EXPRESSIONS AND IMAGES IN BIOLOGICAL ARTICLES

Concept-based queries: Combining and Reusing Linguistic Corpus Formats and Query Languages

Data Driven Ontology Evaluation

Design and Implementation of a Semantic Search Engine for Portuguese

EVALDA-CESART Project: Terminological Resources Acquisition Tools Evaluation Campaign

Open-source Tools for Creation, Maintenance, and Storage of Lexical Resources for Language Generation from Ontologies

Securing Interpretability: The Case of Ega Language Documentation

Sinica BOW (Bilingual Ontological Wordnet): Integration of Bilingual WordNet and SUMO

The Bilingual Web Dictionary on Demand

The GENOMA-KB Platform: Queries Over Integrated Linguistic Resources

The GENOMA-KB project: towards the integration of concepts, terms, textual corpora and entities

The Rationale for Building Resources Expressly for NLP

The Role of MultiWord Terminology in Knowledge Management

Thesaurus or logical ontology, which do we need for mining text?

Ontology construction

 

An Automatic Method for Constructing Domain-Specific Ontology Resources

Principles of a system for terminological concept modelling

Ontology evaluation

An Automatic Method for Constructing Domain-Specific Ontology Resources

Automatic Generation of Glosses in the OntoLearn System

Ontology Evaluation Functionalities of RDF(S), DAML+OIL, and OWL Parsers and Ontology Platforms

Ontology learning

Automatic Generation of Glosses in the OntoLearn System

Clustering Concept Hierarchies from Text

Ontology platforms

Ontology Evaluation Functionalities of RDF(S), DAML+OIL, and OWL Parsers and Ontology Platforms

Open source

Open Resources for Language Technology

The Language Belongs to the People!

The OPUS corpus - parallel and free

FreeLing: An Open-Source Suite of Language Analyzers

Open source LRs

Creating open language resources for Hungarian

Oral Databases

Collection of SLR in the Asian-Pacific area

Orthographic transcription

Orthographic and Phonetic Annotation of Very Large Czech Corpora with Quality Assessment

OWL

Ontology Evaluation Functionalities of RDF(S), DAML+OIL, and OWL Parsers and Ontology Platforms

P

 

P2P

Collaborative Annotation of Sign Language Data with Peer-to-Peer Technology

Paradigmatic relations

Automatic Acquisition of Paradigmatic Relations using Iterated Co-occurrences

Paradigms

Using Paradigm Tables to Generate New Utterances Similar to those Existing in Linguistic Resources

PARADISE

Usability Evaluation of Spoken Dialogue Systems

Parallel corpora

Annotation of anaphoric expressions in an aligned bilingual corpus

Construction of a Bilingual Arabic-Spanish Lexicon of Verbs Based on a Parallel Corpus

Parallel corpora for the Galician language: building and processing of the CLUVI (Linguistic Corpus of the University of Vigo)

The OPUS corpus - parallel and free

Word Sense Disambiguation as a Wordnets' Validation Method in Balkanet

 parallel corpus

A word alignment system based on a translation equivalence extractor

Evaluation of Cross-Language Information Retrieval Using the Domain-Specific GIRT Data as Parallel German-English Corpus

Grouping Synonymous Sentences from a Parallel Corpus

Parallel treebank

Prague Czech-English Dependency Treebank. Syntactically Annotated Resources for Machine Translation

Paraphrase

Building a Paraphrase Corpus for Speech Translation

Grouping Synonymous Sentences from a Parallel Corpus

Terminal Device Oriented Comparable Corpora and its Alignment -- Towards Extracting Paraphrasing Patterns --

PAROLE

Putting the Dutch PAROLE Corpus to Work

Parser evaluation

Phrase-Based Dependency Evaluation of a Japanese Parser

Parsers

Ontology Evaluation Functionalities of RDF(S), DAML+OIL, and OWL Parsers and Ontology Platforms

Parsing

Parsing Ungrammatical Input: An Evaluation Procedure

Semi-automatic Syntactic and Semantic Corpus Annotation with a Deep Parser

Partial Parsing

A High Quality Partial Parser for Annotating German Text Corpora

A Hybrid Strategy for Regular Grammar Parsing

Automatic Methods to Supplement Broad-Coverage Subcategorization Lexicons

Part-of-speech

A Galician Textual Corpus for Morphosyntactic Tagging with Application to Text-to-Speech Synthesis

A Search Tool for Corpora with Positional Tagsets and Ambiguities

Building Part-of-speech Corpora through Histogram Hopping

Part-of-speech (POS)

Developping tools and building linguistic resources for Vietnamese morpho-syntactic processing

Part-of-speech corpus

Part-of-Speech Annotation of Biology Research Abstracts

Part-of-speech tagging

A Morphological Analyzer for Standard Albanian

MULTEXT-East Version 3: Multilingual Morphosyntactic Specifications, Lexicons and Corpora

Patent maps

Test Collections for Patent-to-Patent Retrieval and Patent Map Generation in NTCIR-4 Workshop

Patent retrieval

Test Collections for Patent-to-Patent Retrieval and Patent Map Generation in NTCIR-4 Workshop

Pattern discovery

Pattern Discovery in Named Organization Corpus

Pattern extraction

A pattern extraction workbench combining multiple linguistic levels

Pattern matching

Unsupervised Text Mining for Ontology Extraction: An Evaluation of Statistical Measures

Patterns

From Acts and Topics to Transactions and Dialogue Smoothness

Identifying Definitions in Text Collections for Question Answering

PCFG

Comparing the Ambiguity Reduction Abilities of Probabilistic Context-Free Grammars

Peer-to-peer

Collaborative Annotation of Sign Language Data with Peer-to-Peer Technology

Penn Discourse Treebank

The Penn Discourse Treebank

Penn Treebank

Converting Treebank Annotations to Language Neutral Syntax

Using the Penn Treebank to Evaluate Non-Treebank Parsers

Perception

The Influence of the Labeller’s Regional Background on Phonetic Transcriptions: Implications for the Evaluation of Spoken Language Resources

Perception test

Evaluating an Authentic Audio-Visual Expressive Speech Corpus

Perceptual evaluation

Perceptual Evaluation of Quality Deterioration Owing to Prosody Modification

Performance

Benchmarking ontology tools. A case study for the WebODE platform

Phoneme confusion matrix

The COST 278 MASPER initiative - crosslingual speech recognition with large telephone databases

Phoneme-to-grapheme

A Graphical Tool for Handling Rule Grammars in Java Speech Grammar Format

Phonetic analysis

WinPitch Corpus, a Text to Speech Alignment Tool for Multimodal Corpora

Phonetic balance

The American English SALA-II Data Collection

Phonetic corpus

Embedding IMDI Metadata into a Large Phonetic Corpus

Phonetic Lexicons

CrossTowns: Automatically Generated Phonetic Lexicons of Cross-Lingual Pronunciation Variants of European City Names

Phonetic transcription

Orthographic and Phonetic Annotation of Very Large Czech Corpora with Quality Assessment

The Influence of the Labeller’s Regional Background on Phonetic Transcriptions: Implications for the Evaluation of Spoken Language Resources

Phonetic transcriptions

On the Usefulness of Large Spoken Language Corpora for Linguistic Research

Phonetic units

Evaluation of a Spoken Phonetic Databse in Basque Language

Phonetically balanced corpus

VOXMEX Speech Database: Design of a Phonetically Balanced Corpus

Phonological and Morphological Lexical DB

Unifying Lexicons in View of a Phonological and Morphological Lexical DB

Phonological resource

A Multilingual Phonological Resource Toolkit for Ubiquitous Speech Technology

Phonology

Phonological Treebanks - Issues in Generation and Application

Phonotactics

Acquiring Reusable Multilingual Phonotactic Resources

Phrase structure trees

Automatic transformation of phrase treebanks to dependency trees

Phrase structures

Mapping Dependency Structures to Phrase Structures and the Automatic Acquisition of Mapping Rules

Pipping

Raising the Bar: Stacked Conservative Error Correction Beyond Boosting

Pitch Accent Tones

Learning to predict Pitch Accents using Bayesian Belief Networks for Greek Language

Pitch contour

Corpus Design, Recording and Phonetic Analysis of Greek Emotional Database

Place name

Resources for Place Name Analysis

Platform independence

SpeechRecorder - a Universal Platform Independent Multi-Channel Audio Recording Software

Polish

Extraction of Polish Named-Entities

Political Issues

ENABLER Thematic Network of National Projects: Technical, Strategic and Political Issues of LRs

Polygenesis

Dynamic Lexicographic Data Modelling. A Diachronic Dictionary Development Report

Polysemy

Polysemy and Category Structure in WordNet: An Evidential Approach

Portuguese

Evaluating Solutions for the Rapid Development of State-of-the-Art POS taggers for Portuguese

On the problems of creating a golden standard of inflected forms in Portuguese

Providing on-line access to Portuguese language resources: corpora and lexicons

What is my Style? Using Stylistic Features of Portuguese Web Texts to classify Web pages according to Users' Needs

POS

A Search Tool for Corpora with Positional Tagsets and Ambiguities

Creating Slovenian Language Resources for Development of Speech-to-Speech Translation Components

PoS and headword encoding

The Integrated Language Database of 8th - 21st-Century Dutch

POS annotated corpus

The Lácio-Web: Corpora and Tools to advance Brazilian Portuguese Language Investigations and Computational Linguistic Tools

POS information

Multifunctional Computational Lexicon of Contemporary Portuguese: An Available Resource for Multitype Applications

POS tagging

Combining symbolic and statistical methods in morphological analysis and unknown word guessing

FreeLing: An Open-Source Suite of Language Analyzers

SVMTool: A general POS tagger generator based on Support Vector Machines

The Verb in the Terminological Collocations. Contribution to the Development of a Morphological Analyser: MorphoComp

Using PiTagger for Lemmatization and PoS Tagging of a Spontaneous Speech Corpus: C-ORAL-ROM Italian

POS tags

Towards the Use of Word Stems and Suffixes for Statistical Machine Translation

Positional tagset

A Search Tool for Corpora with Positional Tagsets and Ambiguities

Potential function

Classification of Japanese Spatial Nouns

PP attachment

A Labelled Corpus for Prepositional Phrase Attachment

Exploring Portability of Syntactic Information from English to Basque

Precedence units

Mapping Dependency Structures to Phrase Structures and the Automatic Acquisition of Mapping Rules

Predicate

Annotating Noun Argument Structure for NomBank

Predicate argument structure

Converting Treebank Annotations to Language Neutral Syntax

Prefix verbs

Automatic Methods to Supplement Broad-Coverage Subcategorization Lexicons

Prepositional phrase attachment

A Labelled Corpus for Prepositional Phrase Attachment

Procedural semantics

Some Meaning Procedures of Ontological Semantics

Processing questions

Semi-Automatic Construction of a Question Treebank

Production experiment

Evaluating Multimodal NLG using Production Experiments

Pronouns

Can Anaphoric Definite Descriptions be Replaced by Pronouns?

Pronunciation dictionary

Spoken and Written Language Resources for Vietnamese

Pronunciation Variants

CrossTowns: Automatically Generated Phonetic Lexicons of Cross-Lingual Pronunciation Variants of European City Names

Pronunciation variation

A Spoken Afrikaans Language Resource Designed for Research on Pronunciation Variations

On the Usefulness of Large Spoken Language Corpora for Linguistic Research

Pronunciation variation modeling

Improving Automatic Phonetic Transcription of Spontaneous Speech through Variant-Based Pronunciation Variation Modelling

Proper names

Creating Slovenian Language Resources for Development of Speech-to-Speech Translation Components

Exploiting Anchor Text as a Lexical Resource

Prosodic and lexical cues

Reliability of Lexical and Prosodic Cues in two Real-life Spoken Dialog Corpora

Prosodic Corpus

Japanese MULTEXT: A Prosodic Corpus

Prosody

A data-driven adaptation of prosody in a multilingual TTS

Corpus Design, Recording and Phonetic Analysis of Greek Emotional Database

Designing and Recording an Audiovisual Database of Emotional Speech in Basque

Evaluation of Consensus on the Annotation of Prosodic Breaks in the Romance Corpus of Spontaneous Speech “C-ORAL-ROM”

Learning to predict Pitch Accents using Bayesian Belief Networks for Greek Language

The C-ORAL-ROM CORPUS. A Multilingual Resource of Spontaneous Speech for Romance Languages

Use and Evaluation of Prosodic Annotations in Dutch

You stupid tin box' - children interacting with the AIBO robot: A cross-linguistic emotional speech corpus

Prosody modification

Perceptual Evaluation of Quality Deterioration Owing to Prosody Modification

Protocol

AN EVALUATION PROTOCOL FOR TEXT MINING TOOLS: ALCESTE SAS TEXT MINER SPAD-CRM AND TEMIS TEXT MINING SOLUTIONS TESTING

Protocols and Measures

Designing a Realistic Evaluation of an End-to-end Interactive Question Answering System

Prototype

The Integrated Language Database of 8th - 21st-Century Dutch

Prototyping

The CLaRK System:  XML-based Corpora Development System for Rapid Prototyping

Proxy/Search logs

Collocation Extraction Using Web Statistics

Psycholinguistics

An Emerging Transcontinental Collaborative Research and Education Agenda in Human Language Technologies

Public available

The BITS Speech Synthesis Corpus for German

Q

 

Qualitative

Designing a Realistic Evaluation of an End-to-end Interactive Question Answering System

Quantitative

Designing a Realistic Evaluation of an End-to-end Interactive Question Answering System

Query

Design of an Interactive Web-based User Interface for Speech Database Query Formation

Query elicitation

User Query Analysis for the Specification and Evaluation of a Dialogue Processing and Retrieval System

Query Expansion

Information Retrieval System Using Latent Contextual Relevance

Query formation

Design of an Interactive Web-based User Interface for Speech Database Query Formation

Query languages

Concept-based queries: Combining and Reusing Linguistic Corpus Formats and Query Languages

Querying

Querying both time-aligned and hierarchical corpora with NXT Search

Question Answering

A2Q: an agent-based architecure for multilingual Q&A

Identifying Definitions in Text Collections for Question Answering

Multilingual Pattern Libraries for Question Answering: a Case Study for Definition Questions

Designing a Realistic Evaluation of an End-to-end Interactive Question Answering System

Using large multi-purpose corpora for specific research questions: discourse phenomena related to wh-questions in the Spoken Dutch Corpus

Semi-Automatic Construction of a Question Treebank

The DeepThought Core Architecture Framework

Using Semantic Language Resources to Support Textual Inference for Question Answering

Question answering systems

An Information Repository Model for Advanced Question Answering Systems

Quick transcription

The Fisher Corpus: a Resource for the Next Generations of Speech-to-Text

R

 

Radio dialogue

Issues in Corpus Development for Multi-party Multi-modal Task-oriented Dialogue

Random indexing

Automatic Bilingual Lexicon Acquisition Using Random Indexing of Aligned Bilingual Data

Word Sense Disambiguation Using Random Indexing

RDB

Retrieving Annotated Corpora for Corpus Annotation

RDF(S)

Ontology Evaluation Functionalities of RDF(S), DAML+OIL, and OWL Parsers and Ontology Platforms

Real-life call centers database

Reliability of Lexical and Prosodic Cues in two Real-life Spoken Dialog Corpora

Recognition of emotion

You stupid tin box' - children interacting with the AIBO robot: A cross-linguistic emotional speech corpus

Recognition of terminological variants

Using Weighted Abduction to Align Term Variant Translations in Bilingual Texts

Recruitment

The American English SALA-II Data Collection

Reference

Towards a Reference Annotation Framework

Reference frame

Classification of Japanese Spatial Nouns

Reference speakers

Development of New Telephone Speech Databases for French: The NEOLOGOS Project

Referential

EVALDA-CESART Project: Terminological Resources Acquisition Tools Evaluation Campaign

RefRec Reference Recognizer

Development and Integration of the LDA-Toolkit into the COST249 SpeechDat (II) SIG Reference Recognizer

Regional language variation

The Influence of the Labeller’s Regional Background on Phonetic Transcriptions: Implications for the Evaluation of Spoken Language Resources

Regional variation

An Acoustic Corpus Contemplating Regional Variation for Studies of European Portuguese Nasals

Regular inference

Acquiring Reusable Multilingual Phonotactic Resources

Relations

The Automatic Content Extraction (ACE) Program - Tasks, Data, and Evaluation

Reliability

Computing Reliability for Coreference Annotation

Repetition

Textual Distraction as a Basis for Evaluating Automatic Summarisers

Repository

Securing Interpretability: The Case of Ega Language Documentation

WALA: a multilingual resource repository for West African Languages

Representation

A powerful and versatile XML format for representing role-semantic annotation

Representing Italian Complex Nominals: a Pilot Study

Representation Format

Multimodal Meaning Representation for Generic Dialogue Systems Architectures

Research Funding

Strategic Directions of National and International Research Funding

Resource centers

A Large Metadata Domain of Language Resources

Resource enhancement

Incremental Knowledge Acquisition from WordNet and EuroWordNet

Resource maps

A Progress Report from the Linguistic Data Consortium: Recent Activities in Resource Creation and Distribution and the Development of Tools and Standards

Resources

Semi-automatic Acquisition of Command Grammar

The Cross-Breeding of Dictionaries

Resources for machine translation

Prague Czech-English Dependency Treebank. Syntactically Annotated Resources for Machine Translation

Response appropriateness

Evaluation of Multi-party Virtual Reality Dialogue Interaction

Reusability

Infrastructure for Collaborative Annotation of Speech 39

Securing Interpretability: The Case of Ega Language Documentation

WALA: a multilingual resource repository for West African Languages

Reusability and Management of LRs

Unifying Lexicons in View of a Phonological and Morphological Lexical DB

Reuters-21578

An Analysis of the Relative Difficulty of Reuters-21578 Subsets

Rhetorical analysis

An Annotation Scheme for a Rhetorical Analysis of Biology Articles

Rhetorical structure

CST Bank: A Corpus for the Study of Cross-document Structural Relationships

Rich transcription

The ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

Robust parsing

Hybrid Constraints for Robust Parsing: First Experiments and Evaluation

Robust recognition

Creation and Assessment of Korean Speech and Noise DB in Car Environment

Robust Semantics

The DeepThought Core Architecture Framework

Robustness

COMPARATIVE EVALUATION OF A STOCHASTIC PARSER ON SEMANTIC AND SYNTACTIC-SEMANTIC LABELS

Lexical Entry Templates for Robust Deep Parsing

Romance languages

Evaluation of Consensus on the Annotation of Prosodic Breaks in the Romance Corpus of Spontaneous Speech “C-ORAL-ROM”

Measurements of Spoken Language Variability in a Multilingual Corpus. Predictable Aspects

The C-ORAL-ROM CORPUS. A Multilingual Resource of Spontaneous Speech for Romance Languages

Rule grammars

A Graphical Tool for Handling Rule Grammars in Java Speech Grammar Format

Rules

Why do you ignore me? - Proof that not all direct speech is bad

Russian

Creation of reusable components and language resources for Named Entity Recognition in Russian

Integration of Russian Language Resources

S

 

SALA

SALA II across the finish line: a large collection of mobile telephone speech databases from North and Latin America completed

Schwa deletion

Automatic Generation of Compound Word Lexicon for Hindi Speech Synthesis

Search

Linguistic Corpus Search

Second language acquisition

Talkbank: Building an Open Unified Multimodal Database of Communicative Interaction

Second-language learning

A Grammar and Style Checker Based on Internet Searches

Detecting Errors in English Article Usage with a Maximum Entropy Classifier Trained on a Large, Diverse Corpus

Second-language teaching

Detecting Errors in English Article Usage with a Maximum Entropy Classifier Trained on a Large, Diverse Corpus

Selectional Preferences

Enriching a Thai Lexical Database with Selectional Preferences

Semantic Annotation

Evaluation Resources for Concept-based Cross-Lingual Information Retrieval in the Medical Domain

Handling Subtle Sense Distinctions through Wordnet Semantic Types

Large Scale Experiments for Semantic Labeling of Noun Phrases in Raw Text

MiniCors and Cast3LB: Two Semantically Tagged Spanish Corpora

Semantic Annotating of Czech Corpus via WSD

The French MEDIA/EVALDA project: the evaluation of the understanding capability of Spoken Language Dialogue Systems

Semantic association

Finding Semantic Associations on Express Lane

Semantic classification

Utilization of Multiple Language Resources for Robust Grammar-Based Tense and Aspect Classification

Semantic corpus annotation

Steps Towards Semantically Annotated Language Resources

Semantic frame

COMPARATIVE EVALUATION OF A STOCHASTIC PARSER ON SEMANTIC AND SYNTACTIC-SEMANTIC LABELS

Semantic Inference

INQUER: A WordNet-based Question-Answering Application

Semantic information

FrameNet as a ``Net''

Standardization in Multimodal Content Representation: Some Methodological Issues

Semantic lexicon

Constructing Word-Sense Association Networks from Bilingual Dictionary and Comparable Corpora

Evaluating Lexical Resources for A Semantic Tagger

Semi-Automatic Derivation of a French Lexicon from CLIPS

Semantic mark-up

Semantic Mark-up of Italian Legal Texts Through NLP-based Techniques

Semantic network

Finding Semantic Associations on Express Lane

Semantic Orientation

Using WordNet to Measure Semantic Orientations of Adjectives

Semantic overlap

Agreement in Human Factoid Annotation for Summarization Evaluation

Semantic relations

Building a Maritime Domain Lexicon: a Few Considerations on the Database Structure and the Semantic Coding

Extending WordNets to Implicit Information

FrameNet as a "Net''

Proper Names and Polysemy: from a Lexicographic Experience

Semantic relationship

A Model of Semantic Representations Analysis For Chinese Sentences

Semantic resources

Building a Conceptual GraphBank for Chinese Language

Semantic roles

A powerful and versatile XML format for representing role-semantic annotation

Top Ontology as a Tool for Semantic Role Tagging

Semantic similarity

A Freely Available Automatically Generated Thesaurus of Related Words

Meaningful Clusters

Semantic Tagging

A Similarity Measure for Unsupervised Semantic Disambiguation

Semantic types

Handling Subtle Sense Distinctions through Wordnet Semantic Types

Semantic Web

Bayesian Semantics Incorporation to Web Content for Natural Language Information Retrieval

Data Driven Ontology Evaluation

Exploiting Language Resources for Semantic Web Annotations

Exploiting Semantic Web Technologies for Intelligent Access to Historical Documents

Human Language Technology Elements in a Knowledge Organisation System -The VID project

OntoTag's Linguistic Ontologies: Enhancing Higher Level and Semantic Web Annotations

Sinica BOW (Bilingual Ontological Wordnet): Integration of Bilingual WordNet and SUMO

The Integral Dictionary: An Ontological Resource for the Semantic Web Integration of EuroWordNet, Balkanet, TID and SUMO

Towards A Language Infrastructure for the Semantic Web

Semantics

Design and Implementation of a Semantic Search Engine for Portuguese

Investigation on Semantics to Improve the COVAX System

Multimodal Meaning Representation for Generic Dialogue Systems Architectures

Representing Italian Complex Nominals: a Pilot Study

Semi-automatic Syntactic and Semantic Corpus Annotation with a Deep Parser

The Automatic Content Extraction (ACE) Program - Tasks, Data, and Evaluation

Semi-automatic refinement of translation grammar and lexicon

The Translation Correction Tool: English-Spanish User Studies

Semi-supervised learning

Semi-supervised learning by Fuzzy clustering and Ensemble learning

Sense

Sejong Korean Corpora in the Making

Sense indicator

Semi-Automatic Derivation of a French Lexicon from CLIPS

Sentence alignment

Using a Parallel Transcript/Subtitle Corpus for Sentence Compression

Sentence Chunking

Evaluating Solutions for the Rapid Development of State-of-the-Art POS taggers for Portuguese

Sentence compression

Multimodal Multilingual Resources in the Subtitling Process

Using a Parallel Transcript/Subtitle Corpus for Sentence Compression

Sentence reduction

Using a Parallel Transcript/Subtitle Corpus for Sentence Compression

Sentence similarity

CST Bank: A Corpus for the Study of Cross-document Structural Relationships

Sentence simplification

Automatic Sentence Simplification for Subtitling in Dutch and English

Separable verbs

Towards a Dynamic Lexicon: Predicting the Syntactic Argument Structure of Complex Verbs

SGML

Migrating Language Resources from SGML to XML: the Text Encoding Initiative Recommendations

Shallow Processing

Evaluating Solutions for the Rapid Development of State-of-the-Art POS taggers for Portuguese

Shallow syntactic parsing

A Bayesian Model for Shallow Syntactic Parsing of Natural Language Texts

Verb Valency Descriptors for a Syntactic Treebank

Shallow Text Processing

Extraction of Polish Named-Entities

Shared semantic hierarchy

Korean-Chinese-Japanese Multilingual Wordnet with Shared Semantic Hierarchy

Sign Language

Toward an Annotation Software for Video of Sign Language, Including Image Processing Tools and Signing Space Modelling

Similarity measure

A Similarity Measure for Unsupervised Semantic Disambiguation

Evaluation of Different Similarity Measures for the Extraction of Multiword Units in a Reinforcement Learning Environment

Similarity score

A Comparison of Two Variant Corpora: The Same Content with Different Sources

Singular value decomposition

A Freely Available Automatically Generated Thesaurus of Related Words

Utilizing the One-Sense-per-Discourse Constraint for Fully Unsupervised Word Sense Induction and Disambiguation

Slovene

Making an XML-based Japanese-Slovene Learners' Dictionary

Slovenian

A data-driven adaptation of prosody in a multilingual TTS

Acquisition and Annotation of Slovenian Broadcast News Database

Creating Slovenian Language Resources for Development of Speech-to-Speech Translation Components

Small screen devices

Evaluation of a Multimodal Dialogue System for Small-screen Devices

Smart home system

INSPIRE: Evaluation of a Smart-Home System for Infotainment Management and Device Control

SMIL

Talkbank: Building an Open Unified Multimodal Database of Communicative Interaction

Smoothing techniques

N-Gram Language Modeling for Robust Multi-Lingual Document Classification

SMT

How Does Automatic Machine Translation Evaluation Correlate With Human Scoring as the Number of Reference Translations Increases?

MT Goes Farming: Comparing Two Machine Translation Approaches on a New Domain

Syntax

A Syntactically Annotated Corpus of Tibetan

Software development

Open Resources for Language Technology

Software evaluation

Functional Requirements for an Interlinear Text Editor

Source language

A Comparison of Two Variant Corpora: The Same Content with Different Sources

Spanish Processing

Construction of a Bilingual Arabic-Spanish Lexicon of Verbs Based on a Parallel Corpus

Spanish sentences

The SPARTACUS-Database: a Spanish Sentence Database for Offline Handwriting Recognition

Spanish Wordnet

Selecting the Correct English Synset for a Spanish Sense

Spatial Representation

Toward an Annotation Software for Video of Sign Language, Including Image Processing Tools and Signing Space Modelling

Speaker corpus

Conversational Telephone Speech Corpus Collection for the NIST Speaker Recognition Evaluation 2004

Speaker detection

Conversational Telephone Speech Corpus Collection for the NIST Speaker Recognition Evaluation 2004

Speaker evaluation

Conversational Telephone Speech Corpus Collection for the NIST Speaker Recognition Evaluation 2004

Speaker recognition

Conversational Telephone Speech Corpus Collection for the NIST Speaker Recognition Evaluation 2004

The Mixer Corpus of Multilingual, Multichannel Speaker Recognition Data

Speaker verification

Conversational Telephone Speech Corpus Collection for the NIST Speaker Recognition Evaluation 2004

Specialist texts

A natural language approach to information management: tracking scientific advances through the structure of words

Specialized corpora

The Corpógrafo – a Web-based environment for corpora research

Specialized corpus

A Xml-Based Term Extraction Tool for Basque

Special-purpose

Towards General-Purpose Annotation Tools – How far are we today?

Speech

The Fisher Corpus: a Resource for the Next Generations of Speech-to-Text

The Mixer Corpus of Multilingual, Multichannel Speaker Recognition Data

Speech and noise DB

Creation and Assessment of Korean Speech and Noise DB in Car Environment

Speech corpora

Collecting and Sharing Bilingual Spontaneous Speech Corpora: the ChinFaDial Experiment

Evaluation of Consensus on the Annotation of Prosodic Breaks in the Romance Corpus of Spontaneous Speech “C-ORAL-ROM”

Measurements of Spoken Language Variability in a Multilingual Corpus. Predictable Aspects

Speech corpus

Network of Data Centres (NetDC): BNSC – An Arabic Broadcast News Speech Corpus

Spoken and Written Language Resources for Vietnamese

The BITS Speech Synthesis Corpus for German

Open Collaborative Development of the Thai Language Resources for Natural Language Processing

Speech data collection

Collecting Spontaneously Spoken Queries for Information Retrieval

Speech database

Design of an Interactive Web-based User Interface for Speech Database Query Formation

Perceptual Evaluation of Quality Deterioration Owing to Prosody Modification

Speech databases

Cypriot Speech Database: Data Collection and Greek to Cypriot Dialect Adaptation

OrienTel - Telephony Databases Across Northern Africa and the Middle East

SALA II across the finish line: a large collection of mobile telephone speech databases from North and Latin America completed

SLR Validation: Current Trends and Developments

Speech intensity

Corpus Design, Recording and Phonetic Analysis of Greek Emotional Database

Speech learner corpus

The overview of the SST speech corpus of Japanese learner English and evaluation through the experiment on automatic detection of learners' errors

Speech mode

A Spoken Afrikaans Language Resource Designed for Research on Pronunciation Variations

Speech processing

Development of Slovenian Broadcast News Speech Database

Speech recognition

A complete understanding speech system based on semantic concepts

An Efficient Word Confidence Measure Using Likelihood Ratio Scores

Automatic Phonemic Labeling and Segmentation of Spoken Dutch

Collection and Evaluation of Broadcast News Data for Arabic

Collection of SLR in the Asian-Pacific area

Development of New Telephone Speech Databases for French: The NEOLOGOS Project

Evaluation of a Spoken Phonetic Databse in Basque Language.

Evaluation of Microphone Array Front-Ends for ASR - an Extension of the AURORA Framework

The African Speech Technology Project: An Assessment

The ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

The Fisher Corpus: a Resource for the Next Generations of Speech-to-Text

Speech recording

SpeechRecorder - a Universal Platform Independent Multi-Channel Audio Recording Software

Speech resources

Evaluation of a Spoken Phonetic Databse in Basque Language.

Infrastructure for Collaborative Annotation of Speech

Speech styles

On the Usefulness of Large Spoken Language Corpora for Linguistic Research

Speech synthesis

Automatic Generation of Compound Word Lexicon for Hindi Speech Synthesis

Corpus Design, Recording and Phonetic Analysis of Greek Emotional Database

Learning to predict Pitch Accents using Bayesian Belief Networks for Greek Language

Perceptual Evaluation of Quality Deterioration Owing to Prosody Modification

The BITS Speech Synthesis Corpus for German

Framework for data-driven video-realistic audio-visual speech-synthesis

Speech to text

The Fisher Corpus: a Resource for the Next Generations of Speech-to-Text

Speech translation

A Comparative Study on Human Communication Behaviors and Linguistic Characteristics for Speech-to-Speech Translation

Building a Paraphrase Corpus for Speech Translation

Speech understanding

A complete understanding speech system based on semantic concepts

Speechdat

SALA II across the finish line: a large collection of mobile telephone speech databases from North and Latin America completed

SpeechDat database

The COST 278 MASPER initiative - crosslingual speech recognition with large telephone databases

SpeechDat (II)

Development and Integration of the LDA-Toolkit into the COST249 SpeechDat (II) SIG Reference Recognizer

Speech-recognition

Evaluation and Adaptation of the Celex Dutch Morphological Database

Speech-to-speech MT

A Fine-Grained Evaluation Method for Speech-to-Speech Machine Translation Using Concept Annotations

Speech-to-speech translation

Creating Slovenian Language Resources for Development of Speech-to-Speech Translation Components

Creation and Validation of Large Lexica for Speech-to-Speech Translation Purposes

Speech-to-text

Linguistic Resources for Effective, Affordable, Reusable Speech-to-Text

Spelling Correction

Data Collection and Analysis of Mapudungun Morphology for Spelling Correction

Spoken and written language resources

A Progress Report from the Linguistic Data Consortium: Recent Activities in Resource Creation and Distribution and the Development of Tools and Standards

Spoken corpora

The C-ORAL-ROM CORPUS. A Multilingual Resource of Spontaneous Speech for Romance Languages

Spoken Dialog Corpora

An argumentative annotation schema for meeting discussions

Automatic Keyword Extraction from Spoken Text. A Comparison of two Lexical Resources: the EDR and WordNet

Spoken dialogue

A Comparative Study on Human Communication Behaviors and Linguistic Characteristics for Speech-to-Speech Translation

From Acts and Topics to Transactions and Dialogue Smoothness

Spoken dialogue system

A New ITU-T Recommendation on the Evaluation of Telephone-Based Spoken Dialogue Systems

Can We Talk? Prospects for Automatically Training Spoken Dialogue Systems

INSPIRE: Evaluation of a Smart-Home System for Infotainment Management and Device Control

Usability Evaluation of Spoken Dialogue Systems

Spoken dialogues

The Italian NESPOLE! Corpus: A Multilingual Database with Interlingua Annotation in Tourism and Medical Domains

Spoken Dutch Corpus

The Influence of the Labeller’s Regional Background on Phonetic Transcriptions: Implications for the Evaluation of Spoken Language Resources

Using large multi-purpose corpora for specific research questions: discourse phenomena related to wh-questions in the Spoken Dutch Corpus

Spoken language

A Comparative Study on Human Communication Behaviors and Linguistic Characteristics for Speech-to-Speech Translation

Linguistic annotation of the Spoken Dutch Corpus: If we had to do it all over again ...

Talkbank: Building an Open Unified Multimodal Database of Communicative Interaction

Spoken language corpora

On the Usefulness of Large Spoken Language Corpora for Linguistic Research

Spoken language dialog system

The French MEDIA/EVALDA project: the evaluation of the understanding capability of Spoken Language Dialogue Systems

Spoken language dialogue systems

Usability Evaluation of Multimodal and Domain-Oriented Spoken Language Dialogue Systems

Spoken language processing

Evaluating Multimodal NLG using Production Experiments

Spoken language resources

Linguistic Resources for Effective, Affordable, Reusable Speech-to-Text

Spontaneous human-machine interaction

COMPARATIVE EVALUATION OF A STOCHASTIC PARSER ON SEMANTIC AND SYNTACTIC-SEMANTIC LABELS

Spontaneous speech

Automatic audio and manual transcripts alignment, time-code transfer and selection of exact transcripts

Collecting and Sharing Bilingual Spontaneous Speech Corpora: the ChinFaDial Experiment

Collecting Spontaneously Spoken Queries for Information Retrieval

Improving Automatic Phonetic Transcription of Spontaneous Speech through Variant-Based Pronunciation Variation Modelling

Spontaneous speech corpora

Using PiTagger for Lemmatization and PoS Tagging of a Spontaneous Speech Corpus: C-ORAL-ROM Italian

Square Mean Root (SMR)

Distributional Consistency: As a General Method for Defining a Core Lexicon

Stacked transformation-based learning

Raising the Bar: Stacked Conservative Error Correction Beyond Boosting

Stacking

Raising the Bar: Stacked Conservative Error Correction Beyond Boosting

Stand off annotation

Callisto: A Configurable Annotation Workbench

Standardization

A New ITU-T Recommendation on the Evaluation of Telephone-Based Spoken Dialogue Systems

MED-TYP: A Typological Database for Mediterranean Languages

Standardized Patients

Creation of a Doctor-Patient Dialogue Corpus Using Standardized Patients

Standards

A Registry of Standard Data Categories for Linguistic Annotation

Content Interoperability of Lexical Resources: Open Issues and “MILE” Perspectives

ELRA Validation Methodology and Standard Promotion for Linguistic Resources

Reusable Lexical Representations for Idioms

Towards an International Standard on Feature Structure Representation

Statistical

Translation memories enrichment by statistical bilingual segmentation

Statistical Bilingual Segmentation

Translation memories enrichment by statistical bilingual segmentation

Statistical domain knowledge representation

Bayesian Semantics Incorporation to Web Content for Natural Language Information Retrieval

Statistical machine translation

Augmenting Manual Dictionaries for Statistical Machine Translation Systems

Bilingual Connections for Trilingual Corpora: An XML Approach

Language Model Adaptation for Statistical Machine Translation based on Information Retrieval

MT Goes Farming: Comparing Two Machine Translation Approaches on a New Domain

Towards the Use of Word Stems and Suffixes for Statistical Machine Translation

Statistical measures

Unsupervised Text Mining for Ontology Extraction: An Evaluation of Statistical Measures

Statistical methods

Finding the Correct Interpretation of Swedish Compounds a Statistical Approach

Statistics

The Statistical Analysis of Morphosyntactic Distributions

STBL

Raising the Bar: Stacked Conservative Error Correction Beyond Boosting

Stem

Towards the Use of Word Stems and Suffixes for Statistical Machine Translation

Strategic

ENABLER Thematic Network of National Projects: Technical, Strategic and Political Issues of LRs

Structure learning

Language Modeling using Dynamic Bayesian Networks

Structure retrieval

Retrieving Annotated Corpora for Corpus Annotation

Structured cross-language sense inventory

Exploring Balkanet Shared Ontology for Multilingual Conceptual Indexing

Student essay evaluation

Application of the BLEU Method for Evaluating Free-text Answers in an E-learning Environment

Stylistic features

What is my Style? Using Stylistic Features of Portuguese Web Texts to classify Web pages according to Users' Needs

Subcategorization

Hybrid Constraints for Robust Parsing: First Experiments and Evaluation

Subcategorization extraction

Automatic Methods to Supplement Broad-Coverage Subcategorization Lexicons

Subjective evaluation

A New ITU-T Recommendation on the Evaluation of Telephone-Based Spoken Dialogue Systems

Sublanguage

An Annotated German-Language Medical Text Corpus as Language Resource

Reusing Language Resources for Speech Applications involving Emotion

Subtitling

Using a Parallel Transcript/Subtitle Corpus for Sentence Compression

Suffix

Towards the Use of Word Stems and Suffixes for Statistical Machine Translation

Summarisation

Textual Distraction as a Basis for Evaluating Automatic Summarisers

Why do you ignore me? - Proof that not all direct speech is bad

Agreement in Human Factoid Annotation for Summarization Evaluation

Automatic Sentence Simplification for Subtitling in Dutch and English

Integrated Language Technologies for Multilingual Information Services in the MEMPHIS Project

Multi-Document Summarization using Multiple-Sequence Alignment

Summarization of Multimodal Information

Textual Distraction as a Basis for Evaluating Automatic Summarisers

Summary evaluation

Summarization of Multimodal Information

Supervised word sense disambiguation

Word Sense Disambiguation Using Random Indexing

Supplementing lexicons

Automatic Methods to Supplement Broad-Coverage Subcategorization Lexicons

Support Vector Machines

SVMTool: A general POS tagger generator based on Support Vector Machines

Support verb constructions

Discarding noise in an automatically acquired lexicon of support verb constructions

Supranational Coordination

ENABLER Thematic Network of National Projects: Technical, Strategic and Political Issues of LRs

Survey

NEMLAR - An Arabic Language Resources project

Sustainable resources

Securing Interpretability: The Case of Ega Language Documentation

WALA: a multilingual resource repository for West African Languages

SVMTool

SVMTool: A general POS tagger generator based on Support Vector Machines

Switchboard Corpus

Using the NITE XML Toolkit on the Switchboard Corpus to Study Syntactic Choice: A Case Study

Synonymous sentence group

Grouping Synonymous Sentences from a Parallel Corpus

Synonymy

Using cooccurrence statistics and the web to discover synonyms in a technical language

Syntactic analysis

Using the Web as a Corpus for the Syntactic-Based Collocation Identification

Syntactic Annotation

Adding Syntactic Annotations to Transcripts of Parent-Child Dialogs

The Ongoing Evaluation Campaign of Syntactic Parsing of French: EASY

Syntactic classes

A Corpus-based Syntactic Lexicon for Adverbs

Syntactic dependencies

Exploring Portability of Syntactic Information from English to Basque

Syntactic Lexical properties

A Corpus-based Syntactic Lexicon for Adverbs

Syntactic lexicon

Towards a Dynamic Lexicon: Predicting the Syntactic Argument Structure of Complex Verbs

Syntactic parsers evaluation

The Ongoing Evaluation Campaign of Syntactic Parsing of French: EASY

Syntactic semantic patterns

Automatic extraction of syntactic semantic patterns for multilingual resources

Syntactico-semantic interface

Semantic categorization of Spanish se-constructions

Syntagmatic relations

Automatic Acquisition of Paradigmatic Relations using Iterated Co-occurrences

Syntax

Abar-Hitz: An Annotation Tool for the Basque Dependency Treebank

Representing Italian Complex Nominals: a Pilot Study

Semi-automatic Syntactic and Semantic Corpus Annotation with a Deep Parser

Synthetic speech evaluation

The Design of Czech Language Formal Listening Tests for the Evaluation of TTS Systems

T

 

Tag

Sejong Korean Corpora in the Making

Tagged corpus

Open Collaborative Development of the Thai Language Resources for Natural Language Processing

Tagger

Definition, dictionaries and tagger for Extended Named Entity Hierarchy

Tagging

An Annotated German-Language Medical Text Corpus as Language Resource

Evaluating Solutions for the Rapid Development of State-of-the-Art POS taggers for Portuguese

Term Translations in Parallel Corpora: Discovery and Consistency Check

Tagset design

Tiered Tagging Revisited

Tagset evaluation

Tiered Tagging Revisited

Tagset mapping rules

Tiered Tagging Revisited

Tagsets

Abstracting a Dialogue Act Tagset for Meeting Processing

TalkBank

Talkbank: Building an Open Unified Multimodal Database of Communicative Interaction

Tamil

Experiences in Collection of Handwriting Data for Online Handwriting Recognition in Indic Scripts

Task-oriented dialogue

Issues in Corpus Development for Multi-party Multi-modal Task-oriented Dialogue

Task-oriented human-computer dialogue

From Acts and Topics to Transactions and Dialogue Smoothness

Taxonomy alignment

Selecting the Correct English Synset for a Spanish Sense

TDMT

How Does Automatic Machine Translation Evaluation Correlate With Human Scoring as the Number of Reference Translations Increases?

Technolangue

Technolangue: A Permanent Evaluation and Information Infrastructure

TEI

Migrating Language Resources from SGML to XML: the Text Encoding Initiative Recommendations

Putting the Dutch PAROLE Corpus to Work

Towards basic categories for describing properties of texts in a corpus

TEI Encoding

The Integrated Language Database of 8th - 21st-Century Dutch

Towards an International Standard on Feature Structure Representation

TEIRESIAS algorithm

Pattern Discovery in Named Organization Corpus

Telecommunication

A New ITU-T Recommendation on the Evaluation of Telephone-Based Spoken Dialogue Systems

Telephony applications

OrienTel - Telephony Databases Across Northern Africa and the Middle East

Telicity

Extending WordNets to Implicit Information

Temporal resolution

A Framework for Temporal Resolution

Term base

The GENOMA-KB project: towards the integration of concepts, terms, textual corpora and entities

Term clustering

Clustering Concept Hierarchies from Text

Term distribution

Frequent Term Distribution Measures for Dataset Profiling

Term extraction

Acquiring Bayesian Networks from Text

Automatically selecting domain markers for terminology extraction

BootCaT: Bootstrapping Corpora and Terms from the Web

Detection of Domain Specific Terminology Using Corpora Comparison

Mercedes, A Term-In-Context Highlighter

Term recognition

A Large-Scale Resource for Storing and Recognizing Technical Terminology

Mercedes, A Term-In-Context Highlighter

Term specificity summarization

A comparison of summarisation methods based on term specificity estimation

Term variation

AUTOMATISATION OF THE ACTIVITY OF TERM COLLECTION IN DIFFERENT LANGUAGES

Term weighting

Modelling Legitimate Translation Variation for Automatic Evaluation of MT Quality

Termbase Management System

XTERM: A Flexible Standard-Compliant XML-Based Termbase Management System

Terminological concept modelling

Principles of a system for terminological concept modelling

Terminological resources

Building a Maritime Domain Lexicon: a Few Considerations on the Database Structure and the Semantic Coding

Terminological units

The Verb in the Terminological Collocations. Contribution to the Development of a Morphological Analyser: MorphoComp

Terminological verbal collocation

The Verb in the Terminological Collocations. Contribution to the Development of a Morphological Analyser: MorphoComp

Terminological verbs

The Verb in the Terminological Collocations. Contribution to the Development of a Morphological Analyser: MorphoComp

Terminology

A Large-Scale Resource for Storing and Recognizing Technical Terminology

Automatic Bilingual Lexicon Acquisition Using Random Indexing of Aligned Bilingual Data

Automatically selecting domain markers for terminology extraction

Detection of Domain Specific Terminology Using Corpora Comparison

EVALDA-CESART Project: Terminological Resources Acquisition Tools Evaluation Campaign

From Weaver to the ALPAC Report 

Mercedes, A Term-In-Context Highlighter

Principles of a system for terminological concept modelling

The Bilingual Web Dictionary on Demand

The Corpógrafo – a Web-based environment for corpora research

The GENOMA-KB Platform: Queries Over Integrated Linguistic Resources

The GENOMA-KB project: towards the integration of concepts, terms, textual corpora and entities

The Verb in the Terminological Collocations. Contribution to the Development of a Morphological Analyser: MorphoComp

Using Profiles for IMDI Metadata Creatation

Terminology checker

AUTOMATISATION OF THE ACTIVITY OF TERM COLLECTION IN DIFFERENT LANGUAGES

Terminology databases

AUTOMATISATION OF THE ACTIVITY OF TERM COLLECTION IN DIFFERENT LANGUAGES

Terminology extraction

A Xml-Based Term Extraction Tool for Basque

Methodology For Building Thematic Indexes In Medecine For French

Terms

Discovery of (New) Knowledge and the Analysis of Text Corpora

Terms extraction

Term Translations in Parallel Corpora: Discovery and Consistency Check

Terms translation based on parallel corpora

Term Translations in Parallel Corpora: Discovery and Consistency Check

Test collections

Test Collections for Patent-to-Patent Retrieval and Patent Map Generation in NTCIR-4 Workshop

Test material

EVALDA-CESART Project: Terminological Resources Acquisition Tools Evaluation Campaign

Test set construction

Incremental Methods to Select Test Sentences for Evaluating Translation Ability

Test table

AN EVALUATION PROTOCOL FOR TEXT MINING TOOLS: ALCESTE SAS TEXT MINER SPAD-CRM AND TEMIS TEXT MINING SOLUTIONS TESTING

Text alignment

Making Monolingual Corpora Comparable: a Case Study of Bulgarian and Croatian

Text categorization

NLP-enhanced Content Filtering within the POESIA Project

An Analysis of the Relative Difficulty of Reuters-21578 Subsets

Categorizing Web Pages as a Preprocessing Step for Information Extraction

CHeM: A System for the Automatic Analysis of e-mails in the Restoration and Conservation Domain

Pumping Documents Through a Domain and Genre Classification Pipeline

Text cohesion

RevisionBank: A Resource for Revision-based Multi-document Summarization and Evaluation

Text corpus

An Annotated German-Language Medical Text Corpus as Language Resource

Open Collaborative Development of the Thai Language Resources for Natural Language Processing

Russian Information Retrieval Evaluation Seminar

Spoken and Written Language Resources for Vietnamese

Text Encoding

Migrating Language Resources from SGML to XML: the Text Encoding Initiative Recommendations

Text encoding initiative

Making an XML-based Japanese-Slovene Learners' Dictionary

Text genre

Pumping Documents Through a Domain and Genre Classification Pipeline

Text Mining

AN EVALUATION PROTOCOL FOR TEXT MINING TOOLS: ALCESTE SAS TEXT MINER SPAD-CRM AND TEMIS TEXT MINING SOLUTIONS TESTING

Thesaurus or logical ontology, which do we need for mining text?

Towards Ontology Engineering Based on Linguistic Analysis

Unsupervised Text Mining for Ontology Extraction: An Evaluation of Statistical Measures

Text processing

A Large-Scale Resource for Storing and Recognizing Technical Terminology

Text retrieval

Design and Implementation of a Semantic Search Engine for Portuguese

Development of a Corpus Workbench for the METU Turkish Corpus

Text summarization

Generic Text Summarization Using WordNet

Text Summarization Challenge 2

Comparison of some automatic and manual methods for summary evaluation based on the Text Summarization Challenge 2

Text to speech alignment

WinPitch Corpus, a Text to Speech Alignment Tool for Multimodal Corpora

Text typology

The Lácio-Web: Corpora and Tools to advance Brazilian Portuguese Language Investigations and Computational Linguistic Tools

Towards basic categories for describing properties of texts in a corpus

Text understanding

Pronominal Anaphora Resolution for Unrestricted Text

Text-independent evaluation

Conversational Telephone Speech Corpus Collection for the NIST Speaker Recognition Evaluation 2004

Texts

The AAC [Austrian Academy Corpus] An Enterprise to Develop Large Electronic Text Corpora

Text-to-Hypertext Conversion

Exploiting Coreference Annotations for Text-to-Hypertext Conversion

Text-to-speech

A Galician Textual Corpus for Morphosyntactic Tagging with Application to Text-to-Speech Synthesis

Creation and Validation of Large Lexica for Speech-to-Speech Translation Purposes

Textual DataBase

Computational Lexicography and Carlo Emilio Gadda, Principe dell'Analisi e Duca della Buona Cognizione

Textual distraction

Textual Distraction as a Basis for Evaluating Automatic Summarisers

Thai Lexical Database

Enriching a Thai Lexical Database with Selectional Preferences

Thematic index

Methodology For Building Thematic Indexes In Medecine For French

Thesaurus

Development of Bilingual Domain-Specific Ontology for Automatic Conceptual Indexing

Development of Ontologies with Minimal Set of Conceptual Relations

Thesaurus or logical ontology, which do we need for mining text?

Thesaurus generation

A Freely Available Automatically Generated Thesaurus of Related Words

Three layered architecture

A Framework for Temporal Resolution

Tibetan

A Syntactically Annotated Corpus of Tibetan

Tiered tagging

Tiered Tagging Revisited

Time Alignment

Evaluating Factors Impacting the Accuracy of Forced Alignments in a Multimodal Corpus

Time series analysis

Summarization of Multimodal Information

TMX

Parallel corpora for the Galician language: building and processing of the CLUVI (Linguistic Corpus of the University of Vigo)

TnT

SVMTool: A general POS tagger generator based on Support Vector Machines.

Tokenisation

Term Translations in Parallel Corpora: Discovery and Consistency Check

Evaluating Solutions for the Rapid Development of State-of-the-Art POS taggers for Portuguese

Tone-of-voice

Tone-of-Voice and Controlled Language Techniques

Tool

A Flexible Language Acquisition Tool Kit for Natural Language Processing

Tool

PBIE: A DATA PREPARATION TOOLKIT toward DEVELOPING a PARSING-BASED INFORMATION EXTRACTION SYSTEM

Tools

Evaluation of Transcription and Annotation Tools for a Multi-modal, Multi-party Dialogue Corpus

Methodology for Rapid Prototyping and Testing of ASR Based User Interfaces

Using Profiles for IMDI Metadata Creatation

Tools evaluation

AN EVALUATION PROTOCOL FOR TEXT MINING TOOLS: ALCESTE SAS TEXT MINER SPAD-CRM AND TEMIS TEXT MINING SOLUTIONS TESTING

Tools for lexicography

Tools for Upgrading Printed Dictionaries by Means of Corpus-based Lexical Acquisition

Tools for linguistic analysis

Utilization of Multiple Language Resources for Robust Grammar-Based Tense and Aspect Classification

Top ontology

Top Ontology as a Tool for Semantic Role Tagging

Topic

Textual Distraction as a Basis for Evaluating Automatic Summarisers

Topic Detection

Results of the 2003 Topic Detection and Tracking Evaluation

Topic Maps

Exploiting Coreference Annotations for Text-to-Hypertext Conversion

Topic signatures

Publicly Available Topic Signatures for all WordNet Nominal Senses

Topic-focus articulation

Annotators' Agreement: The Case of Topic-Focus Articulation

Topics

From Acts and Topics to Transactions and Dialogue Smoothness

Topological Field Model

A High Quality Partial Parser for Annotating German Text Corpora

Toponym

Resources for Place Name Analysis

Tourism domains

The Italian NESPOLE! Corpus: A Multilingual Database with Interlingua Annotation in Tourism and Medical Domains

Tracking Evaluation

Results of the 2003 Topic Detection and Tracking Evaluation

Training simulation

Issues in Corpus Development for Multi-party Multi-modal Task-oriented Dialogue

Transaction success

From Acts and Topics to Transactions and Dialogue Smoothness

Transcrigal

Transcrigal: A Bilingual System for Automatic Indexing of Broadcast News

Transcription

Acquisition and Annotation of Slovenian Broadcast News Database

Development of Slovenian Broadcast News Speech Database

The Fisher Corpus: a Resource for the Next Generations of Speech-to-Text

Evaluation of Transcription and Annotation Tools for a Multi-modal, Multi-party Dialogue Corpus

Transcription system

CoGesT: A Formal Transcription System for Conversational Gesture

Transfer-based MT

MT Goes Farming: Comparing Two Machine Translation Approaches on a New Domain

Transformation-based learning

Raising the Bar: Stacked Conservative Error Correction Beyond Boosting

Translatability limits

Modelling Legitimate Translation Variation for Automatic Evaluation of MT Quality

Translation

EVALDA-CESART Project: Terminological Resources Acquisition Tools Evaluation Campaign

Parallel corpora for the Galician language: building and processing of the CLUVI (Linguistic Corpus of the University of Vigo)

The Bilingual Web Dictionary on Demand

Translation ability

Incremental Methods to Select Test Sentences for Evaluating Translation Ability

Translation Corpus

MetaMorpho TM: A Rule-Based Translation Corpus

Translation equivalents

Word Sense Disambiguation as a Wordnets' Validation Method in Balkanet

Translation equivalents in parallel corpora

Term Translations in Parallel Corpora: Discovery and Consistency Check

Translation Memories

Translation memories enrichment by statistical bilingual segmentation

Translation reference tools

BootCaT: Bootstrapping Corpora and Terms from the Web

Translation-equivalents

A word alignment system based on a translation equivalence extractor

Transnational digital government

Developing Language Resources for a Transnational Digital Government System

TREC-style evaluation

Collecting Spontaneously Spoken Queries for Information Retrieval

Tree Adjoining Grammar (TAG)

Developping tools and building linguistic resources for Vietnamese morpho-syntactic processing

Tree Extraction

Highlighting latent structure in documents

Treebank

A Language Resources Infrastructure for Bulgarian

Abar-Hitz: An Annotation Tool for the Basque Dependency Treebank

Automatic transformation of phrase treebanks to dependency trees

Building a Large Grammar for Italian

Enriching a French Treebank

Sejong Korean Corpora in the Making

Treebank generation

Semi-Automatic Construction of a Question Treebank

Treebanks

Phonological Treebanks - Issues in Generation and Application

The MULI Project: Annotation and Analysis of Information Structure in German and English

Unexpected Productions May Well be Errors

TTS

Duration Modeling for Turkish Text-to-Speech Synthesis System

Turkish

Duration Modeling for Turkish Text-to-Speech Synthesis System

Tutorial dialog systems

An Annotated Corpus of Tutorial Dialogs on Mathematical Theorem Proving

Typological databases

MED-TYP: A Typological Database for Mediterranean Languages

U

 

Ungrammatical Input

Parsing Ungrammatical Input: An Evaluation Procedure

Unicode and multimedia prompts

SpeechRecorder - a Universal Platform Independent Multi-Channel Audio Recording Software

Unified Lexicon

Unifying Lexicons in View of a Phonological and Morphological Lexical DB

Universal concepts/words

Semi-automatic UNL Dictionary Generation using WordNet.PT

Unknown word guessing

Combining symbolic and statistical methods in morphological analysis and unknown word guessing

Unknown words

How to Disassemble Alphabetical Processions - Morphological Treatment of Unknown Words

Unsupervised learning

Building Part-of-speech Corpora through Histogram Hopping

Usability

Calibrating Resource-light Automatic MT Evaluation: A Cheap Approach to Ranking MT Systems by the Usability of their Output

Designing a Realistic Evaluation of an End-to-end Interactive Question Answering System

Methodology for Rapid Prototyping and Testing of ASR Based User Interfaces

Usability Evaluation of Multimodal and Domain-Oriented Spoken Language Dialogue Systems

Usability Evaluation of Spoken Dialogue Systems

User

Designing a Realistic Evaluation of an End-to-end Interactive Question Answering System

User feedback

The Translation Correction Tool: English-Spanish User Studies

User interface

Callisto: A Configurable Annotation Workbench

User requirements

User Query Analysis for the Specification and Evaluation of a Dialogue Processing and Retrieval System

User-centred design

The Corpógrafo – a Web-based environment for corpora research

User's perspective

The Integrated Language Database of 8th - 21st-Century Dutch

Users’ needs

What is my Style? Using Stylistic Features of Portuguese Web Texts to classify Web pages according to Users' Needs

Utterance length

A Comparative Study on Human Communication Behaviors and Linguistic Characteristics for Speech-to-Speech Translation

V

 

Vagueness

Classification of Japanese Spatial Nouns

Validation

A Methodology and Associated Tools for Building Interlingual Wordnets

Creation and Validation of Large Lexica for Speech-to-Speech Translation Purposes

ELRA Validation Methodology and Standard Promotion for Linguistic Resources

SLR Validation: Current Trends and Developments

Word Sense Disambiguation as a Wordnets' Validation Method in Balkanet

Validation of Annotations

Abstracting a Dialogue Act Tagset for Meeting Processing

Variants of a corpus

A Comparison of Two Variant Corpora: The Same Content with Different Sources

Various linguistic datatypes

The Integrated Language Database of 8th - 21st-Century Dutch

Vector space model

Automatic Bilingual Lexicon Acquisition Using Random Indexing of Aligned Bilingual Data

Verb valencies

Top Ontology as a Tool for Semantic Role Tagging

Verb valency

Verb Valency Descriptors for a Syntactic Treebank

Video Corpus

Toward an Annotation Software for Video of Sign Language, Including Image Processing Tools and Signing Space Modelling

Vietnamese

Developping tools and building linguistic resources for Vietnamese morpho-syntactic processing

Vietnamese language

Spoken and Written Language Resources for Vietnamese

Virtual agents

Issues in Corpus Development for Multi-party Multi-modal Task-oriented Dialogue

Visualization

A Chatbot as a Novel Corpus Visualization Tool

Viterbi Alignment

Automatic Phonemic Labeling and Segmentation of Spoken Dutch

Viterbi decoding

Speech recognition simulation and its application for Wizard of Oz experiments

Vocal interfaces

Semi-automatic Acquisition of Command Grammar

W

 

Wavelets

Summarization of Multimodal Information

Web

Creating open language resources for Hungarian

Terminal Device Oriented Comparable Corpora and its Alignment -- Towards Extracting Paraphrasing Patterns --

Web as a corpus

Using the Web as a Corpus for the Syntactic-Based Collocation Identification

Web as corpus

The Bilingual Web Dictionary on Demand

Web Content

Bayesian Semantics Incorporation to Web Content for Natural Language Information Retrieval

Web corpus

The OPUS corpus - parallel and free

Web data acquisition

NLP-enhanced Content Filtering within the POESIA Project

Web dictionary

The Bilingual Web Dictionary on Demand

Web Information Retrieval

What is my Style? Using Stylistic Features of Portuguese Web Texts to classify Web pages according to Users' Needs

Web mining

BootCaT: Bootstrapping Corpora and Terms from the Web

Collocation Extraction Using Web Statistics

Using cooccurrence statistics and the web to discover synonyms in a technical language

Web retrieval

Russian Information Retrieval Evaluation Seminar

Web search service

Using the Web as a Corpus for the Syntactic-Based Collocation Identification

Web services

The Corpógrafo – a Web-based environment for corpora research

Web services arquitecture

Web Services Architecture for Language Resources

Web statistics

Collocation Extraction Using Web Statistics

Web-based

Design of an Interactive Web-based User Interface for Speech Database Query Formation

WebODE

Benchmarking ontology tools. A case study for the WebODE platform

Web-site classification

Russian Information Retrieval Evaluation Seminar

Weighted abduction of term templates

Using Weighted Abduction to Align Term Variant Translations in Bilingual Texts

Wizard of Oz

Evaluating Conversation with Hans Christian Andersen

Wizard-of-Oz

An Environment for Dialogue Corpora Collection (ENDIACC)

Wizard-Of-Oz experiment (WoZ)

Speech recognition simulation and its application for Wizard of Oz experiments

Wizard-of-Oz experiments

An Annotated Corpus of Tutorial Dialogs on Mathematical Theorem Proving

Word alignment

 

Term Translations in Parallel Corpora: Discovery and Consistency Check

Word Sense Disambiguation as a Wordnets' Validation Method in Balkanet

Word association

Improving Collocation Extraction for High Frequency Words

Word Association Norms as a Unique Supplement of Traditional Language Resources

Word association norms

Word Association Norms as a Unique Supplement of Traditional Language Resources

Word co-occurrence

A Freely Available Automatically Generated Thesaurus of Related Words

Utilizing the One-Sense-per-Discourse Constraint for Fully Unsupervised Word Sense Induction and Disambiguation

Word Error Rate (WER)

Comparative Evaluations in the Domain of Automatic Speech Recognition 

Speech recognition simulation and its application for Wizard of Oz experiments

Word extraction

Related Word-pairs Extraction without Dictionaries

Word graphs

An Efficient Word Confidence Measure Using Likelihood Ratio Scores

Word list

Creating Slovenian Language Resources for Development of Speech-to-Speech Translation Components

Word order

Mapping Dependency Structures to Phrase Structures and the Automatic Acquisition of Mapping Rules

Word sense

Constructing Word-Sense Association Networks from Bilingual Dictionary and Comparable Corpora

Word Sense Disambiguation

Enriching EWN with Syntagmatic Information by means of WSD

EuroWordNet as a Resource for Cross-language Information Retrieval

Evaluating Variants of the Lesk Approach for Disambiguating Words

Handling Subtle Sense Distinctions through Wordnet Semantic Types

Meaningful Clusters

MiniCors and Cast3LB: Two Semantically Tagged Spanish Corpora

Portuguese Large-scale Language Resources for NLP Applications

Semi-supervised learning by Fuzzy clustering and Ensemble learning

The Effect of Bias on an Automatically-built Word Sense Corpus

Utilizing the One-Sense-per-Discourse Constraint for Fully Unsupervised Word Sense Induction and Disambiguation

Word Sense Disambiguation as a Wordnets' Validation Method in Balkanet

Word sense induction

Utilizing the One-Sense-per-Discourse Constraint for Fully Unsupervised Word Sense Induction and Disambiguation

Word vector representation

A complete understanding speech system based on semantic concepts

Word-formation

Derivational Relations in Flectional Languages - Czech Case

WordNet

A Similarity Measure for Unsupervised Semantic Disambiguation

Automatic Acquisition of Sense Examples using ExRetriever

Combining Heterogeneous Lexical Resources

Concept Creation in Lexical Ontologies

Enriching the Spanish EuroWordNet by Collocations

Enriching WordNet Via Generative Metonymy and Creative Polysemy

Evaluating Variants of the Lesk Approach for Disambiguating Words

Exploring Balkanet Shared Ontology for Multilingual Conceptual Indexing

Generic Text Summarization Using WordNet

Handling Subtle Sense Distinctions through Wordnet Semantic Types

INQUER: A WordNet-based Question-Answering Application

Integration of Russian Language Resources

Korean-Chinese-Japanese Multilingual Wordnet with Shared Semantic Hierarchy

Meaningful Clusters

Metaphors in Wordnets: from Theory to Practice

Polysemy and Category Structure in WordNet: An Evidential Approach

Publicly Available Topic Signatures for all WordNet Nominal Senses

Sinica BOW (Bilingual Ontological Wordnet): Integration of Bilingual WordNet and SUMO

Spanish WordNet 1.6: Porting the Spanish WordNet Across Princeton Versions

Top Ontology as a Tool for Semantic Role Tagging

Towards the MEANING Top Ontology: Sources of Ontological Meaning

Word Association Norms as a Unique Supplement of Traditional Language Resources

Word Sense Disambiguation as a Wordnets' Validation Method in Balkanet

WordNet-Affect: an Affective Extension of WordNet

WordNet Adjectives

Using WordNet to Measure Semantic Orientations of Adjectives

Word-pair extraction

Related Word-pairs Extraction without Dictionaries

Word-structure

A natural language approach to information management: tracking scientific advances through the structure of words

Word-to-word alignment

A word alignment system based on a translation equivalence extractor

Workbench

Development of a Corpus Workbench for the METU Turkish Corpus

World Wide Web

Exploiting Anchor Text as a Lexical Resource

Mining the Web for Discourse Markers

Written and Spoken

ENABLER Thematic Network of National Projects: Technical, Strategic and Political Issues of LRs

Written corpora

The Lácio-Web: Corpora and Tools to advance Brazilian Portuguese Language Investigations and Computational Linguistic Tools

Written corpus construction

Introducing the La Repubblica Corpus: A large Annotated TEI(XML)-Compliant Corpus of Newspaper Italian

Written language resources

ELRA Validation Methodology and Standard Promotion for Linguistic Resources

Written-spoken corpus

Open Collaborative Development of the Thai Language Resources for Natural Language Processing

WSD

Automatic Acquisition of Sense Examples using ExRetriever

EuroWordNet as a Resource for Cross-language Information Retrieval

Semantic Annotating of Czech Corpus via WSD

WWW

Pumping Documents Through a Domain and Genre Classification Pipeline

X

 

XCES

A Search Tool for Corpora with Positional Tagsets and Ambiguities

XML

A Multilingual Phonological Resource Toolkit for Ubiquitous Speech Technology

A pattern extraction workbench combining multiple linguistic levels

A powerful and versatile XML format for representing role-semantic annotation

A Syntactically Annotated Corpus of Tibetan

A Xml-Based Term Extraction Tool for Basque

Acquiring Reusable Multilingual Phonotactic Resources

Bilingual Connections for Trilingual Corpora: An XML Approach

Highlighting latent structure in documents

Investigation on Semantics to Improve the COVAX System

Making an XML-based Japanese-Slovene Learners' Dictionary

Migrating Language Resources from SGML to XML: the Text Encoding Initiative Recommendations

Parallel corpora for the Galician language: building and processing of the CLUVI (Linguistic Corpus of the University of Vigo)

Phonological Treebanks - Issues in Generation and Application

Retrieving Annotated Corpora for Corpus Annotation

SLR Validation: Current Trends and Developments

Talkbank: Building an Open Unified Multimodal Database of Communicative Interaction

The DeepThought Core Architecture Framework

XTERM: A Flexible Standard-Compliant XML-Based Termbase Management System

XML annotation

MED-TYP: A Typological Database for Mediterranean Languages

Towards Ontology Engineering Based on Linguistic Analysis

XML corpora

Querying both time-aligned and hierarchical corpora with NXT Search

The CLaRK System:  XML-based Corpora Development System for Rapid Prototyping

XSLT

Talkbank: Building an Open Unified Multimodal Database of Communicative Interaction

Z

 

Zone

An Annotation Scheme for a Rhetorical Analysis of Biology Articles

Zulu corpora

Software Tools for Morphological Tagging of Zulu Corpora and Lexicon Development