AFFILIATIONS: Browse articles of the conference sorted by institution

A - B - C - D - E - F - G - H - I - J - K - L - M - N - O - P - Q - R - S - T - U - V - W - X - Y - Z -

A
Aalto University School of Business, FINLAND Gold-standard for Topic-specific Sentiment Analysis of Economic Texts
Acapela Group S.A., BELGIUM The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues
Access for All Foundation, SWITZERLAND Applying Accessibility-Oriented Controlled Language (CL) Rules to Improve Appropriateness of Text Alternatives for Images: an Exploratory Study
AgroKnow, GREECE A Gold Standard for CLIR evaluation in the Organic Agriculture Domain
Aholab, University of the Basque Country, SPAIN New Bilingual Speech Databases for Audio Diarization
Aix Marseille University, FRANCE Automatically Enriching Spoken Corpora with Syntactic Information for Linguistic Studies
Alexandru Ioan Cuza University of Iaşi, ROMANIA Transliteration and Alignment of Parallel Texts from Cyrillic to Latin
How Could Veins Speed Up the Process of Discourse Parsing
ALM, UNITED STATES Casa De La Lhéngua: a Set of Language Resources and Natural Language Processing Tools for Mirandese
Alpage (Univ Paris Diderot / INRIA), FRANCE Developing a French FrameNet: Methodology and First results
Alpheios Project, GREECE The Making of Ancient Greek WordNet
AMI Software, FRANCE Investigating the Image of Entities in Social Media: Dataset Design and First Results
ARL, UNITED STATES Finding Romanized Arabic Dialect in Code-Mixed Tweets
ARTI, INDIA Finding Romanized Arabic Dialect in Code-Mixed Tweets
EHU, SPAIN A Stream Computing Approach Towards Scalable NLP
Athena, Research and Innovation Center in Information, Communication and Knowledge Technologies, Institute for Language and Speech Processing, GREECE The Strategic Impact of META-NET on the Regional, National and International Level
META-SHARE: One Year After
The DIRHA simulated corpus
Word Semantic Similarity for Morphologically Rich Languages
Augsburg University of Applied Sciences, GERMANY Single-Person and Multi-Party 3D Visualizations for Nonverbal Communication Analysis
Austrian Academy of Sciences, AUSTRIA Towards Automatic Quality Assessment of Component Metadata

 

B
Barcelona Media Innovació, SPAIN Adapting Freely Available Resources to Build an Opinion Mining Pipeline in Portuguese
Bavarian Archive for Speech Signals, GERMANY Untrained Forced Alignment of Transcriptions and Audio for Language Documentation Corpora using WebMAUS
German Alcohol Language Corpus - the Question of Dialect
Bavarian Archive for Speech Signals, University of Munich, GERMANY German Alcohol Language Corpus - the Question of Dialect
Beaute-test.com, FRANCE Generating a Resource for Products and Brandnames Recognition. Application to the Cosmetic Domain.
Beijing Language and Culture University, CHINA Phoneme Set Design Using English Speech Database by Japanese for Dialogue-Based English CALL Systems
Bibliotheca Alexandria, EGYPT MUHIT: A Multilingual Harmonized Dictionary
Boğaziçi Üniversitesi, TURKEY Self-training a Constituency Parser using n-gram Trees
The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue
Brigham Young University, UNITED STATES Combining Elicited Imitation and Fluency Features for Oral Proficiency Measurement
Momresp: A Bayesian Model for Multi-Annotator Document Labeling
Evaluating Lemmatization Models for Machine-Assisted Corpus-Dictionary Linkage
Using Transfer Learning to Assist Exploratory Corpus Annotation
Student Achievement and French Sentence Repetition Test Scores
LexTerm Manager: Design for an Integrated Lexicography and Terminology System
Brno University of Technology, CZECH REPUBLIC Semantic Search in Documents Enriched by LOD-based Annotations
Budapest University of Technology, HUNGARY DCEP -Digital Corpus of the European Parliament
Bulgarian Academy of Sciences, BULGARIA A System for Experiments with Dependency Parsers
Sublanguage Corpus Analysis Toolkit: a Tool for Assessing the Representativeness and Sublanguage Characteristics of Corpora
Constituency Parsing of Bulgarian: Word- vs Class-based Parsing

 

C
CA, GERMANY Conceptual Transfer: Using Local Classifiers for Transfer Selection
Capita TI, UNITED KINGDOM Machine Translation for Subtitling: A Large-Scale Evaluation
Carnegie Mellon University, UNITED STATES Augmenting English Adjective Senses with Supersenses
A Unified Annotation Scheme for the Semantic/Pragmatic Components of Definiteness
Dual Subtitles as Parallel Corpora
Two Approaches to Metaphor Detection
Can Crowdsourcing be used for Effective Annotation of Arabic?
Resources for the Detection of Conventionalized Metaphors in Four Languages
Comprehensive Annotation of Multiword Expressions in a Social Web Corpus
A Multidialectal Parallel Corpus of Arabic
YouDACC: the Youtube Dialectal Arabic Comment Corpus
The CMU METAL Farsi NLP Approach
Supervised Within-Document Event Coreference using Information Propagation
Morphological Parsing of Swahili using Crowdsourced Lexical Resources
The Language Application Grid
Large Scale Arabic Error Annotation: Guidelines and Framework
Detecting Subevent Structure for Event Coreference Resolution
Catalogue et Index des Sites Médicaux de langue Française, FRANCE Language Resources for French in the Biomedical Domain
CCLS, Columbia, UNITED STATES Tharwa: A Large Scale Dialectal Arabic - Standard Arabic - English Lexicon
CEA ― LIST, FRANCE Adapting VerbNet to French using Existing Resources
The LIMA Multilingual Analyzer Made Free: FLOSS Resources Adaptation and Correction
Developing a French FrameNet: Methodology and First results
Evaluation of Different Strategies for Domain Adaptation in Opinion Mining
Compounds and Distributional Thesauri
CEISIC, Royal Institut of Amazigh Culture, MOROCCO Amazigh Verb Conjugator
CELI, ITALY Modeling, Managing, Exposing, and Linking Ontologies with a Wiki-based Tool
A Gold Standard for CLIR evaluation in the Organic Agriculture Domain
Center for Language Engineering, KICS, UET, PAKISTAN The CLE Urdu POS Tagset
Center of Linguistics of the University of Porto, PORTUGAL Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
Central Institute of Indian Languages, INDIA Towards Building a Kashmiri Treebank: Setting up the Annotation Pipeline
Centre de Recherche Informatique de Montréal, CANADA Linked Open Data and Web Corpus Data for noun compound bracketing
Centro de Linguística Universidade de Lisboa, PORTUGAL LexTec - a Rich Language Resource for Technical Domains in Portuguese
Charles University in Prague, CZECH REPUBLIC Votter Corpus: A Corpus of Social Polling Language
Genres in the Prague Discourse Treebank
Mapping Diatopic and Diachronic Variation in Spoken Czech: the Ortofon and Dialekt Corpora
Corpus of 19th-century Czech Texts: Problems and Solutions
Resources in Conflict: A Bilingual Valency Lexicon vs. a Bilingual Treebank vs. a Linguistic Theory
Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech
Three Dimensions of the so-called "Interoperability" of Annotation Schemes
The Strategic Impact of META-NET on the Regional, National and International Level
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
Valency and Word Order in Czech ― A Corpus Probe
Free English and Czech Telephone Speech Corpus Shared Under the CC-BY-SA 3.0 License
Automatic Mapping Lexical Resources: A Lexical Unit as the Keystone
To Pay or to Get Paid: Enriching a Valency Lexicon with Diatheses
Verbs of Saying with a Textual Connecting Function in the Prague Discourse Treebank
HindEnCorp - Hindi-English and Hindi-only Corpus for Machine Translation
HamleDT 2.0: Thirty Dependency Treebanks Stanfordized
Multilingual Test Sets for Machine Translation of Search Queries for Cross-Lingual Information Retrieval in the Medical Domain
Votter Corpus: A Corpus of Social Polling Language
The MERLIN corpus: Learner Language and the CEFR
Charles University in Prague, Faculty of Arts, CZECH REPUBLIC Improvements to Dependency Parsing Using Automatic Simplification of Data
Charles University in Prague, Faculty of Mathematics and Physics, CZECH REPUBLIC Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech
Word-Formation Network for Czech
A Tagged Corpus and a Tagger for Urdu
Two-Step Machine Translation with Lattices
HindEnCorp - Hindi-English and Hindi-only Corpus for Machine Translation
HamleDT 2.0: Thirty Dependency Treebanks Stanfordized
Improving Evaluation of English-Czech MT through Paraphrasing
Charles University in Prague, Institute of Formal and Applied Linguistics, CZECH REPUBLIC A Tagged Corpus and a Tagger for Urdu
Two-Step Machine Translation with Lattices
Charles University in Prague, Institute of the Czech National Corpus, CZECH REPUBLIC Aligning Parallel Texts with InterText
The SYN-series Corpora of Written Czech
Charles University in Prague, Institute of Theoretical and Computational Linguistics, CZECH REPUBLIC The SYN-series Corpora of Written Czech
City University of Hong Kong, HONG KONG SPECIAL ADMINISTRATIVE REGION OF CHINA Narrowing the Gap Between Termbases and Corpora in Commercial Environments
The Halliday Centre Tagger: An Online Platform for Semi-automatic Text Annotation and Analysis
Clustering Tweets using Wikipedia Concepts
City University of New York, UNITED STATES Comparison of the Impact of Word Segmentation on Name Tagging for Chinese and Japanese
CLE, KICS, UET, PAKISTAN The CLE Urdu POS Tagset
CLILLAC-ARP (Univ Paris Diderot), FRANCE Developing a French FrameNet: Methodology and First results
CLLE-ERSS/CNRS&University of Toulouse 2, FRANCE GLÀFF, a Large Versatile French Lexicon
CLS/CLST, Radboud University Nijmegen, NETHERLANDS The Evolving Infrastructure for Language Resources and the Role for Data Scientists
The Dutch LESLLA Corpus
Vulnerability in Acquisition, Language Impairments in Dutch: Creating a VALID Data Archive
CLUL, PORTUGAL Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
CMU-LTI & INESC-ID, UNITED STATES Dual Subtitles as Parallel Corpora
CNGL at Trinity College Dublin, IRELAND Global Intelligent Content: Active Curation of Language Resources using Linked Data
CNR/ILC, ITALY META-SHARE: One Year After
CNRS LTCI, Télécom ParisTech, FRANCE Emilya: Emotional Body Expression in Daily Actions Database
CNRS/LIMSI, ENSIIE, FRANCE A Hierarchical Taxonomy for Classifying Hardness of Inference Tasks
Multiple Choice Question Corpus Analysis for Distractor Characterization
CNRS, FRANCE Language Resources for French in the Biomedical Domain
CNRS/LIMSI, FRANCE Automatic Language Identity Tagging on Word and Sentence-Level in Multilingual Text Sources: a Case-Study on Luxembourgish
Human Annotation of ASR Error Regions: is "gravity" a Sharable Concept for Human Annotators?
TVD: A Reproducible and Multiply Aligned TV Series Dataset
Cognitive Science Department, METU Informatics, TURKEY Turkish Resources for Visual Word Recognition
Cognizant Setcon, UNITED STATES Media Monitoring and Information Extraction for the Highly Inflected Agglutinative Language Hungarian
Columbia University, Center for Computational Learning Systems, UNITED STATES Large Scale Arabic Error Annotation: Guidelines and Framework
Common Language Resources and Technology Infrastructure, EUROPE The CLARIN Research Infrastructure: Resources and Tools for eHumanities Scholars
Computer Science Department, Technion, ISRAEL How to Construct a Multi-Lingual Domain Ontology
Cooperative State University Baden-Württemberg, GERMANY A Database of Freely Written Texts of German School Students for the Purpose of Automatic Spelling Error Classification
Copenhagen Business School, DENMARK CFT13: a Resource for Research into the Post-editing Process
Evaluating the Effects of Interactivity in a Post-Editing Workbench
Czech Technical University in Prague, CZECH REPUBLIC The Nijmegen Corpus of Casual Czech

 

D
DANS, NETHERLANDS The CMD Cloud
Delux Media, UNITED KINGDOM Machine Translation for Subtitling: A Large-Scale Evaluation
Department of Computational linguistics, Institute for Bulgarian, BAS, BULGARIA The Strategic Impact of META-NET on the Regional, National and International Level
Department of Computer Science and Engineering, Indian Institute of Technology, INDIA Efficient Reuse of Structured and Unstructured Resources for Ontology Population
Shata-Anuvadak: Tackling Multiway Translation of Indian Languages
When Transliteration Met Crowdsourcing : An Empirical Study of Transliteration via Crowdsourcing using Efficient, Non-redundant and Fair Quality Control
Department of Computer Science, University of Bath, UNITED KINGDOM The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue
Department of Linguistics, Radboud University Nijmegen, NETHERLANDS ASR-based CALL Systems and Learner Speech Data: New Resources and Opportunities for Research and Development in Second Language Learning
Department of Telecommunications & Media Informatics, Budapest University of Technology & Economics, HUNGARY The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech
DePaul University, UNITED STATES Expanding N-gram Analytics in ELAN and a Case Study for Sign Synthesis
Deutsche Forschungszentrum für Künstliche Intelligenz - Language Techology Laboratory, GERMANY Annotating Relation Mentions in Tabloid Press
Sprinter: Language Technologies for Interactive and Multimedia Language Learning
Language Resources and Annotation Tools for Cross-Sentence Relation Extraction
Deutsche Forschungszentrum für Künstliche Intelligenz, GERMANY A SKOS-based Schema for TEI encoded Dictionaries at ICLTT
Annotating Relation Mentions in Tabloid Press
The taraXÜ Corpus of Human-Annotated Machine Translations
The Strategic Impact of META-NET on the Regional, National and International Level
Language Resources and Annotation Tools for Cross-Sentence Relation Extraction
Harmonization of German Lexical Resources for Opinion Mining
META-SHARE: One Year After
MAT: a Tool for L2 Pronunciation Errors Annotation
Deutsche Forschungszentrum für Künstliche Intelligenz, Saarland University, GERMANY Annotating Relation Mentions in Tabloid Press
The taraXÜ Corpus of Human-Annotated Machine Translations
The Strategic Impact of META-NET on the Regional, National and International Level
Sprinter: Language Technologies for Interactive and Multimedia Language Learning
Language Resources and Annotation Tools for Cross-Sentence Relation Extraction
DFKI / W3C Fellow, GERMANY Global Intelligent Content: Active Curation of Language Resources using Linked Data
DFKI LT-lab, GERMANY Annotating Relation Mentions in Tabloid Press
DHA Suffa University, PAKISTAN The CLE Urdu POS Tagset
Automatic Acquisition of Urdu Nouns (along with Gender and Irregular Plurals))
Dipartimento Studi Umanistici - Università di Salerno, ITALY VOLIP: a Corpus of Spoken Italian and a Virtuous Example of Reuse of Linguistic Resources
Documenta, UNITED STATES Croatian Memories
Doshisha University, JAPAN Phoneme Set Design Using English Speech Database by Japanese for Dialogue-Based English CALL Systems
Dr. Bhim Rao Ambedkar University, Department of Linguistics, INDIA Developing Politeness Annotated Corpus of Hindi Blogs
Dublin City University, IRELAND TLAXCALA: a Multilingual Corpus of Independent News
The Strategic Impact of META-NET on the Regional, National and International Level
Quality Estimation for Synthetic Parallel Data Generation
caWaC - a Web Corpus of Catalan and its Application to Language Modeling and Machine Translation
Dublin City University, Centre for Next Generation Localisation - University of North Texas, IRELAND Modeling Language Proficiency Using Implicit Feedback
Dublin Institute of Technology, IRELAND TweetNorm_es: an Annotated Corpus for Spanish Microtext Normalization
Dundee University, UNITED KINGDOM A Model for Processing Illocutionary Structures and Argumentation in Debates

 

E
Eberhard Karls Universität Tübingen, GERMANY Finding a Tradeoff between Accuracy and Rater's Workload in Grading Clustered Short Answers
Ecole des Hautes Etudes en Sciences Sociales, FRANCE Bridging the Gap between Speech Technology and Natural Language Processing: An Evaluation Toolbox for Term Discovery Systems
Ecole Nationale d'Ingénieurs de Brest, FRANCE A Database of Full Body Virtual Interactions Annotated with Expressivity Scores
Ecole Nationale Supérieure de l'Electronique et de ses Applications, CANADA An Effortless Way To Create Large-Scale Datasets For Famous Speakers
Ecole Polytechnique Fédérale de Lausanne, SWITZERLAND Collaboration in the Production of a Massively Multilingual Lexicon
Ecole Polytechnique Fédérale de Zurich, SWITZERLAND Semantic Approaches to Software Component Retrieval with English Queries
Ecole Polytechnique Montreal, CANADA Improving Entity Linking using Surface Form Refinement
EDF R&D, FRANCE Investigating the Image of Entities in Social Media: Dataset Design and First Results
ELDA/ELRA, FRANCE The ETAPE Speech Processing Evaluation
VERTa: Facing a Multilingual Experience of a Linguistically-based MT Evaluation
META-SHARE: One Year After
ELRA's Consolidated Services for the HLT Community
Rediscovering 15 Years of Discoveries in Language Resources and Evaluation: The LREC Anthology Analysis
Elhuyar Foundation, IXA NLP Group, University of the Basque Country, SPAIN TweetNorm_es: an Annotated Corpus for Spanish Microtext Normalization
Eptica-Lingway, FRANCE Evaluating Corpora Documentation with regards to the Ethics and Big Data Charter
Erasmus University of Rotterdam, NETHERLANDS Croatian Memories
ERIC lab., Université Lyon 2, FRANCE Investigating the Image of Entities in Social Media: Dataset Design and First Results
Etudes Interculturelles de Langues Appliquées, Université Paris Diderot, FRANCE A Corpus of Machine Translation Errors Extracted from Translation Students Exercises
EURECOM, FRANCE Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web
European Academy of Bozen/Bolzano, ITALY 'interHist' - an Interactive Visual Interface for Corpus Exploration
The MERLIN corpus: Learner Language and the CEFR
KoKo: an L1 Learner Corpus for German
European Commission, Joint Research Centre, ITALY Media Monitoring and Information Extraction for the Highly Inflected Agglutinative Language Hungarian
Clustering of Multi-Word Named Entity Variants: Multilingual Evaluation
Resource Creation and Evaluation for Multilingual Sentiment Analysis in Social Media Texts
Named Entity Recognition on Turkish Tweets
European Parliament - Intrasoft, LUXEMBOURG DCEP -Digital Corpus of the European Parliament
Euroscript Deutschland GmbH, GERMANY Pre-ordering of Phrase-based Machine Translation Input in Translation Workflow
Euroscript Germany, GERMANY The taraXÜ Corpus of Human-Annotated Machine Translations
Euroscript Luxembourg s.à r.l., LUXEMBOURG Pre-ordering of Phrase-based Machine Translation Input in Translation Workflow
Euskal Herriko Unibertsitatae, SPAIN NewsReader: Recording History from Daily News Streams
ExB Group, DENMARK PACE Corpus: a Multilingual Corpus of Polarity-Annotated Textual Data from the Domains Automotive and CEllphone
Expert System, ITALY A Modular System for Rule-based Text Categorisation

 

F
Faculdade de Letras da Universidade de Lisboa / Centro de Linguistica da Universidade de Lisboa, PORTUGAL Revising the Annotation of a Broadcast News Corpus: a Linguistic Approach
Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
Faculdade de Letras da Universidade de Lisboa / Instituto de Engenharia de Sistemas e Computadores ― Investigação e Desenvolvimento em Lisboa, PORTUGAL Revising the Annotation of a Broadcast News Corpus: a Linguistic Approach
Teenage and Adult Speech in School Context: Building and Processing a Corpus of European Portuguese
El-WOZ: a Client-Server Wizard-of-Oz Interface
FamilySearch, UNITED STATES Corpus and Evaluation of Handwriting Recognition of Historical Genealogical Records
Federal University of Juiz de Fora, SPAIN NIF4OGGD - NLP Interchange Format for Open German Governmental Data
Federal University of Rio Grande do Sul - Institute of Informatics, BRAZIL Comparing the Quality of Focused Crawlers and of the Translation Resources Obtained from them
Boosting the Creation of a Treebank
Icelandic Quirks: Testing Linguistic Theories and Language Technology
Comparing Similarity Measures for Distributional Thesauri
Federal University of Rio Grande do Sul, BRAZIL Icelandic Quirks: Testing Linguistic Theories and Language Technology
Federal University of Sâo Carlos, BRAZIL Automatic Semantic Relation Extraction from Portuguese Texts
FG Language Technology, Technische Universität, Darmstadt, GERMANY Distributed Distributional Similarities of Google Books over the Centuries
NoSta-D Named Entity Annotation for German: Guidelines and Dataset
Finnish Association of the Deaf, FINLAND S-pot - a Benchmark in Spotting Signs Within Continuous Signing
Flemish Department of Economy, Science and Innovation, BELGIUM A Decade of HLT Agency Activities in the Low Countries: from Resource Maintenance (BLARK) to Service Offerings (BLAISE)
Fondazione Bruno Kessler, ITALY Crowdsourcing for the Identification of Event Nominals: an Experiment
Mapping WordNet Domains, WordNet Topics and Wikipedia Categories to Generate Multilingual Domain Specific Resources
The Strategic Impact of META-NET on the Regional, National and International Level
T-PAS; A resource of Typed Predicate Argument Structures for linguistic analysis and semantic processing
Creative Language Explorations through a high-Expressivity N-grams Query Language
Enriching the "Senso Comune" Platform with Automatically Acquired Data
Mapping CPA Patterns onto OntoNotes Senses
Euronews: a Multilingual Speech Corpus for ASR
CROMER: a Tool for Cross-Document Event and Entity Coreference
A Gold Standard for CLIR evaluation in the Organic Agriculture Domain
META-SHARE: One Year After
A SICK Cure for the Evaluation of Compositional Distributional Semantic Models
NewsReader: Recording History from Daily News Streams
Automatic Annotation of Machine Translation Datasets with Binary Quality Judgements
The DIRHA simulated corpus
Modeling, Managing, Exposing, and Linking Ontologies with a Wiki-based Tool
An Efficient and User-friendly Tool for Machine Translation Quality Estimation
Resource Creation and Evaluation for Multilingual Sentiment Analysis in Social Media Texts
Fraunhofer IAIS, GERMANY Extracting Information for Context-aware Meeting Preparation
Exploiting the Large-Scale German Broadcast Corpus to Boost the Fraunhofer IAIS Speech Recognition System
Freelance Annotator, JAPAN Annotation of Computer Science Papers for Semantic Relation Extrac-tion
Freie Universitaet Berlin, GERMANY Converting an HPSG-based Treebank into its Parallel Dependency-based Treebank
Constituency Parsing of Bulgarian: Word- vs Class-based Parsing
Friedrich Alexander Universität, Erlangen-Nürnberg, GERMANY New Spanish Speech Corpus Database for the Analysis of People Suffering from Parkinson's Disease
Erlangen-CLP: A Large Annotated Corpus of Speech from Children with Cleft Lip and Palate
Friedrich-Schiller-Universität Jena, GERMANY Collaboratively Annotating Multilingual Parallel Corpora in the Biomedical Domain―some MANTRAs
Disclose Models, Hide the Data - How to Make Use of Confidential Corpora without Seeing Sensitive Raw Data
FTW Telecommunications Research Center Vienna, AUSTRIA The MMASCS Multi-Modal Annotated Synchronous Corpus of Audio, Video, Facial Motion and Tongue Motion Data of Normal, Fast and Slow Speech
Fujitsu Limited, JAPAN A Method for Building Burst-Annotated Co-Occurrence Networks for Analysing Trends in Textual Data
Future University of Hakodate, Department of Complex Systems, JAPAN Japanese Conversation Corpus for Training and Evaluation of Backchannel Prediction Model.

 

G
George Washington University , UNITED STATES Tharwa: A Large Scale Dialectal Arabic - Standard Arabic - English Lexicon
MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic
SANA: A Large Scale Multi-Genre, Multi-Dialect Lexicon for Arabic Subjectivity and Sentiment Analysis
German Research Center for Artificial Intelligence, GERMANY TMO ― The Federated Ontology of the TrendMiner Project
Information Extraction from German Patient Records via Hybrid Parsing and Relation Extraction Strategies
Annotating Relation Mentions in Tabloid Press
Sprinter: Language Technologies for Interactive and Multimedia Language Learning
Language Resources and Annotation Tools for Cross-Sentence Relation Extraction
Harmonization of German Lexical Resources for Opinion Mining
The taraXÜ Corpus of Human-Annotated Machine Translations
Gjøvik University College, NORWAY An Open Source Part-of-Speech Tagger for Norwegian: Building on Existing Language Resources
Goethe University, Text Technology Laboratory, GERMANY ColLex.en: Automatically Generating and Evaluating a Full-form Lexicon for English
Google Inc., UNITED STATES On the Importance of Text Analysis for Stock Price Prediction
A Gold Standard Dependency Corpus for English
Momresp: A Bayesian Model for Multi-Annotator Document Labeling
Computer-Aided Quality Assurance of an Icelandic Pronunciation Dictionary
A Database for Measuring Linguistic Information Content
Graduate Institute of Linguistics, National Taiwan University, TAIWAN Why Chinese Web-as-Corpus is Wacky? Or: How Big Data is Killing Chinese Corpus Linguistics
Graduate School of Advanced Integration Science, Chiba University, JAPAN Japanese Conversation Corpus for Training and Evaluation of Backchannel Prediction Model.
Graduate School of Informatics, Kyoto University, JAPAN Constructing a Chinese―Japanese Parallel Corpus from Wikipedia
Graduate School of Maritime Sciences, Kobe University, JAPAN Bilingual Dictionary Induction as an Optimization Problem
Graz University of Technology, AUSTRIA The DIRHA simulated corpus
Graz University of Technology, Signal Processing and Speech Communication Laboratory, AUSTRIA GRASS: the Graz corpus of Read And Spontaneous Speech
Göteborgs Universitet, SWEDEN Sharing Resources Between Free/Open-Source Rule-based Machine Translation Systems: Grammatical Framework and Apertium

 

H
Hapara, UNITED STATES ClearTK 2.0: Design Patterns for Machine Learning in UIMA
Heidelberg University, GERMANY Computational Narratology: Extracting Tense Clusters from Narrative Texts
Extending HeidelTime for Temporal Expressions Referring to Historic Dates
Heidelberg University, Institute of Computer Science, GERMANY Computational Narratology: Extracting Tense Clusters from Narrative Texts
Extending HeidelTime for Temporal Expressions Referring to Historic Dates
Heriot - Watt University, UNITED KINGDOM An Arabic Twitter Corpus for Subjectivity and Sentiment Analysis
Historical Archives of the Pontifical Gregorian University, ITALY Sharing Cultural Heritage: the Clavius on the Web Project
Holmes Semantic Solutions, FRANCE The Dangerous Myth of the Star System
Generating a Resource for Products and Brandnames Recognition. Application to the Cosmetic Domain.
Hong Kong University of Science & Technology, HONG KONG SPECIAL ADMINISTRATIVE REGION OF CHINA Evaluating Improvised Hip Hop Lyrics - Challenges and Observations
On the Reliability and Inter-Annotator Agreement of Human Semantic MT Evaluation via HMEANT
A Hindi-English Code-Switching Corpus
Humboldt University Berlin, GERMANY Lexical Substitution Dataset for German
Multiword Expressions in Machine Translation
Hungarian Academy of Sciences, HUNGARY Media Monitoring and Information Extraction for the Highly Inflected Agglutinative Language Hungarian
Pivot-based Multilingual Dictionary Building using Wiktionary

 

I
IBM Research and EMAp / FGV, UNITED STATES A Study on Expert Sourcing Enterprise Question Collection and Classification
Enriching the "Senso Comune" Platform with Automatically Acquired Data
NomLex-PT: A Lexicon of Portuguese Nominalizations
IBM Research India, INDIA When Transliteration Met Crowdsourcing : An Empirical Study of Transliteration via Crowdsourcing using Efficient, Non-redundant and Fair Quality Control
IBM Research Lab, UNITED STATES Author-Specific Sentiment Aggregation for Polarity Prediction of Reviews
IDA Center for Computing Sciences, UNITED STATES A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization
IDIAP Research Institute / École Polytechnique Fédérale de Lausanne (EPFL), SWITZERLAND Cross-Linguistic Annotation of Narrativity for English/French Verb Tense Disambiguation
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues
IDIAP Research Institute, SWITZERLAND English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling
IDS Mannheim, GERMANY The Database for Spoken German ― DGD2
Exploring and Visualizing Variation in Language Resources
The Liability of Service Providers in e-Research Infrastructures: Killing the Messenger?
The Research and Teaching Corpus of Spoken German ― FOLK
Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers
Access Control by Query Rewriting: the Case of KorAP
Named Entity Tagging a Very Large Unbalanced Corpus: Training and Evaluating NE Classifiers
IIIT-Hyderabad, INDIA Enrichment of Bilingual Dictionary through News Stream Data
Hindi to English Machine Translation: Using Effective Selection in Multi-Model SMT
IIT Bombay, INDIA Efficient Reuse of Structured and Unstructured Resources for Ontology Population
Shata-Anuvadak: Tackling Multiway Translation of Indian Languages
When Transliteration Met Crowdsourcing : An Empirical Study of Transliteration via Crowdsourcing using Efficient, Non-redundant and Fair Quality Control
IIT Kharagpur, INDIA Design and Development of an Online Computational Framework to Facilitate Language Comprehension Research on Indian Languages
IITB, Mumbai, INDIA Shata-Anuvadak: Tackling Multiway Translation of Indian Languages
IITB-Monash Research Academy, INDIA Efficient Reuse of Structured and Unstructured Resources for Ontology Population
ILC CNR - Pisa, ITALY Polysemy Index for Nouns: an Experiment on Italian using the PAROLE SIMPLE CLIPS Lexical Database
Presenting a System of Human-Machine Interaction for Performing Map Tasks.
Generating Polarity Lexicons with WordNet Propagation in 5 Languages
From Synsets to Videos: Enriching ItalWordNet Multimodally
ILS Institute, University at Albany, SUNY, UNITED STATES Automatic Expansion of the MRC Psycholinguistic Database Imageability Ratings
IMS, Universität Stuttgart, GERMANY Turkish Treebank as a Gold Standard for Morphological Disambiguation and Its Influence on Parsing
Polysemy Index for Nouns: an Experiment on Italian using the PAROLE SIMPLE CLIPS Lexical Database
An Out-of-Domain Test Suite for Dependency Parsing of German
Fuzzy V-Measure - An Evaluation Method for Cluster Analyses of Ambiguous Data
Optimizing a Distributional Semantic Model for the Prediction of German Particle Verb Compositionality
Independent Consultant, QATAR Building a Crisis Management Term Resource for Social Media: The Case of Floods and Protests
Indian Institute of Technology of Guwahati, Department of Computer Science and Engineering, INDIA MTWatch: A Tool for the Analysis of Noisy Parallel Data
INESC-ID, ISCTE-IUL, PORTUGAL Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
INESC-ID/IST, PORTUGAL JUST.ASK, a QA System that Learns to Answer New Questions from Previous Interactions
The DIRHA simulated corpus
Innoetics ltd, GREECE Using Audio Books for Training a Text-to-Speech System
INRA-SenS, IFRIS & ISC-PIF, FRANCE Reconstructing the Semantic Landscape of Natural Language Processing
INRIA - Alpage, FRANCE Deep Syntax Annotation of the Sequoia French Treebank
Inria Nancy Grand-Est/LORIA, FRANCE Deep Syntax Annotation of the Sequoia French Treebank
Inria, FRANCE DeLex, a Freely-available, Large-scale and Linguistically Grounded Morphological Lexicon for German
A Language-independent Approach to Extracting Derivational Relations from an Inflectional Lexicon
Developing a French FrameNet: Methodology and First results
An Open-Source Heavily Multilingual Translation Graph Extracted from Wiktionaries and Parallel Corpora
A Language-independent and fully Unsupervised Approach to Lexicon Induction and Part-of-Speech Tagging for Closely Related Languages
Insight Data Centre for Analytics, Galway, IRELAND Extracting Information for Context-aware Meeting Preparation
INSIGHT, National University of Ireland, Galway, IRELAND Missed Opportunities in Translation Memory Matching
Hot Topics and Schisms in NLP: Community and Trend Analysis with Saffron on ACL and LREC Proceedings
A Method for Building Burst-Annotated Co-Occurrence Networks for Analysing Trends in Textual Data
Institut für Deutsche Sprache, GERMANY Extending Standoff Annotation
Access Control by Query Rewriting: the Case of KorAP
Recent Developments in DeReKo
Institut für Maschinelle Sprachverarbeitung, University of Stuttgart, GERMANY A Corpus of Comparisons in Product Reviews
Institut für Medizinische Informatik/Charite, GERMANY Information Extraction from German Patient Records via Hybrid Parsing and Relation Extraction Strategies
Institut Mines -Télécom, Télécom ParisTech, CNRS LTCI, FRANCE Comparative Analysis of Verbal Alignment in Human-Human and Human-Agent Interactions
Mining a Multimodal Corpus for Non-Verbal Behavior Sequences Conveying Attitudes
Emilya: Emotional Body Expression in Daily Actions Database
Institut National de l'Audiovisuel, FRANCE An Effortless Way To Create Large-Scale Datasets For Famous Speakers
Institut National de Recherche en Informatique et en Automatique , FRANCE Towards an Environment for the Production and the Validation of Lexical Semantic Resources
Classifying Inconsistencies in DBpedia Language Specific Chapters
Institute for Computational Linguistics, National Research Council CNR, ITALY Sharing Cultural Heritage: the Clavius on the Web Project
Institute for Corpus Linguistics and Text Technology (Austrian Academy of Sciences), AUSTRIA A SKOS-based Schema for TEI encoded Dictionaries at ICLTT
Institute for Corpus Linguistics and Text Technology, AUSTRIA The CMD Cloud
Institute for Dutch Lexicology, NETHERLANDS Taalportaal: an Online Grammar of Dutch and Frisian
Institute for German Language (IDS), GERMANY GenitivDB ― a Corpus-Generated Database for German Genitive Classification
Institute for Infocomm Research, SINGAPORE TaLAPi ― A Thai Linguistically Annotated Corpus for Language Processing
Institute for Infocommunication Technologies, Bay Zoltán Nonprofit Ltd. for Applied Research, The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech
Institute for Language and Speech Processing / Innoetics LTD, GREECE Using Audio Books for Training a Text-to-Speech System
Institute for Language and Speech Processing / RC Athens, GREECE Comparing Two Acquisition Systems for Automatically Building an English-Croatian Parallel Corpus from Multilingual Websites
Institute for Language and Speech Processing of Athena, GREECE Using Audio Books for Training a Text-to-Speech System
Developing a Framework for Describing Relations among Language Resources
The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue
Comparing Two Acquisition Systems for Automatically Building an English-Croatian Parallel Corpus from Multilingual Websites
Institute for Specialised Communication and Multilingualism, European Academy of Bozen / Bolzano, ITALY 'interHist' - an Interactive Visual Interface for Corpus Exploration
The MERLIN corpus: Learner Language and the CEFR
KoKo: an L1 Learner Corpus for German
Institute of Communication Acoustics, Ruhr-Universität Bochum, GERMANY The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue
Institute of Computational Linguistics - CNR, FRANCE The Making of Ancient Greek WordNet
The IMAGACT Visual Ontology. an Extendable Multilingual Infrastructure for the Representation of Lexical Encoding of Action
The Strategic Impact of META-NET on the Regional, National and International Level
Polysemy Index for Nouns: an Experiment on Italian using the PAROLE SIMPLE CLIPS Lexical Database
Presenting a System of Human-Machine Interaction for Performing Map Tasks.
From Synsets to Videos: Enriching ItalWordNet Multimodally
Institute of Computer Science at the Polish Academy of Sciences, POLAND Walenty: Towards a Comprehensive Valence Dictionary of Polish
The Strategic Impact of META-NET on the Regional, National and International Level
Projection-based Annotation of a Polish Dependency Treebank
PoliTa: a Multitagger for Polish
Morfeusz Reloaded
Institute of Formal and Applied Linguistics, Charles University in Prague, CZECH REPUBLIC To Pay or to Get Paid: Enriching a Valency Lexicon with Diatheses
Institute of Informatics and Telematics, National Research Council CNR, ITALY Sharing Cultural Heritage: the Clavius on the Web Project
Accommodations in Tuscany as Linked Data
Institute of Linguistics, Academia Sinica, TAIWAN Phone Boundary Annotation in Conversational Speech
Institute of Mathematics and Computer Science of Academy of Sciences of Moldova, Chișinău, RUSSIAN FEDERATION Transliteration and Alignment of Parallel Texts from Cyrillic to Latin
Institute of the Lithuanian Language, LITHUANIA The Strategic Impact of META-NET on the Regional, National and International Level
Instituto de Engenharia de Sistemas e Computadores ― Investigação e Desenvolvimento em Lisboa, Faculdade de Letras da Universidade de Lisboa, PORTUGAL OpenLogos Semantico-Syntactic Knowledge-Rich Bilingual Dictionaries
Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
Revising the Annotation of a Broadcast News Corpus: a Linguistic Approach
JUST.ASK, a QA System that Learns to Answer New Questions from Previous Interactions
Translation Errors from English to Portuguese: an Annotated Corpus
Instituto de Engenharia de Sistemas e Computadores ― Investigação e Desenvolvimento em Lisboa, PORTUGAL Revising the Annotation of a Broadcast News Corpus: a Linguistic Approach
OpenLogos Semantico-Syntactic Knowledge-Rich Bilingual Dictionaries
JUST.ASK, a QA System that Learns to Answer New Questions from Previous Interactions
Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate
Translation Errors from English to Portuguese: an Annotated Corpus
VOCE Corpus: Ecologically Collected Speech Annotated with Physiological and Psychological Stress Assessments
El-WOZ: a Client-Server Wizard-of-Oz Interface
Instituto de Linguística Teórica e Computacional / Universidade de Lisboa, PORTUGAL Casa De La Lhéngua: a Set of Language Resources and Natural Language Processing Tools for Mirandese
Instituto de Telecomunicações, University of Porto, PORTUGAL VOCE Corpus: Ecologically Collected Speech Annotated with Physiological and Psychological Stress Assessments
Instituto de Telecomunicações, University of Porto, Faculty of Engineering, PORTUGAL HESITA(te) in Portuguese
VOCE Corpus: Ecologically Collected Speech Annotated with Physiological and Psychological Stress Assessments
Interinstitutional Center for Research and Development in Computational Linguistics, BRAZIL A Large Corpus of Product Reviews in Portuguese: Tackling Out-Of-Vocabulary Words
International Institute of Information Technology, Hyderabad, INDIA Hindi to English Machine Translation: Using Effective Selection in Multi-Model SMT
Intersect, UNITED KINGDOM The Alveo Virtual Laboratory: A Web based Repository API
Invision Ondertiteling, GERMANY Machine Translation for Subtitling: A Large-Scale Evaluation
IRCAM, UMR STMS UPMC-CNRS, FRANCE Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French
IRISA - CNRS, FRANCE Generating and using Probabilistic Morphological Resources for the Biomedical Domain
IRISA, Université de Rennes 1, FRANCE The ETAPE Speech Processing Evaluation
Bridging the Gap between Speech Technology and Natural Language Processing: An Evaluation Toolbox for Term Discovery Systems
ROOTS: a Toolkit for Easy, Fast and Consistent Processing of Large Sequential Annotated Data Collections
Named Entity Corpus Construction using Wikipedia and DBpedia Ontology
Generating and using Probabilistic Morphological Resources for the Biomedical Domain
IRIT ― CNRS, FRANCE A Model for Processing Illocutionary Structures and Argumentation in Debates
Developing a French FrameNet: Methodology and First results
Enriching the "Senso Comune" Platform with Automatically Acquired Data
ISC-PIF, FRANCE Reconstructing the Semantic Landscape of Natural Language Processing
ISCTE-IUL - Instituto Universitário de Lisboa, Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate
ISCTE/INESC-ID, PORTUGAL Revising the Annotation of a Broadcast News Corpus: a Linguistic Approach
Teenage and Adult Speech in School Context: Building and Processing a Corpus of European Portuguese
iSOCO, S.A., SPAIN Language Processing Infrastructure in the XLike Project
ISTC CNR, ITALY Discovering the Italian Literature: Interactive Access to Audio-indexed Text Resources
Istituto di Linguistica Computazionale "Antonio Zampolli" ― Consiglio Nazionale delle Ricerche, ITALY The Making of Ancient Greek WordNet
The IMAGACT Visual Ontology. an Extendable Multilingual Infrastructure for the Representation of Lexical Encoding of Action
The Strategic Impact of META-NET on the Regional, National and International Level
The LRE Map disclosed
From Synsets to Videos: Enriching ItalWordNet Multimodally
Polysemy Index for Nouns: an Experiment on Italian using the PAROLE SIMPLE CLIPS Lexical Database
T2K2: a System for Automatically Extracting and Organizing Knowledge from Texts
Less is More? Towards a Reduced Inventory of Categories for Training a Parser for the Italian Stanford Dependencies
Italian Institute of Technology, ITALY The D-ANS corpus: the Dublin-Autonomous Nervous System corpus of biosignal and multimodal recordings of conversational speech
IXA NLP Group, University of the Basque Country (UPV/EHU), SPAIN IXA pipeline: Efficient and Ready to Use Multilingual NLP tools
Generating Polarity Lexicons with WordNet Propagation in 5 Languages

 

J
Jadavpur University, INDIA Word Alignment-Based Reordering of Source Chunks in PB-SMT
Japan Science and Technology Agency, JAPAN Bilingual Dictionary Construction with Transliteration Filtering
Jobandtalent, UNITED KINGDOM Collection of a Simultaneous Translation Corpus for Comparative Analysis
Collocation or Free Combination? ― Applying Machine Translation Techniques to identify collocations in Japanese
Language Resource Addition: Dictionary or Corpus?
Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System
Ranking Job Offers for Candidates: Learning Hidden Knowledge from Big Data
Parsing Chinese Synthetic Words with a Character-based Dependency Model
Johns Hopkins University - Human Language Technology Center of Excellence, UNITED STATES Bridging the Gap between Speech Technology and Natural Language Processing: An Evaluation Toolbox for Term Discovery Systems
Johns Hopkins University, UNITED STATES A Wikipedia-based Corpus for Contextualized Machine Translation
A Multi-Dialect, Multi-Genre Corpus of Informal Written Arabic
The Multilingual Paraphrase Database
The American Local News Corpus
Joint Research Centre (JRC), ITALY Clustering of Multi-Word Named Entity Variants: Multilingual Evaluation
Resource Creation and Evaluation for Multilingual Sentiment Analysis in Social Media Texts
Jožef Stefan Institute, Department of Knowledge Technologies, SLOVENIA sloWCrowd: a Crowdsourcing Tool for Lexicographic Tasks
TweetCaT: a Tool for Building Twitter Corpora of Smaller Languages
Jožef Stefan Institute, SLOVENIA sloWCrowd: a Crowdsourcing Tool for Lexicographic Tasks
The Strategic Impact of META-NET on the Regional, National and International Level
Language Processing Infrastructure in the XLike Project
JRC European Commission, ITALY DCEP -Digital Corpus of the European Parliament
JSPS/ National Institute of Informatics, JAPAN A Colloquial Corpus of Japanese Sign Language: Linguistic Resources for Observing Sign Language Conversations
JULIE Lab, Friedrich Schiller Universität Jena, GERMANY Collaboratively Annotating Multilingual Parallel Corpora in the Biomedical Domain―some MANTRAs
Disclose Models, Hide the Data - How to Make Use of Confidential Corpora without Seeing Sensitive Raw Data

 

K
Karlsruhe Institute of Technology, GERMANY GlobalPhone: Pronunciation Dictionaries in 20 Languages
xLiD-Lexica: Cross-lingual Linked Data Lexica
A Database of Freely Written Texts of German School Students for the Purpose of Automatic Spelling Error Classification
A Corpus of Spontaneous Speech in Lectures: The KIT Lecture Corpus for Spoken Language Processing and Translation
Manual Analysis of Structurally Informed Reordering in German-English Machine Translation
RECSA: Resource for Evaluating Cross-lingual Semantic Annotation
Kazan Federal University, RUSSIAN FEDERATION Finite-State Morphological Transducers for Three Kypchak Languages
Keio University, JAPAN Relating Frames and Constructions in Japanese FrameNet
KIIT College of Engineering, Gurgaon, INDIA Statistical Analysis of Multilingual Text Corpus and Development of Language Models
Know-Center, AUSTRIA Modeling, Managing, Exposing, and Linking Ontologies with a Wiki-based Tool
KnowBel Technologies, BELGIUM FLELex: a graded Lexical Resource for French Foreign Learners
Knowledge Society Association, POLAND The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech
KOEMEI SA, SWITZERLAND The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues
Korea advanced Institute of Science and Technology, KOREA Named Entity Corpus Construction using Wikipedia and DBpedia Ontology
KTH, School of Computer Science and Communication, Department of Speech Music and Hearing, SWEDEN Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish
The WaveSurfer Automatic Speech Recognition Plugin
KULeuven, BELGIUM HiEve: A Corpus for Extracting Event Hierarchies from News Stories
Kyoto University, JAPAN Bilingual Dictionary Construction with Transliteration Filtering
Crowdsourcing for Evaluating Machine Translation Quality
Integration of Workflow and Pipeline for Language Service Composition A Large Scale Database of Strongly-related Events in Japanese
Constructing a Corpus of Japanese Predicate Phrases for Synonym/Antonym Relations
Bilingual Dictionary Induction as an Optimization Problem
A Japanese Word Dependency Corpus
Language Resource Addition: Dictionary or Corpus?
Flow Graph Corpus from Recipe Texts
A Framework for Compiling High Quality Knowledge Resources From Raw Corpora
Single Classifier Approach for Verb Sense Disambiguation based on Generalized Features

 

L
Ľ. Štúr Institute of Linguistics, Slovak Academy of Sciences, SLOVAKIA The Strategic Impact of META-NET on the Regional, National and International Level
L2F@INESC-ID and Department of Computer Science, University of Évora, PORTUGAL UM-Corpus: A Large English-Chinese Parallel Corpus for Statistical Machine Translation
L3I - Université de La Rochelle, FRANCE Evaluating Corpora Documentation with regards to the Ethics and Big Data Charter
Laboratoire d'Informatique de Grenoble, FRANCE The Sweet-Home Speech and Multimodal Corpus for Home Automation Interaction
Laboratoire d'Informatique, de Robotique et de Microélectronique de Montpellier, FRANCE Propa-L: a Semantic Filtering Service from a Lexical Network Created using Games With A Purpose
Relation Inference in Lexical Networks ... with Refinements
New Functions for a Multipurpose Multimodal Tool for Phonetic and Linguistic Analysis of Very Large Speech Corpora
Laboratoire de Sciences Cognitives et Psycholinguistique - Ecole des Hautes Etudes en Sciences Sociales / Ecole Nationale Supérieure / CNRS, FRANCE Bridging the Gap between Speech Technology and Natural Language Processing: An Evaluation Toolbox for Term Discovery Systems
Laboratoire National de métrologie et d'Essais, FRANCE The ETAPE Speech Processing Evaluation
Human Annotation of ASR Error Regions: is "gravity" a Sharable Concept for Human Annotators?
ETER: a New Metric for the Evaluation of Hierarchical Named Entity Recognition
Laboratoire Parole et Langage, CNRS, FRANCE Representing Multimodal Linguistic Annotated Data
Automatic Detection of Other-Repetition Occurrences: Application to French Conversational Speech
Aix Map Task Corpus: the French Multimodal Corpus of Task-oriented Dialogue
Laboratoire Parole et Langage, Université Aix-Marseille, FRANCE Automatic Detection of Other-Repetition Occurrences: Application to French Conversational Speech
Language Computer Corporation, UNITED STATES Semi-Supervised Methods for Expanding Psycholinguistics Norms by Integrating Distributional Similarity with the Structure of WordNet
#mygoal: Finding Motivations on Twitter
Language Technologies Institute, Carnegie Mellon University, UNITED STATES Tharwa: A Large Scale Dialectal Arabic - Standard Arabic - English Lexicon
LAS, PORTUGAL Construction and Annotation of a French Folkstale Corpus
LATTICE - CNRS, FRANCE Reconstructing the Semantic Landscape of Natural Language Processing
LaTTiCe, FRANCE Evaluation of Different Strategies for Domain Adaptation in Opinion Mining
Lexical Computing Ltd., UNITED KINGDOM Sublanguage Corpus Analysis Toolkit: a Tool for Assessing the Representativeness and Sublanguage Characteristics of Corpora
Extrinsic Corpus Evaluation with a Collocation Dictionary Task
Lexiques, Dictionnaires, Informatique ― Université Paris 13, FRANCE Developing a French FrameNet: Methodology and First results
Lexis Nexis BIS, FRANCE NewsReader: Recording History from Daily News Streams
LIA ― Université d'Avignon, FRANCE Characterizing and Predicting Bursty Events: the Buzz Case Study on Twitter
Investigating the Image of Entities in Social Media: Dataset Design and First Results
A LDA-based Topic Classification Approach from highly Imperfect Automatic Transcriptions
LIA-CERI (Laboratoire d'Informatique d'Avignon-CERI), FRANCE NASTIA: Negotiating Appointment Setting Interface
LIF, Aix-Marseille Université, FRANCE Comparing the Quality of Focused Crawlers and of the Translation Resources Obtained from them
Comparing Similarity Measures for Distributional Thesauri
LIF-CNRS (Aix Marseille Université), FRANCE FLELex: a graded Lexical Resource for French Foreign Learners
LiLPa, Université de Strasbourg, FRANCE From Non Word to New Word: Automatically Identifying Neologisms in French Newspapers
LIMSI - CNRS & ENSIIE, FRANCE Construction and Annotation of a French Folkstale Corpus
Annotation of Specialized Corpora using a Comprehensive Entity and Relation Scheme
Multiple Choice Question Corpus Analysis for Distractor Characterization
LIMSI - CNRS & IMMI - CNRS, FRANCE The ETAPE Speech Processing Evaluation
Automatic Language Identity Tagging on Word and Sentence-Level in Multilingual Text Sources: a Case-Study on Luxembourgish
The Strategic Impact of META-NET on the Regional, National and International Level
Facing the Identification Problem in Language-Related Scientific Data Analysis.
Rediscovering 15 Years of Discoveries in Language Resources and Evaluation: The LREC Anthology Analysis
LIMSI - CNRS et Université Paris-Sud, FRANCE Construction and Annotation of a French Folkstale Corpus
A Hierarchical Taxonomy for Classifying Hardness of Inference Tasks
Extracting News Web Page Creation Time with DCTFinder
French Resources for Extraction and Normalization of Temporal Expressions with HeidelTime
Multiple Choice Question Corpus Analysis for Distractor Characterization
Evaluating Web-as-corpus Topical Document Retrieval with an Index of the OpenDirectory
Thematic Cohesion: Measuring Terms Discriminatory Power Toward Themes
Bidirectionnal Converter Between Syntactic Annotations: from French Treebank Dependencies to PASSAGE Annotations, and back
Automatic Language Identity Tagging on Word and Sentence-Level in Multilingual Text Sources: a Case-Study on Luxembourgish
LIMSI - CNRS, FRANCE Toward a Unifying Model for Opinion, Sentiment and Emotion Information Extraction
A Corpus of Machine Translation Errors Extracted from Translation Students Exercises
Biomedical Entity Extraction using Machine-Learning Based Approaches
Morpho-Syntactic Study of Errors from Speech Recognition System
Use of Unsupervised Word Classes for Entity Recognition: Application to the Detection of Disorders in Clinical Reports
Evaluating Corpora Documentation with regards to the Ethics and Big Data Charter
French Resources for Extraction and Normalization of Temporal Expressions with HeidelTime
Semantic Clustering of Pivot Paraphrases
Annotation of Specialized Corpora using a Comprehensive Entity and Relation Scheme
Language Resources for French in the Biomedical Domain
Multiple Choice Question Corpus Analysis for Distractor Characterization
Rule-based Reordering Space in Statistical Machine Translation
A Database of Full Body Virtual Interactions Annotated with Expressivity Scores
Human Annotation of ASR Error Regions: is "gravity" a Sharable Concept for Human Annotators?
Facing the Identification Problem in Language-Related Scientific Data Analysis.
Smile and Laughter in Human-Machine Interaction: a Study of Engagement
ETER: a New Metric for the Evaluation of Hierarchical Named Entity Recognition
Bidirectionnal Converter Between Syntactic Annotations: from French Treebank Dependencies to PASSAGE Annotations, and back
Rediscovering 15 Years of Discoveries in Language Resources and Evaluation: The LREC Anthology Analysis
LINA ― Université de Nantes, FRANCE Validation Issues induced by an Automatic Pre-Annotation Mechanism in the Building of Non-projective Dependency Treebanks
LINA, FRANCE Semi-Compositional Method for Synonym Extraction of Multi-Word Terms
LingoChamp, JAPAN A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization
LinguaTV GmbH, GERMANY Sprinter: Language Technologies for Interactive and Multimedia Language Learning
Linguistic Data Consortium, University of Pennsylvania, UNITED STATES Collecting Natural SMS and Chat Conversations in Multiple Languages: The BOLT Phase 2 Corpus
The RATS Collection: Supporting HLT Research with Degraded Audio Data
Developing an Egyptian Arabic Treebank: Impact of Dialectal Morphology on Annotation and Tool Development
Incorporating Alternate Translations into English Translation Treebank
New Directions for Language Resource Development and Distribution
The Language Application Grid
Facing the Identification Problem in Language-Related Scientific Data Analysis.
Developing a Framework for Describing Relations among Language Resources
Linköping University, SWEDEN The Impact of Cohesion Errors in Extraction Based Summaries
LIUM, FRANCE Enhancing the TED-LIUM Corpus with Selected Data for Language Modeling and More TED Talks
LLF ― Université Paris Diderot / CNRS, FRANCE Developing a French FrameNet: Methodology and First results
LLL, Université François Rabelais & CNRS, Tours-Orléans, FRANCE Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French
Logos Institute, FRANCE Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate
Lomonosov Moscow State University, RUSSIAN FEDERATION Summarizing News Clusters on the Basis of Thematic Chains
LORIA / Inria Nancy Grand-Est, FRANCE Mapping the Lexique des Verbes du français (Lexicon of French Verbs) to a NLP Lexicon using Examples
Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process
LORIA, FRANCE Building and Modelling Multilingual Subjective Corpora
LORIA/CNRS, FRANCE Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process
LPP (Laboratoire Phonétique & Phonologie) / LIMSI-CNRS, FRANCE Automatic Language Identity Tagging on Word and Sentence-Level in Multilingual Text Sources: a Case-Study on Luxembourgish
"Human Annotation of ASR Error Regions: is ""gravity"" a Sharable Concept for Human Annotators?"
ETER: a New Metric for the Evaluation of Hierarchical Named Entity Recognition
LPP (Laboratoire Phonétique & Phonologie) / Université Paris Sorbonne Nouvelle, FRANCE Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French
LSCP, EHESS-ENS-CNRS, Bridging the Gap between Speech Technology and Natural Language Processing: An Evaluation Toolbox for Term Discovery Systems
LTRC, IIIT Hyderabad, INDIA Benchmarking of English-Hindi Parallel Corpora
Towards Building a Kashmiri Treebank: Setting up the Annotation Pipeline
Hindi to English Machine Translation: Using Effective Selection in Multi-Model SMT
Lund University, SWEDEN REFRACTIVE: An Open Source Tool to Extract Knowledge from Syntactic and Semantic Relations
LUSI-Lab @DIETI University "Federico II", Naples, ITALY VOLIP: a Corpus of Spoken Italian and a Virtuous Example of Reuse of Linguistic Resources

 

M
Macquarie University, AUSTRALIA Bridging the Gap between Speech Technology and Natural Language Processing: An Evaluation Toolbox for Term Discovery Systems
Macquarie University, Department of Computing, AUSTRALIA AusTalk: an Audio-Visual Corpus of Australian English
The Alveo Virtual Laboratory: A Web based Repository API
Macquarie University, Department of Linguistics, AUSTRALIA AusTalk: an Audio-Visual Corpus of Australian English
Manchester University, UNITED KINGDOM Detecting Document Structure in a Very Large Corpus of UK Financial Reports
Masaryk University, CZECH REPUBLIC Disambiguating Verbs by Collocation: Corpus Lexicography meets Natural Language Processing
Masaryk University, Natural Language Processing Centre, CZECH REPUBLIC Extrinsic Corpus Evaluation with a Collocation Dictionary Task
HindEnCorp - Hindi-English and Hindi-only Corpus for Machine Translation
Massachusetts Institute of Technology, IBM, UNITED STATES A Study on Expert Sourcing Enterprise Question Collection and Classification
Massachusetts Institute of Technology, Lincoln Laboratory, UNITED STATES A Wikipedia-based Corpus for Contextualized Machine Translation
Massachusetts Institute of Technology, UNITED STATES The N2 Corpus: a Semantically Annotated Collection of Islamist Extremist Stories
Max Planck Institut for Psycholinguistics, GERMANY Towards Automatic Quality Assessment of Component Metadata
Improving the Exploitation of Linguistic Annotations in ELAN
VarClass: An Open-source Language Identification Tool for Language Varieties
Max Planck Institute for Evolutionary Anthropology, GERMANY Linguistic Landscaping of South Asia using Digital Language Resources: Genetic vs. Areal Linguistics
Max Planck Institute for Psycholinguistics, Nijmegen, GERMANY Languagesindanger.eu - Including Multimedia Language Resources to disseminate Knowledge and Create Educational Material on less-Resourced Languages
Vulnerability in Acquisition, Language Impairments in Dutch: Creating a VALID Data Archive
Microsoft Language Development Center / IEETA, Universidade de Aveiro, UNITED STATES Multimodal Corpora for Silent Speech Interaction
Microsoft Language Development Center, UNITED STATES Casa De La Lhéngua: a Set of Language Resources and Natural Language Processing Tools for Mirandese
Multimodal Corpora for Silent Speech Interaction
The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech
Microsoft Research India, INDIA Casa De La Lhéngua: a Set of Language Resources and Natural Language Processing Tools for Mirandese
Microsoft Research, UNITED STATES Enriching ODIN
Annotating Clinical Events in Text Snippets for Phenotype Detection
Microsoft, UNITED STATES Modern Chinese Helps Archaic Chinese Processing: Finding and Exploiting the Shared Properties
Middle East Technical University, TURKEY Turkish Resources for Visual Word Recognition
Mipumi Games GmbH, AUSTRIA The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues
MLDC/ADETTI-ISCTE, PORTUGAL Casa De La Lhéngua: a Set of Language Resources and Natural Language Processing Tools for Mirandese
Modyco, Université Paris Ouest Nanterre & CNRS, FRANCE Correcting and Validating Syntactic Dependency in the Spoken French Treebank Rhapsodie
Modyco, Université Paris Ouest Nanterre, FRANCE Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French
MTA-PPKE Hungarian Language Technology Research Group, Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, HUNGARY A New Form of Humor ― Mapping Constraint-Based Computational Morphologies to a Finite-State Representation

 

N
Nagoya University, JAPAN Text Readability and Word Distribution in Japanese
Nanyang Technological University, SINGAPORE Identifying Idioms in Chinese Translations
Building The Sense-Tagged Multilingual Parallel Corpus
Nat'l Institute for Japanese Language and Linguistics, JAPAN Design and Development of an RDB Version of the Corpus of Spontaneous Japanese
National Institute of Informatics, JAPAN Annotation of Computer Science Papers for Semantic Relation Extrac-tion
National Institute for Japanese Language and Linguistics, JAPAN Building a Database of Japanese Adjective Examples from Special Purpose Web Corpora
Collocation or Free Combination? ― Applying Machine Translation Techniques to identify collocations in Japanese
Towards Automatic Transformation Between Different Transcription Conventions: Prediction of Intonation Markers from Linguistic and Acoustic Features
National Institute of Informatics; The University of Tokyo, Corpus for Coreference Resolution on Scientific Papers
National Library of Medicine, UNITED STATES Annotating Question Decomposition on Complex Medical Questions
National Library of Norway, NORWAY Towards Automatic Quality Assessment of Component Metadata
National Taiwan University, TAIWAN Sentence Rephrasing for Parsing Sentences with OOV Words
National Technical University of Athens, School of ECE, GREECE Word Semantic Similarity for Morphologically Rich Languages
National Tsing Hua University, Institute of Information Systems and Applications, TAIWAN Phone Boundary Annotation in Conversational Speech
National University of Ireland, Altocloud, Galway, IRELAND Extracting Information for Context-aware Meeting Preparation
National University of Ireland, Galway, IRELAND Evaluation of Technology Term Recognition with Random Indexing
National University of Ireland, Maynooth, Department of Computer Science, IRELAND A Deep Context Grammatical Model For Authorship Attribution
National University of Taiwan, Department of Computer Science & Information Engineering, TAIWAN Phone Boundary Annotation in Conversational Speech
Sentence Rephrasing for Parsing Sentences with OOV Words
NCSR Demokritos, NOMAD: Linguistic Resources and Tools Aimed at Policy Formulation and Validation
The Ellogon Pattern Engine: Context-free Grammars over Annotations
Annotating Arguments: The NOMAD Collaborative Annotation Tool
NEC Corporation, JAPAN Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System
Nederlandse Taalunie / HLT Agency, NETHERLANDS A Decade of HLT Agency Activities in the Low Countries: from Resource Maintenance (BLARK) to Service Offerings (BLAISE)
New York University, UNITED STATES Corpus and Method for Identifying Citations in Non-Academic Text
Annotating Relations in Scientific Articles
Newcastle University, UNITED KINGDOM Rapid Deployment of Phrase Structure Parsing for Related Languages: A Case Study of Insular Scandinavian
NLP Technologies, CANADA Hashtag Occurrences, Layout and Translation: A Corpus-driven Analysis of Tweets Published by the Canadian Government
NLP2CT Lab / Department of Computer and Information Science, MOROCCO UM-Corpus: A Large English-Chinese Parallel Corpus for Statistical Machine Translation
Nomotika SRL, ITALY Exploiting networks in Law
Norges teknisk-naturvitenskapelige universitet, NORWAY MultiVal - Towards a Multilingual Valence Lexicon
North-West University, UNITED STATES The Development of Dutch and Afrikaans Language Resources for Compound Boundary Analysis.
North-West University,Centre for Text Technology, UNITED STATES Developing Text Resources for Ten South African Languages
Northwestern University, UNITED STATES Using Resource-Rich Languages to Improve Morphological Analysis of Under-Resourced Languages
Discosuite - A Parser Test Suite for German Discontinuous Structures
Norwegian School of Economics, NORWAY CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
Norwegian University of Science and Technology, NORWAY A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence
NTT, JAPAN Extraction of Daily Changing Words for Question Answering
Constructing a Corpus of Japanese Predicate Phrases for Synonym/Antonym Relations
Nuance Communications, BRAZIL NomLex-PT: A Lexicon of Portuguese Nominalizations

 

O
Oberlin College, UNITED STATES Morphological Parsing of Swahili using Crowdsourced Lexical Resources
Objet Direct, FRANCE Generating a Resource for Products and Brandnames Recognition. Application to the Cosmetic Domain.
Towards Electronic SMS Dictionary Construction: An Alignment-based Approach
OntoChem GmbH, GERMANY Creating a Gold Standard Corpus for the Extraction of Chemistry-Disease Relations from Patent Texts
Ontology Engineering Group, Universidad Politécnica de Madrid, SPAIN Enabling Language Resources to Expose Translations as Linked Data on the Web
Open University of Catalonia, SPAIN Machine Translationness: Machine-likeness in Machine Translation Evaluation
Orange Labs, FRANCE NASTIA: Negotiating Appointment Setting Interface
DINASTI: Dialogues with a Negotiating Appointment Setting Interface
Ortolang, FRANCE Representing Multimodal Linguistic Annotated Data
Otto-von-Guericke Universität, GERMANY Modeling and Evaluating Dialog Success in the LAST MINUTE Corpus
Owlin, NETHERLANDS N-gram Counts and Language Models from the Common Crawl

 

P
PAVIS, Italian Institute of Technology, ITALY The D-ANS corpus: the Dublin-Autonomous Nervous System corpus of biosignal and multimodal recordings of conversational speech
Peking University, CHINA Construction of Diachronic Ontologies from People's Daily of Fifty Years
Peking University, Key Laboratory of Computational Linguistics, Ministry of Education, CHINA Construction of Diachronic Ontologies from People's Daily of Fifty Years
Philipps-University of Marburg, GERMANY Creating a Massively Parallel Bible Corpus
A Benchmark Database of Phonetic Alignments in Historical Linguistics and Dialectology
Pixformance GmbH, GERMANY The taraXÜ Corpus of Human-Annotated Machine Translations
Polish Academy of Science, POLAND A Model for Processing Illocutionary Structures and Argumentation in Debates
Polytechnique Montréal, CANADA A Multimodal Interpreter for 3D Visualization and Animation of Verbal Concepts
Improving Entity Linking using Surface Form Refinement
Pontifícia Universidade Católica do Rio Grande do Sul, BRAZIL Building Domain Specific Bilingual Dictionaries
Comparative Analysis of Portuguese Named Entities Recognition Tools
VOAR: A Visual and Integrated Ontology Alignment Environment
Praxiling UMR 5267 CNRS & Université Paul-Valéry Montpellier 3, FRANCE Towards Electronic SMS Dictionary Construction: An Alignment-based Approach
Priberam, Instituto de Telecomunicacoes, PORTUGAL Priberam Compressive Summarization Corpus: A New Multi-Document Summarization Corpus for European Portuguese
Prompsit Language Engineering, SPAIN Quality Estimation for Synthetic Parallel Data Generation
Comparing Two Acquisition Systems for Automatically Building an English-Croatian Parallel Corpus from Multilingual Websites
PSD/LDS Church, UNITED STATES Corpus and Evaluation of Handwriting Recognition of Historical Genealogical Records
PUCRS, BRAZIL Boosting Open Information Extraction with Noun-Based Relations
Pázmány Péter Catholic University, HUNGARY An Efficient Language Independent Toolkit for Complete Morphological Disambiguation

 

Q
Qatar Computing Research Institute, QATAR SenTube: A Corpus for Sentiment Analysis on YouTube Social Media
Simple Effective Microblog Named Entity Recognition: Arabic as an Example
Using Stem-Templates to Improve Arabic POS and Gender/Number Tagging
Building a Crisis Management Term Resource for Social Media: The Case of Floods and Protests
Sublanguage Corpus Analysis Toolkit: a Tool for Assessing the Representativeness and Sublanguage Characteristics of Corpora
The AMARA Corpus: Building Parallel Language Resources for the Educational Domain
Qatar University, QATAR Development of a TV Broadcasts Speech Recognition System for Qatari Arabic
Automatic Long Audio Alignment and Confidence Scoring for Conversational Arabic Speech

 

R
RACAI, ROMANIA CoRoLa ― The Reference Corpus of Contemporary Romanian Language
Radboud University Nijmegen, Center for Language Studies, NETHERLANDS The Dutch LESLLA Corpus
The Gulf of Guinea Creole Corpora
Vulnerability in Acquisition, Language Impairments in Dutch: Creating a VALID Data Archive
Radboud University Nijmegen, Department of Linguistics, NETHERLANDS ASR-based CALL Systems and Learner Speech Data: New Resources and Opportunities for Research and Development in Second Language Learning
Radboud University Nijmegen, NETHERLANDS Creating and Using Large Monolingual Parallel Corpora for Sentential Paraphrase Generation
The Nijmegen Corpus of Casual Czech
ASR-based CALL Systems and Learner Speech Data: New Resources and Opportunities for Research and Development in Second Language Learning
Semi-Automatic Annotation of the UCU Accents Speech Corpus
Improving the Exploitation of Linguistic Annotations in ELAN
Rakuten Institute of Technology, New York, UNITED STATES Comparison of the Impact of Word Segmentation on Name Tagging for Chinese and Japanese
RALI, CANADA Hashtag Occurrences, Layout and Translation: A Corpus-driven Analysis of Tweets Published by the Canadian Government
Rensselaer Polytechnic Institute, UNITED STATES Comparison of the Impact of Word Segmentation on Name Tagging for Chinese and Japanese
Research Institute for Artificial Intelligence, Romanian Academy, ROMANIA Large SMT Data-sets Extracted from Wikipedia
CoRoLa ― The Reference Corpus of Contemporary Romanian Language
The Strategic Impact of META-NET on the Regional, National and International Level
RSS-TOBI - a Prosodically Enhanced Romanian Speech Corpus
Research institute for Linguistics, Hungarian Academy of Sciences, HUNGARY The Strategic Impact of META-NET on the Regional, National and International Level
The Hungarian Gigaword Corpus
Languagesindanger.eu - Including Multimedia Language Resources to disseminate Knowledge and Create Educational Material on less-Resourced Languages
Research Scholar, Indian Institute of Technology Kharagpur, INDIA Design and Development of an Online Computational Framework to Facilitate Language Comprehension Research on Indian Languages
Reykjavik University, ICELAND Correcting Errors in a New Gold Standard for Tagging Icelandic Text
Rapid Deployment of Phrase Structure Parsing for Related Languages: A Case Study of Insular Scandinavian
Rheinisch-Westfaelische Technische Hochschule Aachen , GERMANY Extensions of the Sign Language Recognition and Translation Corpus RWTH-PHOENIX-Weather
Riken Brain Science Institute, JAPAN Design and Development of an RDB Version of the Corpus of Spontaneous Japanese
Ritsumeikan University, JAPAN A Japanese Word Dependency Corpus
Royal Institute of Technology, AUSTRALIA The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue
Royal Institute of Technology, School of Computer Science and Communication, Department of Speech, Music and Hearing, AUSTRALIA Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish
The WaveSurfer Automatic Speech Recognition Plugin
Royal Melbourne Institute of Technology, AUSTRALIA The Alveo Virtual Laboratory: A Web based Repository API
Ruhr - Universität Bochum, GERMANY Building a Reference Lexicon for Countability in English
Rutgers University, UNITED STATES A New Framework for Sign Language Recognition Based on 3D Handshape Identification and Linguistic Modeling
3D Face Tracking and Multi-Scale, Spatio-temporal Analysis of Linguistically Significant Facial Expressions and Head Positions in ASL

 

S
Saarland University, GERMANY A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence
Aligning Predicate-Argument Structures for Paraphrase Fragment Extraction
Interoperability of Dialogue Corpora through ISO 24617-2-based Querying
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues
Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers
Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process
LQVSumm: A Corpus of Linguistic Quality Violations in Multi-Document Summarization
Multiword Expressions in Machine Translation
Finding a Tradeoff between Accuracy and Rater's Workload in Grading Clustered Short Answers
Saint John's College, Santa Fe, UNITED STATES Identifying Idioms in Chinese Translations
Samsung Advanced Institute of Technology, KOREA Can the Crowd be Controlled?: A Case Study on Crowd Sourcing and Automatic Validation of Completed Tasks based on User Modeling
Samsung Research America, UNITED STATES Building a Dataset for Summarization and Keyword Extraction from Emails
Sapienza University of Rome, ITALY Annotating the MASC Corpus with BabelNet
Media Monitoring and Information Extraction for the Highly Inflected Agglutinative Language Hungarian
Clustering of Multi-Word Named Entity Variants: Multilingual Evaluation
HuRIC: a Human Robot Interaction Corpus
Representing Multilingual Data as Linked Data: the Case of BabelNet 2.0
An Analysis of Ambiguity in Word Sense Annotations
Sarah M. Taylor Consulting, LLC, UNITED STATES Automatic Expansion of the MRC Psycholinguistic Database Imageability Ratings
A Multi-Cultural Repository of Automatically Discovered Linguistic and Conceptual Metaphors
ScraperWiki, UNITED KINGDOM NewsReader: Recording History from Daily News Streams
SfS Uni Tübingen, GERMANY Thomas Aquinas in the TüNDRA: Integrating the Index Thomisticus Treebank into CLARIN-D
Siberian State Aerospace University named after academician M. F. Reshetnev, RUSSIAN FEDERATION Speech-Based Emotion Recognition: Feature Selection by Self-Adaptive Multi-Criteria Genetic Algorithm
Siemens AG, GERMANY LinkedHealthAnswers: Towards Linked Data-driven Question Answering for the Health Care Domain
Sikom Software GmbH, The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues
Simon Fraser University, CANADA Building a Reference Lexicon for Countability in English
Slovak Academy of Sciences, SLOVAKIA Alert!... Calm Down, There is Nothing to Worry About. Warning and Soothing Speech Synthesis
Sofia University and IICT-BAS, BULGARIA A System for Experiments with Dependency Parsers
Constituency Parsing of Bulgarian: Word- vs Class-based Parsing
Soochow University, TAIWAN Annotating Events in an Emotion Corpus
Speech Group - LORIA CNRS, FRANCE Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process
Spoken Language Systems Laboratory, INESC ID, PORTUGAL The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue
Spräkbanken, The National Library of Norway, NORWAY The Norwegian Dependency Treebank
Spräkbanken, University of Gothenburg, SWEDEN Linguistic Landscaping of South Asia using Digital Language Resources: Genetic vs. Areal Linguistics
HFST-SweNER ― A New NER Resource for Swedish
Bring vs. MTRoget: Evaluating Automatic Thesaurus Translation
The Strategic Impact of META-NET on the Regional, National and International Level
Computer-aided Morphology Expansion for Old Swedish
A Flexible Language Learning Platform Based on Language Resources and Web Services
Stanford Universiy, UNITED STATES A Gold Standard Dependency Corpus for English
Criteria for Identifying and Annotating Caused Motion Constructions in Corpus Data
Stofnun Árna Magnússonar í íslenskum fræðum, ICELAND Mörkum Njálu. An Annotated Corpus to Analyse and Explain Grammatical Divergences Between 14th-century Manuscripts of Njál's Saga.
Syllabs, FRANCE Evaluating Web-as-corpus Topical Document Retrieval with an Index of the OpenDirectory
Thematic Cohesion: Measuring Terms Discriminatory Power Toward Themes
Synerscope, NETHERLANDS NewsReader: Recording History from Daily News Streams
Synthema, RUSSIAN FEDERATION SAVAS: Collecting, Annotating and Sharing Audiovisual Language Resources for Automatic Subtitling

 

T
Tagmatica + IMMI-CNRS, FRANCE Facing the Identification Problem in Language-Related Scientific Data Analysis.
Rediscovering 15 Years of Discoveries in Language Resources and Evaluation: The LREC Anthology Analysis
Taif University, SAUDI ARABIA DiVE-Arabic: Gulf Arabic Dialogue in a Virtual Environment
Tallinn University of Technology, ESTONIA Dense Components in the Structure of WordNet
Technical University Munich, GERMANY LinkedHealthAnswers: Towards Linked Data-driven Question Answering for the Health Care Domain
Technical University of Cluj-Napoca, Speech Processing Group, ROMANIA GRASS: the Graz corpus of Read And Spontaneous Speech
RSS-TOBI - a Prosodically Enhanced Romanian Speech Corpus
Technical University of Kosice, SLOVAKIA The Slovak Categorized News Corpus
TUKE-BNews-SK: Slovak Broadcast News Corpus Construction and Evaluation
Technische Universität Berlin, GERMANY The Weltmodell: A Data-Driven Commonsense Knowledge Base
Freepal: A Large Collection of Deep Lexico-Syntactic Patterns for Relation Extraction
Technische Universität Darmstadt, GERMANY Distributed Distributional Similarities of Google Books over the Centuries
NoSta-D Named Entity Annotation for German: Guidelines and Dataset
Lexical Substitution Dataset for German
Technische Universität Dresden, GERMANY The MERLIN corpus: Learner Language and the CEFR
Technische Universität München / Imperial College of London, GERMANY The Munich Biovoice Corpus: Effects of Physical Exercising, Heart Rate, and Skin Conductance on Human Speech Production
Technische Universität München, GERMANY The Munich Biovoice Corpus: Effects of Physical Exercising, Heart Rate, and Skin Conductance on Human Speech Production
Telecommunication Laboratories, Chunghwa Telecom Co., Ltd., TAIWAN Sentence Rephrasing for Parsing Sentences with OOV Words
TETIS, Cirad, Irstea, AgroParisTech, FRANCE Towards Electronic SMS Dictionary Construction: An Alignment-based Approach
Text Shuttle GmbH, FRANCE Machine Translation for Subtitling: A Large-Scale Evaluation
Text Shuttle GmbH, SWITZERLAND Machine Translation for Subtitling: A Large-Scale Evaluation
The Chinese University of Hong Kong, HONG KONG SPECIAL ADMINISTRATIVE REGION OF CHINA The CUHK Discourse TreeBank for Chinese: Annotating Explicit Discourse Connectives for the Chinese TreeBank
The Halliday Centre Tagger: An Online Platform for Semi-automatic Text Annotation and Analysis
The Hong Kong Polytechnic University, HONG KONG SPECIAL ADMINISTRATIVE REGION OF CHINA Annotating Events in an Emotion Corpus
The Hong Kong University of Science and Technology, HONG KONG SPECIAL ADMINISTRATIVE REGION OF CHINA Co-Training for Classification of Live or Studio Music Recordings
The Language Archive - DANS, GERMANY Experiences with the ISOcat Data Category Registry
RELISH LMF: Unlocking the Full Power of the Lexical Markup Framework
Linguistic Resources and Cats: How to Use ISOcat, RELcat and SCHEMAcat
The Language Archive, Max Planck Institute for Psycholinguistics, GERMANY The DWAN Framework: Application of a Web Annotation Framework for the General Humanities to the Domain of Language Resources
Experiences with the ISOcat Data Category Registry
RELISH LMF: Unlocking the Full Power of the Lexical Markup Framework
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
The Long Now Foundation, UNITED STATES PanLex: Building a Resource for Panlingual Lexical Translation
The National Institute for Japanese Language and Linguistics, JAPAN Design and Development of an RDB Version of the Corpus of Spontaneous Japanese
Towards Automatic Transformation Between Different Transcription Conventions: Prediction of Intonation Markers from Linguistic and Acoustic Features
The Ohio State University, UNITED STATES Universal Stanford Dependencies: a Cross-Linguistic Typology
A Gold Standard Dependency Corpus for English
The Open University, UNITED KINGDOM On Stopwords, Filtering and Data Sparsity for Sentiment Analysis of Twitter
The Telecommunications Research Center Vienna (FTW), AUSTRIA The MMASCS Multi-Modal Annotated Synchronous Corpus of Audio, Video, Facial Motion and Tongue Motion Data of Normal, Fast and Slow Speech
The University of Electro-Communications, JAPAN A Character-based Approach to Distributional Semantic Models: Exploiting Kanji Characters for Constructing JapaneseWord Vectors
The Árni Magnússon Institute for Icelandic Studies, ICELAND Languagesindanger.eu - Including Multimedia Language Resources to disseminate Knowledge and Create Educational Material on less-Resourced Languages
Annotation Pro + TGA: Automation of Speech Timing Analysis
ISLEX ― a Multilingual Web Dictionary
Correcting Errors in a New Gold Standard for Tagging Icelandic Text
A 500 Million Word POS-Tagged Icelandic Corpus
Utilizing Constituent Structure for Compound Analysis
Tilburg Center for Cognition and Communication, NETHERLANDS The Development of Dutch and Afrikaans Language Resources for Compound Boundary Analysis.
Tilburg University, NETHERLANDS Semantic Approaches to Software Component Retrieval with English Queries
Creating and Using Large Monolingual Parallel Corpora for Sentential Paraphrase Generation
Interoperability of Dialogue Corpora through ISO 24617-2-based Querying
The Development of Dutch and Afrikaans Language Resources for Compound Boundary Analysis.
Synergy of Nederlab and @PhilosTEI: Diachronic and Multilingual Text-induced Corpus Clean-up
Tilde, LATVIA Terminology Localization Guidelines for the National Scenario
Designing the Latvian Speech Recognition Corpus
The Strategic Impact of META-NET on the Regional, National and International Level
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
Bilingual dictionaries for all EU languages
Billions of Parallel Words for Free: Building and Using the EU Bookshop Corpus
Terminology Resources and Terminology Work Benefit from Cloud Services
Titelbild Subtitling and Translation, GERMANY Machine Translation for Subtitling: A Large-Scale Evaluation
Tokyo Institute of Technology, JAPAN Building a Corpus of Manually Revised Texts from Discourse Perspective
Toulouse Université & CNRS, Octogone, FRANCE Aix Map Task Corpus: the French Multimodal Corpus of Task-oriented Dialogue
Trendiction S.A., GERMANY Aligning Predicate-Argument Structures for Paraphrase Fragment Extraction
Trento-Rise, ITALY Creative Language Explorations through a high-Expressivity N-grams Query Language
Enriching the "Senso Comune" Platform with Automatically Acquired Data
Trinity College of Dublin, Centre for Global Intelligent Content, IRELAND Global Intelligent Content: Active Curation of Language Resources using Linked Data
Trinity College of Dublin, Centre for Next Generation Localisation, IRELAND Global Intelligent Content: Active Curation of Language Resources using Linked Data
Trojina, Institute for Applied Slovene Studies, SLOVENIA The D-ANS corpus: the Dublin-Autonomous Nervous System corpus of biosignal and multimodal recordings of conversational speech
The Slovene BNSI Broadcast News Database and Reference Speech Corpus GOS: Towards the Uniform Guidelines for Future Work
Université Technologique de Troyes, FRANCE The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech
Tsinghua University, CHINA NomLex-PT: A Lexicon of Portuguese Nominalizations
Etymological WordNet: Tracing the History of Words
Bring vs. MTRoget: Evaluating Automatic Thesaurus Translation
Language Processing Infrastructure in the XLike Project
Clustering Tweets using Wikipedia Concepts
Tsukuba University of Technology, JAPAN A Colloquial Corpus of Japanese Sign Language: Linguistic Resources for Observing Sign Language Conversations
Télécom ParisTech, FRANCE A Model to Generate Adaptive Multimodal Job Interviews with a Virtual Recruiter

 

U
Ubiquitous Knowledge Processing (UKP) Lab, GERMANY WordNet―Wikipedia―Wiktionary: Construction of a Three-way Alignment
Lexical Substitution Dataset for German
Ugo Bordoni Foundation, ITALY EMOVO Corpus: an Italian Emotional Speech Database
UKP Lab, Technische Universität Darmstadt, GERMANY WordNet―Wikipedia―Wiktionary: Construction of a Three-way Alignment
Ulm University, GERMANY First Insight into Quality-Adaptive Dialogue
Comparison of Gender- and Speaker-adaptive Emotion Recognition
Speech-Based Emotion Recognition: Feature Selection by Self-Adaptive Multi-Criteria Genetic Algorithm
UMass CS, IBM, UNITED STATES A Study on Expert Sourcing Enterprise Question Collection and Classification
UMI 2958 CNRS - GeorgiaTech, FRANCE NASTIA: Negotiating Appointment Setting Interface
DINASTI: Dialogues with a Negotiating Appointment Setting Interface
UMR CLLE-ERSS 5263, Université Toulouse 2/CNRS, TALC-Sef a Manually-revised POS-Tagged Literary Corpus in Serbian, English and French
UMR STL 8163, Université Lille 3, FRANCE TALC-Sef a Manually-revised POS-Tagged Literary Corpus in Serbian, English and French
Union College, UNITED KINGDOM Automatic Expansion of the MRC Psycholinguistic Database Imageability Ratings
Unister GmbH, Leipzig, Germany, NIF4OGGD - NLP Interchange Format for Open German Governmental Data
N³ - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format
Universidad Católica "Nuestra Señora de la Asunción", BRAZIL Guampa: a Toolkit for Collaborative Translation
Universidad Complutense de Madrid (UCM) / ATLAS (UNED), SPAIN Standardisation and Interoperation of Morphosyntactic and Syntactic Annotation Tools for Spanish and their Annotations
Universidad Complutense de Madrid (UCM), SPAIN Standardisation and Interoperation of Morphosyntactic and Syntactic Annotation Tools for Spanish and their Annotations
NoSta-D Named Entity Annotation for German: Guidelines and Dataset
Can Numerical Expressions Be Simpler? Implementation and Demonstration of a Numerical Simplification System for Spanish
Universidad de Antioquia, COLOMBIA New Spanish Speech Corpus Database for the Analysis of People Suffering from Parkinson's Disease
Universidad de Valladolid, SPAIN On the Use of a Fuzzy Classifier to Speed Up the Sp ToBI Labeling of the Glissando Spanish Corpus
Assessment of Non-native Prosody for Spanish as L2 using Quantitative Scores and Perceptual Evaluation
Universidad Nacional Autónoma de México, MEXICO CIEMPIESS: A New Open-Sourced Mexican Spanish Radio Corpus
Universidad Nacional de Educación a Distancia, SPAIN TweetNorm_es: an Annotated Corpus for Spanish Microtext Normalization
Universidad Nacional de Guinea Ecuatorial, EQUATORIAL GUINEA The Gulf of Guinea Creole Corpora
Universidad Politécnica de Cataluña, SPAIN TweetNorm_es: an Annotated Corpus for Spanish Microtext Normalization
Universidad Politécnica de Madrid, SPAIN Enabling Language Resources to Expose Translations as Linked Data on the Web
Universidade de Lisboa, Centro de Linguística, PORTUGAL An Evaluation of the Role of Statistical Measures and Frequency for MWE Identification
LexTec - a Rich Language Resource for Technical Domains in Portuguese
The Gulf of Guinea Creole Corpora
A Cascade Approach for Complex-type Classification
Combining Dependency Information and Generalization in a Pattern-based Approach to the Classification of Lexical-Semantic Relation Instances
Extracting Semantic Relations from Portuguese Corpora using Lexical-Syntactic Patterns
Universidade de Lisboa, PORTUGAL Teenage and Adult Speech in School Context: Building and Processing a Corpus of European Portuguese
Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
The Strategic Impact of META-NET on the Regional, National and International Level
The Gulf of Guinea Creole Corpora
A Corpus of European Portuguese Child and Child-directed Speech
Universidade de São Paulo, BRAZIL Generating a Lexicon of Errors in Portuguese to Support an Error Identification System for Spanish Native Learners
Universidade de Vigo, SPAIN Introducing a Framework for the Evaluation of Music Detection Tools
Universidade Federal do Paraná, BRAZIL NomLex-PT: A Lexicon of Portuguese Nominalizations
Universitaet Bielefeld, GERMANY ALICO: a Multimodal Corpus for the Study of Active Listening
Universitaet Frankfurt am Main, GERMANY Towards Interoperable Discourse Annotation. Discourse Features in the Ontologies of Linguistic Annotation
Universitat Autònoma de Barcelona, SPAIN On the Use of a Fuzzy Classifier to Speed Up the Sp ToBI Labeling of the Glissando Spanish Corpus
DysList: An Annotated Resource of Dyslexic Errors
Discovering Frames in Specialized Domains
Universitat d'Alacant, SPAIN On the Annotation of TMX Translation Memories for Advanced Leveraging in Computer-aided Translation
Comparing Two Acquisition Systems for Automatically Building an English-Croatian Parallel Corpus from Multilingual Websites
Sharing Resources Between Free/Open-Source Rule-based Machine Translation Systems: Grammatical Framework and Apertium
Universitat de Barcelona, SPAIN VERTa: Facing a Multilingual Experience of a Linguistically-based MT Evaluation
MultiVal - Towards a Multilingual Valence Lexicon
The IULA Spanish LSP Treebank
Universitat Oberta de Catalunya, SPAIN Automatic Creation of WordNets from Parallel Corpora
The NewSoMe Corpus: a Unifying Opinion Annotation Framework Across Genres and in Multiple Languages
Universitat Politecnica de Valencia, SPAIN Enrichment of Bilingual Dictionary through News Stream Data
Universitat Politècnica de Catalunya, SPAIN The Strategic Impact of META-NET on the Regional, National and International Level
TweetNorm_es: an Annotated Corpus for Spanish Microtext Normalization
Language Processing Infrastructure in the XLike Project
Universitat Politècnica de València, SPAIN Enrichment of Bilingual Dictionary through News Stream Data
Evaluating the Effects of Interactivity in a Post-Editing Workbench
Online Optimisation of Log-linear Weights in Interactive Machine Translation
University Pompeu Fabra, SPAIN How to Use Less Features and Reach Better Performance in Author Gender Identification
Boosting the Creation of a Treebank
Creating Summarization Systems with SUMMA
Modelling Irony in Twitter: Feature Analysis and Evaluation
Adapting Freely Available Resources to Build an Opinion Mining Pipeline in Portuguese
Can Numerical Expressions Be Simpler? Implementation and Demonstration of a Numerical Simplification System for Spanish
The NewSoMe Corpus: a Unifying Opinion Annotation Framework Across Genres and in Multiple Languages
The IULA Spanish LSP Treebank
The Strategic Impact of META-NET on the Regional, National and International Level
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
Crowdsourcing as a Preprocessing for Complex Semantic Annotation Tasks
A Cascade Approach for Complex-type Classification
Choosing which to Use? A Study of Distributional Models for Nominal Lexical Semantic Classification
DysList: An Annotated Resource of Dyslexic Errors
Metadata as Linked Open Data: mapping disparate XML metadata registries into one RDF/OWL registry.
Ranking Job Offers for Candidates: Learning Hidden Knowledge from Big Data
Combining Dependency Information and Generalization in a Pattern-based Approach to the Classification of Lexical-Semantic Relation Instances
An Exercise in Reuse of Resources: Adapting General Discourse Coreference Resolution for Detecting Lexical Chains in Patent Documentation
University Pompeu Fabra, Department of Information and Communication Technologies, SPAIN An Exercise in Reuse of Resources: Adapting General Discourse Coreference Resolution for Detecting Lexical Chains in Patent Documentation
University Pompeu Fabra, Department of Translation and Language Sciences, SPAIN TexAFon 2.0: a Text Processing Tool for the Generation of Expressive Speech in TTS Applications
Universitatea Bucuresti, Facultatea de Matematica si Informatica, ROMANIA On the Romance Languages Mutual Intelligibility
Using a Machine Learning Model to Assess the Complexity of Stress Systems
Building a Dataset of Multilingual Cognates for the Romanian Lexicon
Universite de Bretagne Sud, IRISA, FRANCE Variations on Quantitative Comparability Measures and their Evaluations on Synthetic French-English Comparable Corpora
Co-clustering of Bilingual Datasets as a Mean for Assisting the Construction of Thematic Bilingual Comparable Corpora
Universite de Geneve, SWITZERLAND C-PhonoGenre: a 7-hour Corpus of 7 Speaking Styles in French: Relations between Situational Features and Prosodic Properties
Universiteit Antwerp, BELGIUM Legal Aspects of Text Mining
Universiteit Antwerp, CliPS, BELGIUM CLIPS Stylometry Investigation (CSI) Corpus: a Dutch Corpus for the Detection of Age, Gender, Personality, Sentiment and Deception in Text
The Strategic Impact of META-NET on the Regional, National and International Level
Universitetet i Oslo, Department of Informatics, NORWAY The Norwegian Dependency Treebank
Universitetet i Oslo, Department of Linguistics and Scandinavian Studies, NORWAY The Norwegian Dependency Treebank
Universitetet i Oslo, NORWAY Semantic Technologies for Querying Linguistic Annotations: An Experiment Focusing on Graph-Structured Data
Towards an Encyclopedia of Compositional Semantics: Documenting the Interface of the English Resource Grammar
Off-Road LAF: Encoding and Processing Annotations in NLP Workflows
Universitetet i Tromsø, NORWAY Finite-State Morphological Transducers for Three Kypchak Languages
University at Albany, Institute for Informatics, Logics and Security Studies, UNITED STATES Automatic Expansion of the MRC Psycholinguistic Database Imageability Ratings
A Multi-Cultural Repository of Automatically Discovered Linguistic and Conceptual Metaphors
University at Albany, UNITED STATES Automatic Expansion of the MRC Psycholinguistic Database Imageability Ratings
A Multi-Cultural Repository of Automatically Discovered Linguistic and Conceptual Metaphors
University College Utrecht, NETHERLANDS Semi-Automatic Annotation of the UCU Accents Speech Corpus
University Lille 1 - LIFL (CNRS/Lille 1) - SequeL team, FRANCE NASTIA: Negotiating Appointment Setting Interface
DINASTI: Dialogues with a Negotiating Appointment Setting Interface
University of Aalto, FINLAND SLMotion - an Extensible Sign Language Oriented Video Analysis Tool
S-pot - a Benchmark in Spotting Signs Within Continuous Signing
A Toolkit for Efficient Learning of Lexical Units for Speech Recognition
University of Aalto, School of Business, FINLAND S-pot - a Benchmark in Spotting Signs Within Continuous Signing
University of Alabama at Birmingham, UNITED STATES ClearTK 2.0: Design Patterns for Machine Learning in UIMA
Sockpuppet Detection in Wikipedia: A Corpus of Real-World Deceptive Writing for Linking Identities
University of Alberta, UNITED STATES Improving Open Relation Extraction via Sentence Re-Structuring
University of Amsterdam, Institute for Logic, Language, and Computation, NETHERLANDS All Fragments Count in Parser Evaluation
University of Amsterdam, NETHERLANDS Untrained Forced Alignment of Transcriptions and Audio for Language Documentation Corpora using WebMAUS
All Fragments Count in Parser Evaluation
Vulnerability in Acquisition, Language Impairments in Dutch: Creating a VALID Data Archive
University of Arizona, UNITED STATES ACTIV-ES: a Comparable, Cross-Dialect Corpus of "everyday" Spanish from Argentina, Mexico, and Spain
Eliciting and Annotating Uncertainty in Spoken Language
Event Extraction Using Distant Supervision
On the Importance of Text Analysis for Stock Price Prediction
The N2 Corpus: a Semantically Annotated Collection of Islamist Extremist Stories
University of Aston, UNITED KINGDOM On Stopwords, Filtering and Data Sparsity for Sentiment Analysis of Twitter
University of Austin, Texas, UNITED STATES Exploring Factors that Contribute to Successful Fingerspelling Comprehension
University of Aveiro, PORTUGAL Multimodal Corpora for Silent Speech Interaction
University of Bamberg, GERMANY PACE Corpus: a Multilingual Corpus of Polarity-Annotated Textual Data from the Domains Automotive and CEllphone
University of Belgrade, SERBIA The Strategic Impact of META-NET on the Regional, National and International Level
University of Bergen, Department of Linguistic, Literary, and Aesthetic Studies, NORWAY Off-Road LAF: Encoding and Processing Annotations in NLP Workflows
University of Bergen, NORWAY The Interplay Between Lexical and Syntactic Resources in Incremental Parsebanking
The Strategic Impact of META-NET on the Regional, National and International Level
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
Pruning the Search Space of the Wolof LFG Grammar Using a Probabilistic and a Constraint Grammar Parser
Chasing the Perfect Splitter: A Comparison of Different Compound Splitting Tools
University of Birmingham, UNITED KINGDOM Mining Online Discussion Forums for Metaphors
DiVE-Arabic: Gulf Arabic Dialogue in a Virtual Environment
University of Bologna, ITALY A Modular System for Rule-based Text Categorisation
University of Boston, UNITED STATES A New Framework for Sign Language Recognition Based on 3D Handshape Identification and Linguistic Modeling
3D Face Tracking and Multi-Scale, Spatio-temporal Analysis of Linguistically Significant Facial Expressions and Head Positions in ASL
University of Brandeis, UNITED STATES Image Annotation with ISO-Space: Distinguishing Content from Structure
Buy One Get One Free: Distant Annotation of Chinese Tense, Event Type and Modality
Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech
Identification of Multiword Expressions in the brWaC
The Language Application Grid
University of Brighton, UNITED KINGDOM A Comparative Evaluation Methodology for NLG in Interactive Systems
University of British Columbia, CANADA Morphological Parsing of Swahili using Crowdsourced Lexical Resources
University of Bucharest, ROMANIA On the Romance Languages Mutual Intelligibility
Aggregation Methods for Efficient Collocation Detection
Using a Machine Learning Model to Assess the Complexity of Stress Systems
Building a Dataset of Multilingual Cognates for the Romanian Lexicon
University of California Santa Cruz, UNITED STATES Getting Reliable Annotations for Sarcasm in Online Dialogues
University of Cambridge (DTAL), UNITED KINGDOM Semantic Clustering of Pivot Paraphrases
University of Cambridge, UNITED KINGDOM Native Language Identification Using Large, Longitudinal Data
Evaluation of Simple Distributional Compositional Operations on Longer Texts
TagNText: a Parallel Corpus for the Induction of Resource-specific non-Taxonomical Relations from Tagged Images
University of Chiba, Faculty of Letters, JAPAN Design and Development of an RDB Version of the Corpus of Spontaneous Japanese
Towards Automatic Transformation Between Different Transcription Conventions: Prediction of Intonation Markers from Linguistic and Acoustic Features
Japanese Conversation Corpus for Training and Evaluation of Backchannel Prediction Model.
University of Coastal Carolina, UNITED STATES The N2 Corpus: a Semantically Annotated Collection of Islamist Extremist Stories
University of Coimbra, Centre for Informatics and Systems, PORTUGAL Exploiting Portuguese Lexical Knowledge Bases for Answering Open Domain Cloze Questions Automatically
University of Coimbra, PORTUGAL HESITA(te) in Portuguese
University of Cologne, GERMANY Languagesindanger.eu - Including Multimedia Language Resources to disseminate Knowledge and Create Educational Material on less-Resourced Languages
Learning from Domain Complexity
VarClass: An Open-source Language Identification Tool for Language Varieties
University of Colorado at Boulder, UNITED STATES PropBank: Semantics of New Predicate Types
ClearTK 2.0: Design Patterns for Machine Learning in UIMA
Criteria for Identifying and Annotating Caused Motion Constructions in Corpus Data
Mapping CPA Patterns onto OntoNotes Senses
University of Colorado School of Medicine, UNITED STATES Sublanguage Corpus Analysis Toolkit: a Tool for Assessing the Representativeness and Sublanguage Characteristics of Corpora
University of Colorado, UNITED STATES PropBank: Semantics of New Predicate Types
Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech
Criteria for Identifying and Annotating Caused Motion Constructions in Corpus Data
Single Classifier Approach for Verb Sense Disambiguation based on Generalized Features
Focusing Annotation for Semantic Role Labeling
University of Columbia, Center for Computational Learning Systems, UNITED STATES Developing an Egyptian Arabic Treebank: Impact of Dialectal Morphology on Annotation and Tool Development
Tharwa: A Large Scale Dialectal Arabic - Standard Arabic - English Lexicon
A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition
MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic
University of Columbia, UNITED STATES Mapping Between English Strings and Reentrant Semantic Graphs
Towards Automatic Detection of Narrative Structure
Tharwa: A Large Scale Dialectal Arabic - Standard Arabic - English Lexicon
Teenage and Adult Speech in School Context: Building and Processing a Corpus of European Portuguese
A Conventional Orthography for Tunisian Arabic
Annotating the MASC Corpus with BabelNet
A Multidialectal Parallel Corpus of Arabic
MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic
Large Scale Arabic Error Annotation: Guidelines and Framework
University of Concordia, UNITED STATES Improving Entity Linking using Surface Form Refinement
University of Copenhagen, Center for Language Technology, DENMARK Augmenting English Adjective Senses with Supersenses
Using TEI, CMDI and ISOcat in CLARIN-DK
Crowdsourcing and Annotating NER for Twitter #drift
When POS Data Sets Don't Add Up: Combatting Sample Bias
University of Copenhagen, DENMARK SenTube: A Corpus for Sentiment Analysis on YouTube Social Media
The Strategic Impact of META-NET on the Regional, National and International Level
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
Crowdsourcing and Annotating NER for Twitter #drift
Crowdsourcing as a Preprocessing for Complex Semantic Annotation Tasks
When POS Data Sets Don't Add Up: Combatting Sample Bias
Transfer Learning of Feedback Head Expressions in Danish and Polish Comparable Multimodal Corpora
Encompassing a Spectrum of LT Users in the CLARIN-DK Infrastructure
University of Cornell, UNITED STATES Annotating Relations in Scientific Articles
University of Coventry, UNITED KINGDOM The Pragmatic Annotation of a Corpus of Academic Lectures
University of Dallas, Texas, UNITED STATES SinoCoreferencer: An End-to-End Chinese Event Coreference Resolver
Annotating Inter-Sentence Temporal Relations in Clinical Notes
Clinical Data-Driven Probabilistic Graph Processing
Multilingual eXtended WordNet Knowledge Base: Semantic Parsing and Translation of Glosses
University of Deusto, LinguaMedia Group, SPAIN IXA pipeline: Efficient and Ready to Use Multilingual NLP tools
University of Dublin City, Centre for Global Intelligent Content, IRELAND Cross-Language Authorship Attribution
University of Dublin City, School of Computing, Centre for Global Intelligent Content, IRELAND MTWatch: A Tool for the Analysis of Noisy Parallel Data
University of Dundee, UNITED KINGDOM A Model for Processing Illocutionary Structures and Argumentation in Debates
University of Düsseldorf, GERMANY Discosuite - A Parser Test Suite for German Discontinuous Structures
On Complex Word Alignment Configurations
University of Eastern Michigan ― LINGUIST List, UNITED STATES Visualization of Language Relations and Families: MultiTree
RELISH LMF: Unlocking the Full Power of the Lexical Markup Framework
University of Economics and Business, Athens, GREECE/a> A Tool Suite for Creating Question Answering Benchmarks
University of Economics, Prague, CZECH REPUBLIC Towards Linked Hypernyms Dataset 2.0: Complementing DBpedia with Hypernym Discovery
University of Edinburgh, Centre for Speech Technology Research, UNITED KINGDOM RSS-TOBI - a Prosodically Enhanced Romanian Speech Corpus
University of Edinburgh, PPLS and HCRC, UNITED KINGDOM Aix Map Task Corpus: the French Multimodal Corpus of Task-oriented Dialogue
University of Edinburgh, Schoolf of Informatics, UNITED KINGDOM Re-using an Argument Corpus to Aid in the Curation of Social Media Collections
University of Edinburgh, UNITED KINGDOM N-gram Counts and Language Models from the Common Crawl
ParCor 1.0: A Parallel Pronoun-Coreference Corpus to Support Statistical MT
An Analysis of Older Users' Interactions with Spoken Dialogue Systems
Re-using an Argument Corpus to Aid in the Curation of Social Media Collections
University of Education Karlsruhe, GERMANY A Database of Freely Written Texts of German School Students for the Purpose of Automatic Spelling Error Classification
University of Essex, UNITED KINGDOM AraNLP: a Java-based Library for the Processing of Arabic Text.
University of Florence, ITALY The IMAGACT Visual Ontology. an Extendable Multilingual Infrastructure for the Representation of Lexical Encoding of Action
University of Gallaudet, UNITED STATES Expanding N-gram Analytics in ELAN and a Case Study for Sign Synthesis
The Use of a FileMaker Pro Database in Evaluating Sign Language Notation Systems
University of Geneva, SWITZERLAND English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling
A Crowdsourcing Smartphone Application for Swiss German: Putting Language Documentation in the Hands of the Users
Cross-Linguistic Annotation of Narrativity for English/French Verb Tense Disambiguation
Applying Accessibility-Oriented Controlled Language (CL) Rules to Improve Appropriateness of Text Alternatives for Images: an Exploratory Study
A Large-Scale Evaluation of Pre-editing Strategies for Improving User-Generated Content Translation
C-PhonoGenre: a 7-hour Corpus of 7 Speaking Styles in French: Relations between Situational Features and Prosodic Properties
DisMo: A Morphosyntactic, Disfluency and Multi-Word Unit Annotator. An Evaluation on a Corpus of French Spontaneous and Read Speech
University of Georgetown, UNITED STATES A Framework for Public Health Surveillance
University of Ghana, INDIA MultiVal - Towards a Multilingual Valence Lexicon
University of Ghent, BELGIUM Speech Recognition Web Services for Dutch
On the Origin of Errors: a Fine-Grained Analysis of MT and PE Errors and their Relationship
Recognising Suicidal Messages in Dutch Social Media
University of Ghent, Language and Translation Technology Team, BELGIUM Evaluation of Automatic Hypernym Extraction from Technical Corpora in English and Dutch
Towards Shared Datasets for Normalization Research
University of Glasgow, UNITED KINGDOM The SSPNet-Mobile Corpus: Social Signal Processing Over Mobile Phones.
Experiences with Parallelisation of an Existing NLP Pipeline: Tagging Hansard
University of Gothenburg and Chalmers University of Technology, SWEDEN Bootstrapping Open-Source English-Bulgarian Computational Dictionary
University of Gothenburg, SWEDEN The DWAN Framework: Application of a Web Annotation Framework for the General Humanities to the Domain of Language Resources
Extracting a bilingual semantic grammar from FrameNet-annotated corpora
HFST-SweNER ― A New NER Resource for Swedish
Bring vs. MTRoget: Evaluating Automatic Thesaurus Translation
Computer-aided Morphology Expansion for Old Swedish
A Flexible Language Learning Platform Based on Language Resources and Web Services
Reusing Swedish FrameNet for Training Semantic Roles
University of Gothenburg/ JSPS/ National Institute of Informatics, SWEDEN A Colloquial Corpus of Japanese Sign Language: Linguistic Resources for Observing Sign Language Conversations
University of Granada, SPAIN Definition Patterns for Predicative Terms in Specialized Lexical Resources
A Model to Generate Adaptive Multimodal Job Interviews with a Virtual Recruiter
University of Groningen, NETHERLANDS A Set of Open Source Tools for Turkish Natural Language Processing
Treelet Probabilities for HPSG Parsing and Error Correction
University of Hamburg, GERMANY Adapting a Part-of-Speech Tagset to Non-Standard Text: the Case of STTS
Because Size Does Matter: The Hamburg Dependency Treebank
University of Harvard, UNITED STATES Eliciting and Annotating Uncertainty in Spoken Language
University of Helsinki, Department of Modern Languages, FINLAND HFST-SweNER ― A New NER Resource for Swedish
University of Helsinki, FINLAND Open-domain Interaction and Online Content in the Sami Language
The Strategic Impact of META-NET on the Regional, National and International Level
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
Computer-aided Morphology Expansion for Old Swedish
ACTIV-ES: a Comparable, Cross-Dialect Corpus of "everyday" Spanish from Argentina, Mexico, and Spain
Heuristic Hyper-minimization of Finite State Lexicons
University of Hildesheim, GERMANY Adapting a Part-of-Speech Tagset to Non-Standard Text: the Case of STTS
University of Iceland, ICELAND The Strategic Impact of META-NET on the Regional, National and International Level
Correcting Errors in a New Gold Standard for Tagging Icelandic Text
An Innovative World Language Centre : Challenges for the Use of Language Technology
Rapid Deployment of Phrase Structure Parsing for Related Languages: A Case Study of Insular Scandinavian
University of Illinois at Urbana-Champaign, UNITED STATES ILLINOISCLOUDNLP: Text Analytics Services in the Cloud
University of Illinois, UNITED STATES Development of a TV Broadcasts Speech Recognition System for Qatari Arabic
Automatic Long Audio Alignment and Confidence Scoring for Conversational Arabic Speech
University of Indiana, UNITED STATES Finite-State Morphological Transducers for Three Kypchak Languages
Guampa: a Toolkit for Collaborative Translation
Discosuite - A Parser Test Suite for German Discontinuous Structures
SWIFT Aligner, A Multifunctional Tool for Parallel Corpora: Visualization, Word Alignment, and (Morpho)-Syntactic Cross-Language Transfer
SANA: A Large Scale Multi-Genre, Multi-Dialect Lexicon for Arabic Subjectivity and Sentiment Analysis
University of International Relations, CHINA The CUHK Discourse TreeBank for Chinese: Annotating Explicit Discourse Connectives for the Chinese TreeBank
University of Jordan, JORDAN Tools for Arabic Natural Language Processing: a Case Study in Qalqalah Prosody
University of Jyväskylä, FINLAND SLMotion - an Extensible Sign Language Oriented Video Analysis Tool
S-pot - a Benchmark in Spotting Signs Within Continuous Signing
University of Konstanz, GERMANY The CLE Urdu POS Tagset
University of Lancaster, UNITED KINGDOM Detecting Document Structure in a Very Large Corpus of UK Financial Reports
Sublanguage Corpus Analysis Toolkit: a Tool for Assessing the Representativeness and Sublanguage Characteristics of Corpora
Experiences with Parallelisation of an Existing NLP Pipeline: Tagging Hansard
University of Latvia, Faculty of Computing, LATVIA Coreference Resolution for Latvian
University of Latvia, Institute of Mathematics and Computer Science, LATVIA Designing the Latvian Speech Recognition Corpus
Dependency Parsing Representation Effects on the Accuracy of Semantic Applications - an Example of an Inflective Language
University of Latvia, LATVIA Extracting a bilingual semantic grammar from FrameNet-annotated corpora
Terminology Localization Guidelines for the National Scenario
Using C5.0 and Exhaustive Search for Boosting Frame-Semantic Parsing Accuracy
University of Leeds, School of Computing, UNITED KINGDOM Designing and Evaluating a Reliable Corpus of Web Genres via Crowd-Sourcing
University of Leeds, School of Modern Languages and Cultures, UNITED KINGDOM Designing and Evaluating a Reliable Corpus of Web Genres via Crowd-Sourcing
University of Leeds, UNITED KINGDOM Tools for Arabic Natural Language Processing: a Case Study in Qalqalah Prosody
Can Crowdsourcing be used for Effective Annotation of Arabic?
Designing and Evaluating a Reliable Corpus of Web Genres via Crowd-Sourcing
University of Ljubljana, SLOVENIA sloWCrowd: a Crowdsourcing Tool for Lexicographic Tasks
TweetCaT: a Tool for Building Twitter Corpora of Smaller Languages
University of Lorraine, FRANCE Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process
University of Macau, MOROCCO UM-Corpus: A Large English-Chinese Parallel Corpus for Statistical Machine Translation
University of Malta, MALTA Automatic Methods for the Extension of a Bilingual Dictionary using Comparable Corpora
The Strategic Impact of META-NET on the Regional, National and International Level
Crowd-sourcing Evaluation of Automatically Acquired, Morphologically Related Word Groupings
University of Manchester ― National Centre for Text Mining, UNITED KINGDOM Interoperability and Customisation of Annotation Schemata in Argo
The Meta-knowledge of Causality in Biomedical Scientific Discourse
The Strategic Impact of META-NET on the Regional, National and International Level
Locating Requests among Open Source Software Communication Messages
University of Manchester, UNITED KINGDOM Interoperability and Customisation of Annotation Schemata in Argo
The Strategic Impact of META-NET on the Regional, National and International Level
Out in the Open: Finding and Categorising Errors in the Lexical Simplification Pipeline
Locating Requests among Open Source Software Communication Messages
University of Mannheim, GERMANY DBpedia Domains: Augmenting DBpedia with Domain Information
University of Maribor, SLOVENIA Machine Translation for Subtitling: A Large-Scale Evaluation
The Slovene BNSI Broadcast News Database and Reference Speech Corpus GOS: Towards the Uniform Guidelines for Future Work
University of Melbourne, AUSTRALIA Mapping WordNet Domains, WordNet Topics and Wikipedia Categories to Generate Multilingual Domain Specific Resources
University of Memphis, Department of Computer Science, UNITED STATES Latent Semantic Analysis Models on Wikipedia and TASA
University of Memphis, UNITED STATES On Paraphrase Identification Corpora
The DARE Corpus: A Resource for Anaphora Resolution in Dialogue Based Intelligent Tutoring Systems
Latent Semantic Analysis Models on Wikipedia and TASA
University of Michigan - Flint, UNITED STATES A Multimodal Dataset for Deception Detection
University of Michigan Ann Arbor, UNITED STATES A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization
University of Michigan, UNITED STATES Building a Dataset for Summarization and Keyword Extraction from Emails
Modeling Language Proficiency Using Implicit Feedback
A Multimodal Dataset for Deception Detection
University of Mons, BELGIUM The AV-LASYN Database: a Synchronous Corpus of Audio and 3D Facial Marker Data for Audio-Visual Laughter Synthesis
University of North Texas, UNITED STATES Building a Dataset for Summarization and Keyword Extraction from Emails
Modeling Language Proficiency Using Implicit Feedback
A Multimodal Dataset for Deception Detection
University of Osaka, JAPAN Web-imageability of the Behavioral Features of Basic-level Concepts
University of Oslo, NORWAY Semantic Technologies for Querying Linguistic Annotations: An Experiment Focusing on Graph-Structured Data
Treelet Probabilities for HPSG Parsing and Error Correction
Off-Road LAF: Encoding and Processing Annotations in NLP Workflows
University of Padua, ITALY A Vector Space Model for Syntactic Distances Between Dialects
University of Pennsylvania, Linguistic Data Consortium, UNITED STATES Collecting Natural SMS and Chat Conversations in Multiple Languages: The BOLT Phase 2 Corpus
The RATS Collection: Supporting HLT Research with Degraded Audio Data
New Directions for Language Resource Development and Distribution
University of Pennsylvania, UNITED STATES RESTful Annotation and Efficient Collaboration
A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization
Collecting Natural SMS and Chat Conversations in Multiple Languages: The BOLT Phase 2 Corpus
New Directions for Language Resource Development and Distribution
A Multi-Dialect, Multi-Genre Corpus of Informal Written Arabic
The Multilingual Paraphrase Database
Rapid Deployment of Phrase Structure Parsing for Related Languages: A Case Study of Insular Scandinavian
The American Local News Corpus
University of Pisa, ITALY The Making of Ancient Greek WordNet
Crowdsourcing for the Identification of Event Nominals: an Experiment
Bootstrapping an Italian VerbNet: data-driven analysis of verb alternations
Choosing which to Use? A Study of Distributional Models for Nominal Lexical Semantic Classification
University of Pittsburgh, Computer Science Department, UNITED STATES A Comparison of MT Errors and ESL Errors
University of Pittsburgh, Intelligent Systems Program, UNITED STATES A Comparison of MT Errors and ESL Errors
University of Pittsburgh, UNITED STATES A Unified Annotation Scheme for the Semantic/Pragmatic Components of Definiteness
University of Porto - Faculty of Law, School of Criminology, PORTUGAL VOCE Corpus: Ecologically Collected Speech Annotated with Physiological and Psychological Stress Assessments
University of Porto - Faculty of Psychology and Educational Sciences, PORTUGAL VOCE Corpus: Ecologically Collected Speech Annotated with Physiological and Psychological Stress Assessments
University of Rochester, UNITED KINGDOM The CMU METAL Farsi NLP Approach
University of Rome Tor Vergata, ITALY A Meta-data Driven Platform for Semi-automatic Configuration of Ontology Mediators
HuRIC: a Human Robot Interaction Corpus
EMOVO Corpus: an Italian Emotional Speech Database
University of Rome Tor Vergata, Ugo Bordoni Foundation, ITALY EMOVO Corpus: an Italian Emotional Speech Database
University of Saarbrücken, GERMANY Exploring and Visualizing Variation in Language Resources
University of Salerno, ITALY From Natural Language to Ontology Population in the Cultural Heritage Domain. A Computational Linguistics-based approach.
University of Santiago de Compostela, SPAIN CORILGA: a Galician Multilevel Annotated Speech Corpus for Linguistic Analysis
Multilingual Corpora with Coreferential Annotation of Person Entities
University of Sassari, ITALY Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate
University of Sfax, ANLP Research group, MIRACL Lab., TUNISIA A Conventional Orthography for Tunisian Arabic
A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition
University of Sheffield, UNITED KINGDOM Bootstrapping Term Extractors for Multiple Languages
Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
Building a Crisis Management Term Resource for Social Media: The Case of Floods and Protests
A Quality-based Active Sample Selection Strategy for Statistical Machine Translation
Who cares about Sarcastic Tweets? Investigating the Impact of Sarcasm on Sentiment Analysis.
Bilingual dictionaries for all EU languages
An Efficient and User-friendly Tool for Machine Translation Quality Estimation
Creating Summarization Systems with SUMMA
Modelling Irony in Twitter: Feature Analysis and Evaluation
University of South Africa, SOUTH AFRICA Missed Opportunities in Translation Memory Matching
University of Southern California, Information Sciences Institute, UNITED STATES Mapping Between English Strings and Reentrant Semantic Graphs
University of Southern California, Institute for Creative Technologies, UNITED STATES The Distress Analysis Interview Corpus of Human and Computer Interviews
A Multimodal Corpus of Rapid Dialogue Games
University of Southern California, SAIL, UNITED STATES Word Semantic Similarity for Morphologically Rich Languages
University of Southern California, UNITED STATES A Corpus of Participant Roles in Contentious Discussions
TweetNorm_es: an Annotated Corpus for Spanish Microtext Normalization
The Distress Analysis Interview Corpus of Human and Computer Interviews
Clustering Tweets using Wikipedia Concepts
University of Southern Denmark, DENMARK ML-Optimization of Ported Constraint Grammars
A 500 Million Word POS-Tagged Icelandic Corpus
University of Southern Denmark, Odense, Department of Language and Communication, DENMARK High Quality Word Lists as a Resource for Multiple Purposes
University of Stanford, UNITED STATES Universal Stanford Dependencies: a Cross-Linguistic Typology
On the Importance of Text Analysis for Stock Price Prediction
A Gold Standard Dependency Corpus for English
N-gram Counts and Language Models from the Common Crawl
Event Extraction Using Distant Supervision
Towards an Encyclopedia of Compositional Semantics: Documenting the Interface of the English Resource Grammar
University of Suttgart, Institut für Maschinelle Sprachverarbeitung, GERMANY A Rank-based Distance Measure to Detect Polysemy and to Determine Salient Vector-Space Features for German Prepositions
Fuzzy V-Measure - An Evaluation Method for Cluster Analyses of Ambiguous Data
University of Szeged, HUNGARY Szeged Corpus 2.5: Morphological Modifications in a Manually POS-tagged Hungarian Corpus
4FX: Light Verb Constructions in a Multilingual Parallel Corpus
Automatic Error Detection Concerning the Definite and Indefinite Conjugation in the HunLearner Corpus
University of São Paulo, BRAZIL Generating a Lexicon of Errors in Portuguese to Support an Error Identification System for Spanish Native Learners
A Large Corpus of Product Reviews in Portuguese: Tackling Out-Of-Vocabulary Words
University of Tartu, ESTONIA The Strategic Impact of META-NET on the Regional, National and International Level
Towards an Integration of Syntactic and Temporal Annotations in Estonian
Dense Components in the Structure of WordNet
University of the Basque Country (UPV/EHU), SPAIN The Strategic Impact of META-NET on the Regional, National and International Level
TweetNorm_es: an Annotated Corpus for Spanish Microtext Normalization
Basque Speecon-like and Basque SpeechDAT MDB-600: Speech Databases for the Development of ASR Technology for Basque
First Approach toward Semantic Role Labeling for Basque
KALAKA-3: a Database for the Recognition of Spoken European Languages on YouTube Audios
A Stream Computing Approach Towards Scalable NLP
University of Tokyo, JAPAN Corpus for Coreference Resolution on Scientific Papers
University of Tokyo, National Institute of Informatics, JAPAN A Colloquial Corpus of Japanese Sign Language: Linguistic Resources for Observing Sign Language Conversations
Corpus for Coreference Resolution on Scientific Papers
Annotation of Computer Science Papers for Semantic Relation Extrac-tion
Overview of Todai Robot Project and Evaluation Framework of its NLP-based Problem Solving
University of Torino, Department of Computer Science, ITALY Exploiting Catenae in a Parallel Treebank Alignment
University of Torino, Department of Foreign Languages and Literatures and Modern Cultures, ITALY Exploiting Catenae in a Parallel Treebank Alignment
University of Torino, ITALY Exploiting networks in Law
University of Trento, ITALY Cross-Language Authorship Attribution
SenTube: A Corpus for Sentiment Analysis on YouTube Social Media
A SICK Cure for the Evaluation of Compositional Distributional Semantic Models
The Development of the Multilingual LUNA Corpus for Spoken Language System Porting
University of Turku, FINLAND Universal Stanford Dependencies: a Cross-Linguistic Typology
University of Twente, NETHERLANDS Croatian Memories
Twente Debate Corpus ― A Multimodal Corpus for Head Movement Analysis
University of Tübingen, GERMANY Towards Automatic Quality Assessment of Component Metadata
Automatic Refinement of Syntactic Categories in Chinese Word Structures
How to Tell a Schneemann from a Milchmann: An Annotation Scheme for Compound-Internal Relations
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
The CLARIN Research Infrastructure: Resources and Tools for eHumanities Scholars
The MERLIN corpus: Learner Language and the CEFR
Adapting a Part-of-Speech Tagset to Non-Standard Text: the Case of STTS
University of Valladolid, SPAIN On the Use of a Fuzzy Classifier to Speed Up the Sp ToBI Labeling of the Glissando Spanish Corpus
Assessment of Non-native Prosody for Spanish as L2 using Quantitative Scores and Perceptual Evaluation
University of Vienna, AUSTRIA The Strategic Impact of META-NET on the Regional, National and International Level
University of Vienna, MODUL, AUSTRIA Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
University of Vigo, SPAIN Introducing a Framework for the Evaluation of Music Detection Tools
The Strategic Impact of META-NET on the Regional, National and International Level
CORILGA: a Galician Multilevel Annotated Speech Corpus for Linguistic Analysis
University of Virginia, UNITED STATES Benchmarking Twitter Sentiment Analysis Tools
University of Warsaw, POLAND Walenty: Towards a Comprehensive Valence Dictionary of Polish
University of Waseda, Faculty of Human Sciences, JAPAN Estimation of Speaking Style in Speech Corpora Focusing on Speech Transcriptionss
University of Washington, UNITED STATES Enriching ODIN
A Finite-State Morphological Analyzer for a Lakota HPSG Grammar
Modern Chinese Helps Archaic Chinese Processing: Finding and Exploiting the Shared Properties
Annotating Clinical Events in Text Snippets for Phenotype Detection
Towards an Encyclopedia of Compositional Semantics: Documenting the Interface of the English Resource Grammar
Language CoLLAGE: Grammatical Description with the LinGO Grammar Matrix
University of Western Sydney, The MARCS Institute, AUSTRALIA AusTalk: an Audio-Visual Corpus of Australian English
The Alveo Virtual Laboratory: A Web based Repository API
University of Wolverhampton, Research Institute in Information and Language Processing, UNITED KINGDOM Disambiguating Verbs by Collocation: Corpus Lexicography meets Natural Language Processing
University of Wolverhampton, UNITED KINGDOM Using a Machine Learning Model to Assess the Complexity of Stress Systems
Disambiguating Verbs by Collocation: Corpus Lexicography meets Natural Language Processing
Mapping CPA Patterns onto OntoNotes Senses
University of Yamanashi, JAPAN Annotating the Focus of Negation in Japanese Text
University of York, UNITED KINGDOM DiVE-Arabic: Gulf Arabic Dialogue in a Virtual Environment
University of Zagreb, CROATIA HiEve: A Corpus for Extracting Event Hierarchies from News Stories
CroDeriV: a New Resource for Processing Croatian Morphology
Comparing Two Acquisition Systems for Automatically Building an English-Croatian Parallel Corpus from Multilingual Websites
Language Processing Infrastructure in the XLike Project
The SETimes.HR Linguistically Annotated Corpus of Croatian
Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing
Quality Estimation for Synthetic Parallel Data Generation
TweetCaT: a Tool for Building Twitter Corpora of Smaller Languages
caWaC - a Web Corpus of Catalan and its Application to Language Modeling and Machine Translation
RECSA: Resource for Evaluating Cross-lingual Semantic Annotation
University of Zagreb, Faculty of Electrical Engineering and Computing, CROATIA HiEve: A Corpus for Extracting Event Hierarchies from News Stories
DerivBase.hr: A High-Coverage Derivational Morphology Resource for Croatian
BiographyNet: Methodological Issues when NLP Supports Historical Research
Hope and Fear: How Opinions Influence Factuality
University of Zagreb, Faculty of Humanities and Social Sciences, CROATIA CroDeriV: a New Resource for Processing Croatian Morphology
The Strategic Impact of META-NET on the Regional, National and International Level
Language Processing Infrastructure in the XLike Project
Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing
RECSA: Resource for Evaluating Cross-lingual Semantic Annotation
University of Zurich, SWITZERLAND Collaboratively Annotating Multilingual Parallel Corpora in the Biomedical Domain―some MANTRAs
Using Large Biomedical Databases as Gold Annotations for Automatic Relation Extraction
Zmorge: A German Morphological Lexicon Extracted from Wiktionary
A Crowdsourcing Smartphone Application for Swiss German: Putting Language Documentation in the Hands of the Users
Constructing and Exploiting an Automatically Annotated Resource of Legislative Texts
Innovations in Parallel Corpus Search Tools
University of Lódz, POLAND The Strategic Impact of META-NET on the Regional, National and International Level
Università Cattolica del Sacro Cuore, Centro Interdisciplinare di Ricerche per la Computerizzazione dei Segni dell’Espressione, ITALY Thomas Aquinas in the TüNDRA: Integrating the Index Thomisticus Treebank into CLARIN-D
Università Cattolica del Sacro Cuore, ITALY A Compact Interactive Visualization of Dependency Treebank Query Results
Università degli studi di Torino, ITALY Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web
Università di Modena e Reggio Emilia, ITALY Presenting a System of Human-Machine Interaction for Performing Map Tasks.
Università di Pavia, ITALY T-PAS; A resource of Typed Predicate Argument Structures for linguistic analysis and semantic processing
Università di Pisa, ITALY Less is More? Towards a Reduced Inventory of Categories for Training a Parser for the Italian Stanford Dependencies
Università di Salerno, DSPSC, ITALY From Natural Language to Ontology Population in the Cultural Heritage Domain. A Computational Linguistics-based approach.
Università di Torino, ITALY Less is More? Towards a Reduced Inventory of Categories for Training a Parser for the Italian Stanford Dependencies
Università di Trento, ITALY A SICK Cure for the Evaluation of Compositional Distributional Semantic Models
Universität Bielefeld, GERMANY ALICO: a Multimodal Corpus for the Study of Active Listening
Annotation Pro + TGA: Automation of Speech Timing Analysis
Representing Multilingual Data as Linked Data: the Case of BabelNet 2.0
The USAGE Review Corpus for Fine Grained Multi Lingual Opinion Analysis
Universität Des Saarlandes, Saarbrücken, Word Alignment-Based Reordering of Source Chunks in PB-SMT
Universität des Saarlandes, Institut der Gesellschaft zur Förderung der Angewandten Informationsforschung, GERMANY Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate
Universität Frankfurt, GERMANY ColLex.en: Automatically Generating and Evaluating a Full-form Lexicon for English
Universität Hamburg, GERMANY Because Size Does Matter: The Hamburg Dependency Treebank
Universität Hildesheim, GERMANY The eIdentity Text Exploration Workbench
Universität Leipzig, Agile Knowledge Engineering and Semantic Web research Group, GERMANY Global Intelligent Content: Active Curation of Language Resources using Linked Data
NIF4OGGD - NLP Interchange Format for Open German Governmental Data
N³ - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format
Universität Leipzig, Department of English, GERMANY High Quality Word Lists as a Resource for Multiple Purposes
Universität Leipzig, GERMANY The Making of Ancient Greek WordNet
PACE Corpus: a Multilingual Corpus of Polarity-Annotated Textual Data from the Domains Automotive and CEllphone
Vocabulary-Based Language Similarity using Web Corpora
Learning from Domain Complexity
Creating a Gold Standard Corpus for the Extraction of Chemistry-Disease Relations from Patent Texts
High Quality Word Lists as a Resource for Multiple Purposes
NIF4OGGD - NLP Interchange Format for Open German Governmental Data
A 500 Million Word POS-Tagged Icelandic Corpus
N³ - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format
Open Philology at the University of Leipzig
A Tool Suite for Creating Question Answering Benchmarks
Universität Leipzig, Natural Language Processing Group, Computer Science Department, GERMANY Vocabulary-Based Language Similarity using Web Corpora
High Quality Word Lists as a Resource for Multiple Purposes
A 500 Million Word POS-Tagged Icelandic Corpus
Universität Munchen, Institute of Phonetics and Speech Processing, GERMANY The MMASCS Multi-Modal Annotated Synchronous Corpus of Audio, Video, Facial Motion and Tongue Motion Data of Normal, Fast and Slow Speech
Online Experiments with the Percy Software Framework - Experiences and some Early Results
Universität München, GERMANY Introducing a Web Application for Labeling, Visualizing Speech and Correcting Derived Speech Signals
Universität Potsdam, GERMANY The KiezDeutsch Korpus (KiDKo) Release 1.0
A German Twitter Snapshot
The eIdentity Text Exploration Workbench
Potsdam Commentary Corpus 2.0: Annotation for Discourse Research
A Model for Processing Illocutionary Structures and Argumentation in Debates
GraPAT: a Tool for Graph Annotations
Universität Stuttgart, GERMANY Parsing Heterogeneous Corpora with a Rich Dependency Grammar
Automatic Extraction of Synonyms for German Particle Verbs from Parallel Data with Distributional Similarity as a Re-Ranking Feature
A Corpus of Comparisons in Product Reviews
Textual Emigration Analysis (TEA)
An Out-of-Domain Test Suite for Dependency Parsing of German
Textual Emigration Analysis (TEA)
The eIdentity Text Exploration Workbench
Mapping Between English Strings and Reentrant Semantic Graphs
Converting an HPSG-based Treebank into its Parallel Dependency-based Treebank
Polysemy Index for Nouns: an Experiment on Italian using the PAROLE SIMPLE CLIPS Lexical Database
A Rank-based Distance Measure to Detect Polysemy and to Determine Salient Vector-Space Features for German Prepositions
UnixMan Corpus: A Resource for Language Learning in the Unix Domain
Fuzzy V-Measure - An Evaluation Method for Cluster Analyses of Ambiguous Data
The Extended DIRNDL Corpus as a Resource for Coreference and Bridging Resolution
Optimizing a Distributional Semantic Model for the Prediction of German Particle Verb Compositionality
Universität Stuttgart, Institute for Natural Language Processing, GERMANY Exploring the Utility of Coreference Chains for Improved Identification of Personal Names
Universität Tübingen, GERMANY CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
The MERLIN corpus: Learner Language and the CEFR
A Compact Interactive Visualization of Dependency Treebank Query Results
Universität Bielefeld, GERMANY Representing Multilingual Data as Linked Data: the Case of BabelNet 2.0
Université Aix-Marseille, CNRS, FRANCE Representing Multimodal Linguistic Annotated Data
Aix Map Task Corpus: the French Multimodal Corpus of Task-oriented Dialogue
Segmentation Evaluation Metrics, a Comparison Grounded on Prosodic and Discourse Units
Université Aix-Marseille, FRANCE A Graph-Based Approach for Computing Free Word Associations
Corpus-Based Computation of Reverse Associations
Using Word Familiarities and Word Associations to Measure Corpus Representativeness
A Collection of Scholarly Book Reviews from the Platforms of Electronic Sources in Humanities and Social Sciences OpenEdition.org
Université Aix-Marseille, Laboratoire d'Informatique Fondamentale, FRANCE A Collection of Scholarly Book Reviews from the Platforms of Electronic Sources in Humanities and Social Sciences OpenEdition.org
A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization
Université Aix-Marseille, Laboratoire des Sciences de l'Information et des Systèmes, FRANCE A Collection of Scholarly Book Reviews from the Platforms of Electronic Sources in Humanities and Social Sciences OpenEdition.org
Université Aix-Marseille, Laboratoire Parole et Langage, CNRS, FRANCE Aix Map Task Corpus: the French Multimodal Corpus of Task-oriented Dialogue
Université Catholique de Louvain, BELGIUM HiEve: A Corpus for Extracting Event Hierarchies from News Stories
FLELex: a graded Lexical Resource for French Foreign Learners
Linking Pictographs to Synsets: Sclera2Cornetto
Speech Recognition Web Services for Dutch
Linguistic Resources and Cats: How to Use ISOcat, RELcat and SCHEMAcat
Multiple Choice Question Corpus Analysis for Distractor Characterization
TermWise: A CAT-tool with Context-Sensitive Terminological Support.
Praaline: Integrating Tools for Speech Corpus Research
DisMo: A Morphosyntactic, Disfluency and Multi-Word Unit Annotator. An Evaluation on a Corpus of French Spontaneous and Read Speech
Université Catholique de Louvain, CCL, BELGIUM Polish Coreference Corpus in Numbers
The Polish Summaries Corpus
Digital Library 2.0: Source of Knowledge and Research Collaboration Platform
The Strategic Impact of META-NET on the Regional, National and International Level
Measuring Readability of Polish Texts: Baseline Experiments
The Procedure of Lexico-Semantic Annotation of Składnica Treebank
Projection-based Annotation of a Polish Dependency Treebank
Université d'Orléans, Laboratoire Ligérien de Linguistique, FRANCE ANCOR_Centre, a Large Free Spoken French Coreference Corpus: Description of the Resource and Reliability Measures
Université de Bretagne-Sud, FRANCE A Database of Full Body Virtual Interactions Annotated with Expressivity Scores
Co-clustering of Bilingual Datasets as a Mean for Assisting the Construction of Thematic Bilingual Comparable Corpora
Université de Genève, Laboratoire d'Analyse et de Traitement du Langage ― Centre Universitaire d'Informatique, SWITZERLAND SwissAdmin: a Multilingual Tagged Parallel Corpus of Press Releases
A Language-independent and fully Unsupervised Approach to Lexicon Induction and Part-of-Speech Tagging for Closely Related Languages
Université de Genève, SWITZERLAND Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French
Applying Accessibility-Oriented Controlled Language (CL) Rules to Improve Appropriateness of Text Alternatives for Images: an Exploratory Study
A Large-Scale Evaluation of Pre-editing Strategies for Improving User-Generated Content Translation
Using a Serious Game to Collect a Child Learner Speech Corpus
Université de Grenoble, Groupe d'Étude pour la Traduction/le Traitement Automatique des Langues et de la Parole ― Laboratoire d'Informatique de Grenoble, FRANCE MotàMot Project: Conversion of a French-Khmer Published Dictionary for Building a Multilingual Lexical System
Université de Lorraine / LORIA, FRANCE Evaluating Corpora Documentation with regards to the Ethics and Big Data Charter
Deep Syntax Annotation of the Sequoia French Treebank
Mapping the Lexique des Verbes du français (Lexicon of French Verbs) to a NLP Lexicon using Examples
Propa-L: a Semantic Filtering Service from a Lexical Network Created using Games With A Purpose
Université de Montréal, CANADA An Iterative Approach for Mining Parallel Sentences in a Comparable Corpus
Hashtag Occurrences, Layout and Translation: A Corpus-driven Analysis of Tweets Published by the Canadian Government
Université de Montréal, Observatoire de linguistique Sens-Texte, CANADA Definition Patterns for Predicative Terms in Specialized Lexical Resources
Discovering Frames in Specialized Domains
Université du Maine, LIUM, FRANCE A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition
"Human Annotation of ASR Error Regions: is ""gravity"" a Sharable Concept for Human Annotators?"
Université Européenne de Bretagne, IRISA, FRANCE ANCOR_Centre, a Large Free Spoken French Coreference Corpus: Description of the Resource and Reliability Measures
Université François Rabelais de Tours, LI, FRANCE ANCOR_Centre, a Large Free Spoken French Coreference Corpus: Description of the Resource and Reliability Measures
Polish Coreference Corpus in Numbers
Université Montpellier 1, Centre d’Etudes Politiques de l’Europe Latine, FRANCE Investigating the Image of Entities in Social Media: Dataset Design and First Results
Université Montpellier 2, LIRMM, FRANCE Towards Electronic SMS Dictionary Construction: An Alignment-based Approach
Université Paris 3 - Sorbonne Nouvelle, LaTTiCe, FRANCE ACTIV-ES: a Comparable, Cross-Dialect Corpus of "everyday" Spanish from Argentina, Mexico, and Spain
Macrosyntactic Segmenters of a French Spoken Corpus
Université Paris Diderot / INRIA ― Alpage, FRANCE Deep Syntax Annotation of the Sequoia French Treebank
Developing a French FrameNet: Methodology and First results
Université Paris Diderot, Centre de Linguistique Inter-langues, de Lexicologie, de Linguistique Anglaise et de Corpus , FRANCE Using a Machine Learning Model to Assess the Complexity of Stress Systems
Université Paris Diderot, FRANCE Adapting VerbNet to French using Existing Resources
Semantic Clustering of Pivot Paraphrases
Extending the Coverage of a MWE Database for Persian CPs Exploiting Valency Alternations
DisMo: A Morphosyntactic, Disfluency and Multi-Word Unit Annotator. An Evaluation on a Corpus of French Spontaneous and Read Speech
Université Paris Ouest Nanterre, FRANCE Macrosyntactic Segmenters of a French Spoken Corpus
Université Paris Sorbonne (Paris IV), FRANCE Deep Syntax Annotation of the Sequoia French Treebank
Université Paris Sud / CNRS ― LIMSI, FRANCE A Corpus of Machine Translation Errors Extracted from Translation Students Exercises
TVD: A Reproducible and Multiply Aligned TV Series Dataset
Université Paris-Sud / LIMSI-CNRS, FRANCE Rule-based Reordering Space in Statistical Machine Translation
Université Saint-Louis, BELGIUM Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French
Université Sorbonne Nouvelle, FRANCE Extending the Coverage of a MWE Database for Persian CPs Exploiting Valency Alternations
Correcting and Validating Syntactic Dependency in the Spoken French Treebank Rhapsodie
Université Toulouse 2 Le Mirail & IRIT, FRANCE VOAR: A Visual and Integrated Ontology Alignment Environment
Uppsala Universitet, SWEDEN Universal Stanford Dependencies: a Cross-Linguistic Typology
Linguistic Landscaping of South Asia using Digital Language Resources: Genetic vs. Areal Linguistics
ParCor 1.0: A Parallel Pronoun-Coreference Corpus to Support Statistical MT
Billions of Parallel Words for Free: Building and Using the EU Bookshop Corpus
A Flexible Language Learning Platform Based on Language Resources and Web Services
Uppsala University, Department of Linguistics and Philology, SWEDEN A Persian Treebank with Stanford Typed Dependencies
UPS, FRANCE A Database of Full Body Virtual Interactions Annotated with Expressivity Scores
UPV/EHU, SPAIN Predicate Matrix: extending SemLink through WordNet mappings
IXA pipeline: Efficient and Ready to Use Multilingual NLP tools
USC ICT, UNITED STATES The Distress Analysis Interview Corpus of Human and Computer Interviews
Utrecht University, NETHERLANDS The Strategic Impact of META-NET on the Regional, National and International Level
CLARIN-NL: Major Results
Vulnerability in Acquisition, Language Impairments in Dutch: Creating a VALID Data Archive
Utrecht University, Université Catholique de Louvain, NETHERLANDS Experiences with the ISOcat Data Category Registry

 

V
Vassar College, UNITED STATES The Language Application Grid
Verbatim Analysis - Vera & Inria, FRANCE An Open-Source Heavily Multilingual Translation Graph Extracted from Wiktionaries and Parallel Corpora
Viavoo & Université Paris-Diderot, FRANCE A Language-independent Approach to Extracting Derivational Relations from an Inflectional Lexicon
Vicomtech-IK4, SPAIN SAVAS: Collecting, Annotating and Sharing Audiovisual Language Resources for Automatic Subtitling
Phoneme Similarity Matrices to Improve Long Audio Alignment for Automatic Subtitling
Machine Translation for Subtitling: A Large-Scale Evaluation
Generating Polarity Lexicons with WordNet Propagation in 5 Languages
Voice & Script International, UNITED KINGDOM Machine Translation for Subtitling: A Large-Scale Evaluation
VoiceInteraction, PORTUGAL SAVAS: Collecting, Annotating and Sharing Audiovisual Language Resources for Automatic Subtitling
Vrije Universiteit Amsterdam, NETHERLANDS BiographyNet: Methodological Issues when NLP Supports Historical Research
Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web
Hope and Fear: How Opinions Influence Factuality
NewsReader: Recording History from Daily News Streams
Discovering and Visualising Stories in News
Using a Sledgehammer to Crack a Nut? Lexical Diversity and Event Coreference Resolution
Generating Polarity Lexicons with WordNet Propagation in 5 Languages
VTEX, BRAZIL Language Editing Dataset of Academic Texts

 

W
Warsaw School of Social Sciences and Humanities, POLAND Digital Library 2.0: Source of Knowledge and Research Collaboration Platform
Measuring Readability of Polish Texts: Baseline Experiments
Waseda University, JAPAN Production of Phrase Tables in 11 European Languages using an Improved Sub-sentential Aligner
Web Research Group, UPF & Yahoo Labs, Barcelona, SPAIN DysList: An Annotated Resource of Dyslexic Errors
Wroclaw University of Technology, POLAND Ruled-based, Interlingual Motivated Mapping of plWordNet onto SUMO Ontology

 

X
XRCE, FRANCE Investigating the Image of Entities in Social Media: Dataset Design and First Results

 

Y
Yahoo Labs, UNITED STATES VERTa: Facing a Multilingual Experience of a Linguistically-based MT Evaluation
Yamagata University, JAPAN Multimodal Dialogue Segmentation with Gesture Post-Processing
Yandex, RUSSIAN FEDERATION Measuring the Impact of Spelling Errors on the Quality of Machine Translation
Yeungnam University, SOUTH KOREA Named Entity Corpus Construction using Wikipedia and DBpedia Ontology

 

Z
Zurich University of Applied Sciences, SWITZERLAND Meta-Classifiers Easily Improve Commercial Sentiment Detection Tools

 

Powered by ELDA © 2014 ELDA/ELRA