AUTHORS: Browse articles of the conference sorted by author

A - B - C - D - E - F - G - H - I - J - K - L - M - N - O - P - Q - R - S - T - U - V - W - X - Y - Z

A
Abad, Alberto The DIRHA Portuguese Corpus: A Comparison of Home Automation Command Detection and Recognition in Simulated and Real Data.
The SpeDial datasets: datasets for Spoken Dialogue Systems analytics
SPA: Web-based Platform for easy Access to Speech Processing Modules
Abanmy, Nora MADAD: A Readability Annotation Tool for Arabic Text
Abbas, Noorhan Compilation of an Arabic Children’s Corpus
Abbott, Rob Internet Argument Corpus 2.0: An SQL schema for Dialogic Social Media and the Corpora to go with it
Abdelali, Ahmed Arabic to English Person Name Transliteration using Twitter
Abdulrahim, Dana A Large Scale Corpus of Gulf Arabic
Abercrombie, Gavin A Rule-based Shallow-transfer Machine Translation System for Scots and English
Abouammoh, Murad Creation of comparable corpora for English-{Urdu, Arabic, Persian}
Abouda, Lotfi Covering various Needs in Temporal Annotation: a Proposal of Extension of ISO TimeML that Preserves Upward Compatibility
Abromeit, Frank Lin|gu|is|tik: Building the Linguist's Pathway to Bibliographies, Libraries, Language Resources and Linked Open Data
Acar, Elif Ahsen A Turkish Database for Psycholinguistic Studies Based on Frequency, Age of Acquisition, and Imageability
Ackermann, Markus FREME: Multilingual Semantic Enrichment with Linked Data and Language Technologies
Adda-Decker, Martine French Learners Audio Corpus of German Speech (FLACGS)
Adda, Gilles The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
Adeel Nawab, Rao Muhammad UPPC - Urdu Paraphrase Plagiarism Corpus
Adesam, Yvonne A Multi-domain Corpus of Swedish Word Sense Annotation
Adolphs, Peter SCARE ― The Sentiment Corpus of App Reviews with Fine-grained Annotations in German
Adouane, Wafia Gulf Arabic Linguistic Resource Building for Sentiment Analysis
Afantenos, Stergos Discourse Structure and Dialogue Acts in Multiparty Dialogue: the STAC Corpus
Parallel Discourse Annotations on a Corpus of Short Texts
Afli, Haithem Using SMT for OCR Error Correction of Historical Texts
Aga, Rosa Tsegaye Learning Thesaurus Relations from Distributional Features
Agić, Željko New Inflectional Lexicons and Training Corpora for Improved Morphosyntactic Annotation of Croatian and Serbian
Agirre, Eneko QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages
Word Sense-Aware Machine Translation: Including Senses as Contextual Features for Improved Translation Models
Addressing the MFS Bias in WSD systems
Evaluating Translation Quality and CLIR Performance of Query Sessions
A comparison of Named-Entity Disambiguation and Word Sense Disambiguation
Agnès, Frédéric A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity Detection
Agosti, Maristella Designing A Long Lasting Linguistic Project: The Case Study of ASIt
Ah-Pine, Julien Hypergraph Modelization of a Syntactically Annotated English Wikipedia Dump
Aichinger, Philipp A Database of Laryngeal High-Speed Videos with Simultaneous High-Quality Audio Recordings of Pathological and Non-Pathological Voices
Aizawa, Akiko English-to-Japanese Translation vs. Dictation vs. Post-editing: Comparing Translation Modes in a Multilingual Setting
Typed Entity and Relation Annotation on Computer Science Papers
Aizawa, Masao Survey of Conversational Behavior: Towards the Design of a Balanced Corpus of Everyday Japanese Conversation
Ajili, Moez FABIOLE, a Speech Database for Forensic Speaker Comparison
Akarun, Lale BosphorusSign: A Turkish Sign Language Recognition Corpus in Health and Finance Domains
Aker, Ahmet What’s the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems
Creation of comparable corpora for English-{Urdu, Arabic, Persian}
Akhtar, Md Shad Aspect based Sentiment Analysis in Hindi: Resource Creation and Evaluation
Alageel, Sinaa MADAD: A Readability Annotation Tool for Arabic Text
Alagić, Domagoj Cro36WSD: A Lexical Sample for Croatian Word Sense Disambiguation
Alam, Firoj Multilevel Annotation of Agreement and Disagreement in Italian News Blogs
Alba Castro, José Luis CORILSE: a Spanish Sign Language Repository for Linguistic Analysis
Al-Badrashiny, Mohamed Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data
SPLIT: Smart Preprocessing (Quasi) Language Independent Tool
Albogamy, Fahad Fast and Robust POS tagger for Arabic Tweets Using Agreement-based Bootstrapping
Aldabe, Itziar A Multilingual Predicate Matrix
Al-Dayel, Abeer MADAD: A Readability Annotation Tool for Arabic Text
Alegria, Iñaki Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish and Slovene
TweetMT: A Parallel Microblog Corpus
Evaluating Translation Quality and CLIR Performance of Query Sessions
Domain Adaptation in MT Using Titles in Wikipedia as a Parallel Corpus: Resources and Evaluation
Alex, Beatrice Homing in on Twitter Users: Evaluating an Enhanced Geoparser for User Profile Locations
Alghamdi, Ayman An Empirical Study of Arabic Formulaic Sequence Extraction Methods
Compilation of an Arabic Children’s Corpus
AlGhamdi, Fahad Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data
Algra, Jouke A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research
Alharbi, Ghada The OpenCourseWare Metadiscourse (OCWMD) Corpus
Alhelbawy, Ayman Towards a Corpus of Violence Acts in Arabic Social Media
Alikaniotis, Dimitrios Predicting Author Age from Weibo Microblog Posts
Al-Khalifa, Hend MADAD: A Readability Annotation Tool for Arabic Text
Al-Khalil, Muhamed Exploiting Arabic Diacritization for High Quality Automatic Annotation
AlMarwani, Nada Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data
Almeida, Hayda SemLinker, a Modular and Open Source Framework for Named Entity Discovery and Linking
Almeida, José João Enriching a Portuguese WordNet using Synonyms from a Monolingual Dictionary
Alonso, Miguel A. EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment Analysis
Alqahtani, Sawsan Guidelines and Framework for a Large Scale Arabic Diacritized Corpus
Al-Shargi, Faisal Morphologically Annotated Corpora and Morphological Analyzers for Moroccan and Sanaani Yemeni Arabic
Al-Shenaifi, Nouf MADAD: A Readability Annotation Tool for Arabic Text
Al-Sulaiti, Latifa Compilation of an Arabic Children’s Corpus
Altuna, Begoña MEANTIME, the NewsReader Multilingual Event and Time Corpus
Al-Twairesh, Nora MADAD: A Readability Annotation Tool for Arabic Text
Alva-Manchengo, Fernando Coh-Metrix-Esp: A Complexity Analysis Tool for Documents Written in Spanish
Alvarez, Aitor Impact of Automatic Segmentation on the Quality, Productivity and Self-reported Post-editing Effort of Intralingual Subtitles
Alves, Ana Can Topic Modelling benefit from Word Sense Information?
Al-Yahya, Maha MADAD: A Readability Annotation Tool for Arabic Text
Al Zaatari, Ayman Arabic Corpora for Credibility Analysis
Aman, Frederic Ecological Gestures for HRI: the GEE Corpus
CirdoX: an on/off-line multisource speech and sound analysis software
The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People
Amanova, Dilafruz Creating Annotated Dialogue Resources: Cross-domain Dialogue Act Classification
Amaral, Daniela Summ-it++: an Enriched Version of the Summ-it Corpus
Amilevičius, Darius NLP Infrastructure for the Lithuanian Language
Amitabh, Unnayan A Machine Learning based Music Retrieval and Recommendation System
Amsler, Michael Sentiframes: A Resource for Verb-centered German Sentiment Inference
Anand, Pranav Internet Argument Corpus 2.0: An SQL schema for Dialogic Social Media and the Corpora to go with it
Ananiadou, Sophia Identifying Content Types of Messages Related to Open Source Software Projects
Ensemble Classification of Grants using LDA-based Features
Andersson, Linda Standard Test Collection for English-Persian Cross-Lingual Word Sense Disambiguation
Andersson, Marta Annotating Topic Development in Information Seeking Queries
Andriamakaoly, Jérémy Speech Trax: A Bottom to the Top Approach for Speaker Tracking and Indexing in an Archiving Context
Andringa, Maaike A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research
Andrzejczuk, Anna Semantic Layer of the Valence Dictionary of Polish Walenty
Anikina, Tatjana InScript: Narrative texts annotated with script information
Antoine, Jean-Yves Covering various Needs in Temporal Annotation: a Proposal of Extension of ISO TimeML that Preserves Upward Compatibility
António Rodrigues, João Use of Domain-Specific Language Resources in Machine Translation
Bootstrapping a Hybrid MT System to a New Language Pair
Antonitsch, André Summ-it++: an Enriched Version of the Summ-it Corpus
Antunes, Sandra The COPLE2 corpus: a learner corpus for Portuguese
Anwar, Maaz Towards Building Semantic Role Labeler for Indian Languages
A Proposition Bank of Urdu
Apidianaki, Marianna Datasets for Aspect-Based Sentiment Analysis in French
Aranberri, Nora QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages
TweetMT: A Parallel Microblog Corpus
Tools and Guidelines for Principled Machine Translation Development
Arauco, Alejandro Towards Lexical Encoding of Multi-Word Expressions in Spanish Dialects
Araujo, Lourdes A Tagged Corpus for Automatic Labeling of Disabilities in Medical Scientific Papers
A R, Balamurali Summarizing Behaviours: An Experiment on the Annotation of Call-Centre Conversations
Arcan, Mihael PE2rr Corpus: Manual Error Annotation of Automatically Pre-annotated MT Post-edits
IRIS: English-Irish Machine Translation System
Archer, Dawn Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages
Ariga, Michiaki A Large-scale Recipe and Meal Data Collection as Infrastructure for Food Research
Arimoto, Yoshiko Accuracy of Automatic Cross-Corpus Emotion Labeling for Conversational Speech Corpus Commonization
Comparison of Emotional Understanding in Modality-Controlled Environments using Multimodal Online Emotional Communication Corpus
Arndt, Natanael Creating Linked Data Morphological Language Resources with MMoOn - The Hebrew Morpheme Inventory
Arndt, Timotheus Creating Linked Data Morphological Language Resources with MMoOn - The Hebrew Morpheme Inventory
Aroyo, Lora The VU Sound Corpus: Adding More Fine-grained Annotations to the Freesound Database
GRaSP: A Multilayered Annotation Scheme for Perspectives
Crowdsourcing Salient Information from News and Tweets
Arppe, Antti Training & Quality Assessment of an Optical Character Recognition Model for Northern Haida
Arsevska, Elena Monitoring Disease Outbreak Events on the Web Using Text-mining Approach and Domain Expert Knowledge
Artola, Xabier Two Architectures for Parallel Processing of Huge Amounts of Text
Interoperability of Annotation Schemes: Using the Pepper Framework to Display AWA Documents in the ANNIS Interface
Artstein, Ron ARRAU: Linguistically-Motivated Annotation of Anaphoric Descriptions
The Negochat Corpus of Human-agent Negotiation Dialogues
Arzelus, Haritz Impact of Automatic Segmentation on the Quality, Productivity and Self-reported Post-editing Effort of Intralingual Subtitles
Asahara, Masayuki Universal Dependencies for Japanese
Asano, Hisako Name Translation based on Fine-grained Named Entity Recognition in a Single Language
Asher, Nicholas Discourse Structure and Dialogue Acts in Multiparty Dialogue: the STAC Corpus
Parallel Discourse Annotations on a Corpus of Short Texts
Aslam, Saba Urdu Summary Corpus
Asooja, Kartik Forecasting Emerging Trends from Scientific Literature
Athanasakou, Vasiliki Learning Tone and Attribution for Financial Text Mining
Attardi, Giuseppe Adapting the TANL tool suite to Universal Dependencies
Attia, Mohammed Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic
Atwell, Eric An Empirical Study of Arabic Formulaic Sequence Extraction Methods
Compilation of an Arabic Children’s Corpus
Auberge, Veronique Ecological Gestures for HRI: the GEE Corpus
Aufrant, Lauriane Cross-lingual and Supervised Models for Morphosyntactic Annotation: a Comparison on Romanian
Augenstein, Isabelle Monolingual Social Media Datasets for Detecting Contradiction and Entailment
Augustinus, Liesbeth AfriBooms: An Online Treebank for Afrikaans
Poly-GrETEL: Cross-Lingual Example-based Querying of Syntactic Constructions
Auziņa, Ilze Tēzaurs.lv: the Largest Open Lexical Database for Latvian
Designing a Speech Corpus for the Development and Evaluation of Dictation Systems in Latvian
Avgustinova, Tania Orthographic and Morphological Correspondences between Related Slavic Languages as a Base for Modeling of Mutual Intelligibility
Avramidis, Eleftherios Tools and Guidelines for Principled Machine Translation Development
Aziz, Wilker Cohere: A Toolkit for Local Coherence
Azpeitia, Andoni Exploiting a Large Strongly Comparable Corpus

 

B
Babych, Bogdan MoBiL: A Hybrid Feature Set for Automatic Human Translation Quality Assessment
Bachan, Jolanta Polish Rhythmic Database ― New Resources for Speech Timing and Rhythm Analysis
Baeza-Yates, Ricardo CASSAurus: A Resource of Simpler Spanish Synonyms
Baisa, Vít Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study
European Union Language Resources in Sketch Engine
VPS-GradeUp: Graded Decisions on Usage Patterns
Balahur, Alexandra Detecting Implicit Expressions of Affect from Text using Semantic Knowledge on Common Concept Properties
Baldwin, Timothy Evaluating a Topic Modelling Approach to Measuring Corpus Similarity
Balenciaga, Marina Impact of Automatic Segmentation on the Quality, Productivity and Self-reported Post-editing Effort of Intralingual Subtitles
Bali, Kalika Functions of Code-Switching in Tweets: An Annotation Framework and Some Initial Experiments
Banea, Carmen Building a Dataset for Possessions Identification in Text
Banjade, Rajendra SemAligner: A Method and Tool for Aligning Chunks with Semantic Relation Types and Semantic Similarity Scores
DT-Neg: Tutorial Dialogues Annotated for Negation Scope and Focus in Context
Banski, Piotr Corpus Query Lingua Franca (CQLF)
KorAP Architecture ― Diving in the Deep Sea of Corpus Data
Baptista, Jorge metaTED: a Corpus of Metadiscourse for Spoken Language
Barackman, Casey PersonaBank: A Corpus of Personal Narratives and Their Story Intention Graphs
Barancikova, Petra Manual and Automatic Paraphrases for MT Evaluation
Barbagli, Alessia CItA: an L1 Italian Learners Corpus to Study the Development of Writing Competence
Barbieri, Francesco What does this Emoji Mean? A Vector Space Skip-Gram Model for Twitter Emojis
Barbu Mititelu, Verginica The IPR-cleared Corpus of Contemporary Written and Spoken Romanian Language
Bargmann, Sascha PARSEME Survey on MWE Resources
Barker, Emma What’s the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems
Barras, Claude Benchmarking multimedia technologies with the CAMOMILE platform: the case of Multimodal Person Discovery at MediaEval 2015
The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
Barreaux, Sabine TermITH-Eval: a French Standard-Based Resource for Keyphrase Extraction Evaluation
Barreiro, Anabela Port4NooJ v3.0: Integrated Linguistic Resources for Portuguese NLP
Bartie, Phil The REAL Corpus: A Crowd-Sourced Corpus of Human Generated and Evaluated Spatial References to Real-World Urban Scenes
Bartolini, Roberto LREC as a Graph: People and Resources in a Network
Bartosiak, Tomasz Accessing and Elaborating Walenty - a Valence Dictionary of Polish - via Internet Browser
Semantic Layer of the Valence Dictionary of Polish Walenty
Barzdins, Guntis Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project
Basile, Angelo D(H)ante: A New Set of Tools for XIII Century Italian
Basili, Roberto A Language Independent Method for Generating Large Scale Polarity Lexicons
Batanović, Vuk Reliable Baselines for Sentiment Analysis in Resource-Limited Languages: The Serbian Movie Review Dataset
Bateman, Leila Building Language Resources for Exploring Autism Spectrum Disorders
Batista, Fernando SPA: Web-based Platform for easy Access to Speech Processing Modules
Batliner, Anton Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets
Battistelli, Delphine Covering various Needs in Temporal Annotation: a Proposal of Extension of ISO TimeML that Preserves Upward Compatibility
Baumann, Timo Mining the Spoken Wikipedia for Speech Data and Beyond
Baumgartner Jr., William A. SuperCAT: The (New and Improved) Corpus Analysis Toolkit
Baur, Claudia A Shared Task for Spoken CALL?
Bayol, Clarisse Ecological Gestures for HRI: the GEE Corpus
Bayyr-ool, Aziyana A Finite-state Morphological Analyser for Tuvan
Béchet, Frédéric Enhancing The RATP-DECODA Corpus With Linguistic Annotations For Performing A Large Range Of NLP Tasks
Summarizing Behaviours: An Experiment on the Annotation of Call-Centre Conversations
Becker, Alex A Web Tool for Building Parallel Corpora of Spoken and Sign Languages
Bedjeti, Adriatik A Corpus of Images and Text in Online News
Bedrick, Steven On Developing Resources for Patient-level Information Retrieval
Begum, Rafiya Functions of Code-Switching in Tweets: An Annotation Framework and Some Initial Experiments
Behera, Pitambar Issues and Challenges in Annotating Urdu Action Verbs on the IMAGACT4ALL Platform
Beijer, Lilian A Dutch Dysarthric Speech Database for Individualized Speech Therapy Research
Bejček, Eduard MWEs in Treebanks: From Survey to Guidelines
Distribution of Valency Complements in Czech Complex Predicates: Between Verb and Noun
Bekavac, Marko Graph-Based Induction of Word Senses in Croatian
Bekkadja, Slima The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People
Bell, Dane Sieve-based Coreference Resolution in the Biomedical Domain
Towards Using Social Media to Identify Individuals at Risk for Preventable Chronic Illness
Bellot, Patrice Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers
Bel, Núria Using Contextual Information for Machine Translation Evaluation
Leveraging RDF Graphs for Crossing Multiple Bilingual Dictionaries
Assessing the Potential of Metaphoricity of verbs using corpus data
Towards producing bilingual lexica from monolingual corpora
Beloki, Zuhaitz Two Architectures for Parallel Processing of Huge Amounts of Text
Interoperability of Annotation Schemes: Using the Pepper Framework to Display AWA Documents in the ANNIS Interface
Beltrami, Daniela Automatic identification of Mild Cognitive Impairment through the analysis of Italian spontaneous speech productions
Ben Abacha, Asma Annotating Named Entities in Consumer Health Questions
Benikova, Darina SemRelData ― Multilingual Contextual Annotation of Semantic Relations between Nominals: Dataset and Guidelines
Ben Jannet, Mohamed Ameur Generating Task-Pertinent sorted Error Lists for Speech Recognition
Benko, Vladimír Two Years of Aranea: Increasing Counts and Tuning the Pipeline
Bentivogli, Luisa WAGS: A Beautiful English-Italian Benchmark Supporting Word Alignment Evaluation on Rare Words
Bentz, Christian Crowdsourcing a Multi-lingual Speech Corpus: Recording, Transcription and Annotation of the CrowdIS Corpora
Berard, Alexandre MultiVec: a Multilingual and Multilevel Representation Learning Toolkit for NLP
Berkling, Kay Corpus for Children’s Writing with Enhanced Output for Specific Spelling Patterns (2nd and 3rd Grade)
Bernard, Guillaume FABIOLE, a Speech Database for Forensic Speaker Comparison
Bernotat, Jasmin How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Bertero, Dario Deep Learning of Audio and Language Features for Humor Prediction
Bertrand, Roxane Laughter in French Spontaneous Conversational Dialogs
A CUP of CoFee: A large Collection of feedback Utterances Provided with communicative function annotations
Besacier, Laurent A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity Detection
The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof
MultiVec: a Multilingual and Multilevel Representation Learning Toolkit for NLP
Besançon, Romaric A Dataset for Open Event Extraction in English
Beskow, Jonas A Multi-party Multi-modal Dataset for Focus of Visual Attention in Human-human and Human-robot Interaction
Bethard, Steven Age and Gender Prediction on Health Forum Data
A Semantically Compositional Annotation Scheme for Time Normalization
Betz, Simon DUEL: A Multi-lingual Multimodal Dialogue Corpus for Disfluency, Exclamations and Laughter
Bhat, Riyaz Ahmad A Proposition Bank of Urdu
Bhattacharya, Pushpak Synset Ranking of Hindi WordNet
Multiword Expressions Dataset for Indian Languages
Bhattacharyya, Pushpak Lexical Resources to Enrich English Malayalam Machine Translation
That'll Do Fine!: A Coarse Lexical Resource for English-Hindi MT, Using Polylingual Topic Models
Aspect based Sentiment Analysis in Hindi: Resource Creation and Evaluation
SlangNet: A WordNet like resource for English Slang
Bhingardive, Sudha Synset Ranking of Hindi WordNet
Multiword Expressions Dataset for Indian Languages
Biagioni, Stefania Two Decades of Terminology: European Framework Programmes Titles
Bianchi, Francesca Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages
Bick, Eckhard A Morphological Lexicon of Esperanto with Morpheme Frequencies
Biemann, Chris SemRelData ― Multilingual Contextual Annotation of Semantic Relations between Nominals: Dataset and Guidelines
Domain-Specific Corpus Expansion with Focused Webcrawling
Bierkandt, Lennart Enriching TimeBank: Towards a more precise annotation of temporal relations in a text
Bies, Ann Large Multi-lingual, Multi-level and Multi-genre Annotation Corpus
Rapid Development of Morphological Analyzers for Typologically Diverse Languages
Parallel Chinese-English Entities, Relations and Events Corpora
Bigenzahn, Wolfgang A Database of Laryngeal High-Speed Videos with Simultaneous High-Quality Audio Recordings of Pathological and Non-Pathological Voices
Bigi, Brigitte The TYPALOC Corpus: A Collection of Various Dysarthric Speech Recordings in Read and Spontaneous Styles
Laughter in French Spontaneous Conversational Dialogs
Billawala, Youssef Extractive Summarization under Strict Length Constraints
Bingel, Joachim KorAP Architecture ― Diving in the Deep Sea of Corpus Data
Bittar, André Emotion Analysis on Twitter: The Hidden Challenge
Bizer, Christian A Large DataBase of Hypernymy Relations Extracted from the Web.
Blache, Philippe MarsaGram: an excursion in the forests of parsing trees
4Couv: A New Treebank for French
Black, Alan W Speech Synthesis of Code-Mixed Text
Blain, Frédéric Phrase Level Segmentation and Labelling of Machine Translation Errors
Blanco, Eduardo Annotating Temporally-Anchored Spatial Knowledge on Top of OntoNotes Semantic Roles
Bleicken, Julian Using a Language Technology Infrastructure for German in order to Anonymize German Sign Language Corpus Data
Bobillier Chaumon, Marc-Eric The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People
Bod, Rens POS-tagging of Historical Dutch
Boella, Guido Automatic Enrichment of WordNet with Common-Sense Knowledge
Bogantes, Diana Towards Lexical Encoding of Multi-Word Expressions in Spanish Dialects
Bonastre, Jean-françois FABIOLE, a Speech Database for Forensic Speaker Comparison
Bond, Francis The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud
Wow! What a Useful Extension! Introducing Non-Referential Concepts to Wordnet
Bonial, Claire Comprehensive and Consistent PropBank Light Verb Annotation
Bonneau, Anne The IFCASL Corpus of French and German Non-native and Native Read Speech
Bontcheva, Kalina Challenges of Evaluating Sentiment Analysis Tools on Social Media
Monolingual Social Media Datasets for Detecting Contradiction and Entailment
Borchmann, Łukasz “He Said She Said” ― a Male/Female Corpus of Polish
Bordea, Georgeta Forecasting Emerging Trends from Scientific Literature
Boroș, Tiberiu The IPR-cleared Corpus of Contemporary Written and Spoken Romanian Language
Bosco, Cristina Annotating Sentiment and Irony in the Online Italian Political Debate on #labuonascuola
Tweeting and Being Ironic in the Debate about a Political Reform: the French Annotated Corpus TWitter-MariagePourTous
Bosc, Tom DART: a Dataset of Arguments and their Relations on Twitter
Bott, Stefan GhoSt-NN: A Representative Gold Standard of German Noun-Noun Compounds
Bouakaz, Saïda The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People
Bouamor, Dhouha Transfer-Based Learning-to-Rank Assessment of Medical Term Technicality
Managing Linguistic and Terminological Variation in a Medical Dialogue System
Bouamor, Houda Guidelines and Framework for a Large Scale Arabic Diacritized Corpus
Building an Arabic Machine Translation Post-Edited Corpus: Guidelines and Annotation
DALILA: The Dialectal Arabic Linguistic Learning Assistant
Boudin, Florian TermITH-Eval: a French Standard-Based Resource for Keyphrase Extraction Evaluation
Bougouin, Adrien TermITH-Eval: a French Standard-Based Resource for Keyphrase Extraction Evaluation
Bouhafs Hafsia, Asma Integration of Lexical and Semantic Knowledge for Sentiment Analysis in SMS
Bouma, Gerlof A Multi-domain Corpus of Swedish Word Sense Annotation
Bourlon, Antoine Simultaneous Sentence Boundary Detection and Alignment with Pivot-based Machine Translation Generated Lexicons
Bowden, Kevin PersonaBank: A Corpus of Personal Narratives and Their Story Intention Graphs
A Corpus of Gesture-Annotated Dialogues for Monologue-to-Dialogue Generation from Personal Narratives
Boye, Johan SpaceRef: A corpus of street-level geographic descriptions
Bozşahin, Cem A Turkish Database for Psycholinguistic Studies Based on Frequency, Age of Acquisition, and Imageability
Braasch, Anna The SemDaX Corpus ― Sense Annotations with Scalable Sense Inventories
Branco, António QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages
Word Sense-Aware Machine Translation: Including Senses as Contextual Features for Improved Translation Models
Use of Domain-Specific Language Resources in Machine Translation
CINTIL DependencyBank PREMIUM - A Corpus of Grammatical Dependencies for Portuguese
Bootstrapping a Hybrid MT System to a New Language Pair
Evaluating Machine Translation in a Usage Scenario
Brandes, Jasper Effect Functors for Opinion Inference
Brasoveanu, Adrian A Regional News Corpora for Contextualized Entity Discovery and Linking
Braunger, Patricia A Comparative Analysis of Crowdsourced Natural Language Corpora for Spoken Dialog Systems
Bredin, Hervé Benchmarking multimedia technologies with the CAMOMILE platform: the case of Multimodal Person Discovery at MediaEval 2015
The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
Brierley, Claire An Empirical Study of Arabic Formulaic Sequence Extraction Methods
Compilation of an Arabic Children’s Corpus
Bristot, Antonella ARRAU: Linguistically-Motivated Annotation of Anaphoric Descriptions
Broadwell, George Aaron ANEW+: Automatic Expansion and Validation of Affective Norms of Words Lexicons in Multiple Languages
Brognaux, Sandrine Combining Manual and Automatic Prosodic Annotation for Expressive Speech Synthesis
Brugman, Hennie Nederlab: Towards a Single Portal and Research Environment for Diachronic Dutch Text Corpora
Brümmer, Martin DBpedia Abstracts: A Large-Scale, Open, Multilingual NLP Training Corpus
Bruneau, Pierrick Benchmarking multimedia technologies with the CAMOMILE platform: the case of Multimodal Person Discovery at MediaEval 2015
The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
Brunson, Mary Introducing the LCC Metaphor Datasets
Buchner, Karolina Extractive Summarization under Strict Length Constraints
Budnik, Mateusz The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
Budzynska, Katarzyna A Corpus of Argument Networks: Using Graph Properties to Analyse Divisive Issues
Buitelaar, Paul Forecasting Emerging Trends from Scientific Literature
Generating a Large-Scale Entity Linking Dictionary from Wikipedia Link Structure and Article Text
IRIS: English-Irish Machine Translation System
Bunt, Harry The DialogBank
Burchardt, Aljoscha Evaluating Machine Translation in a Usage Scenario
Tools and Guidelines for Principled Machine Translation Development
Burga, Alicia Towards Multiple Antecedent Coreference Resolution in Specialized Discourse
Burghardt, Manuel Creating a Lexicon of Bavarian Dialect by Means of Facebook Language Data and Crowdsourcing
Burgos, Pepi Palabras: Crowdsourcing Transcriptions of L2 Speech
Burkhardt, Felix A Taxonomy of Specific Problem Classes in Text-to-Speech Synthesis: Comparing Commercial and Open Source Performance
Buscaldi, Davide Semantic Annotation of the ACL Anthology Corpus for the Automatic Analysis of Scientific Literature
Busso, Lucia Italian VerbNet: A Construction-based Approach to Italian Verb Classification
Buttery, Paula Crowdsourcing a Multi-lingual Speech Corpus: Recording, Transcription and Annotation of the CrowdIS Corpora
Predicting Author Age from Weibo Microblog Posts

 

C
Cabeza-Pereiro, María del Carmen CORILSE: a Spanish Sign Language Repository for Linguistic Analysis
Cabrio, Elena DART: a Dataset of Arguments and their Relations on Twitter
Caines, Andrew Crowdsourcing a Multi-lingual Speech Corpus: Recording, Transcription and Annotation of the CrowdIS Corpora
Predicting Author Age from Weibo Microblog Posts
Cajal, Sergio Towards Multiple Antecedent Coreference Resolution in Specialized Discourse
Cakmak, Huseyin AVAB-DBS: an Audio-Visual Affect Bursts Database for Synthesis
Calixto, Iacer Developing a Dataset for Evaluating Approaches for Document Expansion with Images
Calvo, Arturo Open Data Vocabularies for Assigning Usage Rights to Data Resources from Translation Projects
Calzà, Laura Automatic identification of Mild Cognitive Impairment through the analysis of Italian spontaneous speech productions
Calzolari, Nicoletta New Developments in the LRE Map
LREC as a Graph: People and Resources in a Network
Camacho-Collados, José A Large-Scale Multilingual Disambiguation of Glosses
Camelin, Nathalie Word Embedding Evaluation and Combination
Camgöz, Necati Cihan BosphorusSign: A Turkish Sign Language Recognition Corpus in Health and Finance Domains
Campbell, Nick Capturing Chat: Annotation and Tools for Multiparty Casual Conversation.
The ILMT-s2s Corpus ― A Multimodal Interlingual Map Task Corpus
CHATR the Corpus; a 20-year-old archive of Concatenative Speech Synthesis
Campillos Llanos, Leonardo Transfer-Based Learning-to-Rank Assessment of Medical Term Technicality
Managing Linguistic and Terminological Variation in a Medical Dialogue System
Campos, Marisa CINTIL DependencyBank PREMIUM - A Corpus of Grammatical Dependencies for Portuguese
Candeias, Sara A Web Tool for Building Parallel Corpora of Spoken and Sign Languages
The LetsRead Corpus of Portuguese Children Reading Aloud for Performance Evaluation
Candito, Marie Corpus Annotation within the French FrameNet: a Domain-by-domain Methodology
A General Framework for the Annotation of Causality Based on FrameNet
Hard Time Parsing Questions: Building a QuestionBank for French
Čapka, Tomáš SYN2015: Representative Corpus of Contemporary Written Czech
Cardeñoso-Payo, Valentín On the Use of a Serious Game for Recording a Speech Corpus of People with Intellectual Disabilities
Cardoso, Aida CEPLEXicon ― A Lexicon of Child European Portuguese
Carlini, Roberto Example-based Acquisition of Fine-grained Collocation Resources
Carlmeyer, Birte How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Carl, Michael English-to-Japanese Translation vs. Dictation vs. Post-editing: Comparing Translation Modes in a Multilingual Setting
Carlotto, Talvany Interoperability of Annotation Schemes: Using the Pepper Framework to Display AWA Documents in the ANNIS Interface
Carman, Mark James That'll Do Fine!: A Coarse Lexical Resource for English-Hindi MT, Using Polylingual Topic Models
Caroli, Frederico Tommasi NNBlocks: A Deep Learning Framework for Computational Linguistics Neural Network Models
Carpenter, Jordan An Empirical Exploration of Moral Foundations Theory in Partisan News Sources
Carrive, Jean Speech Trax: A Bottom to the Top Approach for Speaker Tracking and Indexing in an Archiving Context
Carvalho, Paula Port4NooJ v3.0: Integrated Linguistic Resources for Portuguese NLP
Caselli, Tommaso GRaSP: A Multilayered Annotation Scheme for Perspectives
NLP and Public Engagement: The Case of the Italian School Reform
Crowdsourcing Salient Information from News and Tweets
Temporal Information Annotation: Crowd vs. Experts
Cassidy, Steve Publishing the Trove Newspaper Corpus
Castellucci, Giuseppe A Language Independent Method for Generating Large Scale Polarity Lexicons
Castilho, Sheila Evaluating the Impact of Light Post-Editing on Usability
Castillo, Carlos Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages
Cavar, Damir Endangered Language Documentation: Bootstrapping a Chatino Speech Corpus, Forced Aligner, ASR
Generating a Yiddish Speech Corpus, Forced Aligner and Basic ASR System for the AHEYM Project
Global Open Resources and Information for Language and Linguistic Analysis (GORILLA)
Cavar, Malgorzata Endangered Language Documentation: Bootstrapping a Chatino Speech Corpus, Forced Aligner, ASR
Generating a Yiddish Speech Corpus, Forced Aligner and Basic ASR System for the AHEYM Project
Global Open Resources and Information for Language and Linguistic Analysis (GORILLA)
Cavazza, Marc A Corpus of Clinical Practice Guidelines Annotated with the Importance of Recommendations
Cavicchio, Federica ARRAU: Linguistically-Motivated Annotation of Anaphoric Descriptions
Cebović, Ines Building the Macedonian-Croatian Parallel Corpus
Celebi, Arda Segmenting Hashtags using Automatically Created Training Data
Celli, Fabio Multilevel Annotation of Agreement and Disagreement in Italian News Blogs
Celorico, Dirce The LetsRead Corpus of Portuguese Children Reading Aloud for Performance Evaluation
Čermáková, Anna SYN2015: Representative Corpus of Contemporary Written Czech
Cerrato, Loredana The ILMT-s2s Corpus ― A Multimodal Interlingual Map Task Corpus
Çetinoğlu, Özlem A Turkish-German Code-Switching Corpus
Cettolo, Mauro WAGS: A Beautiful English-Italian Benchmark Supporting Word Alignment Evaluation on Rare Words
Chakrabarty, Abhisek A Neural Lemmatizer for Bengali
Chakraborty, Nilesh FREME: Multilingual Semantic Enrichment with Linked Data and Language Technologies
Chalub, Fabricio Semantic Links for Portuguese
Chamberlain, Jon Phrase Detectives Corpus 1.0 Crowdsourced Anaphoric Coreference.
Chanfreau, Agustin Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest
Chang, Angel A comparison of Named-Entity Disambiguation and Word Sense Disambiguation
Chang, Chung-Ning A Corpus of Gesture-Annotated Dialogues for Monologue-to-Dialogue Generation from Personal Narratives
Charlet, Delphine Web Chat Conversations from Contact Centers: a Descriptive Study
Charnois, Thierry Semantic Annotation of the ACL Anthology Corpus for the Automatic Analysis of Scientific Literature
Charton, Eric SemLinker, a Modular and Open Source Framework for Named Entity Discovery and Linking
Chaturvedi, Akshay A Neural Lemmatizer for Bengali
Chavernac, David Monitoring Disease Outbreak Events on the Web Using Text-mining Approach and Domain Expert Knowledge
Chen, Francine Corpus for Customer Purchase Behavior Prediction in Social Media
Chen, Hsin-Hsi Detecting Word Usage Errors in Chinese Sentences for Learning Chinese as a Foreign Language
Fine-Grained Chinese Discourse Relation Labelling
Subtask Mining from Search Query Logs for How-Knowledge Acceleration
Chen, Huan-Yuan Fine-Grained Chinese Discourse Relation Labelling
Chen, Jiajun Evaluating a Deterministic Shift-Reduce Neural Parser for Constituent Parsing
Chen, Lei Analyzing Time Series Changes of Correlation between Market Share and Concerns on Companies measured through Search Engine Suggests
Chen, Xi Building a Dataset for Possessions Identification in Text
Chen, Yan-Ying Corpus for Customer Purchase Behavior Prediction in Social Media
Chen, Yun-Nung AIMU: Actionable Items for Meeting Understanding
AppDialogue: Multi-App Dialogues for Intelligent Assistants
Cherry, Colin A Dataset for Detecting Stance in Tweets
Che, Xiaoyin Punctuation Prediction for Unsegmented Transcript Based on Word Vector
Chiarcos, Christian Lin|gu|is|tik: Building the Linguist's Pathway to Bibliographies, Libraries, Language Resources and Linked Open Data
Word Segmentation for Akkadian Cuneiform
Combining Ontologies and Neural Networks for Analyzing Historical Language Varieties. A Case Study in Middle Low German
The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud
Chiu, Billy Syllable based DNN-HMM Cantonese Speech to Text System
Chiu, Tin-Shing Nine Features in a Random Forest to Learn Taxonomical Semantic Relations
What a Nerd! Beating Students and Vector Cosine in the ESL and TOEFL Datasets
Chlumská, Lucie SYN2015: Representative Corpus of Contemporary Written Czech
Chodroff, Eleanor New release of Mixer-6: Improved validity for phonetic study of speaker variation and identification
Choi, Eunsol Extracting Structured Scholarly Information from the Machine Translation Literature
Choi, Ho-Jin Korean TimeML and Korean TimeBank
Choi, Key-Sun Korean TimeML and Korean TimeBank
Cho, Kit The Validation of MRCPD Cross-language Expansions on Imageability Ratings
ANEW+: Automatic Expansion and Validation of Affective Norms of Words Lexicons in Multiple Languages
Cholakov, Kostadin Enhancing Access to Online Education: Quality Machine Translation of MOOC Content
Chollet, Mathieu A Multimodal Corpus for the Assessment of Public Speaking Ability and Anxiety
Chorianopoulou, Arodami The SpeDial datasets: datasets for Spoken Dialogue Systems analytics
Choudhury, Monojit Functions of Code-Switching in Tweets: An Annotation Framework and Some Initial Experiments
Choukri, Khalid ELRA Activities and Services
Language Resource Citation: the ISLRN Dissemination and Further Developments
The ELRA License Wizard
New Developments in the LRE Map
Enhancing Cross-border EU E-commerce through Machine Translation: Needed Language Resources, Challenges and Opportunities
Chowdhury, Shammur Absar Transfer of Corpus-Specific Dialogue Act Annotation to ISO Standard: Is it worth it?
Christensen, Heidi A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus
Christodoulopoulos, Christos EDISON: Feature Extraction for NLP, Simplified
Chu, Chenhui Paraphrasing Out-of-Vocabulary Words with Word Embeddings and Semantic Lexicons for Low Resource Statistical Machine Translation
Parallel Sentence Extraction from Comparable Corpora with Neural Network Features
Simultaneous Sentence Boundary Detection and Alignment with Pivot-based Machine Translation Generated Lexicons
Cieri, Christopher Trends in HLT Research: A Survey of LDC's Data Scholarship Program
The Language Application Grid and Galaxy
Selection Criteria for Low Resource Language Programs
Building Language Resources for Exploring Autism Spectrum Disorders
Data Management Plans and Data Centers
Cimiano, Philipp How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Crowdsourcing Ontology Lexicons
The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud
Cinkova, Silvie Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study
VPS-GradeUp: Graded Decisions on Usage Patterns
Coreference in Prague Czech-English Dependency Treebank
Towards Comparability of Linguistic Graph Banks for Semantic Parsing
Ciobanu, Alina Maria A Computational Perspective on the Romanian Dialects
Ciravegna, Fabio JATE 2.0: Java Automatic Term Extraction with Apache Solr
Claessen, Koen Analysing Constraint Grammars with a SAT-solver
Clare, Amanda Applying Core Scientific Concepts to Context-Based Citation Recommendation
Claveau, Vincent Distributional Thesauri for Information Retrieval and vice versa
Evaluating Lexical Similarity to build Sentiment Similarity
Clematide, Simon Crowdsourcing an OCR Gold Standard for a German and French Heritage Corpus
Cleve, Anthony Modelling a Parallel Corpus of French and French Belgian Sign Language
Cnossen, Fokie Modelling Multi-issue Bargaining Dialogues: Data Collection, Annotation Design and Corpus
Codina-Filba, Joan Towards Multiple Antecedent Coreference Resolution in Specialized Discourse
Cohan, Arman Revisiting Summarization Evaluation for Scientific Articles
Cohen, K. Bretonnel SuperCAT: The (New and Improved) Corpus Analysis Toolkit
Coheur, Luisa Building a Corpus of Errors and Quality in Machine Translation: Experiments on Error Impact
Cohn, Trevor Studying the Temporal Dynamics of Word Co-occurrences: An Application to Event Detection
Collins, Kathryn J. Towards a Multi-dimensional Taxonomy of Stories in Dialogue
Collovini, Sandra Summ-it++: an Enriched Version of the Summ-it Corpus
A Sequence Model Approach to Relation Extraction in Portuguese
Colotte, Vincent The IFCASL Corpus of French and German Non-native and Native Read Speech
Conger, Kathryn Large Multi-lingual, Multi-level and Multi-genre Annotation Corpus
Cook, Paul Evaluating a Topic Modelling Approach to Measuring Corpus Similarity
Classifying Out-of-vocabulary Terms in a Domain-Specific Social Media Corpus
Copestake, Ann Resources for building applications with Dependency Minimal Recursion Semantics
Corcoglioniti, Francesco PreMOn: a Lemon Extension for Exposing Predicate Models as Linked Data
Cordeiro, Silvio mwetoolkit+sem: Integrating Word Embeddings in the mwetoolkit for Semantic MWE Processing
Corrales-Astorgano, Mario On the Use of a Serious Game for Recording a Speech Corpus of People with Intellectual Disabilities
Correia, Rui Building a Corpus of Errors and Quality in Machine Translation: Experiments on Error Impact
metaTED: a Corpus of Metadiscourse for Spoken Language
Costa, Angela Building a Corpus of Errors and Quality in Machine Translation: Experiments on Error Impact
Couillault, Alain Covering various Needs in Temporal Annotation: a Proposal of Extension of ISO TimeML that Preserves Upward Compatibility
Yes, We Care! Results of the Ethics and Natural Language Processing Surveys
Courtin, Antoine Automatic Classification of Tweets for Analyzing Communication Behavior of Museums
Coutinho, Eduardo Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets
Couto-Vale, Daniel Automatic Recognition of Linguistic Replacements in Text Series Generated from Keystroke Logs
Crevier-Buchman, Lise The TYPALOC Corpus: A Collection of Various Dysarthric Speech Recordings in Read and Spontaneous Styles
Croce, Danilo A Language Independent Method for Generating Large Scale Polarity Lexicons
Cruz, Hilaria Endangered Language Documentation: Bootstrapping a Chatino Speech Corpus, Forced Aligner, ASR
Cuadros, Montse A Comparison of Domain-based Word Polarity Estimation using different Word Embeddings
Cuba Gyllensten, Amaru The Gavagai Living Lexicon
Cucchiarini, Catia Palabras: Crowdsourcing Transcriptions of L2 Speech
A Dutch Dysarthric Speech Database for Individualized Speech Therapy Research
Cucurullo, Sebastiana ALT Explored: Integrating an Online Dialectometric Tool and an Online Dialect Atlas
Cunningham, Stuart A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus
Curto, Pedro SPA: Web-based Platform for easy Access to Speech Processing Modules
Cvrček, Václav SYN2015: Representative Corpus of Contemporary Written Czech
Cysouw, Michael Concepticon: A Resource for the Linking of Concept Lists

 

D
Dabre, Raj Parallel Sentence Extraction from Comparable Corpora with Neural Network Features
da Costa Pereira, Célia DRANZIERA: An Evaluation Protocol For Multi-Domain Opinion Mining
Daelemans, Walter Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource
TwiSty: A Multilingual Twitter Stylometry Corpus for Gender and Personality Profiling
Dagan, Ido The Negochat Corpus of Human-agent Negotiation Dialogues
Daiber, Joachim The Denoised Web Treebank: Evaluating Dependency Parsing under Noisy Input Conditions
Daille, Béatrice Evaluating Lexical Similarity to build Sentiment Similarity
Ambiguity Diagnosis for Terms in Digital Humanities
Bilingual Lexicon Extraction at the Morpheme Level Using Distributional Analysis
TermITH-Eval: a French Standard-Based Resource for Keyphrase Extraction Evaluation
Dai, Xin-Yu Evaluating a Deterministic Shift-Reduce Neural Parser for Constituent Parsing
Damnati, Geraldine Web Chat Conversations from Contact Centers: a Descriptive Study
Danforth, Douglas A Corpus of Word-Aligned Asked and Anticipated Questions in a Virtual Patient Dialogue System
Danieli, Morena Summarizing Behaviours: An Experiment on the Annotation of Call-Centre Conversations
Darģis, Roberts Tēzaurs.lv: the Largest Open Lexical Database for Latvian
Darwish, Kareem Farasa: A New Fast and Accurate Arabic Word Segmenter
Das, Amitava Comparing the Level of Code-Switching in Corpora
Dash, Arnab AppDialogue: Multi-App Dialogues for Intelligent Assistants
da Silva, João Carlos Pereira NNBlocks: A Deep Learning Framework for Computational Linguistics Neural Network Models
David, Jérôme Cross-lingual RDF Thesauri Interlinking
Dayrell, Carmen Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages
de Carvalho, Rita CINTIL DependencyBank PREMIUM - A Corpus of Grammatical Dependencies for Portuguese
Declerck, Thierry The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud
Monolingual Social Media Datasets for Detecting Contradiction and Entailment
De Clercq, Orphee Rude waiter but mouthwatering pastries! An exploratory study into Dutch Aspect-Based Sentiment Analysis
Dediu, Dan Defining and Counting Phonological Classes in Cross-linguistic Segment Databases
Degaetano-Ortlieb, Stefania The Royal Society Corpus: From Uncharted Data to Corpus
de Juan, Paloma Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest
De Kuthy, Kordula Focus Annotation of Task-based Data: A Comparison of Expert and Crowd-Sourced Annotation in a Reading Comprehension Corpus
Delais-Roussarie, Elisabeth The TYPALOC Corpus: A Collection of Various Dysarthric Speech Recordings in Read and Spontaneous Styles
Deléglise, Paul Enhancing The RATP-DECODA Corpus With Linguistic Annotations For Performing A Large Range Of NLP Tasks
Del Gratta, Riccardo New Developments in the LRE Map
LREC as a Graph: People and Resources in a Network
Delli Bovi, Claudio A Large-Scale Multilingual Disambiguation of Glosses
Dell'Orletta, Felice CItA: an L1 Italian Learners Corpus to Study the Development of Writing Competence
del Pozo, Arantza Impact of Automatic Segmentation on the Quality, Productivity and Self-reported Post-editing Effort of Intralingual Subtitles
Del Tredici, Marco Assessing the Potential of Metaphoricity of verbs using corpus data
de Marneffe, Marie-Catherine Universal Dependencies v1: A Multilingual Treebank Collection
Demberg, Vera Annotating Discourse Relations in Spoken Language: A Comparison of the PDTB and CCR Frameworks
Dembowski, Julia CASSAurus: A Resource of Simpler Spanish Synonyms
de Melo, Gerard Medical Concept Embeddings via Labeled Background Corpora
The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud
Demir, Hakan Named Entity Recognition on Twitter for Turkish using Semi-supervised Learning with Word Embeddings
Demner-Fushman, Dina Annotating Logical Forms for EHR Questions
Annotating Named Entities in Consumer Health Questions
de Montcheuil, Gregoire 4Couv: A New Treebank for French
Demuynck, Kris SCALE: A Scalable Language Engineering Toolkit
Denk-Linnert, Doris-Maria A Database of Laryngeal High-Speed Videos with Simultaneous High-Quality Audio Recordings of Pathological and Non-Pathological Voices
Den, Yasuharu Survey of Conversational Behavior: Towards the Design of a Balanced Corpus of Everyday Japanese Conversation
de Paiva, Valeria Semantic Links for Portuguese
Derczynski, Leon Complementarity, F-score, and NLP Evaluation
GATE-Time: Extraction of Temporal Expressions and Events
de Ruiter, Laura DUEL: A Multi-lingual Multimodal Dialogue Corpus for Disfluency, Exclamations and Laughter
Derval, Mathieu Speech Trax: A Bottom to the Top Approach for Speaker Tracking and Indexing in an Archiving Context
De Smedt, Koenraad MWEs in Treebanks: From Survey to Guidelines
NorGramBank: A ‘Deep’ Treebank for Norwegian
Deulofeu, José DeQue: A Lexicon of Complex Prepositions and Conjunctions in French
DeVault, David PentoRef: A Corpus of Spoken References in Task-oriented Dialogues
de Weerd, Harmen Modelling Multi-issue Bargaining Dialogues: Data Collection, Annotation Design and Corpus
Dhuliawala, Shehzaad SlangNet: A WordNet like resource for English Slang
Diab, Mona Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data
Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic
Guidelines and Framework for a Large Scale Arabic Diacritized Corpus
SPLIT: Smart Preprocessing (Quasi) Language Independent Tool
Dias Cardoso, Pedro Domain Adaptation for Named Entity Recognition Using CRFs
Diaz, Alberto Improving Information Extraction from Wikipedia Texts using Basic English
Di Buccio, Emanuele Designing A Long Lasting Linguistic Project: The Case Study of ASIt
di Buono, Maria Pia Semi-automatic Parsing for Web Knowledge Extraction through Semantic Annotation
Di Caro, Luigi Automatic Enrichment of WordNet with Common-Sense Knowledge
Dick, Melanie A Lexical Resource for the Identification of “Weak Words” in German Specification Documents
Dick, Michelle A Corpus of Gesture-Annotated Dialogues for Monologue-to-Dialogue Generation from Personal Narratives
Diewald, Nils KorAP Architecture ― Diving in the Deep Sea of Corpus Data
Dijkstra, Jelske A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research
Dima, Emanuel Crosswalking from CMDI to Dublin Core and MARC 21
Dimitrova, Vanya Lin|gu|is|tik: Building the Linguist's Pathway to Bibliographies, Libraries, Language Resources and Linked Open Data
Dimitrov, Stefan Annotating Characters in Literary Corpora: A Scheme, the CHARLES Tool, and an Annotated Novel
Dimou, Athanasia-Lida Multimodal Resources for Human-Robot Communication Modelling
Dinarelli, Marco Domain Adaptation for Named Entity Recognition Using CRFs
Dini, Luca Emotion Analysis on Twitter: The Hidden Challenge
Dinu, Liviu P. A Computational Perspective on the Romanian Dialects
Using Word Embeddings to Translate Named Entities
A Corpus of Native, Non-native and Translated Texts
Di Nunzio, Giorgio Maria Designing A Long Lasting Linguistic Project: The Case Study of ASIt
DiPersio, Denise Trends in HLT Research: A Survey of LDC's Data Scholarship Program
Data Management Plans and Data Centers
Dirix, Peter AfriBooms: An Online Treebank for Afrikaans
Djemaa, Marianne Corpus Annotation within the French FrameNet: a Domain-by-domain Methodology
A General Framework for the Annotation of Causality Based on FrameNet
Dobrovoljc, Kaja The Universal Dependencies Treebank of Spoken Slovenian
Do, Hyun-Woo Korean TimeML and Korean TimeBank
Doi, Syunya Analyzing Time Series Changes of Correlation between Market Share and Concerns on Companies measured through Search Engine Suggests
Dojchinovski, Milan Crowdsourced Corpus with Entity Salience Annotations
FREME: Multilingual Semantic Enrichment with Linked Data and Language Technologies
DBpedia Abstracts: A Large-Scale, Open, Multilingual NLP Training Corpus
Dragoni, Mauro DRANZIERA: An Evaluation Protocol For Multi-Domain Opinion Mining
Dras, Mark Modeling Language Change in Historical Corpora: The Case of Portuguese
Draxler, Christoph The BAS Speech Data Repository
BAS Speech Science Web Services - an Update of Current Developments
Drumond, Lucas Learning Thesaurus Relations from Distributional Features
Druskat, Stephan Enriching TimeBank: Towards a more precise annotation of temporal relations in a text
corpus-tools.org: An Interoperable Generic Software Tool Set for Multi-layer Linguistic Corpora
Dubuisson Duplessis, Guillaume Purely Corpus-based Automatic Conversation Authoring
Duclot, William CirdoX: an on/off-line multisource speech and sound analysis software
Dufour, Barbara Monitoring Disease Outbreak Events on the Web Using Text-mining Approach and Domain Expert Knowledge
Du, Jinhua Using BabelNet to Improve OOV Coverage in SMT
ProphetMT: A Tree-based SMT-driven Controlled Language Authoring/Post-Editing Tool
Duma, Daniel Applying Core Scientific Concepts to Context-Based Citation Recommendation
Dumitrescu, Ștefan Daniel The IPR-cleared Corpus of Contemporary Written and Spoken Romanian Language
Dumont, Corentin Question-Answering with Logic Specific to Video Games
Dupont, Stéphane AVAB-DBS: an Audio-Visual Affect Bursts Database for Synthesis
Dutoit, Thierry AVAB-DBS: an Audio-Visual Affect Bursts Database for Synthesis
Dyer, Chris Bridge-Language Capitalization Inference in Western Iranian: Sorani, Kurmanji, Zazaki, and Tajik
Dyvik, Helge NorGramBank: A ‘Deep’ Treebank for Norwegian

 

E
Eckart de Castilho, Richard Sense-annotating a Lexical Substitution Data Set with Ubyline
Eckart, Thomas Features for Generic Corpus Querying
Ecker, Brian Internet Argument Corpus 2.0: An SQL schema for Dialogic Social Media and the Corpora to go with it
Ecker, Stefan Unsupervised Ranked Cross-Lingual Lexical Substitution for Low-Resource Languages
Eckert, Kai A Large DataBase of Hypernymy Relations Extracted from the Web.
Eckle-Kohler, Judith Crowdsourcing a Large Dataset of Domain-Specific Context-Sensitive Semantic Verb Relations
Edlund, Jens Hidden Resources ― Strategies to Acquire and Exploit Potential Spoken Language Resources in National Archives
Efthimiou, Eleni Multimodal Resources for Human-Robot Communication Modelling
Eger, Steffen Lemmatization and Morphological Tagging in German and Latin: A Comparison and a Survey of the State-of-the-art
Ehrmann, Maud Cross-lingual Linking of Multi-word Entities and their corresponding Acronyms
Named Entity Resources - Overview and Outlook
Eibl, Maximilian A Corpus of Read and Spontaneous Upper Saxon German Speech for ASR Evaluation
Eichler, Kathrin TEG-REP: A corpus of Textual Entailment Graphs based on Relation Extraction Patterns
Eiselen, Roald South African Language Resources: Phrase Chunking
Government Domain Named Entity Recognition for South African Languages
Ekbal, Asif Building Tempo-HindiWordNet: A resource for effective temporal information access in Hindi
Aspect based Sentiment Analysis in Hindi: Resource Creation and Evaluation
Ekenel, Hazim The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
El Ballouli, Rim Arabic Corpora for Credibility Analysis
ELbassouni, Shady Arabic Corpora for Credibility Analysis
El-Beltagy, Samhaa R. NileULex: A Phrase and Word Level Sentiment Lexicon for Egyptian and Modern Standard Arabic
Elhadad, Michael The Hebrew FrameNet Project
El Haddad, Kevin AVAB-DBS: an Audio-Visual Affect Bursts Database for Synthesis
El-Hajj, Wassim Arabic Corpora for Credibility Analysis
El-Haj, Mahmoud Learning Tone and Attribution for Financial Text Mining
Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages
OSMAN ― A Novel Arabic Readability Metric
Elingui, Uriel Pascal Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof
Ellendorff, Tilia The PsyMine Corpus - A Corpus annotated with Psychiatric Disorders and their Etiological Factors
Elliott, Desmond A Corpus of Images and Text in Online News
1 Million Captioned Dutch Newspaper Images
Emerson, Guy Resources for building applications with Dependency Minimal Recursion Semantics
Emmery, Chris Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource
Engelmann, Kai Frederic An Interaction-Centric Dataset for Learning Automation Rules in Smart Homes
How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Enström, Ingegerd SweLL on the rise: Swedish Learner Language corpus for European Reference Level studies
Erdmann, Johnsey A Corpus of Word-Aligned Asked and Anticipated Questions in a Virtual Patient Dialogue System
Eriksson, Robin Quality Assessment of the Reuters Vol. 2 Multilingual Corpus
Erjavec, Tomaž Corpus-Based Diacritic Restoration for South Slavic Languages
Corpus vs. Lexicon Supervision in Morphosyntactic Tagging: the Case of Slovene
Erro, Daniel A Singing Voice Database in Basque for Statistical Singing Synthesis of Bertsolaritza
Escudero-Mancebo, David On the Use of a Serious Game for Recording a Speech Corpus of People with Intellectual Disabilities
Eshkol, Iris Covering various Needs in Temporal Annotation: a Proposal of Extension of ISO TimeML that Preserves Upward Compatibility
Eshkol-Taravela, Iris Detection of Reformulations in Spoken French
Eskander, Ramy Morphologically Annotated Corpora and Morphological Analyzers for Moroccan and Sanaani Yemeni Arabic
SPLIT: Smart Preprocessing (Quasi) Language Independent Tool
Eskenazi, Maxine metaTED: a Corpus of Metadiscourse for Spoken Language
España-Bonet, Cristina TweetMT: A Parallel Microblog Corpus
Espinosa Anke, Luis Example-based Acquisition of Fine-grained Collocation Resources
ELMD: An Automatically Generated Entity Linking Gold Standard Dataset in the Music Domain
Espinoza, Fredrik The Gavagai Living Lexicon
Esplà-Gomis, Miquel Producing Monolingual and Parallel Web Corpora at the Same Time - SpiderLing and Bitextor's Love Affair
Estève, Yannick Word Embedding Evaluation and Combination
Enhancing The RATP-DECODA Corpus With Linguistic Annotations For Performing A Large Range Of NLP Tasks
Etchegoyhen, Thierry Exploiting a Large Strongly Comparable Corpus
Etcheverry, Mathias Spanish Word Vectors from Wikipedia
Etxeberria, Izaskun Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish and Slovene
Euzenat, Jérôme Cross-lingual RDF Thesauri Interlinking
Eyssel, Friederike How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment

 

F
Faessler, Erik UIMA-Based JCoRe 2.0 Goes GitHub and Maven Central ― State-of-the-Art Software Resource Engineering and Distribution of NLP Pipelines
Fairon, Cédrick Evaluating Lexical Simplification and Vocabulary Knowledge for Learners of French: Possibilities of Using the FLELex Resource
Falala, Sylvain Monitoring Disease Outbreak Events on the Web Using Text-mining Approach and Domain Expert Knowledge
Falk, Ingrid Aspectual Flexibility Increases with Agentivity and Concreteness\\ A Computational Classification Experiment on Polysemous Verbs
"LVF-lemon ― Towards a Linked Data Representation of ""Les Verbes français"""
Fandrych, Christian User, who art thou? User Profiling for Oral Corpus Platforms
Fang, Alex The DialogBank
Farah, Benamara Discourse Structure and Dialogue Acts in Multiparty Dialogue: the STAC Corpus
Farajian, M. Amin WAGS: A Beautiful English-Italian Benchmark Supporting Word Alignment Evaluation on Rare Words
Faralli, Stefano A Large DataBase of Hypernymy Relations Extracted from the Web.
Farzand, Omer Urdu Summary Corpus
Fatema, Kaniz Open Data Vocabularies for Assigning Usage Rights to Data Resources from Translation Projects
Fäth, Christian Lin|gu|is|tik: Building the Linguist's Pathway to Bibliographies, Libraries, Language Resources and Linked Open Data
Fauth, Camille The IFCASL Corpus of French and German Non-native and Native Read Speech
Favre, Benoit A Document Repository for Social Media and Speech Conversations
Word Embedding Evaluation and Combination
Summarizing Behaviours: An Experiment on the Annotation of Call-Centre Conversations
Fawei, Biralatei Passing a USA National Bar Exam: a First Corpus for Experimentation
Fazly, Afsaneh Classifying Out-of-vocabulary Terms in a Domain-Specific Social Media Corpus
Federico, Marcello WAGS: A Beautiful English-Italian Benchmark Supporting Word Alignment Evaluation on Rare Words
Feldman, Laurie ANEW+: Automatic Expansion and Validation of Affective Norms of Words Lexicons in Multiple Languages
Fellbaum, Christiane Encoding Adjective Scales for Fine-grained Resources
Feltracco, Anna Acquiring Opposition Relations among Italian Verb Senses using Crowdsourcing
Ferguson, Emily Building Language Resources for Exploring Autism Spectrum Disorders
Fernández Barrera, Meritxell Enhancing Cross-border EU E-commerce through Machine Translation: Needed Language Resources, Challenges and Opportunities
The ELRA License Wizard
Fernandez, Raquel PentoRef: A Corpus of Spoken References in Task-oriented Dialogues
Fernandez Rei, Elisa Enhanced CORILGA: Introducing the Automatic Phonetic Alignment Tool for Continuous Speech
Introducing the SEA_AP: an Enhanced Tool for Automatic Prosodic Analysis
Ferreira, Eduardo B2SG: a TOEFL-like Task for Portuguese
Ferreira, Jaime SPA: Web-based Platform for easy Access to Speech Processing Modules
Ferrero, Jérémy A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity Detection
Ferret, Olivier A Dataset for Open Event Extraction in English
Ferrugento, Adriana Can Topic Modelling benefit from Word Sense Information?
Figueira, Anny Summ-it++: an Enriched Version of the Summ-it Corpus
Finatto, Maria José Bocorny VerbLexPor: a lexical resource with semantic roles for Portuguese
Finch, Andrew Introducing the Asian Language Treebank (ALT)
Fisas, Beatriz A Multi-Layered Annotated Corpus of Scientific Papers
Fischer, Andrea Orthographic and Morphological Correspondences between Related Slavic Languages as a Base for Modeling of Mutual Intelligibility
Fischer, Stefan Compasses, Magnets, Water Microscopes: Annotation of Terminology in a Diachronic Corpus of Scientific Texts
Fišer, Darja Corpus-Based Diacritic Restoration for South Slavic Languages
Flickinger, Dan Towards Comparability of Linguistic Graph Banks for Semantic Parsing
Flores-Lucas, Valle On the Use of a Serious Game for Recording a Speech Corpus of People with Intellectual Disabilities
Fohr, Dominique The IFCASL Corpus of French and German Non-native and Native Read Speech
How Diachronic Text Corpora Affect Context based Retrieval of OOV Proper Names for Audio News
Fokkens, Antske Two Architectures for Parallel Processing of Huge Amounts of Text
GRaSP: A Multilayered Annotation Scheme for Perspectives
Fomicheva, Marina Using Contextual Information for Machine Translation Evaluation
Fonseca, Evandro Summ-it++: an Enriched Version of the Summ-it Corpus
Adapting an Entity Centric Model for Portuguese Coreference Resolution
Forkel, Robert Concepticon: A Resource for the Linking of Concept Lists
Forsberg, Markus Deriving Morphological Analyzers from Example Inflections
Fort, Karën Yes, We Care! Results of the Ethics and Natural Language Processing Surveys
Foster, Jonathan What’s the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems
Foster, Simon The PsyMine Corpus - A Corpus annotated with Psychiatric Disorders and their Etiological Factors
Fothergill, Richard Evaluating a Topic Modelling Approach to Measuring Corpus Similarity
Fotinea, Stavroula―Evita Multimodal Resources for Human-Robot Communication Modelling
Foucault, Nicolas Automatic Classification of Tweets for Analyzing Communication Behavior of Museums
Fougeron, Cecile The TYPALOC Corpus: A Collection of Various Dysarthric Speech Recordings in Read and Spontaneous Styles
Fournier, Sebastien Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers
Fox Tree, Jean A Corpus of Gesture-Annotated Dialogues for Monologue-to-Dialogue Generation from Personal Narratives
Coordinating Communication in the Wild: The Artwalk Dialogue Corpus of Pedestrian Navigation and Mobile Referential Communication
A Verbal and Gestural Corpus of Story Retellings to an Expressive Embodied Virtual Character
A Multimodal Motion-Captured Corpus of Matched and Mismatched Extravert-Introvert Conversational Pairs
Frain, Alice SatiricLR: a Language Resource of Satirical News Articles
Francisco, Virginia Riddle Generation using Word Associations
Francois, Thomas SVALex: a CEFR-graded Lexical Resource for Swedish Foreign and Second Language Learners
Combining Manual and Automatic Prosodic Annotation for Expressive Speech Synthesis
Evaluating Lexical Simplification and Vocabulary Knowledge for Learners of French: Possibilities of Using the FLELex Resource
Francopoulo, Gil The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
A Study of Reuse and Plagiarism in LREC papers
Predictive Modeling: Guessing the NLP Terms of Tomorrow
Frank, Anette Combining Semantic Annotation of Word Sense & Semantic Roles: A Novel Annotation Scheme for VerbNet Roles on German Language Data
Frankenberg, Claudia Towards Automatic Transcription of ILSE ― an Interdisciplinary Longitudinal Study of Adult Development and Aging
Fredouille, Corinne Automatic Anomaly Detection for Dysarthria across Two Speech Styles: Read vs Spontaneous Speech
The TYPALOC Corpus: A Collection of Various Dysarthric Speech Recordings in Read and Spontaneous Styles
Freitag, Dayne An Annotated Corpus and Method for Analysis of Ad-Hoc Structures Embedded in Text
Freitas, André NNBlocks: A Deep Learning Framework for Computational Linguistics Neural Network Models
Freitas, Bianca QUEMDISSE? Reported speech in Portuguese
Freitas, Cláudia QUEMDISSE? Reported speech in Portuguese
Freitas, Maria João CEPLEXicon ― A Lexicon of Child European Portuguese
Frick, Elena Corpus Query Lingua Franca (CQLF)
User, who art thou? User Profiling for Oral Corpus Platforms
Fried, Daniel Towards Using Social Media to Identify Individuals at Risk for Preventable Chronic Illness
Frieder, Ophir Effects of Sampling on Twitter Trend Detection
Frontini, Francesca Al Qamus al Muhit, a Medieval Arabic Lexicon in LMF
LREC as a Graph: People and Resources in a Network
Füchsel, Silke A Language Resource of German Errors Written by Children with Dyslexia
Fujita, Akira Translation Errors and Incomprehensibility: a Case Study using Machine-Translated Second Language Proficiency Tests
Fulgoni, Dean An Empirical Exploration of Moral Foundations Theory in Partisan News Sources
Funakoshi, Kotaro The dialogue breakdown detection challenge: Task description, datasets, and evaluation metrics
Fünfer, Sarah Evaluation of the KIT Lecture Translation System
Fung, Pascale A Machine Learning based Music Retrieval and Recommendation System
Deep Learning of Audio and Language Features for Humor Prediction
Funk, Adam What’s the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems
A Document Repository for Social Media and Speech Conversations
Furrer, Lenz Crowdsourcing an OCR Gold Standard for a German and French Heritage Corpus

 

G
Gábor, Kata Semantic Annotation of the ACL Anthology Corpus for the Automatic Analysis of Scientific Literature
Gabryszak, Aleksandra Relation- and Phrase-level Linking of FrameNet with Sar-graphs
Gagliardi, Gloria Automatic identification of Mild Cognitive Impairment through the analysis of Italian spontaneous speech productions
Gaizauskas, Robert A Document Repository for Social Media and Speech Conversations
Cross-validating Image Description Datasets and Evaluation Metrics
What’s the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems
Galibert, Olivier Generating Task-Pertinent sorted Error Lists for Speech Recognition
Galvan, Paloma Riddle Generation using Word Associations
Gamallo, Pablo TweetMT: A Parallel Microblog Corpus
Gambäck, Björn Comparing the Level of Code-Switching in Corpora
Ganguly, Debasis Developing a Dataset for Evaluating Approaches for Document Expansion with Images
Ganguly, Niloy Functions of Code-Switching in Tweets: An Annotation Framework and Some Initial Experiments
Ganzeboom, Mario A Dutch Dysarthric Speech Database for Individualized Speech Therapy Research
Gao, Jie JATE 2.0: Java Automatic Term Extraction with Apache Solr
Garain, Utpal A Neural Lemmatizer for Bengali
Garcia, Marcos Incorporating Lexico-semantic Heuristics into Coreference Resolution Sieves for Named Entity Recognition at Document-level
García Mateo, Carmen CORILSE: a Spanish Sign Language Repository for Linguistic Analysis
Enhanced CORILGA: Introducing the Automatic Phonetic Alignment Tool for Continuous Speech
Introducing the SEA_AP: an Enhanced Tool for Automatic Prosodic Analysis
García-Miguel, José Mª CORILSE: a Spanish Sign Language Repository for Linguistic Analysis
García Pablos, Aitor A Comparison of Domain-based Word Polarity Estimation using different Word Embeddings
Garnier, Marie Error Typology and Remediation Strategies for Requirements Written in English by Non-Native Speakers
Gaspari, Federico Enhancing Cross-border EU E-commerce through Machine Translation: Needed Language Resources, Challenges and Opportunities
Gast, Volker Enriching TimeBank: Towards a more precise annotation of temporal relations in a text
corpus-tools.org: An Interoperable Generic Software Tool Set for Multi-layer Linguistic Corpora
Gaudio, Rosa Evaluating Machine Translation in a Usage Scenario
Gauthier, Elodie Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof
Geoffrois, Edouard Evaluating Interactive System Adaptation
Georgeton, Laurianne The TYPALOC Corpus: A Collection of Various Dysarthric Speech Recordings in Read and Spontaneous Styles
Georg, Gersende A Corpus of Clinical Practice Guidelines Annotated with the Importance of Recommendations
Georgiladakis, Spiros Cognitively Motivated Distributional Representations of Meaning
Gerlach, Johanna A Shared Task for Spoken CALL?
Ghaddar, Abbas WikiCoref: An English Coreference-annotated Corpus of Wikipedia Articles
Ghannay, Sahar Word Embedding Evaluation and Combination
Ghidoni, Enrico Automatic identification of Mild Cognitive Impairment through the analysis of Italian spontaneous speech productions
Ghio, Alain The TYPALOC Corpus: A Collection of Various Dysarthric Speech Recordings in Read and Spontaneous Styles
Ghoneim, Mahmoud Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data
Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic
Guidelines and Framework for a Large Scale Arabic Diacritized Corpus
Giannini, Silvia Two Decades of Terminology: European Framework Programmes Titles
Gibbon, Dafydd Legacy language atlas data mining: mapping Kru languages
Gilmartin, Emer Capturing Chat: Annotation and Tools for Multiparty Casual Conversation.
Ginter, Filip Universal Dependencies v1: A Multilingual Treebank Collection
Universal Dependencies for Persian
Ginzburg, Jonathan DUEL: A Multi-lingual Multimodal Dialogue Corpus for Disfluency, Exclamations and Laughter
Girard-Rivier, Maxence Ecological Gestures for HRI: the GEE Corpus
Gkatzia, Dimitra The REAL Corpus: A Crowd-Sourced Corpus of Human Generated and Evaluated Spatial References to Real-World Urban Scenes
Glaser, Elvira ArchiMob - A Corpus of Spoken Swiss German
Gleim, Rüdiger Lemmatization and Morphological Tagging in German and Latin: A Comparison and a Survey of the State-of-the-art
Gobert, Maxime Modelling a Parallel Corpus of French and French Belgian Sign Language
Godfrey, John New release of Mixer-6: Improved validity for phonetic study of speaker variation and identification
Goeuriot, Lorraine Building Evaluation Datasets for Consumer-Oriented Information Retrieval
Goggi, Sara Two Decades of Terminology: European Framework Programmes Titles
Goharian, Nazli Revisiting Summarization Evaluation for Scientific Articles
Effects of Sampling on Twitter Trend Detection
Gokcen, Ajda A Corpus of Word-Aligned Asked and Anticipated Questions in a Virtual Patient Dialogue System
Goldberg, Yoav Universal Dependencies v1: A Multilingual Treebank Collection
Gomes, Luís Word Sense-Aware Machine Translation: Including Senses as Contextual Features for Improved Translation Models
First Steps Towards Coverage-Based Sentence Alignment
Gómez Guinovart, Xavier Enriching a Portuguese WordNet using Synonyms from a Monolingual Dictionary
Gomez, Randy Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recognition
Gómez-Rodríguez, Carlos EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment Analysis
Gonçalo Oliveira, Hugo Discovering Fuzzy Synsets from the Redundancy in Different Lexical-Semantic Resources
TweetMT: A Parallel Microblog Corpus
Can Topic Modelling benefit from Word Sense Information?
Gonçalves, Anabela The COPLE2 corpus: a learner corpus for Portuguese
González-Ferreras, César On the Use of a Serious Game for Recording a Speech Corpus of People with Intellectual Disabilities
Gonzàlez, Meritxell Towards a Linguistic Ontology with an Emphasis on Reasoning and Knowledge Reuse
González Saavedra, Berta Latin Vallex. A Treebank-based Semantic Valency Lexicon for Latin
Goodman, Michael Wayne Resources for building applications with Dependency Minimal Recursion Semantics
Goodwin, Travis Embedding Open-domain Common-sense Knowledge from Text
Gorisch, Jan A CUP of CoFee: A large Collection of feedback Utterances Provided with communicative function annotations
Gornostaja, Tatjana FREME: Multilingual Semantic Enrichment with Linked Data and Language Technologies
Gosko, Didzis Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project
Götze, Jana SpaceRef: A corpus of street-level geographic descriptions
Goulas, Theodore Multimodal Resources for Human-Robot Communication Modelling
Goutte, Cyril Discriminating Similar Languages: Evaluations and Explorations
Goyal, Kartik Bridge-Language Capitalization Inference in Western Iranian: Sorani, Kurmanji, Zazaki, and Tajik
Grabar, Natalia A Large Rated Lexicon with French Medical Words
Detection of Reformulations in Spoken French
Gracia, Jorge Leveraging RDF Graphs for Crossing Multiple Bilingual Dictionaries
The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud
Graff, David Multi-language Speech Collection for NIST LRE
Graham, Calbert Crowdsourcing a Multi-lingual Speech Corpus: Recording, Transcription and Annotation of the CrowdIS Corpora
Graliński, Filip “He Said She Said” ― a Male/Female Corpus of Polish
Granvogl, Daniel Creating a Lexicon of Bavarian Dialect by Means of Facebook Language Data and Crowdsourcing
Green, Phil A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus
Greenwood, Mark A. GATE-Time: Extraction of Temporal Expressions and Events
Grefenstette, Gregory Extracting Weighted Language Lexicons from Wikipedia
Griffitt, Kira The Query of Everything: Developing Open-Domain, Natural-Language Queries for BOLT Information Retrieval
Grimes, Stephen Uzbek-English and Turkish-English Morpheme Alignment Corpora
Large Multi-lingual, Multi-level and Multi-genre Annotation Corpus
Grishman, Ralph Entity Linking with a Paraphrase Flavor
Grouas, Thibault Review on the Existing Language Resources for Languages of France
Grouin, Cyril Text Segmentation of Digitized Clinical Texts
Identification of Drug-Related Medical Conditions in Social Media
Controlled Propagation of Concept Annotations in Textual Corpora
Grover, Claire Homing in on Twitter Users: Evaluating an Enhanced Geoparser for User Profile Locations
Grūzītis, Normunds Tēzaurs.lv: the Largest Open Lexical Database for Latvian
Guerraz, Aleksandra Web Chat Conversations from Contact Centers: a Descriptive Study
Guillou, Erwan The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People
Guillou, Liane PROTEST: A Test Suite for Evaluating Pronouns in Machine Translation
Gulordava, Kristina Discontinuous Verb Phrases in Parsing and Machine Translation of English and German
Gupta, Palash Coreference Annotation Scheme and Relation Types for Hindi
Gurevych, Iryna Sense-annotating a Lexical Substitution Data Set with Ubyline
Combining Semantic Annotation of Word Sense & Semantic Roles: A Novel Annotation Scheme for VerbNet Roles on German Language Data
C4Corpus: Multilingual Web-size Corpus with Free License
Crowdsourcing a Large Dataset of Domain-Specific Context-Sensitive Semantic Verb Relations
Gurrutxaga, Antton Fostering digital representation of EU regional and minority languages: the Digital Language Diversity Project
Gustafson, Joakim Hidden Resources ― Strategies to Acquire and Exploit Potential Spoken Language Resources in National Archives
Gutiérrez-González, Yurena On the Use of a Serious Game for Recording a Speech Corpus of People with Intellectual Disabilities
Gutierrez-Vasques, Ximena Axolotl: a Web Accessible Parallel Corpus for Spanish-Nahuatl
Gutkin, Alexander TTS for Low Resource Languages: A Bangla Synthesizer

 

H
Haaf, Susanne Corpus Analysis based on Structural Phenomena in Texts: Exploiting TEI Encoding for Linguistic Research
Habash, Nizar Arabic Corpora for Credibility Analysis
Building an Arabic Machine Translation Post-Edited Corpus: Guidelines and Annotation
Morphologically Annotated Corpora and Morphological Analyzers for Moroccan and Sanaani Yemeni Arabic
SPLIT: Smart Preprocessing (Quasi) Language Independent Tool
DALILA: The Dialectal Arabic Linguistic Learning Assistant
A Large Scale Corpus of Gulf Arabic
Applying the Cognitive Machine Translation Evaluation Approach to Arabic
Exploiting Arabic Diacritization for High Quality Automatic Annotation
Habernal, Ivan C4Corpus: Multilingual Web-size Corpus with Free License
Crowdsourcing a Large Dataset of Domain-Specific Context-Sensitive Semantic Verb Relations
HaCohen-Kerner, Yaakov A Lexical Resource of Hebrew Verb-Noun Multi-Word Expressions
Hagen, Kristin Constructing a Norwegian Academic Wordlist
Hagmüller, Martin AMISCO: The Austrian German Multi-Sensor Corpus
Hahn-Powell, Gus Sieve-based Coreference Resolution in the Biomedical Domain
Odin's Runes: A Rule Language for Information Extraction
Hahn, Udo CodE Alltag: A German-Language E-Mail Corpus
UIMA-Based JCoRe 2.0 Goes GitHub and Maven Central ― State-of-the-Art Software Resource Engineering and Distribution of NLP Pipelines
Hain, Thomas The OpenCourseWare Metadiscourse (OCWMD) Corpus
A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus
Hajic, Jan QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages
Universal Dependencies v1: A Multilingual Treebank Collection
Fostering the Next Generation of European Language Technology: Recent Developments ― Emerging Initiatives ― Challenges and Opportunities
UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing
Towards Comparability of Linguistic Graph Banks for Semantic Parsing
Hajj, Hazem Arabic Corpora for Credibility Analysis
Hajnicz, Elżbieta Accessing and Elaborating Walenty - a Valence Dictionary of Polish - via Internet Browser
Semantic Layer of the Valence Dictionary of Polish Walenty
Hakkani-Tur, Dilek AIMU: Actionable Items for Meeting Understanding
Halabi, Nawar Phonetic Inventory for an Arabic Speech Corpus
Halfaker, Aaron Edit Categories and Editor Role Identification in Wikipedia
Ha, Linne TTS for Low Resource Languages: A Bangla Synthesizer
Hamfors, Ola The Gavagai Living Lexicon
Hamon, Thierry A Large Rated Lexicon with French Medical Words
Hanbury, Allan Standard Test Collection for English-Persian Cross-Lingual Word Sense Disambiguation
Handschuh, Siegfried NNBlocks: A Deep Learning Framework for Computational Linguistics Neural Network Models
Hangya, Viktor A Hungarian Sentiment Corpus Manually Annotated at Aspect Level
Han, Jingyi Towards producing bilingual lexica from monolingual corpora
Hanke, Thomas Using a Language Technology Infrastructure for German in order to Anonymize German Sign Language Corpus Data
Hanl, Michael KorAP Architecture ― Diving in the Deep Sea of Corpus Data
Han, Qi Visualisation and Exploration of High-Dimensional Distributional Features in Lexical Semantic Classification
Hansen, Dorte Haltrup Facilitating Metadata Interoperability in CLARIN-DK
Hantke, Simone Introducing the Weighted Trustability Evaluator for Crowdsourcing Exemplified by Speaker Likability Classification
Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets
Harabagiu, Sanda Embedding Open-domain Common-sense Knowledge from Text
H. Arai, Noriko Translation Errors and Incomprehensibility: a Case Study using Machine-Translated Second Language Proficiency Tests
Harashima, Jun Japanese Word―Color Associations with and without Contexts
A Large-scale Recipe and Meal Data Collection as Infrastructure for Food Research
Hardmeier, Christian PROTEST: A Test Suite for Evaluating Pronouns in Machine Translation
Harige, Ravindra Generating a Large-Scale Entity Linking Dictionary from Wikipedia Link Structure and Article Text
Hartmann, Silvana Combining Semantic Annotation of Word Sense & Semantic Roles: A Novel Annotation Scheme for VerbNet Roles on German Language Data
Hasanuzzaman, Mohammed Building Tempo-HindiWordNet: A resource for effective temporal information access in Hindi
Hasida, Koiti Graphical Annotation for Syntax-Semantics Mapping
Hassan, Sara A Large Scale Corpus of Gulf Arabic
Hateva, Neli BulPhonC: Bulgarian Speech Corpus for the Development of ASR Technology
Hathout, Nabil Giving Lexical Resources a Second Life: Démonette, a Multi-sourced Morpho-semantic Network for French
Wiktionnaire's Wikicode GLAWIfied: a Workable French Machine-Readable Dictionary
Hätty, Anna GhoSt-NN: A Representative Gold Standard of German Noun-Noun Compounds
Haugereid, Petter NorGramBank: A ‘Deep’ Treebank for Norwegian
Hawwari, Abdelati Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data
Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic
Guidelines and Framework for a Large Scale Arabic Diacritized Corpus
Hayakawa, Akira The ILMT-s2s Corpus ― A Multimodal Interlingual Map Task Corpus
Hayashi, Yoshihiko A Framework for Cross-lingual/Node-wise Alignment of Lexical-Semantic Resources
Extending Monolingual Semantic Textual Similarity Task to Multiple Cross-lingual Settings
Hayoun, Avi The Hebrew FrameNet Project
Hazem, Amir Bilingual Lexicon Extraction at the Morpheme Level Using Distributional Analysis
Improving Bilingual Terminology Extraction from Comparable Corpora via Multiple Word-Space Models
Hedberg, Karin A Multi-domain Corpus of Swedish Word Sense Annotation
Hedeland, Hanna User, who art thou? User Profiling for Oral Corpus Platforms
Heid, Ulrich A Lexical Resource for the Identification of “Weak Words” in German Specification Documents
Hellmann, Sebastian FREME: Multilingual Semantic Enrichment with Linked Data and Language Technologies
The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud
DBpedia Abstracts: A Large-Scale, Open, Multilingual NLP Training Corpus
Hellrich, Johannes UIMA-Based JCoRe 2.0 Goes GitHub and Maven Central ― State-of-the-Art Software Resource Engineering and Distribution of NLP Pipelines
Hendrickx, Iris Enhancing Access to Online Education: Quality Machine Translation of MOOC Content
Hendrikx, Pascal Monitoring Disease Outbreak Events on the Web Using Text-mining Approach and Domain Expert Knowledge
Hennig, Leonhard TEG-REP: A corpus of Textual Entailment Graphs based on Relation Extraction Patterns
Relation- and Phrase-level Linking of FrameNet with Sar-graphs
Henriksen, Lina Providing a Catalogue of Language Resources for Commercial Users
Hensler, Andrea A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds
Hepple, Mark What’s the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems
Studying the Temporal Dynamics of Word Co-occurrences: An Application to Event Detection
Hermann, Thomas How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Herms, Robert A Corpus of Read and Spontaneous Upper Saxon German Speech for ASR Evaluation
Hernaez, Inma A Singing Voice Database in Basque for Statistical Singing Synthesis of Bertsolaritza
Hernández Farías, Delia Irazú Annotating Sentiment and Irony in the Online Italian Political Debate on #labuonascuola
Hernandez, Nicolas Ubuntu-fr: A Large and Open Corpus for Multi-modal Analysis of Online Written Conversations
Hernandez Pompa, Isaac Axolotl: a Web Accessible Parallel Corpus for Spanish-Nahuatl
Hernando, Javier The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
Hersh, William On Developing Resources for Patient-level Information Retrieval
Hervas, Raquel Improving Information Extraction from Wikipedia Texts using Basic English
Riddle Generation using Word Associations
He, Yifan Entity Linking with a Paraphrase Flavor
He, Yulan Detecting Expressions of Blame or Praise in Text
Hicks, Davyth Fostering digital representation of EU regional and minority languages: the Digital Language Diversity Project
Higashinaka, Ryuichiro The dialogue breakdown detection challenge: Task description, datasets, and evaluation metrics
Hirayama, Naoki Parallel Speech Corpora of Japanese Dialects
Hládek, Daniel Evaluation Set for Slovak News Information Retrieval
Hladka, Barbora Czech Legal Text Treebank 1.0
Hnátková, Milena SYN2015: Representative Corpus of Contemporary Written Czech
Hoenen, Armin Wikipedia Titles As Noun Tag Predictors
TGermaCorp -- A (Digital) Humanities Resource for (Computational) Linguistics
Hofmann, Hansjörg A Comparative Analysis of Crowdsourced Natural Language Corpora for Spoken Dialog Systems
Hohle, Petter Universal Dependencies for Norwegian
Hokamp, Chris MARMOT: A Toolkit for Translation Quality Estimation at the Word Level
Hollenstein, Nora Inconsistency Detection in Semantic Annotation
Hollink, Laura A Corpus of Images and Text in Online News
Holst, Anders The Gavagai Living Lexicon
Holthaus, Patrick An Interaction-Centric Dataset for Learning Automation Rules in Smart Homes
How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Homburg, Timo Word Segmentation for Akkadian Cuneiform
Hongchao, Liu EVALution-MAN: A Chinese Dataset for the Training and Evaluation of DSMs
Hönig, Florian Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets
Horbach, Andrea Unsupervised Ranked Cross-Lingual Lexical Substitution for Low-Resource Languages
Improving POS Tagging of German Learner Language in a Reading Comprehension Scenario
A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds
Horsmann, Tobias FlexTag: A Highly Flexible PoS Tagging Framework
Horvat, Matic Extracting Structured Scholarly Information from the Machine Translation Literature
Resources for building applications with Dependency Minimal Recursion Semantics
Hoste, Véronique A Classification-based Approach to Economic Event Detection in Dutch News Text
Exploring the Realization of Irony in Twitter Data
Rude waiter but mouthwatering pastries! An exploratory study into Dutch Aspect-Based Sentiment Analysis
Hough, Julian DUEL: A Multi-lingual Multimodal Dialogue Corpus for Disfluency, Exclamations and Laughter
PentoRef: A Corpus of Spoken References in Task-oriented Dialogues
Hovy, Dirk Exploring Language Variation Across Europe - A Web-based Tool for Computational Sociolinguistics
Hovy, Eduard Edit Categories and Editor Role Identification in Wikipedia
Htait, Amal Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers
Huang, Chu-Ren A lexicon of perception for the identification of synaesthetic metaphors in corpora
Nine Features in a Random Forest to Learn Taxonomical Semantic Relations
What a Nerd! Beating Students and Vector Cosine in the ESL and TOEFL Datasets
EVALution-MAN: A Chinese Dataset for the Training and Evaluation of DSMs
Database of Mandarin Neighborhood Statistics
Huangfu, Luwen Towards Using Social Media to Identify Individuals at Risk for Preventable Chronic Illness
Huang, Hen-Hsen Fine-Grained Chinese Discourse Relation Labelling
Huang, Shujian Evaluating a Deterministic Shift-Reduce Neural Parser for Constituent Parsing
Hua, Zhenhao AppDialogue: Multi-App Dialogues for Intelligent Assistants
Hubert, Isabell Training & Quality Assessment of an Optical Character Recognition Model for Northern Haida
Huck, Matthias Enhancing Access to Online Education: Quality Machine Translation of MOOC Content
Huet, Stéphane Automatic Corpus Extension for Data-driven Natural Language Generation
Hu, Junfeng Domain Ontology Learning Enhanced by Optimized Relation Instance in DBpedia
Hulden, Mans Deriving Morphological Analyzers from Example Inflections
Morphological Analysis of Sahidic Coptic for Automatic Glossing
Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish and Slovene
Humayoun, Muhammad Urdu Summary Corpus
Analyzing Pre-processing Settings for Urdu Single-document Extractive Summarization
Hunter, Julie Discourse Structure and Dialogue Acts in Multiparty Dialogue: the STAC Corpus
Hupkes, Dieuwke POS-tagging of Historical Dutch
Husic, Halima A sense-based lexicon of count and mass expressions: The Bochum English Countability Lexicon
Huygen, Paul Two Architectures for Parallel Processing of Huge Amounts of Text
Hu, Zhichao A Corpus of Gesture-Annotated Dialogues for Monologue-to-Dialogue Generation from Personal Narratives

 

I
Ide, Nancy The Language Application Grid and Galaxy
Idiart, Marco Multiword Expressions in Child Language
Ijuin, Koki Quantitative Analysis of Gazes and Grounding Acts in L1 and L2 Conversations
Iliakopoulou, Aikaterini Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest
Iliash, Anna User, who art thou? User Profiling for Oral Corpus Platforms
Ilievski, Filip Context-enhanced Adaptive Entity Linking
Evaluating Entity Linking: An Analysis of Current Benchmark Datasets and a Roadmap for Doing a Better Job
Illina, Irina How Diachronic Text Corpora Affect Context based Retrieval of OOV Proper Names for Audio News
Imada, Takakazu Analyzing Time Series Changes of Correlation between Market Share and Concerns on Companies measured through Search Engine Suggests
Imran, Muhammad Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages
Inaba, Michimasa The dialogue breakdown detection challenge: Task description, datasets, and evaluation metrics
Indig, Balázs Mapping Ontologies Using Ontologies: Cross-lingual Semantic Role Information Transfer
Inel, Oana Crowdsourcing Salient Information from News and Tweets
Temporal Information Annotation: Crowd vs. Experts
Inoue, Masashi Dialogue System Characterisation by Back-channelling Patterns Extracted from Dialogue Corpus
Inoue, Yusuke Analyzing Time Series Changes of Correlation between Market Share and Concerns on Companies measured through Search Engine Suggests
Inui, Kentaro Question-Answering with Logic Specific to Video Games
Ioki, Masayuki A Large-scale Recipe and Meal Data Collection as Infrastructure for Food Research
Iosif, Elias The SpeDial datasets: datasets for Spoken Dialogue Systems analytics
Crossmodal Network-Based Distributional Semantic Models
Cognitively Motivated Distributional Representations of Meaning
Affective Lexicon Creation for the Greek Language
Iribe, Yurie Speech Corpus Spoken by Young-old, Old-old and Oldest-old Japanese
Irimia, Elena The IPR-cleared Corpus of Contemporary Written and Spoken Romanian Language
Isahara, Hitoshi ASPEC: Asian Scientific Paper Excerpt Corpus
Isard, Amy The Methodius Corpus of Rhetorical Discourse Structures and Generated Texts
Ishida, Mitsuru Quantitative Analysis of Gazes and Grounding Acts in L1 and L2 Conversations
Ishida, Toru Constraint-Based Bilingual Lexicon Induction for Closely Related Languages
Towards a Language Service Infrastructure for Mobile Environments
Itoyama, Katsutoshi Parallel Speech Corpora of Japanese Dialects
Ivanova, Angelina Towards Comparability of Linguistic Graph Banks for Semantic Parsing
Izquierdo, Ruben Addressing the MFS Bias in WSD systems

 

J
Jabaian, Bassam Automatic Corpus Extension for Data-driven Natural Language Generation
Jackl, Bernhard BAS Speech Science Web Services - an Update of Current Developments
Jacquet, Guillaume Cross-lingual Linking of Multi-word Entities and their corresponding Acronyms
Jacquey, Evelyne Ambiguity Diagnosis for Terms in Digital Humanities
Jadi, Grégoire Evaluating Lexical Similarity to build Sentiment Similarity
Jaffe, Evan A Corpus of Word-Aligned Asked and Anticipated Questions in a Virtual Patient Dialogue System
Jagrova, Klara Orthographic and Morphological Correspondences between Related Slavic Languages as a Base for Modeling of Mutual Intelligibility
Jaimes, Alejandro Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest
Jain, Rohit Using lexical and Dependency Features to Disambiguate Discourse Connectives in Hindi
Jakubicek, Milos European Union Language Resources in Sketch Engine
Janier, Mathilde Corpus Resources for Dispute Mediation Discourse
Jansche, Martin TTS for Low Resource Languages: A Bangla Synthesizer
Janssen, Maarten The COPLE2 corpus: a learner corpus for Portuguese
TEITOK: Text-Faithful Annotated Corpora
Jaquette, Daniel Data Management Plans and Data Centers
Jauch, Ronny A Lexical Resource for the Identification of “Weak Words” in German Specification Documents
Jazbec, Ivo-Pavao New Inflectional Lexicons and Training Corpora for Improved Morphosyntactic Annotation of Croatian and Serbian
Jean-Louis, Ludovic SemLinker, a Modular and Open Source Framework for Named Entity Discovery and Linking
Jelínek, Tomáš SYN2015: Representative Corpus of Contemporary Written Czech
Jeong, Young-Seob Korean TimeML and Korean TimeBank
Jettka, Daniel User, who art thou? User Profiling for Oral Corpus Platforms
Jezek, Elisabetta Acquiring Opposition Relations among Italian Verb Senses using Crowdsourcing
Jha, Girish Issues and Challenges in Annotating Urdu Action Verbs on the IMAGACT4ALL Platform
Jha, Rahul Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest
Ji, Donghong Multi-prototype Chinese Character Embedding
Jiménez, Ricardo-María Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages
Jimeno Yepes, Antonio The Scielo Corpus: a Parallel Corpus of Scientific Publications for Biomedicine
Johannessen, Janne M Constructing a Norwegian Academic Wordlist
Johannsen, Anders The SemDaX Corpus ― Sense Annotations with Scalable Sense Inventories
Exploring Language Variation Across Europe - A Web-based Tool for Computational Sociolinguistics
Johansson, Richard Gulf Arabic Linguistic Resource Building for Sentiment Analysis
A Multi-domain Corpus of Swedish Word Sense Annotation
Jones, Dewi Bryn Cysill Ar-lein: A Corpus of Written Contemporary Welsh Compiled from an On-line Spelling and Grammar Checker
Jones, Gareth Developing a Dataset for Evaluating Approaches for Document Expansion with Images
Jones, Karen Multi-language Speech Collection for NIST LRE
Jonquet, Clement Automatic Biomedical Term Polysemy Detection
Joo, Won-Tae Korean TimeML and Korean TimeBank
Joscelyne, Andrew Providing a Catalogue of Language Resources for Commercial Users
Joshi, Aditya That'll Do Fine!: A Coarse Lexical Resource for English-Hindi MT, Using Polylingual Topic Models
Jouvet, Denis The IFCASL Corpus of French and German Non-native and Native Read Speech
Jügler, Jeanin The IFCASL Corpus of French and German Non-native and Native Read Speech
Juhár, Jozef Evaluation Set for Slovak News Information Retrieval
An Extension of the Slovak Broadcast News Corpus based on Semi-Automatic Annotation
Juhn, Young Staggered NLP-assisted refinement for Clinical Annotations of Chronic Disease Events
Junczys-Dowmunt, Marcin The United Nations Parallel Corpus v1.0
Jung, Manuel GATE-Time: Extraction of Temporal Expressions and Events
Jurgens, David Annotating Characters in Literary Corpora: A Scheme, the CHARLES Tool, and an Annotated Novel

 

K
Kaalep, Heiki-Jaan EstNLTK - NLP Toolkit for Estonian
Kabadjov, Mijail The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015
Kabashi, Besim A Proposal for a Part-of-Speech Tagset for the Albanian Language
Kachkovskaia, Tatiana CoRuSS - a New Prosodically Annotated Corpus of Russian Spontaneous Speech
Kahn, Juliette FABIOLE, a Speech Database for Forensic Speaker Comparison
Generating Task-Pertinent sorted Error Lists for Speech Recognition
Kalamboukis, Theodore Using a Cross-Language Information Retrieval System based on OHSUMED to Evaluate the Moses and KantanMT Statistical Machine Translation Systems
Kameko, Hirotaka A Japanese Chess Commentary Corpus
Kaminski, Steve Crosswalking from CMDI to Dublin Core and MARC 21
Kamocki, Pawel Privacy Issues in Online Machine Translation Services - European Perspective
The Public License Selector: 
Making Open Licensing Easier
Kampstra, Frederik A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research
Kanayama, Hiroshi Universal Dependencies for Japanese
Kanojia, Diptesh That'll Do Fine!: A Coarse Lexical Resource for English-Hindi MT, Using Polylingual Topic Models
SlangNet: A WordNet like resource for English Slang
Kaplan, Aidan Morphologically Annotated Corpora and Morphological Analyzers for Moroccan and Sanaani Yemeni Arabic
Kaplan, Dain Solving the AL Chicken-and-Egg Corpus and Model Problem: Model-free Active Learning for Phenomena-driven Corpus Construction
Karabüklü, Serpil BosphorusSign: A Turkish Sign Language Recognition Corpus in Health and Finance Domains
Karkaletsis, Vangelis CLARIN-EL Web-based Annotation Tool
Karlgren, Jussi The Gavagai Living Lexicon
Kashyap, Laxmi Synset Ranking of Hindi WordNet
Katakis, Ioannis Manousos CLARIN-EL Web-based Annotation Tool
Katayama, Taichi Name Translation based on Fine-grained Named Entity Recognition in a Single Language
Katerenchuk, Denys RankDCG: Rank-Ordering Evaluation Measure
Kato, Akihiko Construction of an English Dependency Corpus incorporating Compound Function Words
Kato, Tsuneo Joining-in-type Humanoid Robot Assisted Language Learning System
Kato, Yoshihide Correcting Errors in a Treebank Based on Tree Mining
Katris, Nikolaos Using a Cross-Language Information Retrieval System based on OHSUMED to Evaluate the Moses and KantanMT Statistical Machine Translation Systems
Kattenberg, Mathijs Two Architectures for Parallel Processing of Huge Amounts of Text
Kawada, Yasuhide Analyzing Time Series Changes of Correlation between Market Share and Concerns on Companies measured through Search Engine Suggests
Kawasaki, Yoshifumi Discriminative Analysis of Linguistic Features for Typological Study
Keiper, Lena Improving POS Tagging of German Learner Language in a Reading Comprehension Scenario
Kelepir, Meltem BosphorusSign: A Turkish Sign Language Recognition Corpus in Health and Finance Domains
Kelly, Liadh Building Evaluation Datasets for Consumer-Oriented Information Retrieval
Kemmerer, Steffen SCARE ― The Sentiment Corpus of App Reviews with Fine-grained Annotations in German
Kemps-Snijders, Marc FLAT: Constructing a CLARIN Compatible Home for Language Resources
Kennington, Casey PentoRef: A Corpus of Spoken References in Task-oriented Dialogues
Kepler, Fabio A Web Tool for Building Parallel Corpora of Spoken and Sign Languages
Kerler, Dov-Ber Generating a Yiddish Speech Corpus, Forced Aligner and Basic ASR System for the AHEYM Project
Kermanidis, Katia Lida Enhancing Access to Online Education: Quality Machine Translation of MOOC Content
Kermes, Hannah The Royal Society Corpus: From Uncharted Data to Corpus
Kettnerová, Václava Distribution of Valency Complements in Czech Complex Predicates: Between Verb and Noun
Kettunen, Kimmo Measuring Lexical Quality of a Historical Finnish Newspaper Collection ― Analysis of Garbled OCR Data with Basic Language Technology Tools and Means
Khalfi, Mustapha Al Qamus al Muhit, a Medieval Arabic Lexicon in LMF
Khalifa, AlBara Joining-in-type Humanoid Robot Assisted Language Learning System
Khalifa, Salam DALILA: The Dialectal Arabic Linguistic Learning Assistant
A Large Scale Corpus of Gulf Arabic
Khamis, Ashraf The Royal Society Corpus: From Uncharted Data to Corpus
Khan, Fahad Al Qamus al Muhit, a Medieval Arabic Lexicon in LMF
LREC as a Graph: People and Resources in a Network
Khan, R. A. The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People
Khan, Tafseer Ahmed A Proposition Bank of Urdu
Khashabi, Daniel EDISON: Feature Extraction for NLP, Simplified
Khemakhem, Mohamed Sense-annotating a Lexical Substitution Data Set with Ubyline
Khiari, Wejdene Integration of Lexical and Semantic Knowledge for Sentiment Analysis in SMS
Khudanpur, Sanjeev New release of Mixer-6: Improved validity for phonetic study of speaker variation and identification
Khvtisavrishvili, Nana GhoSt-NN: A Representative Gold Standard of German Noun-Noun Compounds
Kieraś, Witold The on-line version of Grammatical Dictionary of Polish
Kijak, Ewa Distributional Thesauri for Information Retrieval and vice versa
Kilicoglu, Halil Annotating Named Entities in Consumer Health Questions
Kındıroğlu, Ahmet Alp BosphorusSign: A Turkish Sign Language Recognition Corpus in Health and Finance Domains
Kingma, Sigrid A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research
Kiritchenko, Svetlana A Dataset for Detecting Stance in Tweets
Sentiment Lexicons for Arabic Social Media
Happy Accident: A Sentiment Composition Lexicon for Opposing Polarity Phrases
Kirov, Christo Remote Elicitation of Inflectional Paradigms to Seed Morphological Analysis in Low-Resource Languages
Very-large Scale Parsing and Normalization of Wiktionary Morphological Paradigms
Kisler, Thomas The BAS Speech Data Repository
BAS Speech Science Web Services - an Update of Current Developments
Kiss, Tibor A sense-based lexicon of count and mass expressions: The Bochum English Countability Lexicon
Kitaoka, Norihide Speech Corpus Spoken by Young-old, Old-old and Oldest-old Japanese
Klakow, Dietrich Orthographic and Morphological Correspondences between Related Slavic Languages as a Base for Modeling of Mutual Intelligibility
Creating Annotated Dialogue Resources: Cross-domain Dialogue Act Classification
Klang, Marcus WIKIPARQ: A Tabulated Wikipedia Resource Using the Parquet Format
Klassen, Prescott Annotating and Detecting Medical Events in Clinical Notes
Klein, Ewan Applying Core Scientific Concepts to Context-Based Citation Recommendation
Klejch, Ondrej Tools and Guidelines for Principled Machine Translation Development
Klenner, Manfred Sentiframes: A Resource for Verb-centered German Sentiment Inference
Kleppe, Martijn 1 Million Captioned Dutch Newspaper Images
Klessa, Katarzyna Polish Rhythmic Database ― New Resources for Speech Timing and Rhythm Analysis
Kliegr, Tomáš Crowdsourced Corpus with Entity Salience Annotations
Klimek, Bettina Creating Linked Data Morphological Language Resources with MMoOn - The Hebrew Morpheme Inventory
The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud
Klinger, Roman SCARE ― The Sentiment Corpus of App Reviews with Fine-grained Annotations in German
Kloppenburg, Lennart Leveraging Native Data to Correct Preposition Errors in Learners' Dutch
Klubička, Filip New Inflectional Lexicons and Training Corpora for Improved Morphosyntactic Annotation of Croatian and Serbian
Producing Monolingual and Parallel Web Corpora at the Same Time - SpiderLing and Bitextor's Love Affair
Klyueva, Natalia Improving corpus search via parsing
Knappen, Jörg The Royal Society Corpus: From Uncharted Data to Corpus
Knight, Dawn Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages
Knight, Kevin Extracting Structured Scholarly Information from the Machine Translation Literature
Kobayashi, Yuka The dialogue breakdown detection challenge: Task description, datasets, and evaluation metrics
Kobourov, Stephen Towards Using Social Media to Identify Individuals at Risk for Preventable Chronic Illness
Kochanowski, Bartłomiej Recent Advances in Development of a Lexicon-Grammar of Polish: PolNet 3.0
Kocharov, Daniil CoRuSS - a New Prosodically Annotated Corpus of Russian Spontaneous Speech
Phoneme Alignment Using the Information on Phonological Processes in Continuous Speech
Koch, Steffen Visualisation and Exploration of High-Dimensional Distributional Features in Lexical Semantic Classification
Koctúr, Tomáš An Extension of the Slovak Broadcast News Corpus based on Semi-Automatic Annotation
Kohl, Matt Towards a Linguistic Ontology with an Emphasis on Reasoning and Knowledge Reuse
Köhn, Arne Mining the Spoken Wikipedia for Speech Data and Beyond
Koidl, Kevin FREME: Multilingual Semantic Enrichment with Linked Data and Language Technologies
Koiso, Hanae Survey of Conversational Behavior: Towards the Design of a Balanced Corpus of Everyday Japanese Conversation
Kolcz, Alek Effects of Sampling on Twitter Trend Detection
Komachi, Mamoru Analysis of English Spelling Errors in a Word-Typing Game
Konat, Barbara A Corpus of Argument Networks: Using Graph Properties to Analyse Divisive Issues
Konovalov, Vasily The Negochat Corpus of Human-agent Negotiation Dialogues
Köper, Maximilian Visualisation and Exploration of High-Dimensional Distributional Features in Lexical Semantic Classification
Automatically Generated Affective Norms of Abstractness, Arousal, Imageability and Valence for 350 000 German Lemmas
Kordjamshidi, Parisa EDISON: Feature Extraction for NLP, Simplified
Kordoni, Valia Enhancing Access to Online Education: Quality Machine Translation of MOOC Content
Korkontzelos, Yannis Identifying Content Types of Messages Related to Open Source Software Projects
Ensemble Classification of Grants using LDA-based Features
Kornai, Andras Detecting Optional Arguments of Verbs
Korpusik, Mandy Corpus for Customer Purchase Behavior Prediction in Social Media
Köster, Norman How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Koto, Fajri A Publicly Available Indonesian Corpora for Automatic Abstractive and Extractive Chat Summarization
Kousidis, Spyros DUEL: A Multi-lingual Multimodal Dialogue Corpus for Disfluency, Exclamations and Laughter
Koutsakis, Polychronis Affective Lexicon Creation for the Greek Language
Koutsombogera, Maria Multimodal Resources for Human-Robot Communication Modelling
Kováříková, Dominika SYN2015: Representative Corpus of Contemporary Written Czech
Kovář, Vojtěch Finding Definitions in Large Corpora with Sketch Engine
Krause, Sebastian Creating Linked Data Morphological Language Resources with MMoOn - The Hebrew Morpheme Inventory
TEG-REP: A corpus of Textual Entailment Graphs based on Relation Extraction Patterns
Relation- and Phrase-level Linking of FrameNet with Sar-graphs
Krause, Thomas corpus-tools.org: An Interoperable Generic Software Tool Set for Multi-layer Linguistic Corpora
Kraut, Robert Edit Categories and Editor Role Identification in Wikipedia
Krejčová, Ema Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study
VPS-GradeUp: Graded Decisions on Usage Patterns
Křen, Michal SYN2015: Representative Corpus of Contemporary Written Czech
Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages
Krenn, Brigitte The OFAI Multi-Modal Task Description Corpus
Krieg-Holz, Ulrike CodE Alltag: A German-Language E-Mail Corpus
Krilavičius, Tomas NLP Infrastructure for the Lithuanian Language
Krisch, Jennifer A Lexical Resource for the Identification of “Weak Words” in German Specification Documents
Krishnaswamy, Nikhil VoxML: A Visualization Modeling Language
Kríž, Vincent Czech Legal Text Treebank 1.0
Krome, Sabine A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds
Krstev, Cvetana Rule-based Automatic Multi-word Term Extraction and Lemmatization
Kruschwitz, Udo The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015
Phrase Detectives Corpus 1.0 Crowdsourced Anaphoric Coreference.
Towards a Corpus of Violence Acts in Arabic Social Media
Kuhlmann, Marco Towards Comparability of Linguistic Graph Banks for Semantic Parsing
Kuhn, Jonas Learning from Within? Comparing PoS Tagging Approaches for Historical Text
IMS HotCoref DE: A Data-driven Co-reference Resolver for German
Kuhnle, Alexander Resources for building applications with Dependency Minimal Recursion Semantics
Kulick, Seth Rapid Development of Morphological Analyzers for Typologically Diverse Languages
Ku, Lun-Wei ANTUSD: A Large Chinese Sentiment Dictionary
Kummert, Franz How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Kunz, Kerstin Anna From Interoperable Annotations towards Interoperable Resources: A Multilingual Approach to the Analysis of Discourse
Kuo, Chung-Lun Subtask Mining from Search Query Logs for How-Knowledge Acceleration
Kupietz, Marc KorAP Architecture ― Diving in the Deep Sea of Corpus Data
Kuras, Christoph Features for Generic Corpus Querying
Kurfalı, Murathan A Turkish Database for Psycholinguistic Studies Based on Frequency, Age of Acquisition, and Imageability
Kurfürst, Dennis Finding Recurrent Features of Image Schema Gestures: the FIGURE corpus
Kurohashi, Sadao Paraphrasing Out-of-Vocabulary Words with Word Embeddings and Semantic Lexicons for Low Resource Statistical Machine Translation
Parallel Sentence Extraction from Comparable Corpora with Neural Network Features
Simultaneous Sentence Boundary Detection and Alignment with Pivot-based Machine Translation Generated Lexicons
ASPEC: Asian Scientific Paper Excerpt Corpus
Kurtic, Emina What’s the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems
Kutuzov, Andrey Neural Embedding Language Models in Semantic Clustering of Web Search Results
Kuvač Kraljević, Jelena Croatian Error-Annotated Corpus of Non-Professional Written Language
Kuzmenko, Elizaveta Neural Embedding Language Models in Semantic Clustering of Web Search Results
Kyaw Thu, Ye Introducing the Asian Language Treebank (ALT)
Kyuseva, Maria Typology of Adjectives Benchmark for Compositional Distributional Models

 

L
Laaridh, Imed Automatic Anomaly Detection for Dysarthria across Two Speech Styles: Read vs Spontaneous Speech
The TYPALOC Corpus: A Collection of Various Dysarthric Speech Recordings in Read and Spontaneous Styles
Labaka, Gorka Domain Adaptation in MT Using Titles in Wikipedia as a Parallel Corpus: Resources and Evaluation
Lachler, Jordan Training & Quality Assessment of an Optical Character Recognition Model for Northern Haida
Lafourcade, Mathieu Semantic Relation Extraction with Semantic Patterns Experiment on Radiology Reports
Lailler, Carole Enhancing The RATP-DECODA Corpus With Linguistic Annotations For Performing A Large Range Of NLP Tasks
Lai, Mirko Tweeting and Being Ironic in the Debate about a Political Reform: the French Annotated Corpus TWitter-MariagePourTous
Lam, Sam Syllable based DNN-HMM Cantonese Speech to Text System
Lancelot, Renaud Monitoring Disease Outbreak Events on the Web Using Text-mining Approach and Domain Expert Knowledge
Landeau, Anaïs Enhancing The RATP-DECODA Corpus With Linguistic Annotations For Performing A Large Range Of NLP Tasks
Lane, Caoilfhionn IRIS: English-Irish Machine Translation System
Lanfrey, Damien NLP and Public Engagement: The Case of the Italian School Reform
Langlais, Phillippe WikiCoref: An English Coreference-annotated Corpus of Wikipedia Articles
Lanser, Bettina Crowdsourcing Ontology Lexicons
Laparra, Egoitz The Event and Implied Situation Ontology (ESO): Application and Evaluation
A Multilingual Predicate Matrix
Laprie, Yves The IFCASL Corpus of French and German Non-native and Native Read Speech
Lapshinova-Koltunski, Ekaterina From Interoperable Annotations towards Interoperable Resources: A Multilingual Approach to the Analysis of Discourse
Laur, Sven EstNLTK - NLP Toolkit for Estonian
Lawrence, John A Corpus of Argument Networks: Using Graph Properties to Analyse Divisive Issues
Lazic, Biljana Rule-based Automatic Multi-word Term Extraction and Lemmatization
Lebani, Gianluca LexFr: Adapting the LexIt Framework to Build a Corpus-based French Subcategorization Lexicon
Lecouteux, Benjamin CirdoX: an on/off-line multisource speech and sound analysis software
The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People
Le, Dieu-Thu Construction and Analysis of a Large Vietnamese Text Corpus
Lee, Annie Classifying Out-of-vocabulary Terms in a Domain-Specific Social Media Corpus
Lee, John An Annotated Corpus of Direct Speech
A Dependency Treebank of the Chinese Buddhist Canon
Lefeuvre-Halftermeyer, Anaïs Covering various Needs in Temporal Annotation: a Proposal of Extension of ISO TimeML that Preserves Upward Compatibility
Lefever, Els A Classification-based Approach to Economic Event Detection in Dutch News Text
Exploring the Realization of Irony in Twitter Data
Lefevre, Fabrice Automatic Corpus Extension for Data-driven Natural Language Generation
Léger, Serge Discriminating Similar Languages: Evaluations and Explorations
Legou, Thierry The TYPALOC Corpus: A Collection of Various Dysarthric Speech Recordings in Read and Spontaneous Styles
Le, Ha Challenges and Solutions for Consistent Annotation of Vietnamese Treebank
Leichsenring, Christian How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Lejeune, Gaël Ambiguity Diagnosis for Terms in Digital Humanities
Lenci, Alessandro LexFr: Adapting the LexIt Framework to Build a Corpus-based French Subcategorization Lexicon
Italian VerbNet: A Construction-based Approach to Italian Verb Classification
Nine Features in a Random Forest to Learn Taxonomical Semantic Relations
What a Nerd! Beating Students and Vector Cosine in the ESL and TOEFL Datasets
Evaluating Context Selection Strategies to Build Emotive Vector Space Models
Lendvai, Piroska Monolingual Social Media Datasets for Detecting Contradiction and Entailment
Leonhard, Matthias A Database of Laryngeal High-Speed Videos with Simultaneous High-Quality Audio Recordings of Pathological and Non-Pathological Voices
Leser, Ulf SCARE ― The Sentiment Corpus of App Reviews with Fine-grained Annotations in German
Lesnikova, Tatiana Cross-lingual RDF Thesauri Interlinking
Letard, Vincent Purely Corpus-based Automatic Conversation Authoring
Levchik, Anatolii Creating a General Russian Sentiment Lexicon
Levin, Lori Bridge-Language Capitalization Inference in Western Iranian: Sorani, Kurmanji, Zazaki, and Tajik
Lewis, David Open Data Vocabularies for Assigning Usage Rights to Data Resources from Translation Projects
Liakata, Maria Applying Core Scientific Concepts to Context-Based Citation Recommendation
Multi-label Annotation in Scientific Articles - The Multi-label Cancer Risk Assessment Corpus
Liao, Wan-Shan Fine-Grained Chinese Discourse Relation Labelling
Liberman, Mark Building Language Resources for Exploring Autism Spectrum Disorders
Libovický, Jindřich Neural Scoring Function for MST Parser
Li, Claire Syllable based DNN-HMM Cantonese Speech to Text System
Liddy, Elizabeth D. EmoTweet-28: A Fine-Grained Emotion Corpus for Sentiment Analysis
Liebeskind, Chaya A Lexical Resource of Hebrew Verb-Noun Multi-Word Expressions
Lien, John ANEW+: Automatic Expansion and Validation of Affective Norms of Words Lexicons in Multiple Languages
Lier, Florian How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Liew, Jasy Suet Yan EmoTweet-28: A Fine-Grained Emotion Corpus for Sentiment Analysis
Ligozat, Anne-Laure Transfer-Based Learning-to-Rank Assessment of Medical Term Technicality
Purely Corpus-based Automatic Conversation Authoring
Evaluating Lexical Simplification and Vocabulary Knowledge for Learners of French: Possibilities of Using the FLELex Resource
Li, Junyi Jessy Improving the Annotation of Sentence Specificity
Limburská, Adéla Merging Data Resources for Inflectional and Derivational Morphology in Czech
Lim, Chae-Gyun Korean TimeML and Korean TimeBank
Li, Minglei Emotion Corpus Construction Based on Selection from Hashtags
Syllable based DNN-HMM Cantonese Speech to Text System
Lin, Donghui Towards a Language Service Infrastructure for Mobile Environments
Lison, Pierre OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles
Listenmaa, Inari Analysing Constraint Grammars with a SAT-solver
List, Johann-Mattis Concepticon: A Resource for the Linking of Concept Lists
Littell, Patrick Bridge-Language Capitalization Inference in Western Iranian: Sorani, Kurmanji, Zazaki, and Tajik
Little, Alexa EasyTree: A Graphical Tool for Dependency Tree Annotation
Liu, Hongfang Staggered NLP-assisted refinement for Clinical Annotations of Chronic Disease Events
On Developing Resources for Patient-level Information Retrieval
Liu, Kris Coordinating Communication in the Wild: The Artwalk Dialogue Corpus of Pedestrian Navigation and Mobile Referential Communication
A Verbal and Gestural Corpus of Story Retellings to an Expressive Embodied Virtual Character
A Multimodal Motion-Captured Corpus of Matched and Mismatched Extravert-Introvert Conversational Pairs
Liu, Lin Language Resource Citation: the ISLRN Dissemination and Further Developments
The ELRA License Wizard
New Developments in the LRE Map
Liu, Qun ProphetMT: A Tree-based SMT-driven Controlled Language Authoring/Post-Editing Tool
Automatic Construction of Discourse Corpora for Dialogue Translation
Liu, Ting The Validation of MRCPD Cross-language Expansions on Imageability Ratings
ANEW+: Automatic Expansion and Validation of Affective Norms of Words Lexicons in Multiple Languages
Liu, Wuying How does Dictionary Size Influence Performance of Vietnamese Word Segmentation?
Liu, Yang A Bilingual Discourse Corpus and Its Applications
Li, Wenjie Emotion Corpus Construction Based on Selection from Hashtags
Li, Xuansong Uzbek-English and Turkish-English Morpheme Alignment Corpora
Large Multi-lingual, Multi-level and Multi-genre Annotation Corpus
Liyanapathirana, Jeevanthi Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation
Ljubešić, Nikola Croatian Error-Annotated Corpus of Non-Professional Written Language
New Inflectional Lexicons and Training Corpora for Improved Morphosyntactic Annotation of Croatian and Serbian
Corpus-Based Diacritic Restoration for South Slavic Languages
Corpus vs. Lexicon Supervision in Morphosyntactic Tagging: the Case of Slovene
Producing Monolingual and Parallel Web Corpora at the Same Time - SpiderLing and Bitextor's Love Affair
Llewellyn, Clare Homing in on Twitter Users: Evaluating an Enhanced Geoparser for User Profile Locations
Llozhi, Lorena SweLL on the rise: Swedish Learner Language corpus for European Reference Level studies
Loáiciga, Sharid Discontinuous Verb Phrases in Parsing and Machine Translation of English and German
Löfberg, Laura Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages
Logacheva, Varvara MARMOT: A Toolkit for Translation Quality Estimation at the Word Level
Phrase Level Segmentation and Labelling of Machine Translation Errors
Loginova Clouet, Elizaveta Ubuntu-fr: A Large and Open Corpus for Multi-modal Analysis of Online Written Conversations
Lojka, Martin An Extension of the Slovak Broadcast News Corpus based on Semi-Automatic Annotation
Long, Yunfei Emotion Corpus Construction Based on Selection from Hashtags
Lopes, Carla The LetsRead Corpus of Portuguese Children Reading Aloud for Performance Evaluation
Lopes, José The SpeDial datasets: datasets for Spoken Dialogue Systems analytics
Lopez, Cédric Encoding Adjective Scales for Fine-grained Resources
Lopez de Lacalle, Maddalen A Multilingual Predicate Matrix
Lopez de Lacalle, Oier Word Sense-Aware Machine Translation: Including Senses as Contextual Features for Improved Translation Models
Losnegaard, Gyri Smørdal MWEs in Treebanks: From Survey to Guidelines
NorGramBank: A ‘Deep’ Treebank for Norwegian
PARSEME Survey on MWE Resources
Lossio-Ventura, Juan Antonio Automatic Biomedical Term Polysemy Detection
Loudcher, Sabine Hypergraph Modelization of a Syntactically Annotated English Wikipedia Dump
Loukachevitch, Natalia Creating a General Russian Sentiment Lexicon
Louka, Katerina The SpeDial datasets: datasets for Spoken Dialogue Systems analytics
Lovick, Olga The Alaskan Athabascan Grammar Database
Lowe, John B. A Tangled Web: The Faint Signals of Deception in Text - Boulder Lies and Truth Corpus (BLT-C)
Loza Mencía, Eneldo Medical Concept Embeddings via Labeled Background Corpora
Lubis, Nurul Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recognition
Lucisano, Pietro CItA: an L1 Italian Learners Corpus to Study the Development of Writing Competence
Luecking, Andy TGermaCorp -- A (Digital) Humanities Resource for (Computational) Linguistics
Finding Recurrent Features of Image Schema Gestures: the FIGURE corpus
Lu, Jing Event Coreference Resolution with Multi-Pass Sieves
Lukin, Stephanie PersonaBank: A Corpus of Personal Narratives and Their Story Intention Graphs
Lundkvist, Peter SweLL on the rise: Swedish Learner Language corpus for European Reference Level studies
Luo, Wentao Extending Monolingual Semantic Textual Similarity Task to Multiple Cross-lingual Settings
Lupu, Mihai Standard Test Collection for English-Persian Cross-Lingual Word Sense Disambiguation
Lu, Qin Nine Features in a Random Forest to Learn Taxonomical Semantic Relations
Syllable based DNN-HMM Cantonese Speech to Text System
What a Nerd! Beating Students and Vector Cosine in the ESL and TOEFL Datasets
Lusicky, Vesna Providing a Catalogue of Language Resources for Commercial Users
Lu, Yanan Multi-prototype Chinese Character Embedding
Luz, Saturnino The ILMT-s2s Corpus ― A Multimodal Interlingual Map Task Corpus
Lyding, Verena Design and Development of the MERLIN Learner Corpus Platform
Lyse, Gunn Inger NorGramBank: A ‘Deep’ Treebank for Norwegian

 

M
Maamouri, Mohamed Large Multi-lingual, Multi-level and Multi-genre Annotation Corpus
Machado, Gabriel A Sequence Model Approach to Relation Extraction in Portuguese
Maciejewski, Matthew New release of Mixer-6: Improved validity for phonetic study of speaker variation and identification
Mackaness, William The REAL Corpus: A Crowd-Sourced Corpus of Human Generated and Evaluated Spatial References to Real-World Urban Scenes
Maegaard, Bente Providing a Catalogue of Language Resources for Commercial Users
Magnani, Romain Ecological Gestures for HRI: the GEE Corpus
Magnini, Bernardo Acquiring Opposition Relations among Italian Verb Senses using Crowdsourcing
Magnolini, Simone Acquiring Opposition Relations among Italian Verb Senses using Crowdsourcing
Maharjan, Nabin SemAligner: A Method and Tool for Aligning Chunks with Semantic Relation Types and Semantic Similarity Scores
Mahlow, Cerstin C-WEP―Rich Annotated Collection of Writing Errors by Professionals
Maier, Wolfgang An Arabic-Moroccan Darija Code-Switched Corpus
Makrai, Márton Filtering Wiktionary Triangles by Linear Mbetween Distributed Word Models
Maks, Isa GRaSP: A Multilayered Annotation Scheme for Perspectives
Malchanau, Andrei Modelling Multi-issue Bargaining Dialogues: Data Collection, Annotation Design and Corpus
The DialogBank
Malcuori, Marisa Factuality Annotation and Learning in Spanish Texts
Maldonado, Alfredo Open Data Vocabularies for Assigning Usage Rights to Data Resources from Translation Projects
Malmasi, Shervin Discriminating Similar Languages: Evaluations and Explorations
Modeling Language Change in Historical Corpora: The Case of Portuguese
Mamede, Nuno metaTED: a Corpus of Metadiscourse for Spoken Language
Mamprin, Sara Information structure in the Potsdam Commentary Corpus: Topics
Manishina, Elena Automatic Corpus Extension for Data-driven Natural Language Generation
Mankoff, Robert Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest
Mannens, Erik FREME: Multilingual Semantic Enrichment with Linked Data and Language Technologies
Manning, Christopher D. Universal Dependencies v1: A Multilingual Treebank Collection
A comparison of Named-Entity Disambiguation and Word Sense Disambiguation
Enhanced English Universal Dependencies: An Improved Representation for Natural Language Understanding Tasks
Manuvinakurike, Ramesh PentoRef: A Corpus of Spoken References in Task-oriented Dialogues
Mapelli, Valérie ELRA Activities and Services
Language Resource Citation: the ISLRN Dissemination and Further Developments
The ELRA License Wizard
Review on the Existing Language Resources for Languages of France
Marcello, Norina Automatic identification of Mild Cognitive Impairment through the analysis of Italian spontaneous speech productions
Marchi, Erik Introducing the Weighted Trustability Evaluator for Crowdsourcing Exemplified by Speaker Likability Classification
Marciniak, Malgorzata TermoPL - a Flexible Tool for Terminology Extraction
Marcu, Daniel Extracting Structured Scholarly Information from the Machine Translation Literature
Mareček, David If You Even Don't Have a Bit of Bible: Learning Delexicalized POS Taggers
Margaretha, Eliza KorAP Architecture ― Diving in the Deep Sea of Corpus Data
Marg, Lena The Trials and Tribulations of Predicting Post-Editing Productivity
Mariani, Joseph The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
A Study of Reuse and Plagiarism in LREC papers
Predictive Modeling: Guessing the NLP Terms of Tomorrow
Martínez Alonso, Héctor The SemDaX Corpus ― Sense Annotations with Scalable Sense Inventories
Martinez Calvo, Adela Introducing the SEA_AP: an Enhanced Tool for Automatic Prosodic Analysis
Martinez Garcia, Eva TweetMT: A Parallel Microblog Corpus
Martínez-Hinarejos, Carlos-D. Impact of Automatic Segmentation on the Quality, Productivity and Self-reported Post-editing Effort of Intralingual Subtitles
Martinez, Marta Enhanced CORILGA: Introducing the Automatic Phonetic Alignment Tool for Continuous Speech
Introducing the SEA_AP: an Enhanced Tool for Automatic Prosodic Analysis
Martínez Martínez, José Manuel SubCo: A Learner Translation Corpus of Human and Machine Subtitles
Martinez-Romo, Juan A Tagged Corpus for Automatic Labeling of Disabilities in Medical Scientific Papers
Martin, Fabienne Aspectual Flexibility Increases with Agentivity and Concreteness\\ A Computational Classification Experiment on Polysemous Verbs
Martin, James H. A Tangled Web: The Faint Signals of Deception in Text - Boulder Lies and Truth Corpus (BLT-C)
Martins de Matos, David SPA: Web-based Platform for easy Access to Speech Processing Modules
Marti, Roland Orthographic and Morphological Correspondences between Related Slavic Languages as a Base for Modeling of Mutual Intelligibility
Marton, Yuval E-TIPSY: Search Query Corpus Annotated with Entities, Term Importance, POS Tags, and Syntactic Parses
Massimo, Poesio Towards a Corpus of Violence Acts in Arabic Social Media
Matamala, Anna Impact of Automatic Segmentation on the Quality, Productivity and Self-reported Post-editing Effort of Intralingual Subtitles
Matos, Miguel The DIRHA Portuguese Corpus: A Comparison of Home Automation Command Detection and Recognition in Simulated and Real Data.
Matsubara, Shigeki Correcting Errors in a Treebank Based on Tree Mining
Matsumoto, Yuji Universal Dependencies for Japanese
Construction of an English Dependency Corpus incorporating Compound Function Words
Matsuo, Yoshihiro Name Translation based on Fine-grained Named Entity Recognition in a Single Language
Matsuzaki, Takuya Translation Errors and Incomprehensibility: a Case Study using Machine-Translated Second Language Proficiency Tests
Matthies, Franz CodE Alltag: A German-Language E-Mail Corpus
UIMA-Based JCoRe 2.0 Goes GitHub and Maven Central ― State-of-the-Art Software Resource Engineering and Distribution of NLP Pipelines
Maurel, Denis Covering various Needs in Temporal Annotation: a Proposal of Extension of ISO TimeML that Preserves Upward Compatibility
Mauri, Marcel Finding Recurrent Features of Image Schema Gestures: the FIGURE corpus
Maxwell, Mike Selection Criteria for Low Resource Language Programs
May, Jonathan Extracting Structured Scholarly Information from the Machine Translation Literature
Maynard, Diana Challenges of Evaluating Sentiment Analysis Tools on Social Media
GATE-Time: Extraction of Temporal Expressions and Events
Mazo, Hélène ELRA Activities and Services
Mazura, Margaretha Providing a Catalogue of Language Resources for Commercial Users
McCrae, John Philip The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud
McDonald, Ryan Universal Dependencies v1: A Multilingual Treebank Collection
Medveď, Marek European Union Language Resources in Sketch Engine
Megyesi, Beata The Uppsala Corpus of Student Writings: Corpus Creation, Annotation, and Analysis
Mehdad, Yashar Extractive Summarization under Strict Length Constraints
Mehler, Alexander TGermaCorp -- A (Digital) Humanities Resource for (Computational) Linguistics
Finding Recurrent Features of Image Schema Gestures: the FIGURE corpus
Lemmatization and Morphological Tagging in German and Latin: A Comparison and a Survey of the State-of-the-art
TLT-CRF: A Lexicon-supported Morphological Tagger for Latin Based on Conditional Random Fields
Meinel, Christoph Punctuation Prediction for Unsegmented Transcript Based on Word Vector
Meißner, Cordula User, who art thou? User Profiling for Oral Corpus Platforms
Melamud, Oren The Negochat Corpus of Human-agent Negotiation Dialogues
Melero, Maite Leveraging RDF Graphs for Crossing Multiple Bilingual Dictionaries
Melese, Michael Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof
Mella, Odile The IFCASL Corpus of French and German Non-native and Native Read Speech
Melo, Luis Felipe Ambiguity Diagnosis for Terms in Digital Humanities
Mendes, Amália The COPLE2 corpus: a learner corpus for Portuguese
Mendes, Pablo Evaluating Entity Linking: An Analysis of Current Benchmark Datasets and a Roadmap for Doing a Better Job
Mendez, Gonzalo Riddle Generation using Word Associations
Menini, Stefano “Who was Pietro Badoglio?” Towards a QA system for Italian History
Metaxas, Dimitris Detection of Major ASL Sign Types in Continuous Signing For ASL Recognition
Meunier, Christine Automatic Anomaly Detection for Dysarthria across Two Speech Styles: Read vs Spontaneous Speech
The TYPALOC Corpus: A Collection of Various Dysarthric Speech Recordings in Read and Spontaneous Styles
Meurant, Laurence Modelling a Parallel Corpus of French and French Belgian Sign Language
Meurer, Paul NorGramBank: A ‘Deep’ Treebank for Norwegian
Meurers, Detmar Focus Annotation of Task-based Data: A Comparison of Expert and Crowd-Sourced Annotation in a Reading Comprehension Corpus
Meurs, Marie-Jean SemLinker, a Modular and Open Source Framework for Named Entity Discovery and Linking
Meusel, Robert A Large DataBase of Hypernymy Relations Extracted from the Web.
Meyer zu Borgsen, Sebastian How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Michelfeit, Jan European Union Language Resources in Sketch Engine
Mihalcea, Rada Building a Dataset for Possessions Identification in Text
Miháltz, Márton Mapping Ontologies Using Ontologies: Cross-lingual Semantic Role Information Transfer
Mihov, Stoyan BulPhonC: Bulgarian Speech Corpus for the Development of ASR Technology
Mikulová, Marie Coreference in Prague Czech-English Dependency Treebank
Miličević, Maja A Framework for Automatic Acquisition of Croatian and Serbian Verb Aspect from Corpora
Miller, Tristan Sense-annotating a Lexical Substitution Data Set with Ubyline
Milosavljević, Milan Reliable Baselines for Sentiment Analysis in Resource-Limited Languages: The Serbian Movie Review Dataset
Minard, Anne-Lyse MEANTIME, the NewsReader Multilingual Event and Time Corpus
The Event and Implied Situation Ontology (ESO): Application and Evaluation
Minker, Wolfgang A Comparative Study of Text Preprocessing Approaches for Topic Detection of User Utterances
Could Speaker, Gender or Age Awareness be beneficial in Speech-based Emotion Recognition?
Mírovský, Jiří Coreference in Prague Czech-English Dependency Treebank
Searching in the Penn Discourse Treebank Using the PML-Tree Query
Mirzaei, Azadeh Persian Proposition Bank
Mirzaei, Mehrdad The Validation of MRCPD Cross-language Expansions on Imageability Ratings
Misra Sharma, Dipti Coreference Annotation Scheme and Relation Types for Hindi
Mitankin, Petar BulPhonC: Bulgarian Speech Corpus for the Development of ASR Technology
Mitkov, Ruslan A Corpus of Text Data and Gaze Fixations from Autistic and Non-Autistic Adults
Evaluating the Readability of Text Simplification Output for Readers with Cognitive Disabilities
Mitra, Prasenjit Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages
Miwa, Makoto Ensemble Classification of Grants using LDA-based Features
Miyao, Yusuke Universal Dependencies for Japanese
Typed Entity and Relation Annotation on Computer Science Papers
Towards Comparability of Linguistic Graph Banks for Semantic Parsing
Challenges and Solutions for Consistent Annotation of Vietnamese Treebank
Möbius, Bernd The IFCASL Corpus of French and German Non-native and Native Read Speech
Močiariková, Monika Finding Definitions in Large Corpora with Sketch Engine
Modi, Ashutosh InScript: Narrative texts annotated with script information
Moe, Lwin Global Open Resources and Information for Language and Linguistic Analysis (GORILLA)
Moens, Marie-Francine Semi-automatically Alignment of Predicates between Speech and OntoNotes data
Mohammad, Saif A Dataset for Detecting Stance in Tweets
Sentiment Lexicons for Arabic Social Media
Happy Accident: A Sentiment Composition Lexicon for Opposing Polarity Phrases
Mohit, Behrang Building an Arabic Machine Translation Post-Edited Corpus: Guidelines and Annotation
Mohler, Michael Introducing the LCC Metaphor Datasets
Moisik, Scott Defining and Counting Phonological Classes in Cross-linguistic Segment Databases
Mojica de la Vega, Luis Gerardo Markov Logic Networks for Text Mining: A Qualitative and Empirical Comparison with Integer Linear Programming
Mokaddem, Sidahmed Sentiment Analysis in Social Networks through Topic modeling
Moloodi, Amirsaeid Persian Proposition Bank
Monachini, Monica Al Qamus al Muhit, a Medieval Arabic Lexicon in LMF
LREC as a Graph: People and Resources in a Network
Monceaux, Laura Evaluating Lexical Similarity to build Sentiment Similarity
Moniz, Helena The SpeDial datasets: datasets for Spoken Dialogue Systems analytics
SPA: Web-based Platform for easy Access to Speech Processing Modules
Montcheuil, Grégoire MarsaGram: an excursion in the forests of parsing trees
Montemagni, Simonetta CItA: an L1 Italian Learners Corpus to Study the Development of Writing Competence
ALT Explored: Integrating an Online Dialectometric Tool and an Online Dialect Atlas
Monti, Johanna PARSEME Survey on MWE Resources
Moore, Andrew Learning Tone and Attribution for Financial Text Mining
Moran, Steven The ACQDIV Database: Min(d)ing the Ambient Language
The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud
Morante, Roser GRaSP: A Multilayered Annotation Scheme for Perspectives
Moreira, André FLAT: Constructing a CLARIN Compatible Home for Language Resources
Morency, Louis-Philippe A Multimodal Corpus for the Assessment of Public Speaking Ability and Anxiety
Moretti, Giovanni NLP and Public Engagement: The Case of the Italian School Reform
Morey, Mathieu Discourse Structure and Dialogue Acts in Multiparty Dialogue: the STAC Corpus
Morgado da Costa, Luís Wow! What a Useful Extension! Introducing Non-Referential Concepts to Wordnet
Mori, Hiroki Accuracy of Automatic Cross-Corpus Emotion Labeling for Conversational Speech Corpus Commonization
Morin, Emmanuel Improving Bilingual Terminology Extraction from Comparable Corpora via Multiple Word-Space Models
Mori, Shinsuke Universal Dependencies for Japanese
Language Resource Addition Strategies for Raw Text Parsing
Wikification for Scriptio Continua
A Japanese Chess Commentary Corpus
Parallel Speech Corpora of Japanese Dialects
Morlane-Hondère, François Identification of Drug-Related Medical Conditions in Social Media
Morros, Ramon The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
Mortensen, David R. Bridge-Language Capitalization Inference in Western Iranian: Sorani, Kurmanji, Zazaki, and Tajik
Mostafa, Naziba A Machine Learning based Music Retrieval and Recommendation System
Mota, Cristina Port4NooJ v3.0: Integrated Linguistic Resources for Portuguese NLP
Motlani, Raveesh A Finite-State Morphological Analyser for Sindhi
Mott, Justin Parallel Chinese-English Entities, Relations and Events Corpora
Mrabet, Yassine Annotating Named Entities in Consumer Health Questions
Mubarak, Hamdy Farasa: A New Fast and Accurate Arabic Word Segmenter
Arabic to English Person Name Transliteration using Twitter
Mudraya, Olga Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages
Muischnek, Kadri Estonian Dependency Treebank: from Constraint Grammar tagset to Universal Dependencies
Mujadia, Vandan Coreference Annotation Scheme and Relation Types for Hindi
Mújdricza-Maydt, Éva Combining Semantic Annotation of Word Sense & Semantic Roles: A Novel Annotation Scheme for VerbNet Roles on German Language Data
Müller, Markus Evaluation of the KIT Lecture Translation System
Muller, Philippe Corpus Annotation within the French FrameNet: a Domain-by-domain Methodology
A General Framework for the Annotation of Causality Based on FrameNet
Münch, Stefanie A Corpus of Read and Spontaneous Upper Saxon German Speech for ASR Evaluation
Murakami, Yohei Constraint-Based Bilingual Lexicon Induction for Closely Related Languages
Murata, Kenta A Large-scale Recipe and Meal Data Collection as Infrastructure for Food Research
Murawaki, Yugo Wikification for Scriptio Continua
Muszyńska, Ewa Resources for building applications with Dependency Minimal Recursion Semantics
Müürisep, Kaili Estonian Dependency Treebank: from Constraint Grammar tagset to Universal Dependencies
Muzaffar, Sharmin Issues and Challenges in Annotating Urdu Action Verbs on the IMAGACT4ALL Platform
Mykowiecka, Agnieszka TermoPL - a Flexible Tool for Terminology Extraction

 

N
Nabi, Hakim Speech Trax: A Bottom to the Top Approach for Speaker Tracking and Indexing in an Archiving Context
Nagaoka, Atsushi Accuracy of Automatic Cross-Corpus Emotion Labeling for Conversational Speech Corpus Commonization
Nagata, Ryo Discriminative Analysis of Linguistic Features for Typological Study
Nahli, Ouafae Al Qamus al Muhit, a Medieval Arabic Lexicon in LMF
Nakadai, Kazuhiro Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recognition
Nakaguchi, Takao Towards a Language Service Infrastructure for Mobile Environments
Nakamura, Keisuke Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recognition
Nakamura, Satoshi Optimizing Computer-Assisted Transcription Quality with Iterative User Interfaces
Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recognition
Nakazawa, Toshiaki Simultaneous Sentence Boundary Detection and Alignment with Pivot-based Machine Translation Generated Lexicons
ASPEC: Asian Scientific Paper Excerpt Corpus
Namer, Fiammetta Giving Lexical Resources a Second Life: Démonette, a Multi-sourced Morpho-semantic Network for French
Nam, Jinseok Medical Concept Embeddings via Labeled Background Corpora
Naskar, Debashis Sentiment Analysis in Social Networks through Topic modeling
Naskar, Sudip Kumar CATaLog Online: Porting a Post-editing Tool to the Web
Näsman, Jesper The Uppsala Corpus of Student Writings: Corpus Creation, Annotation, and Analysis
Nasr, Alexis DeQue: A Lexicon of Complex Prepositions and Conjunctions in French
Nasution, Arbi Haza Constraint-Based Bilingual Lexicon Induction for Closely Related Languages
Navarretta, Costanza Mirroring Facial Expressions and Emotions in Dyadic Conversations
Navarro, Borja Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and Evaluation
Navas, Eva A Singing Voice Database in Basque for Statistical Singing Synthesis of Bertsolaritza
Navigli, Roberto A Large-Scale Multilingual Disambiguation of Glosses
Nawab, Rao Muhammad Adeel Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages
Urdu Summary Corpus
Nayak, Tapas CATaLog Online: Porting a Post-editing Tool to the Web
Nazar, Rogelio A Taxonomy of Spanish Nouns, a Statistical Algorithm to Generate it and its Implementation in Open Source Code
Neale, Steven QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages
Word Sense-Aware Machine Translation: Including Senses as Contextual Features for Improved Translation Models
Nedoluzhko, Anna From Interoperable Annotations towards Interoperable Resources: A Multilingual Approach to the Analysis of Discourse
Coreference in Prague Czech-English Dependency Treebank
Neergaard, Karl EVALution-MAN: A Chinese Dataset for the Training and Evaluation of DSMs
Database of Mandarin Neighborhood Statistics
Neff, Michael A Corpus of Gesture-Annotated Dialogues for Monologue-to-Dialogue Generation from Personal Narratives
A Verbal and Gestural Corpus of Story Retellings to an Expressive Embodied Virtual Character
A Multimodal Motion-Captured Corpus of Matched and Mismatched Extravert-Introvert Conversational Pairs
Neidle, Carol Detection of Major ASL Sign Types in Continuous Signing For ASL Recognition
Nemeskey, Dávid Márk Detecting Optional Arguments of Verbs
Nenkova, Ani Improving the Annotation of Sentence Specificity
Neubig, Graham Optimizing Computer-Assisted Transcription Quality with Iterative User Interfaces
Neudecker, Clemens An Open Corpus for Named Entity Recognition in Historic Newspapers
Neumann, Stella Automatic Recognition of Linguistic Replacements in Text Series Generated from Keystroke Logs
Névéol, Aurélie The Scielo Corpus: a Parallel Corpus of Scientific Publications for Biomedicine
Neves, Mariana The Scielo Corpus: a Parallel Corpus of Scientific Publications for Biomedicine
Nguyen, Kiem-Hieu A Dataset for Open Event Extraction in English
Nguyen, Ngan Challenges and Solutions for Consistent Annotation of Vietnamese Treebank
Nguyen, Ngoc Towards a Language Service Infrastructure for Mobile Environments
Nguyen, Quy Challenges and Solutions for Consistent Annotation of Vietnamese Treebank
Ng, Vincent Event Coreference Resolution with Multi-Pass Sieves
Markov Logic Networks for Text Mining: A Qualitative and Empirical Comparison with Integer Linear Programming
Ng, Vincent T.Y. Syllable based DNN-HMM Cantonese Speech to Text System
Ní Chasaide, Ailbhe Chatbot Technology with Synthetic Voices in the Acquisition of an Endangered Language: Motivation, Development and Evaluation of a Platform for Irish
Ní Chiaráin, Neasa Chatbot Technology with Synthetic Voices in the Acquisition of an Endangered Language: Motivation, Development and Evaluation of a Platform for Irish
Nicolao, Mauro A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus
Niekrasz, John An Annotated Corpus and Method for Analysis of Ad-Hoc Structures Embedded in Text
Niemietz, Paula Automatic Recognition of Linguistic Replacements in Text Series Generated from Keystroke Logs
Nie, Tian Analyzing Time Series Changes of Correlation between Market Share and Concerns on Companies measured through Search Engine Suggests
Nikolić, Boško Reliable Baselines for Sentiment Analysis in Resource-Limited Languages: The Serbian Movie Review Dataset
Nimb, Sanni The SemDaX Corpus ― Sense Annotations with Scalable Sense Inventories
Niraula, Nobal Bikram SemAligner: A Method and Tool for Aligning Chunks with Semantic Relation Types and Semantic Similarity Scores
Nisioi, Sergiu Comparing Speech and Text Classification on ICNALE
Using Word Embeddings to Translate Named Entities
A Corpus of Native, Non-native and Translated Texts
Nissim, Malvina Leveraging Native Data to Correct Preposition Errors in Learners' Dutch
Nitoń, Bartłomiej Accessing and Elaborating Walenty - a Valence Dictionary of Polish - via Internet Browser
Nivre, Joakim Universal Dependencies v1: A Multilingual Treebank Collection
The Universal Dependencies Treebank of Spoken Slovenian
Universal Dependencies for Persian
Nixon, Lyndon J.B. A Regional News Corpora for Contextualized Entity Discovery and Linking
Noferesti, Samira Using Data Mining Techniques for Sentiment Shifter Identification
Nordhoff, Sebastian The Alaskan Athabascan Grammar Database
Extracting Interlinear Glossed Text from LaTeX Documents
Nöth, Elmar Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets
Nouri, Javad A Novel Evaluation Method for Morphological Segmentation
Nouvel, Damien Named Entity Resources - Overview and Outlook
Novák, Attila A New Integrated Open-source Morphological Analyzer for Hungarian
Novák, Michal Coreference in Prague Czech-English Dependency Treebank
Nugues, Pierre WIKIPARQ: A Tabulated Wikipedia Resource Using the Parquet Format

 

O
Obeid, Ossama Guidelines and Framework for a Large Scale Arabic Diacritized Corpus
Building an Arabic Machine Translation Post-Edited Corpus: Guidelines and Annotation
Oberlander, Jon Homing in on Twitter Users: Evaluating an Enhanced Geoparser for User Profile Locations
Obradovic, Ivan Rule-based Automatic Multi-word Term Extraction and Lemmatization
O'Brien, Sharon Evaluating the Impact of Light Post-Editing on Usability
O'Daniel, Bridget Improving the Annotation of Sentence Specificity
Odijk, Jan CLARIAH in the Netherlands
Ó Droighneáin, Eoin IRIS: English-Irish Machine Translation System
Oellrich, Anika Multi-label Annotation in Scientific Articles - The Multi-label Cancer Risk Assessment Corpus
Oepen, Stephan Towards Comparability of Linguistic Graph Banks for Semantic Parsing
Offersgaard, Lene Facilitating Metadata Interoperability in CLARIN-DK
Oflazer, Kemal Guidelines and Framework for a Large Scale Arabic Diacritized Corpus
Building an Arabic Machine Translation Post-Edited Corpus: Guidelines and Annotation
Ohta, Tomoko Typed Entity and Relation Annotation on Computer Science Papers
Ohya, Kazushi Data Formats and Management Strategies from the Perspective of Language Resource Producers ― Personal Diachronic and Social Synchronic Data Sharing ―
Okanoya, Kazuo Comparison of Emotional Understanding in Modality-Controlled Environments using Multimodal Online Emotional Communication Corpus
Okuno, Hiroshi G. Parallel Speech Corpora of Japanese Dialects
Okur, Eda Named Entity Recognition on Twitter for Turkish using Semi-supervised Learning with Word Embeddings
Olsen, Sussi Providing a Catalogue of Language Resources for Commercial Users
The SemDaX Corpus ― Sense Annotations with Scalable Sense Inventories
Olsson, Fredrik The Gavagai Living Lexicon
Onaindia, Eva Sentiment Analysis in Social Networks through Topic modeling
Onambele, Christophe Latin Vallex. A Treebank-based Semantic Valency Lexicon for Latin
Oostdijk, Nelleke Falling silent, lost for words ... Tracing personal involvement in interviews with Dutch war veterans
Oramas, Sergio ELMD: An Automatically Generated Entity Linking Gold Standard Dataset in the Music Domain
Orasmaa, Siim EstNLTK - NLP Toolkit for Estonian
Oravecz, Csaba A New Integrated Open-source Morphological Analyzer for Hungarian
O'Regan, Jim Privacy Issues in Online Machine Translation Services - European Perspective
Orizu, Udochukwu Detecting Expressions of Blame or Praise in Text
Ortiz Rojas, Sergio Producing Monolingual and Parallel Web Corpora at the Same Time - SpiderLing and Bitextor's Love Affair
Osella, Michele FREME: Multilingual Semantic Enrichment with Linked Data and Language Technologies
Osenova, Petya QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages
MWEs in Treebanks: From Survey to Guidelines
The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud
Ostermann, Simon InScript: Narrative texts annotated with script information
Otegi, Arantxa QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages
Otrusina, Lubomir WTF-LOD - A New Resource for Large-Scale NER Evaluation
Outahajala, Mohamed Using a Small Lexicon with CRFs Confidence Measure to Improve POS Tagging Accuracy
Øvrelid, Lilja Universal Dependencies for Norwegian
Özateş, Şaziye Betül Sentence Similarity based on Dependency Tree Kernels for Multi-document Summarization
Özbal, Gözde PROMETHEUS: A Corpus of Proverbs Annotated with Metaphors
Özgür, Arzucan Sentence Similarity based on Dependency Tree Kernels for Multi-document Summarization
Segmenting Hashtags using Automatically Created Training Data
Named Entity Recognition on Twitter for Turkish using Semi-supervised Learning with Word Embeddings
Özsoy, Ayşe Sumru BosphorusSign: A Turkish Sign Language Recognition Corpus in Health and Finance Domains
Ozturel, Adnan Annotating Topic Development in Information Seeking Queries

 

P
Pääkkönen, Tuula Measuring Lexical Quality of a Historical Finnish Newspaper Collection ― Analysis of Garbled OCR Data with Basic Language Technology Tools and Means
Paetzold, Gustavo Benchmarking Lexical Simplification Systems
Paikens, Pēteris Tēzaurs.lv: the Largest Open Lexical Database for Latvian
Pajkossy, Katalin The hunvec framework for NN-CRF-based sequential tagging
Palmér, Anne The Uppsala Corpus of Student Writings: Corpus Creation, Annotation, and Analysis
Palmer, Martha Large Multi-lingual, Multi-level and Multi-genre Annotation Corpus
Comprehensive and Consistent PropBank Light Verb Annotation
A Proposition Bank of Urdu
Palmero Aprosio, Alessio PreMOn: a Lemon Extension for Exposing Predicate Models as Linked Data
Palogiannidi, Elisavet The SpeDial datasets: datasets for Spoken Dialogue Systems analytics
Affective Lexicon Creation for the Greek Language
Palotti, Joao Building Evaluation Datasets for Consumer-Oriented Information Retrieval
Pal, Santanu CATaLog Online: Porting a Post-editing Tool to the Web
Panchenko, Alexander Best of Both Worlds: Making Word Sense Embeddings Interpretable
Pan, Jeff Passing a USA National Bar Exam: a First Corpus for Experimentation
Papavassiliou, Vassilis Parallel Global Voices: a Collection of Multilingual Corpora with Citizen Media Stories
Pa Pa, Win Introducing the Asian Language Treebank (ALT)
Paperno, Denis Typology of Adjectives Benchmark for Compositional Distributional Models
Pappu, Aasish Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest
Paramita, Monica What’s the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems
Pardelli, Gabriella Two Decades of Terminology: European Framework Programmes Titles
LREC as a Graph: People and Resources in a Network
Pareja-Lora, Antonio The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud
Pareti, Silvia PARC 3.0: A Corpus of Attribution Relations
Annotating Topic Development in Information Seeking Queries
Parish-Morris, Julia Building Language Resources for Exploring Autism Spectrum Disorders
Parker, Jonathan A Semantically Compositional Annotation Scheme for Time Normalization
Park, Joonsuk A Corpus of Argument Networks: Using Graph Properties to Analyse Divisive Issues
Park, SoHyun Classifying Out-of-vocabulary Terms in a Domain-Specific Social Media Corpus
Paroubek, Patrick A Study of Reuse and Plagiarism in LREC papers
Predictive Modeling: Guessing the NLP Terms of Tomorrow
Parra Escartín, Carla PARSEME Survey on MWE Resources
Parvizi, Artemis Towards a Linguistic Ontology with an Emphasis on Reasoning and Knowledge Reuse
Pasha, Arfath SPLIT: Smart Preprocessing (Quasi) Language Independent Tool
Passaro, Lucia C. Evaluating Context Selection Strategies to Build Emotive Vector Space Models
Passarotti, Marco Differentia compositionem facit. A Slower-Paced and Reliable Parser for Latin
Latin Vallex. A Treebank-based Semantic Valency Lexicon for Latin
Patti, Viviana Annotating Sentiment and Irony in the Online Italian Political Debate on #labuonascuola
Tweeting and Being Ironic in the Debate about a Political Reform: the French Annotated Corpus TWitter-MariagePourTous
Paulheim, Heiko A Large DataBase of Hypernymy Relations Extracted from the Web.
Evaluating Entity Linking: An Analysis of Current Benchmark Datasets and a Roadmap for Doing a Better Job
Pawar, Dipawesh Building Tempo-HindiWordNet: A resource for effective temporal information access in Hindi
Pedersen, Bolette The SemDaX Corpus ― Sense Annotations with Scalable Sense Inventories
Pedersen, Ted Age and Gender Prediction on Health Forum Data
Peldszus, Andreas Parallel Discourse Annotations on a Corpus of Short Texts
Pelemans, Joris SCALE: A Scalable Language Engineering Toolkit
Pelletier, Francis Jeffry A sense-based lexicon of count and mass expressions: The Bochum English Countability Lexicon
Perdigão, Fernando The LetsRead Corpus of Portuguese Children Reading Aloud for Performance Evaluation
Pereira Lopes, Gabriel First Steps Towards Coverage-Based Sentence Alignment
Pereira, Rita QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages
Pérez, Naiara Exploiting a Large Strongly Comparable Corpus
Perez, Walter Coh-Metrix-Esp: A Complexity Analysis Tool for Documents Written in Spanish
Perret, Jérémy Parallel Discourse Annotations on a Corpus of Short Texts
Pershina, Maria Entity Linking with a Paraphrase Flavor
Persson, Per The Gavagai Living Lexicon
Pessentheiner, Hannes AMISCO: The Austrian German Multi-Sensor Corpus
Petasis, Georgios CLARIN-EL Web-based Annotation Tool
Peters, Wim Legal Text Interpretation: Identifying Hohfeldian Relations from Text
Petkevič, Vladimír SYN2015: Representative Corpus of Contemporary Written Czech
Petmanson, Timo EstNLTK - NLP Toolkit for Estonian
Petrov, Slav Universal Dependencies v1: A Multilingual Treebank Collection
Petukhova, Volha Modelling Multi-issue Bargaining Dialogues: Data Collection, Annotation Design and Corpus
Creating Annotated Dialogue Resources: Cross-domain Dialogue Act Classification
The DialogBank
Piao, Scott Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages
Pichler, Thomas AMISCO: The Austrian German Multi-Sensor Corpus
Pietquin, Olivier MultiVec: a Multilingual and Multilevel Representation Learning Toolkit for NLP
Pilán, Ildikó SweLL on the rise: Swedish Learner Language corpus for European Reference Level studies
SVALex: a CEFR-graded Lexical Resource for Swedish Foreign and Second Language Learners
Pillot-Loiseau, Claire The TYPALOC Corpus: A Collection of Various Dysarthric Speech Recordings in Read and Spontaneous Styles
Pincus, Eli Towards Automatic Identification of Effective Clues for Team Word-Guessing Games
Pinkal, Manfred A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds
InScript: Narrative texts annotated with script information
A Crowdsourced Database of Event Sequence Descriptions for the Acquisition of High-quality Script Knowledge
Pinnis, Mārcis Designing a Speech Corpus for the Development and Evaluation of Dictation Systems in Latvian
Pipatsrisawat, Knot TTS for Low Resource Languages: A Bangla Synthesizer
Piper, Andrew Annotating Characters in Literary Corpora: A Scheme, the CHARLES Tool, and an Annotated Novel
Piperidis, Stelios Parallel Global Voices: a Collection of Multilingual Corpora with Citizen Media Stories
Plancq, Clément More than Word Cooccurrence: Exploring Support and Opposition in International Climate Negotiations with Semantic Parsing
Plank, Barbara TwiSty: A Multilingual Twitter Stylometry Corpus for Gender and Personality Profiling
Plu, Julien Context-enhanced Adaptive Entity Linking
Evaluating Entity Linking: An Analysis of Current Benchmark Datasets and a Roadmap for Doing a Better Job
Podlaska, Katarzyna Challenges of Adjective Mapping between plWordNet and Princeton WordNet
Poesio, Massimo ARRAU: Linguistically-Motivated Annotation of Anaphoric Descriptions
The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015
Phrase Detectives Corpus 1.0 Crowdsourced Anaphoric Coreference.
Pohling, Marian How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Poibeau, Thierry More than Word Cooccurrence: Exploring Support and Opposition in International Climate Negotiations with Semantic Parsing
Poignant, Johann Benchmarking multimedia technologies with the CAMOMILE platform: the case of Multimodal Person Discovery at MediaEval 2015
The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
Poláková, Lucie Searching in the Penn Discourse Treebank Using the PML-Tree Query
Poletto, Cecilia Designing A Long Lasting Linguistic Project: The Case Study of ASIt
Polzehl, Tim Crowdsourcing a Multi-lingual Speech Corpus: Recording, Transcription and Annotation of the CrowdIS Corpora
Ponti, Edoardo Maria Differentia compositionem facit. A Slower-Paced and Reliable Parser for Latin
Ponzetto, Simone Paolo A Large DataBase of Hypernymy Relations Extracted from the Web.
Pool, Jonathan The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud
Popel, Martin QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages
Tools and Guidelines for Principled Machine Translation Development
Popescu-Belis, Andrei Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation
Popescu, Octavian Corpora for Learning the Mutual Relationship between Semantic Relatedness and Textual Entailment
Popescu, Vladimir Language Resource Citation: the ISLRN Dissemination and Further Developments
The ELRA License Wizard
New Developments in the LRE Map
Enhancing Cross-border EU E-commerce through Machine Translation: Needed Language Resources, Challenges and Opportunities
ELRA Activities and Services
Popović, Maja PE2rr Corpus: Manual Error Annotation of Automatically Pre-annotated MT Post-edits
Tools and Guidelines for Principled Machine Translation Development
Poppek, Johanna Marie A sense-based lexicon of count and mass expressions: The Bochum English Countability Lexicon
Pörner, Nina The BAS Speech Data Repository
BAS Speech Science Web Services - an Update of Current Developments
Portet, François CirdoX: an on/off-line multisource speech and sound analysis software
The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People
Postma, Marten Addressing the MFS Bias in WSD systems
Potamianos, Alexandros The SpeDial datasets: datasets for Spoken Dialogue Systems analytics
Crossmodal Network-Based Distributional Semantic Models
Cognitively Motivated Distributional Representations of Meaning
Affective Lexicon Creation for the Greek Language
Pouchoulin, Gilles The TYPALOC Corpus: A Collection of Various Dysarthric Speech Recordings in Read and Spontaneous Styles
Pouliquen, Bruno The United Nations Parallel Corpus v1.0
Pouli, Vasiliki Linguistically Inspired Language Model Augmentation for MT
Povlsen, Claus Providing a Catalogue of Language Resources for Commercial Users
Prabhakaran, Vinodkumar A Corpus of Wikipedia Discussions: Over the Years, with Topic, Power and Gender Labels
Prange, Jakob A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds
Preoţiuc-Pietro, Daniel An Empirical Exploration of Moral Foundations Theory in Partisan News Sources
Studying the Temporal Dynamics of Word Co-occurrences: An Application to Event Detection
Pretkalniņa, Lauma Tēzaurs.lv: the Largest Open Lexical Database for Latvian
Prévot, Laurent 4Couv: A New Treebank for French
LexFr: Adapting the LexIt Framework to Build a Corpus-based French Subcategorization Lexicon
A CUP of CoFee: A large Collection of feedback Utterances Provided with communicative function annotations
Procházka, Pavel SYN2015: Representative Corpus of Contemporary Written Czech
Proença, Jorge The LetsRead Corpus of Portuguese Children Reading Aloud for Performance Evaluation
Proisl, Thomas A Proposal for a Part-of-Speech Tagset for the Albanian Language
Prokopidis, Prokopis Parallel Global Voices: a Collection of Multilingual Corpora with Citizen Media Stories
Prys, Delyth Cysill Ar-lein: A Corpus of Written Contemporary Welsh Compiled from an On-line Spelling and Grammar Checker
Prys, Gruffudd Cysill Ar-lein: A Corpus of Written Contemporary Welsh Compiled from an On-line Spelling and Grammar Checker
Puolakainen, Tiina Estonian Dependency Treebank: from Constraint Grammar tagset to Universal Dependencies
Pustejovsky, James VoxML: A Visualization Modeling Language
The Language Application Grid and Galaxy
Pyysalo, Sampo Universal Dependencies v1: A Multilingual Treebank Collection
Typed Entity and Relation Annotation on Computer Science Papers

 

Q
Qin, Lu Emotion Corpus Construction Based on Selection from Hashtags
Qiu, Zhengwei Using SMT for OCR Error Correction of Historical Texts
Quasthoff, Uwe Features for Generic Corpus Querying
Construction and Analysis of a Large Vietnamese Text Corpus
Quénot, Georges The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
Querido, Andreia Use of Domain-Specific Language Resources in Machine Translation
CINTIL DependencyBank PREMIUM - A Corpus of Grammatical Dependencies for Portuguese
Bootstrapping a Hybrid MT System to a New Language Pair
Que, Roger Very-large Scale Parsing and Normalization of Wiktionary Morphological Paradigms
Quilitzsch, Anya Generating a Yiddish Speech Corpus, Forced Aligner and Basic ASR System for the AHEYM Project
Quispersaravia, Andre Coh-Metrix-Esp: A Complexity Analysis Tool for Documents Written in Spanish
Quochi, Valeria Fostering digital representation of EU regional and minority languages: the Digital Language Diversity Project
QasemiZadeh, Behrang The ACL RD-TEC 2.0: A Language Resource for Evaluating Term Extraction and Entity Recognition Methods

 

R
Rabadan, Adrian Improving Information Extraction from Wikipedia Texts using Basic English
Rabinovich, Ella A Corpus of Native, Non-native and Translated Texts
Rademaker, Alexandre Semantic Links for Portuguese
Radev, Dragomir Extractive Summarization under Strict Length Constraints
Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest
Sentence Similarity based on Dependency Tree Kernels for Multi-document Summarization
Raganato, Alessandro A Large-Scale Multilingual Disambiguation of Glosses
Ramadier, Lionel Semantic Relation Extraction with Semantic Patterns Experiment on Radiology Reports
Rambelli, Giulia LexFr: Adapting the LexIt Framework to Build a Corpus-based French Subcategorization Lexicon
Rambow, Owen A Corpus of Wikipedia Discussions: Over the Years, with Topic, Power and Gender Labels
Morphologically Annotated Corpora and Morphological Analyzers for Moroccan and Sanaani Yemeni Arabic
SPLIT: Smart Preprocessing (Quasi) Language Independent Tool
Ramisch, Carlos mwetoolkit+sem: Integrating Word Embeddings in the mwetoolkit for Semantic MWE Processing
DeQue: A Lexicon of Complex Prepositions and Conjunctions in French
Ramsay, Allan Fast and Robust POS tagger for Arabic Tweets Using Agreement-based Bootstrapping
Ramshaw, Lance Large Multi-lingual, Multi-level and Multi-genre Annotation Corpus
Rauschenberger, Maria A Language Resource of German Errors Written by Children with Dyslexia
Rauzy, Stéphane MarsaGram: an excursion in the forests of parsing trees
4Couv: A New Treebank for French
Ravenscroft, James Applying Core Scientific Concepts to Context-Based Citation Recommendation
Multi-label Annotation in Scientific Articles - The Multi-label Cancer Risk Assessment Corpus
Ray, Jessica Operational Assessment of Keyword Search on Oral History
Rayner, Manny A Shared Task for Spoken CALL?
Rayson, Paul Learning Tone and Attribution for Financial Text Mining
Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages
UPPC - Urdu Paraphrase Plagiarism Corpus
OSMAN ― A Novel Arabic Readability Metric
Read, Jonathon A Corpus of Clinical Practice Guidelines Annotated with the Importance of Recommendations
Real, Livy Semantic Links for Portuguese
Rebollo, Miguel Sentiment Analysis in Social Networks through Topic modeling
Recski, Gábor Building Concept Graphs from Monolingual Dictionary Entries
Detecting Optional Arguments of Verbs
Reddy, Dinesh Crowdsourced Corpus with Entity Salience Annotations
Redling, Benjamin CodE Alltag: A German-Language E-Mail Corpus
Reed, Chris Corpus Resources for Dispute Mediation Discourse
A Corpus of Argument Networks: Using Graph Properties to Analyse Divisive Issues
Regueira, Xose Luis Enhanced CORILGA: Introducing the Automatic Phonetic Alignment Tool for Continuous Speech
Rehbein, Ines Annotating Discourse Relations in Spoken Language: A Comparison of the PDTB and CCR Frameworks
Rehm, Georg The Language Resource Life Cycle: Towards a Generic Model for Creating, Maintaining, Using and Distributing Language Resources
Fostering the Next Generation of European Language Technology: Recent Developments ― Emerging Initiatives ― Challenges and Opportunities
Reichel, Uwe The BAS Speech Data Repository
BAS Speech Science Web Services - an Update of Current Developments
Reichel, Uwe D. A Taxonomy of Specific Problem Classes in Text-to-Speech Synthesis: Comparing Commercial and Open Source Performance
Rekabsaz, Navid Standard Test Collection for English-Persian Cross-Lingual Word Sense Disambiguation
Rello, Luz CASSAurus: A Resource of Simpler Spanish Synonyms
A Language Resource of German Errors Written by Children with Dyslexia
Remus, Steffen Domain-Specific Corpus Expansion with Focused Webcrawling
Renals, Steve Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project
Renau, Irene A Taxonomy of Spanish Nouns, a Statistical Algorithm to Generate it and its Implementation in Open Source Code
Rendeiro, Nuno Use of Domain-Specific Language Resources in Machine Translation
Bootstrapping a Hybrid MT System to a New Language Pair
Renner-Westermann, Heike Lin|gu|is|tik: Building the Linguist's Pathway to Bibliographies, Libraries, Language Resources and Linked Open Data
Reynaert, Martin Nederlab: Towards a Single Portal and Research Environment for Diachronic Dutch Text Corpora
OCR Post-Correction Evaluation of Early Dutch Books Online - Revisited
Rey-Villamizar, Nicolas Age and Gender Prediction on Health Forum Data
Ribeiro, Eugénio SPA: Web-based Platform for easy Access to Speech Processing Modules
Ribeiro, Ricardo SPA: Web-based Platform for easy Access to Speech Processing Modules
Ribes-Lafoz, María Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and Evaluation
Ribeyre, Corentin Accurate Deep Syntactic Parsing of Graphs: The Case of French
Riccardi, Giuseppe Multilevel Annotation of Agreement and Disagreement in Italian News Blogs
Summarizing Behaviours: An Experiment on the Annotation of Call-Centre Conversations
Transfer of Corpus-Specific Dialogue Act Annotation to ISO Standard: Is it worth it?
Richardson, John A Japanese Chess Commentary Corpus
Richart, Cécile Datasets for Aspect-Based Sentiment Analysis in French
Richter, Viktor How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Rieser, Verena The REAL Corpus: A Crowd-Sourced Corpus of Human Generated and Evaluated Spatial References to Real-World Urban Scenes
Rigau, German A Comparison of Domain-based Word Polarity Estimation using different Word Embeddings
Addressing the MFS Bias in WSD systems
The Event and Implied Situation Ontology (ESO): Application and Evaluation
A Multilingual Predicate Matrix
Rikters, Matīss Syntax-based Multi-system Machine Translation
Rinaldi, Fabio The PsyMine Corpus - A Corpus annotated with Psychiatric Disorders and their Etiological Factors
Rink, Bryan Introducing the LCC Metaphor Datasets
Rinke, Esther Designing A Long Lasting Linguistic Project: The Case Study of ASIt
Ritchie, Phil FREME: Multilingual Semantic Enrichment with Linked Data and Language Technologies
Rituma, Laura Tēzaurs.lv: the Largest Open Lexical Database for Latvian
Rizzo, Giuseppe Context-enhanced Adaptive Entity Linking
Evaluating Entity Linking: An Analysis of Current Benchmark Datasets and a Roadmap for Doing a Better Job
Roberts, Kirk Annotating Logical Forms for EHR Questions
Annotating Named Entities in Consumer Health Questions
Roche, Mathieu Monitoring Disease Outbreak Events on the Web Using Text-mining Approach and Domain Expert Knowledge
Integration of Lexical and Semantic Knowledge for Sentiment Analysis in SMS
Automatic Biomedical Term Polysemy Detection
Rodrigues, Filipe Can Topic Modelling benefit from Word Sense Information?
Rodríguez, Alejandro Towards Lexical Encoding of Multi-Word Expressions in Spanish Dialects
Rodríguez, Eric Towards Lexical Encoding of Multi-Word Expressions in Spanish Dialects
Rodríguez-Fernández, Sara Example-based Acquisition of Fine-grained Collocation Resources
Rodriguez-Ferreira, Teresa Improving Information Extraction from Wikipedia Texts using Basic English
Rodriguez, Kepa ARRAU: Linguistically-Motivated Annotation of Anaphoric Descriptions
Rodriguez, Laritza Annotating Named Entities in Consumer Health Questions
Roesiger, Ina IMS HotCoref DE: A Data-driven Co-reference Resolver for German
SciCorp: A Corpus of English Scientific Articles Annotated for Information Status Analysis
Roesner, Immer A Database of Laryngeal High-Speed Videos with Simultaneous High-Quality Audio Recordings of Pathological and Non-Pathological Voices
Rohwer, Richard An Annotated Corpus and Method for Analysis of Ad-Hoc Structures Embedded in Text
Romary, Laurent TermITH-Eval: a French Standard-Based Resource for Keyphrase Extraction Evaluation
Ronzano, Francesco A Multi-Layered Annotated Corpus of Scientific Papers
What does this Emoji Mean? A Vector Space Skip-Gram Model for Twitter Emojis
Rosá, Aiala Factuality Annotation and Learning in Spanish Texts
Rosenberg, Andrew RankDCG: Rank-Ordering Evaluation Measure
Rosén, Victoria MWEs in Treebanks: From Survey to Guidelines
NorGramBank: A ‘Deep’ Treebank for Norwegian
Rospocher, Marco The Event and Implied Situation Ontology (ESO): Application and Evaluation
PreMOn: a Lemon Extension for Exposing Predicate Models as Linked Data
Rossato, Solange FABIOLE, a Speech Database for Forensic Speaker Comparison
The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People
Rosset, Sophie Transfer-Based Learning-to-Rank Assessment of Medical Term Technicality
Purely Corpus-based Automatic Conversation Authoring
The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
Managing Linguistic and Terminological Variation in a Medical Dialogue System
Generating Task-Pertinent sorted Error Lists for Speech Recognition
Named Entity Resources - Overview and Outlook
Rossini Favretti, Rema Automatic identification of Mild Cognitive Impairment through the analysis of Italian spontaneous speech productions
Rosso, Paolo Using a Small Lexicon with CRFs Confidence Measure to Improve POS Tagging Accuracy
Roth, Dan EDISON: Feature Extraction for NLP, Simplified
Roux, Justus South African National Centre for Digital Language Resources
Roziewski, Szymon LanguageCrawl: A Generic Tool for Building Language Models Upon Common-Crawl
Rozis, Roberts Collecting Language Resources for the Latvian e-Government Machine Translation Platform
Ruan, Chong Domain Ontology Learning Enhanced by Optimized Relation Instance in DBpedia
Rubens, Neil Solving the AL Chicken-and-Egg Corpus and Model Problem: Model-free Active Learning for Phenomena-driven Corpus Construction
Rudnicka, Ewa Challenges of Adjective Mapping between plWordNet and Princeton WordNet
Rudnicky, Alexander AppDialogue: Multi-App Dialogues for Intelligent Assistants
Rudra, Koustav Functions of Code-Switching in Tweets: An Annotation Framework and Some Initial Experiments
Ruiz, Pablo More than Word Cooccurrence: Exploring Support and Opposition in International Climate Negotiations with Semantic Parsing
Ruppenhofer, Josef Effect Functors for Opinion Inference
Russell, Martin A Shared Task for Spoken CALL?
Russo, Irene Fostering digital representation of EU regional and minority languages: the Digital Language Diversity Project
LREC as a Graph: People and Resources in a Network
Rus, Vasile SemAligner: A Method and Tool for Aligning Chunks with Semantic Relation Types and Semantic Similarity Scores
DT-Neg: Tutorial Dialogues Annotated for Negation Scope and Focus in Context
Ruths, Derek Annotating Characters in Literary Corpora: A Scheme, the CHARLES Tool, and an Annotated Novel
Rychlik, Piotr TermoPL - a Flexible Tool for Terminology Extraction
Rychlý, Pavel Finding Definitions in Large Corpora with Sketch Engine
Ryzhova, Daria Typology of Adjectives Benchmark for Compositional Distributional Models
Rzymski, Christoph Enriching TimeBank: Towards a more precise annotation of temporal relations in a text

 

S
Sabetghadam, Serwah Standard Test Collection for English-Persian Cross-Lingual Word Sense Disambiguation
Sack, Harald Crowdsourced Corpus with Entity Salience Annotations
Sadamitsu, Kugatsu Name Translation based on Fine-grained Named Entity Recognition in a Single Language
Sadeque, Farig Age and Gender Prediction on Health Forum Data
Saerens, Marco Combining Manual and Automatic Prosodic Annotation for Expressive Speech Synthesis
Saggion, Horacio A Multi-Layered Annotated Corpus of Scientific Papers
ELMD: An Automatically Generated Entity Linking Gold Standard Dataset in the Music Domain
What does this Emoji Mean? A Vector Space Skip-Gram Model for Twitter Emojis
Saha, Shyamasree Multi-label Annotation in Scientific Articles - The Multi-label Cancer Risk Assessment Corpus
Sahlgren, Magnus The Gavagai Living Lexicon
Saidi, Arash Constructing a Norwegian Academic Wordlist
Saint-Dizier, Patrick Argument Mining: the Bottleneck of Knowledge and Language Resources
LELIO: An Auto-Adaptative System to Acquire Domain Lexical Knowledge in Technical Texts
Error Typology and Remediation Strategies for Requirements Written in English by Non-Native Speakers
Saito, Itsumi Name Translation based on Fine-grained Named Entity Recognition in a Single Language
Sajous, Franck Wiktionnaire's Wikicode GLAWIfied: a Workable French Machine-Readable Dictionary
Sakaki, Shigeyuki Corpus for Customer Purchase Behavior Prediction in Social Media
Sakti, Sakriani Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recognition
Salameh, Mohammad Sentiment Lexicons for Arabic Social Media
Salchak, Aelita A Finite-state Morphological Analyser for Tuvan
Salden, Uta Using a Language Technology Infrastructure for German in order to Anonymize German Sign Language Corpus Data
Salesky, Elizabeth Operational Assessment of Keyword Search on Oral History
Salimbajevs, Askars Designing a Speech Corpus for the Development and Evaluation of Dictation Systems in Latvian
Salim, Soufian Ubuntu-fr: A Large and Open Corpus for Multi-modal Analysis of Online Written Conversations
Salliau, Frank FREME: Multilingual Semantic Enrichment with Linked Data and Language Technologies
Salloum, Wael SPLIT: Smart Preprocessing (Quasi) Language Independent Tool
Salvetti, Franco A Tangled Web: The Faint Signals of Deception in Text - Boulder Lies and Truth Corpus (BLT-C)
Samardzic, Tanja ArchiMob - A Corpus of Spoken Swiss German
A Framework for Automatic Acquisition of Croatian and Serbian Verb Aspect from Corpora
Samier, Quentin Review on the Existing Language Resources for Languages of France
Samih, Younes An Arabic-Moroccan Darija Code-Switched Corpus
Sammons, Mark EDISON: Feature Extraction for NLP, Simplified
Sánchez, Noelia Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and Evaluation
Sandell, Monica SweLL on the rise: Swedish Learner Language corpus for European Reference Level studies
Sanders, Eric Curation of Dutch Regional Dictionaries
Palabras: Crowdsourcing Transcriptions of L2 Speech
Can Tweets Predict TV Ratings?
Sangati, Federico D(H)ante: A New Set of Tools for XIII Century Italian
PARSEME Survey on MWE Resources
Sänger, Mario SCARE ― The Sentiment Corpus of App Reviews with Fine-grained Annotations in German
Santos, Ana Lúcia CEPLEXicon ― A Lexicon of Child European Portuguese
Santos, Diana QUEMDISSE? Reported speech in Portuguese
Santos, Eddie Antonio Training & Quality Assessment of an Optical Character Recognition Model for Northern Haida
Santos, Fábio Discovering Fuzzy Synsets from the Redundancy in Different Lexical-Semantic Resources
Santus, Enrico Nine Features in a Random Forest to Learn Taxonomical Semantic Relations
What a Nerd! Beating Students and Vector Cosine in the ESL and TOEFL Datasets
EVALution-MAN: A Chinese Dataset for the Training and Evaluation of DSMs
San Vicente, Iñaki TweetMT: A Parallel Microblog Corpus
Polarity Lexicon Building: to what Extent Is the Manual Effort Worth?
Saralegi, Xabier Polarity Lexicon Building: to what Extent Is the Manual Effort Worth?
Evaluating Translation Quality and CLIR Performance of Query Sessions
Sarasola, Kepa Domain Adaptation in MT Using Titles in Wikipedia as a Parallel Corpus: Resources and Evaluation
Sarasola, Xabier A Singing Voice Database in Basque for Statistical Singing Synthesis of Bertsolaritza
Saraswati, Jaya Synset Ranking of Hindi WordNet
Saratxaga, Ibon A Singing Voice Database in Basque for Statistical Singing Synthesis of Bertsolaritza
Sarhimaa, Anneli Fostering digital representation of EU regional and minority languages: the Digital Language Diversity Project
Sasada, Tetsuro Language Resource Addition Strategies for Raw Text Parsing
A Japanese Chess Commentary Corpus
Sasaki, Felix FREME: Multilingual Semantic Enrichment with Linked Data and Language Technologies
Sasa, Yuko Ecological Gestures for HRI: the GEE Corpus
Sassolini, Eva ALT Explored: Integrating an Online Dialectometric Tool and an Online Dialect Atlas
Saulīte, Baiba Tēzaurs.lv: the Largest Open Lexical Database for Latvian
Saurí, Roser Towards a Linguistic Ontology with an Emphasis on Reasoning and Knowledge Reuse
Savary, Agata MWEs in Treebanks: From Survey to Guidelines
Covering various Needs in Temporal Annotation: a Proposal of Extension of ISO TimeML that Preserves Upward Compatibility
Towards Lexical Encoding of Multi-Word Expressions in Spanish Dialects
PARSEME Survey on MWE Resources
Scarton, Carolina A Reading Comprehension Corpus for Machine Translation Evaluation
Schäfer, Roland CommonCOW: Massively Huge Web Corpora from CommonCrawl Data and a Method to Distribute them Freely under Restrictive EU Copyright Laws
Schang, Emmanuel Covering various Needs in Temporal Annotation: a Proposal of Extension of ISO TimeML that Preserves Upward Compatibility
Scharl, Arno A Regional News Corpora for Contextualized Entity Discovery and Linking
Scheffler, Tatjana Adding Semantic Relations to a Large-Coverage Connective Lexicon of German
Schenner, Mathias Extracting Interlinear Glossed Text from LaTeX Documents
Scherer, Stefan A Multimodal Corpus for the Assessment of Public Speaking Ability and Anxiety
Scherrer, Yves ArchiMob - A Corpus of Spoken Swiss German
Schiel, Florian The BAS Speech Data Repository
BAS Speech Science Web Services - an Update of Current Developments
Schiffhauer, Birte How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Schlangen, David How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
DUEL: A Multi-lingual Multimodal Dialogue Corpus for Disfluency, Exclamations and Laughter
PentoRef: A Corpus of Spoken References in Task-oriented Dialogues
Schlechtweg, Dominik Exploitation of Co-reference in Distributional Semantics
Schleicher, Thomas Learning Tone and Attribution for Financial Text Mining
Schmidt, Maria A Comparative Analysis of Crowdsourced Natural Language Corpora for Spoken Dialog Systems
Schmidt-Thieme, Lars Learning Thesaurus Relations from Distributional Features
Schmidt, Thomas User, who art thou? User Profiling for Oral Corpus Platforms
FOLK-Gold ― A Gold Standard for Part-of-Speech-Tagging of Spoken German
Schmitt, Alexander Could Speaker, Gender or Age Awareness be beneficial in Speech-based Emotion Recognition?
Schneider, Nathan Inconsistency Detection in Semantic Annotation
Schneider-Stickler, Berit A Database of Laryngeal High-Speed Videos with Simultaneous High-Quality Audio Recordings of Pathological and Non-Pathological Voices
Schoen, Anneleen MEANTIME, the NewsReader Multilingual Event and Time Corpus
Scholman, Merel Annotating Discourse Relations in Spoken Language: A Comparison of the PDTB and CCR Frameworks
Scholze-Stubenrecht, Werner A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds
Schöne, Karin Design and Development of the MERLIN Learner Corpus Platform
Schreitter, Stephanie The OFAI Multi-Modal Task Description Corpus
Schröder, Johannes Towards Automatic Transcription of ILSE ― an Interdisciplinary Longitudinal Study of Adult Development and Aging
Schuller, Björn Introducing the Weighted Trustability Evaluator for Crowdsourcing Exemplified by Speaker Likability Classification
Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets
Schulte im Walde, Sabine GhoSt-NN: A Representative Gold Standard of German Noun-Noun Compounds
Visualisation and Exploration of High-Dimensional Distributional Features in Lexical Semantic Classification
Automatically Generated Affective Norms of Abstractness, Arousal, Imageability and Valence for 350 000 German Lemmas
Schultz, Robert T. Building Language Resources for Exploring Autism Spectrum Disorders
Schultz, Tanja Towards Automatic Transcription of ILSE ― an Interdisciplinary Longitudinal Study of Adult Development and Aging
Schulz, Sarah Learning from Within? Comparing PoS Tagging Approaches for Historical Text
Schulz, Simon How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Schumann, Anne-Kathrin Compasses, Magnets, Water Microscopes: Annotation of Terminology in a Diachronic Corpus of Scientific Texts
The ACL RD-TEC 2.0: A Language Resource for Evaluating Term Extraction and Entity Recognition Methods
Schuschnig, Christian CodE Alltag: A German-Language E-Mail Corpus
Schuster, Sebastian Enhanced English Universal Dependencies: An Improved Representation for Natural Language Understanding Tasks
Schuurman, Ineke AfriBooms: An Online Treebank for Afrikaans
Schwab, Didier A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity Detection
Seara, Roberto Enhanced CORILGA: Introducing the Automatic Phonetic Alignment Tool for Continuous Speech
Seddah, Djamé Accurate Deep Syntactic Parsing of Graphs: The Case of French
Hard Time Parsing Questions: Building a QuestionBank for French
Sedlák, Michal The Public License Selector: 
Making Open Licensing Easier
Seelig, Laura A Corpus of Read and Spontaneous Upper Saxon German Speech for ASR Evaluation
Segawa, Shuhei Speech Corpus Spoken by Young-old, Old-old and Oldest-old Japanese
Segers, Roxane The Event and Implied Situation Ontology (ESO): Application and Evaluation
Segond, Frederique Encoding Adjective Scales for Fine-grained Resources
Seibel, Brandon Classifying Out-of-vocabulary Terms in a Domain-Specific Social Media Corpus
Seitner, Julian A Large DataBase of Hypernymy Relations Extracted from the Web.
Sekulić, Ivan VerbCROcean: A Repository of Fine-Grained Semantic Verb Relations for Croatian
Semenkin, Eugene Could Speaker, Gender or Age Awareness be beneficial in Speech-based Emotion Recognition?
Sepesy Maucec, Mirjam The SI TEDx-UM speech database: a new Slovenian Spoken Language Resource
Seraji, Mojgan Universal Dependencies for Persian
Sergienko, Roman A Comparative Study of Text Preprocessing Approaches for Topic Detection of User Utterances
Serralheiro, António The DIRHA Portuguese Corpus: A Comparison of Home Automation Command Detection and Recognition in Simulated and Real Data.
Serra, Xavier ELMD: An Automatically Generated Entity Linking Gold Standard Dataset in the Music Domain
Servan, Christophe MultiVec: a Multilingual and Multilevel Representation Learning Toolkit for NLP
Sevcikova, Magda Merging Data Resources for Inflectional and Derivational Morphology in Czech
Shaban, Khaled Arabic Corpora for Credibility Analysis
Shafi, Jawad Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages
Shah, Kashif Creation of comparable corpora for English-{Urdu, Arabic, Persian}
Shahrour, Anas Exploiting Arabic Diacritization for High Quality Automatic Annotation
Shaikh, Samira The Validation of MRCPD Cross-language Expansions on Imageability Ratings
ANEW+: Automatic Expansion and Validation of Affective Norms of Words Lexicons in Multiple Languages
Shamsfard, Mehrnoush Using Data Mining Techniques for Sentiment Shifter Identification
Shan, Muhammad A Comparative Study of Text Preprocessing Approaches for Topic Detection of User Utterances
Sharjeel, Muhammad UPPC - Urdu Paraphrase Plagiarism Corpus
Sharma, Dipti A Finite-State Morphological Analyser for Sindhi
Using lexical and Dependency Features to Disambiguate Discourse Connectives in Hindi
Towards Building Semantic Role Labeler for Indian Languages
A Proposition Bank of Urdu
Sharma, Himanshu Using lexical and Dependency Features to Disambiguate Discourse Connectives in Hindi
Sharoff, Serge MoBiL: A Hybrid Feature Set for Automatic Human Translation Quality Assessment
Sheikh, Imran How Diachronic Text Corpora Affect Context based Retrieval of OOV Proper Names for Audio News
Shen, Wade Operational Assessment of Keyword Search on Oral History
Sheridan, Páraic Using SMT for OCR Error Correction of Historical Texts
Shi, Huaxing Building A Case-based Semantic English-Chinese Parallel Treebank
Shindo, Hiroyuki Construction of an English Dependency Corpus incorporating Compound Function Words
Shiue, Yow-Ting Detecting Word Usage Errors in Chinese Sentences for Learning Chinese as a Foreign Language
Shooshan, Sonya Annotating Named Entities in Consumer Health Questions
Shrestha, Niraj Semi-automatically Alignment of Predicates between Speech and OntoNotes data
Shrestha, Prasha Age and Gender Prediction on Health Forum Data
Shukla, Rajita Synset Ranking of Hindi WordNet
Sidarenka, Uladzimir PotTS: The Potsdam Twitter Sentiment Corpus
Sidorov, Maxim Could Speaker, Gender or Age Awareness be beneficial in Speech-based Emotion Recognition?
Sierra, Gerardo Axolotl: a Web Accessible Parallel Corpus for Spanish-Nahuatl
Siklósi, Borbála A New Integrated Open-source Morphological Analyzer for Hungarian
Silva, Guilherme FLAT: Constructing a CLARIN Compatible Home for Language Resources
Silva, João QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages
CINTIL DependencyBank PREMIUM - A Corpus of Grammatical Dependencies for Portuguese
Silveira, Natalia Universal Dependencies v1: A Multilingual Treebank Collection
Simi, Maria Adapting the TANL tool suite to Universal Dependencies
Simkó, Katalin Ilona A Hungarian Sentiment Corpus Manually Annotated at Aspect Level
Simões, Alberto Enriching a Portuguese WordNet using Synonyms from a Monolingual Dictionary
Simonyi, András Mapping Ontologies Using Ontologies: Cross-lingual Semantic Role Information Transfer
Simov, Kiril QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages
Sim Smith, Karin Cohere: A Toolkit for Local Coherence
Simunic, Roman Nino A sense-based lexicon of count and mass expressions: The Bochum English Countability Lexicon
Singh, Dhirendra Synset Ranking of Hindi WordNet
Multiword Expressions Dataset for Indian Languages
Sitaram, Sunayana Speech Synthesis of Code-Mixed Text
Skadina, Inguna Syntax-based Multi-system Machine Translation
Skadiņš, Raivis Collecting Language Resources for the Latvian e-Government Machine Translation Platform
Skoumalová, Hana SYN2015: Representative Corpus of Contemporary Written Czech
Škrabal, Michal SYN2015: Representative Corpus of Contemporary Written Czech
Skrelin, Pavel CoRuSS - a New Prosodically Annotated Corpus of Russian Spontaneous Speech
Smith, Daniel Morphological Analysis of Sahidic Coptic for Automatic Glossing
Smrz, Pavel WTF-LOD - A New Resource for Large-Scale NER Evaluation
Šnajder, Jan VerbCROcean: A Repository of Fine-Grained Semantic Verb Relations for Croatian
Cro36WSD: A Lexical Sample for Croatian Word Sense Disambiguation
Graph-Based Induction of Word Senses in Croatian
Sobhani, Parinaz A Dataset for Detecting Stance in Tweets
Sobrevilla, Marco Coh-Metrix-Esp: A Complexity Analysis Tool for Documents Written in Spanish
Søgaard, Anders The SemDaX Corpus ― Sense Annotations with Scalable Sense Inventories
Sohn, Sunghwan Staggered NLP-assisted refinement for Clinical Annotations of Chronic Disease Events
Solda Kutzmann, Donatella NLP and Public Engagement: The Case of the Italian School Reform
Soler, Juan A Semi-Supervised Approach for Gender Identification
Solorio, Thamar Age and Gender Prediction on Health Forum Data
Sommerdijk, Bridget Can Tweets Predict TV Ratings?
Song, Zhiyi Parallel Chinese-English Entities, Relations and Events Corpora
Sordo, Mohamed ELMD: An Automatically Generated Entity Linking Gold Standard Dataset in the Music Domain
Sørensen, Nicolai Hartvig The SemDaX Corpus ― Sense Annotations with Scalable Sense Inventories
Soria, Claudia Fostering digital representation of EU regional and minority languages: the Digital Language Diversity Project
LREC as a Graph: People and Resources in a Network
Soriano Morales, Edmundo Pavel Hypergraph Modelization of a Syntactically Annotated English Wikipedia Dump
Soroa, Aitor Two Architectures for Parallel Processing of Huge Amounts of Text
Interoperability of Annotation Schemes: Using the Pepper Framework to Display AWA Documents in the ANNIS Interface
Sosoni, Vilelmini Enhancing Access to Online Education: Quality Machine Translation of MOOC Content
Specia, Lucia MARMOT: A Toolkit for Translation Quality Estimation at the Word Level
Phrase Level Segmentation and Labelling of Machine Translation Errors
Benchmarking Lexical Simplification Systems
A Reading Comprehension Corpus for Machine Translation Evaluation
Cohere: A Toolkit for Local Coherence
Spektors, Andrejs Tēzaurs.lv: the Largest Open Lexical Database for Latvian
Speranza, Manuela MEANTIME, the NewsReader Multilingual Event and Time Corpus
Sperber, Matthias Optimizing Computer-Assisted Transcription Quality with Iterative User Interfaces
Spitkovsky, Valentin I. A comparison of Named-Entity Disambiguation and Word Sense Disambiguation
Sproat, Richard TTS for Low Resource Languages: A Bangla Synthesizer
Sprugnoli, Rachele “Who was Pietro Badoglio?” Towards a QA system for Italian History
NLP and Public Engagement: The Case of the Italian School Reform
Temporal Information Annotation: Crowd vs. Experts
Srijith, P. K. Studying the Temporal Dynamics of Word Co-occurrences: An Application to Event Detection
Srikumar, Vivek EDISON: Feature Extraction for NLP, Simplified
S, Sreelekha Lexical Resources to Enrich English Malayalam Machine Translation
Štajner, Sanja Use of Domain-Specific Language Resources in Machine Translation
Bootstrapping a Hybrid MT System to a New Language Pair
Stankovic, Ranka Rule-based Automatic Multi-word Term Extraction and Lemmatization
Staš, Ján Evaluation Set for Slovak News Information Retrieval
An Extension of the Slovak Broadcast News Corpus based on Semi-Automatic Annotation
Stede, Manfred Information structure in the Potsdam Commentary Corpus: Topics
Adding Semantic Relations to a Large-Coverage Connective Lexicon of German
Parallel Discourse Annotations on a Corpus of Short Texts
Steen, Julius Detecting Annotation Scheme Variation in Out-of-Domain Treebanks
Štefanec, Vanja Croatian Error-Annotated Corpus of Non-Professional Written Language
Stefanov, Kalin A Multi-party Multi-modal Dataset for Focus of Visual Attention in Human-human and Human-robot Interaction
Stefas, Mickael Benchmarking multimedia technologies with the CAMOMILE platform: the case of Multimodal Person Discovery at MediaEval 2015
The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
Steffen, Diana A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds
Stegen, Florian Mining the Spoken Wikipedia for Speech Data and Beyond
Stein, Achim "LVF-lemon ― Towards a Linked Data Representation of ""Les Verbes français"""
Old French Dependency Parsing: Results of Two Parsers Analysed from a Linguistic Point of View
Steinberger, Josef The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015
Steinberger, Ralf Cross-lingual Linking of Multi-word Entities and their corresponding Acronyms
Steiner, Petra Refurbishing a Morphological Database for German
Stenger, Irina Orthographic and Morphological Correspondences between Related Slavic Languages as a Base for Modeling of Mutual Intelligibility
Stent, Amanda Extractive Summarization under Strict Length Constraints
Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest
Štěpánek, Jan Searching in the Penn Discourse Treebank Using the PML-Tree Query
Stepanov, Evgeny Summarizing Behaviours: An Experiment on the Annotation of Call-Centre Conversations
Transfer of Corpus-Specific Dialogue Act Annotation to ISO Standard: Is it worth it?
Stevens, Christopher Modelling Multi-issue Bargaining Dialogues: Data Collection, Annotation Design and Corpus
Stoitsis, Giannis FREME: Multilingual Semantic Enrichment with Linked Data and Language Technologies
Stokowiec, Wojciech LanguageCrawl: A Generic Tool for Building Language Models Upon Common-Crawl
Straka, Milan UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing
Merging Data Resources for Inflectional and Derivational Morphology in Czech
Straková, Jana UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing
Straňák, Pavel Improving corpus search via parsing
The Public License Selector: 
Making Open Licensing Easier
Stranisci, Marco Annotating Sentiment and Irony in the Online Italian Political Debate on #labuonascuola
Strapparava, Carlo PROMETHEUS: A Corpus of Proverbs Annotated with Metaphors
Strassel, Stephanie LORELEI Language Packs: Data, Tools, and Resources for Technology Development in Low Resource Languages
The Query of Everything: Developing Open-Domain, Natural-Language Queries for BOLT Information Retrieval
Multi-language Speech Collection for NIST LRE
Selection Criteria for Low Resource Language Programs
Uzbek-English and Turkish-English Morpheme Alignment Corpora
Large Multi-lingual, Multi-level and Multi-genre Annotation Corpus
Parallel Chinese-English Entities, Relations and Events Corpora
Strik, Helmer A Dutch Dysarthric Speech Database for Individualized Speech Therapy Research
A Shared Task for Spoken CALL?
Strik Lievers, Francesca A lexicon of perception for the identification of synaesthetic metaphors in corpora
Strötgen, Jannik GATE-Time: Extraction of Temporal Expressions and Events
Strzalkowski, Tomek The Validation of MRCPD Cross-language Expansions on Imageability Ratings
ANEW+: Automatic Expansion and Validation of Affective Norms of Words Lexicons in Multiple Languages
Stüker, Sebastian Evaluation of the KIT Lecture Translation System
Suderman, Keith The Language Application Grid and Galaxy
Su, Keh-Yih Building A Case-based Semantic English-Chinese Parallel Treebank
Sukhareva, Maria Crowdsourcing a Large Dataset of Domain-Specific Context-Sensitive Semantic Verb Relations
Combining Ontologies and Neural Networks for Analyzing Historical Language Varieties. A Case Study in Middle Low German
Şulea, Octavia-Maria Using Word Embeddings to Translate Named Entities
Sumita, Eiichiro Introducing the Asian Language Treebank (ALT)
ASPEC: Asian Scientific Paper Excerpt Corpus
Sundberg, Gunlög SweLL on the rise: Swedish Learner Language corpus for European Reference Level studies
Sun, Ming AppDialogue: Multi-App Dialogues for Intelligent Assistants
Surdeanu, Mihai Sieve-based Coreference Resolution in the Biomedical Domain
Towards Using Social Media to Identify Individuals at Risk for Preventable Chronic Illness
Odin's Runes: A Rule Language for Information Extraction
Sutcliffe, Richard Using a Cross-Language Information Retrieval System based on OHSUMED to Evaluate the Moses and KantanMT Statistical Machine Translation Systems
Suzuki, Kanta Correcting Errors in a Treebank Based on Tree Mining
Sylak-Glassman, John Remote Elicitation of Inflectional Paradigms to Seed Morphological Analysis in Low-Resource Languages
Very-large Scale Parsing and Normalization of Wiktionary Morphological Paradigms
Szabó, Martina Katalin A Hungarian Sentiment Corpus Manually Annotated at Aspect Level

 

T
Taatgen, Niels Modelling Multi-issue Bargaining Dialogues: Data Collection, Annotation Design and Corpus
Tachibana, Ryuichi Analysis of English Spelling Errors in a Word-Typing Game
Tack, Anaïs SVALex: a CEFR-graded Lexical Resource for Swedish Foreign and Second Language Learners
Evaluating Lexical Simplification and Vocabulary Knowledge for Learners of French: Possibilities of Using the FLELex Resource
Tadić, Marko Building the Macedonian-Croatian Parallel Corpus
Takahashi, Fumihiko Parallel Speech Corpora of Japanese Dialects
Takamura, Hiroya Discriminative Analysis of Linguistic Features for Typological Study
Takeuchi, Moe Quantitative Analysis of Gazes and Grounding Acts in L1 and L2 Conversations
Tambouratzis, George Linguistically Inspired Language Model Augmentation for MT
Tamburini, Fabio Automatic identification of Mild Cognitive Impairment through the analysis of Italian spontaneous speech productions
Specialising Paragraph Vectors for Text Polarity Detection
Tamchyna, Aleš Manual and Automatic Paraphrases for MT Evaluation
Tamisier, Thomas Benchmarking multimedia technologies with the CAMOMILE platform: the case of Multimodal Person Discovery at MediaEval 2015
The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
Tamres-Rudnicky, Yulian AppDialogue: Multi-App Dialogues for Intelligent Assistants
Tanaka, Takaaki Universal Dependencies for Japanese
Tanev, Hristo Detecting Implicit Expressions of Affect from Text using Semantic Knowledge on Common Concept Properties
Tannier, Xavier Datasets for Aspect-Based Sentiment Analysis in French
A Dataset for Open Event Extraction in English
Tateisi, Yuka Typed Entity and Relation Annotation on Computer Science Papers
Tavarez, David A Singing Voice Database in Basque for Statistical Singing Synthesis of Bertsolaritza
Teh, Phoey Lee Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages
Teich, Elke The Royal Society Corpus: From Uncharted Data to Corpus
Teisseire, Maguelonne Automatic Biomedical Term Polysemy Detection
Tekiroglu, Serra Sinem PROMETHEUS: A Corpus of Proverbs Annotated with Metaphors
Telaar, Dominic Towards Automatic Transcription of ILSE ― an Interdisciplinary Longitudinal Study of Adult Development and Aging
Tellier, Isabelle Domain Adaptation for Named Entity Recognition Using CRFs
Semantic Annotation of the ACL Anthology Corpus for the Automatic Analysis of Scientific Literature
Temnikova, Irina A Corpus of Text Data and Gaze Fixations from Autistic and Non-Autistic Adults
Evaluating the Readability of Text Simplification Output for Readers with Cognitive Disabilities
SuperCAT: The (New and Improved) Corpus Analysis Toolkit
Applying the Cognitive Machine Translation Evaluation Approach to Arabic
Teng, Zhiyang LibN3L:A Lightweight Package for Neural NLP
Teraoka, Takehiro Metonymy Analysis Using Associative Relations between Words
Terbeh, Naim Vocal Pathologies Detection and Mispronounced Phonemes Identification: Case of Arabic Continuous Speech
Tetreault, Joel Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest
Tettamanzi, Andrea DRANZIERA: An Evaluation Protocol For Multi-Domain Opinion Mining
Teufel, Simone Solving the AL Chicken-and-Egg Corpus and Model Problem: Model-free Active Learning for Phenomena-driven Corpus Construction
Thadani, Kapil Extractive Summarization under Strict Length Constraints
Thater, Stefan Unsupervised Ranked Cross-Lingual Lexical Substitution for Low-Resource Languages
Improving POS Tagging of German Learner Language in a Reading Comprehension Scenario
A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds
A Crowdsourced Database of Event Sequence Descriptions for the Acquisition of High-quality Script Knowledge
Thomas, Beverley Ensemble Classification of Grants using LDA-based Features
Thomaschewski, Jörg A Language Resource of German Errors Written by Children with Dyslexia
Thompson, Paul Identifying Content Types of Messages Related to Open Source Software Projects
Thunes, Martha NorGramBank: A ‘Deep’ Treebank for Norwegian
Tian, Ran Question-Answering with Logic Specific to Video Games
Tian, Tian Domain Adaptation for Named Entity Recognition Using CRFs
Tian, Ye DUEL: A Multi-lingual Multimodal Dialogue Corpus for Disfluency, Exclamations and Laughter
Tiedemann, Jörg Finding Alternative Translations in a Large Corpus of Movie Subtitle
OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles
Timmermans, Benjamin The VU Sound Corpus: Adding More Fine-grained Annotations to the Freesound Database
Timmons, Tamara On Developing Resources for Patient-level Information Retrieval
Tim, Oates A Gold Standard for Scalar Adjectives
Tjong Kim Sang, Erik Nederlab: Towards a Single Portal and Research Environment for Diachronic Dutch Text Corpora
Tkachenko, Alexander EstNLTK - NLP Toolkit for Estonian
Tobin, Richard Homing in on Twitter Users: Evaluating an Enhanced Geoparser for User Profile Locations
Todo, Naoya Translation Errors and Incomprehensibility: a Case Study using Machine-Translated Second Language Proficiency Tests
Tokunaga, Takenobu Solving the AL Chicken-and-Egg Corpus and Model Problem: Model-free Active Learning for Phenomena-driven Corpus Construction
Tolins, Jackson A Verbal and Gestural Corpus of Story Retellings to an Expressive Embodied Virtual Character
A Multimodal Motion-Captured Corpus of Matched and Mismatched Extravert-Introvert Conversational Pairs
Tomlinson, Marc Introducing the LCC Metaphor Datasets
Tonelli, Sara NLP and Public Engagement: The Case of the Italian School Reform
PreMOn: a Lemon Extension for Exposing Predicate Models as Linked Data
Toral, Antonio Enhancing Cross-border EU E-commerce through Machine Translation: Needed Language Resources, Challenges and Opportunities
TweetMT: A Parallel Microblog Corpus
Producing Monolingual and Parallel Web Corpora at the Same Time - SpiderLing and Bitextor's Love Affair
Toussaint, Yannick Ambiguity Diagnosis for Terms in Digital Humanities
Toutanova, Kristina E-TIPSY: Search Query Corpus Annotated with Entities, Term Importance, POS Tags, and Syntactic Parses
Tracey, Jennifer LORELEI Language Packs: Data, Tools, and Resources for Technology Development in Low Resource Languages
Selection Criteria for Low Resource Language Programs
Uzbek-English and Turkish-English Morpheme Alignment Corpora
Trancoso, Isabel SPA: Web-based Platform for easy Access to Speech Processing Modules
Tratz, Stephen EasyTree: A Graphical Tool for Dependency Tree Annotation
Traum, David Towards a Multi-dimensional Taxonomy of Stories in Dialogue
Towards Automatic Identification of Effective Clues for Team Word-Guessing Games
Trilsbeek, Paul FLAT: Constructing a CLARIN Compatible Home for Language Resources
Trippel, Thorsten Crosswalking from CMDI to Dublin Core and MARC 21
Trips, Carola Syntactic Analysis of Phrasal Compounds in Corpora: a Challenge for NLP Tools
Trmal, Jan New release of Mixer-6: Improved validity for phonetic study of speaker variation and identification
Troncy, Raphael Context-enhanced Adaptive Entity Linking
Trouvain, Juergen The IFCASL Corpus of French and German Non-native and Native Read Speech
Trtovac, Aleksandra Rule-based Automatic Multi-word Term Extraction and Lemmatization
Truneček, Petr SYN2015: Representative Corpus of Contemporary Written Czech
Tsarfaty, Reut Universal Dependencies v1: A Multilingual Treebank Collection
Tsuchiya, Tomoyuki Survey of Conversational Behavior: Towards the Design of a Balanced Corpus of Everyday Japanese Conversation
Tsuruoka, Yoshimasa A Japanese Chess Commentary Corpus
Tsvetanova, Liliya Ecological Gestures for HRI: the GEE Corpus
Tufiș, Dan The IPR-cleared Corpus of Contemporary Written and Spoken Romanian Language
Tulkens, Stephan Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource
Tuomisto, Matti Fostering digital representation of EU regional and minority languages: the Digital Language Diversity Project
Turtle, Howard R. EmoTweet-28: A Fine-Grained Emotion Corpus for Sentiment Analysis
Tuttle, Siri The Alaskan Athabascan Grammar Database
Tu, Zhaopeng Automatic Construction of Discourse Corpora for Dialogue Translation
Tyers, Francis A Finite-state Morphological Analyser for Tuvan
A Finite-State Morphological Analyser for Sindhi

 

U
Uchimoto, Kiyotaka ASPEC: Asian Scientific Paper Excerpt Corpus
Uematsu, Sumire Universal Dependencies for Japanese
Ueno, Hiroshi Dialogue System Characterisation by Back-channelling Patterns Extracted from Dialogue Corpus
Umata, Ichiro Quantitative Analysis of Gazes and Grounding Acts in L1 and L2 Conversations
Ungar, Lyle An Empirical Exploration of Moral Foundations Theory in Partisan News Sources
Unger, Christina Crowdsourcing Ontology Lexicons
Uresova, Zdenka Towards Comparability of Linguistic Graph Banks for Semantic Parsing
Czech Legal Text Treebank 1.0
Uria, Larraitz Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish and Slovene
Urizar, Ruben MEANTIME, the NewsReader Multilingual Event and Time Corpus
Uro, Jim Speech Trax: A Bottom to the Top Approach for Speaker Tracking and Indexing in an Archiving Context
Uryupina, Olga ARRAU: Linguistically-Motivated Annotation of Anaphoric Descriptions
Ushiku, Atsushi Language Resource Addition Strategies for Raw Text Parsing
A Japanese Chess Commentary Corpus
Uszkoreit, Hans TEG-REP: A corpus of Textual Entailment Graphs based on Relation Extraction Patterns
Relation- and Phrase-level Linking of FrameNet with Sar-graphs
Utiyama, Masao Introducing the Asian Language Treebank (ALT)
ASPEC: Asian Scientific Paper Excerpt Corpus
Utka, Andrius NLP Infrastructure for the Lithuanian Language
Utsuro, Takehito Analyzing Time Series Changes of Correlation between Market Share and Concerns on Companies measured through Search Engine Suggests
Uva, Antonio “Who was Pietro Badoglio?” Towards a QA system for Italian History
Uzair, Muhammad Urdu Summary Corpus

 

V
Vacher, Michel CirdoX: an on/off-line multisource speech and sound analysis software
The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People
Vaidya, Ashwini A Proposition Bank of Urdu
Valadas Pereira, Rita CINTIL DependencyBank PREMIUM - A Corpus of Grammatical Dependencies for Portuguese
Vala, Hardik Annotating Characters in Literary Corpora: A Scheme, the CHARLES Tool, and an Annotated Novel
Valderrama, Jorge The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015
Valenzuela-Escárcega, Marco A. Sieve-based Coreference Resolution in the Biomedical Domain
Odin's Runes: A Rule Language for Information Extraction
Vallet, Félicien Speech Trax: A Bottom to the Top Approach for Speaker Tracking and Indexing in an Archiving Context
Valli, André DeQue: A Lexicon of Complex Prepositions and Conjunctions in French
Vallmitjana, Jordi Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest
Valmaseda, Carlos A Tagged Corpus for Automatic Labeling of Disabilities in Medical Scientific Papers
Vanallemeersch, Tom Poly-GrETEL: Cross-Lingual Example-based Querying of Syntactic Constructions
Vandeghinste, Vincent AfriBooms: An Online Treebank for Afrikaans
Poly-GrETEL: Cross-Lingual Example-based Querying of Syntactic Constructions
van den Bosch, Antal Enhancing Access to Online Education: Quality Machine Translation of MOOC Content
Can Tweets Predict TV Ratings?
Nederlab: Towards a Single Portal and Research Environment for Diachronic Dutch Text Corpora
van den Heuvel, Henk Falling silent, lost for words ... Tracing personal involvement in interviews with Dutch war veterans
Curation of Dutch Regional Dictionaries
A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research
van der Goot, Rob The Denoised Web Treebank: Evaluating Dependency Parsing under Noisy Input Conditions
Van der Kuip, Frits A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research
van der Sijs, Nicoline Curation of Dutch Regional Dictionaries
Nederlab: Towards a Single Portal and Research Environment for Diachronic Dutch Text Corpora
Van der Veen, Bas FLAT: Constructing a CLARIN Compatible Home for Language Resources
Van de Velde, Hans A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research
van Erp, Marieke MEANTIME, the NewsReader Multilingual Event and Time Corpus
Context-enhanced Adaptive Entity Linking
Evaluating Entity Linking: An Analysis of Current Benchmark Datasets and a Roadmap for Doing a Better Job
Van Eynde, Frank AfriBooms: An Online Treebank for Afrikaans
van Genabith, Josef CATaLog Online: Porting a Post-editing Tool to the Web
Fostering the Next Generation of European Language Technology: Recent Developments ― Emerging Initiatives ― Challenges and Opportunities
Van hamme, Hugo SCALE: A Scalable Language Engineering Toolkit
van Harmelen, Martin A Corpus of Images and Text in Online News
Van Hee, Cynthia Exploring the Realization of Irony in Twitter Data
van Hout, Roeland Palabras: Crowdsourcing Transcriptions of L2 Speech
Van Huyssteen, Gerhard AfriBooms: An Online Treebank for Afrikaans
Vanin, Aline Adapting an Entity Centric Model for Portuguese Coreference Resolution
van Leeuwen, David A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research
van Miltenburg, Emiel The VU Sound Corpus: Adding More Fine-grained Annotations to the Freesound Database
Van Niekerk, Daniel AfriBooms: An Online Treebank for Afrikaans
van Son, Chantal GRaSP: A Multilayered Annotation Scheme for Perspectives
MEANTIME, the NewsReader Multilingual Event and Time Corpus
van Stipriaan, René Nederlab: Towards a Single Portal and Research Environment for Diachronic Dutch Text Corpora
Varela, Rocio Enhanced CORILGA: Introducing the Automatic Phonetic Alignment Tool for Continuous Speech
Introducing the SEA_AP: an Enhanced Tool for Automatic Prosodic Analysis
Varga, Viktor A Hungarian Sentiment Corpus Manually Annotated at Aspect Level
Vasilaki, Kyriaki Multimodal Resources for Human-Robot Communication Modelling
Vasiļjevs, Andrejs Collecting Language Resources for the Latvian e-Government Machine Translation Platform
Fostering the Next Generation of European Language Technology: Recent Developments ― Emerging Initiatives ― Challenges and Opportunities
Väyrynen, Jaakko Cross-lingual Linking of Multi-word Entities and their corresponding Acronyms
Vela, Mihaela SubCo: A Learner Translation Corpus of Human and Machine Subtitles
CATaLog Online: Porting a Post-editing Tool to the Web
Velldal, Erik A Corpus of Clinical Practice Guidelines Annotated with the Importance of Recommendations
Vempala, Alakananda Annotating Temporally-Anchored Spatial Knowledge on Top of OntoNotes Semantic Roles
Venturi, Giulia CItA: an L1 Italian Learners Corpus to Study the Development of Writing Competence
Verdonik, Darinka The SI TEDx-UM speech database: a new Slovenian Spoken Language Resource
Verhagen, Marc The Language Application Grid and Galaxy
Verhoeven, Ben TwiSty: A Multilingual Twitter Stylometry Corpus for Gender and Personality Profiling
Vernerová, Anna Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study
VPS-GradeUp: Graded Decisions on Usage Patterns
Versley, Yannick Detecting Annotation Scheme Variation in Out-of-Domain Treebanks
Verstoep, Kees Two Architectures for Parallel Processing of Huge Amounts of Text
Verwimp, Lyan SCALE: A Scalable Language Engineering Toolkit
Vetulani, Grażyna Recent Advances in Development of a Lexicon-Grammar of Polish: PolNet 3.0
Vetulani, Zygmunt Recent Advances in Development of a Lexicon-Grammar of Polish: PolNet 3.0
Vidra, Jonáš Merging Data Resources for Inflectional and Derivational Morphology in Czech
Vieira, Renata Summ-it++: an Enriched Version of the Summ-it Corpus
Adapting an Entity Centric Model for Portuguese Coreference Resolution
A Sequence Model Approach to Relation Extraction in Portuguese
Vieu, Laure Corpus Annotation within the French FrameNet: a Domain-by-domain Methodology
A General Framework for the Annotation of Causality Based on FrameNet
Vilares, David EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment Analysis
Villata, Serena DART: a Dataset of Arguments and their Relations on Twitter
Villavicencio, Aline mwetoolkit+sem: Integrating Word Embeddings in the mwetoolkit for Semantic MWE Processing
Multiword Expressions in Child Language
B2SG: a TOEFL-like Task for Portuguese
VerbLexPor: a lexical resource with semantic roles for Portuguese
Villegas, Marta Leveraging RDF Graphs for Crossing Multiple Bilingual Dictionaries
Villemonte de la Clergerie, Eric Accurate Deep Syntactic Parsing of Graphs: The Case of French
Vincze, Veronika A Hungarian Sentiment Corpus Manually Annotated at Aspect Level
Virone, Daniela Tweeting and Being Ironic in the Debate about a Political Reform: the French Annotated Corpus TWitter-MariagePourTous
Viswanathan, Akshay The Gavagai Living Lexicon
Viszlay, Peter An Extension of the Slovak Broadcast News Corpus based on Semi-Automatic Annotation
Vitkutė-Adžgauskienė, Daiva NLP Infrastructure for the Lithuanian Language
Vitvar, Tomas Crowdsourced Corpus with Entity Salience Annotations
Vogel, Stephan Applying the Cognitive Machine Translation Evaluation Approach to Arabic
Voisin, Sylvie Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof
Volk, Martin Crowdsourcing an OCR Gold Standard for a German and French Heritage Corpus
Volodina, Elena SweLL on the rise: Swedish Learner Language corpus for European Reference Level studies
SVALex: a CEFR-graded Lexical Resource for Swedish Foreign and Second Language Learners
Volskaya, Nina CoRuSS - a New Prosodically Annotated Corpus of Russian Spontaneous Speech
Vondřička, Pavel SYN2015: Representative Corpus of Contemporary Written Czech
Vo, Ngoc Phuoc An Corpora for Learning the Mutual Relationship between Semantic Relatedness and Textual Entailment
Von Reihn, Daniel FLAT: Constructing a CLARIN Compatible Home for Language Resources
vor der Brück, Tim TLT-CRF: A Lexicon-supported Morphological Tagger for Latin Based on Conditional Random Fields
Vossen, Piek Addressing the MFS Bias in WSD systems
GRaSP: A Multilayered Annotation Scheme for Perspectives
The Event and Implied Situation Ontology (ESO): Application and Evaluation
Vulcu, Gabriela Forecasting Emerging Trends from Scientific Literature

 

W
Wachsmuth, Sven How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Wacker, Philippe Providing a Catalogue of Language Resources for Commercial Users
Wagner, Agnieszka Polish Rhythmic Database ― New Resources for Speech Timing and Rhythm Analysis
Wagner, Petra How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Wagner, Sven Using a Language Technology Infrastructure for German in order to Anonymize German Sign Language Corpus Data
Waibel, Alex Optimizing Computer-Assisted Transcription Quality with Iterative User Interfaces
Evaluation of the KIT Lecture Translation System
Waitelonis, Joerg Evaluating Entity Linking: An Analysis of Current Benchmark Datasets and a Roadmap for Doing a Better Job
Wald, Mike Phonetic Inventory for an Arabic Speech Corpus
Walker, Kevin Multi-language Speech Collection for NIST LRE
Walker, Marilyn Internet Argument Corpus 2.0: An SQL schema for Dialogic Social Media and the Corpora to go with it
PersonaBank: A Corpus of Personal Narratives and Their Story Intention Graphs
A Corpus of Gesture-Annotated Dialogues for Monologue-to-Dialogue Generation from Personal Narratives
Coordinating Communication in the Wild: The Artwalk Dialogue Corpus of Pedestrian Navigation and Mobile Referential Communication
A Verbal and Gestural Corpus of Story Retellings to an Expressive Embodied Virtual Character
A Multimodal Motion-Captured Corpus of Matched and Mismatched Extravert-Introvert Conversational Pairs
Walker, Martin Learning Tone and Attribution for Financial Text Mining
Wallner, Franziska User, who art thou? User Profiling for Oral Corpus Platforms
Walshe, Brian Open Data Vocabularies for Assigning Usage Rights to Data Resources from Translation Projects
Walther, Désirée Finding Recurrent Features of Image Schema Gestures: the FIGURE corpus
Wambacq, Patrick SCALE: A Scalable Language Engineering Toolkit
Wang, Cheng Punctuation Prediction for Unsegmented Transcript Based on Word Vector
Wang, Josiah Cross-validating Image Description Datasets and Evaluation Metrics
Wang, Lin How does Dictionary Size Influence Performance of Vietnamese Word Segmentation?
Wang, Longyue Automatic Construction of Discourse Corpora for Dialogue Translation
Wang, Meikun On Developing Resources for Patient-level Information Retrieval
Wang, Shih-Ming ANTUSD: A Large Chinese Sentiment Dictionary
Wang, Yingying A Multimodal Motion-Captured Corpus of Matched and Mismatched Extravert-Introvert Conversational Pairs
Wanner, Leo Example-based Acquisition of Fine-grained Collocation Resources
A Semi-Supervised Approach for Gender Identification
Towards Multiple Antecedent Coreference Resolution in Specialized Discourse
Wan, Yan A Machine Learning based Music Retrieval and Recommendation System
Wanzare, Lilian D. A. A Crowdsourced Database of Event Sequence Descriptions for the Acquisition of High-quality Script Knowledge
Wartena, Christian Learning Thesaurus Relations from Distributional Features
Washington, Jonathan A Finite-state Morphological Analyser for Tuvan
Watanabe, Ryoko Survey of Conversational Behavior: Towards the Design of a Balanced Corpus of Everyday Japanese Conversation
Wawer, Aleksander OPFI: A Tool for Opinion Finding in Polish
Way, Andy Using SMT for OCR Error Correction of Historical Texts
Enhancing Access to Online Education: Quality Machine Translation of MOOC Content
Using BabelNet to Improve OOV Coverage in SMT
ProphetMT: A Tree-based SMT-driven Controlled Language Authoring/Post-Editing Tool
Automatic Construction of Discourse Corpora for Dialogue Translation
Webber, Bonnie Inconsistency Detection in Semantic Annotation
Weichselbraun, Albert A Regional News Corpora for Contextualized Entity Discovery and Linking
Weigert, Kathrin User, who art thou? User Profiling for Oral Corpus Platforms
Weiner, Jochen Towards Automatic Transcription of ILSE ― an Interdisciplinary Longitudinal Study of Adult Development and Aging
Wellner, Christian A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds
Wendelstein, Britta Towards Automatic Transcription of ILSE ― an Interdisciplinary Longitudinal Study of Adult Development and Aging
Werner, Steffen A Comparative Analysis of Crowdsourced Natural Language Corpora for Spoken Dialog Systems
Westpfahl, Swantje User, who art thou? User Profiling for Oral Corpus Platforms
FOLK-Gold ― A Gold Standard for Part-of-Speech-Tagging of Spoken German
White, Michael A Corpus of Word-Aligned Asked and Anticipated Questions in a Virtual Patient Dialogue System
Wi, Chung-Il Staggered NLP-assisted refinement for Clinical Annotations of Chronic Disease Events
Wieling, Martijn ALT Explored: Integrating an Online Dialectometric Tool and an Online Dialect Atlas
Wierzchoń, Piotr “He Said She Said” ― a Male/Female Corpus of Polish
Wijnhoven, Kars The DialogBank
Wilkens, Rodrigo Multiword Expressions in Child Language
B2SG: a TOEFL-like Task for Portuguese
Wilkinson, Bryan A Gold Standard for Scalar Adjectives
Windhouwer, Menzo FLAT: Constructing a CLARIN Compatible Home for Language Resources
Wintner, Shuly A Corpus of Native, Non-native and Translated Texts
Wisniewski, Guillaume Cross-lingual and Supervised Models for Morphosyntactic Annotation: a Comparison on Romanian
Witkowski, Wojciech Challenges of Adjective Mapping between plWordNet and Princeton WordNet
Witt, Andreas Corpus Query Lingua Franca (CQLF)
KorAP Architecture ― Diving in the Deep Sea of Corpus Data
Wolff, Christian Creating a Lexicon of Bavarian Dialect by Means of Facebook Language Data and Crowdsourcing
Woliński, Marcin The on-line version of Grammatical Dictionary of Polish
Wong, Tak-sum A Dependency Treebank of the Chinese Buddhist Canon
Wong, Timothy Syllable based DNN-HMM Cantonese Speech to Text System
Wonsever, Dina Factuality Annotation and Learning in Spanish Texts
Spanish Word Vectors from Wikipedia
Wörtwein, Torsten A Multimodal Corpus for the Assessment of Public Speaking Ability and Anxiety
Wottawa, Jane French Learners Audio Corpus of German Speech (FLACGS)
Wrede, Britta An Interaction-Centric Dataset for Learning Automation Rules in Smart Homes
How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Wrede, Sebastian An Interaction-Centric Dataset for Learning Automation Rules in Smart Homes
How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Wright, Jonathan Multi-language Speech Collection for NIST LRE
Wubben, Sander SatiricLR: a Language Resource of Satirical News Articles
Wu, Stephen Staggered NLP-assisted refinement for Clinical Annotations of Chronic Disease Events
On Developing Resources for Patient-level Information Retrieval
Wu, Xiaofeng ProphetMT: A Tree-based SMT-driven Controlled Language Authoring/Post-Editing Tool
Wu, Yi Improving the Annotation of Sentence Specificity
Wyner, Adam Passing a USA National Bar Exam: a First Corpus for Experimentation
Legal Text Interpretation: Identifying Hohfeldian Relations from Text

 

X
Xia, Fei Annotating and Detecting Medical Events in Clinical Notes
Xiao, Liumingjing Domain Ontology Learning Enhanced by Optimized Relation Instance in DBpedia
Xiong, Dan Syllable based DNN-HMM Cantonese Speech to Text System
Xue, Nianwen Large Multi-lingual, Multi-level and Multi-genre Annotation Corpus
Xu, Feiyu TEG-REP: A corpus of Textual Entailment Graphs based on Relation Extraction Patterns
Relation- and Phrase-level Linking of FrameNet with Sar-graphs
Xu, Hongzhi Database of Mandarin Neighborhood Statistics
Xu, Yong Novel elicitation and annotation schemes for sentential and sub-sentential alignments of bitexts

 

Y
Yaguchi, Manabu ASPEC: Asian Scientific Paper Excerpt Corpus
Yahya, Emad Arabic Corpora for Credibility Analysis
Yamada, Masaru English-to-Japanese Translation vs. Dictation vs. Post-editing: Comparing Translation Modes in a Multilingual Setting
Yamamoto, Seiichi Quantitative Analysis of Gazes and Grounding Acts in L1 and L2 Conversations
Joining-in-type Humanoid Robot Assisted Language Learning System
Yaneva, Victoria A Corpus of Text Data and Gaze Fixations from Autistic and Non-Autistic Adults
Evaluating the Readability of Text Simplification Output for Readers with Cognitive Disabilities
Yang, An Domain Ontology Learning Enhanced by Optimized Relation Instance in DBpedia
Yangarber, Roman A Novel Evaluation Method for Morphological Segmentation
Yang, Diyi Edit Categories and Editor Role Identification in Wikipedia
Yang, Haojin Punctuation Prediction for Unsegmented Transcript Based on Word Vector
Yang, Jie LibN3L:A Lightweight Package for Neural NLP
Yang, Yating A Bilingual Discourse Corpus and Its Applications
Yanovich, Polina Detection of Major ASL Sign Types in Continuous Signing For ASL Recognition
Yarowsky, David Remote Elicitation of Inflectional Paradigms to Seed Morphological Analysis in Low-Resource Languages
Very-large Scale Parsing and Normalization of Wiktionary Morphological Paradigms
Yates, Amy On Developing Resources for Patient-level Information Retrieval
Yates, Andrew Effects of Sampling on Twitter Trend Detection
Yeh, Eric An Annotated Corpus and Method for Analysis of Ad-Hoc Structures Embedded in Text
Yetisgen, Meliha Annotating and Detecting Medical Events in Clinical Notes
Yeung, Chak Yan An Annotated Corpus of Direct Speech
Yilmaz, Emre A Dutch Dysarthric Speech Database for Individualized Speech Therapy Research
A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research
Yokomori, Daisuke Survey of Conversational Behavior: Towards the Design of a Balanced Corpus of Everyday Japanese Conversation
Yoshino, Koichiro Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recognition
Parallel Speech Corpora of Japanese Dialects
Young, Steve Learning Tone and Attribution for Financial Text Mining
Yuan, Yu MoBiL: A Hybrid Feature Set for Automatic Human Translation Quality Assessment
Yu, Hwanjo Analyzing Pre-processing Settings for Urdu Single-document Extractive Summarization
Yu, Roy Shing Syllable based DNN-HMM Cantonese Speech to Text System
Yu, Zhiwei If You Even Don't Have a Bit of Bible: Learning Delexicalized POS Taggers
Yvon, François Novel elicitation and annotation schemes for sentential and sub-sentential alignments of bitexts
Cross-lingual and Supervised Models for Morphosyntactic Annotation: a Comparison on Romanian

 

Z
Žabokrtský, Zdeněk If You Even Don't Have a Bit of Bible: Learning Delexicalized POS Taggers
Merging Data Resources for Inflectional and Derivational Morphology in Czech
Zaghouani, Wajdi Guidelines and Framework for a Large Scale Arabic Diacritized Corpus
Building an Arabic Machine Translation Post-Edited Corpus: Guidelines and Annotation
Applying the Cognitive Machine Translation Evaluation Approach to Arabic
Zaiß, Melanie Visualisation and Exploration of High-Dimensional Distributional Features in Lexical Semantic Classification
Zampieri, Marcos CATaLog Online: Porting a Post-editing Tool to the Web
Discriminating Similar Languages: Evaluations and Explorations
Modeling Language Change in Historical Corpora: The Case of Portuguese
Zaragoza, Hugo The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015
Zarcone, Alessandra A Crowdsourced Database of Event Sequence Descriptions for the Acquisition of High-quality Script Knowledge
Zargayouna, Haifa Semantic Annotation of the ACL Anthology Corpus for the Automatic Analysis of Scientific Literature
Zarghili, Arsalan Al Qamus al Muhit, a Medieval Arabic Lexicon in LMF
Zarrieß, Sina PentoRef: A Corpus of Spoken References in Task-oriented Dialogues
Zasina, Adrian Jan SYN2015: Representative Corpus of Contemporary Written Czech
Zayed, Omnia C4Corpus: Multilingual Web-size Corpus with Free License
Zeman, Daniel Universal Dependencies v1: A Multilingual Treebank Collection
If You Even Don't Have a Bit of Bible: Learning Delexicalized POS Taggers
Towards Comparability of Linguistic Graph Banks for Semantic Parsing
Zesch, Torsten FlexTag: A Highly Flexible PoS Tagging Framework
Zeyrek, Deniz A Turkish Database for Psycholinguistic Studies Based on Frequency, Age of Acquisition, and Imageability
Zgank, Andrej The SI TEDx-UM speech database: a new Slovenian Spoken Language Resource
Zhang, Jiajun A Bilingual Discourse Corpus and Its Applications
Zhang, Junhao Domain Ontology Learning Enhanced by Optimized Relation Instance in DBpedia
Zhang, Meishan LibN3L:A Lightweight Package for Neural NLP
Zhang, Wanru Predicting Author Age from Weibo Microblog Posts
Zhang, Xiaojun Automatic Construction of Discourse Corpora for Dialogue Translation
Zhang, Yue Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets
Evaluating a Deterministic Shift-Reduce Neural Parser for Constituent Parsing
LibN3L:A Lightweight Package for Neural NLP
Multi-prototype Chinese Character Embedding
Zhang, Ziqi JATE 2.0: Java Automatic Term Extraction with Apache Solr
Zhao, Chen Analyzing Time Series Changes of Correlation between Market Share and Concerns on Companies measured through Search Engine Suggests
Zhao, Tiejun Building A Case-based Semantic English-Chinese Parallel Treebank
Zhao, Wenli Improving the Annotation of Sentence Specificity
Zhou, Hao Evaluating a Deterministic Shift-Reduce Neural Parser for Constituent Parsing
Zhou, Xi A Bilingual Discourse Corpus and Its Applications
Zhu, Xiaodan A Dataset for Detecting Stance in Tweets
Ziai, Ramon Focus Annotation of Task-based Data: A Comparison of Expert and Crowd-Sourced Annotation in a Reading Comprehension Corpus
Ziemski, Michał The United Nations Parallel Corpus v1.0
Zilio, Leonardo B2SG: a TOEFL-like Task for Portuguese
VerbLexPor: a lexical resource with semantic roles for Portuguese
Zimmerer, Frank The IFCASL Corpus of French and German Non-native and Native Read Speech
Zinn, Claus Crosswalking from CMDI to Dublin Core and MARC 21
Zipser, Florian corpus-tools.org: An Interoperable Generic Software Tool Set for Multi-layer Linguistic Corpora
Zi, Wenjie Classifying Out-of-vocabulary Terms in a Domain-Specific Social Media Corpus
Zong, Chengqing A Bilingual Discourse Corpus and Its Applications
Zorn, René How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Zrigui, Mounir Vocal Pathologies Detection and Mispronounced Phonemes Identification: Case of Arabic Continuous Speech
Zséder, Attila The hunvec framework for NN-CRF-based sequential tagging
Zubiaga, Arkaitz TweetMT: A Parallel Microblog Corpus
Zuccon, Guido Building Evaluation Datasets for Consumer-Oriented Information Retrieval
Zweigenbaum, Pierre Transfer-Based Learning-to-Rank Assessment of Medical Term Technicality
Identification of Drug-Related Medical Conditions in Social Media
Managing Linguistic and Terminological Variation in a Medical Dialogue System
Zydron, Andrzej Using BabelNet to Improve OOV Coverage in SMT

Powered by ELDA © 2016 ELDA/ELRA