Silver Sponsors
Bronze Sponsors

» Conference Programme » List of accepted papers

List of accepted papers


ID Title Authors
106 Investigating Engagement - intercultural and technological aspects of the collection, analysis, and use of the Estonian Multiparty Conversational video data Kristiina Jokinen and Mare Koit
108 “You Seem Aggressive!” Monitoring Anger in a Practical Application Felix Burkhardt
110 Fast Labeling and Transcription with the Speechalyzer Toolkit Felix Burkhardt
112 Smooth Sailing for STEVIN Peter Spyns and Elisabeth D’Halleweyn
113 Automatic Speech Recognition on a Firefighter TETRA Broadcast Channel Daniel Stein and Bela Usabaev
114 Building a Basque-Chinese Dictionary by Using English as Pivot Xabier Saralegi, Iker Manterola and Iñaki San Vicente
117 Mining Sentiment Words from Microblogs for Predicting Writer-Reader Emotion Transition Yi-jie Tang and Hsin-Hsi Chen
120 A High-Quality Web Corpus of Czech Johanka Spoustová and Miroslav Spousta
121 German verb patterns and their implementation in an electronic dictionary Marc Luder
122 Large Scale Semantic Annotation, Indexing and Search at The National Archives Diana Maynard and Mark A. Greenwood
123 QurAna: Corpus of the Quran annotated with Pronominal Anaphora Abdul-Baquee Sharaf and Eric Atwell
124 Bootstrapping Sentiment Labels For Unannotated Documents With Polarity PageRank Christian Scheible and Hinrich Schütze
125 A Multi-layered Reference Corpus for German Sentiment Analysis Simon Clematide, Stefan Gindl, Manfred Klenner, Stefanos Petrakis, Robert Remus, Josef Ruppenhofer, Ulli Waltinger and Michael Wiegand
126 Versatile Speech Databases for High Quality Synthesis for Basque Iñaki Sainz, Daniel Erro, Eva Navas, Inma Hernáez, Jon Sánchez, Ibon Saratxaga and Igor Odriozola
128 TIMEN: An Open Temporal Expression Normalization Resource Hector Llorens, Leon Derczynski, Robert Gaizauskas and Estela Saquete
129 Measuring Interlanguage: Native Language Identification with L1-influence Metrics Julian Brooke and Graeme Hirst
130 TextCoop: A logic-based language for processing discourse structures Patrick SAINT-DIZIER
136 A Gold Standard for Relation Extraction in the Food Domain Michael Wiegand, Benjamin Roth, Eva Lasarcyk, Stephanie Köser and Dietrich Klakow
137 A Repository of Rules and Lexical Resources for Discourse Structure Analysis: the Case of Explanation Structures Sarah BOURSE and Patrick SAINT-DIZIER
139 Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring Flore BARCELLINI, Marie GARNIER, Corinne GROSSE and Patrick SAINT-DIZIER
148 WebAnnotator, an Annotation Tool for Web Pages Xavier Tannier
149 Towards a comprehensive open repository of Polish language resources Maciej Ogrodniczuk, Piotr Pęzik and Adam Przepiórkowski
150 Towards an LFG parser for Polish: An exercise in parasitic grammar development Agnieszka Patejuk and Adam Przepiórkowski
151 Constructive Interaction for Talking about Interesting Topics Kristiina Jokinen and Graham Wilcock
152 Corpus-based Referring Expressions Generation Hilder Pereira, Eder Novais, Andre Mariotti and Ivandre Paraboni
153 Portuguese Text Generation from Large Corpora Eder Novais, Ivandre Paraboni and Douglas Silva
154 SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles Volha Petukhova, Rodrigo Agerri, Mark Fishel, Sergio Penkale, Arantza del Pozo, Mirjam Sepesy Maucec, Andy Way, Panayota Georgakopoulou and Martin Volk
159 Collecting, Interpreting and Exploiting Affective Common Sense Knowledge Erik Cambria, Amir Hussain, Catherine Havasi and Chris Eckl
160 Body-conductive acoustic sensors in human-robot communication Panikos Heracleous, Carlos Ishi, Takahiro Miyashita and Norihiro Hagita
161 From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information Lieve Macken, Veronique Hoste, Marielle Leijten and Luuk Van Waes
164 A Comparative Evaluation of Word Sense Disambiguation Algorithms for German Verena Henrich and Erhard Hinrichs
165 SemScribe: Natural Language Generation for Medical Reports Sebastian Varges, Heike Bieler, Manfred Stede, Lukas C. Faulstich, Kristin Irsig and Malik Atalla
166 Automatic Annotation and Manual Evaluation of the Diachronic German Corpus TüBa-D/DC Erhard Hinrichs and Thomas Zastrow
167 A new dynamic approach for lexical networks evaluation Alain Joubert and Mathieu Lafourcade
168 Grammatical Error Annotation for Korean Learners of Spoken English Hongsuck Seo, Kyusong Lee, Gary Geunbae Lee, Soo-Ok Kweon and Hae-Ri Kim
169 Adaptive Speech Recognition for Intuitive Model‐based Spoken Dialogues Tobias Heinroth, Maximilian Grotz, Florian Nothdurft and Wolfgang Minker
170 First Steps towards the Semi-automatic Development of a Wordformation-based Lexicon of Latin Marco Passarotti and Francesco Mambrini
172 Using Parallel and Comparable Data for Abstract Anaphora Resolution in German and English Heike Zinsmeister, Melanie Seiss and Stefanie Dipper
175 A Tool for Extracting Conversational Implicatures Marta Tatu and Dan Moldovan
176 Polaris: Lymba’s Semantic Parsing Dan Moldovan and Eduardo Blanco
177 Light Verb Constructions in the SzegedParalellFX English—Hungarian Parallel Corpus Veronika Vincze
179 Balanced data repository of spontaneous spoken Czech Lucie Válková, Martina Waclawičová and Michal Křen
180 The coding and annotation of multimodal dialogue acts Volha Petukhova and Harry Bunt
181 Learning Categories and their Instances by Contextual Features Antje Schlaf and Robert Remus
182 Textual Characteristics for Language Processing Systems Mathias Bank and Martin Schierle
183 A Survey of Text Mining Architectures and the UIMA Standard Mathias Bank and Martin Schierle
184 A Rule-based Morphological Analyzer for Murrinh-Patha Melanie Seiss
187 Optimising WSD through Active Learning Piek Vossen, Attila Görög, Fons Laan, Maarten Van Gompel, Rubén Izquierdo and Antal Van den Bosch
188 Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies Bruno Cartoni and Thomas Meyer
190 QurSim: A corpus for evaluation of relatedness in short texts Abdul-Baquee Sharaf and Eric Atwell
192 Eye Tracking as a Tool for Machine Translation Error Analysis Sara Stymne, Henrik Danielsson, Sofia Bremin, Hongzhan Hu, Johanna Karlsson, Anna Prytz Lillkull and Martin Wester
193 Automatic annotation of head velocity and acceleration in Anvil Bart Jongejan
194 Representing the Translation Relation in a Bilingual Wordnet Jyrki Niemi and Krister Lindén
196 Can Statistical Post-Editing with a Small Parallel Corpus Save a Weak MT Engine? Marianna J. Martindale
198 The Impact of Automatic Morphological Analysis & Disambiguation on Dependency Parsing of Turkish GULSEN ERYIGIT
199 Detecting Reduplication in Videos of American Sign Language Zoya Gavrilov, Stan Sclaroff, Carol Neidle and Sven Dickinson
200 Building a multilingual parallel corpus for human users Alexandr Rosen and Martin Vavřín
201 EmpaTweet: Annotating and Detecting Emotions on Twitter Kirk Roberts, Michael A. Roach, Joseph Johnson, Josh Guthrie and Sanda M. Harabagiu
204 EVALIEX – A Proposal for an Extended Evaluation Methodology for Information Extraction Systems Christina Feilmayr, Birgit Pröll and Elisabeth Linsmayr
205 BiBiKit - A Bilingual Bimodal Reading and Writing Tool for Sign Language Users Nedelina Ivanova and Olle Eriksen
206 The BladeMistress Corpus: From Talk to Action in Virtual Worlds Anton Leuski, Carsten Eickhoff, James Ganis and Victor Lavrenko
207 Event Nominals: Annotation Guidelines and a Manually Annotated Corpus in French Arnulphy Béatrice, Tannier Xavier and Vilnat Anne
208 Confrontation of two models of language for the automatic phonetic labeling of an unknown ethnic language of the South-Asia: the case of Mo Piu Geneviève CAELEN-HAUMONT and Sethserey SAM
209 The Parallel-TUT: a multilingual and multiformat treebank Cristina Bosco, Manuela Sanguinetti and Leonardo Lesmo
210 Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources Arno Scharl, Marta Sabou, Stefan Gindl, Walter Rafelsberger and Albert Weichselbraun
211 Using multimodal resources for explanation approaches in intelligent systems Florian Nothdurft and Wolfgang Minker
212 Cloud Logic Programming for Integrating Language Technology Resources Markus Forsberg and Torbjörn Lager
213 AnIta: a powerful morphological analyser for Italian Fabio Tamburini and Matias Melandri
214 Automatic classification of German "an" particle verbs Sylvia Springorum, Sabine Schulte im Walde and Antje Roßdeutscher
215 LIE: Leadership, Influence and Expertise Roberta Catizone, Louise Guthrie, Arthur Thomas and Yorick Wilks
216 CAT: the CELCT Annotation Tool Valentina Bartalesi Lenzi, Giovanni Moretti and Rachele Sprugnoli
217 Quantising Opinions for Political Tweets Analysis Yulan He, Hassan Saif, Zhongyu Wei and Kam-Fai Wong
218 ROMBAC - The ROManian Balanced Annotated Corpus radu ion, elena irimia, dan stefanescu and dan tufis
219 Task-Driven Linguistic Analysis based on an Underspecified Features Representation Stasinos Konstantopoulos, Valia Kordoni, Nicola Cancedda, Vangelis Karkaletsis, Dietrich Klakow and Jean-Michel Renders
220 A French Fairy Tale Corpus syntactically and semantically annotated Ismaïl El Maarouf and Jeanne Villaneau
221 ConanDoyle-neg: Annotation of negation cues and their scope in Conan Doyle stories Roser Morante and Walter Daelemans
222 GerNED: A Corpus in German for Named Entity Disambiguation Danuta Ploch, Leonhard Hennig, Angelina Duka, Ernesto William De Luca and Sahin Albayrak
223 A classification of adjectives for polarity lexicons enhancement Silvia Vázquez and Núria Bel
225 A voting scheme to detect semantic underspecification Héctor Martínez Alonso, Núria Bel and Bolette Sandford Pedersen
226 Aspects of a Legal Framework for Language Resource Management Aditi Sharma Grover, Annamart Nieman, Gerhard Van Huyssteen and Justus Roux
228 Interplay of Coreference and Discourse Relations: Discourse Connectives with a Referential Component Lucie Poláková, Pavlína Jínová and Jiří Mírovský
229 Robust clause boundary identification for corpus annotation Heiki-Jaan Kaalep and Kadri Muischnek
230 CCRT Corpus - Speech Intelligibility Before and After Advanced Head and Neck Cancer Treated with Concomitant Chemoradiotherapy R.P. Clapham, L. van der Molen, R.J.JH. van Son, M. van den Brekel and F.J.M. Hilgers
231 PaCo2: A Fully Automated tool for gathering Parallel Corpora from the Web Iñaki San Vicente and Iker Manterola
232 Mapping WordNet synsets to Wikipedia articles Samuel Fernando and Mark Stevenson
233 BLEU Evaluation of Machine-Translated English-Croatian Legislation Sanja Seljan, Marija Brkić and Tomislav Vičić
235 Making Ellipses Explicit in Dependency Conversion for a German Treebank Wolfgang Seeker and Jonas Kuhn
236 SentiSense: An easily scalable concept-based affective lexicon for sentiment analysis Jorge Carrillo de Albornoz, Laura Plaza and Pablo Gervás
238 Distractorless Authorship Verification John Noecker Jr and Michael Ryan
239 Predicting Phrase Breaks in Classical and Modern Standard Arabic Text Majdi Sawalha, Claire Brierley and Eric Atwell
240 Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing Claire Brierley, Majdi Sawalha and Eric Atwell
241 Item Development and Scoring for Japanese Oral Proficiency Testing Hitokazu Matsushita and Deryle Lonsdale
244 Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing Ziqi Zhang, Philip Webster, Victoria Uren and Fabio Ciravegna
245 Identifying equivalents of specialized verbs in a bilingual comparable corpus of judgments: A frame-based methodology Janine Pimentel
246 TimeBankPT: A TimeML Annotated Corpus of Portuguese Francisco Costa and António Branco
247 Towards a richer wordnet representation of properties exploiting semantic and thematic information from thesauri Sanni Nimb and Bolette Sandford Pedersen
248 Korp – the corpus infrastructure of Språkbanken Lars Borin, Markus Forsberg and Johan Roxendal
249 The open lexical infrastructure of Språkbanken Lars Borin, Markus Forsberg, Leif-Jöran Olsson and Jonatan Uppström
251 Further Developments in Treebank Error Detection Using Derivation Trees Seth Kulick, Ann Bies and Justin Mott
252 Turk Bootstrap Word Sense Inventory 2.0: A Large-Scale Resource for Lexical Substitution Chris Biemann
254 MULTIPHONIA: a MULTImodal database of PHONetics teaching methods in classroom InterActions. Charlotte Alazard, Corine Astésano and Michel Billières
255 Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns Andrei Popescu-Belis, Thomas Meyer, Jeevanthi Liyanapathirana, Bruno Cartoni and Sandrine Zufferey
256 Challenges in the TAC-KBP Slot Filling task Bonan Min and Ralph Grishman
259 Logical metonymies and qualia structures: an annotated database of logical metonymies for German Alessandra Zarcone and Stefan Rued
262 HunOr: A Hungarian–Russian Parallel Corpus Martina Katalin Szabó, Veronika Vincze and István Nagy T.
263 PoliMorf: a (not so) new open morphological dictionary for Polish Marcin Woliński, Marcin Miłkowski, Maciej Ogrodniczuk and Adam Przepiórkowski
264 Introducing the Swedish Kelly-list, a new free e-resource for Swedish Elena Volodina and Sofie Johansson Kokkinakis
266 A Cross-Lingual Dictionary for English Wikipedia Concepts Valentin I. Spitkovsky and Angel X. Chang
267 Language Richness of the Web Martin Majliš and Zdeněk Žabokrtský
268 Feature Discovery for Diachronic Register Analysis: a Semi-Automatic Approach Stefania Degaetano-Ortlieb, Ekaterina Lapshinova-Koltunski and Elke Teich
269 Using Verb Subcategorization for Word Sense Disambiguation Will Roberts and Valia Kordoni
270 Danish parallel corpus for text simplification Sigrid Klerke and Anders Søgaard
272 Propbank-Br: a Brazilian Treebank annotated with semantic role labels Magali Sanches Duran and Sandra Maria Aluísio
274 A Universal Part-of-Speech Tagset Slav Petrov, Dipanjan Das and Ryan McDonald
276 A large scale annotated child language construction database Aline Villavicencio, Beracah Yankama and Robert Berwick
277 Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures Xuansong Li, Stephanie Strassel, Stephen Grimes, Safa Ismael, Mohamed Maamouri, Ann Bies and Nianwen Xue
278 Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual Xuansong Li, Stephanie Strassel, Heng Ji, Kira Griffitt and Joe Ellis
279 Constructing a Class-Based Lexical Dictionary using Interactive Topic Models Kugatsu Sadamitsu, Kuniko Saito, Kenji Imamura and Yoshihiro Matsuo
280 Multimodal Corpus of Multi-party Conversations in Foreign Language Shota Yamazaki, Hirohisa Furukawa, Masafumi Nishida, Kristiina Jokinen and Seiichi Yamamoto
282 Dictionary Look-up with Katakana Variant Recognition Satoshi Sato
284 SUTime: A library for recognizing and normalizing time expressions Angel Chang and Christopher Manning
287 Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking Nicholas Michael, Fei Yang, Dimitris Metaxas and Carol Neidle
288 Two Database Resources for Processing Social Media English Text Eleanor Clark and Kenji Araki
290 A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties Maristella Agosti, Birgit Alber, Giorgio Maria Di Nunzio, Marco Dussin, Stefan Rabanus and Alessandra Tomaselli
291 Evaluation of Online Dialogue Policy Learning Techniques Alexandros Papangelis, Vangelis Karkaletsis and Fillia Makedon
292 Experiences in Resource Generation for Machine Translation through Crowdsourcing Anoop Kunchukuttan, Shourya Roy, Pratik Patel, Kushal Ladha, Somya Gupta, Mitesh M. Khapra and Pushpak Bhattacharyya
293 Multilingual Central Repository version 3.0 Aitor González, Egoitz Laparra and German Rigau
294 Involving Language Professionals in the Evaluation of Machine Translation Eleftherios Avramidis, Aljoscha Burchardt, Christian Federmann, Maja Popović, Cindy Tscherwinka and David Vilar
295 A New Method for Evaluating Automatically Learned Terminological Taxonomies Paola Velardi, Roberto Navigli, Stefano Faralli and Juana Maria Ruiz-Martinez
296 A structural view of Topic and Focus marking in Italian Gloria Gagliardi, Edoardo Lombardi Vallauri and Fabio Tamburini
297 Improving the Recall of a Discourse Parser by Constraint-based Postprocessing Sucheta Ghosh, Richard Johansson, Giuseppe Riccardi and Sara Tonelli
299 Adding Morpho-semantic Relations to the Romanian Wordnet Verginica Barbu Mititelu
300 Cross-lingual studies of ASR errors: paradigms for perceptual evaluations Ioana Vasilescu, Martine Adda-Decker and Lori Lamel
301 The Rocky Road towards a Swedish FrameNet - Creating SweFN Karin Friberg Heppin and Maria Toporowska Gronostaj
302 Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis Dietmar Schabus, Michael Pucher and Gregor Hofer
303 Towards automated annotation of multimodal resources – the AVATecH project Przemyslaw Lenkiewicz
304 Acquisition of Syntactic Text Simplification Rules for French Violeta Seretan
305 An Empirical Study of the Occurrence and Co-Occurrence of Named Entities in Natural Language Corpora K Saravanan, Monojit Choudhury, Raghavendra Udupa and A Kumaran
306 Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified Chinese Chenhui Chu, Toshiaki Nakazawa and Sadao Kurohashi
307 Iula2Standoff: a tool for creating standoff documents for the IULACT Carlos Morell, Jorge Vivaldi and Núria Bel
308 Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement André Bittar, Caroline Hagège, Véronique Moriceau, Xavier Tannier and Charles Tesseidre
309 Introducing the Reference Corpus of Contemporary Portuguese Michel Généreux, Iris Hendrickx and Amália Mendes
311 A Corpus-based Study of the German Recipient Passive Patrick Ziering, Sina Zarrieß and Jonas Kuhn
312 “Vreselijk mooi!” (terribly beautiful): A Subjectivity Lexicon for Dutch Adjectives. Tom De Smedt and Walter Daelemans
313 Rule-Based Detection of Clausal Coordinate Ellipsis Kristiina Muhonen and Tanja Purtonen
314 Evolution of Event Designation in Media: Preliminary Study Xavier Tannier, Véronique Moriceau, Béatrice Arnulphy and Ruixin He
315 Evaluating Machine Reading Systems through Comprehension Tests Anselmo Peñas, Eduard Hovy, Pamela Forner, Álvaro Rodrigo, Richard Sutcliffe, Corina Forascu and Caroline Sporleder
316 Chinese-English CLIR in Biomedicine Using the Extended CMeSH Terms to Expand Queries Xinkai Wang, Paul Thompson and Sophia Ananiadou
319 A proposal for improving WordNet Domains Aitor González and German Rigau
320 An Oral History Annotation Tool for INTER-VIEWs Henk van den Heuvel, Eric Sanders, Robin Rutten, Stef Scagliola and Paula Witkamp
321 Comparing computer vision analysis of signed language video with motion capture recordings Matti Karppa, Tommi Jantunen, Ville Viitaniemi, Jorma Laaksonen, Birgitta Burger and Danny De Weerdt
326 Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese Juan Pablo Martínez Cortés, Jim O’Regan and Francis Tyers
327 Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages Dirk Goldhahn, Thomas Eckart and Uwe Quasthoff
328 Constraint Relaxation in temporal Evaluation Metrics: The Case of DEFT 2010 and 2011 Challenges Cyril Grouin, Patrick Paroubek, Pierre Zweigenbaum and Rémy Kessler
330 Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies Thomas Ulrich Christiansen and Peter Juel Henrichsen
331 The acquisition and dialog act labeling of the EDECAN-SPORTS corpus Lluis-F. Hurtado, Fernando Garcia, Emilio Sanchis and Encarna Segarra
333 A Parameterized and Annotated Corpus of the CMU Let’s Go Bus Information System Alexander Schmitt, Stefan Ultes and Wolfgang Minker
334 Pragmatic identification of the witness sets Livio Robaldo and Jakub Szymanik
335 A good space: Lexical predictors in word space evaluation Christian Smith, Henrik Danielsson and Arne Jönsson
336 Automatic MT Error Analysis: Hjerson Helping Addicter Jan Berka, Ondřej Bojar, Mark Fishel, Maja Popović and Daniel Zeman
337 An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output Daniele Pighin, Lluís Màrquez and Jonathan May
338 A Basic Language Resource Kit for Persian Mojgan Seraji, Beáta Megyesi and Joakim Nivre
340 Re-ordering Source Sentences for SMT Amit Sangodkar and Om Damani
341 Concept-based Selectional Preferences and Distributional Representations from Wikipedia Articles Alex Judea, Vivi Nastase and Michael Strube
342 Semi-Supervised Technical Term Tagging With Minimal User Feedback Behrang QasemiZadeh, Paul Buitelaar, Tianqi Chen and Georgeta Bordea
343 Extended Named Entities Annotation on OCRized Documents: From Corpus Constitution to Evaluation Campaign Olivier Galibert, Sophie Rosset, Cyril Grouin, Pierre Zweigenbaum and Ludovic Quintard
345 Joint Grammar and Treebank Development for Mandarin Chinese with HPSG Yi Zhang, Rui Wang and Yu Chen
347 Wordnet Based Lexicon Grammar for Polish Zygmunt Vetulani
350 A tree is a Baum is an árbol is a sach’a: Creating a trilingual treebank Annette Rios and Anne Göhring
352 INVESTIGATING VERBAL INTELLIGENCE USING THE TF-IDF APPROACH Kseniya Zablotskaya, Fernando Fernandez Martınez and Wolfgang Minker
354 RELATING DOMINANCE OF DIALOGUE PARTICIPANTS WITH THEIR VERBAL INTELLIGENCE SCORES Kseniya Zablotskaya, Umair Rahim, Fernando Fernandez Martınez and Wolfgang Minker
355 Diachronic Changes of Text Complexity in 20th Century English Language: NLP Approach Sanja Štajner and Ruslan Mitkov
356 An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT Ângela Costa, Tiago Luís, Joana Ribeiro, Ana Cristina Mendes and Luísa Coheur
357 SMALLWorlds — a multi-lingual speech corpus for cognitive research Peter Juel Henrichsen and Marcus Uneson
359 A database of semantic clusters of verb usages Silvie Cinková, Martin Holub and Lenka Smejkalová
360 Combining language resources into a grammar-driven Swedish parser Malin Ahlberg and Ramona Enache
361 Annotating dropped pronouns in Chinese newswire text Elizabeth Baran, Yaqin Yang and Nianwen Xue
362 Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions Maria Aloni, Andreas van Cranenburgh, Raquel Fernandez and Marta Sznajder
365 Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics Kanika Gupta, Monojit Choudhury and Kalika Bali
366 Capturing syntactico-semantic regularities among terms: An application of the FrameNet methodology to terminology Marie-Claude L’Homme and Janine Pimentel
370 A Corpus of Adequacy Assessments for Real-World Machine Translation Output Daniele Pighin, Lluís Màrquez and Lluís Formiga
371 Annotating Story Timelines as Temporal Dependency Structures Bethard Steven, Oleksandr Kolomiyets and Marie-Francine Moens
372 An ontological approach to model and query multimodal concurrent linguistic annotations Julien Seinturier, Elisabeth Murisasco, Emmanuel Bruno and Philippe Blache
373 A PropBank for Portuguese: the CINTIL-PropBank António Branco, Catarina Carvalheiro, Sílvia Pereira, Sara Silveira, João Silva, Sérgio Castro and João Graça
374 Highlighting relevant concepts from Topic Signatures Montse Cuadros, Lluís Padró and German Rigau
375 A tool for enhanced search of multilingual digital libraries of e-journals Ranka Stanković, Cvetana Krstev, Ivan Obradović, Aleksandra Trtovac and Miloš Utvić
376 Extending a wordnet framework for simplicity and scalability Pedro Fialho, Sérgio Curto, Ana Cristina Mendes and Luísa Coheur
377 DeCour: a corpus of DEceptive statements in Italian COURts Tommaso Fornaciari and Massimo Poesio
378 Irish Treebanking and Parsing: A Preliminary Evaluation Teresa Lynn, Ozlem Cetinoglu, Jennifer Foster, Elaine Uí Dhonnchadha, Mark Dras and Josef van Genabith
379 Expertise Mining for Enterprise Content Management Georgeta Bordea, Sabrina Kirrane, Paul Buitelaar and Bianca Pereira
380 Word Alignment for English-Turkish Language Pair Mehmet Talha Çakmak, Süleyman Acar and Gülşen Eryiğit
381 Comparing performance of different set-covering strategies for linguistic content optimization in speech corpora Nelly Barbot, Olivier Boeffard and Arnaud Delhay
382 PEXACC: A Parallel Data Mining Algorithm from Comparable Corpora Radu Ion
384 A Framework for Spelling Correction in Persian Language Using Noisy Channel Model Mohammad Hoseyn Sheykholeslam, Behrouz Minaei-Bidgoli and Hossein Juzi
387 Wiktionary as a Multilingual Wordnet Gilles Sérasset
388 Dysarthric Speech Database for Development of QoLT Software Technology Dae-Lim Choi, Bong-Wan Kim, Yeon-Whoa Kim, Yong-Ju Lee, Yongnam Um and Minhwa Chung
389 CLTC: A Chinese-English Cross-lingual Topic Corpus Yunqing Xia, Guoyu Tang, Peng Jin and Xia Yang
390 Customizable SCF Acquisition in Italian Tommaso Caselli, Francesca Frontini, Valeria Quochi, Francesco Rubino and Irene Russo
391 Statistical Evaluation of Pronunciation Encoding Iris Merkus and Florian Schiel
393 Improving corpus annotation productivity: a method and experiment with interactive tagging Atro Voutilainen
394 Semantic relations established by processes expressed by nouns and verbs: identification in a corpus by means of syntaxico-semantic annotation Nava Maroto, Marie-Claude L’Homme and Amparo Alcina
396 The Language Library: supporting community effort for collective resource production Nicoletta Calzolari, Riccardo Del Gratta, Francesca Frontini, Francesco Rubino and Irene Russo
397 MISTRAL: A Melody Intonation Speaker Tonal Range semi-automatic Analysis using variable Levels Binh Hai PHAM, Benoît WEBER, Geneviève CAELEN-HAUMONT and Do-Dat TRAN
398 German and English Treebanks and Lexica for Tree-Adjoining Grammars Miriam Kaeshammer and Vera Demberg
399 Annotating a corpus of human interaction with prosodic profiles – focusing on Mandarin repair/disfluency Helen Kaiyun CHEN
400 The Australian National Corpus: National Infrastructure for Language Resources Steve Cassidy, Michael Haugh, Pam Peters and Mark Fallu
401 A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies Hongzhi Xu, Helen Kaiyun Chen, Chu-Ren Huang, Qin Lu and Dingxu Shi
402 ELAN development, keeping pace with communities’ needs Han Sloetjes and Aarthy Somasundaram
403 Revealing Contentious Concepts Across Social Groups Zumrut Akcam, Ching-Sheng Lin, Samira Shaikh, Sharon Small, Ken Stahl, Tomek Strzalkowski and Nick Webb
404 Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber) Fabrizio Borgia, Claudia S. Bianchini, Patrice Dalle and Maria De Marsico
405 Cost and Benefit of Using WordNet Senses for Sentiment Analysis BALAMURALI AR, ADITYA JOSHI and PUSHPAK BHATTACHARYYA
409 Rembrandt - a named-entity recognition framework Nuno Cardoso
411 An Adaptive Framework for Named Entity Combination Bogdan Sacaleanu and Günter Neumann
413 Texto4Science: a Quebec French Database of Annotated Short Text Messages Philippe Langlais, Patrick Drouin, Amélie Paulus, Eugénie Rompré Brodeur and Florent Cottin
416 Collecting and Analysing Chats and Tweets in SoNaR Eric Sanders
418 A generalized morphological network for French: the Daemonette lexicon Nabil Hathout and Fiammetta Namer
420 Alternative Lexicalizations of Discourse Connectives in Czech Magdalena Rysova
422 Prediction of Non-Linguistic Information of Spontaneous Speech from the Prosodic Annotation: Evaluation of the X-JToBI system Kikuo Maekawa
424 PEARL: ProjEction of Annotations Rule Language, a Language for Projecting (UIMA) Annotations over RDF Knowledge Bases Maria Teresa Pazienza, Armando Stellato and Andrea Turbati
425 Temporal Tagging on Different Domains: Challenges, Strategies, and Gold Standards Jannik Strötgen and Michael Gertz
426 Correlation between Similarity Measures for Wikipedia Monica Lestari Paramita and Paul Clough
427 Linguistic knowledge for specialized text production MIRIAM BUENDÍA-CASTRO and BEATRIZ SÁNCHEZ-CÁRDENAS
428 The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation Massimo Moneglia, Monica Monachini, Omar Calabrese, Alessandro Panunzi, Francesca Frontini, Gloria Gagliardi and Irene Russo
429 HamleDT: To Parse or Not to Parse? David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Daniel Zeman, Zdeněk Žabokrtský and Jan Hajič
430 FreeLing 3.0: Towards Wider Multilinguality Lluís Padró and Evgeny Stanilovsky
434 Evaluating Appropriateness Of System Responses In A Spoken CALL Game Manny Rayner, Pierrette Bouillon and Johanna Gerlach
436 A Scalable Architecture For Web Deployment of Spoken Dialogue Systems Matthew Fuchs, Nikos Tsourakis and Manny Rayner
437 Semantic metadata mapping in practice: the Virtual Language Observatory Dieter Van Uytvanck, Herman Stehouwer and Lari Lampen
440 The Icelandic Parsed Historical Corpus (IcePaHC) Eiríkur Rögnvaldsson, Anton Karl Ingason, Einar Freyr Sigurðsson and Joel Wallenberg
442 Empty Argument Insertion in the Hindi PropBank Ashwini Vaidya, Jinho D. Choi, Martha Palmer and Bhuvana Narasimhan
443 A Rough Set Formalization of Quantitative Evaluation with Ambiguity Patrick Paroubek and Xavier Tannier
444 A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation Eleftherios Avramidis, Marta R. Costa-jussa, Christian Federmann, Josef van Genabith, Maite Melero and Pavel Pecina
445 Reference corpus of historical Slovene Tomaž Erjavec
446 Inforex — a web-based tool for text corpus management and semantic annotation Michał Marcińczuk, Jan Kocoń and Bartosz Broda
447 A new semantically annotated corpus with syntax-based and cross-lingual senses RAKHO Myriam, LAPORTE Éric and CONSTANT Matthieu
448 Recent Developments in CLARIN-NL Jan Odijk
450 Unsupervised acquisition of concatenative morphology Lionel Nicolas, Jacques Farré and Cécile Darme
451 Massively Increasing TIMEX3 Resources: A Transduction Approach Leon Derczynski and Estela Saquete
452 Constructing Large Proposition Databases Peter Exner and Pierre Nugues
453 Visualizing Sentiment Analysis on a User Forum Rasmus Sundberg, Anders Eriksson, Johan Bini and Pierre Nugues
454 Towards Automatic Gesture Stroke Detection Binyam Gebrekidan Gebre and Peter Wittenburg
455 Semantic Role Labeling with the Swedish FrameNet Richard Johansson, Karin Friberg Heppin and Dimitrios Kokkinakis
456 Prague Dependency Style Treebank for Tamil Loganathan Ramasamy and Zdeněk Žabokrtský
457 A Distributed Resource Repository for Cloud-Based Machine Translation Jörg Tiedemann, Dorte Haltrup Hansen, Lene Offersgaard, Sussi Olsen and Matthias Zumpe
460 Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese Patricia Gonçalves, Rita Santos and António Branco
461 Developing LMF-XML Bilingual Dictionaries for Colloquial Arabic Dialects David Graff and Mohamed Maamouri
463 Parallel Data, Tools and Interfaces in OPUS Jörg Tiedemann
464 SemSim: Resources for Normalized Semantic Similarity Computation Using Lexical Networks Elias Iosif and Alexandros Potamianos
465 Annotating and Learning Morphological Segmentation of Egyptian Colloquial Arabic Emad Mohamed, Behrang Mohit and Kemal Oflazer
466 A Resource-light Approach to Phrase Extraction for English and German Documents from the Patent Domain and User Generated Content Julia Maria Schulz, Daniela Becks, Christa Womser-Hacker and Thomas Mandl
467 Kitten: a tool for normalizing HTML and extracting its textual content Mathieu-Henri Falco, Véronique Moriceau and Anne Vilnat
468 A Metadata Editor to Support the Description of Linguistic Resources Emanuel Dima, Christina Hoppermann, Thorsten Trippel and Claus Zinn
469 A Comparable Portuguese-Spanish Corpus with Ellipsis Annotations Luz Rello and Iria Gayo
470 A Repository for the Sustainable Management of Research Data Emanuel Dima, Erhard Hinrichs, Christina Hoppermann, Thorsten Trippel and Claus Zinn
471 A Galician Syntactic Corpus with Application to Intonation Modeling Montserrat Arza, José M. García-Miguel, Francisco Campillo and Miguel Cuevas - Alonso
474 A Reference Dependency Bank for Analyzing Complex Predicates Tafseer Ahmed, Miriam Butt, Annette Hautli and Sebastian Sulger
475 UBY — A Large-Scale Unified Lexical-Semantic Resource Based on LMF Iryna Gurevych, Judith Eckle-Kohler, Silvana Hartmann, Michael Matuschek, Christian M. Meyer and Christian Wirth
476 The Influence of Corpus Quality on Statistical Measurements on Language Resources Thomas Eckart, Uwe Quasthoff and Dirk Goldhahn
478 Applying cross-lingual WSD to wordnet development Marianna Apidianaki and Benoît Sagot
480 Annotating Qualia Relations in Italian and French Complex Nominals Pierrette Bouillon, Elisabetta Jezek, Chiara Melloni and Aurélie Picton
481 Terra: a Collection of Translation Error-Annotated Corpora Mark Fishel, Ondřej Bojar and Maja Popović
482 Identifying Nuggets of Information in GALE Distillation Evaluation Olga Babko-Malaya, Greg Milette, Michael Schneider and Sarah Scogin
483 SUTAV: A Turkish Audio-Visual Database Ibrahim Saygin Topkaya and Hakan Erdogan
485 Speech and Language Resources for LVCSR of Russian Sergey Zablotskiy, Alexander Shvets, Maxim Sidorov, Eugene Semenkin and Wolfgang Minker
486 KALAKA-2: a TV broadcast speech database for the recognition of Iberian languages in clean and noisy environments Luis Javier Rodriguez-Fuentes, Mikel Penagarikano, Amparo Varona, Mireia Diez and German Bordel
487 Automatic word alignment tools to scale production of manually aligned parallel text Stephen Grimes, Katherine Peterson and Xuansong Li
488 Developing and evaluating an emergency scenario dialogue corpus Jolanta Bachan
490 A Framework for Evaluating Text Correction Robert Dale and George Narroway
493 Large Scale Lexical Analysis Gregor Thurmair, Vera Aleksic and Christoph Schwarz
494 NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks Chieh-Jen Wang, Shuk-Man Cheng, Lung-Hao Lee, Hsin-Hsi Chen, Wen-shen Liu, Pei-Wen Huang and Shih-Peng Lin
495 The ETAPE corpus for the evaluation of speech-based TV content processing in the French language Guillaume Gravier, Gilles Adda, Niklas Paulsson, Matthieu Carré, Aude Giraudel and Olivier Galibert
496 Automatic lexical semantic classification of nouns Núria Bel, Lauren Romeo and Muntsa Padró
502 Is it Useful to Support Users with Lexical Resources? A User Study. Ernesto William De Luca
503 In the same boat and other idiomatic seafaring expressions Rita Marinelli and Laura Cignoni
504 Creation and use of Language Resources in a Question-Answering eHealth System Anna Braasch, Lina Henriksen, Csaba Huszka, Anders Johannsen, Lars Kayser, Bente Maegaard, Ole Norgaard, Stefan Schulz and Jürgen Wedekind
505 Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents Lina M. Rojas-Barahona, Alejandra Lorenzo and Claire Gardent
506 Collection of a Large Database of French-English SMT Output Corrections Marion Potet, Emmanuelle Esperança-Rodier, Laurent Besacier and Hervé Blanchon
507 German "nach"-Particle Verbs in Semantic Theory and Corpus Data Boris Haselbach, Wolfgang Seeker and Kerstin Eckart
508 A Prototype Tool to Discover Missing Wikipedia Inter-language Links by means of Cross-lingual Word Sense Disambiguation Els Lefever, Veronique Hoste and Martine De Cock
509 Arabic-Segmentation Combination Strategies for Statistical Machine Translation Saab Mansour and Hermann Ney
510 Announcing Prague Czech-English Dependency Treebank 2.0 Ondřej Bojar, Jan Hajič, Eva Hajičová, Jarmila Panevová, Petr Sgall, Silvie Cinková, Eva Fučíková, Marie Mikulová, Petr Pajas, Jan Popelka, Jiří Semecký, Jana Šindlerová, Jan Štěpánek, Josef Toman, Zdeňka Urešová and Zdeněk Žabokrtský
511 First Results in a Study Evaluating Pre-labeling and Correction Propagation for Machine-Assisted Syriac Morphological Analysis Paul Felt, Eric Ringger, Kevin Seppi, Kristian Heal, Robbie Haertel and Deryl Lonsdale
513 A Corpus of Spontaneous Multi-party Conversation in Bosnian Serbo-Croatian and British English Emina Kurtic, Bill Wells, Guy J. Brown, Timothy Kempton and Ahmet Aker
514 Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques Antton Gurrutxaga and Iñaki Alegria
515 Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks Jorge Vivaldi, Luis Adrián Cabrera-Diego, Gerardo Sierra and María Pozzi
516 The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends Javier Caminero, Mari Carmen Rodríguez, Jean Vanderdonckt, Fabio Paternò, Joerg Rett, Dave Raggett, Jean-Loup Comeliau and Ignacio Marín
518 French and German corpora for audience-based text type classification Amalia Todirascu, Sebastian Pado, Jennifer Krisch, Max Kisselew and Ulrich Heid
519 The IULA Treebank Montserrat Marimon, Beatríz Fisas, Núria Bel, Marta Villegas, Jorge Vivaldi, Sergi Torner and Mercè Lorente
520 Modality in Text: a Proposal for Corpus Annotation Iris Hendrickx, Amália Mendes and Silvia Mencarelli
521 Rule-based Entity Recognition and Coverage of SNOMED CT in Swedish Clinical Text Skeppstedt Maria, Kvist Maria and Dalianis Hercules
522 An Evaluation of the Effect of Automatic Preprocessing on Syntactic Parsing for Biomedical Relation Extraction Md. Faisal Mahbub Chowdhury and Alberto Lavelli
524 Federated Search: Towards a Common Search Infrastructure Herman Stehouwer, Matej Durco, Eric Auer and Daan Broeder
525 Evaluating and improving syntactic lexica by plugging them within a parser Elsa Tolone, Benoît Sagot and Éric Villemonte de la Clergerie
526 The Herme Database of Spontaneous Multimodal Human-Robot Dialogues frank han, emer gilmartin, celine delooze, brian vaughan and nick campbell
527 Source-Language Dictionaries Help Non-Expert Users to Enlarge Target-Language Dictionaries for Machine Translation Víctor M. Sánchez-Cartagena, Miquel Esplà-Gomis and Juan Antonio Pérez-Ortiz
529 EXMARaLDA and the FOLK tools – two toolsets for transcribing and annotating spoken language Thomas Schmidt
530 ISO 24617-2: A semantically-based standard for dialogue annotation Harry Bunt, Jan Alexandersson, Jae-Woong Choe, Alex Chengyu Fang, Koiti Hasida, Volha Petukhova, Andre Popescu-Belis and David Traum
531 Towards a methodology for automatic identification of hypernyms in the definitions of large-scale dictionary Inga Gheorghita and Jean-Marie Pierrel
533 A review corpus annotated for negation, speculation and their scope Natalia Konstantinova, Sheila C.M. de Sousa, Noa P. Cruz, Manuel J. Maña, Maite Taboada and Ruslan Mitkov
534 Developing a large semantically annotated corpus Valerio Basile, Johan Bos, Kilian Evang and Noortje Venhuizen
535 Dynamic web service deployment in a cloud environment Marc Kemps-Snijders, Matthijs Brouwer, JanPieter Kunst and Tom Visser
536 Associative and Semantic Features Extracted From Web-Harvested Corpora Elias Iosif, Maria Giannoudaki, Eric Fosler-Lussier and Alexandros Potamianos
537 Collection of a corpus of Dutch SMS Maaske Treurniet, Orphée De Clercq, Henk van den Heuvel and Nelleke Oostdijk
538 Evaluating Hebbian Self-Organizing Memories for Lexical Representation and Access Marcello Ferro, Claudia Marzi, Claudia Caudai and Vito Pirrelli
539 A Corpus for Gesture-Controlled Mobile Spoken Dialogue Systems Nikos Tsourakis and Manny Rayner
543 Towards a User-Friendly Platform for Building Language Resources based on Web Services Marc Poch, Antonio Toral, Olivier Hamon, Valeria Quochi and Núria Bel
544 Collaborative semantic editing of linked data lexica John McCrae, Elena Montiel-Ponsoda and Philipp Cimiano
545 Evaluating the Impact of Phrase Recognition on Semantic Annotation Pablo Mendes, Joachim Daiber, Rohana Rajapakse, Felix Sasaki and Christian Bizer
547 Proper Language Resource Centers Willem Elbers, Daan Broeder and Dieter van Uytvanck
548 RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building Alessandro Panunzi, Marco Fabbri, Massimo Moneglia, Lorenzo Gregori and Samuele Paladini
549 Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign Karën Fort, Claire François, Olivier Galibert and Maha Ghribi
550 LAST MINUTE: a Multimodal Corpus of Speech-based User-Companion Interactions Dietmar Rösner, Jörg Frommer, Rafael Friesen, Matthias Haase, Julia Lange and Mirko Otto
552 Semantic annotation of French corpora: animacy and verb semantic classes Juliette Thuilier
553 Evaluation of Unsupervised Information Extraction Wei Wang, Romaric Besançon, Olivier Ferret and Brigitte Grau
555 A contrastive review of paraphrase acquisition techniques Houda Bouamor, Aurélien Max, Gabriel Illouz and Anne Vilnat
557 Expanding Arabic Treebank to Speech: Results from Broadcast News Mohamed Maamouri, Ann Bies and Seth Kulick
559 Typing Race Games as a Method to Create Spelling Error Corpora Paul Rodrigues and C. Anton Rytting
561 A treebank-based study on the influence of Italian word order on parsing performance Anita Alicante, Cristina Bosco, Anna Corazza and Alberto Lavelli
562 Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems Kallirroi Georgila, Alan Black, Kenji Sagae and David Traum
563 A Search Tool for FrameNet Constructicon Hiroaki Sato
566 Extraction of unmarked quotations in Newspapers Stéphanie Weiser and Patrick Watrin
567 Ontoterminology: How to unify terminology and ontology into a single paradigm Christophe Roche
568 A system for automatic tonal annotation of speech corpora Piet Mertens
569 Corpus Annotation as a Psycholinguistic Task Donia Scott, Rossano Barone and Rob Koeling
570 DBpedia: A Multilingual Cross-domain Knowledge Base Pablo Mendes, Max Jakob and Christian Bizer
572 A Morphological Analyzer For Wolof Using Finite-State Techniques Cheikh M. Bamba Dione
574 Designing a search interface for a Spanish learner spoken corpus: the end-user’s evaluation Leonardo Campillos Llanos
576 What is a “linguistic” ontology? The creation and purpose of a bottom-up ontology for nouns in context Patrick Hanks, Elisabetta Ježek and Francesca Frontini
577 Design and compilation of a specialized parallel corpus Spanish-German Carla Parra Escartín
578 Detection of Peculiar Word Sense by Distance Metric Learning with Labeled Examples Minoru Sasaki and Hiroyuki Shinnou
579 Conventional Orthography for Dialectal Arabic Nizar Habash, Mona Diab and Owen Rambow
581 Standardizing a Component Metadata Infrastructure Daan Broeder, Dieter van Uytvanck, Maria Gavrilidou and Thorsten Trippel
583 Assessing Crowdsourcing Quality through Objective Tasks Ahmet Aker, Mahmoud El-Haj, M-Dyaa Albakour and Udo Kruschwitz
584 Association Norms of German Noun Compounds Sabine Schulte im Walde and Susanne Borgwaldt
585 Word Sketches for Turkish Bharat Ram Ambati, Siva Reddy and Adam Kilgarriff
586 The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser Eckhard Bick, Heliana Mello, Alessandro Panunzi and Tommaso Raso
587 Bulgarian X-language Parallel Corpus Svetla Koeva, Ivelina Stoyanova, Rositsa Dekova, Borislav Rizov and Angel Genov
589 The MASC Word Sense Corpus Rebecca Passonneau, Collin Baker, Christiane Fellbaum and Nancy Ide
590 Págico: Evaluating Wikipedia-based information retrieval in Portuguese Cristina Mota, Alberto Simões, Cláudia Freitas, Luís Costa and Diana Santos
593 Representation of linguistic and domain knowledge for second language learning in virtual worlds Alexandre Denis, Ingrid Falk, Claire Gardent and Laura Perez-Beltrachini
594 A Multilingual Natural Stress Emotion Database Xin Zuo, Tian Li and Pascale Fung
595 The Twins Corpus of Museum Visitor Questions Priti Aggarwal, Ron Artstein, Jillian Gerten, Athanasios Katsamanis, Angela Nazarian and David Traum
596 Development of a Web-Scale Chinese Word N-gram Corpus with Parts of Speech Information Chi-Hsin Yu and Hsin-Hsi Chen
597 Medical term extraction in an Arabic medical corpus Doaa Samy, Antonio Moreno-Sandoval, Conchi Bueno-Díaz, Marta Garrote-Salazar and José M Guirao
598 Annotation Facilities for the Reliable Analysis of Human Motion Michael Kipp
601 Addressing polysemy in bilingual lexicon extraction from comparable corpora Darja Fišer and Nikola Ljubešić
603 Arabic Word Generation and Modelling for Spell Checking Mohammed Attia, Pavel Pecina, Younes Samih and Josef van Genabith
605 Annotation of response tokens and their triggering expressions in Japanese multi-party conversations Yasuharu Den, Hanae Koiso, Katsuya Takanashi and Nao Yoshida
606 Automatically Generated Online Dictionaries Enikő Héja and Dávid Takács
607 Method for Collection of Acted Speech Using Various Situation Scripts Takahiro Miyajima, Hideaki Kikuchi, Katsuhiko Shirai and Shigeki Okawa
608 Citing on-line Language Resources Daan Broeder, Dieter van Uytvanck and Gunter Senft
609 Automatic Extraction and Evaluation of Arabic LFG Resources Mohammed Attia, Lamia Tounsi and Josef van Genabith
610 Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger Matthieu Constant and Isabelle Tellier
611 The Minho Quotation Resource Brett Drury and JJ Almeida
614 Translog-II: a Program for Recording User Activity Data for Empirical Translation Process Research Michael Carl
616 Morphosyntactic Analysis of the CHILDES and TalkBank Corpora Brian MacWhinney and Wintner Shuly
618 Similarity Ranking as Attribute for Machine Learning Approach to Authorship Identification Jan Rygl and Aleš Horák
619 Challenges in the development of annotated corpora of computer-mediated communication in Indian Languages: A Case of Hindi Ritesh Kumar
622 LexIt: A Computational Resource on Italian Argument Structure Alessandro Lenci, Gabriella Lapesa and Giulia Bonansinga
623 Annotating Football Matches: Influence of the Source Medium on Manual Annotation Karën Fort and Vincent Claveau
624 The C-ORAL-BRASIL I: Reference Corpus for Spoken Brazilian Portuguese Tommaso Raso and Heliana Mello
626 A light way to collect comparable corpora from the Web Ahmet Aker, Evangelos Kanoulas and Robert Gaizauskas
627 Holaaa!! writin like u talk is kewl but kinda hard 4 NLP Maite Melero, Judith Domingo, Montse Marquina and Martí Quixal
628 Applying Random Indexing to Structured Data to Find Contextually Similar Words Danica Damljanovic, Udo Kruschwitz, M-Dyaa Albakour, Johann Petrak and Mihai Lupu
629 Coreference in Spoken vs. Written Texts: a Corpus-based Analysis Marilisa Amoia, Kerstin Kunz and Ekaterina Lapshinova-Koltunski
632 Towards Fully Automatic Annotation of Audio Books for TTS Olivier Boeffard, Laure Charonnat, Sébastien Le Maguer and Damien Lolive
633 Centroids: Gold standards with distributional variation Ian Lewin, Senay Kafkas and Dietrich Rebholz-Schuhmann
635 Multimodal Behaviour and Feedback in Different Types of Interaction Costanza Navarretta and Patrizia Paggio
636 On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market David Lewis, Alexander O’Connor, Andrzej Zydroń, Gerd Sjögren and Rahzeb Choudhury
637 Multimedia database of the cultural heritage of the Balkans Ivana Tanasijević, Biljana Sikimić and Gordana Pavlović-Lažetić
638 ANALEC: a New Tool for the Dynamic Annotation of Textual Data Frederic Landragin and Thierry Poibeau
639 Feedback in Nordic First-Encounters: a Comparative Study Costanza Navarretta, Elisabeth Ahlsén, Jens Allwood, Kristiina Jokinen and Patrizia Paggio
640 Annotating Opinions in German Political News Kristina Adson, Hong Li, Tal Kirshboim, Xiwen Cheng and Feiyu Xu
641 MultiUN v2: UN Documents with Multilingual Alignments Yu Chen and Andreas Eisele
642 The CONCISUS Corpus of Event Summaries Horacio Saggion and Sandra Szasz
643 Building and Exploring Semantic Equivalences Resources Gracinda Carvalho, David Martins de Matos and Vitor Rocio
644 IDENTIC Corpus: Morphologically Enriched Indonesian-English Parallel Corpus Septina Dian Larasati
645 The Joy of Parallelism with CzEng 1.0 Ondřej Bojar, Zdeněk Žabokrtský, Ondřej Dušek, Petra Galuščáková, Martin Majliš, David Mareček, Jiří Maršík, Michal Novák, Martin Popel and Aleš Tamchyna
646 LAMP: A Multimodal Web Platform for Collaborative Linguistic Analysis Kais Dukes and Eric Atwell
648 Web Service integration platform for Polish linguistic resources Maciej Ogrodniczuk and Michał Lenart
649 Suffix Trees as Language Models Casey Redd Kennington and Martin Kay
651 The Romanian Neuter Examined Through A Two-Gender N-Gram Classification System Liviu P. Dinu, Vlad Niculae and Octavia-Maria Şulea
652 Using semi-experts to derive judgments on word sense alignment: a pilot study Soojeong Eom, Markus Dickinson and Graham Katz
653 The Polish Sejm Corpus Maciej Ogrodniczuk
655 Creating and Curating a Cross-Language Entity Linking Collection Dawn Lawrie, James Mayfield, Paul McNamee and Douglas Oard
656 The TARSQI Toolkit Marc Verhagen and James Pustejovsky
657 A corpus of general and specific sentences from news Annie Louis and Ani Nenkova
658 Annotation Trees: LDC’s customizable, extensible, scalable, annotation infrastructure Jonathan Wright, Kira Griffitt, Joe Ellis, Stephanie Strassel and Brendan Callahan
661 Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing Elena Filatova
663 YADAC: Yet another Dialectal Arabic Corpus Rania Al-Sabbagh and Roxana Girju
664 An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines) James Clarke, Vivek Srikumar, Mark Sammons and Dan Roth
665 Korean learners’ English accentuation corpus for automatic accent detection Ho-Young Lee, Jieun Song and Kiduk Yoon
672 Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization Luís Marujo, Anatole Gershman, Jaime Carbonell, Robert Frederking and João P. Neto
673 Hindi Subjective Lexicon (HSL): A Lexical Resource for Hindi Adjective Polarity Classification Akshat Bakliwal, Piyush Arora and Vasudeva Varma
674 Annotating Near-Identity from Coreference Disagreements Marta Recasens, M. Antònia Martí and Constantin Orasan
676 The REX corpora: A collection of multimodal corpora of referring expressions in collaborative problem solving dialogue Takenobu Tokunaga, Ryu Iida, Asuka Terai and Naoko Kuriyama
677 Statistical Machine Translation without Source-side Parallel Corpus using Word Lattice and Phrase Extension Takanori Kusumoto and Tomoyoshi Akiba
678 Extracting Knowledge-Rich Contexts in Russian and German Anne-Kathrin Schumann
679 Brand Pitt: A Corpus to Explore the Art of Naming Gozde Ozbal, Carlo Strapparava and Marco Guerini
680 Evaluating automatic cross-domain Dutch semantic role annotation Orphée De Clercq, Veronique Hoste and Paola Monachesi
682 Syntactic annotation of spontaneous speech: application to call-center conversation data Thierry Bazillon, Melanie Delplano, Frederic Bechet, Alexis Nasr and Benoit Favre
683 Korean Children’s Spoken English Corpus and an Analysis of its Pronunciation Variability Hyejin Hong, Sunhee Kim and Minhwa Chung
684 DECODA: a call-centre human-human spoken conversation corpus Frederic Bechet, Benjamin Maza, Nicolas Bigouroux, Thierry Bazillon, Marc El-Beze, Renato De Mori and Eric Arbillot
685 The ALLEGRA corpus: a trilingual resource for Romansh, an under-represented language of Switzerland Yves Scherrer and Bruno Cartoni
686 Intelligibility assessment in forensic applications Giovanni Costantini, Andrea Paoloni and Massimiliano Todisco
687 From medical language processing to BioNLP domain Sara Goggi, Manuela Sassi, Gabriella Pardelli and Stefania Biagioni
693 Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task Tomoyosi Akiba, Hiromitsu Nishizaki, Kiyoaki Aikawa, Tatsuya Kawahara and Tomoko Matsui
697 Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese Antonio Moreno-Sandoval, Leonardo Campillos, Yang Dong, Emi Takamori, José M. Guirao, Paula Gozalo, Chieko Kimura, Kengo Matsui and Marta Garrote
698 TED-LIUM: an Automatic Speech Recognition dedicated corpus Anthony Rousseau, Paul Deléglise and Yannick Estève
700 The SYNC3 Collaborative Annotation Tool Georgios Petasis
701 ELRA in the heart of a cooperative HLT world Valérie Mapelli, Victoria Arranz, Matthieu Carré, Hélène Mazo, Djamel Mostefa and Khalid Choukri
703 Automatic Translation of Scientific Documents in the HAL Archive Lambert Patrik, Holger Schwenk and Frédéric Blain
706 Irregularity Detection in Categorized Document Corpora Borut Sluban, Senja Pollak, Roel Coesemans and Nada Lavrac
707 The REPERE Corpus : a multimodal corpus for person recognition Aude Giraudel, Matthieu Carré, Valérie Mapelli, Juliette Kahn, Olivier Galibert and Ludovic Quintard
708 Lemmatising Serbian as Category Tagging with Bidirectional Sequence Classification andrea gesmundo and Tanja Samardzic
709 Efficient Dependency Graph Matching with the IMS Open Corpus Workbench Thomas Proisl and Peter Uhrig
711 Strategies to Improve a Speaker Diarisation Tool David Tavarez, Eva Navas, Daniel Erro and Ibon Saratxaga
712 Exploring self-training and co-training for dependency parsing Rahul Goutam and Bharat Ambati
714 Effects of Document Clustering in Modeling Wikipedia-style Term Descriptions Atsushi Fujii, Yuya Fujii and Takenobu Tokunaga
715 MaltOptimizer: A System for MaltParser Optimization Miguel Ballesteros and Joakim Nivre
716 Building Text-To-Speech Voices in the Cloud Alistair Conkie, Thomas Okken, Yeon-Jun Kim and Giuseppe Di Fabbrizio
717 On the practice of error analysis for machine translation evaluation Sara Stymne and Lars Ahrenberg
718 The Database of Modern Icelandic Inflection Kristín Bjarnadóttir
719 Croatian Dependency Treebank: Recent Development and Initial Experiments Dasa Berovic, Zeljko Agic and Marko Tadic
720 The Tagged Icelandic Corpus (MÍM) Sigrún Helgadóttir, Ásta Svavarsdóttir, Eiríkur Rögnvaldsson, Kristín Bjarnadóttir and Hrafn Loftsson
721 Evaluation of the KomParse Conversational Non-Player Characters in a Commercial Virtual World Tina Kluewer, Peter Adolphs, Feiyu Xu and Hans Uszkoreit
722 Using LRs in Humanities research Marta Villegas, Nuria Bel, Carlos Gonzalo, Amparo Moreno and Nuria Simelio
724 A Treebank-driven Creation of an OntoValence Verb lexicon for Bulgarian Petya Osenova, Kiril Simov, Laska Laskova and Stanislava Kancheva
726 Integrating/Interchanging NLP tools in a distributed environment: a case study chaining Freeling to the DESR parser Francesco Rubino, Francesca Frontini and Valeria Quochi
727 Spell Checking for Chinese Shaohua Yang, Hai Zhao, Xiaolin Wang and Bao-liang Lu
729 Customization of the Europarl Corpus for Translation Studies Zahurul Islam and Alexander Mehler
730 A Parallel Corpus of Music and Lyrics Annotated with Emotions Carlo Strapparava, Rada Mihalcea and Alberto Battocchi
732 Creation of a bottom-up corpus-based ontology for Italian Linguistics Elisa Bianchi and Mirko Tavosanis
733 Glottolog/Langdoc:Increasing the visibility of grey literature for low-density languages Sebastian Nordhoff and Harald Hammarström
734 Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora Carmen Dayrell, Arnaldo Candido Jr., Gabriel Lima, Danilo Machado Jr., Ann Copestake, Valéria Feltrim, Stella Tagnin and Sandra Aluisio
735 NgramQuery - Smart Information Extraction from Google N-gram using External Resources Martin Aleksandrov and Carlo Strapparava
736 Error profiling for task-based evaluation of machine-translated text: a Polish-English case study Sandra Weiss and Lars Ahrenberg
737 Polish Multimodal Corpus – a collection of referential gestures Magdalena Lis
740 DEGELS1: A comparable corpus of French Sign Language and co-speech gestures Annelies Braffort and Leïla Boutora
741 Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs Matilde GONZALEZ, Michael FILHOL and Christophe COLLET
743 Expanding Parallel Resources for Medium-Density Languages for Free Georgi Iliev
744 Creation of Shared Language Resource Repository in the Nordic and Baltic Countries Andrejs Vasiljevs, Markus Forsberg, Tatiana Gornostay, Dorte Hansen, Kristín Jóhannsdóttir, Gunn Lyse, Krister Lindén, Lene Offersgaard, Sussi Olsen, Bolette Pedersen, Eiríkur Rögnvaldsson, Inguna Skadiņa and Koenraad De Smedt
746 Adapting and evaluating a generic term extraction tool Anita Gojun, Ulrich Heid, Bernd Weißbach, Carola Loth and Insa Mingers
748 Beyond SoNaR: towards the facilitation of large corpus building efforts Martin Reynaert, Ineke Schuurman, Veronique Hoste and Nelleke Oostdijk
751 Robustness and adaptation of spoken language understanding systems among languages and domains: the PORTMEDIA project Fabrice Lefèvre, Djamel Mostefa, Laurent Besacier, Yannick Estève, Matthieu Quignard, Nathalie Camelin, Lina Rojas-Barahona, Bassam Jabaian and Benoit Favre
753 A GUI to Detect and Correct Errors in Hindi Dependency Treebank Rahul Agarwal, Bharat Ram Ambati and Anil Kumar Singh
754 Spell Checking in Spanish: The Case of Diacritic Accents jordi atserias, maria fuentes, rogelio nazar and irene renau
755 Iterative Refinement of Annotation Guidelines for Semantically Fuzzy Named Entity Types, such as Pathological Phenomena Kerstin Hornbostel, Elena Beisswanger, Ekaterina Buyko, Erik Faessler, Jennifer Traumüller, Susann Schröder and Udo Hahn
756 Example-Based Treebank Querying Liesbeth Augustinus, Vincent Vandeghinste and Frank Van Eynde
757 Annotating Factive Verbs Alvin Grisson II and Yusuke Miyao
758 Annotating Errors in a Hungarian Learner Corpus Markus Dickinson and Scott Ledbetter
761 Evaluation of a Complex Information Extraction Application in Specific Domain Romaric Besançon, Olivier Ferret and Ludovic Jean-Louis
762 Text Simplification Tools for Spanish Stefan Bott and Horacio Saggion
763 VERTa: Linguistic features in MT evaluation Elisabet Comelles, Jordi Atserias, Victoria Arranz and Irene Castellón
766 Specifying Treebanks, Outsourcing Parsebanks: FinnTreeBank 3 Atro Voutilainen, Kristiina Muhonen, Tanja Purtonen and Krister Linden
767 W-PhAMT: A web tool for phonetic multilevel timeline visualization Francesco Cutugno and Vincenza Anna Leano
769 The LRE Map. Harmonising Community Descriptions of Resources Nicoletta Calzolari, Riccardo Del Gratta, Gil Francopoulo, Joseph Mariani, Francesco Rubino, Irene Russo and Claudia Soria
770 Romanian TimeBank: An Annotated Parallel Corpus for Temporal Information Corina Forascu and Dan Tufis
772 Chinese Whispers: Cooperative Paraphrase Acquisition Matteo Negri, Yashar Mehdad, Alessandro Marchetti, Danilo Giampiccolo and Luisa Bentivogli
773 The Nordic Dialect Corpus Janne Bondi Johannessen, Joel Priestley, Kristin Hagen, Anders Nøklestad and André Lynum
774 WeSearch UGC: A Comprehensive Sample of User-Generated Content Jonathon Read, Rebecca Dridan, Stephan Oepen and Lilja Øvrelid
775 Legal electronic dictionary for Czech František Cvrček, Karel Pala and Pavel Rychlý
776 This also affects the context - Errors in extraction based summaries Thomas Kaspersson, Christian Smith, Henrik Danielsson and Arne Jönsson
777 The FLaReNet Strategic Language Resource Agenda Claudia Soria, Nuria Bel, Khalid Choukri, Joseph Mariani, Monica Monachini, Jan Odijk, Stelios Piperidis, Valeria Quochi and Nicoletta Calzolari
778 Yes we can!? Annotating English modal verbs Josef Ruppenhofer and Ines Rehbein
779 Building a Multimodal Laughter Database for Emotion Recognition Merlin Teodosia Suarez, Jocelynn Cu and Madelene Sta. Maria
782 Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus Jörg Frommer, Bernd Michaelis, Dietmar Rösner, Andreas Wendemuth, Rafael Friesen, Matthias Haase, Manuela Kunze, Rico Andrich, Julia Lange, Axel Panning and Ingo Siegert
783 Rapid creation of large-scale corpora and frequency dictionaries Attila Zséder, Gábor Recski, Dániel Varga and András Kornai
784 Adaptive dictionary for bilingual lexicon extraction from comparable corpora Amir Mohamed HAZEM and Emmanuel MORIN
785 Linguistic Resources for Handwriting Recognition and Translation Evaluation Zhiyi Song, Safa Ismael, Steven Grimes, Gary Krug and Stephanie Strassel
786 Prosomarker: a prosodic analysis tool based on optimal pitch stylization and automatic syllabi-fication Antonio Origlia and Iolanda Alfano
787 The DISCO ASR-based CALL system: practicing L2 oral skills and beyond Helmer Strik, Jozef Colpaert, Joost Van Doremalen and Catia Cucchiarini
788 METU Turkish Discourse Bank Browser Utku Şirin, Ruket Çakıcı and Deniz Zeyrek
789 The New IDS Corpus Analysis Platform: Challenges and Prospects Piotr Bański, Peter M. Fischer, Elena Frick, Erik Ketzan, Marc Kupietz, Carsten Schnober, Oliver Schonefeld and Andreas Witt
790 Evaluating Multi-focus Natural Language Queries over Data Services Silvia Quarteroni, Vincenzo Guerrisi and Pietro La Torre
794 Application of a Semantic Search Algorithm to Semi-Automatic GUI Generation Maria Teresa Pazienza, Noemi Scarpato and Armando Stellato
796 Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian Mladen Karan, Jan Šnajder and Bojana Dalbelo Bašić
800 Evaluating Query Languages for a Corpus Processing System Elena Frick, Carsten Schnober and Piotr Bański
802 Two Phase Evaluation for Selecting Machine Translation Services Chunqi Shi, Donghui Lin, Masahiko Shimada and Toru Ishida
804 Development and Application of a Cross-language Document Comparability Metric Fangzhong Su and Bogdan Babych
805 A Graphical Citation Browser for the ACL Anthology Benjamin Weitz and Ulrich Schäfer
806 Document Attrition in Web Corpora: an Exploration Stephen Wattam, Paul Rayson and Damon Berridge
807 A Repository of Data and Evaluation Resources for Natural Language Generation Anja Belz and Albert Gatt
808 Scenario Oriented Translation Service Composition Chunqi Shi, Donghui Lin and Toru Ishida
810 Compatibility and interoperability of multi-modal standardized language resources Yoshinobu Kano
812 The Quaero Evaluation Campaign on Term Extraction Thibault Mondary, Adeline Nazarenko, Haïfa Zargayouna and Sabine Barreaux
813 Italian and Spanish Null Subjects. A Case Study Evaluation in an MT Perspective. Lorenza Russo, Sharid Loáiciga and Asheesh Gulati
814 DGT-TM: A freely available Translation Memory in 22 languages Ralf Steinberger, Andreas Eisele, Szymon Klocek, Spyridon Pilos and Patrick Schlüter
815 Simplified guidelines for the creation of Large Scale Dialectal Arabic Annotations Mona Diab and Heba Elfardy
816 META-SHARE v1: An Open Network of Repositories for Language Resources including Data and Tools Christian Federmann, Byron Georgantopoulos, Christian Girardi, Olivier Hamon, Dimitris Mavroeidis, Salvatore Minutoli and Marc Schröder
817 Analyzing and Aligning German compound nouns Marion Weller and Ulrich Heid
818 Identification of Manner in Bio-Events Raheel Nawaz, Paul Thompson and Sophia Ananiadou
819 A tool/database interface for multi-level analyses Kurt Eberle, Kerstin Eckart and Ulrich Heid
820 Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies Thierry Declerck and Karlheinz Mörth
822 Using an ASR database to design a pronunciation evaluation system in Basque Igor Odriozola, Eva Navas, Inma Hernáez, Iñaki Sainz, Ibon Saratxaga, Jon Sánchez and Daniel Erro
823 A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Certainty and Uncertainty Alberto Lavelli, Bernardo Magnini, Ramona Bongelli, Carla Canestrari, Ilaria Riccioni, Cinzia Buldorini, Ricardo Pietrobon and Andrzej Zuczkowski
824 New language resources for the Pashto language Djamel Mostefa
825 Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language Ting Liu, Samira Shaikh, Tomek Strzalkowski, Aaron Broadwell, Jennifer Stromer-Galley and Sarah Taylor
827 CALBC: Releasing the Final Corpora Şenay Kafkas, Ian Lewin, David Milward, Erik van Mulligen, Jan Kors, Udo Hahn and Dietrich Rebholz-Schuhmann
828 BUCEADOR, a multi-language search engine for digital libraries Jordi Adell, Antonio Bonafonte, Antonio Cardenal, Marta R. Costa-jussà, José A. R. Fonollosa and Asunción Moreno
829 Linguistic Analysis Processing Line for Bulgarian Aleksandar Savkov, Laska Laskova, Stanislava Kancheva, Petya Osenova and Kiril Simov
830 Getting more data — Schoolkids as annotators Jirka Hana and Barbora Hladka
831 WordNet Atlas: a web application for browsing WordNet as a zoomable map Matteo Abrate and Clara Bacciu
834 Building large corpora from the web using a new efficient tool chain Roland Schäfer and Felix Bildhauer
836 An empirical resource for discovering cognitive principles of discourse organization: the ANNODIS corpus Stergos Afantenos, Nicholas Asher, Farah Benamara, Myriam Bras, Cecile Fabre, Mai Ho-Dac, Anne Le Draoulec, Philippe Muller, Marie-Paul Pery-Woodley, Laurent Prevot, Josette Rebeyrolles, Ludovic Tanguy, Marianne Vergez-Couret and Laure Vieu
838 English to Indonesian Transliteration to Support English Pronunciation Learning Amalia Zahra and Julie Carson-Berndsen
839 Boosting the Coverage of a Semantic Lexicon by Automatically Extracted Event Nominalizations Kata Gábor, Marianna Apidianaki, Benoît Sagot and Eric Villemonte de La Clergerie
842 Using the International Standard Language Resource Number: Practical and Technical Aspects Jungyeul Park, Victoria Arranz, Olivier Hamon and Khalid Choukri
843 Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System Claire Jaja, Douglas Briesch, Jamal Laoudi and Clare Voss
844 RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus Jens Forster, Christoph Schmidt, Thomas Hoyoux, Oscar Koller, Uwe Zelle, Justus Piater and Hermann Ney
845 The KnowledgeStore: an Entity-Based Storage System Roldano Cattoni, Francesco Corcoglioniti, Christian Girardi, Bernardo Magnini, Luciano Serafini and Roberto Zanoli
846 Elaborating a sustainability model for LRs Khalid Choukri, Victoria Arranz and Jungyeul Park
848 CLIMB grammars: three projects using metagrammar engineering Antske Fokkens, Tania Avgustinova and Yi Zhang
850 Annotated Bibliographical Reference Corpora in Digital Humanities Young-Min Kim, Patrice Bellot, Elodie Faath and Marin Dacos
853 Corpus of children voices for mid-level social markers and affect bursts analysis Marie Tahon, Agnes Delaborde and Laurence Devillers
855 Joint Segmentation and POS Tagging for Arabic Using a CRF-based Classifier Souhir Gahbiche-Braham, Hélène Bonneau-Maynard, Thomas Lavergne and François Yvon
857 Fivehundredmillionandone Tokens. Loading the AAC Container with Text Resources for Text Studies. Hanno Biber and Evelyn Breiteneder
860 Annotation of anaphoric relations and topic continuity in Japanese conversation Natsuko Nakagawa and Yasuharu Den
863 Classifying Standard Linguistic Processing Functionalities based on the Fundamental Data Operation Types Yoshihiko Hayashi and Chiharu Narawa
864 Evaluating expressive speech synthesis from audiobook corpora for conversational phrases Eva Szekely, Joao Cabral, Mohamed Abou-Zleikha, Peter Cahill and Julie Carson-Berndsen
865 The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech Juan María Garrido, Yesika Laplaza, Montse Marquina, Andrea Pearman, José Gregorio Escalada, Miguel Ángel Rodríguez and Ana Armenta
866 DramaBank: Annotating Agency in Narrative Discourse David Elson
867 Linguagrid: a network of Linguistic and Semantic Services for the Italian Language. Alessio Bosca, Luca Dini, Milen Kouylekov and Marco Trevisan
868 Statistical Measures for the Acceptability and Semi-productivity of Persian Light Verb Constructions Shiva Taslimipoor, Afsaneh Fazly and Ali Hamzeh
872 On the Way to a Legal Sharing of Web Applications in NLP Victoria Arranz and Olivier Hamon
875 JRC Eurovoc Indexer JEX - A freely available multi-label categorisation tool Ralf Steinberger, Mohamed Ebrahim and Marco Turchi
876 Designing French Tale Corpora for Entertaining Text To Speech Synthesis David Doukhan, Sophie Rosset, Albert Rilliard, Christophe d’Alessandro and Martine Adda-Decker
878 „Rendering Endangered Lexicons Interoperable through Standards Harmonization”: the RELISH project Helen Aristar-Dry, Sebastian Drude, Jost Gippert, Irina Nevskaya and Menzo Windhouwer
879 Le Petit Prince in UNL Ronaldo Martins
881 Unsupervised document zone identification using probabilistic graphical models Andrea Varga, Daniel Preotiuc-Pietro and Fabio Ciravegna
882 Summarizing a multi-source set of documents in a Smart Room Maria Fuentes, Horacio Rodríguez and Jordi Turmo
884 Empirical Comparisons of MASC Word Sense Annotations: a First Look Gerard de Melo and Collin F. Baker
885 Creating HAVIC: The Heterogeneous Audio Visual Internet Collection Stephanie Strassel, Amanda Morris, Jonathan Fiscus, Christopher Caruso, Haejoong Lee, Paul Over, James Fiumara, Barbara Shaw, Brian Antonishek and Martial Michel
886 Identifying Multi-Word Expressions in Statistical Machine Translation Dhouha Bouamor, Nasredine Semmar and Pierre Zweigenbaum
887 Multi-Layer Discourse Annotation of a Dutch Text Corpus Gisela Redeker, Ildikó Berzlánovich, Nynke van der Vliet, Gosse Bouma and Markus Egg
888 Identifying Word Translations from Comparable Documents without a Seed Lexicon Reinhard Rapp, Serge Sharoff, Bogdan Babych and Richard Forsyth
891 The Language Archive – a new hub for language resources Sebastian Drude, Daan Broeder, Paul Trilsbeek and Peter Wittenburg
892 A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora Mahdi Khademian, Kaveh Taghipour and Shahram Khadivi
893 Automatic Term Recognition Needs Multiple Evidence Natalia Loukachevitch
894 Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank Sudheer Kolachina, Rashmi Prasad, Dipti Misra Sharma and Aravind Joshi
895 A methodology for the extraction of information about the usage of formulaic expressions in scientific texts Hannah Kermes
898 CLCM - A Linguistic Resource for Effective Simplification of Instructions in the Crisis Management Domain and its Evaluations Irina Temnikova
900 From Grammar Extraction to Treebanking: A Bootstrapping Approach Masood Ghayoomi
902 Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation Takafumi Suzuki, Yusuke Abe, Itsuki Toyota, Takehito Utsuro, Suguru Matsuyoshi and Masatoshi Tsuchiya
903 Improving K-Nearest Neighbor Efficacy for Farsi Text Classification Behrooz Minaei, Mohammad Hossein Elahimanesh and Hossein Malekinezhad
905 ULex: new data models and a mobile environment for corpus enrichment. Dafydd Gibbon
906 UniDic for Early Middle Japanese: an Electrical Dictionary for Morphological Analysis of Classical Japanese Toshinobu Ogiso, Mamoru Komachi, Yasuharu Den and Yuji Matsumoto
907 A platform-independent user-friendly dictionary from Italian to LIS Umar Shoaib, Gabriele Tiotto, Nadeem Ahmad and Paolo Prinetto
908 Mapping WordNet to the Kyoto ontology Egoitz Laparra, German Rigau and Piek Vossen
909 Tools for plWordNet Development. Presentation and Perspectives Bartosz Broda, Marek Maziarz and Maciej Piasecki
910 Creating a Data Collection for Evaluating Rich Speech Retrieval Maria Eskevich, Gareth J.F. Jones, Martha Larson and Roeland Ordelman
911 Ontologies of Linguistic Annotation: Survey and perspectives Christian Chiarcos
912 The Open Linguistics Working Group Christian Chiarcos, Sebastian Hellmann and Sebastian Nordhoff
914 An Annotation Scheme for Quantifier Scope Disambiguation Mehdi Manshadi, Eric Meinhardt, James Allen and Mary Swift
915 A generic formalism to represent linguistic corpora in RDF and OWL/DL Christian Chiarcos
916 LDC Language Resource Papers Catalog: Building a Bibliographic Database Eleftheria Ahtaridis, Christopher Cieri and Denise DiPersio
917 A hierarchical approach with feature selection for emotion recognition from speech Panagiotis Giannoulis and Gerasimos Potamianos
919 A Concise Query Language with Search and Transform Operations for Corpora with Multiple Levels of Annotation Anil Kumar Singh
921 A Study of Word-Classing for MT Reordering Ananthakrishnan Ramanathan and Karthik Visweswariah
922 DOXA server : creating a generic web evaluation platform Pierre ALBERT, Olivier HAMON and Patrick PAROUBEK
924 Large aligned treebanks for syntax-based machine translation Gideon Kotzé, Vincent Vandeghinste, Scott Martens and Jörg Tiedemann
925 Collecting and Using Comparable Corpora for Statistical Machine Translation Inguna Skadiņa, Ahmet Aker, Nikos Glaros, Fangzhong Su, Dan Tufis, Mateja Verlic, Andrejs Vasiļjevs and Bogdan Babych
926 Recognition of Polish Derivational Relations Based on Supervised Learning Scheme Maciej Piasecki, Radoslaw Ramocki and Marek Maziarz
931 Combining Formal Concept Analysis and semantic information for building ontological structures from texts : an exploratory study Silvia Moraes and Vera Lima
934 Diversified Bootstrapping for Acquiring High-Coverage Paraphrase Resource Hideki Shima and Teruko Mitamura
938 The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language Mike Kestemont, Claudia Peersman, Benny De Decker, Guy De Pauw, Kim Luyckx, Roser Morante, Frederik Vaassen, Janneke van de Loo and Walter Daelemans
939 Clause-based Discourse Segmentation of Arabic Texts Iskandar KESKES, Farah BENAMARA and Lamia Hadrich Belguith
941 Building Japanese Predicate-argument Structure Corpus using Lexical Conceptual Structure Yuichiroh Matsubayashi, Yusuke Miyao and Akiko Aizawa
942 An Examination of Cross-Cultural Similarities and Differences from Social Media Data with respect to Language Use Mohammad Fazleh Elahi and Paola Monachesi
944 Domain-specific vs. Uniform Modeling for Coreference Resolution Olga Uryupina and Massimo Poesio
945 Extending the EmotiNet Knowledge Base to Improve the Automatic Detection of Implicitly Expressed Emotions from Text Alexandra Balahur and Jesús M. Hermida
946 Extending the adverbial coverage of a French morphological lexicon Elsa Tolone, Stavroula Voyatzi and Claude Martineau
947 Reconstructing the Diachronic Morphology of Romanian from Dictionary Citations Dan Cristea, Radu Simionescu and Gabriela Haja
948 Latvian and Lithuanian Named Entity Recognition with TildeNER Mārcis Pinnis
951 Collecting humorous expressions from a community-based question-answering-service corpus Masashi Inoue
954 RELcat: a Relation Registry for ISOcat data categories Menzo Windhouwer
956 The Political Speech Corpus of Bulgarian Petya Osenova and Kiril Simov
957 NLTG-Eval: A Toolkit for Creating Online Language Evaluation Experiments Eric Kow and Anja Belz
958 A Database of Attribution Relations Silvia Pareti
960 Collaborative Development and Evaluation of Text-processing Workflows in a UIMA-supported Web-based Workbench Rafal Rak, Andrew Rowley and Sophia Ananiadou
964 A Mandarin-English Code-switching Corpus Ying Li, Yue Yu and Pascale Fung
965 InfiKorp: Towards a Free Corpus of Polish Bartosz Broda, Michał Marcińczuk, Marek Maziarz, Adam Radziszewski and Adam Wardyński
966 A Fast, Memory Efficient, Scalable and Multilingual Dictionary Retriever Paulo Fernandes, Lucelene Lopes, Carlos A. Prolo, Afonso Sales and Renata Vieira
967 Structural alignment of plain text books André Santos and José João Almeida
968 Turkish Paraphrase Corpus Seniz Demir, Ilknur Durgar El-Kahlout, Erdem Unal and Hamza Kaya
970 International Multicultural Name Matching Competition: Design, Execution, Results, and Lessons Learned Keith J Miller, Elizabeth Schroeder Richerson, Sarah McLeod and James Finley
971 Measuring Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms Ryan Georgi, Fei Xia and William Lewis
973 The Effectiveness of Unsupervised Learning on Domain Adaptation A Case Study on Chinese Word Segmentation Yan Song and Fei Xia
974 Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz Joao Paulo Cabral, Mark Kane, Zeeshan Ahmed, Mohamed Abou-Zleikha, Eva Szekely, Amalia Zahra, Kalu Ogbureke, Peter Cahill, Julie Carson-Berndsen and Stephan Schlogl
976 An implementation of a Latvian resource grammar in Grammatical Framework Peteris Paikens and Normunds Gruzitis
977 Resource Evaluation for Usable Speech Interfaces: Utilizing Human–Human Dialogue Pepi Stavropoulou, Dimitris Spiliotopoulos and Georgios Kouroupetroglou
978 GATEtoGerManC: A GATE-based annotation pipeline for historical German Silke Scheible, Richard J. Whitt, Martin Durrell and Paul Bennett
980 Dealing with unknown words in statistical machine translation João Silva and Luísa Coheur
983 A disambiguation resource for semantic annotation Eric Charton and Michel Gagnon
985 PET: a Tool for Post-editing and Assessing Machine Translation Wilker Aziz, Sheila Castilho and Lucia Specia
986 Enriching the ISST-TANL Corpus with Semantic Frames Alessandro Lenci, Simonetta Montemagni, Eva Maria Vecchi and Giulia Venturi
987 Developing Partially-Transcribed Speech Corpus from Edited Transcriptions Kengo Ohta, Masatoshi Tsuchiya and Seiichi Nakagawa
991 Construction of the Turkish National Corpus (TNC) Yeşim Aksan, Mustafa Aksan, Ahmet Koltuksuz, Taner Sezer, Ümit Mersinli, Umut Ufuk Demirhan, Hakan Yılmazer, Gülsüm Atasoy, Seda Öz and İpek Yıldız
992 Building a learner corpus Jirka Hana, Alexandr Rosen, Barbora Štindlová and Petr Jäger
996 Results of the ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation Christian Federmann, Eleftherios Avramidis, Marta R. Costa-Jussa, Josef van Genabith, Maite Melero and Pavel Pecina
998 The META-SHARE Metadata Schema for the Description of Language Resources Maria Gavrilidou, Penny Labropoulou, Elina Desipri, Stelios Piperidis, Monica Monachini, Francesca Frontini, Thierry Declerck, Gil Francopoulo, Victoria Arranz and Valerie Mapelli
999 Fine-grained German Sentiment Analysis on Social Media Saeedeh Momtazi
1000 Alignment-based reordering for SMT Maria Holmqvist, Sara Stymne, Lars Ahrenberg and Magnus Merkel
1003 Same domain different discourse style- A case study on Language Resources for data-driven Machine Translation Monica Gavrila, Walther v. Hahn and Cristina Vertan
1006 Annotations for Power Relations on Email Threads Vinodkumar Prabhakaran, Huzaifa Neralwala, Owen Rambow and Mona Diab
1007 A Language Resources Framework supporting Conceptual Relationships among Biomedical Information to Chronic Disease Surveillance Juliana Pollettini, Evandro Ruiz and Alessandra Macedo
1008 Foundations of a Multilayer Annotation Framework for Twitter Communications During Crisis Events William J. Corvey, Sarah Vieweg, Sudha Verma, Martha Palmer and James H. Martin
1010 Pedagogical stances and their multimodal signals. Leone Giovanna, D’Errico Francesca and Poggi Isabella
1011 An audiovisual political speech analysis incorporating eye-tracking and perception data Stefan Scherer, Georg Layher, John Kane, Heiko Neumann and Nick Campbell
1012 Tajik-Farsi Persian Transliteration Using Statistical Machine Translation Chris Irwin Davis
1013 Corpus based Semi-Automatic Extraction of Persian Compound Verbs and their Relations Somayeh Bagherbeygi and Mehrnoush Shamsfard
1014 Word Sense Inventories by Non-Experts. Anna Rumshisky, Nick Botchan, Sophie Kushkuley and James Pustejovsky
1015 PAMOCAT: Automatic retrieval of specified postures Bernhard Brüning, Christian Schnier, Karola Pitsch and Sven Wasmuth
1016 Statistical Section Segmentation in Free-Text Clinical Records Michael Tepper, Daniel Capurro, Fei Xia, Lucy Vanderwende and Meliha Yetisgen-Yildiz
1017 Constructing a Question Corpus for Textual Semantic Relations Rui Wang and Shuguang Li
1018 Building a fine-grained subjectivity lexicon from a web corpus Isa Maks and Piek Vossen
1020 Building a database of French frozen adverbial phrases Aude Grezka and Céline Poudat
1021 Matching Cultural Heritage items to Wikipedia Eneko Agirre
1022 ATLIS: Identifying Locational Information in Text Automatically John Vogel and Marc Verhagen
1024 A Speech and Gesture Spatial Corpus in Assisted Living Dimitra Anastasiou
1026 Harnessing 3rd party observer gaze for a continuous measure of the appropriateness of a speaker change Jens Edlund, Simon Alexandersson, Jonas Beskow, Lisa Gustavsson, Mattias Heldner, Anna Hjalmarsson, Petter Kallionen and Ellen Marklund
1027 Constraint Based Description of Polish Multiword Expressions Roman Kurc and Maciej Piasecki
1028 An Open Source Persian Computational Grammar Shafqat Mumtaz Virk and Elnaz Abolahrar
1031 Project FLY: a multidisciplinary project within Linguistics Mariana Gomes and Ana Guilherme
1032 The Hausa Internet Corpus Christian Chiarcos
1034 The Common Orthographic Vocabulary of the Portuguese Language: a set of common open lexical resources for a pluricentric language José Pedro Ferreira
1035 NLP Challenges for Eunomos a Tool to Build and Manage Legal Knowledge Guido Boella, Luigi di Caro, Llio Humphreys, Livio Robaldo and Leon van der Torre
1036 Pursing power in Arabic on-line discussion forums Marc Tomlinson
1037 The Dependency-parsed FrameNet Corpus Daniel Bauer, Hagen Fürstenau and Owen Rambow
1040 Incorporating an Error Corpus into a Spellchecker for Maltese Michael Rosner, Albert Gatt, Andrew Attard and Jan Joachimsen
1042 Building Synthetic Voices in the META-NET Framework Emília Garcia Casademont, Antonio Bonafonte and Asunción Moreno
1043 Automatic Translation of Scholarly Terms into Patent Terms Using Synonyms Extraction Techniques Hidetsugu Nanba, Toshiyuki Takezawa, Kiyoko Uchiyama and Akiko Aizawa
1046 Tree-Structured Named Entity Recognition on OCR Data: Analysis, Processing and Results Marco Dinarelli and Sophie Rosset
1047 Building a 70 billion word corpus of English from ClueWeb Jan Pomikálek, Miloš Jakubíček, Pavel Rychlý and Adam Kilgarriff
1049 How genres influence performance of parsers: a case study on the Chinese TreeBank Dong Wang and Fei Xia
1051 Improving Short-String Language Identification using Many-Language Distance Classification Paul Rodrigues
1052 Dependency parsing for interaction detection in pharmacogenomics Gerold Schneider, Fabio Rinaldi and Simon Clematide
1053 Semantic Annotations in Japanese FrameNet: Comparing Frames in Japanese and English Kyoko Ohara
1055 Buildind a Resource of Semantic Types Octavian Popescu and Luisa Bentivoglio
1056 A data and analysis resource for an experiment in text mining a collection of micro-­blogs on a political topic. William Black, Rob Procter, Steven Gray and Sophia Ananiadou
1057 AWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis Muhammad Abdul-Mageed and Mona Diab
1059 Causal analysis of task incompletion for spoken dialogs focused on interactions of user and the system Sunao Hara, Norihide Kitaoka and Kazuya Takeda
1061 Generation of Verbal Stems in Derivationally Rich Language Nives Mikelić Preradović, Krešimir Šojat and Marko Tadić
1062 Corpus+WordNet thesaurus generation for ontology enriching Fernando Castilho, Roger Granada, Breno Meneghetti, Leonardo Carvalho and Renata Vieira
1063 Reclassifying subcategorization frames for experimental analysis and stimulus generation Paula Buttery and Andrew Caines
1064 Creating a Coreference Resolution System for Polish Mateusz Kopeć and Maciej Ogrodniczuk
1065 LDC Forced Aligner Xiaoyi Ma
1066 Semantic Orientation PMI or Vector Space Models? An evaluation of automated sentiment acquisition methods. Aleksander Wawer
1069 Assessing the Comparability of News Texts Emma Barker and Rob Gaizauskas
1070 Building Text-to-Speech Systems for Resource Poor Languages Nur-Hana Samsudin and Mark Lee
1072 Representing General Relational Knowledge in ConceptNet 5 Robert Speer and Catherine Havasi
1075 Boosting statistical tagger accuracy with simple rule-based grammars Mans Hulden and Jerid Francom
1076 Twitter Event Lexicon for Natural Language Processing Jennifer Williams and Graham Katz
1077 A morphological transducer for Kyrgyz Jonathan Washington, Mirlan Ipasov and Francis Tyers
1078 A Corpus for Research on Deliberation and Debate Marilyn Walker, Jean Fox Tree, Pranav Anand, Rob Abbott and Joseph King
1080 Evaluating the Similarity Estimator component of the TWIN Personality-based Recommender System Alexandra Roshchina, John Cardiff and Paolo Rosso
1081 Learning Sentiment Lexicons in Spanish Veronica Perez-Rosas, Carmen Banea and Rada Mihalcea
1082 Unsupervised Word Sense Disambiguation with Multilingual Representations Erwin Fernandez-Ordonez, Rada Mihalcea and Samer Hassan
1086 The META-SHARE Language Resources Sharing Infrastructure: Principles, Challenges, Solutions Stelios Piperidis
1087 Annotating progressive aspect constructions in the spoken section of the British National Corpus Andrew Caines and Paula Buttery
1091 Annotating Spatial Containment Relations Between Events Kirk Roberts, Travis Goodwin and Sanda Harabagiu
1095 Agreement and disagreement in threaded discussion Jacob Andreas, Sara Rosenthal and Kathleen McKeown
1096 Logic and Graph Based Methods for Terminological Assessment Benoît Robichaud
1097 Parsing Any Domain English text to CoNLL dependencies Sudheer Kolachina and Prasanth Kolachina
1098 POS Tagging for Grammaticalization and Grammatical Neologism Detection Maarten Janssen
1107 Using DiAML and ANVIL for multimodal dialogue annotations Harry Bunt, Michael Kipp and Volha Petukhova
1109 Annotated Corpora for Word Alignment between Japanese and English Tsuyoshi Okita
1111 Assigning Connotation Values to Events Tommaso Caselli, Irene Russo and Francesco Rubino
1112 Biographies or Blenders: Which Resource to Prefer for Cross-Domain Sentiment Analysis Natalia Ponomareva and Mike Thelwall
1114 An Annotated Corpus of Film Dialogue for Learning and Characterizing Character Style Marilyn Walker and Grace Lin
1116 SPPAS: a tool for Speech Phonetization Alignment and Syllabification Brigitte Bigi
1117 Twenty Years of Language Resource Development and Distribution: A Progress Report on LDC Activities Christopher Cieri and Mark Liberman
1120 A Phonemic Corpus of Polish Child-Directed Speech Luc Boruta and Justyna Jastrzebska
1121 The KIT Lecture Corpus for Speech Translation Sebastian Stüker, Teresa Herrmann, Florian Kraft, Christian Mohr and Alex Waibel
1122 Orthographic Transcription: which Enrichment is required for phonetization? Brigitte Bigi and Pauline Peri
1123 The Role of Model Testing in Standards Development: The Case of ISO-Space James Pustejovsky and Jessica Moszkowicz
1124 Aleda, a free large-scale entity database for French Benoît Sagot and Rosa Stern
1126 The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation Marcello Federico, Sebastian Stüker, Luisa Bentivogli, Michael Paul, Mauro Cettolo, Teresa Herrmann, Jan Niehues and Giovanni Moretti
1127 Cleaning noisy wordnets Benoît Sagot and Darja Fišer
1129 Tackling interoperability issues within UIMA work-flows Nicolas Hernandez
1130 Ubiquitous Usage of a Broad Coverage French Corpus: Processing the Est Republicain corpus Djamé Seddah, Marie Candito, Benoit Crabbé and Enrique Henestroza Anguiano
1131 Wordnet extension made simple: A multilingual lexicon-based approach using wiki resources Valérie Hanoka and Benoît Sagot
1132 Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment Shyam Agrawal, Shweta Sinha, Pooja Singh and Jesper Olson
Update Thursday 29 March 2012
The others articles in the same section :
Update Monday 14 May 2012
Update Tuesday 8 May 2012
Published on Thursday 17 May 2012
Update Thursday 24 May 2012

Printable version of this article Send the article by e-mail title=        


Home     |     News     |     Site Map     |     Restricted     
  Update Thursday 11 October 2012