ID | Title | Authors |
106 | Investigating Engagement - intercultural and technological aspects of the collection, analysis, and use of the Estonian Multiparty Conversational video data | Kristiina Jokinen and Mare Koit |
108 | “You Seem Aggressive!” Monitoring Anger in a Practical Application | Felix Burkhardt |
110 | Fast Labeling and Transcription with the Speechalyzer Toolkit | Felix Burkhardt |
112 | Smooth Sailing for STEVIN | Peter Spyns and Elisabeth D’Halleweyn |
113 | Automatic Speech Recognition on a Firefighter TETRA Broadcast Channel | Daniel Stein and Bela Usabaev |
114 | Building a Basque-Chinese Dictionary by Using English as Pivot | Xabier Saralegi, Iker Manterola and Iñaki San Vicente |
117 | Mining Sentiment Words from Microblogs for Predicting Writer-Reader Emotion Transition | Yi-jie Tang and Hsin-Hsi Chen |
120 | A High-Quality Web Corpus of Czech | Johanka Spoustová and Miroslav Spousta |
121 | German verb patterns and their implementation in an electronic dictionary | Marc Luder |
122 | Large Scale Semantic Annotation, Indexing and Search at The National Archives | Diana Maynard and Mark A. Greenwood |
123 | QurAna: Corpus of the Quran annotated with Pronominal Anaphora | Abdul-Baquee Sharaf and Eric Atwell |
124 | Bootstrapping Sentiment Labels For Unannotated Documents With Polarity PageRank | Christian Scheible and Hinrich Schütze |
125 | A Multi-layered Reference Corpus for German Sentiment Analysis | Simon Clematide, Stefan Gindl, Manfred Klenner, Stefanos Petrakis, Robert Remus, Josef Ruppenhofer, Ulli Waltinger and Michael Wiegand |
126 | Versatile Speech Databases for High Quality Synthesis for Basque | Iñaki Sainz, Daniel Erro, Eva Navas, Inma Hernáez, Jon Sánchez, Ibon Saratxaga and Igor Odriozola |
128 | TIMEN: An Open Temporal Expression Normalization Resource | Hector Llorens, Leon Derczynski, Robert Gaizauskas and Estela Saquete |
129 | Measuring Interlanguage: Native Language Identification with L1-influence Metrics | Julian Brooke and Graeme Hirst |
130 | TextCoop: A logic-based language for processing discourse structures | Patrick SAINT-DIZIER |
136 | A Gold Standard for Relation Extraction in the Food Domain | Michael Wiegand, Benjamin Roth, Eva Lasarcyk, Stephanie Köser and Dietrich Klakow |
137 | A Repository of Rules and Lexical Resources for Discourse Structure Analysis: the Case of Explanation Structures | Sarah BOURSE and Patrick SAINT-DIZIER |
139 | Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring | Flore BARCELLINI, Marie GARNIER, Corinne GROSSE and Patrick SAINT-DIZIER |
148 | WebAnnotator, an Annotation Tool for Web Pages | Xavier Tannier |
149 | Towards a comprehensive open repository of Polish language resources | Maciej Ogrodniczuk, Piotr Pęzik and Adam Przepiórkowski |
150 | Towards an LFG parser for Polish: An exercise in parasitic grammar development | Agnieszka Patejuk and Adam Przepiórkowski |
151 | Constructive Interaction for Talking about Interesting Topics | Kristiina Jokinen and Graham Wilcock |
152 | Corpus-based Referring Expressions Generation | Hilder Pereira, Eder Novais, Andre Mariotti and Ivandre Paraboni |
153 | Portuguese Text Generation from Large Corpora | Eder Novais, Ivandre Paraboni and Douglas Silva |
154 | SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles | Volha Petukhova, Rodrigo Agerri, Mark Fishel, Sergio Penkale, Arantza del Pozo, Mirjam Sepesy Maucec, Andy Way, Panayota Georgakopoulou and Martin Volk |
159 | Collecting, Interpreting and Exploiting Affective Common Sense Knowledge | Erik Cambria, Amir Hussain, Catherine Havasi and Chris Eckl |
160 | Body-conductive acoustic sensors in human-robot communication | Panikos Heracleous, Carlos Ishi, Takahiro Miyashita and Norihiro Hagita |
161 | From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information | Lieve Macken, Veronique Hoste, Marielle Leijten and Luuk Van Waes |
164 | A Comparative Evaluation of Word Sense Disambiguation Algorithms for German | Verena Henrich and Erhard Hinrichs |
165 | SemScribe: Natural Language Generation for Medical Reports | Sebastian Varges, Heike Bieler, Manfred Stede, Lukas C. Faulstich, Kristin Irsig and Malik Atalla |
166 | Automatic Annotation and Manual Evaluation of the Diachronic German Corpus TüBa-D/DC | Erhard Hinrichs and Thomas Zastrow |
167 | A new dynamic approach for lexical networks evaluation | Alain Joubert and Mathieu Lafourcade |
168 | Grammatical Error Annotation for Korean Learners of Spoken English | Hongsuck Seo, Kyusong Lee, Gary Geunbae Lee, Soo-Ok Kweon and Hae-Ri Kim |
169 | Adaptive Speech Recognition for Intuitive Model‐based Spoken Dialogues | Tobias Heinroth, Maximilian Grotz, Florian Nothdurft and Wolfgang Minker |
170 | First Steps towards the Semi-automatic Development of a Wordformation-based Lexicon of Latin | Marco Passarotti and Francesco Mambrini |
172 | Using Parallel and Comparable Data for Abstract Anaphora Resolution in German and English | Heike Zinsmeister, Melanie Seiss and Stefanie Dipper |
175 | A Tool for Extracting Conversational Implicatures | Marta Tatu and Dan Moldovan |
176 | Polaris: Lymba’s Semantic Parsing | Dan Moldovan and Eduardo Blanco |
177 | Light Verb Constructions in the SzegedParalellFX English—Hungarian Parallel Corpus | Veronika Vincze |
178 | CoALT: A SOFTWARE FOR COMPARING AUTOMATIC LABELLING TOOLS | Dominique Fohr and Odile Mella |
179 | Balanced data repository of spontaneous spoken Czech | Lucie Válková, Martina Waclawičová and Michal Křen |
180 | The coding and annotation of multimodal dialogue acts | Volha Petukhova and Harry Bunt |
181 | Learning Categories and their Instances by Contextual Features | Antje Schlaf and Robert Remus |
182 | Textual Characteristics for Language Processing Systems | Mathias Bank and Martin Schierle |
183 | A Survey of Text Mining Architectures and the UIMA Standard | Mathias Bank and Martin Schierle |
184 | A Rule-based Morphological Analyzer for Murrinh-Patha | Melanie Seiss |
187 | Optimising WSD through Active Learning | Piek Vossen, Attila Görög, Fons Laan, Maarten Van Gompel, Rubén Izquierdo and Antal Van den Bosch |
188 | Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies | Bruno Cartoni and Thomas Meyer |
190 | QurSim: A corpus for evaluation of relatedness in short texts | Abdul-Baquee Sharaf and Eric Atwell |
192 | Eye Tracking as a Tool for Machine Translation Error Analysis | Sara Stymne, Henrik Danielsson, Sofia Bremin, Hongzhan Hu, Johanna Karlsson, Anna Prytz Lillkull and Martin Wester |
193 | Automatic annotation of head velocity and acceleration in Anvil | Bart Jongejan |
194 | Representing the Translation Relation in a Bilingual Wordnet | Jyrki Niemi and Krister Lindén |
196 | Can Statistical Post-Editing with a Small Parallel Corpus Save a Weak MT Engine? | Marianna J. Martindale |
198 | The Impact of Automatic Morphological Analysis & Disambiguation on Dependency Parsing of Turkish | GULSEN ERYIGIT |
199 | Detecting Reduplication in Videos of American Sign Language | Zoya Gavrilov, Stan Sclaroff, Carol Neidle and Sven Dickinson |
200 | Building a multilingual parallel corpus for human users | Alexandr Rosen and Martin Vavřín |
201 | EmpaTweet: Annotating and Detecting Emotions on Twitter | Kirk Roberts, Michael A. Roach, Joseph Johnson, Josh Guthrie and Sanda M. Harabagiu |
204 | EVALIEX – A Proposal for an Extended Evaluation Methodology for Information Extraction Systems | Christina Feilmayr, Birgit Pröll and Elisabeth Linsmayr |
205 | BiBiKit - A Bilingual Bimodal Reading and Writing Tool for Sign Language Users | Nedelina Ivanova and Olle Eriksen |
206 | The BladeMistress Corpus: From Talk to Action in Virtual Worlds | Anton Leuski, Carsten Eickhoff, James Ganis and Victor Lavrenko |
207 | Event Nominals: Annotation Guidelines and a Manually Annotated Corpus in French | Arnulphy Béatrice, Tannier Xavier and Vilnat Anne |
208 | Confrontation of two models of language for the automatic phonetic labeling of an unknown ethnic language of the South-Asia: the case of Mo Piu | Geneviève CAELEN-HAUMONT and Sethserey SAM |
209 | The Parallel-TUT: a multilingual and multiformat treebank | Cristina Bosco, Manuela Sanguinetti and Leonardo Lesmo |
210 | Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources | Arno Scharl, Marta Sabou, Stefan Gindl, Walter Rafelsberger and Albert Weichselbraun |
211 | Using multimodal resources for explanation approaches in intelligent systems | Florian Nothdurft and Wolfgang Minker |
212 | Cloud Logic Programming for Integrating Language Technology Resources | Markus Forsberg and Torbjörn Lager |
213 | AnIta: a powerful morphological analyser for Italian | Fabio Tamburini and Matias Melandri |
214 | Automatic classification of German "an" particle verbs | Sylvia Springorum, Sabine Schulte im Walde and Antje Roßdeutscher |
215 | LIE: Leadership, Influence and Expertise | Roberta Catizone, Louise Guthrie, Arthur Thomas and Yorick Wilks |
216 | CAT: the CELCT Annotation Tool | Valentina Bartalesi Lenzi, Giovanni Moretti and Rachele Sprugnoli |
217 | Quantising Opinions for Political Tweets Analysis | Yulan He, Hassan Saif, Zhongyu Wei and Kam-Fai Wong |
218 | ROMBAC - The ROManian Balanced Annotated Corpus | radu ion, elena irimia, dan stefanescu and dan tufis |
219 | Task-Driven Linguistic Analysis based on an Underspecified Features Representation | Stasinos Konstantopoulos, Valia Kordoni, Nicola Cancedda, Vangelis Karkaletsis, Dietrich Klakow and Jean-Michel Renders |
220 | A French Fairy Tale Corpus syntactically and semantically annotated | Ismaïl El Maarouf and Jeanne Villaneau |
221 | ConanDoyle-neg: Annotation of negation cues and their scope in Conan Doyle stories | Roser Morante and Walter Daelemans |
222 | GerNED: A Corpus in German for Named Entity Disambiguation | Danuta Ploch, Leonhard Hennig, Angelina Duka, Ernesto William De Luca and Sahin Albayrak |
223 | A classification of adjectives for polarity lexicons enhancement | Silvia Vázquez and Núria Bel |
225 | A voting scheme to detect semantic underspecification | Héctor Martínez Alonso, Núria Bel and Bolette Sandford Pedersen |
226 | Aspects of a Legal Framework for Language Resource Management | Aditi Sharma Grover, Annamart Nieman, Gerhard Van Huyssteen and Justus Roux |
228 | Interplay of Coreference and Discourse Relations: Discourse Connectives with a Referential Component | Lucie Poláková, Pavlína Jínová and Jiří Mírovský |
229 | Robust clause boundary identification for corpus annotation | Heiki-Jaan Kaalep and Kadri Muischnek |
230 | CCRT Corpus - Speech Intelligibility Before and After Advanced Head and Neck Cancer Treated with Concomitant Chemoradiotherapy | R.P. Clapham, L. van der Molen, R.J.JH. van Son, M. van den Brekel and F.J.M. Hilgers |
231 | PaCo2: A Fully Automated tool for gathering Parallel Corpora from the Web | Iñaki San Vicente and Iker Manterola |
232 | Mapping WordNet synsets to Wikipedia articles | Samuel Fernando and Mark Stevenson |
233 | BLEU Evaluation of Machine-Translated English-Croatian Legislation | Sanja Seljan, Marija Brkić and Tomislav Vičić |
235 | Making Ellipses Explicit in Dependency Conversion for a German Treebank | Wolfgang Seeker and Jonas Kuhn |
236 | SentiSense: An easily scalable concept-based affective lexicon for sentiment analysis | Jorge Carrillo de Albornoz, Laura Plaza and Pablo Gervás |
238 | Distractorless Authorship Verification | John Noecker Jr and Michael Ryan |
239 | Predicting Phrase Breaks in Classical and Modern Standard Arabic Text | Majdi Sawalha, Claire Brierley and Eric Atwell |
240 | Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing | Claire Brierley, Majdi Sawalha and Eric Atwell |
241 | Item Development and Scoring for Japanese Oral Proficiency Testing | Hitokazu Matsushita and Deryle Lonsdale |
244 | Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing | Ziqi Zhang, Philip Webster, Victoria Uren and Fabio Ciravegna |
245 | Identifying equivalents of specialized verbs in a bilingual comparable corpus of judgments: A frame-based methodology | Janine Pimentel |
246 | TimeBankPT: A TimeML Annotated Corpus of Portuguese | Francisco Costa and António Branco |
247 | Towards a richer wordnet representation of properties exploiting semantic and thematic information from thesauri | Sanni Nimb and Bolette Sandford Pedersen |
248 | Korp – the corpus infrastructure of Språkbanken | Lars Borin, Markus Forsberg and Johan Roxendal |
249 | The open lexical infrastructure of Språkbanken | Lars Borin, Markus Forsberg, Leif-Jöran Olsson and Jonatan Uppström |
251 | Further Developments in Treebank Error Detection Using Derivation Trees | Seth Kulick, Ann Bies and Justin Mott |
252 | Turk Bootstrap Word Sense Inventory 2.0: A Large-Scale Resource for Lexical Substitution | Chris Biemann |
254 | MULTIPHONIA: a MULTImodal database of PHONetics teaching methods in classroom InterActions. | Charlotte Alazard, Corine Astésano and Michel Billières |
255 | Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns | Andrei Popescu-Belis, Thomas Meyer, Jeevanthi Liyanapathirana, Bruno Cartoni and Sandrine Zufferey |
256 | Challenges in the TAC-KBP Slot Filling task | Bonan Min and Ralph Grishman |
259 | Logical metonymies and qualia structures: an annotated database of logical metonymies for German | Alessandra Zarcone and Stefan Rued |
262 | HunOr: A Hungarian–Russian Parallel Corpus | Martina Katalin Szabó, Veronika Vincze and István Nagy T. |
263 | PoliMorf: a (not so) new open morphological dictionary for Polish | Marcin Woliński, Marcin Miłkowski, Maciej Ogrodniczuk and Adam Przepiórkowski |
264 | Introducing the Swedish Kelly-list, a new free e-resource for Swedish | Elena Volodina and Sofie Johansson Kokkinakis |
266 | A Cross-Lingual Dictionary for English Wikipedia Concepts | Valentin I. Spitkovsky and Angel X. Chang |
267 | Language Richness of the Web | Martin Majliš and Zdeněk Žabokrtský |
268 | Feature Discovery for Diachronic Register Analysis: a Semi-Automatic Approach | Stefania Degaetano-Ortlieb, Ekaterina Lapshinova-Koltunski and Elke Teich |
269 | Using Verb Subcategorization for Word Sense Disambiguation | Will Roberts and Valia Kordoni |
270 | Danish parallel corpus for text simplification | Sigrid Klerke and Anders Søgaard |
272 | Propbank-Br: a Brazilian Treebank annotated with semantic role labels | Magali Sanches Duran and Sandra Maria Aluísio |
274 | A Universal Part-of-Speech Tagset | Slav Petrov, Dipanjan Das and Ryan McDonald |
276 | A large scale annotated child language construction database | Aline Villavicencio, Beracah Yankama and Robert Berwick |
277 | Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures | Xuansong Li, Stephanie Strassel, Stephen Grimes, Safa Ismael, Mohamed Maamouri, Ann Bies and Nianwen Xue |
278 | Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual | Xuansong Li, Stephanie Strassel, Heng Ji, Kira Griffitt and Joe Ellis |
279 | Constructing a Class-Based Lexical Dictionary using Interactive Topic Models | Kugatsu Sadamitsu, Kuniko Saito, Kenji Imamura and Yoshihiro Matsuo |
280 | Multimodal Corpus of Multi-party Conversations in Foreign Language | Shota Yamazaki, Hirohisa Furukawa, Masafumi Nishida, Kristiina Jokinen and Seiichi Yamamoto |
282 | Dictionary Look-up with Katakana Variant Recognition | Satoshi Sato |
284 | SUTime: A library for recognizing and normalizing time expressions | Angel Chang and Christopher Manning |
287 | Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking | Nicholas Michael, Fei Yang, Dimitris Metaxas and Carol Neidle |
288 | Two Database Resources for Processing Social Media English Text | Eleanor Clark and Kenji Araki |
290 | A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties | Maristella Agosti, Birgit Alber, Giorgio Maria Di Nunzio, Marco Dussin, Stefan Rabanus and Alessandra Tomaselli |
291 | Evaluation of Online Dialogue Policy Learning Techniques | Alexandros Papangelis, Vangelis Karkaletsis and Fillia Makedon |
292 | Experiences in Resource Generation for Machine Translation through Crowdsourcing | Anoop Kunchukuttan, Shourya Roy, Pratik Patel, Kushal Ladha, Somya Gupta, Mitesh M. Khapra and Pushpak Bhattacharyya |
293 | Multilingual Central Repository version 3.0 | Aitor González, Egoitz Laparra and German Rigau |
294 | Involving Language Professionals in the Evaluation of Machine Translation | Eleftherios Avramidis, Aljoscha Burchardt, Christian Federmann, Maja Popović, Cindy Tscherwinka and David Vilar |
295 | A New Method for Evaluating Automatically Learned Terminological Taxonomies | Paola Velardi, Roberto Navigli, Stefano Faralli and Juana Maria Ruiz-Martinez |
296 | A structural view of Topic and Focus marking in Italian | Gloria Gagliardi, Edoardo Lombardi Vallauri and Fabio Tamburini |
297 | Improving the Recall of a Discourse Parser by Constraint-based Postprocessing | Sucheta Ghosh, Richard Johansson, Giuseppe Riccardi and Sara Tonelli |
299 | Adding Morpho-semantic Relations to the Romanian Wordnet | Verginica Barbu Mititelu |
300 | Cross-lingual studies of ASR errors: paradigms for perceptual evaluations | Ioana Vasilescu, Martine Adda-Decker and Lori Lamel |
301 | The Rocky Road towards a Swedish FrameNet - Creating SweFN | Karin Friberg Heppin and Maria Toporowska Gronostaj |
302 | Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis | Dietmar Schabus, Michael Pucher and Gregor Hofer |
303 | Towards automated annotation of multimodal resources – the AVATecH project | Przemyslaw Lenkiewicz |
304 | Acquisition of Syntactic Text Simplification Rules for French | Violeta Seretan |
305 | An Empirical Study of the Occurrence and Co-Occurrence of Named Entities in Natural Language Corpora | K Saravanan, Monojit Choudhury, Raghavendra Udupa and A Kumaran |
306 | Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified Chinese | Chenhui Chu, Toshiaki Nakazawa and Sadao Kurohashi |
307 | Iula2Standoff: a tool for creating standoff documents for the IULACT | Carlos Morell, Jorge Vivaldi and Núria Bel |
308 | Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement | André Bittar, Caroline Hagège, Véronique Moriceau, Xavier Tannier and Charles Tesseidre |
309 | Introducing the Reference Corpus of Contemporary Portuguese | Michel Généreux, Iris Hendrickx and Amália Mendes |
311 | A Corpus-based Study of the German Recipient Passive | Patrick Ziering, Sina Zarrieß and Jonas Kuhn |
312 | “Vreselijk mooi!” (terribly beautiful): A Subjectivity Lexicon for Dutch Adjectives. | Tom De Smedt and Walter Daelemans |
313 | Rule-Based Detection of Clausal Coordinate Ellipsis | Kristiina Muhonen and Tanja Purtonen |
314 | Evolution of Event Designation in Media: Preliminary Study | Xavier Tannier, Véronique Moriceau, Béatrice Arnulphy and Ruixin He |
315 | Evaluating Machine Reading Systems through Comprehension Tests | Anselmo Peñas, Eduard Hovy, Pamela Forner, Álvaro Rodrigo, Richard Sutcliffe, Corina Forascu and Caroline Sporleder |
316 | Chinese-English CLIR in Biomedicine Using the Extended CMeSH Terms to Expand Queries | Xinkai Wang, Paul Thompson and Sophia Ananiadou |
319 | A proposal for improving WordNet Domains | Aitor González and German Rigau |
320 | An Oral History Annotation Tool for INTER-VIEWs | Henk van den Heuvel, Eric Sanders, Robin Rutten, Stef Scagliola and Paula Witkamp |
321 | Comparing computer vision analysis of signed language video with motion capture recordings | Matti Karppa, Tommi Jantunen, Ville Viitaniemi, Jorma Laaksonen, Birgitta Burger and Danny De Weerdt |
326 | Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese | Juan Pablo Martínez Cortés, Jim O’Regan and Francis Tyers |
327 | Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages | Dirk Goldhahn, Thomas Eckart and Uwe Quasthoff |
328 | Constraint Relaxation in temporal Evaluation Metrics: The Case of DEFT 2010 and 2011 Challenges | Cyril Grouin, Patrick Paroubek, Pierre Zweigenbaum and Rémy Kessler |
330 | Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies | Thomas Ulrich Christiansen and Peter Juel Henrichsen |
331 | The acquisition and dialog act labeling of the EDECAN-SPORTS corpus | Lluis-F. Hurtado, Fernando Garcia, Emilio Sanchis and Encarna Segarra |
333 | A Parameterized and Annotated Corpus of the CMU Let’s Go Bus Information System | Alexander Schmitt, Stefan Ultes and Wolfgang Minker |
334 | Pragmatic identification of the witness sets | Livio Robaldo and Jakub Szymanik |
335 | A good space: Lexical predictors in word space evaluation | Christian Smith, Henrik Danielsson and Arne Jönsson |
336 | Automatic MT Error Analysis: Hjerson Helping Addicter | Jan Berka, Ondřej Bojar, Mark Fishel, Maja Popović and Daniel Zeman |
337 | An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output | Daniele Pighin, Lluís Màrquez and Jonathan May |
338 | A Basic Language Resource Kit for Persian | Mojgan Seraji, Beáta Megyesi and Joakim Nivre |
340 | Re-ordering Source Sentences for SMT | Amit Sangodkar and Om Damani |
341 | Concept-based Selectional Preferences and Distributional Representations from Wikipedia Articles | Alex Judea, Vivi Nastase and Michael Strube |
342 | Semi-Supervised Technical Term Tagging With Minimal User Feedback | Behrang QasemiZadeh, Paul Buitelaar, Tianqi Chen and Georgeta Bordea |
343 | Extended Named Entities Annotation on OCRized Documents: From Corpus Constitution to Evaluation Campaign | Olivier Galibert, Sophie Rosset, Cyril Grouin, Pierre Zweigenbaum and Ludovic Quintard |
345 | Joint Grammar and Treebank Development for Mandarin Chinese with HPSG | Yi Zhang, Rui Wang and Yu Chen |
347 | Wordnet Based Lexicon Grammar for Polish | Zygmunt Vetulani |
350 | A tree is a Baum is an árbol is a sach’a: Creating a trilingual treebank | Annette Rios and Anne Göhring |
352 | INVESTIGATING VERBAL INTELLIGENCE USING THE TF-IDF APPROACH | Kseniya Zablotskaya, Fernando Fernandez Martınez and Wolfgang Minker |
354 | RELATING DOMINANCE OF DIALOGUE PARTICIPANTS WITH THEIR VERBAL INTELLIGENCE SCORES | Kseniya Zablotskaya, Umair Rahim, Fernando Fernandez Martınez and Wolfgang Minker |
355 | Diachronic Changes of Text Complexity in 20th Century English Language: NLP Approach | Sanja Štajner and Ruslan Mitkov |
356 | An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT | Ângela Costa, Tiago Luís, Joana Ribeiro, Ana Cristina Mendes and Luísa Coheur |
357 | SMALLWorlds — a multi-lingual speech corpus for cognitive research | Peter Juel Henrichsen and Marcus Uneson |
359 | A database of semantic clusters of verb usages | Silvie Cinková, Martin Holub and Lenka Smejkalová |
360 | Combining language resources into a grammar-driven Swedish parser | Malin Ahlberg and Ramona Enache |
361 | Annotating dropped pronouns in Chinese newswire text | Elizabeth Baran, Yaqin Yang and Nianwen Xue |
362 | Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions | Maria Aloni, Andreas van Cranenburgh, Raquel Fernandez and Marta Sznajder |
365 | Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics | Kanika Gupta, Monojit Choudhury and Kalika Bali |
366 | Capturing syntactico-semantic regularities among terms: An application of the FrameNet methodology to terminology | Marie-Claude L’Homme and Janine Pimentel |
370 | A Corpus of Adequacy Assessments for Real-World Machine Translation Output | Daniele Pighin, Lluís Màrquez and Lluís Formiga |
371 | Annotating Story Timelines as Temporal Dependency Structures | Bethard Steven, Oleksandr Kolomiyets and Marie-Francine Moens |
372 | An ontological approach to model and query multimodal concurrent linguistic annotations | Julien Seinturier, Elisabeth Murisasco, Emmanuel Bruno and Philippe Blache |
373 | A PropBank for Portuguese: the CINTIL-PropBank | António Branco, Catarina Carvalheiro, Sílvia Pereira, Sara Silveira, João Silva, Sérgio Castro and João Graça |
374 | Highlighting relevant concepts from Topic Signatures | Montse Cuadros, Lluís Padró and German Rigau |
375 | A tool for enhanced search of multilingual digital libraries of e-journals | Ranka Stanković, Cvetana Krstev, Ivan Obradović, Aleksandra Trtovac and Miloš Utvić |
376 | Extending a wordnet framework for simplicity and scalability | Pedro Fialho, Sérgio Curto, Ana Cristina Mendes and Luísa Coheur |
377 | DeCour: a corpus of DEceptive statements in Italian COURts | Tommaso Fornaciari and Massimo Poesio |
378 | Irish Treebanking and Parsing: A Preliminary Evaluation | Teresa Lynn, Ozlem Cetinoglu, Jennifer Foster, Elaine Uí Dhonnchadha, Mark Dras and Josef van Genabith |
379 | Expertise Mining for Enterprise Content Management | Georgeta Bordea, Sabrina Kirrane, Paul Buitelaar and Bianca Pereira |
380 | Word Alignment for English-Turkish Language Pair | Mehmet Talha Çakmak, Süleyman Acar and Gülşen Eryiğit |
381 | Comparing performance of different set-covering strategies for linguistic content optimization in speech corpora | Nelly Barbot, Olivier Boeffard and Arnaud Delhay |
382 | PEXACC: A Parallel Data Mining Algorithm from Comparable Corpora | Radu Ion |
384 | A Framework for Spelling Correction in Persian Language Using Noisy Channel Model | Mohammad Hoseyn Sheykholeslam, Behrouz Minaei-Bidgoli and Hossein Juzi |
387 | Wiktionary as a Multilingual Wordnet | Gilles Sérasset |
388 | Dysarthric Speech Database for Development of QoLT Software Technology | Dae-Lim Choi, Bong-Wan Kim, Yeon-Whoa Kim, Yong-Ju Lee, Yongnam Um and Minhwa Chung |
389 | CLTC: A Chinese-English Cross-lingual Topic Corpus | Yunqing Xia, Guoyu Tang, Peng Jin and Xia Yang |
390 | Customizable SCF Acquisition in Italian | Tommaso Caselli, Francesca Frontini, Valeria Quochi, Francesco Rubino and Irene Russo |
391 | Statistical Evaluation of Pronunciation Encoding | Iris Merkus and Florian Schiel |
393 | Improving corpus annotation productivity: a method and experiment with interactive tagging | Atro Voutilainen |
394 | Semantic relations established by processes expressed by nouns and verbs: identification in a corpus by means of syntaxico-semantic annotation | Nava Maroto, Marie-Claude L’Homme and Amparo Alcina |
396 | The Language Library: supporting community effort for collective resource production | Nicoletta Calzolari, Riccardo Del Gratta, Francesca Frontini, Francesco Rubino and Irene Russo |
397 | MISTRAL: A Melody Intonation Speaker Tonal Range semi-automatic Analysis using variable Levels | Binh Hai PHAM, Benoît WEBER, Geneviève CAELEN-HAUMONT and Do-Dat TRAN |
398 | German and English Treebanks and Lexica for Tree-Adjoining Grammars | Miriam Kaeshammer and Vera Demberg |
399 | Annotating a corpus of human interaction with prosodic profiles – focusing on Mandarin repair/disfluency | Helen Kaiyun CHEN |
400 | The Australian National Corpus: National Infrastructure for Language Resources | Steve Cassidy, Michael Haugh, Pam Peters and Mark Fallu |
401 | A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies | Hongzhi Xu, Helen Kaiyun Chen, Chu-Ren Huang, Qin Lu and Dingxu Shi |
402 | ELAN development, keeping pace with communities’ needs | Han Sloetjes and Aarthy Somasundaram |
403 | Revealing Contentious Concepts Across Social Groups | Zumrut Akcam, Ching-Sheng Lin, Samira Shaikh, Sharon Small, Ken Stahl, Tomek Strzalkowski and Nick Webb |
404 | Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber) | Fabrizio Borgia, Claudia S. Bianchini, Patrice Dalle and Maria De Marsico |
405 | Cost and Benefit of Using WordNet Senses for Sentiment Analysis | BALAMURALI AR, ADITYA JOSHI and PUSHPAK BHATTACHARYYA |
409 | Rembrandt - a named-entity recognition framework | Nuno Cardoso |
411 | An Adaptive Framework for Named Entity Combination | Bogdan Sacaleanu and Günter Neumann |
413 | Texto4Science: a Quebec French Database of Annotated Short Text Messages | Philippe Langlais, Patrick Drouin, Amélie Paulus, Eugénie Rompré Brodeur and Florent Cottin |
416 | Collecting and Analysing Chats and Tweets in SoNaR | Eric Sanders |
418 | A generalized morphological network for French: the Daemonette lexicon | Nabil Hathout and Fiammetta Namer |
420 | Alternative Lexicalizations of Discourse Connectives in Czech | Magdalena Rysova |
422 | Prediction of Non-Linguistic Information of Spontaneous Speech from the Prosodic Annotation: Evaluation of the X-JToBI system | Kikuo Maekawa |
424 | PEARL: ProjEction of Annotations Rule Language, a Language for Projecting (UIMA) Annotations over RDF Knowledge Bases | Maria Teresa Pazienza, Armando Stellato and Andrea Turbati |
425 | Temporal Tagging on Different Domains: Challenges, Strategies, and Gold Standards | Jannik Strötgen and Michael Gertz |
426 | Correlation between Similarity Measures for Wikipedia | Monica Lestari Paramita and Paul Clough |
427 | Linguistic knowledge for specialized text production | MIRIAM BUENDÍA-CASTRO and BEATRIZ SÁNCHEZ-CÁRDENAS |
428 | The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation | Massimo Moneglia, Monica Monachini, Omar Calabrese, Alessandro Panunzi, Francesca Frontini, Gloria Gagliardi and Irene Russo |
429 | HamleDT: To Parse or Not to Parse? | David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Daniel Zeman, Zdeněk Žabokrtský and Jan Hajič |
430 | FreeLing 3.0: Towards Wider Multilinguality | Lluís Padró and Evgeny Stanilovsky |
434 | Evaluating Appropriateness Of System Responses In A Spoken CALL Game | Manny Rayner, Pierrette Bouillon and Johanna Gerlach |
436 | A Scalable Architecture For Web Deployment of Spoken Dialogue Systems | Matthew Fuchs, Nikos Tsourakis and Manny Rayner |
437 | Semantic metadata mapping in practice: the Virtual Language Observatory | Dieter Van Uytvanck, Herman Stehouwer and Lari Lampen |
440 | The Icelandic Parsed Historical Corpus (IcePaHC) | Eiríkur Rögnvaldsson, Anton Karl Ingason, Einar Freyr Sigurðsson and Joel Wallenberg |
442 | Empty Argument Insertion in the Hindi PropBank | Ashwini Vaidya, Jinho D. Choi, Martha Palmer and Bhuvana Narasimhan |
443 | A Rough Set Formalization of Quantitative Evaluation with Ambiguity | Patrick Paroubek and Xavier Tannier |
444 | A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation | Eleftherios Avramidis, Marta R. Costa-jussa, Christian Federmann, Josef van Genabith, Maite Melero and Pavel Pecina |
445 | Reference corpus of historical Slovene | Tomaž Erjavec |
446 | Inforex — a web-based tool for text corpus management and semantic annotation | Michał Marcińczuk, Jan Kocoń and Bartosz Broda |
447 | A new semantically annotated corpus with syntax-based and cross-lingual senses | RAKHO Myriam, LAPORTE Éric and CONSTANT Matthieu |
448 | Recent Developments in CLARIN-NL | Jan Odijk |
450 | Unsupervised acquisition of concatenative morphology | Lionel Nicolas, Jacques Farré and Cécile Darme |
451 | Massively Increasing TIMEX3 Resources: A Transduction Approach | Leon Derczynski and Estela Saquete |
452 | Constructing Large Proposition Databases | Peter Exner and Pierre Nugues |
453 | Visualizing Sentiment Analysis on a User Forum | Rasmus Sundberg, Anders Eriksson, Johan Bini and Pierre Nugues |
454 | Towards Automatic Gesture Stroke Detection | Binyam Gebrekidan Gebre and Peter Wittenburg |
455 | Semantic Role Labeling with the Swedish FrameNet | Richard Johansson, Karin Friberg Heppin and Dimitrios Kokkinakis |
456 | Prague Dependency Style Treebank for Tamil | Loganathan Ramasamy and Zdeněk Žabokrtský |
457 | A Distributed Resource Repository for Cloud-Based Machine Translation | Jörg Tiedemann, Dorte Haltrup Hansen, Lene Offersgaard, Sussi Olsen and Matthias Zumpe |
460 | Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese | Patricia Gonçalves, Rita Santos and António Branco |
461 | Developing LMF-XML Bilingual Dictionaries for Colloquial Arabic Dialects | David Graff and Mohamed Maamouri |
463 | Parallel Data, Tools and Interfaces in OPUS | Jörg Tiedemann |
464 | SemSim: Resources for Normalized Semantic Similarity Computation Using Lexical Networks | Elias Iosif and Alexandros Potamianos |
465 | Annotating and Learning Morphological Segmentation of Egyptian Colloquial Arabic | Emad Mohamed, Behrang Mohit and Kemal Oflazer |
466 | A Resource-light Approach to Phrase Extraction for English and German Documents from the Patent Domain and User Generated Content | Julia Maria Schulz, Daniela Becks, Christa Womser-Hacker and Thomas Mandl |
467 | Kitten: a tool for normalizing HTML and extracting its textual content | Mathieu-Henri Falco, Véronique Moriceau and Anne Vilnat |
468 | A Metadata Editor to Support the Description of Linguistic Resources | Emanuel Dima, Christina Hoppermann, Thorsten Trippel and Claus Zinn |
469 | A Comparable Portuguese-Spanish Corpus with Ellipsis Annotations | Luz Rello and Iria Gayo |
470 | A Repository for the Sustainable Management of Research Data | Emanuel Dima, Erhard Hinrichs, Christina Hoppermann, Thorsten Trippel and Claus Zinn |
471 | A Galician Syntactic Corpus with Application to Intonation Modeling | Montserrat Arza, José M. García-Miguel, Francisco Campillo and Miguel Cuevas - Alonso |
474 | A Reference Dependency Bank for Analyzing Complex Predicates | Tafseer Ahmed, Miriam Butt, Annette Hautli and Sebastian Sulger |
475 | UBY — A Large-Scale Unified Lexical-Semantic Resource Based on LMF | Iryna Gurevych, Judith Eckle-Kohler, Silvana Hartmann, Michael Matuschek, Christian M. Meyer and Christian Wirth |
476 | The Influence of Corpus Quality on Statistical Measurements on Language Resources | Thomas Eckart, Uwe Quasthoff and Dirk Goldhahn |
478 | Applying cross-lingual WSD to wordnet development | Marianna Apidianaki and Benoît Sagot |
480 | Annotating Qualia Relations in Italian and French Complex Nominals | Pierrette Bouillon, Elisabetta Jezek, Chiara Melloni and Aurélie Picton |
481 | Terra: a Collection of Translation Error-Annotated Corpora | Mark Fishel, Ondřej Bojar and Maja Popović |
482 | Identifying Nuggets of Information in GALE Distillation Evaluation | Olga Babko-Malaya, Greg Milette, Michael Schneider and Sarah Scogin |
483 | SUTAV: A Turkish Audio-Visual Database | Ibrahim Saygin Topkaya and Hakan Erdogan |
485 | Speech and Language Resources for LVCSR of Russian | Sergey Zablotskiy, Alexander Shvets, Maxim Sidorov, Eugene Semenkin and Wolfgang Minker |
486 | KALAKA-2: a TV broadcast speech database for the recognition of Iberian languages in clean and noisy environments | Luis Javier Rodriguez-Fuentes, Mikel Penagarikano, Amparo Varona, Mireia Diez and German Bordel |
487 | Automatic word alignment tools to scale production of manually aligned parallel text | Stephen Grimes, Katherine Peterson and Xuansong Li |
488 | Developing and evaluating an emergency scenario dialogue corpus | Jolanta Bachan |
490 | A Framework for Evaluating Text Correction | Robert Dale and George Narroway |
493 | Large Scale Lexical Analysis | Gregor Thurmair, Vera Aleksic and Christoph Schwarz |
494 | NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks | Chieh-Jen Wang, Shuk-Man Cheng, Lung-Hao Lee, Hsin-Hsi Chen, Wen-shen Liu, Pei-Wen Huang and Shih-Peng Lin |
495 | The ETAPE corpus for the evaluation of speech-based TV content processing in the French language | Guillaume Gravier, Gilles Adda, Niklas Paulsson, Matthieu Carré, Aude Giraudel and Olivier Galibert |
496 | Automatic lexical semantic classification of nouns | Núria Bel, Lauren Romeo and Muntsa Padró |
502 | Is it Useful to Support Users with Lexical Resources? A User Study. | Ernesto William De Luca |
503 | In the same boat and other idiomatic seafaring expressions | Rita Marinelli and Laura Cignoni |
504 | Creation and use of Language Resources in a Question-Answering eHealth System | Anna Braasch, Lina Henriksen, Csaba Huszka, Anders Johannsen, Lars Kayser, Bente Maegaard, Ole Norgaard, Stefan Schulz and Jürgen Wedekind |
505 | Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents | Lina M. Rojas-Barahona, Alejandra Lorenzo and Claire Gardent |
506 | Collection of a Large Database of French-English SMT Output Corrections | Marion Potet, Emmanuelle Esperança-Rodier, Laurent Besacier and Hervé Blanchon |
507 | German "nach"-Particle Verbs in Semantic Theory and Corpus Data | Boris Haselbach, Wolfgang Seeker and Kerstin Eckart |
508 | A Prototype Tool to Discover Missing Wikipedia Inter-language Links by means of Cross-lingual Word Sense Disambiguation | Els Lefever, Veronique Hoste and Martine De Cock |
509 | Arabic-Segmentation Combination Strategies for Statistical Machine Translation | Saab Mansour and Hermann Ney |
510 | Announcing Prague Czech-English Dependency Treebank 2.0 | Ondřej Bojar, Jan Hajič, Eva Hajičová, Jarmila Panevová, Petr Sgall, Silvie Cinková, Eva Fučíková, Marie Mikulová, Petr Pajas, Jan Popelka, Jiří Semecký, Jana Šindlerová, Jan Štěpánek, Josef Toman, Zdeňka Urešová and Zdeněk Žabokrtský |
511 | First Results in a Study Evaluating Pre-labeling and Correction Propagation for Machine-Assisted Syriac Morphological Analysis | Paul Felt, Eric Ringger, Kevin Seppi, Kristian Heal, Robbie Haertel and Deryl Lonsdale |
513 | A Corpus of Spontaneous Multi-party Conversation in Bosnian Serbo-Croatian and British English | Emina Kurtic, Bill Wells, Guy J. Brown, Timothy Kempton and Ahmet Aker |
514 | Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques | Antton Gurrutxaga and Iñaki Alegria |
515 | Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks | Jorge Vivaldi, Luis Adrián Cabrera-Diego, Gerardo Sierra and María Pozzi |
516 | The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends | Javier Caminero, Mari Carmen Rodríguez, Jean Vanderdonckt, Fabio Paternò, Joerg Rett, Dave Raggett, Jean-Loup Comeliau and Ignacio Marín |
518 | French and German corpora for audience-based text type classification | Amalia Todirascu, Sebastian Pado, Jennifer Krisch, Max Kisselew and Ulrich Heid |
519 | The IULA Treebank | Montserrat Marimon, Beatríz Fisas, Núria Bel, Marta Villegas, Jorge Vivaldi, Sergi Torner and Mercè Lorente |
520 | Modality in Text: a Proposal for Corpus Annotation | Iris Hendrickx, Amália Mendes and Silvia Mencarelli |
521 | Rule-based Entity Recognition and Coverage of SNOMED CT in Swedish Clinical Text | Skeppstedt Maria, Kvist Maria and Dalianis Hercules |
522 | An Evaluation of the Effect of Automatic Preprocessing on Syntactic Parsing for Biomedical Relation Extraction | Md. Faisal Mahbub Chowdhury and Alberto Lavelli |
524 | Federated Search: Towards a Common Search Infrastructure | Herman Stehouwer, Matej Durco, Eric Auer and Daan Broeder |
525 | Evaluating and improving syntactic lexica by plugging them within a parser | Elsa Tolone, Benoît Sagot and Éric Villemonte de la Clergerie |
526 | The Herme Database of Spontaneous Multimodal Human-Robot Dialogues | frank han, emer gilmartin, celine delooze, brian vaughan and nick campbell |
527 | Source-Language Dictionaries Help Non-Expert Users to Enlarge Target-Language Dictionaries for Machine Translation | Víctor M. Sánchez-Cartagena, Miquel Esplà-Gomis and Juan Antonio Pérez-Ortiz |
529 | EXMARaLDA and the FOLK tools – two toolsets for transcribing and annotating spoken language | Thomas Schmidt |
530 | ISO 24617-2: A semantically-based standard for dialogue annotation | Harry Bunt, Jan Alexandersson, Jae-Woong Choe, Alex Chengyu Fang, Koiti Hasida, Volha Petukhova, Andre Popescu-Belis and David Traum |
531 | Towards a methodology for automatic identification of hypernyms in the definitions of large-scale dictionary | Inga Gheorghita and Jean-Marie Pierrel |
533 | A review corpus annotated for negation, speculation and their scope | Natalia Konstantinova, Sheila C.M. de Sousa, Noa P. Cruz, Manuel J. Maña, Maite Taboada and Ruslan Mitkov |
534 | Developing a large semantically annotated corpus | Valerio Basile, Johan Bos, Kilian Evang and Noortje Venhuizen |
535 | Dynamic web service deployment in a cloud environment | Marc Kemps-Snijders, Matthijs Brouwer, JanPieter Kunst and Tom Visser |
536 | Associative and Semantic Features Extracted From Web-Harvested Corpora | Elias Iosif, Maria Giannoudaki, Eric Fosler-Lussier and Alexandros Potamianos |
537 | Collection of a corpus of Dutch SMS | Maaske Treurniet, Orphée De Clercq, Henk van den Heuvel and Nelleke Oostdijk |
538 | Evaluating Hebbian Self-Organizing Memories for Lexical Representation and Access | Marcello Ferro, Claudia Marzi, Claudia Caudai and Vito Pirrelli |
539 | A Corpus for Gesture-Controlled Mobile Spoken Dialogue Systems | Nikos Tsourakis and Manny Rayner |
543 | Towards a User-Friendly Platform for Building Language Resources based on Web Services | Marc Poch, Antonio Toral, Olivier Hamon, Valeria Quochi and Núria Bel |
544 | Collaborative semantic editing of linked data lexica | John McCrae, Elena Montiel-Ponsoda and Philipp Cimiano |
545 | Evaluating the Impact of Phrase Recognition on Semantic Annotation | Pablo Mendes, Joachim Daiber, Rohana Rajapakse, Felix Sasaki and Christian Bizer |
547 | Proper Language Resource Centers | Willem Elbers, Daan Broeder and Dieter van Uytvanck |
548 | RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building | Alessandro Panunzi, Marco Fabbri, Massimo Moneglia, Lorenzo Gregori and Samuele Paladini |
549 | Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign | Karën Fort, Claire François, Olivier Galibert and Maha Ghribi |
550 | LAST MINUTE: a Multimodal Corpus of Speech-based User-Companion Interactions | Dietmar Rösner, Jörg Frommer, Rafael Friesen, Matthias Haase, Julia Lange and Mirko Otto |
552 | Semantic annotation of French corpora: animacy and verb semantic classes | Juliette Thuilier |
553 | Evaluation of Unsupervised Information Extraction | Wei Wang, Romaric Besançon, Olivier Ferret and Brigitte Grau |
555 | A contrastive review of paraphrase acquisition techniques | Houda Bouamor, Aurélien Max, Gabriel Illouz and Anne Vilnat |
557 | Expanding Arabic Treebank to Speech: Results from Broadcast News | Mohamed Maamouri, Ann Bies and Seth Kulick |
559 | Typing Race Games as a Method to Create Spelling Error Corpora | Paul Rodrigues and C. Anton Rytting |
561 | A treebank-based study on the influence of Italian word order on parsing performance | Anita Alicante, Cristina Bosco, Anna Corazza and Alberto Lavelli |
562 | Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems | Kallirroi Georgila, Alan Black, Kenji Sagae and David Traum |
563 | A Search Tool for FrameNet Constructicon | Hiroaki Sato |
566 | Extraction of unmarked quotations in Newspapers | Stéphanie Weiser and Patrick Watrin |
567 | Ontoterminology: How to unify terminology and ontology into a single paradigm | Christophe Roche |
568 | A system for automatic tonal annotation of speech corpora | Piet Mertens |
569 | Corpus Annotation as a Psycholinguistic Task | Donia Scott, Rossano Barone and Rob Koeling |
570 | DBpedia: A Multilingual Cross-domain Knowledge Base | Pablo Mendes, Max Jakob and Christian Bizer |
572 | A Morphological Analyzer For Wolof Using Finite-State Techniques | Cheikh M. Bamba Dione |
574 | Designing a search interface for a Spanish learner spoken corpus: the end-user’s evaluation | Leonardo Campillos Llanos |
576 | What is a “linguistic” ontology? The creation and purpose of a bottom-up ontology for nouns in context | Patrick Hanks, Elisabetta Ježek and Francesca Frontini |
577 | Design and compilation of a specialized parallel corpus Spanish-German | Carla Parra Escartín |
578 | Detection of Peculiar Word Sense by Distance Metric Learning with Labeled Examples | Minoru Sasaki and Hiroyuki Shinnou |
579 | Conventional Orthography for Dialectal Arabic | Nizar Habash, Mona Diab and Owen Rambow |
581 | Standardizing a Component Metadata Infrastructure | Daan Broeder, Dieter van Uytvanck, Maria Gavrilidou and Thorsten Trippel |
583 | Assessing Crowdsourcing Quality through Objective Tasks | Ahmet Aker, Mahmoud El-Haj, M-Dyaa Albakour and Udo Kruschwitz |
584 | Association Norms of German Noun Compounds | Sabine Schulte im Walde and Susanne Borgwaldt |
585 | Word Sketches for Turkish | Bharat Ram Ambati, Siva Reddy and Adam Kilgarriff |
586 | The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser | Eckhard Bick, Heliana Mello, Alessandro Panunzi and Tommaso Raso |
587 | Bulgarian X-language Parallel Corpus | Svetla Koeva, Ivelina Stoyanova, Rositsa Dekova, Borislav Rizov and Angel Genov |
589 | The MASC Word Sense Corpus | Rebecca Passonneau, Collin Baker, Christiane Fellbaum and Nancy Ide |
590 | Págico: Evaluating Wikipedia-based information retrieval in Portuguese | Cristina Mota, Alberto Simões, Cláudia Freitas, Luís Costa and Diana Santos |
593 | Representation of linguistic and domain knowledge for second language learning in virtual worlds | Alexandre Denis, Ingrid Falk, Claire Gardent and Laura Perez-Beltrachini |
594 | A Multilingual Natural Stress Emotion Database | Xin Zuo, Tian Li and Pascale Fung |
595 | The Twins Corpus of Museum Visitor Questions | Priti Aggarwal, Ron Artstein, Jillian Gerten, Athanasios Katsamanis, Angela Nazarian and David Traum |
596 | Development of a Web-Scale Chinese Word N-gram Corpus with Parts of Speech Information | Chi-Hsin Yu and Hsin-Hsi Chen |
597 | Medical term extraction in an Arabic medical corpus | Doaa Samy, Antonio Moreno-Sandoval, Conchi Bueno-Díaz, Marta Garrote-Salazar and José M Guirao |
598 | Annotation Facilities for the Reliable Analysis of Human Motion | Michael Kipp |
601 | Addressing polysemy in bilingual lexicon extraction from comparable corpora | Darja Fišer and Nikola Ljubešić |
603 | Arabic Word Generation and Modelling for Spell Checking | Mohammed Attia, Pavel Pecina, Younes Samih and Josef van Genabith |
605 | Annotation of response tokens and their triggering expressions in Japanese multi-party conversations | Yasuharu Den, Hanae Koiso, Katsuya Takanashi and Nao Yoshida |
606 | Automatically Generated Online Dictionaries | Enikő Héja and Dávid Takács |
607 | Method for Collection of Acted Speech Using Various Situation Scripts | Takahiro Miyajima, Hideaki Kikuchi, Katsuhiko Shirai and Shigeki Okawa |
608 | Citing on-line Language Resources | Daan Broeder, Dieter van Uytvanck and Gunter Senft |
609 | Automatic Extraction and Evaluation of Arabic LFG Resources | Mohammed Attia, Lamia Tounsi and Josef van Genabith |
610 | Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger | Matthieu Constant and Isabelle Tellier |
611 | The Minho Quotation Resource | Brett Drury and JJ Almeida |
614 | Translog-II: a Program for Recording User Activity Data for Empirical Translation Process Research | Michael Carl |
616 | Morphosyntactic Analysis of the CHILDES and TalkBank Corpora | Brian MacWhinney and Wintner Shuly |
618 | Similarity Ranking as Attribute for Machine Learning Approach to Authorship Identification | Jan Rygl and Aleš Horák |
619 | Challenges in the development of annotated corpora of computer-mediated communication in Indian Languages: A Case of Hindi | Ritesh Kumar |
622 | LexIt: A Computational Resource on Italian Argument Structure | Alessandro Lenci, Gabriella Lapesa and Giulia Bonansinga |
623 | Annotating Football Matches: Influence of the Source Medium on Manual Annotation | Karën Fort and Vincent Claveau |
624 | The C-ORAL-BRASIL I: Reference Corpus for Spoken Brazilian Portuguese | Tommaso Raso and Heliana Mello |
626 | A light way to collect comparable corpora from the Web | Ahmet Aker, Evangelos Kanoulas and Robert Gaizauskas |
627 | Holaaa!! writin like u talk is kewl but kinda hard 4 NLP | Maite Melero, Judith Domingo, Montse Marquina and Martí Quixal |
628 | Applying Random Indexing to Structured Data to Find Contextually Similar Words | Danica Damljanovic, Udo Kruschwitz, M-Dyaa Albakour, Johann Petrak and Mihai Lupu |
629 | Coreference in Spoken vs. Written Texts: a Corpus-based Analysis | Marilisa Amoia, Kerstin Kunz and Ekaterina Lapshinova-Koltunski |
632 | Towards Fully Automatic Annotation of Audio Books for TTS | Olivier Boeffard, Laure Charonnat, Sébastien Le Maguer and Damien Lolive |
633 | Centroids: Gold standards with distributional variation | Ian Lewin, Senay Kafkas and Dietrich Rebholz-Schuhmann |
635 | Multimodal Behaviour and Feedback in Different Types of Interaction | Costanza Navarretta and Patrizia Paggio |
636 | On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market | David Lewis, Alexander O’Connor, Andrzej Zydroń, Gerd Sjögren and Rahzeb Choudhury |
637 | Multimedia database of the cultural heritage of the Balkans | Ivana Tanasijević, Biljana Sikimić and Gordana Pavlović-Lažetić |
638 | ANALEC: a New Tool for the Dynamic Annotation of Textual Data | Frederic Landragin and Thierry Poibeau |
639 | Feedback in Nordic First-Encounters: a Comparative Study | Costanza Navarretta, Elisabeth Ahlsén, Jens Allwood, Kristiina Jokinen and Patrizia Paggio |
640 | Annotating Opinions in German Political News | Kristina Adson, Hong Li, Tal Kirshboim, Xiwen Cheng and Feiyu Xu |
641 | MultiUN v2: UN Documents with Multilingual Alignments | Yu Chen and Andreas Eisele |
642 | The CONCISUS Corpus of Event Summaries | Horacio Saggion and Sandra Szasz |
643 | Building and Exploring Semantic Equivalences Resources | Gracinda Carvalho, David Martins de Matos and Vitor Rocio |
644 | IDENTIC Corpus: Morphologically Enriched Indonesian-English Parallel Corpus | Septina Dian Larasati |
645 | The Joy of Parallelism with CzEng 1.0 | Ondřej Bojar, Zdeněk Žabokrtský, Ondřej Dušek, Petra Galuščáková, Martin Majliš, David Mareček, Jiří Maršík, Michal Novák, Martin Popel and Aleš Tamchyna |
646 | LAMP: A Multimodal Web Platform for Collaborative Linguistic Analysis | Kais Dukes and Eric Atwell |
648 | Web Service integration platform for Polish linguistic resources | Maciej Ogrodniczuk and Michał Lenart |
649 | Suffix Trees as Language Models | Casey Redd Kennington and Martin Kay |
651 | The Romanian Neuter Examined Through A Two-Gender N-Gram Classification System | Liviu P. Dinu, Vlad Niculae and Octavia-Maria Şulea |
652 | Using semi-experts to derive judgments on word sense alignment: a pilot study | Soojeong Eom, Markus Dickinson and Graham Katz |
653 | The Polish Sejm Corpus | Maciej Ogrodniczuk |
655 | Creating and Curating a Cross-Language Entity Linking Collection | Dawn Lawrie, James Mayfield, Paul McNamee and Douglas Oard |
656 | The TARSQI Toolkit | Marc Verhagen and James Pustejovsky |
657 | A corpus of general and specific sentences from news | Annie Louis and Ani Nenkova |
658 | Annotation Trees: LDC’s customizable, extensible, scalable, annotation infrastructure | Jonathan Wright, Kira Griffitt, Joe Ellis, Stephanie Strassel and Brendan Callahan |
661 | Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing | Elena Filatova |
663 | YADAC: Yet another Dialectal Arabic Corpus | Rania Al-Sabbagh and Roxana Girju |
664 | An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines) | James Clarke, Vivek Srikumar, Mark Sammons and Dan Roth |
665 | Korean learners’ English accentuation corpus for automatic accent detection | Ho-Young Lee, Jieun Song and Kiduk Yoon |
672 | Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization | Luís Marujo, Anatole Gershman, Jaime Carbonell, Robert Frederking and João P. Neto |
673 | Hindi Subjective Lexicon (HSL): A Lexical Resource for Hindi Adjective Polarity Classification | Akshat Bakliwal, Piyush Arora and Vasudeva Varma |
674 | Annotating Near-Identity from Coreference Disagreements | Marta Recasens, M. Antònia Martí and Constantin Orasan |
676 | The REX corpora: A collection of multimodal corpora of referring expressions in collaborative problem solving dialogue | Takenobu Tokunaga, Ryu Iida, Asuka Terai and Naoko Kuriyama |
677 | Statistical Machine Translation without Source-side Parallel Corpus using Word Lattice and Phrase Extension | Takanori Kusumoto and Tomoyoshi Akiba |
678 | Extracting Knowledge-Rich Contexts in Russian and German | Anne-Kathrin Schumann |
679 | Brand Pitt: A Corpus to Explore the Art of Naming | Gozde Ozbal, Carlo Strapparava and Marco Guerini |
680 | Evaluating automatic cross-domain Dutch semantic role annotation | Orphée De Clercq, Veronique Hoste and Paola Monachesi |
682 | Syntactic annotation of spontaneous speech: application to call-center conversation data | Thierry Bazillon, Melanie Delplano, Frederic Bechet, Alexis Nasr and Benoit Favre |
683 | Korean Children’s Spoken English Corpus and an Analysis of its Pronunciation Variability | Hyejin Hong, Sunhee Kim and Minhwa Chung |
684 | DECODA: a call-centre human-human spoken conversation corpus | Frederic Bechet, Benjamin Maza, Nicolas Bigouroux, Thierry Bazillon, Marc El-Beze, Renato De Mori and Eric Arbillot |
685 | The ALLEGRA corpus: a trilingual resource for Romansh, an under-represented language of Switzerland | Yves Scherrer and Bruno Cartoni |
686 | Intelligibility assessment in forensic applications | Giovanni Costantini, Andrea Paoloni and Massimiliano Todisco |
687 | From medical language processing to BioNLP domain | Sara Goggi, Manuela Sassi, Gabriella Pardelli and Stefania Biagioni |
693 | Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task | Tomoyosi Akiba, Hiromitsu Nishizaki, Kiyoaki Aikawa, Tatsuya Kawahara and Tomoko Matsui |
697 | Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese | Antonio Moreno-Sandoval, Leonardo Campillos, Yang Dong, Emi Takamori, José M. Guirao, Paula Gozalo, Chieko Kimura, Kengo Matsui and Marta Garrote |
698 | TED-LIUM: an Automatic Speech Recognition dedicated corpus | Anthony Rousseau, Paul Deléglise and Yannick Estève |
700 | The SYNC3 Collaborative Annotation Tool | Georgios Petasis |
701 | ELRA in the heart of a cooperative HLT world | Valérie Mapelli, Victoria Arranz, Matthieu Carré, Hélène Mazo, Djamel Mostefa and Khalid Choukri |
703 | Automatic Translation of Scientific Documents in the HAL Archive | Lambert Patrik, Holger Schwenk and Frédéric Blain |
706 | Irregularity Detection in Categorized Document Corpora | Borut Sluban, Senja Pollak, Roel Coesemans and Nada Lavrac |
707 | The REPERE Corpus : a multimodal corpus for person recognition | Aude Giraudel, Matthieu Carré, Valérie Mapelli, Juliette Kahn, Olivier Galibert and Ludovic Quintard |
708 | Lemmatising Serbian as Category Tagging with Bidirectional Sequence Classification | andrea gesmundo and Tanja Samardzic |
709 | Efficient Dependency Graph Matching with the IMS Open Corpus Workbench | Thomas Proisl and Peter Uhrig |
711 | Strategies to Improve a Speaker Diarisation Tool | David Tavarez, Eva Navas, Daniel Erro and Ibon Saratxaga |
712 | Exploring self-training and co-training for dependency parsing | Rahul Goutam and Bharat Ambati |
714 | Effects of Document Clustering in Modeling Wikipedia-style Term Descriptions | Atsushi Fujii, Yuya Fujii and Takenobu Tokunaga |
715 | MaltOptimizer: A System for MaltParser Optimization | Miguel Ballesteros and Joakim Nivre |
716 | Building Text-To-Speech Voices in the Cloud | Alistair Conkie, Thomas Okken, Yeon-Jun Kim and Giuseppe Di Fabbrizio |
717 | On the practice of error analysis for machine translation evaluation | Sara Stymne and Lars Ahrenberg |
718 | The Database of Modern Icelandic Inflection | Kristín Bjarnadóttir |
719 | Croatian Dependency Treebank: Recent Development and Initial Experiments | Dasa Berovic, Zeljko Agic and Marko Tadic |
720 | The Tagged Icelandic Corpus (MÍM) | Sigrún Helgadóttir, Ásta Svavarsdóttir, Eiríkur Rögnvaldsson, Kristín Bjarnadóttir and Hrafn Loftsson |
721 | Evaluation of the KomParse Conversational Non-Player Characters in a Commercial Virtual World | Tina Kluewer, Peter Adolphs, Feiyu Xu and Hans Uszkoreit |
722 | Using LRs in Humanities research | Marta Villegas, Nuria Bel, Carlos Gonzalo, Amparo Moreno and Nuria Simelio |
724 | A Treebank-driven Creation of an OntoValence Verb lexicon for Bulgarian | Petya Osenova, Kiril Simov, Laska Laskova and Stanislava Kancheva |
726 | Integrating/Interchanging NLP tools in a distributed environment: a case study chaining Freeling to the DESR parser | Francesco Rubino, Francesca Frontini and Valeria Quochi |
727 | Spell Checking for Chinese | Shaohua Yang, Hai Zhao, Xiaolin Wang and Bao-liang Lu |
729 | Customization of the Europarl Corpus for Translation Studies | Zahurul Islam and Alexander Mehler |
730 | A Parallel Corpus of Music and Lyrics Annotated with Emotions | Carlo Strapparava, Rada Mihalcea and Alberto Battocchi |
732 | Creation of a bottom-up corpus-based ontology for Italian Linguistics | Elisa Bianchi and Mirko Tavosanis |
733 | Glottolog/Langdoc:Increasing the visibility of grey literature for low-density languages | Sebastian Nordhoff and Harald Hammarström |
734 | Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora | Carmen Dayrell, Arnaldo Candido Jr., Gabriel Lima, Danilo Machado Jr., Ann Copestake, Valéria Feltrim, Stella Tagnin and Sandra Aluisio |
735 | NgramQuery - Smart Information Extraction from Google N-gram using External Resources | Martin Aleksandrov and Carlo Strapparava |
736 | Error profiling for task-based evaluation of machine-translated text: a Polish-English case study | Sandra Weiss and Lars Ahrenberg |
737 | Polish Multimodal Corpus – a collection of referential gestures | Magdalena Lis |
740 | DEGELS1: A comparable corpus of French Sign Language and co-speech gestures | Annelies Braffort and Leïla Boutora |
741 | Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs | Matilde GONZALEZ, Michael FILHOL and Christophe COLLET |
743 | Expanding Parallel Resources for Medium-Density Languages for Free | Georgi Iliev |
744 | Creation of Shared Language Resource Repository in the Nordic and Baltic Countries | Andrejs Vasiljevs, Markus Forsberg, Tatiana Gornostay, Dorte Hansen, Kristín Jóhannsdóttir, Gunn Lyse, Krister Lindén, Lene Offersgaard, Sussi Olsen, Bolette Pedersen, Eiríkur Rögnvaldsson, Inguna Skadiņa and Koenraad De Smedt |
746 | Adapting and evaluating a generic term extraction tool | Anita Gojun, Ulrich Heid, Bernd Weißbach, Carola Loth and Insa Mingers |
748 | Beyond SoNaR: towards the facilitation of large corpus building efforts | Martin Reynaert, Ineke Schuurman, Veronique Hoste and Nelleke Oostdijk |
751 | Robustness and adaptation of spoken language understanding systems among languages and domains: the PORTMEDIA project | Fabrice Lefèvre, Djamel Mostefa, Laurent Besacier, Yannick Estève, Matthieu Quignard, Nathalie Camelin, Lina Rojas-Barahona, Bassam Jabaian and Benoit Favre |
753 | A GUI to Detect and Correct Errors in Hindi Dependency Treebank | Rahul Agarwal, Bharat Ram Ambati and Anil Kumar Singh |
754 | Spell Checking in Spanish: The Case of Diacritic Accents | jordi atserias, maria fuentes, rogelio nazar and irene renau |
755 | Iterative Refinement of Annotation Guidelines for Semantically Fuzzy Named Entity Types, such as Pathological Phenomena | Kerstin Hornbostel, Elena Beisswanger, Ekaterina Buyko, Erik Faessler, Jennifer Traumüller, Susann Schröder and Udo Hahn |
756 | Example-Based Treebank Querying | Liesbeth Augustinus, Vincent Vandeghinste and Frank Van Eynde |
757 | Annotating Factive Verbs | Alvin Grisson II and Yusuke Miyao |
758 | Annotating Errors in a Hungarian Learner Corpus | Markus Dickinson and Scott Ledbetter |
761 | Evaluation of a Complex Information Extraction Application in Specific Domain | Romaric Besançon, Olivier Ferret and Ludovic Jean-Louis |
762 | Text Simplification Tools for Spanish | Stefan Bott and Horacio Saggion |
763 | VERTa: Linguistic features in MT evaluation | Elisabet Comelles, Jordi Atserias, Victoria Arranz and Irene Castellón |
766 | Specifying Treebanks, Outsourcing Parsebanks: FinnTreeBank 3 | Atro Voutilainen, Kristiina Muhonen, Tanja Purtonen and Krister Linden |
767 | W-PhAMT: A web tool for phonetic multilevel timeline visualization | Francesco Cutugno and Vincenza Anna Leano |
769 | The LRE Map. Harmonising Community Descriptions of Resources | Nicoletta Calzolari, Riccardo Del Gratta, Gil Francopoulo, Joseph Mariani, Francesco Rubino, Irene Russo and Claudia Soria |
770 | Romanian TimeBank: An Annotated Parallel Corpus for Temporal Information | Corina Forascu and Dan Tufis |
772 | Chinese Whispers: Cooperative Paraphrase Acquisition | Matteo Negri, Yashar Mehdad, Alessandro Marchetti, Danilo Giampiccolo and Luisa Bentivogli |
773 | The Nordic Dialect Corpus | Janne Bondi Johannessen, Joel Priestley, Kristin Hagen, Anders Nøklestad and André Lynum |
774 | WeSearch UGC: A Comprehensive Sample of User-Generated Content | Jonathon Read, Rebecca Dridan, Stephan Oepen and Lilja Øvrelid |
775 | Legal electronic dictionary for Czech | František Cvrček, Karel Pala and Pavel Rychlý |
776 | This also affects the context - Errors in extraction based summaries | Thomas Kaspersson, Christian Smith, Henrik Danielsson and Arne Jönsson |
777 | The FLaReNet Strategic Language Resource Agenda | Claudia Soria, Nuria Bel, Khalid Choukri, Joseph Mariani, Monica Monachini, Jan Odijk, Stelios Piperidis, Valeria Quochi and Nicoletta Calzolari |
778 | Yes we can!? Annotating English modal verbs | Josef Ruppenhofer and Ines Rehbein |
779 | Building a Multimodal Laughter Database for Emotion Recognition | Merlin Teodosia Suarez, Jocelynn Cu and Madelene Sta. Maria |
782 | Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus | Jörg Frommer, Bernd Michaelis, Dietmar Rösner, Andreas Wendemuth, Rafael Friesen, Matthias Haase, Manuela Kunze, Rico Andrich, Julia Lange, Axel Panning and Ingo Siegert |
783 | Rapid creation of large-scale corpora and frequency dictionaries | Attila Zséder, Gábor Recski, Dániel Varga and András Kornai |
784 | Adaptive dictionary for bilingual lexicon extraction from comparable corpora | Amir Mohamed HAZEM and Emmanuel MORIN |
785 | Linguistic Resources for Handwriting Recognition and Translation Evaluation | Zhiyi Song, Safa Ismael, Steven Grimes, Gary Krug and Stephanie Strassel |
786 | Prosomarker: a prosodic analysis tool based on optimal pitch stylization and automatic syllabi-fication | Antonio Origlia and Iolanda Alfano |
787 | The DISCO ASR-based CALL system: practicing L2 oral skills and beyond | Helmer Strik, Jozef Colpaert, Joost Van Doremalen and Catia Cucchiarini |
788 | METU Turkish Discourse Bank Browser | Utku Şirin, Ruket Çakıcı and Deniz Zeyrek |
789 | The New IDS Corpus Analysis Platform: Challenges and Prospects | Piotr Bański, Peter M. Fischer, Elena Frick, Erik Ketzan, Marc Kupietz, Carsten Schnober, Oliver Schonefeld and Andreas Witt |
790 | Evaluating Multi-focus Natural Language Queries over Data Services | Silvia Quarteroni, Vincenzo Guerrisi and Pietro La Torre |
794 | Application of a Semantic Search Algorithm to Semi-Automatic GUI Generation | Maria Teresa Pazienza, Noemi Scarpato and Armando Stellato |
796 | Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian | Mladen Karan, Jan Šnajder and Bojana Dalbelo Bašić |
800 | Evaluating Query Languages for a Corpus Processing System | Elena Frick, Carsten Schnober and Piotr Bański |
802 | Two Phase Evaluation for Selecting Machine Translation Services | Chunqi Shi, Donghui Lin, Masahiko Shimada and Toru Ishida |
804 | Development and Application of a Cross-language Document Comparability Metric | Fangzhong Su and Bogdan Babych |
805 | A Graphical Citation Browser for the ACL Anthology | Benjamin Weitz and Ulrich Schäfer |
806 | Document Attrition in Web Corpora: an Exploration | Stephen Wattam, Paul Rayson and Damon Berridge |
807 | A Repository of Data and Evaluation Resources for Natural Language Generation | Anja Belz and Albert Gatt |
808 | Scenario Oriented Translation Service Composition | Chunqi Shi, Donghui Lin and Toru Ishida |
810 | Compatibility and interoperability of multi-modal standardized language resources | Yoshinobu Kano |
812 | The Quaero Evaluation Campaign on Term Extraction | Thibault Mondary, Adeline Nazarenko, Haïfa Zargayouna and Sabine Barreaux |
813 | Italian and Spanish Null Subjects. A Case Study Evaluation in an MT Perspective. | Lorenza Russo, Sharid Loáiciga and Asheesh Gulati |
814 | DGT-TM: A freely available Translation Memory in 22 languages | Ralf Steinberger, Andreas Eisele, Szymon Klocek, Spyridon Pilos and Patrick Schlüter |
815 | Simplified guidelines for the creation of Large Scale Dialectal Arabic Annotations | Mona Diab and Heba Elfardy |
816 | META-SHARE v1: An Open Network of Repositories for Language Resources including Data and Tools | Christian Federmann, Byron Georgantopoulos, Christian Girardi, Olivier Hamon, Dimitris Mavroeidis, Salvatore Minutoli and Marc Schröder |
817 | Analyzing and Aligning German compound nouns | Marion Weller and Ulrich Heid |
818 | Identification of Manner in Bio-Events | Raheel Nawaz, Paul Thompson and Sophia Ananiadou |
819 | A tool/database interface for multi-level analyses | Kurt Eberle, Kerstin Eckart and Ulrich Heid |
820 | Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies | Thierry Declerck and Karlheinz Mörth |
822 | Using an ASR database to design a pronunciation evaluation system in Basque | Igor Odriozola, Eva Navas, Inma Hernáez, Iñaki Sainz, Ibon Saratxaga, Jon Sánchez and Daniel Erro |
823 | A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Certainty and Uncertainty | Alberto Lavelli, Bernardo Magnini, Ramona Bongelli, Carla Canestrari, Ilaria Riccioni, Cinzia Buldorini, Ricardo Pietrobon and Andrzej Zuczkowski |
824 | New language resources for the Pashto language | Djamel Mostefa |
825 | Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language | Ting Liu, Samira Shaikh, Tomek Strzalkowski, Aaron Broadwell, Jennifer Stromer-Galley and Sarah Taylor |
827 | CALBC: Releasing the Final Corpora | Şenay Kafkas, Ian Lewin, David Milward, Erik van Mulligen, Jan Kors, Udo Hahn and Dietrich Rebholz-Schuhmann |
828 | BUCEADOR, a multi-language search engine for digital libraries | Jordi Adell, Antonio Bonafonte, Antonio Cardenal, Marta R. Costa-jussà, José A. R. Fonollosa and Asunción Moreno |
829 | Linguistic Analysis Processing Line for Bulgarian | Aleksandar Savkov, Laska Laskova, Stanislava Kancheva, Petya Osenova and Kiril Simov |
830 | Getting more data — Schoolkids as annotators | Jirka Hana and Barbora Hladka |
831 | WordNet Atlas: a web application for browsing WordNet as a zoomable map | Matteo Abrate and Clara Bacciu |
834 | Building large corpora from the web using a new efficient tool chain | Roland Schäfer and Felix Bildhauer |
836 | An empirical resource for discovering cognitive principles of discourse organization: the ANNODIS corpus | Stergos Afantenos, Nicholas Asher, Farah Benamara, Myriam Bras, Cecile Fabre, Mai Ho-Dac, Anne Le Draoulec, Philippe Muller, Marie-Paul Pery-Woodley, Laurent Prevot, Josette Rebeyrolles, Ludovic Tanguy, Marianne Vergez-Couret and Laure Vieu |
838 | English to Indonesian Transliteration to Support English Pronunciation Learning | Amalia Zahra and Julie Carson-Berndsen |
839 | Boosting the Coverage of a Semantic Lexicon by Automatically Extracted Event Nominalizations | Kata Gábor, Marianna Apidianaki, Benoît Sagot and Eric Villemonte de La Clergerie |
842 | Using the International Standard Language Resource Number: Practical and Technical Aspects | Jungyeul Park, Victoria Arranz, Olivier Hamon and Khalid Choukri |
843 | Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System | Claire Jaja, Douglas Briesch, Jamal Laoudi and Clare Voss |
844 | RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus | Jens Forster, Christoph Schmidt, Thomas Hoyoux, Oscar Koller, Uwe Zelle, Justus Piater and Hermann Ney |
845 | The KnowledgeStore: an Entity-Based Storage System | Roldano Cattoni, Francesco Corcoglioniti, Christian Girardi, Bernardo Magnini, Luciano Serafini and Roberto Zanoli |
846 | Elaborating a sustainability model for LRs | Khalid Choukri, Victoria Arranz and Jungyeul Park |
848 | CLIMB grammars: three projects using metagrammar engineering | Antske Fokkens, Tania Avgustinova and Yi Zhang |
850 | Annotated Bibliographical Reference Corpora in Digital Humanities | Young-Min Kim, Patrice Bellot, Elodie Faath and Marin Dacos |
853 | Corpus of children voices for mid-level social markers and affect bursts analysis | Marie Tahon, Agnes Delaborde and Laurence Devillers |
855 | Joint Segmentation and POS Tagging for Arabic Using a CRF-based Classifier | Souhir Gahbiche-Braham, Hélène Bonneau-Maynard, Thomas Lavergne and François Yvon |
857 | Fivehundredmillionandone Tokens. Loading the AAC Container with Text Resources for Text Studies. | Hanno Biber and Evelyn Breiteneder |
860 | Annotation of anaphoric relations and topic continuity in Japanese conversation | Natsuko Nakagawa and Yasuharu Den |
863 | Classifying Standard Linguistic Processing Functionalities based on the Fundamental Data Operation Types | Yoshihiko Hayashi and Chiharu Narawa |
864 | Evaluating expressive speech synthesis from audiobook corpora for conversational phrases | Eva Szekely, Joao Cabral, Mohamed Abou-Zleikha, Peter Cahill and Julie Carson-Berndsen |
865 | The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech | Juan María Garrido, Yesika Laplaza, Montse Marquina, Andrea Pearman, José Gregorio Escalada, Miguel Ángel Rodríguez and Ana Armenta |
866 | DramaBank: Annotating Agency in Narrative Discourse | David Elson |
867 | Linguagrid: a network of Linguistic and Semantic Services for the Italian Language. | Alessio Bosca, Luca Dini, Milen Kouylekov and Marco Trevisan |
868 | Statistical Measures for the Acceptability and Semi-productivity of Persian Light Verb Constructions | Shiva Taslimipoor, Afsaneh Fazly and Ali Hamzeh |
872 | On the Way to a Legal Sharing of Web Applications in NLP | Victoria Arranz and Olivier Hamon |
875 | JRC Eurovoc Indexer JEX - A freely available multi-label categorisation tool | Ralf Steinberger, Mohamed Ebrahim and Marco Turchi |
876 | Designing French Tale Corpora for Entertaining Text To Speech Synthesis | David Doukhan, Sophie Rosset, Albert Rilliard, Christophe d’Alessandro and Martine Adda-Decker |
878 | „Rendering Endangered Lexicons Interoperable through Standards Harmonization”: the RELISH project | Helen Aristar-Dry, Sebastian Drude, Jost Gippert, Irina Nevskaya and Menzo Windhouwer |
879 | Le Petit Prince in UNL | Ronaldo Martins |
881 | Unsupervised document zone identification using probabilistic graphical models | Andrea Varga, Daniel Preotiuc-Pietro and Fabio Ciravegna |
882 | Summarizing a multi-source set of documents in a Smart Room | Maria Fuentes, Horacio Rodríguez and Jordi Turmo |
884 | Empirical Comparisons of MASC Word Sense Annotations: a First Look | Gerard de Melo and Collin F. Baker |
885 | Creating HAVIC: The Heterogeneous Audio Visual Internet Collection | Stephanie Strassel, Amanda Morris, Jonathan Fiscus, Christopher Caruso, Haejoong Lee, Paul Over, James Fiumara, Barbara Shaw, Brian Antonishek and Martial Michel |
886 | Identifying Multi-Word Expressions in Statistical Machine Translation | Dhouha Bouamor, Nasredine Semmar and Pierre Zweigenbaum |
887 | Multi-Layer Discourse Annotation of a Dutch Text Corpus | Gisela Redeker, Ildikó Berzlánovich, Nynke van der Vliet, Gosse Bouma and Markus Egg |
888 | Identifying Word Translations from Comparable Documents without a Seed Lexicon | Reinhard Rapp, Serge Sharoff, Bogdan Babych and Richard Forsyth |
891 | The Language Archive – a new hub for language resources | Sebastian Drude, Daan Broeder, Paul Trilsbeek and Peter Wittenburg |
892 | A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora | Mahdi Khademian, Kaveh Taghipour and Shahram Khadivi |
893 | Automatic Term Recognition Needs Multiple Evidence | Natalia Loukachevitch |
894 | Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank | Sudheer Kolachina, Rashmi Prasad, Dipti Misra Sharma and Aravind Joshi |
895 | A methodology for the extraction of information about the usage of formulaic expressions in scientific texts | Hannah Kermes |
898 | CLCM - A Linguistic Resource for Effective Simplification of Instructions in the Crisis Management Domain and its Evaluations | Irina Temnikova |
900 | From Grammar Extraction to Treebanking: A Bootstrapping Approach | Masood Ghayoomi |
902 | Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation | Takafumi Suzuki, Yusuke Abe, Itsuki Toyota, Takehito Utsuro, Suguru Matsuyoshi and Masatoshi Tsuchiya |
903 | Improving K-Nearest Neighbor Efficacy for Farsi Text Classification | Behrooz Minaei, Mohammad Hossein Elahimanesh and Hossein Malekinezhad |
905 | ULex: new data models and a mobile environment for corpus enrichment. | Dafydd Gibbon |
906 | UniDic for Early Middle Japanese: an Electrical Dictionary for Morphological Analysis of Classical Japanese | Toshinobu Ogiso, Mamoru Komachi, Yasuharu Den and Yuji Matsumoto |
907 | A platform-independent user-friendly dictionary from Italian to LIS | Umar Shoaib, Gabriele Tiotto, Nadeem Ahmad and Paolo Prinetto |
908 | Mapping WordNet to the Kyoto ontology | Egoitz Laparra, German Rigau and Piek Vossen |
909 | Tools for plWordNet Development. Presentation and Perspectives | Bartosz Broda, Marek Maziarz and Maciej Piasecki |
910 | Creating a Data Collection for Evaluating Rich Speech Retrieval | Maria Eskevich, Gareth J.F. Jones, Martha Larson and Roeland Ordelman |
911 | Ontologies of Linguistic Annotation: Survey and perspectives | Christian Chiarcos |
912 | The Open Linguistics Working Group | Christian Chiarcos, Sebastian Hellmann and Sebastian Nordhoff |
914 | An Annotation Scheme for Quantifier Scope Disambiguation | Mehdi Manshadi, Eric Meinhardt, James Allen and Mary Swift |
915 | A generic formalism to represent linguistic corpora in RDF and OWL/DL | Christian Chiarcos |
916 | LDC Language Resource Papers Catalog: Building a Bibliographic Database | Eleftheria Ahtaridis, Christopher Cieri and Denise DiPersio |
917 | A hierarchical approach with feature selection for emotion recognition from speech | Panagiotis Giannoulis and Gerasimos Potamianos |
919 | A Concise Query Language with Search and Transform Operations for Corpora with Multiple Levels of Annotation | Anil Kumar Singh |
921 | A Study of Word-Classing for MT Reordering | Ananthakrishnan Ramanathan and Karthik Visweswariah |
922 | DOXA server : creating a generic web evaluation platform | Pierre ALBERT, Olivier HAMON and Patrick PAROUBEK |
924 | Large aligned treebanks for syntax-based machine translation | Gideon Kotzé, Vincent Vandeghinste, Scott Martens and Jörg Tiedemann |
925 | Collecting and Using Comparable Corpora for Statistical Machine Translation | Inguna Skadiņa, Ahmet Aker, Nikos Glaros, Fangzhong Su, Dan Tufis, Mateja Verlic, Andrejs Vasiļjevs and Bogdan Babych |
926 | Recognition of Polish Derivational Relations Based on Supervised Learning Scheme | Maciej Piasecki, Radoslaw Ramocki and Marek Maziarz |
931 | Combining Formal Concept Analysis and semantic information for building ontological structures from texts : an exploratory study | Silvia Moraes and Vera Lima |
934 | Diversified Bootstrapping for Acquiring High-Coverage Paraphrase Resource | Hideki Shima and Teruko Mitamura |
938 | The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language | Mike Kestemont, Claudia Peersman, Benny De Decker, Guy De Pauw, Kim Luyckx, Roser Morante, Frederik Vaassen, Janneke van de Loo and Walter Daelemans |
939 | Clause-based Discourse Segmentation of Arabic Texts | Iskandar KESKES, Farah BENAMARA and Lamia Hadrich Belguith |
941 | Building Japanese Predicate-argument Structure Corpus using Lexical Conceptual Structure | Yuichiroh Matsubayashi, Yusuke Miyao and Akiko Aizawa |
942 | An Examination of Cross-Cultural Similarities and Differences from Social Media Data with respect to Language Use | Mohammad Fazleh Elahi and Paola Monachesi |
944 | Domain-specific vs. Uniform Modeling for Coreference Resolution | Olga Uryupina and Massimo Poesio |
945 | Extending the EmotiNet Knowledge Base to Improve the Automatic Detection of Implicitly Expressed Emotions from Text | Alexandra Balahur and Jesús M. Hermida |
946 | Extending the adverbial coverage of a French morphological lexicon | Elsa Tolone, Stavroula Voyatzi and Claude Martineau |
947 | Reconstructing the Diachronic Morphology of Romanian from Dictionary Citations | Dan Cristea, Radu Simionescu and Gabriela Haja |
948 | Latvian and Lithuanian Named Entity Recognition with TildeNER | Mārcis Pinnis |
951 | Collecting humorous expressions from a community-based question-answering-service corpus | Masashi Inoue |
954 | RELcat: a Relation Registry for ISOcat data categories | Menzo Windhouwer |
956 | The Political Speech Corpus of Bulgarian | Petya Osenova and Kiril Simov |
957 | NLTG-Eval: A Toolkit for Creating Online Language Evaluation Experiments | Eric Kow and Anja Belz |
958 | A Database of Attribution Relations | Silvia Pareti |
960 | Collaborative Development and Evaluation of Text-processing Workflows in a UIMA-supported Web-based Workbench | Rafal Rak, Andrew Rowley and Sophia Ananiadou |
964 | A Mandarin-English Code-switching Corpus | Ying Li, Yue Yu and Pascale Fung |
965 | InfiKorp: Towards a Free Corpus of Polish | Bartosz Broda, Michał Marcińczuk, Marek Maziarz, Adam Radziszewski and Adam Wardyński |
966 | A Fast, Memory Efficient, Scalable and Multilingual Dictionary Retriever | Paulo Fernandes, Lucelene Lopes, Carlos A. Prolo, Afonso Sales and Renata Vieira |
967 | Structural alignment of plain text books | André Santos and José João Almeida |
968 | Turkish Paraphrase Corpus | Seniz Demir, Ilknur Durgar El-Kahlout, Erdem Unal and Hamza Kaya |
970 | International Multicultural Name Matching Competition: Design, Execution, Results, and Lessons Learned | Keith J Miller, Elizabeth Schroeder Richerson, Sarah McLeod and James Finley |
971 | Measuring Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms | Ryan Georgi, Fei Xia and William Lewis |
973 | The Effectiveness of Unsupervised Learning on Domain Adaptation A Case Study on Chinese Word Segmentation | Yan Song and Fei Xia |
974 | Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz | Joao Paulo Cabral, Mark Kane, Zeeshan Ahmed, Mohamed Abou-Zleikha, Eva Szekely, Amalia Zahra, Kalu Ogbureke, Peter Cahill, Julie Carson-Berndsen and Stephan Schlogl |
976 | An implementation of a Latvian resource grammar in Grammatical Framework | Peteris Paikens and Normunds Gruzitis |
977 | Resource Evaluation for Usable Speech Interfaces: Utilizing Human–Human Dialogue | Pepi Stavropoulou, Dimitris Spiliotopoulos and Georgios Kouroupetroglou |
978 | GATEtoGerManC: A GATE-based annotation pipeline for historical German | Silke Scheible, Richard J. Whitt, Martin Durrell and Paul Bennett |
980 | Dealing with unknown words in statistical machine translation | João Silva and Luísa Coheur |
983 | A disambiguation resource for semantic annotation | Eric Charton and Michel Gagnon |
985 | PET: a Tool for Post-editing and Assessing Machine Translation | Wilker Aziz, Sheila Castilho and Lucia Specia |
986 | Enriching the ISST-TANL Corpus with Semantic Frames | Alessandro Lenci, Simonetta Montemagni, Eva Maria Vecchi and Giulia Venturi |
987 | Developing Partially-Transcribed Speech Corpus from Edited Transcriptions | Kengo Ohta, Masatoshi Tsuchiya and Seiichi Nakagawa |
991 | Construction of the Turkish National Corpus (TNC) | Yeşim Aksan, Mustafa Aksan, Ahmet Koltuksuz, Taner Sezer, Ümit Mersinli, Umut Ufuk Demirhan, Hakan Yılmazer, Gülsüm Atasoy, Seda Öz and İpek Yıldız |
992 | Building a learner corpus | Jirka Hana, Alexandr Rosen, Barbora Štindlová and Petr Jäger |
996 | Results of the ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation | Christian Federmann, Eleftherios Avramidis, Marta R. Costa-Jussa, Josef van Genabith, Maite Melero and Pavel Pecina |
998 | The META-SHARE Metadata Schema for the Description of Language Resources | Maria Gavrilidou, Penny Labropoulou, Elina Desipri, Stelios Piperidis, Monica Monachini, Francesca Frontini, Thierry Declerck, Gil Francopoulo, Victoria Arranz and Valerie Mapelli |
999 | Fine-grained German Sentiment Analysis on Social Media | Saeedeh Momtazi |
1000 | Alignment-based reordering for SMT | Maria Holmqvist, Sara Stymne, Lars Ahrenberg and Magnus Merkel |
1003 | Same domain different discourse style- A case study on Language Resources for data-driven Machine Translation | Monica Gavrila, Walther v. Hahn and Cristina Vertan |
1006 | Annotations for Power Relations on Email Threads | Vinodkumar Prabhakaran, Huzaifa Neralwala, Owen Rambow and Mona Diab |
1007 | A Language Resources Framework supporting Conceptual Relationships among Biomedical Information to Chronic Disease Surveillance | Juliana Pollettini, Evandro Ruiz and Alessandra Macedo |
1008 | Foundations of a Multilayer Annotation Framework for Twitter Communications During Crisis Events | William J. Corvey, Sarah Vieweg, Sudha Verma, Martha Palmer and James H. Martin |
1010 | Pedagogical stances and their multimodal signals. | Leone Giovanna, D’Errico Francesca and Poggi Isabella |
1011 | An audiovisual political speech analysis incorporating eye-tracking and perception data | Stefan Scherer, Georg Layher, John Kane, Heiko Neumann and Nick Campbell |
1012 | Tajik-Farsi Persian Transliteration Using Statistical Machine Translation | Chris Irwin Davis |
1013 | Corpus based Semi-Automatic Extraction of Persian Compound Verbs and their Relations | Somayeh Bagherbeygi and Mehrnoush Shamsfard |
1014 | Word Sense Inventories by Non-Experts. | Anna Rumshisky, Nick Botchan, Sophie Kushkuley and James Pustejovsky |
1015 | PAMOCAT: Automatic retrieval of specified postures | Bernhard Brüning, Christian Schnier, Karola Pitsch and Sven Wasmuth |
1016 | Statistical Section Segmentation in Free-Text Clinical Records | Michael Tepper, Daniel Capurro, Fei Xia, Lucy Vanderwende and Meliha Yetisgen-Yildiz |
1017 | Constructing a Question Corpus for Textual Semantic Relations | Rui Wang and Shuguang Li |
1018 | Building a fine-grained subjectivity lexicon from a web corpus | Isa Maks and Piek Vossen |
1020 | Building a database of French frozen adverbial phrases | Aude Grezka and Céline Poudat |
1021 | Matching Cultural Heritage items to Wikipedia | Eneko Agirre |
1022 | ATLIS: Identifying Locational Information in Text Automatically | John Vogel and Marc Verhagen |
1024 | A Speech and Gesture Spatial Corpus in Assisted Living | Dimitra Anastasiou |
1026 | Harnessing 3rd party observer gaze for a continuous measure of the appropriateness of a speaker change | Jens Edlund, Simon Alexandersson, Jonas Beskow, Lisa Gustavsson, Mattias Heldner, Anna Hjalmarsson, Petter Kallionen and Ellen Marklund |
1027 | Constraint Based Description of Polish Multiword Expressions | Roman Kurc and Maciej Piasecki |
1028 | An Open Source Persian Computational Grammar | Shafqat Mumtaz Virk and Elnaz Abolahrar |
1031 | Project FLY: a multidisciplinary project within Linguistics | Mariana Gomes and Ana Guilherme |
1032 | The Hausa Internet Corpus | Christian Chiarcos |
1034 | The Common Orthographic Vocabulary of the Portuguese Language: a set of common open lexical resources for a pluricentric language | José Pedro Ferreira |
1035 | NLP Challenges for Eunomos a Tool to Build and Manage Legal Knowledge | Guido Boella, Luigi di Caro, Llio Humphreys, Livio Robaldo and Leon van der Torre |
1036 | Pursing power in Arabic on-line discussion forums | Marc Tomlinson |
1037 | The Dependency-parsed FrameNet Corpus | Daniel Bauer, Hagen Fürstenau and Owen Rambow |
1040 | Incorporating an Error Corpus into a Spellchecker for Maltese | Michael Rosner, Albert Gatt, Andrew Attard and Jan Joachimsen |
1042 | Building Synthetic Voices in the META-NET Framework | Emília Garcia Casademont, Antonio Bonafonte and Asunción Moreno |
1043 | Automatic Translation of Scholarly Terms into Patent Terms Using Synonyms Extraction Techniques | Hidetsugu Nanba, Toshiyuki Takezawa, Kiyoko Uchiyama and Akiko Aizawa |
1046 | Tree-Structured Named Entity Recognition on OCR Data: Analysis, Processing and Results | Marco Dinarelli and Sophie Rosset |
1047 | Building a 70 billion word corpus of English from ClueWeb | Jan Pomikálek, Miloš Jakubíček, Pavel Rychlý and Adam Kilgarriff |
1049 | How genres influence performance of parsers: a case study on the Chinese TreeBank | Dong Wang and Fei Xia |
1051 | Improving Short-String Language Identification using Many-Language Distance Classification | Paul Rodrigues |
1052 | Dependency parsing for interaction detection in pharmacogenomics | Gerold Schneider, Fabio Rinaldi and Simon Clematide |
1053 | Semantic Annotations in Japanese FrameNet: Comparing Frames in Japanese and English | Kyoko Ohara |
1055 | Buildind a Resource of Semantic Types | Octavian Popescu and Luisa Bentivoglio |
1056 | A data and analysis resource for an experiment in text mining a collection of micro-blogs on a political topic. | William Black, Rob Procter, Steven Gray and Sophia Ananiadou |
1057 | AWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis | Muhammad Abdul-Mageed and Mona Diab |
1059 | Causal analysis of task incompletion for spoken dialogs focused on interactions of user and the system | Sunao Hara, Norihide Kitaoka and Kazuya Takeda |
1061 | Generation of Verbal Stems in Derivationally Rich Language | Nives Mikelić Preradović, Krešimir Šojat and Marko Tadić |
1062 | Corpus+WordNet thesaurus generation for ontology enriching | Fernando Castilho, Roger Granada, Breno Meneghetti, Leonardo Carvalho and Renata Vieira |
1063 | Reclassifying subcategorization frames for experimental analysis and stimulus generation | Paula Buttery and Andrew Caines |
1064 | Creating a Coreference Resolution System for Polish | Mateusz Kopeć and Maciej Ogrodniczuk |
1065 | LDC Forced Aligner | Xiaoyi Ma |
1066 | Semantic Orientation PMI or Vector Space Models? An evaluation of automated sentiment acquisition methods. | Aleksander Wawer |
1069 | Assessing the Comparability of News Texts | Emma Barker and Rob Gaizauskas |
1070 | Building Text-to-Speech Systems for Resource Poor Languages | Nur-Hana Samsudin and Mark Lee |
1072 | Representing General Relational Knowledge in ConceptNet 5 | Robert Speer and Catherine Havasi |
1075 | Boosting statistical tagger accuracy with simple rule-based grammars | Mans Hulden and Jerid Francom |
1076 | Twitter Event Lexicon for Natural Language Processing | Jennifer Williams and Graham Katz |
1077 | A morphological transducer for Kyrgyz | Jonathan Washington, Mirlan Ipasov and Francis Tyers |
1078 | A Corpus for Research on Deliberation and Debate | Marilyn Walker, Jean Fox Tree, Pranav Anand, Rob Abbott and Joseph King |
1080 | Evaluating the Similarity Estimator component of the TWIN Personality-based Recommender System | Alexandra Roshchina, John Cardiff and Paolo Rosso |
1081 | Learning Sentiment Lexicons in Spanish | Veronica Perez-Rosas, Carmen Banea and Rada Mihalcea |
1082 | Unsupervised Word Sense Disambiguation with Multilingual Representations | Erwin Fernandez-Ordonez, Rada Mihalcea and Samer Hassan |
1086 | The META-SHARE Language Resources Sharing Infrastructure: Principles, Challenges, Solutions | Stelios Piperidis |
1087 | Annotating progressive aspect constructions in the spoken section of the British National Corpus | Andrew Caines and Paula Buttery |
1091 | Annotating Spatial Containment Relations Between Events | Kirk Roberts, Travis Goodwin and Sanda Harabagiu |
1095 | Agreement and disagreement in threaded discussion | Jacob Andreas, Sara Rosenthal and Kathleen McKeown |
1096 | Logic and Graph Based Methods for Terminological Assessment | Benoît Robichaud |
1097 | Parsing Any Domain English text to CoNLL dependencies | Sudheer Kolachina and Prasanth Kolachina |
1098 | POS Tagging for Grammaticalization and Grammatical Neologism Detection | Maarten Janssen |
1107 | Using DiAML and ANVIL for multimodal dialogue annotations | Harry Bunt, Michael Kipp and Volha Petukhova |
1109 | Annotated Corpora for Word Alignment between Japanese and English | Tsuyoshi Okita |
1111 | Assigning Connotation Values to Events | Tommaso Caselli, Irene Russo and Francesco Rubino |
1112 | Biographies or Blenders: Which Resource to Prefer for Cross-Domain Sentiment Analysis | Natalia Ponomareva and Mike Thelwall |
1114 | An Annotated Corpus of Film Dialogue for Learning and Characterizing Character Style | Marilyn Walker and Grace Lin |
1116 | SPPAS: a tool for Speech Phonetization Alignment and Syllabification | Brigitte Bigi |
1117 | Twenty Years of Language Resource Development and Distribution: A Progress Report on LDC Activities | Christopher Cieri and Mark Liberman |
1120 | A Phonemic Corpus of Polish Child-Directed Speech | Luc Boruta and Justyna Jastrzebska |
1121 | The KIT Lecture Corpus for Speech Translation | Sebastian Stüker, Teresa Herrmann, Florian Kraft, Christian Mohr and Alex Waibel |
1122 | Orthographic Transcription: which Enrichment is required for phonetization? | Brigitte Bigi and Pauline Peri |
1123 | The Role of Model Testing in Standards Development: The Case of ISO-Space | James Pustejovsky and Jessica Moszkowicz |
1124 | Aleda, a free large-scale entity database for French | Benoît Sagot and Rosa Stern |
1126 | The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation | Marcello Federico, Sebastian Stüker, Luisa Bentivogli, Michael Paul, Mauro Cettolo, Teresa Herrmann, Jan Niehues and Giovanni Moretti |
1127 | Cleaning noisy wordnets | Benoît Sagot and Darja Fišer |
1129 | Tackling interoperability issues within UIMA work-flows | Nicolas Hernandez |
1130 | Ubiquitous Usage of a Broad Coverage French Corpus: Processing the Est Republicain corpus | Djamé Seddah, Marie Candito, Benoit Crabbé and Enrique Henestroza Anguiano |
1131 | Wordnet extension made simple: A multilingual lexicon-based approach using wiki resources | Valérie Hanoka and Benoît Sagot |
1132 | Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment | Shyam Agrawal, Shweta Sinha, Pooja Singh and Jesper Olson |