|
INTRODUCTORY MESSAGES:
INVITED TALK:
KEYNOTES SPEECHES:
SESSIONS: Browse articles of the conference sorted by session number
|
Session O1 - Corpora for Machine Translation |
Chairperson: Josef van Genabith |
11:35-11:55 |
Iñaki San Vicente and Iker Manterola |
PaCo2: A Fully Automated tool for gathering Parallel Corpora from the Web |
11:55-12:15 |
Mark Fishel, Ondřej Bojar and Maja Popović |
Terra: a Collection of Translation Error-Annotated Corpora |
12:15-12:35 |
Ahmet Aker, Evangelos Kanoulas and Robert Gaizauskas |
A light way to collect comparable corpora from the Web |
12:35-12:55 |
Volha Petukhova, Rodrigo Agerri, Mark Fishel, Sergio Penkale, Arantza del Pozo, Mirjam Sepesy Maucec, Andy Way, Panayota Georgakopoulou and Martin Volk |
SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles |
12:55-13:15 |
Daniele Pighin, Lluís Màrquez and Lluís Formiga |
The FAUST Corpus of Adequacy Assessments for Real-World Machine Translation Output |
|
Session O2 - Infrastructures and Strategies for LRs (1) |
Chairperson: Hans Uszkoreit |
11:35-11:55 |
Stelios Piperidis |
The META-SHARE Language Resources Sharing Infrastructure: Principles, Challenges, Solutions |
11:55-12:15 |
Riccardo Del Gratta, Francesca Frontini, Francesco Rubino, Irene Russo and Nicoletta Calzolari |
The Language Library: supporting community effort for collective resource production |
12:15-12:35 |
Khalid Choukri, Victoria Arranz, Olivier Hamon and Jungyeul Park |
Using the International Standard Language Resource Number: Practical and Technical Aspects |
12:35-12:55 |
Valérie Mapelli, Victoria Arranz, Matthieu Carré, Hélène Mazo, Djamel Mostefa and Khalid Choukri |
ELRA in the heart of a cooperative HLT world |
12:55-13:15 |
Christopher Cieri, Marian Reed, Denise DiPersio and Mark Liberman |
Twenty Years of Language Resource Development and Distribution: A Progress Report on LDC Activities |
|
Session O5 - Crowdsourcing (Special Session) |
Chairperson: Karen Fort and Iryna Gurevych |
14:45-15:05 |
Arno Scharl, Marta Sabou, Stefan Gindl, Walter Rafelsberger and Albert Weichselbraun |
Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources |
15:05-15:25 |
Anoop Kunchukuttan, Shourya Roy, Pratik Patel, Kushal Ladha, Somya Gupta, Mitesh M. Khapra and Pushpak Bhattacharyya |
Experiences in Resource Generation for Machine Translation through Crowdsourcing |
15:25-15:45 |
Elena Filatova |
Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing |
15:45-16:05 |
Luís Marujo, Anatole Gershman, Jaime Carbonell, Robert Frederking and João P. Neto |
Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization |
|
Session O6 - Dialogue and Multimodality |
Chairperson: Jimmy Kunzmann |
14:45-15:05 |
Kristiina Jokinen and Graham Wilcock |
Constructive Interaction for Talking about Interesting Topics |
15:05-15:25 |
Florian Nothdurft and Wolfgang Minker |
Using multimodal resources for explanation approaches in intelligent systems |
15:25-15:45 |
Shota Yamasaki, Hirohisa Furukawa, Masafumi Nishida, Kristiina Jokinen and Seiichi Yamamoto |
Multimodal Corpus of Multi-party Conversations in Second Language |
15:45-16:05 |
Takenobu Tokunaga, Ryu Iida, Asuka Terai and Naoko Kuriyama |
The REX corpora: A collection of multimodal corpora of referring expressions in collaborative problem solving dialogues |
16:05-16:25 |
Harry Bunt, Jan Alexandersson, Jae-Woong Choe, Alex Chengyu Fang, Koiti Hasida, Volha Petukhova, Andrei Popescu-Belis and David Traum |
ISO 24617-2: A semantically-based standard for dialogue annotation |
|
Session O7 - Machine Translation and Language Resources (1) |
Chairperson: Gregor Thurmair |
14:45-15:05 |
Inguna Skadiņa, Ahmet Aker, Nikos Mastropavlos, Fangzhong Su, Dan Tufiș, Mateja Verlic, Andrejs Vasiļjevs, Bogdan Babych, Paul Clough, Robert Gaizauskas, Nikos Glaros, Monica Lestari Paramita and Mārcis Pinnis |
Collecting and Using Comparable Corpora for Statistical Machine Translation |
15:05-15:25 |
Casey Redd Kennington, Martin Kay and Annemarie Friedrich |
Suffix Trees as Language Models |
15:25-15:45 |
Ralf Steinberger, Andreas Eisele, Szymon Klocek, Spyridon Pilos and Patrick Schlüter |
DGT-TM: A freely available Translation Memory in 22 languages |
15:45-16:05 |
Reinhard Rapp, Serge Sharoff and Bogdan Babych |
Identifying Word Translations from Comparable Documents Without a Seed Lexicon |
16:05-16:25 |
Gideon Kotzé, Vincent Vandeghinste, Scott Martens and Jörg Tiedemann |
Large aligned treebanks for syntax-based machine translation |
|
Session O8 - Corpus Processing and Infrastructure |
Chairperson: Bente Maegaard |
14:45-15:05 |
Lars Borin, Markus Forsberg and Johan Roxendal |
Korp ― the corpus infrastructure of Spräkbanken |
15:05-15:25 |
Jonathan Wright, Kira Griffitt, Joe Ellis, Stephanie Strassel and Brendan Callahan |
Annotation Trees: LDC's customizable, extensible, scalable, annotation infrastructure |
15:25-15:45 |
Roland Schäfer and Felix Bildhauer |
Building Large Corpora from the Web Using a New Efficient Tool Chain |
15:45-16:05 |
Young-Min Kim, Patrice Bellot, Elodie Faath and Marin Dacos |
Annotated Bibliographical Reference Corpora in Digital Humanities |
16:05-16:25 |
Jan Pomikálek, Miloš Jakubíček and Pavel Rychlý |
Building a 70 billion word corpus of English from ClueWeb |
|
Session O14 - Machine Translation and Evaluation (1) |
Chairperson: Robert Frederking |
18:10-18:30 |
Sara Stymne, Henrik Danielsson, Sofia Bremin, Hongzhan Hu, Johanna Karlsson, Anna Prytz Lillkull and Martin Wester |
Eye Tracking as a Tool for Machine Translation Error Analysis |
18:30-18:50 |
Eleftherios Avramidis, Aljoscha Burchardt, Christian Federmann, Maja Popović, Cindy Tscherwinka and David Vilar |
Involving Language Professionals in the Evaluation of Machine Translation |
18:50-19:10 |
Daniele Pighin, Lluís Màrquez and Jonathan May |
An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output |
|
Session O17 - Infrastructures and Strategies for LRs (2) |
Chairperson: Andrejs Vasiljevs |
9:45-10:05 |
Claudia Soria, Núria Bel, Khalid Choukri, Joseph Mariani, Monica Monachini, Jan Odijk, Stelios Piperidis, Valeria Quochi and Nicoletta Calzolari |
The FLaReNet Strategic Language Resource Agenda |
10:05-10:25 |
Daan Broeder, Dieter van Uytvanck, Maria Gavrilidou, Thorsten Trippel and Menzo Windhouwer |
Standardizing a Component Metadata Infrastructure |
10:25-10:45 |
Daan Broeder, Dieter van Uytvanck and Gunter Senft |
Citing on-line Language Resources |
10:45-11:05 |
Khalid Choukri and Victoria Arranz |
An Analytical Model of Language Resource Sustainability |
11:05-11:25 |
David Lewis, Alexander O'Connor, Andrzej Zydroń, Gerd Sjögren and Rahzeb Choudhury |
On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market |
|
Session O18 - Dialogue |
Chairperson: Linne Ha |
9:45-10:05 |
Alexandros Papangelis, Vangelis Karkaletsis and Fillia Makedon |
Evaluation of Online Dialogue Policy Learning Techniques |
10:05-10:25 |
Lluis-F. Hurtado, Fernando Garcia, Emilio Sanchis and Encarna Segarra |
The acquisition and dialog act labeling of the EDECAN-SPORTS corpus |
10:25-10:45 |
Jolanta Bachan |
Developing and evaluating an emergency scenario dialogue corpus |
10:45-11:05 |
Lina M. Rojas-Barahona, Alejandra Lorenzo and Claire Gardent |
Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents |
11:05-11:25 |
Fabrice Lefèvre, Djamel Mostefa, Laurent Besacier, Yannick Estève, Matthieu Quignard, Nathalie Camelin, Benoit Favre, Bassam Jabaian and Lina M. Rojas-Barahona |
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora |
|
Session O28 - Sign Language |
Chairperson: Javier Caminero |
14:55-15:15 |
Dimitris Metaxas, Bo Liu, Fei Yang, Peng Yang, Nicholas Michael and Carol Neidle |
Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking |
15:15-15:35 |
Matti Karppa, Tommi Jantunen, Ville Viitaniemi, Jorma Laaksonen, Birgitta Burger and Danny De Weerdt |
Comparing computer vision analysis of signed language video with motion capture recordings |
15:35-15:55 |
Annelies Braffort and Leïla Boutora |
DEGELS1: A comparable corpus of French Sign Language and co-speech gestures |
15:55-16:15 |
Matilde Gonzalez, Michael Filhol and Christophe Collet |
Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs |
16:15-16:35 |
Umar Shoaib, Nadeem Ahmad, Paolo Prinetto and Gabriele Tiotto |
A platform-independent user-friendly dictionary from Italian to LIS |
|
Session O30 - Computer Aided Language Learning |
Chairperson: Justus Roux |
16:55-17:15 |
Hitokazu Matsushita and Deryle Lonsdale |
Item Development and Scoring for Japanese Oral Proficiency Testing |
17:15-17:35 |
Manny Rayner, Pierrette Bouillon and Johanna Gerlach |
Evaluating Appropriateness Of System Responses In A Spoken CALL Game |
17:35-17:55 |
Antonio Moreno-Sandoval, Leonardo Campillos Llanos, Yang Dong, Emi Takamori, José M. Guirao, Paula Gozalo, Chieko Kimura, Kengo Matsui and Marta Garrote-Salazar |
Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese |
17:55-18:15 |
Helmer Strik, Jozef Colpaert, Joost Van Doremalen and Catia Cucchiarini |
The DISCO ASR-based CALL system: practicing L2 oral skills and beyond |
|
Session O31 - Discourse (2) |
Chairperson: Aravind Joshi |
16:55-17:15 |
Marta Tatu and Dan Moldovan |
A Tool for Extracting Conversational Implicatures |
17:15-17:35 |
Andrei Popescu-Belis, Thomas Meyer, Jeevanthi Liyanapathirana, Bruno Cartoni and Sandrine Zufferey |
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns |
17:35-17:55 |
Steven Bethard, Oleksandr Kolomiyets and Marie-Francine Moens |
Annotating Story Timelines as Temporal Dependency Structures |
17:55-18:15 |
Stergos Afantenos, Nicholas Asher, Farah Benamara, Myriam Bras, Cecile Fabre, Mai Ho-Dac, Anne Le Draoulec, Philippe Muller, Marie-Paul Pery-Woodley, Laurent Prevot, Josette Rebeyrolles, Ludovic Tanguy, Marianne Vergez-Couret and Laure Vieu |
An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus |
|
Session O37 - Subjectivity and Emotions |
Chairperson: Julio Gonzalo |
9:45-10:05 |
Jörg Frommer, Bernd Michaelis, Dietmar Rösner, Andreas Wendemuth, Rafael Friesen, Matthias Haase, Manuela Kunze, Rico Andrich, Julia Lange, Axel Panning and Ingo Siegert |
Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus |
10:05-10:25 |
Isa Maks and Piek Vossen |
Building a fine-grained subjectivity lexicon from a web corpus |
10:25-10:45 |
Veronica Perez-Rosas, Carmen Banea and Rada Mihalcea |
Learning Sentiment Lexicons in Spanish |
10:45-11:05 |
Tommaso Caselli, Irene Russo and Francesco Rubino |
Assigning Connotation Values to Events |
11:05-11:25 |
Balamuraliar, Aditya Joshi and Pushpak Bhattacharyya |
Cost and Benefit of Using WordNet Senses for Sentiment Analysis |
|
Session O38 - Named Entities |
Chairperson: Satoshi Sato |
9:45-10:05 |
Xuansong Li, Stephanie Strassel, Heng Ji, Kira Griffitt and Joe Ellis |
Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual |
10:05-10:25 |
Dawn Lawrie, James Mayfield, Paul McNamee and Douglas Oard |
Creating and Curating a Cross-Language Person-Entity Linking Collection |
10:25-10:45 |
Keith J. Miller, Elizabeth Schroeder Richerson, Sarah McLeod, James Finley and Aaron Schein |
International Multicultural Name Matching Competition: Design, Execution, Results, and Lessons Learned |
10:45-11:05 |
K Saravanan, Monojit Choudhury, Raghavendra Udupa and A Kumaran |
An Empirical Study of the Occurrence and Co-Occurrence of Named Entities in Natural Language Corpora |
11:05-11:25 |
Olivier Galibert, Sophie Rosset, Cyril Grouin, Pierre Zweigenbaum and Ludovic Quintard |
Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign |
|
Session O39 - Treebanks and Syntax |
Chairperson: Erhard Hinrichs |
9:45-10:05 |
Wolfgang Seeker and Jonas Kuhn |
Making Ellipses Explicit in Dependency Conversion for a German Treebank |
10:05-10:25 |
Hongzhi Xu, Helen Kaiyun Chen, Chu-Ren Huang, Qin Lu, Dingxu Shi and Tin-Shing Chiu |
A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies |
10:25-10:45 |
Tafseer Ahmed, Miriam Butt, Annette Hautli and Sebastian Sulger |
A Reference Dependency Bank for Analyzing Complex Predicates |
10:45-11:05 |
Jan Hajič, Eva Hajičová, Jarmila Panevová, Petr Sgall, Ondřej Bojar, Silvie Cinková, Eva Fučíková, Marie Mikulová, Petr Pajas, Jan Popelka, Jiří Semecký, Jana Šindlerová, Jan Štěpánek, Josef Toman, Zdeňka Urešová and Zdeněk Žabokrtský |
Announcing Prague Czech-English Dependency Treebank 2.0 |
11:05-11:25 |
Liesbeth Augustinus, Vincent Vandeghinste and Frank Van Eynde |
Example-Based Treebank Querying |
|
Session O41 - Machine Translation and Language Resources (2) |
Chairperson: Atsushi Fujii |
11:45-12:05 |
Víctor M. Sánchez-Cartagena, Miquel Esplà-Gomis and Juan Antonio Pérez-Ortiz |
Source-Language Dictionaries Help Non-Expert Users to Enlarge Target-Language Dictionaries for Machine Translation |
12:05-12:25 |
Christian Federmann, Eleftherios Avramidis, Marta R. Costa-Jussà, Josef van Genabith, Maite Melero and Pavel Pecina |
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation |
12:25-12:45 |
Maria Holmqvist, Sara Stymne, Lars Ahrenberg and Magnus Merkel |
Alignment-based reordering for SMT |
12:45-13:05 |
Monica Gavrila, Walther v. Hahn and Cristina Vertan |
Same domain different discourse style - A case study on Language Resources for data-driven Machine Translation |
13:05-13:25 |
Hidetsugu Nanba, Toshiyuki Takezawa, Kiyoko Uchiyama and Akiko Aizawa |
Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques |
|
Session O44 - Evaluation of Systems and Application |
Chairperson: Sabine Schulte im Walde |
11:45-12:05 |
Ioana Vasilescu, Martine Adda-Decker and Lori Lamel |
Cross-lingual studies of ASR errors: paradigms for perceptual evaluations |
12:05-12:25 |
Kallirroi Georgila, Alan Black, Kenji Sagae and David Traum |
Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems |
12:25-12:45 |
Tomoyosi Akiba, Hiromitsu Nishizaki, Kiyoaki Aikawa, Tatsuya Kawahara and Tomoko Matsui |
Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task |
12:45-13:05 |
Tina Kluewer, Feiyu Xu, Peter Adophs and Hans Uszkoreit |
Evaluation of the KomParse Conversational Non-Player Characters in a Commercial Virtual World |
13:05-13:25 |
Marcello Federico, Sebastian Stüker, Luisa Bentivogli, Michael Paul, Mauro Cettolo, Teresa Herrmann, Jan Niehues and Giovanni Moretti |
The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation |
|
Session O48 - Named Entities and Subjectivity |
Chairperson: Dan Tufis |
14:55-15:15 |
Udo Hahn, Elena Beisswanger, Ekaterina Buyko, Erik Faessler, Jenny Traumüller, Susann Schröder and Kerstin Hornbostel |
Iterative Refinement and Quality Checking of Annotation Guidelines ― How to Deal Effectively with Semantically Sloppy Named Entity Types, such as Pathological Phenomena |
15:15-15:35 |
Danuta Ploch, Leonhard Hennig, Angelina Duka, Ernesto William De Luca and Sahin Albayrak |
GerNED: A German Corpus for Named Entity Disambiguation |
15:35-15:55 |
Ian Lewin, Şenay Kafkas and Dietrich Rebholz-Schuhmann |
Centroids: Gold standards with distributional variation |
15:55-16:15 |
Yulan He, Hassan Saif, Zhongyu Wei and Kam-Fai Wong |
Quantising Opinions for Political Tweets Analysis |
16:15-16:35 |
Muhammad Abdul-Mageed and Mona Diab |
AWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis |
|
Session P2 - Tools, Systems and Evaluation |
Chair : Michael Kipp |
11:35-13:15 |
Felix Burkhardt |
Fast Labeling and Transcription with the Speechalyzer Toolkit |
11:35-13:15 |
Bart Jongejan |
Automatic annotation of head velocity and acceleration in Anvil |
11:35-13:15 |
Przemyslaw Lenkiewicz, Binyam Gebrekidan Gebre, Oliver Schreer, Stefano Masneri, Daniel Schneider and Sebastian Tschöpel |
AVATecH ― automated annotation through audio and video analysis |
11:35-13:15 |
Henk van den Heuvel, Eric Sanders, Robin Rutten, Stef Scagliola and Paula Witkamp |
An Oral History Annotation Tool for INTER-VIEWs |
11:35-13:15 |
Han Sloetjes and Aarthy Somasundaram |
ELAN development, keeping pace with communities' needs |
11:35-13:15 |
Michał Marcińczuk, Jan Kocoń and Bartosz Broda |
Inforex -- a web-based tool for text corpus management and semantic annotation |
11:35-13:15 |
Binyam Gebrekidan Gebre, Peter Wittenburg and Przemyslaw Lenkiewicz |
Towards Automatic Gesture Stroke Detection |
11:35-13:15 |
Thomas Schmidt |
EXMARaLDA and the FOLK tools ― two toolsets for transcribing and annotating spoken language |
11:35-13:15 |
Leonardo Campillos Llanos |
Designing a search interface for a Spanish learner spoken corpus: the end-user's evaluation |
|
Session P5 - Information Extraction (1) |
Chair : Günter Neumann |
14:45-16:25 |
Michael Wiegand, Benjamin Roth, Eva Lasarcyk, Stephanie Köser and Dietrich Klakow |
A Gold Standard for Relation Extraction in the Food Domain |
14:45-16:25 |
Mathias Bank, Robert Remus and Martin Schierle |
Textual Characteristics for Language Engineering |
14:45-16:25 |
Ziqi Zhang, Philip Webster, Victoria Uren, Andrea Varga and Fabio Ciravegna |
Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing |
14:45-16:25 |
Xavier Tannier, Véronique Moriceau, Béatrice Arnulphy and Ruixin He |
Evolution of Event Designation in Media: Preliminary Study |
14:45-16:25 |
Yunqing Xia, Guoyu Tang, Peng Jin and Xia Yang |
CLTC: A Chinese-English Cross-lingual Topic Corpus |
14:45-16:25 |
Julia Maria Schulz, Daniela Becks, Christa Womser-Hacker and Thomas Mandl |
A Resource-light Approach to Phrase Extraction for English and German Documents from the Patent Domain and User Generated Content |
14:45-16:25 |
Md. Faisal Mahbub Chowdhury and Alberto Lavelli |
An Evaluation of the Effect of Automatic Preprocessing on Syntactic Parsing for Biomedical Relation Extraction |
14:45-16:25 |
Wei Wang, Romaric Besançon, Olivier Ferret and Brigitte Grau |
Evaluation of Unsupervised Information Extraction |
14:45-16:25 |
Stéphanie Weiser and Patrick Watrin |
Extraction of unmarked quotations in Newspapers |
14:45-16:25 |
Martin Aleksandrov and Carlo Strapparava |
NgramQuery - Smart Information Extraction from Google N-gram using External Resources |
|
Session P7 - Multiword Expressions and Term Extraction |
Chair : Karel Pala |
14:45-16:25 |
Behrang QasemiZadeh, Paul Buitelaar, Tianqi Chen and Georgeta Bordea |
Semi-Supervised Technical Term Tagging With Minimal User Feedback |
14:45-16:25 |
Miriam Buendía-Castro and Beatriz Sánchez-Cárdenas |
Linguistic knowledge for specialized text production |
14:45-16:25 |
Rita Marinelli and Laura Cignoni |
In the same boat and other idiomatic seafaring expressions |
14:45-16:25 |
Sabine Schulte im Walde, Susanne Borgwaldt and Ronny Jauch |
Association Norms of German Noun Compounds |
14:45-16:25 |
Doaa Samy, Antonio Moreno-Sandoval, Conchi Bueno-Díaz, Marta Garrote-Salazar and José M. Guirao |
Medical Term Extraction in an Arabic Medical Corpus |
14:45-16:25 |
Matthieu Constant and Isabelle Tellier |
Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger |
14:45-16:25 |
Anita Gojun, Ulrich Heid, Bernd Weißbach, Carola Loth and Insa Mingers |
Adapting and evaluating a generic term extraction tool |
14:45-16:25 |
Mladen Karan, Jan Šnajder and Bojana Dalbelo Bašić |
Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian |
14:45-16:25 |
Thibault Mondary, Adeline Nazarenko, Haïfa Zargayouna and Sabine Barreaux |
The Quaero Evaluation Initiative on Term Extraction |
14:45-16:25 |
Shiva Taslimipoor, Afsaneh Fazly and Ali Hamzeh |
Using Noun Similarity to Adapt an Acceptability Measure for Persian Light Verb Constructions |
14:45-16:25 |
Dhouha Bouamor, Nasredine Semmar and Pierre Zweigenbaum |
Identifying bilingual Multi-Word Expressions for Statistical Machine Translation |
14:45-16:25 |
Takafumi Suzuki, Yusuke Abe, Itsuki Toyota, Takehito Utsuro, Suguru Matsuyoshi and Masatoshi Tsuchiya |
Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation |
14:45-16:25 |
Aude Grezka and Céline Poudat |
Building a database of French frozen adverbial phrases |
14:45-16:25 |
Marc Luder |
German Verb Patterns and Their Implementation in an Electronic Dictionary |
|
Session P8 - Authoring Tools, Proofing |
Chair : Catia Cucchiarini |
14:45-16:25 |
Flore Barcellini, Camille Albert, Corinne Grosse and Patrick Saint-Dizier |
Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring |
14:45-16:25 |
Mohammad Hoseyn Sheykholeslam, Behrouz Minaei-Bidgoli and Hossein Juzi |
A Framework for Spelling Correction in Persian Language Using Noisy Channel Model |
14:45-16:25 |
Nizar Habash, Mona Diab and Owen Rambow |
Conventional Orthography for Dialectal Arabic |
14:45-16:25 |
Khaled Shaalan, Mohammed Attia, Pavel Pecina, Younes Samih and Josef van Genabith |
Arabic Word Generation and Modelling for Spell Checking |
14:45-16:25 |
Jan Rygl and Aleš Horák |
Similarity Ranking as Attribute for Machine Learning Approach to Authorship Identification |
14:45-16:25 |
Shaohua Yang, Hai Zhao, Xiaolin Wang and Bao-liang Lu |
Spell Checking for Chinese |
14:45-16:25 |
Jordi Atserias, Maria Fuentes, Rogelio Nazar and Irene Renau |
Spell Checking in Spanish: The Case of Diacritic Accents |
14:45-16:25 |
Michael Rosner, Albert Gatt, Andrew Attard and Jan Joachimsen |
Incorporating an Error Corpus into a Spellchecker for Maltese |
|
Session P9 - Morphology |
Chair : Pushpak Bhattacharyya |
16:45-18:05 |
Marco Passarotti and Francesco Mambrini |
First Steps towards the Semi-automatic Development of a Wordformation-based Lexicon of Latin |
16:45-18:05 |
Marcin Woliński, Marcin Miłkowski, Maciej Ogrodniczuk and Adam Przepiórkowski |
PoliMorf: a (not so) new open morphological dictionary for Polish |
16:45-18:05 |
Lionel Nicolas, Jacques Farré and Cécile Darme |
Unsupervised acquisition of concatenative morphology |
16:45-18:05 |
Emad Mohamed, Behrang Mohit and Kemal Oflazer |
Annotating and Learning Morphological Segmentation of Egyptian Colloquial Arabic |
16:45-18:05 |
Paul Felt, Eric Ringger, Kevin Seppi, Kristian Heal, Robbie Haertel and Deryle Lonsdale |
First Results in a Study Evaluating Pre-annotation and Correction Propagation for Machine-Assisted Syriac Morphological Analysis |
16:45-18:05 |
Claudia Marzi, Marcello Ferro, Claudia Caudai and Vito Pirrelli |
Evaluating Hebbian Self-Organizing Memories for Lexical Representation and Access |
16:45-18:05 |
Cheikh M. Bamba Dione |
A Morphological Analyzer For Wolof Using Finite-State Techniques |
16:45-18:05 |
Septina Dian Larasati |
IDENTIC Corpus: Morphologically Enriched Indonesian-English Parallel Corpus |
16:45-18:05 |
Liviu P. Dinu, Vlad Niculae and Octavia-Maria Şulea |
The Romanian Neuter Examined Through A Two-Gender N-Gram Classification System |
16:45-18:05 |
Toshinobu Ogiso, Mamoru Komachi, Yasuharu Den and Yuji Matsumoto |
UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese |
16:45-18:05 |
Maciej Piasecki, Radoslaw Ramocki and Marek Maziarz |
Recognition of Polish Derivational Relations Based on Supervised Learning Scheme |
16:45-18:05 |
Dan Cristea, Radu Simionescu and Gabriela Haja |
Reconstructing the Diachronic Morphology of Romanian from Dictionary Citations |
16:45-18:05 |
Krešimir Šojat, Nives Mikelić Preradović and Marko Tadić |
Generation of Verbal Stems in Derivationally Rich Language |
16:45-18:05 |
Jonathan Washington, Mirlan Ipasov and Francis Tyers |
A finite-state morphological transducer for Kyrgyz |
16:45-18:05 |
Fabio Tamburini and Matias Melandri |
AnIta: a powerful morphological analyser for Italian |
|
Session P10 - Prosody and Phonetics |
Chair : Laurence Devillers |
16:45-18:05 |
Gloria Gagliardi, Edoardo Lombardi Vallauri and Fabio Tamburini |
A topologic view of Topic and Focus marking in Italian |
16:45-18:05 |
Geneviève Caelen-Haumont and Sethserey Sam |
Comparison between two models of language for the automatic phonetic labeling of an undocumented language of the South-Asia: the case of Mo Piu |
16:45-18:05 |
Benoît Weber, Geneviève Caelen-Haumont, Binh Hai Pham and Do-Dat Tran |
MISTRAL+: A Melody Intonation Speaker Tonal Range semi-automatic Analysis using variable Levels |
16:45-18:05 |
Nelly Barbot, Olivier Boeffard and Arnaud Delhay |
Comparing performance of different set-covering strategies for linguistic content optimization in speech corpora |
16:45-18:05 |
Olivier Boeffard, Laure Charonnat, Sébastien Le Maguer and Damien Lolive |
Towards Fully Automatic Annotation of Audio Books for TTS |
16:45-18:05 |
Iris Merkus and Florian Schiel |
Statistical Evaluation of Pronunciation Encoding |
16:45-18:05 |
Helen Kaiyun Chen |
Annotating a corpus of human interaction with prosodic profiles ― focusing on Mandarin repair/disfluency |
16:45-18:05 |
Kikuo Maekawa |
Prediction of Non-Linguistic Information of Spontaneous Speech from the Prosodic Annotation: Evaluation of the X-JToBI system |
16:45-18:05 |
Antonio Origlia and Iolanda Alfano |
Prosomarker: a prosodic analysis tool based on optimal pitch stylization and automatic syllabification |
16:45-18:05 |
David Doukhan, Sophie Rosset, Albert Rilliard, Christophe d'Alessandro and Martine Adda-Decker |
Designing French Tale Corpora for Entertaining Text To Speech Synthesis |
16:45-18:05 |
Claire Brierley, Majdi Sawalha and Eric Atwell |
Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing |
16:45-18:05 |
Luc Boruta and Justyna Jastrzebska |
A Phonemic Corpus of Polish Child-Directed Speech |
|
Session P11 - Language Resource Infrastructures (1) |
Chair : Daan Broeder |
16:45-18:05 |
Peter Spyns and Elisabeth D'Halleweyn |
Smooth Sailing for STEVIN |
16:45-18:05 |
Dieter van Uytvanck, Herman Stehouwer and Lari Lampen |
Semantic metadata mapping in practice: the Virtual Language Observatory |
16:45-18:05 |
Aditi Sharma Grover, Annamart Nieman, Gerhard Van Huyssteen and Justus Roux |
Aspects of a Legal Framework for Language Resource Management |
16:45-18:05 |
Elena Volodina and Sofie Johansson Kokkinakis |
Introducing the Swedish Kelly-list, a new lexical e-resource for Swedish |
16:45-18:05 |
Philippe Langlais, Patrick Drouin, Amélie Paulus, Eugénie Rompré Brodeur and Florent Cottin |
Texto4Science: a Quebec French Database of Annotated Short Text Messages |
16:45-18:05 |
Jan Odijk |
Recent Developments in CLARIN-NL |
16:45-18:05 |
Emanuel Dima, Christina Hoppermann, Erhard Hinrichs, Thorsten Trippel and Claus Zinn |
A Metadata Editor to Support the Description of Linguistic Resources |
16:45-18:05 |
Hanno Biber and Evelyn Breiteneder |
Fivehundredmillionandone Tokens. Loading the AAC Container with Text Resources for Text Studies. |
16:45-18:05 |
José Pedro Ferreira, Maarten Janssen, Gladis Barcellos de Oliveira, Margarita Correia and Gilvan Müller de Oliveira |
The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language |
16:45-18:05 |
Andrejs Vasiljevs, Markus Forsberg, Tatiana Gornostay, Dorte Haltrup Hansen, Kristín Jóhannsdóttir, Gunn Lyse, Krister Lindén, Lene Offersgaard, Sussi Olsen, Bolette Pedersen, Eiríkur Rögnvaldsson, Inguna Skadiņa, Koenraad De Smedt, Ville Oksanen and Roberts Rozis |
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries |
16:45-18:05 |
Nicoletta Calzolari, Riccardo Del Gratta, Gil Francopoulo, Joseph Mariani, Francesco Rubino, Irene Russo and Claudia Soria |
The LRE Map. Harmonising Community Descriptions of Resources |
16:45-18:05 |
Maria Gavrilidou, Penny Labropoulou, Elina Desipri, Stelios Piperidis, Haris Papageorgiou, Monica Monachini, Francesca Frontini, Thierry Declerck, Gil Francopoulo, Victoria Arranz and Valérie Mapelli |
The META-SHARE Metadata Schema for the Description of Language Resources |
16:45-18:05 |
Yoshinobu Kano |
Towards automation in using multi-modal language resources: compatibility and interoperability for multi-modal features in Kachako |
|
Session P12 - Subjectivity: Sentiments, Emotions, Opinions (1) |
Chair : Carlo Strapparava |
18:10-19:30 |
Xin Zuo, Tian Li and Pascale Fung |
A Multilingual Natural Stress Emotion Database |
18:10-19:30 |
Takahiro Miyajima, Hideaki Kikuchi, Katsuhiko Shirai and Shigeki Okawa |
Method for Collection of Acted Speech Using Various Situation Scripts |
18:10-19:30 |
Hong Li, Xiwen Cheng, Kristina Adson, Tal Kirshboim and Feiyu Xu |
Annotating Opinions in German Political News |
18:10-19:30 |
Akshat Bakliwal, Piyush Arora and Vasudeva Varma |
Hindi Subjective Lexicon: A Lexical Resource for Hindi Adjective Polarity Classification |
18:10-19:30 |
Juan María Garrido, Yesika Laplaza, Montse Marquina, Andrea Pearman, José Gregorio Escalada, Miguel Ángel Rodríguez and Ana Armenta |
The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech |
18:10-19:30 |
Panagiotis Giannoulis and Gerasimos Potamianos |
A hierarchical approach with feature selection for emotion recognition from speech |
18:10-19:30 |
Alexandra Balahur and Jesús M. Hermida |
Extending the EmotiNet Knowledge Base to Improve the Automatic Detection of Implicitly Expressed Emotions from Text |
18:10-19:30 |
Saeedeh Momtazi |
Fine-grained German Sentiment Analysis on Social Media |
18:10-19:30 |
Felix Burkhardt |
You Seem Aggressive! Monitoring Anger in a Practical Application |
18:10-19:30 |
Yi-jie Tang and Hsin-Hsi Chen |
Mining Sentiment Words from Microblogs for Predicting Writer-Reader Emotion Transition |
18:10-19:30 |
Christian Scheible and Hinrich Schütze |
Bootstrapping Sentiment Labels For Unannotated Documents With Polarity PageRank |
|
Session P14 - Dialogue |
Chair : Ron Artstein |
18:10-19:30 |
Tobias Heinroth, Maximilian Grotz, Florian Nothdurft and Wolfgang Minker |
Adaptive Speech Understanding for Intuitive Model-based Spoken Dialogues |
18:10-19:30 |
Kseniya Zablotskaya, Umair Rahim, Fernando Fernández Martínez and Wolfgang Minker |
Relating Dominance of Dialogue Participants with their Verbal Intelligence Scores |
18:10-19:30 |
Volha Petukhova and Harry Bunt |
The coding and annotation of multimodal dialogue acts |
18:10-19:30 |
Harry Bunt, Michael Kipp and Volha Petukhova |
Using DiAML and ANVIL for multimodal dialogue annotations |
18:10-19:30 |
Matthew Fuchs, Nikos Tsourakis and Manny Rayner |
A Scalable Architecture For Web Deployment of Spoken Dialogue Systems |
18:10-19:30 |
Nikos Tsourakis and Manny Rayner |
A Corpus for a Gesture-Controlled Mobile Spoken Dialogue System |
18:10-19:30 |
Emina Kurtic, Bill Wells, Guy J. Brown, Timothy Kempton and Ahmet Aker |
A Corpus of Spontaneous Multi-party Conversation in Bosnian Serbo-Croatian and British English |
18:10-19:30 |
Jing Guang Han, Emer Gilmartin, Celine DeLooze, Brian Vaughan and Nick Campbell |
The Herme Database of Spontaneous Multimodal Human-Robot Dialogues |
18:10-19:30 |
Yasuharu Den, Hanae Koiso, Katsuya Takanashi and Nao Yoshida |
Annotation of response tokens and their triggering expressions in Japanese multi-party conversations |
18:10-19:30 |
Thierry Bazillon, Melanie Deplano, Frederic Bechet, Alexis Nasr and Benoit Favre |
Syntactic annotation of spontaneous speech: application to call-center conversation data |
18:10-19:30 |
Frederic Bechet, Benjamin Maza, Nicolas Bigouroux, Thierry Bazillon, Marc El-Beze, Renato De Mori and Eric Arbillot |
DECODA: a call-centre human-human spoken conversation corpus |
18:10-19:30 |
Pepi Stavropoulou, Dimitris Spiliotopoulos and Georgios Kouroupetroglou |
Resource Evaluation for Usable Speech Interfaces: Utilizing Human-Human Dialogue |
18:10-19:30 |
Jens Edlund, Simon Alexandersson, Jonas Beskow, Lisa Gustavsson, Mattias Heldner, Anna Hjalmarsson, Petter Kallionen and Ellen Marklund |
3rd party observer gaze as a continuous measure of dialogue flow |
18:10-19:30 |
Marc Tomlinson, David Bracewell, Mary Draper, Zewar Almissour, Ying Shi and Jeremy Bensley |
Pursing power in Arabic on-line discussion forums |
18:10-19:30 |
Sunao Hara, Norihide Kitaoka and Kazuya Takeda |
Causal analysis of task completion errors in spoken music retrieval interactions |
18:10-19:30 |
Marilyn Walker, Grace Lin and Jennifer Sawyer |
An Annotated Corpus of Film Dialogue for Learning and Characterizing Character Style |
|
Session P15 - Semantic Annotation |
Chair : Aline Villavicencio |
9:45-11:25 |
Béatrice Arnulphy, Xavier Tannier and Anne Vilnat |
Event Nominals: Annotation Guidelines and a Manually Annotated Corpus in French |
9:45-11:25 |
Maria Aloni, Andreas van Cranenburgh, Raquel Fernandez and Marta Sznajder |
Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions |
9:45-11:25 |
António Branco, Catarina Carvalheiro, Sílvia Pereira, Sara Silveira, João Silva, Sérgio Castro and João Graça |
A PropBank for Portuguese: the CINTIL-PropBank |
9:45-11:25 |
Ashwini Vaidya, Jinho D. Choi, Martha Palmer and Bhuvana Narasimhan |
Empty Argument Insertion in the Hindi PropBank |
9:45-11:25 |
Pierrette Bouillon, Elisabetta Jezek, Chiara Melloni and Aurélie Picton |
Annotating Qualia Relations in Italian and French Complex Nominals |
9:45-11:25 |
Juliette Thuilier and Laurence Danlos |
Semantic annotation of French corpora: animacy and verb semantic classes |
9:45-11:25 |
Josef Ruppenhofer and Ines Rehbein |
Yes we can!? Annotating English modal verbs |
9:45-11:25 |
Mehdi Manshadi, James Allen and Mary Swift |
An Annotation Scheme for Quantifier Scope Disambiguation |
9:45-11:25 |
Yuichiroh Matsubayashi, Yusuke Miyao and Akiko Aizawa |
Building Japanese Predicate-argument Structure Corpus using Lexical Conceptual Structure |
9:45-11:25 |
Kyoko Ohara |
Semantic Annotations in Japanese FrameNet: Comparing Frames in Japanese and English |
9:45-11:25 |
Roser Morante and Walter Daelemans |
ConanDoyle-neg: Annotation of negation cues and their scope in Conan Doyle stories |
|
Session P16 - Document Classification, Text Categorisation |
Chair : Serge Sharoff |
9:45-11:25 |
Mike Kestemont, Claudia Peersman, Benny De Decker, Guy De Pauw, Kim Luyckx, Roser Morante, Frederik Vaassen, Janneke van de Loo and Walter Daelemans |
The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language |
9:45-11:25 |
Kseniya Zablotskaya, Fernando Fernández Martínez and Wolfgang Minker |
Investigating Verbal Intelligence Using the TF-IDF Approach |
9:45-11:25 |
Sanja Štajner and Ruslan Mitkov |
Diachronic Changes in Text Complexity in 20th Century English Language: An NLP Approach |
9:45-11:25 |
Tommaso Fornaciari and Massimo Poesio |
DeCour: a corpus of DEceptive statements in Italian COURts |
9:45-11:25 |
Amalia Todirascu, Sebastian Pado, Jennifer Krisch, Max Kisselew and Ulrich Heid |
French and German Corpora for Audience-based Text Type Classification |
9:45-11:25 |
Borut Sluban, Senja Pollak, Roel Coesemans and Nada Lavrac |
Irregularity Detection in Categorized Document Corpora |
9:45-11:25 |
Carmen Dayrell, Arnaldo Candido Jr., Gabriel Lima, Danilo Machado Jr., Ann Copestake, Valéria Feltrim, Stella Tagnin and Sandra Aluisio |
Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora |
9:45-11:25 |
Andrea Varga, Daniel Preotiuc-Pietro and Fabio Ciravegna |
Unsupervised document zone identification using probabilistic graphical models |
9:45-11:25 |
Mohammad Hossein Elahimanesh, Behrouz Minaei and Hossein Malekinezhad |
Improving K-Nearest Neighbor Efficacy for Farsi Text Classification |
|
Session P18 - Digital Libraries |
Chair : Monica Monachini |
9:45-11:25 |
Jordi Adell, Antonio Bonafonte, Antonio Cardenal, Marta R. Costa-Jussà, José A. R. Fonollosa, Asunción Moreno, Eva Navas and Eduardo R. Banga |
BUCEADOR, a multi-language search engine for digital libraries |
9:45-11:25 |
Ranka Stanković, Cvetana Krstev, Ivan Obradović, Aleksandra Trtovac and Miloš Utvić |
A tool for enhanced search of multilingual digital libraries of e-journals |
9:45-11:25 |
Benjamin Weitz and Ulrich Schäfer |
A Graphical Citation Browser for the ACL Anthology |
9:45-11:25 |
Eleftheria Ahtaridis, Christopher Cieri and Denise DiPersio |
LDC Language Resource Database: Building a Bibliographic Database |
9:45-11:25 |
Eneko Agirre, Ander Barrena, Oier Lopez de Lacalle, Aitor Soroa, Samuel Fernando and Mark Stevenson |
Matching Cultural Heritage items to Wikipedia |
|
Session P19 - Treebanks |
Chair : Menno van Zaanen |
11:45-13:05 |
Seth Kulick, Ann Bies and Justin Mott |
Further Developments in Treebank Error Detection Using Derivation Trees |
11:45-13:05 |
Xuansong Li, Stephanie Strassel, Stephen Grimes, Safa Ismael, Mohamed Maamouri, Ann Bies and Nianwen Xue |
Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures |
11:45-13:05 |
Mohamed Maamouri, Ann Bies and Seth Kulick |
Expanding Arabic Treebank to Speech: Results from Broadcast News |
11:45-13:05 |
Magali Sanches Duran and Sandra Maria Aluísio |
Propbank-Br: a Brazilian Treebank annotated with semantic role labels |
11:45-13:05 |
Yi Zhang, Rui Wang and Yu Chen |
Joint Grammar and Treebank Development for Mandarin Chinese with HPSG |
11:45-13:05 |
Annette Rios and Anne Göhring |
A tree is a Baum is an árbol is a sach'a: Creating a trilingual treebank |
11:45-13:05 |
Miriam Kaeshammer and Vera Demberg |
German and English Treebanks and Lexica for Tree-Adjoining Grammars |
11:45-13:05 |
Loganathan Ramasamy and Zdeněk Žabokrtský |
Prague Dependency Style Treebank for Tamil |
11:45-13:05 |
Patricia Gonçalves, Rita Santos and António Branco |
Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese |
11:45-13:05 |
Dasa Berovic, Zeljko Agic and Marko Tadić |
Croatian Dependency Treebank: Recent Development and Initial Experiments |
11:45-13:05 |
Rahul Agarwal, Bharat Ram Ambati and Anil Kumar Singh |
A GUI to Detect and Correct Errors in Hindi Dependency Treebank |
11:45-13:05 |
Masood Ghayoomi |
From Grammar Rule Extraction to Treebanking: A Bootstrapping Approach |
11:45-13:05 |
Montserrat Marimon, Beatríz Fisas, Núria Bel, Marta Villegas, Jorge Vivaldi, Sergi Torner, Mercè Lorente, Silvia Vázquez and Marta Villegas |
The IULA Treebank |
11:45-13:05 |
Atro Voutilainen, Kristiina Muhonen, Tanja Purtonen and Krister Lindén |
Specifying Treebanks, Outsourcing Parsebanks: FinnTreeBank 3 |
11:45-13:05 |
Cristina Bosco, Manuela Sanguinetti and Leonardo Lesmo |
The Parallel-TUT: a multilingual and multiformat treebank |
11:45-13:05 |
Teresa Lynn, Ozlem Cetinoglu, Jennifer Foster, Elaine Uí Dhonnchadha, Mark Dras and Josef van Genabith |
Irish Treebanking and Parsing: A Preliminary Evaluation |
|
Session P20 - Parsing |
Chair : Antonio Moreno-Sandoval |
11:45-13:05 |
Mohammed Attia, Khaled Shaalan, Lamia Tounsi and Josef van Genabith |
Automatic Extraction and Evaluation of Arabic LFG Resources |
11:45-13:05 |
Kristiina Muhonen and Tanja Purtonen |
Rule-Based Detection of Clausal Coordinate Ellipsis |
11:45-13:05 |
Gülşen Eryiğit |
The Impact of Automatic Morphological Analysis & Disambiguation on Dependency Parsing of Turkish |
11:45-13:05 |
Stasinos Konstantopoulos, Valia Kordoni, Nicola Cancedda, Vangelis Karkaletsis, Dietrich Klakow and Jean-Michel Renders |
Task-Driven Linguistic Analysis based on an Underspecified Features Representation |
11:45-13:05 |
Malin Ahlberg and Ramona Enache |
Combining Language Resources Into A Grammar-Driven Swedish Parser |
11:45-13:05 |
Eiríkur Rögnvaldsson, Anton Karl Ingason, Einar Freyr Sigurðsson and Joel Wallenberg |
The Icelandic Parsed Historical Corpus (IcePaHC) |
11:45-13:05 |
Anita Alicante, Cristina Bosco, Anna Corazza and Alberto Lavelli |
A treebank-based study on the influence of Italian word order on parsing performance |
11:45-13:05 |
Dong Wang and Fei Xia |
Effort of Genre Variation and Prediction of System Performance |
|
Session P21 - Information Extraction (2) |
Chair : Paul Buitelaar |
11:45-13:05 |
Michael Tepper, Daniel Capurro, Fei Xia, Lucy Vanderwende and Meliha Yetisgen-Yildiz |
Statistical Section Segmentation in Free-Text Clinical Records |
11:45-13:05 |
Ramona Bongelli, Carla Canestrari, Ilaria Riccioni, Andrzej Zuczkowski, Cinzia Buldorini, Ricardo Pietrobon, Alberto Lavelli and Bernardo Magnini |
A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty |
11:45-13:05 |
Cristina Mota, Alberto Simões, Cláudia Freitas, Luís Costa and Diana Santos |
Págico: Evaluating Wikipedia-based information retrieval in Portuguese |
11:45-13:05 |
Danica Damljanovic, Udo Kruschwitz, M-Dyaa Albakour, Johann Petrak and Mihai Lupu |
Applying Random Indexing to Structured Data to Find Contextually Similar Words |
11:45-13:05 |
Horacio Saggion and Sandra Szasz |
The CONCISUS Corpus of Event Summaries |
11:45-13:05 |
Gracinda Carvalho, David Martins de Matos and Vitor Rocio |
Building and Exploring Semantic Equivalences Resources |
11:45-13:05 |
Marc Verhagen and James Pustejovsky |
The TARSQI Toolkit |
11:45-13:05 |
Gabriella Pardelli, Manuela Sassi, Sara Goggi and Stefania Biagioni |
From medical language processing to BioNLP domain |
11:45-13:05 |
Romaric Besançon, Olivier Ferret and Ludovic Jean-Louis |
Evaluation of a Complex Information Extraction Application in Specific Domain |
11:45-13:05 |
Hannah Kermes |
A methodology for the extraction of information about the usage of formulaic expressions in scientific texts |
11:45-13:05 |
André Santos, José João Almeida and Nuno Carvalho |
Structural alignment of plain text books |
11:45-13:05 |
Gerold Schneider, Fabio Rinaldi and Simon Clematide |
Dependency parsing for interaction detection in pharmacogenomics |
11:45-13:05 |
William Black, Rob Procter, Steven Gray and Sophia Ananiadou |
A data and analysis resource for an experiment in text mining a collection of micro-blogs on a political topic. |
|
Session P23 - Machine Translation (1) |
Chair : Philippe Langlais |
14:55-16:35 |
Bruno Cartoni and Thomas Meyer |
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies |
14:55-16:35 |
Marianna J. Martindale |
Can Statistical Post-Editing with a Small Parallel Corpus Save a Weak MT Engine? |
14:55-16:35 |
Sanja Seljan, Marija Brkić and Tomislav Vičić |
BLEU Evaluation of Machine-Translated English-Croatian Legislation |
14:55-16:35 |
Chenhui Chu, Toshiaki Nakazawa and Sadao Kurohashi |
Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified Chinese |
14:55-16:35 |
Juan Pablo Martínez Cortés, Jim O'Regan and Francis Tyers |
Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese |
14:55-16:35 |
Jan Berka, Ondřej Bojar, Mark Fishel, Maja Popović and Daniel Zeman |
Automatic MT Error Analysis: Hjerson Helping Addicter |
14:55-16:35 |
Amit Sangodkar and Om Damani |
Re-ordering Source Sentences for SMT |
14:55-16:35 |
Ângela Costa, Tiago Luís, Joana Ribeiro, Ana Cristina Mendes and Luísa Coheur |
An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT |
14:55-16:35 |
Mehmet Talha Çakmak, Süleyman Acar and Gülşen Eryiğit |
Word Alignment for English-Turkish Language Pair |
14:55-16:35 |
Radu Ion |
PEXACC: A Parallel Sentence Mining Algorithm from Comparable Corpora |
14:55-16:35 |
Eleftherios Avramidis, Marta R. Costa-Jussà, Christian Federmann, Josef van Genabith, Maite Melero and Pavel Pecina |
A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation |
14:55-16:35 |
Stephen Grimes, Katherine Peterson and Xuansong Li |
Automatic word alignment tools to scale production of manually aligned parallel texts |
14:55-16:35 |
Carla Parra Escartín |
Design and compilation of a specialized Spanish-German parallel corpus |
14:55-16:35 |
Jörg Tiedemann, Dorte Haltrup Hansen, Lene Offersgaard, Sussi Olsen and Matthias Zumpe |
A Distributed Resource Repository for Cloud-Based Machine Translation |
|
Session P24 - Corpus Creation, Processing, Usage (1) |
Chair : Takenobu Tokunaga |
14:55-16:35 |
Jörg Tiedemann |
Parallel Data, Tools and Interfaces in OPUS |
14:55-16:35 |
Maciej Ogrodniczuk |
The Polish Sejm Corpus |
14:55-16:35 |
Lieve Macken, Veronique Hoste, Marielle Leijten and Luuk Van Waes |
From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information |
14:55-16:35 |
Maristella Agosti, Birgit Alber, Giorgio Maria Di Nunzio, Marco Dussin, Stefan Rabanus and Alessandra Tomaselli |
A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties |
14:55-16:35 |
Michel Généreux, Iris Hendrickx and Amália Mendes |
Introducing the Reference Corpus of Contemporary Portuguese Online |
14:55-16:35 |
Mojgan Seraji, Beáta Megyesi and Joakim Nivre |
A Basic Language Resource Kit for Persian |
14:55-16:35 |
Eric Sanders |
Collecting and Analysing Chats and Tweets in SoNaR |
14:55-16:35 |
Tomaž Erjavec |
The goo300k corpus of historical Slovene |
14:55-16:35 |
Mathieu-Henri Falco, Véronique Moriceau and Anne Vilnat |
Kitten: a tool for normalizing HTML and extracting its textual content |
14:55-16:35 |
Maaske Treurniet, Orphée De Clercq, Henk van den Heuvel and Nelleke Oostdijk |
Collection of a corpus of Dutch SMS |
14:55-16:35 |
Alessandro Panunzi, Marco Fabbri, Massimo Moneglia, Lorenzo Gregori and Samuele Paladini |
RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building |
14:55-16:35 |
Brett Drury and José João Almeida |
The Minho Quotation Resource |
14:55-16:35 |
Elena Frick, Carsten Schnober and Piotr Bański |
Evaluating Query Languages for a Corpus Processing System |
|
Session P26 - Multilinguality |
Chair : Gil Francopoulo |
16:55-18:15 |
Jyrki Niemi and Krister Lindén |
Representing the Translation Relation in a Bilingual Wordnet |
16:55-18:15 |
Alexandr Rosen and Martin Vavřín |
Building a multilingual parallel corpus for human users |
16:55-18:15 |
Martina Katalin Szabó, Veronika Vincze and István Nagy T. |
HunOr: A Hungarian―Russian Parallel Corpus |
16:55-18:15 |
Kanika Gupta, Monojit Choudhury and Kalika Bali |
Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics |
16:55-18:15 |
Gilles Sérasset |
Dbnary: Wiktionary as a LMF based Multilingual RDF network |
16:55-18:15 |
Lluís Padró and Evgeny Stanilovsky |
FreeLing 3.0: Towards Wider Multilinguality |
16:55-18:15 |
Svetla Koeva, Ivelina Stoyanova, Rositsa Dekova, Borislav Rizov and Angel Genov |
Bulgarian X-language Parallel Corpus |
16:55-18:15 |
Enikő Héja and Dávid Takács |
Automatically Generated Online Dictionaries |
16:55-18:15 |
Costanza Navarretta, Elisabeth Ahlsén, Jens Allwood, Kristiina Jokinen and Patrizia Paggio |
Feedback in Nordic First-Encounters: a Comparative Study |
16:55-18:15 |
Yu Chen and Andreas Eisele |
MultiUN v2: UN Documents with Multilingual Alignments |
16:55-18:15 |
Zahurul Islam and Alexander Mehler |
Customization of the Europarl Corpus for Translation Studies |
16:55-18:15 |
Thierry Declerck, Karlheinz Mörth and Piroska Lendvai |
Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies |
16:55-18:15 |
Ying Li, Yue Yu and Pascale Fung |
A Mandarin-English Code-Switching Corpus |
16:55-18:15 |
Paulo Fernandes, Lucelene Lopes, Carlos A. Prolo, Afonso Sales and Renata Vieira |
A Fast, Memory Efficient, Scalable and Multilingual Dictionary Retriever |
16:55-18:15 |
Aitor Gonzalez-Agirre, Egoitz Laparra and German Rigau |
Multilingual Central Repository version 3.0 |
|
Session P27 - Question Answering and Summarisation |
Chair : Horacio Saggion |
16:55-18:15 |
Christian Smith, Henrik Danielsson and Arne Jönsson |
A good space: Lexical predictors in word space evaluation |
16:55-18:15 |
Ulrich Andersen, Anna Braasch, Lina Henriksen, Csaba Huszka, Anders Johannsen, Lars Kayser, Bente Maegaard, Ole Norgaard, Stefan Schulz and Jürgen Wedekind |
Creation and use of Language Resources in a Question-Answering eHealth System |
16:55-18:15 |
Atsushi Fujii, Yuya Fujii and Takenobu Tokunaga |
Effects of Document Clustering in Modeling Wikipedia-style Term Descriptions |
16:55-18:15 |
Silvia Quarteroni, Vincenzo Guerrisi and Pietro La Torre |
Evaluating Multi-focus Natural Language Queries over Data Services |
16:55-18:15 |
Maria Fuentes, Horacio Rodríguez and Jordi Turmo |
Summarizing a multimodal set of documents in a Smart Room |
|
Session P28 - Multimodal Corpus for Interaction |
Chair : Vangelis Karkaletsis |
16:55-18:15 |
Dietmar Rösner, Jörg Frommer, Rafael Friesen, Matthias Haase, Julia Lange and Mirko Otto |
LAST MINUTE: a Multimodal Corpus of Speech-based User-Companion Interactions |
16:55-18:15 |
Karën Fort and Vincent Claveau |
Annotating Football Matches: Influence of the Source Medium on Manual Annotation |
16:55-18:15 |
Stephanie Strassel, Amanda Morris, Jonathan Fiscus, Christopher Caruso, Haejoong Lee, Paul Over, James Fiumara, Barbara Shaw, Brian Antonishek and Martial Michel |
Creating HAVIC: Heterogeneous Audio Visual Internet Collection |
16:55-18:15 |
Charlotte Alazard, Corine Astésano and Michel Billières |
MULTIPHONIA: a MULTImodal database of PHONetics teaching methods in classroom InterActions. |
|
Session 29 - Ontologies |
Chair : Paola Velardi |
16:55-18:15 |
Egoitz Laparra, German Rigau and Piek Vossen |
Mapping WordNet to the Kyoto ontology |
16:55-18:15 |
Kugatsu Sadamitsu, Kuniko Saito, Kenji Imamura and Yoshihiro Matsuo |
Constructing a Class-Based Lexical Dictionary using Interactive Topic Models |
16:55-18:15 |
Verginica Barbu Mititelu |
Adding Morpho-semantic Relations to the Romanian Wordnet |
16:55-18:15 |
Julien Seinturier, Elisabeth Murisasco, Emmanuel Bruno and Philippe Blache |
An ontological approach to model and query multimodal concurrent linguistic annotations |
16:55-18:15 |
Massimo Moneglia, Monica Monachini, Omar Calabrese, Alessandro Panunzi, Francesca Frontini, Gloria Gagliardi and Irene Russo |
The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation |
16:55-18:15 |
Inga Gheorghita and Jean-Marie Pierrel |
Towards a methodology for automatic identification of hypernyms in the definitions of large-scale dictionary |
16:55-18:15 |
John McCrae, Elena Montiel-Ponsoda and Philipp Cimiano |
Collaborative semantic editing of linked data lexica |
16:55-18:15 |
Christophe Roche |
Ontoterminology: How to unify terminology and ontology into a single paradigm |
16:55-18:15 |
Alexandre Denis, Ingrid Falk, Claire Gardent and Laura Perez-Beltrachini |
Representation of linguistic and domain knowledge for second language learning in virtual worlds |
16:55-18:15 |
Petya Osenova, Kiril Simov, Laska Laskova and Stanislava Kancheva |
A Treebank-driven Creation of an OntoValence Verb lexicon for Bulgarian |
16:55-18:15 |
Elisa Bianchi, Mirko Tavosanis and Emiliano Giovannetti |
Creation of a bottom-up corpus-based ontology for Italian Linguistics |
16:55-18:15 |
Matteo Abrate and Clara Bacciu |
Visualizing word senses in WordNet Atlas |
|
Session P30 - Discourse |
Chair : David Traum |
18:20-19:40 |
Kristiina Jokinen and Silvi Tenjes |
Investigating Engagement - intercultural and technological aspects of the collection, analysis, and use of the Estonian Multiparty Conversational video data |
18:20-19:40 |
Patrick Saint-Dizier |
DISLOG: A logic-based language for processing discourse structures |
18:20-19:40 |
Sarah Bourse and Patrick Saint-Dizier |
A Repository of Rules and Lexical Resources for Discourse Structure Analysis: the Case of Explanation Structures |
18:20-19:40 |
Stefania Degaetano-Ortlieb, Ekaterina Lapshinova-Koltunski and Elke Teich |
Feature Discovery for Diachronic Register Analysis: a Semi-Automatic Approach |
18:20-19:40 |
Sucheta Ghosh, Richard Johansson, Giuseppe Riccardi and Sara Tonelli |
Improving the Recall of a Discourse Parser by Constraint-based Postprocessing |
18:20-19:40 |
Elizabeth Baran, Yaqin Yang and Nianwen Xue |
Annotating dropped pronouns in Chinese newswire text |
18:20-19:40 |
Magdalena Rysova |
Alternative Lexicalizations of Discourse Connectives in Czech |
18:20-19:40 |
Utku Şirin, Ruket Çakıcı and Deniz Zeyrek |
METU Turkish Discourse Bank Browser |
18:20-19:40 |
David Elson |
DramaBank: Annotating Agency in Narrative Discourse |
18:20-19:40 |
Gisela Redeker, Ildikó Berzlánovich, Nynke van der Vliet, Gosse Bouma and Markus Egg |
Multi-Layer Discourse Annotation of a Dutch Text Corpus |
18:20-19:40 |
Iskandar Keskes, Farah Benamara and Lamia Hadrich Belguith |
Clause-based Discourse Segmentation of Arabic Texts |
18:20-19:40 |
Mariana Gomes, Ana Guilherme, Leonor Tavares and Rita Marquilhas |
Project FLY: a multidisciplinary project within Linguistics |
18:20-19:40 |
Ching-Sheng Lin, Zumrut Akcam, Samira Shaikh, Sharon Small, Ken Stahl, Tomek Strzalkowski and Nick Webb |
Revealing Contentious Concepts Across Social Groups |
|
Session P32 - Corpus Creation, Processing, Usage (2) |
Chair : Shyam Agrawal |
18:20-19:40 |
Ting Liu, Samira Shaikh, Tomek Strzalkowski, Aaron Broadwell, Jennifer Stromer-Galley, Sarah Taylor, Umit Boz, Xiaoai Ren and Jingsi Wu |
Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language |
18:20-19:40 |
Ivana Tanasijević, Biljana Sikimić and Gordana Pavlović-Lažetić |
Multimedia database of the cultural heritage of the Balkans |
18:20-19:40 |
Rania Al-Sabbagh and Roxana Girju |
YADAC: Yet another Dialectal Arabic Corpus |
18:20-19:40 |
Yves Scherrer and Bruno Cartoni |
The Trilingual ALLEGRA Corpus: Presentation and Possible Use for Lexicon Induction |
18:20-19:40 |
Martin Reynaert, Ineke Schuurman, Veronique Hoste, Nelleke Oostdijk and Maarten van Gompel |
Beyond SoNaR: towards the facilitation of large corpus building efforts |
18:20-19:40 |
Piotr Bański, Peter M. Fischer, Elena Frick, Erik Ketzan, Marc Kupietz, Carsten Schnober, Oliver Schonefeld and Andreas Witt |
The New IDS Corpus Analysis Platform: Challenges and Prospects |
18:20-19:40 |
Kurt Eberle, Kerstin Eckart, Ulrich Heid and Boris Haselbach |
A Tool/Database Interface for Multi-Level Analyses |
18:20-19:40 |
Djamel Mostefa, Khalid Choukri, Sylvie Brunessaux, Karim Boudahmane |
New language resources for the Pashto language |
18:20-19:40 |
Şenay Kafkas, Ian Lewin, David Milward, Erik van Mulligen, Jan Kors, Udo Hahn and Dietrich Rebholz-Schuhmann |
CALBC: Releasing the Final Corpora |
18:20-19:40 |
Martin Majliš and Zdeněk Žabokrtský |
Language Richness of the Web |
|
Session P33 - Web Services |
Chair : Nuria Bel |
18:20-19:40 |
Markus Forsberg and Torbjörn Lager |
Cloud Logic Programming for Integrating Language Technology Resources |
18:20-19:40 |
Marc Kemps-Snijders, Matthijs Brouwer, Jan Pieter Kunst and Tom Visser |
Dynamic web service deployment in a cloud environment |
18:20-19:40 |
Bharat Ram Ambati, Siva Reddy and Adam Kilgarriff |
Word Sketches for Turkish |
18:20-19:40 |
Chunqi Shi, Donghui Lin and Toru Ishida |
Service Composition Scenarios for Task-Oriented Translation |
18:20-19:40 |
Aleksandar Savkov, Laska Laskova, Stanislava Kancheva, Petya Osenova and Kiril Simov |
Linguistic Analysis Processing Line for Bulgarian |
18:20-19:40 |
Victoria Arranz and Olivier Hamon |
On the Way to a Legal Sharing of Web Applications in NLP |
18:20-19:40 |
Rafal Rak, Andrew Rowley and Sophia Ananiadou |
Collaborative Development and Evaluation of Text-processing Workflows in a UIMA-supported Web-based Workbench |
18:20-19:40 |
Javier Caminero, Mari Carmen Rodríguez, Jean Vanderdonckt, Fabio Paternò, Joerg Rett, Dave Raggett, Jean-Loup Comeliau and Ignacio Marín |
The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends |
|
Session P34 Corpus Creation, Processing, Usage (3) |
Chair : German Rigau |
9:45-11:25 |
Ronaldo Martins |
Le Petit Prince in UNL |
9:45-11:25 |
Christian Chiarcos |
A generic formalism to represent linguistic corpora in RDF and OWL/DL |
9:45-11:25 |
Silvia Pareti |
A Database of Attribution Relations |
9:45-11:25 |
Bartosz Broda, Michał Marcińczuk, Marek Maziarz, Adam Radziszewski and Adam Wardyński |
KPWr: Towards a Free Corpus of Polish |
9:45-11:25 |
Yeşim Aksan, Mustafa Aksan, Ahmet Koltuksuz, Taner Sezer, Ümit Mersinli, Umut Ufuk Demirhan, Hakan Yılmazer, Gülsüm Atasoy, Seda Öz, İpek Yıldız and Özlem Kurtoğlu |
Construction of the Turkish National Corpus (TNC) |
9:45-11:25 |
Jirka Hana, Alexandr Rosen, Barbora Štindlová and Petr Jäger |
Building a learner corpus |
9:45-11:25 |
Isabella Poggi, Francesca D'Errico and Giovanna Leone |
Pedagogical stances and their multimodal signals. |
9:45-11:25 |
Tsuyoshi Okita |
Annotated Corpora for Word Alignment between Japanese and English and its Evaluation with MAP-based Word Aligner |
9:45-11:25 |
Djamé Seddah, Marie Candito, Benoit Crabbé and Enrique Henestroza Anguiano |
Ubiquitous Usage of a Broad Coverage French Corpus: Processing the Est Republicain corpus |
|
Session P35 - Language Resource Infrastructures (2) |
Chair : Claudia Soria |
9:45-11:25 |
Herman Stehouwer, Matej Durco, Eric Auer and Daan Broeder |
Federated Search: Towards a Common Search Infrastructure |
9:45-11:25 |
Willem Elbers, Daan Broeder and Dieter van Uytvanck |
Proper Language Resource Centers |
9:45-11:25 |
Sebastian Drude, Daan Broeder, Paul Trilsbeek and Peter Wittenburg |
The Language Archive ― a new hub for language resources |
9:45-11:25 |
Kais Dukes and Eric Atwell |
LAMP: A Multimodal Web Platform for Collaborative Linguistic Analysis |
9:45-11:25 |
James Clarke, Vivek Srikumar, Mark Sammons and Dan Roth |
An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines) |
9:45-11:25 |
Marta Villegas, Núria Bel, Carlos Gonzalo, Amparo Moreno and Nuria Simelio |
Using Language Resources in Humanities research |
9:45-11:25 |
Sebastian Nordhoff and Harald Hammarström |
Glottolog/Langdoc:Increasing the visibility of grey literature for low-density languages |
9:45-11:25 |
Steve Cassidy, Michael Haugh, Pam Peters and Mark Fallu |
The Australian National Corpus: National Infrastructure for Language Resources |
9:45-11:25 |
Christian Federmann, Ioanna Giannopoulou, Christian Girardi, Olivier Hamon, Dimitris Mavroeidis, Salvatore Minutoli and Marc Schröder |
META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools |
9:45-11:25 |
Alessio Bosca, Luca Dini, Milen Kouylekov and Marco Trevisan |
Linguagrid: a network of Linguistic and Semantic Services for the Italian Language. |
|
Session P36 - Speech Synthesis |
Chair : Martine Garnier-Rizet |
9:45-11:25 |
Iñaki Sainz, Daniel Erro, Eva Navas, Inma Hernáez, Jon Sánchez, Ibon Saratxaga and Igor Odriozola |
Versatile Speech Databases for High Quality Synthesis for Basque |
9:45-11:25 |
Dietmar Schabus, Michael Pucher and Gregor Hofer |
Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis |
9:45-11:25 |
Alistair Conkie, Thomas Okken, Yeon-Jun Kim and Giuseppe Di Fabbrizio |
Building Text-To-Speech Voices in the Cloud |
9:45-11:25 |
Emília Garcia Casademont, Antonio Bonafonte and Asunción Moreno |
Building Synthetic Voices in the META-NET Framework |
9:45-11:25 |
Nur-Hana Samsudin and Mark Lee |
Building Text-to-Speech Systems for Resource Poor Languages |
9:45-11:25 |
Eva Szekely, Joao Paulo Cabral, Mohamed Abou-Zleikha, Peter Cahill and Julie Carson-Berndsen |
Evaluating expressive speech synthesis from audiobook corpora for conversational phrases |
|
Session P37 - Speech Resources |
Chair : Andrea Paoloni |
9:45-11:25 |
Panikos Heracleous, Carlos Ishi, Takahiro Miyashita and Norihiro Hagita |
Body-conductive acoustic sensors in human-robot communication |
9:45-11:25 |
Lucie Válková, Martina Waclawičová and Michal Křen |
Balanced data repository of spontaneous spoken Czech |
9:45-11:25 |
R.P. Clapham, L. van der Molen, R.J.JH. van Son, M. van den Brekel and F.J.M. Hilgers |
NKI-CCRT Corpus - Speech Intelligibility Before and After Advanced Head and Neck Cancer Treated with Concomitant Chemoradiotherapy |
9:45-11:25 |
Thomas Ulrich Christiansen and Peter Juel Henrichsen |
Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies |
9:45-11:25 |
Peter Juel Henrichsen and Marcus Uneson |
SMALLWorlds -- Multilingual Content-Controlled Monologues |
9:45-11:25 |
Alexander Schmitt, Stefan Ultes and Wolfgang Minker |
A Parameterized and Annotated Spoken Dialog Corpus of the CMU Let's Go Bus Information System |
9:45-11:25 |
Sergey Zablotskiy, Alexander Shvets, Maxim Sidorov, Eugene Semenkin and Wolfgang Minker |
Speech and Language Resources for LVCSR of Russian |
9:45-11:25 |
Dae-Lim Choi, Bong-Wan Kim, Yeon-Whoa Kim, Yong-Ju Lee, Yongnam Um and Minhwa Chung |
Dysarthric Speech Database for Development of QoLT Software Technology |
9:45-11:25 |
Eckhard Bick, Heliana Mello, Alessandro Panunzi and Tommaso Raso |
The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser |
9:45-11:25 |
Janne Bondi Johannessen, Joel Priestley, Kristin Hagen, Anders Nøklestad and André Lynum |
The Nordic Dialect Corpus |
9:45-11:25 |
Dafydd Gibbon |
ULex: new data models and a mobile environment for corpus enrichment. |
9:45-11:25 |
Kengo Ohta, Masatoshi Tsuchiya and Seiichi Nakagawa |
Developing Partially-Transcribed Speech Corpus from Edited Transcriptions |
9:45-11:25 |
Xiaoyi Ma |
LDC Forced Aligner |
9:45-11:25 |
Sebastian Stüker, Florian Kraft, Christian Mohr, Teresa Herrmann, Eunah Cho and Alex Waibel |
The KIT Lecture Corpus for Speech Translation |
9:45-11:25 |
Shyam Agrawal, Shweta Sinha, Pooja Singh and Jesper Olson |
Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment |
|
Session P38 - Subjectivity: Sentiments, Emotions, Opinions (2) |
Chair : Paolo Rosso |
11:45-13:25 |
Simon Clematide, Stefan Gindl, Manfred Klenner, Stefanos Petrakis, Robert Remus, Josef Ruppenhofer, Ulli Waltinger and Michael Wiegand |
MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis |
11:45-13:25 |
Silvia Vázquez and Núria Bel |
A Classification of Adjectives for Polarity Lexicons Enhancement |
11:45-13:25 |
Jorge Carrillo de Albornoz, Laura Plaza and Pablo Gervás |
SentiSense: An easily scalable concept-based affective lexicon for sentiment analysis |
11:45-13:25 |
Tom De Smedt and Walter Daelemans |
Vreselijk mooi! (terribly beautiful): A Subjectivity Lexicon for Dutch Adjectives. |
11:45-13:25 |
Rasmus Sundberg, Anders Eriksson, Johan Bini and Pierre Nugues |
Visualizing Sentiment Analysis on a User Forum |
11:45-13:25 |
Erik Cambria, Yunqing Xia and Amir Hussain |
Affective Common Sense Knowledge Acquisition for Sentiment Analysis |
|
Session P39 - Language Resource Infrastructures (3) |
Chair : Penny Labropoulou |
11:45-13:25 |
Emanuel Dima, Verena Henrich, Erhard Hinrichs, Marie Hinrichs, Christina Hoppermann, Thorsten Trippel, Thomas Zastrow and Claus Zinn |
A Repository for the Sustainable Management of Research Data |
11:45-13:25 |
Maciej Ogrodniczuk, Piotr Pęzik and Adam Przepiórkowski |
Towards a comprehensive open repository of Polish language resources |
11:45-13:25 |
Lars Borin, Markus Forsberg, Leif-Jöran Olsson and Jonatan Uppström |
The open lexical infrastructure of Spräkbanken |
11:45-13:25 |
Christian Chiarcos, Sebastian Hellmann, Sebastian Nordhoff, Steven Moran, Richard Littauer, Judith Eckle-Kohler, Iryna Gurevych, Silvana Hartmann, Michael Matuschek and Christian M. Meyer |
The Open Linguistics Working Group |
11:45-13:25 |
Silke Scheible, Richard J. Whitt, Martin Durrell and Paul Bennett |
GATEtoGerManC: A GATE-based Annotation Pipeline for Historical German |
11:45-13:25 |
Nicolas Hernandez |
Tackling interoperability issues within UIMA workflows |
|
Session P41 - Semantics |
Chair : Marc Verhagen |
11:45-13:25 |
Alain Joubert and Mathieu Lafourcade |
A new dynamic approach for lexical networks evaluation |
11:45-13:25 |
Roberta Catizone, Louise Guthrie, Arthur Thomas and Yorick Wilks |
LIE: Leadership, Influence and Expertise |
11:45-13:25 |
Richard Johansson, Karin Friberg Heppin and Dimitrios Kokkinakis |
Semantic Role Labeling with the Swedish FrameNet |
11:45-13:25 |
Pedro Fialho, Sérgio Curto, Ana Cristina Mendes and Luísa Coheur |
Extending a wordnet framework for simplicity and scalability |
11:45-13:25 |
Boris Haselbach, Wolfgang Seeker and Kerstin Eckart |
German """"nach""""-Particle Verbs in Semantic Theory and Corpus Data |
11:45-13:25 |
Alessandro Lenci, Gabriella Lapesa and Giulia Bonansinga |
LexIt: A Computational Resource on Italian Argument Structure |
11:45-13:25 |
Alessandro Lenci, Simonetta Montemagni, Giulia Venturi and Maria Grazia Cutrullà |
Enriching the ISST-TANL Corpus with Semantic Frames |
|
Session P43 - Sign Language |
Chair : Thomas Hanke |
11:45-13:25 |
Zoya Gavrilov, Stan Sclaroff, Carol Neidle and Sven Dickinson |
Detecting Reduplication in Videos of American Sign Language |
11:45-13:25 |
Nedelina Ivanova and Olle Eriksen |
BiBiKit - A Bilingual Bimodal Reading and Writing Tool for Sign Language Users |
11:45-13:25 |
Fabrizio Borgia, Claudia S. Bianchini, Patrice Dalle and Maria De Marsico |
Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber) |
11:45-13:25 |
Jens Forster, Christoph Schmidt, Thomas Hoyoux, Oscar Koller, Uwe Zelle, Justus Piater and Hermann Ney |
RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus |
|
Session P44 - Machine Translation (2) |
Chair : Jan Hajic |
14:55-16:35 |
Saab Mansour and Hermann Ney |
Arabic-Segmentation Combination Strategies for Statistical Machine Translation |
14:55-16:35 |
Ondřej Bojar, Zdeněk Žabokrtský, Ondřej Dušek, Petra Galuščáková, Martin Majliš, David Mareček, Jiří Maršík, Michal Novák, Martin Popel and Aleš Tamchyna |
The Joy of Parallelism with CzEng 1.0 |
14:55-16:35 |
Takanori Kusumoto and Tomoyosi Akiba |
Statistical Machine Translation without Source-side Parallel Corpus Using Word Lattice and Phrase Extension |
14:55-16:35 |
Lambert Patrik, Holger Schwenk and Frédéric Blain |
Automatic Translation of Scientific Documents in the HAL Archive |
14:55-16:35 |
Georgi Iliev and Angel Genov |
Expanding Parallel Resources for Medium-Density Languages for Free |
14:55-16:35 |
Elisabet Comelles, Jordi Atserias, Victoria Arranz and Irene Castellón |
VERTa: Linguistic features in MT evaluation |
14:55-16:35 |
Zhiyi Song, Safa Ismael, Stephen Grimes, David Doermann and Stephanie Strassel |
Linguistic Resources for Handwriting Recognition and Translation Evaluation |
14:55-16:35 |
Fangzhong Su and Bogdan Babych |
Development and Application of a Cross-language Document Comparability Metric |
14:55-16:35 |
Claire Jaja, Douglas Briesch, Jamal Laoudi and Clare Voss |
Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System |
14:55-16:35 |
Ananthakrishnan Ramanathan and Karthik Visweswariah |
A Study of Word-Classing for MT Reordering |
14:55-16:35 |
João Silva, Luísa Coheur, Ângela Costa and Isabel Trancoso |
Dealing with unknown words in statistical machine translation |
14:55-16:35 |
Wilker Aziz, Sheila Castilho and Lucia Specia |
PET: a Tool for Post-editing and Assessing Machine Translation |
14:55-16:35 |
Chris Irwin Davis |
Tajik-Farsi Persian Transliteration Using Statistical Machine Translation |
14:55-16:35 |
Emma Barker and Robert Gaizauskas |
Assessing the Comparability of News Texts |
|
Session P48 - Speech/Multimodal Tools, Systems, Applications |
Chair : Tomaž Erjavec |
14:55-16:35 |
Michael Kipp |
Annotation Facilities for the Reliable Analysis of Human Motion |
14:55-16:35 |
Michael Carl |
Translog-II: a Program for Recording User Activity Data for Empirical Reading and Writing Research |
14:55-16:35 |
Giovanni Costantini, Andrea Paoloni and Massimiliano Todisco |
Intelligibility assessment in forensic applications |
14:55-16:35 |
David Tavarez, Eva Navas, Daniel Erro and Ibon Saratxaga |
Strategies to Improve a Speaker Diarisation Tool |
14:55-16:35 |
Igor Odriozola, Eva Navas, Inma Hernáez, Iñaki Sainz, Ibon Saratxaga, Jon Sánchez and Daniel Erro |
Using an ASR database to design a pronunciation evaluation system in Basque |
14:55-16:35 |
Francesco Cutugno, Vincenza Anna Leano and Antonio Origlia |
W-PhAMT: A web tool for phonetic multilevel timeline visualization |
14:55-16:35 |
Amalia Zahra and Julie Carson-Berndsen |
English to Indonesian Transliteration to Support English Pronunciation Practice |
14:55-16:35 |
Joao Paulo Cabral, Mark Kane, Zeeshan Ahmed, Mohamed Abou-Zleikha, Eva Szekely, Amalia Zahra, Kalu Ogbureke, Peter Cahill, Julie Carson-Berndsen and Stephan Schlogl |
Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz |
14:55-16:35 |
Bernhard Brüning, Christian Schnier, Karola Pitsch and Sven Wasmuth |
PAMOCAT: Automatic retrieval of specified postures |
|
|