LREC 2010 Proceedings

INTRODUCTORY MESSAGES:

Nicoletta Calzolari - Introduction of the Conference Chair
Stelios Piperidis - Message from ELRA President
Khalid Choukri - Message from ELRA Secretary General and ELDA Managing Director
Mike Rosner - Message of the Chair of the Local Organizing Committee

INVITED TALK:

Ray Fabri - Maltese at the Crossroads of Technological Developments (Slides)

KEYNOTES SPEECHES:

Jaime Carbonell - Intelligence Resource Collection for Low-Density Languages (slides)
Ralf Steinberger - Challenges and Methods for Multilingual Text Mining (slides)

PANEL:

Keith Miller - Perspectives on Machine Translation Evaluation (Slides)

SESSIONS: Browse articles of the conference sorted by session number

	Session O1 - Semantic Acquisition	Chairperson : Maria Teresa Pazienza
11:35-11:55	Fabienne Fritzinger, Frank Richter and Marion Weller	Pattern-Based Extraction of Negative Polarity Items from Dependency-Parsed Text
11:55-12:15	Luca Dini and Giampaolo Mazzini	The Impact of Grammar Enhancement on Semantic Resources Induction
12:15-12:35	Alessandro Lenci, Martina Johnson and Gabriella Lapesa	Building an Italian FrameNet through Semi-automatic Corpus Analysis
12:35-12:55	Claire Mouton, Gaël de Chalendar and Benoît Richert	FrameNet Translation Using Bilingual Dictionaries with Evaluation on the English-French Pair
12:55-13:15	Paul Cook and Suzanne Stevenson	Automatically Identifying Changes in the Semantic Orientation of Words

	Session O2 - LR Infrastructures and Standards	Chairperson : Christopher Cieri
11:35-11:55	Lars Borin, Markus Forsberg and Dimitrios Kokkinakis	Diabase: Towards a Diachronic BLARK in Support of Historical Studies
11:55-12:15	Daan Broeder, Marc Kemps-Snijders, Dieter Van Uytvanck, Menzo Windhouwer, Peter Withers, Peter Wittenburg and Claus Zinn	A Data Category Registry- and Component-based Metadata Framework
12:15-12:35	Jan Odijk	The CLARIN-NL Project
12:35-12:55	Samuel Cruz-Lara, Gil Francopoulo, Laurent Romary and Nasredine Semmar	MLIF : A Metamodel to Represent and Exchange Multilingual Textual Information
12:55-13:15	Peter Wittenburg, Nuria Bel, Lars Borin, Gerhard Budin, Nicoletta Calzolari, Eva Hajicova, Kimmo Koskenniemi, Lothar Lemnitzer, Bente Maegaard, Maciej Piasecki, Jean-Marie Pierrel, Stelios Piperidis, Inguna Skadina, Dan Tufis, Remco van Veenendaal, Tamas Váradi and Martin Wynne	Resource and Service Centres as the Backbone for a Sustainable Service Infrastructure

	Session O3 - Dialogue and Evaluation	Chairperson : Sophie Rosset
11:35-11:55	Susan Robinson, Antonio Roque and David Traum	Dialogues in Context: An Objective User-Oriented Evaluation Approach for Virtual Human Dialogue
11:55-12:15	Joshua B. Gordon and Rebecca J. Passonneau	An Evaluation Framework for Natural Language Understanding in Spoken Dialogue Systems
12:15-12:35	Sunao Hara, Norihide Kitaoka and Kazuya Takeda	Estimation Method of User Satisfaction Using N-gram-based Dialog History Model for Spoken Dialog System
12:35-12:55	Nick Webb, David Benyon, Preben Hansen and Oil Mival	Evaluating Human-Machine Conversation for Appropriateness
12:55-13:15	Svetlana Stoyanchev and Paul Piwek	Constructing the CODA Corpus: A Parallel Corpus of Monologues and Expository Dialogues

	Session O4 - Text-to-Speech Corpora	Chairperson : Harald Höge
11:35-11:55	Didier Cadic, Cédric Boidin and Christophe d'Alessandro	Towards Optimal TTS Corpora
11:55-12:15	Michael Pucher, Friedrich Neubarth, Volker Strom, Sylvia Moosmüller, Gregor Hofer, Christian Kranzler, Gudrun Schuchmann and Dietmar Schabus	Resources for Speech Synthesis of Viennese Varieties
12:15-12:35	Pavel Skrelin, Nina Volskaya, Daniil Kocharov, Karina Evgrafova, Olga Glotova and Vera Evdokimova	A Fully Annotated Corpus of Russian Speech
12:35-12:55	Francisco Campillo, Daniela Braga, Ana Belén Mourín, Carmen García-Mateo, Pedro Silva, Miguel Sales Dias and Francisco Méndez	Building High Quality Databases for Minority Languages such as Galician
12:55-13:15	Alexandros Lazaridis, Theodoros Kostoulas, Todor Ganchev, Iosif Mporas and Nikos Fakotakis	Vergina: A Modern Greek Speech Database for Speech Synthesis

	Session O5 - Knowledge Discovery	Chairperson :
14:45-15:05	Danica Damljanovic, Milan Agatonovic and Hamish Cunningham	Identification of the Question Focus: Combining Syntactic Analysis and Ontology-based Lookup through the User Interaction
15:05-15:25	Paul McNamee, Hoa Trang Dang, Heather Simpson, Patrick Schone and Stephanie M. Strassel	An Evaluation of Technologies for Knowledge Base Population
15:25-15:45	Eneko Agirre, Montse Cuadros, German Rigau and Aitor Soroa	Exploring Knowledge Bases for Similarity
15:45-16:05	Francesca Fallucchi, Maria Teresa Pazienza and Fabio Massimo Zanzotto	Generic Ontology Learners on Application Domains
16:05-16:25	Jorge Vivaldi and Horacio Rodríguez	Finding Domain Terms using Wikipedia

	Session O6 - Temporal and Spatial Annotation - Special Session	Chairperson : James Pustejovsky
14:45-15:05	James Pustejovsky, Kiyong Lee, Harry Bunt and Laurent Romary	ISO-TimeML: An International Standard for Semantic Annotation
15:05-15:25	Leon Derczynski and Robert Gaizauskas	Analysing Temporally Annotated Corpora with CAVaT
15:25-15:45	Naushad UzZaman and James Allen	TRIOS-TimeBank Corpus: Extended TimeBank Corpus with Help of Deep Understanding of Text
15:45-16:05	Parisa Kordjamshidi, Martijn Van Otterlo and Marie-Francine Moens	Spatial Role Labeling: Task Definition and Annotation Scheme

	Session O7 - Evaluation Methodologies	Chairperson :
14:45-15:05	Jerid Francom, Amy LaCross and Adam Ussishkin	How Specialized are Specialized Corpora? Behavioral Evaluation of Corpus Representativeness for Maltese.
15:05-15:25	Yoshinobu Kano, Ruben Dorado, Luke McCrohon, Sophia Ananiadou and Jun'ichi Tsujii	U-Compare: An Integrated Language Resource Evaluation Platform Including a Comprehensive UIMA Resource Library
15:25-15:45	Haïfa Zargayouna and Adeline Nazarenko	Evaluation of Textual Knowledge Acquisition Tools: a Challenging Task
15:45-16:05	K. Bretonnel Cohen, Christophe Roeder, William A. Baumgartner Jr., Lawrence E. Hunter and Karin Verspoor	Test Suite Design for Biomedical Ontology Concept Recognition Systems
16:05-16:25	Ondřej Bojar, Adam Liška and Zdeněk Žabokrtský	Evaluating Utility of Data Sources in a Large Parallel Czech-English Corpus CzEng 0.9

	Session O8 - Sign Language	Chairperson : Eleni Efthimiou
14:45-15:05	Annelies Braffort, Laurence Bolot, Emilie Chételat-Pelé, Annick Choisier, Maxime Delorme, Michael Filhol, Jérémie Segouat, Cyril Verrecchia, Flora Badin and Nadège Devos	Sign Language Corpora for Analysis, Processing and Evaluation
15:05-15:25	Onno Crasborn	The Sign Linguistics Corpora Network: Towards Standards for Signed Language Resources
15:25-15:45	Kyle Duarte and Sylvie Gibet	Heterogeneous Data Sources for Signed Language Analysis and Synthesis: The SignCom Project
15:45-16:05	Antonio Balvet, Cyril Courtin, Dominique Boutet, Christian Cuxac, Ivani Fusellier-Souza, Brigitte Garcia, Marie-Thérèse L’Huillier and Marie-Anne Sallandre	The Creagest Project: a Digitized and Annotated Corpus for French Sign Language (LSF) and Natural Gestural Languages
16:05-16:25	Philippe Dreuw, Hermann Ney, Gregorio Martinez, Onno Crasborn, Justus Piater, Jose Miguel Moya and Mark Wheatley	The SignSpeak Project - Bridging the Gap Between Signers and Speakers

	Session O9 - Anaphora, Coreference	Chairperson : Bernardo Magnini
16:45-17:05	Costanza Navarretta	The DAD Parallel Corpora and their Uses
17:05-17:25	Massimo Poesio, Olga Uryupina and Yannick Versley	Creating a Coreference Resolution System for Italian
17:25-17:45	Arndt Riester, David Lorenz and Nina Seemann	A Recursive Annotation Scheme for Referential Information Status
17:45-18:05	Tommaso Caselli and Irina Prodanof	Annotating Event Anaphora: A Case Study

	Session O10 - Machine Translation	Chairperson : Robert Frederking
16:45-17:05	Sherri Condon, Dan Parvaz, John Aberdeen, Christy Doran, Andrew Freeman and Marwan Awad	Evaluation of Machine Translation Errors in English and Iraqi Arabic
17:05-17:25	Jörg Tiedemann	Lingua-Align: An Experimental Toolbox for Automatic Tree-to-Tree Alignment
17:25-17:45	Maria Holmqvist	Heuristic Word Alignment with Parallel Phrases
17:45-18:05	Sylwia Ozdowska and Vincent Claveau	Inferring Syntactic Rules for Word Alignment through Inductive Logic Programming

	Session O11 - Authoring Tools and Text Analysis	Chairperson : Michael Kipp
16:45-17:05	Jennifer Pedler and Roger Mitton	A Large List of Confusion Sets for Spellchecking Assessed Against a Corpus of Real-word Errors
17:05-17:25	Na-Rae Han, Joel Tetreault, Soo-Hwa Lee and Jin-Young Ha	Using an Error-Annotated Learner Corpus to Develop an ESL/EFL Error Correction System
17:25-17:45	Alberto Barrón-Cedeño, Martin Potthast, Paolo Rosso, Benno Stein and Andreas Eiselt	Corpus and Evaluation Measures for Automatic Plagiarism Detection
17:45-18:05	Philip van Oosten, Dries Tanghe and Véronique Hoste	Towards an Improved Methodology for Automated Readability Prediction

	Session O12 - Parsing	Chairperson : Yoshihiko Hayashi
16:45-17:05	Danielle Ben-Gera, Yi Zhang and Valia Kordoni	Semantic Feature Engineering for Enhancing Disambiguation Performance in Deep Linguistic Processing
17:05-17:25	Jordi Atserias, Giuseppe Attardi, Maria Simi and Hugo Zaragoza	Active Learning for Building a Corpus of Questions for Parsing
17:25-17:45	Eckhard Bick	FrAG, a Hybrid Constraint Grammar Parser for French
17:45-18:05	Elaine Uí Dhonnchadha and Josef Van Genabith	Partial Dependency Parsing for Irish

	Session O13 - Ontologies	Chairperson : Thierry Declerck
18:10-18:30	Marta Tatu and Dan Moldovan	Inducing Ontologies from Folksonomies using Natural Language Understanding
18:30-18:50	Vivi Nastase, Michael Strube, Benjamin Boerschinger, Caecilia Zirn and Anas Elghafari	WikiNet: A Very Large Scale Multi-Lingual Concept Network
18:50-19:10	Gosse Bouma	Cross-lingual Ontology Alignment using EuroWordNet and Wikipedia
19:10-19:30	Matthias Hartung and Anette Frank	A Semi-supervised Type-based Classification of Adjectives: Distinguishing Properties and Relations

	Session O14 - Terminology, Corpus and Lexicon	Chairperson : Adam Kilgarriff
18:10-18:30	Sylviane Cardey, Krzysztof Bogacki, Xavier Blanco and Ruslan Mitkov	Resources for Controlled Languages for Alert Messages and Protocols in the European Perspective
18:30-18:50	Klaar Vanopstal, Bart Desmet and Véronique Hoste	Towards a Learning Approach for Abbreviation Detection and Resolution.
18:50-19:10	Bruno Cartoni and Pierre Zweigenbaum	Semi-Automated Extension of a Specialized Medical Lexicon for French
19:10-19:30	Rogelio Nazar and Maarten Janssen	Combining Resources: Taxonomy Extraction from Multiple Dictionaries

	Session O15 - Trends in Speech Databases	Chairperson : Felix Burkhardt
18:10-18:30	Toomas Altosaar, Louis ten Bosch, Guillaume Aimetti, Christos Koniaris, Kris Demuynck and Henk van den Heuvel	A Speech Corpus for Modeling Language Acquisition: CAREGIVER
18:30-18:50	Florian Schiel	BAStat : New Statistical Resources at the Bavarian Archive for Speech Signals
18:50-19:10	Kseniya Zablotskaya, Steffen Walter and Wolfgang Minker	Speech Data Corpus for Verbal Intelligence Estimation
19:10-19:30	Janne Bondi Johannessen, Kristin Hagen, Anders Nøklestad and Joel Priestley	Enhancing Language Resources with Maps

	Session O16 - LRs: Infrastructures and Strategies	Chairperson : Hans Uszkoreit
9:45-10:05	Christopher Cieri and Mark Liberman	Adapting to Trends in Language Resource Development: A Progress Report on LDC Activities
10:05-10:25	Victoria Arranz and Khalid Choukri	ELRA’s Services 15 Years on...Sharing and Anticipating the Community
10:25-10:45	Nicoletta Calzolari and Claudia Soria	Preparing the field for an Open Resource Infrastructure: the role of the FLaReNet Network of Excellence
10:45-11:05	Jonathan H. Clark and Alon Lavie	LoonyBin: Keeping Language Technologists Sane through Automated Management of Experimental (Hyper)Workflows
11:05-11:25	Zhiyi Song, Stephanie Strassel, Gary Krug and Kazuaki Maeda	Enhanced Infrastructure for Creation and Collection of Translation Resources

	Session O17 - Opinion Mining and Emotions	Chairperson : Nick Campbell
9:45-10:05	Lun-Wei Ku, Ting-Hao Huang and Hsin-Hsi Chen	Construction of a Chinese Opinion Treebank
10:05-10:25	Alexander Pak and Patrick Paroubek	Twitter as a Corpus for Sentiment Analysis and Opinion Mining
10:25-10:45	Isa Maks and Piek Vossen	Annotation Scheme and Gold Standard for Dutch Subjective Adjectives
10:45-11:05	Matthieu Vernier, Laura Monceaux and Béatrice Daille	Learning Subjectivity Phrases missing from Resources through a Large Set of Semantic Tests
11:05-11:25	Carlo Strapparava, Marco Guerini and Oliviero Stock	Predicting Persuasiveness in Political Discourses

	Session O18 - Information Extraction	Chairperson : Nancy Ide
9:45-10:05	Yassine Benajiba and Imed Zitouni	Arabic Word Segmentation for Better Unit of Analysis
10:05-10:25	Xabier Saralegi and Maddalen Lopez de Lacalle	Dictionary and Monolingual Corpus-based Query Translation for Basque-English CLIR
10:25-10:45	Jana Straková and Pavel Pecina	Czech Information Retrieval with Syntax-based Language Models
10:45-11:05	Lukas Michelbacher, Florian Laws, Beate Dorow, Ulrich Heid and Hinrich Schütze	Building a Cross-lingual Relatedness Thesaurus using a Graph Similarity Measure
11:05-11:25	Walid Magdy, Jinming Min, Johannes Leveling and Gareth J. F. Jones	Building a Domain-specific Document Collection for Evaluating Metadata Effects on Information Retrieval

	Session O19 - Semantics	Chairperson : Evelyne Viegas
9:45-10:05	Torsten Zesch and Iryna Gurevych	The More the Better? Assessing the Influence of Wikipedia’s Growth on Semantic Relatedness Measures
10:05-10:25	Sabine Schulte im Walde	Comparing Computational Models of Selectional Preferences - Second-order Co-Occurrence vs. Latent Semantic Clusters
10:25-10:45	Daisuke Kawahara and Sadao Kurohashi	Acquiring Reliable Predicate-argument Structures from Raw Corpora for Case Frame Compilation
10:45-11:05	Ziqi Zhang, Anna Lisa Gentile, Lei Xia, José Iria and Sam Chapman	A Random Graph Walk based Approach to Computing Semantic Relatedness Using Knowledge from Wikipedia
11:05-11:25	Kathrin Baker, Michael Bloodgood, Bonnie Dorr, Nathaniel W. Filardo, Lori Levin and Christine Piatko	A Modality Lexicon and its use in Automatic Tagging

	Session O20 - Discourse Annotation and Parsing	Chairperson : Aravind Joshi
11:45-12:05	Nathanael Chambers and Dan Jurafsky	A Database of Narrative Schemas
12:05-12:25	Markus Egg and Gisela Redeker	How Complex is Discourse Structure?
12:25-12:45	Bonaventura Coppola and Alessandro Moschitti	A General Purpose FrameNet-based Shallow Semantic Parser
12:45-13:05	Daniel Cer, Marie-Catherine de Marneffe, Dan Jurafsky and Chris Manning	Parsing to Stanford Dependencies: Trade-offs between Speed and Accuracy

	Session O21 - Emotion, Sentiment	Chairperson : Inma Hernaez Rioja
11:45-12:05	Alexander Schmitt, Gregor Bertrand, Tobias Heinroth, Wolfgang Minker and Jackson Liscombe	WITcHCRafT: A Workbench for Intelligent exploraTion of Human ComputeR conversaTions
12:05-12:25	Ulli Waltinger	GermanPolarityClues: A Lexical Resource for German Sentiment Analysis
12:25-12:45	Björn Schuller, Riccardo Zaccarelli, Nicolas Rollet and Laurence Devillers	CINEMO ― A French Spoken Language Resource for Complex Emotions: Facts and Baselines
12:45-13:05	Gregor Bertrand, Florian Nothdurft, Steffen Walter, Andreas Scheck, Henrik Kessler and Wolfgang Minker	Towards Investigating Effective Affective Dialogue Strategies

	Session O22 - Corpus Building, Annotation and Methodology	Chairperson : Dimitrios Kokkinasis
11:45-12:05	Martin Volk, Noah Bubenhofer, Adrian Althaus, Maya Bangerter, Lenz Furrer and Beni Ruef	Challenges in Building a Multilingual Alpine Heritage Corpus
12:05-12:25	Marc Carmen, Paul Felt, Robbie Haertel, Deryle Lonsdale, Peter McClanahan, Owen Merkling, Eric Ringger and Kevin Seppi	Tag Dictionaries Accelerate Manual Annotation
12:25-12:45	Dan Flickinger, Stephan Oepen and Gisle Ytrestøl	WikiWoods: Syntacto-Semantic Annotation for English Wikipedia
12:45-13:05	Hai Zhao, Yan Song and Chunyu Kit	How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method

	Session O23 - Broadcast News	Chairperson : Carmen García-Mateo
11:45-12:05	Luis Javier Rodríguez-Fuentes, Mikel Penagarikano, Germán Bordel, Amparo Varona and Mireia Díez	KALAKA: A TV Broadcast Speech Database for the Evaluation of Language Recognition Systems
12:05-12:25	Yannick Estève, Thierry Bazillon, Jean-Yves Antoine, Frédéric Béchet and Jérôme Farinas	The EPAC Corpus: Manual and Automatic Annotations of Conversational Speech in French Broadcast News
12:25-12:45	Kwanchiva Saykham, Ananlada Chotimongkol and Chai Wutiwiwatchai	Online Temporal Language Model Adaptation for a Thai Broadcast News Transcription System
12:45-13:05	Doris Baum, Daniel Schneider, Rolf Bardeli, Jochen Schwenninger, Barbara Samlowski, Thomas Winkler and Joachim Köhler	DiSCo - A German Evaluation Corpus for Challenging Problems in the Broadcast Domain

	Session O24 - Machine Translation	Chairperson : Atsushi Fuji
14:55-15:15	Vamshi Ambati, Stephan Vogel and Jaime Carbonell	Active Learning and Crowd-Sourcing for Machine Translation
15:15-15:35	Sara Stymne and Lars Ahrenberg	Using a Grammar Checker for Evaluation and Postprocessing of Statistical Machine Translation
15:35-15:55	Hiroyuki Kaji, Takashi Tsunakawa and Daisuke Okada	Using Comparable Corpora to Adapt a Translation Model to Domains
15:55-16:15	Xuansong Li, Niyu Ge, Stephen Grimes, Stephanie M. Strassel and Kazuaki Maeda	Enriching Word Alignment with Linguistic Tags
16:15-16:35	Sisay Adugna and Andreas Eisele	English ― Oromo Machine Translation: An Experiment Using a Statistical Approach

	Session O25 - Emotion, Sentiment - Special Session	Chairperson :
14:55-15:15	Stefano Baccianella, Andrea Esuli and Fabrizio Sebastiani	SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining
15:15-15:35	Mátyás Brendel, Riccardo Zaccarelli and Laurence Devillers	Building a System for Emotions Detection from Speech to Control an Affective Avatar
15:35-15:55	Martijn Goudbeek and Mirjam Broersma	The Demo / Kemo Corpus: A Principled Approach to the Study of Cross-cultural Differences in the Vocal Expression and Perception of Emotion
15:55-16:15	Alexandra Balahur, Ralf Steinberger, Mijail Kabadjov, Vanni Zavarella, Erik van der Goot, Matina Halkia, Bruno Pouliquen and Jenya Belyaeva	Sentiment Analysis in the News
16:15-16:35	Discussion

	Session O26 - Corpus Tools	Chairperson : Martha Palmer
14:55-15:15	Dekang Lin, Kenneth Church, Heng Ji, Satoshi Sekine, David Yarowsky, Shane Bergsma, Kailash Patil, Emily Pitler, Rachel Lathbury, Vikram Rao, Kapil Dalwani and Sushant Narsale	New Tools for Web-Scale N-grams
15:15-15:35	Verena Henrich and Erhard Hinrichs	GernEdiT - The GermaNet Editing Tool
15:35-15:55	Véronika Lux-Pogodalla, Dominique Besagni and Karën Fort	FastKwic, an “Intelligent“ Concordancer Using FASTR
15:55-16:15	Giuseppe Attardi, Stefano Dei Rossi, Giulia Di Pietro, Alessandro Lenci, Simonetta Montemagni and Maria Simi	A Resource and Tool for Super-sense Tagging of Italian Texts
16:15-16:35	Richard Schwarz, Hinrich Schütze, Fabienne Martin and Achim Stein	Identification of Rare & Novel Senses Using Translations in a Parallel Corpus

	Session O27 - Lexicon, Morphology	Chairperson : Sonja Bosch
14:55-15:15	Johannes Handl and Carsten Weber	A Multilayered Declarative Approach to Cope with Morphotactics and Allomorphy in Derivational Morphology
15:15-15:35	Helena Blancafort	Learning Morphology of Romance, Germanic and Slavic Languages with the Tool Linguistica
15:35-15:55	Nuria Gala, Véronique Rey and Michael Zock	A Tool for Linking Stems and Conceptual Fragments to Enhance word Access
15:55-16:15	Patrice Lopez and Laurent Romary	GRISP: A Massive Multilingual Terminological Database for Scientific and Technical Domains
16:15-16:35	Wauter Bosma and Piek Vossen	Bootstrapping Language Neutral Term Extraction

	Session O28 - Syntax and Semantics	Chairperson : António Branco
16:55-17:15	Ineke Schuurman, Véronique Hoste and Paola Monachesi	Interacting Semantic Layers of Annotation in SoNaR, a Reference Corpus of Contemporary Written Dutch
17:15-17:35	Anne Vilnat, Patrick Paroubek, Eric Villemonte de la Clergerie, Gil Francopoulo and Marie-Laure Guénot	PASSAGE Syntactic Representation: a Minimal Common Ground for Evaluation
17:35-17:55	Sara Rosenthal, William Lipovsky, Kathleen McKeown, Kapil Thadani and Jacob Andreas	Towards Semi-Automated Annotation for Prepositional Phrase Attachment
17:55-18:15	Max Jakob, Markéta Lopatková and Valia Kordoni	Mapping between Dependency Structures and Compositional Semantic Representations

	Session O29 - Metadata	Chairperson : Dafydd Gibbon
16:55-17:15	Raheel Nawaz, Paul Thompson, John McNaught and Sophia Ananiadou	Meta-Knowledge Annotation of Bio-Events
17:15-17:35	Christopher Cieri, Khalid Choukri, Nicoletta Calzolari, D. Terence Langendoen, Johannes Leveling, Martha Palmer, Nancy Ide and James Pustejovsky	A Road Map for Interoperable Language Resource Metadata
17:35-17:55	Josef Ruppenhofer, Caroline Sporleder and Fabian Shirokov	Speaker Attribution in Cabinet Protocols
17:55-18:15	Katrin Tomanek and Udo Hahn	Annotation Time Stamps ― Temporal Metadata from the Linguistic Annotation Process

	Session O30 - Tagging	Chairperson : Reinhard Rapp
16:55-17:15	Markus Dickinson and Charles Jochim	Evaluating Distributional Properties of Tagsets
17:15-17:35	Kais Dukes and Nizar Habash	Morphological Annotation of Quranic Arabic
17:35-17:55	Emad Mohamed and Sandra Kübler	Arabic Part of Speech Tagging
17:55-18:15	Tomaž Erjavec	MULTEXT-East Version 4: Multilingual Morphosyntactic Specifications, Lexicons and Corpora

	Session O31 - Multimodal Annotation	Chairperson : Jean Claude Martin
16:55-17:15	Harry Bunt, Jan Alexandersson, Jean Carletta, Jae-Woong Choe, Alex Chengyu Fang, Koiti Hasida, Kiyong Lee, Volha Petukhova, Andrei Popescu-Belis, Laurent Romary, Claudia Soria and David Traum	Towards an ISO Standard for Dialogue Act Annotation
17:15-17:35	Volha Petukhova and Harry Bunt	Towards an Integrated Scheme for Semantic Annotation of Multimodal Dialogue Data
17:35-17:55	Pierre Tirilly, Vincent Claveau and Patrick Gros	News Image Annotation on a Large Parallel Text-image Corpus
17:55-18:15	Isabella Poggi, Francesca D'Errico and Laura Vincze	Types of Nods. The Polysemy of a Social Signal

	Session O32 - Lexicon	Chairperson : German Rigau
18:20-18:40	Núria Bel	Handling of Missing Values in Lexical Acquisition
18:40-19:00	Josef Ruppenhofer, Jonas Sunde and Manfred Pinkal	Generating FrameNets of Various Granularities: The FrameNet Transformer
19:00-19:20	Benoît Sagot	The Lefff, a Freely Available and Large-coverage Morphological and Syntactic Lexicon for French
19:20-19:40	Diego De Cao, Danilo Croce and Roberto Basili	Extensive Evaluation of a FrameNet-WordNet mapping resource

	Session O33 - Question Answering	Chairperson : Gilles Adda
18:20-18:40	Guillaume Bernard, Sophie Rosset, Martine Adda-Decker and Olivier Galibert	A Question-answer Distance Measure to Investigate QA System Progress
18:40-19:00	Peter Adolphs, Xiwen Cheng, Tina Klüwer, Hans Uszkoreit and Feiyu Xu	Question Answering Biographic Information and Social Network Powered by the Semantic Web
19:00-19:20	Nicolas Moreau, Olivier Hamon, Djamel Mostefa, Sophie Rosset, Olivier Galibert, Lori Lamel, Jordi Turmo, Pere R. Comas, Paolo Rosso, Davide Buscaldi and Khalid Choukri	Evaluation Protocol and Tools for Question-Answering on Speech Transcripts
19:20-19:40	Pamela Forner, Danilo Giampiccolo, Bernardo Magnini, Anselmo Peñas, Álvaro Rodrigo and Richard Sutcliffe	Evaluating Multilingual Question Answering Systems at CLEF

	Session O34 - Endangered Languages	Chairperson : Richard Sproat
18:20-18:40	Lene Antonsen, Trond Trosterud and Linda Wiechetek	Reusing Grammatical Resources for New Languages
18:40-19:00	Fei Xia, Carrie Lewis and William D. Lewis	The Problems of Language Identification within Hugely Multilingual Data Sets
19:00-19:20	Enikő Héja	The Role of Parallel Corpora in Bilingual Lexicography
19:20-19:40	Cheikh M. Bamba Dione, Jonas Kuhn and Sina Zarrieß	Design and Development of Part-of-Speech-Tagging Resources for Wolof (Niger-Congo, spoken in Senegal)

	Session O35 - Disordered Speech Corpus	Chairperson : Florian Schiel
18:20-18:40	Oscar Saz, Eduardo Lleida, Carlos Vaquero and W.-Ricardo Rodríguez	The Alborada-I3A Corpus of Disordered Speech
18:40-19:00	Jakob Schou Pedersen and Lars Bo Larsen	A Speech Corpus for Dyslexic Reading Training
19:00-19:20	Caroline Williams, Andrew Thwaites, Paula Buttery, Jeroen Geertzen, Billi Randall, Meredith Shafto, Barry Devereux and Lorraine Tyler	The Cambridge Cookie-Theft Corpus: A Corpus of Directed and Spontaneous Speech of Brain-Damaged Patients and Healthy Individuals
19:20-19:40	Cécile Fougeron, Lise Crevier-Buchman, Corinne Fredouille, Alain Ghio, Christine Meunier, Claude Chevrie-Muller, Jean-Francois Bonastre, Antonia Colazo-Simon, Céline Delooze, Danielle Duez, Cédric Gendrot, Thierry Legou, Nathalie Lévêque, Claire Pillot-Loiseau, Serge Pinto, Gilles Pouchoulin, Danièle Robert, Jacqueline Vaissière, François Viallet and Coralie Vincent	The DesPho-APaDy Project: Developing an Acoustic-phonetic Characterization of Dysarthric Speech in French

	Session O36 - National and International projects	Chairperson : Marko Tadić
9:45-10:05	Marina B. Ruiter, Toni C. M. Rietveld, Catia Cucchiarini, Emiel J. Krahmer and Helmer Strik	Human Language Technology and Communicative Disabilities: Requirements and Possibilities for the Future
10:05-10:25	Aditi Sharma Grover, Gerhard B. van Huyssteen and Marthinus W. Pretorius	The South African Human Language Technologies Audit
10:25-10:45	Swaran Lata and Somnath Chandra Vijay Kumar	Development of Linguistic Resources and Tools for Providing Multilingual Solutions in Indian Languages ― A Report on National Initiative
10:45-11:05	Peter Spyns and Elisabeth D'Halleweyn	Flemish-Dutch HLT Policy: Evolving to New Forms of Collaboration
11:05-11:25	Bente Maegaard, Mohamed Attia, Khalid Choukri, Olivier Hamon, Steven Krauwer and Mustafa Yaseen	Cooperation for Arabic Language Resources and Tools ― The MEDAR Project

	Session O37 - Machine Translation	Chairperson : Gudrun Magnusdottir
9:45-10:05	Andreas Eisele and Yu Chen	MultiUN: A Multilingual Corpus from United Nation Documents
10:05-10:25	Chi-kiu Lo and Dekai Wu	Evaluating Machine Translation Utility via Semantic Role Labels
10:25-10:45	William D. Lewis, Chris Wendt and David Bullock	Achieving Domain Specificity in SMT without Overt Siloing
10:45-11:05	Billy Tak-Ming Wong	Semantic Evaluation of Machine Translation
11:05-11:25	David Guthrie, Mark Hepple and Wei Liu	Efficient Minimal Perfect Hash Language Models

	Session O38 - Corpus Tools	Chairperson : Oi Yee Kwong
9:45-10:05	Ting Qian, Kristy Hollingshead, Su-youn Yoon, Kyoung-young Kim and Richard Sproat	A Python Toolkit for Universal Transliteration
10:05-10:25	Sowmya V. B., Monojit Choudhury, Kalika Bali, Tirthankar Dasgupta and Anupam Basu	Resource Creation for Training and Testing of Transliteration Systems for Indian Languages
10:25-10:45	Fabienne Fritzinger, Marion Weller and Ulrich Heid	A Survey of Idiomatic Preposition-Noun-Verb Triples on Token Level
10:45-11:05	Meghan Lammie Glenn, Stephanie M. Strassel, Haejoong Lee, Kazuaki Maeda, Ramez Zakhary and Xuansong Li	Transcription Methods for Consistency, Volume and Efficiency
11:05-11:25	Muhammad Kamran Malik, Tafseer Ahmed, Sebastian Sulger, Tina Bögel, Atif Gulzar, Ghulam Raza, Sarmad Hussain and Miriam Butt	Transliterating Urdu for a Broad-Coverage Urdu/Hindi LFG Grammar

	Session O39 - Information Extraction	Chairperson : Martine Adda-Decker
9:45-10:05	Ralph Grishman	The Impact of Task and Corpus on Event Extraction Systems
10:05-10:25	Darja Fišer, Senja Pollak and Špela Vintar	Learning to Mine Definitions from Slovene Structured and Unstructured Knowledge-Rich Resources
10:25-10:45	Silvana Marianela Bernaola Biggio, Manuela Speranza and Roberto Zanoli	Entity Mention Detection using a Combination of Redundancy-Driven Classifiers
10:45-11:05	Klaar Vanopstal, Robert Vander Stichele, Godelieve Laureys and Joost Buysschaert	Assessing the Impact of English Language Skills and Education Level on PubMed Searches by Dutch-speaking Users
11:05-11:25	Andre Blessing and Hinrich Schütze	Fine-Grained Geographical Relation Extraction from Wikipedia

	Session O40 - Ontologies	Chairperson : Christopher Brewster
11:45-12:05	Ekaterina Ovchinnikova, Laure Vieu, Alessandro Oltramari, Stefano Borgo and Theodore Alexandrov	Data-Driven and Ontological Analysis of FrameNet for Natural Language Reasoning
12:05-12:25	Hans-Ulrich Krieger	A General Methodology for Equipping Ontologies with Time
12:25-12:45	Dan Tufiş and Dan Ştefănescu	A Differential Semantics Approach to the Annotation of Synsets in WordNet
12:45-13:05	Bolette S. Pedersen, Sanni Nimb and Anna Braasch	Merging Specialist Taxonomies and Folk Taxonomies in Wordnets - A case Study of Plants, Animals and Foods in the Danish Wordnet
13:05-13:25	Mithun Balakrishna, Dan Moldovan, Marta Tatu and Marian Olteanu	Semi-Automatic Domain Ontology Creation from Text Resources

	Session O41 - Multiword Expressions and Collocations	Chairperson : Benjamin Tsou
11:45-12:05	Marion Weller and Ulrich Heid	Extraction of German Multiword Expressions from Parsed Corpora Using Context Features
12:05-12:25	Stefania Spina	The Dictionary of Italian Collocations: Design and Integration in an Online Learning Environment
12:25-12:45	Margarita Alonso Ramos, Leo Wanner, Orsolya Vincze, Gerard Casamayor del Bosque, Nancy Vázquez Veiga, Estela Mosqueira Suárez and Sabela Prieto González	Towards a Motivated Annotation Schema of Collocation Errors in Learner Corpora
12:45-13:05	Ulrich Heid, Fabienne Fritzinger, Erhard Hinrichs, Marie Hinrichs and Thomas Zastrow	Term and Collocation Extraction by Means of Complex Linguistic Web Services
13:05-13:25	Francesca Bonin, Felice Dell'Orletta, Simonetta Montemagni and Giulia Venturi	A Contrastive Approach to Multi-word Extraction from Domain-specific Corpora

	Session O42 - Word Sense Disambiguation	Chairperson : Anne Vilnat
11:45-12:05	Amal Zouaq, Michel Gagnon and Benoit Ozell	Can Syntactic and Logical Graphs help Word Sense Disambiguation?
12:05-12:25	Susan Windisch Brown, Travis Rood and Martha Palmer	Number or Nuance: Which Factors Restrict Reliable Word Sense Annotation?
12:25-12:45	Rebecca J. Passonneau, Ansaf Salleb-Aoussi, Vikas Bhardwaj and Nancy Ide	Word Sense Annotation of Polysemous Words by Multiple Annotators
12:45-13:05	Sanaz Jabbari, Mark Hepple and Louise Guthrie	Evaluating Lexical Substitution: Analysis and New Measures
13:05-13:25	Ekaterina Shutova and Simone Teufel	Metaphor Corpus Annotated for Source - Target Domain Mappings

	Session O43 - Speech Corpus Processing	Chairperson : Catia Cucchiarini
11:45-12:05	Philippe Blache, Roxane Bertrand, Mathilde Guardiola, Marie-Laure Guénot, Christine Meunier, Irina Nesterenko, Berthille Pallaud, Laurent Prévot, Béatrice Priego-Valverde and Stéphane Rauzy	The OTIM Formal Annotation Model: A Preliminary Step before Annotation Scheme
12:05-12:25	Grégory Senay, Georges Linarès, Benjamin Lecouteux, Stanislas Oger and Thierry Michel	Transcriber Driving Strategies for Transcription Aid System
12:25-12:45	Rena Nemoto, Martine Adda-Decker and Jacques Durand	Word Boundaries in French: Evidence from Large Speech Corpora
12:45-13:05	Christina Leitner, Martin Schickbichler and Stefan Petrik	Example-Based Automatic Phonetic Transcription
13:05-13:25	Brigitte Bigi, Christine Meunier, Irina Nesterenko and Roxane Bertrand	Automatic Detection of Syllable Boundaries in Spontaneous Speech

	Session O44 - Web Services	Chairperson : Virach Sornlertlamvanich
14:55-15:15	Arif Bramantoro, Ulrich Schäfer and Toru Ishida	Towards an Integrated Architecture for Composite Language Services and Multiple Linguistic Processing Components
15:15-15:35	Marta Villegas, Núria Bel, Santiago Bel and Víctor Rodríguez	A Case Study on Interoperability for Language Resources and Applications
15:35-15:55	Nancy Ide, Keith Suderman and Brian Simms	ANC2Go: A Web Application for Customized Corpus Creation
15:55-16:15	Yohei Murakami, Donghui Lin, Masahiro Tanaka, Takao Nakaguchi and Toru Ishida	Language Service Management with the Language Grid
16:15-16:35	Jennifer DeCamp	Language Technology Resource Center

	Session O45 - Textual Entailment and Question Answering	Chairperson : Jerry Hobbs
14:55-15:15	Louise Deléger and Pierre Zweigenbaum	Identifying Paraphrases between Technical and Lay Corpora
15:15-15:35	Luisa Bentivogli, Elena Cabrio, Ido Dagan, Danilo Giampiccolo, Medea Lo Leggio and Bernardo Magnini	Building Textual Entailment Specialized Data Sets: a Methodology for Isolating Linguistic Phenomena Relevant to Inference
15:35-15:55	Milen Kouylekov, Yashar Mehdad and Matteo Negri	Mining Wikipedia for Large-scale Repositories of Context-Sensitive Entailment Rules
15:55-16:15	Daniel Sonntag and Bogdan Sacaleanu	Speech Grammars for Textual Entailment Patterns in Multimodal Question Answering
16:15-16:35	Anne Garcia-Fernandez, Sophie Rosset and Anne Vilnat	MACAQ : A Multi Annotated Corpus to Study how we Adapt Answers to Various Questions

	Session O46 - Discourse Annotation	Chairperson : Harry Bunt
14:55-15:15	Silvia Pareti and Irina Prodanof	Annotating Attribution Relations: Towards an Italian Discourse Treebank
15:15-15:35	Charles Teissèdre, Delphine Battistelli and Jean-Luc Minel	Resources for Calendar Expressions Semantic Tagging and Temporal Navigation through Texts
15:35-15:55	Stergos Afantenos, Pascal Denis, Philippe Muller and Laurence Danlos	Learning Recursive Segments for Discourse Parsing
15:55-16:15	Gerlof Bouma, Lilja Øvrelid and Jonas Kuhn	Towards a Large Parallel Corpus of Cleft Constructions
16:15-16:35	Livio Robaldo, Eleni Miltsakaki and Alessia Bianchini	Corpus-based Semantics of Concession: Where do Expectations Come from?

	Session O47 - Named Entity Recognition	Chairperson : Lluis Padrò
14:55-15:15	Mark Arehart	Indexing Methods for Faster and More Effective Person Name Search
15:15-15:35	Asif Ekbal and Sriparna Saha	Maximum Entropy Classifier Ensembling using Genetic Algorithm for NER in Bengali
15:35-15:55	Mohammed Attia, Antonio Toral, Lamia Tounsi, Monica Monachini and Josef van Genabith	An Automatically Built Named Entity Lexicon for Arabic
15:55-16:15	Agata Savary, Jakub Waszczuk and Adam Przepiórkowski	Towards the Annotation of Named Entities in the National Corpus of Polish
16:15-16:35	Cláudia Freitas, Cristina Mota, Diana Santos, Hugo Gonçalo Oliveira and Paula Carvalho	Second HAREM: Advancing the State of the Art of Named Entity Recognition in Portuguese

	Session P1 - Anaphora, Coreference and Evaluation	Chair : Antonio Pareja-Lora
11:35-13:15	Ruud Koolen and Emiel Krahmer	The D-TUNA Corpus: A Dutch Dataset for the Evaluation of Referring Expression Generation Algorithms
11:35-13:15	Azad Abad, Luisa Bentivogli, Ido Dagan, Danilo Giampiccolo, Shachar Mirkin, Emanuele Pianta and Asher Stern	A Resource for Investigating the Impact of Anaphora and Coreference on Inference.
11:35-13:15	Cristina Nicolae, Gabriel Nicolae and Kirk Roberts	C-3: Coherence and Coreference Corpus
11:35-13:15	Claudiu Mihăilă, Iustina Ilisei and Diana Inkpen	Romanian Zero Pronoun Distribution: A Comparative Study
11:35-13:15	Marta Recasens, Eduard Hovy and M. Antònia Martí	A Typology of Near-Identity Relations for Coreference (NIDENT)
11:35-13:15	Kepa Joseba Rodríguez, Francesca Delogu, Yannick Versley, Egon W. Stemle and Massimo Poesio	Anaphoric Annotation of Wikipedia and Blogs in the Live Memories Corpus
11:35-13:15	Samuel Broscheit, Simone Paolo Ponzetto, Yannick Versley and Massimo Poesio	Extending BART to Provide a Coreference Resolution System for German
11:35-13:15	Jiří Mírovský, Petr Pajas and Anna Nedoluzhko	Annotation Tool for Extended Textual Coreference and Bridging Anaphora
11:35-13:15	Petya Osenova, Laska Laskova and Kiril Simov	Exploring Co-Reference Chains for Concept Annotation of Domain Texts
11:35-13:15	Heather Simpson, Stephanie Strassel, Robert Parker and Paul McNamee	Wikipedia and the Web of Confusable Entities: Experience from Entity Linking Query Creation for TAC 2009 Knowledge Base Population

	Session P2 - Tools, Systems and Evaluation	Chair : Marc Verhagen
11:35-13:15	Athanasios Karasimos and Evanthia Petropoulou	A Crash Test with Linguistica in Modern Greek: The Case of Derivational Affixes and Bound Stems
11:35-13:15	Anil Kumar Singh and Bharat Ram Ambati	An Integrated Digital Tool for Accessing Language Resources
11:35-13:15	Paul Felt, Owen Merkling, Marc Carmen, Eric Ringger, Warren Lemmon, Kevin Seppi and Robbie Haertel	CCASH: A Web Application Framework for Efficient, Distributed Language Resource Development
11:35-13:15	Rüdiger Gleim and Alexander Mehler	Computational Linguistics for Mere Mortals - Powerful but Easy-to-use Linguistic Processing for Scientists in the Humanities
11:35-13:15	Bernd Bohnet and Leo Wanner	Open Soucre Graph Transducer Interpreter and Grammar Development Environment
11:35-13:15	Federico Sangati, Willem Zuidema and Rens Bod	Efficiently Extract Rrecurring Tree Fragments from Large Treebanks
11:35-13:15	José João Almeida, André Santos and Alberto Simões	Bigorna -- A Toolkit for Orthography Migration Challenges
11:35-13:15	Carl Christensen, Ross Hendrickson and Deryle Lonsdale	Principled Construction of Elicited Imitation Tests
11:35-13:15	Jan Jona Javoršek and Tomaž Erjavec	Experimental Deployment of a Grid Virtual Organization for Human Language Technologies
11:35-13:15	Peter Nabende	Applying a Dynamic Bayesian Network Framework to Transliteration Identification

	Session P3 - Lexical Resources	Chair : Anna Braasch
11:35-13:15	Adrien Lardilleux, Julien Gosme and Yves Lepage	Bilingual Lexicon Induction: Effortless Evaluation of Word Alignment Tools and Production of Resources for Improbable Language Pairs
11:35-13:15	Akira Utsumi	Exploring the Relationship between Semantic Spaces and Semantic Relations
11:35-13:15	C. Anton Rytting, Paul Rodrigues, Tim Buckwalter, David Zajic, Bridget Hirsch, Jeff Carnes, Nathanael Lynn, Sarah Wayland, Chris Taylor, Jason White, Charles Blake III, Evelyn Browne, Corey Miller and Tristan Purvis	Error Correction for Arabic Dictionary Lookup
11:35-13:15	Noureddine Loukil, Kais Haddar and Abdelmajid Benhamadou	A Syntactic Lexicon for Arabic Verbs
11:35-13:15	Amit Kirschenbaum and Shuly Wintner	A General Method for Creating a Bilingual Transliteration Dictionary
11:35-13:15	Thomas Proisl and Besim Kabashi	Using High-Quality Resources in NLP: The Valency Dictionary of English as a Resource for Left-Associative Grammars
11:35-13:15	Grigori Sidorov, Alberto Barrón-Cedeño and Paolo Rosso	English-Spanish Large Statistical Dictionary of Inflectional Forms
11:35-13:15	Majdi Sawalha and Eric Atwell	Constructing and Using Broad-coverage Lexical Resource for Enhancing Morphological Analysis of Arabic
11:35-13:15	Rania Al-Sabbagh and Roxana Girju	Mining the Web for the Induction of a Dialectical Arabic Lexicon
11:35-13:15	Benoît Sagot, Laurence Danlos and Rosa Stern	A Lexicon of French Quotation Verbs for Automatic Quotation Extraction
11:35-13:15	Benoît Sagot and Géraldine Walther	A Morphological Lexicon for the Persian Language
11:35-13:15	Jana Šindlerová and Ondřej Bojar	Building a Bilingual ValLex Using Treebank Token Alignment: First Observations
11:35-13:15	Óscar Ferrández, Michael Ellsworth, Rafael Muñoz and Collin F. Baker	Aligning FrameNet and WordNet based on Semantic Neighborhoods
11:35-13:15	Anca Dinu	Building a Generative Lexicon for Romanian
11:35-13:15	Hiroaki SATO	How FrameSQL Shows the Japanese FrameNet Data
11:35-13:15	Svetla Koeva	Lexicon and Grammar in Bulgarian FrameNet
11:35-13:15	Bento Carlos Dias-da-Silva and Ariani Di-Felippo	REBECA: Turning WordNet Databases into ""Ontolexicons""
11:35-13:15	Karel Pala, Christiane Fellbaum and Sonja Bosch	Lexical Resources for Noun Compounds in Czech, English and Zulu
11:35-13:15	Michael Gasser	Expanding the Lexicon for a Resource-Poor Language Using a Morphological Analyzer and a Web Crawler
11:35-13:15	Gerard de Melo and Gerhard Weikum	Providing Multilingual, Multimodal Answers to Lexical Database Queries
11:35-13:15	Sabine Ploux, Armelle Boussidan and Hyungsuk Ji	The Semantic Atlas: an Interactive Model of Lexical Representation

	Session P4 - Web Services	Chair : Bruno Cartoni
14:45-16:25	Adam Funk and Kalina Bontcheva	Ontology-Based Categorization of Web Services with Machine Learning
14:45-16:25	Marie Hinrichs, Thomas Zastrow and Erhard Hinrichs	WebLicht: Web-based LRT Services in a Distributed eScience Infrastructure
14:45-16:25	Ulrich Heid, Helmut Schmid, Kerstin Eckart and Erhard Hinrichs	A Corpus Representation Format for Linguistic Web Services: The D-SPIN Text Corpus Format and its Relationship with ISO Standards
14:45-16:25	Donghui Lin, Yoshiaki Murakami, Toru Ishida, Yohei Murakami and Masahiro Tanaka	Composing Human and Machine Translation Services: Language Grid for Improving Localization Processes
14:45-16:25	Bora Savas, Yoshihiko Hayashi, Monica Monachini, Claudia Soria and Nicoletta Calzolari	An LMF-based Web Service for Accessing WordNet-type Semantic Lexicons
14:45-16:25	Virach Sornlertlamvanich, Thatsanee Charoenporn and Hitoshi Isahara	Language Resource Management System for Asian WordNet Collaboration and Its Web Service Application

	Session P5 - Named Entity Recognition	Chair : Valia Kordoni
14:45-16:25	Rita Marinelli	Lexical Resources and Ontological Classifications for the Recognition of Proper Names Sense Extension
14:45-16:25	Damien Nouvel, Jean-Yves Antoine, Nathalie Friburger and Denis Maurel	An Analysis of the Performances of the CasEN Named Entities Recognition System in the Ester2 Evaluation Campaign
14:45-16:25	Olivier Galibert, Sophie Rosset, Xavier Tannier and Fanny Grandry	Hybrid Citation Extraction from Patents
14:45-16:25	Bart Desmet and Véronique Hoste	Towards a Balanced Named Entity Corpus for Dutch
14:45-16:25	Satoshi Sato and Sayoko Kaide	A Person-Name Filter for Automatic Compilation of Bilingual Person-Name Lexicons
14:45-16:25	Michael Tanenblatt, Anni Coden and Igor Sominsky	The ConceptMapper Approach to Named Entity Recognition
14:45-16:25	Grzegorz Chrupała and Dietrich Klakow	A Named Entity Labeler for German: Exploiting Wikipedia and Distributional Clusters
14:45-16:25	Keith J. Miller, Sarah McLeod, Elizabeth Schroeder, Mark Arehart, Kenneth Samuel, James Finley, Vanesa Jurica and John Polk	Improving Personal Name Search in the TIGR System
14:45-16:25	Wajdi Zaghouani, Bruno Pouliquen, Mohamed Ebrahim and Ralf Steinberger	Adapting a resource-light highly multilingual Named Entity Recognition system to Arabic
14:45-16:25	Dietrich Rebholz-Schuhmann, Antonio José Jimeno-Yepes, Erik M. van Mulligen, Ning Kang, Jan Kors, David Milward, Peter Corbett, Ekaterina Buyko, Katrin Tomanek, Elena Beisswanger and Udo Hahn	The CALBC Silver Standard Corpus for Biomedical Named Entities ― A Study in Harmonizing the Contributions from Four Independent Named Entity Taggers
14:45-16:25	Ana Cristina Mendes, Luísa Coheur and Paula Vaz Lobo	Named Entity Recognition in Questions: Towards a Golden Collection

	Session P6 - Pronunciation Variants	Chair : Fernando Fernández Martínez
14:45-16:25	Alexander Schmitt, Tim Polzehl, Wolfgang Minker and Jackson Liscombe	The Influence of the Utterance Length on the Recognition of Aged Voices
14:45-16:25	Nikos Tsourakis, Agnes Lisowska, Manny Rayner and Pierrette Bouillon	Examining the Effects of Rephrasing User Input on Two Mobile Spoken Language Systems
14:45-16:25	Damjan Vlaj, Aleksandra Zögling Markuš, Marko Kos and Zdravko Kačič	Acquisition and Annotation of Slovenian Lombard Speech Database
14:45-16:25	Natalie D. Snoeren, Martine Adda-Decker and Gilles Adda	The Study of Writing Variants in an Under-resourced Language: Some Evidence from Mobile N-Deletion in Luxembourgish
14:45-16:25	Jean-Luc Rouas, Mayumi Beppu and Martine Adda-Decker	Comparison of Spectral Properties of Read, Prepared and Casual Speech in French
14:45-16:25	Marijn Schraagen and Gerrit Bloothooft	Evaluating Repetitions, or how to Improve your Multilingual ASR System by doing Nothing
14:45-16:25	Elena Grishina, Svetlana Savchuk and Alexej Poljakov	Design and Data Collection for the Accentological Corpus of the Russian Language
14:45-16:25	Siim Orasmaa, Reina Käärik, Jaak Vilo and Tiit Hennoste	Information Retrieval of Word Form Variants in Spoken Language Corpora Using Generalized Edit Distance

	Session P7 - Multiword Expressions and Collocations	Chair : Beatrice Daille
14:45-16:25	Meng Wang, Chu-Ren Huang, Shiwen Yu and Weiwei Sun	Automatic Acquisition of Chinese Novel Noun Compounds
14:45-16:25	Luka Nerima, Eric Wehrli and Violeta Seretan	A Recursive Treatment of Collocations
14:45-16:25	Caroline Sporleder, Linlin Li, Philip Gorinski and Xaver Koch	Idioms in Context: The IDIX Corpus
14:45-16:25	Laura Street, Nathan Michalov, Rachel Silverstein, Michael Reynolds, Lurdes Ruela, Felicia Flowers, Angela Talucci, Priscilla Pereira, Gabriella Morgon, Samantha Siegel, Marci Barousse, Antequa Anderson, Tashom Carroll and Anna Feldman	Like Finding a Needle in a Haystack: Annotating the American National Corpus for Idiomatic Expressions
14:45-16:25	Andrea Zaninello and Malvina Nissim	Creation of Lexical Resources for a Characterisation of Multiword Expressions in Italian
14:45-16:25	Carlos Ramisch, Aline Villavicencio and Christian Boitet	mwetoolkit: a Framework for Multiword Expression Identification
14:45-16:25	Junko Kubo, Keita Tsuji and Shigeo Sugimoto	Automatic Term Recognition Based on the Statistical Differences of Relative Frequencies in Different Corpora

	Session P8 - Validation of Language Resources	Chair : Zygmunt Vetulani
14:45-16:25	Claire Gardent and Alejandra Lorenzo	Identifying Sources of Weakness in Syntactic Lexicon Extraction
14:45-16:25	Bharat Ram Ambati, Mridul Gupta, Samar Husain and Dipti Misra Sharma	A High Recall Error Identification Tool for Hindi Treebank Validation

	Session P9 - Grammar and Syntax	Chair : Cristina Bosco
14:45-16:25	Anne Abeillé and Danièle Godard	The Grande Grammaire du Français Project
14:45-16:25	Marina Lloberes, Irene Castellón and Lluís Padró	Spanish FreeLing Dependency Grammar
14:45-16:25	Montserrat Marimon	The Spanish Resource Grammar

	Session P10 - Morphology	Chair : Miriam Butt
16:45-18:05	Gertrud Faaß, Ulrich Heid and Helmut Schmid	Design and Application of a Gold Standard for Morphological Analysis: SMOR as an Example of Morphological Evaluation
16:45-18:05	Niraj Aswani and Robert Gaizauskas	Developing Morphological Analysers for South Asian Languages: Experimenting with the Hindi and Gujarati Languages
16:45-18:05	Cvetana Krstev, Ranka Stanković and Duško Vitas	A Description of Morphological Features of Serbian: a Revision using Feature System Declaration
16:45-18:05	Çağrı Çöltekin	A Freely Available Morphological Analyzer for Turkish
16:45-18:05	Iñaki Alegria, Garbiñe Aranbarri, Klara Ceberio, Gorka Labaka, Bittor Laskurain and Ruben Urizar	A Morphological Processor Based on Foma for Biscayan (a Basque dialect)
16:45-18:05	Yugo Murawaki and Sadao Kurohashi	Online Japanese Unknown Morpheme Detection using Orthographic Variation
16:45-18:05	Bruno Cartoni and Marie-Aude Lefer	The MuLeXFoR Database: Representing Word-Formation Processes in a Multilingual Lexicographic Environment
16:45-18:05	Ting-Hao Huang, Lun-Wei Ku and Hsin-Hsi Chen	Predicting Morphological Types of Chinese Bi-Character Words by Machine Learning Approaches
16:45-18:05	Mohamed Altantawy, Nizar Habash, Owen Rambow and Ibrahim Saleh	Morphological Analysis and Generation of Arabic Nouns: A Morphemic Functional Approach
16:45-18:05	Mehrnoush Shamsfard, Hoda Sadat Jafari and Mahdi Ilbeygi	STeP-1: A Set of Fundamental Tools for Persian Text Processing
16:45-18:05	Sara Tonelli, Emanuele Pianta, Rodolfo Delmonte and Michele Brunelli	VenPro: A Morphological Analyzer for Venetan

	Session P11 - Tools for Multimodal Corpus	Chair : Katerina Pastra
16:45-18:05	Nick Campbell and Akiko Tabata	A Software Toolkit for Viewing Annotated Multimodal Data Interactively over the Web
16:45-18:05	Nick Webb, David Benyon, Jay Bradley, Preben Hansen and Oil Mival	Wizard of Oz Experiments for a Companion Dialogue System: Eliciting Companionable Conversation
16:45-18:05	Volker Fritzsch, Stefan Scherer and Friedhelm Schwenker	An Open Source Process Engine Framework for Realtime Pattern Recognition and Information Fusion Tasks
16:45-18:05	Jens Allwood, Harald Hammarström, Andries Hendrikse, Mtholeni N. Ngcobo, Nozibele Nomdebevana, Laurette Pretorius and Mac van der Merwe	Work on Spoken (Multimodal) Language Corpora in South Africa
16:45-18:05	Eric Auer, Albert Russel, Han Sloetjes, Peter Wittenburg, Oliver Schreer, S. Masnieri, Daniel Schneider and Sebastian Tschöpel	ELAN as Flexible Annotation Framework for Sound and Image Processing Detectors

	Session P12 - Language Resource Infrastructures	Chair : Hamish Cunningham
16:45-18:05	Claus Zinn, Peter Wittenburg and Jacquelijn Ringersma	An Evolving eScience Environment for Research Data in Linguistics
16:45-18:05	Dieter Van Uytvanck, Claus Zinn, Daan Broeder, Peter Wittenburg and Mariano Gardellini	Virtual Language Observatory: The Portal to the Language Resources and Technology Universe
16:45-18:05	Adam Kilgarriff, Siva Reddy, Jan Pomikálek and Avinesh PVS	A Corpus Factory for Many Languages
16:45-18:05	Erhard Hinrichs, Verena Henrich and Thomas Zastrow	Sustainability of Linguistic Data and Analysis in the Context of a Collaborative eScience Environment
16:45-18:05	Armando Stellato, Heiko Stoermer, Stefano Bortoli, Noemi Scarpato, Andrea Turbati, Paolo Bouquet and Maria Teresa Pazienza	Maskkot ― An Entity-centric Annotation Platform
16:45-18:05	Maite Melero, Gemma Boleda, Montse Cuadros, Cristina España-Bonet, Lluís Padró, Martí Quixal, Carlos Rodríguez and Roser Saurí	Language Technology Challenges of a ‘Small’ Language (Catalan)
16:45-18:05	Lluís Padró, Miquel Collado, Samuel Reese, Marina Lloberes and Irene Castellón	FreeLing 2.1: Five Years of Open-source Language Processing Tools
16:45-18:05	Bartosz Broda, Michał Marcińczuk and Maciej Piasecki	Building a Node of the Accessible Language Technology Infrastructure
16:45-18:05	Peter Menke and Alexander Mehler	The Ariadne System: A Flexible and Extensible Framework for the Modeling and Storage of Experimental Data in the Humanities.
16:45-18:05	Nicoletta Calzolari, Claudia Soria, Riccardo Del Gratta, Sara Goggi, Valeria Quochi, Irene Russo, Khalid Choukri, Joseph Mariani and Stelios Piperidis	The LREC Map of Language Resources and Technologies
16:45-18:05	Nick Rizzolo and Dan Roth	Learning Based Java for Rapid Development of NLP Systems
16:45-18:05	Kazuaki Maeda, Haejoong Lee, Stephen Grimes, Jonathan Wright, Robert Parker, David Lee and Andrea Mazzucchi	Technical Infrastructure at Linguistic Data Consortium: Software and Hardware Resources for Linguistic Data Creation
16:45-18:05	Thepchai Supnithi, Taneth Ruangrajitpakorn, Kanokorn Trakultaweekool and Peerachet Porkaew	AutoTagTCG : A Framework for Automatic Thai CG Tagging
16:45-18:05	Javier Couto, Helena Blancafort, Somara Seng, Nicolas Kuchmann-Beauger, Anass Talby and Claude de Loupy	OAL: A NLP Architecture to Improve the Development of Linguistic Resources for NLP
16:45-18:05	Girish Nath Jha	The TDIL Program and the Indian Langauge Corpora Intitiative (ILCI)
16:45-18:05	Stephanie Strassel, Dan Adams, Henry Goldberg, Jonathan Herr, Ron Keesing, Daniel Oblinger, Heather Simpson, Robert Schrag and Jonathan Wright	The DARPA Machine Reading Program - Encouraging Linguistic and Reasoning Research with a Series of Reading Tasks
16:45-18:05	Adam Przepiórkowski, Rafał L. Górski, Marek Łaziński and Piotr Pęzik	Recent Developments in the National Corpus of Polish
16:45-18:05	Drahomíra ""johanka"" Spoustová, Miroslav Spousta and Pavel Pecina	Building a Web Corpus of Czech
16:45-18:05	Brigitte Jörg, Hans Uszkoreit and Alastair Burt	LT World: Ontology and Reference Information Portal

	Session P13 - Subjectivity: Sentiments, Emotions, Opinions	Chair : Silke Scheible
18:10-19:30	Vassiliki Rentoumi, Stefanos Petrakis, Manfred Klenner, George A. Vouros and Vangelis Karkaletsis	United we Stand: Improving Sentiment Analysis by Joining Machine Learning and Rule Based Methods
18:10-19:30	Plaban Kr. Bhowmick, Anupam Basu and Pabitra Mitra	Determining Reliability of Subjective and Multi-label Emotion Annotation through Novel Fuzzy Agreement Measure
18:10-19:30	Aleksander Wawer	Is Sentiment a Property of Synsets? Evaluating Resources for Sentiment Classification using Machine Learning
18:10-19:30	Patrick Paroubek, Alexander Pak and Djamel Mostefa	Annotations for Opinion Mining Evaluation in the Industrial Context of the DOXA project
18:10-19:30	Huan-An Kao and Hsin-Hsi Chen	Comment Extraction from Blog Posts and Its Applications to Opinion Mining
18:10-19:30	Sophia Yat Mei Lee, Ying Chen, Shoushan Li and Chu-Ren Huang	Emotion Cause Events: Corpus Construction and Analysis
18:10-19:30	Horacio Saggion and Adam Funk	Interpreting SentiWordNet for Opinion Classification
18:10-19:30	Polina Panicheva, John Cardiff and Paolo Rosso	Personal Sense and Idiolect: Combining Authorship Attribution and Opinion Analysis
18:10-19:30	Antonio Reyes, Martin Potthast, Paolo Rosso and Benno Stein	Evaluating Humour Features on Web Comments
18:10-19:30	Shu Zhang, Wenjie Jia, Yingju Xia, Yao Meng and Hao Yu	Extracting Product Features and Sentiments from Chinese Customer Reviews
18:10-19:30	Changqin Quan and Fuji Ren	Automatic Annotation of Word Emotion in Sentences Based on Ren-CECps
18:10-19:30	Bal Krishna Bal and Patrick Saint Dizier	Towards Building Annotated Resources for Analyzing Opinions and Argumentation in News Editorials
18:10-19:30	Irene Russo	Discovering Polarity for Ambiguous and Objective Adjectives through Adverbial Modification
18:10-19:30	Željko Agić, Nikola Ljubešić and Marko Tadić	Towards Sentiment Analysis of Financial Texts in Croatian
18:10-19:30	Robert Remus, Uwe Quasthoff and Gerhard Heyer	SentiWS - A Publicly Available German-language Resource for Sentiment Analysis
18:10-19:30	Stefan Scherer, Ingo Siegert, Lutz Bigalke and Sascha Meudt	Developing an Expressive Speech Labeling Tool Incorporating the Temporal Characteristics of Emotion

	Session P14 - Word Sense Disambiguation and Evaluation	Chair : Olivier Ferret
18:10-19:30	Kyota Tsutsumida, Jun Okamoto, Shun Ishizaki, Makoto Nakatsuji, Akimichi Tanaka and Tadasu Uchiyama	Study of Word Sense Disambiguation System that uses Contextual Features - Approach of Combining Associative Concept Dictionary and Corpus -
18:10-19:30	Jun Okamoto and Shun Ishizaki	Homographic Ideogram Understanding Using Contextual Dynamic Network
18:10-19:30	Christian Scheible	An Evaluation of Predicate Argument Clustering using Pseudo-Disambiguation
18:10-19:30	Lubomir Otrusina and Pavel Smrz	A New Approach to Pseudoword Generation
18:10-19:30	Myriam Rakho and Matthieu Constant	Evaluating the Impact of Some Linguistic Information on the Performances of a Similarity-based and Translation-oriented Word-Sense Disambiguation Method
18:10-19:30	Ines Rehbein and Josef Ruppenhofer	There’s no Data like More Data? Revisiting the Impact of Data Size on a Classification Task
18:10-19:30	Egoitz Laparra and German Rigau	eXtended WordFrameNet
18:10-19:30	Attila Görög and Piek Vossen	Computer Assisted Semantic Annotation in the DutchSemCor Project

	Session P15 - Metadata and Digital Libraries	Chair : Sue Ellen Wright
18:10-19:30	Shunsuke Kozawa, Hitomi Tohyama, Kiyotaka Uchimoto and Shigeki Matsubara	Collection of Usage Information for Language Resources from Academic Articles
18:10-19:30	Cristina Vertan	Towards the Integration of Language Tools Within Historical Digital Libraries
18:10-19:30	Alistair Willis, David King, David Morse, Anton Dil, Chris Lyal and Dave Roberts	From XML to XML: The Why and How of Making the Biodiversity Literature Accessible to Researchers
18:10-19:30	Manuela Sassi, Gabriella Pardelli, Stefania Biagioni, Carlo Carlesi and Sara Goggi	A Digital Archive of Research Papers in Computer Science

	Session P16 - Part-of-Speech Tagging	Chair : Horacio Rodríguez
18:10-19:30	Yan Zhao and Gertjan van Noord	POS Multi-tagging Based on Combined Models
18:10-19:30	Mahdi Mohseni and Behrouz Minaei-bidgoli	A Persian Part-Of-Speech Tagger Based on Morphological Analysis
18:10-19:30	Majdi Sawalha and Eric Atwell	Fine-Grain Morphological Analyzer and Part-of-Speech Tagger for Arabic Text
18:10-19:30	Claire Brierley and Eric Atwell	ProPOSEC: A Prosody and PoS Annotated Spoken English Corpus
18:10-19:30	Boris Haselbach and Ulrich Heid	The Development of a Morphosyntactic Tagset for Afrikaans and its Use with Statistical Tagging
18:10-19:30	Jirka Hana and Anna Feldman	A Positional Tagset for Russian

	Session P17 - Semantic Annotation	Chair : Satoshi Sato
9:45-11:25	Antonio Balvet, Lucie Barque and Rafael Marín	Building a Lexicon of French Deverbal Nouns from a Semantically Annotated Corpus
9:45-11:25	Izaskun Aldezabal, María Jesús Aranzabe, Arantza Díaz de Ilarraza and Ainara Estarrona	Building the Basque PropBank
9:45-11:25	Samuel Reese, Gemma Boleda, Montse Cuadros, Lluís Padró and German Rigau	Wikicorpus: A Word-Sense Disambiguated Multilingual Wikipedia Corpus
9:45-11:25	Aina Peris, Mariona Taulé, Gemma Boleda and Horacio Rodríguez	ADN-Classifier:Automatically Assigning Denotation Types to Nominalizations
9:45-11:25	Roser Morante	Descriptive Analysis of Negation Cues in Biomedical Texts
9:45-11:25	Diana Santos and Cristina Mota	Experiments in Human-computer Cooperation for the Semantic Annotation of Portuguese Corpora
9:45-11:25	Magali Sanches Duran, Marcelo Adriano Amâncio and Sandra Maria Aluísio	Assigning Wh-Questions to Verbal Arguments: Annotation Tools Evaluation and Corpus Building
9:45-11:25	Stuart Moore, Sabine Buchholz and Anna Korhonen	Annotating the Enron Email Corpus with Number Senses
9:45-11:25	Suguru Matsuyoshi, Megumi Eguchi, Chitose Sao, Koji Murakami, Kentaro Inui and Yuji Matsumoto	Annotating Event Mentions in Text with Modality, Focus, and Source Information
9:45-11:25	Elisabetta Jezek and Valeria Quochi	Capturing Coercions in Texts: a First Annotation Exercise
9:45-11:25	Paula Vaz Lobo and David Martins de Matos	Fairy Tale Corpus Organization Using Latent Semantic Mapping and an Item-to-item Top-n Recommendation Algorithm

	Session P18 - Corpus and Morphological Annotation	Chair : Joan Soler Bou
9:45-11:25	Antonio Pareja-Lora and Guadalupe Aguado de Cea	Ontology-based Interoperation of Linguistic Tools for an Improved Lemma Annotation in Spanish
9:45-11:25	Kikuo Maekawa, Makoto Yamazaki, Takehiko Maruyama, Masaya Yamaguchi, Hideki Ogura, Wakako Kashino, Toshinobu Ogiso, Hanae Koiso and Yasuharu Den	Design, Compilation, and Preliminary Analyses of Balanced Corpus of Contemporary Written Japanese
9:45-11:25	Bracha Nir, Brian MacWhinney and Shuly Wintner	A Morphologically-Analyzed CHILDES Corpus of Hebrew
9:45-11:25	Jarmila Panevová and Magda Ševčíková	Annotation of Morphological Meanings of Verbs Revisited
9:45-11:25	Seth Kulick, Ann Bies and Mohamed Maamouri	Consistent and Flexible Integration of Morphological Annotation in the Arabic Treebank

	Session P19 - Applications of Speech Technology	Chair : Norihide Kitaoka
9:45-11:25	Justus Roux, Pieter Scholtz, Daleen Klop, Claus Povlsen, Bart Jongejan and Asta Magnusdottir	Incorporating Speech Synthesis in the Development of a Mobile Platform for e-learning.
9:45-11:25	Alejandro Abejón, Doroteo T. Toledano, Danilo Spada, González Victor and Daniel Hernández López	A Study of the Influence of Speech Type on Automatic Language Recognition Performance
9:45-11:25	Joseph Polifroni, Imre Kiss and Mark Adler	Bootstrapping Named Entity Extraction for the Creation of Mobile Services
9:45-11:25	Jesús Tomás, Alejandro Canovas, Jaime Lloret, Miguel García Pineda and Jose L. Abad	Speech Translation in Pedagogical Environment Using Additional Sources of Knowledge
9:45-11:25	Koichiro Honda and Tomoyosi Akiba	Language Modeling Approach for Retrieving Passages in Lecture Audio Data
9:45-11:25	Manny Rayner, Pierrette Bouillon, Nikos Tsourakis, Johanna Gerlach, Maria Georgescul, Yukie Nakao and Claudia Baur	A Multilingual CALL Game Based on Speech Translation
9:45-11:25	Iker Luengo, Eva Navas, Igor Odriozola, Ibon Saratxaga, Inmaculada Hernaez, Iñaki Sainz and Daniel Erro	Modified LTSE-VAD Algorithm for Applications Requiring Reduced Silence Frame Misclassification
9:45-11:25	Michal Gishri, Vered Silber-Varod and Ami Moyal	Lexicon Design for Transcription of Spontaneous Voice Messages
9:45-11:25	Kevin Walker, Christopher Caruso and Denise DiPersio	Large Scale Multilingual Broadcast Data Collection to Support Machine Translation and Distillation Technology Development

	Session P20 - Speech Data Collection	Chair : Wolfgang Minker
9:45-11:25	Line Adde and Torbjørn Svendsen	NameDat: A Database of English Proper Names Spoken by Native Norwegians
9:45-11:25	Felix Burkhardt, Martin Eckert, Wiebke Johannsen and Joachim Stegmann	A Database of Age and Gender Annotated Telephone Speech
9:45-11:25	Patrick Bauer, David Scheler and Tim Fingscheidt	WTIMIT: The TIMIT Speech Corpus Transmitted Over The 3G AMR Wideband Mobile Network
9:45-11:25	Petr Pollák and Josef Rajnoha	Multi-Channel Database of Spontaneous Czech with Synchronization of Channels Recorded by Independent Devices
9:45-11:25	Ian McGraw, Chia-ying Lee, Lee Hetherington, Stephanie Seneff and Jim Glass	Collecting Voices from the Cloud

	Session P21 - Dialogue Evaluation	Chair : Claire Gardent
9:45-11:25	Els Lefever and Véronique Hoste	Construction of a Benchmark Data Set for Cross-lingual Word Sense Disambiguation
9:45-11:25	Marianne Laurent, Philippe Bretier and Carole Manquillet	Ad-hoc Evaluations Along the Lifecycle of Industrial Spoken Dialogue Systems: Heading to Harmonisation?
9:45-11:25	Xuchen Yao, Pravin Bhutada, Kallirroi Georgila, Kenji Sagae, Ron Artstein and David Traum	Practical Evaluation of Speech Recognizers for Virtual Human Dialogue Systems
9:45-11:25	Barbara Plank	Improved Statistical Measures to Assess Natural Language Parser Performance across Domains
9:45-11:25	Carlos-D. Martínez-Hinarejos, Vicent Tamarit and José-M. Benedí	Evaluation of HMM-based Models for the Annotation of Unsegmented Dialogue Turns

	Session P22 - Machine Translation and Evaluation	Chair :
11:45-13:05	Hercules Dalianis, Hao-chun Xing and Xin Zhang	Creating a Reusable English-Chinese Parallel Corpus for Bilingual Dictionary Construction
11:45-13:05	Marta R. Costa-jussà, Mireia Farrús, José B. Mariño and José A. R. Fonollosa	Automatic and Human Evaluation Study of a Rule-based and a Statistical Catalan-Spanish Machine Translation Systems
11:45-13:05	Marta R. Costa-jussà and José A. R. Fonollosa	Using Linear Interpolation and Weighted Reordering Hypotheses in the Moses System
11:45-13:05	Maxim Khalilov, José A. R. Fonollosa, Inguna Skadina, Edgars Brālītis and Lauma Pretkalnina	Towards Improving English-Latvian Translation: A System Comparison and a New Rescoring Feature
11:45-13:05	Yanli Sun	Mining the Correlation between Human and Automatic Evaluation at Sentence Level
11:45-13:05	Christian Federmann	Appraise: An Open-Source Toolkit for Manual Phrase-Based Evaluation of Translations
11:45-13:05	Olivier Hamon	Is my Judge a good One?
11:45-13:05	Mark Fishel and Harri Kirik	Linguistically Motivated Unsupervised Segmentation for Machine Translation
11:45-13:05	Yu Chen and Andreas Eisele	Integrating a Rule-based with a Hierarchical Translation System
11:45-13:05	Aurélien Max, Josep Maria Crego and François Yvon	Contrastive Lexical Evaluation of Machine Translation
11:45-13:05	Yiou Wang, Kiyotaka Uchimoto, Jun’ichi Kazama, Canasai Kruengkrai and Kentaro Torisawa	Adapting Chinese Word Segmentation for Machine Translation Based on Short Units
11:45-13:05	Masaki Murata, Tomohiro Ohno, Shigeki Matsubara and Yasuyoshi Inagaki	Construction of Chunk-Aligned Bilingual Lecture Corpus for Simultaneous Machine Translation
11:45-13:05	Ondřej Bojar, Pavel Straňák and Daniel Zeman	Data Issues in English-to-Hindi Machine Translation
11:45-13:05	Taiji Nagasaka, Ran Shimanouchi, Akiko Sakamoto, Takafumi Suzuki, Yohei Morishita, Takehito Utsuro and Suguru Matsuyoshi	Utilizing Semantic Equivalence Classes of Japanese Functional Expressions in Translation Rule Acquisition from Parallel Patent Sentences
11:45-13:05	Niraj Aswani and Robert Gaizauskas	English-Hindi Transliteration using Multiple Similarity Metrics

	Session P23 - Corpora and Treebanks, Grammar and Syntax	Chair : Patrick Saint Dizier
11:45-13:05	Cristina Bosco, Simonetta Montemagni, Alessandro Mazzei, Vincenzo Lombardo, Felice Dell'Orletta, Alessandro Lenci, Leonardo Lesmo, Giuseppe Attardi, Maria Simi, Alberto Lavelli, Johan Hall, Jens Nilsson and Joakim Nivre	Comparing the Influence of Different Treebank Annotations on Dependency Parsing
11:45-13:05	Olga Lyashevskaya	Bank of Russian Constructions and Valencies
11:45-13:05	Tomaž Erjavec, Darja Fišer, Simon Krek and Nina Ledinek	The JOS Linguistically Tagged Corpus of Slovene
11:45-13:05	António Branco, Francisco Costa, João Silva, Sara Silveira, Sérgio Castro, Mariana Avelãs, Clara Pinto and João Graça	Developing a Deep Linguistic Databank Supporting a Collection of Treebanks: the CINTIL DeepGramBank
11:45-13:05	Katarzyna Głowińska and Adam Przepiórkowski	The Design of Syntactic Annotation Levels in the National Corpus of Polish
11:45-13:05	Kais Dukes, Eric Atwell and Abdul-Baquee M. Sharaf	Syntactic Annotation Guidelines for the Quranic Arabic Dependency Treebank
11:45-13:05	Jan Štěpánek and Petr Pajas	Querying Diverse Treebanks in a Uniform Way
11:45-13:05	Marie Mikulová and Jan Štěpánek	Ways of Evaluation of the Annotators in Building the Prague Czech-English Dependency Treebank
11:45-13:05	Marie Candito, Benoît Crabbé and Pascal Denis	Statistical French Dependency Parsing: Treebank Conversion and First Results
11:45-13:05	Marc Kupietz, Cyril Belica, Holger Keibel and Andreas Witt	The German Reference Corpus DeReKo: A Primordial Sample for Linguistic Research
11:45-13:05	Veronika Vincze, Dóra Szauter, Attila Almási, György Móra, Zoltán Alexin and János Csirik	Hungarian Dependency Treebank
11:45-13:05	Archna Bhatia, Rajesh Bhatt, Bhuvana Narasimhan, Martha Palmer, Owen Rambow, Dipti Misra Sharma, Michael Tepper, Ashwini Vaidya and Fei Xia	Empty Categories in a Hindi Treebank
11:45-13:05	Jinho D. Choi, Claire Bonial and Martha Palmer	Propbank Instance Annotation Guidelines Using a Dedicated Editor, Jubilee
11:45-13:05	Hiroki Hanaoka, Hideki Mima and Jun'ichi Tsujii	A Japanese Particle Corpus Built by Example-Based Annotation
11:45-13:05	Stephen A. Boxwell and Chris Brew	A Pilot Arabic CCGbank
11:45-13:05	Simon Mille and Leo Wanner	Syntactic Dependencies for Multilingual and Multilevel Corpus Annotation
11:45-13:05	Adriane Boyd	EAGLE: an Error-Annotated Corpus of Beginning Learner German
11:45-13:05	José M. García-Miguel, Gael Vaamonde and Fita González Domínguez	ADESSE, a Database with Syntactic and Semantic Annotation of a Corpus of Spanish
11:45-13:05	Jan Strunk	Enriching a Treebank to Investigate Relative Clause Extraposition in German
11:45-13:05	John Lee and Dag Haug	Porting an Ancient Greek and Latin Treebank

	Session P24 - Parsing	Chair : Dan Flickinger
14:55-16:35	Alexis Baird and Christopher R. Walker	The Creation of a Large-Scale LFG-Based Gold Parsebank
14:55-16:35	Mridul Gupta, Vineet Yadav, Samar Husain and Dipti Misra Sharma	Partial Parsing as a Method to Expedite Dependency Annotation of a Hindi Treebank
14:55-16:35	Djamé Seddah	Exploring the Spinal-STIG Model for Parsing French
14:55-16:35	Kristina Vučković, Željko Agić and Marko Tadić	Improving Chunking Accuracy on Croatian Texts by Morphosyntactic Tagging
14:55-16:35	Rui Wang and Yi Zhang	Hybrid Constituent and Dependency Parsing with Tsinghua Chinese Treebank
14:55-16:35	Valia Kordoni and Yi Zhang	Disambiguating Compound Nouns for a Dynamic HPSG Treebank of Wall Street Journal Texts
14:55-16:35	João Silva, António Branco and Patricia Gonçalves	Top-Performing Robust Constituency Parsing of Portuguese: Freely Available in as Many Ways as you Can Get it
14:55-16:35	Marco Passarotti and Felice Dell'Orletta	Improvements in Parsing the Index Thomisticus Treebank. Revision, Combination and a Feature Model for Medieval Latin
14:55-16:35	Violeta Seretan, Eric Wehrli, Luka Nerima and Gabriela Soare	FipsRomanian: Towards a Romanian Version of the Fips Syntactic Parser
14:55-16:35	Kathrin Spreyer, Lilja Øvrelid and Jonas Kuhn	Training Parsers on Partial Trees: A Cross-language Comparison
14:55-16:35	Lamia Tounsi and Josef van Genabith	Arabic Parsing Using Grammar Transforms
14:55-16:35	Yoshihiko Hayashi, Thierry Declerck and Chiharu Narawa	LAF/GrAF-grounded Representation of Dependency Structures

	Session P25 - Discourse Annotation	Chair : Dan Cristea
14:55-16:35	Piroska Lendvai, Thierry Declerck, Sándor Darányi, Pablo Gervás, Raquel Hervás, Scott Malec and Federico Peinado	Integration of Linguistic Markup into Semantic Models of Folk Narratives: The Fairy Tale Use Case
14:55-16:35	Šárka Zikánová, Lucie Mladová, Jiří Mírovský and Pavlína Jínová	Typical Cases of Annotators’ Disagreement in Discourse Annotations in Prague Dependency Treebank
14:55-16:35	Samira Shaikh, Tomek Strzalkowski, Aaron Broadwell, Jennifer Stromer-Galley, Sarah Taylor and Nick Webb	MPC: A Multi-Party Chat Corpus for Modeling Social Phenomena in Discourse
14:55-16:35	Raffaella Bernardi, Manuel Kirschner and Zorana Ratkovic	Context Fusion: The Role of Discourse Structure and Centering Theory
14:55-16:35	Xuchen Yao, Irina Borisova and Mehwish Alam	PDTB XML: the XMLization of the Penn Discourse TreeBank 2.0
14:55-16:35	Horacio Saggion, Elena Stein-Sparvieri, David Maldavsky and Sandra Szasz	NLP Resources for the Analysis of Patient/Therapist Interviews
14:55-16:35	Nicole Novielli and Carlo Strapparava	Studying the Lexicon of Dialogue Acts
14:55-16:35	Nils Reiter, Oliver Hellwig, Anand Mishra, Anette Frank and Jens Burkhardt	Using NLP Methods for the Analysis of Rituals
14:55-16:35	Amal Al-Saif and Katja Markert	The Leeds Arabic Discourse Treebank: Annotating Discourse Connectives for Arabic
14:55-16:35	Maria Liakata, Simone Teufel, Advaith Siddharthan and Colin Batchelor	Corpora for the Conceptualisation and Zoning of Scientific Papers
14:55-16:35	Oi Yee Kwong	Constructing an Annotated Story Corpus: Some Observations and Issues
14:55-16:35	David K. Elson and Kathleen R. McKeown	Building a Bank of Semantically Encoded Narratives
14:55-16:35	Rashmi Prasad, Aravind Joshi and Bonnie Webber	Exploiting Scope for Shallow Discourse Parsing

	Session P26 - Dialogue Annotation	Chair : Jens Allwood
14:55-16:35	Sara Tonelli, Giuseppe Riccardi, Rashmi Prasad and Aravind Joshi	Annotation of Discourse Relations for Conversational Spoken Dialogs
14:55-16:35	Thomas Schmidt and Wilfried Schütte	FOLKER: An Annotation Tool for Efficient Transcription of Natural, Multi-party Interaction
14:55-16:35	Agnieszka Mykowiecka, Katarzyna Głowińska and Joanna Rabiega-Wiśniewska	Domain-related Annotation of Polish Spoken Dialogue Corpus LUNA.PL
14:55-16:35	Yasuharu Den, Hanae Koiso, Takehiko Maruyama, Kikuo Maekawa, Katsuya Takanashi, Mika Enomoto and Nao Yoshida	Two-level Annotation of Utterance-units in Japanese Dialogs: An Empirically Emerged Scheme
14:55-16:35	Olivier Blanc, Matthieu Constant, Anne Dister and Patrick Watrin	Partial Parsing of Spontaneous Spoken French
14:55-16:35	Mohamed Maamouri, Ann Bies, Seth Kulick, Wajdi Zaghouani, Dave Graff and Mike Ciul	From Speech to Trees: Applying Treebank Annotation to Arabic Broadcast News
14:55-16:35	Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka and Satoshi Nakamura	Dialogue Acts Annotation for NICT Kyoto Tour Dialogue Corpus to Construct Statistical Dialogue Systems
14:55-16:35	Iris Eshkol, Denis Maurel and Nathalie Friburger	Eslo: From Transcription to Speakers' Personal Information Annotation
14:55-16:35	Roberta Catizone, Alexiei Dingli and Robert Gaizauskas	Using Dialogue Corpora to Extend Information Extraction Patterns for Natural Language Understanding of Dialogue
14:55-16:35	Renata Savy	Pr.A.Ti.D: A Coding Scheme for Pragmatic Annotation of Dialogues.

	Session P27 - Evaluation of Speech Recognition and Speech Synthesis	Chair : Olivier Galibert
14:55-16:35	Bert Réveil, Jean-Pierre Martens and Henk van den Heuvel	Improving Proper Name Recognition by Adding Automatically Learned Pronunciation Variants to the Lexicon
14:55-16:35	Iñaki Sainz, Eva Navas, Inma Hernáez, Antonio Bonafonte and Francisco Campillo	TTS Evaluation Campaign with a Common Spanish Database
14:55-16:35	Timo Sowa, Fiorenza Arisio and Luca Cristoforetti	DICIT: Evaluation of a Distant-talking Speech Interface for Television

	Session P28 - Terminological Lexicons, Ontologies, Corpora	Chair : Monica Monachini
16:55-18:15	Ranka Stanković, Ivan Obradović and Olivera Kitanović	GIS Application Improvement with Multilingual Lexical and Terminological Resources
16:55-18:15	Rita Marinelli, Adriana Roventini, Giovanni Spadoni and Sebastiana Cucurullo	Lexical Semantic Resources in a Terminological Network
16:55-18:15	Nelleke Oostdijk, Suzan Verberne and Cornelis Koster	Constructing a Broad-coverage Lexicon for Text Mining in the Patent Domain
16:55-18:15	Rodrigo Agerri and Ana García-Serrano	Q-WordNet: Extracting Polarity from WordNet Senses
16:55-18:15	Aya Nishikawa, Ryo Nishimura, Yasuhiko Watanabe and Yoshihiro Okada	A Context Sensitive Variant Dictionary for Supporting Variant Selection
16:55-18:15	Montse Cuadros, Egoitz Laparra, German Rigau, Piek Vossen and Wauter Bosma	Integrating a Large Domain Ontology of Species into WordNet
16:55-18:15	Andrejs Vasiljevs and Kaspars Balodis	Corpus Based Analysis for Multilingual Terminology Entry Compounding
16:55-18:15	Arianne Reimerink, Pilar León Araúz and Pedro J. Magaña Redondo	EcoLexicon: An Environmental TKB
16:55-18:15	Dimitrios Kokkinakis and Ulla Gerdin	A Swedish Scientific Medical Corpus for Terminology Management and Linguistic Exploration

	Session P29 - Question Answering and Evaluation	Chair : Giuseppe Attardi
16:55-18:15	Silvia Quarteroni and Alessandro Moschitti	A Comprehensive Resource to Evaluate Complex Open Domain Question Answering
16:55-18:15	Alessandra Giordani and Alessandro Moschitti	Corpora for Automatically Learning to Map Natural Language Questions into SQL Queries
16:55-18:15	Fang Xu and Dietrich Klakow	Paragraph Acquisition and Selection for List Question Using Amazon’s Mechanical Turk
16:55-18:15	Diana Santos, Luís Miguel Cabral, Corina Forascu, Pamela Forner, Fredric Gey, Katrin Lamm, Thomas Mandl, Petya Osenova, Anselmo Peñas, Álvaro Rodrigo, Julia Schulz, Yvonne Skalban and Erik Tjong Kim Sang	GikiCLEF: Crosscultural Issues in Multilingual Information Access
16:55-18:15	Sarra El Ayari, Brigitte Grau and Anne-Laure Ligozat	Fine-grained Linguistic Evaluation of Question Answering Systems
16:55-18:15	Arnaud Grappy, Brigitte Grau, Olivier Ferret, Cyril Grouin, Véronique Moriceau, Isabelle Robba, Xavier Tannier, Anne Vilnat and Vincent Barbier	A Corpus for Studying Full Answer Justification
16:55-18:15	Ludovic Quintard, Olivier Galibert, Gilles Adda, Brigitte Grau, Dominique Laurent, Véronique Moriceau, Sophie Rosset, Xavier Tannier and Anne Vilnat	Question Answering on Web Data: The QA Evaluation in Quæro
16:55-18:15	Xavier Tannier and Véronique Moriceau	FIDJI: Web Question-Answering at Quaero 2009
16:55-18:15	Bernard Jacquemin	A Derivational Rephrasing Experiment for Question Answering

	Session P30 - Natural Language Generation	Chair : Kristiina Jokinen
16:55-18:15	Roberto P. A. Araujo, Rafael L. de Oliveira, Eder M. de Novais, Thiago D. Tadeu, Daniel B. Pereira and Ivandré Paraboni	SINotas: the Evaluation of a NLG Application
16:55-18:15	Thiago D. Tadeu, Eder M. de Novais and Ivandré Paraboni	Extracting Surface Realisation Templates from Corpora
16:55-18:15	Sandra Williams and Richard Power	A Fact-aligned Corpus of Numerical Expressions
16:55-18:15	Andrew Gargett, Konstantina Garoufi, Alexander Koller and Kristina Striegnitz	The GIVE-2 Corpus of Giving Instructions in Virtual Environments

	Session P31 - Dialogue Corpora	Chair : Laurent Prevot
16:55-18:15	Keyan Zhou, Aijun Li, Zhigang Yin and Chengqing Zong	CASIA-CASSIL: a Chinese Telephone Conversation Corpus in Real Scenarios with Multi-leveled Annotation
16:55-18:15	Yuki Kamiya, Tomohiro Ohno, Shigeki Matsubara and Hideki Kashioka	Construction of Back-Channel Utterance Corpus for Responsive Spoken Dialogue System Development
16:55-18:15	Werner Spiegl, Korbinian Riedhammer, Stefan Steidl and Elmar Nöth	FAU IISAH Corpus -- A German Speech Database Consisting of Human-Machine and Human-Human Interaction Acquired by Close-Talking and Far-Distance Microphones
16:55-18:15	Rodolfo Delmonte, Antonella Bristot and Vincenzo Pallotta	Deep Linguistic Processing with GETARUNS for Spoken Dialogue Understanding
16:55-18:15	Helena Spilková, Daniel Brenner, Anton Öttl, Pavel Vondřička, Wim van Dommelen and Mirjam Ernestus	The Kachna L1/L2 Picture Replication Corpus
16:55-18:15	Linda Brandschain, David Graff, Christopher Cieri, Kevin Walker, Chris Caruso and Abby Neely	Greybeard Longitudinal Speech Study
16:55-18:15	Linda Brandschain, David Graff, Chris Cieri, Kevin Walker, Chris Caruso and Abby Neely	Mixer 6

	Session P32 - Dialogue Management and Systems	Chair : Takenobu Tokunaga
16:55-18:15	Tobias Heinroth, Dan Denich, Alexander Schmitt and Wolfgang Minker	Efficient Spoken Dialogue Domain Representation and Interpretation
16:55-18:15	Ioana Vasilescu, Sophie Rosset and Martine Adda-Decker	On the Role of Discourse Markers in Interactive Spoken Question Answering Systems
16:55-18:15	Jette Viethen, Simon Zwarts, Robert Dale and Markus Guhe	Dialogue Reference in a Visual Domain
16:55-18:15	Anton Leuski and David Traum	NPCEditor: A Tool for Building Question-Answering Characters

	Session P33 - Information Extraction, Terminology, Corpora	Chair : Pierre Zweigenbaum
18:20-19:40	Claudia Borg, Mike Rosner and Gordon J. Pace	Automatic Grammar Rule Extraction and Ranking for Definitions
18:20-19:40	Alberto Tretti and Barbara Di Eugenio	Analysis and Presentation of Results for Mobile Local Search
18:20-19:40	Atsushi Fujii	Modeling Wikipedia Articles to Enhance Encyclopedic Search
18:20-19:40	Christian Federmann and Thierry Declerck	Extraction, Merging, and Monitoring of Company Data from Heterogeneous Sources
18:20-19:40	Alberto Simões, José João Almeida and Rita Farinha	Processing and Extracting Data from Dicionário Aberto
18:20-19:40	Ziqi Zhang, José Iria and Fabio Ciravegna	Improving Domain-specific Entity Recognition with Automatic Term Recognition and Feature Extraction
18:20-19:40	Jakob Halskov, Dorte Haltrup Hansen, Anna Braasch and Sussi Olsen	Quality Indicators of LSP Texts ― Selection and Measurements Measuring the Terminological Usefulness of Documents for an LSP Corpus
18:20-19:40	Eric Charton and Juan-Manuel Torres-Moreno	NLGbAse: A Free Linguistic Resource for Natural Language Processing Systems
18:20-19:40	Cécile Grivaz	Human Judgements on Causation in French Texts
18:20-19:40	Heng Ji, Xiang Li, Angelo Lucia and Jianting Zhang	Annotating Event Chains for Carbon Sequestration Literature
18:20-19:40	Kumutha Swampillai and Mark Stevenson	Inter-sentential Relations in Information Extraction Corpora
18:20-19:40	Christopher R. Walker and Hannah Copperman	Evaluating Complex Semantic Artifacts
18:20-19:40	Marc Kemps-Snijders, Thomas Koller, Han Sloetjes and Huib Verwey	LAT Bridge: Bridging Tools for Annotation and Exploration of Rich Linguistic Data

	Session P34 - Knowledge Discovery	Chair : Leo Wanner
18:20-19:40	Paola Monachesi and Thomas Markus	Socially Driven Ontology Enrichment for eLearning
18:20-19:40	Avaré Stewart, Kerstin Denecke and Wolfgand Nejdl	Cross-Corpus Textual Entailment for Sublanguage Analysis in Epidemic Intelligence
18:20-19:40	Ekaterina Buyko, Elena Beisswanger and Udo Hahn	The GeneReg Corpus for Gene Expression Regulation Events ― An Overview of the Corpus and its In-Domain and Out-of-Domain Interoperability
18:20-19:40	Carlos Periñán-Pascual and Francisco Arcas-Túnez	The Architecture of FunGramKB
18:20-19:40	Jaouad Mousser	A Large Coverage Verb Taxonomy for Arabic
18:20-19:40	Satoshi Sekine and Kapil Dalwani	Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information

	Session P35 - Text Corpora and Language Resources	Chair : Toma? Erjavec
18:20-19:40	Henk van den Heuvel, René van Horik, Stef Scagliola, Eric Sanders and Paula Witkamp	The VeteranTapes: Research Corpus, Fragment Processing Tool, and Enhanced Publications for the e-Humanities
18:20-19:40	Martin Reynaert, Nelleke Oostdijk, Orphée De Clercq, Henk van den Heuvel and Franciska de Jong	Balancing SoNaR: IPR versus Processing Issues in a 500-Million-Word Written Dutch Reference Corpus
18:20-19:40	Youssef Aït Ouguengay and Aïcha Bouhjar	For Standardised Amazigh Linguistic Resources
18:20-19:40	Dafydd Gibbon, Moses Ekpenyong and Eno-Abasi Urua	Medefaidrin: Resources Documenting the Birth and Death Language Life-cycle
18:20-19:40	Nicolas Serrano, Francisco Castro and Alfons Juan	The RODRIGO Database
18:20-19:40	Cristina Sánchez-Marco, Gemma Boleda, Josep Maria Fontana and Judith Domingo	Annotation and Representation of a Diachronic Corpus of Spanish
18:20-19:40	Roser Sanromà and Gemma Boleda	The Database of Catalan Adjectives
18:20-19:40	Graham Neubig and Shinsuke Mori	Word-based Partial Annotation for Efficient Corpus Construction

	Session P36 - Multimodal and Audiovisual Corpora	Chair : Daniel Sonntag
9:45-11:25	Elena Grishina	Multimodal Russian Corpus (MURCO): First Steps
9:45-11:25	Kristiina Jokinen	Non-verbal Signals for Turn-taking and Feedback
9:45-11:25	Patrizia Paggio, Jens Allwood, Elisabeth Ahlsén, Kristiina Jokinen and Costanza Navarretta	The NOMCO Multimodal Nordic Resource - Goals and Characteristics
9:45-11:25	Fernando Fernández-Martínez, Juan Manuel Lucas-Cuesta, Roberto Barra Chicote, Javier Ferreiros and Javier Macías-Guarasa	HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish
9:45-11:25	Francisco Torreira and Mirjam Ernestus	The Nijmegen Corpus of Casual Spanish
9:45-11:25	Rein Ove Sikveland, Anton Öttl, Ingunn Amdal, Mirjam Ernestus, Torbjørn Svendsen and Jens Edlund	Spontal-N: A Corpus of Interactional Spoken Norwegian
9:45-11:25	Jens Edlund, Jonas Beskow, Kjell Elenius, Kahl Hellmer, Sofia Strönbergsson and David House	Spontal: A Swedish Spontaneous Dialogue Corpus of Audio, Video and Motion Capture
9:45-11:25	Jérôme Urbain, Elisabetta Bevacqua, Thierry Dutoit, Alexis Moinet, Radoslaw Niewiadomski, Catherine Pelachaud, Benjamin Picart, Joëlle Tilmanne and Johannes Wagner	The AVLaughterCycle Database
9:45-11:25	Carlos Gómez Gallo, T. Florian Jaeger and Katrina Furth	A Database for the Exploration of Spanish Planning
9:45-11:25	Stavros Ntalampiras, Todor Ganchev, Ilyas Potamitis and Nikos Fakotakis	Heterogeneous Sensor Database in Support of Human Behaviour Analysis in Unrestricted Environments: The Audio Part
9:45-11:25	Theodoros Kostoulas, Otilia Kocsis, Todor Ganchev, Fernando Fernández-Aranda, Juan J. Santamaría, Susana Jiménez-Murcia, Maher Ben Moussa, Nadia Magnenat-Thalmann and Nikos Fakotakis	The PlayMancer Database: A Multimodal Affect Database in Support of Research and Development Activities in Serious Game Environment
9:45-11:25	Alexander Vorwerk, Xiaohui Wang, Dorothea Kolossa, Steffen Zeiler and Reinhold Orglmeister	WAPUSK20 - A Database for Robust Audiovisual Speech Recognition
9:45-11:25	Peng-Wen Chen, Snehal Kumar Chennuru and Ying Zhang	A Language Approach to Modeling Human Behaviors
9:45-11:25	Kathleen Eberhard, Hannele Nicholson, Sandra Kübler, Susan Gundersen and Matthias Scheutz	The Indiana ``Cooperative Remote Search Task"" (CReST) Corpus
9:45-11:25	Katerina Pastra, Christian Wallraven, Michael Schultze, Argyro Vataki and Kathrin Kaulard	The POETICON Corpus: Capturing Language Use and Sensorimotor Experience in Everyday Interaction
9:45-11:25	Quan Nguyen and Michael Kipp	Annotation of Human Gesture using 3D Skeleton Controls
9:45-11:25	Massimo Poesio, Marco Baroni, Oswald Lanz, Alessandro Lenci, Alexandros Potamianos, Hinrich Schütze, Sabine Schulte im Walde and Luca Surian	BabyExp: Constructing a Huge Multimodal Resource to Acquire Commonsense Knowledge Like Children Do

	Session P37 - Sign Language	Chair : Annelies Braffort
9:45-11:25	François Lefebvre-Albaret and Patrice Dalle	Video Retrieval in Sign Language Videos : How to Model and Compare Signs?
9:45-11:25	Antoinette Hawayek, Riccardo Del Gratta and Giuseppe Cappelli	A Bilingual Dictionary Mexican Sign Language-Spanish/Spanish-Mexican Sign Language

	Session P38 - Document Classification	Chair : Dan Tufiş
9:45-11:25	Serge Sharoff, Zhili Wu and Katja Markert	The Web Library of Babel: evaluating genre collections
9:45-11:25	Hercules Dalianis and Sumithra Velupillai	How Certain are Clinical Assessments? Annotating Swedish Clinical Text for (Un)certainties, Speculations and Negations
9:45-11:25	Magnus Rosell	Text Cluster Trimming for Better Descriptions and Improved Quality
9:45-11:25	Alberto Díaz, Pablo Gervás, Antonio García and Laura Plaza	Development and Use of an Evaluation Collection for Personalisation of Digital Newspapers
9:45-11:25	Michael Wiegand and Dietrich Klakow	Predictive Features for Detecting Indefinite Polar Sentences
9:45-11:25	Naoki Ishikawa, Ryo Nishimura, Yasuhiko Watanabe, Yoshihiro Okada and Masaki Murata	Detection of submitters suspected of pretending to be someone else in a community site
9:45-11:25	Nikola Ljubešić, Tomislava Lauc and Damir Boras	Building a Gold Standard for Event Detection in Croatian

	Session P39 - Summarisation	Chair : Luca Dini
9:45-11:25	Jorge Vivaldi, Iria da Cunha, Juan Manuel Torres-Moreno and Patricia Velázquez-Morales	Automatic Summarization Using Terminological and Semantic Resources
9:45-11:25	Claude de Loupy, Marie Guégan, Christelle Ayache, Somara Seng and Juan-Manuel Torres Moreno	A French Human Reference Corpus for Multi-Document Summarization and Sentence Compression
9:45-11:25	Ahmet Aker and Robert Gaizauskas	Model Summaries for Location-related Images
9:45-11:25	Masahiro Nakano, Hideyuki Shibuki, Rintaro Miyazaki, Madoka Ishioroshi, Koichi Kaneko and Tatsunori Mori	Construction of Text Summarization Corpus for the Credibility of Information on the Web

	Session P40 - Textual Entailment	Chair : Brigitte Grau
9:45-11:25	Paul Bedaride and Claire Gardent	Syntactic Testsuites and Textual Entailment Recognition
9:45-11:25	Rui Wang and Caroline Sporleder	Constructing a Textual Semantic Relation Corpus Using a Discourse Treebank
9:45-11:25	Aurélien Max and Guillaume Wisniewski	Mining Naturally-occurring Corrections and Paraphrases from Wikipedia’s Revision History
9:45-11:25	Jana Z. Sukkarieh and Eleanor Bolge	Building a Textual Entailment Suite for the Evaluation of Automatic Content Scoring Technologies

	Session P41 - Semantics and Evaluation	Chair : Amália Mendes
11:45-13:05	Kirk Roberts, Srikanth Gullapalli, Cosmin Adrian Bejan and Sanda Harabagiu	A Linguistic Resource for Semantic Parsing of Motion Events
11:45-13:05	Zareen Syed, Evelyne Viegas and Savas Parastatidis	Automatic Discovery of Semantic Relations using MindNet
11:45-13:05	Ineke Schuurman and Vincent Vandeghinste	Cultural Aspects of Spatiotemporal Analysis in Multilingual Applications
11:45-13:05	Fabienne Venant	Meaning Representation: From Continuity to Discreteness
11:45-13:05	Dirk Goldhahn and Uwe Quasthoff	Automatic Annotation of Co-Occurrence Relations
11:45-13:05	Simon Scerri, Gerhard Gossen, Brian Davis and Siegfried Handschuh	Classifying Action Items for Semantic Email
11:45-13:05	Jiří Materna and Karel Pala	Using Ontologies for Semi-automatic Linking VerbaLex with FrameNet
11:45-13:05	Olivier Ferret	Testing Semantic Similarity Measures for Extracting Synonyms from a Corpus

	Session P42 - Text Mining	Chair : Serge Sharoff
11:45-13:05	Sophia Ananiadou, John McNaught, James Thomas, Mark Rickinson and Sandy Oliver	Evaluating a Text Mining Based Educational Search Portal
11:45-13:05	Hiroyuki Shinnou and Minoru Sasaki	Detection of Peculiar Examples using LOF and One Class SVM
11:45-13:05	Agata Cybulska and Piek Vossen	Event Models for Historical Perspectives: Determining Relations between High and Low Level Events in Text, Based on the Classification of Time, Location and Participants.
11:45-13:05	Eva Sassolini and Alessandra Cinini	Cultural Heritage: Knowledge Extraction from Web Documents

	Session P43 - Multilingual Corpora for Machine Translation	Chair : Gregor Thurmair
11:45-13:05	Lieve Macken	An Annotation Scheme and Gold Standard for Dutch-English Word Alignment
11:45-13:05	Lucia Specia, Nicola Cancedda and Marc Dymetman	A Dataset for Assessing Machine Translation Evaluation Metrics
11:45-13:05	Gabor Recski, András Rung, Attila Zséder and András Kornai	NP Alignment in Bilingual Corpora
11:45-13:05	Orphée De Clercq and Maribel Montero Perez	Data Collection and IPR in Multilingual Parallel Corpora. Dutch Parallel Corpus
11:45-13:05	Yulia Tsvetkov and Shuly Wintner	Automatic Acquisition of Parallel Corpora from Websites with Dynamic Content
11:45-13:05	Beáta Megyesi, Bengt Dahlqvist, Éva Á. Csató and Joakim Nivre	The English-Swedish-Turkish Parallel Treebank
11:45-13:05	Lars Ahrenberg	Alignment-based Profiling of Europarl Data in an English-Swedish Parallel Corpus
11:45-13:05	Jesús González-Rubio, Jorge Civera, Alfons Juan and Francisco Casacuberta	Saturnalia: A Latin-Catalan Parallel Corpus for Statistical MT
11:45-13:05	Julia Maria Schulz, Christa Womser-Hacker and Thomas Mandl	Multilingual Corpus Development for Opinion Mining
11:45-13:05	Tom Vanallemeersch	Belgisch Staatsblad Corpus: Retrieving French-Dutch Sentences from Official Documents

	Session P44 - Language Identification	Chair : Alexander Mehler
11:45-13:05	Yu Fu, Feiyu Xu and Hans Uszkoreit	Determining the Origin and Structure of Person Names
11:45-13:05	Tommi Vatanen, Jaakko J. Väyrynen and Sami Virpioja	Language Identification of Short Text Segments with N-gram Models
11:45-13:05	Stasinos Konstantopoulos	Learning Language Identification Models: A Comparative Analysis of the Distinctive Features of Names and Common Words
11:45-13:05	Mohamed Belgacem, Georges Antoniadis and Laurent Besacier	Automatic Identification of Arabic Dialects

	Session P45 - Evaluation Methodologies	Chair : Alessandro Moschitti
11:45-13:05	Elin Carlsson and Hercules Dalianis	Influence of Module Order on Rule-Based De-identification of Personal Names in Electronic Patient Records Written in Swedish
11:45-13:05	Olga Babko-Malaya, Dan Hunter, Connie Fournelle and Jim White	Evaluation of Document Citations in Phase 2 Gale Distillation
11:45-13:05	Olivier Galibert, Ludovic Quintard, Sophie Rosset, Pierre Zweigenbaum, Claire Nédellec, Sophie Aubin, Laurent Gillard, Jean-Pierre Raysz, Delphine Pois, Xavier Tannier, Louise Deléger and Dominique Laurent	Named and Specific Entity Detection in Varied Data: The Quæro Named Entity Baseline Evaluation
11:45-13:05	Marco Guerini, Carlo Strapparava and Oliviero Stock	Evaluation Metrics for Persuasive NLP with Google AdWords
11:45-13:05	Joana Hois	Inter-Annotator Agreement on a Linguistic Ontology for Spatial Language - A Case Study for GUM-Space
11:45-13:05	Petra-Maria Strauß, Stefan Scherer, Georg Layher and Holger Hoffmann	Evaluation of the PIT Corpus Or What a Difference a Face Makes?

	Session P46 - Corpora and Evaluation	Chair : Henk van den Heuvel
11:45-13:05	Ernesto William De Luca	A Corpus for Evaluating Semantic Multilingual Web Retrieval Systems: The Sense Folder Corpus
11:45-13:05	Sanja Seljan, Marko Tadić, Željko Agić, Jan Šnajder, Bojana Dalbelo Bašić and Vjekoslav Osmann	Corpus Aligner (CorAl) Evaluation on English-Croatian Parallel Corpora
11:45-13:05	Irina Temnikova	Cognitive Evaluation Approach for a Controlled Language Post-Editing Experiment
11:45-13:05	Maria Khokhlova and Victor Zakharov	Studying Word Sketches for Russian
11:45-13:05	Hannah Copperman and Christopher R. Walker	Fred’s Reusable Evaluation Device: Providing Support for Quick and Reliable Linguistic Annotation
11:45-13:05	Emilia Apostolova, Sean Neilan, Gary An, Noriko Tomuro and Steven Lytinen	Djangology: A Light-weight Web-based Tool for Distributed Collaborative Text Annotation

	Session P47 - Corpora, Annotation and Tools	Chair : Satoshi Sekine
14:55-16:35	Marc Verhagen	The Brandeis Annotation Tool
14:55-16:35	Georgios Petasis and Dimitrios Petasis	BlogBuster: A Tool for Extracting Corpora from the Blogosphere
14:55-16:35	Jinho D. Choi, Claire Bonial and Martha Palmer	Propbank Frameset Annotation Guidelines Using a Dedicated Editor, Cornerstone
14:55-16:35	Dain Kaplan, Ryu Iida and Takenobu Tokunaga	Annotation Process Management Revisited
14:55-16:35	Takeshi Abekawa, Masao Utiyama, Eiichiro Sumita and Kyo Kageura	Community-based Construction of Draft and Final Translation Corpus Through a Translation Hosting Site Minna no Hon'yaku (MNH)
14:55-16:35	Maarten Marx and Anne Schuth	DutchParl. The Parliamentary Documents in Dutch
14:55-16:35	Svetla Koeva, Diana Blagoeva and Siya Kolkovska	Bulgarian National Corpus Project
14:55-16:35	Khalil Dahab and Anja Belz	A Game-based Approach to Transcribing Images of Text
14:55-16:35	Ghulam Raza	Inferring Subcat Frames of Verbs in Urdu
14:55-16:35	Romaric Besançon, Gaël de Chalendar, Olivier Ferret, Faiza Gara, Olivier Mesnard, Meriama Laïb and Nasredine Semmar	LIMA : A Multilingual Framework for Linguistic Analysis and Linguistic Resources Development and Evaluation
14:55-16:35	Catarina Magro	When CORDIAL Becomes Friendly: Endowing the CORDIAL Corpus with a Syntactic Annotation Layer
14:55-16:35	Richard Johansson and Alessandro Moschitti	A Flexible Representation of Heterogeneous Annotation Data
14:55-16:35	Roberto Navigli, Paola Velardi and Juana María Ruiz-Martínez	An Annotated Dataset for Extracting Definitions and Hypernyms from the Web

	Session P48 - Tools for Speech Corpus	Chair : Justus Roux
14:55-16:35	Kai Wörner	A Tool for Feature-Structure Stand-Off-Annotation on Transcriptions of Spoken Discourse
14:55-16:35	Andrew Thwaites, Jeroen Geertzen, William D. Marslen-Wilson and Paula Buttery	LIPS: A Tool for Predicting the Lexical Isolation Point of a Word
14:55-16:35	Ibon Saratxaga, Inmaculada Hernáez, Eva Navas, Iñaki Sainz, Iker Luengo, Jon Sanchez, Igor Odriozola and Daniel Erro	AhoTransf: A Tool for Multiband Excitation Based Speech Analysis and Modification
14:55-16:35	Sara Romano and Francesco Cutugno	New Features in Spoken Language Search Hawk (SpLaSH): Query Language and Query Sequence
14:55-16:35	Kornel Laskowski and Jens Edlund	A Snack Implementation and Tcl/Tk Interface to the Fundamental Frequency Variation Spectrum Algorithm
14:55-16:35	Sathish Pammi, Marcela Charfuelan and Marc Schröder	Multilingual Voice Creation Toolkit for the MARY TTS Platform

	Session P49 - WordNet, Framenet, Ontologies	Chair : Karel Pala
14:55-16:35	Winston Anderson, Laurette Pretorius and Albert Kotzé	Base Concepts in the African Languages Compared to Upper Ontologies and the WordNet Top Ontology
14:55-16:35	Yue Ma, Adeline Nazarenko and Laurent Audibert	Formal Description of Resources for Ontology-based Semantic Annotation
14:55-16:35	Roxane Segers and Piek Vossen	Facilitating Non-expert Users of the KYOTO Platform: the TMEKO Editing Protocol for Synset to Ontology Mappings
14:55-16:35	Chris Irwin Davis and Dan Moldovan	Feasibility of Automatically Bootstrapping a Persian WordNet
14:55-16:35	Pushpak Bhattacharyya	IndoWordNet
14:55-16:35	Zygmunt Vetulani, Marek Kubis and Tomasz Obrębski	PolNet ― Polish WordNet: Data and Tools
14:55-16:35	Mehrnoush Shamsfard, Hakimeh Fadaei and Elham Fekri	Extracting Lexico-conceptual Knowledge for Developing Persian WordNet
14:55-16:35	Prasanth Kolachina, Sudheer Kolachina, Anil Kumar Singh, Samar Husain, Viswanath Naidu, Rajeev Sangal and Aksar Bharati	Grammar Extraction from Treebanks for Hindi and Telugu
14:55-16:35	Emiliano Giovannetti	An Unsupervised Approach for Semantic Relation Interpretation
14:55-16:35	Gabor Melli	Concept Mentions within KDD-2009 Abstracts (kdd09cma1) Linked to a KDD Ontology (kddo1)
14:55-16:35	Min-Jae Kwon, Hae-Yun Lee and Hee-Rahk Chae	Linking Korean Words with an Ontology
14:55-16:35	Hassina Aliane, Zaia Alimazighi and Ahmed Cherif Mazari	Al ―Khalil : The Arabic Linguistic Ontology Project
14:55-16:35	Cássia Trojahn, Paulo Quaresma and Renata Vieira	An API for Multi-lingual Ontology Matching
14:55-16:35	Thierry Declerck and Piroska Lendvai	Towards a Standardized Linguistic Annotation of the Textual Content of Labels in Knowledge Representation Systems
14:55-16:35	Kiril Simov and Petya Osenova	Constructing of an Ontology-based Lexicon for Bulgarian
14:55-16:35	René Witte, Ninus Khamis and Juergen Rilling	Flexible Ontology Population from Text: The OwlExporter
14:55-16:35	Takehiro Teraoka, Jun Okamoto and Shun Ishizaki	An Associative Concept Dictionary for Verbs and its Application to Elliptical Word Estimation
14:55-16:35	Nao Tatsumi, Jun Okamoto and Shun Ishizaki	Evaluating Semantic Relations and Distances in the Associative Concept Dictionary using NIRS-imaging
14:55-16:35	Giulio Paci, Giorgio Pedrazzi and Roberta Turra	Wikipedia-based Approach for Linking Ontology Concepts to their Realisations in Text
14:55-16:35	Pradeep Dantuluri, Brian Davis and Siegfried Handschuh	A Use Case for Controlled Languages as Interfaces to Semantic Web Applications
14:55-16:35	Alessandro Oltramari, Guido Vetere, Maurizio Lenzerini, Aldo Gangemi and Nicola Guarino	Senso Comune