Session O1-W Lexicons, Semantics & Infrastructural Issues :  
Y. Hayashi, T. Ishida A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons
C. Soria, M. Tesconi, F. Bertagna, N. Calzolari, A. Marchetti, M. Monachini Moving to dynamic computational lexicons with LeXFlow
V. Cavalli-sforza, A. Soudi IMORPH?: An Inheritance and Equivalence Based Morphology Description Compiler
A. Itai, S. Wintner, S. Yona A Computational Lexicon of Contemporary Hebrew
E. Agirre, I. Aldezabal, J. Etxeberria, E. Izagirre, K. Mendizabal, E. Pociello, M. Quintian A methodology for the joint development of the Basque WordNet and Semcor
S. Elkateb, W. Black, H. Rodríguez, M. Alkhalifa, E. Aribau, P. Vossen, A. Pease, C. Fellbaum Building a WordNet for Arabic
V. Tablan, T. Polajnar, H. Cunningham, K. Bontcheva User-friendly ontology authoring using a controlled language
Session O2-EW Named Entity Recognition & Time Annotation :  
L. Nezda, A. Hickl, J. Lehmann, S. Fayyaz What in the World is a Shahab? Wide Coverage Named Entity Recognition for Arabic
I. Michailidis, K. Diamantaras, S. Vasileiadis, Y. Frčre Greek Named Entity Recognition using Support Vector Machines, Maximum Entropy and Onetime
B. Pouliquen, M. Kimler, R. Steinberger, C. Ignat, T. Oellinger, K. Blackler, F. Fluart, W. Zaghouani, A. Widiger, A. Forslund, C. Best Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation
V. Giouli, A. Konstandinidis, E. Desypri, H. Papageorgiou Multi-domain Multi-lingual Named Entity Recognition: Revisiting & Grounding the resources issue
C. Forăscu, I. Pistol, D. Cristea Temporality in relation with discourse structure
B. Boguraev, R. Ando Analysis of TimeBank as a Resource for TimeML Parsing
F. Pan, R. Mulkar, J. Hobbs An Annotated Corpus of Typical Durations of Events
Session O3-EW Authoring Tools, Information Extraction and Retrieval :  
F. Bustamante, A. Arnaiz, M. Ginés A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker
M. Reynaert Corpus-Induced Corpus Clean-up
F. Bustamante, E. Díaz Spelling Error Patterns in Spanish for Word Processing Applications
E. Luca, A. Nürnberger Rebuilding Lexical Resources for Information Retrieval using Sense Folder Detection and Merging Methods
J. Iria, F. Ciravegna A Methodology and Tool for Representing Language Resources for Information Extraction
N. Habash, C. Mah, S. Imran, R. Calistri-yeh, P. Sheridan Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
A. Klassmann, F. Offenga, D. Broeder, R. Skiba, P. Wittenburg Comparison of Resource Discovery Methods
Session O4-S Speech Corpora and Dialogue :  
C. Cieri, W. Andrews, J. Campbell, G. Doddington, J. Godfrey, S. Huang, M. Liberman, A. Martin, H. Nakasone, M. Przybocki, K. Walker The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
E. Grishina Spoken Russian in the Russian National Corpus (RNC)
E. Barker, R. Higashinaka, F. Mairesse, R. Gaizauskas, M. Walker, J. Foster Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives
A. Nimaan, P. Nocera, J. Bonastre Towards automatic transcription of Somali language
C. Cucchiarini, H. Hamme, O. Herwijnen, F. Smits JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
S. Galliano, E. Geoffrois, G. Gravier, J. Bonastre, D. Mostefa, K. Choukri Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News
J. Polifroni, M. Walker Learning Database Content for Spoken Dialogue System Design
Session O5-E Machine Translation and Evaluation :  
D. Mostefa, O. Hamon, K. Choukri Evaluation of Automatic Speech Recognition and Speech Translation within TC-STAR: Results from the first evaluation campaign
O. Hamon, M. Rajman X-Score: Automatic Evaluation of Machine Translation Grammaticality
K. Miller, M. Vanni Formal v. Informal: Register-Differentiated Arabic MT Evaluation in the PLATO Paradigm
E. Macklovitch TransType2: The Last Word
U. Schäfer, D. Beck Automatic Testing and Evaluation of Multilingual Language Technology Resources and Components
O. Hamon, A. Popescu-belis, K. Choukri, M. Dabbadie, A. Hartley, W. Mustafa El Hadi, M. Rajman, I. Timimi CESTA: First Conclusions of the Technolangue MT Evaluation Campaign
S. Strassel, C. Cieri, A. Cole, D. Dipersio, M. Liberman, X. Ma, M. Maamouri, K. Maeda Integrated Linguistic Resources for Language Exploitation Technologies
Session O6-WMT Ontologies :  
P. Cimiano, M. Hartung, E. Ratsch Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus
J. Iria, C. Brewster, F. Ciravegna, Y. Wilks An Incremental Tri-Partite Approach To Ontology Learning
G. Grefenstette, F. Debili, C. Fluhr, S. Zinger Exploiting text for extracting image processing resources
R. Marinelli, A. Roventini, G. Spadoni Using Core Ontology for Domain Lexicon Structuring
S. Li, Q. Lu, W. Li, R. Xu Interaction between Lexical Base and Ontology with Formal Concept Analysis
R. Bartolini, C. Caracciolo, E. Giovannetti, A. Lenci, S. Marchi, V. Pirrelli, C. Renso, L. Spinsanti Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases
Session O7-GWT Standards :  
N. Ide, L. Romary Representing Linguistic Corpora and Their Annotations
T. Declerck SynAF: Towards a Standard for Syntactic Annotation
G. Francopoulo, M. George, N. Calzolari, M. Monachini, N. Bel, M. Pet, C. Soria Lexical Markup Framework (LMF)
M. Assem, A. Gangemi, G. Schreiber Conversion of WordNet to a standard RDF/OWL representation
L. Henriksen, C. Povlsen, A. Vasiljevs EuroTermBank - a Terminology Resource based on Best Practice
G. Tummarello, C. Morbidoni, F. Kepler, F. Piazza, P. Puliti A novel Textual Encoding paradigm based on Semantic Web tools and semantics
Session O8-W Lexicon Acquisition :  
P. Chesley, S. Salmon-alt Automatic extraction of subcategorization frames for French
A. Berglund, R. Johansson, P. Nugues Extraction of Temporal Information from Texts in Swedish
M. Schiehlen, K. Spranger The Mass-Count Distinction: Acquisition and Disambiguation
P. Smrž Automatic Acquisition of Semantics-Extraction Patterns from Parallel Texts
Y. Zhang, V. Kordoni Automated Deep Lexical Acquisition for Robust Open Texts Processing
S. Amsalu Data-driven Amharic-English Bilingual Lexicon Acquisition
Session O9-SE TTS & Units for TTS :  
Q. Yang, J. Martens, N. Konings, H. Heuvel Development of a phoneme-to-phoneme (p2p) converter to improve the grapheme-to-phoneme (g2p) conversion of names
S. Breuer, S. Bergmann, R. Dragon, S. Möller Set-up of a Unit-Selection Synthesis with a Prominent Voice
D. Oliver, K. Szklanny Creation and analysis of a Polish speech database for use in unit selection synthesis
J. Pérez, A. Bonafonte, H. Hain, E. Keller, S. Breuer, J. Tian ECESS Inter-Module Interface Specification for Speech Synthesis
M. Garcia, C. D’alessandro, G. Bailly, P. Mareüil, M. Morel A joint prosody evaluation of French text-to-speech synthesis systems
A. Bonafonte, H. Höge, I. Kiss, A. Moreno, U. Ziegenhain, H. Heuvel, H. Hain, X. Wang, M. Garcia TC-STAR: Specifications of Language Resources and Evaluation for Speech Synthesis
Session O10-E Parsing Evaluation :  
P. Paroubek, I. Robba, A. Vilnat, C. Ayache Data, Annotations and Measures in EASY: the Evaluation Campaign for Parsers of French
G. Neumann, B. Crysmann Exploring HPSG-based Treebanks for Probabilistic Parsing
R. Subba, B. Eugenio, E. Terenzi Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations
B. Roark, M. Harper, E. Charniak, B. Dorr, M. Johnson, J. Kahn, Y. Liu, M. Ostendorf, J. Hale, A. Krasnyanskaya, M. Lease, I. Shafran, M. Snover, R. Stewart, L. Yung SParseval: Evaluation Metrics for Parsing Speech
A. Denis, M. Quignard, G. Pitel A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation
T. Vanrullen, P. Blache, J. Balfourier Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
Session O11-W Corpus & Lexicon :  
C. Fairon, S. Paumier A translated corpus of 30,000 French SMS
I. Castellón, A. Fernández-montraveta, G. Vázquez, L. Alonso, J. Capilla The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level
S. Klatt A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German
K. Inui, T. Hirano, R. Iida, A. Fujita, Y. Matsumoto Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences
Session O12-W Tokenization and Tagging :  
M. Forst, R. Kaplan The importance of precise tokenizing for deep grammars
F. Wang, X. Deng, F. Zou Towards Unified Chinese Segmentation Algorithm
F. Pîrvan, D. Tufi Tagset Mapping and Statistical Training Data Cleaning-up
J. Hlavácová New Approach to Frequency Dictionaries — Czech Example
Session O13-M Multimodal Corpora & Natural Interaction :  
N. Campbell, T. Sadanobu, M. Imura, N. Iwahashi, S. Noriko, D. Douxchamps A Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow
D. Byron, E. Fosler-lussier The OSU Quake 2004 corpus of two-party situated problem-solving dialogs
A. Green, H. Hüttenrauch, E. Topp, K. Severinson Developing a Contextualized Multimodal Corpus for Human-Robot Interaction
S. Luz, M. Bouamrane, M. Masoodian Gathering a corpus of multimodal computer-mediated meetings with focus on text and audio interaction
Session O14- W Sentiments & Semantics :  
A. Andreevskaia, S. Bergler Semantic Tag Extraction from WordNet Glosses
A. Esuli, F. Sebastiani SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining
C. Strapparava, A. Valitutti, O. Stock The Affective Weight of Lexicon
M. Taboada, C. Anthony, K. Voll Methods for Creating Semantic Orientation Dictionaries
Session O15-W Treebanks and Acquisition :  
F. Dell’orletta, A. Lenci, S. Montemagni, V. Pirrelli Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment
V. Novak, J. Hajic Perspectives of Turning Prague Dependency Treebank into a Knowledge Base
M. Maamouri, A. Bies, T. Buckwalter, M. Diab, N. Habash, O. Rambow, D. Tabessi Developing and Using a Pilot Dialectal Arabic Treebank
M. Marneffe, B. Maccartney, C. Manning Generating Typed Dependency Parses from Phrase Structure Parses
Session O16-W MWEs and Collocations :  
A. Alonge The Italian Metaphor Database
R. Osswald, H. Helbig, S. Hartrumpf The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon
S. Sharoff, B. Babych, A. Hartley Using collocations from comparable corpora to find translation equivalents
M. Sailer, B. Trawinski The Collection of Distributionally Idiosyncratic Items: A Multilingual Resource for Linguistic Research
Session O17-TGW Multilingual Tools and Terminology :  
D. Stefănescu, D. Tufi Aligning Multilingual Thesauri
G. Ajani, G. Boella, L. Lesmo, M. Martin, A. Mazze A Development Tool For Multilingual Ontology-based Conceptual Dictionaries
B. Hughes, T. Baldwin, S. Bird, J. Nicholson, A. Mackinlay Reconsidering Language Identification for Written Language Resources
X. Ma Champollion: A Robust Parallel Text Sentence Aligner
Session O18-M Multimodal Corpora Annotation and Tools :  
J. Niekrasz, J. Gruenstein NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora
K. Pastra Beyond Multimedia Integration: corpora and annotations for cross-media decision mechanisms
N. Bernsen, L. Dybkjćr, S. Kiilerich H. C. Andersen Conversation Corpus
D. Sonntag, M. Romanelli A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
Session O19-GW Annotation and Annotation Tools :  
A. Burchardt, K. Erk, A. Frank, A. Kowalski, S. Pado SALTO - A Versatile Multi-Level Annotation Tool
M. Balasubramanya, M. Higgins, P. Lucas, J. Senn, D. Widdows Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources
K. Erk, S. Pado SHALMANESER - A Toolchain For Shallow Semantic Parsing
M. Yaseen, M. Attia, B. Maegaard, K. Choukri, N. Paulsson, S. Haamid, S. Krauwer, C. Bendahman, H. Fersře, M. Rashwan, B. Haddad, C. Mukbel, A. Mouradi, A. Al-kufaishi, M. Shahin, N. Chenfour, A. Ragheb Building Annotated Written and Spoken Arabic LR’s in NEMLAR Project
Session O20-W Multilingual Corpora :  
S. Sharoff A Uniform Interface to Large-Scale Linguistic Resources
D. Ciobanu, T. Hartley, S. Sharoff Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language
A. Feldman, J. Hana, C. Brew A Cross-language Approach to Rapid Creation of New Morpho-syntactically Annotated Resources
J. Kuhn, M. Jellinghaus Multilingual parallel treebanking: a lean and flexible approach
K. Miller, M. Vanni, O. Rambow, B. Dorr, D. Farwell, R. Green, N. Habash, S. Helmreich, E. Hovy, L. Levin, K. Miller, T. Mitamura, F. Reeder, A. Siddharthan Parallel Syntactic Annotation of Multiple Languages
J. Granfeldt, P. Nugues, M. Ĺgren, J. Thulin, E. Persson, S. Schlyter CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling
Session O21-TEW Structuring Terminology :  
C. Jouis "Hierarchical Relationships ""is-a"": Distinguishing Belonging, Inclusion and Part/of Relationships "
O. Ferret Building a network of topical relations from a corpus
A. Meur, M. Derouin Lemma-oriented dictionaries, concept-oriented terminology and translation memories
Y. Chou, C. Huang Hantology - A Linguistic Resource for Chinese Language Processing and Studying
J. Couturier, S. Neuvel, P. Drouin Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts
B. Alex, M. Nissim, C. Grover The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text
Session O22-GW Infrastructures, Methodologies and Standards :  
B. Hughes Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community
D. Broeder, A. Claus, F. Offenga, R. Skiba, P. Trilsbeek, P. Wittenburg LAMUS - the Language Archive Management and Upload System
M. Gavrilidou, P. Labropoulou, S. Piperidis, V. Giouli, N. Calzolari, M. Monachini, C. Soria, K. Choukri Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
F. Sasaki Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
N. Ide, K. Suderman Integrating Linguistic Resources: The American National Corpus Model
P. Wittenburg, D. Broeder, W. Klein, S. Levinson, L. Romary Foundations of Modern Language Resource Archives
Session O23-SG Speech Corpora & Annotation :  
A. Bies, S. Strassel, H. Lee, K. Maeda, S. Kulick, Y. Liu, M. Harper, M. Lease Linguistic Resources for Speech Parsing
K. Uchimoto, R. Hamabe, T. Maruyama, K. Takanashi, T. Kawahara, H. Isahara Dependency-structure Annotation to Corpus of Spontaneous Japanese
O. Baude, M. Jacobson, A. Tchobanov, R. Walter Interoperability of audio corpora: the case of the French corpora
M. Voghera, F. Cutugno An observatory on Spoken Italian linguistic resources and descriptive standards
A. Sandrelli, C. Bendazzoli Tagging a Corpus of Interpreted Speeches: the European Parliament Interpreting Corpus (EPIC)
K. Maeda, C. Cieri, K. Walker Low-cost Customized Speech Corpus Creation for Speech Technology Applications
Session O24-EW Evaluation of Information Retrieval :  
A. Abdelsapor, N. Adly, K. Darwish, O. Emam, W. Magdy, M. Nagi Building a Heterogeneous Information Retrieval Collection of Printed Arabic Documents
H. Dalianis, B. Jongejan Hand-crafted versus Machine-learned Inflectional Rules: The Euroling-SiteSeeker Stemmer and CST's Lemmatiser
L. Ku, Y. Liang, H. Chen Tagging Heterogeneous Evaluation Corpora for Opinionated Tasks
A. Fujii, M. Iwayama, N. Kando Test Collections for Patent Retrieval and Patent Classification in the Fifth NTCIR Workshop
C. Peters The Impact of Evaluation on Multilingual Information Retrieval System Development
S. Tongchim, P. Srichaivattana, V. Sornlertlamvanich, H. Isahara Blind Evaluation for Thai Search Engines
Session O25-WE Machine Translation and Evaluation :  
J. Giménez, E. Amigó IQMT: A Framework for Automatic Machine Translation Evaluation
A. Popescu-belis, P. Estrella, M. King, N. Underwood A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation
D. Vilar, J. Xu, L. D'haro, H. Ney Error Analysis of Statistical Machine Translation Output
K. Uchimoto, N. Hayashida, T. Ishida, H. Isahara Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
A. Patry, F. Gotti, P. Langlais MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation
A. Mauser, E. Matusov, H. Ney Training a Statistical Machine Translation System without GIZA++
Session O26-W Question Answering :  
G. Fliedner Towards Natural Interactive Question Answering
D. Buscaldi, P. Rosso Mining Knowledge from Wikipedia for the Question Answering task
V. Moriceau Language Challenges for Data Fusion in Question-Answering
L. Zhou, C. Lin, E. Hovy Summarizing Answers for Complicated Questions
S. Harabagiu, C. Human An Answer Bank for Temporal Inference
G. Marton, B. Katz Using Semantic Overlap Scoring in Answering TREC Relationship Questions
Session O27-G Large Programs, Data Centers and International Cooperation :  
S. Itahashi, C. Tseng, S. Nakamura Oriental COCOSDA: Past, Present and Future
B. Maegaard, J. Fenstad, L. Ahrenberg, K. Kvale, K. Mühlenbock, B. Heid KUNSTI - Knowledge Generation for Norwegian Language Technology
E. D'halleweyn, J. Odijk, L. Teunissen, C. Cucchiarini The Dutch-Flemish HLT Programme STEVIN: Essential Speech and Language Technology Resources
S. Chaudiron, J. Mariani Techno-langue: The French National Initiative for Human Language Technologies (HLT)
B. Maegaard, S. Krauwer, K. Choukri, L. Jřrgensen The BLARK concept and BLARK for Arabic
C. Cieri, M. Liberman More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities
Session O28-SE ASR, Tools and Evaluation :  
M. Rayner, P. Bouillon, B. Hockey, N. Chatzichrisafis REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
T. Kato, T. Toda, H. Saruwatari, K. Shikano Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria
J. Mauclair, Y. Estčve, S. Petit-renaud, P. Deléglise Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions
D. Gibbon, S. Tseng Discourse functions of duration in Mandarin: resource design and implementation
J. Fiscus, J. Ajot, N. Radde, C. Laprun Multiple Dimension Levenshtein Edit Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech
S. Burger, Z. Sloane, J. Yang Competitive Evaluation of Commercially Available Speech Recognizers in Multiple Languages
Session O29-E Evaluation Semantics and Senses :  
R. Johansson, P. Nugues Construction of a FrameNet Labeler for Swedish Text
M. Sahlgren Towards pertinent evaluation methodologies for word-space models
S. Schulte Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet
R. Passonneau Measuring Agreement on Set-valued Items (MASI) for Semantic and Pragmatic Annotation
A. Rumshisky, J. Pustejovsky Inducing Sense-Discriminating Context Patterns from Sense-Tagged Corpora
R. Navigli Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory
Session O30-W Parallel Corpora & MT :  
A. Eisele Parallel Corpora and Phrase-Based Statistical Machine Translation for New Language Pairs via Multiple Intermediaries
Y. Nitta, M. Saraki, S. Ikehara Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
S. Hasan, A. Isbihani, H. Ney Creating a Large-Scale Arabic to French Statistical Machine Translation System
X. Ma, C. Cieri Corpus Support for Machine Translation at LDC
Session O31-W Corpus Construction & Annotation :  
O. Medelyan, S. Schulz, J. Paetzold, M. Poprat, K. Markó Language Specific and Topic Focused Web Crawling
D. Tufis, E. Irimia RoCo-News: A Hand Validated Journalistic Corpus of Romanian
C. Grover, R. Tobin Rule-Based Chunking and Reusability
E. Hajicova, P. Sgall Corpus Annotation as a Test of a Linguistic Theory
Session O32-EW Coreference, Summarisation and Evaluation :  
R. Catizone, A. Dalli, Y. Wilks Evaluating Automatically Generated Timelines from the Web
O. Postolache, D. Cristea, C. Orasan Transferring Coreference Chains through Word Alignment
O. Uryupina Coreference Resolution with and without Linguistic Knowledge
E. Hovy, C. Lin, L. Zhou, J. Fukumoto Automated Summarization Evaluation with Basic Elements
Session O33-EMT Evaluation of Multimodality & Dialogue :  
J. Soler, P. Cerezo, C. Ávila, D. Merino Usability evaluation of 3G multimodal services in Telefónica Móviles Espańa
H. Dybkjćr, L. Dybkjćr Act-Topic Patterns for Automatically Checking Dialogue Models
D. Mostefa, M. Garcia, K. Choukri Evaluation of multimodal components within CHIL
H. Bunt Dimensions in Dialogue Act Annotation
Session O34-WE Morphology & Tagging :  
A. Novák Morphological Tools for Six Small Uralic Languages
J. Vaneyghen, G. Pauw, D. Compernolle, W. Daelemans A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora
M. Mieskes, M. Strube Part-of-Speech Tagging of Transcribed Speech
B. Hughes, D. Gibbon, T. Trippel Feature-Based Encoding and Querying Language Resources with Character Semantics
Session O35-T Terminology & Knowledge Acquisition :  
W. Mustafa el Hadi, I. Timimi, M. Dabbadie, K. Choukri, O. Hamon, Y. Chiao Terminological Resources Acquisition Tools: Towards a User-Oriented Evaluation Model
T. Charoenporn, C. Kruengkrai, T. Theeramunkong, V. Sornlertlamvanich, H. Isahara Word Knowledge Acquisition for Computational Lexicon Construction
M. Lafourcade Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence
N. Okazaki, S. Ananiadou Clustering acronyms in biomedical text for disambiguation
Session O36-W Semantically Annotated Corpora :  
B. Magnini, E. Pianta, C. Girardi, M. Negri, L. Romano, M. Speranza, V. Lenzi, R. Sprugnoli I-CAB: the Italian Content Annotation Bank
A. Burchardt, K. Erk, A. Frank, A. Kowalski, S. Pado The SALSA Corpus: a German corpus resource for lexical semantics
H. Papageorgiou, E. Desipri, M. Koutsombogera, K. Pouli, P. Prokopidis Adding multi-layer semantics to the Greek Dependency Treebank
E. Agirre, I. Aldezabal, J. Etxeberria, E. Pociello A Preliminary Study for Building the Basque PropBank
Session O37-S Lexicon and Pronunciation :  
J. Gros, V. Cvetko-orešnik, P. Jakopin, A. Mihelic SI-PRON: a Pronunciation Lexicon for Slovenian
S. Schaden, U. Jekosch “Casselberveetovallarga” and Other Unpronounceable Places: The CrossTowns Corpus
D. Graff, T. Buckwalter, M. Maamouri, H. Jin Lexicon Development for Varieties of Spoken Colloquial Arabic
T. Pellegrini, L. Lamel Experimental detection of vowel pronunciation variants in Amharic
Session O38-W Lexicon Syntax and Semantics :  
M. Slavcheva Semantic Descriptors: The Case of Reflexive Verbs
A. Korhonen, Y. Krymolowski, T. Briscoe A Large Subcategorization Lexicon for Natural Language Processing Applications
P. Saint-dizier PrepNet: a Multilingual Lexical Description of Prepositions
K. Kipper, A. Korhonen, N. Ryant, M. Palmer Extending VerbNet with Novel Verb Classes
Session O39-W Lexicons, Semantics, Clustering Tools :  
M. Islam, D. Inkpen Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words
D. Widdows, A. Toumouh, B. Dorow, A. Lehireche Ongoing Developments in Automatically Adapting Lexical Resources to the Biomedical Domain
M. Maragoudakis, K. Kermanidis, A. Garbis, N. Fakotakis Dealing with Imbalanced Data using Bayesian Techniques
B. Medlock An Introduction to NLP-based Textual Anonymisation
R. Marinelli, R. Bindi Proper Names and Linguistic Dynamics
V. Rus, A. Graesser The Look and Feel of a Confident Entailer
Session O40-T Creating Terminologies :  
M. L’homme, H. Bae  A Methodology for Developing Multilingual Resources for Terminology
G. Nenadic, N. Okazaki, S. Ananiadou Towards a terminological resource for biomedical text mining
B. Dobrov, N. Loukachevitch Development of Linguistic Ontology on Natural Sciences and Technology
R. Melz, P. Ryu, K. Choi Compiling large language resources using lexical similarity metrics for domain taxonomy learning
W. Peters, M. Sagri, D. Tiscornia, S. Castagnoli The LOIS Project
S. Anstein, G. Kremer, U. Reyle Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
Session O41-M Emotions :  
C. Clavel, I. Vasilescu, L. Devillers, T. Ehrette, G. Richard Fear-type emotions of the SAFE Corpus: annotation issues
L. Devillers, R. Cowie, J. Martin, E. Douglas-cowie, S. Abrilian, M. Mcrorie Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches
K. Laskowski, S. Burger Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus
D. Reidsma, D. Heylen, R. Ordelman Annotating Emotion in Meetings
T. Vogt, E. André Improving Automatic Emotion Recognition from Speech via Gender Differentiation
J. Martin, G. Caridakis, L. Devillers, K. Karpouzis, S. Abrilian Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
Session O42-EW Question Answering Evaluation :  
L. Gillard, P. Bellot, M. El-bčze Question Answering Evaluation Survey
J. Ko, L. Hiyakumoto, E. Nyberg Exploiting Multiple Semantic Resources for Answer Selection
H. Shima, M. Wang, F. Lin, T. Mitamura Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering
F. Lacatusu, A. Hickl, S. Harabagiu Impact of Question Decomposition on the Quality of Answer Summaries
B. Magnini, D. Giampicollo, L. Aunimo, C. Ayache, P. Osenova, A. Peńas, M. Rijke, B. Sacaleanu, D. Santos, R. Sutcliffe The Multilingual Question Answering Track at CLEF
C. Ayache, B. Grau, A. Vilnat EQueR: the French Evaluation campaign of Question-Answering Systems
Session P1-W : Anaphora & Coreference  
A. Nřklestad, Ř. Reigem, C. Johansson Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains
L. Hasler, C. Orasan, K. Naumann NPs for Events: Experiments in Coreference Annotation
T. Caselli, I. Prodanof Annotating Bridging Anaphors in Italian: in Search of Reliability
D. Goecke, A. Witt Exploiting logical document structure for anaphora resolution
H. Suzuki, G. Kacmarcik RefRef: A Tool for Viewing and Exploring Coreference Space
W. Gegg-harrison, D. Byron PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit
M. Poesio, M. Kabadjov, P. Goux, U. Kruschwitz, E. Bishop, L. Corti An Anaphora Resolution-Based Anonymization Module
G. Puscasu, R. Mitkov If it were then, then when was it? Establishing the anaphoric role of then
Session P2-W : Corpora & LR Infrastructures  
D. Kokkinakis Collection, Encoding and Linguistic Processing of a Swedish Medical Corpus - The MEDLEX Experience
N. Oostdijk, L. Boves User requirements analysis for the design of a reference corpus of written Dutch
C. Onelli, D. Poietti, C. Seidenari, F. Tamburini The DiaCORIS project: a diachronic corpus of written Italian
D. Santos, S. Inácio Annotating COMPARA, a Grammar-aware Parallel Corpus
D. Guthrie, B. Allison, W. Liu, L. Guthrie, Y. Wilks A Closer Look at Skip-gram Modelling
A. Chalamandaris, A. Protopapas, P. Tsiakoulis, S. Rapti All Greek to me! An automatic Greeklish to Greek transliteration system
Session P3-W : Machine Translation  
A. Patel, D. Radev Lexical similarity can distinguish between automatic and manual translations
O. Bojar, M. Prokopova Czech-English Word Alignment
L. Cyrus Building a resource for studying translation shifts
A. Wu, K. Lowery A Hebrew Tree Bank Based on Cantillation Marks
S. Oepen, J. Lřnning Discriminant-Based MRS Banking
I. Kruijff-korbayová, K. Chvatalova, O. Postolache Annotation Guidelines for Czech-English Word Alignment
H. Kaji, M. Watanabe Automatic Construction of Japanese WordNet
R. Rapp, C. Vide Example-Based Machine Translation Using a Dictionary of Word Pairs
B. Schrader Non-probabilistic alignment of rare German and English nominal expressions
M. Popovic, H. Ney POS-based Word Reorderings for Statistical Machine Translation
V. Vandeghinste, I. Schuurman, M. Carl, S. Markantonatou, T. Badia METIS-II: Machine Translation for Low Resource Languages
R. Ion, A. Ceauşu, D. Tufiş Dependency-Based Phrase Alignment
M. Miháltz, G. Pohl Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation
A. Ceauşu, D. Ştefănescu, D. Tufiş Acquis Communautaire Sentence Alignment using Support Vector Machines
Session P4-W : Question Answering, Summarisation, Classification  
J. Civera, A. Juan Bilingual Machine-Aided Indexing
S. Verberne, L. Boves, N. Oostdijk, P. Coppen Data for question answering: The case of why
H. Saggion Multilingual Multidocument Summarization Tools and Evaluation
H. Saggion, R. Gaizauskas Language Resources for Background Gathering
Wei. Li, Wen. Li, Q. Lu Mining Implicit Entities in Queries
F. Bertagna Representation and Inference for Open-Domain QA: Strength and Limits of two Italian Semantic Lexicons
R. Sauri, M. Verhagen, J. Pustejovsk SlinkET: A Partial Modal Parser for Events
Session P5-W : Learning and Acquisition  
V. Nováček, P. Smrž, J. Pomikálek Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator
D. Kawahara, S. Kurohashi Case Frame Compilation from the Web using High-Performance Computing
B. Sagot, L. Clément, E. Villemonte de la Clergerie, P. Boullier The Lefff 2 syntactic lexicon for French: architecture, acquisition, use
K. Kanzaki, Q. Ma, E. Yamamoto, H. Isahara Semantic Analysis of Abstract Nouns to Compile a Thesaurus of Adjectives
B. Wellner, M. Vilain Leveraging Machine Readable Dictionaries in Discriminative Sequence Models
N. Bel, S. Espeja, M. Marimon New tools for the encoding of lexical data extracted from corpus
Y. Ohishi, K. Itou, K. Takeda, A. Fujii Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus
Session P6-WT : Corpora: Creation, Annotation  
M. Pazienza, M. Pennacchiotti, F. Zanzotto Mixing WordNet, VerbNet and PropBank for studying verb relations
J. Apresjan, I. Boguslavsky, B. Iomdin, L. Iomdin, A. Sannikov, V. Sizov A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects
N. Xue Annotating the Predicate-Argument Structure of Chinese Nominalizations
S. Džeroski, T. Erjavec, N. Ledinek, P. Pajas, Z. Žabokrtsky, A. Žele Towards a Slovene Dependency Treebank
J. Nivre, J. Nilsson, J. Hall Talbanken05: A Swedish Treebank with Phrase Structure and Dependency Annotation
R. Bernardi, A. Bolognesi, C. Seidenari, F. Tamburini POS tagset design for Italian
T. Ohta, Y. Tateisi, J. Kim, A. Yakushiji, J. Tsujii Linguistic and Biological Annotations of Biological Interaction Events
N. Areta, A. Gurrutxaga, I. Leturia, Z. Polin, R. Saiz, I. Alegria, X. Artola, A. Diaz de Ilarraza, N. Ezeiza, A. Sologaistoa, A. Soroa, A. Valverde Structure, Annotation and Tools in the Basque ZT Corpus
N. Masaki, I. Hiroshi, H. Taiichi, T. Takenobu A new approach to syntactic annotation
Y. Matsumoto, M. Asahara, K. Hashimoto, Y. Tono, A. Ohtani, T. Morita An Annotated Corpus Management Tool: ChaKi
Y. Xia, K. Wong., Wen. Li Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach
J. Jung, M. Miyake, H. Akam Recurrent Markov Cluster (RMCL) Algorithm for the Refinement of the Semantic Network
V. Hoste, G. Pauw KNACK-2002: a Richly Annotated Corpus of Dutch Written Text
F. Barreto, A. Branco, E. Ferreira, A. Mendes, M. Nascimento, F. Nunes, J. Silva Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project
H. Bunt, A. Schiffrin Methodological Aspects of Semantic Annotation
Session P7-T : Terminology & Ontologies  
M. Lin, H. Chen Constructing a Named Entity Ontology from Web Corpora
K. Ahmad, M. Musacchio, G. Palumbo Ontological and Terminological Commitments and the Discourse of Specialist Communities
L. Lesmo, L. Robaldo From Natural Language to Databases via Ontologies
A. Simőes, J. Almeida T2O - Recycling Thesauri into a Multilingual Ontology
L. Bordoni, T. Mazzoli Towards an Ontology for Art and Colours
M. Lundälv, K. Mühlenbock, B. Farre, A. Brännström SYMBERED - a Symbol-Concept Editing Tool
Y. Kiyota, H. Nakagawa A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames
H. Brugman, V. Malaisé, L. Gazendam A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs
T. Declerck, A. Pérez, O. Vela, Z. Gantner, D. Manzano-macho Multilingual Lexical Semantic Resources for Ontology Translation
A. Estellés, A. Alcina, V. Soler Retrieving Terminological Data from the TxtCeram Tagged Domain Corpus: A First Step towards a Terminological Ontology
L. Sarmento, B. Maia, D. Santos, A. Pinto, L. Cabral Corpógrafo V3 From Terminological Aid to Semi-automatic Knowledge Engineering
Y. Senda, Y. Sinohara, M. Okumura Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing
Session P8-E : Evaluation of Question Answering & Summarisation  
A. Moschitti, R. Basili A Tree Kernel approach to Question and Answer Classification in Question Answering Systems
I. Cramer, J. Leidner, D. Klakow Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia
L. Costa, L. Sarmento Component Evaluation in a Question Answering System
B. Grau, A. Ligozat, I. Robba, A. Vilnat, L. Monceau FRASQUES : A Question Answering system in the EQueR evaluation campaign
T. Akiba Exploiting Dynamic Passage Retrieval for Spoken Question Recognition and Context Processing towards Speech-driven Information Access Dialogue
M. Bilotti, E. Nyberg Evaluation for Scenario Question Answering Systems
M. Hassel, J. Sjöbergh Towards Holistic Summarization - Selecting Summaries, Not Sentences
C. Orasan, L. Hasler Computer-aided summarisation - what the user really wants
Session P9-M : Multimodal Tools  
P. Berck, A. Russel ANNEX - a web-based Framework for Exploiting Annotated Media Resources
P. Wittenburg, H. Brugman, A. Russel, A. Klassmann, H. Sloetjes ELAN: a Professional Framework for Multimodality Research
A. Popescu-belis, M. Georgescul TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface
F. Behrens, J. Milde The Eclipse Annotator: an extensible system for multimodal corpus creation
K. Maeda, H. Lee, J. Medero, S. Strassel A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit
K. Ahmad, C. Bennett, T. Oliver Visual Surveillance and Video Annotation and Description
Session P10-S : Speech Corpora & Annotation  
N. Grřnnum DanPASS - A Danish Phonetically Annotated Spontaneous Speech Corpus
Y. Irie, S. Matsubara, N. Kawaguchi, Y. Yamaguchi, Y. Inagaki Layered Speech-Act Annotation for Spoken Dialogue Corpus
T. Ohno, S. Matsubara, H. Kashioka, N. Kato, Y. Inagaki A Syntactically Annotated Corpus of Japanese Spoken Monologue
I. Chiari Slips and errors in spoken data transcription
D. Gibbon, F. Fernandes, T. Trippel A BLARK extension for temporal annotation mining
P. Paggio Annotating Information Structure in a Corpus of Spoken Danish
T. Tan, L. Besacier A French Non-Native Corpus for Automatic Speech Recognition
A. Bogacka, K. Dziubalska-kołaczyk, G. Krynicki, D. Pietrala, M. Wypych General and Task-Specific Corpus Resources for Polish Adult Learners of English
A. Zgank, D. Verdonik, A. Markus, Z. Kacic SINOD - Slovenian non-native speech database
V. Alabau, C. Martinez Bilingual speech corpus in two phonetically similar languages
M. Kaiser, H. Mögele, F. Schiel Bikers Accessing the Web: The SmartWeb Motorbike Corpus
A. Moreno, A. Febrer, L. Márquez Generation of Language Resources for the Development of Speech Technologies in Catalan
J. Benedí, E. Lleida, A. Varona, M. Castro, I. Galiano, R. Justo, I. López de Letona, A. Miguel Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA
S. Rosset, S. Petel The Ritel Corpus - An annotated Human-Machine open-domain question answering spoken dialog corpus
H. Boril, T. Boril, P. Pollák Methodology of Lombard Speech Database Acquisition: Experiences with CLSD
E. Westerhout, P. Monachesi A pilot study for a Corpus of Dutch Aphasic Speech (CoDAS)
R. Savy, F. Cutugno, C. Crocco Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb)
Session P11-WME : Computational Lexicons  
F. Vriend, L. Boves, H. Heuvel, R. Hout, J. Kruijsen, J. Swanenberg A Unified Structure for Dutch Dialect Dictionary Data
M. Mangeot, A. Chalvin Dictionary Building with the Jibiki Platform: the GDEF case
V. Trón, P. Halácsy, P. Rebrus, A. Rung, P. Vajda, E. Simon Hungarian lexical database and morphological grammar
B. Cartoni Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes
T. Erjavec, T. Fišer Building Slovene WordNet
S. Schulz, K. Markó, P. Daumke, U. Hahn, S. Hanser, P. Nohama, R. Andrade, E. Pacheco, M. Romacker Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MORPHOSAURUS Subword Lexicon
S. Nimb LEXADV - a multilingual semantic Lexicon for Adverbs
C. Krstev, R. Stanković, D. Vitas, I. Obradović WS4LR: A Workstation for Lexical Resources
A. Geyken, N. Schrader LexikoNet - a lexical database based on type and role hierarchies
J. Pustejovsky, C. Havasi, J. Littman, A. Rumshisky, M. Verhagen Towards a Generative Lexical Resource: The Brandeis Semantic Ontology
T. Kanamaru, M. Murata, H. Isahara Creation of a Japanese Adverb Dictionary that Includes Information on the Speaker’s Communicative Intention Using Machine Learning
A. Roventini Linking Verbal Entries of Different Lexical Resources
N. Ruimy Merging two Ontology-based Lexical Resources
S. Bosch, L. Pretorius, J. Jones Towards machine-readable lexicons for South African Bantu languages
M. Lopatkova, Z. Zabokrtsky, K. Skwarska Valency Lexicon of Czech Verbs: Alternation-Based Model
G. Vetulani, Z. Vetulani, T. Obrębski Syntactic Lexicon of Polish Predicative Nouns
R. Manurung, D. O’mara, H. Pain, G. Ritchie, A. Waller Building a Lexical Database for an Interactive Joke-Generator
Session P12-W : Named Entity  
C. Sporleder, M. Erp, T. Porcelijn, A. Bosch, P. Arntzen Identifying Named Entities in Text Databases from the Natural History Domain
P. Lucas, M. Balasubramanya, D. Widdows, M. Higgins The Information Commons Gazetteer: A Public Resource of Populated Places and Worldwide Administrative Divisions
P. Mazur, R. Dale Named Entity Extraction with Conjunction Disambiguation
U. Schäfer OntoNERdIE - Mapping and Linking Ontologies to Named Entity Recognition and Information Extraction Resources
Session P13-W : Corpora  
V. Tablan, W. Peters, D. Maynard, H. Cunningham Creating Tools for Morphological Analysis of Sumerian
C. Benzmüller, H. Horacek, H. Lesourd, I. Kruijff-korbayova, M. Schiller, M. Wolska "A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material"
C. Bosco, V. Lombardo Comparing linguistic information in treebank annotations
I. Langkilde-geary, J. Betteridge A Factored Functional Dependency Transformation of the English Penn Treebank for Probabilistic Surface Generation
A. Nazarenko, E. Alphonse, J. Derivičre, T. Hamon, G. Vauvert, D. Weissenbacher The ALVIS Format for Linguistically Annotated Documents
L. Sarmento BACO - A large database of text and co-occurrences
M. Nascimento, J. Gonçalves, L. Pereira, A. Estrela, A. Pereira, R. Santos, S. Oliveira The African Varieties of Portuguese: Compiling Comparable Corpora and Analysing Data-derived Lexicon
L. Dinu, A. Dinu On the data base of Romanian syllables and some of its quantitative and cryptographic aspects
U. Quasthoff, M. Richter, C. Biemann Corpus Portal for Search in Monolingual Corpora
I. Berlocher, K. Huh, E. Laporte, J. Nam Morphological annotation of Korean with Directly Maintainable Resources
A. Bosch, I. Schuurman, V. Vandeghinste Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development
G. Noord, I. Schuurman, V. Vandeghinste Syntactic Annotation of Large Corpora in STEVIN
M. Wong Skeleton Parsing in Chinese: Annotation Scheme and Guidelines
K. Tenfjord, P. Meurer, K. Hofland The ASK Corpus - a Language Learner Corpus of Norwegian as a Second Language
A. Cole Corpus Development and Publication
M. Sousa, T. Trippel Building a historical corpus for Classical Portuguese: some technological aspects
T. Obrebski, M. Stolarski UAM Text Tools - a flexible NLP architecture
Session P14-GW : General Issues and Large Programs  
R. Ivanovska-naskova Development of the First LRs for Macedonian: Current Projects
T. Wandmacher, J. Antoine Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People
N. Cucurullo, S. Montemagni, M. Paoli, E. Picchi, E. Sassolini Dialectal resources on-line: the ALT-Web experience
M. Monachini, N. Calzolari, K. Choukri, J. Friedrich, G. Maltese, M. Mammini, J. Odijk, M. Ulivieri Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian
F. Calzolari, E. Sassolini, M. Sassi, S. Cucurullo, E. Picchi, F. Bertagna, A. Enea, M. Monachini, C. Soria, N. Calzolari Next Generation Language Resources using Grid
M. Kemps-snijders, M. Nederhof, P. Wittenburg LEXUS, a web-based tool for manipulating lexical resources
F. Offenga, D. Broeder, P. Wittenburg, J. Ducret, L. Romary Metadata Profile in the ISO Data Category Registry
H. Uszkoreit, F. Xu, J. Steffen, I. Aslan The pragmatic combination of different cross-lingual resources for multilingual information services
I. Maks, B. Boelhouwer Exploring opportunities for comparability and enrichment by linking lexical databases
Session P15-W : Multiwords & Collocations  
R. Xu, Q. Lu, S. Li The Design and Construction of A Chinese Collocation Bank
L. Wanner, M. Ramos Local Document Relevance Clustering in IR Using Collocation Information
S. Cinkova, P. Pecina, P. Podvesky, P. Schlesinger Semi-automatic Building of Swedish Collocation Lexicon
N. Grégoire Elaborating the parameterized Equivalence Class Method for Dutch
A. Mendes, S. Antunes, M. Nascimento, J. Casteleiro, L. Pereira, T. Sá COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions
W. Anderson, P. Kotzé Finite state tokenisation of an orthographical disjunctive agglutinative language: The verbal segment of Northern Sotho
Session P16-T : Terminology Multiwords & Collocations  
M. Brekke, K. Innselset, M. Kristiansen, K. Řvsthus KB-N: Automatic Term Extraction from Knowledge Bank of Economics
P. Alessandro, F. Marco, M. Massimo Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts
H. Hjelm Extraction of Cross Language Term Correspondences
J. Ritz, U. Heid Extraction tools for collocations and their morphosyntactic specificities
R. Kongkachandra, K. Chamnongthai Semantic-Based Keyword Recovery Function for Keyword Extraction System
Session P17-E : Evaluation: annotation, acquisition, alignment, recognition  
P. Wagacha, G. Pauw, P. Githinji A Grapheme-Based Approach for Accent Restoration in Gikuyu
J. Semecký On Automatic Assignment of Verb Valency Frames in Czech
K. Kageura, G. Kikui A Self-Referring Quantitative Evaluation of the ATR Basic Travel Expression Corpus (BTEC)
R. Passonneau, N. Habash, O. Rambow Inter-annotator Agreement on a Multilingual Semantic Annotation Task
G. Szarvas, R. Farkas, L. Felföldi, A. Kocsor, J. Csirik A highly accurate Named Entity corpus for Hungarian
E. Bick Turning a Dependency Treebank into a PSG-style Constituent Treebank
M. Ueyama Evaluation of Web-based Corpora: Effects of Seed Selection and Time Interval
D. Kokkinakis, D. Dannélls Recognizing Acronyms and their Definitions in Swedish Medical Texts
Y. Chiao, O. Kraif, D. Laurent, T. Nguyen, N. Semmar, F. Stuck, J. Véronis, W. Zaghouani Evaluation of multilingual text alignment systems: the ARCADE II project
M. Salmenkivi Finding representative sets of dialect words for geographical regions
D. Santos, N. Seco, N. Cardoso, R. Vilela HAREM: An Advanced NER Evaluation Contest for Portuguese
Session P18-M : Multimodal Corpora and Annotation  
J. Maas, B. Wrede BITT: A Corpus for Topic Tracking Evaluation on Multimodal Human-Robot-Interaction
J. Segouat, A. Braffort, E. Martin Sign Language corpus analysis: Synchronisation of linguistic annotation and numerical data
J. Bungeroth, D. Stein, P. Dreuw, M. Zahedi, H. Ney A German Sign Language Corpus of the Domain Weather Report
S. Abrilian, L. Devillers, J. Martin Annotation of Emotions in Real-Life Video Interviews: Variability between Coders
G. Lechenadec, V. Maffiolo, N. Chateau, J. Colletta Creation of a Corpus of Multimodal Spontaneous Expressions of Emotions in Human-Machine Interaction
P. Strauß, H. Hoffman, W. Minker, H. Neumann, G. Palm, S. Scherer, F. Schwenker, H. Traue, W. Walter, U. Weidenbacher Wizard-of-Oz Data Collection for Perception and Interaction in Multi-User Environments
I. Kruijff-korbayova, T. Becker, N. Blaylock, C. Gerstenberger, M. Kaisser, P. Poller, V. Rieser, J. Schehl The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player
Session P19-E : Evaluation : Speech  
J. Ko, F. Murase, T. Mitamura, E. Nyberg, M. Tateishi, I. Akahori Analyzing the Effects of Spoken Dialog Systems on Driving Behavior
A. Raake, B. Katz SUS-based Method for Speech Reception Threshold Measurement in French
P. Mareüil, C. D'Alessandro, A. Raake, G. Bailly, M. Garcia, M. Morel A joint intelligibility evaluation of French text-to-speech systems: the EvaSy SUS/ACR campaign
M. Przybocki, G. Sanders, A. Le Edit Distance: A Metric for Machine Translation Evaluation
G. Hodász Evaluation Methods of a Linguistically Enriched Translation Memory System
J. Laoudi, C. Tate, C. Voss Task-based MT Evaluation: From Who/When/Where Extraction to Event Understanding
H. Bonneau-maynard, C. Ayache, F. Bechet, A. Denis, A. Kuhn, F. Lefevre, D. Mostefa, M. Quignard, S. Rosset, C. Servan, J. Villaneau Results of the French Evalda-Media evaluation campaign for literal understanding
B. Andrassy, H. Hoege Human and machine recognition as a function of SNR
J. Hoidekr, J. Psutka, A. Pražák, J. Psutka Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries
N. Bernsen, T. Hansen, S. Kiilerich, T. Madsen Field Evaluation of a Single-Word Pronunciation Training System
R. Banchs, A. Bonafonte, J. Pérez Acceptance Testing of a Spoken Language Translation System
E. Coussé, S. Gillis Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus
D. Buehler, W. Minker Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign
J. Soler, P. Cerezo, D. Merino, A. Sánchez MEDUSA: User-Centred Design and usability evaluation of Automatic Speech Recognition telephone services in Telefónica Móviles Espańa
Session P20-S : Speech Corpora for TTS Systems and Emotion and Speech tools  
D. Braga, L. Coelho, J. Teixeira, D. Freitas ProGmatica: a Prosodic and Pragmatic Database for European Portuguese
I. Amdal, T. Svendsen FonDat1: A Speech Synthesis Corpus for Norwegian
M. Umbert, A. Moreno, P. Agüero, A. Bonafonte Spanish Synthesis Corpora
H. Mögele, M. Kaiser, F. Schiel SmartWeb UMTS Speech Data Collection - The SmartWeb Handheld Corpus
C. Draxler, K. Jänsch Speech Recordings in Public Schools in Germany - the Perfect Show Case for Web-based Recordings and Annotation
Z. Huang, L. Chen, M. Harper An Open Source Prosodic Feature Extraction Tool
B. Tsou, T. Lai, K. Sin, L. Cheung Court Stenography-To-Text (“STT”) in Hong Kong: A Jurilinguistic Engineering Effort
I. Saratxaga, E. Navas, I. Hernáez, I. Aholab Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque
Session P21-W : Corpora, Lexicons & Multilinguality  
B. Megyesi, A. Hein, É. Johanson Building a Swedish-Turkish Parallel Corpus
T. Erjavec The English-Slovene ACQUIS corpus
R. Steinberger, B. Pouliquen, A. Widiger, C. Ignat, T. Erjavec, D. Tufiş, D. Varga The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages
B. Maegaard, L. Offersgaard, L. Henriksen, H. Jansen, X. Lepetit, C. Navarretta, C. Povlsen The MULINCO corpus and corpus platform
J. Tiedemann ISA & ICA - Two Web Interfaces for Interactive Alignment of Bitexts
E. Guevara, S. Scalise, A. Bisetto, C. Melloni MORBO/COMP: a multilingual database of compound words
A. Sansň Documenting variation across Europe and the Mediterranean: The Pavia Typological Database
S. Cinková From PropBank to EngValLex: Adapting PropBank-Lexicon to the Valency Theory of the Functional Generative Description
D. Samy, A. Sandoval, J. Guirao, E. Alfonseca Building a Parallel Multilingual Corpus (Arabic-Spanish-English)
W. Ma, C. Huang Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
M. Geilfuss, J. Milde SAM - an annotation editor for parallel texts
Session P22-W : Tools for Corpora and Lexicons  
J. Kinoshita, L. Salvador, C. Menezes CoGrOO: a Brazilian-Portuguese Grammar Checker based on the CETENFOLHA Corpus
P. Nemec Tree Searching/Rewriting Formalism
P. Le, T. Nguyen, L. Romary, A. Roussanaly A Lexicalized Tree-Adjoining Grammar for Vietnamese
C. Rohrer, M. Forst Improving coverage and parsing quality of a large-scale LFG for German
T. Baldwin, S. Awab Open Source Corpus Analysis Tools for Malay
J. Nivre, J. Hall, J. Nilsson MaltParser: A Data-Driven Parser-Generator for Dependency Parsing
B. Sagot, P. Boullier Deep non-probabilistic parsing of large corpora
S. Frota, M. Vigário, F. Martin FreP: An Electronic Tool for Extracting Frequency Information of Phonological Units from Portuguese Written Text
R. Levy, G. Andrew Tregex and Tsurgeon: tools for querying and manipulating tree data structures
U. Heid, E. Taljard, D. Prinsloo Grammar-based tools for the creation of tagging resources for an unresourced language: the case of Northern Sotho
E. Dhonnchadha, J. Genabith A Part-of-speech tagger for Irish using Finite-State Morphology and Constraint Grammar Disambiguation
P. Halácsy, A. Kornai, C. Oravecz, V. Trón, D. Varga Using a morphological analyzer in high precision POS tagging of Hungarian
M. Verhagen, R. Knippen, I. Mani, J. Pustejovsky Annotation of Temporal Relations with Tango
C. Havasi, J. Pustejovsky, M. Verhagen BULB: A Unified Lexical Browser
B. Waldron, A. Copestake, U. Schäfer, B. Kiefer Preprocessing and Tokenisation Standards in DELPH-IN Tools
Y. Kato, S. Matsubara, Y. Inagaki A Corpus Search System Utilizing Lexical Dependency Structure
C. Fairon, S. Paumier A framework for real-time dictionary updating
E. Alfonseca, A. Moreno-sandoval, J. Guirao, M. Ruiz-casado The wraetlic NLP suite
J. Atserias, B. Casas, E. Comelles, M. González, L. Padró, M. Padró FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
Session P23-GW : Web services and Digital Archives and Libraries  
R. Bernardi, D. Calvanese, L. Dini, V. Tomaso, E. Frasnelli, U. Kugler, B. Plank Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano
D. Broeder, F. Offenga, P. Wittenburg, P. Kamp, D. Nathan, S. Strömqvist Technologies for a Federation of Language Resource Archives
P. Berck, H. Bibiko, M. Kemps-snijders, A. Russel, P. Wittenburg Ontology-based Language Archive Utilization
M. Kemps-snijders, J. Ducret, L. Romary, P. Wittenburg An API for accessing the Data Category Registry
M. Boekestein, G. Depoorter, R. Veenendaal Functioning of the Centre for Dutch Language and Speech Technology
E. Suzuki, T. Suzuki, K. Kakihana On the Web Trilingual Sign Language Dictionary to Learn the foreign Sign Language without Learning a Target Spoken Language
Session P24-WM : Semantics, Ontologies, Semantic Web  
A. Oltramari LexiPass methodology: a conceptual path from frames to senses and back
P. Morărescu Principles for annotating and reasoning with spatial information
P. Buitelaar, P. Cimiano, S. Racioppa, M. Siegel Ontology-based Information Extraction with SOBA
E. Yamamoto, K. Kanzaki, H. Isahara Detection of inconsistencies in concept classifications in a large dictionary — Toward an improvement of the EDR electronic dictionary —
D. Dutoit Alexandria: A Powerful Multilingual Resource For web
A. Kawtrakul, C. Pechsiri, T. Permpool, D. Thamvijit, P. Sornprasert, C. Yingsaeree, M. Suktarachan Ontology Driven K-Portal Construction and K-Service Provision
P. Bouquet, L. Serafini, S. Zanobini The role of lexical resources in matching classification schemas
K. Simov, P. Osenova Shallow Semantic Annotation of Bulgarian
T. Declerck, M. Vela Generic NLP Tools for Supporting Shallow Ontology Building
D. Beermann, L. Hellan Word Sense Disambiguation and Semantic Disambiguation for Construction Types in Deep Processing Grammars
P. Filipe, N. Mamede A Framework to Integrate Ubiquitous Knowledge Modeling
Session P25-T : Terminology Lexicons & Tools  
H. Ozaku, A. Abe, K. Sagara, N. Kuwahara, K. Kogure Features of Terms in Actual Nursing Activities
A. Storrer, S. Wellinghoff Automated detection and annotation of term definitions in German text corpora
J. Feliu, J. Vivaldi, M. Cabré SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations
C. Yirong, L. Qin, L. Wenjie, S. Zhifang, J. Luning A Study on Terminology Extraction Based on Classified Corpora
A. Mulloni, V. Pekar Automatic Detection of Orthographics Cues for Cognate Recognition
B. Tsou, O. Kwong Toward a Pan-Chinese Thesaurus
G. Pardelli, M. Sassi, S. Goggi, P. Orsolini Natural Language Processing: A Terminological And Statistical Approach
M. Itagaki, A. Aue, T. Aikawa Detecting Inter-domain Semantic Shift using Syntactic Similarity
Session P26-EW : Evaluation: Lexicons, Tools  
K. Smaďli, C. Lavecchia, J. Haton Linguistic features modeling based on Partial New Cache
N. Ruimy Structuring a Domain Vocabulary in a General Knowledge Environment
R. Passonneau, R. Blitz, D. Elson, A. Giral, J. Klavans CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project
C. Kruengkrai, V. Sornlertlamvanich, H. Isahara A Conditional Random Field Framework for Thai Morphological Analysis
K. Kuroda, M. Utiyama, H. Isahara Getting Deeper Semantics than Berkeley FrameNet with MSFA
B. Alex Integrating Language Knowledge Resources to Extend the English Inclusion Classifier to a New Language
M. Kouylekov, B. Magnini Building a Large-Scale Repository of Textual Entailment Rules
S. Schaden Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure
Ž. Agić, M. Tadić Evaluating Morphosyntactic Tagging of Croatian Texts
U. Petersen Querying Both Parallel And Treebank Corpora: Evaluation Of A Corpus Query System
J. Medero, K. Maeda, S. Strassel, C. Walker An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks
A. Sinopalnikova, P. Smrž Intelligent Dictionary Interfaces: Usability Evaluation of Access-Supporting Enhancements
H. Halpin Automatic Evaluation and Compositon of NLP Pipelines with Web Services
A. Barbu, E. Ionescu, V. Mititelu Romanian Valence Dictionary in XML Format
Session P27-E : Evaluation methodologies & Evaluation of Language Technologies  
A. Lisowska, N. Underwood Rote: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics
M. King, N. Underwood Evaluating Symbiotic Systems: the challenge
N. Underwood, A. Lisowska The Evolution of an Evaluation Framework for a Text Mining System
F. Ibekwe-sanjuan A task-oriented framework for evaluating theme detection systems: A discussion paper
L. Sitbon, P. Bellot Tools and methods for objective or contextual evaluation of topic segmentation
F. Zou, F. Wang, X. Deng, S. Han Evaluation of Stop Word Lists in Chinese Language
A. Shehata, F. Zanzotto A Dependency-based Algorithm for Grammar Conversion
N. Semmar, M. Laib, C. Fluhr A Deep Linguistic Analysis for Cross-language Information Retrieval
W. Abramowicz, A. Filipowska, J. Piskorski, K. Węcel, K. Wieloc Linguistic Suite for Polish Cadastral System
K. Ahmad, L. Gillam, D. Cheng Sentiments on a Grid: Analysis of Streaming News and Views
B. Pedersen Query Expansion on Compounds
D. Rojas, T. Aikawa Predicting MT Quality as a Function of the Source Language
Session P28-M : Multimodal: Methodologies & Natural Interaction  
M. Rajman, M. Ailomaa, A. Lisowska, M. Melichar, S. Amstrong Extending the Wizard of Oz Methodology for Language-enabled Multimodal Systems
S. Raidt, G. Bailly, F. Elisei Does a Virtual Talking Face Generate Proper Multimodal Cues to Draw User’s Attention to Points of Interest?
R. Taib, N. Ruiz Tangible Objects for the Acquisition of Multimodal Interaction Patterns
Session P29-S : S2S Translation  
D. Verdonik, M. Rojc Are you ready for a call? - Spontaneous conversations in tourism for speech-to-speech translation systems
J. Pérez, A. Bonafonte GAIA: Common Framework for the Development of Speech Translation Technologies
H. Tohyama, S. Matsubara Collection of Simultaneous Interpreting Patterns by Using Bilingual Spoken Monologue Corpus
H. Heuvel, K. Choukri, C. Gollan, A. Moreno, D. Mostefa TC-STAR: New language resources for ASR and SLT purposes
Session P30-S : Speech: TTS & ASR  
B. Williams, R. Jones, I. Uemlianin Tools and resources for speech synthesis arising from a Welsh TTS project
H. Somers, G. Evans, Z. Mohamed Developing Speech Synthesis for Under-Resourced Languages by “Faking it”: An Experiment with Somali
D. Athanassia-lida, A. Chalamandaris Language identification from suprasegmental cues: Speech synthesis of Greek utterances from different dialectal variations
H. Kawanami, T. Kitamura, K. Shikano Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech
M. Bahrani, H. Sameti, N. Hafezi, H. Movassagh Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems
J. Talley Bootstrapping New Language ASR Capabilities: Achieving Best Letter-to-Sound Performance under Resource Constraints
P. Ircing, J. Hoidekr, J. Psutka Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech