4 Nina Grønnum DanPASS - A Danish Phonetically Annotated Spontaneous Speech Corpus 6 Andi Wu Kirk Lowery A Hebrew Tree Bank Based on Cantillation Marks 8 Stéphane CHAUDIRON and Joseph MARIANI Technolangue : The French National Initiative for Human Language Technologies (HLT) 9 Manny Rayner, Pierrette Bouillon, Beth Ann Hockey and Nikos Chatzichrisafis REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications 10 Anders Berglund, Richard Johansson, Pierre Nugues Extraction of Temporal Information from Texts in Swedish 11 Hlavacova Jaroslava New Approach to Frequency Dictionaries - Czech Example 12 Flora Ramírez Bustamante, Alfredo Arnaiz, Mar Ginés A Spell Checker for a World Language 14 Elliott Macklovitch TransType2: The last word 19 Ibon Saratxaga, Eva Navas, Inmaculada Hernáez, Iker Luengo Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque 23 Dimitrios Kokkinakis Collection, Encoding and Linguistic Processing of a Swedish Medical Corpus - The MEDLEX Experience 29 Noguchi Masaki, Ichikawa Hiroshi, Hashimoto Taiichi and Tokunaga Takenobu A new approach to syntactic annotation 33 Tien-Ping Tan, Laurent Besacier A French Non-Native Corpus for Automatic Speech Recognition 35 Andrea Sansò Documenting variation across Europe and the Mediterranean: The Pavia Typological Database 36 H. Movasagh, H. Sameti, N. Hafezi, M. Bahrani Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems 37 Shisanu Tongchim, Prapass Srichaivattana, Virach Sornlertlamvanich, Hitoshi Isahara Blind Evaluation for Thai Search Engines 39 Yuji Matsumoto, Masayuki Asahara, Kiyota Hashimoto, Yukio Tono, Akira Ohtani, and Toshio Morita An Annotated Corpus Management Tool: ChaKi 43 Christophe Jouis Hierarchical relation " is-a ": distinguishing Belonging, inclusion and part/of relationships. 44 Ivan Berlocher, Hyun-gue Huh, Eric Laporte, Jee-sun Nam Morphological annotation of Korean with Directly Maintainable Resources 45 Patrick Saint-Dizier PrepNet: a multilingual lexical description of prepositions 48 Marie-Claude L' Homme and Hee Sook Bae A Methodology for Developing Multilingual Resources for Terminology 49 Stephan Raidt, Gérard Beilly, Frédéric Elisei Does a virtual talking face generate proper multimodal cues to draw the user's attention to points of interest? 50 May Lai-Yin Wong Skeleton parsing in written Chinese: annotation scheme and guidelines 52 Constantin Orasan and Laura Hasler Computer-aided summarisation -- what the user really wants 54 Nianwen Xue Predicate-Argument structure annotation of nominalized verbs in Chinese 55 Kyo Kageura and Gen'ichiro Kikui A self-referring quantitative evaluation of the ATR basic travel expression corpus BTEC 58 Hiromi itoh Ozaku, Akinori Abe, Kaoru Sagara, Noriaki Kuwahara, Kiyoshi Kogure Features of terms in actual nursing activities 59 Nuno Seco, Diana Santos, Nuno Cardoso and Rui Vilela HAREM: An Advanced NER Evaluation Contest for Portuguese 60 Rafael Banchs, Antonio Bonafonte, Javier Pérez Acceptance Testing of a Spoken Language Translation System 64 Valentin Tablan, Wim Peters, Diana Maynard Creating Tools for Morphological Analysis of Sumerian 66 Ingunn Amdal and Torbjørn Svendsen FonDat1: A Speech Synthesis Corpus for Norwegian 67 Alon Itai, Shuly Wintner and Shlomo Yona A Computational Lexicon of Contemporary Hebrew 68 Ruska Ivanovska- Naskova Development of the first LRs for Macedonian: current projects 74 Reinhard Rapp, Carlos Martin Vide Example-based Machine Translation Using a Dictionary of Word Pairs 75 Widad MUSTAFA EL HADI, Ismail TIMIMI, Marianne DABBADIE, Khalid CHOUKRI, Yun-Chuang CHIAO, Olivier HAMON Terminological Resources Acquisition Tools: Towards a User-Oriented Evaluation Model 76 Núria Bel , Sergio Espeja , Montserrat Marimon New tools for the encoding of lexical data extracted from corpus 77 Daniela Braga, Luis Coelho, Joao Paulo Teixeira, Diamantino Freitas Progmatica: A Prosodic atabase for European Portuguese 79 Enrique Amigó, Jesús Giménez IQ T: A Framework for Machine Translation Evaluation 80 Tommaso Caselli & Irina Prodanof Annotating bridging anaphors in ItaliaN:in 3earch of reliability. 81 Henk van den Heuvel,Khalid Choukri, Christian Gollan, Asuncion Moreno, Djamel MosteFa TC-STAR: New language resources for ASR and SST purposes 86 Yunqing Xia, Kam-Fai Wong and Wenjie Li Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach 87 Baden Hughes Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community 88 Yasunori OHISHI, Katunobu ITOU, Kazuya TAKEDA, and Atsushi FUJII Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus 89 Catherine Havasi, James Pustejovsky, and Marc Verhagen BULB: A Unified Lexical Browser 91 Ulrich Schäfer, Daniel Beck Automatic Testing and Evaluation of Multilingual Language Technology Resources and Components 92 Elena Grishina Spoken Russian in Russian National Corpus (RNC) 93 Paul Buitelaar, Philipp Cimiano, Stefania Racioppa, Melanie Siegel Ontology-based Information Extraction with SOBA 94 Dominique Dutoit Alexandria: a web 2.0 interface displaying linguistic data for the web semantic 96 Jan Frederik Maas, Britta Wrede BITT: A corpus for Topic Tracking evaluation on multimodal Human-Robot-Interaction 97 Hercules Dalianis, Bart Jongejan Hand-crafted versus machine-learned inflectional rules: The Euroling-SiteSeeker stemmer and CST's lemmatiser 99 Christian Rohrer & Martin Forst Improving coverage and parsing quality of a large-scale LFG for German 100 Bettina Schrader Non-probabilistic alignment of rare German and English nominal expressions 101 Paula Chesley, Susanne Salmon-Alt Automatic extraction of subcategorization frames for French 103 Yuki Irie, Shigeki Matsubara, Nobuo Kawaguchi, Yukiko Yamaguchi, Yasuyoshi Inagaki Layered Speech-Act Annotation for Spoken Dialogue Corpus 104 Khurshid Ahmad, Craig Bennett and Tim Oliver Visual Surveillance Annotation and Description 105 Mathieu MANGEOT and Antoine CHALVIN Dictionary Building with the Papillon Platform: the GDEF case 106 Tomohiro OHNO, Shigeki MATSUBARA, Hideki KASHIOKA, Naoto KATO, Yasuyoshi INAGAKI A Syntactically Annotated Corpus of Japanese Spoken Monologue 107 Špela Vintar, Gaël Dias, Tomaž Erjavec, Darja Fišer, Jerneja Žganec Gros Bilingual lexicon enrichment for a speech-to-speech translation system 111 Jerneja Žganec Gros, Varja Cvetko-Orešnik, Primož Jakopin, Aleš Mihelic SI-PRON: A Reference Pronunciation Lexicon for Slovenian 112 Silvie Cinková From PropBank to EngValLex: Adapting PropBank-Lexicon to the Valency Theory of Functional Generative Description 113 Tonio WANDMACHER, Jean-Yves ANTOINE Training language models without appropriate language resources : experiments with an augmentative and alternative communication system for disabled people 116 Brian Roark, Mary Harper, Eugene Charniak, Bonnie Dorr, Mark Johnson, Jeremy Kahn, Yang Liu, Mari Ostendorf, John Hale, Anna Krasnyanskaya, Matt Lease, Izhak Shafran, Matt Snover, Robin Stewart, Lisa Yung SParseval: Evaluation Metrics for Parsing Speech 118 Ming-Shun Lin and Hsin-Hsi Chen Constructing a Named Entity Ontology from Web Corpora 119 Flora Ramírez Bustamante, Enrique López Díaz Spelling Error Patterns in Spanish for Word Processing Applications 123 Panikos Heracleous, Tomomi Kaino, Hiroshi Saruwatari, Kiyohiro Shikano Non-audible murmur (NAM) recognition in clean and noisy environments using non-linear and human perception-based features 124 Christer Johansson, Anders Nøklestad, Øysten Reigem Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of co-reference chains. 125 Hitomi Tohyama, Shigeki Matsubara Collection of Simultaneous Interpreting Patterns by Using Bilingual Spoken Monologue Corpus 127 Shuichi ITAHASHI, Chiu-yu TSENG, Satoshi NAKAMURA Oriental COCOSDA: Past, Present and Future 128 Angelika Storrer, Sandra Wellinghoff Automated detection and annotation of term definitions in German text corpora 131 M.Yaseen, M. Atteya, K.Choukri, B.Haddad, B.Maegaard, N.Paulsson, S.Krauwer, C.Mokbel, H Fersøe, A Ali, C Bendahman Building Annotated Written Arabic Corpus Resources in NEMLAR Project 133 Sašo Džeroski, Tomaž Erjavec, Nina Ledinek, Petr Pajas, Zdenek Žabokrtsky, Andreja Žele Towards a Slovene Dependency Treebank 137 Canasai Kruengkrai, Virach Sornlertlamvanich, Hitoshi Isahara A Conditional Random Field Framework for Thai Morphological Analysis 138 Yoshihide Kato, Shigeki Matsubara and Yasuyoshi Inagaki A Corpus Search System Utilizing Lexical Dependency Structure 139 Peter Berck, Albert Russel ANNEX - a web-based framework for exploiting annotated media resources 140 Tomaž Erjavec The English-Slovene ACQUIS corpus 141 Daan Broeder, Andreas Claus, Freddy Offenga, Romuald Skiba, Paul Trilsbeek, Peter Wittenburg LAMUS - the Language Archive Management and Upload System 142 Daan Broeder, Freddy Offenga, Peter Wittenburg, Peter van der Kamp, David Nathan, Sven Strömqvist Technologies for a Federation of Language Resource Archives 146 Marc Kemps-Snijders, Julien Ducret, Laurent Romary, Peter Wittenburg An API for Accessing the ISO Data Category Registry 147 Peter Wittenburg, Daan Broeder, Wolfgang Klein, Stephen Levinson, Laurent Romary Foundations of Modern Language Resource Archives 148 Freddy Offenga, Daan Broeder, Peter WIttenburg, Julien Ducret, Laurent Romary The Metadata Profile in the ISO Data Category Registry 149 Marc Kemps-Snijders, Peter Wittenburg LEXUS - a web-based tool for manipulating lexical resources 150 Tomaž Erjavec, Darja Fišer Building Slovene Wordnet 151 Luciana Bordoni, Tiziana Mazzoli Towards an ontology for art and colours 152 Richard Johansson and Pierre Nugues Construction of a FrameNet Labeler for Swedish Text 153 Han Sloetjes, Albert Russel, Alex Klassmann, Peter Wittenburg, Hennie Brugman ELAN V2.4: a professional framework for multimodality research 154 Peter Berck, Hans-Jörg Bibiko, Marc Kemps-Snijders, Albert Russel, Peter Wittenburg Ontology-based Language Archive Utilization 162 Joakim Nivre, Johan Hall and Jens Nilsson MaltParser: A Data-Driven Parser-Generator for Dependency Parsing 163 Andrej Zgank, Darinka Verdonik, Aleksandra Zogling Markus, Zdravko Kacic SINOD - Slovenian non-native speech database 165 Mark van Assem, Aldo Gangemi, and Guus Schreiber Conversion of WordNet to a standard RDF/OWL representation 167 Antal van den Bosch, Ineke Schuurman, Vincent Vandeghinste Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development 168 Mark Przybocki, Gregory Sanders, Audrey Le Edit-Distance - A Metric for Machine Translation Evaluation 169 Niels Ole Bernsen, Laila Dybkjær and Svend Kiilerich H. C. Andersen Conversation Corpus 170 Magnus Sahlgren Towards a pertinent evaluation methodology for word-space models 171 Andrei Popescu-Belis, Paula Estrella, Margaret King, Nancy Underwood A model for context-based evaluation of language processing systems and its application to machine translation evaluation 172 Martin Forst & Ronald M. Kaplan The importance of precise tokenizing for deep grammars 174 Annalisa Sandrelli & Claudio Bendazzoli Tagging a corpus of interpreted speeches: the European Parliament Interpreting Corpus (EPIC) 178 Hisami Suzuki, Gary Kacmarcik RefRef: a Tool for Viewing and Exploring Coreference Space 179 Bernt Andrassy and Harald Hoege Evaluation of in-car ASR using an Articulation Index based WER(SNR)-measure 180 R. Manurung, D. O'Mara, H. Pain, G. Ritchie, A. Waller Building a Lexical Database for an Interactive Joke-Generator 182 Bruno Cartoni Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes. 184 Serge Sharoff A uniform interface to large-scale linguistic resources 186 Carlo Strapparava, Alessandro Valitutti, Oliviero Stock The Affective Weight of Lexicon 187 Agnes Lisowska, Nancy Underwood ROTE: a tool to support users in defining the relative importance of quality characteristics 190 Serge Sharoff, Bogdan Babych, Anthony Hartley Using collocations from comparable corpora to find translation equivalents 192 Folkert de Vriend, Jos Swanenberg, Joep Kruijsen, Henk van den Heuvel, Lou Boves & Roeland van Hout A Unified Structure for Dutch Dialect Dictionary Data 193 E. Uí Dhonnchadha & J. Van Genabith A Part-of-speech tagger for Irish using Finite-State Morphology and Constraint Grammar Disambiguation 194 Véronique Moriceau Language Challenges for Data Fusion in Question-Answering 195 Luís Sarmento, Luís Cabral BACO - A large database of text and co-occurrences 196 Ulrich Schäfer OntoNERdIE -- Mapping and Linking Ontologies to Named Entity Recognition and Information Extraction Resources 197 Jonathan Fiscus, Jerome Ajot, Nicolas Radde, and Christophe Laprun Multiple Dimension Levenshtein Edit Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech 198 J.Atserias, B.Casas, E.Comelles, M.Gonzàlez, L.Padró, M.Padró FreeLing 1.3: Syntactic and semantic services in an open-source NLP library 199 Jiri Semecky On Automatic Assignment of Verb Valency Frames in Czech 200 Ben Medlock An Introduction to NLP-based Textual Anonymisation 201 Hennie Brugman, Luit Gazendam, Véronique Malaisé A web based general thesaurus browser to support indexing of television and radio programs 203 Judit Feliu; Jorge Vivaldi; M. Teresa Cabré SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations 206 Irene Cramer, Jochen L. Leidner, Dietrich Klakow Building An Evaluation Corpus for German Question Answering by Harvesting Wikipedia 209 Roald Eiselen Suggestions on the evaluation of suggestion adequacy for spelling checkers 210 Gerhard Fliedner Towards Natural Interactive Question Answering 214 B. Waldron, A. Copestake, U. Schaefer, B. Kiefer Preprocessing and Tokenisation Standards in DELPH-IN Tools 216 Juri Apresjan, Igor Boguslavsky, Boris Iomdin, Leonid Iomdin, Andrei Sannikov, Victor Sizov A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects 218 Martin Hassel, Jonas Sjöbergh Towards Holistic Summarization - Selecting Summaries, Not Sentences 219 Stefan Schaden, Ute Jekosch Casselberveetovallarga and other unpronounceable places: The CrossTowns Corpus released 220 Dimitrios Kokkinakis, Dana Dannélls Recognizing Acronyms and their Definitions in Swedish Medical Texts 222 Lun-Wei Ku, Yu-Ting Liang and Hsin-Hsi Chen Tagging Heterogeneous Evaluation Corpora for Opinionated Tasks 223 Joakim Nivre, Jens Nilsson and Johan Hall Talbanken05: A Swedish Treebank with Phrase Structure and Dependency Annotation 224 Oana Postolache and Dan Cristea and Constantin Orasan Transferring coreference chains through word alignment. 225 G.Tummarello, C.Morbidoni, F. Kepler, F. Piazza, P.Puliti Semantic Web based Textual Markup: a preliminary evaluation 227 Anna Bogacka A task-specific corpus resource for Polish adult learners of English 228 Olena Medelyan, Stefan Schulz, Jan Paetzold, Michael Poprat, Kornél Marcó Language Specific and Topic Focused Web Crawling 229 Martin Reynaert Corpus-Induced Corpus Clean-up 233 Andrei Popescu-Belis and Maria Georgescul TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface 234 Feng Pan, Rutu Mulkar, and Jerry R. Hobbs An Annotated Corpus of Typical Durations of Events 235 Agam Patel and Dragomir Radev Lexical similarity can distinguish between automatic and manual translations 236 Nick Campbell, Toshiyuki Sadanobu, Damien Douxchamps, Masataka Imura, Suzuki Noriko and Mayumi Bono A multimedia database of meetings and informal interactions for tracking participant involvement and discourse flow 237 Fu Lee Wang, Xiaotie Deng, Feng Zou Towards Unified Chinese Segmentation Algorithm 238 Doaa Samy, Antonio Moreno Sandoval, José M. Guirao, Enrique Alfonseca Building and Assessment of a Parallel Multilingual Corpus (Arabic-Spanish-English) 240 Doaa Samy and Antonio Moreno Sandoval XML-Based Arabic Part of Speech Tagger 241 Donna K. Byron and Eric Fosler-Lussier The OSU Quake 2004 corpus of two-party situated problem-solving dialogs 242 Md. Aminul Islam and Diana Inkpen Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words 243 Yoji Kiyota, Hiroshi Nakagawa A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames 244 Ronnie Taib, Natalie Ruiz Tangible Objects for the Acquisition of Multimodal Interaction Patterns 245 Václav Novák and Jan Hajic Perspectives of Turning Prague Dependency Treebank into a Knowledge Base 246 Jonas Granfeldt, Pierre Nugues, Emil Persson, Jonas Thulin, Malin Ågren, Suzanne Schlyter CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling 247 Hui Tan, Claire Cardie Annotating a multi-entity cross-document noun phrase coreference corpus: issues and preliminary results 248 Qian Yang, Jean-Pierre Martens, Nanneke Konings, Henk van den Heuvel Development of a phoneme-to-phoneme (p2p) converter to improve the grapheme-to-phoneme (g2p) conversion of names 249 Jaeyoung Jung, Maki Miyake, Hiroyuki Akama Recurrent Markov Cluster(RMCL)Algorithm for the Refinement of the Semantic Network 254 Catia Cucchiarini, Hugo Van hamme, Felix Smits JASMIN-CGN: Extension of the Spoken Dutch Corpus with speech of elderly people, children and non-natives in the human-machine interaction modality. 255 Cédrick Fairon, Sébastien Paumier A framework for real time dictionary updating 256 Vicente Alabau and Carlos D. Martínez Bilingual speech corpus in two phonetically similar languages 258 Vincent Vandeghinste, Ineke Schuurman, Michael Carl, Stella Markantonatou, Toni Badia METIS-II: Machine Translation for Low Resource Languages 259 Elisabeth D'Halleweyn, Jan Odijk, Lisanne Teunissen, Catia Cucchiarini The Dutch-Flemish HLT Programme STEVIN: Essential Speech and Language Technology Resources 261 Nuria GALA and André Valli A computational lexicon of French verbal subcategorization frames 262 Emiko Suzuki and Kyoko Kakihana On the Web Trilingual Sign Language Dictionary to Learn the foreign Sign Language without Learning a Target Spoken Language 266 Daniela Goecke, Andreas Witt Exploiting logical document structure and text type structure for anaphora resolution 269 Catalina Hallett, Donia Scott and Richard Power Textual rendering of complex post-coordinated clinical concepts from a medical ontology 270 Cédrick Fairon, Sébastien Paumier A translated corpus of 30,000 French SMS 273 Feng Zou, Fu Lee Wang, Xiaotie Deng, Song Han Evaluation of Stop Word List in Chinese Language 275 Anna Sinopalnikova and Pavel Smrz Intelligent Dictionary Interfaces: Usability Evaluation of Access-Supporting Enhancements 277 Florian Schiel, Moritz Kaiser, Hannes Mögele German SmartWeb UMTS Speech Data Collection 278 Žabokrtský, Z., Lopatková, M., Skwarska, K. Valency Lexicon of Czech Verbs: Alternation-Based Model 279 Darinka Verdonik, Matej Rojc Are you ready for a call? - Spontaneous conversations in tourism for speech-to-speech translation systems 280 Florian Schiel, Hannes Mögele, Moritz Kaiser Bikers Accessing the Web: The SmartWeb Motorbike Corpus 281 Asanee Kawtrakul Ontology Driven Knowledge Portal Construction 282 Dan Cristea, Corina Forascu, Ionut Cristian Pistol Temporality in relation with discourse structure 283 Eva Hajicová, Petr Sgall Corpus Annotation As a Test of a Linguistic Theory 285 Ondrej Bojar and Magdalena Prokopova Czech-English Word Alignment 286 Sergio Scalise, Antonietta Bisetto, Emiliano Guevara, Chiara Melloni MORBO/COMP: a multilingual database of compound words 288 Daniel Sonntag and Massimo Romanelli A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications 292 Nicole Grégoire A Standard Lexical Representation of Multiword Expressions 293 Anders Green, Helge Hüttenrauch, Elin Anna Topp, Kerstin Severinson Eklundh Developing a Contextualized Multimodal Corpus for Human-Robot Interaction 294 Wei-Yun Ma, Chu-Ren Huang Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus 295 Ana-Maria Barbu, Emil Ionescu, Verginica Barbu Mititelu Romanian Valence Dictionary in XML Format 296 Niels Ole Bernsen, Thomas K. Hansen, Svend Kiilerich and Torben Kruchov Madsen Field Evaluation of a Single-Word Pronunciation Training System 297 Wei Li, Wenjie Li, Qin Lu Mining the Implicit Entities in the Queries 298 Kiyotaka Uchimoto, Ryoji Hamabe, Takehiko Maruyama, Katsuya Takanashi, Tatsuya Kawahara, and Hitoshi Isahara Dependency-structure Annotation to Corpus of Spontaneous Japanese 299 N. Areta, A. Gurrutxaga, I. Leturia, Z. Polin, R. Saiz (Elhuyar) I. Alegria, X. Artola, A. Diaz de Ilarraza, N. Ezeiza, A. Sologaistoa, A. Soroa, A. Valverde (IXA) STRUCTURE, ANNOTATION AND TOOLS IN THE BASQUE ZT CORPUS 301 Joseph Polifroni and Marilyn Walker Learning Database Content for Spoken Dialogue System Design 304 Jorge Civera and Alfons Juan Bilingual Machine-Aided Indexing 305 Panunzi Alessandro, Fabbri Marco, Moneglia Massimo INTEGRATING METHODS AND LRs FOR AUTOMATIC KEYWORD EXTRACTION FROM OPEN DOMAIN TEXTS 306 Luís Fernando Costa Component evaluation in a question answering system 307 Stefan Breuer, Sven Bergmann, Ralf Dragon, Sebastian Möller Set-up of a Unit-Selection Synthesis with a Prominent Voice 308 Nasredine Semmar, Meriama Laib and Christian Fluhr Using a Deep Linguistic Analysis in Cross-language Information Retrieval 309 Susana Inácio and Diana Santos Annotating COMPARA, a grammar-aware parallel corpus 310 Lina Henriksen, Claus Povlsen, Andrejs Vasiljevs EuroTermBank - a terminology resource based on best practice 311 Florbela Barreto, António Branco, Eduardo Ferreira, Amália Mendes, Maria Fernanda Bacelar do Nascimento, Filipe Nunes, João Ricardo Silva A Large High Quality Linguistically Interpreted Open Corpus of Portuguese: the TagShare Corpus 312 Luís Sarmento, Belinda Maia, Diana Santos, Luís Cabral, Ana Sofia Pinto Corpógrafo V3 From simple word concordance to semi-automatic knowledge engineering 313 Liviu P. Dinu On the data base of Romanian syllables 314 Alessandro Bahgat, Fabio Massimo Zanzotto A Dependency-based Algorithm for Grammar Conversion 315 Khurshid Ahmad, Maria Teresa Musacchio and Giuseppe Palumbo Ontological Terminological Commitments and the Discourse of Specialist Communities 318 Alessandro Oltramari FrameNet as a conceptual window to WordNet 319 Chloe Clavel, Ioana Vasilescu, Laurence Devillers, Thibaut Ehrette, Gaël Richard The SAFE Corpus: fear-type emotions detection for surveillance applications 320 Arne Mauser, Evgeny Matusov, Hermann Ney Training a Statistical Machine Translation System without GIZA++ 323 Evie Coussé and Steven Gillis Regional Bias in the Broad Phonetic Transcription of the Spoken Dutch Corpus 324 Atsushi Fujii, Makoto Iwayama, Noriko Kando Test Collections for Patent Retrieval and Patent Classification in the Fifth NTCIR Workshop 326 Željko Agic, Marko Tadic Evaluating Morphosyntactic Tagging of Croatian Texts 328 Martin Rajman, Marita Ailomaa, Agnes Lisowska, Mirek Melichar, Susan Armstrong Extending the Wizard of Oz Technique for Multimodal Language-enabled Systems 329 Gertjan van Noord, Ineke Schuurman, Vincent Vandeghinste Syntactic Annotation of Large Corpora in STEVIN 332 Davide Buscaldi, Paolo Rosso Mining Knowledge from Wikipedia for the Question Answering Task 333 Sabine Schulte im Walde Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet 336 Peter W. Wagacha, Guy De Pauw and Pauline W. Githinji Automatic Diacritic Placemenent using Memory-Based Learning: a Gikuyu Language Case Study 338 Juan José Rodríguez Soler, Pedro Concejero Cerezo, Daniel Tapias Merino, Alberto José Sánchez MEDUSA: User-Centred Design and usability evaluation of Automatic Speech Recognition telephone services in Telefónica Móviles España 339 Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Pado, Manfred Pinkal The SALSA corpus: a German corpus resource for lexical semantics 340 Ralf Steinberger, Bruno Pouliquen, Anna Widiger, Camelia Ignat, Tomaz Erjavec, Dan Tufis, Dániel Varga The JRC-Acquis: A multilingual aligned parallel corpus with 20+ languages 341 Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Pado SALTO -- A Versatile Multi-Level Annotation Tool 342 Veronique Hoste and Guy De Pauw KNACK-2002: a Richly Annotated Corpus of Dutch Written Text 343 Philipp Cimiano, Matthias Hartung and Esther Ratsch Finding the appropriate generalization level for binary ontological relations extracted from the Genia corpus 344 Tomoyuki Kato, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria 345 Margot Mieskes, Michael Strube Part-of-Speech Tagging of Transcribed Speech 346 Branimir Boguraev and Rie Kubota Ando Analysis of TimeBank as a resource for TimeML Parsing 347 Daisuke Kawahara and Sadao Kurohashi Case Frame Compilation from the Web using High-Performance Computing 348 Patrick Paroubek, Anne Vilnat, Isabelle Robba, Christelle Ayache, Louis-Gabriel Pouillot EASY: Evaluation Campaign of Syntactic Parsers of French 349 Toshiyuki Kanamaru, Masaki Murata, Hitoshi Isahara Creating a Japanese Adverb Dictionary That Includes Information About Speaker's Communicative Intention Using Machine Learning 350 Gregory Grefenstette, Fathi Debili, Christian Fluhr, Svitlana Zinger Exploiting text for extracting image processing resources 351 Naoaki Okazaki and Sophia Ananiadou Clustering acronyms in biomedical text for disambiguation 352 Goran Nenadic, Naoki Okazaki, Sophia Ananiadou Towards a terminological resource for biomedical text mining 356 Hans Hjelm Extraction of Cross Language Term Correspondences 357 David Guthrie, Ben Allison, Wei Liu, Louise Guthrie, Yorick Wilks A Closer Look at Skip-gram Modelling 359 Dobrov B., Loukachevitch N. Development of Linguistic Ontology on Natural Sciences and Technology 360 Matthew W. Bilotti and Eric Nyberg Evaluation for Scenario Question Answering Systems 362 Loukachevitch N., Dobrov B. Information Significance as a Factor of Inclusion of Multiword Expressions to Computational Resources 363 Dirk Bühler, Wolfgang Minker STOCHASTIC SPOKEN NATURAL LANGUAGE PARSING IN THE FRAMEWORK OF THE FRENCH MEDIA EVALUATION CAMPAIGN 364 Stephan Oepen and Jan Tore Lønning Discriminant-Based MRS Banking 365 György Szarvas, Richárd Farkas, László Felföldi, András Kocsor Highly accurate Named Entity corpus for Hungarian 366 Thierry Declerck and Mihaela Vela A Generic NLP Tool for Supporting Shallow Ontology Building 367 Julia Ritz, Ulrich Heid Extraction tools for collocations and their morphosyntactic specificities 368 Luke Nezda, Andrew Hickl, John Lehmann, and Sarmad Fayyaz What in the world is a Shahab? Wide Coverage Named Entity Recognition for Arabic 372 Massimo Poesio, Philippe Goux, and Mijail A. Kabadjov AN ANAPHORA RESOLUTION-BASED ANONYMIZATION MODULE 375 Manfred Sailer and Beata Trawinski The Collection of Distributionally Idiosyncratic Items: A Multilingual Resource for Linguistic Research 376 Danie J. Prinsloo, Elsabé Taljard, Ulrich Heid Grammar-based tools for the creation of tagging resources for an unresourced language: the case of Northern Sotho 378 Maria Clara Paixão de Sousa and Thorsten Trippel Building a historical corpus for Classical Portuguese: some technological aspects 379 Maria Teresa Pazienza, Marco Pennacchiotti, Fabio Massimo Zanzotto Mixing WordNet, Verbnet and PropBank for studying verb relations 381 Leo Wanner and Margarita Alonso Ramos Local Document Relevance Clustering in IR Using Collocation Information 384 Andrea Esuli and Fabrizio Sebastiani SentiWordNet: A Publicly Available Lexical Resource for Opinion Mining 386 Alexandre Denis, Matthieu Quignard, Guillaume Pitel A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation 388 Felix Sasaki Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources 389 Margaret King and Nancy Underwood Evaluating Symbiotic Systems: the challenge 390 A. Chalamandaris, A. Protopapas, P. Tsiakoulis, S. Raptis All Greek to me! An automatic Greeklish to Greek transliteration system 392 Thurid Vogt, Elisabeth Andre Improving Automatic Emotion Recognition from Speech via Gender Differentiation 394 Khurshid Ahmad, Lee Gillam and David Cheng Sentiments on a Grid: Analysis of Streaming News and Views 395 Briony Williams and Ivan Uemlianin A procedure for facilitating the development of manual letter-to-sound rules for the Festival speech synthesis framework 397 Thierry Declerck, Ovidiu Vela, Asunción Gómez Pérez, Zeno Gantner,and David Manzano-Macho Multilingual Lexical Resources for Ontology translation 398 Beatrice Alex, Malvina Nissim and Claire Grover The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text 404 Ben Wellner and Marc Vilain Leveraging Machine Readable Dictionaries in Discriminative Sequence Models 405 Sasa Hasan, Anas El Isbihani, Hermann Ney Creating a Large-Scale Arabic to French Statistical Machine Translation System 407 Chen Yirong,Lu Qin,Li Wenjie,Sui Zhifang A Study on Terminology Extraction Based on Classified Corpora 408 Anna Estelles Palanca, María Amparo Alcina Caudet, Victoria Soler Puertes Retrieving terminological data from the TxtCeram tagged domain corpus: first step on a terminological ontology. 409 Violetta Cavalli-Sforza, Abdelhadi Soudi IMORPHE: An Inheritance and Equivalence Based Morphology Description Compiler 410 Laurianne Sitbon and Patrice Bellot Tools and methods for topic segmentation of texts and contextual evaluation 411 L. Devillers, R. Cowie, J-C. Martin, E. Douglas-Cowie Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches 412 Maja Popovic, Hermann Ney POS-based Word Reorderings for Statistical Machine Translation 413 David Vilar, Jia Xu, Hermann Ney, Luis Fernando D'Haro Error Analysis of Statistical Machine Translation Output 414 Castellón, Irene; Fernández-Montraveta, Ana; Vázquez, Gloria; Alonso, Laura; Capilla, Joan Antonio The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level 415 Javier Pérez, Antonio Bonafonte GAIA: A Common Framework for the Development of Speech Translation Techonologies 416 Attila Novák Morphological Tools for Six Small Uralic Languages 417 Javier Pérez, Antonio Bonafonte, Horst-Udo Hain, Eric Keller, Stefan Breuer and Jilei Tian ECESS Inter-Module Interface Specification for Speech Synthesis 418 Petr Nemec Tree Searching/Rewriting Interpreter Tool 420 Maite Taboada, Caroline Anthony and Kimberly Voll Methods for creating semantic orientation dictionaries 421 Mahmoud Shokrollahi-Far,Antal van den Bosch, Harry Bunt, Walter Daelemans, Ayaz Isazadeh Bootstrapping POS Tags on the Vocalism of Classical Arabic… 422 Hans C. Boas CoLangDoc: A web-based tool for collaborative research on endangered languages 423 Masaki Itagaki, Anthony Aue, and Takako Aikawa Detecting Inter-domain Semantic Shift using Syntactic Similarity 424 Pablo D. Agüero, Jordi Adell, Mónica Caballero, Antonio Bonafonte and Asunción Moreno Database pruning and unit recovering for unit selection speech synthesis 427 Hynek BORIL, Tomáš BORIL, Petr POLLÁK Methodology of Lombard Speech Database Acquisition: Experiences with CLSD 428 Harry Bunt Design Principles for Multidimensional Dialogue Act Annotation Schemes 429 Gabriel Murray and Ben Hachey Improved Automatic Evaluation of Summaries Using a Large Semantic Space 430 Olivier Baude, Michel Jacobson, Atanas Tchobanov & Richard Walter Interoperability of the sound corpora: the case of the French corpora 431 Pavel Pecina, Pavel Schlesinger Evaluation of Classification Methods for Collocation Extraction 433 R. Bernardi, D. Calvanese, L. Dini, V. Di Tommaso, E. Frasnelli, B. Plank Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano 435 Irene Langkilde-Geary and Justin Betteridge A Factored Functional Dependency Transformation of the English Penn Treebank for Probabilistic Surface Generation 436 Jim Talley Bootstrapping new language ASR capabilities: Achieving best letter-to-sound performance under resource constraints 438 Eduard Hovy, Chin-Yew Lin, Liang Zhou, and Junichi Fukumoto Automated Summarization Evaluation with Basic Elements 439 Hiroyuki Kaji and Mariko Watanabe Automatic Construction of Japanese WordNet 440 Marie-Catherine de Marneffe, Bill MacCartney, Christopher D. Manning Generating Typed Dependency Parses from Phrase Structure Parses 441 Sónia Frota, Marina Vigário & Fernando Martins FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text 442 Ulrik Petersen Querying both parallel and treebank corpora: Evaluation of a corpus query system 443 Liang Zhou, Chin-Yew Lin, and Eduard Hovy Summarizing Answers for Complicated Questions 445 Monica Monachini, Nicoletta Calzolari, Khalid Choukri, Jochen Friedrich, Michele Mammini, Jan Odijk, Marisa Ulivieri Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian 446 Ronny Melz, Pum-Mo Ryu, Key-Sun Choi Compiling a large Korean IT-related language resource by language independent methods for domain-specific taxonomy learning 448 Felix Pîrvan, Dan Tufis Tagsets Mapping and Statistical Training Data Cleaning-up 451 Dan Tufis, Elena Irimia RoCo-News: A Hand Validated Journalistic Corpus of Romanian 452 Eckhard Bick Turning a Dependency Treebank into a PSG-style Constituent Treebank 453 Dan Stefanescu, Dan Tufis Aligning multilingual thesauri 454 Radu Ion, Alexandru Ceausu, Dan Tufis Dependency-Based Phrase Alignment 456 Alexandru Ceausu, Dan Stefanescu, Dan Tufis Acquis Communautaire sentence alignment using Support Vector Machines 457 Claire Grover and Richard Tobin Rule-Based Chunking and Reusability 459 Baden Hughes, Timothy Baldwin, Steven Bird, Jeremy Nicholson and Andrew MacKinlay Reconsidering Language Identification for Written Language Resources 460 Sun-Mee Bae and Key-Sun Choi A Comparison of Conceptual Maps Between Korean and Chinese Words in CoreNet 461 Yasuko SENDA, Yasusi SINOHARA, Manabu OKUMURA Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing 463 Lundälv, Mats, Mühlenbock, Katarina, Farre, Bengt and Brännström, Annika SYMBERED - a symbol-concept editing tool 464 Yi Zhang, Valia Kordoni Automated Deep Lexical Acquisition for Robust Open Texts Processing 465 Martin, J.-C., Caridakis, G., Devillers, L., Karpouzis, K., Abrilian, S. Manual Annotation and Image Processing of Multimodal Emotional Behaviours in TV Interviews 467 Cvetana Krstev, Ranka Stankovic, Duško Vitas, and Ivan Obradovic WS4LR: A Workstation for Lexical Resources 468 Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer Extending VerbNet with novel verb classes 469 James Pustejovsky, Catherine Havasi, Roser Sauri, Patrick Hanks, Jessica Littman, Anna Rumshisky, Jose Castano, Marc Verhagen Towards a Generative Lexical Resource: The Brandeis Semantic Ontology 471 Hans Dybkjær and Laila Dybkjær Act-Topic Patterns for Automatically Checking Dialogue Models 472 Rojas, David M. and Takako Aikawa Predicting MT quality as a function of the source language 473 Pawel Mazur and Robert Dale Named Entity Extraction with Conjunction Disambiguation 477 Beatrice Alex Integrating Language Knowledge Resources to Extend the English Inclusion Classifier to a New Language 478 Griet Depoorter, Remco van Veenendaal Functioning of the Centre for Dutch Language and Speech Technology (TST Centre) 480 Bente Maegaard, Lene Offersgaard, Lina Henriksen, Costanza Navarretta, Claus Povlsen, Hanne Jansen, Xavier Lepetit The MULINCO corpus and corpus platform 481 Claudia Soria, Maurizio Tesconi, Francesca Bertagna, Nicoletta Calzolari, Andrea Marchetti, Monica Monachini Moving to dynamic computational lexicons with LeXFlow 482 Caroline Sporleder, Marieke van Erp, Tijn Porcelijn, Antal van den Bosch and Pim Arntzen Identifying Named Entities in Text Databases from the Natural History Domain 483 Harold Somers, Gareth Evans and Zeinab Mohamed Developing Speech Synthesis for Under-Resourced Languages by ‘Faking it' : An Experiment with Somali 484 Dragos CIOBANU, Anthony HARTLEY, Serge SHAROFF Using richly annotated trilingual language resources for acquiring reading skills in a foreign language 485 Guido Boella, Alessandro Mazzei, Piercarlo Rossi, Marco Martin A Development Tool For Multilingual Ontology-based Conceptual Dictionaries 487 Jens-Erik Fenstad, Lars Ahrenberg, Knut Kvale, Bente Maegaard, Katarina Mühlenbock, Bernt-Erik Heid KUNSTI - Knowledge Generation for Norwegian Language Technology 488 Péter Halácsy, Dániel Varga, Csaba, Oravecz, András Kornai, Viktor Trón Using a morphological analyzer in high precision POS tagging of Hungarian 489 Toumouh Adil, Widdows Dominic, Lehireche Ahmed, Malki Mimoun Adapting WordNet to the Medical Domain using Lexicosyntactic Patterns in the Ohsumed Corpus 492 Renata Savy, Francesco Cutugno, Claudia Crocco Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb). 493 Baden Hughes, Dafydd Gibbon and Thorsten Trippel Feature-Based Encoding and Querying Language Resources with Character Semantics 495 Rajen Subba, Barbara Di Eugenio, Elena Terenzi Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations 496 Kiyotaka Uchimoto, Naoko Hayashida, Toru Ishida, and Hitoshi Isahara Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence 497 Isa Maks and Michel Boekestein Exploring opportunities for enrichment by linking lexical databases 498 Horacio Saggion Multilingual Multidocument Summarization Tools and Evaluation 500 Olivier Ferret Building a network of topical relations from a corpus 501 P. Bouquet, L. Serafini, S. Zanobini The role of lexical resources in matching classification schemas 503 Manolis Maragoudakis, Katia Kermanidis, Aristogiannis Garbis and Nikos Fakotakis Dealing with imbalanced data using Bayesian techniques 504 J.M.Benedí, E.Lleida, A.Varona, M.J.Castro, I.Galiano, R.Justo, I.López and A.Miguel Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA 505 Rainer Osswald, Hermann Helbig, Sven Hartrumpf The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon 506 Jean Véronis, Yun-Chuang Chiao The ARCADE II project : Evaluation of multilingual text alignment systems 508 Francesca Bertagna Representation and Inference for Open-Domain QA: strength and limits of two Italian Semantic Lexicons 509 Abdelrahim Abdelsapor, Noha Adly, Kareem Darwish, Ossama Emam, Magdi Nagi Building a Heterogeneous Information Retrieval Collection of Printed Arabic Documents 510 Saturnino Luz, Matt-Mouley Bouamrane, Masood Masoodian Gathering a corpus of multimodal computer-mediated meetings with focus on text and audio interaction 511 A. L. Dimou, A. Chalamandaris Language identification from suprasegmental cues: Speech synthesis of Greek utterances from different dialectal variations 513 Roger Levy and Galen Andrew Tregex and Tsurgeon: tools for querying and manipulating tree data structures 515 L. Gillard, P. Bellot, M. El-Bèze EVALUATIONS OF QUESTION ANSWERING AND EVALUATIONS OF THE EVALUATION 518 B. Magnini, E. Pianta, C. Girardi, M. Negri, L. Romano, M. Speranza, and R. Sprugnoli I-CAB: the Italian Content Annotation Bank 521 Bente Maegaard, Steven Krauwer, Khalid Choukri, Lise Damsgaard Jørgensen The BLARK concept and BLARK for Arabic 524 Gabriella Pardelli, Manuela Sassi, Sara Goggi, Paola Orsolini Natural Language Processing: A Terminological and Statistical Approach 525 Suzan Verberne, Lou Boves, Nelleke Oostdijk Data for question answering: the case of 'why' 527 Kiril Simov and Petya Osenova Shallow Semantic Annotation of Bulgarian 530 Christopher Cieri, Walt Andrews, Joseph P. Campbell, George Doddington, Jack Godfrey, Shudong Huang, Mark Liberman, Alvin Martin, Hirotaka Nakasone, Mark Przybocki, Kevin Walker The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research 531 Laurence Devillers, Jean-Claude Martin, Sarkis Abrilian Annotation of Emotions in Real-Life Video Interviews: Variability between Coders 532 Ya-Min Chou, Chu-Ren Huang Hantology-A Linguistic Resource for Chinese Language Processing and Studying 534 Voula Giouli, Penny Labropoulou, Maria Gavrilidou, Stelios Piperidis, Monica Monachini, Claudia Soria, Nicoletta Calzolari, Khalid Choukri Language resources production models: the case of INTERA multilingual corpus and terminology 535 Kyoko Kanzaki,Qing Ma,Eiko Yamamoto and Hitoshi Isahara Semantic Analysis of Abstract Nouns to compile a thesaurus of Adjectives 537 Katrin Erk and Sebastian Pado FRED and ROSY -- A Flexible Toolbox For Semantic Role Assignment 538 Valentin Tablan, Tamara Polajnar, Hamish Cunningham, Kalina Bontcheva User-friendly ontology authoring using a controlled language 539 Laura Hasler and Constantin Orasan Towards cross-document event annotation 540 Amália Mendes, Maria Fernanda Bacelar do Nascimento, Sandra Antunes, Luísa Pereira COMBINA-PT: a large corpus-extracted and hand-checked lexical database of Portuguese multiword expressions 541 David Graff, Tim Buckwalter, Mohamed Maamouri, Hubert Jin Lexicon Development for Varieties of Spoken Dialectal Arabic 542 Alexandre Patry, Fabrizio Gotti, Philippe Langlais MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation 543 Mohamed Maamouri, Ann Bies, Tim Buckwalter, Mona Diab, Nizar Habash, Owen Rambow, Dalila Tabessi Developing and Using a Pilot Dialectal Arabic Treebank 544 Beata Bandmann Megyesi, Eva Csato Johansson, Anna Sågvall Hein Building a Swedish-Turkish Parallel Corpus 545 Horacio Saggion, Robert Gaizauskas Language Resources for Background Gathering 547 Jennifer Doyon, Keith Miller, John Henderson, Linda Van Guilder, Steven Van Guilder, Kristian Concepcion, Florence Reeder Document Exploitation Improvement by Component Evaluation 550 Julie Medero, Kazuaki Maeda, Stephanie Strassel, Christopher Walker An Efficient Approach to Gold Standard Annotation: Decision Points for Complex Tasks 552 Stefan Klatt A Corpus-Based Approach to the Interpretation of Unknown Words with an Application to German 553 Sophie Rosset, Sandra Petel The Ritel Corpus - An annotated Human-Machine open-domain question answering spoken dialog corpus 554 Anna Feldman, Jirka Hana, Chris Brew A cross-language approach to rapid creation of new morpho-syntactically annotated resources 557 Konstantinos I. Diamantaras, Ionas Michailidis, Spyros Vasileiadis, Yannick Frère Greek Named Entities Recognition using Support Vector Machines, Maximum Entropy Models and Onetime 558 Anna Korhonen, Yuval Krymolowski, Ted Briscoe A Large Subcategorization Lexicon for Natural Language Processing Applications 560 Nancy Ide, Keith Suderman An Open Linguistic Infrastructure for American English 562 Nancy Ide, Laurent Romary Representing Linguistic Corpora and Their Annotations 565 Zhongqiang Huang, Lei Chen, Mary Harper An Open Source Prosodic Feature Extraction Tool 566 Alina Andreevskaia and Sabine Bergler SEMANTIC Tag Extraction from WordNet Glosses 567 Kow Kuroda and Hitoshi Isahara Getting Deeper Semantics than Berkeley FrameNet with MSFA 569 Enrique Alfonseca, Antonio Moreno-Sandoval, Jose Maria Guirao and Maria Ruiz-Casado The wraetlic NLP suite 570 Tomoko Ohta, Yuka Tateisi, Jin-Dong Kim, Akane Yakushiji, Jun-ichi Tsujii Linguistic and Biological Annotations of Biological Interaction Events 573 Knut Hofland, Paul Meurer, Kari Tenfjord The ASK corpus - a language learner corpus of Norwegian as a second language 575 Ivana Kruijff-Korbayova, Klara Chvatalova, Oana Postolache Annotation Guidelines for Czech-English Word Alignment 576 Jérémie Segouat, Annelies Braffort and Emilie Martin Sign Language corpus analysis: Synchronisation of linguistic annotation and numerical data 577 Gil Francopoulo, Monte George, Nicoletta Calzolari, Monica Monachini, Nuria Bel, Mandy Pet, Claudia Soria Lexical Markup Framework (LMF) 578 Bruno Pouliquen, Marco Kimler, Ralf Steinberger, Camelia Ignat, Tamara Oellinger, Ken Blackler, Flavio Fluart, Wajdi Zaghouani, Anna Widiger, Ann-Charlotte Forslund, Clive Best Geocoding multilingual texts: recognition, disambiguation and visualisation 580 Bolette Sandford Pedersen Search on Danish compounds using shallow linguistic techniques 581 Thatsanee Charoenporn, Canasai Kruengkrai, Thanaruk Theeramunkong, Virach Sornlertlamvanich, and Hitoshi Isahara Word Knowledge Acquisition for Computational Lexicon Construction 582 Wim Peters, Maria Teresa Sagri, Daniela Tiscornia, Sara Castagnoli The LOIS Project 583 Joris Vaneyghen, Dirk Van Compernolle A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora 584 Adriana Roventini - Nilda Ruimy Linking Verbal Entries of Different Lexical Resources Linking Verbal Entries of Different Lexical Resources 586 K Choukri, M Dabbadie, O Hamon, A Hartley, W Mustafa El Hadi, A Popescu-Belis, M Rajman, S Surcin, I Timimi CESTA Machine Translation Evaluation Campaign: towards a reliable, reusable protocol 589 Marie-Jeanne Derouin, André Le Meur Lemma-oriented dictionaries, concept-oriented terminology and translation memories 590 Martí Umbert, Asunción Moreno, Pablo Aguero, Antonio Bonafonte Spanish Synthesis Corpora 591 Pavel Ircing, Jan Hoidekr, Josef Psutka Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech 593 Milena Slavcheva Semantic Descriptors: The Case of Reflexive Verbs 595 Guenter Neumann and Berthold Crysmann Exploring HPSG-based Treebanks for Probabilistic Parsing 596 Rita Marinelli, Remo Bindi Proper Names and Linguistic Dynamics 597 Sonja Bosch, Jackie Jones, Laurette Pretorius Towards machine-readable lexicons for South African Bantu languages 599 Le Chenadec G., Maffiolo V., Chateau N. and Colletta J.M. Creation of a corpus of multimodal spontaneous expressions of emotions in Human-Machine Interaction 600 Yoshihiko Hayashi, Toru Ishida A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons 602 Rita Marinelli, Adriana Roventini Using Core Ontology for Domain Lexicon Structuring 603 W. Abramowicz, A. Filipowska, K. Wecel, K. Wieloch, J. Piskorski Linguistic Suite for Polish Cadastral System 604 Hiromichi Kawanami, Takahiro Kitamura and Kiyohiro Shikano Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech 606 Pavel Smrz, Vit Novacek Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator 608 Bernardi R., Bolognesi A., Seidenari C., Tamburini F. POS tagset design for Italian 610 Kentaro Inui, Toru Hirano, Ryu Iida, Atsushi Fujita and Yuji Matsumoto Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences 611 Onelli C., Proietti D., Seidenari C., Tamburini F. The DiaCORIS project: a diachronic corpus of written Italian 612 Eneko Agirre, Izaskun Aldezabal, Jone Etxeberria, Eli Pociello A preliminary study for building the Basque PropBank 613 Eiko Yamamoto, Kyoko Kanzaki, and Hitoshi Isahara Detection of inconsistencies in concept classifications in a large dictionary -- Toward an improvement of the EDR electronic dictionary -- 614 Eneko Agirre, Izaskun Aldezabal, Jone Etxeberria, Eli Izagirre, Karmele Mendizabal, Eli Pociello and Mikel Quintian A methodology for the joint development of the Basque WordNet and Semcor 615 Jan Hoidekr, J.V. Psutka, Aleš Pražák, Josef Psutka Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries 616 Javier Cuétara-Priede, Luis Villaseñor-Pineda, Manuel Montes-y-Gómez, Manuel Alberto Pérez-Coutiño TranscríbEMex: An automatic phonetic transcriber for the Mexican Spanish 617 Stefanie Anstein, Gerhard Kremer, Uwe Reyle Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology 618 Mathieu Lafourcade Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence 619 Mariona Taulé, Roser Morante, Juan Aparicio, M Antònia Martí 3LB-LEX and 3LB SEM lexical resources for Spanish and Catalan 620 Ernesto William De Luca and Andreas Nürnberger Rebuilding Lexical Resources for Information Retrieval using Sense Folder Detection and Merging Methods. 621 Pavel Smrz, Vit Novacek Automatic Acquisition of Semantics-Extraction Patterns from Parallel Texts 622 Lea Cyrus Building a resource for studying translation shifts 623 Christoph Draxler Speech Recordings in Public Schools in Germany -- the Perfect Show Case for Web-based Recordings and Annotation 624 B K Tsou, B Y Lai, K K Sin, L Y L Cheung Court Stenography-To-Text (“STT”) in Hong Kong: A Jurilinguistic Engineering Effort 625 Lars Hellan & Dorothee Beermann Word Sense Disambiguation and Semantic Construction Analysis for Deep Processing Grammars 626 Dennis Reidsma, Dirk Heylen, Roeland Ordelman Annotating emotion in meetings 627 H. Bonneau-Maynard, C. Ayache, F. Bechet, A. Denis, , A. Kuhn, F. Lefevre, D. Mostefa, M. Quignard, S. Rosset, C. Servan, J. Villaneau Results of the French Evalda-Media evaluation campaign for literal understanding 629 Jonas Kuhn, Michael Jellinghaus Multilingual parallel treebanking: a lean and flexible approach 630 Julie Mauclair, Yannick Estève, Simon Petit-Renaud, Paul Deléglise Automatic detection of well recognized words in automatic speech transcription 631 Francesca Bertagna, Federico Calzolari, Nicoletta Calzolari, Sebastiana Cucurullo, Alessandro Enea, Monica Monachini, Eugenio Picchi, Manuela Sassi, Eva Sassolini, Claudia Soria Next Generation Language Resources using GRID 632 Sanni Nimb LEXADV - a multilingual semantic lexicon for adverbs 633 Voula Giouli, Alexis Konstadinidis, Elina Desipri, Harris Papageorgiou, Stelios Piperidis Multi-domain Multi-lingual Named Entity Recognition: Re-visiting & grounding the resources issue 634 Rebecca Passonneau and Nizar Habash and Owen Rambow Inter-annotator agreement on a multilingual semantic annotation task 636 Rebecca Passonneau Measuring Agreement on Set-Valued Items (MASI) for Semantic and Pragmatic Annotation 637 Tomoyosi Akiba Exploiting Dynamic Passage Retrieval for Spoken Question Recognition and Context Processing towards Speech-driven Information Access Dialogue 638 Marc Verhagen, Robert Knippen, Inderjeet Mani, and James Pustejovsky Annotation of TimeML Relations with Tango 639 Patrizia Paggio Information structure and pauses in a corpus of spoken Danish 641 Quasthoff, Uwe; Biemann, Chris; Richter, Matthias Corpus Portal for Search in 16 Monolingual Corpora 642 Tristan Vanrullen, Philippe Blache & Jean-Marie Balfourier Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy 643 Andreas Eisele Parallel corpora and phrase-based statistical MT for new language pairs via multiple intermediaries 644 Jae-Ho Kim, Sun-Mee Bae and Key-Sun Choi Automatic Generation of Medical Term Definition from Large Corpora 645 Owen Rambow, Bonnie Dorr, David Farwell, Rebecca Green, Nizar Habash, Stephen Helmreich, Eduard Hovy, Lori Levin, Keith J. Miller, Teruko Mitamura, Florence Reeder, Advaith Siddharthan Parallel Syntactic Annotation of Multiple Languages 646 Sylvain Galliano, Edouard Geoffrois, Jean-François Bonastre, Guillaume Gravier, Djamel Mostefa, Khalid Choukri. Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News 647 Juan José Rodríguez Soler, Lucía Palacios Fernández, Pedro Concejero Cerezo, Carlos Lázaro Ávila, Daniel Tapias Merino Usability evaluation of 3G multimodal services in Telefónica Móviles España 649 Márton Miháltz, Gábor Pohl Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation 650 Porfírio Filipe and Nuno Mamede A FRAMEWORK TO INTEGRATE UBIQUITOUS KNOWLEDGE MODELING 651 Felice Dell'Orletta, Alessandro Lenci, Simonetta Montemagni, Vito Pirrelli Searching Treebanks for Functional Constraints: Crosslingual Experiments in Grammatical Relations Assignment 652 Declerck Thierry SynAF: Towards a standard for Syntactic Annotation 653 Christelle Ayache, Brigitte Grau, Anne Vilnat Evaluation of question-answering systems : The French EQueR-EVALDA Evaluation Campaign 654 Maria Fernanda Bacelar do Nascimento, José Bettencourt Gonçalves, Luísa Pereira, Antónia Estrela, Afonso Pereira The African Varieties of Portuguese: Compiling Comparable Corpora and analysing data-derived lexicons 655 Richard Evans and Viktor Pekar BiRD: A prototype system to build and exploit resource databases for researchers 656 Benjamin K. Tsou and Oi Yee Kwong Toward a Pan-Chinese Thesaurus 657 Nelleke Oostdijk, Lou Boves User requirements analysis for the design of a reference corpus of written Dutch 658 Brigitte Grau, Anne-Laure Ligozat, Isabelle Robba, Anne Vilnat FRASQUES : a Question Answering system in the EQueR evaluation campaign 661 Gábor Hodász Evaluation Methods of a Linguistically Enriched Translation Memory System 664 José João Almeida, Alberto Simões T2O - Recycling Thesauri into a Multilingual Ontology 666 Saba Amsalu Data-driven Amharic-English Bilingual Lexicon Acquisition 667 Jörg Tiedemann ISA & ICA - Two Web Interfaces for Interactive Alignment of Bitexts 669 Wolfgang Minker, Heiko Neumann, Günther Palm, Petra-Maria Strauß, Harald Traue, Ulrich Weidenbacher WIZARD-OF-OZ DATA COLLECTION FOR PERCEPTION AND INTERACTION IN MULTI-USER ENVIRONMENTS 670 Nancy Underwood, Agnes Lisowska The evolution of an evaluation framework for a text mining system 672 Eline Westerhout, Paola Monachesi A pilot study for a Corpus of Dutch Aphasic Speech (CODAS) 673 Jan Bungeroth, Daniel Stein, Philippe Dreuw, Morteza Zahedi and Hermann Ney A German Sign Language Corpus of the Domain Weather Report 674 Roberto Bartolini, Caterina Caracciolo, Emiliano Giovannetti, Alessandro Lenci, Simone Marchi, Vito Pirrelli, Chiara Renso, Laura Spinsanti Creation and Use of Lexicons and Ontologies for Natural Language Interface to Databases 676 MULLONI, Andrea; PEKAR, Viktor AUTOMATIC DETECTION OF ORTHOGRAPHIC CUES FOR COGNATES RECOGNITION 677 Timothy Baldwin and Su'ad Awab Open Source Corpus Analysis Tools for Malay 678 Fidelia Ibekwe-SanJuan A task-oriented framework for evaluating theme detection systems. 679 Asunción Moreno, Albert Febrer, Lluis Márquez Generation of Language Resources for the Development of Speech Technologies in Catalan 680 Georgiana Puscasu and Ruslan Mitkov If it were then, then when was it? Establishing the anaphoric role of “then” 683 Viktor Trón, Péter Halácsy, Péter Rebrus, András Rung, Péter Vajda, Eszter Simon Morphdb.hu: Hungarian lexical database and morphological grammar 685 Hong Phuong Le, Thi Minh Huyen Nguyen, Laurent Romary, Azim Roussanaly A Lexicalized Tree Adjoining Grammar for Vietnamese 687 Silvie Cinkova, Pavel Pecina, Petr Podvesky, Pavel Schlesinger Semi-automatic Building of Swedish Collocation Lexicon 688 Dominika Oliver and Krzysztof Szklanny Creation and analysis of a Polish speech database for use in unit selection speech synthesis 689 Jorge Kinoshita Laís do Nascimento Salvador Carlos Eduardo Dantas de Menezes CoGrOO: a Brazilian-Portuguese Grammar Checker based on the CETENFOLHA Corpus 691 Stefan Schaden Evaluation of automatically generated transcriptions of non-native pronunciations using a phonetic distance measure 692 Isabella Chiari Slips and errors in spoken data transcription 697 Motoko Ueyama Evaluation of Web-based Corpora: Effects of Seed Selection and Time Interval 700 José Iria, Christopher Brewster, Fabio Ciravegna and Yorick Wilks AN INCREMENTAL TRI-PARTITE APPROACH TO ONTOLOGY LEARNING 701 Thomas Pellegrini and Lori Lamel Experimental detection of vowel prononciation variants in Amharic 702 R. Catizone, A. Dalli and Y. WIlks A Methodology for Evaluating Timelines 704 Ivana Kruijff-Korbayova, Tilman Becker, Nate Blaylock, Ciprian Gerstenberger, Michael Kaisser, Peter Poller, Verena Rieser, Jan Schehl The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player 705 Rebecca Passonneau, Roberta Blitz, David Elson, Angela Giral, and Judith Klavans CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project 706 Anna Rumshisky, James Pustejovsky Inducing sense-discriminating context patterns from sense-tagged corpora 707 Milen Kouylekov and Bernardo Magnini Building a Large-Scale Repository of Textual Entailment Rules 708 Alessandro Moschitti and Roberto Basili A Tree Kernel approach to Question and Answer Classification in Question Answering Systems 709 C. d' Alessandro, P. Boula de Mareüil, G. Bailly, M.-N. Garcia, M. Morel A joint intelligibility evaluation of French text to speech systems: the EVASY/SUS campaign 710 Winston Anderson and Petro Kotzé Finite state tokenisation of an orthographical disjunctive agglutinative language: The verbal segment of Northern Sotho 711 Couturier, Jean-François; Drouin, Patrick; Neuvel, Sylvain Applying lexical constraints on morpho-syntactic patterns for the identification of conceptual relational content in specialized texts. 712 Katerina Pastra Beyond Multimedia Integration Mechanisms: in need of extending multimedia annotation schemes 715 Yoshihiko Nitta, Masashi Saraki, Satoru Ikehara Building Carefully Tagged Bi-lingual Corpora to Cope with Linguistic Idiosyncrasy 716 Roser Saurí, Marc Verhagen, James Pustejovsky SlinkET: A Partial Modal Parser for Events 717 Christopher Cieri, Mark Liberman More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities 719 A. Bonafonte, H. Höge, I. Kiss, A. Moreno, U. Ziegenhain, H. van den Heuvel, H.U. Hain, X. S. Wang, M. N. Garcia TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis 720 Miriam Voghera, Francesco Cutugno An Observatory on Spoken Italian Linguistic Resources and Descriptive Standards 721 Kamel Smaïli, Caroline Lavecchia and J.P. Haton Linguistic Features Modeling Based on Partial New Cache Model 723 Pum-Mo Ryu, Key-Sun Choi, and Ronny Melz Taxonomy Learning using Term Specificity and Similarity 724 Stefan Schulz, Kornél Markó, Percy Nohama, Roosewelt Leite de Andrade, Edson José Pacheco, Udo Hahn, Martin Romacker The MORPHOSAURUS Medical Subword Lexicon 725 Marko Salmenkivi Finding representative sets of dialect words for geographical regions 726 Olga Uryupina Coreference resolution with and without linguistic knowledge 727 Keith Miller and Michelle Vanni Modern Standard Arabic v. Dialect: Comparing Arabic-English MT Performance 729 O Hamon, M Rajman X-Score : Automatic Evaluation of Machine Translation Grammaticality 730 Andrew Roberts, Latifa Al-Sulaiti and Eric Atwell A comparison of concordancers for Arabic: WordSmith, Mono-Conc, Xaira and aConCorde 733 Roberto Navigli Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory 735 Dafydd Gibbon, Flaviane Romani Fernandes, Thorsten Trippel A BLARK extension for temporal annotation mining 736 Michael Schiehlen and Kristina Spranger The Mass-Count Distinction: Acquisition and Disambiguation 737 Lars Nygaard & Stephan Oepen Identifying Transparent Compounds in a Large Computational Lexicon 738 Andrew W. Cole Corpus Development and Publication 739 Shu-Chuan Tseng, Dafydd Gibbon Discourse functions of duration in Mandarin 740 Leonardo Lesmo Livio Robaldo From Natural Language to Databases via Ontologies 742 Adeline Nazarenko, Erick Alphonse, Julien Derivière, Thierry Hamon, Guillaume Vauvert, Davy Weissenbacher The ALVIS Format for Linguistically Annotated Documents 744 Alexander Raake and Brian FG Katz SUS-based Method for Speech Reception Threshold Measurement in French 745 Stephanie Strassel, Christopher Cieri, Andrew Cole, Denise DiPersio, Mark Liberman, Xiaoyi Ma, Mohamed Maamouri Integrated Linguistic Resources for Language Exploitation Technologies 746 Xiaoyi Ma Champollion: A Robust Parallel Text Sentence Aligner 748 Fabian Behrens & Jan-Torsten Milde The Eclipse Annotator: an extensible system for multimodal corpus creation 749 Harris Papageorgiou, Elina Desipri, Maria Koutsombogera, Kanella Pouli, Andromaxi Griva, Prokopis Prokopidis Adding multi-layer semantics to the Greek Dependency Treebank 750 Cristina BOSCO, Vincenzo LOMBARDO Comparing linguistic information in treebank annotations 752 Nella Cucurullo, Simonetta Montemagni, Matilde Paoli, Eugenio Picchi, Eva Sassolini Dialectal resources on-line: the ALT-Web experience 754 Xiaoyi Ma, Chris Cieri Corpus Support for Machine Translation at LDC 755 Ann Bies, Stephanie Strassel, Haejoong Lee, Kazuaki Maeda, Seth Kulick, Yang Liu, Mary Harper, Matthew Lease Linguistic Resources for Speech Parsing 756 Tomasz Obrebski, Michal Stolarski UAM Text Tools - a flexible NLP architecture 757 Markus Geilfuss & Jan-Torsten Milde SAM - an annotation editor for parallel texts 758 Hans Uszkoreit, Feiyu Xu, Joerg Steffen, Ilhan Aslan The pragmatic combination of different crosslingual resources for multilingual information services 759 Nizar Habash, Clinton Mah, Sabiha Imran, Randy Calistri-Yeh and Paraic Sheridan Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval 760 Grazyna Vetulani, Zygmunt Vetulani, Tomasz Obrebski Syntactic Lexicon of Polish Predicative Nouns 762 Alonge Antonietta The Italian Metaphor Database 763 Georgiana Puscasu How to identify temporal clauses? 765 Jose Iria, Fabio Ciravegna A Methodology and Tool for Representing Language Resources for Information Extraction 767 Harry Halpin Automatic Evaluation and Compositon of NLP Pipelines with Web Services 768 C. de Loupy The OuRAL Project 769 Harry Bunt and Amanda Schiffrin Methodological aspects of semantic annotation 770 Whitney Gegg-Harrison, Donna Byron PYCOT: An OT-Based Pronoun Resolution Tool 772 Emma Barker, Ryuichiro Higashinaka, François Mairesse, Robert Gaizauskas, Marilyn Walker, Jonathan Foster Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives 774 Jeongwoo Ko, Laurie Hiyakumoto, Eric Nyberg Exploiting Multiple Semantic Resources for Answer Selection 776 Kazuaki Maeda, Christopher Cieri and Kevin Walker Low-cost Customized Speech Corpus Creation for Speech Technology Applications 777 John Niekrasz and Alexander Gruenstein NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Discourse 778 Christoph Benzmueller, Helmut Horacek, Henri Lesourd, Ivana Kruijff-Korbayova, Marvin Schiller, Magdalena Wolska A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material. 779 J.Laoudi, C. Tate, C. Voss Task-based MT Evaluation: From Who/When/Where Extraction to Event Understanding 780 Kazuaki Maeda, Haejoong Lee, Julie Medero, Stephanie Strassel A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit 782 Hideki Shima, Mengqiu Wang, Frank Lin, Teruko Mitamura Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering 783 Jeongwoo Ko, Fumihiko Murase, Teruko Mitamura, Eric Nyberg, Masahiko Tateishi, and Ichiro Akahori Analyzing the Effects of Spoken Dialog Systems on Driving Behavior 787 Magesh Balasubramanya, Peter Lucas, Michael Higgins and Dominic Widdows Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources 788 Vasile Rus, Art Graesser The Look and Feel of a Textual Entailer 789 R.Gaizauskas, M. Hepple and H.Harkema Indirect Evaluation of Parsers Using Dependency Relations to Assess Semantic Output 790 Gregory Marton and Boris Katz Automatic TREC Relationship Answering and Nugget Evaluation 792 Finley Lacatusu, Andrew Hickl, Sanda Harabagiu Impact of Question Decomposition on the Quality of Answer Summaries 793 Sanda M. Harabagiu, Cristina Nicolae, Gabriel Nicolae Coreference by Consolidation 794 Sanda Harabagiu and Cosmin Adrian Bejan An Answer Bank for Temporal Inference 795 Paul C Morarescu Principles for annotating and reasoning with spatial information 796 Li Sujian, Lu Qin, Li Wenjie, Xu Ruifeng Interact between Lexical Base and Ontology with Formal Concept Analysis 798 Rachada Kongkachandra and Kosin Chamnongthai SEMANTIC-BASED KEYWORD RECOVERY FUNCTION FOR KEYWORD EXTRACTION SYSTEM 799 Ruifeng Xu, Qin Lu and Sujian Li The Design and Construction of Chinese Collocation Bank 800 Nilda Ruimy, Adriana Roventini Merging two Ontology-based Lexical Resources 801 Nimaan Abdillahi, Nocera Pascal Towards automatic transcription of Somali language 802 Susanne Burger, Zachary A. Sloane, Jie Yang Competitive Evaluation of Commercially Available Speech Recognizers in Multiple Languages 803 Kornel Laskowski and Susanne Burger Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus 805 W. Black, S. Elkateb, H. Rodriguez, M. Alkhalifa, P. Vossen, A. Pease, C. Fellbaum Introducing the Arabic WordNet Project 806 Benoît Sagot and Pierre Boullier Deep non-probabilistic parsing of large corpora 807 Magnar Brekke, Kai Innselset, Marita Kristiansen & Kari Øvsthus KB-N: Automatic Term Extraction from Knowledge Bank of Economics 808 Alexander Klassmann, Freddy Offenga, Daan Broeder, Romuald Skiba, Peter Wittenburg Comparison of Resource Discovery Methods 809 Peter Lucas, Magesh Balasubramanya, and Dominic Widdows The Information Commons Gazetteer: A Public Resource of Populated Places and Worldwide Administrative Divisions 810 Benoît Sagot, Lionel Clément, Eric Villemonte de La Clergerie and Pierre Boullier The Lefff 2 syntactic lexicon for French: architecture, acquisition, use 811 Nilda Ruimy Structuring a Domain Vocabulary in a General Knowledge Environment 812 Alexander Geyken, Norbert Schrader LexikoNet - a lexical database based on type and role hierarchies 813 Djamel Mostefa ; Olivier Hamon, Khalid CHOUKRI et. al. TC-STAR: Evaluation of speech to speech components 814 Djamel Mostefa ; Marie-Neige Garcia, Olivier Hamon, Khalid CHOUKRI et. al. CHIL: Evaluation of multimodal technology components 815 Carol Peters The Impact of Evaluation on Multilingual Information Retrieval System Development 816 Carol Peters , Bernardo Magnini, Christelle Ayache, et. al. CLEF Q-A special track 817 Marie-Neige Garcia , Christophe d'Alessandro , Philippe Boula de Mareüil A joint prosody evaluation of French text-to-speech systems: the EVASY Prosody campaign