default search action
9th LREC 2014: Reykjavik, Iceland
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asunción Moreno, Jan Odijk, Stelios Piperidis:
Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, Reykjavik, Iceland, May 26-31, 2014. European Language Resources Association (ELRA) 2014 - Steve Cassidy, Dominique Estival, Timothy Jones, Denis Burnham, Jared Burghold:
The Alveo Virtual Laboratory: A Web Based Repository API. 1-7 - Xabier Artola, Zuhaitz Beloki, Aitor Soroa:
A stream computing approach towards scalable NLP. 8-13 - Hao Wu, Zhiye Fei, Aaron Dai, Mark Sammons, Dan Roth, Stephen Mayhew:
ILLINOISCLOUDNLP: Text Analytics Services in the Cloud. 14-21 - Nancy Ide, James Pustejovsky, Christopher Cieri, Eric Nyberg, Di Wang, Keith Suderman, Marc Verhagen, Jonathan Wright:
The Language Application Grid. 22-30 - George Christodoulides:
Praaline: Integrating Tools for Speech Corpus Research. 31-34 - Anabela Barreiro, Johanna Monti, Brigitte Orliac, Susanne Preuß, Kutz Arrieta, Wang Ling, Fernando Batista, Isabel Trancoso:
Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate. 35-40 - Sandipan Dandapat, Declan Groves:
MTWatch: A Tool for the Analysis of Noisy Parallel Data. 41-45 - Thierry Etchegoyhen, Lindsay Bywood, Mark Fishel, Panayota Georgakopoulou, Jie Jiang, Gerard van Loenhout, Arantza del Pozo, Mirjam Sepesy Maucec, Anja Turner, Martin Volk:
Machine Translation for Subtitling: A Large-Scale Evaluation. 46-53 - Joaquim Moré, Salvador Climent:
Machine Translationness: Machine-likeness in Machine Translation Evaluation. 54-61 - Joke Daems, Lieve Macken, Sonia Vandepitte:
On the origin of errors: A fine-grained analysis of MT and PE errors and their relationship. 62-66 - Fei Cheng, Kevin Duh, Yuji Matsumoto:
Parsing Chinese Synthetic Words with a Character-based Dependency Model. 67-72 - Tomás Jelínek:
Improvements to Dependency Parsing Using Automatic Simplification of Data. 73-77 - Jasmijn Bastings, Khalil Sima'an:
All Fragments Count in Parser Evaluation. 78-82 - Maria Simi, Cristina Bosco, Simonetta Montemagni:
Less is More? Towards a Reduced Inventory of Categories for Training a Parser for the Italian Stanford Dependencies. 83-90 - Anton Karl Ingason, Hrafn Loftsson, Eiríkur Rögnvaldsson, Einar Freyr Sigurðsson, Joel Wallenberg:
Rapid Deployment of Phrase Structure Parsing for Related Languages: A Case Study of Insular Scandinavian. 91-95 - Clarissa Castellã Xavier, Vera Lúcia Strube de Lima:
Boosting Open Information Extraction with Noun-Based Relations. 96-100 - Travis R. Goodwin, Sanda M. Harabagiu:
Clinical Data-Driven Probabilistic Graph Processing. 101-108 - Gongye Jin, Daisuke Kawahara, Sadao Kurohashi:
A Framework for Compiling High Quality Knowledge Resources From Raw Corpora. 109-114 - Clément de Groc, Xavier Tannier, Claude de Loupy:
Thematic Cohesion: measuring terms discriminatory power toward themes. 115-119 - Simon Scerri, Behrang Q. Zadeh, Maciej Dabrowski, Ismael Rivera:
Extracting Information for Context-aware Meeting Preparation. 120-124 - AiTi Aw, Sharifah Aljunied Mahani, Nattadaporn Lertcheva, Sasiwimon Kalunsima:
TaLAPi ― A Thai Linguistically Annotated Corpus for Language Processing. 125-132 - Guiyao Ke, Pierre-François Marteau, Gildas Ménier:
Variations on quantitative comparability measures and their evaluations on synthetic French-English comparable corpora. 133-139 - Paul Felt, Eric K. Ringger, Kevin D. Seppi, Kristian Heal:
Using Transfer Learning to Assist Exploratory Corpus Annotation. 140-145 - Miguel B. Almeida, Mariana S. C. Almeida, André F. T. Martins, Helena Figueira, Pedro Mendes, Cláudia Pinto:
Priberam Compressive Summarization Corpus: A New Multi-Document Summarization Corpus for European Portuguese. 146-152 - Patrick Schone, Heath E. Nielson, Mark Ward:
Corpus and Evaluation of Handwriting Recognition of Historical Genealogical Records. 153-159 - Milena Hnátková, Michal Kren, Pavel Procházka, Hana Skoumalová:
The SYN-series corpora of written Czech. 160-164 - Karel Kucera, Martin Stluka:
Corpus of 19th-century Czech Texts: Problems and Solutions. 165-168 - Maik Stührenberg:
Extending standoff annotation. 169-174 - Stefan Höfler, Kyoko Sugisaki:
Constructing and exploiting an automatically annotated resource of legislative texts. 175-180 - Yuan Luo, Thomas F. Boucher, Tolga Oral, David Osofsky, Sara Weber:
A Study on Expert Sourcing Enterprise Question Collection and Classification. 181-188 - Balamurali A. R.:
Can the Crowd be Controlled?: A Case Study on Crowd Sourcing and Automatic Validation of Completed Tasks based on User Modeling. 189-195 - Mitesh M. Khapra, Ananthakrishnan Ramanathan, Anoop Kunchukuttan, Karthik Visweswariah, Pushpak Bhattacharyya:
When Transliteration Met Crowdsourcing : An Empirical Study of Transliteration via Crowdsourcing using Efficient, Non-redundant and Fair Quality Control. 196-202 - Manjira Sinha, Tirthankar Dasgupta, Anupam Basu:
Design and Development of an Online Computational Framework to Facilitate Language Comprehension Research on Indian Languages. 203-210 - Martin Benjamin:
Collaboration in the Production of a Massively Multilingual Lexicon. 211-215 - Marco Marelli, Stefano Menini, Marco Baroni, Luisa Bentivogli, Raffaella Bernardi, Roberto Zamparelli:
A SICK cure for the evaluation of compositional distributional semantic models. 216-223 - Wajdi Zaghouani, Kais Dukes:
Can Crowdsourcing be used for Effective Annotation of Arabic? 224-228 - Héctor Martínez Alonso, Lauren Romeo:
Crowdsourcing as a preprocessing for complex semantic annotation tasks. 229-234 - Christoph Draxler:
Online experiments with the Percy software framework - experiences and some early results. 235-240 - Ryan Cotterell, Chris Callison-Burch:
A Multi-Dialect, Multi-Genre Corpus of Informal Written Arabic. 241-245 - Stefan Ultes, Hüseyin Dikme, Wolfgang Minker:
First Insight into Quality-Adaptive Dialogue. 246-251 - Volha Petukhova, Martin Gropp, Dietrich Klakow, Gregor Eigner, Mario Topf, Stefan Srb, Petr Motlícek, Blaise Potard, John Dines, Olivier Deroo, Ronny Egeler, Uwe Meinz, Steffen Liersch, Anna Schmidt:
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues. 252-258 - Dietmar F. Rösner, Rafael Friesen, Stephan Günther, Rico Andrich:
Modeling and evaluating dialog success in the LAST MINUTE corpus. 259-265 - Layla El Asri, Rémi Lemonnier, Romain Laroche, Olivier Pietquin, Hatim Khouzaimi:
NASTIA: Negotiating Appointment Setting Interface. 266-271 - Layla El Asri, Romain Laroche, Olivier Pietquin:
DINASTI: Dialogues with a Negotiating Appointment Setting Interface. 272-278 - Thomas Pellegrini, Vahid Hedayati, Ângela Costa:
El-WOZ: a client-server wizard-of-oz interface. 279-282 - Claire Brierley, Majdi Sawalha, Eric Atwell:
Tools for Arabic Natural Language Processing: a case study in qalqalah prosody. 283-287 - Johann-Mattis List, Jelena Prokic:
A Benchmark Database of Phonetic Alignments in Historical Linguistics and Dialectology. 288-294 - Anne Lacheret, Sylvain Kahane, Julie Beliao, Anne Dister, Kim Gerdes, Jean-Philippe Goldman, Nicolas Obin, Paola Pietrandrea, Atanas Tchobanov:
Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French. 295-301 - Jean-Philippe Goldman, Tea Prsir, Antoine Auchlin:
C-PhonoGenre: a 7-hours corpus of 7 speaking styles in French: relations between situational features and prosodic properties. 302-305 - Abir Masmoudi, Mariem Ellouze Khmekhem, Yannick Estève, Lamia Hadrich Belguith, Nizar Habash:
A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition. 306-310 - Yuichi Ishimoto, Tomoyuki Tsuchiya, Hanae Koiso, Yasuharu Den:
Towards Automatic Transformation between Different Transcription Conventions: Prediction of Intonation Markers from Linguistic and Acoustic Features. 311-315 - Tiberiu Boros, Adriana Stan, Oliver Watts, Stefan Daniel Dumitrescu:
RSS-TOBI - A Prosodically Enhanced Romanian Speech Corpus. 316-320 - Klim Peshkov, Laurent Prévot:
Segmentation evaluation metrics, a comparison grounded on prosodic and discourse units. 321-325 - Bistra Andreeva, William J. Barry, Jacques C. Koreman:
A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence. 326-330 - Liviu P. Dinu, Alina Maria Ciobanu, Ioana Chitoran, Vlad Niculae:
Using a machine learning model to assess the complexity of stress systems. 331-336 - Tanja Schultz, Tim Schlippe:
GlobalPhone: Pronunciation Dictionaries in 20 Languages. 337-341 - Juan Rafael Orozco-Arroyave, Julián David Arias-Londoño, Jesús Francisco Vargas-Bonilla, María Claudia Gonzalez-Rátiva, Elmar Nöth:
New Spanish speech corpus database for the analysis of people suffering from Parkinson's disease. 342-347 - François Salmon, Félicien Vallet:
An Effortless Way To Create Large-Scale Datasets For Famous Speakers. 348-352 - Florian Schiel, Thomas Kisler:
German Alcohol Language Corpus - the Question of Dialect. 353-356 - Jetske Klatter, Roeland van Hout, Henk van den Heuvel, Paula Fikkert, Anne Baker, Jan De Jong, Frank Wijnen, Eric Sanders, Paul Trilsbeek:
Vulnerability in Acquisition, Language Impairments in Dutch: Creating a VALID Data Archive. 357-364 - Mirjam Ernestus, Lucie Kocková-Amortová, Petr Pollák:
The Nijmegen Corpus of Casual Czech. 365-370 - Carlos Daniel Hernandez Mena, Abel Herrera Camacho:
CIEMPIESS: A New Open-Sourced Mexican Spanish Radio Corpus. 371-375 - Marie Koprivová, Hana Golánová, Petra Klimesová, David Lukes:
MAPPING DIATOPIC AND DIACHRONIC VARIATION IN SPOKEN CZECH: THE ORTOFON AND DIALEKT CORPORA. 376-382 - Thomas Schmidt:
The Research and Teaching Corpus of Spoken German ― FOLK. 383-387 - Niklas Vanhainen, Giampiero Salvi:
Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish. 388-392 - Marta Villegas, Maite Melero, Núria Bel:
Metadata as Linked Open Data: mapping disparate XML metadata registries into one RDF/OWL registry. 393-400 - Maud Ehrmann, Francesco Cecconi, Daniele Vannella, John Philip McCrae, Philipp Cimiano, Roberto Navigli:
Representing Multilingual Data as Linked Data: the Case of BabelNet 2.0. 401-408 - Jorge Gracia, Elena Montiel-Ponsoda, Daniel Vila-Suero, Guadalupe Aguado de Cea:
Enabling Language Resources to Expose Translations as Linked Data on the Web. 409-413 - Thierry Declerck, Karlheinz Mörth, Eveline Wandl-Vogt:
A SKOS-based Schema for TEI encoded Dictionaries at ICLTT. 414-417 - Anindya Roy, Camille Guinaudeau, Hervé Bredin, Claude Barras:
TVD: A Reproducible and Multiply Aligned TV Series Dataset. 418-425 - James Pustejovsky, Zachary Yocum:
Image Annotation with ISO-Space: Distinguishing Content from Structure. 426-431 - Arantza del Pozo, Carlo Aliprandi, Aitor Álvarez, Carlos Mendes, Joao P. Neto, Sérgio Paulo, Nicola Piccinini, Matteo Raffaelli:
SAVAS: Collecting, Annotating and Sharing Audiovisual Language Resources for Automatic Subtitling. 432-436 - Pablo Ruiz Fabo, Aitor Álvarez, Haritz Arzelus:
Phoneme Similarity Matrices to Improve Long Audio Alignment for Automatic Subtitling. 437-442 - Luis Javier Rodríguez-Fuentes, Mikel Peñagarikano, Amparo Varona, Mireia Díez, Germán Bordel:
KALAKA-3: a database for the recognition of spoken European languages on YouTube audios. 443-449 - Dilek Küçük, Guillaume Jacquet, Ralf Steinberger:
Named Entity Recognition on Turkish Tweets. 450-454 - Nathan Schneider, Spencer Onuffer, Nora Kazour, Emily Danchik, Michael T. Mordowanec, Henrietta Conrad, Noah A. Smith:
Comprehensive Annotation of Multiword Expressions in a Social Web Corpus. 455-461 - Clare Llewellyn, Claire Grover, Jon Oberlander, Ewan Klein:
Re-using an Argument Corpus to Aid in the Curation of Social Media Collections. 462-468 - Marc T. Tomlinson, David B. Bracewell, Wayne Krug, David Hinote:
#mygoal: Finding Motivations on Twitter. 469-474 - Andrew Yates, Jon Parker, Nazli Goharian, Ophir Frieder:
A Framework for Public Health Surveillance. 475-482 - Ahmet Aker, Monica Lestari Paramita, Emma Barker, Robert J. Gaizauskas:
Bootstrapping Term Extractors for Multiple Languages. 483-489 - Els Lefever, Marjan Van de Kauter, Véronique Hoste:
Evaluation of Automatic Hypernym Extraction from Technical Corpora in English and Dutch. 490-497 - Lori S. Levin, Teruko Mitamura, Brian MacWhinney, Davida Fromm, Jaime G. Carbonell, Weston Feely, Robert E. Frederking, Anatole Gershman, Carlos Ramírez:
Resources for the Detection of Conventionalized Metaphors in Four Languages. 498-501 - Yves Scherrer, Benoît Sagot:
A language-independent and fully unsupervised approach to lexicon induction and part-of-speech tagging for closely related languages. 502-508 - Stefan Bott, Sabine Schulte im Walde:
Optimizing a Distributional Semantic Model for the Prediction of German Particle Verb Compositionality. 509-516 - Kristiina Jokinen:
Open-domain Interaction and Online Content in the Sami Language. 517-522 - Tjerk Hagemeijer, Michel Généreux, Iris Hendrickx, Amália Mendes, Abigail Tiny, Armando Zamora:
The Gulf of Guinea Creole Corpora. 523-529 - Dagmar Jung, Katarzyna Klessa, Zsuzsa Duray, Beatrix Oszkó, Mária Sipos, Sándor Szeverényi, Zsuzsa Várnai, Paul Trilsbeek, Tamás Váradi:
Languagesindanger.eu - Including Multimedia Language Resources to disseminate Knowledge and Create Educational Material on less-Resourced Languages. 530-535 - José Pedro Ferreira, Cristiano Chesi, Daan Baldewijns, Fernando Miguel Pinto, Margarita Correia, Daniela Braga, Hyongsil Cho, Amadeu Ferreira, Miguel Sales Dias:
Casa de la Lhéngua: a set of language resources and natural language processing tools for Mirandese. 536-540 - Christian Curtis:
A finite-state morphological analyzer for a Lakota HPSG grammar. 541-544 - Adam Kilgarriff, Pavel Rychlý, Milos Jakubícek, Vojtech Kovár, Vít Baisa, Lucia Kocincová:
Extrinsic Corpus Evaluation with a Collocation Dictionary Task. 545-552 - Nancy L. Underwood, Bartolomé Mesa-Lao, Mercedes García-Martínez, Michael Carl, Vicent Alabau, Jesús González-Rubio, Luis A. Leiva, Germán Sanchis-Trilles, Daniel Ortiz-Martínez, Francisco Casacuberta:
Evaluating the effects of interactivity in a post-editing workbench. 553-559 - Bogdan Ludusan, Maarten Versteegh, Aren Jansen, Guillaume Gravier, Xuan-Nga Cao, Mark Johnson, Emmanuel Dupoux:
Bridging the gap between speech technology and natural language processing: an evaluation toolbox for term discovery systems. 560-567 - Paula Lopez-Otero, Laura Docío Fernández, Carmen García-Mateo:
Introducing a Framework for the Evaluation of Music Detection Tools. 568-572 - Bartosz Broda, Bartlomiej Niton, Wlodzimierz Gruszczynski, Maciej Ogrodniczuk:
Measuring Readability of Polish Texts: Baseline Experiments. 573-580 - Jason Utt, Sylvia Springorum, Maximilian Köper, Sabine Schulte im Walde:
Fuzzy V-Measure - An Evaluation Method for Cluster Analyses of Ambiguous Data. 581-587 - Andrea Horbach, Alexis Palmer, Magdalena Wolska:
Finding a Tradeoff between Accuracy and Rater's Workload in Grading Clustered Short Answers. 588-595 - Petra Barancíková, Rudolf Rosa, Ales Tamchyna:
Improving Evaluation of English-Czech MT through Paraphrasing. 596-601 - Chi-kiu Lo, Dekai Wu:
On the reliability and inter-annotator agreement of human semantic MT evaluation via HMEANT. 602-607 - Nelleke Oostdijk, Henk van den Heuvel:
The evolving infrastructure for language resources and the role for data scientists. 608-612 - Dorte Haltrup Hansen, Lene Offersgaard, Sussi Olsen:
Using TEI, CMDI and ISOcat in CLARIN-DK. 613-618 - Jonathan Chevelu, Gwénolé Lecorvé, Damien Lolive:
ROOTS: a toolkit for easy, fast and consistent processing of large sequential annotated data collections. 619-626 - Matteo Abrate, Angelo Mario Del Grosso, Emiliano Giovannetti, Angelica Lo Duca, Damiana Luzzi, Lorenzo Mancini, Andrea Marchetti, Irene Pedretti, Silvia Piccini:
Sharing Cultural Heritage: the Clavius on the Web Project. 627-634 - Verena Lyding, Lionel Nicolas, Egon Stemle:
'interHist' ̶ an interactive visual interface for corpus exploration. 635-641 - Chenhui Chu, Toshiaki Nakazawa, Sadao Kurohashi:
Constructing a Chinese―Japanese Parallel Corpus from Wikipedia. 642-647 - Lise Rebout, Philippe Langlais:
An Iterative Approach for Mining Parallel Sentences in a Comparable Corpus. 648-655 - Dan Tufis:
Large SMT data-sets extracted from Wikipedia. 656-663 - Juan Luo, Yves Lepage:
Production of Phrase Tables in 11 European Languages using an Improved Sub-sentential Aligner. 664-669 - Hiroaki Shimizu, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Collection of a Simultaneous Translation Corpus for Comparative Analysis. 670-673 - Sharid Loáiciga, Thomas Meyer, Andrei Popescu-Belis:
English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling. 674-681 - Bushra Jawaid, Ondrej Bojar:
Two-Step Machine Translation with Lattices. 682-686 - Matej Durco, Menzo Windhouwer:
The CMD Cloud. 687-690 - Fritz Kliche, André Blessing, Ulrich Heid, Jonathan Sonntag:
The eIdentity Text Exploration Workbench. 691-697 - Damir Cavar, Malgorzata Cavar:
Visualization of Language Relations and Families: MultiTree. 698-701 - Pierre André Ménard, Caroline Barrière:
Linked Open Data and Web Corpus Data for noun compound bracketing. 702-709 - Anita Rácz, István Nagy T., Veronika Vincze:
4FX: Light Verb Constructions in a Multilingual Parallel Corpus. 710-715 - Wan Yu Ho, Christine Kng, Shan Wang, Francis Bond:
Identifying Idioms in Chinese Translations. 716-721 - Kara Warburton:
NARROWING THE GAP BETWEEN TERMBASES AND CORPORA IN COMMERCIAL ENVIRONMENTS. 722-727 - Rodrigo Boos, Kassius Prestes, Aline Villavicencio:
Identification of Multiword Expressions in the brWaC. 728-735 - Lis Pereira, Elga Strafella, Yuji Matsumoto:
Collocation or Free Combination? ― Applying Machine Translation Techniques to identify collocations in Japanese. 736-739 - Irina P. Temnikova, Andrea Varga, Dogan Biyikli:
Building a Crisis Management Term Resource for Social Media: The Case of Floods and Protests. 740-747 - Riyaz Ahmad Bhat, Shahid Musjtaq Bhat, Dipti Misra Sharma:
Towards building a Kashmiri Treebank: Setting up the Annotation Pipeline. 748-752 - Shinsuke Mori, Hideki Ogura, Tetsuro Sasada:
A Japanese Word Dependency Corpus. 753-758 - Chris Culy, Marco Passarotti, Ulla König-Cardanobile:
A Compact Interactive Visualization of Dependency Treebank Query Results. 759-766