


default search action
Language Resources and Evaluation, Volume 59
Volume 59, Number 1, March 2025
- Jesin James, Isabella Shields, Vithya Yogarajan, Peter J. Keegan, Catherine I. Watson, Peter-Lucas Jones, Keoni Mahelona:

The development of a labelled te reo Māori-English bilingual database for language technology. 1-26 - Jia Hoong Ong

, Florence Yik Nam Leung
, Fang Liu
:
The Reading Everyday Emotion Database (REED): a set of audio-visual recordings of emotions in music and language. 27-49 - Emna Fsih, Rahma Boujelbane

, Lamia Hadrich Belguith:
Resources building for sentiment analysis of content disseminated by Tunisian medias in social networks. 51-76 - Gábor Simon

, Tímea Borbála Bajzát, Júlia Ballagó, Zsuzsanna Havasi, Emese K. Molnár, Eszter Szlávich:
When MIPVU goes to no man's land: a new language resource for hybrid, morpheme-based metaphor identification in Hungarian. 77-108 - Colin Swaelens

, Ilse De Vos
, Els Lefever
:
Linguistic annotation of Byzantine book epigrams. 109-134 - Kerenza Doxolodeo, Adila Alfa Krisnadhi

:
AC-IQuAD: Automatically Constructed Indonesian Question Answering Dataset by Leveraging Wikidata. 135-160 - Kiran Babu Nelatoori

, Hima Bindu Kommanti
:
Toxic comment classification and rationale extraction in code-mixed text leveraging co-attentive multi-task learning. 161-190 - Paras Tiwari, Sawan Rai, C. Ravindranath Chowdary

:
Large scale annotated dataset for code-mix abusive short noisy text. 191-218 - Merve Bayrak

, Deniz Dal
:
A new methodology for automatic creation of concept maps of Turkish texts. 219-256 - Jenna Kanerva, Hanna Kitti, Li-Hsin Chang, Teemu Vahtola

, Mathias Creutz, Filip Ginter:
Semantic search as extractive paraphrase span detection. 257-276 - François Delon

, Gabriel Bédubourg, Léo Bouscarrat, Jean-Baptiste Meynard, Aude Valois, Benjamin Queyriaux, Carlos Ramisch, Marc Tanti:
Infectious risk events and their novelty in event-based surveillance: new definitions and annotated corpus. 277-295 - Tharindu Ranasinghe

, Isuri Anuradha, Damith Premasiri, Kanishka Silva
, Hansi Hettiarachchi
, Lasitha Uyangodage, Marcos Zampieri:
SOLD: Sinhala offensive language dataset. 297-337 - Gerardo Sierra

, Gemma Bel-Enguix, Ameyali Díaz-Velasco, Natalia Guerrero-Cerón, Núria Bel:
An aligned corpus of Spanish bibles. 339-369 - Béatrice Biancardi, Mathieu Chollet, Chloé Clavel:

Introducing the 3MT_French dataset to investigate the timing of public speaking judgements. 371-390 - Federica Beccaria

, Angela Cristiano
, Flavio Pisciotta
, Noemi Usardi
, Elisa Borgogni
, Filippo Prayer Galletti
, Giulia Corsi
, Lorenzo Gregori
, Gloria Gagliardi
:
DILLo: an Italian lexical database for speech-language pathologists. 391-411 - Anna Chromá

, Jakub Sláma
, Klára Matiasovitsová
, Jolana Treichelová
:
A morphologically annotated longitudinal corpus of spoken Czech child-adult interactions. 413-436 - Mojca Brglez

, Omnia Zayed, Paul Buitelaar:
TCMeta: a multilingual dataset of COVID tweets for relation-level metaphor analysis. 437-475 - Shankar Biradar

, Sunil Saumya, Arun Chauhan:
Faux Hate: unravelling the web of fake narratives in spreading hateful stories: a multi-label and multi-class dataset in cross-lingual Hindi-English code-mixed text. 477-508 - Rasha Obeidat, Yara Alharahsheh, Mahmoud Al-Ayyoub, Maram Gharaibeh:

ArEntail: manually-curated Arabic natural language inference dataset from news headlines. 509-535 - Taja Kuzman, Nikola Ljubesic:

Automatic genre identification: a survey. 537-570 - Steven Coats

:
A new corpus of geolocated ASR transcripts from Germany. 571-589 - Omaima Abboud, Batia Laufer, Noam Ordan, Uliana Sentsova, Shuly Wintner:

A corpus of English learners with Arabic and Hebrew backgrounds. 591-599 - Soran Badawi

, Arefeh Kazemi, Vali Rezaie:
KurdiSent: a corpus for kurdish sentiment analysis. 601-620 - Borja Herce

, Bogdan Pricop:
VeLeRo: an inflected verbal lexicon of standard Romanian and a quantitative analysis of morphological predictability. 621-637
Volume 59, Number 2, June 2025
- Abubakr H. Ombabi

, Wael Ouarda
, Adel M. Alimi
:
Improving Arabic sentiment analysis across context-aware attention deep model based on natural language processing. 639-663 - Brayan Stiven Lancheros, Gloria Corpas Pastor

, Ruslan Mitkov
:
Data augmentation and transfer learning for cross-lingual Named Entity Recognition in the biomedical domain. 665-684 - Chen Gafni

, Livnat Herzig Sheinfux
, Hadar Klunover, Anat Bar Siman Tov, Anat Prior
, Shuly Wintner
:
Analyzing learner language: the case of the Hebrew Learner Essay Corpus. 685-726 - Ida Szubert, Omri Abend

, Nathan Schneider
, Samuel Gibbon
, Louis Mahon
, Sharon Goldwater
, Mark Steedman
:
Cross-linguistically consistent semantic and syntactic annotation of child-directed speech. 727-776 - Ida Szubert, Omri Abend

, Nathan Schneider
, Samuel Gibbon
, Louis Mahon
, Sharon Goldwater
, Mark Steedman
:
Correction: Cross-linguistically consistent semantic and syntactic annotation of child-directed speech. 777-778 - Jaroslav Reichel

, Lubomír Benko:
Preservation of sentiment in machine translation of low-resource languages: a case study on Slovak movie subtitles. 779-805 - Nur Azmina Mohamad Zamani

, Norhaslinda Kamaruddin, Ahmad Muhyiddin B. Yusof:
Dataset on sentiment-based cryptocurrency-related news and tweets in English and Malay language. 807-842 - Jihye Park, Hye Jin Lee, Sungzoon Cho:

Automatic construction of direction-aware sentiment lexicon using direction-dependent words. 843-869 - Manoel Fernando Alonso Gadi, Miguel-Ángel Sicilia:

A sentiment corpus for the cryptocurrency financial domain: the CryptoLin corpus. 871-889 - Katja Meden

, Tomaz Erjavec
, Andrej Pancur
:
Slovenian parliamentary corpus siParl. 891-911 - Clement Levallois:

Umigon-lexicon: rule-based model for interpretable sentiment analysis and factuality categorization. 913-930 - Michalis Mountantonakis, Loukas Mertzanis, Michalis Bastakis, Yannis Tzitzikas:

A comparative evaluation for question answering over Greek texts by using machine translation and BERT. 931-957 - Ringki Das, Thoudam Doren Singh:

Which words are important?: an empirical study of Assamese sentiment analysis. 959-982 - Johnatan Estiven Bonilla

:
Spoken Spanish PoS tagging: gold standard dataset. 983-1012 - Pablo Báez, Leonardo Campillos-Llanos

, Fredy Núñez, Jocelyn Dunstan:
Entity normalization in a Spanish medical corpus using a UMLS-based lexicon: findings and limitations. 1013-1041 - Gábor Recski, Eszter Iklódi, Björn Lellmann, Ádám Kovács, Allan Hanbury:

BRISE-plandok: a German legal corpus of building regulations. 1043-1082 - Minni Jain

, Rajni Jindal
, Amita Jain
:
DoSLex: automatic generation of all domain semantically rich sentiment lexicon. 1083-1110 - Xue Li, Paul Groth

:
How different is different? Systematically identifying distribution shifts and their impacts in NER datasets. 1111-1150 - Spela Arhar Holdt

, Iztok Kosem
:
Šolar, the developmental corpus of Slovene. 1151-1177 - Mohammad Abdous, Poorya Piroozfar, Behrouz Minaei-Bidgoli:

PESTS: Persian_English cross lingual corpus for semantic textual similarity. 1179-1199 - Ibtissam Touahri, Azzeddine Mazroui:

Annotation and evaluation of a dialectal Arabic sentiment corpus against benchmark datasets using transformers. 1201-1233 - Shujun Wan

, Peter Bourgonje, Hongling Xiao, Clara Wan Ching Ho
:
Chinese-DiMLex: a lexicon of Chinese discourse connectives. 1235-1256 - Douglas Vitório

, Ellen Souza
, Lucas Martins, Nádia Félix F. da Silva, André Carlos Ponce de Leon Ferreira de Carvalho, Adriano Lorena Inácio de Oliveira, Francisco Edmundo de Andrade:
Building a relevance feedback corpus for legal information retrieval in the real-case scenario of the Brazilian Chamber of Deputies. 1257-1277 - Sriram Krishnan, Amba Kulkarni, Gérard Huet:

Normalized dataset for Sanskrit word segmentation and morphological parsing. 1279-1330 - Najet Hadj Mohamed, Chérifa Ben Khelil

, Agata Savary, Iskander Keskes, Jean-Yves Antoine, Lamia Hadrich Belguith:
PARSEME-AR: Arabic reference corpus for multiword expressions using PARSEME annotation guidelines. 1331-1361 - Francesco Periti, Sergio Picascia

, Stefano Montanelli, Alfio Ferrara, Nina Tahmasebi:
Studying word meaning evolution through incremental semantic shift detection. 1363-1399 - Mouad Jbel

, Mourad Jabrane
, Imad Hafidi, Abdelmoutalib Metrane:
Sentiment analysis dataset in Moroccan dialect: bridging the gap between Arabic and Latin scripted dialect. 1401-1430 - Dominik Schlechtweg, Frank D. Zamora-Reina, Felipe Bravo-Marquez, Nikolay Arefyev:

Sense through time: diachronic word sense annotations for word sense induction and Lexical Semantic Change Detection. 1431-1465 - Sven Laur, Siim Orasmaa, Sandra Eiche, Dage Särg:

Automatic dependency parsing of Estonian: what linguistic features to include? 1467-1494 - Fanny Ducel, Aurélie Névéol, Karën Fort

:
"You'll be a nurse, my son!" Automatically assessing gender biases in autoregressive language models in French and Italian. 1495-1523 - Claire Bonial, Stephanie M. Lukin, Mitchell Abrams, Anthony Baker, Lucia Donatelli, Ashley Foots, Cory J. Hayes, Cassidy Henry, Taylor Hudson, Matthew Marge, Kimberly A. Pollard, Ron Artstein, David R. Traum, Clare R. Voss:

Human-robot dialogue annotation for multi-modal common ground. 1525-1575 - Johannes Sibeko, Menno van Zaanen

:
Developing and testing syllabification systems for South African Sesotho. 1577-1592 - Rukayah Alhedayani

:
The Najdi Arabic Corpus: a new corpus for an underrepresented Arabic dialect. 1593-1612 - Joseph Jessie S. Oñate

, Tiffany Lyn O. Pandes:
Exploratory Analysis of Rinconada Bikol Language-Nabua Text Corpus. 1613-1629 - Pascual Julián Iranzo, Germán Rigau, Fernando Sáenz-Pérez, Pablo Velasco-Crespo:

Conversion of the Spanish WordNet databases into a Prolog-readable format. 1631-1657 - Chiara Alzetta

, Simonetta Montemagni, Marta Sartor, Giulia Venturi
:
Parlamint-it: an 18-karat UD treebank of Italian parliamentary speeches. 1659-1683 - Felipe Alves Siqueira, Douglas Vitório

, Ellen Souza
, José A. P. Santos, Hidelberg Oliveira Albuquerque, Márcio de Souza Dias, Nádia Félix F. da Silva, André C. P. L. F. de Carvalho
, Adriano L. I. Oliveira, Carmelo J. A. Bastos Filho:
Ulysses Tesemõ: a new large corpus for Brazilian legal and governmental domain. 1685-1704 - Borja Herce

:
VeLeSpa: An inflected verbal lexicon of Peninsular Spanish and a quantitative analysis of paradigmatic predictability. 1705-1718 - Simona Frenda, Gavin Abercrombie, Valerio Basile

, Alessandro Pedrani, Raffaella Panizzon, Alessandra Teresa Cignarella, Cristina Marco, Davide Bernardi:
Perspectivist approaches to natural language processing: a survey. 1719-1746 - Rong Xiang, Emmanuele Chersoni, Yixia Li, Jing Li

, Chu-Ren Huang, Yushan Pan, Yushi Li:
Cantonese natural language processing in the transformers era: a survey and current challenges. 1747-1773 - Zeyu Zhang, Steven Bethard

:
A survey on geocoding: algorithms and datasets for toponym resolution. 1775-1796 - Tomás Freitas Osório, Henrique Lopes Cardoso

:
Historical Portuguese corpora: a survey. 1797-1832
Volume 59, Number 3, September 2025
- Vandan Mujadia, Pruthwik Mishra, Dipti Misra Sharma:

Disfluency annotated corpora for Indian English in technical domains. 1833-1864 - Soma Das, Sanjay Chatterji:

Sanitization of septic news sentences through hybrid approach in English. 1865-1897 - Darinka Verdonik, Andreja Bizjak, Andrej Zgank

, Mirjam Sepesy Maucec, Mitja Trojar, Jerneja Zganec-Gros, Marko Bajec, Iztok Lebar Bajec, Simon Dobrisek:
Strategies for managing time and costs in speech corpus creation: insights from the Slovenian ARTUR corpus. 1899-1924 - Francesca Carbone, Gilles Bouchet, Alain Ghio, Thierry Legou, Carine André, Muriel Lalain, Caterina Petrone, Antoine Giovanni:

Investigating droplet emission during speech interaction. 1925-1953 - Francesca Carbone, Gilles Bouchet, Alain Ghio, Thierry Legou, Carine André, Muriel Lalain, Caterina Petrone, Antoine Giovanni:

Correction to: Investigating droplet emission during speech interaction. 1955-1956 - Hyo-sun Ryu

, Jae Kook Lee
:
Detection of political hate speech in Korean language. 1957-1988 - Jakub Sido

, Michal Seják, Ondrej Prazák, Miloslav Konopík, Václav Moravec:
Czech news dataset for semantic textual similarity. 1989-2006 - Alberto Benayas, Miguel-Ángel Sicilia, Marçal Mora Cantallops:

A comparative analysis of encoder only and decoder only models in intent classification and sentiment analysis: navigating the trade-offs in model size and performance. 2007-2030 - Wolfgang S. Schmeisser-Nieto, Alessandra Teresa Cignarella, Tom Bourgeade, Simona Frenda

, Alejandro Ariza-Casabona
, Mario Laurent, Paolo Giovanni Cicirelli
, Andrea Marra, Giuseppe Corbelli
, Farah Benamara, Cristina Bosco
, Véronique Moriceau, Marinella Paciello, Viviana Patti, Mariona Taulé
, Francesca D'Errico:
Stereohoax: a multilingual corpus of racial hoaxes and social media reactions annotated for stereotypes. 2031-2069 - Tomaz Erjavec

, Matyás Kopp
, Nikola Ljubesic
, Taja Kuzman
, Paul Rayson
, Petya Osenova
, Maciej Ogrodniczuk
, Çagri Çöltekin
, Danijel Korzinek
, Katja Meden
, Jure Skubic
, Peter Rupnik, Tommaso Agnoloni
, José Aires
, Starkaður Barkarson, Roberto Bartolini
, Núria Bel
, María Calzada Pérez
, Roberts Dargis
, Sascha Diwersy
, Maria Gavriilidou
, Ruben van Heusden
, Mikel Iruskieta
, Neeme Kahusk
, Anna Kryvenko
, Noémi Ligeti-Nagy
, Carmen Magariños
, Martin Mölder
, Costanza Navarretta
, Kiril Simov
, Lars Magne Tungland, Jouni Tuominen
, John Edward Vidler
, Adina Ioana Vladu
, Tanja Wissik
, Väinö Yrjänäinen, Darja Fiser
:
ParlaMint II: advancing comparable parliamentary corpora across Europe. 2071-2102 - V. Jothi Prakash

, S. Arul Antran Vijay
:
An integrated framework for emotion and sentiment analysis in Tamil and Malayalam visual content. 2103-2141 - Lucie Barque, Richard Huyghe, Martial Foegel:

Exploring lexical factors in semantic annotation: insights from the classification of nouns in French. 2143-2167 - Mahadia Tunga

, Davis David:
Introducing a Swahili social media sentiment analysis dataset for the telecom industry. 2169-2184 - Victor Gonçalves Lima

, Denilson Alves Pereira:
UFLA-FORMS: an academic forms dataset for information extraction in the Portuguese language. 2185-2211 - Kozhin Muhealddin Awlla, Hadi Veisi, Abdulhady Abas Abdullah:

Sentiment analysis in low-resource contexts: BERT's impact on Central Kurdish. 2213-2243 - Majid Adibian, Hossein Zeinali, Soroush Barmaki:

DeepMine-multi-TTS: a Persian speech corpus for multi-speaker text-to-speech. 2245-2264 - Fengkai Liu

, Tan Jin, John S. Y. Lee
:
Automatic readability assessment for sentences: neural, hybrid and large language models. 2265-2296 - Leila Hazrati

, Alireza Sokhandan
, Leili Farzinvash
:
Improving irony speech spreaders profiling on social networks using clustering & transformer based models. 2297-2327 - Yinglun Sun

, Jose Zavala, Shuju Shi, Rachel Finegold, Roxana Girju, Jeffrey Moore:
MedicalCare: building and annotating an empathy-rich corpus. 2329-2364 - David Gimeno-Gómez, Carlos D. Martínez-Hinarejos:

Evaluation of end-to-end continuous spanish lipreading in different data conditions. 2365-2386 - Endang Wahyu Pamungkas, Patricia Chiril

:
Ngalawan Ujaran Sengit: hate speech detection in indonesian code-mixed social media data. 2387-2414 - Serhii Zasiekin

, Larysa Zasiekina, Emilie Altman, Mariia Hryntus, Victor Kuperman:
The narratives of war (NoW) corpus of written testimonies of the Russia-Ukraine war. 2415-2426 - Tomer Sagi

, Moran Zaga, Sinai Rusinek, Marcell Fekete, Johannes Bjerva
, Katja Hose
:
Utilizing phonetic similarity for cross-source and cross-language toponym matching: a benchmark and prototype. 2427-2451 - Nankai Lin, Yingwen Fu, Xiaotian Lin, Ziyu Yang, Shengyi Jiang:

A new evaluation method: evaluation data and metrics for Chinese grammatical error correction. 2453-2468 - Muhammad Saad Amin

, Xiao Zhang
, Luca Anselma, Alessandro Mazzei, Johan Bos
:
Semantic processing for Urdu: corpus creation, parsing, and generation. 2469-2500 - Hongjie Cai, Nan Song, Zengzhi Wang, Qiming Xie, Qiankun Zhao, Ke Li, Siwei Wu, Shijie Liu, Heqing Ma, Jianfei Yu

, Rui Xia:
MEMD-ABSA: a multi-element multi-domain dataset for aspect-based sentiment analysis. 2501-2529 - Magdalena Gapsa

:
"But why??" Evaluation of user-suggested synonyms in the Thesaurus of Modern Slovene. 2531-2563 - Hossein Mirzaee, Javad Peymanfard, Hamid Habibzadeh Moshtaghin, Hossein Zeinali:

ArmanEmo: a Persian dataset for text-based emotion detection. 2565-2587 - Mohammed Elsadiq Barmati, Bachir Said, Abdelghani Dahou:

Multi-task learning for multi-dialect Arabic sentiment classification and sarcasm detection. 2589-2612 - Mahendra Gupta

, Maitreyee Dutta, Chandresh Kumar Maurya:
Benchmarking Hindi-to-English direct speech-to-speech translation with synthetic data. 2613-2651 - Vandan Mujadia, Pruthwik Mishra, Dipti Misra Sharma:

Disfluency processing for cascaded speech translation involving English and Indian languages. 2653-2686 - Pascal Moliner

, Patrick Rateau
, Anthony Piermattéo
, Emma Claudinon, Enola Guegan:
Tropes and the EmotAix lexicon for evaluating the emotional tonality of French verbal association corpora in social representation studies. 2687-2703 - G. Bharathi Mohan, M. Gayathri, R. Prasanna Kumar:

Detoxifying language model outputs: combining multi-agent debates and reinforcement learning for improved summarization. 2705-2736 - Anxo Pérez

, Marcos Fernández-Pichel, Javier Parapar, David E. Losada:
DepreSym: A Depression Symptom Annotated Corpus and the Role of Large Language Models as Assessors of Psychological Markers. 2737-2762 - Rachel M. Murphy

, Dave A. Dongelmans
, Nicolette F. de Keizer
, Rosa J. Jongeneel, Christiaan H. Koster, Kitty J. Jager
, Ameen Abu-Hanna
, Iacer Calixto
, Joanna E. Klopotowska
:
Creation of a gold standard Dutch corpus of clinical notes for adverse drug event detection: the Dutch ADE corpus. 2763-2779 - Branko Zitko, Angelina Gaspar, Lucija Brocic

, Daniel Vasic, Ani Grubisic
:
Human-machine interaction in building an English reference dataset for natural language processing tasks. 2781-2809 - Sanjana Kavatagi, Rashmi Rachh:

HASTIKA: hate speech and target identification in Kannada-English code-mixed text. 2811-2856 - Quanqi Du, Sofie Labat

, Thomas Demeester, Véronique Hoste:
UniC: a dataset for emotion analysis of videos with multimodal and unimodal labels. 2857-2892 - Shivani Tufchi, Ashima Yadav, Tanveer Ahmed:

AMTCF: an advanced multimodal transformer and ConvNext fusion for contextualized fake news detection in digital landscape. 2893-2927 - Joy Gorai

, Dilip Kumar Shaw:
Attention and LoRA-based multimodal emotion detection system. 2929-2944 - Ananya Pandey, Dinesh Kumar Vishwakarma:

Aspect-based multimodal sentiment analysis via employing visual-to-emotional-caption translation network using visual-caption pairs. 2945-2972 - Gili Goldin, Nick Howell, Noam Ordan, Ella Rabinovich, Shuly Wintner:

The Knesset corpus: an annotated corpus of Hebrew parliamentary proceedings. 2973-3004 - Yunhao Zhang, Xiaohan Zhang, Chong Li, Shaonan Wang

, Chengqing Zong:
MulCogBench: a multi-modal cognitive benchmark dataset for evaluating Chinese and English computational language models. 3005-3028 - Xiaolong Wu, Chaobo Song, Shanshan Xiang, Ronghe Cao, Chang Feng, Hankiz Yilahun, Mingxing Xu, Askar Hamdulla, Thomas Fang Zheng:

A Chinese natural speech complex emotion dataset based on emotion vector annotation method. 3029-3050 - Bharathi Raja Chakravarthi, Saranya Rajiakodi, Rahul Ponnusamy, Bhuvaneswari Sivagnanam, Sara Yogesh Thakare, Sathiyaraj Thangasamy:

Detecting caste and migration hate speech in low-resource Tamil language. 3051-3086 - Marco Casavantes

, Manuel Montes-y-Gómez
, Delia Irazú Hernández Farías
, Luis Carlos González-Gurrola
, Alberto Barrón-Cedeño
:
PropitterX: a Twitter-based propaganda corpus extended with multiple contextual features. 3087-3115 - Hiuching Hung, Thorsten Piske, Paula Andrea Pérez-Toro, Tomás Arias-Vergara, Andreas Maier:

kidsNARRATE: a versatile corpus for studying Chinese-english bilingual L2 narrative skills in preschoolers. 3117-3138 - Aizihaierjiang Yusufu, Kamran Aziz

, Aizierguli Yusufu, Abidan Ainiwaer, Fei Li, Donghong Ji:
Uzbek news corpus for named entity recognition. 3139-3152 - Sujit Kumar, Anant Shankhdhar, Divyam Singal, Bhuvan Aggarwal, Ahaan Sameer Malhotra, Sanasam Ranbir Singh:

Fake news article detection datasets for Hindi language. 3153-3188 - Teisovi Angami

, Mimi Kevichüsa-Ezung, Sanasam Ranbir Singh, Themrichon Tuithung:
Evaluation of the morphological rules for the Tenyidie language: a low-resource language. 3189-3214 - Arash Ghafouri, Hassan Naderi, Mahdi Firouzmandi:

PinLID: a dataset for Pinglish language identiftcation based on code-mixing sentence on unstructured resources. 3215-3241 - Ijazul Haq

, Yingjie Zhang, Intakhab Alam Qadri:
POS tagging of low-resource Pashto language: annotated corpus and BERT-based model. 3243-3265 - Kyaw Htet Aung, Mark Dras:

Myanmar XNLI: building a dataset and exploring low-resource approaches to natural language inference with Myanmar. 3267-3310 - Sham K. Berhane, Simon M. Beyene, Yoel G. Teklit, Ibrahim A. Ibrahim, Natnael A. Teklu, Sirak A. Bereketeab, Fitsum Gaim:

Towards neural named entity recognition system in Tigrinya with large-scale dataset. 3311-3339 - Anqi Zhou, Qiuhong Li, Chao Wu

:
The Mandarin Chinese speech database: a corpus of 18,820 auditory neutral nonsense sentences. 3341-3352 - Mu You

, Jing Zhang, Derek F. Wong, Kaixin Lan:
Umplc: the first longitudinal learner corpus of Portuguese. 3353-3372 - Thanatkorn Chuenbanluesuk, Voramate Plodprong, Weerasak Karoon, Kotchakorn Rueangsri, Suthasinee Pojam, Thitirat Siriborvornratanakul:

Using contrastive language-image pre-training for Thai recipe recommendation. 3373-3383 - Gülsen Eryigit

, Anna Golynskaia
, Elif Sayar, Tolgahan Türker:
Error annotation: a review and faceted taxonomy. 3385-3409

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














