default search action
Carolina Scarton
Person information
- affiliation: University of Sheffield, Department of Computer Science
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j8]Matheus V. V. Berto, Breno L. Freitas, Carolina Scarton, João A. Machado-Neto, Tiago A. Almeida:
Accelerating discoveries in medicine using distributed vector representations of words. Expert Syst. Appl. 250: 123566 (2024) - [j7]Sidney Evaldo Leal, Magali Sanches Duran, Carolina Evaristo Scarton, Nathan Siegle Hartmann, Sandra Maria Aluísio:
NILC-Metrix: assessing the complexity of written and spoken language in Brazilian Portuguese. Lang. Resour. Evaluation 58(1): 73-110 (2024) - [c73]Zhihao Zhang, Tomas Goldsack, Carolina Scarton, Chenghua Lin:
ATLAS: Improving Lay Summarisation with Attribute-based Control. ACL (Short Papers) 2024: 337-345 - [c72]Wei He, Marco Idiart, Carolina Scarton, Aline Villavicencio:
Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss. ACL (Findings) 2024: 12473-12485 - [c71]Tomas Goldsack, Carolina Scarton, Matthew Shardlow, Chenghua Lin:
Overview of the BioLaySumm 2024 Shared Task on the Lay Summarization of Biomedical Research Articles. BioNLP@ACL 2024: 122-131 - [c70]Yue Li, Carolina Scarton:
Can We Identify Stance without Target Arguments? A Study for Rumour Stance Classification. LREC/COLING 2024: 2844-2851 - [c69]Yida Mu, Ben P. Wu, William Thorne, Ambrose Robinson, Nikolaos Aletras, Carolina Scarton, Kalina Bontcheva, Xingyi Song:
Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science. LREC/COLING 2024: 12074-12086 - [c68]Sebastian T. Vincent, Rowanne Sumner, Alice Dowek, Charlotte Prescott, Emily Preston, Chris Bayliss, Chris Oakley, Carolina Scarton:
Reference-less Analysis of Context Specificity in Translation with Personalised Language Models. LREC/COLING 2024: 13769-13784 - [c67]Jake Vasilakes, Zhixue Zhao, Michal Gregor, Ivan Vykopal, Martin Hyben, Carolina Scarton:
ExU: AI Models for Examining Multilingual Disinformation Narratives and Understanding their Spread. EAMT (2) 2024: 39-40 - [c66]Brendan Spillane, Carolina Scarton, Róbert Móro, Petar Ivanov, Andrey Tagarev, Jakub Simko, Ibrahim Abu Farha, Gary Munnelly, Filip Uhlárik, Freddy Heppell:
Multilinguality in the VIGILANT project. EAMT (2) 2024: 41-42 - [c65]Sebastian T. Vincent, Charlotte Prescott, Chris Bayliss, Chris Oakley, Carolina Scarton:
A Case Study on Contextual Machine Translation in a Professional Scenario of Subtitling. EAMT (1) 2024: 561-572 - [e5]Carolina Scarton, Charlotte Prescott, Chris Bayliss, Chris Oakley, Joanna Wright, Stuart Wrigley, Xingyi Song, Edward Gow-Smith, Rachel Bawden, Víctor M. Sánchez-Cartagena, Patrick Cadwell, Ekaterina Lapshinova-Koltunski, Vera Cabarrão, Konstantinos Chatzitheodorou, Mary Nurminen, Diptesh Kanojia, Helena Moniz:
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), EAMT 2024, Sheffield, UK, June 24-27, 2024. European Association for Machine Translation (EAMT) 2024, ISBN 978-1-0686907-0-9 [contents] - [e4]Carolina Scarton, Charlotte Prescott, Chris Bayliss, Chris Oakley, Joanna Wright, Stuart Wrigley, Xingyi Song, Edward Gow-Smith, Mikel Forcada, Helena L. Moniz:
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 2), EAMT 2024, Sheffield, UK, June 24-27, 2024. European Association for Machine Translation (EAMT) 2024, ISBN 978-1-0686907-1-6 [contents] - [i42]Edward Gow-Smith, Dylan Phelps, Harish Tayyar Madabushi, Carolina Scarton, Aline Villavicencio:
Word Boundary Information Isn't Useful for Encoder Language Models. CoRR abs/2401.07923 (2024) - [i41]Zhihao Zhang, Tomas Goldsack, Carolina Scarton, Chenghua Lin:
ATLAS: Improving Lay Summarisation with Attribute-based Control. CoRR abs/2406.05625 (2024) - [i40]João A. Leite, Olesya Razuvayevskaya, Kalina Bontcheva, Carolina Scarton:
EUvsDisinfo: a Dataset for Multilingual Detection of Pro-Kremlin Disinformation in News Articles. CoRR abs/2406.12614 (2024) - [i39]Wei He, Marco Idiart, Carolina Scarton, Aline Villavicencio:
Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss. CoRR abs/2406.15175 (2024) - [i38]Jake Vasilakes, Zhixue Zhao, Ivan Vykopal, Michal Gregor, Martin Hyben, Carolina Scarton:
ExU: AI Models for Examining Multilingual Disinformation Narratives and Understanding their Spread. CoRR abs/2406.15443 (2024) - [i37]Sebastian T. Vincent, Charlotte Prescott, Chris Bayliss, Chris Oakley, Carolina Scarton:
A Case Study on Contextual Machine Translation in a Professional Scenario of Subtitling. CoRR abs/2407.00108 (2024) - [i36]Tomas Goldsack, Carolina Scarton, Matthew Shardlow, Chenghua Lin:
Overview of the BioLaySumm 2024 Shared Task on the Lay Summarization of Biomedical Research Articles. CoRR abs/2408.08566 (2024) - 2023
- [j6]Iknoor Singh, Carolina Scarton, Kalina Bontcheva:
UTDRM: unsupervised method for training debunked-narrative retrieval models. EPJ Data Sci. 12(1): 59 (2023) - [c64]Sebastian T. Vincent, Robert Flynn, Carolina Scarton:
MTCue: Learning Zero-Shot Control of Extra-Textual Attributes by Leveraging Unstructured Context in Neural Machine Translation. ACL (Findings) 2023: 8210-8226 - [c63]Tomas Goldsack, Zheheng Luo, Qianqian Xie, Carolina Scarton, Matthew Shardlow, Sophia Ananiadou, Chenghua Lin:
BioLaySumm 2023 Shared Task: Lay Summarisation of Biomedical Research Articles. BioNLP@ACL 2023: 468-477 - [c62]Tomas Goldsack, Zhihao Zhang, Chenghua Lin, Carolina Scarton:
Domain-Driven and Discourse-Guided Scientific Summarisation. ECIR (1) 2023: 361-376 - [c61]Ben Wu, Yue Li, Yida Mu, Carolina Scarton, Kalina Bontcheva, Xingyi Song:
Don't waste a single annotation: improving single-label classifiers through soft labels. EMNLP (Findings) 2023: 5347-5355 - [c60]Freddy Heppell, Kalina Bontcheva, Carolina Scarton:
Analysing State-Backed Propaganda Websites: a New Dataset and Linguistic Study. EMNLP 2023: 5729-5741 - [c59]Tomas Goldsack, Zhihao Zhang, Chen Tang, Carolina Scarton, Chenghua Lin:
Enhancing Biomedical Lay Summarisation with External Knowledge Graphs. EMNLP 2023: 8016-8032 - [c58]Yida Mu, Mali Jin, Charlie Grimshaw, Carolina Scarton, Kalina Bontcheva, Xingyi Song:
VaxxHesitancy: A Dataset for Studying Hesitancy towards COVID-19 Vaccination on Twitter. ICWSM 2023: 1052-1062 - [c57]Ye Jiang, Xingyi Song, Carolina Scarton, Iknoor Singh, Ahmet Aker, Kalina Bontcheva:
Categorising Fine-to-Coarse Grained Misinformation: An Empirical Study of the COVID-19 Infodemic. RANLP 2023: 556-567 - [c56]João Augusto Leite, Carolina Scarton, Diego F. Silva:
Noisy Self-Training with Data Augmentations for Offensive and Hate Speech Detection Tasks. RANLP 2023: 631-640 - [c55]Yue Li, Carolina Scarton, Xingyi Song, Kalina Bontcheva:
Classifying COVID-19 Vaccine Narratives. RANLP 2023: 648-657 - [c54]Ben Wu, Olesya Razuvayevskaya, Freddy Heppell, João Augusto Leite, Carolina Scarton, Kalina Bontcheva, Xingyi Song:
SheffieldVeraAI at SemEval-2023 Task 3: Mono and Multilingual Approaches for News Genre, Topic and Persuasion Technique Classification. SemEval@ACL 2023: 1995-2008 - [e3]Mary Nurminen, Judith Brenner, Maarit Koponen, Sirkku Latomaa, Mikhail Mikhailov, Frederike Schierl, Tharindu Ranasinghe, Eva Vanmassenhove, Sergi Alvarez Vidal, Nora Aranberri, Mara Nunziatini, Carla Parra Escartín, Mikel L. Forcada, Maja Popovic, Carolina Scarton, Helena Moniz:
Proceedings of the 24th Annual Conference of the European Association for Machine Translation, EAMT 2023, Tampere, Finland, 12-15 June 2023. European Association for Machine Translation 2023, ISBN 978-952-03-2947-1 [contents] - [i35]Yida Mu, Mali Jin, Charlie Grimshaw, Carolina Scarton, Kalina Bontcheva, Xingyi Song:
VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination on Twitter. CoRR abs/2301.06660 (2023) - [i34]Ben Wu, Olesya Razuvayevskaya, Freddy Heppell, João Augusto Leite, Carolina Scarton, Kalina Bontcheva, Xingyi Song:
Team SheffieldVeraAI at SemEval-2023 Task 3: Mono and multilingual approaches for news genre, topic and persuasion technique classification. CoRR abs/2303.09421 (2023) - [i33]Yue Li, Carolina Scarton:
Evaluating the Role of Target Arguments in Rumour Stance Classification. CoRR abs/2303.12665 (2023) - [i32]Sebastian T. Vincent, Rowanne Sumner, Alice Dowek, Charlotte Blundell, Emily Preston, Chris Bayliss, Chris Oakley, Carolina Scarton:
Personalised Language Modelling of Screen Characters Using Rich Metadata Annotations. CoRR abs/2303.16618 (2023) - [i31]Yida Mu, Ye Jiang, Freddy Heppell, Iknoor Singh, Carolina Scarton, Kalina Bontcheva, Xingyi Song:
A Large-Scale Comparative Study of Accurate COVID-19 Information versus Misinformation. CoRR abs/2304.04811 (2023) - [i30]Yida Mu, Ben P. Wu, William Thorne, Ambrose Robinson, Nikolaos Aletras, Carolina Scarton, Kalina Bontcheva, Xingyi Song:
Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science. CoRR abs/2305.14310 (2023) - [i29]Sebastian T. Vincent, Robert Flynn, Carolina Scarton:
MTCue: Learning Zero-Shot Control of Extra-Textual Attributes by Leveraging Unstructured Context in Neural Machine Translation. CoRR abs/2305.15904 (2023) - [i28]João Augusto Leite, Carolina Scarton, Diego F. Silva:
Noisy Self-Training with Data Augmentations for Offensive and Hate Speech Detection Tasks. CoRR abs/2307.16609 (2023) - [i27]Iknoor Singh, Carolina Scarton, Xingyi Song, Kalina Bontcheva:
Finding Already Debunked Narratives via Multistage Retrieval: Enabling Cross-Lingual, Cross-Dataset and Zero-Shot Learning. CoRR abs/2308.05680 (2023) - [i26]Olesya Razuvayevskaya, Ben Wu, João Augusto Leite, Freddy Heppell, Ivan Srba, Carolina Scarton, Kalina Bontcheva, Xingyi Song:
Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification. CoRR abs/2308.07282 (2023) - [i25]João Augusto Leite, Olesya Razuvayevskaya, Kalina Bontcheva, Carolina Scarton:
Detecting Misinformation with LLM-Predicted Credibility Signals and Weak Supervision. CoRR abs/2309.07601 (2023) - [i24]Tomas Goldsack, Zheheng Luo, Qianqian Xie, Carolina Scarton, Matthew Shardlow, Sophia Ananiadou, Chenghua Lin:
Overview of the BioLaySumm 2023 Shared Task on Lay Summarization of Biomedical Research Articles. CoRR abs/2309.17332 (2023) - [i23]Freddy Heppell, Kalina Bontcheva, Carolina Scarton:
Analysing State-Backed Propaganda Websites: a New Dataset and Linguistic Study. CoRR abs/2310.14032 (2023) - [i22]Tomas Goldsack, Zhihao Zhang, Chen Tang, Carolina Scarton, Chenghua Lin:
Enhancing Biomedical Lay Summarisation with External Knowledge Graphs. CoRR abs/2310.15702 (2023) - [i21]Ben Wu, Yue Li, Yida Mu, Carolina Scarton, Kalina Bontcheva, Xingyi Song:
Don't Waste a Single Annotation: Improving Single-Label Classifiers Through Soft Labels. CoRR abs/2311.05265 (2023) - 2022
- [c53]Sebastian T. Vincent, Loïc Barrault, Carolina Scarton:
Controlling Extra-Textual Attributes about Dialogue Participants: A Case Study of English-to-Polish Neural Machine Translation. EAMT 2022: 121-130 - [c52]Tomas Goldsack, Zhihao Zhang, Chenghua Lin, Carolina Scarton:
Making Science Simple: Corpora for the Lay Summarisation of Scientific Literature. EMNLP 2022: 10589-10604 - [c51]Edward Gow-Smith, Harish Tayyar Madabushi, Carolina Scarton, Aline Villavicencio:
Improving Tokenisation by Alternative Treatment of Spaces. EMNLP 2022: 11430-11443 - [c50]Sebastian T. Vincent, Loïc Barrault, Carolina Scarton:
Controlling Formality in Low-Resource NMT with Domain Adaptation and Re-Ranking: SLT-CDT-UoS at IWSLT2022. IWSLT@ACL 2022: 341-350 - [c49]Dylan Phelps, Xuan-Rui Fan, Edward Gow-Smith, Harish Tayyar Madabushi, Carolina Scarton, Aline Villavicencio:
Sample Efficient Approaches for Idiomaticity Detection. MWE@LREC2022 2022: 105-111 - [c48]Harish Tayyar Madabushi, Edward Gow-Smith, Marcos García, Carolina Scarton, Marco Idiart, Aline Villavicencio:
SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding. SemEval@NAACL 2022: 107-121 - [c47]Iknoor Singh, Yue Li, Melissa Thong, Carolina Scarton:
GateNLP-UShef at SemEval-2022 Task 8: Entity-Enriched Siamese Transformer for Multilingual News Article Similarity. SemEval@NAACL 2022: 1121-1128 - [c46]Iknoor Singh, Kalina Bontcheva, Xingyi Song, Carolina Scarton:
Comparative Analysis of Engagement, Themes, and Causality of Ukraine-Related Debunks and Disinformation. SocInfo 2022: 128-143 - [e2]Helena Moniz, Lieve Macken, Andrew Rufener, Loïc Barrault, Marta R. Costa-jussà, Christophe Declercq, Maarit Koponen, Ellie Kemp, Spyridon Pilos, Mikel L. Forcada, Carolina Scarton, Joachim Van den Bogaert, Joke Daems, Arda Tezcan, Bram Vanroy, Margot Fonteyne:
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, EAMT 2022, Ghent, Belgium, June 1-3, 2022. European Association for Machine Translation 2022, ISBN 9789464597622 [contents] - [e1]Vládia Pinheiro, Pablo Gamallo, Raquel Amaro, Carolina Scarton, Fernando Batista, Diego Furtado Silva, Catarina Magro, Hugo Pinto:
Computational Processing of the Portuguese Language - 15th International Conference, PROPOR 2022, Fortaleza, Brazil, March 21-23, 2022, Proceedings. Lecture Notes in Computer Science 13208, Springer 2022, ISBN 978-3-030-98304-8 [contents] - [i20]Sidney Evaldo Leal, Magali Sanches Duran, Carolina Evaristo Scarton, Nathan Siegle Hartmann, Sandra Maria Aluísio:
NILC-Metrix: assessing the complexity of written and spoken language in Brazilian Portuguese. CoRR abs/2201.03445 (2022) - [i19]Edward Gow-Smith, Harish Tayyar Madabushi, Carolina Scarton, Aline Villavicencio:
Improving Tokenisation by Alternative Treatment of Spaces. CoRR abs/2204.04058 (2022) - [i18]Harish Tayyar Madabushi, Edward Gow-Smith, Marcos García, Carolina Scarton, Marco Idiart, Aline Villavicencio:
SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding. CoRR abs/2204.10050 (2022) - [i17]Sebastian T. Vincent, Loïc Barrault, Carolina Scarton:
Controlling Extra-Textual Attributes about Dialogue Participants - A Case Study of English-to-Polish Neural Machine Translation. CoRR abs/2205.04747 (2022) - [i16]Sebastian T. Vincent, Loïc Barrault, Carolina Scarton:
Controlling Formality in Low-Resource NMT with Domain Adaptation and Re-Ranking: SLT-CDT-UoS at IWSLT2022. CoRR abs/2205.05990 (2022) - [i15]Dylan Phelps, Xuan-Rui Fan, Edward Gow-Smith, Harish Tayyar Madabushi, Carolina Scarton, Aline Villavicencio:
Sample Efficient Approaches for Idiomaticity Detection. CoRR abs/2205.11306 (2022) - [i14]Iknoor Singh, Yue Li, Melissa Thong, Carolina Scarton:
GateNLP-UShef at SemEval-2022 Task 8: Entity-Enriched Siamese Transformer for Multilingual News Article Similarity. CoRR abs/2205.15812 (2022) - [i13]Yue Li, Carolina Scarton, Xingyi Song, Kalina Bontcheva:
Classifying COVID-19 vaccine narratives. CoRR abs/2207.08522 (2022) - [i12]Tomas Goldsack, Zhihao Zhang, Chenghua Lin, Carolina Scarton:
Making Science Simple: Corpora for the Lay Summarisation of Scientific Literature. CoRR abs/2210.09932 (2022) - [i11]Iknoor Singh, Kalina Bontcheva, Xingyi Song, Carolina Scarton:
Comparative Analysis of Engagement, Themes, and Causality of Ukraine-Related Debunks and Disinformation. CoRR abs/2212.07457 (2022) - 2021
- [j5]Fernando Alva-Manchego, Carolina Scarton, Lucia Specia:
The (Un)Suitability of Automatic Evaluation Metrics for Text Simplification. Comput. Linguistics 47(4): 861-889 (2021) - [j4]Yelena Mejova, Marinella Petrocchi, Carolina Scarton:
Special Issue on Disinformation, Hoaxes and Propaganda within Online Social Networks and Media. Online Soc. Networks Media 23: 100132 (2021) - [c45]Marcos García, Tiago Kramer Vieira, Carolina Scarton, Marco Idiart, Aline Villavicencio:
Assessing the Representations of Idiomaticity in Vector Models with a Noun Compound Dataset Labeled at Type and Token Levels. ACL/IJCNLP (1) 2021: 2730-2741 - [c44]Marcos García, Tiago Kramer Vieira, Carolina Scarton, Marco Idiart, Aline Villavicencio:
Probing for idiomaticity in vector space models. EACL 2021: 3551-3564 - [c43]Harish Tayyar Madabushi, Edward Gow-Smith, Carolina Scarton, Aline Villavicencio:
AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language Models. EMNLP (Findings) 2021: 3464-3477 - [c42]Carolina Scarton, Yue Li:
Cross-lingual Rumour Stance Classification: a First Study with BERT and Machine Translation. TTO 2021: 50-59 - [i10]Iknoor Singh, Carolina Scarton, Kalina Bontcheva:
Multistage BiCross Encoder: Team GATE Entry for MLIA Multilingual Semantic Search Task 2. CoRR abs/2101.03013 (2021) - [i9]Ye Jiang, Xingyi Song, Carolina Scarton, Ahmet Aker, Kalina Bontcheva:
Categorising Fine-to-Coarse Grained Misinformation: An Empirical Study of COVID-19 Infodemic. CoRR abs/2106.11702 (2021) - [i8]Iknoor Singh, Kalina Bontcheva, Carolina Scarton:
The False COVID-19 Narratives That Keep Being Debunked: A Spatiotemporal Analysis. CoRR abs/2107.12303 (2021) - [i7]Harish Tayyar Madabushi, Edward Gow-Smith, Carolina Scarton, Aline Villavicencio:
AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language Models. CoRR abs/2109.04413 (2021) - 2020
- [j3]Fernando Alva-Manchego, Carolina Scarton, Lucia Specia:
Data-Driven Sentence Simplification: Survey and Benchmark. Comput. Linguistics 46(1): 135-187 (2020) - [j2]Carolina Scarton:
Horacio Saggion, Automatic Text Simplification. Synthesis lectures on human language technologies, April 2017. Nat. Lang. Eng. 26(4): 489-492 (2020) - [c41]Fernando Alva-Manchego, Louis Martin, Antoine Bordes, Carolina Scarton, Benoît Sagot, Lucia Specia:
ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations. ACL 2020: 4668-4679 - [c40]Carolina Scarton, Pranava Madhyastha, Lucia Specia:
Deciding When, How and for Whom to Simplify. ECAI 2020: 2172-2179 - [c39]João Augusto Leite, Diego F. Silva, Kalina Bontcheva, Carolina Scarton:
Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis. AACL/IJCNLP 2020: 914-924 - [c38]Carolina Scarton, Diego F. Silva, Kalina Bontcheva:
Measuring What Counts: The Case of Rumour Stance Classification. AACL/IJCNLP 2020: 925-932 - [c37]Roney L. S. Santos, Gabriela Wick-Pedro, Sidney Evaldo Leal, Oto A. Vale, Thiago A. S. Pardo, Kalina Bontcheva, Carolina Scarton:
Measuring the Impact of Readability Features in Fake News Detection. LREC 2020: 1404-1413 - [c36]Gabriela Wick-Pedro, Roney L. S. Santos, Oto A. Vale, Thiago A. S. Pardo, Kalina Bontcheva, Carolina Scarton:
Linguistic Analysis Model for Monitoring User Reaction on Satirical News for Brazilian Portuguese. PROPOR 2020: 313-320 - [i6]Fernando Alva-Manchego, Louis Martin, Antoine Bordes, Carolina Scarton, Benoît Sagot, Lucia Specia:
ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations. CoRR abs/2005.00481 (2020) - [i5]Carolina Scarton, Diego F. Silva, Kalina Bontcheva:
Measuring What Counts: The case of Rumour Stance Classification. CoRR abs/2010.04532 (2020) - [i4]João Augusto Leite, Diego F. Silva, Kalina Bontcheva, Carolina Scarton:
Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis. CoRR abs/2010.04543 (2020)
2010 – 2019
- 2019
- [c35]Fernando Alva-Manchego, Carolina Scarton, Lucia Specia:
Cross-Sentence Transformations in Text Simplification. WNLP@ACL 2019: 181-184 - [c34]Fernando Alva-Manchego, Louis Martin, Carolina Scarton, Lucia Specia:
EASSE: Easier Automatic Sentence Simplification Evaluation. EMNLP/IJCNLP (3) 2019: 49-54 - [c33]Carolina Scarton, Mikel L. Forcada, Miquel Esplà-Gomis, Lucia Specia:
Estimating post-editing effort: a study on human judgements, task-based and reference-based metrics of MT quality. IWSLT 2019 - [i3]Fernando Alva-Manchego, Louis Martin, Carolina Scarton, Lucia Specia:
EASSE: Easier Automatic Sentence Simplification Evaluation. CoRR abs/1908.04567 (2019) - [i2]Carolina Scarton, Mikel L. Forcada, Miquel Esplà-Gomis, Lucia Specia:
Estimating post-editing effort: a study on human judgements, task-based and reference-based metrics of MT quality. CoRR abs/1910.06204 (2019) - 2018
- [b2]Lucia Specia, Carolina Scarton, Gustavo Henrique Paetzold:
Quality Estimation for Machine Translation. Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers 2018, ISBN 978-3-031-01040-8 - [c32]Carolina Scarton, Lucia Specia:
Learning Simplifications for Specific Target Audiences. ACL (2) 2018: 712-718 - [c31]Carolina Scarton, Gustavo Paetzold, Lucia Specia:
Text Simplification from Professionally Produced Corpora. LREC 2018 - [c30]Carolina Scarton, Gustavo Paetzold, Lucia Specia:
SimPA: A Sentence-Level Simplification Corpus for the Public Administration Domain. LREC 2018 - [c29]Mikel L. Forcada, Carolina Scarton, Lucia Specia, Barry Haddow, Alexandra Birch:
Exploring gap filling as a cheaper alternative to reading comprehension questionnaires when evaluating machine translation for gisting. WMT 2018: 192-203 - [c28]Chiraag Lala, Pranava Swaroop Madhyastha, Carolina Scarton, Lucia Specia:
Sheffield Submissions for WMT18 Multimodal Translation Shared Task. WMT (shared task) 2018: 624-631 - [c27]Julia Ive, Carolina Scarton, Frédéric Blain, Lucia Specia:
Sheffield Submissions for the WMT18 Quality Estimation Shared Task. WMT (shared task) 2018: 794-800 - [i1]Mikel L. Forcada, Carolina Scarton, Lucia Specia, Barry Haddow, Alexandra Birch:
Exploring Gap Filling as a Cheaper Alternative to Reading Comprehension Questionnaires when Evaluating Machine Translation for Gisting. CoRR abs/1809.00315 (2018) - 2017
- [c26]Yvette Graham, Qingsong Ma, Timothy Baldwin, Qun Liu, Carla Parra Escartín, Carolina Scarton:
Improving Evaluation of Document-level Machine Translation Quality Estimation. EACL (2) 2017: 356-361 - [c25]Carolina Scarton, Alessio Palmero Aprosio, Sara Tonelli, Tamara Martín-Wanton, Lucia Specia:
MUSST: A Multilingual Syntactic Simplification Tool. IJCNLP (System Demonstrations) 2017: 25-28 - [c24]Fernando Alva-Manchego, Joachim Bingel, Gustavo Paetzold, Carolina Scarton, Lucia Specia:
Learning How to Simplify From Explicit Labeling of Complex-Simplified Text Pairs. IJCNLP(1) 2017: 295-305 - [c23]Frédéric Blain, Carolina Scarton, Lucia Specia:
Bilexical Embeddings for Quality Estimation. WMT 2017: 545-550 - 2016
- [b1]Carolina Scarton:
Document-level machine translation quality estimation. University of Sheffield, UK, 2016 - [c22]Carolina Scarton, Gustavo Paetzold, Lucia Specia:
Quality Estimation for Language Output Applications. COLING (Tutorials) 2016: 14-17 - [c21]Carolina Scarton, Lucia Specia:
A Reading Comprehension Corpus for Machine Translation Evaluation. LREC 2016 - [c20]Sandra M. Aluísio, Andre Cunha, Carolina Scarton:
Evaluating Progression of Alzheimer's Disease by Regression and Classification Methods in a Narrative Language Test in Portuguese. PROPOR 2016: 109-114 - [c19]Liling Tan, Carolina Scarton, Lucia Specia, Josef van Genabith:
SAARSHEFF at SemEval-2016 Task 1: Semantic Textual Similarity with Machine Translation Evaluation Metrics and (eXtreme) Boosted Tree Ensembles. SemEval@NAACL-HLT 2016: 628-633 - [c18]Ondrej Bojar, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Aurélie Névéol, Mariana L. Neves, Martin Popel, Matt Post, Raphael Rubino, Carolina Scarton, Lucia Specia, Marco Turchi, Karin Verspoor, Marcos Zampieri:
Findings of the 2016 Conference on Machine Translation. WMT 2016: 131-198 - [c17]Carolina Scarton, Daniel Beck, Kashif Shah, Karin Sim Smith, Lucia Specia:
Word embeddings and discourse information for Quality Estimation. WMT 2016: 831-837 - 2015
- [c16]Lucia Specia, Gustavo Paetzold, Carolina Scarton:
Multi-level Translation Quality Prediction with QuEst++. ACL (System Demonstrations) 2015: 115-120 - [c15]Carolina Scarton, Marcos Zampieri, Mihaela Vela, Josef van Genabith, Lucia Specia:
Searching for Context: a Study on Document-Level Labels for Translation Quality Estimation. EAMT 2015 - [c14]Carolina Scarton:
Discourse and Document-level Information for Evaluating Language Output Tasks. HLT-NAACL 2015: 118-125 - [c13]Liling Tan, Carolina Scarton, Lucia Specia, Josef van Genabith:
USAAR-SHEFFIELD: Semantic Textual Similarity with Deep Regression and Machine Translation Evaluation Metrics. SemEval@NAACL-HLT 2015: 85-89 - [c12]Ondrej Bojar, Rajen Chatterjee, Christian Federmann, Barry Haddow, Matthias Huck, Chris Hokamp, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Matt Post, Carolina Scarton, Lucia Specia, Marco Turchi:
Findings of the 2015 Workshop on Statistical Machine Translation. WMT@EMNLP 2015: 1-46 - [c11]Carolina Scarton, Liling Tan, Lucia Specia:
USHEF and USAAR-USHEF participation in the WMT15 QE shared task. WMT@EMNLP 2015: 336-341 - 2014
- [c10]Carolina Scarton, Lin Sun, Karin Kipper Schuler, Magali Sanches Duran, Martha Palmer, Anna Korhonen:
Verb Clustering for Brazilian Portuguese. CICLing (1) 2014: 25-39 - [c9]Carolina Scarton, Lucia Specia:
Document-level translation quality estimation: exploring discourse and pseudo-references. EAMT 2014: 101-108 - [c8]Carolina Scarton, Magali Sanches Duran, Sandra Maria Aluísio:
Using Cross-Linguistic Knowledge to Build VerbNet-Style Lexicons: Results for a (Brazilian) Portuguese VerbNet. PROPOR 2014: 149-160 - [c7]Carolina Scarton, Lucia Specia:
Exploring Consensus in Machine Translation for Quality Estimation. WMT@ACL 2014: 342-347 - 2013
- [c6]Magali Sanches Duran, Carolina Evaristo Scarton, Sandra Maria Aluísio, Carlos Ramisch:
Identifying Pronominal Verbs: Towards Automatic Disambiguation of the Clitic 'se' in Portuguese. MWE@NAACL-HLT 2013: 93-100 - 2011
- [c5]Maria José Bocorny Finatto, Carolina Evaristo Scarton, Amanda Rocha, Sandra M. Aluísio:
Características do jornalismo popular: avaliação da inteligibilidade e auxílio à descrição do gênero (Characteristics of Popular News: the Evaluation of Intelligibility and Support to the Genre Description) [in Portuguese]. STIL 2011 - [c4]Bianca Franco Pasqualini, Carolina Evaristo Scarton, Maria José Bocorny Finatto:
Comparando Avaliações de Inteligibilidade Textual entre Originais e Traduções de Textos Literários (Comparing Textual Intelligibility Evaluations among Literary Source Texts and their Translations) [in Portuguese]. STIL 2011 - [c3]Carolina Evaristo Scarton:
VerbNet.Br: construção semiautomática de um léxico computacional de verbos para o português do Brasil (VerbNet.Br: semiautomatic construction of a computational verb lexicon for Brazilian Portuguese) [in Portuguese]. STIL 2011 - 2010
- [j1]Carolina Evaristo Scarton, Sandra Maria Aluísio:
Análise da Inteligibilidade de textos via ferramentas de Processamento de Língua Natural: adaptando as métricas do Coh-Metrix para o Português. Linguamática 2(1): 45-61 (2010) - [c2]Carolina Scarton, Caroline Gasperin, Sandra M. Aluísio:
Revisiting the Readability Assessment of Texts in Portuguese. IBERAMIA 2010: 306-315 - [c1]Carolina Scarton, Matheus de Oliveira, Arnaldo Cândido Júnior, Caroline Gasperin, Sandra M. Aluísio:
SIMPLIFICA: a tool for authoring simplified texts in Brazilian Portuguese guided by readability assessments. NAACL (Demos) 2010: 41-44
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-05 22:03 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint