


Остановите войну!
for scientists:


default search action
Carolina Scarton
Person information

- affiliation: University of Sheffield, Department of Computer Science
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c53]Tomas Goldsack, Zhihao Zhang, Chenghua Lin, Carolina Scarton:
Domain-Driven and Discourse-Guided Scientific Summarisation. ECIR (1) 2023: 361-376 - [i22]Yida Mu, Mali Jin, Charlie Grimshaw, Carolina Scarton, Kalina Bontcheva, Xingyi Song:
VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination on Twitter. CoRR abs/2301.06660 (2023) - [i21]Ben Wu, Olesya Razuvayevskaya, Freddy Heppell, João Augusto Leite, Carolina Scarton, Kalina Bontcheva, Xingyi Song:
Team SheffieldVeraAI at SemEval-2023 Task 3: Mono and multilingual approaches for news genre, topic and persuasion technique classification. CoRR abs/2303.09421 (2023) - 2022
- [c52]Sebastian T. Vincent
, Loïc Barrault, Carolina Scarton:
Controlling Extra-Textual Attributes about Dialogue Participants: A Case Study of English-to-Polish Neural Machine Translation. EAMT 2022: 121-130 - [c51]Tomas Goldsack, Zhihao Zhang, Chenghua Lin, Carolina Scarton:
Making Science Simple: Corpora for the Lay Summarisation of Scientific Literature. EMNLP 2022: 10589-10604 - [c50]Edward Gow-Smith, Harish Tayyar Madabushi, Carolina Scarton, Aline Villavicencio:
Improving Tokenisation by Alternative Treatment of Spaces. EMNLP 2022: 11430-11443 - [c49]Sebastian T. Vincent
, Loïc Barrault, Carolina Scarton:
Controlling Formality in Low-Resource NMT with Domain Adaptation and Re-Ranking: SLT-CDT-UoS at IWSLT2022. IWSLT@ACL 2022: 341-350 - [c48]Harish Tayyar Madabushi
, Edward Gow-Smith, Marcos García, Carolina Scarton, Marco Idiart, Aline Villavicencio:
SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding. SemEval@NAACL 2022: 107-121 - [c47]Iknoor Singh, Yue Li, Melissa Thong, Carolina Scarton:
GateNLP-UShef at SemEval-2022 Task 8: Entity-Enriched Siamese Transformer for Multilingual News Article Similarity. SemEval@NAACL 2022: 1121-1128 - [c46]Iknoor Singh
, Kalina Bontcheva
, Xingyi Song
, Carolina Scarton
:
Comparative Analysis of Engagement, Themes, and Causality of Ukraine-Related Debunks and Disinformation. SocInfo 2022: 128-143 - [e2]Helena Moniz, Lieve Macken, Andrew Rufener, Loïc Barrault, Marta R. Costa-jussà, Christophe Declercq, Maarit Koponen, Ellie Kemp, Spyridon Pilos, Mikel L. Forcada, Carolina Scarton, Joachim Van den Bogaert, Joke Daems, Arda Tezcan, Bram Vanroy, Margot Fonteyne:
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, EAMT 2022, Ghent, Belgium, June 1-3, 2022. European Association for Machine Translation 2022, ISBN 9789464597622 [contents] - [e1]Vládia Pinheiro
, Pablo Gamallo
, Raquel Amaro
, Carolina Scarton
, Fernando Batista
, Diego Furtado Silva
, Catarina Magro
, Hugo Pinto:
Computational Processing of the Portuguese Language - 15th International Conference, PROPOR 2022, Fortaleza, Brazil, March 21-23, 2022, Proceedings. Lecture Notes in Computer Science 13208, Springer 2022, ISBN 978-3-030-98304-8 [contents] - [i20]Sidney Evaldo Leal, Magali Sanches Duran, Carolina Evaristo Scarton, Nathan Siegle Hartmann, Sandra Maria Aluísio:
NILC-Metrix: assessing the complexity of written and spoken language in Brazilian Portuguese. CoRR abs/2201.03445 (2022) - [i19]Edward Gow-Smith, Harish Tayyar Madabushi, Carolina Scarton, Aline Villavicencio:
Improving Tokenisation by Alternative Treatment of Spaces. CoRR abs/2204.04058 (2022) - [i18]Harish Tayyar Madabushi, Edward Gow-Smith, Marcos García, Carolina Scarton, Marco Idiart, Aline Villavicencio:
SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding. CoRR abs/2204.10050 (2022) - [i17]Sebastian T. Vincent, Loïc Barrault, Carolina Scarton:
Controlling Extra-Textual Attributes about Dialogue Participants - A Case Study of English-to-Polish Neural Machine Translation. CoRR abs/2205.04747 (2022) - [i16]Sebastian T. Vincent, Loïc Barrault, Carolina Scarton:
Controlling Formality in Low-Resource NMT with Domain Adaptation and Re-Ranking: SLT-CDT-UoS at IWSLT2022. CoRR abs/2205.05990 (2022) - [i15]Dylan Phelps, Xuan-Rui Fan, Edward Gow-Smith, Harish Tayyar Madabushi, Carolina Scarton, Aline Villavicencio:
Sample Efficient Approaches for Idiomaticity Detection. CoRR abs/2205.11306 (2022) - [i14]Iknoor Singh, Yue Li, Melissa Thong, Carolina Scarton:
GateNLP-UShef at SemEval-2022 Task 8: Entity-Enriched Siamese Transformer for Multilingual News Article Similarity. CoRR abs/2205.15812 (2022) - [i13]Yue Li, Carolina Scarton, Xingyi Song, Kalina Bontcheva:
Classifying COVID-19 vaccine narratives. CoRR abs/2207.08522 (2022) - [i12]Tomas Goldsack, Zhihao Zhang, Chenghua Lin, Carolina Scarton:
Making Science Simple: Corpora for the Lay Summarisation of Scientific Literature. CoRR abs/2210.09932 (2022) - [i11]Iknoor Singh, Kalina Bontcheva, Xingyi Song, Carolina Scarton:
Comparative Analysis of Engagement, Themes, and Causality of Ukraine-Related Debunks and Disinformation. CoRR abs/2212.07457 (2022) - 2021
- [j5]Fernando Alva-Manchego, Carolina Scarton, Lucia Specia:
The (Un)Suitability of Automatic Evaluation Metrics for Text Simplification. Comput. Linguistics 47(4): 861-889 (2021) - [j4]Yelena Mejova, Marinella Petrocchi, Carolina Scarton:
Special Issue on Disinformation, Hoaxes and Propaganda within Online Social Networks and Media. Online Soc. Networks Media 23: 100132 (2021) - [c45]Marcos García
, Tiago Kramer Vieira, Carolina Scarton, Marco Idiart, Aline Villavicencio:
Assessing the Representations of Idiomaticity in Vector Models with a Noun Compound Dataset Labeled at Type and Token Levels. ACL/IJCNLP (1) 2021: 2730-2741 - [c44]Marcos García, Tiago Kramer Vieira, Carolina Scarton, Marco Idiart, Aline Villavicencio:
Probing for idiomaticity in vector space models. EACL 2021: 3551-3564 - [c43]Harish Tayyar Madabushi
, Edward Gow-Smith, Carolina Scarton, Aline Villavicencio:
AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language Models. EMNLP (Findings) 2021: 3464-3477 - [c42]Carolina Scarton, Yue Li:
Cross-lingual Rumour Stance Classification: a First Study with BERT and Machine Translation. TTO 2021: 50-59 - [i10]Iknoor Singh, Carolina Scarton, Kalina Bontcheva:
Multistage BiCross Encoder: Team GATE Entry for MLIA Multilingual Semantic Search Task 2. CoRR abs/2101.03013 (2021) - [i9]Ye Jiang, Xingyi Song, Carolina Scarton, Ahmet Aker, Kalina Bontcheva:
Categorising Fine-to-Coarse Grained Misinformation: An Empirical Study of COVID-19 Infodemic. CoRR abs/2106.11702 (2021) - [i8]Iknoor Singh, Kalina Bontcheva, Carolina Scarton:
The False COVID-19 Narratives That Keep Being Debunked: A Spatiotemporal Analysis. CoRR abs/2107.12303 (2021) - [i7]Harish Tayyar Madabushi
, Edward Gow-Smith
, Carolina Scarton, Aline Villavicencio:
AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language Models. CoRR abs/2109.04413 (2021) - 2020
- [j3]Fernando Alva-Manchego
, Carolina Scarton, Lucia Specia:
Data-Driven Sentence Simplification: Survey and Benchmark. Comput. Linguistics 46(1): 135-187 (2020) - [j2]Carolina Scarton:
Horacio Saggion, Automatic Text Simplification. Synthesis lectures on human language technologies, April 2017. Nat. Lang. Eng. 26(4): 489-492 (2020) - [c41]Fernando Alva-Manchego
, Louis Martin, Antoine Bordes, Carolina Scarton, Benoît Sagot, Lucia Specia:
ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations. ACL 2020: 4668-4679 - [c40]Carolina Scarton, Pranava Madhyastha
, Lucia Specia:
Deciding When, How and for Whom to Simplify. ECAI 2020: 2172-2179 - [c39]João Augusto Leite, Diego F. Silva, Kalina Bontcheva, Carolina Scarton:
Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis. AACL/IJCNLP 2020: 914-924 - [c38]Carolina Scarton, Diego F. Silva, Kalina Bontcheva:
Measuring What Counts: The Case of Rumour Stance Classification. AACL/IJCNLP 2020: 925-932 - [c37]Roney L. S. Santos, Gabriela Wick-Pedro, Sidney Evaldo Leal, Oto A. Vale, Thiago A. S. Pardo, Kalina Bontcheva, Carolina Scarton:
Measuring the Impact of Readability Features in Fake News Detection. LREC 2020: 1404-1413 - [c36]Gabriela Wick-Pedro
, Roney L. S. Santos, Oto A. Vale
, Thiago A. S. Pardo
, Kalina Bontcheva
, Carolina Scarton
:
Linguistic Analysis Model for Monitoring User Reaction on Satirical News for Brazilian Portuguese. PROPOR 2020: 313-320 - [i6]Fernando Alva-Manchego, Louis Martin, Antoine Bordes, Carolina Scarton, Benoît Sagot, Lucia Specia:
ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations. CoRR abs/2005.00481 (2020) - [i5]Carolina Scarton, Diego F. Silva, Kalina Bontcheva:
Measuring What Counts: The case of Rumour Stance Classification. CoRR abs/2010.04532 (2020) - [i4]João Augusto Leite, Diego F. Silva, Kalina Bontcheva, Carolina Scarton:
Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis. CoRR abs/2010.04543 (2020)
2010 – 2019
- 2019
- [c35]Fernando Alva-Manchego, Carolina Scarton, Lucia Specia:
Cross-Sentence Transformations in Text Simplification. WNLP@ACL 2019: 181-184 - [c34]Fernando Alva-Manchego
, Louis Martin, Carolina Scarton, Lucia Specia:
EASSE: Easier Automatic Sentence Simplification Evaluation. EMNLP/IJCNLP (3) 2019: 49-54 - [c33]Carolina Scarton, Mikel L. Forcada, Miquel Esplà-Gomis, Lucia Specia:
Estimating post-editing effort: a study on human judgements, task-based and reference-based metrics of MT quality. IWSLT 2019 - [i3]Fernando Alva-Manchego, Louis Martin, Carolina Scarton, Lucia Specia:
EASSE: Easier Automatic Sentence Simplification Evaluation. CoRR abs/1908.04567 (2019) - [i2]Carolina Scarton, Mikel L. Forcada, Miquel Esplà-Gomis, Lucia Specia:
Estimating post-editing effort: a study on human judgements, task-based and reference-based metrics of MT quality. CoRR abs/1910.06204 (2019) - 2018
- [b2]Lucia Specia, Carolina Scarton
, Gustavo Henrique Paetzold:
Quality Estimation for Machine Translation. Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers 2018 - [c32]Carolina Scarton, Lucia Specia:
Learning Simplifications for Specific Target Audiences. ACL (2) 2018: 712-718 - [c31]Carolina Scarton, Gustavo Paetzold, Lucia Specia:
Text Simplification from Professionally Produced Corpora. LREC 2018 - [c30]Carolina Scarton, Gustavo Paetzold, Lucia Specia:
SimPA: A Sentence-Level Simplification Corpus for the Public Administration Domain. LREC 2018 - [c29]Mikel L. Forcada
, Carolina Scarton, Lucia Specia, Barry Haddow, Alexandra Birch:
Exploring gap filling as a cheaper alternative to reading comprehension questionnaires when evaluating machine translation for gisting. WMT 2018: 192-203 - [c28]Chiraag Lala, Pranava Swaroop Madhyastha
, Carolina Scarton, Lucia Specia:
Sheffield Submissions for WMT18 Multimodal Translation Shared Task. WMT (shared task) 2018: 624-631 - [c27]Julia Ive, Carolina Scarton, Frédéric Blain, Lucia Specia:
Sheffield Submissions for the WMT18 Quality Estimation Shared Task. WMT (shared task) 2018: 794-800 - [i1]Mikel L. Forcada, Carolina Scarton, Lucia Specia, Barry Haddow, Alexandra Birch:
Exploring Gap Filling as a Cheaper Alternative to Reading Comprehension Questionnaires when Evaluating Machine Translation for Gisting. CoRR abs/1809.00315 (2018) - 2017
- [c26]Yvette Graham, Qingsong Ma, Timothy Baldwin, Qun Liu, Carla Parra Escartín, Carolina Scarton:
Improving Evaluation of Document-level Machine Translation Quality Estimation. EACL (2) 2017: 356-361 - [c25]Carolina Scarton, Alessio Palmero Aprosio, Sara Tonelli, Tamara Martín-Wanton, Lucia Specia:
MUSST: A Multilingual Syntactic Simplification Tool. IJCNLP (System Demonstrations) 2017: 25-28 - [c24]Fernando Alva-Manchego, Joachim Bingel, Gustavo Paetzold, Carolina Scarton, Lucia Specia:
Learning How to Simplify From Explicit Labeling of Complex-Simplified Text Pairs. IJCNLP(1) 2017: 295-305 - [c23]Frédéric Blain, Carolina Scarton, Lucia Specia:
Bilexical Embeddings for Quality Estimation. WMT 2017: 545-550 - 2016
- [b1]Carolina Scarton:
Document-level machine translation quality estimation. University of Sheffield, UK, 2016 - [c22]Carolina Scarton, Gustavo Paetzold, Lucia Specia:
Quality Estimation for Language Output Applications. COLING (Tutorials) 2016: 14-17 - [c21]Carolina Scarton, Lucia Specia:
A Reading Comprehension Corpus for Machine Translation Evaluation. LREC 2016 - [c20]Sandra M. Aluísio
, Andre Cunha, Carolina Scarton
:
Evaluating Progression of Alzheimer's Disease by Regression and Classification Methods in a Narrative Language Test in Portuguese. PROPOR 2016: 109-114 - [c19]Liling Tan, Carolina Scarton, Lucia Specia, Josef van Genabith:
SAARSHEFF at SemEval-2016 Task 1: Semantic Textual Similarity with Machine Translation Evaluation Metrics and (eXtreme) Boosted Tree Ensembles. SemEval@NAACL-HLT 2016: 628-633 - [c18]Ondrej Bojar
, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Aurélie Névéol, Mariana L. Neves, Martin Popel, Matt Post, Raphael Rubino, Carolina Scarton, Lucia Specia, Marco Turchi, Karin Verspoor, Marcos Zampieri:
Findings of the 2016 Conference on Machine Translation. WMT 2016: 131-198 - [c17]Carolina Scarton, Daniel Beck, Kashif Shah, Karin Sim Smith, Lucia Specia:
Word embeddings and discourse information for Quality Estimation. WMT 2016: 831-837 - 2015
- [c16]Lucia Specia, Gustavo Paetzold, Carolina Scarton:
Multi-level Translation Quality Prediction with QuEst++. ACL (System Demonstrations) 2015: 115-120 - [c15]Carolina Scarton, Marcos Zampieri, Mihaela Vela, Josef van Genabith, Lucia Specia:
Searching for Context: a Study on Document-Level Labels for Translation Quality Estimation. EAMT 2015 - [c14]Carolina Scarton:
Discourse and Document-level Information for Evaluating Language Output Tasks. HLT-NAACL 2015: 118-125 - [c13]Liling Tan, Carolina Scarton, Lucia Specia, Josef van Genabith:
USAAR-SHEFFIELD: Semantic Textual Similarity with Deep Regression and Machine Translation Evaluation Metrics. SemEval@NAACL-HLT 2015: 85-89 - [c12]Ondrej Bojar
, Rajen Chatterjee, Christian Federmann, Barry Haddow, Matthias Huck, Chris Hokamp, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Matt Post, Carolina Scarton, Lucia Specia, Marco Turchi:
Findings of the 2015 Workshop on Statistical Machine Translation. WMT@EMNLP 2015: 1-46 - [c11]Carolina Scarton, Liling Tan, Lucia Specia:
USHEF and USAAR-USHEF participation in the WMT15 QE shared task. WMT@EMNLP 2015: 336-341 - 2014
- [c10]Carolina Scarton
, Lin Sun, Karin Kipper Schuler, Magali Sanches Duran, Martha Palmer
, Anna Korhonen:
Verb Clustering for Brazilian Portuguese. CICLing (1) 2014: 25-39 - [c9]Carolina Scarton, Lucia Specia:
Document-level translation quality estimation: exploring discourse and pseudo-references. EAMT 2014: 101-108 - [c8]Carolina Scarton, Magali Sanches Duran, Sandra Maria Aluísio
:
Using Cross-Linguistic Knowledge to Build VerbNet-Style Lexicons: Results for a (Brazilian) Portuguese VerbNet. PROPOR 2014: 149-160 - [c7]Carolina Scarton, Lucia Specia:
Exploring Consensus in Machine Translation for Quality Estimation. WMT@ACL 2014: 342-347 - 2013
- [c6]Magali Sanches Duran, Carolina Evaristo Scarton, Sandra Maria Aluísio, Carlos Ramisch:
Identifying Pronominal Verbs: Towards Automatic Disambiguation of the Clitic 'se' in Portuguese. MWE@NAACL-HLT 2013: 93-100 - 2011
- [c5]Maria José Bocorny Finatto, Carolina Evaristo Scarton, Amanda Rocha, Sandra M. Aluísio:
Características do jornalismo popular: avaliação da inteligibilidade e auxílio à descrição do gênero (Characteristics of Popular News: the Evaluation of Intelligibility and Support to the Genre Description) [in Portuguese]. STIL 2011 - [c4]Bianca Franco Pasqualini, Carolina Evaristo Scarton, Maria José Bocorny Finatto:
Comparando Avaliações de Inteligibilidade Textual entre Originais e Traduções de Textos Literários (Comparing Textual Intelligibility Evaluations among Literary Source Texts and their Translations) [in Portuguese]. STIL 2011 - [c3]Carolina Evaristo Scarton:
VerbNet.Br: construção semiautomática de um léxico computacional de verbos para o português do Brasil (VerbNet.Br: semiautomatic construction of a computational verb lexicon for Brazilian Portuguese) [in Portuguese]. STIL 2011 - 2010
- [j1]Carolina Evaristo Scarton, Sandra Maria Aluísio:
Análise da Inteligibilidade de textos via ferramentas de Processamento de Língua Natural: adaptando as métricas do Coh-Metrix para o Português. Linguamática 2(1): 45-61 (2010) - [c2]Carolina Scarton
, Caroline Gasperin, Sandra M. Aluísio
:
Revisiting the Readability Assessment of Texts in Portuguese. IBERAMIA 2010: 306-315 - [c1]Carolina Scarton, Matheus de Oliveira, Arnaldo Cândido Júnior, Caroline Gasperin, Sandra M. Aluísio:
SIMPLIFICA: a tool for authoring simplified texts in Brazilian Portuguese guided by readability assessments. NAACL (Demos) 2010: 41-44
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
load content from web.archive.org
Privacy notice: By enabling the option above, your browser will contact the API of web.archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2023-03-22 23:54 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint