Skip to main content

Showing 1–14 of 14 results for author: Briakou, E

  1. arXiv:2407.10456  [pdf, other

    cs.CL

    Don't Throw Away Data: Better Sequence Knowledge Distillation

    Authors: Jun Wang, Eleftheria Briakou, Hamid Dadkhahi, Rishabh Agarwal, Colin Cherry, Trevor Cohn

    Abstract: A critical component in knowledge distillation is the means of coupling the teacher and student. The predominant sequence knowledge distillation method involves supervised learning of the student against teacher-decoded outputs, and is exemplified by the current state of the art, which incorporates minimum Bayes risk (MBR) decoding. In this paper we seek to integrate MBR more tightly in distillati… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2312.01582  [pdf, other

    cs.CL

    Explaining with Contrastive Phrasal Highlighting: A Case Study in Assisting Humans to Detect Translation Differences

    Authors: Eleftheria Briakou, Navita Goyal, Marine Carpuat

    Abstract: Explainable NLP techniques primarily explain by answering "Which tokens in the input are responsible for this prediction?''. We argue that for NLP models that make predictions by comparing two input texts, it is more useful to explain by answering "What differences between the two inputs explain this prediction?''. We introduce a technique to generate contrastive highlights that explain the predic… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: EMNLP 2023

  3. arXiv:2311.09828  [pdf, other

    cs.CL

    AfriMTE and AfriCOMET: Enhancing COMET to Embrace Under-resourced African Languages

    Authors: Jiayi Wang, David Ifeoluwa Adelani, Sweta Agrawal, Marek Masiak, Ricardo Rei, Eleftheria Briakou, Marine Carpuat, Xuanli He, Sofia Bourhim, Andiswa Bukula, Muhidin Mohamed, Temitayo Olatoye, Tosin Adewumi, Hamam Mokayed, Christine Mwase, Wangui Kimotho, Foutse Yuehgoh, Anuoluwapo Aremu, Jessica Ojo, Shamsuddeen Hassan Muhammad, Salomey Osei, Abdul-Hakeem Omotayo, Chiamaka Chukwuneke, Perez Ogayo, Oumaima Hourrane , et al. (33 additional authors not shown)

    Abstract: Despite the recent progress on scaling multilingual machine translation (MT) to several under-resourced African languages, accurately measuring this progress remains challenging, since evaluation is often performed on n-gram matching metrics such as BLEU, which typically show a weaker correlation with human judgments. Learned metrics such as COMET have higher correlation; however, the lack of eval… ▽ More

    Submitted 23 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted by NAACL 2024

  4. arXiv:2305.14331  [pdf, other

    cs.CL cs.AI

    What Else Do I Need to Know? The Effect of Background Information on Users' Reliance on QA Systems

    Authors: Navita Goyal, Eleftheria Briakou, Amanda Liu, Connor Baumler, Claire Bonial, Jeffrey Micher, Clare R. Voss, Marine Carpuat, Hal Daumé III

    Abstract: NLP systems have shown impressive performance at answering questions by retrieving relevant context. However, with the increasingly large models, it is impossible and often undesirable to constrain models' knowledge or reasoning to only the retrieved context. This leads to a mismatch between the information that the models access to derive the answer and the information that is available to the us… ▽ More

    Submitted 25 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  5. arXiv:2305.10266  [pdf, other

    cs.CL

    Searching for Needles in a Haystack: On the Role of Incidental Bilingualism in PaLM's Translation Capability

    Authors: Eleftheria Briakou, Colin Cherry, George Foster

    Abstract: Large, multilingual language models exhibit surprisingly good zero- or few-shot machine translation capabilities, despite having never seen the intentionally-included translation examples provided to typical neural translation systems. We investigate the role of incidental bilingualism -- the unintentional consumption of bilingual signals, including translation examples -- in explaining the transl… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023

  6. arXiv:2301.07779  [pdf, other

    cs.CL

    Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection

    Authors: Weijia Xu, Sweta Agrawal, Eleftheria Briakou, Marianna J. Martindale, Marine Carpuat

    Abstract: Neural sequence generation models are known to "hallucinate", by producing outputs that are unrelated to the source text. These hallucinations are potentially harmful, yet it remains unclear in what conditions they arise and how to mitigate their impact. In this work, we first identify internal model symptoms of hallucinations by analyzing the relative token contributions to the generation in cont… ▽ More

    Submitted 24 February, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

    Comments: Accepted at TACL

  7. arXiv:2203.07643  [pdf, other

    cs.CL

    Can Synthetic Translations Improve Bitext Quality?

    Authors: Eleftheria Briakou, Marine Carpuat

    Abstract: Synthetic translations have been used for a wide range of NLP tasks primarily as a means of data augmentation. This work explores, instead, how synthetic translations can be used to revise potentially imperfect reference translations in mined bitext. We find that synthetic samples can improve bitext quality without any additional bilingual supervision when they replace the originals based on a sem… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: ACL 2022

  8. arXiv:2111.06787  [pdf, other

    cs.CL

    BitextEdit: Automatic Bitext Editing for Improved Low-Resource Machine Translation

    Authors: Eleftheria Briakou, Sida I. Wang, Luke Zettlemoyer, Marjan Ghazvininejad

    Abstract: Mined bitexts can contain imperfect translations that yield unreliable training signals for Neural Machine Translation (NMT). While filtering such pairs out is known to improve final model quality, we argue that it is suboptimal in low-resource conditions where even mined data can be limited. In our work, we propose instead, to refine the mined bitexts via automatic editing: given a sentence in a… ▽ More

    Submitted 30 May, 2022; v1 submitted 12 November, 2021; originally announced November 2021.

  9. arXiv:2110.10668  [pdf, other

    cs.CL

    Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality Transfer

    Authors: Eleftheria Briakou, Sweta Agrawal, Joel Tetreault, Marine Carpuat

    Abstract: While the field of style transfer (ST) has been growing rapidly, it has been hampered by a lack of standardized practices for automatic evaluation. In this paper, we evaluate leading ST automatic metrics on the oft-researched task of formality style transfer. Unlike previous evaluations, which focus solely on English, we expand our focus to Brazilian-Portuguese, French, and Italian, making this wo… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: EMNLP 2021

  10. arXiv:2106.04747  [pdf, other

    cs.CL

    A Review of Human Evaluation for Style Transfer

    Authors: Eleftheria Briakou, Sweta Agrawal, Ke Zhang, Joel Tetreault, Marine Carpuat

    Abstract: This paper reviews and summarizes human evaluation practices described in 97 style transfer papers with respect to three main evaluation aspects: style transfer, meaning preservation, and fluency. In principle, evaluations by human raters should be the most reliable. However, in style transfer papers, we find that protocols for human evaluations are often underspecified and not standardized, which… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Comments: GEM 2021

  11. arXiv:2105.15087  [pdf, other

    cs.CL

    Beyond Noise: Mitigating the Impact of Fine-grained Semantic Divergences on Neural Machine Translation

    Authors: Eleftheria Briakou, Marine Carpuat

    Abstract: While it has been shown that Neural Machine Translation (NMT) is highly sensitive to noisy parallel training samples, prior work treats all types of mismatches between source and target as noise. As a result, it remains unclear how samples that are mostly equivalent but contain a small number of semantically divergent tokens impact NMT training. To close this gap, we analyze the impact of differen… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

    Comments: ACL 2021

  12. arXiv:2104.04108  [pdf, other

    cs.CL cs.AI

    XFORMAL: A Benchmark for Multilingual Formality Style Transfer

    Authors: Eleftheria Briakou, Di Lu, Ke Zhang, Joel Tetreault

    Abstract: We take the first step towards multilingual style transfer by creating and releasing XFORMAL, a benchmark of multiple formal reformulations of informal text in Brazilian Portuguese, French, and Italian. Results on XFORMAL suggest that state-of-the-art style transfer approaches perform close to simple baselines, indicating that style transfer is even more challenging when moving multilingual.

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: NAACL 2021

  13. arXiv:2010.03662  [pdf, other

    cs.CL

    Detecting Fine-Grained Cross-Lingual Semantic Divergences without Supervision by Learning to Rank

    Authors: Eleftheria Briakou, Marine Carpuat

    Abstract: Detecting fine-grained differences in content conveyed in different languages matters for cross-lingual NLP and multilingual corpora analysis, but it is a challenging machine learning problem since annotation is expensive and hard to scale. This work improves the prediction and annotation of fine-grained semantic divergences. We introduce a training strategy for multilingual BERT models by learnin… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: EMNLP 2020

  14. arXiv:1904.05674  [pdf, other

    cs.CL cs.LG

    Cross-topic distributional semantic representations via unsupervised mappings

    Authors: Eleftheria Briakou, Nikos Athanasiou, Alexandros Potamianos

    Abstract: In traditional Distributional Semantic Models (DSMs) the multiple senses of a polysemous word are conflated into a single vector space representation. In this work, we propose a DSM that learns multiple distributional representations of a word based on different topics. First, a separate DSM is trained for each topic and then each of the topic-based DSMs is aligned to a common vector space. Our un… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

    Comments: NAACL-HLT 2019