Skip to main content

Showing 1–41 of 41 results for author: Carpuat, M

  1. arXiv:2406.06608  [pdf, other

    cs.CL cs.AI

    The Prompt Report: A Systematic Survey of Prompting Techniques

    Authors: Sander Schulhoff, Michael Ilie, Nishant Balepur, Konstantine Kahadze, Amanda Liu, Chenglei Si, Yinheng Li, Aayush Gupta, HyoJung Han, Sevien Schulhoff, Pranav Sandeep Dulepet, Saurav Vidyadhara, Dayeon Ki, Sweta Agrawal, Chau Pham, Gerson Kroiz, Feileen Li, Hudson Tao, Ashay Srivastava, Hevander Da Costa, Saloni Gupta, Megan L. Rogers, Inna Goncearenco, Giuseppe Sarli, Igor Galynker , et al. (6 additional authors not shown)

    Abstract: Generative Artificial Intelligence (GenAI) systems are being increasingly deployed across all parts of industry and research settings. Developers and end users interact with these systems through the use of prompting or prompt engineering. While prompting is a widespread and highly researched concept, there exists conflicting terminology and a poor ontological understanding of what constitutes a p… ▽ More

    Submitted 14 July, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2405.20512  [pdf, other

    cs.CL cs.LG

    How Multilingual Are Large Language Models Fine-Tuned for Translation?

    Authors: Aquia Richburg, Marine Carpuat

    Abstract: A new paradigm for machine translation has recently emerged: fine-tuning large language models (LLM) on parallel text has been shown to outperform dedicated translation systems trained in a supervised fashion on much larger amounts of parallel data (Xu et al., 2024a; Alves et al., 2024). However, it remains unclear whether this paradigm can enable massively multilingual machine translation or whet… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2405.10260  [pdf, other

    cs.CL cs.AI

    Keep It Private: Unsupervised Privatization of Online Text

    Authors: Calvin Bao, Marine Carpuat

    Abstract: Authorship obfuscation techniques hold the promise of helping people protect their privacy in online communications by automatically rewriting text to hide the identity of the original author. However, obfuscation has been evaluated in narrow settings in the NLP literature and has primarily been addressed with superficial edit operations that can lead to unnatural outputs. In this work, we introdu… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 17 pages, 6 figures

  4. arXiv:2404.11717  [pdf, other

    cs.CL

    How often are errors in natural language reasoning due to paraphrastic variability?

    Authors: Neha Srikanth, Marine Carpuat, Rachel Rudinger

    Abstract: Large language models have been shown to behave inconsistently in response to meaning-preserving paraphrastic inputs. At the same time, researchers evaluate the knowledge and reasoning abilities of these models with test evaluations that do not disaggregate the effect of paraphrastic variability on performance. We propose a metric for evaluating the paraphrastic consistency of natural language rea… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: accepted to TACL 2024 (pre-MIT Press publication version)

  5. arXiv:2404.07851  [pdf, other

    cs.CL cs.AI

    Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations

    Authors: Dayeon Ki, Marine Carpuat

    Abstract: Machine Translation (MT) remains one of the last NLP tasks where large language models (LLMs) have not yet replaced dedicated supervised systems. This work exploits the complementary strengths of LLMs and supervised MT by guiding LLMs to automatically post-edit MT with external feedback on its quality, derived from Multidimensional Quality Metric (MQM) annotations. Working with LLaMA-2 models, we… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 21 pages, 8 figures

    Journal ref: NAACL 2024 Findings

  6. arXiv:2403.14402  [pdf, other

    cs.SD cs.CL eess.AS

    XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception

    Authors: HyoJung Han, Mohamed Anwar, Juan Pino, Wei-Ning Hsu, Marine Carpuat, Bowen Shi, Changhan Wang

    Abstract: Speech recognition and translation systems perform poorly on noisy inputs, which are frequent in realistic environments. Augmenting these systems with visual signals has the potential to improve robustness to noise. However, audio-visual (AV) data is only available in limited amounts and for fewer languages than audio-only resources. To address this gap, we present XLAVS-R, a cross-lingual audio-v… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  7. arXiv:2312.10126  [pdf, other

    cs.CL

    Do Text Simplification Systems Preserve Meaning? A Human Evaluation via Reading Comprehension

    Authors: Sweta Agrawal, Marine Carpuat

    Abstract: Automatic text simplification (TS) aims to automate the process of rewriting text to make it easier for people to read. A pre-requisite for TS to be useful is that it should convey information that is consistent with the meaning of the original text. However, current TS evaluation protocols assess system outputs for simplicity and meaning preservation without regard for the document context in whi… ▽ More

    Submitted 28 February, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted at TACL (a pre-MIT Press publication version)

  8. arXiv:2312.01582  [pdf, other

    cs.CL

    Explaining with Contrastive Phrasal Highlighting: A Case Study in Assisting Humans to Detect Translation Differences

    Authors: Eleftheria Briakou, Navita Goyal, Marine Carpuat

    Abstract: Explainable NLP techniques primarily explain by answering "Which tokens in the input are responsible for this prediction?''. We argue that for NLP models that make predictions by comparing two input texts, it is more useful to explain by answering "What differences between the two inputs explain this prediction?''. We introduce a technique to generate contrastive highlights that explain the predic… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: EMNLP 2023

  9. arXiv:2312.01308  [pdf, other

    cs.CL

    Bridging Background Knowledge Gaps in Translation with Automatic Explicitation

    Authors: HyoJung Han, Jordan Lee Boyd-Graber, Marine Carpuat

    Abstract: Translations help people understand content written in another language. However, even correct literal translations do not fulfill that goal when people lack the necessary background to understand them. Professional translators incorporate explicitations to explain the missing context by considering cultural differences between source and target audiences. Despite its potential to help users, NLP… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: EMNLP2023

  10. arXiv:2311.15507  [pdf, other

    cs.CL cs.AI

    Improving Word Sense Disambiguation in Neural Machine Translation with Salient Document Context

    Authors: Elijah Rippeth, Marine Carpuat, Kevin Duh, Matt Post

    Abstract: Lexical ambiguity is a challenging and pervasive problem in machine translation (\mt). We introduce a simple and scalable approach to resolve translation ambiguity by incorporating a small amount of extra-sentential context in neural \mt. Our approach requires no sense annotation and no change to standard model architectures. Since actual document context is not available for the vast majority of… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  11. arXiv:2311.09828  [pdf, other

    cs.CL

    AfriMTE and AfriCOMET: Enhancing COMET to Embrace Under-resourced African Languages

    Authors: Jiayi Wang, David Ifeoluwa Adelani, Sweta Agrawal, Marek Masiak, Ricardo Rei, Eleftheria Briakou, Marine Carpuat, Xuanli He, Sofia Bourhim, Andiswa Bukula, Muhidin Mohamed, Temitayo Olatoye, Tosin Adewumi, Hamam Mokayed, Christine Mwase, Wangui Kimotho, Foutse Yuehgoh, Anuoluwapo Aremu, Jessica Ojo, Shamsuddeen Hassan Muhammad, Salomey Osei, Abdul-Hakeem Omotayo, Chiamaka Chukwuneke, Perez Ogayo, Oumaima Hourrane , et al. (33 additional authors not shown)

    Abstract: Despite the recent progress on scaling multilingual machine translation (MT) to several under-resourced African languages, accurately measuring this progress remains challenging, since evaluation is often performed on n-gram matching metrics such as BLEU, which typically show a weaker correlation with human judgments. Learned metrics such as COMET have higher correlation; however, the lack of eval… ▽ More

    Submitted 23 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted by NAACL 2024

  12. arXiv:2310.16924  [pdf, other

    cs.CL cs.HC

    Physician Detection of Clinical Harm in Machine Translation: Quality Estimation Aids in Reliance and Backtranslation Identifies Critical Errors

    Authors: Nikita Mehandru, Sweta Agrawal, Yimin Xiao, Elaine C Khoong, Ge Gao, Marine Carpuat, Niloufar Salehi

    Abstract: A major challenge in the practical use of Machine Translation (MT) is that users lack guidance to make informed decisions about when to rely on outputs. Progress in quality estimation research provides techniques to automatically assess MT quality, but these techniques have primarily been evaluated in vitro by comparison against human judgments outside of a specific context of use. This paper eval… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  13. arXiv:2310.15055  [pdf, other

    cs.CL cs.AI cs.HC

    Towards Conceptualization of "Fair Explanation": Disparate Impacts of anti-Asian Hate Speech Explanations on Content Moderators

    Authors: Tin Nguyen, Jiannan Xu, Aayushi Roy, Hal Daumé III, Marine Carpuat

    Abstract: Recent research at the intersection of AI explainability and fairness has focused on how explanations can improve human-plus-AI task performance as assessed by fairness measures. We propose to characterize what constitutes an explanation that is itself "fair" -- an explanation that does not adversely impact specific populations. We formulate a novel evaluation method of "fair explanations" using n… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Main Conference (Long Paper)

  14. arXiv:2305.14993  [pdf, other

    cs.CL

    Controlling Pre-trained Language Models for Grade-Specific Text Simplification

    Authors: Sweta Agrawal, Marine Carpuat

    Abstract: Text simplification (TS) systems rewrite text to make it more readable while preserving its content. However, what makes a text easy to read depends on the intended readers. Recent work has shown that pre-trained language models can simplify text using a wealth of techniques to control output simplicity, ranging from specifying only the desired reading grade level, to directly specifying low-level… ▽ More

    Submitted 30 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  15. arXiv:2305.14331  [pdf, other

    cs.CL cs.AI

    What Else Do I Need to Know? The Effect of Background Information on Users' Reliance on QA Systems

    Authors: Navita Goyal, Eleftheria Briakou, Amanda Liu, Connor Baumler, Claire Bonial, Jeffrey Micher, Clare R. Voss, Marine Carpuat, Hal Daumé III

    Abstract: NLP systems have shown impressive performance at answering questions by retrieving relevant context. However, with the increasingly large models, it is impossible and often undesirable to constrain models' knowledge or reasoning to only the retrieved context. This leads to a mismatch between the information that the models access to derive the answer and the information that is available to the us… ▽ More

    Submitted 25 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  16. arXiv:2301.07779  [pdf, other

    cs.CL

    Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection

    Authors: Weijia Xu, Sweta Agrawal, Eleftheria Briakou, Marianna J. Martindale, Marine Carpuat

    Abstract: Neural sequence generation models are known to "hallucinate", by producing outputs that are unrelated to the source text. These hallucinations are potentially harmful, yet it remains unclear in what conditions they arise and how to mitigate their impact. In this work, we first identify internal model symptoms of hallucinations by analyzing the relative token contributions to the generation in cont… ▽ More

    Submitted 24 February, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

    Comments: Accepted at TACL

  17. arXiv:2211.05100  [pdf, other

    cs.CL

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More

    Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  18. arXiv:2209.02906  [pdf

    cs.CL cs.HC

    Facilitating Global Team Meetings Between Language-Based Subgroups: When and How Can Machine Translation Help?

    Authors: Yongle Zhang, Dennis Asamoah Owusu, Marine Carpuat, Ge Gao

    Abstract: Global teams frequently consist of language-based subgroups who put together complementary information to achieve common goals. Previous research outlines a two-step work communication flow in these teams. There are team meetings using a required common language (i.e., English); in preparation for those meetings, people have subgroup conversations in their native languages. Work communication at t… ▽ More

    Submitted 27 September, 2022; v1 submitted 6 September, 2022; originally announced September 2022.

    Journal ref: PACM on Human-Computer Interaction, Vol.6, No.CSCW1, Article 90, Publication date: April 2022

  19. arXiv:2205.06644  [pdf, other

    cs.CL

    Controlling Translation Formality Using Pre-trained Multilingual Language Models

    Authors: Elijah Rippeth, Sweta Agrawal, Marine Carpuat

    Abstract: This paper describes the University of Maryland's submission to the Special Task on Formality Control for Spoken Language Translation at \iwslt, which evaluates translation from English into 6 languages with diverse grammatical formality markers. We investigate to what extent this problem can be addressed with a \textit{single multilingual model}, simultaneously controlling its output for target l… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: 9 pages, 2 figures, IWSLT22 camera-ready (system paper @ ACL-IWSLT Shared Task on Formality Control for Spoken Language Translation)

  20. arXiv:2203.09486  [pdf, other

    cs.CL

    An Imitation Learning Curriculum for Text Editing with Non-Autoregressive Models

    Authors: Sweta Agrawal, Marine Carpuat

    Abstract: We propose a framework for training non-autoregressive sequence-to-sequence models for editing tasks, where the original input sequence is iteratively edited to produce the output. We show that the imitation learning algorithms designed to train such models for machine translation introduces mismatches between training and inference that lead to undertraining and poor generalization in editing sce… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: To appear in ACL 2022

  21. arXiv:2203.07643  [pdf, other

    cs.CL

    Can Synthetic Translations Improve Bitext Quality?

    Authors: Eleftheria Briakou, Marine Carpuat

    Abstract: Synthetic translations have been used for a wide range of NLP tasks primarily as a means of data augmentation. This work explores, instead, how synthetic translations can be used to revise potentially imperfect reference translations in mined bitext. We find that synthetic samples can improve bitext quality without any additional bilingual supervision when they replace the originals based on a sem… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: ACL 2022

  22. arXiv:2110.10668  [pdf, other

    cs.CL

    Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality Transfer

    Authors: Eleftheria Briakou, Sweta Agrawal, Joel Tetreault, Marine Carpuat

    Abstract: While the field of style transfer (ST) has been growing rapidly, it has been hampered by a lack of standardized practices for automatic evaluation. In this paper, we evaluate leading ST automatic metrics on the oft-researched task of formality style transfer. Unlike previous evaluations, which focus solely on English, we expand our focus to Brazilian-Portuguese, French, and Italian, making this wo… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: EMNLP 2021

  23. arXiv:2109.04620  [pdf, other

    cs.CL

    Rule-based Morphological Inflection Improves Neural Terminology Translation

    Authors: Weijia Xu, Marine Carpuat

    Abstract: Current approaches to incorporating terminology constraints in machine translation (MT) typically assume that the constraint terms are provided in their correct morphological forms. This limits their application to real-world scenarios where constraint terms are provided as lemmas. In this paper, we introduce a modular framework for incorporating lemma constraints in neural MT (NMT) in which lingu… ▽ More

    Submitted 7 October, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  24. arXiv:2106.04747  [pdf, other

    cs.CL

    A Review of Human Evaluation for Style Transfer

    Authors: Eleftheria Briakou, Sweta Agrawal, Ke Zhang, Joel Tetreault, Marine Carpuat

    Abstract: This paper reviews and summarizes human evaluation practices described in 97 style transfer papers with respect to three main evaluation aspects: style transfer, meaning preservation, and fluency. In principle, evaluations by human raters should be the most reliable. However, in style transfer papers, we find that protocols for human evaluations are often underspecified and not standardized, which… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Comments: GEM 2021

  25. arXiv:2105.15087  [pdf, other

    cs.CL

    Beyond Noise: Mitigating the Impact of Fine-grained Semantic Divergences on Neural Machine Translation

    Authors: Eleftheria Briakou, Marine Carpuat

    Abstract: While it has been shown that Neural Machine Translation (NMT) is highly sensitive to noisy parallel training samples, prior work treats all types of mismatches between source and target as noise. As a result, it remains unclear how samples that are mostly equivalent but contain a small number of semantically divergent tokens impact NMT training. To close this gap, we analyze the impact of differen… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

    Comments: ACL 2021

  26. arXiv:2105.12900  [pdf, other

    cs.CL

    How Does Distilled Data Complexity Impact the Quality and Confidence of Non-Autoregressive Machine Translation?

    Authors: Weijia Xu, Shuming Ma, Dongdong Zhang, Marine Carpuat

    Abstract: While non-autoregressive (NAR) models are showing great promise for machine translation, their use is limited by their dependence on knowledge distillation from autoregressive models. To address this issue, we seek to understand why distillation is so effective. Prior work suggests that distilled training data is less complex than manual translations. Based on experiments with the Levenshtein Tran… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

    Comments: Findings of ACL 2021

  27. arXiv:2011.06868  [pdf, other

    cs.CL cs.LG

    EDITOR: an Edit-Based Transformer with Repositioning for Neural Machine Translation with Soft Lexical Constraints

    Authors: Weijia Xu, Marine Carpuat

    Abstract: We introduce an Edit-Based Transformer with Repositioning (EDITOR), which makes sequence generation flexible by seamlessly allowing users to specify preferences in output lexical choice. Building on recent models for non-autoregressive sequence generation (Gu et al., 2019), EDITOR generates new sequences by iteratively editing hypotheses. It relies on a novel reposition operation designed to disen… ▽ More

    Submitted 19 January, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

    Comments: TACL 2021

  28. arXiv:2010.09608  [pdf, other

    cs.CL

    Incorporating Terminology Constraints in Automatic Post-Editing

    Authors: David Wan, Chris Kedzie, Faisal Ladhak, Marine Carpuat, Kathleen McKeown

    Abstract: Users of machine translation (MT) may want to ensure the use of specific lexical terminologies. While there exist techniques for incorporating terminology constraints during inference for MT, current APE approaches cannot ensure that they will appear in the final translation. In this paper, we present both autoregressive and non-autoregressive models for lexically constrained APE, demonstrating th… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: To appear in WMT, 2020

  29. arXiv:2010.03662  [pdf, other

    cs.CL

    Detecting Fine-Grained Cross-Lingual Semantic Divergences without Supervision by Learning to Rank

    Authors: Eleftheria Briakou, Marine Carpuat

    Abstract: Detecting fine-grained differences in content conveyed in different languages matters for cross-lingual NLP and multilingual corpora analysis, but it is a challenging machine learning problem since annotation is expensive and hard to scale. This work improves the prediction and annotation of fine-grained semantic divergences. We introduce a training strategy for multilingual BERT models by learnin… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: EMNLP 2020

  30. arXiv:2010.03412  [pdf, other

    cs.CL cs.LG

    Dual Reconstruction: a Unifying Objective for Semi-Supervised Neural Machine Translation

    Authors: Weijia Xu, Xing Niu, Marine Carpuat

    Abstract: While Iterative Back-Translation and Dual Learning effectively incorporate monolingual training data in neural machine translation, they use different objectives and heuristic gradient approximation strategies, and have not been extensively compared. We introduce a novel dual reconstruction objective that provides a unified view of Iterative Back-Translation and Dual Learning. It motivates a theor… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: Accepted at Findings of EMNLP 2020

  31. arXiv:1911.08706  [pdf, other

    cs.CL

    Controlling Neural Machine Translation Formality with Synthetic Supervision

    Authors: Xing Niu, Marine Carpuat

    Abstract: This work aims to produce translations that convey source language content at a formality level that is appropriate for a particular audience. Framing this problem as a neural sequence-to-sequence task ideally requires training triplets consisting of a bilingual sentence pair labeled with target language formality. However, in practice, available training examples are limited to English sentence p… ▽ More

    Submitted 27 November, 2019; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: Accepted at AAAI 2020

  32. arXiv:1911.00835  [pdf, other

    cs.CL

    Controlling Text Complexity in Neural Machine Translation

    Authors: Sweta Agrawal, Marine Carpuat

    Abstract: This work introduces a machine translation task where the output is aimed at audiences of different levels of target language proficiency. We collect a high quality dataset of news articles available in English and Spanish, written for diverse grade levels and propose a method to align segments across comparable bilingual articles. The resulting dataset makes it possible to train multi-task sequen… ▽ More

    Submitted 3 November, 2019; originally announced November 2019.

    Comments: Accepted to EMNLP-IJCNLP 2019

  33. arXiv:1905.05816  [pdf, other

    cs.CL

    Curriculum Learning for Domain Adaptation in Neural Machine Translation

    Authors: Xuan Zhang, Pamela Shapiro, Gaurav Kumar, Paul McNamee, Marine Carpuat, Kevin Duh

    Abstract: We introduce a curriculum learning approach to adapt generic neural machine translation models to a specific domain. Samples are grouped by their similarities to the domain of interest and each group is fed to the training algorithm with a particular schedule. This approach is simple to implement on top of any neural framework or architecture, and consistently outperforms both unadapted and adapte… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

  34. arXiv:1904.04079  [pdf, other

    cs.CL stat.ML

    Differentiable Sampling with Flexible Reference Word Order for Neural Machine Translation

    Authors: Weijia Xu, Xing Niu, Marine Carpuat

    Abstract: Despite some empirical success at correcting exposure bias in machine translation, scheduled sampling algorithms suffer from a major drawback: they incorrectly assume that words in the reference translations and in sampled sequences are aligned at each time step. Our new differentiable sampling algorithm addresses this issue by optimizing the probability that the reference can be aligned with the… ▽ More

    Submitted 4 May, 2019; v1 submitted 4 April, 2019; originally announced April 2019.

    Comments: Accepted at NAACL 2019

  35. arXiv:1811.01116  [pdf, other

    cs.CL

    Bi-Directional Differentiable Input Reconstruction for Low-Resource Neural Machine Translation

    Authors: Xing Niu, Weijia Xu, Marine Carpuat

    Abstract: We aim to better exploit the limited amounts of parallel text available in low-resource settings by introducing a differentiable reconstruction loss for neural machine translation (NMT). This loss compares original inputs to reconstructed inputs, obtained by back-translating translation hypotheses into the input language. We leverage differentiable sampling and bi-directional NMT to train models e… ▽ More

    Submitted 3 April, 2019; v1 submitted 2 November, 2018; originally announced November 2018.

    Comments: Accepted at NAACL 2019

  36. arXiv:1811.00739  [pdf, other

    cs.CL cs.LG

    An Empirical Exploration of Curriculum Learning for Neural Machine Translation

    Authors: Xuan Zhang, Gaurav Kumar, Huda Khayrallah, Kenton Murray, Jeremy Gwinnup, Marianna J Martindale, Paul McNamee, Kevin Duh, Marine Carpuat

    Abstract: Machine translation systems based on deep neural networks are expensive to train. Curriculum learning aims to address this issue by choosing the order in which samples are presented during training to help train better models faster. We adopt a probabilistic view of curriculum learning, which lets us flexibly evaluate the impact of curricula design, and perform an extensive exploration on a German… ▽ More

    Submitted 2 November, 2018; originally announced November 2018.

  37. arXiv:1806.04357  [pdf, other

    cs.CL

    Multi-Task Neural Models for Translating Between Styles Within and Across Languages

    Authors: Xing Niu, Sudha Rao, Marine Carpuat

    Abstract: Generating natural language requires conveying content in an appropriate style. We explore two related tasks on generating text of varying formality: monolingual formality transfer and formality-sensitive machine translation. We propose to solve these tasks jointly using multi-task learning, and show that our models achieve state-of-the-art performance for formality transfer and are able to perfor… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

    Comments: Accepted at the 27th International Conference on Computational Linguistics (COLING 2018)

  38. arXiv:1805.11213  [pdf, other

    cs.CL

    Bi-Directional Neural Machine Translation with Synthetic Parallel Data

    Authors: Xing Niu, Michael Denkowski, Marine Carpuat

    Abstract: Despite impressive progress in high-resource settings, Neural Machine Translation (NMT) still struggles in low-resource and out-of-domain scenarios, often failing to match the quality of phrase-based translation. We propose a novel technique that combines back-translation and multilingual NMT to improve performance in these difficult cases. Our technique trains a single model for both directions o… ▽ More

    Submitted 30 May, 2018; v1 submitted 28 May, 2018; originally announced May 2018.

    Comments: Accepted at the 2nd Workshop on Neural Machine Translation and Generation (WNMT 2018)

  39. arXiv:1803.11291  [pdf, other

    cs.CL

    Robust Cross-lingual Hypernymy Detection using Dependency Context

    Authors: Shyam Upadhyay, Yogarshi Vyas, Marine Carpuat, Dan Roth

    Abstract: Cross-lingual Hypernymy Detection involves determining if a word in one language ("fruit") is a hypernym of a word in another language ("pomme" i.e. apple in French). The ability to detect hypernymy cross-lingually can aid in solving cross-lingual versions of tasks such as textual entailment and event coreference. We propose BISPARSE-DEP, a family of unsupervised approaches for cross-lingual hyper… ▽ More

    Submitted 29 March, 2018; originally announced March 2018.

    Comments: NAACL 2018. SU and YV contributed equally

  40. arXiv:1803.11112  [pdf, other

    cs.CL

    Identifying Semantic Divergences in Parallel Text without Annotations

    Authors: Yogarshi Vyas, Xing Niu, Marine Carpuat

    Abstract: Recognizing that even correct translations are not always semantically equivalent, we automatically detect meaning divergences in parallel sentence pairs with a deep neural model of bilingual semantic similarity which can be trained for any parallel corpus without any manual annotation. We show that our semantic model detects divergences more accurately than models based on surface features derive… ▽ More

    Submitted 29 March, 2018; originally announced March 2018.

    Comments: Accepted as a full paper to NAACL 2018

  41. arXiv:1802.06041  [pdf, other

    cs.CL

    Fluency Over Adequacy: A Pilot Study in Measuring User Trust in Imperfect MT

    Authors: Marianna J. Martindale, Marine Carpuat

    Abstract: Although measuring intrinsic quality has been a key factor in the advancement of Machine Translation (MT), successfully deploying MT requires considering not just intrinsic quality but also the user experience, including aspects such as trust. This work introduces a method of studying how users modulate their trust in an MT system after seeing errorful (disfluent or inadequate) output amidst good… ▽ More

    Submitted 16 February, 2018; originally announced February 2018.

    Comments: To appear at AMTA 2018