Skip to main content

Showing 1–8 of 8 results for author: Raganato, A

  1. arXiv:2403.07726  [pdf, other

    cs.CL

    SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes

    Authors: Timothee Mickus, Elaine Zosa, Raúl Vázquez, Teemu Vahtola, Jörg Tiedemann, Vincent Segonne, Alessandro Raganato, Marianna Apidianaki

    Abstract: This paper presents the results of the SHROOM, a shared task focused on detecting hallucinations: outputs from natural language generation (NLG) systems that are fluent, yet inaccurate. Such cases of overgeneration put in jeopardy many NLG applications, where correctness is often mission-critical. The shared task was conducted with a newly constructed dataset of 4000 model outputs labeled by 5 ann… ▽ More

    Submitted 29 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: SemEval 2024 shared task. Pre-review version

  2. arXiv:2403.07544  [pdf, other

    cs.CL

    MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki

    Authors: Timothee Mickus, Stig-Arne Grönroos, Joseph Attieh, Michele Boggia, Ona De Gibert, Shaoxiong Ji, Niki Andreas Lopi, Alessandro Raganato, Raúl Vázquez, Jörg Tiedemann

    Abstract: NLP in the age of monolithic large language models is approaching its limits in terms of size and information that can be handled. The trend goes to modularization, a necessary step into the direction of designing smaller sub-networks and components with specialized functionality. In this paper, we present the MAMMOTH toolkit: a framework designed for training massively multilingual modular machin… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Presented as a demo at EACL 2024

  3. arXiv:2212.01936  [pdf, other

    cs.CL

    Democratizing Neural Machine Translation with OPUS-MT

    Authors: Jörg Tiedemann, Mikko Aulamo, Daria Bakshandaeva, Michele Boggia, Stig-Arne Grönroos, Tommi Nieminen, Alessandro Raganato, Yves Scherrer, Raul Vazquez, Sami Virpioja

    Abstract: This paper presents the OPUS ecosystem with a focus on the development of open machine translation models and tools, and their integration into end-user applications, development platforms and professional workflows. We discuss our on-going mission of increasing language coverage and translation quality, and also describe on-going work on the development of modular translation models and speed-opt… ▽ More

    Submitted 4 July, 2023; v1 submitted 4 December, 2022; originally announced December 2022.

  4. arXiv:2010.06478  [pdf, other

    cs.CL

    XL-WiC: A Multilingual Benchmark for Evaluating Semantic Contextualization

    Authors: Alessandro Raganato, Tommaso Pasini, Jose Camacho-Collados, Mohammad Taher Pilehvar

    Abstract: The ability to correctly model distinct meanings of a word is crucial for the effectiveness of semantic representation techniques. However, most existing evaluation benchmarks for assessing this criterion are tied to sense inventories (usually WordNet), restricting their usage to a small subset of knowledge-based representation techniques. The Word-in-Context dataset (WiC) addresses the dependence… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: EMNLP2020

  5. arXiv:2002.10260  [pdf, other

    cs.CL

    Fixed Encoder Self-Attention Patterns in Transformer-Based Machine Translation

    Authors: Alessandro Raganato, Yves Scherrer, Jörg Tiedemann

    Abstract: Transformer-based models have brought a radical change to neural machine translation. A key feature of the Transformer architecture is the so-called multi-head attention mechanism, which allows the model to focus simultaneously on different parts of the input. However, recent works have shown that most attention heads learn simple, and often redundant, positional patterns. In this paper, we propos… ▽ More

    Submitted 5 October, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: Accepted to Findings of EMNLP 2020

  6. arXiv:1906.04040  [pdf, other

    cs.CL

    The University of Helsinki submissions to the WMT19 news translation task

    Authors: Aarne Talman, Umut Sulubacak, Raúl Vázquez, Yves Scherrer, Sami Virpioja, Alessandro Raganato, Arvi Hurskainen, Jörg Tiedemann

    Abstract: In this paper, we present the University of Helsinki submissions to the WMT 2019 shared task on news translation in three language pairs: English-German, English-Finnish and Finnish-English. This year, we focused first on cleaning and filtering the training data using multiple data-filtering approaches, resulting in much smaller and cleaner training sets. For English-German, we trained both senten… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: To appear in WMT19

  7. Multilingual NMT with a language-independent attention bridge

    Authors: Raúl Vázquez, Alessandro Raganato, Jörg Tiedemann, Mathias Creutz

    Abstract: In this paper, we propose a multilingual encoder-decoder architecture capable of obtaining multilingual sentence representations by means of incorporating an intermediate {\em attention bridge} that is shared across all languages. That is, we train the model with language-specific encoders and decoders that are connected via self-attention with a shared layer that we call attention bridge. This la… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

    Journal ref: Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019) Pages 33-39

  8. arXiv:1608.06718  [pdf, other

    cs.CL

    A Large-Scale Multilingual Disambiguation of Glosses

    Authors: José Camacho Collados, Claudio Delli Bovi, Alessandro Raganato, Roberto Navigli

    Abstract: Linking concepts and named entities to knowledge bases has become a crucial Natural Language Understanding task. In this respect, recent works have shown the key advantage of exploiting textual definitions in various Natural Language Processing applications. However, to date there are no reliable large-scale corpora of sense-annotated textual definitions available to the research community. In thi… ▽ More

    Submitted 24 August, 2016; originally announced August 2016.

    Comments: Accepted in LREC 2016

    Journal ref: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC), 2016, pages 1701-1708, Portoroz, Slovenia