Skip to main content

Showing 1–12 of 12 results for author: Dessì, R

  1. arXiv:2306.14209  [pdf, other

    cs.CV cs.AI

    Deep image prior inpainting of ancient frescoes in the Mediterranean Alpine arc

    Authors: Fabio Merizzi, Perrine Saillard, Oceane Acquier, Elena Morotti, Elena Loli Piccolomini, Luca Calatroni, Rosa Maria Dessì

    Abstract: The unprecedented success of image reconstruction approaches based on deep neural networks has revolutionised both the processing and the analysis paradigms in several applied disciplines. In the field of digital humanities, the task of digital reconstruction of ancient frescoes is particularly challenging due to the scarce amount of available training data caused by ageing, wear, tear and retouch… ▽ More

    Submitted 11 December, 2023; v1 submitted 25 June, 2023; originally announced June 2023.

    Comments: 26 pages

  2. arXiv:2304.01662  [pdf, other

    cs.CV cs.AI cs.CL

    Cross-Domain Image Captioning with Discriminative Finetuning

    Authors: Roberto Dessì, Michele Bevilacqua, Eleonora Gualdoni, Nathanael Carraz Rakotonirina, Francesca Franzon, Marco Baroni

    Abstract: Neural captioners are typically trained to mimic human-generated references without optimizing for any specific communication goal, leading to problems such as the generation of vague captions. In this paper, we show that fine-tuning an out-of-the-box neural captioner with a self-supervised discriminative communication objective helps to recover a plain, visually descriptive language that is more… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: CVPR 2023

  3. arXiv:2302.09865  [pdf, other

    cs.CL cs.AI cs.LG

    Can discrete information extraction prompts generalize across language models?

    Authors: Nathanaël Carraz Rakotonirina, Roberto Dessì, Fabio Petroni, Sebastian Riedel, Marco Baroni

    Abstract: We study whether automatically-induced prompts that effectively extract information from a language model can also be used, out-of-the-box, to probe other language models for the same information. After confirming that discrete prompts induced with the AutoPrompt algorithm outperform manual and semi-manual prompts on the slot-filling task, we demonstrate a drop in performance for AutoPrompt prompt… ▽ More

    Submitted 7 March, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: Published as conference paper at ICLR 2023

  4. arXiv:2302.08913  [pdf, other

    cs.CV cs.AI cs.LG

    Referential communication in heterogeneous communities of pre-trained visual deep networks

    Authors: Matéo Mahaut, Francesca Franzon, Roberto Dessì, Marco Baroni

    Abstract: As large pre-trained image-processing neural networks are being embedded in autonomous agents such as self-driving cars or robots, the question arises of how such systems can communicate with each other about the surrounding world, despite their different architectures and training regimes. As a first step in this direction, we systematically explore the task of \textit{referential communication}… ▽ More

    Submitted 13 March, 2024; v1 submitted 4 February, 2023; originally announced February 2023.

  5. arXiv:2302.07842  [pdf, ps, other

    cs.CL

    Augmented Language Models: a Survey

    Authors: Grégoire Mialon, Roberto Dessì, Maria Lomeli, Christoforos Nalmpantis, Ram Pasunuru, Roberta Raileanu, Baptiste Rozière, Timo Schick, Jane Dwivedi-Yu, Asli Celikyilmaz, Edouard Grave, Yann LeCun, Thomas Scialom

    Abstract: This survey reviews works in which language models (LMs) are augmented with reasoning skills and the ability to use tools. The former is defined as decomposing a potentially complex task into simpler subtasks while the latter consists in calling external modules such as a code interpreter. LMs can leverage these augmentations separately or in combination via heuristics, or learn to do so from demo… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  6. arXiv:2302.04761  [pdf, other

    cs.CL

    Toolformer: Language Models Can Teach Themselves to Use Tools

    Authors: Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Luke Zettlemoyer, Nicola Cancedda, Thomas Scialom

    Abstract: Language models (LMs) exhibit remarkable abilities to solve new tasks from just a few examples or textual instructions, especially at scale. They also, paradoxically, struggle with basic functionality, such as arithmetic or factual lookup, where much simpler and smaller models excel. In this paper, we show that LMs can teach themselves to use external tools via simple APIs and achieve the best of… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  7. arXiv:2210.11512  [pdf, other

    cs.CL

    Communication breakdown: On the low mutual intelligibility between human and neural captioning

    Authors: Roberto Dessì, Eleonora Gualdoni, Francesca Franzon, Gemma Boleda, Marco Baroni

    Abstract: We compare the 0-shot performance of a neural caption-based image retriever when given as input either human-produced captions or captions generated by a neural captioner. We conduct this comparison on the recently introduced ImageCoDe data-set (Krojer et al., 2022) which contains hard distractors nearly identical to the images to be retrieved. We find that the neural retriever has much higher per… ▽ More

    Submitted 27 April, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: Accepted as a short paper at EMNLP 2022

  8. arXiv:2107.01366  [pdf, other

    cs.CL cs.AI cs.LG

    Can Transformers Jump Around Right in Natural Language? Assessing Performance Transfer from SCAN

    Authors: Rahma Chaabouni, Roberto Dessì, Eugene Kharitonov

    Abstract: Despite their practical success, modern seq2seq architectures are unable to generalize systematically on several SCAN tasks. Hence, it is not clear if SCAN-style compositional generalization is useful in realistic NLP tasks. In this work, we study the benefit that such compositionality brings about to several machine translation tasks. We present several focused modifications of Transformer that g… ▽ More

    Submitted 16 September, 2021; v1 submitted 3 July, 2021; originally announced July 2021.

    Comments: BlackboxNLP workshop, EMNLP 2021

  9. arXiv:2106.04258  [pdf, other

    cs.CL cs.AI cs.LG cs.MA

    Interpretable agent communication from scratch (with a generic visual processor emerging on the side)

    Authors: Roberto Dessì, Eugene Kharitonov, Marco Baroni

    Abstract: As deep networks begin to be deployed as autonomous agents, the issue of how they can communicate with each other becomes important. Here, we train two deep nets from scratch to perform realistic referent identification through unsupervised emergent communication. We show that the largely interpretable emergent protocol allows the nets to successfully communicate even about object types they did n… ▽ More

    Submitted 15 October, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: Accepted at NeurIPS 2021

  10. arXiv:1911.01892  [pdf, ps, other

    cs.CL cs.AI

    Focus on What's Informative and Ignore What's not: Communication Strategies in a Referential Game

    Authors: Roberto Dessì, Diane Bouchacourt, Davide Crepaldi, Marco Baroni

    Abstract: Research in multi-agent cooperation has shown that artificial agents are able to learn to play a simple referential game while developing a shared lexicon. This lexicon is not easy to analyze, as it does not show many properties of a natural language. In a simple referential game with two neural network-based agents, we analyze the object-symbol mapping trying to understand what kind of strategy w… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: 3rd NeurIPS Workshop on Emergent Communication

  11. arXiv:1905.08527  [pdf, other

    cs.CL cs.AI cs.LG

    CNNs found to jump around more skillfully than RNNs: Compositional generalization in seq2seq convolutional networks

    Authors: Roberto Dessì, Marco Baroni

    Abstract: Lake and Baroni (2018) introduced the SCAN dataset probing the ability of seq2seq models to capture compositional generalizations, such as inferring the meaning of "jump around" 0-shot from the component words. Recurrent networks (RNNs) were found to completely fail the most challenging generalization cases. We test here a convolutional network (CNN) on these tasks, reporting hugely improved perfo… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

    Comments: accepted as a short paper at ACL 2019

  12. arXiv:1810.07652  [pdf, other

    eess.AS cs.CL cs.LG cs.SD stat.ML

    Fine-tuning on Clean Data for End-to-End Speech Translation: FBK @ IWSLT 2018

    Authors: Mattia Antonino Di Gangi, Roberto Dessì, Roldano Cattoni, Matteo Negri, Marco Turchi

    Abstract: This paper describes FBK's submission to the end-to-end English-German speech translation task at IWSLT 2018. Our system relies on a state-of-the-art model based on LSTMs and CNNs, where the CNNs are used to reduce the temporal dimension of the audio input, which is in general much higher than machine translation input. Our model was trained only on the audio-to-text parallel data released for the… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.

    Comments: 6 pages, 2 figures, system description at the 15th International Workshop on Spoken Language Translation (IWSLT) 2018