Skip to main content

Showing 1–33 of 33 results for author: Staiano, J

  1. arXiv:2404.03623  [pdf, other

    cs.CL cs.AI cs.CY

    Unveiling LLMs: The Evolution of Latent Representations in a Temporal Knowledge Graph

    Authors: Marco Bronzini, Carlo Nicolini, Bruno Lepri, Jacopo Staiano, Andrea Passerini

    Abstract: Large Language Models (LLMs) demonstrate an impressive capacity to recall a vast range of common factual knowledge information. However, unravelling the underlying reasoning of LLMs and explaining their internal mechanisms of exploiting this factual knowledge remain active areas of investigation. Our work analyzes the factual knowledge encoded in the latent representation of LLMs when prompted to… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Preprint. Under review. 10 pages, 7 figures

  2. arXiv:2403.20288  [pdf, other

    cs.CL cs.AI

    Can LLMs Correct Physicians, Yet? Investigating Effective Interaction Methods in the Medical Domain

    Authors: Burcu Sayin, Pasquale Minervini, Jacopo Staiano, Andrea Passerini

    Abstract: We explore the potential of Large Language Models (LLMs) to assist and potentially correct physicians in medical decision-making tasks. We evaluate several LLMs, including Meditron, Llama2, and Mistral, to analyze the ability of these models to interact effectively with physicians across different scenarios. We consider questions from PubMedQA and several tasks, ranging from binary (yes/no) respon… ▽ More

    Submitted 6 May, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

    Comments: Accepted for oral presentation at NAACL 2024, The 6th Clinical Natural Language Processing Workshop

  3. arXiv:2403.08739  [pdf, other

    cs.CL cond-mat.dis-nn cond-mat.stat-mech cs.AI

    The Garden of Forking Paths: Observing Dynamic Parameters Distribution in Large Language Models

    Authors: Carlo Nicolini, Jacopo Staiano, Bruno Lepri, Raffaele Marino

    Abstract: A substantial gap persists in understanding the reasons behind the exceptional performance of the Transformer architecture in NLP. A particularly unexplored area involves the mechanistic description of how the distribution of parameters evolves over time during training. In this work we suggest that looking at the time evolution of the statistic distribution of model parameters, and specifically a… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 15 pages

  4. arXiv:2311.10587  [pdf, other

    cs.CL

    Countering Misinformation via Emotional Response Generation

    Authors: Daniel Russo, Shane Peter Kaszefski-Yaschuk, Jacopo Staiano, Marco Guerini

    Abstract: The proliferation of misinformation on social media platforms (SMPs) poses a significant danger to public health, social cohesion and ultimately democracy. Previous research has shown how social correction can be an effective way to curb misinformation, by engaging directly in a constructive dialogue with users who spread -- often in good faith -- misleading messages. Although professional fact-ch… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: Accepted to EMNLP 2023 main conference

  5. arXiv:2310.05628  [pdf, other

    cs.CL cs.CE cs.CY

    Glitter or Gold? Deriving Structured Insights from Sustainability Reports via Large Language Models

    Authors: Marco Bronzini, Carlo Nicolini, Bruno Lepri, Andrea Passerini, Jacopo Staiano

    Abstract: Over the last decade, several regulatory bodies have started requiring the disclosure of non-financial information from publicly listed companies, in light of the investors' increasing attention to Environmental, Social, and Governance (ESG) issues. Publicly released information on sustainability practices is often disclosed in diverse, unstructured, and multi-modal documentation. This poses a cha… ▽ More

    Submitted 16 January, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

  6. arXiv:2307.09416  [pdf, other

    cs.CV cs.CL

    Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation Evaluation

    Authors: Federico Betti, Jacopo Staiano, Lorenzo Baraldi, Lorenzo Baraldi, Rita Cucchiara, Nicu Sebe

    Abstract: Research in Image Generation has recently made significant progress, particularly boosted by the introduction of Vision-Language models which are able to produce high-quality visual content based on textual inputs. Despite ongoing advancements in terms of generation quality and realism, no methodical frameworks have been defined yet to quantitatively measure the quality of the generated content an… ▽ More

    Submitted 19 July, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: Accepted as oral at ACM MultiMedia 2023 (Brave New Ideas track)

  7. arXiv:2301.11312  [pdf, other

    cs.CL

    LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization

    Authors: Laura Nguyen, Thomas Scialom, Benjamin Piwowarski, Jacopo Staiano

    Abstract: Text Summarization is a popular task and an active area of research for the Natural Language Processing community. By definition, it requires to account for long input texts, a characteristic which poses computational challenges for neural models. Moreover, real-world documents come in a variety of complex, visually-rich, layouts. This information is of great relevance, whether to highlight salien… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: To be published in EACL 2023

  8. Which Discriminator for Cooperative Text Generation?

    Authors: Antoine Chaffin, Thomas Scialom, Sylvain Lamprier, Jacopo Staiano, Benjamin Piwowarski, Ewa Kijak, Vincent Claveau

    Abstract: Language models generate texts by successively predicting probability distributions for next tokens given past ones. A growing field of interest tries to leverage external information in the decoding process so that the generated texts have desired properties, such as being more natural, non toxic, faithful, or having a specific writing style. A solution is to use a classifier at each generation s… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: 6 pages, 2 figures, accepted to SIGIR 2022

  9. arXiv:2201.12320  [pdf, other

    cs.LG cs.CL

    Generative Cooperative Networks for Natural Language Generation

    Authors: Sylvain Lamprier, Thomas Scialom, Antoine Chaffin, Vincent Claveau, Ewa Kijak, Jacopo Staiano, Benjamin Piwowarski

    Abstract: Generative Adversarial Networks (GANs) have known a tremendous success for many continuous generation tasks, especially in the field of image generation. However, for discrete outputs such as language, optimizing GANs remains an open problem with many instabilities, as no gradient can be properly back-propagated from the discriminator output to the generator parameters. An alternative is to learn… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

  10. arXiv:2109.01078  [pdf, other

    cs.CL

    Skim-Attention: Learning to Focus via Document Layout

    Authors: Laura Nguyen, Thomas Scialom, Jacopo Staiano, Benjamin Piwowarski

    Abstract: Transformer-based pre-training techniques of text and layout have proven effective in a number of document understanding tasks. Despite this success, multimodal pre-training models suffer from very high computational and memory costs. Motivated by human reading strategies, this paper presents Skim-Attention, a new attention mechanism that takes advantage of the structure of the document and its la… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: 15 pages, 6 figures, to be published in EMNLP 2021 Findings

  11. arXiv:2106.06363  [pdf, other

    cs.CL cs.AI

    To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs

    Authors: Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

    Abstract: Due to the discrete nature of words, language GANs require to be optimized from rewards provided by discriminator networks, via reinforcement learning methods. This is a much harder setting than for continuous tasks, which enjoy gradient flows from discriminators to generators, usually leading to dramatic learning instabilities. However, we claim that this can be solved by making discriminator and… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

  12. arXiv:2104.07560  [pdf, other

    cs.CL

    Rethinking Automatic Evaluation in Sentence Simplification

    Authors: Thomas Scialom, Louis Martin, Jacopo Staiano, Éric Villemonte de la Clergerie, Benoît Sagot

    Abstract: Automatic evaluation remains an open research question in Natural Language Generation. In the context of Sentence Simplification, this is particularly challenging: the task requires by nature to replace complex words with simpler ones that shares the same meaning. This limits the effectiveness of n-gram based metrics like BLEU. Going hand in hand with the recent advances in NLG, new metrics have b… ▽ More

    Submitted 16 April, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: updated affiliation and link to data

  13. arXiv:2104.07555  [pdf, other

    cs.CL

    Data-QuestEval: A Referenceless Metric for Data-to-Text Semantic Evaluation

    Authors: Clément Rebuffel, Thomas Scialom, Laure Soulier, Benjamin Piwowarski, Sylvain Lamprier, Jacopo Staiano, Geoffrey Scoutheeten, Patrick Gallinari

    Abstract: QuestEval is a reference-less metric used in text-to-text tasks, that compares the generated summaries directly to the source text, by automatically asking and answering questions. Its adaptation to Data-to-Text tasks is not straightforward, as it requires multimodal Question Generation and Answering systems on the considered tasks, which are seldom available. To this purpose, we propose a method… ▽ More

    Submitted 7 September, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: Accepted at EMNLP 2021

  14. arXiv:2103.12693  [pdf, other

    cs.CL

    QuestEval: Summarization Asks for Fact-based Evaluation

    Authors: Thomas Scialom, Paul-Alexis Dray, Patrick Gallinari, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano, Alex Wang

    Abstract: Summarization evaluation remains an open research problem: current metrics such as ROUGE are known to be limited and to correlate poorly with human judgments. To alleviate this issue, recent work has proposed evaluation metrics which rely on question answering models to assess whether a summary contains all the relevant information in its source document. Though promising, the proposed approaches… ▽ More

    Submitted 9 April, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: project page: https://github.com/recitalAI/QuestEval

  15. arXiv:2010.12643  [pdf, other

    cs.CL

    Synthetic Data Augmentation for Zero-Shot Cross-Lingual Question Answering

    Authors: Arij Riabi, Thomas Scialom, Rachel Keraron, Benoît Sagot, Djamé Seddah, Jacopo Staiano

    Abstract: Coupled with the availability of large scale datasets, deep learning architectures have enabled rapid progress on the Question Answering task. However, most of those datasets are in English, and the performances of state-of-the-art multilingual models are significantly lower when evaluated on non-English data. Due to high data collection costs, it is not realistic to obtain annotated data for each… ▽ More

    Submitted 14 October, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: 7 pages

  16. arXiv:2010.03369  [pdf, other

    cs.CL cs.AI

    Toward Stance-based Personas for Opinionated Dialogues

    Authors: Thomas Scialom, Serra Sinem Tekiroglu, Jacopo Staiano, Marco Guerini

    Abstract: In the context of chit-chat dialogues it has been shown that endowing systems with a persona profile is important to produce more coherent and meaningful conversations. Still, the representation of such personas has thus far been limited to a fact-based representation (e.g. "I have two cats."). We argue that these representations remain superficial w.r.t. the complexity of human personality. In th… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: Accepted at Findings of EMNLP 2020

  17. arXiv:2007.00968  [pdf, other

    cs.CL

    Project PIAF: Building a Native French Question-Answering Dataset

    Authors: Rachel Keraron, Guillaume Lancrenon, Mathilde Bras, Frédéric Allary, Gilles Moyse, Thomas Scialom, Edmundo-Pavel Soriano-Morales, Jacopo Staiano

    Abstract: Motivated by the lack of data for non-English languages, in particular for the evaluation of downstream tasks such as Question Answering, we present a participatory effort to collect a native French Question Answering Dataset. Furthermore, we describe and publicly release the annotation tool developed for our collection effort, along with the data obtained and preliminary baselines.

    Submitted 2 July, 2020; originally announced July 2020.

    Comments: LREC 2020

  18. arXiv:2006.04643  [pdf, other

    cs.CL cs.LG

    ColdGANs: Taming Language GANs with Cautious Sampling Strategies

    Authors: Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

    Abstract: Training regimes based on Maximum Likelihood Estimation (MLE) suffer from known limitations, often leading to poorly generated text sequences. At the root of these limitations is the mismatch between training and inference, i.e. the so-called exposure bias, exacerbated by considering only the reference texts as correct, while in practice several alternative formulations could be as good. Generativ… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

  19. arXiv:2004.14900  [pdf, other

    cs.CL

    MLSUM: The Multilingual Summarization Corpus

    Authors: Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

    Abstract: We present MLSUM, the first large-scale MultiLingual SUMmarization dataset. Obtained from online newspapers, it contains 1.5M+ article/summary pairs in five different languages -- namely, French, German, Spanish, Russian, Turkish. Together with English newspapers from the popular CNN/Daily mail dataset, the collected data form a large scale multilingual dataset which can enable new research direct… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

  20. arXiv:2002.10832  [pdf, other

    cs.CL cs.CV cs.LG

    What BERT Sees: Cross-Modal Transfer for Visual Question Generation

    Authors: Thomas Scialom, Patrick Bordes, Paul-Alexis Dray, Jacopo Staiano, Patrick Gallinari

    Abstract: Pre-trained language models have recently contributed to significant advances in NLP tasks. Recently, multi-modal versions of BERT have been developed, using heavy pre-training relying on vast corpora of aligned textual and image data, primarily applied to classification tasks such as VQA. In this paper, we are interested in evaluating the visual capabilities of BERT out-of-the-box, by avoiding pr… ▽ More

    Submitted 16 December, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: INLG 2020

  21. arXiv:2002.10375  [pdf, other

    cs.CL cs.LG

    Discriminative Adversarial Search for Abstractive Summarization

    Authors: Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

    Abstract: We introduce a novel approach for sequence decoding, Discriminative Adversarial Search (DAS), which has the desirable properties of alleviating the effects of exposure bias without requiring external metrics. Inspired by Generative Adversarial Networks (GANs), wherein a discriminator is used to improve the generator, our method differs from GANs in that the generator parameters are not updated at… ▽ More

    Submitted 30 August, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: ICML 2020

  22. arXiv:1911.03350  [pdf, other

    cs.CL cs.AI

    Ask to Learn: A Study on Curiosity-driven Question Generation

    Authors: Thomas Scialom, Jacopo Staiano

    Abstract: We propose a novel text generation task, namely Curiosity-driven Question Generation. We start from the observation that the Question Generation task has traditionally been considered as the dual problem of Question Answering, hence tackling the problem of generating a question given the text that contains its answer. Such questions can be used to evaluate machine reading comprehension. However, i… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: 13 pages, 3 figures

  23. arXiv:1909.01610  [pdf, ps, other

    cs.CL cs.AI cs.IR

    Answers Unite! Unsupervised Metrics for Reinforced Summarization Models

    Authors: Thomas Scialom, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

    Abstract: Abstractive summarization approaches based on Reinforcement Learning (RL) have recently been proposed to overcome classical likelihood maximization. RL enables to consider complex, possibly non-differentiable, metrics that globally assess the quality and relevance of the generated outputs. ROUGE, the most used summarization metric, is known to suffer from bias towards lexical similarity as well as… ▽ More

    Submitted 4 September, 2019; originally announced September 2019.

    Comments: Accepted at EMNLP 2019

  24. arXiv:1810.03660  [pdf, other

    cs.CL cs.CY

    DepecheMood++: a Bilingual Emotion Lexicon Built Through Simple Yet Powerful Techniques

    Authors: Oscar Araque, Lorenzo Gatti, Jacopo Staiano, Marco Guerini

    Abstract: Several lexica for sentiment analysis have been developed and made available in the NLP community. While most of these come with word polarity annotations (e.g. positive/negative), attempts at building lexica for finer-grained emotion analysis (e.g. happiness, sadness) have recently attracted significant attention. Such lexica are often exploited as a building block in the process of developing le… ▽ More

    Submitted 8 October, 2018; originally announced October 2018.

    Comments: 12 pages, 2 figures

  25. arXiv:1704.00939  [pdf, other

    cs.CL cs.CY

    Fortia-FBK at SemEval-2017 Task 5: Bullish or Bearish? Inferring Sentiment towards Brands from Financial News Headlines

    Authors: Youness Mansar, Lorenzo Gatti, Sira Ferradans, Marco Guerini, Jacopo Staiano

    Abstract: In this paper, we describe a methodology to infer Bullish or Bearish sentiment towards companies/brands. More specifically, our approach leverages affective lexica and word embeddings in combination with convolutional neural networks to infer the sentiment of financial news headlines towards a target company. Such architecture was used and evaluated in the context of the SemEval 2017 challenge (ta… ▽ More

    Submitted 4 April, 2017; originally announced April 2017.

    Comments: 6 pages, 1 figure; accepted for publication at the International Workshop on Semantic Evaluation (SemEval-2017) to be held in conjunction with ACL 2017

  26. arXiv:1612.00323  [pdf, other

    cs.CY physics.soc-ph

    The Tyranny of Data? The Bright and Dark Sides of Data-Driven Decision-Making for Social Good

    Authors: Bruno Lepri, Jacopo Staiano, David Sangokoya, Emmanuel Letouzé, Nuria Oliver

    Abstract: The unprecedented availability of large-scale human behavioral data is profoundly changing the world we live in. Researchers, companies, governments, financial institutions, non-governmental organizations and also citizen groups are actively experimenting, innovating and adapting algorithmic decision-making tools to understand global patterns of human behavior and provide decision support to tackl… ▽ More

    Submitted 2 December, 2016; v1 submitted 1 December, 2016; originally announced December 2016.

    Comments: preprint version; book chapter to appear in "Transparent Data Mining for Big and Small Data", Studies in Big Data Series, Springer

  27. arXiv:1603.04012  [pdf, other

    cs.CY cs.SI physics.soc-ph

    The Death and Life of Great Italian Cities: A Mobile Phone Data Perspective

    Authors: Marco De Nadai, Jacopo Staiano, Roberto Larcher, Nicu Sebe, Daniele Quercia, Bruno Lepri

    Abstract: The Death and Life of Great American Cities was written in 1961 and is now one of the most influential book in city planning. In it, Jane Jacobs proposed four conditions that promote life in a city. However, these conditions have not been empirically tested until recently. This is mainly because it is hard to collect data about "city life". The city of Seoul recently collected pedestrian activity… ▽ More

    Submitted 13 March, 2016; originally announced March 2016.

    Comments: In Proceedings of the 26th International ACM Conference on World Wide Web (WWW), 2016

  28. arXiv:1506.06882  [pdf, other

    cs.CV

    SALSA: A Novel Dataset for Multimodal Group Behavior Analysis

    Authors: Xavier Alameda-Pineda, Jacopo Staiano, Ramanathan Subramanian, Ligia Batrinca, Elisa Ricci, Bruno Lepri, Oswald Lanz, Nicu Sebe

    Abstract: Studying free-standing conversational groups (FCGs) in unstructured social settings (e.g., cocktail party ) is gratifying due to the wealth of information available at the group (mining social networks) and individual (recognizing native behavioral and personality traits) levels. However, analyzing social scenes involving FCGs is also highly challenging due to the difficulty in extracting behavior… ▽ More

    Submitted 23 June, 2015; originally announced June 2015.

    Comments: 14 pages, 11 figures

  29. arXiv:1503.04723  [pdf, other

    cs.SI cs.CL cs.CY

    Deep Feelings: A Massive Cross-Lingual Study on the Relation between Emotions and Virality

    Authors: Marco Guerini, Jacopo Staiano

    Abstract: This article provides a comprehensive investigation on the relations between virality of news articles and the emotions they are found to evoke. Virality, in our view, is a phenomenon with many facets, i.e. under this generic term several different effects of persuasive communication are comprised. By exploiting a high-coverage and bilingual corpus of documents containing metrics of their spread o… ▽ More

    Submitted 16 March, 2015; originally announced March 2015.

    Comments: preprint version of WWW 2015 'Web Science Track' paper

  30. arXiv:1409.2983  [pdf, other

    cs.CY cs.SI physics.soc-ph

    Once Upon a Crime: Towards Crime Prediction from Demographics and Mobile Data

    Authors: Andrey Bogomolov, Bruno Lepri, Jacopo Staiano, Nuria Oliver, Fabio Pianesi, Alex Pentland

    Abstract: In this paper, we present a novel approach to predict crime in a geographic space from multiple data sources, in particular mobile phone and demographic data. The main contribution of the proposed approach lies in using aggregated and anonymized human behavioral data derived from mobile network activity to tackle the crime prediction problem. While previous research efforts have used either backgr… ▽ More

    Submitted 10 September, 2014; originally announced September 2014.

    Comments: 10 pages, 3 figures. To appear in ACM International Conference on Multimodal Interaction (ICMI 2014)

  31. Money Walks: A Human-Centric Study on the Economics of Personal Mobile Data

    Authors: Jacopo Staiano, Nuria Oliver, Bruno Lepri, Rodrigo de Oliveira, Michele Caraviello, Nicu Sebe

    Abstract: In the context of a myriad of mobile apps which collect personally identifiable information (PII) and a prospective market place of personal data, we investigate a user-centric monetary valuation of mobile PII. During a 6-week long user study in a living lab deployment with 60 participants, we collected their daily valuations of 4 categories of mobile PII (communication, e.g. phonecalls made/recei… ▽ More

    Submitted 10 July, 2014; v1 submitted 2 July, 2014; originally announced July 2014.

    Comments: 15 pages, 2 figures. To appear in ACM International Joint Conference on Pervasive and Ubiquitous Computing (Ubicomp 2014)

    ACM Class: K.6.0

  32. arXiv:1405.1605  [pdf, ps, other

    cs.CL cs.CY

    DepecheMood: a Lexicon for Emotion Analysis from Crowd-Annotated News

    Authors: Jacopo Staiano, Marco Guerini

    Abstract: While many lexica annotated with words polarity are available for sentiment analysis, very few tackle the harder task of emotion analysis and are usually quite limited in coverage. In this paper, we present a novel approach for extracting - in a totally automated way - a high-coverage and high-precision lexicon of roughly 37 thousand terms annotated with emotion scores, called DepecheMood. Our app… ▽ More

    Submitted 7 May, 2014; originally announced May 2014.

    Comments: To appear at ACL 2014. 7 pages

  33. arXiv:1309.3908  [pdf, ps, other

    cs.SI cs.CY cs.MM physics.soc-ph

    Exploring Image Virality in Google Plus

    Authors: Marco Guerini, Jacopo Staiano, Davide Albanese

    Abstract: Reactions to posts in an online social network show different dynamics depending on several textual features of the corresponding content. Do similar dynamics exist when images are posted? Exploiting a novel dataset of posts, gathered from the most popular Google+ users, we try to give an answer to such a question. We describe several virality phenomena that emerge when taking into account visual… ▽ More

    Submitted 16 September, 2013; originally announced September 2013.

    Comments: 8 pages, 8 figures. IEEE/ASE SocialCom 2013