Skip to main content

Showing 1–33 of 33 results for author: Lecue, F

  1. arXiv:2407.10794  [pdf, other

    cs.CL cs.AI

    Graphusion: Leveraging Large Language Models for Scientific Knowledge Graph Fusion and Construction in NLP Education

    Authors: Rui Yang, Boming Yang, Sixun Ouyang, Tianwei She, Aosong Feng, Yuang Jiang, Freddy Lecue, Jinghui Lu, Irene Li

    Abstract: Knowledge graphs (KGs) are crucial in the field of artificial intelligence and are widely applied in downstream tasks, such as enhancing Question Answering (QA) systems. The construction of KGs typically requires significant effort from domain experts. Recently, Large Language Models (LLMs) have been used for knowledge graph construction (KGC), however, most existing approaches focus on a local pe… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 24 pages, 11 figures, 13 tables. arXiv admin note: substantial text overlap with arXiv:2402.14293

  2. arXiv:2407.04173  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Quantifying Prediction Consistency Under Model Multiplicity in Tabular LLMs

    Authors: Faisal Hamman, Pasan Dissanayake, Saumitra Mishra, Freddy Lecue, Sanghamitra Dutta

    Abstract: Fine-tuning large language models (LLMs) on limited tabular data for classification tasks can lead to \textit{fine-tuning multiplicity}, where equally well-performing models make conflicting predictions on the same inputs due to variations in the training process (i.e., seed, random weight initialization, retraining on additional or deleted samples). This raises critical concerns about the robustn… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  3. arXiv:2406.13427  [pdf, other

    cs.LG

    Are Logistic Models Really Interpretable?

    Authors: Danial Dervovic, Freddy Lécué, Nicolás Marchesotti, Daniele Magazzeni

    Abstract: The demand for open and trustworthy AI models points towards widespread publishing of model weights. Consumers of these model weights must be able to act accordingly with the information provided. That said, one of the simplest AI classification models, Logistic Regression (LR), has an unwieldy interpretation of its model weights, with greater difficulties when extending LR to generalised additive… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 36 pages, 5 Figures. Extended version of paper accepted to IJCAI 2024. arXiv admin note: substantial text overlap with arXiv:2211.06360

  4. arXiv:2406.02625  [pdf, other

    cs.LG cs.AI stat.ML

    Progressive Inference: Explaining Decoder-Only Sequence Classification Models Using Intermediate Predictions

    Authors: Sanjay Kariyappa, Freddy Lécué, Saumitra Mishra, Christopher Pond, Daniele Magazzeni, Manuela Veloso

    Abstract: This paper proposes Progressive Inference - a framework to compute input attributions to explain the predictions of decoder-only sequence classification models. Our work is based on the insight that the classification head of a decoder-only Transformer model can be used to make intermediate predictions by evaluating them at different points in the input sequence. Due to the causal attention mechan… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  5. arXiv:2405.06890  [pdf, other

    cs.CL cs.AI

    TacoERE: Cluster-aware Compression for Event Relation Extraction

    Authors: Yong Guan, Xiaozhi Wang, Lei Hou, Juanzi Li, Jeff Pan, Jiaoyan Chen, Freddy Lecue

    Abstract: Event relation extraction (ERE) is a critical and fundamental challenge for natural language processing. Existing work mainly focuses on directly modeling the entire document, which cannot effectively handle long-range dependencies and information redundancy. To address these issues, we propose a cluster-aware compression method for improving event relation extraction (TacoERE), which explores a c… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: Accepted to LREC-COLING 2024

  6. arXiv:2404.02935  [pdf, other

    cs.CL cs.AI cs.LG

    KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual Checking

    Authors: Jiawei Zhang, Chejian Xu, Yu Gai, Freddy Lecue, Dawn Song, Bo Li

    Abstract: This paper introduces KnowHalu, a novel approach for detecting hallucinations in text generated by large language models (LLMs), utilizing step-wise reasoning, multi-formulation query, multi-form knowledge for factual checking, and fusion-based detection mechanism. As LLMs are increasingly applied across various domains, ensuring that their outputs are not hallucinated is critical. Recognizing the… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  7. REFRESH: Responsible and Efficient Feature Reselection Guided by SHAP Values

    Authors: Shubham Sharma, Sanghamitra Dutta, Emanuele Albini, Freddy Lecue, Daniele Magazzeni, Manuela Veloso

    Abstract: Feature selection is a crucial step in building machine learning models. This process is often achieved with accuracy as an objective, and can be cumbersome and computationally expensive for large-scale datasets. Several additional model performance characteristics such as fairness and robustness are of importance for model development. As regulations are driving the need for more trustworthy mode… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  8. arXiv:2402.14293  [pdf, other

    cs.CL

    Leveraging Large Language Models for Concept Graph Recovery and Question Answering in NLP Education

    Authors: Rui Yang, Boming Yang, Sixun Ouyang, Tianwei She, Aosong Feng, Yuang Jiang, Freddy Lecue, Jinghui Lu, Irene Li

    Abstract: In the domain of Natural Language Processing (NLP), Large Language Models (LLMs) have demonstrated promise in text-generation tasks. However, their educational applications, particularly for domain-specific queries, remain underexplored. This study investigates LLMs' capabilities in educational scenarios, focusing on concept graph recovery and question-answering (QA). We assess LLMs' zero-shot per… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  9. arXiv:2402.08290  [pdf, other

    cs.LG cs.AI

    The Effect of Data Poisoning on Counterfactual Explanations

    Authors: André Artelt, Shubham Sharma, Freddy Lecué, Barbara Hammer

    Abstract: Counterfactual explanations provide a popular method for analyzing the predictions of black-box systems, and they can offer the opportunity for computational recourse by suggesting actionable changes on how to change the input to obtain a different (i.e.\ more favorable) system output. However, recent work highlighted their vulnerability to different types of manipulations. This work studies the… ▽ More

    Submitted 21 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  10. arXiv:2401.15820  [pdf, other

    cs.CV cs.AI

    Knowledge-Aware Neuron Interpretation for Scene Classification

    Authors: Yong Guan, Freddy Lecue, Jiaoyan Chen, Ru Li, Jeff Z. Pan

    Abstract: Although neural models have achieved remarkable performance, they still encounter doubts due to the intransparency. To this end, model prediction explanation is attracting more and more attentions. However, current methods rarely incorporate external knowledge and still suffer from three limitations: (1) Neglecting concept completeness. Merely selecting concepts may not sufficient for prediction.… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: Accepted to AAAI2024

  11. arXiv:2401.09972  [pdf, other

    cs.CL

    Better Explain Transformers by Illuminating Important Information

    Authors: Linxin Song, Yan Cui, Ao Luo, Freddy Lecue, Irene Li

    Abstract: Transformer-based models excel in various natural language processing (NLP) tasks, attracting countless efforts to explain their inner workings. Prior methods explain Transformers by focusing on the raw gradient and attention as token attribution scores, where non-relevant information is often considered during explanation computation, resulting in confusing results. In this work, we propose highl… ▽ More

    Submitted 26 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  12. arXiv:2311.14137  [pdf, other

    cs.LG cs.CR

    Privacy-Preserving Algorithmic Recourse

    Authors: Sikha Pentyala, Shubham Sharma, Sanjay Kariyappa, Freddy Lecue, Daniele Magazzeni

    Abstract: When individuals are subject to adverse outcomes from machine learning models, providing a recourse path to help achieve a positive outcome is desirable. Recent work has shown that counterfactual explanations - which can be used as a means of single-step recourse - are vulnerable to privacy issues, putting an individuals' privacy at risk. Providing a sequential multi-step path for recourse can amp… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: Accepted at 3rd International Workshop on Explainable AI in Finance, ICAIF 2023

  13. arXiv:2311.05436  [pdf, other

    stat.ML cs.CY cs.LG

    Fair Wasserstein Coresets

    Authors: Zikai Xiong, Niccolò Dalmasso, Shubham Sharma, Freddy Lecue, Daniele Magazzeni, Vamsi K. Potluru, Tucker Balch, Manuela Veloso

    Abstract: Data distillation and coresets have emerged as popular approaches to generate a smaller representative set of samples for downstream learning tasks to handle large-scale datasets. At the same time, machine learning is being increasingly applied to decision-making processes at a societal level, making it imperative for modelers to address inherent biases towards subgroups present in the data. While… ▽ More

    Submitted 4 June, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: 28 pages, 7 figures, 7 tables

  14. Explainable Artificial Intelligence (XAI) 2.0: A Manifesto of Open Challenges and Interdisciplinary Research Directions

    Authors: Luca Longo, Mario Brcic, Federico Cabitza, Jaesik Choi, Roberto Confalonieri, Javier Del Ser, Riccardo Guidotti, Yoichi Hayashi, Francisco Herrera, Andreas Holzinger, Richard Jiang, Hassan Khosravi, Freddy Lecue, Gianclaudio Malgieri, Andrés Páez, Wojciech Samek, Johannes Schneider, Timo Speith, Simone Stumpf

    Abstract: As systems based on opaque Artificial Intelligence (AI) continue to flourish in diverse real-world applications, understanding these black box models has become paramount. In response, Explainable AI (XAI) has emerged as a field of research with practical and ethical benefits across various domains. This paper not only highlights the advancements in XAI and its application in real-world scenarios… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    ACM Class: F.2.0; H.1.2; I.2; I.2.6; K.4; K.5

    Journal ref: Information Fusion 2024

  15. arXiv:2307.04850  [pdf, other

    cs.LG cs.AI

    SHAP@k:Efficient and Probably Approximately Correct (PAC) Identification of Top-k Features

    Authors: Sanjay Kariyappa, Leonidas Tsepenekas, Freddy Lécué, Daniele Magazzeni

    Abstract: The SHAP framework provides a principled method to explain the predictions of a model by computing feature importance. Motivated by applications in finance, we introduce the Top-k Identification Problem (TkIP), where the objective is to identify the k features with the highest SHAP values. While any method to compute SHAP values with uncertainty estimates (such as KernelSHAP and SamplingSHAP) can… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  16. arXiv:2305.08950  [pdf, other

    cs.LG stat.ME

    Causal Analysis for Robust Interpretability of Neural Networks

    Authors: Ola Ahmad, Nicolas Bereux, Loïc Baret, Vahid Hashemi, Freddy Lecue

    Abstract: Interpreting the inner function of neural networks is crucial for the trustworthy development and deployment of these black-box models. Prior interpretability methods focus on correlation-based measures to attribute model decisions to individual examples. However, these measures are susceptible to noise and spurious correlations encoded in the model during the training phase (e.g., biased inputs,… ▽ More

    Submitted 20 June, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

  17. arXiv:2304.06832  [pdf, other

    cs.LG

    Task Adaptive Feature Transformation for One-Shot Learning

    Authors: Imtiaz Masud Ziko, Freddy Lecue, Ismail Ben Ayed

    Abstract: We introduce a simple non-linear embedding adaptation layer, which is fine-tuned on top of fixed pre-trained features for one-shot tasks, improving significantly transductive entropy-based inference for low-shot regimes. Our norm-induced transformation could be understood as a re-parametrization of the feature space to disentangle the representations of different classes in a task specific manner.… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

  18. arXiv:2211.06360  [pdf, ps, other

    cs.LG

    Rethinking Log Odds: Linear Probability Modelling and Expert Advice in Interpretable Machine Learning

    Authors: Danial Dervovic, Nicolas Marchesotti, Freddy Lecue, Daniele Magazzeni

    Abstract: We introduce a family of interpretable machine learning models, with two broad additions: Linearised Additive Models (LAMs) which replace the ubiquitous logistic link function in General Additive Models (GAMs); and SubscaleHedge, an expert advice algorithm for combining base models trained on subsets of features called subscales. LAMs can augment any additive binary classification model equipped w… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: 33 pages, 2 figures. Comments welcome

  19. arXiv:2209.15438  [pdf, other

    cs.SE cs.LG

    Empowering the trustworthiness of ML-based critical systems through engineering activities

    Authors: Juliette Mattioli, Agnes Delaborde, Souhaiel Khalfaoui, Freddy Lecue, Henri Sohier, Frederic Jurie

    Abstract: This paper reviews the entire engineering process of trustworthy Machine Learning (ML) algorithms designed to equip critical systems with advanced analytics and decision functions. We start from the fundamental principles of ML and describe the core elements conditioning its trust, particularly through its design: namely domain specification, data engineering, design of the ML algorithms, their im… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    Comments: This work has been supported by the French government under the "France 2030" program, as part of the SystemX Technological Research Institute Research Institute

  20. arXiv:2208.12731  [pdf, other

    cs.LG cs.AI

    Comparing Apples to Oranges: Learning Similarity Functions for Data Produced by Different Distributions

    Authors: Leonidas Tsepenekas, Ivan Brugere, Freddy Lecue, Daniele Magazzeni

    Abstract: Similarity functions measure how comparable pairs of elements are, and play a key role in a wide variety of applications, e.g., notions of Individual Fairness abiding by the seminal paradigm of Dwork et al., as well as Clustering problems. However, access to an accurate similarity function should not always be considered guaranteed, and this point was even raised by Dwork et al. For instance, it i… ▽ More

    Submitted 23 October, 2023; v1 submitted 26 August, 2022; originally announced August 2022.

    Comments: Accepted at NeurIPS 2023

  21. arXiv:2208.04955  [pdf, other

    cs.LG cs.AI math.OC

    Explainable prediction of Qcodes for NOTAMs using column generation

    Authors: Krunal Kishor Patel, Guy Desaulniers, Andrea Lodi, Freddy Lecue

    Abstract: A NOtice To AirMen (NOTAM) contains important flight route related information. To search and filter them, NOTAMs are grouped into categories called QCodes. In this paper, we develop a tool to predict, with some explanations, a Qcode for a NOTAM. We present a way to extend the interpretable binary classification using column generation proposed in Dash, Gunluk, and Wei (2018) to a multiclass text… ▽ More

    Submitted 20 January, 2023; v1 submitted 9 August, 2022; originally announced August 2022.

  22. arXiv:2203.07255  [pdf, other

    cs.CV

    FisheyeHDK: Hyperbolic Deformable Kernel Learning for Ultra-Wide Field-of-View Image Recognition

    Authors: Ola Ahmad, Freddy Lecue

    Abstract: Conventional convolution neural networks (CNNs) trained on narrow Field-of-View (FoV) images are the state-of-the-art approaches for object recognition tasks. Some methods proposed the adaptation of CNNs to ultra-wide FoV images by learning deformable kernels. However, they are limited by the Euclidean geometry and their accuracy degrades under strong distortions caused by fisheye projections. In… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: Accepted at AAAI22

  23. arXiv:2112.11230  [pdf, other

    cs.LG cs.AI

    Interpretable Preference-based Reinforcement Learning with Tree-Structured Reward Functions

    Authors: Tom Bewley, Freddy Lecue

    Abstract: The potential of reinforcement learning (RL) to deliver aligned and performant agents is partially bottlenecked by the reward engineering problem. One alternative to heuristic trial-and-error is preference-based RL (PbRL), where a reward function is inferred from sparse human feedback. However, prior PbRL methods lack interpretability of the learned reward structure, which hampers the ability to a… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

    Comments: Accepted for publication at the 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022)

  24. arXiv:2102.10191  [pdf, other

    cs.CV

    Adaptable Deformable Convolutions for Semantic Segmentation of Fisheye Images in Autonomous Driving Systems

    Authors: Clément Playout, Ola Ahmad, Freddy Lecue, Farida Cheriet

    Abstract: Advanced Driver-Assistance Systems rely heavily on perception tasks such as semantic segmentation where images are captured from large field of view (FoV) cameras. State-of-the-art works have made considerable progress toward applying Convolutional Neural Network (CNN) to standard (rectilinear) images. However, the large FoV cameras used in autonomous vehicles produce fisheye images characterized… ▽ More

    Submitted 19 February, 2021; originally announced February 2021.

    Comments: 10 pages, 4 figures

  25. arXiv:2009.14260  [pdf, other

    cs.LG cs.CV cs.NE

    Trustworthy Convolutional Neural Networks: A Gradient Penalized-based Approach

    Authors: Nicholas Halliwell, Freddy Lecue

    Abstract: Convolutional neural networks (CNNs) are commonly used for image classification. Saliency methods are examples of approaches that can be used to interpret CNNs post hoc, identifying the most relevant pixels for a prediction following the gradients flow. Even though CNNs can correctly classify images, the underlying saliency maps could be erroneous in many cases. This can result in skepticism as to… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

    Comments: 13pages

  26. arXiv:2006.16917  [pdf, other

    cs.AI cs.LG

    Ontology-guided Semantic Composition for Zero-Shot Learning

    Authors: Jiaoyan Chen, Freddy Lecue, Yuxia Geng, Jeff Z. Pan, Huajun Chen

    Abstract: Zero-shot learning (ZSL) is a popular research problem that aims at predicting for those classes that have never appeared in the training stage by utilizing the inter-class relationship with some side information. In this study, we propose to model the compositional and expressive semantics of class labels by an OWL (Web Ontology Language) ontology, and further develop a new ZSL framework with ont… ▽ More

    Submitted 30 June, 2020; originally announced June 2020.

    Comments: Accepted by KR 2020 - 17th International Conference on Principles of Knowledge Representation and Reasoning

  27. arXiv:2006.14248  [pdf, other

    cs.LG stat.ML

    Background Knowledge Injection for Interpretable Sequence Classification

    Authors: Severin Gsponer, Luca Costabello, Chan Le Van, Sumit Pai, Christophe Gueret, Georgiana Ifrim, Freddy Lecue

    Abstract: Sequence classification is the supervised learning task of building models that predict class labels of unseen sequences of symbols. Although accuracy is paramount, in certain scenarios interpretability is a must. Unfortunately, such trade-off is often hard to achieve since we lack human-independent interpretability metrics. We introduce a novel sequence learning algorithm, that combines (i) linea… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: New Frontiers in Mining Complex Patterns 2019 @ ECML 2019, 12 pages

  28. arXiv:1908.04839  [pdf, other

    cs.LG cs.AI

    Local Score Dependent Model Explanation for Time Dependent Covariates

    Authors: Xochitl Watts, Freddy Lecue

    Abstract: The use of deep neural networks to make high risk decisions creates a need for global and local explanations so that users and experts have confidence in the modeling algorithms. We introduce a novel technique to find global and local explanations for time series data used in binary classification machine learning systems. We identify the most salient of the original features used by a black box m… ▽ More

    Submitted 13 August, 2019; originally announced August 2019.

    Comments: Work accepted as full paper for presentation at XAI (Explainable AI) workshop at Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI) 2019 in Macao, China - August 10-16, 2019

  29. arXiv:1905.13672  [pdf, other

    cs.LG cs.AI stat.ML

    Augmenting Transfer Learning with Semantic Reasoning

    Authors: Freddy Lecue, Jiaoyan Chen, Jeff Z. Pan, Huajun Chen

    Abstract: Transfer learning aims at building robust prediction models by transferring knowledge gained from one problem to another. In the semantic Web, learning tasks are enhanced with semantic representations. We exploit their semantics to augment transfer learning by dealing with when to transfer with semantic measurements and what to transfer with semantic embeddings. We further present a general framew… ▽ More

    Submitted 23 June, 2019; v1 submitted 31 May, 2019; originally announced May 2019.

    Comments: 7 pages

    Journal ref: IJCAI 2019

  30. arXiv:1811.05245  [pdf, ps, other

    cs.AI

    Interpretable Credit Application Predictions With Counterfactual Explanations

    Authors: Rory Mc Grath, Luca Costabello, Chan Le Van, Paul Sweeney, Farbod Kamiab, Zhao Shen, Freddy Lecue

    Abstract: We predict credit applications with off-the-shelf, interchangeable black-box classifiers and we explain single predictions with counterfactual explanations. Counterfactual explanations expose the minimal changes required on the input data to obtain a different result e.g., approved vs rejected application. Despite their effectiveness, counterfactuals are mainly designed for changing an undesired o… ▽ More

    Submitted 16 November, 2018; v1 submitted 13 November, 2018; originally announced November 2018.

  31. arXiv:1807.08372  [pdf, other

    cs.AI cs.LG

    Knowledge-based Transfer Learning Explanation

    Authors: Jiaoyan Chen, Freddy Lecue, Jeff Z. Pan, Ian Horrocks, Huajun Chen

    Abstract: Machine learning explanation can significantly boost machine learning's application in decision making, but the usability of current methods is limited in human-centric explanation, especially for transfer learning, an important machine learning branch that aims at utilizing knowledge from one learning domain (i.e., a pair of dataset and prediction task) to enhance prediction model training in ano… ▽ More

    Submitted 22 July, 2018; originally announced July 2018.

    Comments: Accepted by International Conference on Principles of Knowledge Representation and Reasoning, 2018

  32. arXiv:1805.10587  [pdf, other

    cs.AI

    Semantic Explanations of Predictions

    Authors: Freddy Lecue, Jiewen Wu

    Abstract: The main objective of explanations is to transmit knowledge to humans. This work proposes to construct informative explanations for predictions made from machine learning models. Motivated by the observations from social sciences, our approach selects data points from the training sample that exhibit special characteristics crucial for explanation, for instance, ones contrastive to the classificat… ▽ More

    Submitted 27 May, 2018; originally announced May 2018.

  33. arXiv:1704.07466  [pdf, other

    cs.AI

    Learning from Ontology Streams with Semantic Concept Drift

    Authors: Freddy Lecue, Jiaoyan Chen, Jeff Pan, Huajun Chen

    Abstract: Data stream learning has been largely studied for extracting knowledge structures from continuous and rapid data records. In the semantic Web, data is interpreted in ontologies and its ordered sequence is represented as an ontology stream. Our work exploits the semantics of such streams to tackle the problem of concept drift i.e., unexpected changes in data distribution, causing most of models to… ▽ More

    Submitted 24 April, 2017; originally announced April 2017.