Skip to main content

Showing 1–48 of 48 results for author: Lapuschkin, S

  1. arXiv:2405.20331  [pdf, other

    cs.LG cs.AI cs.CL

    CoSy: Evaluating Textual Explanations of Neurons

    Authors: Laura Kopf, Philine Lou Bommer, Anna Hedström, Sebastian Lapuschkin, Marina M. -C. Höhne, Kirill Bykov

    Abstract: A crucial aspect of understanding the complex nature of Deep Neural Networks (DNNs) is the ability to explain learned concepts within their latent representations. While various methods exist to connect neurons to textual descriptions of human-understandable concepts, evaluating the quality of these explanation methods presents a major challenge in the field due to a lack of unified, general-purpo… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 10 pages, 5 figures

  2. arXiv:2405.02383  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    A Fresh Look at Sanity Checks for Saliency Maps

    Authors: Anna Hedström, Leander Weber, Sebastian Lapuschkin, Marina Höhne

    Abstract: The Model Parameter Randomisation Test (MPRT) is highly recognised in the eXplainable Artificial Intelligence (XAI) community due to its fundamental evaluative criterion: explanations should be sensitive to the parameters of the model they seek to explain. However, recent studies have raised several methodological concerns for the empirical interpretation of MPRT. In response, we propose two modif… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2401.06465

  3. arXiv:2404.10433  [pdf, other

    cs.CV cs.AI cs.LG

    Explainable concept mappings of MRI: Revealing the mechanisms underlying deep learning-based brain disease classification

    Authors: Christian Tinauer, Anna Damulina, Maximilian Sackl, Martin Soellradl, Reduan Achtibat, Maximilian Dreyer, Frederik Pahde, Sebastian Lapuschkin, Reinhold Schmidt, Stefan Ropele, Wojciech Samek, Christian Langkammer

    Abstract: Motivation. While recent studies show high accuracy in the classification of Alzheimer's disease using deep neural networks, the underlying learned concepts have not been investigated. Goals. To systematically identify changes in brain regions through concepts learned by the deep neural network for model validation. Approach. Using quantitative R2* maps we separated Alzheimer's patients (n=117… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  4. arXiv:2404.09601  [pdf, other

    cs.LG cs.AI cs.CV

    Reactive Model Correction: Mitigating Harm to Task-Relevant Features via Conditional Bias Suppression

    Authors: Dilyara Bareeva, Maximilian Dreyer, Frederik Pahde, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Deep Neural Networks are prone to learning and relying on spurious correlations in the training data, which, for high-risk applications, can have fatal consequences. Various approaches to suppress model reliance on harmful features have been proposed that can be applied post-hoc without additional training. Whereas those methods can be applied with efficiency, they also tend to harm model performa… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  5. arXiv:2404.06453  [pdf, other

    cs.CV cs.AI cs.LG

    PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits

    Authors: Maximilian Dreyer, Erblina Purelku, Johanna Vielhaben, Wojciech Samek, Sebastian Lapuschkin

    Abstract: The field of mechanistic interpretability aims to study the role of individual neurons in Deep Neural Networks. Single neurons, however, have the capability to act polysemantically and encode for multiple (unrelated) features, which renders their interpretation difficult. We present a method for disentangling polysemanticity of any Deep Neural Network by decomposing a polysemantic neuron into mult… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 14 pages (4 pages manuscript, 2 pages references, 8 pages appendix)

  6. arXiv:2402.12118  [pdf, other

    cs.LG cs.AI

    DualView: Data Attribution from the Dual Perspective

    Authors: Galip Ümit Yolcu, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Local data attribution (or influence estimation) techniques aim at estimating the impact that individual data points seen during training have on particular predictions of an already trained Machine Learning model during test time. Previous methods either do not perform well consistently across different evaluation criteria from literature, are characterized by a high computational demand, or suff… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  7. arXiv:2402.05602  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    AttnLRP: Attention-Aware Layer-Wise Relevance Propagation for Transformers

    Authors: Reduan Achtibat, Sayed Mohammad Vakilzadeh Hatefi, Maximilian Dreyer, Aakriti Jain, Thomas Wiegand, Sebastian Lapuschkin, Wojciech Samek

    Abstract: Large Language Models are prone to biased predictions and hallucinations, underlining the paramount importance of understanding their model-internal reasoning process. However, achieving faithful attributions for the entirety of a black-box transformer model and maintaining computational efficiency is an unsolved challenge. By extending the Layer-wise Relevance Propagation attribution method to ha… ▽ More

    Submitted 10 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  8. arXiv:2401.17441  [pdf, other

    cs.LG cs.AI stat.ML

    Explaining Predictive Uncertainty by Exposing Second-Order Effects

    Authors: Florian Bley, Sebastian Lapuschkin, Wojciech Samek, Grégoire Montavon

    Abstract: Explainable AI has brought transparency into complex ML blackboxes, enabling, in particular, to identify which features these models use for their predictions. So far, the question of explaining predictive uncertainty, i.e. why a model 'doubts', has been scarcely studied. Our investigation reveals that predictive uncertainty is dominated by second-order effects, involving single features or produc… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 12 pages + supplement

  9. arXiv:2401.06465  [pdf, other

    cs.AI cs.LG stat.ME

    Sanity Checks Revisited: An Exploration to Repair the Model Parameter Randomisation Test

    Authors: Anna Hedström, Leander Weber, Sebastian Lapuschkin, Marina MC Höhne

    Abstract: The Model Parameter Randomisation Test (MPRT) is widely acknowledged in the eXplainable Artificial Intelligence (XAI) community for its well-motivated evaluative principle: that the explanation function should be sensitive to changes in the parameters of the model function. However, recent works have identified several methodological caveats for the empirical interpretation of MPRT. To address the… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: 19 pages, 12 figures, NeurIPS XAIA 2023

  10. arXiv:2311.16681  [pdf, other

    cs.CV cs.AI

    Understanding the (Extra-)Ordinary: Validating Deep Model Decisions with Prototypical Concept-based Explanations

    Authors: Maximilian Dreyer, Reduan Achtibat, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Ensuring both transparency and safety is critical when deploying Deep Neural Networks (DNNs) in high-risk applications, such as medicine. The field of explainable AI (XAI) has proposed various methods to comprehend the decision-making processes of opaque DNNs. However, only few XAI methods are suitable of ensuring safety in practice as they heavily rely on repeated labor-intensive and possibly bia… ▽ More

    Submitted 29 April, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: 39 pages (8 pages manuscript, 3 pages references, 28 pages appendix)

  11. arXiv:2310.17638  [pdf, other

    cs.LG stat.ML

    Generative Fractional Diffusion Models

    Authors: Gabriel Nobis, Maximilian Springenberg, Marco Aversa, Michael Detzel, Rembert Daems, Roderick Murray-Smith, Shinichi Nakajima, Sebastian Lapuschkin, Stefano Ermon, Tolga Birdal, Manfred Opper, Christoph Knochenhauer, Luis Oala, Wojciech Samek

    Abstract: We introduce the first continuous-time score-based generative model that leverages fractional diffusion processes for its underlying dynamics. Although diffusion models have excelled at capturing data distributions, they still suffer from various limitations such as slow convergence, mode-collapse on imbalanced data, and lack of diversity. These issues are partially linked to the use of light-tail… ▽ More

    Submitted 24 June, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    ACM Class: I.2.4; F.4.1; G.3

  12. Human-Centered Evaluation of XAI Methods

    Authors: Karam Dawoud, Wojciech Samek, Peter Eisert, Sebastian Lapuschkin, Sebastian Bosse

    Abstract: In the ever-evolving field of Artificial Intelligence, a critical challenge has been to decipher the decision-making processes within the so-called "black boxes" in deep learning. Over recent years, a plethora of methods have emerged, dedicated to explaining decisions across diverse tasks. Particularly in tasks like image classification, these methods typically identify and emphasize the pivotal p… ▽ More

    Submitted 16 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Journal ref: ICDMW (2023) 912-921

  13. arXiv:2308.12053  [pdf, other

    cs.LG cs.AI cs.NE

    Layer-wise Feedback Propagation

    Authors: Leander Weber, Jim Berend, Alexander Binder, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: In this paper, we present Layer-wise Feedback Propagation (LFP), a novel training approach for neural-network-like predictors that utilizes explainability, specifically Layer-wise Relevance Propagation(LRP), to assign rewards to individual connections based on their respective contributions to solving a given task. This differs from traditional gradient descent, which updates parameters towards an… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    MSC Class: 68T05

  14. arXiv:2308.09437  [pdf, other

    cs.LG cs.AI cs.CV cs.CY

    From Hope to Safety: Unlearning Biases of Deep Models via Gradient Penalization in Latent Space

    Authors: Maximilian Dreyer, Frederik Pahde, Christopher J. Anders, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Deep Neural Networks are prone to learning spurious correlations embedded in the training data, leading to potentially biased predictions. This poses risks when deploying these models for high-stake decision-making, such as in medical applications. Current methods for post-hoc model correction either require input-level annotations which are only possible for spatially localized biases, or augment… ▽ More

    Submitted 18 December, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: 35 pages (9 pages manuscript, 2 pages references, 24 pages appendix)

  15. arXiv:2304.14019  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    XAI-based Comparison of Input Representations for Audio Event Classification

    Authors: Annika Frommholz, Fabian Seipel, Sebastian Lapuschkin, Wojciech Samek, Johanna Vielhaben

    Abstract: Deep neural networks are a promising tool for Audio Event Classification. In contrast to other data like natural images, there are many sensible and non-obvious representations for audio data, which could serve as input to these models. Due to their black-box nature, the effect of different input representations has so far mostly been investigated by measuring classification performance. In this w… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: 7 pages, 4 figures

  16. Bridging the Gap: Gaze Events as Interpretable Concepts to Explain Deep Neural Sequence Models

    Authors: Daniel G. Krakowczyk, Paul Prasse, David R. Reich, Sebastian Lapuschkin, Tobias Scheffer, Lena A. Jäger

    Abstract: Recent work in XAI for eye tracking data has evaluated the suitability of feature attribution methods to explain the output of deep neural sequence models for the task of oculomotric biometric identification. These methods provide saliency maps to highlight important input features of a specific eye gaze sequence. However, to date, its localization analysis has been lacking a quantitative approach… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: Preprint for ETRA '23: 2023 Symposium on Eye Tracking Research and Applications

  17. arXiv:2303.12641  [pdf, other

    cs.CV cs.AI

    Reveal to Revise: An Explainable AI Life Cycle for Iterative Bias Correction of Deep Models

    Authors: Frederik Pahde, Maximilian Dreyer, Wojciech Samek, Sebastian Lapuschkin

    Abstract: State-of-the-art machine learning models often learn spurious correlations embedded in the training data. This poses risks when deploying these models for high-stake decision-making, such as in medical applications like skin cancer detection. To tackle this problem, we propose Reveal to Revise (R2R), a framework entailing the entire eXplainable Artificial Intelligence (XAI) life cycle, enabling pr… ▽ More

    Submitted 27 March, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  18. arXiv:2303.06365  [pdf, other

    cs.LG cs.AI cs.CV

    Explainable AI for Time Series via Virtual Inspection Layers

    Authors: Johanna Vielhaben, Sebastian Lapuschkin, Grégoire Montavon, Wojciech Samek

    Abstract: The field of eXplainable Artificial Intelligence (XAI) has greatly advanced in recent years, but progress has mainly been made in computer vision and natural language processing. For time series, where the input is often not interpretable, only limited research on XAI is available. In this work, we put forward a virtual inspection layer, that transforms the time series to an interpretable represen… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

    Comments: 13 pages, 7 figures

  19. arXiv:2302.07265  [pdf, other

    cs.LG cs.AI

    The Meta-Evaluation Problem in Explainable AI: Identifying Reliable Estimators with MetaQuantus

    Authors: Anna Hedström, Philine Bommer, Kristoffer K. Wickstrøm, Wojciech Samek, Sebastian Lapuschkin, Marina M. -C. Höhne

    Abstract: One of the unsolved challenges in the field of Explainable AI (XAI) is determining how to most reliably estimate the quality of an explanation method in the absence of ground truth explanation labels. Resolving this issue is of utmost importance as the evaluation outcomes generated by competing evaluation methods (or ''quality estimators''), which aim at measuring the same property of an explanati… ▽ More

    Submitted 19 July, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 35 pages, 15 figures, 5 tables

    Journal ref: Transactions on Machine Learning Research, Volume 2023, (2023), ISSN: 2835-8856

  20. arXiv:2211.17174  [pdf, other

    cs.CV cs.AI cs.LG

    Optimizing Explanations by Network Canonization and Hyperparameter Search

    Authors: Frederik Pahde, Galip Ümit Yolcu, Alexander Binder, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Explainable AI (XAI) is slowly becoming a key component for many AI applications. Rule-based and modified backpropagation XAI approaches however often face challenges when being applied to modern model architectures including innovative layer building blocks, which is caused by two reasons. Firstly, the high flexibility of rule-based XAI methods leads to numerous potential parameterizations. Secon… ▽ More

    Submitted 27 March, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

  21. Explaining machine learning models for age classification in human gait analysis

    Authors: Djordje Slijepcevic, Fabian Horst, Marvin Simak, Sebastian Lapuschkin, Anna-Maria Raberger, Wojciech Samek, Christian Breiteneder, Wolfgang I. Schöllhorn, Matthias Zeppelzauer, Brian Horsak

    Abstract: Machine learning (ML) models have proven effective in classifying gait analysis data, e.g., binary classification of young vs. older adults. ML models, however, lack in providing human understandable explanations for their predictions. This "black-box" behavior impedes the understanding of which input features the model predictions are based on. We investigated an Explainable Artificial Intelligen… ▽ More

    Submitted 16 October, 2022; originally announced November 2022.

    Comments: 3 pages, 1 figure

    Journal ref: Gait & Posture 97 (Supplement 1) (2022) 252-253

  22. Explaining automated gender classification of human gait

    Authors: Fabian Horst, Djordje Slijepcevic, Matthias Zeppelzauer, Anna-Maria Raberger, Sebastian Lapuschkin, Wojciech Samek, Wolfgang I. Schöllhorn, Christian Breiteneder, Brian Horsak

    Abstract: State-of-the-art machine learning (ML) models are highly effective in classifying gait analysis data, however, they lack in providing explanations for their predictions. This "black-box" characteristic makes it impossible to understand on which input patterns, ML models base their predictions. The present study investigates whether Explainable Artificial Intelligence methods, i.e., Layer-wise Rele… ▽ More

    Submitted 16 October, 2022; originally announced November 2022.

    Comments: 3 pages, 1 figure

    Journal ref: Gait & Posture 81 (Supplement 1) (2020) 159-160

  23. arXiv:2211.12486  [pdf, other

    cs.LG cs.CV

    Shortcomings of Top-Down Randomization-Based Sanity Checks for Evaluations of Deep Neural Network Explanations

    Authors: Alexander Binder, Leander Weber, Sebastian Lapuschkin, Grégoire Montavon, Klaus-Robert Müller, Wojciech Samek

    Abstract: While the evaluation of explanations is an important step towards trustworthy models, it needs to be done carefully, and the employed metrics need to be well-understood. Specifically model randomization testing is often overestimated and regarded as a sole criterion for selecting or discarding certain explanation methods. To address shortcomings of this test, we start by observing an experimental… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: 23 pages

  24. arXiv:2211.11426  [pdf, other

    cs.CV cs.AI cs.LG

    Revealing Hidden Context Bias in Segmentation and Object Detection through Concept-specific Explanations

    Authors: Maximilian Dreyer, Reduan Achtibat, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Applying traditional post-hoc attribution methods to segmentation or object detection predictors offers only limited insights, as the obtained feature attribution maps at input level typically resemble the models' predicted segmentation mask or bounding box. In this work, we address the need for more informative explanations for these predictors by proposing the post-hoc eXplainable Artificial Int… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  25. From Attribution Maps to Human-Understandable Explanations through Concept Relevance Propagation

    Authors: Reduan Achtibat, Maximilian Dreyer, Ilona Eisenbraun, Sebastian Bosse, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: The field of eXplainable Artificial Intelligence (XAI) aims to bring transparency to today's powerful but opaque deep learning models. While local XAI methods explain individual predictions in form of attribution maps, thereby identifying where important features occur (but not providing information about what they represent), global explanation techniques visualize what concepts a model has gener… ▽ More

    Submitted 6 January, 2024; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: 87 pages (13 pages manuscript, 8 pages references, 66 pages appendix) 63 figures (6 in manuscript, 57 in appendix) 3 tables (in appendix)

    Journal ref: Nature Machine Intelligence (year 2023, volume 5, pages 1006-1019)

  26. arXiv:2205.01929  [pdf, other

    cs.LG

    Explain to Not Forget: Defending Against Catastrophic Forgetting with XAI

    Authors: Sami Ede, Serop Baghdadlian, Leander Weber, An Nguyen, Dario Zanca, Wojciech Samek, Sebastian Lapuschkin

    Abstract: The ability to continuously process and retain new information like we do naturally as humans is a feat that is highly sought after when training neural networks. Unfortunately, the traditional optimization algorithms often require large amounts of data available during training time and updates wrt. new data are difficult after the training process has been completed. In fact, when new data or ta… ▽ More

    Submitted 22 June, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: 14 pages including appendix, 5 figures, 2 tables, 1 algorithm listing. v2 update increases figure readability, updates Fig 5 caption, adds our collaborators Dario and An as co-authors v3 brings the preprint in line with the final version accepted for peer-reviewed publication at CD-MAKE 2022. v4 metadata update

  27. arXiv:2203.10087  [pdf, other

    cs.LG

    But that's not why: Inference adjustment by interactive prototype revision

    Authors: Michael Gerstenberger, Sebastian Lapuschkin, Peter Eisert, Sebastian Bosse

    Abstract: Despite significant advances in machine learning, decision-making of artificial agents is still not perfect and often requires post-hoc human interventions. If the prediction of a model relies on unreasonable factors it is desirable to remove their effect. Deep interactive prototype adjustment enables the user to give hints and correct the model's reasoning. In this paper, we demonstrate that prot… ▽ More

    Submitted 9 October, 2023; v1 submitted 18 March, 2022; originally announced March 2022.

  28. arXiv:2203.08008  [pdf, other

    cs.LG

    Beyond Explaining: Opportunities and Challenges of XAI-Based Model Improvement

    Authors: Leander Weber, Sebastian Lapuschkin, Alexander Binder, Wojciech Samek

    Abstract: Explainable Artificial Intelligence (XAI) is an emerging research field bringing transparency to highly complex and opaque machine learning (ML) models. Despite the development of a multitude of methods to explain the decisions of black-box classifiers in recent years, these tools are seldomly used beyond visualization purposes. Only recently, researchers have started to employ explanations in pra… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  29. arXiv:2202.06861  [pdf, other

    cs.LG

    Quantus: An Explainable AI Toolkit for Responsible Evaluation of Neural Network Explanations and Beyond

    Authors: Anna Hedström, Leander Weber, Dilyara Bareeva, Daniel Krakowczyk, Franz Motzkus, Wojciech Samek, Sebastian Lapuschkin, Marina M. -C. Höhne

    Abstract: The evaluation of explanation methods is a research topic that has not yet been explored deeply, however, since explainability is supposed to strengthen trust in artificial intelligence, it is necessary to systematically review and compare explanation methods in order to confirm their correctness. Until now, no tool with focus on XAI evaluation exists that exhaustively and speedily allows research… ▽ More

    Submitted 27 April, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: 4 pages, 1 figure, 1 table

    Journal ref: Journal of Machine Learning Research, Vol. 24 (2023) 1-11

  30. arXiv:2202.06621  [pdf, other

    cs.LG cs.AI

    Measurably Stronger Explanation Reliability via Model Canonization

    Authors: Franz Motzkus, Leander Weber, Sebastian Lapuschkin

    Abstract: While rule-based attribution methods have proven useful for providing local explanations for Deep Neural Networks, explaining modern and more varied network architectures yields new challenges in generating trustworthy explanations, since the established rule sets might not be sufficient or applicable to novel network structures. As an elegant solution to the above issue, network canonization has… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: 5 pages, 4 figures

  31. arXiv:2202.03482  [pdf, other

    cs.CV cs.AI cs.LG

    Navigating Neural Space: Revisiting Concept Activation Vectors to Overcome Directional Divergence

    Authors: Frederik Pahde, Maximilian Dreyer, Leander Weber, Moritz Weckbecker, Christopher J. Anders, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: With a growing interest in understanding neural network prediction strategies, Concept Activation Vectors (CAVs) have emerged as a popular tool for modeling human-understandable concepts in the latent space. Commonly, CAVs are computed by leveraging linear classifiers optimizing the separability of latent representations of samples with and without a given concept. However, in this paper we show t… ▽ More

    Submitted 5 February, 2024; v1 submitted 7 February, 2022; originally announced February 2022.

  32. ECQ$^{\text{x}}$: Explainability-Driven Quantization for Low-Bit and Sparse DNNs

    Authors: Daniel Becking, Maximilian Dreyer, Wojciech Samek, Karsten Müller, Sebastian Lapuschkin

    Abstract: The remarkable success of deep neural networks (DNNs) in various applications is accompanied by a significant increase in network parameters and arithmetic operations. Such increases in memory and computational demands make deep learning prohibitive for resource-constrained hardware platforms such as mobile devices. Recent efforts aim to reduce these overheads, while preserving model performance a… ▽ More

    Submitted 16 February, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: 22 pages, 10 figures, 1 table

    Journal ref: xxAI - Beyond Explainable AI, Lecture Notes in Computer Science (LNAI Vol. 13200), Springer International Publishing, 2022

  33. arXiv:2106.13200  [pdf, other

    cs.LG

    Software for Dataset-wide XAI: From Local Explanations to Global Insights with Zennit, CoRelAy, and ViRelAy

    Authors: Christopher J. Anders, David Neumann, Wojciech Samek, Klaus-Robert Müller, Sebastian Lapuschkin

    Abstract: Deep Neural Networks (DNNs) are known to be strong predictors, but their prediction strategies can rarely be understood. With recent advances in Explainable Artificial Intelligence (XAI), approaches are available to explore the reasoning behind those complex models' predictions. Among post-hoc attribution methods, Layer-wise Relevance Propagation (LRP) shows high performance. For deeper quantitati… ▽ More

    Submitted 28 February, 2023; v1 submitted 24 June, 2021; originally announced June 2021.

    Comments: 20 pages, 6 figures, 2 listings, 1 table

  34. arXiv:2007.08790  [pdf, other

    cs.CV cs.LG

    Explanation-Guided Training for Cross-Domain Few-Shot Classification

    Authors: Jiamei Sun, Sebastian Lapuschkin, Wojciech Samek, Yunqing Zhao, Ngai-Man Cheung, Alexander Binder

    Abstract: Cross-domain few-shot classification task (CD-FSC) combines few-shot classification with the requirement to generalize across domains represented by datasets. This setup faces challenges originating from the limited labeled data in each class and, additionally, from the domain shift between training and test sets. In this paper, we introduce a novel training approach for existing FSC models. It le… ▽ More

    Submitted 9 December, 2020; v1 submitted 17 July, 2020; originally announced July 2020.

    Journal ref: Proceedings of the 25th International Conference on Pattern Recognition 2021

  35. Understanding Integrated Gradients with SmoothTaylor for Deep Neural Network Attribution

    Authors: Gary S. W. Goh, Sebastian Lapuschkin, Leander Weber, Wojciech Samek, Alexander Binder

    Abstract: Integrated Gradients as an attribution method for deep neural network models offers simple implementability. However, it suffers from noisiness of explanations which affects the ease of interpretability. The SmoothGrad technique is proposed to solve the noisiness issue and smoothen the attribution maps of any gradient-based attribution method. In this paper, we present SmoothTaylor as a novel theo… ▽ More

    Submitted 2 September, 2021; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: 8 pages, 3 figures. Accepted in 25th International Conference on Pattern Recognition, (ICPR) 2020. In Proceedings: pp. 4949-4956

  36. arXiv:2003.07631  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Explaining Deep Neural Networks and Beyond: A Review of Methods and Applications

    Authors: Wojciech Samek, Grégoire Montavon, Sebastian Lapuschkin, Christopher J. Anders, Klaus-Robert Müller

    Abstract: With the broader and highly successful usage of machine learning in industry and the sciences, there has been a growing demand for Explainable AI. Interpretability and explanation methods for gaining a better understanding about the problem solving abilities and strategies of nonlinear Machine Learning, in particular, deep neural networks, are therefore receiving increased attention. In this work… ▽ More

    Submitted 25 February, 2021; v1 submitted 17 March, 2020; originally announced March 2020.

    Comments: 30 pages, 20 figures

  37. arXiv:2001.01037  [pdf, other

    cs.CV cs.CL cs.LG

    Explain and Improve: LRP-Inference Fine-Tuning for Image Captioning Models

    Authors: Jiamei Sun, Sebastian Lapuschkin, Wojciech Samek, Alexander Binder

    Abstract: This paper analyzes the predictions of image captioning models with attention mechanisms beyond visualizing the attention itself. We develop variants of layer-wise relevance propagation (LRP) and gradient-based explanation methods, tailored to image captioning models with attention mechanisms. We compare the interpretability of attention heatmaps systematically against the explanations provided by… ▽ More

    Submitted 1 August, 2021; v1 submitted 4 January, 2020; originally announced January 2020.

  38. arXiv:1912.11425  [pdf, other

    cs.CV cs.LG cs.NE eess.IV

    Finding and Removing Clever Hans: Using Explanation Methods to Debug and Improve Deep Models

    Authors: Christopher J. Anders, Leander Weber, David Neumann, Wojciech Samek, Klaus-Robert Müller, Sebastian Lapuschkin

    Abstract: Contemporary learning models for computer vision are typically trained on very large (benchmark) datasets with millions of samples. These may, however, contain biases, artifacts, or errors that have gone unnoticed and are exploitable by the model. In the worst case, the trained model does not learn a valid and generalizable strategy to solve the problem it was trained for, and becomes a 'Clever-Ha… ▽ More

    Submitted 18 December, 2020; v1 submitted 22 December, 2019; originally announced December 2019.

    Comments: 47 pages, 21 figures

  39. Pruning by Explaining: A Novel Criterion for Deep Neural Network Pruning

    Authors: Seul-Ki Yeom, Philipp Seegerer, Sebastian Lapuschkin, Alexander Binder, Simon Wiedemann, Klaus-Robert Müller, Wojciech Samek

    Abstract: The success of convolutional neural networks (CNNs) in various applications is accompanied by a significant increase in computation and parameter storage costs. Recent efforts to reduce these overheads involve pruning and compressing the weights of various layers while at the same time aiming to not sacrifice performance. In this paper, we propose a novel criterion for CNN pruning inspired by neur… ▽ More

    Submitted 12 March, 2021; v1 submitted 18 December, 2019; originally announced December 2019.

    Comments: 25 pages + 5 supplementary pages, 13 figures, 6 tables

    Journal ref: Pattern Recognition, Volume 115, pp.107899, 2021

  40. arXiv:1912.07737  [pdf, other

    cs.LG stat.ML

    On the Explanation of Machine Learning Predictions in Clinical Gait Analysis

    Authors: Djordje Slijepcevic, Fabian Horst, Sebastian Lapuschkin, Anna-Maria Raberger, Matthias Zeppelzauer, Wojciech Samek, Christian Breiteneder, Wolfgang I. Schöllhorn, Brian Horsak

    Abstract: Machine learning (ML) is increasingly used to support decision-making in the healthcare sector. While ML approaches provide promising results with regard to their classification performance, most share a central limitation, namely their black-box character. Motivated by the interest to understand the functioning of ML models, methods from the field of Explainable Artificial Intelligence (XAI) have… ▽ More

    Submitted 19 August, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: 37 pages, 7 figures, 2 tables, 24 supplementary figures, 1 supplementary table

  41. arXiv:1910.09840  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Towards Best Practice in Explaining Neural Network Decisions with LRP

    Authors: Maximilian Kohlbrenner, Alexander Bauer, Shinichi Nakajima, Alexander Binder, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Within the last decade, neural network based predictors have demonstrated impressive - and at times super-human - capabilities. This performance is often paid for with an intransparent prediction process and thus has sparked numerous contributions in the novel field of explainable artificial intelligence (XAI). In this paper, we focus on a popular and widely used method of XAI, the Layer-wise Rele… ▽ More

    Submitted 13 July, 2020; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: 7 pages, 4 figures, 1 table. fixed table row compared to v2. Presented virtually at IJCNN 2020

  42. arXiv:1908.06943  [pdf, other

    eess.IV cs.CV q-bio.QM

    Resolving challenges in deep learning-based analyses of histopathological images using explanation methods

    Authors: Miriam Hägele, Philipp Seegerer, Sebastian Lapuschkin, Michael Bockmayr, Wojciech Samek, Frederick Klauschen, Klaus-Robert Müller, Alexander Binder

    Abstract: Deep learning has recently gained popularity in digital pathology due to its high prediction quality. However, the medical domain requires explanation and insight for a better understanding beyond standard quantitative performance evaluation. Recently, explanation methods have emerged, which are so far still rarely used in medicine. This work shows their application to generate heatmaps that allow… ▽ More

    Submitted 24 April, 2020; v1 submitted 15 August, 2019; originally announced August 2019.

    Journal ref: Sci Rep 10, 6423 (2020)

  43. arXiv:1902.10178  [pdf, other

    cs.AI cs.CV cs.LG cs.NE stat.ML

    Unmasking Clever Hans Predictors and Assessing What Machines Really Learn

    Authors: Sebastian Lapuschkin, Stephan Wäldchen, Alexander Binder, Grégoire Montavon, Wojciech Samek, Klaus-Robert Müller

    Abstract: Current learning machines have successfully solved hard application problems, reaching high accuracy and displaying seemingly "intelligent" behavior. Here we apply recent techniques for explaining decisions of state-of-the-art learning machines and analyze various tasks from computer vision and arcade games. This showcases a spectrum of problem-solving behaviors ranging from naive and short-sighte… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

    Comments: Accepted for publication in Nature Communications

  44. Explaining the Unique Nature of Individual Gait Patterns with Deep Learning

    Authors: Fabian Horst, Sebastian Lapuschkin, Wojciech Samek, Klaus-Robert Müller, Wolfgang I. Schöllhorn

    Abstract: Machine learning (ML) techniques such as (deep) artificial neural networks (DNN) are solving very successfully a plethora of tasks and provide new predictive models for complex physical, chemical, biological and social systems. However, in most cases this comes with the disadvantage of acting as a black box, rarely providing information about what made them arrive at a particular prediction. This… ▽ More

    Submitted 27 February, 2019; v1 submitted 13 August, 2018; originally announced August 2018.

    Comments: 17 pages (23 pages including references, 24 pages including references and auxiliary statements, 33 pages including references, auxiliary statements and and supplementary material). 5 figures, 3 tables, 4 supplementary figures, 9 supplementary tables. Accepted for publication at Scientific Reports: https://doi.org/10.1038/s41598-019-38748-8

  45. arXiv:1808.04260  [pdf, other

    cs.LG stat.ML

    iNNvestigate neural networks!

    Authors: Maximilian Alber, Sebastian Lapuschkin, Philipp Seegerer, Miriam Hägele, Kristof T. Schütt, Grégoire Montavon, Wojciech Samek, Klaus-Robert Müller, Sven Dähne, Pieter-Jan Kindermans

    Abstract: In recent years, deep neural networks have revolutionized many application domains of machine learning and are key components of many critical decision or predictive processes. Therefore, it is crucial that domain specialists can understand and analyze actions and pre- dictions, even of the most complex neural network architectures. Despite these arguments neural networks are often treated as blac… ▽ More

    Submitted 13 August, 2018; originally announced August 2018.

  46. arXiv:1807.03418  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    AudioMNIST: Exploring Explainable Artificial Intelligence for Audio Analysis on a Simple Benchmark

    Authors: Sören Becker, Johanna Vielhaben, Marcel Ackermann, Klaus-Robert Müller, Sebastian Lapuschkin, Wojciech Samek

    Abstract: Explainable Artificial Intelligence (XAI) is targeted at understanding how models perform feature selection and derive their classification decisions. This paper explores post-hoc explanations for deep neural networks in the audio domain. Notably, we present a novel Open Source audio dataset consisting of 30,000 audio samples of English spoken digits which we use for classification tasks on spoken… ▽ More

    Submitted 27 November, 2023; v1 submitted 9 July, 2018; originally announced July 2018.

    Comments: 10 pages, 5 figures, 1 table

  47. arXiv:1708.07689  [pdf, other

    stat.ML cs.AI cs.CV cs.IR cs.LG

    Understanding and Comparing Deep Neural Networks for Age and Gender Classification

    Authors: Sebastian Lapuschkin, Alexander Binder, Klaus-Robert Müller, Wojciech Samek

    Abstract: Recently, deep neural networks have demonstrated excellent performances in recognizing the age and gender on human face images. However, these models were applied in a black-box manner with no information provided about which facial features are actually used for prediction and how these features depend on image preprocessing, model initialization and architecture choice. We present a study invest… ▽ More

    Submitted 25 August, 2017; originally announced August 2017.

    Comments: 8 pages, 5 figures, 5 tables. Presented at ICCV 2017 Workshop: 7th IEEE International Workshop on Analysis and Modeling of Faces and Gestures

    MSC Class: 68

  48. arXiv:1611.08191  [pdf, other

    stat.ML cs.LG

    Interpreting the Predictions of Complex ML Models by Layer-wise Relevance Propagation

    Authors: Wojciech Samek, Grégoire Montavon, Alexander Binder, Sebastian Lapuschkin, Klaus-Robert Müller

    Abstract: Complex nonlinear models such as deep neural network (DNNs) have become an important tool for image classification, speech recognition, natural language processing, and many other fields of application. These models however lack transparency due to their complex nonlinear structure and to the complex data distributions to which they typically apply. As a result, it is difficult to fully characteri… ▽ More

    Submitted 24 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems