Skip to main content

Showing 1–50 of 90 results for author: Tabor, J

  1. arXiv:2406.18739  [pdf, other

    cs.LG

    RetroGFN: Diverse and Feasible Retrosynthesis using GFlowNets

    Authors: Piotr Gaiński, Michał Koziarski, Krzysztof Maziarz, Marwin Segler, Jacek Tabor, Marek Śmieja

    Abstract: Single-step retrosynthesis aims to predict a set of reactions that lead to the creation of a target molecule, which is a crucial task in molecular discovery. Although a target molecule can often be synthesized with multiple different reactions, it is not clear how to verify the feasibility of a reaction, because the available datasets cover only a tiny fraction of the possible solutions. Consequen… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2406.11443  [pdf, other

    cs.CV cs.LG

    PrAViC: Probabilistic Adaptation Framework for Real-Time Video Classification

    Authors: Magdalena Trędowicz, Łukasz Struski, Marcin Mazur, Szymon Janusz, Arkadiusz Lewicki, Jacek Tabor

    Abstract: Video processing is generally divided into two main categories: processing of the entire video, which typically yields optimal classification outcomes, and real-time processing, where the objective is to make a decision as promptly as possible. The latter is often driven by the need to identify rapidly potential critical or dangerous situations. These could include machine failure, traffic acciden… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2406.01755  [pdf, other

    cs.LG cs.AI

    Sparser, Better, Deeper, Stronger: Improving Sparse Training with Exact Orthogonal Initialization

    Authors: Aleksandra Irena Nowak, Łukasz Gniecki, Filip Szatkowski, Jacek Tabor

    Abstract: Static sparse training aims to train sparse models from scratch, achieving remarkable results in recent years. A key design choice is given by the sparse initialization, which determines the trainable sub-network through a binary mask. Existing methods mainly select such mask based on a predefined dense initialization. Such an approach may not efficiently leverage the mask's potential impact on th… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  4. arXiv:2405.18163  [pdf, other

    cs.GR

    NegGS: Negative Gaussian Splatting

    Authors: Artur Kasymov, Bartosz Czekaj, Marcin Mazur, Jacek Tabor, Przemysław Spurek

    Abstract: One of the key advantages of 3D rendering is its ability to simulate intricate scenes accurately. One of the most widely used methods for this purpose is Gaussian Splatting, a novel approach that is known for its rapid training and inference capabilities. In essence, Gaussian Splatting involves incorporating data about the 3D objects of interest into a series of Gaussian distributions, each of whi… ▽ More

    Submitted 30 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  5. arXiv:2405.17604  [pdf, other

    cs.LG cs.AI cs.CL

    LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters

    Authors: Klaudia Bałazy, Mohammadreza Banaei, Karl Aberer, Jacek Tabor

    Abstract: The recent trend in scaling language models has led to a growing demand for parameter-efficient tuning (PEFT) methods such as LoRA (Low-Rank Adaptation). LoRA consistently matches or surpasses the full fine-tuning baseline with fewer parameters. However, handling numerous task-specific or user-specific LoRA modules on top of a base model still presents significant storage challenges. To address th… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  6. arXiv:2405.14331  [pdf, other

    cs.CV cs.AI cs.LG

    LucidPPN: Unambiguous Prototypical Parts Network for User-centric Interpretable Computer Vision

    Authors: Mateusz Pach, Dawid Rymarczyk, Koryna Lewandowska, Jacek Tabor, Bartosz Zieliński

    Abstract: Prototypical parts networks combine the power of deep learning with the explainability of case-based reasoning to make accurate, interpretable decisions. They follow the this looks like that reasoning, representing each prototypical part with patches from training images. However, a single image patch comprises multiple visual features, such as color, shape, and texture, making it difficult for us… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Work in the review process. The code will be available upon acceptance

  7. arXiv:2403.07603  [pdf, other

    cs.LG

    ProPML: Probability Partial Multi-label Learning

    Authors: Łukasz Struski, Adam Pardyl, Jacek Tabor, Bartosz Zieliński

    Abstract: Partial Multi-label Learning (PML) is a type of weakly supervised learning where each training instance corresponds to a set of candidate labels, among which only some are true. In this paper, we introduce \our{}, a novel probabilistic approach to this problem that extends the binary cross entropy to the PML setup. In contrast to existing methods, it does not require suboptimal disambiguation and,… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted to the International Conference on Data Science and Advanced Analytics (DSAA 2023)

  8. arXiv:2402.06390  [pdf, other

    cs.CV

    ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian Splatting

    Authors: Georgii Stanishevskii, Jakub Steczkiewicz, Tomasz Szczepanik, Sławomir Tadeja, Jacek Tabor, Przemysław Spurek

    Abstract: Numerous emerging deep-learning techniques have had a substantial impact on computer graphics. Among the most promising breakthroughs are the recent rise of Neural Radiance Fields (NeRFs) and Gaussian Splatting (GS). NeRFs encode the object's shape and color in neural network weights using a handful of images with known camera positions to generate novel views. In contrast, GS provides accelerated… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  9. arXiv:2402.01459  [pdf, other

    cs.CV

    GaMeS: Mesh-Based Adapting and Modification of Gaussian Splatting

    Authors: Joanna Waczyńska, Piotr Borycki, Sławomir Tadeja, Jacek Tabor, Przemysław Spurek

    Abstract: Recently, a range of neural network-based methods for image rendering have been introduced. One such widely-researched neural radiance field (NeRF) relies on a neural network to represent 3D scenes, allowing for realistic view synthesis from a small number of 2D images. However, most NeRF models are constrained by long training and inference times. In comparison, Gaussian Splatting (GS) is a novel… ▽ More

    Submitted 15 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  10. arXiv:2312.13729  [pdf, other

    cs.CV

    Gaussian Splatting with NeRF-based Color and Opacity

    Authors: Dawid Malarz, Weronika Smolak, Jacek Tabor, Sławomir Tadeja, Przemysław Spurek

    Abstract: Neural Radiance Fields (NeRFs) have demonstrated the remarkable potential of neural networks to capture the intricacies of 3D objects. By encoding the shape and color information within neural network weights, NeRFs excel at producing strikingly sharp novel views of 3D objects. Recently, numerous generalizations of NeRFs utilizing generative models have emerged, expanding its versatility. In contr… ▽ More

    Submitted 12 June, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

  11. arXiv:2311.03884  [pdf, other

    eess.IV cs.CV

    MeVGAN: GAN-based Plugin Model for Video Generation with Applications in Colonoscopy

    Authors: Łukasz Struski, Tomasz Urbańczyk, Krzysztof Bucki, Bartłomiej Cupiał, Aneta Kaczyńska, Przemysław Spurek, Jacek Tabor

    Abstract: Video generation is important, especially in medicine, as much data is given in this form. However, video generation of high-resolution data is a very demanding task for generative models, due to the large need for memory. In this paper, we propose Memory Efficient Video GAN (MeVGAN) - a Generative Adversarial Network (GAN) which uses plugin-type architecture. We use a pre-trained 2D-image GAN and… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  12. arXiv:2309.12033  [pdf, other

    cs.CV cs.LG

    Face Identity-Aware Disentanglement in StyleGAN

    Authors: Adrian Suwała, Bartosz Wójcik, Magdalena Proszewska, Jacek Tabor, Przemysław Spurek, Marek Śmieja

    Abstract: Conditional GANs are frequently used for manipulating the attributes of face images, such as expression, hairstyle, pose, or age. Even though the state-of-the-art models successfully modify the requested attributes, they simultaneously modify other important characteristics of the image, such as a person's identity. In this paper, we focus on solving this problem by introducing PluGeN4Faces, a plu… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  13. arXiv:2308.08162  [pdf, other

    cs.CV cs.AI cs.LG

    Interpretability Benchmark for Evaluating Spatial Misalignment of Prototypical Parts Explanations

    Authors: Mikołaj Sacha, Bartosz Jura, Dawid Rymarczyk, Łukasz Struski, Jacek Tabor, Bartosz Zieliński

    Abstract: Prototypical parts-based networks are becoming increasingly popular due to their faithful self-explanations. However, their similarity maps are calculated in the penultimate network layer. Therefore, the receptive field of the prototype activation region often depends on parts of the image outside this region, which can lead to misleading interpretations. We name this undesired behavior a spatial… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: Under review. Code will be release upon acceptance

  14. arXiv:2307.02198  [pdf, other

    cs.LG q-bio.QM

    ChiENN: Embracing Molecular Chirality with Graph Neural Networks

    Authors: Piotr Gaiński, Michał Koziarski, Jacek Tabor, Marek Śmieja

    Abstract: Graph Neural Networks (GNNs) play a fundamental role in many deep learning problems, in particular in cheminformatics. However, typical GNNs cannot capture the concept of chirality, which means they do not distinguish between the 3D graph of a chemical compound and its mirror image (enantiomer). The ability to distinguish between enantiomers is important especially in drug discovery because enanti… ▽ More

    Submitted 10 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

  15. arXiv:2306.12230  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Fantastic Weights and How to Find Them: Where to Prune in Dynamic Sparse Training

    Authors: Aleksandra I. Nowak, Bram Grooten, Decebal Constantin Mocanu, Jacek Tabor

    Abstract: Dynamic Sparse Training (DST) is a rapidly evolving area of research that seeks to optimize the sparse initialization of a neural network by adapting its topology during training. It has been shown that under specific conditions, DST is able to outperform dense models. The key components of this framework are the pruning and growing criteria, which are repeatedly applied during the training proces… ▽ More

    Submitted 29 November, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  16. arXiv:2306.10535  [pdf, other

    eess.IV cs.CV cs.LG

    ProMIL: Probabilistic Multiple Instance Learning for Medical Imaging

    Authors: Łukasz Struski, Dawid Rymarczyk, Arkadiusz Lewicki, Robert Sabiniewicz, Jacek Tabor, Bartosz Zieliński

    Abstract: Multiple Instance Learning (MIL) is a weakly-supervised problem in which one label is assigned to the whole bag of instances. An important class of MIL models is instance-based, where we first classify instances and then aggregate those predictions to obtain a bag label. The most common MIL model is when we consider a bag as positive if at least one of its instances has a positive label. However,… ▽ More

    Submitted 12 March, 2024; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: Accepted Paper to European Conference on Artificial Intelligence (ECAI 2023)

  17. arXiv:2306.06082  [pdf, other

    cs.CV cs.LG

    Augmentation-aware Self-supervised Learning with Conditioned Projector

    Authors: Marcin Przewięźlikowski, Mateusz Pyla, Bartosz Zieliński, Bartłomiej Twardowski, Jacek Tabor, Marek Śmieja

    Abstract: Self-supervised learning (SSL) is a powerful technique for learning robust representations from unlabeled data. By learning to remain invariant to applied data augmentations, methods such as SimCLR and MoCo are able to reach quality on par with supervised approaches. However, this invariance may be harmful to solving some downstream tasks which depend on traits affected by augmentations used durin… ▽ More

    Submitted 2 December, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: Prepint under review. Code: https://github.com/gmum/CASSLE

  18. arXiv:2305.10579  [pdf, other

    cs.CV

    MultiPlaneNeRF: Neural Radiance Field with Non-Trainable Representation

    Authors: Dominik Zimny, Artur Kasymov, Adam Kania, Jacek Tabor, Maciej Zięba, Przemysław Spurek

    Abstract: NeRF is a popular model that efficiently represents 3D objects from 2D images. However, vanilla NeRF has some important limitations. NeRF must be trained on each object separately. The training time is long since we encode the object's shape and color in neural network weights. Moreover, NeRF does not generalize well to unseen data. In this paper, we present MultiPlaneNeRF -- a model that simultan… ▽ More

    Submitted 28 November, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  19. arXiv:2304.05243  [pdf, other

    cs.LG

    r-softmax: Generalized Softmax with Controllable Sparsity Rate

    Authors: Klaudia Bałazy, Łukasz Struski, Marek Śmieja, Jacek Tabor

    Abstract: Nowadays artificial neural network models achieve remarkable results in many disciplines. Functions mapping the representation provided by the model to the probability distribution are the inseparable aspect of deep learning solutions. Although softmax is a commonly accepted probability mapping function in the machine learning community, it cannot return sparse outputs and always spreads the posit… ▽ More

    Submitted 21 April, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

  20. arXiv:2304.01367  [pdf, other

    cs.CV

    Gaussian model for closed curves

    Authors: Krzysztof Byrski, Przemysław Spurek, Jacek Tabor

    Abstract: Gaussian Mixture Models (GMM) do not adapt well to curved and strongly nonlinear data. However, we can use Gaussians in the curvilinear coordinate systems to solve this problem. Moreover, such a solution allows for the adaptation of clusters to the complicated shapes defined by the family of functions. But still, it is challenging to model clusters as closed curves (e.g., circles, ellipses, etc.).… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  21. arXiv:2303.06226  [pdf, other

    cs.CV

    NeRFlame: FLAME-based conditioning of NeRF for 3D face rendering

    Authors: Wojciech Zając, Joanna Waczyńska, Piotr Borycki, Jacek Tabor, Maciej Zięba, Przemysław Spurek

    Abstract: Traditional 3D face models are based on mesh representations with texture. One of the most important models is FLAME (Faces Learned with an Articulated Model and Expressions), which produces meshes of human faces that are fully controllable. Unfortunately, such models have problems with capturing geometric and appearance details. In contrast to mesh representation, the neural radiance field (NeRF)… ▽ More

    Submitted 27 November, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

  22. arXiv:2303.03389  [pdf, other

    cs.LG cs.AI

    Contrastive Hierarchical Clustering

    Authors: Michał Znaleźniak, Przemysław Rola, Patryk Kaszuba, Jacek Tabor, Marek Śmieja

    Abstract: Deep clustering has been dominated by flat models, which split a dataset into a predefined number of groups. Although recent methods achieve an extremely high similarity with the ground truth on popular benchmarks, the information contained in the flat partition is limited. In this paper, we introduce CoHiClust, a Contrastive Hierarchical Clustering model based on deep neural networks, which can b… ▽ More

    Submitted 21 June, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

  23. arXiv:2302.04959  [pdf, other

    cs.LG cs.AI cs.SD eess.AS

    Hypernetworks build Implicit Neural Representations of Sounds

    Authors: Filip Szatkowski, Karol J. Piczak, Przemysław Spurek, Jacek Tabor, Tomasz Trzciński

    Abstract: Implicit Neural Representations (INRs) are nowadays used to represent multimedia signals across various real-life applications, including image super-resolution, image compression, or 3D rendering. Existing methods that leverage INRs are predominantly focused on visual data, as their application to other modalities, such as audio, is nontrivial due to the inductive biases present in architectural… ▽ More

    Submitted 17 June, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: ECML2023

  24. arXiv:2302.04045  [pdf, other

    cs.CL

    Revisiting Offline Compression: Going Beyond Factorization-based Methods for Transformer Language Models

    Authors: Mohammadreza Banaei, Klaudia Bałazy, Artur Kasymov, Rémi Lebret, Jacek Tabor, Karl Aberer

    Abstract: Recent transformer language models achieve outstanding results in many natural language processing (NLP) tasks. However, their enormous size often makes them impractical on memory-constrained devices, requiring practitioners to compress them to smaller networks. In this paper, we explore offline compression methods, meaning computationally-cheap approaches that do not require further fine-tuning o… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  25. arXiv:2301.12276  [pdf, other

    cs.CV

    ProtoSeg: Interpretable Semantic Segmentation with Prototypical Parts

    Authors: Mikołaj Sacha, Dawid Rymarczyk, Łukasz Struski, Jacek Tabor, Bartosz Zieliński

    Abstract: We introduce ProtoSeg, a novel model for interpretable semantic image segmentation, which constructs its predictions using similar patches from the training set. To achieve accuracy comparable to baseline methods, we adapt the mechanism of prototypical parts and introduce a diversity loss function that increases the variety of prototypes within each class. We show that ProtoSeg discovers semantic… ▽ More

    Submitted 28 January, 2023; originally announced January 2023.

    Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2023, pp. 1481-1492

  26. arXiv:2211.01839  [pdf, other

    cs.SD cs.AI cs.LG cs.NE eess.AS

    HyperSound: Generating Implicit Neural Representations of Audio Signals with Hypernetworks

    Authors: Filip Szatkowski, Karol J. Piczak, Przemysław Spurek, Jacek Tabor, Tomasz Trzciński

    Abstract: Implicit neural representations (INRs) are a rapidly growing research field, which provides alternative ways to represent multimedia signals. Recent applications of INRs include image super-resolution, compression of high-dimensional signals, or 3D rendering. However, these solutions usually focus on visual data, and adapting them to the audio domain is not trivial. Moreover, it requires a separat… ▽ More

    Submitted 25 January, 2024; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022 MetaLearn workshop

  27. arXiv:2210.02796  [pdf, other

    cs.LG cs.AI

    Hypernetwork approach to Bayesian MAML

    Authors: Piotr Borycki, Piotr Kubacki, Marcin Przewięźlikowski, Tomasz Kuśmierczyk, Jacek Tabor, Przemysław Spurek

    Abstract: The main goal of Few-Shot learning algorithms is to enable learning from small amounts of data. One of the most popular and elegant Few-Shot learning approaches is Model-Agnostic Meta-Learning (MAML). The main idea behind this method is to learn the shared universal weights of a meta-model, which are then adapted for specific tasks. However, the method suffers from over-fitting and poorly quantifi… ▽ More

    Submitted 30 August, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: arXiv admin note: text overlap with arXiv:2205.15745

  28. arXiv:2208.09931  [pdf, other

    cs.LG cs.AI

    ProPaLL: Probabilistic Partial Label Learning

    Authors: Łukasz Struski, Jacek Tabor, Bartosz Zieliński

    Abstract: Partial label learning is a type of weakly supervised learning, where each training instance corresponds to a set of candidate labels, among which only one is true. In this paper, we introduce ProPaLL, a novel probabilistic approach to this problem, which has at least three advantages compared to the existing approaches: it simplifies the training process, improves performance, and can be applied… ▽ More

    Submitted 21 August, 2022; originally announced August 2022.

  29. arXiv:2206.14882  [pdf, other

    stat.ML cs.LG

    LIDL: Local Intrinsic Dimension Estimation Using Approximate Likelihood

    Authors: Piotr Tempczyk, Rafał Michaluk, Łukasz Garncarek, Przemysław Spurek, Jacek Tabor, Adam Goliński

    Abstract: Most of the existing methods for estimating the local intrinsic dimension of a data distribution do not scale well to high-dimensional data. Many of them rely on a non-parametric nearest neighbors approach which suffers from the curse of dimensionality. We attempt to address that challenge by proposing a novel approach to the problem: Local Intrinsic Dimension estimation using approximate Likeliho… ▽ More

    Submitted 11 July, 2022; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: ICML 2022

  30. arXiv:2206.13923  [pdf, other

    cs.LG

    SLOVA: Uncertainty Estimation Using Single Label One-Vs-All Classifier

    Authors: Bartosz Wójcik, Jacek Grela, Marek Śmieja, Krzysztof Misztal, Jacek Tabor

    Abstract: Deep neural networks present impressive performance, yet they cannot reliably estimate their predictive confidence, limiting their applicability in high-risk domains. We show that applying a multi-label one-vs-all loss reveals classification ambiguity and reduces model overconfidence. The introduced SLOVA (Single Label One-Vs-All) model redefines typical one-vs-all predictive probabilities to a si… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

  31. arXiv:2206.09453  [pdf, other

    cs.LG cs.AI stat.ML

    Bounding Evidence and Estimating Log-Likelihood in VAE

    Authors: Łukasz Struski, Marcin Mazur, Paweł Batorski, Przemysław Spurek, Jacek Tabor

    Abstract: Many crucial problems in deep learning and statistics are caused by a variational gap, i.e., a difference between evidence and evidence lower bound (ELBO). As a consequence, in the classical VAE model, we obtain only the lower bound on the log-likelihood since ELBO is used as a cost function, and therefore we cannot compare log-likelihood between models. In this paper, we present a general and eff… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

  32. arXiv:2206.07996  [pdf, other

    cs.LG

    Continual Learning with Guarantees via Weight Interval Constraints

    Authors: Maciej Wołczyk, Karol J. Piczak, Bartosz Wójcik, Łukasz Pustelnik, Paweł Morawiecki, Jacek Tabor, Tomasz Trzciński, Przemysław Spurek

    Abstract: We introduce a new training paradigm that enforces interval constraints on neural network parameter space to control forgetting. Contemporary Continual Learning (CL) methods focus on training neural networks efficiently from a stream of data, while reducing the negative impact of catastrophic forgetting, yet they do not provide any firm guarantees that network performance will not deteriorate unco… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: Short presentation at ICML 2022

  33. arXiv:2205.15745  [pdf, other

    cs.LG cs.AI

    HyperMAML: Few-Shot Adaptation of Deep Models with Hypernetworks

    Authors: M. Przewięźlikowski, P. Przybysz, J. Tabor, M. Zięba, P. Spurek

    Abstract: The aim of Few-Shot learning methods is to train models which can easily adapt to previously unseen tasks, based on small amounts of data. One of the most popular and elegant Few-Shot learning approaches is Model-Agnostic Meta-Learning (MAML). The main idea behind this method is to learn the general weights of the meta-model, which are further adapted to specific problems in a small number of grad… ▽ More

    Submitted 8 July, 2024; v1 submitted 31 May, 2022; originally announced May 2022.

  34. arXiv:2203.11378  [pdf, other

    cs.LG cs.AI cs.CV

    HyperShot: Few-Shot Learning by Kernel HyperNetworks

    Authors: Marcin Sendera, Marcin Przewięźlikowski, Konrad Karanowski, Maciej Zięba, Jacek Tabor, Przemysław Spurek

    Abstract: Few-shot models aim at making predictions using a minimal number of labeled examples from a given task. The main challenge in this area is the one-shot setting where only one element represents each class. We propose HyperShot - the fusion of kernels and hypernetwork paradigm. Compared to reference approaches that apply a gradient-based adjustment of the parameters, our model aims to switch the cl… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  35. arXiv:2112.02902  [pdf, other

    cs.CV cs.AI cs.LG

    Interpretable Image Classification with Differentiable Prototypes Assignment

    Authors: Dawid Rymarczyk, Łukasz Struski, Michał Górszczak, Koryna Lewandowska, Jacek Tabor, Bartosz Zieliński

    Abstract: We introduce ProtoPool, an interpretable image classification model with a pool of prototypes shared by the classes. The training is more straightforward than in the existing methods because it does not require the pruning stage. It is obtained by introducing a fully differentiable assignment of prototypes to particular classes. Moreover, we introduce a novel focal similarity function to focus the… ▽ More

    Submitted 5 September, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: Accepted to ECCV 2022

  36. arXiv:2110.14010  [pdf, other

    cs.LG cs.CV

    MisConv: Convolutional Neural Networks for Missing Data

    Authors: Marcin Przewięźlikowski, Marek Śmieja, Łukasz Struski, Jacek Tabor

    Abstract: Processing of missing data by modern neural networks, such as CNNs, remains a fundamental, yet unsolved challenge, which naturally arises in many practical applications, like image inpainting or autonomous vehicles and robots. While imputation-based techniques are still one of the most popular solutions, they frequently introduce unreliable information to the data and do not take into account the… ▽ More

    Submitted 29 October, 2021; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted for publication at WACV 2022 Conference

  37. arXiv:2110.13561  [pdf, other

    cs.LG

    Non-Gaussian Gaussian Processes for Few-Shot Regression

    Authors: Marcin Sendera, Jacek Tabor, Aleksandra Nowak, Andrzej Bedychaj, Massimiliano Patacchiola, Tomasz Trzciński, Przemysław Spurek, Maciej Zięba

    Abstract: Gaussian Processes (GPs) have been widely used in machine learning to model distributions over functions, with applications including multi-modal regression, time-series prediction, and few-shot learning. GPs are particularly useful in the last application since they rely on Normal distributions and enable closed-form computation of the posterior probability function. Unfortunately, because the re… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  38. arXiv:2110.05841  [pdf, other

    cs.LG cs.AI

    Relative Molecule Self-Attention Transformer

    Authors: Łukasz Maziarka, Dawid Majchrowski, Tomasz Danel, Piotr Gaiński, Jacek Tabor, Igor Podolak, Paweł Morkisz, Stanisław Jastrzębski

    Abstract: Self-supervised learning holds promise to revolutionize molecule property prediction - a central task to drug discovery and many more industries - by enabling data efficient learning from scarce experimental data. Despite significant progress, non-pretrained methods can be still competitive in certain settings. We reason that architecture might be a key bottleneck. In particular, enriching the bac… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

  39. arXiv:2108.04907  [pdf, other

    cs.LG

    Flow-based SVDD for anomaly detection

    Authors: Marcin Sendera, Marek Śmieja, Łukasz Maziarka, Łukasz Struski, Przemysław Spurek, Jacek Tabor

    Abstract: We propose FlowSVDD -- a flow-based one-class classifier for anomaly/outliers detection that realizes a well-known SVDD principle using deep learning tools. Contrary to other approaches to deep SVDD, the proposed model is instantiated using flow-based models, which naturally prevents from collapsing of bounding hypersphere into a single point. Experiments show that FlowSVDD achieves comparable res… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

    Comments: arXiv admin note: text overlap with arXiv:2010.03002

  40. arXiv:2107.13214  [pdf, other

    cs.LG cs.AI

    SONG: Self-Organizing Neural Graphs

    Authors: Łukasz Struski, Tomasz Danel, Marek Śmieja, Jacek Tabor, Bartosz Zieliński

    Abstract: Recent years have seen a surge in research on deep interpretable neural networks with decision trees as one of the most commonly incorporated tools. There are at least three advantages of using decision trees over logistic regression classification models: they are easy to interpret since they are based on binary decisions, they can make decisions faster, and they provide a hierarchy of classes. H… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

  41. Direction is what you need: Improving Word Embedding Compression in Large Language Models

    Authors: Klaudia Bałazy, Mohammadreza Banaei, Rémi Lebret, Jacek Tabor, Karl Aberer

    Abstract: The adoption of Transformer-based models in natural language processing (NLP) has led to great success using a massive number of parameters. However, due to deployment constraints in edge devices, there has been a rising interest in the compression of these models to improve their inference time and memory footprint. This paper presents a novel loss objective to compress token embeddings in the Tr… ▽ More

    Submitted 3 August, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

  42. arXiv:2106.05409  [pdf, other

    cs.LG

    Zero Time Waste: Recycling Predictions in Early Exit Neural Networks

    Authors: Maciej Wołczyk, Bartosz Wójcik, Klaudia Bałazy, Igor Podolak, Jacek Tabor, Marek Śmieja, Tomasz Trzciński

    Abstract: The problem of reducing processing time of large deep learning models is a fundamental challenge in many real-world applications. Early exit methods strive towards this goal by attaching additional Internal Classifiers (ICs) to intermediate layers of a neural network. ICs can quickly return predictions for easy examples and, as a result, reduce the average inference time of the whole model. Howeve… ▽ More

    Submitted 5 December, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted at NeurIPS 2021

  43. arXiv:2102.05973  [pdf, other

    cs.CV

    HyperPocket: Generative Point Cloud Completion

    Authors: Przemysław Spurek, Artur Kasymov, Marcin Mazur, Diana Janik, Sławomir Tadeja, Łukasz Struski, Jacek Tabor, Tomasz Trzciński

    Abstract: Scanning real-life scenes with modern registration devices typically give incomplete point cloud representations, mostly due to the limitations of the scanning process and 3D occlusions. Therefore, completing such partial representations remains a fundamental challenge of many computer vision applications. Most of the existing approaches aim to solve this problem by learning to reconstruct individ… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  44. arXiv:2011.14620  [pdf, other

    cs.LG cs.AI stat.ML

    RegFlow: Probabilistic Flow-based Regression for Future Prediction

    Authors: Maciej Zięba, Marcin Przewięźlikowski, Marek Śmieja, Jacek Tabor, Tomasz Trzcinski, Przemysław Spurek

    Abstract: Predicting future states or actions of a given system remains a fundamental, yet unsolved challenge of intelligence, especially in the scope of complex and non-deterministic scenarios, such as modeling behavior of humans. Existing approaches provide results under strong assumptions concerning unimodality of future states, or, at best, assuming specific probability distributions that often poorly f… ▽ More

    Submitted 30 November, 2020; originally announced November 2020.

  45. arXiv:2011.14340  [pdf, other

    cs.CV cs.AI cs.LG

    ProtoPShare: Prototype Sharing for Interpretable Image Classification and Similarity Discovery

    Authors: Dawid Rymarczyk, Łukasz Struski, Jacek Tabor, Bartosz Zieliński

    Abstract: In this paper, we introduce ProtoPShare, a self-explained method that incorporates the paradigm of prototypical parts to explain its predictions. The main novelty of the ProtoPShare is its ability to efficiently share prototypical parts between the classes thanks to our data-dependent merge-pruning. Moreover, the prototypes are more consistent and the model is more robust to image perturbations th… ▽ More

    Submitted 29 November, 2020; originally announced November 2020.

  46. OneFlow: One-class flow for anomaly detection based on a minimal volume region

    Authors: Łukasz Maziarka, Marek Śmieja, Marcin Sendera, Łukasz Struski, Jacek Tabor, Przemysław Spurek

    Abstract: We propose OneFlow - a flow-based one-class classifier for anomaly (outlier) detection that finds a minimal volume bounding region. Contrary to density-based methods, OneFlow is constructed in such a way that its result typically does not depend on the structure of outliers. This is caused by the fact that during training the gradient of the cost function is propagated only over the points located… ▽ More

    Submitted 22 September, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Journal ref: 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

  47. arXiv:2009.07327  [pdf, other

    cs.LG cs.CV stat.ML

    Generative models with kernel distance in data space

    Authors: Szymon Knop, Marcin Mazur, Przemysław Spurek, Jacek Tabor, Igor Podolak

    Abstract: Generative models dealing with modeling a~joint data distribution are generally either autoencoder or GAN based. Both have their pros and cons, generating blurry images or being unstable in training or prone to mode collapse phenomenon, respectively. The objective of this paper is to construct a~model situated between above architectures, one that does not inherit their main weaknesses. The propos… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

  48. arXiv:2006.10013  [pdf, other

    cs.LG cs.CR stat.ML

    Adversarial Examples Detection and Analysis with Layer-wise Autoencoders

    Authors: Bartosz Wójcik, Paweł Morawiecki, Marek Śmieja, Tomasz Krzyżek, Przemysław Spurek, Jacek Tabor

    Abstract: We present a mechanism for detecting adversarial examples based on data representations taken from the hidden layers of the target network. For this purpose, we train individual autoencoders at intermediate layers of the target network. This allows us to describe the manifold of true data and, in consequence, decide whether a given example has the same characteristics as true data. It also gives u… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  49. arXiv:2006.08710  [pdf, other

    cs.CV cs.LG eess.IV

    HyperFlow: Representing 3D Objects as Surfaces

    Authors: Przemysław Spurek, Maciej Zięba, Jacek Tabor, Tomasz Trzciński

    Abstract: In this work, we present HyperFlow - a novel generative model that leverages hypernetworks to create continuous 3D object representations in a form of lightweight surfaces (meshes), directly out of point clouds. Efficient object representations are essential for many computer vision applications, including robotic manipulation and autonomous driving. However, creating those representations is ofte… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  50. arXiv:2005.12991  [pdf, other

    cs.LG cs.CV stat.ML

    Kernel Self-Attention in Deep Multiple Instance Learning

    Authors: Dawid Rymarczyk, Adriana Borowa, Jacek Tabor, Bartosz Zieliński

    Abstract: Not all supervised learning problems are described by a pair of a fixed-size input tensor and a label. In some cases, especially in medical image analysis, a label corresponds to a bag of instances (e.g. image patches), and to classify such bag, aggregation of information from all of the instances is needed. There have been several attempts to create a model working with a bag of instances, howeve… ▽ More

    Submitted 5 March, 2021; v1 submitted 25 May, 2020; originally announced May 2020.

    Comments: https://openaccess.thecvf.com/content/WACV2021/papers/Rymarczyk_Kernel_Self-Attention_for_Weakly-Supervised_Image_Classification_Using_Deep_Multiple_Instance_WACV_2021_paper.pdf