Skip to main content

Showing 1–9 of 9 results for author: Feofanov, V

  1. arXiv:2406.10327  [pdf, other

    stat.ML cs.LG

    Analysing Multi-Task Regression via Random Matrix Theory with Application to Time Series Forecasting

    Authors: Romain Ilbert, Malik Tiomoko, Cosme Louart, Ambroise Odonnat, Vasilii Feofanov, Themis Palpanas, Ievgen Redko

    Abstract: In this paper, we introduce a novel theoretical framework for multi-task regression, applying random matrix theory to provide precise performance estimations, under high-dimensional, non-Gaussian data distributions. We formulate a multi-task optimization problem as a regularization technique to enable single-task models to leverage multi-task learning information. We derive a closed-form solution… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2405.18979  [pdf, other

    cs.LG stat.ML

    MANO: Exploiting Matrix Norm for Unsupervised Accuracy Estimation Under Distribution Shifts

    Authors: Renchunzi Xie, Ambroise Odonnat, Vasilii Feofanov, Weijian Deng, Jianfeng Zhang, Bo An

    Abstract: Leveraging the models' outputs, specifically the logits, is a common approach to estimating the test accuracy of a pre-trained neural network on out-of-distribution (OOD) samples without requiring access to the corresponding ground truth labels. Despite their ease of implementation and computational efficiency, current logit-based methods are vulnerable to overconfidence issues, leading to predict… ▽ More

    Submitted 24 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: The three first authors contributed equally

  3. arXiv:2402.10198  [pdf, other

    cs.LG stat.ML

    SAMformer: Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention

    Authors: Romain Ilbert, Ambroise Odonnat, Vasilii Feofanov, Aladin Virmaux, Giuseppe Paolo, Themis Palpanas, Ievgen Redko

    Abstract: Transformer-based architectures achieved breakthrough performance in natural language processing and computer vision, yet they remain inferior to simpler linear baselines in multivariate long-term forecasting. To better understand this phenomenon, we start by studying a toy linear forecasting problem for which we show that transformers are incapable of converging to their true solution despite the… ▽ More

    Submitted 3 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted as an Oral at ICML 2024, Vienna. The first two authors contributed equally

  4. arXiv:2401.08909  [pdf, other

    cs.LG

    Leveraging Gradients for Unsupervised Accuracy Estimation under Distribution Shift

    Authors: Renchunzi Xie, Ambroise Odonnat, Vasilii Feofanov, Ievgen Redko, Jianfeng Zhang, Bo An

    Abstract: Estimating test accuracy without access to the ground-truth test labels under varying test environments is a challenging, yet extremely important problem in the safe deployment of machine learning algorithms. Existing works rely on the information from either the outputs or the extracted features of neural networks to formulate an estimation score correlating with the ground-truth test accuracy. I… ▽ More

    Submitted 1 March, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  5. arXiv:2310.14814  [pdf, other

    cs.LG cs.AI stat.ML

    Leveraging Ensemble Diversity for Robust Self-Training in the Presence of Sample Selection Bias

    Authors: Ambroise Odonnat, Vasilii Feofanov, Ievgen Redko

    Abstract: Self-training is a well-known approach for semi-supervised learning. It consists of iteratively assigning pseudo-labels to unlabeled data for which the model is confident and treating them as labeled examples. For neural networks, softmax prediction probabilities are often used as a confidence measure, although they are known to be overconfident, even for wrong predictions. This phenomenon is part… ▽ More

    Submitted 3 April, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted at AISTATS 2024, Valencia, Spain

  6. arXiv:2310.13434  [pdf, other

    cs.LG cs.AI stat.ML

    Random Matrix Analysis to Balance between Supervised and Unsupervised Learning under the Low Density Separation Assumption

    Authors: Vasilii Feofanov, Malik Tiomoko, Aladin Virmaux

    Abstract: We propose a theoretical framework to analyze semi-supervised classification under the low density separation assumption in a high-dimensional regime. In particular, we introduce QLDS, a linear classification model, where the low density separation assumption is implemented via quadratic margin maximization. The algorithm has an explicit solution with rich theoretical properties, and we show that… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:10008-10033, 2023

  7. arXiv:2202.12040  [pdf, other

    cs.LG

    Self-Training: A Survey

    Authors: Massih-Reza Amini, Vasilii Feofanov, Loic Pauletto, Lies Hadjadj, Emilie Devijver, Yury Maximov

    Abstract: Semi-supervised algorithms aim to learn prediction functions from a small set of labeled observations and a large set of unlabeled observations. Because this framework is relevant in many applications, they have received a lot of interest in both academia and industry. Among the existing techniques, self-training methods have undoubtedly attracted greater attention in recent years. These models ar… ▽ More

    Submitted 27 May, 2024; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: 36 pages, 1 figure

  8. arXiv:2109.14422  [pdf, other

    cs.LG

    Multi-class Probabilistic Bounds for Self-learning

    Authors: Vasilii Feofanov, Emilie Devijver, Massih-Reza Amini

    Abstract: Self-learning is a classical approach for learning with both labeled and unlabeled observations which consists in giving pseudo-labels to unlabeled training instances with a confidence score over a predetermined threshold. At the same time, the pseudo-labeling technique is prone to error and runs the risk of adding noisy labels into unlabeled training data. In this paper, we present a probabilisti… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

    Comments: 46 pages, 4 figures

  9. arXiv:1911.04841  [pdf, other

    cs.LG stat.ML

    Semi-supervised Wrapper Feature Selection by Modeling Imperfect Labels

    Authors: Vasilii Feofanov, Emilie Devijver, Massih-Reza Amini

    Abstract: In this paper, we propose a new wrapper feature selection approach with partially labeled training examples where unlabeled observations are pseudo-labeled using the predictions of an initial classifier trained on the labeled training set. The wrapper is composed of a genetic algorithm for proposing new feature subsets, and an evaluation measure for scoring the different feature subsets. The selec… ▽ More

    Submitted 10 March, 2020; v1 submitted 12 November, 2019; originally announced November 2019.

    Comments: 18 pages, 1 figure