Skip to main content

Showing 1–19 of 19 results for author: Aminian, G

  1. arXiv:2405.00454  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Robust Semi-supervised Learning via $f$-Divergence and $α$-Rényi Divergence

    Authors: Gholamali Aminian, Amirhossien Bagheri, Mahyar JafariNodeh, Radmehr Karimian, Mohammad-Hossein Yassaee

    Abstract: This paper investigates a range of empirical risk functions and regularization methods suitable for self-training methods in semi-supervised learning. These approaches draw inspiration from various divergence measures, such as $f$-divergences and $α$-Rényi divergences. Inspired by the theoretical foundations rooted in divergences, i.e., $f$-divergences and $α$-Rényi divergence, we also provide val… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Accepted in ISIT 2024

  2. arXiv:2402.07025  [pdf, other

    stat.ML cs.IT cs.LG

    Generalization Error of Graph Neural Networks in the Mean-field Regime

    Authors: Gholamali Aminian, Yixuan He, Gesine Reinert, Łukasz Szpruch, Samuel N. Cohen

    Abstract: This work provides a theoretical framework for assessing the generalization error of graph neural networks in the over-parameterized regime, where the number of parameters surpasses the quantity of data points. We explore two widely utilized types of graph neural networks: graph convolutional neural networks and message passing graph neural networks. Prior to this study, existing bounds on the gen… ▽ More

    Submitted 1 July, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: Accepted in ICML 2024

  3. arXiv:2306.11623  [pdf, ps, other

    stat.ML cs.LG math.ST

    Mean-field Analysis of Generalization Errors

    Authors: Gholamali Aminian, Samuel N. Cohen, Łukasz Szpruch

    Abstract: We propose a novel framework for exploring weak and $L_2$ generalization errors of algorithms through the lens of differential calculus on the space of probability measures. Specifically, we consider the KL-regularized empirical risk minimization problem and establish generic conditions under which the generalization error convergence rate, when training on a sample of size $n$, is… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 49 pages

    MSC Class: 62B10; 60F99; 49N80; 46N30

  4. arXiv:2304.14332  [pdf, other

    cs.LG cs.IT

    On the Generalization Error of Meta Learning for the Gibbs Algorithm

    Authors: Yuheng Bu, Harsha Vardhan Tetali, Gholamali Aminian, Miguel Rodrigues, Gregory Wornell

    Abstract: We analyze the generalization ability of joint-training meta learning algorithms via the Gibbs algorithm. Our exact characterization of the expected meta generalization error for the meta Gibbs algorithm is based on symmetrized KL information, which measures the dependence between all meta-training datasets and the output parameters, including task-specific and meta parameters. Additionally, we de… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: Accepted at ISIT 2023

  5. arXiv:2210.09864  [pdf, ps, other

    cs.IT

    Information-theoretic Characterizations of Generalization Error for the Gibbs Algorithm

    Authors: Gholamali Aminian, Yuheng Bu, Laura Toni, Miguel R. D. Rodrigues, Gregory W. Wornell

    Abstract: Various approaches have been developed to upper bound the generalization error of a supervised learning algorithm. However, existing bounds are often loose and even vacuous when evaluated in practice. As a result, they may fail to characterize the exact generalization ability of a learning algorithm. Our main contributions are exact characterizations of the expected generalization error of the wel… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: under review. arXiv admin note: text overlap with arXiv:2107.13656, arXiv:2111.01635

  6. arXiv:2210.08188  [pdf, ps, other

    cs.IT cs.LG

    How Does Pseudo-Labeling Affect the Generalization Error of the Semi-Supervised Gibbs Algorithm?

    Authors: Haiyun He, Gholamali Aminian, Yuheng Bu, Miguel Rodrigues, Vincent Y. F. Tan

    Abstract: We provide an exact characterization of the expected generalization error (gen-error) for semi-supervised learning (SSL) with pseudo-labeling via the Gibbs algorithm. The gen-error is expressed in terms of the symmetrized KL information between the output hypothesis, the pseudo-labeled dataset, and the labeled dataset. Distribution-free upper and lower bounds on the gen-error can also be obtained.… ▽ More

    Submitted 15 June, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: 30 pages, 4 figures

  7. arXiv:2210.00483  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Learning Algorithm Generalization Error Bounds via Auxiliary Distributions

    Authors: Gholamali Aminian, Saeed Masiha, Laura Toni, Miguel R. D. Rodrigues

    Abstract: Generalization error bounds are essential for comprehending how well machine learning models work. In this work, we suggest a novel method, i.e., the Auxiliary Distribution Method, that leads to new upper bounds on expected generalization errors that are appropriate for supervised learning scenarios. We show that our general upper bounds can be specialized under some conditions to new bounds invol… ▽ More

    Submitted 16 April, 2024; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: Accepted in IEEE Journal on Selected Areas in Information Theory

  8. arXiv:2209.07148  [pdf, ps, other

    cs.LG cs.AI cs.IT

    Semi-supervised Batch Learning From Logged Data

    Authors: Gholamali Aminian, Armin Behnamnia, Roberto Vega, Laura Toni, Chengchun Shi, Hamid R. Rabiee, Omar Rivasplata, Miguel R. D. Rodrigues

    Abstract: Off-policy learning methods are intended to learn a policy from logged data, which includes context, action, and feedback (cost or reward) for each sample point. In this work, we build on the counterfactual risk minimization framework, which also assumes access to propensity scores. We propose learning methods for problems where feedback is missing for some samples, so there are samples with feedb… ▽ More

    Submitted 18 February, 2024; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: 46 pages,

  9. arXiv:2202.12150  [pdf, ps, other

    cs.IT cs.LG

    Tighter Expected Generalization Error Bounds via Convexity of Information Measures

    Authors: Gholamali Aminian, Yuheng Bu, Gregory Wornell, Miguel Rodrigues

    Abstract: Generalization error bounds are essential to understanding machine learning algorithms. This paper presents novel expected generalization error upper bounds based on the average joint distribution between the output hypothesis and each input training sample. Multiple generalization error upper bounds based on different information measures are provided, including Wasserstein distance, total variat… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: 10 pages, 1 figure

  10. arXiv:2202.12123  [pdf, ps, other

    cs.IT stat.ML

    An Information-theoretical Approach to Semi-supervised Learning under Covariate-shift

    Authors: Gholamali Aminian, Mahed Abroshan, Mohammad Mahdi Khalili, Laura Toni, Miguel R. D. Rodrigues

    Abstract: A common assumption in semi-supervised learning is that the labeled, unlabeled, and test data are drawn from the same distribution. However, this assumption is not satisfied in many applications. In many scenarios, the data is collected sequentially (e.g., healthcare) and the distribution of the data may change over time often exhibiting so-called covariate shifts. In this paper, we propose an app… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: Accepted at AISTATS 2022

  11. arXiv:2111.01635  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Characterizing and Understanding the Generalization Error of Transfer Learning with Gibbs Algorithm

    Authors: Yuheng Bu, Gholamali Aminian, Laura Toni, Miguel Rodrigues, Gregory Wornell

    Abstract: We provide an information-theoretic analysis of the generalization ability of Gibbs-based transfer learning algorithms by focusing on two popular transfer learning approaches, $α$-weighted-ERM and two-stage-ERM. Our key result is an exact characterization of the generalization behaviour using the conditional symmetrized KL information between the output hypothesis and the target training samples g… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  12. arXiv:2107.13656  [pdf, ps, other

    cs.LG cs.IT math.ST stat.ML

    Characterizing the Generalization Error of Gibbs Algorithm with Symmetrized KL information

    Authors: Gholamali Aminian, Yuheng Bu, Laura Toni, Miguel R. D. Rodrigues, Gregory Wornell

    Abstract: Bounding the generalization error of a supervised learning algorithm is one of the most important problems in learning theory, and various approaches have been developed. However, existing bounds are often loose and lack of guarantees. As a result, they may fail to characterize the exact generalization ability of a learning algorithm. Our main contribution is an exact characterization of the expec… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: The first and second author have contributed equally to the paper. This paper is accepted in the ICML-21 Workshop on Information-Theoretic Methods for Rigorous, Responsible, and Reliable Machine Learning: https://sites.google.com/view/itr3/schedule

  13. arXiv:2102.02016  [pdf, ps, other

    cs.IT cs.LG stat.ML

    Information-Theoretic Bounds on the Moments of the Generalization Error of Learning Algorithms

    Authors: Gholamali Aminian, Laura Toni, Miguel R. D. Rodrigues

    Abstract: Generalization error bounds are critical to understanding the performance of machine learning models. In this work, building upon a new bound of the expected value of an arbitrary function of the population and empirical risk of a learning algorithm, we offer a more refined analysis of the generalization behaviour of a machine learning models based on a characterization of (bounds) to their genera… ▽ More

    Submitted 5 May, 2021; v1 submitted 3 February, 2021; originally announced February 2021.

    Comments: 7 pages, 3 figures, to be published in ISIT 2021. Some typos are fixed in the new version. The Re'yni divergence results are added in the new version

  14. arXiv:2010.12664  [pdf, ps, other

    cs.IT math.ST stat.ML

    Jensen-Shannon Information Based Characterization of the Generalization Error of Learning Algorithms

    Authors: Gholamali Aminian, Laura Toni, Miguel R. D. Rodrigues

    Abstract: Generalization error bounds are critical to understanding the performance of machine learning models. In this work, we propose a new information-theoretic based generalization error upper bound applicable to supervised learning scenarios. We show that our general bound can specialize in various previous bounds. We also show that our general bound can be specialized under some conditions to a new b… ▽ More

    Submitted 8 January, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Accepted in ITW 2020 conference

  15. arXiv:1702.03590  [pdf, other

    cs.IT

    On the Capacity of a Class of Signal-Dependent Noise Channels

    Authors: Hamid Ghourchian, Gholamali Aminian, Amin Gohari, Mahtab Mirmohseni, Masoumeh Nasiri-Kenari

    Abstract: In some applications, the variance of additive measurement noise depends on the signal that we aim to measure. For instance, additive Gaussian signal-dependent noise (AGSDN) channel models are used in molecular and optical communication. Herein we provide lower and upper bounds on the capacity of additive signal-dependent noise (ASDN) channels. The idea of the first lower bound is the extension of… ▽ More

    Submitted 2 June, 2017; v1 submitted 12 February, 2017; originally announced February 2017.

    Comments: 34 pages, 3 figures

  16. arXiv:1604.05680  [pdf, ps, other

    cs.IT

    On Medium Chemical Reaction in Diffusion-Based Molecular Communication: a Two-Way Relaying Example

    Authors: Maryam Farahnak-Ghazani, Gholamali Aminian, Mahtab Mirmohseni, Amin Gohari, Masoumeh Nasiri-Kenari

    Abstract: Chemical reactions are a prominent feature of molecular communication (MC) systems, with no direct parallels in wireless communications. While chemical reactions may be used inside the transmitter nodes, receiver nodes or the communication medium, we focus on its utility in the medium in this paper. Such chemical reactions can be used to perform computation over the medium as molecules diffuse and… ▽ More

    Submitted 24 April, 2018; v1 submitted 19 April, 2016; originally announced April 2016.

    Comments: 32 pages, 6 figures

  17. arXiv:1509.05877  [pdf, other

    cs.IT q-bio.MN

    On the Capacity of Point-to-Point and Multiple-Access Molecular Communications with Ligand-Receptors

    Authors: Gholamali Aminian, Maryam Farahnak Ghazani, Mahtab Mirmohseni, Masoumeh Nasiri Kenari, Faramarz Fekri

    Abstract: In this paper, we consider the bacterial point-to-point and multiple-access molecular communications with ligand-receptors. For the point-to-point communication, we investigate common signaling methods, namely the Level Scenario (LS), which uses one type of a molecule with different concentration levels, and the Type Scenario (TS), which employs multiple types of molecules with a single concentrat… ▽ More

    Submitted 19 September, 2015; originally announced September 2015.

  18. arXiv:1504.04322  [pdf, other

    cs.IT

    On the Capacity of Level and Type Modulation in Molecular Communication with Ligand Receptors

    Authors: Gholamali Aminian, Mahtab Mirmohseni, Masoumeh Nasiri Kenari, Faramarz Fekri

    Abstract: In this paper, we consider the bacterial point-to-point communication problem with one transmitter and one receiver by considering the ligand receptor binding process. The most commonly investigated signalling model, referred to as the Level Scenario (LS), uses one type of a molecule with different concentration levels for signaling. An alternative approach is to employ multiple types of molecules… ▽ More

    Submitted 16 April, 2015; originally announced April 2015.

    Comments: 18 pages, Accepted at ISIT conference

  19. arXiv:1410.3988  [pdf, other

    cs.IT

    Capacity of Diffusion based Molecular Communication Networks over LTI-Poisson Channels

    Authors: Hamidreza Arjmandi, Gholamali Aminian, Amin Gohari, Masoumeh Nasiri Kenari, Urbashi Mitra

    Abstract: In this paper, the capacity of a diffusion based molecular communication network under the model of a Linear Time Invarient-Poisson (LTI-Poisson) channel is studied. Introduced in the context of molecular communication, the LTI-Poisson model is a natural extension of the conventional memoryless Poisson channel to include memory. Exploiting prior art on linear ISI channels, a computable finite-lett… ▽ More

    Submitted 16 October, 2014; v1 submitted 15 October, 2014; originally announced October 2014.