Computer Science > Machine Learning

arXiv:2106.04765 (cs)

[Submitted on 9 Jun 2021 (v1), last revised 27 Oct 2021 (this version, v2)]

Title:Predicting Deep Neural Network Generalization with Perturbation Response Curves

Authors:Yair Schiff, Brian Quanz, Payel Das, Pin-Yu Chen

View PDF

Abstract:The field of Deep Learning is rich with empirical evidence of human-like performance on a variety of prediction tasks. However, despite these successes, the recent Predicting Generalization in Deep Learning (PGDL) NeurIPS 2020 competition suggests that there is a need for more robust and efficient measures of network generalization. In this work, we propose a new framework for evaluating the generalization capabilities of trained networks. We use perturbation response (PR) curves that capture the accuracy change of a given network as a function of varying levels of training sample perturbation. From these PR curves, we derive novel statistics that capture generalization capability. Specifically, we introduce two new measures for accurately predicting generalization gaps: the Gi-score and Pal-score, which are inspired by the Gini coefficient and Palma ratio (measures of income inequality), that accurately predict generalization gaps. Using our framework applied to intra and inter-class sample mixup, we attain better predictive scores than the current state-of-the-art measures on a majority of tasks in the PGDL competition. In addition, we show that our framework and the proposed statistics can be used to capture to what extent a trained network is invariant to a given parametric input transformation, such as rotation or translation. Therefore, these generalization gap prediction statistics also provide a useful means for selecting optimal network architectures and hyperparameters that are invariant to a certain perturbation.

Comments:	NeurIPS 2021
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2106.04765 [cs.LG]
	(or arXiv:2106.04765v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2106.04765

Submission history

From: Yair Schiff [view email]
[v1] Wed, 9 Jun 2021 01:37:36 UTC (2,182 KB)
[v2] Wed, 27 Oct 2021 01:19:08 UTC (2,401 KB)

Computer Science > Machine Learning

Title:Predicting Deep Neural Network Generalization with Perturbation Response Curves

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Predicting Deep Neural Network Generalization with Perturbation Response Curves

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators