subscribe to arXiv mailings

Adapting Conformal Prediction to Distribution Shifts Without Labels

Authors: Kevin Kasa, Zhiyu Zhang, Heng Yang, Graham W. Taylor

Abstract: Conformal prediction (CP) enables machine learning models to output prediction sets with guaranteed coverage rate, assuming exchangeable data. Unfortunately, the exchangeability assumption is frequently violated due to distribution shifts in practice, and the challenge is often compounded by the lack of ground truth labels at test time. Focusing on classification in this paper, our goal is to impr… ▽ More Conformal prediction (CP) enables machine learning models to output prediction sets with guaranteed coverage rate, assuming exchangeable data. Unfortunately, the exchangeability assumption is frequently violated due to distribution shifts in practice, and the challenge is often compounded by the lack of ground truth labels at test time. Focusing on classification in this paper, our goal is to improve the quality of CP-generated prediction sets using only unlabeled data from the test domain. This is achieved by two new methods called ECP and EACP, that adjust the score function in CP according to the base model's uncertainty on the unlabeled test data. Through extensive experiments on a number of large-scale datasets and neural network architectures, we show that our methods provide consistent improvement over existing baselines and nearly match the performance of supervised algorithms. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2402.10229 [pdf, other]

Mixture-Models: a one-stop Python Library for Model-based Clustering using various Mixture Models

Authors: Siva Rajesh Kasa, Hu Yijie, Santhosh Kumar Kasa, Vaibhav Rajan

Abstract: \texttt{Mixture-Models} is an open-source Python library for fitting Gaussian Mixture Models (GMM) and their variants, such as Parsimonious GMMs, Mixture of Factor Analyzers, MClust models, Mixture of Student's t distributions, etc. It streamlines the implementation and analysis of these models using various first/second order optimization routines such as Gradient Descent and Newton-CG through au… ▽ More \texttt{Mixture-Models} is an open-source Python library for fitting Gaussian Mixture Models (GMM) and their variants, such as Parsimonious GMMs, Mixture of Factor Analyzers, MClust models, Mixture of Student's t distributions, etc. It streamlines the implementation and analysis of these models using various first/second order optimization routines such as Gradient Descent and Newton-CG through automatic differentiation (AD) tools. This helps in extending these models to high-dimensional data, which is first of its kind among Python libraries. The library provides user-friendly model evaluation tools, such as BIC, AIC, and log-likelihood estimation. The source-code is licensed under MIT license and can be accessed at \url{https://github.com/kasakh/Mixture-Models}. The package is highly extensible, allowing users to incorporate new distributions and optimization techniques with ease. We conduct a large scale simulation to compare the performance of various gradient based approaches against Expectation Maximization on a wide range of settings and identify the corresponding best suited approach. △ Less

Submitted 8 February, 2024; originally announced February 2024.

arXiv:2312.16549 [pdf, other]

How Robust are LLMs to In-Context Majority Label Bias?

Authors: Karan Gupta, Sumegh Roychowdhury, Siva Rajesh Kasa, Santhosh Kumar Kasa, Anish Bhanushali, Nikhil Pattisapu, Prasanna Srinivasa Murthy

Abstract: In the In-Context Learning (ICL) setup, various forms of label biases can manifest. One such manifestation is majority label bias, which arises when the distribution of labeled examples in the in-context samples is skewed towards one or more specific classes making Large Language Models (LLMs) more prone to predict those labels. Such discrepancies can arise from various factors, including logistic… ▽ More In the In-Context Learning (ICL) setup, various forms of label biases can manifest. One such manifestation is majority label bias, which arises when the distribution of labeled examples in the in-context samples is skewed towards one or more specific classes making Large Language Models (LLMs) more prone to predict those labels. Such discrepancies can arise from various factors, including logistical constraints, inherent biases in data collection methods, limited access to diverse data sources, etc. which are unavoidable in a real-world industry setup. In this work, we study the robustness of in-context learning in LLMs to shifts that occur due to majority label bias within the purview of text classification tasks. Prior works have shown that in-context learning with LLMs is susceptible to such biases. In our study, we go one level deeper and show that the robustness boundary varies widely for different models and tasks, with certain LLMs being highly robust (~90%) to majority label bias. Additionally, our findings also highlight the impact of model size and the richness of instructional prompts contributing towards model robustness. We restrict our study to only publicly available open-source models to ensure transparency and reproducibility. △ Less

Submitted 27 December, 2023; originally announced December 2023.

Comments: 6 pages, 3 figures, 2 table. Accepted at Workshop on Responsible Language Modeling, AAAI 2024, (www.aaai.org)

arXiv:2307.01088 [pdf, other]

Empirically Validating Conformal Prediction on Modern Vision Architectures Under Distribution Shift and Long-tailed Data

Authors: Kevin Kasa, Graham W. Taylor

Abstract: Conformal prediction has emerged as a rigorous means of providing deep learning models with reliable uncertainty estimates and safety guarantees. Yet, its performance is known to degrade under distribution shift and long-tailed class distributions, which are often present in real world applications. Here, we characterize the performance of several post-hoc and training-based conformal prediction m… ▽ More Conformal prediction has emerged as a rigorous means of providing deep learning models with reliable uncertainty estimates and safety guarantees. Yet, its performance is known to degrade under distribution shift and long-tailed class distributions, which are often present in real world applications. Here, we characterize the performance of several post-hoc and training-based conformal prediction methods under these settings, providing the first empirical evaluation on large-scale datasets and models. We show that across numerous conformal methods and neural network families, performance greatly degrades under distribution shifts violating safety guarantees. Similarly, we show that in long-tailed settings the guarantees are frequently violated on many classes. Understanding the limitations of these methods is necessary for deployment in real world and safety-critical applications. △ Less

Submitted 3 July, 2023; originally announced July 2023.

arXiv:0708.0752 [pdf, ps, other]

doi 10.1103/PhysRevA.77.012719

Two-color photoassociation spectroscopy of ytterbium atoms and the precise determinations of s-wave scattering lengths

Authors: Masaaki Kitagawa, Katsunari Enomoto, Kentaro Kasa, Yoshiro Takahashi, Roman Ciurylo, Pascal Naidon, Paul S. Julienne

Abstract: By performing high-resolution two-color photoassociation spectroscopy, we have successfully determined the binding energies of several of the last bound states of the homonuclear dimers of six different isotopes of ytterbium. These spectroscopic data are in excellent agreement with theoretical calculations based on a simple model potential, which very precisely predicts the s-wave scattering len… ▽ More By performing high-resolution two-color photoassociation spectroscopy, we have successfully determined the binding energies of several of the last bound states of the homonuclear dimers of six different isotopes of ytterbium. These spectroscopic data are in excellent agreement with theoretical calculations based on a simple model potential, which very precisely predicts the s-wave scattering lengths of all 28 pairs of the seven stable isotopes. The s-wave scattering lengths for collision of two atoms of the same isotopic species are 13.33(18) nm for ^{168}Yb, 3.38(11) nm for ^{170}Yb, -0.15(19) nm for ^{171}Yb, -31.7(3.4) nm for ^{172}Yb, 10.55(11) nm for ^{173}Yb, 5.55(8) nm for ^{174}Yb, and -1.28(23) nm for ^{176}Yb. The coefficient of the lead term of the long-range van der Waals potential of the Yb_2 molecule is C_6=1932(30) atomic units $(E_h a_0^6 \approx 9.573\times 10^{-26}$ J nm^6). △ Less

Submitted 6 August, 2007; originally announced August 2007.

Comments: 9 pages, 7 figures

Showing 1–5 of 5 results for author: Kasa, K