Skip to main content

Showing 1–12 of 12 results for author: Pabbaraju, C

  1. arXiv:2406.15916  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Credit Attribution and Stable Compression

    Authors: Roi Livni, Shay Moran, Kobbi Nissim, Chirag Pabbaraju

    Abstract: Credit attribution is crucial across various fields. In academic research, proper citation acknowledges prior work and establishes original contributions. Similarly, in generative models, such as those trained on existing artworks or music, it is important to ensure that any generated content influenced by these works appropriately credits the original creators. We study credit attribution by ma… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 15 pages, 1 figure

  2. arXiv:2405.15116  [pdf, other

    cs.LG cs.AI

    Quantifying the Gain in Weak-to-Strong Generalization

    Authors: Moses Charikar, Chirag Pabbaraju, Kirankumar Shiragur

    Abstract: Recent advances in large language models have shown capabilities that are extraordinary and near-superhuman. These models operate with such complexity that reliably evaluating and aligning them proves challenging for humans. This leads to the natural question: can guidance from weak models (like humans) adequately direct the capabilities of strong models? In a recent and somewhat surprising work,… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2312.02435  [pdf, other

    cs.DS

    Embedding Probability Distributions into Low Dimensional $\ell_1$: Tree Ising Models via Truncated Metrics

    Authors: Moses Charikar, Spencer Compton, Chirag Pabbaraju

    Abstract: Given an arbitrary set of high dimensional points in $\ell_1$, there are known negative results that preclude the possibility of always mapping them to a low dimensional $\ell_1$ space while preserving distances with small multiplicative distortion. This is in stark contrast with dimension reduction in Euclidean space ($\ell_2$) where such mappings are always possible. While the first non-trivial… ▽ More

    Submitted 5 April, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  4. arXiv:2311.11194  [pdf, other

    cs.DS cs.IT cs.LG stat.ML

    Testing with Non-identically Distributed Samples

    Authors: Shivam Garg, Chirag Pabbaraju, Kirankumar Shiragur, Gregory Valiant

    Abstract: We examine the extent to which sublinear-sample property testing and estimation applies to settings where samples are independently but not identically distributed. Specifically, we consider the following distributional property testing framework: Suppose there is a set of distributions over a discrete support of size $k$, $\textbf{p}_1, \textbf{p}_2,\ldots,\textbf{p}_T$, and we obtain $c$ indepen… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  5. arXiv:2310.01551  [pdf, other

    cs.LG cs.AI cs.DS

    Harnessing the Power of Choices in Decision Tree Learning

    Authors: Guy Blanc, Jane Lange, Chirag Pabbaraju, Colin Sullivan, Li-Yang Tan, Mo Tiwari

    Abstract: We propose a simple generalization of standard and empirically successful decision tree learning algorithms such as ID3, C4.5, and CART. These algorithms, which have been central to machine learning for decades, are greedy in nature: they grow a decision tree by iteratively splitting on the best attribute. Our algorithm, Top-$k$, considers the $k$ best attributes as possible splits instead of just… ▽ More

    Submitted 25 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

    ACM Class: I.2.0; I.2.m

  6. arXiv:2308.06424  [pdf, ps, other

    cs.LG stat.ML

    Multiclass Learnability Does Not Imply Sample Compression

    Authors: Chirag Pabbaraju

    Abstract: A hypothesis class admits a sample compression scheme, if for every sample labeled by a hypothesis from the class, it is possible to retain only a small subsample, using which the labels on the entire sample can be inferred. The size of the compression scheme is an upper bound on the size of the subsample produced. Every learnable binary hypothesis class (which must necessarily have finite VC dime… ▽ More

    Submitted 20 September, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

  7. arXiv:2306.01993  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Provable benefits of score matching

    Authors: Chirag Pabbaraju, Dhruv Rohatgi, Anish Sevekari, Holden Lee, Ankur Moitra, Andrej Risteski

    Abstract: Score matching is an alternative to maximum likelihood (ML) for estimating a probability distribution parametrized up to a constant of proportionality. By fitting the ''score'' of the distribution, it sidesteps the need to compute this constant of proportionality (which is often intractable). While score matching and variants thereof are popular in practice, precise theoretical understanding of th… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: 25 Pages

  8. arXiv:2211.04956  [pdf, ps, other

    stat.ML cs.DS cs.LG

    A Characterization of List Learnability

    Authors: Moses Charikar, Chirag Pabbaraju

    Abstract: A classical result in learning theory shows the equivalence of PAC learnability of binary hypothesis classes and the finiteness of VC dimension. Extending this to the multiclass setting was an open problem, which was settled in a recent breakthrough result characterizing multiclass PAC learnability via the DS dimension introduced earlier by Daniely and Shalev-Shwartz. In this work we consider list… ▽ More

    Submitted 26 March, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

  9. arXiv:2210.00189  [pdf, other

    cs.LG stat.ML

    Pitfalls of Gaussians as a noise distribution in NCE

    Authors: Holden Lee, Chirag Pabbaraju, Anish Sevekari, Andrej Risteski

    Abstract: Noise Contrastive Estimation (NCE) is a popular approach for learning probability density functions parameterized up to a constant of proportionality. The main idea is to design a classification problem for distinguishing training data from samples from an easy-to-sample noise distribution $q$, in a manner that avoids having to calculate a partition function. It is well-known that the choice of… ▽ More

    Submitted 2 March, 2023; v1 submitted 1 October, 2022; originally announced October 2022.

    Comments: 14 pages, 1 figure

  10. arXiv:2107.02951  [pdf, ps, other

    cs.LG stat.ML

    Universal Approximation for Log-concave Distributions using Well-conditioned Normalizing Flows

    Authors: Holden Lee, Chirag Pabbaraju, Anish Sevekari, Andrej Risteski

    Abstract: Normalizing flows are a widely used class of latent-variable generative models with a tractable likelihood. Affine-coupling (Dinh et al, 2014-16) models are a particularly common type of normalizing flows, for which the Jacobian of the latent-to-observable-variable transformation is triangular, allowing the likelihood to be computed in linear time. Despite the widespread usage of affine couplings,… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

    Comments: 40 pages, 0 figures

  11. arXiv:2012.02661  [pdf, other

    cs.LG stat.ML

    Efficient semidefinite-programming-based inference for binary and multi-class MRFs

    Authors: Chirag Pabbaraju, Po-Wei Wang, J. Zico Kolter

    Abstract: Probabilistic inference in pairwise Markov Random Fields (MRFs), i.e. computing the partition function or computing a MAP estimate of the variables, is a foundational problem in probabilistic graphical models. Semidefinite programming relaxations have long been a theoretically powerful tool for analyzing properties of probabilistic inference, but have not been practical owing to the high computati… ▽ More

    Submitted 30 April, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

    Comments: Accepted at NeurIPS'20. The code can be found at https://github.com/locuslab/sdp_mrf

  12. arXiv:1907.05638  [pdf, other

    cs.LG stat.ML

    Learning Functions over Sets via Permutation Adversarial Networks

    Authors: Chirag Pabbaraju, Prateek Jain

    Abstract: In this paper, we consider the problem of learning functions over sets, i.e., functions that are invariant to permutations of input set items. Recent approaches of pooling individual element embeddings can necessitate extremely large embedding sizes for challenging functions. We address this challenge by allowing standard neural networks like LSTMs to succinctly capture the function over the set.… ▽ More

    Submitted 10 January, 2020; v1 submitted 12 July, 2019; originally announced July 2019.