Skip to main content

Showing 1–50 of 57 results for author: Valiant, G

  1. arXiv:2312.02417  [pdf, other

    math.ST cs.DS stat.ML

    Near-Optimal Mean Estimation with Unknown, Heteroskedastic Variances

    Authors: Spencer Compton, Gregory Valiant

    Abstract: Given data drawn from a collection of Gaussian variables with a common mean but different and unknown variances, what is the best algorithm for estimating their common mean? We present an intuitive and efficient algorithm for this task. As different closed-form guarantees can be hard to compare, the Subset-of-Signals model serves as a benchmark for heteroskedastic mean estimation: given $n$ Gaussi… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  2. arXiv:2311.16342  [pdf, other

    cs.CC cs.DS

    Matrix Multiplication in Quadratic Time and Energy? Towards a Fine-Grained Energy-Centric Church-Turing Thesis

    Authors: Gregory Valiant

    Abstract: We describe two algorithms for multiplying n x n matrices using time and energy n^2 polylog(n) under basic models of classical physics. The first algorithm is for multiplying integer-valued matrices, and the second, quite different algorithm, is for Boolean matrix multiplication. We hope this work inspires a deeper consideration of physically plausible/realizable models of computing that might all… ▽ More

    Submitted 12 December, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

  3. arXiv:2311.11194  [pdf, other

    cs.DS cs.IT cs.LG stat.ML

    Testing with Non-identically Distributed Samples

    Authors: Shivam Garg, Chirag Pabbaraju, Kirankumar Shiragur, Gregory Valiant

    Abstract: We examine the extent to which sublinear-sample property testing and estimation applies to settings where samples are independently but not identically distributed. Specifically, we consider the following distributional property testing framework: Suppose there is a set of distributions over a discrete support of size $k$, $\textbf{p}_1, \textbf{p}_2,\ldots,\textbf{p}_T$, and we obtain $c$ indepen… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  4. arXiv:2306.04049  [pdf, other

    cs.LG cs.DS stat.ML

    One-sided Matrix Completion from Two Observations Per Row

    Authors: Steven Cao, Percy Liang, Gregory Valiant

    Abstract: Given only a few observed entries from a low-rank matrix $X$, matrix completion is the problem of imputing the missing entries, and it formalizes a wide range of real-world settings that involve estimating missing data. However, when there are too few observed entries to complete the matrix, what other aspects of the underlying matrix can be reliably recovered? We study one such problem setting, t… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: ICML 2023

  5. arXiv:2305.16349  [pdf, other

    cs.CL cs.AI cs.LG

    Lexinvariant Language Models

    Authors: Qian Huang, Eric Zelikman, Sarah Li Chen, Yuhuai Wu, Gregory Valiant, Percy Liang

    Abstract: Token embeddings, a mapping from discrete lexical symbols to continuous vectors, are at the heart of any language model (LM). However, lexical symbol meanings can also be determined and even redefined by their structural role in a long context. In this paper, we ask: is it possible for a language model to be performant without \emph{any} fixed token embeddings? Such a language model would have to… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  6. arXiv:2210.00655  [pdf, other

    cs.DS cs.GT

    Online Pen Testing

    Authors: Mingda Qiao, Gregory Valiant

    Abstract: We study a "pen testing" problem, in which we are given $n$ pens with unknown amounts of ink $X_1, X_2, \ldots, X_n$, and we want to choose a pen with the maximum amount of remaining ink in it. The challenge is that we cannot access each $X_i$ directly; we only get to write with the $i$-th pen until either a certain amount of ink is used, or the pen runs out of ink. In both cases, this testing red… ▽ More

    Submitted 21 November, 2022; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: To appear at ITCS 2023; v2 added discussion on a closely related work of Awerbuch, Azar, Fiat, and Leighton (1996)

  7. arXiv:2208.01066  [pdf, other

    cs.CL cs.LG

    What Can Transformers Learn In-Context? A Case Study of Simple Function Classes

    Authors: Shivam Garg, Dimitris Tsipras, Percy Liang, Gregory Valiant

    Abstract: In-context learning refers to the ability of a model to condition on a prompt sequence consisting of in-context examples (input-output pairs corresponding to some task) along with a new query input, and generate the corresponding output. Crucially, in-context learning happens only at inference time without any parameter updates to the model. While large language models such as GPT-3 exhibit some a… ▽ More

    Submitted 11 August, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

  8. arXiv:2204.12615  [pdf, other

    cs.DC cs.NI

    From Sand to Flour: The Next Leap in Granular Computing with NanoSort

    Authors: Theo Jepsen, Stephen Ibanez, Gregory Valiant, Nick McKeown

    Abstract: The granularity of distributed computing is limited by communication time: there is no point in farming out smaller and smaller tasks if the communication overhead dominates the decrease in processing time due to the added parallelism. In this work, we leverage the low communication latency of a new NIC/CPU hardware design, the nanoPU, to explore a new extreme of granularity in distributed computa… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

  9. arXiv:2203.15260  [pdf, other

    cs.LG cs.CC cs.DS math.OC stat.ML

    Efficient Convex Optimization Requires Superlinear Memory

    Authors: Annie Marsden, Vatsal Sharan, Aaron Sidford, Gregory Valiant

    Abstract: We show that any memory-constrained, first-order algorithm which minimizes $d$-dimensional, $1$-Lipschitz convex functions over the unit ball to $1/\mathrm{poly}(d)$ accuracy using at most $d^{1.25 - δ}$ bits of memory must make at least $\tildeΩ(d^{1 + (4/3)δ})$ first-order queries (for any constant $δ\in [0, 1/4]$). Consequently, the performance of such memory-constrained algorithms are a polyno… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: 33 pages, 1 figure

  10. arXiv:2201.04315  [pdf, ps, other

    math.ST cs.DS cs.IT cs.LG

    On the Statistical Complexity of Sample Amplification

    Authors: Brian Axelrod, Shivam Garg, Yanjun Han, Vatsal Sharan, Gregory Valiant

    Abstract: Given $n$ i.i.d. samples drawn from an unknown distribution $P$, when is it possible to produce a larger set of $n+m$ samples which cannot be distinguished from $n+m$ i.i.d. samples drawn from $P$? (Axelrod et al. 2019) formalized this question as the sample amplification problem, and gave optimal amplification procedures for discrete distributions and Gaussian location models. However, these proc… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

  11. arXiv:2111.03137  [pdf, other

    math.OC cs.DS cs.LG stat.ML

    Big-Step-Little-Step: Efficient Gradient Methods for Objectives with Multiple Scales

    Authors: Jonathan Kelner, Annie Marsden, Vatsal Sharan, Aaron Sidford, Gregory Valiant, Honglin Yuan

    Abstract: We provide new gradient-based methods for efficiently solving a broad class of ill-conditioned optimization problems. We consider the problem of minimizing a function $f : \mathbb{R}^d \rightarrow \mathbb{R}$ which is implicitly decomposable as the sum of $m$ unknown non-interacting smooth, strongly convex functions and provide a method which solves this problem with a number of gradient evaluatio… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: 95 pages, 4 figures; authors are listed in alphabetical order

  12. arXiv:2106.15662  [pdf, other

    cs.LG cs.DS stat.ML

    Exponential Weights Algorithms for Selective Learning

    Authors: Mingda Qiao, Gregory Valiant

    Abstract: We study the selective learning problem introduced by Qiao and Valiant (2019), in which the learner observes $n$ labeled data points one at a time. At a time of its choosing, the learner selects a window length $w$ and a model $\hat\ell$ from the model class $\mathcal{L}$, and then labels the next $w$ data points using $\hat\ell$. The excess risk incurred by the learner is defined as the differenc… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

    Comments: To appear in COLT 2021

  13. arXiv:2102.08622  [pdf, other

    cs.LG stat.ML

    Sinkhorn Label Allocation: Semi-Supervised Classification via Annealed Self-Training

    Authors: Kai Sheng Tai, Peter Bailis, Gregory Valiant

    Abstract: Self-training is a standard approach to semi-supervised learning where the learner's own predictions on unlabeled data are used as supervision during training. In this paper, we reinterpret this label assignment process as an optimal transportation problem between examples and classes, wherein the cost of assigning an example to a class is mediated by the current predictions of the classifier. Thi… ▽ More

    Submitted 11 June, 2021; v1 submitted 17 February, 2021; originally announced February 2021.

    Comments: ICML 2021 camera ready version

  14. arXiv:2101.05234  [pdf, other

    stat.ML cs.LG

    On Misspecification in Prediction Problems and Robustness via Improper Learning

    Authors: Annie Marsden, John Duchi, Gregory Valiant

    Abstract: We study probabilistic prediction games when the underlying model is misspecified, investigating the consequences of predicting using an incorrect parametric model. We show that for a broad class of loss functions and parametric families of distributions, the regret of playing a "proper" predictor -- one from the putative model class -- relative to the best predictor in the same model class has lo… ▽ More

    Submitted 29 January, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

    Comments: 28 pages, 6 figures

  15. arXiv:2012.03454  [pdf, other

    cs.LG cs.DS stat.ML

    Stronger Calibration Lower Bounds via Sidestepping

    Authors: Mingda Qiao, Gregory Valiant

    Abstract: We consider an online binary prediction setting where a forecaster observes a sequence of $T$ bits one by one. Before each bit is revealed, the forecaster predicts the probability that the bit is $1$. The forecaster is called well-calibrated if for each $p \in [0, 1]$, among the $n_p$ bits for which the forecaster predicts probability $p$, the actual number of ones, $m_p$, is indeed equal to… ▽ More

    Submitted 6 October, 2023; v1 submitted 7 December, 2020; originally announced December 2020.

    Comments: STOC 2021; v3 fixed a typo in the statement of Theorem 1

  16. arXiv:2005.00695  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    On the Generalization Effects of Linear Transformations in Data Augmentation

    Authors: Sen Wu, Hongyang R. Zhang, Gregory Valiant, Christopher Ré

    Abstract: Data augmentation is a powerful technique to improve performance in applications such as image and text classification tasks. Yet, there is little rigorous understanding of why and how various augmentations work. In this work, we consider a family of linear transformations and study their effects on the ridge estimator in an over-parametrized linear regression setting. First, we show that transfor… ▽ More

    Submitted 26 July, 2023; v1 submitted 2 May, 2020; originally announced May 2020.

    Comments: 22 pages. Appeared in ICML 2020

  17. arXiv:1912.06111  [pdf, other

    cs.LG stat.ML

    Sublinear Optimal Policy Value Estimation in Contextual Bandits

    Authors: Weihao Kong, Gregory Valiant, Emma Brunskill

    Abstract: We study the problem of estimating the expected reward of the optimal policy in the stochastic disjoint linear bandit setting. We prove that for certain settings it is possible to obtain an accurate estimate of the optimal policy value even with a number of samples that is sublinear in the number that would be required to \emph{find} a policy that realizes a value close to this optima. We establis… ▽ More

    Submitted 13 December, 2019; v1 submitted 12 December, 2019; originally announced December 2019.

    Comments: Extended to the mixture of Gaussians setting

  18. arXiv:1911.03605  [pdf, other

    cs.DS cs.LG stat.ML

    Worst-Case Analysis for Randomly Collected Data

    Authors: Justin Y. Chen, Gregory Valiant, Paul Valiant

    Abstract: We introduce a framework for statistical estimation that leverages knowledge of how samples are collected but makes no distributional assumptions on the data values. Specifically, we consider a population of elements $[n]={1,\ldots,n}$ with corresponding data values $x_1,\ldots,x_n$. We observe the values for a "sample" set $A \subset [n]$ and wish to estimate some statistic of the values for a "t… ▽ More

    Submitted 26 October, 2020; v1 submitted 8 November, 2019; originally announced November 2019.

  19. arXiv:1907.08306  [pdf, other

    cs.DS stat.CO

    A Polynomial Time Algorithm for Log-Concave Maximum Likelihood via Locally Exponential Families

    Authors: Brian Axelrod, Ilias Diakonikolas, Anastasios Sidiropoulos, Alistair Stewart, Gregory Valiant

    Abstract: We consider the problem of computing the maximum likelihood multivariate log-concave distribution for a set of points. Specifically, we present an algorithm which, given $n$ points in $\mathbb{R}^d$ and an accuracy parameter $ε>0$, runs in time $poly(n,d,1/ε),$ and returns a log-concave distribution which, with high probability, has the property that the likelihood of the $n$ points under the retu… ▽ More

    Submitted 18 July, 2019; originally announced July 2019.

    Comments: The present paper is a merger of two independent works arXiv:1811.03204 and arXiv:1812.05524, proposing essentially the same algorithm to compute the log-concave MLE

  20. arXiv:1907.05012  [pdf, other

    cs.LG stat.ML

    Making AI Forget You: Data Deletion in Machine Learning

    Authors: Antonio Ginart, Melody Y. Guan, Gregory Valiant, James Zou

    Abstract: Intense recent discussions have focused on how to provide individuals with control over when their data can and cannot be used --- the EU's Right To Be Forgotten regulation is an example of this effort. In this paper we initiate a framework studying what to do when it is no longer permissible to deploy models derivative from specific user data. In particular, we formulate the problem of efficientl… ▽ More

    Submitted 4 November, 2019; v1 submitted 11 July, 2019; originally announced July 2019.

    Comments: To appear in NeurIPS 2019

  21. arXiv:1906.01040  [pdf, other

    cs.SD cs.CL cs.LG eess.AS stat.ML

    A Surprising Density of Illusionable Natural Speech

    Authors: Melody Y. Guan, Gregory Valiant

    Abstract: Recent work on adversarial examples has demonstrated that most natural inputs can be perturbed to fool even state-of-the-art machine learning systems. But does this happen for humans as well? In this work, we investigate: what fraction of natural instances of speech can be turned into "illusions" which either alter humans' perception or result in different people having significantly different per… ▽ More

    Submitted 19 August, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: CogSci 2019

  22. arXiv:1904.12053  [pdf, other

    cs.LG math.ST stat.ML

    Sample Amplification: Increasing Dataset Size even when Learning is Impossible

    Authors: Brian Axelrod, Shivam Garg, Vatsal Sharan, Gregory Valiant

    Abstract: Given data drawn from an unknown distribution, $D$, to what extent is it possible to ``amplify'' this dataset and output an even larger set of samples that appear to have been drawn from $D$? We formalize this question as follows: an $(n,m)$ $\text{amplification procedure}$ takes as input $n$ independent draws from an unknown distribution $D$, and outputs a set of $m > n$ ``samples''. An amplifica… ▽ More

    Submitted 2 December, 2019; v1 submitted 26 April, 2019; originally announced April 2019.

    Comments: Added discussion about potential applications

  23. arXiv:1904.09080  [pdf, other

    cs.LG stat.ML

    Implicit regularization for deep neural networks driven by an Ornstein-Uhlenbeck like process

    Authors: Guy Blanc, Neha Gupta, Gregory Valiant, Paul Valiant

    Abstract: We consider networks, trained via stochastic gradient descent to minimize $\ell_2$ loss, with the training labels perturbed by independent noise at each iteration. We characterize the behavior of the training dynamics near any parameter vector that achieves zero training error, in terms of an implicit regularization term corresponding to the sum over the data points, of the squared $\ell_2$ norm o… ▽ More

    Submitted 22 July, 2020; v1 submitted 19 April, 2019; originally announced April 2019.

  24. arXiv:1904.08544  [pdf, ps, other

    cs.LG stat.ML

    Memory-Sample Tradeoffs for Linear Regression with Small Error

    Authors: Vatsal Sharan, Aaron Sidford, Gregory Valiant

    Abstract: We consider the problem of performing linear regression over a stream of $d$-dimensional examples, and show that any algorithm that uses a subquadratic amount of memory exhibits a slower rate of convergence than can be achieved without memory constraints. Specifically, consider a sequence of labeled examples $(a_1,b_1), (a_2,b_2)\ldots,$ with $a_i$ drawn independently from a $d$-dimensional isotro… ▽ More

    Submitted 10 October, 2020; v1 submitted 17 April, 2019; originally announced April 2019.

    Comments: A few minor edits over previous version

  25. arXiv:1902.04553  [pdf, ps, other

    math.ST cs.LG stat.ML

    Maximum Likelihood Estimation for Learning Populations of Parameters

    Authors: Ramya Korlakai Vinayak, Weihao Kong, Gregory Valiant, Sham M. Kakade

    Abstract: Consider a setting with $N$ independent individuals, each with an unknown parameter, $p_i \in [0, 1]$ drawn from some unknown distribution $P^\star$. After observing the outcomes of $t$ independent Bernoulli trials, i.e., $X_i \sim \text{Binomial}(t, p_i)$ per individual, our objective is to accurately estimate $P^\star$. This problem arises in numerous domains, including the social sciences, psyc… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

  26. arXiv:1902.04256  [pdf, other

    cs.LG cs.DS stat.ML

    A Theory of Selective Prediction

    Authors: Mingda Qiao, Gregory Valiant

    Abstract: We consider a model of selective prediction, where the prediction algorithm is given a data sequence in an online fashion and asked to predict a pre-specified statistic of the upcoming data points. The algorithm is allowed to choose when to make the prediction as well as the length of the prediction window, possibly depending on the observations so far. We prove that, even without any distribution… ▽ More

    Submitted 28 May, 2019; v1 submitted 12 February, 2019; originally announced February 2019.

    Comments: COLT 2019

  27. arXiv:1901.11399  [pdf, other

    cs.CV cs.LG stat.ML

    Equivariant Transformer Networks

    Authors: Kai Sheng Tai, Peter Bailis, Gregory Valiant

    Abstract: How can prior knowledge on the transformation invariances of a domain be incorporated into the architecture of a neural network? We propose Equivariant Transformers (ETs), a family of differentiable image-to-image mappings that improve the robustness of models towards pre-defined continuous transformation groups. Through the use of specially-derived canonical coordinate systems, ETs incorporate fu… ▽ More

    Submitted 24 May, 2019; v1 submitted 25 January, 2019; originally announced January 2019.

    Comments: ICML 2019

  28. arXiv:1811.06609  [pdf, other

    cs.LG stat.ML

    A Spectral View of Adversarially Robust Features

    Authors: Shivam Garg, Vatsal Sharan, Brian Hu Zhang, Gregory Valiant

    Abstract: Given the apparent difficulty of learning models that are robust to adversarial perturbations, we propose tackling the simpler problem of developing adversarially robust features. Specifically, given a dataset and metric of interest, the goal is to return a function (or multiple functions) that 1) is robust to adversarial perturbations, and 2) has significant variation across the datapoints. We es… ▽ More

    Submitted 15 November, 2018; originally announced November 2018.

    Comments: To appear at NIPS 2018

  29. arXiv:1811.03204  [pdf, other

    cs.DS stat.CO

    An Efficient Algorithm for High-Dimensional Log-Concave Maximum Likelihood

    Authors: Brian Axelrod, Gregory Valiant

    Abstract: The log-concave maximum likelihood estimator (MLE) problem answers: for a set of points $X_1,...X_n \in \mathbb R^d$, which log-concave density maximizes their likelihood? We present a characterization of the log-concave MLE that leads to an algorithm with runtime $poly(n,d, \frac 1 ε,r)$ to compute a log-concave distribution whose log-likelihood is at most $ε$ less than that of the MLE, and $r$ i… ▽ More

    Submitted 7 November, 2018; originally announced November 2018.

  30. arXiv:1805.01626  [pdf, other

    cs.LG stat.ML

    Estimating Learnability in the Sublinear Data Regime

    Authors: Weihao Kong, Gregory Valiant

    Abstract: We consider the problem of estimating how well a model class is capable of fitting a distribution of labeled data. We show that it is often possible to accurately estimate this "learnability" even when given an amount of data that is too small to reliably learn any accurate model. Our first result applies to the setting where the data is drawn from a $d$-dimensional distribution with isotropic cov… ▽ More

    Submitted 25 March, 2019; v1 submitted 4 May, 2018; originally announced May 2018.

  31. arXiv:1712.01725  [pdf, other

    cs.DS

    Approximating the Spectrum of a Graph

    Authors: David Cohen-Steiner, Weihao Kong, Christian Sohler, Gregory Valiant

    Abstract: The spectrum of a network or graph $G=(V,E)$ with adjacency matrix $A$, consists of the eigenvalues of the normalized Laplacian $L= I - D^{-1/2} A D^{-1/2}$. This set of eigenvalues encapsulates many aspects of the structure of the graph, including the extent to which the graph posses community structures at multiple scales. We study the problem of approximating the spectrum… ▽ More

    Submitted 5 December, 2017; originally announced December 2017.

  32. arXiv:1711.08113  [pdf, ps, other

    cs.LG

    Learning Discrete Distributions from Untrusted Batches

    Authors: Mingda Qiao, Gregory Valiant

    Abstract: We consider the problem of learning a discrete distribution in the presence of an $ε$ fraction of malicious data sources. Specifically, we consider the setting where there is some underlying distribution, $p$, and each data source provides a batch of $\ge k$ samples, with the guarantee that at least a $(1-ε)$ fraction of the sources draw their samples from a distribution with total variation dista… ▽ More

    Submitted 21 November, 2017; originally announced November 2017.

  33. arXiv:1711.02309  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Overcomplete HMMs

    Authors: Vatsal Sharan, Sham Kakade, Percy Liang, Gregory Valiant

    Abstract: We study the problem of learning overcomplete HMMs---those that have many hidden states but a small output alphabet. Despite having significant practical importance, such HMMs are poorly understood with no known positive or negative results for efficient learning. In this paper, we present several new results---both positive and negative---which help define the boundaries between the tractable and… ▽ More

    Submitted 27 June, 2018; v1 submitted 7 November, 2017; originally announced November 2017.

    Comments: Added acknowledgements

  34. arXiv:1711.02305  [pdf, other

    cs.LG cs.DS stat.ML

    Sketching Linear Classifiers over Data Streams

    Authors: Kai Sheng Tai, Vatsal Sharan, Peter Bailis, Gregory Valiant

    Abstract: We introduce a new sub-linear space sketch---the Weight-Median Sketch---for learning compressed linear classifiers over data streams while supporting the efficient recovery of large-magnitude weights in the model. This enables memory-limited execution of several statistical analyses over streams, including online feature selection, streaming data explanation, relative deltoid detection, and stream… ▽ More

    Submitted 6 April, 2018; v1 submitted 7 November, 2017; originally announced November 2017.

    Comments: Full version of paper appearing at SIGMOD 2018 with more detailed proofs of theoretical results. Code available at https://github.com/stanford-futuredata/wmsketch

  35. arXiv:1709.02707  [pdf, other

    cs.LG

    Learning Populations of Parameters

    Authors: Kevin Tian, Weihao Kong, Gregory Valiant

    Abstract: Consider the following estimation problem: there are $n$ entities, each with an unknown parameter $p_i \in [0,1]$, and we observe $n$ independent random variables, $X_1,\ldots,X_n$, with $X_i \sim $ Binomial$(t, p_i)$. How accurately can one recover the "histogram" (i.e. cumulative density function) of the $p_i$'s? While the empirical estimates would recover the histogram to earth mover distance… ▽ More

    Submitted 22 November, 2017; v1 submitted 8 September, 2017; originally announced September 2017.

  36. arXiv:1708.02740  [pdf, ps, other

    cs.LG cs.IT

    A Data Prism: Semi-Verified Learning in the Small-Alpha Regime

    Authors: Michela Meister, Gregory Valiant

    Abstract: We consider a model of unreliable or crowdsourced data where there is an underlying set of $n$ binary variables, each evaluator contributes a (possibly unreliable or adversarial) estimate of the values of some subset of $r$ of the variables, and the learner is given the true value of a constant number of variables. We show that, provided an $α$-fraction of the evaluators are "good" (either correct… ▽ More

    Submitted 9 August, 2017; originally announced August 2017.

  37. arXiv:1707.03854  [pdf, other

    cs.LG stat.ML

    Estimating the unseen from multiple populations

    Authors: Aditi Raghunathan, Greg Valiant, James Zou

    Abstract: Given samples from a distribution, how many new elements should we expect to find if we continue sampling this distribution? This is an important and actively studied problem, with many applications ranging from unseen species estimation to genomics. We generalize this extrapolation and related unseen estimation problems to the multiple population setting, where population $j$ has an unknown distr… ▽ More

    Submitted 12 July, 2017; originally announced July 2017.

    Comments: 13 pages, 3 figures, appearing at the International Conference on Machine Learning 2017 (ICML 2017)

  38. arXiv:1706.08146  [pdf, other

    cs.LG cs.AI stat.ML

    Compressed Factorization: Fast and Accurate Low-Rank Factorization of Compressively-Sensed Data

    Authors: Vatsal Sharan, Kai Sheng Tai, Peter Bailis, Gregory Valiant

    Abstract: What learning algorithms can be run directly on compressively-sensed data? In this work, we consider the question of accurately and efficiently computing low-rank matrix or tensor factorizations given data compressed via random projections. We examine the approach of first performing factorization in the compressed domain, and then reconstructing the original high-dimensional factors from the reco… ▽ More

    Submitted 27 May, 2019; v1 submitted 25 June, 2017; originally announced June 2017.

    Comments: Updates for ICML'19 camera-ready

  39. arXiv:1703.04940  [pdf, other

    cs.LG cs.AI cs.CC cs.CR stat.ML

    Resilience: A Criterion for Learning in the Presence of Arbitrary Outliers

    Authors: Jacob Steinhardt, Moses Charikar, Gregory Valiant

    Abstract: We introduce a criterion, resilience, which allows properties of a dataset (such as its mean or best low rank approximation) to be robustly computed, even in the presence of a large fraction of arbitrary additional data. Resilience is a weaker condition than most other properties considered so far in the literature, and yet enables robust estimation in a broader variety of settings. We provide new… ▽ More

    Submitted 26 November, 2017; v1 submitted 15 March, 2017; originally announced March 2017.

    Comments: 32 pages, full version of ITCS2018 paper (minor citation edit from v2)

  40. arXiv:1703.01804  [pdf, other

    cs.LG stat.ML

    Orthogonalized ALS: A Theoretically Principled Tensor Decomposition Algorithm for Practical Use

    Authors: Vatsal Sharan, Gregory Valiant

    Abstract: The popular Alternating Least Squares (ALS) algorithm for tensor decomposition is efficient and easy to implement, but often converges to poor local optima---particularly when the weights of the factors are non-uniform. We propose a modification of the ALS approach that is as efficient as standard ALS, but provably recovers the true factors with random initialization under standard incoherence ass… ▽ More

    Submitted 23 September, 2017; v1 submitted 6 March, 2017; originally announced March 2017.

    Comments: Minor updates to presentation. Appears in ICML'17

  41. arXiv:1612.02526  [pdf, other

    cs.LG cs.AI cs.CC stat.ML

    Prediction with a Short Memory

    Authors: Vatsal Sharan, Sham Kakade, Percy Liang, Gregory Valiant

    Abstract: We consider the problem of predicting the next observation given a sequence of past observations, and consider the extent to which accurate prediction requires complex algorithms that explicitly leverage long-range dependencies. Perhaps surprisingly, our positive results show that for a broad class of sequences, there is an algorithm that predicts well on average, and bases its predictions only on… ▽ More

    Submitted 27 June, 2018; v1 submitted 7 December, 2016; originally announced December 2016.

    Comments: Updates for STOC camera ready

  42. arXiv:1611.02315  [pdf, ps, other

    cs.LG cs.AI cs.CC cs.CR math.ST

    Learning from Untrusted Data

    Authors: Moses Charikar, Jacob Steinhardt, Gregory Valiant

    Abstract: The vast majority of theoretical results in machine learning and statistics assume that the available training data is a reasonably reliable reflection of the phenomena to be learned or estimated. Similarly, the majority of machine learning and statistical techniques used in practice are brittle to the presence of large amounts of biased or malicious data. In this work we consider two frameworks i… ▽ More

    Submitted 11 June, 2017; v1 submitted 7 November, 2016; originally announced November 2016.

    Comments: Updated based on STOC camera-ready

  43. arXiv:1606.05374  [pdf, ps, other

    cs.HC cs.CR cs.DS cs.GT cs.LG

    Avoiding Imposters and Delinquents: Adversarial Crowdsourcing and Peer Prediction

    Authors: Jacob Steinhardt, Gregory Valiant, Moses Charikar

    Abstract: We consider a crowdsourcing model in which $n$ workers are asked to rate the quality of $n$ items previously generated by other workers. An unknown set of $αn$ workers generate reliable ratings, while the remaining workers may behave arbitrarily and possibly adversarially. The manager of the experiment can also manually evaluate the quality of a small number of items, and wishes to curate together… ▽ More

    Submitted 16 June, 2016; originally announced June 2016.

    Comments: 18 pages

  44. arXiv:1605.02646  [pdf, other

    cs.CR

    Information Theoretically Secure Databases

    Authors: Gregory Valiant, Paul Valiant

    Abstract: We introduce the notion of a database system that is information theoretically "Secure In Between Accesses"--a database system with the properties that 1) users can efficiently access their data, and 2) while a user is not accessing their data, the user's information is information theoretically secure to malicious agents, provided that certain requirements on the maintenance of the database are r… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 16 pages, 2 figures

  45. arXiv:1602.06586  [pdf, ps, other

    cs.LG

    Recovering Structured Probability Matrices

    Authors: Qingqing Huang, Sham M. Kakade, Weihao Kong, Gregory Valiant

    Abstract: We consider the problem of accurately recovering a matrix B of size M by M , which represents a probability distribution over M2 outcomes, given access to an observed matrix of "counts" generated by taking independent samples from the distribution B. How can structural properties of the underlying matrix B be leveraged to yield computationally efficient and information theoretically optimal recons… ▽ More

    Submitted 5 February, 2018; v1 submitted 21 February, 2016; originally announced February 2016.

  46. arXiv:1602.00061  [pdf, other

    cs.LG stat.ML

    Spectrum Estimation from Samples

    Authors: Weihao Kong, Gregory Valiant

    Abstract: We consider the problem of approximating the set of eigenvalues of the covariance matrix of a multivariate distribution (equivalently, the problem of approximating the "population spectrum"), given access to samples drawn from the distribution. The eigenvalues of the covariance of a distribution contain basic information about the distribution, including the presence or lack of structure in the di… ▽ More

    Submitted 16 July, 2017; v1 submitted 29 January, 2016; originally announced February 2016.

    MSC Class: 62H12; 62H10

  47. arXiv:1504.05321  [pdf, ps, other

    cs.LG

    Instance Optimal Learning

    Authors: Gregory Valiant, Paul Valiant

    Abstract: We consider the following basic learning task: given independent draws from an unknown distribution over a discrete support, output an approximation of the distribution that is as accurate as possible in $\ell_1$ distance (i.e. total variation or statistical distance). Perhaps surprisingly, it is often possible to "de-noise" the empirical distribution of the samples to return an approximation of t… ▽ More

    Submitted 11 November, 2015; v1 submitted 21 April, 2015; originally announced April 2015.

  48. arXiv:1504.04599  [pdf, other

    cs.LG cs.IT math.ST stat.ML

    Testing Closeness With Unequal Sized Samples

    Authors: Bhaswar B. Bhattacharya, Gregory Valiant

    Abstract: We consider the problem of closeness testing for two discrete distributions in the practically relevant setting of \emph{unequal} sized samples drawn from each of them. Specifically, given a target error parameter $\varepsilon > 0$, $m_1$ independent draws from an unknown distribution $p,$ and $m_2$ draws from an unknown distribution $q$, we describe a test for distinguishing the case that $p=q$ f… ▽ More

    Submitted 17 April, 2015; originally announced April 2015.

    Comments: 27 pages, 3 figures

  49. arXiv:1312.1983  [pdf, ps, other

    cs.CC q-bio.PE

    Satisfiability and Evolution

    Authors: Adi Livnat, Christos Papadimitriou, Aviad Rubinstein, Gregory Valiant, Andrew Wan

    Abstract: We show that, if truth assignments on $n$ variables reproduce through recombination so that satisfaction of a particular Boolean function confers a small evolutionary advantage, then a polynomially large population over polynomially many generations (polynomial in $n$ and the inverse of the initial satisfaction probability) will end up almost certainly consisting exclusively of satisfying truth as… ▽ More

    Submitted 11 August, 2014; v1 submitted 6 December, 2013; originally announced December 2013.

    MSC Class: 92D15 ACM Class: F.0

  50. arXiv:1310.1949  [pdf, other

    cs.LG stat.ML

    Least Squares Revisited: Scalable Approaches for Multi-class Prediction

    Authors: Alekh Agarwal, Sham M. Kakade, Nikos Karampatziakis, Le Song, Gregory Valiant

    Abstract: This work provides simple algorithms for multi-class (and multi-label) prediction in settings where both the number of examples n and the data dimension d are relatively large. These robust and parameter free algorithms are essentially iterative least-squares updates and very versatile both in theory and in practice. On the theoretical front, we present several variants with convergence guarantees… ▽ More

    Submitted 21 October, 2013; v1 submitted 7 October, 2013; originally announced October 2013.