Skip to main content

Showing 1–50 of 185 results for author: Woodruff, D P

  1. arXiv:2407.11959  [pdf, other

    cs.DS

    Faster Algorithms for Schatten-p Low Rank Approximation

    Authors: Praneeth Kacham, David P. Woodruff

    Abstract: We study algorithms for the Schatten-$p$ Low Rank Approximation (LRA) problem. First, we show that by using fast rectangular matrix multiplication algorithms and different block sizes, we can improve the running time of the algorithms in the recent work of Bakshi, Clarkson and Woodruff (STOC 2022). We then show that by carefully combining our new algorithm with the algorithm of Li and Woodruff (IC… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  2. arXiv:2407.03262  [pdf, ps, other

    cs.DS cs.LG

    Nearly Linear Sparsification of $\ell_p$ Subspace Approximation

    Authors: David P. Woodruff, Taisuke Yasuda

    Abstract: The $\ell_p$ subspace approximation problem is an NP-hard low rank approximation problem that generalizes the median hyperplane problem ($p = 1$), principal component analysis ($p = 2$), and the center hyperplane problem ($p = \infty$). A popular approach to cope with the NP-hardness of this problem is to compute a strong coreset, which is a small weighted subset of the input points which simultan… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  3. arXiv:2406.13231  [pdf, other

    cs.DS

    Tight Lower Bounds for Directed Cut Sparsification and Distributed Min-Cut

    Authors: Yu Cheng, Max Li, Honghao Lin, Zi-Yi Tai, David P. Woodruff, Jason Zhang

    Abstract: In this paper, we consider two fundamental cut approximation problems on large graphs. We prove new lower bounds for both problems that are optimal up to logarithmic factors. The first problem is to approximate cuts in balanced directed graphs. In this problem, the goal is to build a data structure that $(1 \pm ε)$-approximates cut values in graphs with $n$ vertices. For arbitrary directed graph… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2406.06821  [pdf, other

    cs.DS

    Streaming Algorithms with Few State Changes

    Authors: Rajesh Jayaram, David P. Woodruff, Samson Zhou

    Abstract: In this paper, we study streaming algorithms that minimize the number of changes made to their internal state (i.e., memory contents). While the design of streaming algorithms typically focuses on minimizing space and update time, these metrics fail to capture the asymmetric costs, inherent in modern hardware and database systems, of reading versus writing to memory. In fact, most streaming algori… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: PODS 2024

  5. arXiv:2406.06808  [pdf, ps, other

    cs.DS cs.LG

    Fast White-Box Adversarial Streaming Without a Random Oracle

    Authors: Ying Feng, Aayush Jain, David P. Woodruff

    Abstract: Recently, the question of adversarially robust streaming, where the stream is allowed to depend on the randomness of the streaming algorithm, has gained a lot of attention. In this work, we consider a strong white-box adversarial model (Ajtai et al. PODS 2022), in which the adversary has access to all past random coins and the parameters used by the streaming algorithm. We focus on the sparse reco… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  6. arXiv:2406.06735  [pdf, ps, other

    cs.DS

    Fast Sampling Based Sketches for Tensors

    Authors: William Swartworth, David P. Woodruff

    Abstract: We introduce a new approach for applying sampling-based sketches to two and three mode tensors. We illustrate our technique to construct sketches for the classical problems of $\ell_0$ sampling and producing $\ell_1$ embeddings. In both settings we achieve sketches that can be applied to a rank one tensor in $(\mathbb{R}^d)^{\otimes q}$ (for $q=2,3$) in time scaling with $d$ rather than $d^2$ or… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  7. arXiv:2406.02910  [pdf, other

    cs.DS

    High-Dimensional Geometric Streaming for Nearly Low Rank Data

    Authors: Hossein Esfandiari, Vahab Mirrokni, Praneeth Kacham, David P. Woodruff, Peilin Zhong

    Abstract: We study streaming algorithms for the $\ell_p$ subspace approximation problem. Given points $a_1, \ldots, a_n$ as an insertion-only stream and a rank parameter $k$, the $\ell_p$ subspace approximation problem is to find a $k$-dimensional subspace $V$ such that $(\sum_{i=1}^n d(a_i, V)^p)^{1/p}$ is minimized, where $d(a, V)$ denotes the Euclidean distance between $a$ and $V$ defined as… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  8. arXiv:2406.02432  [pdf, other

    cs.DS cs.LG stat.ML

    Coresets for Multiple $\ell_p$ Regression

    Authors: David P. Woodruff, Taisuke Yasuda

    Abstract: A coreset of a dataset with $n$ examples and $d$ features is a weighted subset of examples that is sufficient for solving downstream data analytic tasks. Nearly optimal constructions of coresets for least squares and $\ell_p$ linear regression with a single response are known in prior work. However, for multiple $\ell_p$ regression where there can be $m$ responses, there are no known constructions… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  9. arXiv:2406.02431  [pdf, other

    cs.DS cs.LG stat.ML

    Reweighted Solutions for Weighted Low Rank Approximation

    Authors: David P. Woodruff, Taisuke Yasuda

    Abstract: Weighted low rank approximation (WLRA) is an important yet computationally challenging primitive with applications ranging from statistical analysis, model compression, and signal processing. To cope with the NP-hardness of this problem, prior work considers heuristics, bicriteria, or fixed parameter tractable algorithms to solve this problem. In this work, we introduce a new relaxed solution to W… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  10. arXiv:2403.20307  [pdf, ps, other

    cs.DS

    Optimal Communication for Classic Functions in the Coordinator Model and Beyond

    Authors: Hossein Esfandiari, Praneeth Kacham, Vahab Mirrokni, David P. Woodruff, Peilin Zhong

    Abstract: In the coordinator model of communication with $s$ servers, given an arbitrary non-negative function $f$, we study the problem of approximating the sum $\sum_{i \in [n]}f(x_i)$ up to a $1 \pm \varepsilon$ factor. Here the vector $x \in R^n$ is defined to be $x = x(1) + \cdots + x(s)$, where $x(j) \ge 0$ denotes the non-negative vector held by the $j$-th server. A special case of the problem is whe… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 52 pages. To appear in STOC 2024

  11. arXiv:2403.20283  [pdf, ps, other

    cs.CC cs.DS

    A New Information Complexity Measure for Multi-pass Streaming with Applications

    Authors: Mark Braverman, Sumegha Garg, Qian Li, Shuo Wang, David P. Woodruff, Jiapeng Zhang

    Abstract: We introduce a new notion of information complexity for multi-pass streaming problems and use it to resolve several important questions in data streams. In the coin problem, one sees a stream of $n$ i.i.d. uniform bits and one would like to compute the majority with constant advantage. We show that any constant pass algorithm must use $Ω(\log n)$ bits of memory, significantly extending an earlie… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: To appear in STOC 2024

  12. arXiv:2311.17868  [pdf, ps, other

    cs.DS

    Space-Optimal Profile Estimation in Data Streams with Applications to Symmetric Functions

    Authors: Justin Y. Chen, Piotr Indyk, David P. Woodruff

    Abstract: We revisit the problem of estimating the profile (also known as the rarity) in the data stream model. Given a sequence of $m$ elements from a universe of size $n$, its profile is a vector $φ$ whose $i$-th entry $φ_i$ represents the number of distinct elements that appear in the stream exactly $i$ times. A classic paper by Datar and Muthukrishan from 2002 gave an algorithm which estimates any entry… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: To appear in ITCS 2024

  13. arXiv:2311.17281  [pdf, ps, other

    cs.DS cs.IT

    Lower Bounds on Adaptive Sensing for Matrix Recovery

    Authors: Praneeth Kacham, David P Woodruff

    Abstract: We study lower bounds on adaptive sensing algorithms for recovering low rank matrices using linear measurements. Given an $n \times n$ matrix $A$, a general linear measurement $S(A)$, for an $n \times n$ matrix $S$, is just the inner product of $S$ and $A$, each treated as $n^2$-dimensional vectors. By performing as few linear measurements as possible on a rank-$r$ matrix $A$, we hope to construct… ▽ More

    Submitted 19 February, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Fixed minor typos

  14. arXiv:2311.04158  [pdf, other

    cs.LG cs.DS stat.ML

    Computing Approximate $\ell_p$ Sensitivities

    Authors: Swati Padmanabhan, David P. Woodruff, Qiuyi Zhang

    Abstract: Recent works in dimensionality reduction for regression tasks have introduced the notion of sensitivity, an estimate of the importance of a specific datapoint in a dataset, offering provable guarantees on the quality of the approximation after removing low-sensitivity datapoints via subsampling. However, fast algorithms for approximating $\ell_p$ sensitivities, which we show is equivalent to appro… ▽ More

    Submitted 21 November, 2023; v1 submitted 7 November, 2023; originally announced November 2023.

  15. arXiv:2311.00642  [pdf, other

    cs.DS

    Near-Optimal $k$-Clustering in the Sliding Window Model

    Authors: David P. Woodruff, Peilin Zhong, Samson Zhou

    Abstract: Clustering is an important technique for identifying structural information in large-scale data analysis, where the underlying dataset may be too large to store. In many applications, recent data can provide more accurate information and thus older data past a certain time is expired. The sliding window model captures these desired properties and thus there has been substantial interest in cluster… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  16. arXiv:2310.19068  [pdf, ps, other

    cs.DS cs.LG

    Sketching Algorithms for Sparse Dictionary Learning: PTAS and Turnstile Streaming

    Authors: Gregory Dexter, Petros Drineas, David P. Woodruff, Taisuke Yasuda

    Abstract: Sketching algorithms have recently proven to be a powerful approach both for designing low-space streaming algorithms as well as fast polynomial time approximation schemes (PTAS). In this work, we develop new techniques to extend the applicability of sketching-based approaches to the sparse dictionary learning and the Euclidean $k$-means clustering problems. In particular, we initiate the study of… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: To appear in NeurIPS 2023

  17. arXiv:2310.05869  [pdf, other

    cs.LG cs.AI

    HyperAttention: Long-context Attention in Near-Linear Time

    Authors: Insu Han, Rajesh Jayaram, Amin Karbasi, Vahab Mirrokni, David P. Woodruff, Amir Zandieh

    Abstract: We present an approximate attention mechanism named HyperAttention to address the computational challenges posed by the growing complexity of long contexts used in Large Language Models (LLMs). Recent work suggests that in the worst-case scenario, quadratic time is necessary unless the entries of the attention matrix are bounded or the matrix has low stable rank. We introduce two parameters which… ▽ More

    Submitted 1 December, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

  18. arXiv:2310.02882  [pdf, other

    cs.DS

    Streaming Euclidean $k$-median and $k$-means with $o(\log n)$ Space

    Authors: Vincent Cohen-Addad, David P. Woodruff, Samson Zhou

    Abstract: We consider the classic Euclidean $k$-median and $k$-means objective on data streams, where the goal is to provide a $(1+\varepsilon)$-approximation to the optimal $k$-median or $k$-means solution, while using as little memory as possible. Over the last 20 years, clustering in data streams has received a tremendous amount of attention and has been the test-bed for a large variety of new techniques… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: To appear at FOCS 2023

  19. arXiv:2308.15772  [pdf, other

    cs.CL

    Task-Based MoE for Multitask Multilingual Machine Translation

    Authors: Hai Pham, Young Jin Kim, Subhabrata Mukherjee, David P. Woodruff, Barnabas Poczos, Hany Hassan Awadalla

    Abstract: Mixture-of-experts (MoE) architecture has been proven a powerful method for diverse tasks in training deep models in many applications. However, current MoE implementations are task agnostic, treating all tokens from different tasks in the same manner. In this work, we instead design a novel method that incorporates task information into MoE models at different granular levels with shared dynamic… ▽ More

    Submitted 24 October, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

  20. arXiv:2307.05117  [pdf, ps, other

    cs.DS cs.DC cs.LG

    $\ell_p$-Regression in the Arbitrary Partition Model of Communication

    Authors: Yi Li, Honghao Lin, David P. Woodruff

    Abstract: We consider the randomized communication complexity of the distributed $\ell_p$-regression problem in the coordinator model, for $p\in (0,2]$. In this problem, there is a coordinator and $s$ servers. The $i$-th server receives $A^i\in\{-M, -M+1, \ldots, M\}^{n\times d}$ and $b^i\in\{-M, -M+1, \ldots, M\}^n$ and the coordinator would like to find a $(1+ε)$-approximate solution to… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  21. arXiv:2307.03529  [pdf, other

    cs.DS

    Improved Algorithms for White-Box Adversarial Streams

    Authors: Ying Feng, David P. Woodruff

    Abstract: We study streaming algorithms in the white-box adversarial stream model, where the internal state of the streaming algorithm is revealed to an adversary who adaptively generates the stream updates, but the algorithm obtains fresh randomness unknown to the adversary at each time step. We incorporate cryptographic assumptions to construct robust algorithms against such adversaries. We propose effici… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: ICML 2023

  22. arXiv:2306.06611  [pdf, other

    cs.LG cs.DS

    Learning the Positions in CountSketch

    Authors: Yi Li, Honghao Lin, Simin Liu, Ali Vakilian, David P. Woodruff

    Abstract: We consider sketching algorithms which first compress data by multiplication with a random sketch matrix, and then apply the sketch to quickly solve an optimization problem, e.g., low-rank approximation and regression. In the learning-based sketching paradigm proposed by~\cite{indyk2019learning}, the sketch matrix is found by choosing a random sparse matrix, e.g., CountSketch, and then the values… ▽ More

    Submitted 10 April, 2024; v1 submitted 11 June, 2023; originally announced June 2023.

    Comments: Corrected the proof of Theorem 5.1. arXiv admin note: text overlap with arXiv:2007.09890

  23. arXiv:2306.01869  [pdf, other

    cs.DS cs.LG

    Fast $(1+\varepsilon)$-Approximation Algorithms for Binary Matrix Factorization

    Authors: Ameya Velingker, Maximilian Vötsch, David P. Woodruff, Samson Zhou

    Abstract: We introduce efficient $(1+\varepsilon)$-approximation algorithms for the binary matrix factorization (BMF) problem, where the inputs are a matrix $\mathbf{A}\in\{0,1\}^{n\times d}$, a rank parameter $k>0$, as well as an accuracy parameter $\varepsilon>0$, and the goal is to approximate $\mathbf{A}$ as a product of low-rank factors $\mathbf{U}\in\{0,1\}^{n\times k}$ and… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: ICML 2023

  24. arXiv:2306.00732  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Sharper Bounds for $\ell_p$ Sensitivity Sampling

    Authors: David P. Woodruff, Taisuke Yasuda

    Abstract: In large scale machine learning, random sampling is a popular way to approximate datasets by a small representative subset of examples. In particular, sensitivity sampling is an intensely studied technique which provides provable guarantees on the quality of approximation, while reducing the number of examples to the product of the VC dimension $d$ and the total sensitivity $\mathfrak S$ in remark… ▽ More

    Submitted 3 January, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: To appear in ICML 2023; added discussion of prior work

  25. arXiv:2305.05826  [pdf, ps, other

    cs.DS math.NA

    Universal Matrix Sparsifiers and Fast Deterministic Algorithms for Linear Algebra

    Authors: Rajarshi Bhattacharjee, Gregory Dexter, Cameron Musco, Archan Ray, Sushant Sachdeva, David P Woodruff

    Abstract: Let $\mathbf S \in \mathbb R^{n \times n}$ satisfy $\|\mathbf 1-\mathbf S\|_2\leεn$, where $\mathbf 1$ is the all ones matrix and $\|\cdot\|_2$ is the spectral norm. It is well-known that there exists such an $\mathbf S$ with just $O(n/ε^2)$ non-zero entries: we can let $\mathbf S$ be the scaled adjacency matrix of a Ramanujan expander graph. We show that such an $\mathbf S$ yields a $universal$… ▽ More

    Submitted 12 January, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 41 pages

    ACM Class: F.2.1; G.1.3; G.1.2; G.4; I.1.2

  26. arXiv:2304.09281  [pdf, ps, other

    cs.DS

    Optimal Eigenvalue Approximation via Sketching

    Authors: William Swartworth, David P. Woodruff

    Abstract: Given a symmetric matrix $A$, we show from the simple sketch $GAG^T$, where $G$ is a Gaussian matrix with $k = O(1/ε^2)$ rows, that there is a procedure for approximating all eigenvalues of $A$ simultaneously to within $ε\|A\|_F$ additive error with large probability. Unlike the work of (Andoni, Nguyen, SODA, 2013), we do not require that $A$ is positive semidefinite and therefore we can recover s… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  27. arXiv:2304.09217  [pdf, ps, other

    cs.DS

    New Subset Selection Algorithms for Low Rank Approximation: Offline and Online

    Authors: David P. Woodruff, Taisuke Yasuda

    Abstract: Subset selection for the rank $k$ approximation of an $n\times d$ matrix $A$ offers improvements in the interpretability of matrices, as well as a variety of computational savings. This problem is well-understood when the error measure is the Frobenius norm, with various tight algorithms known even in challenging models such as the online model, where an algorithm must select the column subset irr… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: To appear in STOC 2023; abstract shortened

  28. arXiv:2304.07413  [pdf, other

    cs.DS

    Robust Algorithms on Adaptive Inputs from Bounded Adversaries

    Authors: Yeshwanth Cherapanamjeri, Sandeep Silwal, David P. Woodruff, Fred Zhang, Qiuyi Zhang, Samson Zhou

    Abstract: We study dynamic algorithms robust to adaptive input generated from sources with bounded capabilities, such as sparsity or limited interaction. For example, we consider robust linear algebraic algorithms when the updates to the input are sparse but given by an adversary with access to a query oracle. We also study robust algorithms in the standard centralized setting, where an adversary queries an… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  29. arXiv:2304.06853  [pdf, ps, other

    cs.DS

    Pseudorandom Hashing for Space-bounded Computation with Applications in Streaming

    Authors: Praneeth Kacham, Rasmus Pagh, Mikkel Thorup, David P. Woodruff

    Abstract: We revisit Nisan's classical pseudorandom generator (PRG) for space-bounded computation (STOC 1990) and its applications in streaming algorithms. We describe a new generator, HashPRG, that can be thought of as a symmetric version of Nisan's generator over larger alphabets. Our generator allows a trade-off between seed length and the time needed to compute a given block of the generator's output. H… ▽ More

    Submitted 4 January, 2024; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: Minor writing improvements

  30. arXiv:2304.02261  [pdf, other

    cs.DS cs.LG stat.ML

    Optimal Sketching Bounds for Sparse Linear Regression

    Authors: Tung Mai, Alexander Munteanu, Cameron Musco, Anup B. Rao, Chris Schwiegelshohn, David P. Woodruff

    Abstract: We study oblivious sketching for $k$-sparse linear regression under various loss functions such as an $\ell_p$ norm, or from a broad class of hinge-like loss functions, which includes the logistic and ReLU losses. We show that for sparse $\ell_2$ norm regression, there is a distribution over oblivious sketches with $Θ(k\log(d/k)/\varepsilon^2)$ rows, which is tight up to a constant factor. This ex… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: AISTATS 2023

  31. arXiv:2303.01709  [pdf, other

    cs.DS cs.LG

    Streaming Algorithms for Learning with Experts: Deterministic Versus Robust

    Authors: David P. Woodruff, Fred Zhang, Samson Zhou

    Abstract: In the online learning with experts problem, an algorithm must make a prediction about an outcome on each of $T$ days (or times), given a set of $n$ experts who make predictions on each day (or time). The algorithm is given feedback on the outcomes of each day, including the cost of its prediction and the cost of the expert predictions, and the goal is to make a prediction with the minimum cost, s… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  32. arXiv:2302.05707  [pdf, ps, other

    cs.CR cs.DS cs.LG

    On Differential Privacy and Adaptive Data Analysis with Bounded Space

    Authors: Itai Dinur, Uri Stemmer, David P. Woodruff, Samson Zhou

    Abstract: We study the space complexity of the two related fields of differential privacy and adaptive data analysis. Specifically, (1) Under standard cryptographic assumptions, we show that there exists a problem P that requires exponentially more space to be solved efficiently with differential privacy, compared to the space needed without privacy. To the best of our knowledge, this is the first separat… ▽ More

    Submitted 11 February, 2023; originally announced February 2023.

  33. arXiv:2211.09964  [pdf, ps, other

    cs.DS

    Optimal Algorithms for Linear Algebra in the Current Matrix Multiplication Time

    Authors: Yeshwanth Cherapanamjeri, Sandeep Silwal, David P. Woodruff, Samson Zhou

    Abstract: We study fundamental problems in linear algebra, such as finding a maximal linearly independent subset of rows or columns (a basis), solving linear regression, or computing a subspace embedding. For these problems, we consider input matrices $\mathbf{A}\in\mathbb{R}^{n\times d}$ with $n > d$. The input can be read in $\text{nnz}(\mathbf{A})$ time, which denotes the number of nonzero entries of… ▽ More

    Submitted 19 January, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: SODA 2023

  34. arXiv:2211.07132  [pdf, ps, other

    cs.DS

    The $\ell_p$-Subspace Sketch Problem in Small Dimensions with Applications to Support Vector Machines

    Authors: Yi Li, Honghao Lin, David P. Woodruff

    Abstract: In the $\ell_p$-subspace sketch problem, we are given an $n\times d$ matrix $A$ with $n>d$, and asked to build a small memory data structure $Q(A,ε)$ so that, for any query vector $x\in\mathbb{R}^d$, we can output a number in $(1\pmε)\|Ax\|_p^p$ given only $Q(A,ε)$. This problem is known to require $\tildeΩ(dε^{-2})$ bits of memory for $d=Ω(\log(1/ε))$. However, for $d=o(\log(1/ε))$, no data struc… ▽ More

    Submitted 15 February, 2024; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: Corrected the citation for Lemma 3.3 and adjusted the constants in the proof accordingly

  35. arXiv:2211.06790  [pdf, other

    cs.DS

    Near-Linear Sample Complexity for $L_p$ Polynomial Regression

    Authors: Raphael A. Meyer, Cameron Musco, Christopher Musco, David P. Woodruff, Samson Zhou

    Abstract: We study $L_p$ polynomial regression. Given query access to a function $f:[-1,1] \rightarrow \mathbb{R}$, the goal is to find a degree $d$ polynomial $\hat{q}$ such that, for a given parameter $\varepsilon > 0$, $$ \|\hat{q}-f\|_p\le (1+\varepsilon) \cdot \min_{q:\text{deg}(q)\le d}\|q-f\|_p. $$ Here $\|\cdot\|_p$ is the $L_p$ norm, $\|g\|_p = (\int_{-1}^1 |g(t)|^p dt)^{1/p}$. We show that queryin… ▽ More

    Submitted 12 November, 2022; originally announced November 2022.

    Comments: 68 pages, to be presented at SODA 2023

  36. arXiv:2209.15219  [pdf, other

    cs.DS cs.LG

    Optimal Query Complexities for Dynamic Trace Estimation

    Authors: David P. Woodruff, Fred Zhang, Qiuyi Zhang

    Abstract: We consider the problem of minimizing the number of matrix-vector queries needed for accurate trace estimation in the dynamic setting where our underlying matrix is changing slowly, such as during an optimization process. Specifically, for any $m$ matrices $A_1,...,A_m$ with consecutive differences bounded in Schatten-$1$ norm by $α$, we provide a novel binary tree summation procedure that simulta… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    Comments: 30 pages

  37. Recovery from Non-Decomposable Distance Oracles

    Authors: Zhuangfei Hu, Xinda Li, David P. Woodruff, Hongyang Zhang, Shufan Zhang

    Abstract: A line of work has looked at the problem of recovering an input from distance queries. In this setting, there is an unknown sequence $s \in \{0,1\}^{\leq n}$, and one chooses a set of queries $y \in \{0,1\}^{\mathcal{O}(n)}$ and receives $d(s,y)$ for a distance function $d$. The goal is to make as few queries as possible to recover $s$. Although this problem is well-studied for decomposable distan… ▽ More

    Submitted 13 August, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: This work has been presented at conference The 14th Innovations in Theoretical Computer Science (ITCS 2023) and accepted for publishing in the journal IEEE Transactions on Information Theory

  38. arXiv:2207.08268  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Online Lewis Weight Sampling

    Authors: David P. Woodruff, Taisuke Yasuda

    Abstract: The seminal work of Cohen and Peng introduced Lewis weight sampling to the theoretical computer science community, yielding fast row sampling algorithms for approximating $d$-dimensional subspaces of $\ell_p$ up to $(1+ε)$ error. Several works have extended this important primitive to other settings, including the online coreset and sliding window models. However, these results are only for… ▽ More

    Submitted 17 December, 2022; v1 submitted 17 July, 2022; originally announced July 2022.

    Comments: To appear in SODA 2023; Removed result on adversarial streaming

  39. arXiv:2207.08075  [pdf, ps, other

    cs.DS

    Streaming Algorithms with Large Approximation Factors

    Authors: Yi Li, Honghao Lin, David P. Woodruff, Yuheng Zhang

    Abstract: We initiate a broad study of classical problems in the streaming model with insertions and deletions in the setting where we allow the approximation factor $α$ to be much larger than $1$. Such algorithms can use significantly less memory than the usual setting for which $α= 1+ε$ for an $ε\in (0,1)$. We study large approximations for a number of problems in sketching and streaming and the following… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

  40. arXiv:2207.07822  [pdf, ps, other

    cs.LG cs.DS

    Adaptive Sketches for Robust Regression with Importance Sampling

    Authors: Sepideh Mahabadi, David P. Woodruff, Samson Zhou

    Abstract: We introduce data structures for solving robust regression through stochastic gradient descent (SGD) by sampling gradients with probability proportional to their norm, i.e., importance sampling. Although SGD is widely used for large scale machine learning, it is well-known for possibly experiencing slow convergence rates due to the high variance from uniform sampling. On the other hand, importance… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: RANDOM 2022

  41. arXiv:2207.07417  [pdf, other

    cs.DS cs.LG

    Near-Linear Time and Fixed-Parameter Tractable Algorithms for Tensor Decompositions

    Authors: Arvind V. Mahankali, David P. Woodruff, Ziyu Zhang

    Abstract: We study low rank approximation of tensors, focusing on the tensor train and Tucker decompositions, as well as approximations with tree tensor networks and more general tensor networks. For tensor train decomposition, we give a bicriteria $(1 + \eps)$-approximation algorithm with a small bicriteria rank and $O(q \cdot \nnz(A))$ running time, up to lower order terms, which improves over the additiv… ▽ More

    Submitted 26 November, 2023; v1 submitted 15 July, 2022; originally announced July 2022.

  42. arXiv:2206.12802  [pdf, other

    cs.LG cs.DS stat.ML

    Bounding the Width of Neural Networks via Coupled Initialization -- A Worst Case Analysis

    Authors: Alexander Munteanu, Simon Omlor, Zhao Song, David P. Woodruff

    Abstract: A common method in training neural networks is to initialize all the weights to be independent Gaussian vectors. We observe that by instead initializing the weights into independent pairs, where each pair consists of two identical Gaussian vectors, we can significantly improve the convergence analysis. While a similar technique has been studied for random inputs [Daniely, NeurIPS 2020], it has not… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

    Comments: ICML 2022

  43. arXiv:2206.12110  [pdf, other

    cs.DS

    Learning Augmented Binary Search Trees

    Authors: Honghao Lin, Tian Luo, David P. Woodruff

    Abstract: A treap is a classic randomized binary search tree data structure that is easy to implement and supports O(\log n) expected time access. However, classic treaps do not take advantage of the input distribution or patterns in the input. Given recent advances in algorithms with predictions, we propose pairing treaps with machine advice to form a learning-augmented treap. We are the first to propose a… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

  44. arXiv:2204.09837  [pdf, ps, other

    cs.DS cs.LG

    Memory Bounds for the Experts Problem

    Authors: Vaidehi Srinivas, David P. Woodruff, Ziyu Xu, Samson Zhou

    Abstract: Online learning with expert advice is a fundamental problem of sequential prediction. In this problem, the algorithm has access to a set of $n$ "experts" who make predictions on each day. The goal on each day is to process these predictions, and make a prediction with the minimum cost. After making a prediction, the algorithm sees the actual outcome on that day, updates its state, and then moves o… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: 32 pages, 1 figure, to appear in the 54th ACM Symposium on Theory of Computing (STOC 2022)

  45. arXiv:2204.09136  [pdf, other

    cs.DS

    The White-Box Adversarial Data Stream Model

    Authors: Miklos Ajtai, Vladimir Braverman, T. S. Jayram, Sandeep Silwal, Alec Sun, David P. Woodruff, Samson Zhou

    Abstract: We study streaming algorithms in the white-box adversarial model, where the stream is chosen adaptively by an adversary who observes the entire internal state of the algorithm at each time step. We show that nontrivial algorithms are still possible. We first give a randomized algorithm for the $L_1$-heavy hitters problem that outperforms the optimal deterministic Misra-Gries algorithm on long stre… ▽ More

    Submitted 23 July, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: PODS 2022

  46. arXiv:2204.06653  [pdf, other

    cs.DS cs.LG

    Sketching Algorithms and Lower Bounds for Ridge Regression

    Authors: Praneeth Kacham, David P. Woodruff

    Abstract: We give a sketching-based iterative algorithm that computes a $1+\varepsilon$ approximate solution for the ridge regression problem $\min_x \|Ax-b\|_2^2 +λ\|x\|_2^2$ where $A \in R^{n \times d}$ with $d \ge n$. Our algorithm, for a constant number of iterations (requiring a constant number of passes over the input), improves upon earlier work (Chowdhury et al.) by requiring that the sketching matr… ▽ More

    Submitted 16 June, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: To appear at ICML 2022

  47. arXiv:2204.03790  [pdf, ps, other

    cs.DS cs.CG math.FA

    High-Dimensional Geometric Streaming in Polynomial Space

    Authors: David P. Woodruff, Taisuke Yasuda

    Abstract: Many existing algorithms for streaming geometric data analysis have been plagued by exponential dependencies in the space complexity, which are undesirable for processing high-dimensional data sets. In particular, once $d\geq\log n$, there are no known non-trivial streaming algorithms for problems such as maintaining convex hulls and Löwner-John ellipsoids of $n$ points, despite a long line of wor… ▽ More

    Submitted 26 September, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: Abstract shortened to meet arXiv limits; v2 fix statements concerning online condition number; v3 to appear in FOCS 2022; v4 minor fixes

  48. arXiv:2204.03782  [pdf, ps, other

    cs.DS math.NA

    Testing Positive Semidefiniteness Using Linear Measurements

    Authors: Deanna Needell, William Swartworth, David P. Woodruff

    Abstract: We study the problem of testing whether a symmetric $d \times d$ input matrix $A$ is symmetric positive semidefinite (PSD), or is $ε$-far from the PSD cone, meaning that $λ_{\min}(A) \leq - ε\|A\|_p$, where $\|A\|_p$ is the Schatten-$p$ norm of $A$. In applications one often needs to quickly tell if an input matrix is PSD, and a small distance from the PSD cone may be tolerable. We consider two we… ▽ More

    Submitted 25 October, 2023; v1 submitted 7 April, 2022; originally announced April 2022.

    ACM Class: F.2.1

  49. arXiv:2203.09572  [pdf, other

    cs.DS cs.LG

    Triangle and Four Cycle Counting with Predictions in Graph Streams

    Authors: Justin Y. Chen, Talya Eden, Piotr Indyk, Honghao Lin, Shyam Narayanan, Ronitt Rubinfeld, Sandeep Silwal, Tal Wagner, David P. Woodruff, Michael Zhang

    Abstract: We propose data-driven one-pass streaming algorithms for estimating the number of triangles and four cycles, two fundamental problems in graph analytics that are widely studied in the graph data stream literature. Recently, (Hsu 2018) and (Jiang 2020) applied machine learning techniques in other data stream problems, using a trained oracle that can predict certain properties of the stream elements… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: To be presented at ICLR 2022

  50. arXiv:2203.07557  [pdf, other

    cs.DS

    Fast Regression for Structured Inputs

    Authors: Raphael A. Meyer, Cameron Musco, Christopher Musco, David P. Woodruff, Samson Zhou

    Abstract: We study the $\ell_p$ regression problem, which requires finding $\mathbf{x}\in\mathbb R^{d}$ that minimizes $\|\mathbf{A}\mathbf{x}-\mathbf{b}\|_p$ for a matrix $\mathbf{A}\in\mathbb R^{n \times d}$ and response vector $\mathbf{b}\in\mathbb R^{n}$. There has been recent interest in developing subsampling methods for this problem that can outperform standard techniques when $n$ is very large. Howe… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.