Skip to main content

Showing 1–13 of 13 results for author: Kiani, B T

  1. arXiv:2406.01461  [pdf, other

    cs.LG math.DG stat.ML

    Hardness of Learning Neural Networks under the Manifold Hypothesis

    Authors: Bobak T. Kiani, Jason Wang, Melanie Weber

    Abstract: The manifold hypothesis presumes that high-dimensional data lies on or near a low-dimensional manifold. While the utility of encoding geometric structure has been demonstrated empirically, rigorous analysis of its impact on the learnability of neural networks is largely missing. Several recent results have established hardness results for learning feedforward and equivariant neural networks under… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2401.01869  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    On the hardness of learning under symmetries

    Authors: Bobak T. Kiani, Thien Le, Hannah Lawrence, Stefanie Jegelka, Melanie Weber

    Abstract: We study the problem of learning equivariant neural networks via gradient descent. The incorporation of known symmetries ("equivariance") into neural nets has empirically improved the performance of learning pipelines, in domains ranging from biology to computer vision. However, a rich yet separate line of learning theoretic research has demonstrated that actually learning shallow, fully-connected… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: 52 pages, 4 figures

  3. arXiv:2307.05432  [pdf, other

    cs.LG math.NA

    Self-Supervised Learning with Lie Symmetries for Partial Differential Equations

    Authors: Grégoire Mialon, Quentin Garrido, Hannah Lawrence, Danyal Rehman, Yann LeCun, Bobak T. Kiani

    Abstract: Machine learning for differential equations paves the way for computationally efficient alternatives to numerical solvers, with potentially broad impacts in science and engineering. Though current algorithms typically require simulated training data tailored to a given setting, one may instead wish to learn useful information from heterogeneous sources, or from real dynamical systems observations… ▽ More

    Submitted 14 February, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  4. arXiv:2302.11556  [pdf, other

    cs.LG cs.AI

    Equivariant Polynomials for Graph Neural Networks

    Authors: Omri Puny, Derek Lim, Bobak T. Kiani, Haggai Maron, Yaron Lipman

    Abstract: Graph Neural Networks (GNN) are inherently limited in their expressive power. Recent seminal works (Xu et al., 2019; Morris et al., 2019b) introduced the Weisfeiler-Lehman (WL) hierarchy as a measure of expressive power. Although this hierarchy has propelled significant advances in GNN analysis and architecture developments, it suffers from several significant limitations. These include a complex… ▽ More

    Submitted 4 June, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

  5. arXiv:2302.02774  [pdf, other

    stat.ML cs.AI cs.LG math.ST

    The SSL Interplay: Augmentations, Inductive Bias, and Generalization

    Authors: Vivien Cabannes, Bobak T. Kiani, Randall Balestriero, Yann LeCun, Alberto Bietti

    Abstract: Self-supervised learning (SSL) has emerged as a powerful framework to learn representations from raw data without supervision. Yet in practice, engineers face issues such as instability in tuning optimizers and collapse of representations during training. Such challenges motivate the need for a theory to shed light on the complex interplay between the choice of data augmentation, network architect… ▽ More

    Submitted 1 June, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    MSC Class: 68Q32 ACM Class: G.3

    Journal ref: Proceedings of the 40 th International Conference on Machine Learning, Honolulu, Hawaii, USA. PMLR 202, 2023

  6. arXiv:2209.14884  [pdf, other

    cs.LG cs.AI stat.ML

    Joint Embedding Self-Supervised Learning in the Kernel Regime

    Authors: Bobak T. Kiani, Randall Balestriero, Yubei Chen, Seth Lloyd, Yann LeCun

    Abstract: The fundamental goal of self-supervised learning (SSL) is to produce useful representations of data without access to any labels for classifying the data. Modern methods in SSL, which form representations based on known or constructed relationships between samples, have been particularly effective at this task. Here, we aim to extend this framework to incorporate algorithms based on kernel methods… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  7. arXiv:2110.06084  [pdf, other

    cs.LG cs.AI

    Implicit Bias of Linear Equivariant Networks

    Authors: Hannah Lawrence, Kristian Georgiev, Andrew Dienes, Bobak T. Kiani

    Abstract: Group equivariant convolutional neural networks (G-CNNs) are generalizations of convolutional neural networks (CNNs) which excel in a wide range of technical applications by explicitly encoding symmetries, such as rotations and permutations, in their architectures. Although the success of G-CNNs is driven by their \emph{explicit} symmetry bias, a recent line of work has proposed that the \emph{imp… ▽ More

    Submitted 12 September, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: 21 pages, 19 figures

    Journal ref: ICML 2022: 12096-12125

  8. arXiv:2109.11330  [pdf, other

    quant-ph cs.DS cs.LG math-ph

    Quantum algorithms for group convolution, cross-correlation, and equivariant transformations

    Authors: Grecia Castelazo, Quynh T. Nguyen, Giacomo De Palma, Dirk Englund, Seth Lloyd, Bobak T. Kiani

    Abstract: Group convolutions and cross-correlations, which are equivariant to the actions of group elements, are commonly used in mathematics to analyze or take advantage of symmetries inherent in a given problem setting. Here, we provide efficient quantum algorithms for performing linear group convolutions and cross-correlations on data stored as quantum states. Runtimes for our algorithms are logarithmic… ▽ More

    Submitted 6 September, 2022; v1 submitted 23 September, 2021; originally announced September 2021.

    Journal ref: Phys. Rev. A, 106, 032402 (2022)

  9. arXiv:2101.03037  [pdf, other

    quant-ph cs.AI cs.LG stat.ML

    Learning quantum data with the quantum Earth Mover's distance

    Authors: Bobak Toussi Kiani, Giacomo De Palma, Milad Marvian, Zi-Wen Liu, Seth Lloyd

    Abstract: Quantifying how far the output of a learning algorithm is from its target is an essential task in machine learning. However, in quantum settings, the loss landscapes of commonly used distance metrics often produce undesirable outcomes such as poor local minima and exponentially decaying gradients. To overcome these obstacles, we consider here the recently proposed quantum earth mover's (EM) or Was… ▽ More

    Submitted 16 May, 2022; v1 submitted 8 January, 2021; originally announced January 2021.

    Journal ref: Quantum Science and Technology 7(4), 045002 (2022)

  10. arXiv:2010.15776  [pdf, other

    quant-ph cs.DS math-ph math.NA

    Quantum advantage for differential equation analysis

    Authors: Bobak T. Kiani, Giacomo De Palma, Dirk Englund, William Kaminsky, Milad Marvian, Seth Lloyd

    Abstract: Quantum algorithms for both differential equation solving and for machine learning potentially offer an exponential speedup over all known classical algorithms. However, there also exist obstacles to obtaining this potential speedup in useful problem instances. The essential obstacle for quantum differential equation solving is that outputting useful information may require difficult post-processi… ▽ More

    Submitted 26 April, 2022; v1 submitted 29 October, 2020; originally announced October 2020.

    Journal ref: Physical Review A 105, 022415 (2022)

  11. arXiv:2004.05923  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG math-ph quant-ph

    Adversarial Robustness Guarantees for Random Deep Neural Networks

    Authors: Giacomo De Palma, Bobak T. Kiani, Seth Lloyd

    Abstract: The reliability of deep learning algorithms is fundamentally challenged by the existence of adversarial examples, which are incorrectly classified inputs that are extremely close to a correctly classified input. We explore the properties of adversarial examples for deep neural networks with random weights and biases, and prove that for any $p\ge1$, the $\ell^p$ distance of any given input from the… ▽ More

    Submitted 22 July, 2021; v1 submitted 13 April, 2020; originally announced April 2020.

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139:2522-2534, 2021

  12. arXiv:2001.11897  [pdf, other

    quant-ph cs.LG math-ph

    Learning Unitaries by Gradient Descent

    Authors: Bobak Toussi Kiani, Seth Lloyd, Reevu Maity

    Abstract: We study the hardness of learning unitary transformations in $U(d)$ via gradient descent on time parameters of alternating operator sequences. We provide numerical evidence that, despite the non-convex nature of the loss landscape, gradient descent always converges to the target unitary when the sequence contains $d^2$ or more parameters. Rates of convergence indicate a "computational phase transi… ▽ More

    Submitted 18 February, 2020; v1 submitted 31 January, 2020; originally announced January 2020.

  13. arXiv:1812.10156  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG math-ph quant-ph

    Random deep neural networks are biased towards simple functions

    Authors: Giacomo De Palma, Bobak Toussi Kiani, Seth Lloyd

    Abstract: We prove that the binary classifiers of bit strings generated by random wide deep neural networks with ReLU activation function are biased towards simple functions. The simplicity is captured by the following two properties. For any given input bit string, the average Hamming distance of the closest input bit string with a different classification is at least sqrt(n / (2π log n)), where n is the l… ▽ More

    Submitted 23 October, 2019; v1 submitted 25 December, 2018; originally announced December 2018.

    Journal ref: Advances in Neural Information Processing Systems 32, 1962-1974 (2019)