Skip to main content

Showing 1–10 of 10 results for author: Boopathy, A

  1. arXiv:2406.15941  [pdf, other

    cs.LG stat.ML

    Towards Exact Computation of Inductive Bias

    Authors: Akhilan Boopathy, William Yue, Jaedong Hwang, Abhiram Iyer, Ila Fiete

    Abstract: Much research in machine learning involves finding appropriate inductive biases (e.g. convolutional neural networks, momentum-based optimizers, transformers) to promote generalization on tasks. However, quantification of the amount of inductive bias associated with these architectures and hyperparameters has been limited. We propose a novel method for efficiently computing the inductive bias requi… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Published at IJCAI 2024

  2. arXiv:2404.13698  [pdf, other

    cs.RO cs.LG stat.ML

    Resampling-free Particle Filters in High-dimensions

    Authors: Akhilan Boopathy, Aneesh Muppidi, Peggy Yang, Abhiram Iyer, William Yue, Ila Fiete

    Abstract: State estimation is crucial for the performance and safety of numerous robotic applications. Among the suite of estimation techniques, particle filters have been identified as a powerful solution due to their non-parametric nature. Yet, in high-dimensional state spaces, these filters face challenges such as 'particle deprivation' which hinders accurate representation of the true posterior distribu… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: Published at ICRA 2024, 7 pages, 5 figures

  3. arXiv:2310.17537  [pdf, other

    cs.AI cs.LG

    Neuro-Inspired Fragmentation and Recall to Overcome Catastrophic Forgetting in Curiosity

    Authors: Jaedong Hwang, Zhang-Wei Hong, Eric Chen, Akhilan Boopathy, Pulkit Agrawal, Ila Fiete

    Abstract: Deep reinforcement learning methods exhibit impressive performance on a range of tasks but still struggle on hard exploration tasks in large environments with sparse rewards. To address this, intrinsic rewards can be generated using forward model prediction errors that decrease as the environment becomes known, and incentivize an agent to explore novel states. While prediction-based intrinsic rewa… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 Workshop - Intrinsically Motivated Open-ended Learning

  4. arXiv:2307.05793  [pdf, other

    cs.AI cs.RO

    Grid Cell-Inspired Fragmentation and Recall for Efficient Map Building

    Authors: Jaedong Hwang, Zhang-Wei Hong, Eric Chen, Akhilan Boopathy, Pulkit Agrawal, Ila Fiete

    Abstract: Animals and robots navigate through environments by building and refining maps of space. These maps enable functions including navigation back to home, planning, search and foraging. Here, we use observations from neuroscience, specifically the observed fragmentation of grid cell map in compartmentalized spaces, to propose and apply the concept of Fragmentation-and-Recall (FARMap) in the mapping o… ▽ More

    Submitted 8 July, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: TMLR (Featured Certification)

  5. arXiv:2305.01034  [pdf, other

    cs.LG cs.AI stat.ML

    Model-agnostic Measure of Generalization Difficulty

    Authors: Akhilan Boopathy, Kevin Liu, Jaedong Hwang, Shu Ge, Asaad Mohammedsaleh, Ila Fiete

    Abstract: The measure of a machine learning algorithm is the difficulty of the tasks it can perform, and sufficiently difficult tasks are critical drivers of strong machine learning models. However, quantifying the generalization difficulty of machine learning benchmarks has remained challenging. We propose what is to our knowledge the first model-agnostic measure of the inherent generalization difficulty o… ▽ More

    Submitted 2 June, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: Published at ICML 2023, 28 pages, 6 figures

  6. arXiv:2303.14151  [pdf, other

    cs.LG stat.ML

    Double Descent Demystified: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle

    Authors: Rylan Schaeffer, Mikail Khona, Zachary Robertson, Akhilan Boopathy, Kateryna Pistunova, Jason W. Rocks, Ila Rani Fiete, Oluwasanmi Koyejo

    Abstract: Double descent is a surprising phenomenon in machine learning, in which as the number of model parameters grows relative to the number of data, test error drops as models grow ever larger into the highly overparameterized (data undersampled) regime. This drop in test error flies against classical learning theory on overfitting and has arguably underpinned the success of large models in machine lea… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  7. arXiv:2106.08453  [pdf, other

    cs.LG stat.ML

    How to Train Your Wide Neural Network Without Backprop: An Input-Weight Alignment Perspective

    Authors: Akhilan Boopathy, Ila Fiete

    Abstract: Recent works have examined theoretical and empirical properties of wide neural networks trained in the Neural Tangent Kernel (NTK) regime. Given that biological neural networks are much wider than their artificial counterparts, we consider NTK regime wide neural networks as a possible model of biological neural networks. Leveraging NTK theory, we show theoretically that gradient descent drives lay… ▽ More

    Submitted 13 July, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: Published at ICML 2022, 28 pages, 9 figures

  8. arXiv:2102.01208  [pdf, ps, other

    cs.LG stat.ML

    Fast Training of Provably Robust Neural Networks by SingleProp

    Authors: Akhilan Boopathy, Tsui-Wei Weng, Sijia Liu, Pin-Yu Chen, Gaoyuan Zhang, Luca Daniel

    Abstract: Recent works have developed several methods of defending neural networks against adversarial attacks with certified guarantees. However, these techniques can be computationally costly due to the use of certification during training. We develop a new regularizer that is both more efficient than existing certified defenses, requiring only one additional forward propagation through a network, and can… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: Published at AAAI 2021

  9. arXiv:2006.14748  [pdf, other

    cs.LG stat.ML

    Proper Network Interpretability Helps Adversarial Robustness in Classification

    Authors: Akhilan Boopathy, Sijia Liu, Gaoyuan Zhang, Cynthia Liu, Pin-Yu Chen, Shiyu Chang, Luca Daniel

    Abstract: Recent works have empirically shown that there exist adversarial examples that can be hidden from neural network interpretability (namely, making network interpretation maps visually similar), or interpretability is itself susceptible to adversarial attacks. In this paper, we theoretically show that with a proper measurement of interpretation, it is actually difficult to prevent prediction-evasion… ▽ More

    Submitted 21 October, 2020; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: 22 pages, 9 figures, Published at ICML 2020

  10. arXiv:1811.12395  [pdf, other

    stat.ML cs.CR cs.LG

    CNN-Cert: An Efficient Framework for Certifying Robustness of Convolutional Neural Networks

    Authors: Akhilan Boopathy, Tsui-Wei Weng, Pin-Yu Chen, Sijia Liu, Luca Daniel

    Abstract: Verifying robustness of neural network classifiers has attracted great interests and attention due to the success of deep neural networks and their unexpected vulnerability to adversarial perturbations. Although finding minimum adversarial distortion of neural networks (with ReLU activations) has been shown to be an NP-complete problem, obtaining a non-trivial lower bound of minimum distortion as… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    Comments: Accepted by AAAI 2019