Skip to main content

Showing 1–13 of 13 results for author: Miconi, T

  1. arXiv:2305.11252  [pdf, other

    cs.NE cs.AI cs.LG q-bio.NC

    Brain-inspired learning in artificial neural networks: a review

    Authors: Samuel Schmidgall, Jascha Achterberg, Thomas Miconi, Louis Kirsch, Rojin Ziaei, S. Pardis Hajiseyedrazi, Jason Eshraghian

    Abstract: Artificial neural networks (ANNs) have emerged as an essential tool in machine learning, achieving remarkable success across diverse domains, including image and speech generation, game playing, and robotics. However, there exist fundamental differences between ANNs' operating mechanisms and those of the biological brain, particularly concerning learning processes. This paper presents a comprehens… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  2. arXiv:2302.05583  [pdf, other

    cs.LG cs.AI

    Procedural generation of meta-reinforcement learning tasks

    Authors: Thomas Miconi

    Abstract: Open-endedness stands to benefit from the ability to generate an infinite variety of diverse, challenging environments. One particularly interesting type of challenge is meta-learning ("learning-to-learn"), a hallmark of intelligent behavior. However, the number of meta-learning environments in the literature is limited. Here we describe a parametrized space for simple meta-reinforcement learning… ▽ More

    Submitted 8 December, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: Agent Learning in Open-Endedness (ALOE) Workshop at NeurIPS 2023

  3. arXiv:2112.08588  [pdf, other

    cs.NE cs.AI

    Learning to acquire novel cognitive tasks with evolution, plasticity and meta-meta-learning

    Authors: Thomas Miconi

    Abstract: A hallmark of intelligence is the ability to autonomously learn new flexible, cognitive behaviors - that is, behaviors where the appropriate action depends not just on immediate stimuli (as in simple reflexive stimulus-response associations), but on contextual information that must be adequately acquired, stored and processed. While many meta-learning algorithms can design agents that autonomously… ▽ More

    Submitted 26 May, 2023; v1 submitted 15 December, 2021; originally announced December 2021.

    Journal ref: 40th International Conference on Machine Learning (ICML 2023)

  4. arXiv:2107.01729  [pdf, other

    cs.NE

    Hebbian learning with gradients: Hebbian convolutional neural networks with modern deep learning frameworks

    Authors: Thomas Miconi

    Abstract: Deep learning networks generally use non-biological learning methods. By contrast, networks based on more biologically plausible learning, such as Hebbian learning, show comparatively poor performance and difficulties of implementation. Here we show that Hebbian learning in hierarchical, convolutional neural networks can be implemented almost trivially with modern deep learning frameworks, by usin… ▽ More

    Submitted 1 November, 2021; v1 submitted 4 July, 2021; originally announced July 2021.

    Comments: All code available at https://github.com/ThomasMiconi/HebbianCNNPyTorch

  5. arXiv:2006.16558  [pdf, other

    cs.LG cs.NE stat.ML

    Enabling Continual Learning with Differentiable Hebbian Plasticity

    Authors: Vithursan Thangarasa, Thomas Miconi, Graham W. Taylor

    Abstract: Continual learning is the problem of sequentially learning new tasks or knowledge while protecting previously acquired knowledge. However, catastrophic forgetting poses a grand challenge for neural networks performing such learning process. Thus, neural networks that are deployed in the real world often struggle in scenarios where the data distribution is non-stationary (concept drift), imbalanced… ▽ More

    Submitted 30 June, 2020; originally announced June 2020.

    Comments: Published as a conference paper at IJCNN 2020

  6. arXiv:2002.10585  [pdf, other

    cs.NE

    Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity

    Authors: Thomas Miconi, Aditya Rawal, Jeff Clune, Kenneth O. Stanley

    Abstract: The impressive lifelong learning in animal brains is primarily enabled by plastic changes in synaptic connectivity. Importantly, these changes are not passive, but are actively controlled by neuromodulation, which is itself under the control of the brain. The resulting self-modifying abilities of the brain play an important role in learning and adaptation, and are a major basis for biological rein… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

    Comments: Presented at the 7th International Conference on Learning Representations (ICLR 2019)

    Journal ref: 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019

  7. arXiv:2002.09571  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Learning to Continually Learn

    Authors: Shawn Beaulieu, Lapo Frati, Thomas Miconi, Joel Lehman, Kenneth O. Stanley, Jeff Clune, Nick Cheney

    Abstract: Continual lifelong learning requires an agent or model to learn many sequentially ordered tasks, building on previous knowledge without catastrophically forgetting it. Much work has gone towards preventing the default tendency of machine learning models to catastrophically forget, yet virtually all such work involves manually-designed solutions to the problem. We instead advocate meta-learning a s… ▽ More

    Submitted 3 March, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

  8. arXiv:2002.09505  [pdf, other

    cs.LG cs.AI stat.ML

    Estimating Q(s,s') with Deep Deterministic Dynamics Gradients

    Authors: Ashley D. Edwards, Himanshu Sahni, Rosanne Liu, Jane Hung, Ankit Jain, Rui Wang, Adrien Ecoffet, Thomas Miconi, Charles Isbell, Jason Yosinski

    Abstract: In this paper, we introduce a novel form of value function, $Q(s, s')$, that expresses the utility of transitioning from a state $s$ to a neighboring state $s'$ and then acting optimally thereafter. In order to derive an optimal policy, we develop a forward dynamics model that learns to make next-state predictions that maximize this value. This formulation decouples actions from values while still… ▽ More

    Submitted 25 August, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: Accepted into ICML 2020

  9. arXiv:1910.08461  [pdf, other

    cs.LG stat.ML

    First-Order Preconditioning via Hypergradient Descent

    Authors: Ted Moskovitz, Rui Wang, Janice Lan, Sanyam Kapoor, Thomas Miconi, Jason Yosinski, Aditya Rawal

    Abstract: Standard gradient descent methods are susceptible to a range of issues that can impede training, such as high correlations and different scaling in parameter space.These difficulties can be addressed by second-order approaches that apply a pre-conditioning matrix to the gradient to improve convergence. Unfortunately, such algorithms typically struggle to scale to high-dimensional problems, in part… ▽ More

    Submitted 27 April, 2020; v1 submitted 18 October, 2019; originally announced October 2019.

  10. arXiv:1804.02464  [pdf, other

    cs.NE cs.LG stat.ML

    Differentiable plasticity: training plastic neural networks with backpropagation

    Authors: Thomas Miconi, Jeff Clune, Kenneth O. Stanley

    Abstract: How can we build agents that keep learning from experience, quickly and efficiently, after their initial training? Here we take inspiration from the main mechanism of learning in biological brains: synaptic plasticity, carefully tuned by evolution to produce efficient lifelong learning. We show that plasticity, just like connection weights, can be optimized by gradient descent in large (millions o… ▽ More

    Submitted 31 July, 2018; v1 submitted 6 April, 2018; originally announced April 2018.

    Comments: Presented at ICML 2018

    Journal ref: Proceedings of the 35th International Conference on Machine Learning (ICML2018), Stockholm, Sweden, PMLR 80, 2018

  11. arXiv:1707.01195  [pdf, ps, other

    stat.AP cs.AI stat.ML

    The impossibility of "fairness": a generalized impossibility result for decisions

    Authors: Thomas Miconi

    Abstract: Various measures can be used to estimate bias or unfairness in a predictor. Previous work has already established that some of these measures are incompatible with each other. Here we show that, when groups differ in prevalence of the predicted event, several intuitive, reasonable measures of fairness (probability of positive prediction given occurrence or non-occurrence; probability of occurrence… ▽ More

    Submitted 11 September, 2017; v1 submitted 4 July, 2017; originally announced July 2017.

  12. arXiv:1609.02228  [pdf, other

    cs.NE cs.AI cs.LG q-bio.NC

    Learning to learn with backpropagation of Hebbian plasticity

    Authors: Thomas Miconi

    Abstract: Hebbian plasticity is a powerful principle that allows biological brains to learn from their lifetime experience. By contrast, artificial neural networks trained with backpropagation generally have fixed connection weights that do not change once training is complete. While recent methods can endow neural networks with long-term memories, Hebbian plasticity is currently not amenable to gradient de… ▽ More

    Submitted 19 October, 2016; v1 submitted 7 September, 2016; originally announced September 2016.

    ACM Class: I.2.6; I.5

  13. arXiv:1606.06216  [pdf, other

    cs.NE

    Neural networks with differentiable structure

    Authors: Thomas Miconi

    Abstract: While gradient descent has proven highly successful in learning connection weights for neural networks, the actual structure of these networks is usually determined by hand, or by other optimization algorithms. Here we describe a simple method to make network structure differentiable, and therefore accessible to gradient descent. We test this method on recurrent neural networks applied to simple s… ▽ More

    Submitted 6 August, 2016; v1 submitted 20 June, 2016; originally announced June 2016.