Skip to main content

Showing 1–38 of 38 results for author: Lehman, J

  1. arXiv:2404.16244  [pdf, other

    cs.CY

    The Ethics of Advanced AI Assistants

    Authors: Iason Gabriel, Arianna Manzini, Geoff Keeling, Lisa Anne Hendricks, Verena Rieser, Hasan Iqbal, Nenad Tomašev, Ira Ktena, Zachary Kenton, Mikel Rodriguez, Seliem El-Sayed, Sasha Brown, Canfer Akbulut, Andrew Trask, Edward Hughes, A. Stevie Bergman, Renee Shelby, Nahema Marchal, Conor Griffin, Juan Mateos-Garcia, Laura Weidinger, Winnie Street, Benjamin Lange, Alex Ingerman, Alison Lentz , et al. (32 additional authors not shown)

    Abstract: This paper focuses on the opportunities and the ethical and societal risks posed by advanced AI assistants. We define advanced AI assistants as artificial agents with natural language interfaces, whose function is to plan and execute sequences of actions on behalf of a user, across one or more domains, in line with the user's expectations. The paper starts by considering the technology itself, pro… ▽ More

    Submitted 28 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

  2. arXiv:2310.13032  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Quality-Diversity through AI Feedback

    Authors: Herbie Bradley, Andrew Dai, Hannah Teufel, Jenny Zhang, Koen Oostermeijer, Marco Bellagente, Jeff Clune, Kenneth Stanley, Grégory Schott, Joel Lehman

    Abstract: In many text-generation problems, users may prefer not only a single response, but a diverse range of high-quality outputs from which to choose. Quality-diversity (QD) search algorithms aim at such outcomes, by continually improving and diversifying a population of candidates. However, the applicability of QD to qualitative domains, like creative writing, has been limited by the difficulty of algo… ▽ More

    Submitted 7 December, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: minor additions to supplementary results

  3. arXiv:2310.12103  [pdf, other

    cs.AI cs.NE

    Quality Diversity through Human Feedback: Towards Open-Ended Diversity-Driven Optimization

    Authors: Li Ding, Jenny Zhang, Jeff Clune, Lee Spector, Joel Lehman

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has shown potential in qualitative tasks where easily defined performance measures are lacking. However, there are drawbacks when RLHF is commonly used to optimize for average human preferences, especially in generative tasks that demand diverse model responses. Meanwhile, Quality Diversity (QD) algorithms excel at identifying diverse and high-qual… ▽ More

    Submitted 4 June, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: ICML 2024

  4. arXiv:2306.01711  [pdf, other

    cs.AI cs.LG

    OMNI: Open-endedness via Models of human Notions of Interestingness

    Authors: Jenny Zhang, Joel Lehman, Kenneth Stanley, Jeff Clune

    Abstract: Open-ended algorithms aim to learn new, interesting behaviors forever. That requires a vast environment search space, but there are thus infinitely many possible tasks. Even after filtering for tasks the current agent can learn (i.e., learning progress), countless learnable yet uninteresting tasks remain (e.g., minor variations of previously learned tasks). An Achilles Heel of open-endedness resea… ▽ More

    Submitted 14 February, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: 47 pages, 33 figures

  5. arXiv:2302.12170  [pdf, other

    cs.NE

    Language Model Crossover: Variation through Few-Shot Prompting

    Authors: Elliot Meyerson, Mark J. Nelson, Herbie Bradley, Adam Gaier, Arash Moradi, Amy K. Hoover, Joel Lehman

    Abstract: This paper pursues the insight that language models naturally enable an intelligent variation operator similar in spirit to evolutionary crossover. In particular, language models of sufficient scale demonstrate in-context learning, i.e. they can learn from associations between a small number of input patterns to generate outputs incorporating such associations (also called few-shot prompting). Thi… ▽ More

    Submitted 13 May, 2024; v1 submitted 23 February, 2023; originally announced February 2023.

  6. arXiv:2302.09248  [pdf, other

    cs.AI cs.CY cs.LG cs.NE

    Machine Love

    Authors: Joel Lehman

    Abstract: While ML generates much economic value, many of us have problematic relationships with social media and other ML-powered applications. One reason is that ML often optimizes for what we want in the moment, which is easy to quantify but at odds with what is known scientifically about human flourishing. Thus, through its impoverished models of us, ML currently falls far short of its exciting potentia… ▽ More

    Submitted 22 February, 2023; v1 submitted 18 February, 2023; originally announced February 2023.

  7. arXiv:2211.10551  [pdf, other

    cs.CV

    A Practical Stereo Depth System for Smart Glasses

    Authors: Jialiang Wang, Daniel Scharstein, Akash Bapat, Kevin Blackburn-Matzen, Matthew Yu, Jonathan Lehman, Suhib Alsisan, Yanghan Wang, Sam Tsai, Jan-Michael Frahm, Zijian He, Peter Vajda, Michael F. Cohen, Matt Uyttendaele

    Abstract: We present the design of a productionized end-to-end stereo depth sensing system that does pre-processing, online stereo rectification, and stereo depth estimation with a fallback to monocular depth estimation when rectification is unreliable. The output of our depth sensing system is then used in a novel view generation pipeline to create 3D computational photography effects using point-of-view i… ▽ More

    Submitted 31 March, 2023; v1 submitted 18 November, 2022; originally announced November 2022.

    Comments: Accepted at CVPR2023

  8. arXiv:2208.03569  [pdf, other

    eess.IV cs.CV cs.LG

    Constrained self-supervised method with temporal ensembling for fiber bundle detection on anatomic tracing data

    Authors: Vaanathi Sundaresan, Julia F. Lehman, Sean Fitzgibbon, Saad Jbabdi, Suzanne N. Haber, Anastasia Yendiki

    Abstract: Anatomic tracing data provides detailed information on brain circuitry essential for addressing some of the common errors in diffusion MRI tractography. However, automated detection of fiber bundles on tracing data is challenging due to sectioning distortions, presence of noise and artifacts and intensity/contrast variations. In this work, we propose a deep learning method with a self-supervised l… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: Accepted in 1st International Workshop on Medical Optical Imaging and Virtual Microscopy Image Analysis (MOVI 2022)

  9. arXiv:2206.08896  [pdf, other

    cs.NE

    Evolution through Large Models

    Authors: Joel Lehman, Jonathan Gordon, Shawn Jain, Kamal Ndousse, Cathy Yeh, Kenneth O. Stanley

    Abstract: This paper pursues the insight that large language models (LLMs) trained to generate code can vastly improve the effectiveness of mutation operators applied to programs in genetic programming (GP). Because such LLMs benefit from training data that includes sequential changes and modifications, they can approximate likely changes that humans would make. To highlight the breadth of implications of s… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  10. arXiv:2111.01340  [pdf, other

    cs.CL

    Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks

    Authors: Aakanksha Naik, Jill Lehman, Carolyn Rose

    Abstract: Natural language understanding (NLU) has made massive progress driven by large benchmarks, but benchmarks often leave a long tail of infrequent phenomena underrepresented. We reflect on the question: have transfer learning methods sufficiently addressed the poor performance of benchmark-trained models on the long tail? We conceptualize the long tail using macro-level dimensions (e.g., underreprese… ▽ More

    Submitted 3 June, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: To appear in TACL 2022. This is a pre-MIT Press publication version

  11. arXiv:2106.06555  [pdf, other

    cs.LG

    Robust Knowledge Graph Completion with Stacked Convolutions and a Student Re-Ranking Network

    Authors: Justin Lovelace, Denis Newman-Griffis, Shikhar Vashishth, Jill Fain Lehman, Carolyn Penstein Rosé

    Abstract: Knowledge Graph (KG) completion research usually focuses on densely connected benchmark datasets that are not representative of real KGs. We curate two KG datasets that include biomedical and encyclopedic knowledge and use an existing commonsense KG dataset to explore KG completion in the more realistic setting where dense connectivity is not guaranteed. We develop a deep convolutional network tha… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021)

  12. arXiv:2104.07874  [pdf, other

    cs.CL cs.AI

    Translational NLP: A New Paradigm and General Principles for Natural Language Processing Research

    Authors: Denis Newman-Griffis, Jill Fain Lehman, Carolyn Rosé, Harry Hochheiser

    Abstract: Natural language processing (NLP) research combines the study of universal principles, through basic science, with applied science targeting specific use cases and settings. However, the process of exchange between basic NLP and applications is often assumed to emerge naturally, resulting in many innovations going unapplied and many important questions left unstudied. We describe a new paradigm of… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: Accepted to NAACL-HLT 2021

  13. arXiv:2010.02246  [pdf, other

    cs.CL cs.LG

    MedFilter: Improving Extraction of Task-relevant Utterances from Doctor-Patient Conversations through Integration of Discourse Structure and Ontological Knowledge

    Authors: Sopan Khosla, Shikhar Vashishth, Jill Fain Lehman, Carolyn Rose

    Abstract: Information extraction from conversational data is particularly challenging because the task-centric nature of conversation allows for effective communication of implicit information by humans, but is challenging for machines. The challenges may differ between utterances depending on the role of the speaker within the conversation, especially when relevant expertise is distributed asymmetrically a… ▽ More

    Submitted 21 June, 2022; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted as Long Paper to EMNLP 2020

  14. arXiv:2008.09266  [pdf, other

    cs.CL

    Adapting Event Extractors to Medical Data: Bridging the Covariate Shift

    Authors: Aakanksha Naik, Jill Lehman, Carolyn Rose

    Abstract: We tackle the task of adapting event extractors to new domains without labeled data, by aligning the marginal distributions of source and target domains. As a testbed, we create two new event extraction datasets using English texts from two medical domains: (i) clinical notes, and (ii) doctor-patient conversations. We test the efficacy of three marginal alignment techniques: (i) adversarial domain… ▽ More

    Submitted 20 August, 2020; originally announced August 2020.

  15. arXiv:2007.10546  [pdf, ps, other

    cs.CY cs.AI cs.LG

    Ideas for Improving the Field of Machine Learning: Summarizing Discussion from the NeurIPS 2019 Retrospectives Workshop

    Authors: Shagun Sodhani, Mayoore S. Jaiswal, Lauren Baker, Koustuv Sinha, Carl Shneider, Peter Henderson, Joel Lehman, Ryan Lowe

    Abstract: This report documents ideas for improving the field of machine learning, which arose from discussions at the ML Retrospectives workshop at NeurIPS 2019. The goal of the report is to disseminate these ideas more broadly, and in turn encourage continuing discussion about how the field could improve along these axes. We focus on topics that were most discussed at the workshop: incentives for encourag… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

  16. arXiv:2006.07495  [pdf, other

    cs.NE

    Open Questions in Creating Safe Open-ended AI: Tensions Between Control and Creativity

    Authors: Adrien Ecoffet, Jeff Clune, Joel Lehman

    Abstract: Artificial life originated and has long studied the topic of open-ended evolution, which seeks the principles underlying artificial systems that innovate continually, inspired by biological evolution. Recently, interest has grown within the broader field of AI in a generalization of open-ended evolution, here called open-ended search, wherein such questions of open-endedness are explored for advan… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  17. arXiv:2006.04734  [pdf, other

    cs.AI

    Reinforcement Learning Under Moral Uncertainty

    Authors: Adrien Ecoffet, Joel Lehman

    Abstract: An ambitious goal for machine learning is to create agents that behave ethically: The capacity to abide by human moral norms would greatly expand the context in which autonomous agents could be practically and safely deployed, e.g. fully autonomous vehicles will encounter charged moral decisions that complicate their deployment. While ethical agents could be trained by rewarding correct behavior u… ▽ More

    Submitted 19 July, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: 28 pages, 18 figures; update adds discussion of a possible flaw of Nash voting, discussion of further possible research into MEC, as well as a few more references; updated to ICML version

  18. arXiv:2005.13092  [pdf, other

    cs.LG stat.ML

    Synthetic Petri Dish: A Novel Surrogate Model for Rapid Architecture Search

    Authors: Aditya Rawal, Joel Lehman, Felipe Petroski Such, Jeff Clune, Kenneth O. Stanley

    Abstract: Neural Architecture Search (NAS) explores a large space of architectural motifs -- a compute-intensive process that often involves ground-truth evaluation of each motif by instantiating it within a large network, and training and evaluating the network with thousands of domain-specific data samples. Inspired by how biological motifs such as cells are sometimes extracted from their natural environm… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

  19. First return, then explore

    Authors: Adrien Ecoffet, Joost Huizinga, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: The promise of reinforcement learning is to solve complex sequential decision problems autonomously by specifying a high-level reward function only. However, reinforcement learning algorithms struggle when, as is often the case, simple and intuitive rewards provide sparse and deceptive feedback. Avoiding these pitfalls requires thoroughly exploring the environment, but creating algorithms that can… ▽ More

    Submitted 16 September, 2021; v1 submitted 27 April, 2020; originally announced April 2020.

    Comments: 47 pages, 14 figures, 4 tables; reorganized sections and modified SI text extensively; added reference to the published version, changed title to published title; added reference to published unformatted pdf

    Journal ref: Nature 590, 580-586 (2021)

  20. arXiv:2003.08536  [pdf, other

    cs.NE

    Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions

    Authors: Rui Wang, Joel Lehman, Aditya Rawal, Jiale Zhi, Yulun Li, Jeff Clune, Kenneth O. Stanley

    Abstract: Creating open-ended algorithms, which generate their own never-ending stream of novel and appropriately challenging learning opportunities, could help to automate and accelerate progress in machine learning. A recent step in this direction is the Paired Open-Ended Trailblazer (POET), an algorithm that generates and solves its own challenges, and allows solutions to goal-switch between challenges t… ▽ More

    Submitted 13 April, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: 23 pages, 14 figures

  21. arXiv:2002.09571  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Learning to Continually Learn

    Authors: Shawn Beaulieu, Lapo Frati, Thomas Miconi, Joel Lehman, Kenneth O. Stanley, Jeff Clune, Nick Cheney

    Abstract: Continual lifelong learning requires an agent or model to learn many sequentially ordered tasks, building on previous knowledge without catastrophically forgetting it. Much work has gone towards preventing the default tendency of machine learning models to catastrophically forget, yet virtually all such work involves manually-designed solutions to the problem. We instead advocate meta-learning a s… ▽ More

    Submitted 3 March, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

  22. arXiv:2001.10560  [pdf, other

    cs.LG cs.AI stat.ML

    The KEEN Universe: An Ecosystem for Knowledge Graph Embeddings with a Focus on Reproducibility and Transferability

    Authors: Mehdi Ali, Hajira Jabeen, Charles Tapley Hoyt, Jens Lehman

    Abstract: There is an emerging trend of embedding knowledge graphs (KGs) in continuous vector spaces in order to use those for machine learning tasks. Recently, many knowledge graph embedding (KGE) models have been proposed that learn low dimensional representations while trying to maintain the structural properties of the KGs such as the similarity of nodes depending on their edges to other nodes. KGEs can… ▽ More

    Submitted 28 January, 2020; originally announced January 2020.

  23. arXiv:1912.07768  [pdf, other

    cs.LG stat.ML

    Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data

    Authors: Felipe Petroski Such, Aditya Rawal, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: This paper investigates the intriguing question of whether we can create learning algorithms that automatically generate training data, learning environments, and curricula in order to help AI agents rapidly learn. We show that such algorithms are possible via Generative Teaching Networks (GTNs), a general approach that is, in theory, applicable to supervised, unsupervised, and reinforcement learn… ▽ More

    Submitted 16 December, 2019; originally announced December 2019.

  24. arXiv:1907.06077  [pdf, other

    cs.NE

    Evolvability ES: Scalable and Direct Optimization of Evolvability

    Authors: Alexander Gajewski, Jeff Clune, Kenneth O. Stanley, Joel Lehman

    Abstract: Designing evolutionary algorithms capable of uncovering highly evolvable representations is an open challenge; such evolvability is important because it accelerates evolution and enables fast adaptation to changing circumstances. This paper introduces evolvability ES, an evolutionary algorithm designed to explicitly and efficiently optimize for evolvability, i.e. the ability to further adapt. The… ▽ More

    Submitted 13 July, 2019; originally announced July 2019.

    Comments: Published in GECCO 2019

  25. arXiv:1906.10918  [pdf, other

    cs.LG cs.AI cs.NE

    Towards Empathic Deep Q-Learning

    Authors: Bart Bussmann, Jacqueline Heinerman, Joel Lehman

    Abstract: As reinforcement learning (RL) scales to solve increasingly complex tasks, interest continues to grow in the fields of AI safety and machine ethics. As a contribution to these fields, this paper introduces an extension to Deep Q-Networks (DQNs), called Empathic DQN, that is loosely inspired both by empathy and the golden rule ("Do unto others as you would have them do unto you"). Empathic DQN aims… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

    Comments: To be presented as a poster at the IJCAI-19 AI Safety Workshop

  26. arXiv:1906.10189  [pdf, other

    cs.NE cs.AI

    Evolutionary Computation and AI Safety: Research Problems Impeding Routine and Safe Real-world Application of Evolution

    Authors: Joel Lehman

    Abstract: Recent developments in artificial intelligence and machine learning have spurred interest in the growing field of AI safety, which studies how to prevent human-harming accidents when deploying AI systems. This paper thus explores the intersection of AI safety with evolutionary computation, to show how safety issues arise in evolutionary computation and how understanding from evolutionary computati… ▽ More

    Submitted 4 October, 2019; v1 submitted 24 June, 2019; originally announced June 2019.

  27. arXiv:1906.09510  [pdf, other

    cs.LG stat.ML

    Learning Belief Representations for Imitation Learning in POMDPs

    Authors: Tanmay Gangwani, Joel Lehman, Qiang Liu, Jian Peng

    Abstract: We consider the problem of imitation learning from expert demonstrations in partially observable Markov decision processes (POMDPs). Belief representations, which characterize the distribution over the latent states in a POMDP, have been modeled using recurrent neural networks and probabilistic latent variable models, and shown to be effective for reinforcement learning in POMDPs. In this work, we… ▽ More

    Submitted 22 June, 2019; originally announced June 2019.

    Comments: Conference on Uncertainty in Artificial Intelligence (UAI 2019)

  28. arXiv:1901.10995  [pdf, other

    cs.LG cs.AI stat.ML

    Go-Explore: a New Approach for Hard-Exploration Problems

    Authors: Adrien Ecoffet, Joost Huizinga, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: A grand challenge in reinforcement learning is intelligent exploration, especially when rewards are sparse or deceptive. Two Atari games serve as benchmarks for such hard-exploration domains: Montezuma's Revenge and Pitfall. On both games, current RL algorithms perform poorly, even those with intrinsic motivation, which is the dominant method to improve performance on hard-exploration domains. To… ▽ More

    Submitted 26 February, 2021; v1 submitted 30 January, 2019; originally announced January 2019.

    Comments: 37 pages, 14 figures; added references to Goyal et al. and Oh et al., updated reference to Colas et al; updated author emails; point readers to updated paper

  29. arXiv:1901.01753  [pdf, other

    cs.NE

    Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions

    Authors: Rui Wang, Joel Lehman, Jeff Clune, Kenneth O. Stanley

    Abstract: While the history of machine learning so far largely encompasses a series of problems posed by researchers and algorithms that learn their solutions, an important question is whether the problems themselves can be generated by the algorithm at the same time as they are being solved. Such a process would in effect build its own diverse and expanding curricula, and the solutions to problems at vario… ▽ More

    Submitted 20 February, 2019; v1 submitted 7 January, 2019; originally announced January 2019.

    Comments: 28 pages, 9 figures

  30. arXiv:1812.07069  [pdf, other

    cs.NE

    An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents

    Authors: Felipe Petroski Such, Vashisht Madhavan, Rosanne Liu, Rui Wang, Pablo Samuel Castro, Yulun Li, Jiale Zhi, Ludwig Schubert, Marc G. Bellemare, Jeff Clune, Joel Lehman

    Abstract: Much human and computational effort has aimed to improve how deep reinforcement learning algorithms perform on benchmarks such as the Atari Learning Environment. Comparatively less effort has focused on understanding what has been learned by such methods, and investigating and comparing the representations learned by different families of reinforcement learning (RL) algorithms. Sources of friction… ▽ More

    Submitted 29 May, 2019; v1 submitted 17 December, 2018; originally announced December 2018.

  31. arXiv:1807.03247  [pdf, other

    cs.CV cs.LG stat.ML

    An Intriguing Failing of Convolutional Neural Networks and the CoordConv Solution

    Authors: Rosanne Liu, Joel Lehman, Piero Molino, Felipe Petroski Such, Eric Frank, Alex Sergeev, Jason Yosinski

    Abstract: Few ideas have enjoyed as large an impact on deep learning as convolution. For any problem involving pixels or spatial representations, common intuition holds that convolutional neural networks may be appropriate. In this paper we show a striking counterexample to this intuition via the seemingly trivial coordinate transform problem, which simply requires learning a mapping between coordinates in… ▽ More

    Submitted 3 December, 2018; v1 submitted 9 July, 2018; originally announced July 2018.

    Comments: Published in NeurIPS 2018

  32. arXiv:1803.03453  [pdf, other

    cs.NE

    The Surprising Creativity of Digital Evolution: A Collection of Anecdotes from the Evolutionary Computation and Artificial Life Research Communities

    Authors: Joel Lehman, Jeff Clune, Dusan Misevic, Christoph Adami, Lee Altenberg, Julie Beaulieu, Peter J. Bentley, Samuel Bernard, Guillaume Beslon, David M. Bryson, Patryk Chrabaszcz, Nick Cheney, Antoine Cully, Stephane Doncieux, Fred C. Dyer, Kai Olav Ellefsen, Robert Feldt, Stephan Fischer, Stephanie Forrest, Antoine Frénoy, Christian Gagné, Leni Le Goff, Laura M. Grabowski, Babak Hodjat, Frank Hutter , et al. (28 additional authors not shown)

    Abstract: Biological evolution provides a creative fount of complex and subtle adaptations, often surprising the scientists who discover them. However, because evolution is an algorithmic process that transcends the substrate in which it occurs, evolution's creativity is not limited to nature. Indeed, many researchers in the field of digital evolution have observed their evolving algorithms and organisms su… ▽ More

    Submitted 21 November, 2019; v1 submitted 9 March, 2018; originally announced March 2018.

  33. arXiv:1712.06568  [pdf, other

    cs.NE cs.AI

    ES Is More Than Just a Traditional Finite-Difference Approximator

    Authors: Joel Lehman, Jay Chen, Jeff Clune, Kenneth O. Stanley

    Abstract: An evolution strategy (ES) variant based on a simplification of a natural evolution strategy recently attracted attention because it performs surprisingly well in challenging deep reinforcement learning domains. It searches for neural network parameters by generating perturbations to the current set of parameters, checking their performance, and moving in the aggregate direction of higher reward.… ▽ More

    Submitted 1 May, 2018; v1 submitted 18 December, 2017; originally announced December 2017.

  34. arXiv:1712.06567  [pdf, other

    cs.NE cs.LG

    Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning

    Authors: Felipe Petroski Such, Vashisht Madhavan, Edoardo Conti, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: Deep artificial neural networks (DNNs) are typically trained via gradient-based learning algorithms, namely backpropagation. Evolution strategies (ES) can rival backprop-based algorithms such as Q-learning and policy gradients on challenging deep reinforcement learning (RL) problems. However, ES can be considered a gradient-based algorithm because it performs stochastic gradient descent via an ope… ▽ More

    Submitted 20 April, 2018; v1 submitted 18 December, 2017; originally announced December 2017.

  35. arXiv:1712.06563  [pdf, other

    cs.NE cs.AI

    Safe Mutations for Deep and Recurrent Neural Networks through Output Gradients

    Authors: Joel Lehman, Jay Chen, Jeff Clune, Kenneth O. Stanley

    Abstract: While neuroevolution (evolving neural networks) has a successful track record across a variety of domains from reinforcement learning to artificial life, it is rarely applied to large, deep neural networks. A central reason is that while random mutation generally works in low dimensions, a random perturbation of thousands or millions of weights is likely to break existing functionality, providing… ▽ More

    Submitted 1 May, 2018; v1 submitted 18 December, 2017; originally announced December 2017.

  36. arXiv:1712.06560  [pdf, other

    cs.AI

    Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents

    Authors: Edoardo Conti, Vashisht Madhavan, Felipe Petroski Such, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: Evolution strategies (ES) are a family of black-box optimization algorithms able to train deep neural networks roughly as well as Q-learning and policy gradient methods on challenging deep reinforcement learning (RL) problems, but are much faster (e.g. hours vs. days) because they parallelize better. However, many RL problems require directed exploration because they have reward functions that are… ▽ More

    Submitted 29 October, 2018; v1 submitted 18 December, 2017; originally announced December 2017.

  37. arXiv:1604.07806  [pdf, other

    cs.AI cs.NE

    Using Indirect Encoding of Multiple Brains to Produce Multimodal Behavior

    Authors: Jacob Schrum, Joel Lehman, Sebastian Risi

    Abstract: An important challenge in neuroevolution is to evolve complex neural networks with multiple modes of behavior. Indirect encodings can potentially answer this challenge. Yet in practice, indirect encodings do not yield effective multimodal controllers. Thus, this paper introduces novel multimodal extensions to HyperNEAT, a popular indirect encoding. A previous multimodal HyperNEAT approach called s… ▽ More

    Submitted 26 April, 2016; originally announced April 2016.

  38. Evolvability Is Inevitable: Increasing Evolvability Without the Pressure to Adapt

    Authors: Joel Lehman, Kenneth O. Stanley

    Abstract: Why evolvability appears to have increased over evolutionary time is an important unresolved biological question. Unlike most candidate explanations, this paper proposes that increasing evolvability can result without any pressure to adapt. The insight is that if evolvability is heritable, then an unbiased drifting process across genotypes can still create a distribution of phenotypes biased towar… ▽ More

    Submitted 5 February, 2013; originally announced February 2013.