Skip to main content

Showing 1–15 of 15 results for author: Nye, M

  1. arXiv:2402.01913  [pdf, other

    cs.RO

    TartanDrive 2.0: More Modalities and Better Infrastructure to Further Self-Supervised Learning Research in Off-Road Driving Tasks

    Authors: Matthew Sivaprakasam, Parv Maheshwari, Mateo Guaman Castro, Samuel Triest, Micah Nye, Steve Willits, Andrew Saba, Wenshan Wang, Sebastian Scherer

    Abstract: We present TartanDrive 2.0, a large-scale off-road driving dataset for self-supervised learning tasks. In 2021 we released TartanDrive 1.0, which is one of the largest datasets for off-road terrain. As a follow-up to our original dataset, we collected seven hours of data at speeds of up to 15m/s with the addition of three new LiDAR sensors alongside the original camera, inertial, GPS, and proprioc… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  2. arXiv:2212.10012  [pdf, other

    cs.CL

    Language Modeling with Latent Situations

    Authors: Belinda Z. Li, Maxwell Nye, Jacob Andreas

    Abstract: Language models (LMs) often generate incoherent outputs: they refer to events and entity states that are incompatible with the state of the world described in their inputs. We introduce SituationSupervision, a family of approaches for improving coherence in LMs by training them to construct and condition on explicit representations of entities and their states. SituationSupervision has two compone… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: 13 pages, 3 figures, 7 tables

  3. arXiv:2112.00114  [pdf, other

    cs.LG cs.NE

    Show Your Work: Scratchpads for Intermediate Computation with Language Models

    Authors: Maxwell Nye, Anders Johan Andreassen, Guy Gur-Ari, Henryk Michalewski, Jacob Austin, David Bieber, David Dohan, Aitor Lewkowycz, Maarten Bosma, David Luan, Charles Sutton, Augustus Odena

    Abstract: Large pre-trained language models perform remarkably well on tasks that can be done "in one pass", such as generating realistic text or synthesizing computer programs. However, they struggle with tasks that require unbounded multi-step computation, such as adding integers or executing programs. Surprisingly, we find that these same models are able to perform complex multi-step computations -- even… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

  4. arXiv:2108.07732  [pdf, other

    cs.PL cs.LG

    Program Synthesis with Large Language Models

    Authors: Jacob Austin, Augustus Odena, Maxwell Nye, Maarten Bosma, Henryk Michalewski, David Dohan, Ellen Jiang, Carrie Cai, Michael Terry, Quoc Le, Charles Sutton

    Abstract: This paper explores the limits of the current generation of large language models for program synthesis in general purpose programming languages. We evaluate a collection of such models (with between 244M and 137B parameters) on two new benchmarks, MBPP and MathQA-Python, in both the few-shot and fine-tuning regimes. Our benchmarks are designed to measure the ability of these models to synthesize… ▽ More

    Submitted 15 August, 2021; originally announced August 2021.

    Comments: Jacob and Augustus contributed equally

  5. arXiv:2107.02794  [pdf, other

    cs.AI cs.CL cs.LG

    Improving Coherence and Consistency in Neural Sequence Models with Dual-System, Neuro-Symbolic Reasoning

    Authors: Maxwell Nye, Michael Henry Tessler, Joshua B. Tenenbaum, Brenden M. Lake

    Abstract: Human reasoning can often be understood as an interplay between two systems: the intuitive and associative ("System 1") and the deliberative and logical ("System 2"). Neural sequence models -- which have been increasingly successful at performing complex, structured tasks -- exhibit the advantages and failure modes of System 1: they are fast and learn patterns from data, but are often inconsistent… ▽ More

    Submitted 15 December, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

    Comments: NeurIPS 2021

  6. arXiv:2106.07824  [pdf, other

    cs.AI

    Communicating Natural Programs to Humans and Machines

    Authors: Samuel Acquaviva, Yewen Pu, Marta Kryven, Theodoros Sechopoulos, Catherine Wong, Gabrielle E Ecanow, Maxwell Nye, Michael Henry Tessler, Joshua B. Tenenbaum

    Abstract: The Abstraction and Reasoning Corpus (ARC) is a set of procedural tasks that tests an agent's ability to flexibly solve novel problems. While most ARC tasks are easy for humans, they are challenging for state-of-the-art AI. What makes building intelligent systems that can generalize to novel situations such as ARC difficult? We posit that the answer might be found by studying the difference of \em… ▽ More

    Submitted 19 May, 2023; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: equal contributions: (author 1,2) and (author 3,4,5). 36th Conference on Neural Information Processing Systems (NeurIPS 2022) Track on Datasets and Benchmarks

  7. arXiv:2106.00737  [pdf, other

    cs.CL

    Implicit Representations of Meaning in Neural Language Models

    Authors: Belinda Z. Li, Maxwell Nye, Jacob Andreas

    Abstract: Does the effectiveness of neural language models derive entirely from accurate modeling of surface word co-occurrence statistics, or do these models represent and reason about the world they describe? In BART and T5 transformer language models, we identify contextual word representations that function as models of entities and situations as they evolve throughout a discourse. These neural represen… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: 15 pages, 6 figures; accepted to ACL 2021

  8. arXiv:2012.12964  [pdf, other

    cs.PL cs.AI cs.LG

    Representing Partial Programs with Blended Abstract Semantics

    Authors: Maxwell Nye, Yewen Pu, Matthew Bowers, Jacob Andreas, Joshua B. Tenenbaum, Armando Solar-Lezama

    Abstract: Synthesizing programs from examples requires searching over a vast, combinatorial space of possible programs. In this search process, a key challenge is representing the behavior of a partially written program before it can be executed, to judge if it is on the right track and predict where to search next. We introduce a general technique for representing partially written programs in a program sy… ▽ More

    Submitted 19 April, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

    Comments: ICLR 2021

  9. arXiv:2006.08381  [pdf, other

    cs.AI cs.LG

    DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning

    Authors: Kevin Ellis, Catherine Wong, Maxwell Nye, Mathias Sable-Meyer, Luc Cary, Lucas Morales, Luke Hewitt, Armando Solar-Lezama, Joshua B. Tenenbaum

    Abstract: Expert problem-solving is driven by powerful languages for thinking about problems and their solutions. Acquiring expertise means learning these languages -- systems of concepts, alongside the skills to use them. We present DreamCoder, a system that learns to solve problems by writing programs. It builds expertise by creating programming languages for expressing domain concepts, together with neur… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  10. arXiv:2003.13004  [pdf, other

    math.PR cs.IT math.DG q-bio.PE stat.ME

    Information geometry for phylogenetic trees

    Authors: Maryam K. Garba, Tom M. W. Nye, Jonas Lueg, Stephan F. Huckemann

    Abstract: We propose a new space of phylogenetic trees which we call wald space. The motivation is to develop a space suitable for statistical analysis of phylogenies, but with a geometry based on more biologically principled assumptions than existing spaces: in wald space, trees are close if they induce similar distributions on genetic sequence data. As a point set, wald space contains the previously devel… ▽ More

    Submitted 17 September, 2020; v1 submitted 29 March, 2020; originally announced March 2020.

    MSC Class: 92D15; 53A35; 94A17

  11. arXiv:2003.05562  [pdf, other

    cs.AI cs.LG

    Learning Compositional Rules via Neural Program Synthesis

    Authors: Maxwell I. Nye, Armando Solar-Lezama, Joshua B. Tenenbaum, Brenden M. Lake

    Abstract: Many aspects of human reasoning, including language, require learning rules from very little data. Humans can do this, often learning systematic rules from very few examples, and combining these rules to form compositional rule-based systems. Current neural architectures, on the other hand, often fail to generalize in a compositional manner, especially when evaluated in ways that vary systematical… ▽ More

    Submitted 22 October, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

    Comments: NeurIPS 2020. Code can be found at https://github.com/mtensor/rulesynthesis

  12. arXiv:1906.04604  [pdf, other

    cs.PL cs.AI cs.LG cs.SE

    Write, Execute, Assess: Program Synthesis with a REPL

    Authors: Kevin Ellis, Maxwell Nye, Yewen Pu, Felix Sosa, Josh Tenenbaum, Armando Solar-Lezama

    Abstract: We present a neural program synthesis approach integrating components which write, execute, and assess code to navigate the search space of possible programs. We equip the search process with an interpreter or a read-eval-print-loop (REPL), which immediately executes partially written programs, exposing their semantics. The REPL addresses a basic challenge of program synthesis: tiny changes in syn… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

    Comments: The first four authors contributed equally to this work

  13. arXiv:1902.06349  [pdf, other

    cs.AI cs.LG

    Learning to Infer Program Sketches

    Authors: Maxwell Nye, Luke Hewitt, Joshua Tenenbaum, Armando Solar-Lezama

    Abstract: Our goal is to build systems which write code automatically from the kinds of specifications humans can most easily provide, such as examples and natural language instruction. The key idea of this work is that a flexible combination of pattern recognition and explicit reasoning can be used to solve these complex programming problems. We propose a method for dynamically integrating these types of i… ▽ More

    Submitted 4 June, 2019; v1 submitted 17 February, 2019; originally announced February 2019.

    Comments: Accepted to ICML 2019

  14. arXiv:1807.08919  [pdf, other

    cs.LG cs.AI stat.ML

    The Variational Homoencoder: Learning to learn high capacity generative models from few examples

    Authors: Luke B. Hewitt, Maxwell I. Nye, Andreea Gane, Tommi Jaakkola, Joshua B. Tenenbaum

    Abstract: Hierarchical Bayesian methods can unify many related tasks (e.g. k-shot classification, conditional and unconditional generation) as inference within a single generative model. However, when this generative model is expressed as a powerful neural network such as a PixelCNN, we show that existing learning techniques typically fail to effectively use latent variables. To address this, we develop a m… ▽ More

    Submitted 24 July, 2018; originally announced July 2018.

    Comments: UAI 2018 oral presentation

  15. arXiv:1807.06399  [pdf, other

    cs.LG cs.NE stat.ML

    Are Efficient Deep Representations Learnable?

    Authors: Maxwell Nye, Andrew Saxe

    Abstract: Many theories of deep learning have shown that a deep network can require dramatically fewer resources to represent a given function compared to a shallow network. But a question remains: can these efficient representations be learned using current deep learning techniques? In this work, we test whether standard deep learning methods can in fact find the efficient representations posited by severa… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

    Comments: Presented at ICLR 2018 Workshop