Skip to main content

Showing 1–14 of 14 results for author: Gershman, S J

  1. arXiv:2402.06590  [pdf, other

    cs.AI cs.LG

    Predictive representations: building blocks of intelligence

    Authors: Wilka Carvalho, Momchil S. Tomov, William de Cothi, Caswell Barry, Samuel J. Gershman

    Abstract: Adaptive behavior often requires predicting future events. The theory of reinforcement learning prescribes what kinds of predictive representations are useful and how to compute them. This paper integrates these theoretical ideas with work on cognition and neuroscience. We pay special attention to the successor representation (SR) and its generalizations, which have been widely applied both as eng… ▽ More

    Submitted 11 July, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: accepted to Neural Computation

  2. arXiv:2312.08519  [pdf

    q-bio.NC cs.AI

    Reconciling Shared versus Context-Specific Information in a Neural Network Model of Latent Causes

    Authors: Qihong Lu, Tan T. Nguyen, Qiong Zhang, Uri Hasson, Thomas L. Griffiths, Jeffrey M. Zacks, Samuel J. Gershman, Kenneth A. Norman

    Abstract: It has been proposed that, when processing a stream of events, humans divide their experiences in terms of inferred latent causes (LCs) to support context-dependent learning. However, when shared structure is present across contexts, it is still unclear how the "splitting" of LCs and learning of shared structure can be simultaneously achieved. Here, we present the Latent Cause Network (LCNet), a n… ▽ More

    Submitted 6 June, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  3. arXiv:2312.03759  [pdf, ps, other

    cs.CL cs.AI cs.CY cs.DL

    How should the advent of large language models affect the practice of science?

    Authors: Marcel Binz, Stephan Alaniz, Adina Roskies, Balazs Aczel, Carl T. Bergstrom, Colin Allen, Daniel Schad, Dirk Wulff, Jevin D. West, Qiong Zhang, Richard M. Shiffrin, Samuel J. Gershman, Ven Popov, Emily M. Bender, Marco Marelli, Matthew M. Botvinick, Zeynep Akata, Eric Schulz

    Abstract: Large language models (LLMs) are being increasingly incorporated into scientific workflows. However, we have yet to fully grasp the implications of this integration. How should the advent of large language models affect the practice of science? For this opinion piece, we have invited four diverse groups of scientists to reflect on this query, sharing their perspectives and engaging in debate. Schu… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  4. arXiv:2310.06110  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Grokking as the Transition from Lazy to Rich Training Dynamics

    Authors: Tanishq Kumar, Blake Bordelon, Samuel J. Gershman, Cengiz Pehlevan

    Abstract: We propose that the grokking phenomenon, where the train loss of a neural network decreases much earlier than its test loss, can arise due to a neural network transitioning from lazy training dynamics to a rich, feature learning regime. To illustrate this mechanism, we study the simple setting of vanilla gradient descent on a polynomial regression problem with a two layer neural network which exhi… ▽ More

    Submitted 11 April, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Adding new experiments on higher degree Hermite polynomials, multi-index targets, removed DMFT analysis from this version

  5. arXiv:2305.15277  [pdf, other

    cs.LG

    Successor-Predecessor Intrinsic Exploration

    Authors: Changmin Yu, Neil Burgess, Maneesh Sahani, Samuel J. Gershman

    Abstract: Exploration is essential in reinforcement learning, particularly in environments where external rewards are sparse. Here we focus on exploration with intrinsic rewards, where the agent transiently augments the external rewards with self-generated intrinsic rewards. Although the study of intrinsic rewards has a long history, existing methods focus on composing the intrinsic reward based on measures… ▽ More

    Submitted 25 January, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

  6. arXiv:2107.12544  [pdf, other

    cs.AI

    Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning

    Authors: Pedro A. Tsividis, Joao Loula, Jake Burga, Nathan Foss, Andres Campero, Thomas Pouncy, Samuel J. Gershman, Joshua B. Tenenbaum

    Abstract: Reinforcement learning (RL) studies how an agent comes to achieve reward in an environment through interactions over time. Recent advances in machine RL have surpassed human expertise at the world's oldest board games and many classic video games, but they require vast quantities of experience to learn successfully -- none of today's algorithms account for the human ability to learn so many differ… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

  7. arXiv:2107.06393  [pdf, other

    cs.CV cs.AI cs.LG

    Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous Interface

    Authors: Tuan Anh Le, Katherine M. Collins, Luke Hewitt, Kevin Ellis, N. Siddharth, Samuel J. Gershman, Joshua B. Tenenbaum

    Abstract: Modeling complex phenomena typically involves the use of both discrete and continuous variables. Such a setting applies across a wide range of problems, from identifying trends in time-series data to performing effective compositional scene understanding in images. Here, we propose Hybrid Memoised Wake-Sleep (HMWS), an algorithm for effective inference in such hybrid discrete-continuous models. Pr… ▽ More

    Submitted 20 April, 2022; v1 submitted 3 July, 2021; originally announced July 2021.

    Journal ref: ICLR 2022

  8. arXiv:2012.15814  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    Language-Mediated, Object-Centric Representation Learning

    Authors: Ruocheng Wang, Jiayuan Mao, Samuel J. Gershman, Jiajun Wu

    Abstract: We present Language-mediated, Object-centric Representation Learning (LORL), a paradigm for learning disentangled, object-centric scene representations from vision and language. LORL builds upon recent advances in unsupervised object discovery and segmentation, notably MONet and Slot Attention. While these algorithms learn an object-centric representation just by reconstructing the input image, LO… ▽ More

    Submitted 8 June, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

    Comments: ACL 2021 Findings. First two authors contributed equally; last two authors contributed equally. Project page: https://lang-orl.github.io/

  9. arXiv:1909.05885  [pdf, other

    cs.CL cs.LG stat.ML

    Analyzing machine-learned representations: A natural language case study

    Authors: Ishita Dasgupta, Demi Guo, Samuel J. Gershman, Noah D. Goodman

    Abstract: As modern deep networks become more complex, and get closer to human-like capabilities in certain domains, the question arises of how the representations and decision rules they learn compare to the ones in humans. In this work, we study representations of sentences in one such artificial system for natural language processing. We first present a diagnostic test dataset to examine the degree of ab… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

    Comments: This article supersedes a previous article arXiv:1802.04302

  10. arXiv:1805.11571  [pdf, other

    stat.ML cs.LG

    Human-in-the-Loop Interpretability Prior

    Authors: Isaac Lage, Andrew Slavin Ross, Been Kim, Samuel J. Gershman, Finale Doshi-Velez

    Abstract: We often desire our models to be interpretable as well as accurate. Prior work on optimizing models for interpretability has relied on easy-to-quantify proxies for interpretability, such as sparsity or the number of operations required. In this work, we optimize for interpretability by directly including humans in the optimization loop. We develop an algorithm that minimizes the number of user stu… ▽ More

    Submitted 30 October, 2018; v1 submitted 29 May, 2018; originally announced May 2018.

    Comments: To appear at NIPS 2018, selected for a spotlight. 13 pages (incl references and appendix)

  11. arXiv:1802.06426  [pdf, other

    cs.AI q-bio.NC

    Estimating scale-invariant future in continuous time

    Authors: Zoran Tiganj, Samuel J. Gershman, Per B. Sederberg, Marc W. Howard

    Abstract: Natural learners must compute an estimate of future outcomes that follow from a stimulus in continuous time. Widely used reinforcement learning algorithms discretize continuous time and estimate either transition functions from one step to the next (model-based algorithms) or a scalar value of exponentially-discounted future reward using the Bellman equation (model-free algorithms). An important d… ▽ More

    Submitted 26 October, 2018; v1 submitted 18 February, 2018; originally announced February 2018.

    Comments: 25 pages, 10 figures

  12. arXiv:1802.04302  [pdf, other

    cs.CL stat.ML

    Evaluating Compositionality in Sentence Embeddings

    Authors: Ishita Dasgupta, Demi Guo, Andreas Stuhlmüller, Samuel J. Gershman, Noah D. Goodman

    Abstract: An important challenge for human-like AI is compositional semantics. Recent research has attempted to address this by using deep neural networks to learn vector space embeddings of sentences, which then serve as input to other tasks. We present a new dataset for one such task, `natural language inference' (NLI), that cannot be solved using only word-level knowledge and requires some compositionali… ▽ More

    Submitted 17 May, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

  13. arXiv:1606.02396  [pdf, other

    stat.ML cs.AI cs.LG cs.NE

    Deep Successor Reinforcement Learning

    Authors: Tejas D. Kulkarni, Ardavan Saeedi, Simanta Gautam, Samuel J. Gershman

    Abstract: Learning robust value functions given raw observations and rewards is now possible with model-free and model-based deep reinforcement learning algorithms. There is a third alternative, called Successor Representations (SR), which decomposes the value function into two components -- a reward predictor and a successor map. The successor map represents the expected future state occupancy from any giv… ▽ More

    Submitted 8 June, 2016; originally announced June 2016.

    Comments: 10 pages, 6 figures

  14. arXiv:1604.00289  [pdf, other

    cs.AI cs.CV cs.LG cs.NE stat.ML

    Building Machines That Learn and Think Like People

    Authors: Brenden M. Lake, Tomer D. Ullman, Joshua B. Tenenbaum, Samuel J. Gershman

    Abstract: Recent progress in artificial intelligence (AI) has renewed interest in building systems that learn and think like people. Many advances have come from using deep neural networks trained end-to-end in tasks such as object recognition, video games, and board games, achieving performance that equals or even beats humans in some respects. Despite their biological inspiration and performance achieveme… ▽ More

    Submitted 2 November, 2016; v1 submitted 1 April, 2016; originally announced April 2016.

    Comments: In press at Behavioral and Brain Sciences. Open call for commentary proposals (until Nov. 22, 2016). https://www.cambridge.org/core/journals/behavioral-and-brain-sciences/information/calls-for-commentary/open-calls-for-commentary