Skip to main content

Showing 1–15 of 15 results for author: DiCarlo, J

  1. arXiv:2401.06005  [pdf, other

    q-bio.NC cs.AI cs.CV cs.LG

    How does the primate brain combine generative and discriminative computations in vision?

    Authors: Benjamin Peters, James J. DiCarlo, Todd Gureckis, Ralf Haefner, Leyla Isik, Joshua Tenenbaum, Talia Konkle, Thomas Naselaris, Kimberly Stachenfeld, Zenna Tavares, Doris Tsao, Ilker Yildirim, Nikolaus Kriegeskorte

    Abstract: Vision is widely understood as an inference problem. However, two contrasting conceptions of the inference process have each been influential in research on biological vision as well as the engineering of machine vision. The first emphasizes bottom-up signal flow, describing vision as a largely feedforward, discriminative inference process that filters and transforms the visual information to remo… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  2. arXiv:2312.14285  [pdf, other

    q-bio.NC cs.LG cs.NE

    Probing Biological and Artificial Neural Networks with Task-dependent Neural Manifolds

    Authors: Michael Kuoch, Chi-Ning Chou, Nikhil Parthasarathy, Joel Dapello, James J. DiCarlo, Haim Sompolinsky, SueYeon Chung

    Abstract: Recently, growth in our understanding of the computations performed in both biological and artificial neural networks has largely been driven by either low-level mechanistic studies or global normative approaches. However, concrete methodologies for bridging the gap between these levels of abstraction remain elusive. In this work, we investigate the internal mechanisms of neural networks through t… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: To appear in the proceedings of the Conference on Parsimony and Learning (CPAL) 2024

  3. arXiv:2308.06887  [pdf, other

    cs.CV cs.AI q-bio.NC

    Robustified ANNs Reveal Wormholes Between Human Category Percepts

    Authors: Guy Gaziv, Michael J. Lee, James J. DiCarlo

    Abstract: The visual object category reports of artificial neural networks (ANNs) are notoriously sensitive to tiny, adversarial image perturbations. Because human category reports (aka human percepts) are thought to be insensitive to those same small-norm perturbations -- and locally stable in general -- this argues that ANNs are incomplete scientific models of human visual perception. Consistent with this… ▽ More

    Submitted 4 October, 2023; v1 submitted 13 August, 2023; originally announced August 2023.

    Comments: In NeurIPS 2023. Code: https://github.com/ggaziv/Wormholes Project Webpage: https://himjl.github.io/pwormholes

    Journal ref: https://neurips.cc/virtual/2023/poster/72812

  4. arXiv:2210.08974  [pdf

    cs.CY

    Coordinated Science Laboratory 70th Anniversary Symposium: The Future of Computing

    Authors: Klara Nahrstedt, Naresh Shanbhag, Vikram Adve, Nancy Amato, Romit Roy Choudhury, Carl Gunter, Nam Sung Kim, Olgica Milenkovic, Sayan Mitra, Lav Varshney, Yurii Vlasov, Sarita Adve, Rashid Bashir, Andreas Cangellaris, James DiCarlo, Katie Driggs-Campbell, Nick Feamster, Mattia Gazzola, Karrie Karahalios, Sanmi Koyejo, Paul Kwiat, Bo Li, Negar Mehr, Ravish Mehra, Andrew Miller , et al. (3 additional authors not shown)

    Abstract: In 2021, the Coordinated Science Laboratory CSL, an Interdisciplinary Research Unit at the University of Illinois Urbana-Champaign, hosted the Future of Computing Symposium to celebrate its 70th anniversary. CSL's research covers the full computing stack, computing's impact on society and the resulting need for social responsibility. In this white paper, we summarize the major technological points… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

  5. arXiv:2210.08340  [pdf

    cs.AI q-bio.NC

    Toward Next-Generation Artificial Intelligence: Catalyzing the NeuroAI Revolution

    Authors: Anthony Zador, Sean Escola, Blake Richards, Bence Ölveczky, Yoshua Bengio, Kwabena Boahen, Matthew Botvinick, Dmitri Chklovskii, Anne Churchland, Claudia Clopath, James DiCarlo, Surya Ganguli, Jeff Hawkins, Konrad Koerding, Alexei Koulakov, Yann LeCun, Timothy Lillicrap, Adam Marblestone, Bruno Olshausen, Alexandre Pouget, Cristina Savin, Terrence Sejnowski, Eero Simoncelli, Sara Solla, David Sussillo , et al. (2 additional authors not shown)

    Abstract: Neuroscience has long been an essential driver of progress in artificial intelligence (AI). We propose that to accelerate progress in AI, we must invest in fundamental research in NeuroAI. A core component of this is the embodied Turing test, which challenges AI animal models to interact with the sensorimotor world at skill levels akin to their living counterparts. The embodied Turing test shifts… ▽ More

    Submitted 22 February, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: White paper, 10 pages + 8 pages of references, 1 figures

  6. arXiv:2206.11228  [pdf, other

    q-bio.NC cs.LG

    Adversarially trained neural representations may already be as robust as corresponding biological neural representations

    Authors: Chong Guo, Michael J. Lee, Guillaume Leclerc, Joel Dapello, Yug Rao, Aleksander Madry, James J. DiCarlo

    Abstract: Visual systems of primates are the gold standard of robust perception. There is thus a general belief that mimicking the neural representations that underlie those systems will yield artificial visual systems that are adversarially robust. In this work, we develop a method for performing adversarial visual attacks directly on primate brain activity. We then leverage this method to demonstrate that… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: 10 pages, 6 figures, ICML2022

  7. arXiv:2111.06979  [pdf, other

    q-bio.NC cs.LG cs.NE

    Neural Population Geometry Reveals the Role of Stochasticity in Robust Perception

    Authors: Joel Dapello, Jenelle Feather, Hang Le, Tiago Marques, David D. Cox, Josh H. McDermott, James J. DiCarlo, SueYeon Chung

    Abstract: Adversarial examples are often cited by neuroscientists and machine learning researchers as an example of how computational models diverge from biological sensory systems. Recent work has proposed adding biologically-inspired components to visual neural networks as a way to improve their adversarial robustness. One surprisingly effective component for reducing adversarial vulnerability is response… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  8. arXiv:2110.10645  [pdf, other

    eess.IV cs.CV q-bio.NC

    Combining Different V1 Brain Model Variants to Improve Robustness to Image Corruptions in CNNs

    Authors: Avinash Baidya, Joel Dapello, James J. DiCarlo, Tiago Marques

    Abstract: While some convolutional neural networks (CNNs) have surpassed human visual abilities in object classification, they often struggle to recognize objects in images corrupted with different types of common noise patterns, highlighting a major limitation of this family of models. Recently, it has been shown that simulating a primary visual cortex (V1) at the front of CNNs leads to small improvements… ▽ More

    Submitted 7 December, 2021; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: 15 pages with supplementary material, 3 main figures, 2 supplementary figures, 4 supplementary tables

    Journal ref: Workshop on Shared Visual Representations in Human and Machine Intelligence 2021

  9. arXiv:2103.14025  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    The ThreeDWorld Transport Challenge: A Visually Guided Task-and-Motion Planning Benchmark for Physically Realistic Embodied AI

    Authors: Chuang Gan, Siyuan Zhou, Jeremy Schwartz, Seth Alter, Abhishek Bhandwaldar, Dan Gutfreund, Daniel L. K. Yamins, James J DiCarlo, Josh McDermott, Antonio Torralba, Joshua B. Tenenbaum

    Abstract: We introduce a visually-guided and physics-driven task-and-motion planning benchmark, which we call the ThreeDWorld Transport Challenge. In this challenge, an embodied agent equipped with two 9-DOF articulated arms is spawned randomly in a simulated physical home environment. The agent is required to find a small set of objects scattered around the house, pick them up, and transport them to a desi… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Comments: Project page: http://tdw-transport.csail.mit.edu/

  10. arXiv:2007.04954  [pdf, other

    cs.CV cs.GR cs.LG cs.RO

    ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation

    Authors: Chuang Gan, Jeremy Schwartz, Seth Alter, Damian Mrowca, Martin Schrimpf, James Traer, Julian De Freitas, Jonas Kubilius, Abhishek Bhandwaldar, Nick Haber, Megumi Sano, Kuno Kim, Elias Wang, Michael Lingelbach, Aidan Curtis, Kevin Feigelis, Daniel M. Bear, Dan Gutfreund, David Cox, Antonio Torralba, James J. DiCarlo, Joshua B. Tenenbaum, Josh H. McDermott, Daniel L. K. Yamins

    Abstract: We introduce ThreeDWorld (TDW), a platform for interactive multi-modal physical simulation. TDW enables simulation of high-fidelity sensory data and physical interactions between mobile agents and objects in rich 3D environments. Unique properties include: real-time near-photo-realistic image rendering; a library of objects and environments, and routines for their customization; generative procedu… ▽ More

    Submitted 28 December, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: Oral Presentation at NeurIPS 21 Datasets and Benchmarks Track. Project page: http://www.threedworld.org

  11. arXiv:1909.06161  [pdf, other

    cs.CV cs.LG cs.NE eess.IV q-bio.NC

    Brain-Like Object Recognition with High-Performing Shallow Recurrent ANNs

    Authors: Jonas Kubilius, Martin Schrimpf, Kohitij Kar, Ha Hong, Najib J. Majaj, Rishi Rajalingham, Elias B. Issa, Pouya Bashivan, Jonathan Prescott-Roy, Kailyn Schmidt, Aran Nayebi, Daniel Bear, Daniel L. K. Yamins, James J. DiCarlo

    Abstract: Deep convolutional artificial neural networks (ANNs) are the leading class of candidate models of the mechanisms of visual processing in the primate ventral stream. While initially inspired by brain anatomy, over the past years, these ANNs have evolved from a simple eight-layer architecture in AlexNet to extremely deep and branching architectures, demonstrating increasingly better object categoriz… ▽ More

    Submitted 28 October, 2019; v1 submitted 13 September, 2019; originally announced September 2019.

    Comments: NeurIPS 2019 (Oral). Code available at https://github.com/dicarlolab/neurips2019

  12. arXiv:1808.01405  [pdf, other

    cs.CV

    Teacher Guided Architecture Search

    Authors: Pouya Bashivan, Mark Tensen, James J DiCarlo

    Abstract: Much of the recent improvement in neural networks for computer vision has resulted from discovery of new networks architectures. Most prior work has used the performance of candidate models following limited training to automatically guide the search in a feasible way. Could further gains in computational efficiency be achieved by guiding the search via measurements of a high performing network wi… ▽ More

    Submitted 6 September, 2019; v1 submitted 3 August, 2018; originally announced August 2018.

    Comments: Accepted to ICCV 2019

  13. arXiv:1807.00053  [pdf, other

    q-bio.NC cs.AI cs.CV cs.LG cs.NE

    Task-Driven Convolutional Recurrent Models of the Visual System

    Authors: Aran Nayebi, Daniel Bear, Jonas Kubilius, Kohitij Kar, Surya Ganguli, David Sussillo, James J. DiCarlo, Daniel L. K. Yamins

    Abstract: Feed-forward convolutional neural networks (CNNs) are currently state-of-the-art for object classification tasks such as ImageNet. Further, they are quantitatively accurate models of temporally-averaged responses of neurons in the primate brain's visual system. However, biological visual systems have two ubiquitous architectural features not shared with typical CNNs: local recurrence within cortic… ▽ More

    Submitted 26 October, 2018; v1 submitted 20 June, 2018; originally announced July 2018.

    Comments: NIPS 2018 Camera Ready Version, 16 pages including supplementary information, 6 figures

  14. Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition

    Authors: Charles F. Cadieu, Ha Hong, Daniel L. K. Yamins, Nicolas Pinto, Diego Ardila, Ethan A. Solomon, Najib J. Majaj, James J. DiCarlo

    Abstract: The primate visual system achieves remarkable visual object recognition performance even in brief presentations and under changes to object exemplar, geometric transformations, and background variation (a.k.a. core visual object recognition). This remarkable performance is mediated by the representation formed in inferior temporal (IT) cortex. In parallel, recent advances in machine learning have… ▽ More

    Submitted 12 June, 2014; originally announced June 2014.

    Comments: 35 pages, 12 figures, extends and expands upon arXiv:1301.3530

  15. arXiv:1301.3530  [pdf, other

    cs.NE cs.CV cs.LG q-bio.NC

    The Neural Representation Benchmark and its Evaluation on Brain and Machine

    Authors: Charles F. Cadieu, Ha Hong, Dan Yamins, Nicolas Pinto, Najib J. Majaj, James J. DiCarlo

    Abstract: A key requirement for the development of effective learning representations is their evaluation and comparison to representations we know to be effective. In natural sensory domains, the community has viewed the brain as a source of inspiration and as an implicit benchmark for success. However, it has not been possible to directly test representational learning algorithms directly against the repr… ▽ More

    Submitted 25 January, 2013; v1 submitted 15 January, 2013; originally announced January 2013.

    Comments: The v1 version contained incorrectly computed kernel analysis curves and KA-AUC values for V4, IT, and the HT-L3 models. They have been corrected in this version