Skip to main content

Showing 1–16 of 16 results for author: DiCarlo, J

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2506.05633  [pdf, ps, other

    q-bio.NC cs.CV cs.NE

    Noninvasive precision modulation of high-level neural population activity via natural vision perturbations

    Authors: Guy Gaziv, Sarah Goulding, Ani Ayvazian-Hancock, Yoon Bai, James J. DiCarlo

    Abstract: Precise control of neural activity -- modulating target neurons deep in the brain while leaving nearby neurons unaffected -- is an outstanding challenge in neuroscience, generally approached using invasive techniques. This study investigates the possibility of precisely and noninvasively modulating neural activity in the high-level primate ventral visual stream via perturbations on one's natural v… ▽ More

    Submitted 9 June, 2025; v1 submitted 5 June, 2025; originally announced June 2025.

  2. arXiv:2412.09115  [pdf, other

    q-bio.NC cs.CV cs.LG cs.NE

    Vision CNNs trained to estimate spatial latents learned similar ventral-stream-aligned representations

    Authors: Yudi Xie, Weichen Huang, Esther Alter, Jeremy Schwartz, Joshua B. Tenenbaum, James J. DiCarlo

    Abstract: Studies of the functional role of the primate ventral visual stream have traditionally focused on object categorization, often ignoring -- despite much prior evidence -- its role in estimating "spatial" latents such as object position and pose. Most leading ventral stream models are derived by optimizing networks for object categorization, which seems to imply that the ventral stream is also deriv… ▽ More

    Submitted 17 February, 2025; v1 submitted 12 December, 2024; originally announced December 2024.

    Comments: 30 pages, 21 figures, ICLR 2025

  3. arXiv:2401.06005  [pdf, other

    q-bio.NC cs.AI cs.CV cs.LG

    How does the primate brain combine generative and discriminative computations in vision?

    Authors: Benjamin Peters, James J. DiCarlo, Todd Gureckis, Ralf Haefner, Leyla Isik, Joshua Tenenbaum, Talia Konkle, Thomas Naselaris, Kimberly Stachenfeld, Zenna Tavares, Doris Tsao, Ilker Yildirim, Nikolaus Kriegeskorte

    Abstract: Vision is widely understood as an inference problem. However, two contrasting conceptions of the inference process have each been influential in research on biological vision as well as the engineering of machine vision. The first emphasizes bottom-up signal flow, describing vision as a largely feedforward, discriminative inference process that filters and transforms the visual information to remo… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  4. arXiv:2312.14285  [pdf, other

    q-bio.NC cs.LG cs.NE

    Probing Biological and Artificial Neural Networks with Task-dependent Neural Manifolds

    Authors: Michael Kuoch, Chi-Ning Chou, Nikhil Parthasarathy, Joel Dapello, James J. DiCarlo, Haim Sompolinsky, SueYeon Chung

    Abstract: Recently, growth in our understanding of the computations performed in both biological and artificial neural networks has largely been driven by either low-level mechanistic studies or global normative approaches. However, concrete methodologies for bridging the gap between these levels of abstraction remain elusive. In this work, we investigate the internal mechanisms of neural networks through t… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: To appear in the proceedings of the Conference on Parsimony and Learning (CPAL) 2024

  5. arXiv:2312.05956  [pdf, other

    q-bio.NC

    The Quest for an Integrated Set of Neural Mechanisms Underlying Object Recognition in Primates

    Authors: Kohitij Kar, James J DiCarlo

    Abstract: Visual object recognition -- the behavioral ability to rapidly and accurately categorize many visually encountered objects -- is core to primate cognition. This behavioral capability is algorithmically impressive because of the myriad identity-preserving viewpoints and scenes that dramatically change the visual image produced by the same object. Until recently, the brain mechanisms that support th… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  6. arXiv:2308.06887  [pdf, other

    cs.CV cs.AI q-bio.NC

    Robustified ANNs Reveal Wormholes Between Human Category Percepts

    Authors: Guy Gaziv, Michael J. Lee, James J. DiCarlo

    Abstract: The visual object category reports of artificial neural networks (ANNs) are notoriously sensitive to tiny, adversarial image perturbations. Because human category reports (aka human percepts) are thought to be insensitive to those same small-norm perturbations -- and locally stable in general -- this argues that ANNs are incomplete scientific models of human visual perception. Consistent with this… ▽ More

    Submitted 4 October, 2023; v1 submitted 13 August, 2023; originally announced August 2023.

    Comments: In NeurIPS 2023. Code: https://github.com/ggaziv/Wormholes Project Webpage: https://himjl.github.io/pwormholes

    Journal ref: https://neurips.cc/virtual/2023/poster/72812

  7. arXiv:2210.08340  [pdf

    cs.AI q-bio.NC

    Toward Next-Generation Artificial Intelligence: Catalyzing the NeuroAI Revolution

    Authors: Anthony Zador, Sean Escola, Blake Richards, Bence Ölveczky, Yoshua Bengio, Kwabena Boahen, Matthew Botvinick, Dmitri Chklovskii, Anne Churchland, Claudia Clopath, James DiCarlo, Surya Ganguli, Jeff Hawkins, Konrad Koerding, Alexei Koulakov, Yann LeCun, Timothy Lillicrap, Adam Marblestone, Bruno Olshausen, Alexandre Pouget, Cristina Savin, Terrence Sejnowski, Eero Simoncelli, Sara Solla, David Sussillo , et al. (2 additional authors not shown)

    Abstract: Neuroscience has long been an essential driver of progress in artificial intelligence (AI). We propose that to accelerate progress in AI, we must invest in fundamental research in NeuroAI. A core component of this is the embodied Turing test, which challenges AI animal models to interact with the sensorimotor world at skill levels akin to their living counterparts. The embodied Turing test shifts… ▽ More

    Submitted 22 February, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: White paper, 10 pages + 8 pages of references, 1 figures

  8. arXiv:2206.11228  [pdf, other

    q-bio.NC cs.LG

    Adversarially trained neural representations may already be as robust as corresponding biological neural representations

    Authors: Chong Guo, Michael J. Lee, Guillaume Leclerc, Joel Dapello, Yug Rao, Aleksander Madry, James J. DiCarlo

    Abstract: Visual systems of primates are the gold standard of robust perception. There is thus a general belief that mimicking the neural representations that underlie those systems will yield artificial visual systems that are adversarially robust. In this work, we develop a method for performing adversarial visual attacks directly on primate brain activity. We then leverage this method to demonstrate that… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: 10 pages, 6 figures, ICML2022

  9. arXiv:2111.06979  [pdf, other

    q-bio.NC cs.LG cs.NE

    Neural Population Geometry Reveals the Role of Stochasticity in Robust Perception

    Authors: Joel Dapello, Jenelle Feather, Hang Le, Tiago Marques, David D. Cox, Josh H. McDermott, James J. DiCarlo, SueYeon Chung

    Abstract: Adversarial examples are often cited by neuroscientists and machine learning researchers as an example of how computational models diverge from biological sensory systems. Recent work has proposed adding biologically-inspired components to visual neural networks as a way to improve their adversarial robustness. One surprisingly effective component for reducing adversarial vulnerability is response… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  10. arXiv:2110.10645  [pdf, other

    eess.IV cs.CV q-bio.NC

    Combining Different V1 Brain Model Variants to Improve Robustness to Image Corruptions in CNNs

    Authors: Avinash Baidya, Joel Dapello, James J. DiCarlo, Tiago Marques

    Abstract: While some convolutional neural networks (CNNs) have surpassed human visual abilities in object classification, they often struggle to recognize objects in images corrupted with different types of common noise patterns, highlighting a major limitation of this family of models. Recently, it has been shown that simulating a primary visual cortex (V1) at the front of CNNs leads to small improvements… ▽ More

    Submitted 7 December, 2021; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: 15 pages with supplementary material, 3 main figures, 2 supplementary figures, 4 supplementary tables

    Journal ref: Workshop on Shared Visual Representations in Human and Machine Intelligence 2021

  11. Topographic Deep Artificial Neural Networks (TDANNs) predict face selectivity topography in primate inferior temporal (IT) cortex

    Authors: Hyodong Lee, James J. DiCarlo

    Abstract: Deep convolutional neural networks are biologically driven models that resemble the hierarchical structure of primate visual cortex and are the current best predictors of the neural responses measured along the ventral stream. However, the networks lack topographic properties that are present in the visual cortex, such as orientation maps in primary visual cortex and category-selective maps in inf… ▽ More

    Submitted 21 September, 2019; originally announced September 2019.

    Comments: 2018 Conference on Cognitive Computational Neuroscience

  12. arXiv:1909.06161  [pdf, other

    cs.CV cs.LG cs.NE eess.IV q-bio.NC

    Brain-Like Object Recognition with High-Performing Shallow Recurrent ANNs

    Authors: Jonas Kubilius, Martin Schrimpf, Kohitij Kar, Ha Hong, Najib J. Majaj, Rishi Rajalingham, Elias B. Issa, Pouya Bashivan, Jonathan Prescott-Roy, Kailyn Schmidt, Aran Nayebi, Daniel Bear, Daniel L. K. Yamins, James J. DiCarlo

    Abstract: Deep convolutional artificial neural networks (ANNs) are the leading class of candidate models of the mechanisms of visual processing in the primate ventral stream. While initially inspired by brain anatomy, over the past years, these ANNs have evolved from a simple eight-layer architecture in AlexNet to extremely deep and branching architectures, demonstrating increasingly better object categoriz… ▽ More

    Submitted 28 October, 2019; v1 submitted 13 September, 2019; originally announced September 2019.

    Comments: NeurIPS 2019 (Oral). Code available at https://github.com/dicarlolab/neurips2019

  13. arXiv:1807.00053  [pdf, other

    q-bio.NC cs.AI cs.CV cs.LG cs.NE

    Task-Driven Convolutional Recurrent Models of the Visual System

    Authors: Aran Nayebi, Daniel Bear, Jonas Kubilius, Kohitij Kar, Surya Ganguli, David Sussillo, James J. DiCarlo, Daniel L. K. Yamins

    Abstract: Feed-forward convolutional neural networks (CNNs) are currently state-of-the-art for object classification tasks such as ImageNet. Further, they are quantitatively accurate models of temporally-averaged responses of neurons in the primate brain's visual system. However, biological visual systems have two ubiquitous architectural features not shared with typical CNNs: local recurrence within cortic… ▽ More

    Submitted 26 October, 2018; v1 submitted 20 June, 2018; originally announced July 2018.

    Comments: NIPS 2018 Camera Ready Version, 16 pages including supplementary information, 6 figures

  14. arXiv:1703.07633  [pdf

    physics.bio-ph physics.med-ph physics.optics q-bio.NC

    Deep brain fluorescence imaging with minimally invasive ultra-thin optical fibers

    Authors: Shay Ohayon, Antonio Miguel Caravaca-Aguirre, Rafael Piestun, James J. DiCarlo

    Abstract: A major open challenge in neuroscience is the ability to measure and perturb neural activity in vivo from well-defined neural sub-populations at cellular resolution anywhere in the brain. However, limitations posed by scattering and absorption prohibit non-invasive (surface) multiphoton approaches for deep (>2mm) structures, while Gradient Refreactive Index (GRIN) endoscopes are thick and cause si… ▽ More

    Submitted 9 November, 2017; v1 submitted 3 March, 2017; originally announced March 2017.

  15. Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition

    Authors: Charles F. Cadieu, Ha Hong, Daniel L. K. Yamins, Nicolas Pinto, Diego Ardila, Ethan A. Solomon, Najib J. Majaj, James J. DiCarlo

    Abstract: The primate visual system achieves remarkable visual object recognition performance even in brief presentations and under changes to object exemplar, geometric transformations, and background variation (a.k.a. core visual object recognition). This remarkable performance is mediated by the representation formed in inferior temporal (IT) cortex. In parallel, recent advances in machine learning have… ▽ More

    Submitted 12 June, 2014; originally announced June 2014.

    Comments: 35 pages, 12 figures, extends and expands upon arXiv:1301.3530

  16. arXiv:1301.3530  [pdf, other

    cs.NE cs.CV cs.LG q-bio.NC

    The Neural Representation Benchmark and its Evaluation on Brain and Machine

    Authors: Charles F. Cadieu, Ha Hong, Dan Yamins, Nicolas Pinto, Najib J. Majaj, James J. DiCarlo

    Abstract: A key requirement for the development of effective learning representations is their evaluation and comparison to representations we know to be effective. In natural sensory domains, the community has viewed the brain as a source of inspiration and as an implicit benchmark for success. However, it has not been possible to directly test representational learning algorithms directly against the repr… ▽ More

    Submitted 25 January, 2013; v1 submitted 15 January, 2013; originally announced January 2013.

    Comments: The v1 version contained incorrectly computed kernel analysis curves and KA-AUC values for V4, IT, and the HT-L3 models. They have been corrected in this version