Search | arXiv e-print repository

A separability-based approach to quantifying generalization: which layer is best?

Authors: Luciano Dyballa, Evan Gerritz, Steven W. Zucker

Abstract: Generalization to unseen data remains poorly understood for deep learning classification and foundation models, especially in the open set scenario. How can one assess the ability of networks to adapt to new or extended versions of their input space in the spirit of few-shot learning, out-of-distribution generalization, domain adaptation, and category discovery? Which layers of a network are likel… ▽ More Generalization to unseen data remains poorly understood for deep learning classification and foundation models, especially in the open set scenario. How can one assess the ability of networks to adapt to new or extended versions of their input space in the spirit of few-shot learning, out-of-distribution generalization, domain adaptation, and category discovery? Which layers of a network are likely to generalize best? We provide a new method for evaluating the capacity of networks to represent a sampled domain, regardless of whether the network has been trained on all classes in that domain. Our approach is the following: after fine-tuning state-of-the-art pre-trained models for visual classification on a particular domain, we assess their performance on data from related but distinct variations in that domain. Generalization power is quantified as a function of the latent embeddings of unseen data from intermediate layers for both unsupervised and supervised settings. Working throughout all stages of the network, we find that (i) high classification accuracy does not imply high generalizability; and (ii) deeper layers in a model do not always generalize the best, which has implications for pruning. Since the trends observed across datasets are largely consistent, we conclude that our approach reveals (a function of) the intrinsic capacity of the different layers of a model to generalize. Our code is available at https://github.com/dyballa/generalization △ Less

Submitted 2 November, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

Comments: 7 pages, 6 figures

ACM Class: I.5.1; I.2.6; I.4.10

arXiv:2402.14102 [pdf, other]

Learning dynamic representations of the functional connectome in neurobiological networks

Authors: Luciano Dyballa, Samuel Lang, Alexandra Haslund-Gourley, Eviatar Yemini, Steven W. Zucker

Abstract: The static synaptic connectivity of neuronal circuits stands in direct contrast to the dynamics of their function. As in changing community interactions, different neurons can participate actively in various combinations to effect behaviors at different times. We introduce an unsupervised approach to learn the dynamic affinities between neurons in live, behaving animals, and to reveal which commun… ▽ More The static synaptic connectivity of neuronal circuits stands in direct contrast to the dynamics of their function. As in changing community interactions, different neurons can participate actively in various combinations to effect behaviors at different times. We introduce an unsupervised approach to learn the dynamic affinities between neurons in live, behaving animals, and to reveal which communities form among neurons at different times. The inference occurs in two major steps. First, pairwise non-linear affinities between neuronal traces from brain-wide calcium activity are organized by non-negative tensor factorization (NTF). Each factor specifies which groups of neurons are most likely interacting for an inferred interval in time, and for which animals. Finally, a generative model that allows for weighted community detection is applied to the functional motifs produced by NTF to reveal a dynamic functional connectome. Since time codes the different experimental variables (e.g., application of chemical stimuli), this provides an atlas of neural motifs active during separate stages of an experiment (e.g., stimulus application or spontaneous behaviors). Results from our analysis are experimentally validated, confirming that our method is able to robustly predict causal interactions between neurons to generate behavior. Code is available at https://github.com/dyballa/dynamic-connectomes. △ Less

Submitted 27 February, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

Comments: Accepted at ICLR 24

arXiv:2402.14095 [pdf, other]

Zero-shot generalization across architectures for visual classification

Authors: Evan Gerritz, Luciano Dyballa, Steven W. Zucker

Abstract: Generalization to unseen data is a key desideratum for deep networks, but its relation to classification accuracy is unclear. Using a minimalist vision dataset and a measure of generalizability, we show that popular networks, from deep convolutional networks (CNNs) to transformers, vary in their power to extrapolate to unseen classes both across layers and across architectures. Accuracy is not a g… ▽ More Generalization to unseen data is a key desideratum for deep networks, but its relation to classification accuracy is unclear. Using a minimalist vision dataset and a measure of generalizability, we show that popular networks, from deep convolutional networks (CNNs) to transformers, vary in their power to extrapolate to unseen classes both across layers and across architectures. Accuracy is not a good predictor of generalizability, and generalization varies non-monotonically with layer depth. △ Less

Submitted 3 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

Comments: Accepted as a Tiny Paper at ICLR 2024. Code available at https://github.com/dyballa/generalization/tree/ICLR2024TinyPaper

ACM Class: I.2.6; I.5.1; I.4.10

arXiv:2208.09123 [pdf, other]

doi 10.1162/neco_a_01566

IAN: Iterated Adaptive Neighborhoods for manifold learning and dimensionality estimation

Authors: Luciano Dyballa, Steven W. Zucker

Abstract: Invoking the manifold assumption in machine learning requires knowledge of the manifold's geometry and dimension, and theory dictates how many samples are required. However, in applications data are limited, sampling may not be uniform, and manifold properties are unknown and (possibly) non-pure; this implies that neighborhoods must adapt to the local structure. We introduce an algorithm for infer… ▽ More Invoking the manifold assumption in machine learning requires knowledge of the manifold's geometry and dimension, and theory dictates how many samples are required. However, in applications data are limited, sampling may not be uniform, and manifold properties are unknown and (possibly) non-pure; this implies that neighborhoods must adapt to the local structure. We introduce an algorithm for inferring adaptive neighborhoods for data given by a similarity kernel. Starting with a locally-conservative neighborhood (Gabriel) graph, we sparsify it iteratively according to a weighted counterpart. In each step, a linear program yields minimal neighborhoods globally and a volumetric statistic reveals neighbor outliers likely to violate manifold geometry. We apply our adaptive neighborhoods to non-linear dimensionality reduction, geodesic computation and dimension estimation. A comparison against standard algorithms using, e.g., k-nearest neighbors, demonstrates their usefulness. Code for our algorithm will be available at https://github.com/dyballa/IAN △ Less

Submitted 6 January, 2023; v1 submitted 18 August, 2022; originally announced August 2022.

Comments: 69 pages, 35 figures

ACM Class: I.2.6; I.5.0

Journal ref: Neural Comput 2023, 35 (3), 453-524

arXiv:1411.0431 [pdf, ps, other]

doi 10.1017/nws.2015.19

Further insights into the interareal connectivity of a cortical network

Authors: Luciano Dyballa, Valmir C. Barbosa

Abstract: Over the past years, network science has proven invaluable as a means to better understand many of the processes taking place in the brain. Recently, interareal connectivity data of the macaque cortex was made available with great richness of detail. We explore new aspects of this dataset, such as a correlation between connection weights and cortical hierarchy. We also look at the link-community s… ▽ More Over the past years, network science has proven invaluable as a means to better understand many of the processes taking place in the brain. Recently, interareal connectivity data of the macaque cortex was made available with great richness of detail. We explore new aspects of this dataset, such as a correlation between connection weights and cortical hierarchy. We also look at the link-community structure that emerges from the data to uncover the major communication pathways in the network, and moreover investigate its reciprocal connections, showing that they share similar properties. △ Less

Submitted 3 November, 2014; originally announced November 2014.

Journal ref: Network Science 3 (2015), 526-550

Showing 1–5 of 5 results for author: Dyballa, L