Search | arXiv e-print repository

Bayesian Mixed Multidimensional Scaling for Auditory Processing

Authors: Giovanni Rebaudo, Fernando Llanos, Bharath Chandrasekaran, Abhra Sarkar

Abstract: The human brain distinguishes speech sound categories by representing acoustic signals in a latent multidimensional auditory-perceptual space. This space can be statistically constructed using multidimensional scaling, a technique that can compute lower-dimensional latent features representing the speech signals in such a way that their pairwise distances in the latent space closely resemble the c… ▽ More The human brain distinguishes speech sound categories by representing acoustic signals in a latent multidimensional auditory-perceptual space. This space can be statistically constructed using multidimensional scaling, a technique that can compute lower-dimensional latent features representing the speech signals in such a way that their pairwise distances in the latent space closely resemble the corresponding distances in the observation space. The inter-individual and inter-population (e.g., native versus non-native listeners) heterogeneity in such representations is however not well understood. These questions have often been examined using joint analyses that ignore individual heterogeneity or using separate analyses that cannot characterize human similarities. Neither extreme, therefore, allows for principled comparisons between populations and individuals. The focus of the current literature has also often been on inference on latent distances between the categories and not on the latent features themselves, which are crucial for our applications, that make up these distances. Motivated by these problems, we develop a novel Bayesian mixed multidimensional scaling method, taking into account the heterogeneity across populations and subjects. We design a Markov chain Monte Carlo algorithm for posterior computation. We then recover the latent features using a post-processing scheme applied to the posterior samples. We evaluate the method's empirical performances through synthetic experiments. Applied to a motivating auditory neuroscience study, the method provides novel insights into how biologically interpretable lower-dimensional latent features reconstruct the observed distances between the stimuli and vary between individuals and their native language experiences. △ Less

Submitted 1 December, 2023; v1 submitted 31 August, 2022; originally announced September 2022.

arXiv:2206.10757 [pdf, other]

Bayesian Tensor Factorized Vector Autoregressive Models for Inferring Granger Causality Patterns from High-Dimensional Multi-subject Panel Neuroimaging Data

Authors: Jingjing Fan, Kevin Sitek, Bharath Chandrasekaran, Abhra Sarkar

Abstract: Understanding the dynamics of functional brain connectivity patterns using noninvasive neuroimaging techniques is an important focus in human neuroscience. Vector autoregressive (VAR) processes and Granger causality analysis (GCA) have been extensively used for this purpose. While high-resolution multi-subject neuroimaging data are routinely collected now-a-days, the statistics literature on VAR m… ▽ More Understanding the dynamics of functional brain connectivity patterns using noninvasive neuroimaging techniques is an important focus in human neuroscience. Vector autoregressive (VAR) processes and Granger causality analysis (GCA) have been extensively used for this purpose. While high-resolution multi-subject neuroimaging data are routinely collected now-a-days, the statistics literature on VAR models has remained heavily focused on small-to-moderate dimensional problems and single-subject data. Motivated by these issues, we develop a novel Bayesian random effects panel VAR model for multi-subject high-dimensional neuroimaging data. We begin with a single-subject model that structures the VAR coefficients as a three-way tensor, then reduces the dimensions by applying a Tucker tensor decomposition. A novel sparsity-inducing shrinkage prior allows data-adaptive rank and lag selection. We then extend the approach to a novel random effects model for multi-subject data that carefully avoids the dimensions getting exploded with the number of subjects but also flexibly accommodates subject-specific heterogeneity. We design a Markov chain Monte Carlo algorithm for posterior computation. Finally, GCA with posterior false discovery control is performed on the posterior samples. The method shows excellent empirical performance in simulation experiments. Applied to our motivating functional magnetic resonance imaging study, the approach allows the directional connectivity of human brain networks to be studied in fine detail, revealing meaningful but previously unsubstantiated cortical connectivity patterns. △ Less

Submitted 14 September, 2022; v1 submitted 21 June, 2022; originally announced June 2022.

arXiv:2112.04626 [pdf, other]

doi 10.1007/s11336-024-09947-8

Bayesian Semiparametric Longitudinal Inverse-Probit Mixed Models for Category Learning

Authors: Minerva Mukhopadhyay, Jacie R. McHaney, Bharath Chandrasekaran, Abhra Sarkar

Abstract: Understanding how the adult human brain learns novel categories is an important problem in neuroscience. Drift-diffusion models are popular in such contexts for their ability to mimic the underlying neural mechanisms. One such model for gradual longitudinal learning was recently developed by Paulon et al. (2021). Fitting conventional drift-diffusion models, however, requires data on both category… ▽ More Understanding how the adult human brain learns novel categories is an important problem in neuroscience. Drift-diffusion models are popular in such contexts for their ability to mimic the underlying neural mechanisms. One such model for gradual longitudinal learning was recently developed by Paulon et al. (2021). Fitting conventional drift-diffusion models, however, requires data on both category responses and associated response times. In practice, category response accuracies are often the only reliable measure recorded by behavioral scientists to describe human learning. However, To our knowledge, drift-diffusion models for such scenarios have never been considered in the literature. To address this gap, in this article, we build carefully on Paulon et al. (2021), but now with latent response times integrated out, to derive a novel biologically interpretable class of `inverse-probit' categorical probability models for observed categories alone. However, this new marginal model presents significant identifiability and inferential challenges not encountered originally for the joint model by Paulon et al. (2021). We address these new challenges using a novel projection-based approach with a symmetry-preserving identifiability constraint that allows us to work with conjugate priors in an unconstrained space. We adapt the model for group and individual-level inference in longitudinal settings. Building again on the model's latent variable representation, we design an efficient Markov chain Monte Carlo algorithm for posterior computation. We evaluate the empirical performance of the method through simulation experiments. The practical efficacy of the method is illustrated in applications to longitudinal tone learning studies. △ Less

Submitted 21 July, 2024; v1 submitted 8 December, 2021; originally announced December 2021.

Comments: arXiv admin note: text overlap with arXiv:1912.02774

Journal ref: Psychometrika 89 (2024) 461-485

arXiv:1912.02774 [pdf, other]

Bayesian Semiparametric Longitudinal Drift-Diffusion Mixed Models for Tone Learning in Adults

Authors: Giorgio Paulon, Fernando Llanos, Bharath Chandrasekaran, Abhra Sarkar

Abstract: Understanding how adult humans learn non-native speech categories such as tone information has shed novel insights into the mechanisms underlying experience-dependent brain plasticity. Scientists have traditionally examined these questions using longitudinal learning experiments under a multi-category decision making paradigm. Drift-diffusion processes are popular in such contexts for their abilit… ▽ More Understanding how adult humans learn non-native speech categories such as tone information has shed novel insights into the mechanisms underlying experience-dependent brain plasticity. Scientists have traditionally examined these questions using longitudinal learning experiments under a multi-category decision making paradigm. Drift-diffusion processes are popular in such contexts for their ability to mimic underlying neural mechanisms. Motivated by these problems, we develop a novel Bayesian semiparametric inverse Gaussian drift-diffusion mixed model for multi-alternative decision making in longitudinal settings. We design a Markov chain Monte Carlo algorithm for posterior computation. We evaluate the method's empirical performances through synthetic experiments. Applied to our motivating longitudinal tone learning study, the method provides novel insights into how the biologically interpretable model parameters evolve with learning, differ between input-response tone combinations, and differ between well and poorly performing adults. △ Less

Submitted 15 June, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

arXiv:1506.02344 [pdf, other]

Stay on path: PCA along graph paths

Authors: Megasthenis Asteris, Anastasios Kyrillidis, Alexandros G. Dimakis, Han-Gyol Yi and, Bharath Chandrasekaran

Abstract: We introduce a variant of (sparse) PCA in which the set of feasible support sets is determined by a graph. In particular, we consider the following setting: given a directed acyclic graph $G$ on $p$ vertices corresponding to variables, the non-zero entries of the extracted principal component must coincide with vertices lying along a path in $G$. From a statistical perspective, information on th… ▽ More We introduce a variant of (sparse) PCA in which the set of feasible support sets is determined by a graph. In particular, we consider the following setting: given a directed acyclic graph $G$ on $p$ vertices corresponding to variables, the non-zero entries of the extracted principal component must coincide with vertices lying along a path in $G$. From a statistical perspective, information on the underlying network may potentially reduce the number of observations required to recover the population principal component. We consider the canonical estimator which optimally exploits the prior knowledge by solving a non-convex quadratic maximization on the empirical covariance. We introduce a simple network and analyze the estimator under the spiked covariance model. We show that side information potentially improves the statistical complexity. We propose two algorithms to approximate the solution of the constrained quadratic maximization, and recover a component with the desired properties. We empirically evaluate our schemes on synthetic and real datasets. △ Less

Submitted 18 June, 2015; v1 submitted 7 June, 2015; originally announced June 2015.

Comments: 12 pages, 5 figures, In Proceedings of International Conference on Machine Learning (ICML) 2015

Showing 1–5 of 5 results for author: Chandrasekaran, B