Skip to main content

Showing 1–22 of 22 results for author: Tolooshams, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.05239  [pdf, other

    cs.LG

    Evaluating Sparse Autoencoders: From Shallow Design to Matching Pursuit

    Authors: Valérie Costa, Thomas Fel, Ekdeep Singh Lubana, Bahareh Tolooshams, Demba Ba

    Abstract: Sparse autoencoders (SAEs) have recently become central tools for interpretability, leveraging dictionary learning principles to extract sparse, interpretable features from neural representations whose underlying structure is typically unknown. This paper evaluates SAEs in a controlled setting using MNIST, which reveals that current shallow architectures implicitly rely on a quasi-orthogonality as… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: Complementary work to arXiv:2506.03093

  2. arXiv:2506.04536  [pdf, ps, other

    cs.LG cs.AI q-bio.NC

    NOBLE -- Neural Operator with Biologically-informed Latent Embeddings to Capture Experimental Variability in Biological Neuron Models

    Authors: Luca Ghafourpour, Valentin Duruisseaux, Bahareh Tolooshams, Philip H. Wong, Costas A. Anastassiou, Anima Anandkumar

    Abstract: Characterizing the diverse computational properties of human neurons via multimodal electrophysiological, transcriptomic, and morphological data provides the foundation for constructing and validating bio-realistic neuron models that can advance our understanding of fundamental mechanisms underlying brain function. However, current modeling approaches remain constrained by the limited availability… ▽ More

    Submitted 12 June, 2025; v1 submitted 4 June, 2025; originally announced June 2025.

  3. arXiv:2506.03093  [pdf, ps, other

    cs.LG

    From Flat to Hierarchical: Extracting Sparse Representations with Matching Pursuit

    Authors: Valérie Costa, Thomas Fel, Ekdeep Singh Lubana, Bahareh Tolooshams, Demba Ba

    Abstract: Motivated by the hypothesis that neural network representations encode abstract, interpretable features as linearly accessible, approximately orthogonal directions, sparse autoencoders (SAEs) have become a popular tool in interpretability. However, recent work has demonstrated phenomenology of model representations that lies outside the scope of this hypothesis, showing signatures of hierarchical,… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: Preprint

  4. arXiv:2505.22973  [pdf, ps, other

    cs.LG cs.AI

    EquiReg: Equivariance Regularized Diffusion for Inverse Problems

    Authors: Bahareh Tolooshams, Aditi Chandrashekar, Rayhan Zirvi, Abbas Mammadov, Jiachen Yao, Chuwei Wang, Anima Anandkumar

    Abstract: Diffusion models represent the state-of-the-art for solving inverse problems such as image restoration tasks. In the Bayesian framework, diffusion-based inverse solvers incorporate a likelihood term to guide the prior sampling process, generating data consistent with the posterior distribution. However, due to the intractability of the likelihood term, many current methods rely on isotropic Gaussi… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  5. arXiv:2501.01157  [pdf, other

    eess.IV cs.LG physics.med-ph

    Ultrasound Lung Aeration Map via Physics-Aware Neural Operators

    Authors: Jiayun Wang, Oleksii Ostras, Masashi Sode, Bahareh Tolooshams, Zongyi Li, Kamyar Azizzadenesheli, Gianmarco Pinton, Anima Anandkumar

    Abstract: Lung ultrasound is a growing modality in clinics for diagnosing and monitoring acute and chronic lung diseases due to its low cost and accessibility. Lung ultrasound works by emitting diagnostic pulses, receiving pressure waves and converting them into radio frequency (RF) data, which are then processed into B-mode images with beamformers for radiologists to interpret. However, unlike conventional… ▽ More

    Submitted 2 January, 2025; originally announced January 2025.

  6. arXiv:2410.16290  [pdf, other

    eess.IV cs.CV

    A Unified Model for Compressed Sensing MRI Across Undersampling Patterns

    Authors: Armeet Singh Jatyani, Jiayun Wang, Aditi Chandrashekar, Zihui Wu, Miguel Liu-Schiaffini, Bahareh Tolooshams, Anima Anandkumar

    Abstract: Compressed Sensing MRI reconstructs images of the body's internal anatomy from undersampled measurements, thereby reducing scan time. Recently, deep learning has shown great potential for reconstructing high-fidelity images from highly undersampled measurements. However, one needs to train multiple models for different undersampling patterns and desired output image resolutions, since most network… ▽ More

    Submitted 3 April, 2025; v1 submitted 5 October, 2024; originally announced October 2024.

    Comments: Accepted at 2025 Conference on Computer Vision and Pattern Recognition

  7. arXiv:2410.03463  [pdf, other

    cs.LG cs.AI cs.CV

    Diffusion State-Guided Projected Gradient for Inverse Problems

    Authors: Rayhan Zirvi, Bahareh Tolooshams, Anima Anandkumar

    Abstract: Recent advancements in diffusion models have been effective in learning data priors for solving inverse problems. They leverage diffusion sampling steps for inducing a data prior while using a measurement guidance gradient at each step to impose data consistency. For general inverse problems, approximations are needed when an unconditionally trained diffusion model is used since the measurement li… ▽ More

    Submitted 1 April, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

    Comments: Published as a conference paper at ICLR 2025. RZ and BT have equal contributions

  8. arXiv:2409.03302  [pdf, other

    quant-ph cs.LG

    Fourier Neural Operators for Learning Dynamics in Quantum Spin Systems

    Authors: Freya Shah, Taylor L. Patti, Julius Berner, Bahareh Tolooshams, Jean Kossaifi, Anima Anandkumar

    Abstract: Fourier Neural Operators (FNOs) excel on tasks using functional data, such as those originating from partial differential equations. Such characteristics render them an effective approach for simulating the time evolution of quantum wavefunctions, which is a computationally challenging, yet coveted task for understanding quantum systems. In this manuscript, we use FNOs to model the evolution of ra… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

    Comments: 9 pages, 4 figures

  9. arXiv:2306.03249  [pdf, other

    cs.LG eess.SP stat.CO

    Probabilistic Unrolling: Scalable, Inverse-Free Maximum Likelihood Estimation for Latent Gaussian Models

    Authors: Alexander Lin, Bahareh Tolooshams, Yves Atchadé, Demba Ba

    Abstract: Latent Gaussian models have a rich history in statistics and machine learning, with applications ranging from factor analysis to compressed sensing to time series analysis. The classical method for maximizing the likelihood of these models is the expectation-maximization (EM) algorithm. For problems with high-dimensional latent variables and large datasets, EM scales poorly because it needs to inv… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 29 pages, 4 figures

    Journal ref: International Conference on Machine Learning, 2023

  10. Unrolled Compressed Blind-Deconvolution

    Authors: Bahareh Tolooshams, Satish Mulleti, Demba Ba, Yonina C. Eldar

    Abstract: The problem of sparse multichannel blind deconvolution (S-MBD) arises frequently in many engineering applications such as radar/sonar/ultrasound imaging. To reduce its computational and implementation cost, we propose a compression method that enables blind recovery from much fewer measurements with respect to the full received signal in time. The proposed compression measures the signal through a… ▽ More

    Submitted 18 May, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: Accepted to IEEE TSP

  11. arXiv:2112.04939  [pdf, other

    eess.AS cs.LG cs.SD

    A Training Framework for Stereo-Aware Speech Enhancement using Deep Neural Networks

    Authors: Bahareh Tolooshams, Kazuhito Koishida

    Abstract: Deep learning-based speech enhancement has shown unprecedented performance in recent years. The most popular mono speech enhancement frameworks are end-to-end networks mapping the noisy mixture into an estimate of the clean speech. With growing computational power and availability of multichannel microphone recordings, prior works have aimed to incorporate spatial statistics along with spectral in… ▽ More

    Submitted 31 January, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: Accepted to the IEEE 47th International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

  12. arXiv:2106.00058  [pdf, other

    cs.LG eess.SP stat.ML

    Stable and Interpretable Unrolled Dictionary Learning

    Authors: Bahareh Tolooshams, Demba Ba

    Abstract: The dictionary learning problem, representing data as a combination of a few atoms, has long stood as a popular method for learning representations in statistics and signal processing. The most popular dictionary learning algorithm alternates between sparse coding and dictionary update steps, and a rich literature has studied its theoretical convergence. The success of dictionary learning relies o… ▽ More

    Submitted 2 August, 2022; v1 submitted 31 May, 2021; originally announced June 2021.

    Comments: Published in Transactions on Machine Learning Research (TMLR) (08/2022)

  13. arXiv:2104.00530  [pdf, other

    cs.LG stat.AP stat.ML

    Gaussian Process Convolutional Dictionary Learning

    Authors: Andrew H. Song, Bahareh Tolooshams, Demba Ba

    Abstract: Convolutional dictionary learning (CDL), the problem of estimating shift-invariant templates from data, is typically conducted in the absence of a prior/structure on the templates. In data-scarce or low signal-to-noise ratio (SNR) regimes, learned templates overfit the data and lack smoothness, which can affect the predictive performance of downstream tasks. To address this limitation, we propose… ▽ More

    Submitted 24 November, 2021; v1 submitted 28 March, 2021; originally announced April 2021.

    Comments: IEEE Signal Processing Letters (2021)

  14. arXiv:2102.07003  [pdf, other

    cs.LG

    On the convergence of group-sparse autoencoders

    Authors: Emmanouil Theodosis, Bahareh Tolooshams, Pranay Tankala, Abiy Tasissa, Demba Ba

    Abstract: Recent approaches in the theoretical analysis of model-based deep learning architectures have studied the convergence of gradient descent in shallow ReLU networks that arise from generative models whose hidden layers are sparse. Motivated by the success of architectures that impose structured forms of sparsity, we introduce and study a group-sparse autoencoder that accounts for a variety of genera… ▽ More

    Submitted 21 January, 2022; v1 submitted 13 February, 2021; originally announced February 2021.

  15. arXiv:2010.11391  [pdf, ps, other

    eess.SP cs.LG

    Unfolding Neural Networks for Compressive Multichannel Blind Deconvolution

    Authors: Bahareh Tolooshams, Satish Mulleti, Demba Ba, Yonina C. Eldar

    Abstract: We propose a learned-structured unfolding neural network for the problem of compressive sparse multichannel blind-deconvolution. In this problem, each channel's measurements are given as convolution of a common source signal and sparse filter. Unlike prior works where the compression is achieved either through random projections or by applying a fixed structured compression matrix, this paper prop… ▽ More

    Submitted 11 February, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: Accepted to 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)

  16. arXiv:2006.09534  [pdf, other

    cs.IT cs.LG eess.SP

    Towards improving discriminative reconstruction via simultaneous dense and sparse coding

    Authors: Abiy Tasissa, Emmanouil Theodosis, Bahareh Tolooshams, Demba Ba

    Abstract: Discriminative features extracted from the sparse coding model have been shown to perform well for classification. Recent deep learning architectures have further improved reconstruction in inverse problems by considering new dense priors learned from data. We propose a novel dense and sparse coding model that integrates both representation capability and discriminative features. The model studies… ▽ More

    Submitted 13 December, 2022; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: 24 pages

  17. arXiv:2001.11542  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Channel-Attention Dense U-Net for Multichannel Speech Enhancement

    Authors: Bahareh Tolooshams, Ritwik Giri, Andrew H. Song, Umut Isik, Arvindh Krishnaswamy

    Abstract: Supervised deep learning has gained significant attention for speech enhancement recently. The state-of-the-art deep learning methods perform the task by learning a ratio/binary mask that is applied to the mixture in the time-frequency domain to produce the clean speech. Despite the great performance in the single-channel setting, these frameworks lag in performance in the multichannel setting as… ▽ More

    Submitted 30 January, 2020; originally announced January 2020.

  18. arXiv:1908.09258  [pdf, other

    cs.LG stat.ML

    RandNet: deep learning with compressed measurements of images

    Authors: Thomas Chang, Bahareh Tolooshams, Demba Ba

    Abstract: Principal component analysis, dictionary learning, and auto-encoders are all unsupervised methods for learning representations from a large amount of training data. In all these methods, the higher the dimensions of the input data, the longer it takes to learn. We introduce a class of neural networks, termed RandNet, for learning representations using compressed random measurements of data of inte… ▽ More

    Submitted 25 August, 2019; originally announced August 2019.

    Comments: The first two authors contributed equally to this work

  19. arXiv:1907.09881  [pdf, other

    cs.LG stat.ML

    Convolutional Dictionary Learning in Hierarchical Networks

    Authors: Javier Zazo, Bahareh Tolooshams, Demba Ba

    Abstract: Filter banks are a popular tool for the analysis of piecewise smooth signals such as natural images. Motivated by the empirically observed properties of scale and detail coefficients of images in the wavelet domain, we propose a hierarchical deep generative model of piecewise smooth signals that is a recursion across scales: the low pass scale coefficients at one layer are obtained by filtering th… ▽ More

    Submitted 23 July, 2019; originally announced July 2019.

  20. arXiv:1907.03211  [pdf, other

    cs.LG stat.AP stat.ML

    Convolutional dictionary learning based auto-encoders for natural exponential-family distributions

    Authors: Bahareh Tolooshams, Andrew H. Song, Simona Temereanca, Demba Ba

    Abstract: We introduce a class of auto-encoder neural networks tailored to data from the natural exponential family (e.g., count data). The architectures are inspired by the problem of learning the filters in a convolutional generative model with sparsity constraints, often referred to as convolutional dictionary learning (CDL). Our work is the first to combine ideas from convolutional generative models and… ▽ More

    Submitted 28 June, 2020; v1 submitted 6 July, 2019; originally announced July 2019.

    Journal ref: International Conference on Machine Learning (ICML) 2020

  21. Deep Residual Autoencoders for Expectation Maximization-inspired Dictionary Learning

    Authors: Bahareh Tolooshams, Sourav Dey, Demba Ba

    Abstract: We introduce a neural-network architecture, termed the constrained recurrent sparse autoencoder (CRsAE), that solves convolutional dictionary learning problems, thus establishing a link between dictionary learning and neural networks. Specifically, we leverage the interpretation of the alternating-minimization algorithm for dictionary learning as an approximate Expectation-Maximization algorithm t… ▽ More

    Submitted 18 October, 2020; v1 submitted 18 April, 2019; originally announced April 2019.

    Journal ref: in IEEE Transactions on Neural Networks and Learning Systems, pp. 1-15, 2020

  22. arXiv:1807.04734  [pdf, other

    cs.LG stat.ML

    Scalable Convolutional Dictionary Learning with Constrained Recurrent Sparse Auto-encoders

    Authors: Bahareh Tolooshams, Sourav Dey, Demba Ba

    Abstract: Given a convolutional dictionary underlying a set of observed signals, can a carefully designed auto-encoder recover the dictionary in the presence of noise? We introduce an auto-encoder architecture, termed constrained recurrent sparse auto-encoder (CRsAE), that answers this question in the affirmative. Given an input signal and an approximate dictionary, the encoder finds a sparse approximation… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.