Skip to main content

Showing 1–50 of 115 results for author: Tarokh, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.02138  [pdf, other

    cs.LG cs.AI stat.ML

    Elliptic Loss Regularization

    Authors: Ali Hasan, Haoming Yang, Yuting Ng, Vahid Tarokh

    Abstract: Regularizing neural networks is important for anticipating model behavior in regions of the data space that are not well represented. In this work, we propose a regularization technique for enforcing a level of smoothness in the mapping between the data input space and the loss value. We specify the level of regularity by requiring that the loss of the network satisfies an elliptic operator over t… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: ICLR 2025

  2. arXiv:2503.02117  [pdf, ps, other

    cs.LG cs.AI cs.CV

    Parabolic Continual Learning

    Authors: Haoming Yang, Ali Hasan, Vahid Tarokh

    Abstract: Regularizing continual learning techniques is important for anticipating algorithmic behavior under new realizations of data. We introduce a new approach to continual learning by imposing the properties of a parabolic partial differential equation (PDE) to regularize the expected behavior of the loss over time. This class of parabolic PDEs has a number of favorable properties that allow us to anal… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  3. arXiv:2502.11362  [pdf, other

    cs.LG

    Teleportation With Null Space Gradient Projection for Optimization Acceleration

    Authors: Zihao Wu, Juncheng Dong, Ahmed Aloui, Vahid Tarokh

    Abstract: Optimization techniques have become increasingly critical due to the ever-growing model complexity and data scale. In particular, teleportation has emerged as a promising approach, which accelerates convergence of gradient descent-based methods by navigating within the loss invariant level set to identify parameters with advantageous geometric properties. Existing teleportation algorithms have pri… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

  4. arXiv:2502.11340  [pdf, other

    cs.LG

    S2TX: Cross-Attention Multi-Scale State-Space Transformer for Time Series Forecasting

    Authors: Zihao Wu, Juncheng Dong, Haoming Yang, Vahid Tarokh

    Abstract: Time series forecasting has recently achieved significant progress with multi-scale models to address the heterogeneity between long and short range patterns. Despite their state-of-the-art performance, we identify two potential areas for improvement. First, the variates of the multivariate time series are processed independently. Moreover, the multi-scale (long and short range) representations ar… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

  5. arXiv:2501.00467  [pdf, other

    cs.LG stat.CO

    Score-Based Metropolis-Hastings Algorithms

    Authors: Ahmed Aloui, Ali Hasan, Juncheng Dong, Zihao Wu, Vahid Tarokh

    Abstract: In this paper, we introduce a new approach for integrating score-based models with the Metropolis-Hastings algorithm. While traditional score-based diffusion models excel in accurately learning the score function from data points, they lack an energy function, making the Metropolis-Hastings adjustment step inaccessible. Consequently, the unadjusted Langevin algorithm is often used for sampling usi… ▽ More

    Submitted 31 March, 2025; v1 submitted 31 December, 2024; originally announced January 2025.

  6. arXiv:2412.16482  [pdf, other

    cs.LG stat.ML

    Learn2Mix: Training Neural Networks Using Adaptive Data Integration

    Authors: Shyam Venkatasubramanian, Vahid Tarokh

    Abstract: Accelerating model convergence within resource-constrained environments is critical to ensure fast and efficient neural network training. This work presents learn2mix, a novel training strategy that adaptively adjusts class proportions within batches, focusing on classes with higher error rates. Unlike classical training methods that use static class proportions, learn2mix continually adapts class… ▽ More

    Submitted 13 February, 2025; v1 submitted 20 December, 2024; originally announced December 2024.

  7. arXiv:2412.02089  [pdf, other

    cs.LG

    Offline Stochastic Optimization of Black-Box Objective Functions

    Authors: Juncheng Dong, Zihao Wu, Hamid Jafarkhani, Ali Pezeshki, Vahid Tarokh

    Abstract: Many challenges in science and engineering, such as drug discovery and communication network design, involve optimizing complex and expensive black-box functions across vast search spaces. Thus, it is essential to leverage existing data to avoid costly active queries of these black-box functions. To this end, while Offline Black-Box Optimization (BBO) is effective for deterministic problems, it ma… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

  8. arXiv:2411.14351  [pdf, other

    stat.ML cs.CR cs.LG stat.AP

    Indiscriminate Disruption of Conditional Inference on Multivariate Gaussians

    Authors: William N. Caballero, Matthew LaRosa, Alexander Fisher, Vahid Tarokh

    Abstract: The multivariate Gaussian distribution underpins myriad operations-research, decision-analytic, and machine-learning models (e.g., Bayesian optimization, Gaussian influence diagrams, and variational autoencoders). However, despite recent advances in adversarial machine learning (AML), inference for Gaussian models in the presence of an adversary is notably understudied. Therefore, we consider a se… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

    Comments: 30 pages, 6 figures; 4 tables

  9. arXiv:2410.14615  [pdf, other

    stat.ML cs.AI cs.IT cs.LG eess.SP

    Asymptotically Optimal Change Detection for Unnormalized Pre- and Post-Change Distributions

    Authors: Arman Adibi, Sanjeev Kulkarni, H. Vincent Poor, Taposh Banerjee, Vahid Tarokh

    Abstract: This paper addresses the problem of detecting changes when only unnormalized pre- and post-change distributions are accessible. This situation happens in many scenarios in physics such as in ferromagnetism, crystallography, magneto-hydrodynamics, and thermodynamics, where the energy models are difficult to normalize. Our approach is based on the estimation of the Cumulative Sum (CUSUM) statistic… ▽ More

    Submitted 11 February, 2025; v1 submitted 18 October, 2024; originally announced October 2024.

  10. arXiv:2409.10075  [pdf, other

    cs.LG

    Steinmetz Neural Networks for Complex-Valued Data

    Authors: Shyam Venkatasubramanian, Ali Pezeshki, Vahid Tarokh

    Abstract: We introduce a new approach to processing complex-valued data using DNNs consisting of parallel real-valued subnetworks with coupled outputs. Our proposed class of architectures, referred to as Steinmetz Neural Networks, incorporates multi-view learning to construct more interpretable representations in the latent space. Moreover, we present the Analytic Neural Network, which incorporates a consis… ▽ More

    Submitted 13 February, 2025; v1 submitted 16 September, 2024; originally announced September 2024.

  11. arXiv:2409.04986  [pdf, other

    cs.LG

    DynamicFL: Federated Learning with Dynamic Communication Resource Allocation

    Authors: Qi Le, Enmao Diao, Xinran Wang, Vahid Tarokh, Jie Ding, Ali Anwar

    Abstract: Federated Learning (FL) is a collaborative machine learning framework that allows multiple users to train models utilizing their local data in a distributed manner. However, considerable statistical heterogeneity in local data across devices often leads to suboptimal model performance compared with independently and identically distributed (IID) data scenarios. In this paper, we introduce DynamicF… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

  12. arXiv:2408.00131  [pdf, other

    stat.ML cs.AI cs.LG q-fin.RM

    Distributionally Robust Optimization as a Scalable Framework to Characterize Extreme Value Distributions

    Authors: Patrick Kuiper, Ali Hasan, Wenhao Yang, Yuting Ng, Hoda Bidkhori, Jose Blanchet, Vahid Tarokh

    Abstract: The goal of this paper is to develop distributionally robust optimization (DRO) estimators, specifically for multidimensional Extreme Value Theory (EVT) statistics. EVT supports using semi-parametric models called max-stable distributions built from spatial Poisson point processes. While powerful, these models are only asymptotically valid for large samples. However, since extreme data is by defin… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

  13. arXiv:2407.17654  [pdf, other

    cs.LG stat.ML

    Generative Learning for Simulation of Vehicle Faults

    Authors: Patrick Kuiper, Sirui Lin, Jose Blanchet, Vahid Tarokh

    Abstract: We develop a novel generative model to simulate vehicle health and forecast faults, conditioned on practical operational considerations. The model, trained on data from the US Army's Predictive Logistics program, aims to support predictive maintenance. It forecasts faults far enough in advance to execute a maintenance intervention before a breakdown occurs. The model incorporates real-world factor… ▽ More

    Submitted 30 July, 2024; v1 submitted 24 July, 2024; originally announced July 2024.

  14. arXiv:2407.12234  [pdf, other

    cs.LG cs.CE math.OC stat.ML

    Base Models for Parabolic Partial Differential Equations

    Authors: Xingzi Xu, Ali Hasan, Jie Ding, Vahid Tarokh

    Abstract: Parabolic partial differential equations (PDEs) appear in many disciplines to model the evolution of various mathematical objects, such as probability flows, value functions in control theory, and derivative prices in finance. It is often necessary to compute the solutions or a function of the solutions to a parametric PDE in multiple scenarios corresponding to different parameters of this PDE. Th… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Appears in UAI 2024

  15. arXiv:2406.09638  [pdf, other

    cs.LG eess.SP

    RASPNet: A Benchmark Dataset for Radar Adaptive Signal Processing Applications

    Authors: Shyam Venkatasubramanian, Bosung Kang, Ali Pezeshki, Muralidhar Rangaswamy, Vahid Tarokh

    Abstract: We present a large-scale dataset for radar adaptive signal processing (RASP) applications to support the development of data-driven models within the adaptive radar community. The dataset, RASPNet, exceeds 16 TB in size and comprises 100 realistic scenarios compiled over a variety of topographies and land types from across the contiguous United States. For each scenario, RASPNet consists of 10,000… ▽ More

    Submitted 14 February, 2025; v1 submitted 13 June, 2024; originally announced June 2024.

  16. arXiv:2404.13844  [pdf, other

    cs.LG cs.AI

    ColA: Collaborative Adaptation with Gradient Learning

    Authors: Enmao Diao, Qi Le, Suya Wu, Xinran Wang, Ali Anwar, Jie Ding, Vahid Tarokh

    Abstract: A primary function of back-propagation is to compute both the gradient of hidden representations and parameters for optimization with gradient descent. Training large models requires high computational costs due to their vast parameter sizes. While Parameter-Efficient Fine-Tuning (PEFT) methods aim to train smaller auxiliary models to save computational space, they still present computational over… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  17. arXiv:2404.09402  [pdf, other

    cs.LG cs.AI stat.ML

    Neural McKean-Vlasov Processes: Distributional Dependence in Diffusion Processes

    Authors: Haoming Yang, Ali Hasan, Yuting Ng, Vahid Tarokh

    Abstract: McKean-Vlasov stochastic differential equations (MV-SDEs) provide a mathematical description of the behavior of an infinite number of interacting particles by imposing a dependence on the particle density. As such, we study the influence of explicitly including distributional information in the parameterization of the SDE. We propose a series of semi-parametric methods for representing MV-SDEs, an… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: Appears in AISTATS 2024

  18. arXiv:2311.12356  [pdf, other

    cs.LG

    Random Linear Projections Loss for Hyperplane-Based Optimization in Neural Networks

    Authors: Shyam Venkatasubramanian, Ahmed Aloui, Vahid Tarokh

    Abstract: Advancing loss function design is pivotal for optimizing neural network training and performance. This work introduces Random Linear Projections (RLP) loss, a novel approach that enhances training efficiency by leveraging geometric relationships within the data. Distinct from traditional loss functions that target minimizing pointwise errors, RLP loss operates by minimizing the distance between se… ▽ More

    Submitted 30 May, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

  19. arXiv:2311.03630  [pdf, other

    cs.LG stat.ME stat.ML

    Counterfactual Data Augmentation with Contrastive Learning

    Authors: Ahmed Aloui, Juncheng Dong, Cat P. Le, Vahid Tarokh

    Abstract: Statistical disparity between distinct treatment groups is one of the most significant challenges for estimating Conditional Average Treatment Effects (CATE). To address this, we introduce a model-agnostic data augmentation method that imputes the counterfactual outcomes for a selected subset of individuals. Specifically, we utilize contrastive learning to learn a representation space and a simila… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  20. arXiv:2310.07123  [pdf, other

    cs.LG cs.AI

    Off-Policy Evaluation for Human Feedback

    Authors: Qitong Gao, Ge Gao, Juncheng Dong, Vahid Tarokh, Min Chi, Miroslav Pajic

    Abstract: Off-policy evaluation (OPE) is important for closing the gap between offline training and evaluation of reinforcement learning (RL), by estimating performance and/or rank of target (evaluation) policies using offline trajectories only. It can improve the safety and efficiency of data collection and policy testing procedures in situations where online deployments are expensive, such as healthcare.… ▽ More

    Submitted 14 October, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  21. arXiv:2310.01720  [pdf, other

    cs.LG cs.AI

    Perceiver-based CDF Modeling for Time Series Forecasting

    Authors: Cat P. Le, Chris Cannella, Ali Hasan, Yuting Ng, Vahid Tarokh

    Abstract: Transformers have demonstrated remarkable efficacy in forecasting time series data. However, their extensive dependence on self-attention mechanisms demands significant computational resources, thereby limiting their practical applicability across diverse tasks, especially in multimodal problems. In this work, we propose a new architecture, called perceiver-CDF, for modeling cumulative distributio… ▽ More

    Submitted 24 June, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted in Winter Simulation Conference 2024

  22. arXiv:2306.11697  [pdf, other

    stat.ME cs.LG stat.ML

    Treatment Effects in Extreme Regimes

    Authors: Ahmed Aloui, Ali Hasan, Yuting Ng, Miroslav Pajic, Vahid Tarokh

    Abstract: Understanding treatment effects in extreme regimes is important for characterizing risks associated with different interventions. This is hindered by the unavailability of counterfactual outcomes and the rarity and difficulty of collecting extreme data in practice. To address this issue, we propose a new framework based on extreme value theory for estimating treatment effects in extreme regimes. W… ▽ More

    Submitted 22 May, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

  23. arXiv:2306.07918  [pdf, other

    cs.LG stat.ML

    Causal Mediation Analysis with Multi-dimensional and Indirectly Observed Mediators

    Authors: Ziyang Jiang, Yiling Liu, Michael H. Klein, Ahmed Aloui, Yiman Ren, Keyu Li, Vahid Tarokh, David Carlson

    Abstract: Causal mediation analysis (CMA) is a powerful method to dissect the total effect of a treatment into direct and mediated effects within the potential outcome framework. This is important in many scientific applications to identify the underlying mechanisms of a treatment effect. However, in many scientific applications the mediator is unobserved, but there may exist related measurements. For examp… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: 16 pages, 4 figures, 5 tables

  24. arXiv:2306.07408  [pdf, other

    cs.LG cs.AI cs.RO

    Robust Reinforcement Learning through Efficient Adversarial Herding

    Authors: Juncheng Dong, Hao-Lun Hsu, Qitong Gao, Vahid Tarokh, Miroslav Pajic

    Abstract: Although reinforcement learning (RL) is considered the gold standard for policy design, it may not always provide a robust solution in various scenarios. This can result in severe performance degradation when the environment is exposed to potential disturbances. Adversarial training using a two-player max-min game has been proven effective in enhancing the robustness of RL agents. In this work, we… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  25. arXiv:2306.02925  [pdf, other

    cs.CE physics.comp-ph

    Deep Generalized Green's Functions

    Authors: Rixi Peng, Juncheng Dong, Jordan Malof, Willie J. Padilla, Vahid Tarokh

    Abstract: In this study, we address the challenge of obtaining a Green's function operator for linear partial differential equations (PDEs). The Green's function is well-sought after due to its ability to directly map inputs to solutions, bypassing the need for common numerical methods such as finite difference and finite elements methods. However, obtaining an explicit form of the Green's function kernel f… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  26. arXiv:2305.11400  [pdf, other

    cs.LG stat.ML

    Mode-Aware Continual Learning for Conditional Generative Adversarial Networks

    Authors: Cat P. Le, Juncheng Dong, Ahmed Aloui, Vahid Tarokh

    Abstract: The main challenge in continual learning for generative models is to effectively learn new target modes with limited samples while preserving previously learned ones. To this end, we introduce a new continual learning approach for conditional generative adversarial networks by leveraging a mode-affinity score specifically designed for generative modeling. First, the generator produces samples of e… ▽ More

    Submitted 23 September, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  27. arXiv:2305.00003  [pdf, other

    cs.CE cond-mat.mtrl-sci cs.LG

    Neural Network Accelerated Process Design of Polycrystalline Microstructures

    Authors: Junrong Lin, Mahmudul Hasan, Pinar Acar, Jose Blanchet, Vahid Tarokh

    Abstract: Computational experiments are exploited in finding a well-designed processing path to optimize material structures for desired properties. This requires understanding the interplay between the processing-(micro)structure-property linkages using a multi-scale approach that connects the macro-scale (process parameters) to meso (homogenized properties) and micro (crystallographic texture) scales. Due… ▽ More

    Submitted 3 May, 2023; v1 submitted 11 April, 2023; originally announced May 2023.

  28. arXiv:2303.08241  [pdf, other

    cs.CV eess.SP

    Subspace Perturbation Analysis for Data-Driven Radar Target Localization

    Authors: Shyam Venkatasubramanian, Sandeep Gogineni, Bosung Kang, Ali Pezeshki, Muralidhar Rangaswamy, Vahid Tarokh

    Abstract: Recent works exploring data-driven approaches to classical problems in adaptive radar have demonstrated promising results pertaining to the task of radar target localization. Via the use of space-time adaptive processing (STAP) techniques and convolutional neural networks, these data-driven approaches to target localization have helped benchmark the performance of neural networks for matched scena… ▽ More

    Submitted 21 March, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: 6 pages, 3 figures. Submitted to 2023 IEEE Radar Conference (RadarConf). Extension of arXiv:2209.02890

  29. arXiv:2302.05601  [pdf, other

    cs.LG

    Pruning Deep Neural Networks from a Sparsity Perspective

    Authors: Enmao Diao, Ganghua Wang, Jiawei Zhan, Yuhong Yang, Jie Ding, Vahid Tarokh

    Abstract: In recent years, deep network pruning has attracted significant attention in order to enable the rapid deployment of AI into small devices with computation and memory constraints. Pruning is often achieved by dropping redundant weights, neurons, or layers of a deep network while attempting to retain a comparable test performance. Many deep pruning algorithms have been proposed with impressive empi… ▽ More

    Submitted 23 August, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: ICLR 2023

  30. arXiv:2302.03821  [pdf, other

    cs.LG math.OC stat.ME stat.ML

    PASTA: Pessimistic Assortment Optimization

    Authors: Juncheng Dong, Weibin Mo, Zhengling Qi, Cong Shi, Ethan X. Fang, Vahid Tarokh

    Abstract: We consider a class of assortment optimization problems in an offline data-driven setting. A firm does not know the underlying customer choice model but has access to an offline dataset consisting of the historically offered assortment set, customer choice, and revenue. The objective is to use the offline dataset to find an optimal assortment. Due to the combinatorial nature of assortment optimiza… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

  31. arXiv:2302.02009  [pdf, other

    cs.LG stat.ML

    Domain Adaptation via Rebalanced Sub-domain Alignment

    Authors: Yiling Liu, Juncheng Dong, Ziyang Jiang, Ahmed Aloui, Keyu Li, Hunter Klein, Vahid Tarokh, David Carlson

    Abstract: Unsupervised domain adaptation (UDA) is a technique used to transfer knowledge from a labeled source domain to a different but related unlabeled target domain. While many UDA methods have shown success in the past, they often assume that the source and target domains must have identical class label distributions, which can limit their effectiveness in real-world scenarios. To address this limitati… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: 20 pages, 6 figures, 4 tables

  32. arXiv:2302.00250  [pdf, other

    stat.ML cs.LG

    Quickest Change Detection for Unnormalized Statistical Models

    Authors: Suya Wu, Enmao Diao, Taposh Banerjee, Jie Ding, Vahid Tarokh

    Abstract: Classical quickest change detection algorithms require modeling pre-change and post-change distributions. Such an approach may not be feasible for various machine learning models because of the complexity of computing the explicit distributions. Additionally, these methods may suffer from a lack of robustness to model mismatch and noise. This paper develops a new variant of the classical Cumulativ… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Comments: A version of this paper has been accepted by the 26th International Conference on Artificial Intelligence and Statistics (AISTATS 2023)

  33. arXiv:2212.08779  [pdf, other

    cs.IR

    Personalized Federated Recommender Systems with Private and Partially Federated AutoEncoders

    Authors: Qi Le, Enmao Diao, Xinran Wang, Ali Anwar, Vahid Tarokh, Jie Ding

    Abstract: Recommender Systems (RSs) have become increasingly important in many application domains, such as digital marketing. Conventional RSs often need to collect users' data, centralize them on the server-side, and form a global model to generate reliable recommendations. However, they suffer from two critical limitations: the personalization problem that the RSs trained traditionally may not be customi… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

  34. arXiv:2210.00380  [pdf, other

    cs.LG stat.ME stat.ML

    Transfer Learning for Individual Treatment Effect Estimation

    Authors: Ahmed Aloui, Juncheng Dong, Cat P. Le, Vahid Tarokh

    Abstract: This work considers the problem of transferring causal knowledge between tasks for Individual Treatment Effect (ITE) estimation. To this end, we theoretically assess the feasibility of transferring ITE knowledge and present a practical framework for efficient transfer. A lower bound is introduced on the ITE error of the target task to demonstrate that ITE knowledge transfer is challenging due to t… ▽ More

    Submitted 5 June, 2023; v1 submitted 1 October, 2022; originally announced October 2022.

  35. arXiv:2209.02890  [pdf, other

    cs.CV eess.SP

    Data-Driven Target Localization Using Adaptive Radar Processing and Convolutional Neural Networks

    Authors: Shyam Venkatasubramanian, Sandeep Gogineni, Bosung Kang, Ali Pezeshki, Muralidhar Rangaswamy, Vahid Tarokh

    Abstract: Leveraging the advanced functionalities of modern radio frequency (RF) modeling and simulation tools, specifically designed for adaptive radar processing applications, this paper presents a data-driven approach to improve accuracy in radar target localization post adaptive radar detection. To this end, we generate a large number of radar returns by randomly placing targets of variable strengths in… ▽ More

    Submitted 9 July, 2024; v1 submitted 6 September, 2022; originally announced September 2022.

  36. arXiv:2205.14025  [pdf, other

    stat.ME cs.LG stat.ML

    Inference and Sampling for Archimax Copulas

    Authors: Yuting Ng, Ali Hasan, Vahid Tarokh

    Abstract: Understanding multivariate dependencies in both the bulk and the tails of a distribution is an important problem for many applications, such as ensuring algorithms are robust to observations that are infrequent but have devastating effects. Archimax copulas are a family of distributions endowed with a precise representation that allows simultaneous modeling of the bulk and the tails of a distribut… ▽ More

    Submitted 20 September, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Yuting Ng and Ali Hasan contributed equally to this work. This work has been accepted at NeurIPS 2022

  37. arXiv:2201.11209  [pdf, other

    cs.LG eess.IV

    On The Energy Statistics of Feature Maps in Pruning of Neural Networks with Skip-Connections

    Authors: Mohammadreza Soltani, Suya Wu, Yuerong Li, Jie Ding, Vahid Tarokh

    Abstract: We propose a new structured pruning framework for compressing Deep Neural Networks (DNNs) with skip connections, based on measuring the statistical dependency of hidden layers and predicted outputs. The dependence measure defined by the energy statistics of hidden layers serves as a model-free measure of information between the feature maps and the output of the network. The estimated dependence m… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

  38. Toward Data-Driven STAP Radar

    Authors: Shyam Venkatasubramanian, Chayut Wongkamthong, Mohammadreza Soltani, Bosung Kang, Sandeep Gogineni, Ali Pezeshki, Muralidhar Rangaswamy, Vahid Tarokh

    Abstract: Using an amalgamation of techniques from classical radar, computer vision, and deep learning, we characterize our ongoing data-driven approach to space-time adaptive processing (STAP) radar. We generate a rich example dataset of received radar signals by randomly placing targets of variable strengths in a predetermined region using RFView, a site-specific radio frequency modeling and simulation to… ▽ More

    Submitted 9 March, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: 5 pages, 4 figures. Submitted to 2022 IEEE Radar Conference (RadarConf)

  39. arXiv:2201.09149  [pdf, other

    cs.MA cs.IT cs.LG

    Multi-Agent Adversarial Attacks for Multi-Channel Communications

    Authors: Juncheng Dong, Suya Wu, Mohammadreza Sultani, Vahid Tarokh

    Abstract: Recently Reinforcement Learning (RL) has been applied as an anti-adversarial remedy in wireless communication networks. However, studying the RL-based approaches from the adversary's perspective has received little attention. Additionally, RL-based approaches in an anti-adversary or adversarial paradigm mostly consider single-channel communication (either channel selection or single channel power… ▽ More

    Submitted 27 January, 2022; v1 submitted 22 January, 2022; originally announced January 2022.

  40. arXiv:2201.03617  [pdf, other

    physics.flu-dyn cs.LG

    A Physics-Informed Vector Quantized Autoencoder for Data Compression of Turbulent Flow

    Authors: Mohammadreza Momenifar, Enmao Diao, Vahid Tarokh, Andrew D. Bragg

    Abstract: Analyzing large-scale data from simulations of turbulent flows is memory intensive, requiring significant resources. This major challenge highlights the need for data compression techniques. In this study, we apply a physics-informed Deep Learning technique based on vector quantization to generate a discrete, low-dimensional representation of data from simulations of three-dimensional turbulent fl… ▽ More

    Submitted 11 January, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: this article is a conference version of arXiv:2103.01074

  41. arXiv:2112.03469  [pdf, other

    physics.flu-dyn cs.LG

    Emulating Spatio-Temporal Realizations of Three-Dimensional Isotropic Turbulence via Deep Sequence Learning Models

    Authors: Mohammadreza Momenifar, Enmao Diao, Vahid Tarokh, Andrew D. Bragg

    Abstract: We use a data-driven approach to model a three-dimensional turbulent flow using cutting-edge Deep Learning techniques. The deep learning framework incorporates physical constraints on the flow, such as preserving incompressibility and global statistical invariants of velocity gradient tensor. The accuracy of the model is assessed using statistical and physics-based metrics. The data set comes from… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: AI2ASE: AAAI Workshop on AI to Accelerate Science and Engineering, 2022

  42. arXiv:2111.13311  [pdf, other

    cs.LG cs.CE

    Blaschke Product Neural Networks (BPNN): A Physics-Infused Neural Network for Phase Retrieval of Meromorphic Functions

    Authors: Juncheng Dong, Simiao Ren, Yang Deng, Omar Khatib, Jordan Malof, Mohammadreza Soltani, Willie Padilla, Vahid Tarokh

    Abstract: Numerous physical systems are described by ordinary or partial differential equations whose solutions are given by holomorphic or meromorphic functions in the complex domain. In many cases, only the magnitude of these functions are observed on various points on the purely imaginary jw-axis since coherent measurement of their phases is often expensive. However, it is desirable to retrieve the lost… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

  43. arXiv:2111.13207  [pdf, other

    cs.LG

    Characteristic Neural Ordinary Differential Equations

    Authors: Xingzi Xu, Ali Hasan, Khalil Elkhalil, Jie Ding, Vahid Tarokh

    Abstract: We propose Characteristic-Neural Ordinary Differential Equations (C-NODEs), a framework for extending Neural Ordinary Differential Equations (NODEs) beyond ODEs. While NODEs model the evolution of a latent variables as the solution to an ODE, C-NODE models the evolution of the latent variables as the solution of a family of first-order quasi-linear partial differential equations (PDEs) along curve… ▽ More

    Submitted 9 November, 2022; v1 submitted 25 November, 2021; originally announced November 2021.

  44. arXiv:2110.13340  [pdf, other

    cs.IR cs.LG

    Decentralized Multi-Target Cross-Domain Recommendation for Multi-Organization Collaborations

    Authors: Enmao Diao, Vahid Tarokh, Jie Ding

    Abstract: Recommender Systems (RSs) are operated locally by different organizations in many realistic scenarios. If various organizations can fully share their data and perform computation in a centralized manner, they may significantly improve the accuracy of recommendations. However, collaborations among multiple organizations in enhancing the performance of recommendations are primarily limited due to th… ▽ More

    Submitted 6 November, 2022; v1 submitted 25 October, 2021; originally announced October 2021.

  45. arXiv:2110.02399  [pdf, other

    cs.LG cs.CV

    Task Affinity with Maximum Bipartite Matching in Few-Shot Learning

    Authors: Cat P. Le, Juncheng Dong, Mohammadreza Soltani, Vahid Tarokh

    Abstract: We propose an asymmetric affinity score for representing the complexity of utilizing the knowledge of one task for learning another one. Our method is based on the maximum bipartite matching algorithm and utilizes the Fisher Information matrix. We provide theoretical analyses demonstrating that the proposed score is mathematically well-defined, and subsequently use the affinity score to propose a… ▽ More

    Submitted 21 January, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: Accepted as a conference paper at ICLR 2022

  46. arXiv:2106.02104  [pdf, other

    cs.LG

    Semi-Empirical Objective Functions for MCMC Proposal Optimization

    Authors: Chris Cannella, Vahid Tarokh

    Abstract: Current objective functions used for training neural MCMC proposal distributions implicitly rely on architectural restrictions to yield sensible optimization results, which hampers the development of highly expressive neural MCMC proposal architectures. In this work, we introduce and demonstrate a semi-empirical procedure for determining approximate objective functions suitable for optimizing arbi… ▽ More

    Submitted 9 April, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: 41 pages, 21 tables, 22 figures

  47. arXiv:2106.01432  [pdf, other

    cs.LG

    SemiFL: Semi-Supervised Federated Learning for Unlabeled Clients with Alternate Training

    Authors: Enmao Diao, Jie Ding, Vahid Tarokh

    Abstract: Federated Learning allows the training of machine learning models by using the computation and private data resources of many distributed clients. Most existing results on Federated Learning (FL) assume the clients have ground-truth labels. However, in many practical scenarios, clients may be unable to label task-specific data due to a lack of expertise or resource. We propose SemiFL to address th… ▽ More

    Submitted 11 October, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

  48. arXiv:2106.01425  [pdf, other

    cs.LG

    GAL: Gradient Assisted Learning for Decentralized Multi-Organization Collaborations

    Authors: Enmao Diao, Jie Ding, Vahid Tarokh

    Abstract: Collaborations among multiple organizations, such as financial institutions, medical centers, and retail markets in decentralized settings are crucial to providing improved service and performance. However, the underlying organizations may have little interest in sharing their local data, models, and objective functions. These requirements have created new challenges for multi-organization collabo… ▽ More

    Submitted 11 October, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

  49. arXiv:2106.00110  [pdf, other

    cs.SD cs.LG eess.AS

    A Methodology for Exploring Deep Convolutional Features in Relation to Hand-Crafted Features with an Application to Music Audio Modeling

    Authors: Anna K. Yanchenko, Mohammadreza Soltani, Robert J. Ravier, Sayan Mukherjee, Vahid Tarokh

    Abstract: Understanding the features learned by deep models is important from a model trust perspective, especially as deep systems are deployed in the real world. Most recent approaches for deep feature understanding or model explanation focus on highlighting input data features that are relevant for classification decisions. In this work, we instead take the perspective of relating deep features to well-s… ▽ More

    Submitted 9 October, 2021; v1 submitted 31 May, 2021; originally announced June 2021.

    Comments: Code available at https://github.com/aky4wn/convolutions-for-music-audio

  50. arXiv:2103.12827  [pdf, other

    cs.LG eess.IV stat.ML

    Fisher Task Distance and Its Application in Neural Architecture Search

    Authors: Cat P. Le, Mohammadreza Soltani, Juncheng Dong, Vahid Tarokh

    Abstract: We formulate an asymmetric (or non-commutative) distance between tasks based on Fisher Information Matrices, called Fisher task distance. This distance represents the complexity of transferring the knowledge from one task to another. We provide a proof of consistency for our distance through theorems and experiments on various classification tasks from MNIST, CIFAR-10, CIFAR-100, ImageNet, and Tas… ▽ More

    Submitted 30 April, 2022; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: Published in IEEE Access, Volume 10, 2022