Skip to main content

Showing 51–100 of 143 results for author: Tarokh, V

.
  1. arXiv:2110.02399  [pdf, other

    cs.LG cs.CV

    Task Affinity with Maximum Bipartite Matching in Few-Shot Learning

    Authors: Cat P. Le, Juncheng Dong, Mohammadreza Soltani, Vahid Tarokh

    Abstract: We propose an asymmetric affinity score for representing the complexity of utilizing the knowledge of one task for learning another one. Our method is based on the maximum bipartite matching algorithm and utilizes the Fisher Information matrix. We provide theoretical analyses demonstrating that the proposed score is mathematically well-defined, and subsequently use the affinity score to propose a… ▽ More

    Submitted 21 January, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: Accepted as a conference paper at ICLR 2022

  2. arXiv:2106.02104  [pdf, other

    cs.LG

    Semi-Empirical Objective Functions for MCMC Proposal Optimization

    Authors: Chris Cannella, Vahid Tarokh

    Abstract: Current objective functions used for training neural MCMC proposal distributions implicitly rely on architectural restrictions to yield sensible optimization results, which hampers the development of highly expressive neural MCMC proposal architectures. In this work, we introduce and demonstrate a semi-empirical procedure for determining approximate objective functions suitable for optimizing arbi… ▽ More

    Submitted 9 April, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: 41 pages, 21 tables, 22 figures

  3. arXiv:2106.01432  [pdf, other

    cs.LG

    SemiFL: Semi-Supervised Federated Learning for Unlabeled Clients with Alternate Training

    Authors: Enmao Diao, Jie Ding, Vahid Tarokh

    Abstract: Federated Learning allows the training of machine learning models by using the computation and private data resources of many distributed clients. Most existing results on Federated Learning (FL) assume the clients have ground-truth labels. However, in many practical scenarios, clients may be unable to label task-specific data due to a lack of expertise or resource. We propose SemiFL to address th… ▽ More

    Submitted 11 October, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

  4. arXiv:2106.01425  [pdf, other

    cs.LG

    GAL: Gradient Assisted Learning for Decentralized Multi-Organization Collaborations

    Authors: Enmao Diao, Jie Ding, Vahid Tarokh

    Abstract: Collaborations among multiple organizations, such as financial institutions, medical centers, and retail markets in decentralized settings are crucial to providing improved service and performance. However, the underlying organizations may have little interest in sharing their local data, models, and objective functions. These requirements have created new challenges for multi-organization collabo… ▽ More

    Submitted 11 October, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

  5. arXiv:2106.00110  [pdf, other

    cs.SD cs.LG eess.AS

    A Methodology for Exploring Deep Convolutional Features in Relation to Hand-Crafted Features with an Application to Music Audio Modeling

    Authors: Anna K. Yanchenko, Mohammadreza Soltani, Robert J. Ravier, Sayan Mukherjee, Vahid Tarokh

    Abstract: Understanding the features learned by deep models is important from a model trust perspective, especially as deep systems are deployed in the real world. Most recent approaches for deep feature understanding or model explanation focus on highlighting input data features that are relevant for classification decisions. In this work, we instead take the perspective of relating deep features to well-s… ▽ More

    Submitted 9 October, 2021; v1 submitted 31 May, 2021; originally announced June 2021.

    Comments: Code available at https://github.com/aky4wn/convolutions-for-music-audio

  6. arXiv:2103.12827  [pdf, other

    cs.LG eess.IV stat.ML

    Fisher Task Distance and Its Application in Neural Architecture Search

    Authors: Cat P. Le, Mohammadreza Soltani, Juncheng Dong, Vahid Tarokh

    Abstract: We formulate an asymmetric (or non-commutative) distance between tasks based on Fisher Information Matrices, called Fisher task distance. This distance represents the complexity of transferring the knowledge from one task to another. We provide a proof of consistency for our distance through theorems and experiments on various classification tasks from MNIST, CIFAR-10, CIFAR-100, ImageNet, and Tas… ▽ More

    Submitted 30 April, 2022; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: Published in IEEE Access, Volume 10, 2022

  7. arXiv:2103.02260  [pdf, other

    cs.CR cs.CY cs.DC cs.NI

    Talaria: A Framework for Simulation of Permissioned Blockchains for Logistics and Beyond

    Authors: Jiali Xing, David Fischer, Nitya Labh, Ryan Piersma, Benjamin C. Lee, Yu Amy Xia, Tuhin Sahai, Vahid Tarokh

    Abstract: In this paper, we present Talaria, a novel permissioned blockchain simulator that supports numerous protocols and use cases, most notably in supply chain management. Talaria extends the capability of BlockSim, an existing blockchain simulator, to include permissioned blockchains and serves as a foundation for further private blockchain assessment. Talaria is designed with both practical Byzantine… ▽ More

    Submitted 30 March, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

  8. Dimension Reduced Turbulent Flow Data From Deep Vector Quantizers

    Authors: Mohammadreza Momenifar, Enmao Diao, Vahid Tarokh, Andrew D. Bragg

    Abstract: Analyzing large-scale data from simulations of turbulent flows is memory intensive, requiring significant resources. This major challenge highlights the need for data compression techniques. In this study, we apply a physics-informed Deep Learning technique based on vector quantization to generate a discrete, low-dimensional representation of data from simulations of three-dimensional turbulent fl… ▽ More

    Submitted 24 May, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

    Journal ref: Journal of Turbulence 2022

  9. arXiv:2103.00241  [pdf, other

    cs.LG cs.CV

    Improved Automated Machine Learning from Transfer Learning

    Authors: Cat P. Le, Mohammadreza Soltani, Robert Ravier, Vahid Tarokh

    Abstract: In this paper, we propose a neural architecture search framework based on a similarity measure between some baseline tasks and a target task. We first define the notion of the task similarity based on the log-determinant of the Fisher Information matrix. Next, we compute the task similarity from each of the baseline tasks to the target task. By utilizing the relation between a target and a set of… ▽ More

    Submitted 29 January, 2022; v1 submitted 27 February, 2021; originally announced March 2021.

  10. arXiv:2102.11351  [pdf, other

    cs.LG stat.ML

    Generative Archimedean Copulas

    Authors: Yuting Ng, Ali Hasan, Khalil Elkhalil, Vahid Tarokh

    Abstract: We propose a new generative modeling technique for learning multidimensional cumulative distribution functions (CDFs) in the form of copulas. Specifically, we consider certain classes of copulas known as Archimedean and hierarchical Archimedean copulas, popular for their parsimonious representation and ability to model different tail dependencies. We consider their representation as mixture models… ▽ More

    Submitted 10 June, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: UAI 2021

  11. arXiv:2102.09042  [pdf, other

    stat.ML cs.LG stat.CO

    Modeling Extremes with d-max-decreasing Neural Networks

    Authors: Ali Hasan, Khalil Elkhalil, Yuting Ng, Joao M. Pereira, Sina Farsiu, Jose H. Blanchet, Vahid Tarokh

    Abstract: We propose a novel neural network architecture that enables non-parametric calibration and generation of multivariate extreme value distributions (MEVs). MEVs arise from Extreme Value Theory (EVT) as the necessary class of models when extrapolating a distributional fit over large spatial and temporal scales based on data observed in intermediate scales. In turn, EVT dictates that $d$-max-decreasin… ▽ More

    Submitted 1 March, 2022; v1 submitted 17 February, 2021; originally announced February 2021.

  12. arXiv:2012.13307  [pdf, other

    math.ST cs.IT cs.LG eess.SP physics.data-an

    On Statistical Efficiency in Learning

    Authors: Jie Ding, Enmao Diao, Jiawei Zhou, Vahid Tarokh

    Abstract: A central issue of many statistical learning problems is to select an appropriate model from a set of candidate models. Large models tend to inflate the variance (or overfitting), while small models tend to cause biases (or underfitting) for a given fixed dataset. In this work, we address the critical challenge of model selection to strike a balance between model fitting and model complexity, thus… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

    Comments: to be published by the IEEE Transactions on Information Theory

  13. arXiv:2010.13962  [pdf, ps, other

    cs.LG cs.AI

    Task-Aware Neural Architecture Search

    Authors: Cat P. Le, Mohammadreza Soltani, Robert Ravier, Vahid Tarokh

    Abstract: The design of handcrafted neural networks requires a lot of time and resources. Recent techniques in Neural Architecture Search (NAS) have proven to be competitive or better than traditional handcrafted design, although they require domain knowledge and have generally used limited search spaces. In this paper, we propose a novel framework for neural architecture search, utilizing a dictionary of m… ▽ More

    Submitted 15 March, 2021; v1 submitted 26 October, 2020; originally announced October 2020.

  14. arXiv:2010.01264  [pdf, other

    cs.LG stat.ML

    HeteroFL: Computation and Communication Efficient Federated Learning for Heterogeneous Clients

    Authors: Enmao Diao, Jie Ding, Vahid Tarokh

    Abstract: Federated Learning (FL) is a method of training machine learning models on private data distributed over a large number of possibly heterogeneous clients such as mobile phones and IoT devices. In this work, we propose a new federated learning framework named HeteroFL to address heterogeneous clients equipped with very different computation and communication capabilities. Our solution can enable th… ▽ More

    Submitted 13 December, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: ICLR 2021

  15. arXiv:2007.06682  [pdf, other

    cs.LG cs.CV stat.ML

    GeoStat Representations of Time Series for Fast Classification

    Authors: Robert J. Ravier, Mohammadreza Soltani, Miguel Simões, Denis Garagic, Vahid Tarokh

    Abstract: Recent advances in time series classification have largely focused on methods that either employ deep learning or utilize other machine learning models for feature extraction. Though successful, their power often comes at the requirement of computational complexity. In this paper, we introduce GeoStat representations for time series. GeoStat representations are based off of a generalization of rec… ▽ More

    Submitted 11 January, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: 28 pages, 8 tables, 5 figures

  16. arXiv:2007.06407  [pdf, other

    cs.NE q-bio.NC

    Deep Cross-Subject Mapping of Neural Activity

    Authors: Marko Angjelichinoski, Bijan Pesaran, Vahid Tarokh

    Abstract: Objective. In this paper, we consider the problem of cross-subject decoding, where neural activity data collected from the prefrontal cortex of a given subject (destination) is used to decode motor intentions from the neural activity of a different subject (source). Approach. We cast the problem of neural activity mapping in a probabilistic framework where we adopt deep generative modelling. Our p… ▽ More

    Submitted 21 February, 2022; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: More details on hyper-parameter tuning and some additional results

  17. arXiv:2007.06140  [pdf, other

    cs.LG stat.ML

    Projected Latent Markov Chain Monte Carlo: Conditional Sampling of Normalizing Flows

    Authors: Chris Cannella, Mohammadreza Soltani, Vahid Tarokh

    Abstract: We introduce Projected Latent Markov Chain Monte Carlo (PL-MCMC), a technique for sampling from the high-dimensional conditional distributions learned by a normalizing flow. We prove that a Metropolis-Hastings implementation of PL-MCMC asymptotically samples from the exact conditional distributions associated with a normalizing flow. As a conditional sampling method, PL-MCMC enables Monte Carlo Ex… ▽ More

    Submitted 26 February, 2021; v1 submitted 12 July, 2020; originally announced July 2020.

    Comments: 27 pages, 22 figures, 4 tables

  18. arXiv:2007.06120  [pdf, other

    stat.ML cs.LG

    Fisher Auto-Encoders

    Authors: Khalil Elkhalil, Ali Hasan, Jie Ding, Sina Farsiu, Vahid Tarokh

    Abstract: It has been conjectured that the Fisher divergence is more robust to model uncertainty than the conventional Kullback-Leibler (KL) divergence. This motivates the design of a new class of robust generative auto-encoders (AE) referred to as Fisher auto-encoders. Our approach is to design Fisher AEs by minimizing the Fisher divergence between the intractable joint distribution of observed data and la… ▽ More

    Submitted 23 October, 2020; v1 submitted 12 July, 2020; originally announced July 2020.

  19. arXiv:2007.06075  [pdf, other

    stat.ML cs.LG

    Identifying Latent Stochastic Differential Equations

    Authors: Ali Hasan, João M. Pereira, Sina Farsiu, Vahid Tarokh

    Abstract: We present a method for learning latent stochastic differential equations (SDEs) from high-dimensional time series data. Given a high-dimensional time series generated from a lower dimensional latent unknown Itô process, the proposed method learns the mapping from ambient to latent space, and the underlying SDE coefficients, through a self-supervised learning approach. Using the framework of varia… ▽ More

    Submitted 26 November, 2021; v1 submitted 12 July, 2020; originally announced July 2020.

    Comments: 20 pages, 8 figures, to be published in IEEE Transactions of Signal Processing

  20. arXiv:2005.07342  [pdf, other

    stat.ME stat.ML

    Model Linkage Selection for Cooperative Learning

    Authors: Jiaying Zhou, Jie Ding, Kean Ming Tan, Vahid Tarokh

    Abstract: We consider a distributed learning setting where each agent/learner holds a specific parametric model and data source. The goal is to integrate information across a set of learners to enhance the prediction accuracy of a given learner. A natural way to integrate information is to build a joint model across a group of learners that shares common parameters of interest. However, the underlying param… ▽ More

    Submitted 20 September, 2021; v1 submitted 14 May, 2020; originally announced May 2020.

  21. arXiv:2002.11582  [pdf, other

    math.OC cs.LG

    Proximal Gradient Algorithm with Momentum and Flexible Parameter Restart for Nonconvex Optimization

    Authors: Yi Zhou, Zhe Wang, Kaiyi Ji, Yingbin Liang, Vahid Tarokh

    Abstract: Various types of parameter restart schemes have been proposed for accelerated gradient algorithms to facilitate their practical convergence in convex optimization. However, the convergence properties of accelerated gradient algorithms under parameter restart remain obscure in nonconvex optimization. In this paper, we propose a novel accelerated proximal gradient algorithm with parameter restart (n… ▽ More

    Submitted 27 April, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

  22. Multimodal Controller for Generative Models

    Authors: Enmao Diao, Jie Ding, Vahid Tarokh

    Abstract: Class-conditional generative models are crucial tools for data generation from user-specified class labels. Existing approaches for class-conditional generative models require nontrivial modifications of backbone generative architectures to model conditional information fed into the model. This paper introduces a plug-and-play module named `multimodal controller' to generate multimodal data withou… ▽ More

    Submitted 3 August, 2022; v1 submitted 6 February, 2020; originally announced February 2020.

  23. arXiv:2001.00564  [pdf, other

    cs.LG stat.ML

    Robust Marine Buoy Placement for Ship Detection Using Dropout K-Means

    Authors: Yuting Ng, João M. Pereira, Denis Garagic, Vahid Tarokh

    Abstract: Marine buoys aid in the battle against Illegal, Unreported and Unregulated (IUU) fishing by detecting fishing vessels in their vicinity. Marine buoys, however, may be disrupted by natural causes and buoy vandalism. In this paper, we formulate marine buoy placement as a clustering problem, and propose dropout k-means and dropout k-median to improve placement robustness to buoy disruption. We simu… ▽ More

    Submitted 20 February, 2020; v1 submitted 2 January, 2020; originally announced January 2020.

    Comments: ICASSP 2020

  24. arXiv:1911.05127  [pdf, other

    math.OC

    Distributed Online Convex Optimization with Improved Dynamic Regret

    Authors: Yan Zhang, Robert J. Ravier, Vahid Tarokh, Michael M. Zavlanos

    Abstract: In this paper, we consider the problem of distributed online convex optimization, where a group of agents collaborate to track the global minimizers of a sum of time-varying objective functions in an online manner. Specifically, we propose a novel distributed online gradient descent algorithm that relies on an online adaptation of the gradient tracking technique used in static optimization. We sho… ▽ More

    Submitted 13 October, 2020; v1 submitted 12 November, 2019; originally announced November 2019.

  25. arXiv:1911.05050  [pdf, ps, other

    math.OC cs.LG

    A Distributed Online Convex Optimization Algorithm with Improved Dynamic Regret

    Authors: Yan Zhang, Robert J. Ravier, Michael M. Zavlanos, Vahid Tarokh

    Abstract: In this paper, we consider the problem of distributed online convex optimization, where a network of local agents aim to jointly optimize a convex function over a period of multiple time steps. The agents do not have any information about the future. Existing algorithms have established dynamic regret bounds that have explicit dependence on the number of time steps. In this work, we show that we c… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

  26. arXiv:1911.03540  [pdf, ps, other

    cs.NE eess.SP

    Cross-subject Decoding of Eye Movement Goals from Local Field Potentials

    Authors: Marko Angjelichinoski, John Choi, Taposh Banerjee, Bijan Pesaran, Vahid Tarokh

    Abstract: Objective. We consider the cross-subject decoding problem from local field potential (LFP) signals, where training data collected from the prefrontal cortex (PFC) of a source subject is used to decode intended motor actions in a destination subject. Approach. We propose a novel supervised transfer learning technique, referred to as data centering, which is used to adapt the feature space of the so… ▽ More

    Submitted 6 January, 2020; v1 submitted 8 November, 2019; originally announced November 2019.

  27. Supervised Encoding for Discrete Representation Learning

    Authors: Cat P. Le, Yi Zhou, Jie Ding, Vahid Tarokh

    Abstract: Classical supervised classification tasks search for a nonlinear mapping that maps each encoded feature directly to a probability mass over the labels. Such a learning framework typically lacks the intuition that encoded features from the same class tend to be similar and thus has little interpretability for the learned features. In this paper, we propose a novel supervised learning model named Su… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

  28. arXiv:1910.10341  [pdf, other

    eess.IV cs.LG stat.ML

    Deep Clustering of Compressed Variational Embeddings

    Authors: Suya Wu, Enmao Diao, Jie Ding, Vahid Tarokh

    Abstract: Motivated by the ever-increasing demands for limited communication bandwidth and low-power consumption, we propose a new methodology, named joint Variational Autoencoders with Bernoulli mixture models (VAB), for performing clustering in the compressed data domain. The idea is to reduce the data dimension by Variational Autoencoders (VAEs) and group data representations by Bernoulli mixture models… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: Submitted to the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, May 2020

  29. arXiv:1910.10262  [pdf, other

    cs.LG stat.ML

    Learning Partial Differential Equations from Data Using Neural Networks

    Authors: Ali Hasan, João M. Pereira, Robert Ravier, Sina Farsiu, Vahid Tarokh

    Abstract: We develop a framework for estimating unknown partial differential equations from noisy data, using a deep learning approach. Given noisy samples of a solution to an unknown PDE, our method interpolates the samples using a neural network, and extracts the PDE by equating derivatives of the neural network approximation. Our method applies to PDEs which are linear combinations of user-defined dictio… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

  30. arXiv:1910.09122  [pdf, other

    cs.LG cs.CV stat.ML

    Perception-Distortion Trade-off with Restricted Boltzmann Machines

    Authors: Chris Cannella, Jie Ding, Mohammadreza Soltani, Vahid Tarokh

    Abstract: In this work, we introduce a new procedure for applying Restricted Boltzmann Machines (RBMs) to missing data inference tasks, based on linearization of the effective energy function governing the distribution of observations. We compare the performance of our proposed procedure with those obtained using existing reconstruction procedures trained on incomplete data. We place these performance compa… ▽ More

    Submitted 20 October, 2019; originally announced October 2019.

    Comments: 5 pages, 1 figure

  31. Speech Emotion Recognition with Dual-Sequence LSTM Architecture

    Authors: Jianyou Wang, Michael Xue, Ryan Culhane, Enmao Diao, Jie Ding, Vahid Tarokh

    Abstract: Speech Emotion Recognition (SER) has emerged as a critical component of the next generation human-machine interfacing technologies. In this work, we propose a new dual-level model that predicts emotions based on both MFCC features and mel-spectrograms produced from raw audio signals. Each utterance is preprocessed into MFCC features and two mel-spectrograms at different time-frequency resolutions.… ▽ More

    Submitted 12 February, 2020; v1 submitted 19 October, 2019; originally announced October 2019.

    Comments: Accepted by ICASSP 2020

  32. Restricted Recurrent Neural Networks

    Authors: Enmao Diao, Jie Ding, Vahid Tarokh

    Abstract: Recurrent Neural Network (RNN) and its variations such as Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU), have become standard building blocks for learning online data of sequential nature in many research areas, including natural language processing and speech data analysis. In this paper, we present a new methodology to significantly reduce the number of parameters in RNNs while ma… ▽ More

    Submitted 14 November, 2019; v1 submitted 21 August, 2019; originally announced August 2019.

  33. DRASIC: Distributed Recurrent Autoencoder for Scalable Image Compression

    Authors: Enmao Diao, Jie Ding, Vahid Tarokh

    Abstract: We propose a new architecture for distributed image compression from a group of distributed data sources. The work is motivated by practical needs of data-driven codec design, low power consumption, robustness, and data privacy. The proposed architecture, which we refer to as Distributed Recurrent Autoencoder for Scalable Image Compression (DRASIC), is able to train distributed encoders and one jo… ▽ More

    Submitted 27 December, 2019; v1 submitted 23 March, 2019; originally announced March 2019.

  34. Convergence Rate of Empirical Spectral Distribution of Random Matrices from Linear Codes

    Authors: Chin Hei Chan, Vahid Tarokh, Maosheng Xiong

    Abstract: It is known that the empirical spectral distribution of random matrices obtained from linear codes of increasing length converges to the well-known Marchenko-Pastur law, if the Hamming distance of the dual codes is at least 5. In this paper, we prove that the convergence in probability is at least of the order $n^{-1/4}$ where $n$ is the length of the code.

    Submitted 12 May, 2020; v1 submitted 22 February, 2019; originally announced February 2019.

    MSC Class: 60B20; 94B05

    Journal ref: IEEE Trans. Inform. Theory 67 (2021), no. 2, 1080-1087

  35. arXiv:1902.02715   

    math.OC cs.LG

    Momentum Schemes with Stochastic Variance Reduction for Nonconvex Composite Optimization

    Authors: Yi Zhou, Zhe Wang, Kaiyi Ji, Yingbin Liang, Vahid Tarokh

    Abstract: Two new stochastic variance-reduced algorithms named SARAH and SPIDER have been recently proposed, and SPIDER has been shown to achieve a near-optimal gradient oracle complexity for nonconvex optimization. However, the theoretical advantage of SPIDER does not lead to substantial improvement of practical performance over SVRG. To address this issue, momentum technique can be a good candidate to imp… ▽ More

    Submitted 15 May, 2019; v1 submitted 7 February, 2019; originally announced February 2019.

    Comments: We are merging the results of this paper with another paper at arXiv:1810.10690. Therefore, we want to withdraw this paper

  36. arXiv:1901.11500  [pdf, other

    math.OC

    Prediction in Online Convex Optimization for Parametrizable Objective Functions

    Authors: Robert Ravier, Vahid Tarokh

    Abstract: Many techniques for online optimization problems involve making decisions based solely on presently available information: fewer works take advantage of potential predictions. In this paper, we discuss the problem of online convex optimization for parametrizable objectives, i.e. optimization problems that depend solely on the value of a parameter at a given time. We introduce a new regularity for… ▽ More

    Submitted 31 January, 2019; v1 submitted 31 January, 2019; originally announced January 2019.

    Comments: 10 pages, 3 figures

  37. arXiv:1901.10397  [pdf, ps, other

    cs.NE q-bio.NC

    Minimax-optimal decoding of movement goals from local field potentials using complex spectral features

    Authors: Marko Angjelichinoski, Taposh Banerjee, John Choi, Bijan Pesaran, Vahid Tarokh

    Abstract: We consider the problem of predicting eye movement goals from local field potentials (LFP) recorded through a multielectrode array in the macaque prefrontal cortex. The monkey is tasked with performing memory-guided saccades to one of eight targets during which LFP activity is recorded and used to train a decoder. Previous reports have mainly relied on the spectral amplitude of the LFPs as a featu… ▽ More

    Submitted 29 January, 2019; originally announced January 2019.

    Comments: Under review

  38. arXiv:1901.00451  [pdf, ps, other

    cs.LG stat.ML

    SGD Converges to Global Minimum in Deep Learning via Star-convex Path

    Authors: Yi Zhou, Junjie Yang, Huishuai Zhang, Yingbin Liang, Vahid Tarokh

    Abstract: Stochastic gradient descent (SGD) has been found to be surprisingly effective in training a variety of deep neural networks. However, there is still a lack of understanding on how and why SGD can train these complex networks towards a global minimum. In this study, we establish the convergence of SGD to a global minimum for nonconvex optimization problems that are commonly encountered in neural ne… ▽ More

    Submitted 2 January, 2019; originally announced January 2019.

    Comments: ICLR2019

  39. arXiv:1810.10690  [pdf, other

    math.OC cs.LG stat.ML

    SpiderBoost and Momentum: Faster Stochastic Variance Reduction Algorithms

    Authors: Zhe Wang, Kaiyi Ji, Yi Zhou, Yingbin Liang, Vahid Tarokh

    Abstract: SARAH and SPIDER are two recently developed stochastic variance-reduced algorithms, and SPIDER has been shown to achieve a near-optimal first-order oracle complexity in smooth nonconvex optimization. However, SPIDER uses an accuracy-dependent stepsize that slows down the convergence in practice, and cannot handle objective functions that involve nonsmooth regularizers. In this paper, we propose Sp… ▽ More

    Submitted 15 May, 2020; v1 submitted 24 October, 2018; originally announced October 2018.

    Comments: Appear in NeurIPS 2019

  40. arXiv:1810.09583  [pdf, other

    stat.ML cs.IT cs.LG econ.EM physics.app-ph

    Model Selection Techniques -- An Overview

    Authors: Jie Ding, Vahid Tarokh, Yuhong Yang

    Abstract: In the era of big data, analysts usually explore various statistical models or machine learning methods for observed data in order to facilitate scientific discoveries or gain predictive power. Whatever data and fitting procedures are employed, a crucial step is to select the most appropriate model or method from a set of candidates. Model selection is a key ingredient in data analysis for reliabl… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

    Comments: accepted by IEEE SIGNAL PROCESSING MAGAZINE

  41. arXiv:1810.03817  [pdf, ps, other

    cs.LG stat.ML

    Learning Bounds for Greedy Approximation with Explicit Feature Maps from Multiple Kernels

    Authors: Shahin Shahrampour, Vahid Tarokh

    Abstract: Nonlinear kernels can be approximated using finite-dimensional feature maps for efficient risk minimization. Due to the inherent trade-off between the dimension of the (mapped) feature space and the approximation accuracy, the key problem is to identify promising (explicit) features leading to a satisfactory out-of-sample performance. In this work, we tackle this problem by efficiently choosing su… ▽ More

    Submitted 9 October, 2018; originally announced October 2018.

    Comments: Proc. of 2018 Advances in Neural Information Processing Systems (NIPS 2018)

  42. arXiv:1809.00408  [pdf, other

    math.PR math.CO

    Asymptotically Pseudo-Independent Matrices

    Authors: Ilya Soloveychik, Vahid Tarokh

    Abstract: We show that the family of pseudo-random matrices recently discovered by Soloveychik, Xiang, and Tarokh in their work `Symmetric Pseudo-Random Matrices' exhibits asymptotic independence. More specifically, any two sequences of matrices of matching sizes from that construction generated using sequences of different non-reciprocal primitive polynomials are asymptotically independent.

    Submitted 29 October, 2018; v1 submitted 2 September, 2018; originally announced September 2018.

    Comments: arXiv admin note: text overlap with arXiv:1702.04086

  43. arXiv:1809.00358  [pdf, other

    eess.SP q-bio.NC stat.AP stat.ME

    Sequential Detection of Regime Changes in Neural Data

    Authors: Taposh Banerjee, Stephen Allsop, Kay M. Tye, Demba Ba, Vahid Tarokh

    Abstract: The problem of detecting changes in firing patterns in neural data is studied. The problem is formulated as a quickest change detection problem. Important algorithms from the literature are reviewed. A new algorithmic technique is discussed to detect deviations from learned baseline behavior. The algorithms studied can be applied to both spike and local field potential data. The algorithms are app… ▽ More

    Submitted 2 September, 2018; originally announced September 2018.

  44. arXiv:1807.06945  [pdf, other

    eess.SP cs.LG stat.ME stat.ML

    Cyclostationary Statistical Models and Algorithms for Anomaly Detection Using Multi-Modal Data

    Authors: Taposh Banerjee, Gene Whipps, Prudhvi Gurram, Vahid Tarokh

    Abstract: A framework is proposed to detect anomalies in multi-modal data. A deep neural network-based object detector is employed to extract counts of objects and sub-events from the data. A cyclostationary model is proposed to model regular patterns of behavior in the count sequences. The anomaly detection problem is formulated as a problem of detecting deviations from learned cyclostationary behavior. Se… ▽ More

    Submitted 2 July, 2018; originally announced July 2018.

  45. arXiv:1806.03571  [pdf, other

    stat.ML cs.LG

    Stationary Geometric Graphical Model Selection

    Authors: Ilya Soloveychik, Vahid Tarokh

    Abstract: We consider the problem of model selection in Gaussian Markov fields in the sample deficient scenario. In many practically important cases, the underlying networks are embedded into Euclidean spaces. Using the natural geometric structure, we introduce the notion of spatially stationary distributions over geometric graphs. This directly generalizes the notion of stationary time series to the multid… ▽ More

    Submitted 29 October, 2018; v1 submitted 9 June, 2018; originally announced June 2018.

    Comments: arXiv admin note: text overlap with arXiv:1802.03848

  46. arXiv:1803.08947  [pdf, other

    stat.AP cs.IT

    Sequential Event Detection Using Multimodal Data in Nonstationary Environments

    Authors: Taposh Banerjee, Gene Whipps, Prudhvi Gurram, Vahid Tarokh

    Abstract: The problem of sequential detection of anomalies in multimodal data is considered. The objective is to observe physical sensor data from CCTV cameras, and social media data from Twitter and Instagram to detect anomalous behaviors or events. Data from each modality is transformed to discrete time count data by using an artificial neural network to obtain counts of objects in CCTV images and by coun… ▽ More

    Submitted 23 March, 2018; originally announced March 2018.

  47. Estimation of the Evolutionary Spectra with Application to Stationarity Test

    Authors: Yu Xiang, Jie Ding, Vahid Tarokh

    Abstract: In this work, we propose a new inference procedure for understanding non-stationary processes, under the framework of evolutionary spectra developed by Priestley. Among various frameworks of modeling non-stationary processes, the distinguishing feature of the evolutionary spectra is its focus on the physical meaning of frequency. The classical estimate of the evolutionary spectral density is based… ▽ More

    Submitted 17 January, 2019; v1 submitted 25 February, 2018; originally announced February 2018.

    Comments: To appear in IEEE Transactions on Signal Processing. A short version of this work appeared in ICASSP 2018

  48. arXiv:1802.03849  [pdf, other

    math.PR

    Large Deviations of Convex Polyominoes

    Authors: Ilya Soloveychik, Vahid Tarokh

    Abstract: Enumeration of various types of lattice polygons and in particular polyominoes is of primary importance in many machine learning, pattern recognition, and geometric analysis problems. In this work, we develop a large deviation principle for convex polyominoes under different restrictions, such as fixed area and/or perimeter.

    Submitted 18 April, 2018; v1 submitted 11 February, 2018; originally announced February 2018.

  49. arXiv:1802.03848  [pdf, other

    stat.ML

    Region Detection in Markov Random Fields: Gaussian Case

    Authors: Ilya Soloveychik, Vahid Tarokh

    Abstract: We consider the problem of model selection in Gaussian Markov fields in the sample deficient scenario. The benchmark information-theoretic results in the case of d-regular graphs require the number of samples to be at least proportional to the logarithm of the number of vertices to allow consistent graph recovery. When the number of samples is less than this amount, reliable detection of all edges… ▽ More

    Submitted 28 March, 2018; v1 submitted 11 February, 2018; originally announced February 2018.

  50. arXiv:1712.07102  [pdf, other

    stat.ML cs.LG

    On Data-Dependent Random Features for Improved Generalization in Supervised Learning

    Authors: Shahin Shahrampour, Ahmad Beirami, Vahid Tarokh

    Abstract: The randomized-feature approach has been successfully employed in large-scale kernel approximation and supervised learning. The distribution from which the random features are drawn impacts the number of features required to efficiently perform a learning task. Recently, it has been shown that employing data-dependent randomization improves the performance in terms of the required number of random… ▽ More

    Submitted 19 December, 2017; originally announced December 2017.

    Comments: 12 pages; (pages 1-8) to appear in Proc. of AAAI Conference on Artificial Intelligence (AAAI), 2018