Skip to main content

Showing 1–26 of 26 results for author: Bajwa, W U

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.07977  [pdf, other

    cs.LG math.OC stat.ML

    RESIST: Resilient Decentralized Learning Using Consensus Gradient Descent

    Authors: Cheng Fang, Rishabh Dixit, Waheed U. Bajwa, Mert Gurbuzbalaban

    Abstract: Empirical risk minimization (ERM) is a cornerstone of modern machine learning (ML), supported by advances in optimization theory that ensure efficient solutions with provable algorithmic convergence rates, which measure the speed at which optimization algorithms approach a solution, and statistical learning rates, which characterize how well the solution generalizes to unseen data. Privacy, memory… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: preprint of a journal paper; 100 pages and 17 figures

  2. arXiv:2308.02922  [pdf, other

    stat.ML cs.LG eess.SP math.ST

    Structured Low-Rank Tensors for Generalized Linear Models

    Authors: Batoul Taki, Anand D. Sarwate, Waheed U. Bajwa

    Abstract: Recent works have shown that imposing tensor structures on the coefficient tensor in regression problems can lead to more reliable parameter estimation and lower sample complexity compared to vector-based methods. This work investigates a new low-rank tensor model, called Low Separation Rank (LSR), in Generalized Linear Model (GLM) problems. The LSR model -- which generalizes the well-known Tucker… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: 43 pages; published in Transactions on Machine Learning Research (08/2023)

    Journal ref: Transactions on Machine Learning Research, Aug. 2023 (https://openreview.net/forum?id=qUxBs3Ln41)

  3. arXiv:2105.14673  [pdf, ps, other

    cs.LG eess.SP math.ST stat.ML

    A Minimax Lower Bound for Low-Rank Matrix-Variate Logistic Regression

    Authors: Batoul Taki, Mohsen Ghassemi, Anand D. Sarwate, Waheed U. Bajwa

    Abstract: This paper considers the problem of matrix-variate logistic regression. It derives the fundamental error threshold on estimating low-rank coefficient matrices in the logistic regression problem by obtaining a lower bound on the minimax risk. The bound depends explicitly on the dimension and distribution of the covariates, the rank and energy of the coefficient matrix, and the number of samples. Th… ▽ More

    Submitted 28 January, 2022; v1 submitted 30 May, 2021; originally announced May 2021.

    Comments: 8 pages; published in Proc. 55th Asilomar Conf. Signals, Systems, and Computers, Pacific Grove, CA, Oct. 31-Nov. 3, 2021

  4. arXiv:2101.01300  [pdf, other

    cs.LG cs.DC cs.MA eess.SP stat.ML

    A Linearly Convergent Algorithm for Distributed Principal Component Analysis

    Authors: Arpita Gang, Waheed U. Bajwa

    Abstract: Principal Component Analysis (PCA) is the workhorse tool for dimensionality reduction in this era of big data. While often overlooked, the purpose of PCA is not only to reduce data dimensionality, but also to yield features that are uncorrelated. Furthermore, the ever-increasing volume of data in the modern world often requires storage of data samples across multiple machines, which precludes the… ▽ More

    Submitted 28 November, 2021; v1 submitted 4 January, 2021; originally announced January 2021.

    Comments: 34 pages; final version of journal paper accepted for publication in a special issue of EURASIP J. Signal Processing

  5. arXiv:2005.08854  [pdf, other

    cs.LG cs.DC eess.SP math.OC stat.ML

    Scaling-up Distributed Processing of Data Streams for Machine Learning

    Authors: Matthew Nokleby, Haroon Raja, Waheed U. Bajwa

    Abstract: Emerging applications of machine learning in numerous areas involve continuous gathering of and learning from streams of data. Real-time incorporation of streaming data into the learned models is essential for improved inference in these applications. Further, these applications often involve data that are either inherently gathered at geographically distributed entities or that are intentionally… ▽ More

    Submitted 31 August, 2020; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: 45 pages, 9 figures; preprint of a journal paper published in Proceedings of the IEEE (Special Issue on Optimization for Data-driven Learning and Control)

    Journal ref: Proc. of the IEEE, vol. 108, no. 11, pp. 1984-2012, Nov. 2020

  6. arXiv:2001.01017  [pdf, ps, other

    cs.LG cs.CV eess.SP math.OC stat.ML

    Distributed Stochastic Algorithms for High-rate Streaming Principal Component Analysis

    Authors: Haroon Raja, Waheed U. Bajwa

    Abstract: This paper considers the problem of estimating the principal eigenvector of a covariance matrix from independent and identically distributed data samples in streaming settings. The streaming rate of data in many contemporary applications can be high enough that a single processor cannot finish an iteration of existing methods for eigenvector estimation before a new sample arrives. This paper formu… ▽ More

    Submitted 3 January, 2020; originally announced January 2020.

    Comments: 37 pages, 11 figures; preprint of a journal submission

  7. arXiv:1911.03725  [pdf, other

    cs.LG eess.SP math.ST stat.ML

    Tensor Regression Using Low-rank and Sparse Tucker Decompositions

    Authors: Talal Ahmed, Haroon Raja, Waheed U. Bajwa

    Abstract: This paper studies a tensor-structured linear regression model with a scalar response variable and tensor-structured predictors, such that the regression parameters form a tensor of order $d$ (i.e., a $d$-fold multiway array) in $\mathbb{R}^{n_1 \times n_2 \times \cdots \times n_d}$. It focuses on the task of estimating the regression tensor from $m$ realizations of the response variable and the p… ▽ More

    Submitted 20 July, 2020; v1 submitted 9 November, 2019; originally announced November 2019.

    Comments: 28 pages, 5 figures, 2 tables; preprint of a journal paper published in SIAM Journal on Mathematics of Data Science

    MSC Class: 41A52; 41A63; 62F10; 62J05

    Journal ref: SIAM J. Math. Data Science, vol. 2, no. 4, pp. 944-966, 2020

  8. arXiv:1908.08649  [pdf, other

    stat.ML cs.CR cs.DC cs.LG eess.SP

    Adversary-resilient Distributed and Decentralized Statistical Inference and Machine Learning: An Overview of Recent Advances Under the Byzantine Threat Model

    Authors: Zhixiong Yang, Arpita Gang, Waheed U. Bajwa

    Abstract: While the last few decades have witnessed a huge body of work devoted to inference and learning in distributed and decentralized setups, much of this work assumes a non-adversarial setting in which individual nodes---apart from occasional statistical failures---operate as intended within the algorithmic framework. In recent years, however, cybersecurity threats from malicious non-state actors and… ▽ More

    Submitted 1 June, 2020; v1 submitted 22 August, 2019; originally announced August 2019.

    Comments: 24 pages, 6 figures, 2 tables; Published in IEEE Signal Processing Magazine, May 2020 (Special Issue on "Machine Learning From Distributed, Streaming Data")

    Journal ref: IEEE Signal Processing Mag., vol. 37, no. 3, pp. 146-159, May 2020

  9. arXiv:1908.08098  [pdf, other

    stat.ML cs.DC cs.LG cs.MA eess.SP

    BRIDGE: Byzantine-resilient Decentralized Gradient Descent

    Authors: Cheng Fang, Zhixiong Yang, Waheed U. Bajwa

    Abstract: Machine learning has begun to play a central role in many applications. A multitude of these applications typically also involve datasets that are distributed across multiple computing devices/machines due to either design constraints (e.g., multiagent systems) or computational/privacy reasons (e.g., learning on smartphone data). Such applications often require the learning tasks to be carried out… ▽ More

    Submitted 14 June, 2022; v1 submitted 21 August, 2019; originally announced August 2019.

    Comments: 20 pages, 10 figures, 2 tables; some expanded discussion as well as additional numerical experiments using the CIFAR-10 dataset

  10. arXiv:1908.00195  [pdf, other

    cs.LG cs.CR eess.SP stat.ML

    Learning-Aided Physical Layer Attacks Against Multicarrier Communications in IoT

    Authors: Alireza Nooraiepour, Waheed U. Bajwa, Narayan B. Mandayam

    Abstract: Internet-of-Things (IoT) devices that are limited in power and processing are susceptible to physical layer (PHY) spoofing (signal exploitation) attacks owing to their inability to implement a full-blown protocol stack for security. The overwhelming adoption of multicarrier techniques such as orthogonal frequency division multiplexing (OFDM) for the PHY layer makes IoT devices further vulnerable t… ▽ More

    Submitted 4 July, 2020; v1 submitted 31 July, 2019; originally announced August 2019.

    Comments: 15 pages; 20 figures; 3 tables; preprint of a paper accepted for publication in IEEE Trans. Cognitive Commun. Netw

    Journal ref: IEEE Trans. Cognitive Commun. Netw., vol. 7, no. 1, pp. 239-254, Mar. 2021

  11. arXiv:1903.09284  [pdf, other

    cs.LG cs.IT eess.SP stat.ML

    Learning Mixtures of Separable Dictionaries for Tensor Data: Analysis and Algorithms

    Authors: Mohsen Ghassemi, Zahra Shakeri, Anand D. Sarwate, Waheed U. Bajwa

    Abstract: This work addresses the problem of learning sparse representations of tensor data using structured dictionary learning. It proposes learning a mixture of separable dictionaries to better capture the structure of tensor data by generalizing the separable dictionary learning model. Two different approaches for learning mixture of separable dictionaries are explored and sufficient conditions for loca… ▽ More

    Submitted 13 June, 2020; v1 submitted 21 March, 2019; originally announced March 2019.

    Comments: 18 pages, 4 figures, 3 tables; Published in IEEE Trans. Signal Processing

    Journal ref: IEEE Trans. Signal Processing, vol. 68, pp. 33-48, 2020

  12. Identifiability of Kronecker-structured Dictionaries for Tensor Data

    Authors: Zahra Shakeri, Anand D. Sarwate, Waheed U. Bajwa

    Abstract: This paper derives sufficient conditions for local recovery of coordinate dictionaries comprising a Kronecker-structured dictionary that is used for representing $K$th-order tensor data. Tensor observations are assumed to be generated from a Kronecker-structured dictionary multiplied by sparse coefficient tensors that follow the separable sparsity model. This work provides sufficient conditions on… ▽ More

    Submitted 25 May, 2018; v1 submitted 10 December, 2017; originally announced December 2017.

    Comments: 16 pages, to appear in IEEE Journal of Special Topics in Signal Processing

    Journal ref: IEEE J. Sel. Topics Signal Processing, vol. 12, no. 5, pp. 1047-1062, Oct. 2018

  13. arXiv:1711.04887  [pdf, other

    stat.ML cs.LG

    STARK: Structured Dictionary Learning Through Rank-one Tensor Recovery

    Authors: Mohsen Ghassemi, Zahra Shakeri, Anand D. Sarwate, Waheed U. Bajwa

    Abstract: In recent years, a class of dictionaries have been proposed for multidimensional (tensor) data representation that exploit the structure of tensor data by imposing a Kronecker structure on the dictionary underlying the data. In this work, a novel algorithm called "STARK" is provided to learn Kronecker structured dictionaries that can represent tensors of any order. By establishing that the Kroneck… ▽ More

    Submitted 13 November, 2017; originally announced November 2017.

  14. arXiv:1708.08155  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    ByRDiE: Byzantine-resilient distributed coordinate descent for decentralized learning

    Authors: Zhixiong Yang, Waheed U. Bajwa

    Abstract: Distributed machine learning algorithms enable learning of models from datasets that are distributed over a network without gathering the data at a centralized location. While efficient distributed algorithms have been developed under the assumption of faultless networks, failures that can render these algorithms nonfunctional occur frequently in the real world. This paper focuses on the problem o… ▽ More

    Submitted 5 July, 2019; v1 submitted 27 August, 2017; originally announced August 2017.

    Comments: Preprint of a paper accepted into IEEE Transactions on Signal and Information Processing Over Networks; 16 pages, 5 figures, and 1 table

    Journal ref: IEEE Trans. Signal Inform. Proc. over Netw., vol. 5, no. 4, pp. 611-627, Dec. 2019

  15. arXiv:1708.06077  [pdf, other

    math.ST cs.IT stat.ML

    ExSIS: Extended Sure Independence Screening for Ultrahigh-dimensional Linear Models

    Authors: Talal Ahmed, Waheed U. Bajwa

    Abstract: Statistical inference can be computationally prohibitive in ultrahigh-dimensional linear models. Correlation-based variable screening, in which one leverages marginal correlations for removal of irrelevant variables from the model prior to statistical inference, can be used to overcome this challenge. Prior works on correlation-based variable screening either impose statistical priors on the linea… ▽ More

    Submitted 4 July, 2020; v1 submitted 20 August, 2017; originally announced August 2017.

    Comments: 30 pages; 3 figures and 1 table; preprint of a journal publication

    Journal ref: EURASIP J. Signal Processing, vol. 159, pp. 33-48, Jun. 2019

  16. Stochastic Optimization from Distributed, Streaming Data in Rate-limited Networks

    Authors: Matthew Nokleby, Waheed U. Bajwa

    Abstract: Motivated by machine learning applications in networks of sensors, internet-of-things (IoT) devices, and autonomous agents, we propose techniques for distributed stochastic convex learning from high-rate data streams. The setup involves a network of nodes---each one of which has a stream of data arriving at a constant rate---that solve a stochastic convex optimization problem by collaborating with… ▽ More

    Submitted 6 August, 2018; v1 submitted 25 April, 2017; originally announced April 2017.

    Comments: 16 pages, 6 figures; Accepted for publication in IEEE Transactions on Signal and Information Processing over Networks

    Journal ref: Published in IEEE Trans. Signal Inform. Proc. over Netw., vol. 5, no. 1, pp. 152-167, Mar. 2019

  17. Human Action Attribute Learning From Video Data Using Low-Rank Representations

    Authors: Tong Wu, Prudhvi Gurram, Raghuveer M. Rao, Waheed U. Bajwa

    Abstract: Representation of human actions as a sequence of human body movements or action attributes enables the development of models for human activity recognition and summarization. We present an extension of the low-rank representation (LRR) model, termed the clustering-aware structure-constrained low-rank representation (CS-LRR) model, for unsupervised learning of human action attributes from video dat… ▽ More

    Submitted 4 July, 2020; v1 submitted 22 December, 2016; originally announced December 2016.

    Comments: 26 pages; 8 figures; 2 tables; Rutgers University Technical Report #2020-07-001

    Report number: Rutgers University Technical Report #2020-07-001

  18. arXiv:1605.05284  [pdf, other

    cs.IT cs.LG stat.ML

    Minimax Lower Bounds for Kronecker-Structured Dictionary Learning

    Authors: Zahra Shakeri, Waheed U. Bajwa, Anand D. Sarwate

    Abstract: Dictionary learning is the problem of estimating the collection of atomic elements that provide a sparse representation of measured/collected signals or data. This paper finds fundamental limits on the sample complexity of estimating dictionaries for tensor data by proving a lower bound on the minimax risk. This lower bound depends on the dimensions of the tensor and parameters of the generative m… ▽ More

    Submitted 17 May, 2016; originally announced May 2016.

    Comments: 5 pages, 1 figure. To appear in 2016 IEEE International Symposium on Information Theory

    Journal ref: Proc. IEEE Intl. Symp. Information Theory, Barcelona, Spain, Jul. 10-15, 2016, pp. 1148-1152

  19. arXiv:1412.7839  [pdf, other

    cs.LG cs.IT stat.ML

    Cloud K-SVD: A Collaborative Dictionary Learning Algorithm for Big, Distributed Data

    Authors: Haroon Raja, Waheed U. Bajwa

    Abstract: This paper studies the problem of data-adaptive representations for big, distributed data. It is assumed that a number of geographically-distributed, interconnected sites have massive local data and they are interested in collaboratively learning a low-dimensional geometric structure underlying these data. In contrast to previous works on subspace-based data representations, this paper focuses on… ▽ More

    Submitted 17 August, 2015; v1 submitted 25 December, 2014; originally announced December 2014.

    Comments: Accepted for Publication in IEEE Trans. Signal Processing (2015); 16 pages, 3 figures

    Journal ref: IEEE Trans. Signal Processing, vol. 64, no. 1, pp. 173-188, Jan. 2016

  20. arXiv:1412.6808  [pdf, other

    stat.ML cs.CV cs.LG

    Learning the nonlinear geometry of high-dimensional data: Models and algorithms

    Authors: Tong Wu, Waheed U. Bajwa

    Abstract: Modern information processing relies on the axiom that high-dimensional data lie near low-dimensional geometric structures. This paper revisits the problem of data-driven learning of these geometric structures and puts forth two new nonlinear geometric models for data describing "related" objects/phenomena. The first one of these models straddles the two extremes of the subspace model and the unio… ▽ More

    Submitted 9 August, 2015; v1 submitted 21 December, 2014; originally announced December 2014.

    Comments: Extended version of the journal paper accepted for publication in IEEE Trans. Signal Processing (20 pages, 7 figures, 4 tables)

    Journal ref: IEEE Trans. Signal Processing, vol. 63, no. 23, pp. 6229-6244, Dec. 2015

  21. MIMO-MC Radar: A MIMO Radar Approach Based on Matrix Completion

    Authors: Shunqiao Sun, Waheed U. Bajwa, Athina P. Petropulu

    Abstract: In a typical MIMO radar scenario, transmit nodes transmit orthogonal waveforms, while each receive node performs matched filtering with the known set of transmit waveforms, and forwards the results to the fusion center. Based on the data it receives from multiple antennas, the fusion center formulates a matrix, which, in conjunction with standard array processing schemes, such as MUSIC, leads to t… ▽ More

    Submitted 13 September, 2014; originally announced September 2014.

    Comments: 29 pages, 13 figures, IEEE Trans. on Aerospace and Electronic Systems

    Journal ref: IEEE Trans. Aerosp. Electron. Syst., vol. 51, no. 3, pp. 1839-1852, Jul. 2015

  22. Target Estimation in Colocated MIMO Radar via Matrix Completion

    Authors: Shunqiao Sun, Athina P. Petropulu, Waheed U. Bajwa

    Abstract: We consider a colocated MIMO radar scenario, in which the receive antennas forward their measurements to a fusion center. Based on the received data, the fusion center formulates a matrix which is then used for target parameter estimation. When the receive antennas sample the target returns at Nyquist rate, and assuming that there are more receive antennas than targets, the data matrix at the fusi… ▽ More

    Submitted 25 March, 2013; v1 submitted 17 February, 2013; originally announced February 2013.

    Comments: 5 pages, ICASSP 2013

    Journal ref: Proc. IEEE Intl. Conf. Acoustics, Speech, and Signal Processing, Vancouver, Canada, May 26-31, 2013, pp. 4144-4148

  23. arXiv:1210.2440  [pdf, ps, other

    math.ST cs.IT stat.ML

    Group Model Selection Using Marginal Correlations: The Good, the Bad and the Ugly

    Authors: Waheed U. Bajwa, Dustin G. Mixon

    Abstract: Group model selection is the problem of determining a small subset of groups of predictors (e.g., the expression data of genes) that are responsible for majority of the variation in a response variable (e.g., the malignancy of a tumor). This paper focuses on group model selection in high-dimensional linear models, in which the number of predictors far exceeds the number of samples of the response… ▽ More

    Submitted 8 October, 2012; originally announced October 2012.

    Comments: Accepted for publication in Proc. 50th Annu. Allerton Conf. Communication, Control, and Computing, Monticello, IL, Oct. 1-5, 2012; 8 pages and 4 figures

    Journal ref: Proc. 50th Annu. Allerton Conf. Communication, Control, and Computing, Monticello, IL, Oct. 1-5, 2012, pp. 494-501

  24. Level set estimation from projection measurements: Performance guarantees and fast computation

    Authors: Kalyani Krishnamurthy, Waheed U. Bajwa, Rebecca Willett

    Abstract: Estimation of the level set of a function (i.e., regions where the function exceeds some value) is an important problem with applications in digital elevation mapping, medical imaging, astronomy, etc. In many applications, the function of interest is not observed directly. Rather, it is acquired through (linear) projection measurements, such as tomographic projections, interferometric measurements… ▽ More

    Submitted 2 May, 2013; v1 submitted 18 September, 2012; originally announced September 2012.

    Comments: 23 pages, 20 figures

    MSC Class: 62; 68

    Journal ref: SIAM J. Imaging Sciences, vol. 6, no. 4, pp. 2047-2074, Oct. 2013

  25. arXiv:1104.4135  [pdf, ps, other

    stat.ME math.ST

    Posterior consistency in linear models under shrinkage priors

    Authors: Artin Armagan, David B. Dunson, Jaeyong Lee, Waheed U. Bajwa, Nate Strawn

    Abstract: We investigate the asymptotic behavior of posterior distributions of regression coefficients in high-dimensional linear models as the number of dimensions grows with the number of observations. We show that the posterior distribution concentrates in neighborhoods of the true parameter under simple sufficient conditions. These conditions hold under popular shrinkage priors given some sparsity assum… ▽ More

    Submitted 19 May, 2013; v1 submitted 20 April, 2011; originally announced April 2011.

    Comments: To appear in Biometrika

    Journal ref: Biometrika, vol. 100, no. 4, pp. 1011-1018, Dec. 2013

  26. arXiv:1006.0719  [pdf, ps, other

    math.ST cs.IT stat.ML

    Why Gabor Frames? Two Fundamental Measures of Coherence and Their Role in Model Selection

    Authors: Waheed U. Bajwa, Robert Calderbank, Sina Jafarpour

    Abstract: This paper studies non-asymptotic model selection for the general case of arbitrary design matrices and arbitrary nonzero entries of the signal. In this regard, it generalizes the notion of incoherence in the existing literature on model selection and introduces two fundamental measures of coherence---termed as the worst-case coherence and the average coherence---among the columns of a design matr… ▽ More

    Submitted 2 July, 2010; v1 submitted 3 June, 2010; originally announced June 2010.

    Comments: 31 pages, 4 figures; This paper is a full-length journal version of a shorter paper that was presented at the IEEE International Symposium on Information Theory, Austin, TX, June 2010

    Journal ref: J. Commun. Netw., vol. 12, no. 4, pp. 289-307, Aug. 2010