Skip to main content

Showing 1–50 of 61 results for author: Balakrishnan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.04317  [pdf, ps, other

    eess.IV cs.AI cs.CV cs.LG

    CLIP-RL: Surgical Scene Segmentation Using Contrastive Language-Vision Pretraining & Reinforcement Learning

    Authors: Fatmaelzahraa Ali Ahmed, Muhammad Arsalan, Abdulaziz Al-Ali, Khalid Al-Jalham, Shidin Balakrishnan

    Abstract: Understanding surgical scenes can provide better healthcare quality for patients, especially with the vast amount of video data that is generated during MIS. Processing these videos generates valuable assets for training sophisticated models. In this paper, we introduce CLIP-RL, a novel contrastive language-image pre-training model tailored for semantic segmentation for surgical scenes. CLIP-RL pr… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

  2. arXiv:2507.04304  [pdf, ps, other

    eess.IV cs.AI cs.CV

    Surg-SegFormer: A Dual Transformer-Based Model for Holistic Surgical Scene Segmentation

    Authors: Fatimaelzahraa Ahmed, Muraam Abdel-Ghani, Muhammad Arsalan, Mahmoud Ali, Abdulaziz Al-Ali, Shidin Balakrishnan

    Abstract: Holistic surgical scene segmentation in robot-assisted surgery (RAS) enables surgical residents to identify various anatomical tissues, articulated tools, and critical structures, such as veins and vessels. Given the firm intraoperative time constraints, it is challenging for surgeons to provide detailed real-time explanations of the operative field for trainees. This challenge is compounded by th… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

    Comments: Accepted in IEEE Case 2025

  3. arXiv:2506.19025  [pdf, ps, other

    math.ST cs.AI cs.LG stat.ME stat.ML

    Statistical Inference for Optimal Transport Maps: Recent Advances and Perspectives

    Authors: Sivaraman Balakrishnan, Tudor Manole, Larry Wasserman

    Abstract: In many applications of optimal transport (OT), the object of primary interest is the optimal transport map. This map rearranges mass from one probability distribution to another in the most efficient way possible by minimizing a specified cost. In this paper we review recent advances in estimating and developing limit theorems for the OT map, using samples from the underlying distributions. We al… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: 36 pages, 1 figure

  4. arXiv:2506.18474  [pdf, ps, other

    eess.IV cs.AI cs.CV cs.LG

    A Deep Convolutional Neural Network-Based Novel Class Balancing for Imbalance Data Segmentation

    Authors: Atifa Kalsoom, M. A. Iftikhar, Amjad Ali, Zubair Shah, Shidin Balakrishnan, Hazrat Ali

    Abstract: Retinal fundus images provide valuable insights into the human eye's interior structure and crucial features, such as blood vessels, optic disk, macula, and fovea. However, accurate segmentation of retinal blood vessels can be challenging due to imbalanced data distribution and varying vessel thickness. In this paper, we propose BLCB-CNN, a novel pipeline based on deep learning and bi-level class… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: This is preprint of the paper submitted to Scientific Reports journal

  5. arXiv:2502.13417  [pdf, other

    cs.CL cs.AI cs.LG

    RLTHF: Targeted Human Feedback for LLM Alignment

    Authors: Yifei Xu, Tusher Chakraborty, Emre Kıcıman, Bibek Aryal, Eduardo Rodrigues, Srinagesh Sharma, Roberto Estevao, Maria Angels de Luis Balaguer, Jessica Wolk, Rafael Padilha, Leonardo Nunes, Shobana Balakrishnan, Songwu Lu, Ranveer Chandra

    Abstract: Fine-tuning large language models (LLMs) to align with user preferences is challenging due to the high cost of quality human annotations in Reinforcement Learning from Human Feedback (RLHF) and the generalizability limitations of AI Feedback. To address these challenges, we propose RLTHF, a human-AI hybrid framework that combines LLM-based initial alignment with selective human annotations to achi… ▽ More

    Submitted 20 February, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

  6. arXiv:2502.12326  [pdf, ps, other

    math.ST cs.LG stat.ME stat.ML

    Stability Bounds for Smooth Optimal Transport Maps and their Statistical Implications

    Authors: Sivaraman Balakrishnan, Tudor Manole

    Abstract: We study estimators of the optimal transport (OT) map between two probability distributions. We focus on plugin estimators derived from the OT map between estimates of the underlying distributions. We develop novel stability bounds for OT maps which generalize those in past work, and allow us to reduce the problem of optimally estimating the transport map to that of optimally estimating densities… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: 26 pages, 1 figure

  7. arXiv:2410.07269  [pdf

    eess.IV cs.AI cs.CV

    Deep Learning for Surgical Instrument Recognition and Segmentation in Robotic-Assisted Surgeries: A Systematic Review

    Authors: Fatimaelzahraa Ali Ahmed, Mahmoud Yousef, Mariam Ali Ahmed, Hasan Omar Ali, Anns Mahboob, Hazrat Ali, Zubair Shah, Omar Aboumarzouk, Abdulla Al Ansari, Shidin Balakrishnan

    Abstract: Applying deep learning (DL) for annotating surgical instruments in robot-assisted minimally invasive surgeries (MIS) represents a significant advancement in surgical technology. This systematic review examines 48 studies that and advanced DL methods and architectures. These sophisticated DL models have shown notable improvements in the precision and efficiency of detecting and segmenting surgical… ▽ More

    Submitted 7 November, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: 57 pages, 9 figures, Published in Artificial Intelligence Reviews journal <https://link.springer.com/journal/10462>

  8. arXiv:2406.19730  [pdf, other

    quant-ph cs.CR

    Quantum-Enhanced Secure Approval Voting Protocol

    Authors: Saiyam Sakhuja, S. Balakrishnan

    Abstract: In a world where elections touch every aspect of society, the need for secure voting is paramount. Traditional safeguards, based on classical cryptography, rely on complex math problems like factoring large numbers. However, quantum computing is changing the game. Recent advances in quantum technology suggest that classical cryptographic methods may not be as secure as we thought. This paper intro… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  9. arXiv:2312.03318  [pdf, other

    cs.LG cs.CV stat.ML

    Complementary Benefits of Contrastive Learning and Self-Training Under Distribution Shift

    Authors: Saurabh Garg, Amrith Setlur, Zachary Chase Lipton, Sivaraman Balakrishnan, Virginia Smith, Aditi Raghunathan

    Abstract: Self-training and contrastive learning have emerged as leading techniques for incorporating unlabeled data, both under distribution shift (unsupervised domain adaptation) and when it is absent (semi-supervised learning). However, despite the popularity and compatibility of these techniques, their efficacy in combination remains unexplored. In this paper, we undertake a systematic empirical investi… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  10. arXiv:2305.19570  [pdf, other

    stat.ML cs.LG

    Online Label Shift: Optimal Dynamic Regret meets Practical Algorithms

    Authors: Dheeraj Baby, Saurabh Garg, Tzu-Ching Yen, Sivaraman Balakrishnan, Zachary Chase Lipton, Yu-Xiang Wang

    Abstract: This paper focuses on supervised and unsupervised online label shift, where the class marginals $Q(y)$ varies but the class-conditionals $Q(x|y)$ remain invariant. In the unsupervised setting, our goal is to adapt a learner, trained on some offline labeled data, to changing label distributions given unlabeled online data. In the supervised setting, we must both learn a classifier and adapt to the… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: First three authors contributed equally

  11. arXiv:2302.03020  [pdf, other

    cs.LG cs.CV stat.ML

    RLSbench: Domain Adaptation Under Relaxed Label Shift

    Authors: Saurabh Garg, Nick Erickson, James Sharpnack, Alex Smola, Sivaraman Balakrishnan, Zachary C. Lipton

    Abstract: Despite the emergence of principled methods for domain adaptation under label shift, their sensitivity to shifts in class conditional distributions is precariously under explored. Meanwhile, popular deep domain adaptation heuristics tend to falter when faced with label proportions shifts. While several papers modify these heuristics in attempts to handle label proportions shifts, inconsistencies i… ▽ More

    Submitted 5 June, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: Accepted at ICML 2023. Paper website: https://sites.google.com/view/rlsbench/

  12. arXiv:2212.07365  [pdf, other

    eess.SY cs.LG

    Learning Invariant Subspaces of Koopman Operators--Part 2: Heterogeneous Dictionary Mixing to Approximate Subspace Invariance

    Authors: Charles A. Johnson, Shara Balakrishnan, Enoch Yeung

    Abstract: This work builds on the models and concepts presented in part 1 to learn approximate dictionary representations of Koopman operators from data. Part I of this paper presented a methodology for arguing the subspace invariance of a Koopman dictionary. This methodology was demonstrated on the state-inclusive logistic lifting (SILL) basis. This is an affine basis augmented with conjunctive logistic fu… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: 16 pages, 6 figures. arXiv admin note: substantial text overlap with arXiv:2206.13585

  13. arXiv:2212.07358  [pdf, other

    eess.SY cs.LG

    Learning Invariant Subspaces of Koopman Operators--Part 1: A Methodology for Demonstrating a Dictionary's Approximate Subspace Invariance

    Authors: Charles A. Johnson, Shara Balakrishnan, Enoch Yeung

    Abstract: Koopman operators model nonlinear dynamics as a linear dynamic system acting on a nonlinear function as the state. This nonstandard state is often called a Koopman observable and is usually approximated numerically by a superposition of functions drawn from a dictionary. In a widely used algorithm, Extended Dynamic Mode Decomposition, the dictionary functions are drawn from a fixed class of functi… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: 13 pages, 2 figures. arXiv admin note: substantial text overlap with arXiv:2206.13585

  14. arXiv:2211.05584  [pdf, other

    cs.CL cs.AI cs.CE cs.LG

    Exploring Robustness of Prefix Tuning in Noisy Data: A Case Study in Financial Sentiment Analysis

    Authors: Sudhandar Balakrishnan, Yihao Fang, Xioadan Zhu

    Abstract: The invention of transformer-based models such as BERT, GPT, and RoBERTa has enabled researchers and financial companies to finetune these powerful models and use them in different downstream tasks to achieve state-of-the-art performance. Recently, a lightweight alternative (approximately 0.1% - 3% of the original model parameters) to fine-tuning, known as prefix tuning has been introduced. This m… ▽ More

    Submitted 25 October, 2022; originally announced November 2022.

    Comments: Accepted at the FinNLP workshop part of the EMNLP 2022 conference

  15. arXiv:2211.02093  [pdf, other

    cs.LG stat.ML

    Domain Adaptation under Missingness Shift

    Authors: Helen Zhou, Sivaraman Balakrishnan, Zachary C. Lipton

    Abstract: Rates of missing data often depend on record-keeping policies and thus may change across times and locations, even when the underlying features are comparatively stable. In this paper, we introduce the problem of Domain Adaptation under Missingness Shift (DAMS). Here, (labeled) source data and (unlabeled) target data would be exchangeable but for different missing data mechanisms. We show that if… ▽ More

    Submitted 3 May, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

  16. arXiv:2209.10860  [pdf, other

    cs.LG cs.AI cs.CY

    SCALES: From Fairness Principles to Constrained Decision-Making

    Authors: Sreejith Balakrishnan, Jianxin Bi, Harold Soh

    Abstract: This paper proposes SCALES, a general framework that translates well-established fairness principles into a common representation based on the Constraint Markov Decision Process (CMDP). With the help of causal language, our framework can place constraints on both the procedure of decision making (procedural fairness) as well as the outcomes resulting from decisions (outcome fairness). Specifically… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: Accepted to the 2022 AAAI/ACM Conference on AI, Ethics, and Society (AIES '22), Updated version with additional citations, 14 pages

  17. arXiv:2207.13048  [pdf, other

    cs.LG

    Domain Adaptation under Open Set Label Shift

    Authors: Saurabh Garg, Sivaraman Balakrishnan, Zachary C. Lipton

    Abstract: We introduce the problem of domain adaptation under Open Set Label Shift (OSLS) where the label distribution can change arbitrarily and a new class may arrive during deployment, but the class-conditional distributions p(x|y) are domain-invariant. OSLS subsumes domain adaptation under label shift and Positive-Unlabeled (PU) learning. The learner's goals here are two-fold: (a) estimate the target la… ▽ More

    Submitted 16 October, 2022; v1 submitted 26 July, 2022; originally announced July 2022.

    Comments: Accepted at NeurIPS 2022

  18. arXiv:2206.13585  [pdf, other

    eess.SY cs.LG

    Heterogeneous mixtures of dictionary functions to approximate subspace invariance in Koopman operators

    Authors: Charles A. Johnson, Shara Balakrishnan, Enoch Yeung

    Abstract: Koopman operators model nonlinear dynamics as a linear dynamic system acting on a nonlinear function as the state. This nonstandard state is often called a Koopman observable and is usually approximated numerically by a superposition of functions drawn from a \textit{dictionary}. A widely used algorithm, is \textit{Extended Dynamic Mode Decomposition}, where the dictionary functions are drawn from… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: 14 pages, 7 figures, journal paper

  19. arXiv:2205.03316  [pdf, other

    cs.LG eess.SY

    Application of Clustering Algorithms for Dimensionality Reduction in Infrastructure Resilience Prediction Models

    Authors: Srijith Balakrishnan, Beatrice Cassottana, Arun Verma

    Abstract: Recent studies increasingly adopt simulation-based machine learning (ML) models to analyze critical infrastructure system resilience. For realistic applications, these ML models consider the component-level characteristics that influence the network response during emergencies. However, such an approach could result in a large number of features and cause ML models to suffer from the `curse of dim… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

  20. arXiv:2201.04234  [pdf, other

    cs.LG stat.ML

    Leveraging Unlabeled Data to Predict Out-of-Distribution Performance

    Authors: Saurabh Garg, Sivaraman Balakrishnan, Zachary C. Lipton, Behnam Neyshabur, Hanie Sedghi

    Abstract: Real-world machine learning deployments are characterized by mismatches between the source (training) and target (test) distributions that may cause performance drops. In this work, we investigate methods for predicting the target domain accuracy using only labeled source data and unlabeled target data. We propose Average Thresholded Confidence (ATC), a practical method that learns a threshold on… ▽ More

    Submitted 14 October, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

    Comments: Accepted at ICLR 2022

  21. arXiv:2111.00980  [pdf, other

    cs.LG stat.ML

    Mixture Proportion Estimation and PU Learning: A Modern Approach

    Authors: Saurabh Garg, Yifan Wu, Alex Smola, Sivaraman Balakrishnan, Zachary C. Lipton

    Abstract: Given only positive examples and unlabeled examples (from both positive and negative classes), we might hope nevertheless to estimate an accurate positive-versus-negative classifier. Formally, this task is broken down into two subtasks: (i) Mixture Proportion Estimation (MPE) -- determining the fraction of positive examples in the unlabeled data; and (ii) PU-learning -- given such an estimate, lea… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: Spotlight at NeurIPS 2021

  22. arXiv:2108.11483  [pdf, other

    cs.LG math.OC stat.ML

    Heavy-tailed Streaming Statistical Estimation

    Authors: Che-Ping Tsai, Adarsh Prasad, Sivaraman Balakrishnan, Pradeep Ravikumar

    Abstract: We consider the task of heavy-tailed statistical estimation given streaming $p$-dimensional samples. This could also be viewed as stochastic optimization under heavy-tailed distributions, with an additional $O(p)$ space complexity constraint. We design a clipped stochastic gradient descent algorithm and provide an improved analysis, under a more nuanced condition on the noise of the stochastic gra… ▽ More

    Submitted 25 February, 2022; v1 submitted 25 August, 2021; originally announced August 2021.

  23. arXiv:2107.10383  [pdf, ps, other

    eess.SY cs.LG cs.RO

    Online-Learning Deep Neuro-Adaptive Dynamic Inversion Controller for Model Free Control

    Authors: Nathan Lutes, K. Krishnamurthy, Venkata Sriram Siddhardh Nadendla, S. N. Balakrishnan

    Abstract: Adaptive methods are popular within the control literature due to the flexibility and forgiveness they offer in the area of modelling. Neural network adaptive control is favorable specifically for the powerful nature of the machine learning algorithm to approximate unknown functions and for the ability to relax certain constraints within traditional adaptive control. Deep neural networks are large… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    Comments: 8 pages, 4 fugures, manuscript under review for CDC'2021

  24. arXiv:2107.09888  [pdf, other

    eess.SY cs.AI cs.GT cs.MA

    Strategic Mitigation of Agent Inattention in Drivers with Open-Quantum Cognition Models

    Authors: Qizi Zhang, Venkata Sriram Siddhardh Nadendla, S. N. Balakrishnan, Jerome Busemeyer

    Abstract: State-of-the-art driver-assist systems have failed to effectively mitigate driver inattention and had minimal impacts on the ever-growing number of road mishaps (e.g. life loss, physical injuries due to accidents caused by various factors that lead to driver inattention). This is because traditional human-machine interaction settings are modeled in classical and behavioral game-theoretic domains w… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    Comments: 12 pages, 4 figures, submitted to IEEE Transactions on Human-Machine Systems

  25. arXiv:2105.00303  [pdf, other

    cs.LG stat.ML

    RATT: Leveraging Unlabeled Data to Guarantee Generalization

    Authors: Saurabh Garg, Sivaraman Balakrishnan, J. Zico Kolter, Zachary C. Lipton

    Abstract: To assess generalization, machine learning scientists typically either (i) bound the generalization gap and then (after training) plug in the empirical risk to obtain a bound on the true risk; or (ii) validate empirically on holdout data. However, (i) typically yields vacuous guarantees for overparameterized models. Furthermore, (ii) shrinks the training set and its guarantee erodes with each re-u… ▽ More

    Submitted 6 November, 2021; v1 submitted 1 May, 2021; originally announced May 2021.

    Comments: ICML 2021 (Long Talk)

  26. arXiv:2102.10264  [pdf, other

    cs.LG cs.RO stat.ML

    On Proximal Policy Optimization's Heavy-tailed Gradients

    Authors: Saurabh Garg, Joshua Zhanson, Emilio Parisotto, Adarsh Prasad, J. Zico Kolter, Zachary C. Lipton, Sivaraman Balakrishnan, Ruslan Salakhutdinov, Pradeep Ravikumar

    Abstract: Modern policy gradient algorithms such as Proximal Policy Optimization (PPO) rely on an arsenal of heuristics, including loss clipping and gradient clipping, to ensure successful learning. These heuristics are reminiscent of techniques from robust statistics, commonly used for estimation in outlier-rich (``heavy-tailed'') regimes. In this paper, we present a detailed empirical study to characteriz… ▽ More

    Submitted 12 July, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

    Comments: ICML 2021

  27. arXiv:2012.14978  [pdf, other

    cs.CL cs.IR cs.LG

    Few-Shot Named Entity Recognition: A Comprehensive Study

    Authors: Jiaxin Huang, Chunyuan Li, Krishan Subudhi, Damien Jose, Shobana Balakrishnan, Weizhu Chen, Baolin Peng, Jianfeng Gao, Jiawei Han

    Abstract: This paper presents a comprehensive study to efficiently build named entity recognition (NER) systems when a small number of in-domain labeled data is available. Based upon recent Transformer-based self-supervised pre-trained language models (PLMs), we investigate three orthogonal schemes to improve the model generalization ability for few-shot settings: (1) meta-learning to construct prototypes f… ▽ More

    Submitted 29 December, 2020; originally announced December 2020.

  28. arXiv:2011.10947  [pdf, other

    cs.CR eess.SP

    Who is in Control? Practical Physical Layer Attack and Defense for mmWave based Sensing in Autonomous Vehicles

    Authors: Zhi Sun, Sarankumar Balakrishnan, Lu Su, Arupjyoti Bhuyan, Pu Wang, Chunming Qiao

    Abstract: With the wide bandwidths in millimeter wave (mmWave) frequency band that results in unprecedented accuracy, mmWave sensing has become vital for many applications, especially in autonomous vehicles (AVs). In addition, mmWave sensing has superior reliability compared to other sensing counterparts such as camera and LiDAR, which is essential for safety-critical driving. Therefore, it is critical to u… ▽ More

    Submitted 22 November, 2020; originally announced November 2020.

  29. arXiv:2011.08541  [pdf, other

    cs.LG cs.AI cs.RO

    Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization

    Authors: Sreejith Balakrishnan, Quoc Phong Nguyen, Bryan Kian Hsiang Low, Harold Soh

    Abstract: The problem of inverse reinforcement learning (IRL) is relevant to a variety of tasks including value alignment and robot learning from demonstration. Despite significant algorithmic contributions in recent years, IRL remains an ill-posed problem at its core; multiple reward functions coincide with the observed behavior and the actual reward function is not identifiable without prior knowledge or… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: Accepted to 34th Conference on Neural Information Processing Systems (NeurIPS 2020). Includes Appendix. 21 pages

  30. arXiv:2006.11909  [pdf, other

    stat.ML cs.IT cs.LG

    Two-Sample Testing on Ranked Preference Data and the Role of Modeling Assumptions

    Authors: Charvi Rastogi, Sivaraman Balakrishnan, Nihar B. Shah, Aarti Singh

    Abstract: A number of applications require two-sample testing on ranked preference data. For instance, in crowdsourcing, there is a long-standing question of whether pairwise comparison data provided by people is distributed similar to ratings-converted-to-comparisons. Other examples include sports data analysis and peer grading. In this paper, we design two-sample tests for pairwise comparison data and ran… ▽ More

    Submitted 18 November, 2020; v1 submitted 21 June, 2020; originally announced June 2020.

    Comments: 40 pages, 4 figures

  31. arXiv:2003.07554  [pdf, other

    cs.LG stat.ML

    A Unified View of Label Shift Estimation

    Authors: Saurabh Garg, Yifan Wu, Sivaraman Balakrishnan, Zachary C. Lipton

    Abstract: Under label shift, the label distribution p(y) might change but the class-conditional distributions p(x|y) do not. There are two dominant approaches for estimating the label marginal. BBSE, a moment-matching approach based on confusion matrices, is provably consistent and provides interpretable error bounds. However, a maximum likelihood estimation approach, which we call MLLS, dominates empirical… ▽ More

    Submitted 16 October, 2020; v1 submitted 17 March, 2020; originally announced March 2020.

    Comments: Accepted at Neurips 2020

  32. Quasi-deterministic secure quantum communication using non-maximally entangled states

    Authors: Sujan Vijayaraj, S. Balakrishnan, K. Senthilnathan

    Abstract: Quantum communication in general helps deter potential eavesdropping in the course of transmission of bits to enable secure communication between two or more parties. In this paper, we propose a novel quasi-deterministic secure quantum communication scheme using non-maximally entangled states. The proposed scheme follows a simple procedure, and cases where the entanglement required can be signific… ▽ More

    Submitted 5 March, 2021; v1 submitted 7 December, 2019; originally announced December 2019.

    Journal ref: Int J Theor Phys, 60, 164 (2021)

  33. arXiv:1908.01089  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Path Length Bounds for Gradient Descent and Flow

    Authors: Chirag Gupta, Sivaraman Balakrishnan, Aaditya Ramdas

    Abstract: We derive bounds on the path length $ζ$ of gradient descent (GD) and gradient flow (GF) curves for various classes of smooth convex and nonconvex functions. Among other results, we prove that: (a) if the iterates are linearly convergent with factor $(1-c)$, then $ζ$ is at most $\mathcal{O}(1/c)$; (b) under the Polyak-Kurdyka-Lojasiewicz (PKL) condition, $ζ$ is at most $\mathcal{O}(\sqrtκ)$, where… ▽ More

    Submitted 19 March, 2021; v1 submitted 2 August, 2019; originally announced August 2019.

    Comments: 55 pages. Accepted for publication at the Journal of Machine Learning Research (JMLR, 2021)

  34. arXiv:1907.00927  [pdf, ps, other

    stat.ML cs.AI cs.LG

    A Unified Approach to Robust Mean Estimation

    Authors: Adarsh Prasad, Sivaraman Balakrishnan, Pradeep Ravikumar

    Abstract: In this paper, we develop connections between two seemingly disparate, but central, models in robust statistics: Huber's epsilon-contamination model and the heavy-tailed noise model. We provide conditions under which this connection provides near-statistically-optimal estimators. Building on this connection, we provide a simple variant of recent computationally-efficient algorithms for mean estima… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: 51 pages, 6 figures

  35. arXiv:1902.03649  [pdf, other

    cs.CR cs.NI

    Physical Layer Identification based on Spatial-temporal Beam Features for Millimeter Wave Wireless Networks

    Authors: Sarankumar Balakrishnan, Shreya Gupta, Arupjyoti Bhuyan, Pu Wang, Dimitrios Koutsonikolas, Zhi Sun

    Abstract: With millimeter wave (mmWave) wireless communication envisioned to be the key enabler of next generation high data rate wireless networks, security is of paramount importance. While conventional security measures in wireless networks operate at a higher layer of the protocol stack, physical layer security utilizes unique device dependent hardware features to identify and authenticate legitimate de… ▽ More

    Submitted 10 February, 2019; originally announced February 2019.

    Comments: 14 pages, 30 figures

  36. arXiv:1806.03286  [pdf, other

    stat.ML cs.LG

    Regression with Comparisons: Escaping the Curse of Dimensionality with Ordinal Information

    Authors: Yichong Xu, Sivaraman Balakrishnan, Aarti Singh, Artur Dubrawski

    Abstract: In supervised learning, we typically leverage a fully labeled dataset to design methods for function estimation or prediction. In many practical situations, we are able to obtain alternative feedback, possibly at a low cost. A broad goal is to understand the usefulness of, and to design algorithms to exploit, this alternative feedback. In this paper, we consider a semi-supervised regression settin… ▽ More

    Submitted 6 November, 2019; v1 submitted 8 June, 2018; originally announced June 2018.

    Comments: 52 pages, 11 figures; Preliminary version in International Conference on Machine Learning 2018

    Journal ref: Journal of Machine Learning Research 21 (2020) 1-54

  37. arXiv:1805.10406  [pdf, other

    math.ST cs.DS cs.LG stat.ME stat.ML

    Robust Nonparametric Regression under Huber's $ε$-contamination Model

    Authors: Simon S. Du, Yining Wang, Sivaraman Balakrishnan, Pradeep Ravikumar, Aarti Singh

    Abstract: We consider the non-parametric regression problem under Huber's $ε$-contamination model, in which an $ε$ fraction of observations are subject to arbitrary adversarial noise. We first show that a simple local binning median step can effectively remove the adversary noise and this median estimator is minimax optimal up to absolute constants over the Hölder function class with smoothness parameters s… ▽ More

    Submitted 25 May, 2018; originally announced May 2018.

  38. arXiv:1805.07883  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    How Many Samples are Needed to Estimate a Convolutional or Recurrent Neural Network?

    Authors: Simon S. Du, Yining Wang, Xiyu Zhai, Sivaraman Balakrishnan, Ruslan Salakhutdinov, Aarti Singh

    Abstract: It is widely believed that the practical success of Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) owes to the fact that CNNs and RNNs use a more compact parametric representation than their Fully-Connected Neural Network (FNN) counterparts, and consequently require fewer training examples to accurately estimate their parameters. We initiate the study of rigorously chara… ▽ More

    Submitted 29 June, 2019; v1 submitted 20 May, 2018; originally announced May 2018.

    Comments: Revised version, with new results on recurrent neural networks. Preliminary version in NeurIPS 2018

  39. arXiv:1803.08586  [pdf, ps, other

    stat.ML cs.LG math.ST

    Optimization of Smooth Functions with Noisy Observations: Local Minimax Rates

    Authors: Yining Wang, Sivaraman Balakrishnan, Aarti Singh

    Abstract: We consider the problem of global optimization of an unknown non-convex smooth function with zeroth-order feedback. In this setup, an algorithm is allowed to adaptively query the underlying function at different locations and receives noisy evaluations of function values at the queried points (i.e. the algorithm has access to zeroth-order information). Optimization performance is evaluated by the… ▽ More

    Submitted 22 March, 2018; originally announced March 2018.

    Comments: 29 pages, 1 figure

  40. arXiv:1802.06485  [pdf, ps, other

    stat.ML cs.AI cs.LG

    Robust Estimation via Robust Gradient Estimation

    Authors: Adarsh Prasad, Arun Sai Suggala, Sivaraman Balakrishnan, Pradeep Ravikumar

    Abstract: We provide a new computationally-efficient class of estimators for risk minimization. We show that these estimators are robust for general statistical models: in the classical Huber epsilon-contamination model and in heavy-tailed settings. Our workhorse is a novel robust variant of gradient descent, and we provide conditions under which our gradient descent variant provides accurate estimators in… ▽ More

    Submitted 20 April, 2018; v1 submitted 18 February, 2018; originally announced February 2018.

    Comments: 48 pages, 5 figures

  41. arXiv:1712.06120  [pdf, other

    stat.ML cs.IT stat.ME

    Hypothesis Testing for High-Dimensional Multinomials: A Selective Review

    Authors: Sivaraman Balakrishnan, Larry Wasserman

    Abstract: The statistical analysis of discrete data has been the subject of extensive statistical research dating back to the work of Pearson. In this survey we review some recently developed methods for testing hypotheses about high-dimensional multinomials. Traditional tests like the $χ^2$ test and the likelihood ratio test can have poor power in the high-dimensional setting. Much of the research in this… ▽ More

    Submitted 17 December, 2017; originally announced December 2017.

    Comments: 19 pages, 6 figures. Written in memory of Stephen E. Fienberg

  42. arXiv:1710.10551  [pdf, other

    stat.ML cs.LG

    Stochastic Zeroth-order Optimization in High Dimensions

    Authors: Yining Wang, Simon Du, Sivaraman Balakrishnan, Aarti Singh

    Abstract: We consider the problem of optimizing a high-dimensional convex function using stochastic zeroth-order queries. Under sparsity assumptions on the gradients or function values, we present two algorithms: a successive component/feature selection algorithm and a noisy mirror descent algorithm using Lasso gradient estimates, and show that both algorithms have convergence rates that de- pend only logar… ▽ More

    Submitted 25 February, 2018; v1 submitted 28 October, 2017; originally announced October 2017.

    Comments: Camera-ready version at AISTATS 2018

  43. arXiv:1709.00127  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Low Permutation-rank Matrices: Structural Properties and Noisy Completion

    Authors: Nihar B. Shah, Sivaraman Balakrishnan, Martin J. Wainwright

    Abstract: We consider the problem of noisy matrix completion, in which the goal is to reconstruct a structured matrix whose entries are partially observed in noise. Standard approaches to this underdetermined inverse problem are based on assuming that the underlying matrix has low rank, or is well-approximated by a low rank matrix. In this paper, we propose a richer model based on what we term the "permutat… ▽ More

    Submitted 31 August, 2017; originally announced September 2017.

  44. arXiv:1706.10003  [pdf, other

    math.ST cs.IT cs.LG stat.ML

    Hypothesis Testing For Densities and High-Dimensional Multinomials: Sharp Local Minimax Rates

    Authors: Sivaraman Balakrishnan, Larry Wasserman

    Abstract: We consider the goodness-of-fit testing problem of distinguishing whether the data are drawn from a specified distribution, versus a composite alternative separated from the null in the total variation metric. In the discrete case, we consider goodness-of-fit testing when the null distribution has a possibly growing or unbounded number of categories. In the continuous case, we consider testing a L… ▽ More

    Submitted 29 June, 2017; originally announced June 2017.

    Comments: 60 pages, 6 figures

  45. arXiv:1702.07709  [pdf, ps, other

    stat.ML cs.DS cs.LG

    Computationally Efficient Robust Estimation of Sparse Functionals

    Authors: Simon S. Du, Sivaraman Balakrishnan, Aarti Singh

    Abstract: Many conventional statistical procedures are extremely sensitive to seemingly minor deviations from modeling assumptions. This problem is exacerbated in modern high-dimensional settings, where the problem dimension can grow with and possibly exceed the sample size. We consider the problem of robust estimation of sparse functionals, and provide a computationally and statistically efficient algorith… ▽ More

    Submitted 24 February, 2017; originally announced February 2017.

  46. arXiv:1702.02686  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Rate Optimal Estimation and Confidence Intervals for High-dimensional Regression with Missing Covariates

    Authors: Yining Wang, Jialei Wang, Sivaraman Balakrishnan, Aarti Singh

    Abstract: Although a majority of the theoretical literature in high-dimensional statistics has focused on settings which involve fully-observed data, settings with missing values and corruptions are common in practice. We consider the problems of estimation and of constructing component-wise confidence intervals in a sparse high-dimensional linear regression model when some covariates of the design matrix a… ▽ More

    Submitted 2 November, 2017; v1 submitted 8 February, 2017; originally announced February 2017.

    Comments: 41 pages, 1 figure, 3 tables

  47. arXiv:1609.00978  [pdf, ps, other

    stat.ML cs.LG math.OC

    Local Maxima in the Likelihood of Gaussian Mixture Models: Structural Results and Algorithmic Consequences

    Authors: Chi Jin, Yuchen Zhang, Sivaraman Balakrishnan, Martin J. Wainwright, Michael Jordan

    Abstract: We provide two fundamental results on the population (infinite-sample) likelihood function of Gaussian mixture models with $M \geq 3$ components. Our first main result shows that the population likelihood function has bad local maxima even in the special case of equally-weighted mixtures of well-separated and spherical Gaussians. We prove that the log-likelihood value of these bad local maxima can… ▽ More

    Submitted 4 September, 2016; originally announced September 2016.

    Comments: Neural Information Processing Systems (NIPS) 2016

  48. arXiv:1606.09632  [pdf, other

    cs.LG cs.AI cs.IT stat.ML

    A Permutation-based Model for Crowd Labeling: Optimal Estimation and Robustness

    Authors: Nihar B. Shah, Sivaraman Balakrishnan, Martin J. Wainwright

    Abstract: The task of aggregating and denoising crowd-labeled data has gained increased significance with the advent of crowdsourcing platforms and massive datasets. We propose a permutation-based model for crowd labeled data that is a significant generalization of the classical Dawid-Skene model, and introduce a new error metric by which to compare different estimators. We derive global minimax rates for t… ▽ More

    Submitted 10 January, 2021; v1 submitted 30 June, 2016; originally announced June 2016.

    Comments: in IEEE Transactions on Information Theory (online), 2020

  49. Arbitrage-Free Combinatorial Market Making via Integer Programming

    Authors: Christian Kroer, Miroslav Dudík, Sébastien Lahaie, Sivaraman Balakrishnan

    Abstract: We present a new combinatorial market maker that operates arbitrage-free combinatorial prediction markets specified by integer programs. Although the problem of arbitrage-free pricing, while maintaining a bound on the subsidy provided by the market maker, is #P-hard in the worst case, we posit that the typical case might be amenable to modern integer programming (IP) solvers. At the crux of our me… ▽ More

    Submitted 10 June, 2016; v1 submitted 9 June, 2016; originally announced June 2016.

  50. arXiv:1603.06881  [pdf, ps, other

    cs.LG cs.AI cs.IT stat.ML

    Feeling the Bern: Adaptive Estimators for Bernoulli Probabilities of Pairwise Comparisons

    Authors: Nihar B. Shah, Sivaraman Balakrishnan, Martin J. Wainwright

    Abstract: We study methods for aggregating pairwise comparison data in order to estimate outcome probabilities for future comparisons among a collection of n items. Working within a flexible framework that imposes only a form of strong stochastic transitivity (SST), we introduce an adaptivity index defined by the indifference sets of the pairwise comparison probabilities. In addition to measuring the usual… ▽ More

    Submitted 22 March, 2016; originally announced March 2016.