Skip to main content

Showing 1–50 of 67 results for author: Lei, Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.02665  [pdf, ps, other

    cs.LG

    Beyond Invisibility: Learning Robust Visible Watermarks for Stronger Copyright Protection

    Authors: Tianci Liu, Tong Yang, Quan Zhang, Qi Lei

    Abstract: As AI advances, copyrighted content faces growing risk of unauthorized use, whether through model training or direct misuse. Building upon invisible adversarial perturbation, recent works developed copyright protections against specific AI techniques such as unauthorized personalization through DreamBooth that are misused. However, these methods offer only short-term security, as they require retr… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: UAI 2025

  2. arXiv:2505.24097  [pdf, ps, other

    stat.ML cs.LG

    Performative Risk Control: Calibrating Models for Reliable Deployment under Performativity

    Authors: Victor Li, Baiting Chen, Yuzhen Mao, Qi Lei, Zhun Deng

    Abstract: Calibrating blackbox machine learning models to achieve risk control is crucial to ensure reliable decision-making. A rich line of literature has been studying how to calibrate a model so that its predictions satisfy explicit finite-sample statistical guarantees under a fixed, static, and unknown data-generating distribution. However, prediction-supported decisions may influence the outcome they a… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  3. arXiv:2505.22829  [pdf, ps, other

    cs.LG cs.AI

    Bridging Distribution Shift and AI Safety: Conceptual and Methodological Synergies

    Authors: Chenruo Liu, Kenan Tang, Yao Qin, Qi Lei

    Abstract: This paper bridges distribution shift and AI safety through a comprehensive analysis of their conceptual and methodological synergies. While prior discussions often focus on narrow cases or informal analogies, we establish two types connections between specific causes of distribution shift and fine-grained AI safety issues: (1) methods addressing a specific shift type can help achieve correspondin… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 35 pages

  4. arXiv:2502.13954  [pdf, other

    cs.CL cs.LG

    Latent Distribution Decoupling: A Probabilistic Framework for Uncertainty-Aware Multimodal Emotion Recognition

    Authors: Jingwang Huang, Jiang Zhong, Qin Lei, Jinpeng Gao, Yuming Yang, Sirui Wang, Peiguang Li, Kaiwen Wei

    Abstract: Multimodal multi-label emotion recognition (MMER) aims to identify the concurrent presence of multiple emotions in multimodal data. Existing studies primarily focus on improving fusion strategies and modeling modality-to-label dependencies. However, they often overlook the impact of \textbf{aleatoric uncertainty}, which is the inherent noise in the multimodal data and hinders the effectiveness of… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  5. arXiv:2502.09850  [pdf, other

    cs.LG

    Elastic Representation: Mitigating Spurious Correlations for Group Robustness

    Authors: Tao Wen, Zihan Wang, Quan Zhang, Qi Lei

    Abstract: Deep learning models can suffer from severe performance degradation when relying on spurious correlations between input features and labels, making the models perform well on training data but have poor prediction accuracy for minority groups. This problem arises especially when training data are limited or imbalanced. While most prior work focuses on learning invariant features (with consistent c… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

    Comments: Accepted at AISTATS 2025

  6. arXiv:2502.08505  [pdf, other

    cs.LG

    Bridging Domain Adaptation and Graph Neural Networks: A Tensor-Based Framework for Effective Label Propagation

    Authors: Tao Wen, Elynn Chen, Yuzhou Chen, Qi Lei

    Abstract: Graph Neural Networks (GNNs) have recently become the predominant tools for studying graph data. Despite state-of-the-art performance on graph classification tasks, GNNs are overwhelmingly trained in a single domain under supervision, thus necessitating a prohibitively high demand for labels and resulting in poorly transferable representations. To address this challenge, we propose the Label-Propa… ▽ More

    Submitted 15 February, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

  7. arXiv:2502.05075  [pdf, ps, other

    cs.LG math.NA stat.ML

    Discrepancies are Virtue: Weak-to-Strong Generalization through Lens of Intrinsic Dimension

    Authors: Yijun Dong, Yicheng Li, Yunai Li, Jason D. Lee, Qi Lei

    Abstract: Weak-to-strong (W2S) generalization is a type of finetuning (FT) where a strong (large) student model is trained on pseudo-labels generated by a weak teacher. Surprisingly, W2S FT often outperforms the weak teacher. We seek to understand this phenomenon through the observation that FT often occurs in intrinsically low-dimensional spaces. Leveraging the low intrinsic dimensionality of FT, we analyz… ▽ More

    Submitted 20 June, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

    Comments: ICML 2025

  8. arXiv:2412.03593  [pdf, other

    cs.CL cs.AI cs.LG

    CovidLLM: A Robust Large Language Model with Missing Value Adaptation and Multi-Objective Learning Strategy for Predicting Disease Severity and Clinical Outcomes in COVID-19 Patients

    Authors: Shengjun Zhu, Siyu Liu, Yang Li, Qing Lei, Hongyan Hou, Hewei Jiang, Shujuan Guo, Feng Wang, Rongshang Chen, Xionglin Fan, Shengce Tao, Jiaxin Cai

    Abstract: Coronavirus Disease 2019 (COVID-19), which emerged in 2019, has caused millions of deaths worldwide. Although effective vaccines have been developed to mitigate severe symptoms, certain populations, particularly the elderly and those with comorbidities, remain at high risk for severe outcomes and increased mortality. Consequently, early identification of the severity and clinical outcomes of the d… ▽ More

    Submitted 28 November, 2024; originally announced December 2024.

  9. arXiv:2411.15583  [pdf, other

    cs.HC

    Exploring Viewing Modalities in Cinematic Virtual Reality: A Systematic Review and Meta-Analysis of Challenges in Evaluating User Experience

    Authors: Yawen Zhang, Han Zhou, Zhoumingju Jiang, Zilu Tang, Tao Luo, Qinyuan Lei

    Abstract: Cinematic Virtual Reality (CVR) is a narrative-driven VR experience that uses head-mounted displays with a 360-degree field of view. Previous research has explored different viewing modalities to enhance viewers' CVR experience. This study conducted a systematic review and meta-analysis focusing on how different viewing modalities, including intervened rotation, avatar assistance, guidance cues, a… ▽ More

    Submitted 23 November, 2024; originally announced November 2024.

    Comments: 29 pages, recommend for acceptance by CSCW

  10. arXiv:2411.14086  [pdf, other

    cs.RO

    Path-Tracking Hybrid A* and Hierarchical MPC Framework for Autonomous Agricultural Vehicles

    Authors: Mingke Lu, Han Gao, Haijie Dai, Qianli Lei, Chang Liu

    Abstract: We propose a Path-Tracking Hybrid A* planner coupled with a hierarchical Model Predictive Control (MPC) framework for path smoothing in agricultural vehicles. The goal is to minimize deviation from reference paths during cross-furrow operations, thereby optimizing operational efficiency, preventing crop and soil damage, while also enforcing curvature constraints and ensuring full-body collision av… ▽ More

    Submitted 17 May, 2025; v1 submitted 21 November, 2024; originally announced November 2024.

  11. arXiv:2411.03746  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Optimal Defenses Against Gradient Reconstruction Attacks

    Authors: Yuxiao Chen, Gamze Gürsoy, Qi Lei

    Abstract: Federated Learning (FL) is designed to prevent data leakage through collaborative model training without centralized data storage. However, it remains vulnerable to gradient reconstruction attacks that recover original training data from shared gradients. To optimize the trade-off between data leakage and utility loss, we first derive a theoretical lower bound of reconstruction error (among all at… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

    Comments: The code for this project is available at https://github.com/cyx78/Optimal_Defenses_Against_Gradient_Reconstruction_Attacks

  12. arXiv:2410.23904  [pdf, other

    cs.CV

    EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection

    Authors: Qinqian Lei, Bo Wang, Robby T. Tan

    Abstract: Detecting Human-Object Interactions (HOI) in zero-shot settings, where models must handle unseen classes, poses significant challenges. Existing methods that rely on aligning visual encoders with large Vision-Language Models (VLMs) to tap into the extensive knowledge of VLMs, require large, computationally expensive models and encounter training difficulties. Adapting VLMs with prompt learning off… ▽ More

    Submitted 18 December, 2024; v1 submitted 31 October, 2024; originally announced October 2024.

    Comments: Accepted by NeurIPS 2024

  13. arXiv:2410.21331  [pdf, other

    cs.LG cs.AI

    Beyond Interpretability: The Gains of Feature Monosemanticity on Model Robustness

    Authors: Qi Zhang, Yifei Wang, Jingyi Cui, Xiang Pan, Qi Lei, Stefanie Jegelka, Yisen Wang

    Abstract: Deep learning models often suffer from a lack of interpretability due to polysemanticity, where individual neurons are activated by multiple unrelated semantics, resulting in unclear attributions of model behavior. Recent advances in monosemanticity, where neurons correspond to consistent and distinct semantics, have significantly improved interpretability but are commonly believed to compromise a… ▽ More

    Submitted 27 October, 2024; originally announced October 2024.

  14. arXiv:2407.19126  [pdf, other

    cs.AI

    Greedy Output Approximation: Towards Efficient Structured Pruning for LLMs Without Retraining

    Authors: Jianwei Li, Yijun Dong, Qi Lei

    Abstract: To remove redundant components of large language models (LLMs) without incurring significant computational costs, this work focuses on single-shot pruning without a retraining phase. We simplify the pruning process for Transformer-based LLMs by identifying a depth-2 pruning structure that functions independently. Additionally, we propose two inference-aware pruning criteria derived from the optimi… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  15. arXiv:2407.08209  [pdf, other

    cs.CV

    Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation Datasets

    Authors: Qin Lei, Jiang Zhong, Qizhu Dai

    Abstract: Curvilinear object segmentation plays a crucial role across various applications, yet datasets in this domain often suffer from small scale due to the high costs associated with data acquisition and annotation. To address these challenges, this paper introduces a novel approach for expanding curvilinear object segmentation datasets, focusing on enhancing the informativeness of generated data and t… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  16. arXiv:2407.06120  [pdf, other

    cs.LG math.NA stat.ML

    Sketchy Moment Matching: Toward Fast and Provable Data Selection for Finetuning

    Authors: Yijun Dong, Hoang Phan, Xiang Pan, Qi Lei

    Abstract: We revisit data selection in a modern context of finetuning from a fundamental perspective. Extending the classical wisdom of variance minimization in low dimensions to high-dimensional finetuning, our generalization analysis unveils the importance of additionally reducing bias induced by low-rank approximation. Inspired by the variance-bias tradeoff in high dimensions from the theory, we introduc… ▽ More

    Submitted 16 November, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: NeurIPS 2024

  17. arXiv:2406.19617  [pdf, ps, other

    cs.LG cs.IT math.OC

    Stochastic Zeroth-Order Optimization under Strongly Convexity and Lipschitz Hessian: Minimax Sample Complexity

    Authors: Qian Yu, Yining Wang, Baihe Huang, Qi Lei, Jason D. Lee

    Abstract: Optimization of convex functions under stochastic zeroth-order feedback has been a major and challenging question in online learning. In this work, we consider the problem of optimizing second-order smooth and strongly convex functions where the algorithm is only accessible to noisy evaluations of the objective function it queries. We provide the first tight characterization for the rate of the mi… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  18. arXiv:2403.09164  [pdf, other

    cs.CL stat.AP

    Exploring the Comprehension of ChatGPT in Traditional Chinese Medicine Knowledge

    Authors: Li Yizhen, Huang Shaohan, Qi Jiaxing, Quan Lei, Han Dongran, Luan Zhongzhi

    Abstract: No previous work has studied the performance of Large Language Models (LLMs) in the context of Traditional Chinese Medicine (TCM), an essential and distinct branch of medical knowledge with a rich history. To bridge this gap, we present a TCM question dataset named TCM-QA, which comprises three question types: single choice, multiple choice, and true or false, to examine the LLM's capacity for kno… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  19. arXiv:2403.06424  [pdf, other

    stat.ML cs.CV cs.LG

    Bridging Domains with Approximately Shared Features

    Authors: Ziliang Samuel Zhong, Xiang Pan, Qi Lei

    Abstract: Multi-source domain adaptation aims to reduce performance degradation when applying machine learning models to unseen domains. A fundamental challenge is devising the optimal strategy for feature selection. Existing literature is somewhat paradoxical: some advocate for learning invariant features from source domains, while others favor more diverse features. To address the challenge, we propose a… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  20. arXiv:2403.02695  [pdf, other

    cs.LG

    Controllable Prompt Tuning For Balancing Group Distributional Robustness

    Authors: Hoang Phan, Andrew Gordon Wilson, Qi Lei

    Abstract: Models trained on data composed of different groups or domains can suffer from severe performance degradation under distribution shifts. While recent methods have largely focused on optimizing the worst-group objective, this often comes at the expense of good performance on other groups. To address this problem, we introduce an optimization scheme to achieve good performance across groups and find… ▽ More

    Submitted 4 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Proceedings of the 41st International Conference on Machine Learning

  21. arXiv:2402.09478  [pdf, other

    cs.CR cs.LG

    Data Reconstruction Attacks and Defenses: A Systematic Evaluation

    Authors: Sheng Liu, Zihan Wang, Yuxiao Chen, Qi Lei

    Abstract: Reconstruction attacks and defenses are essential in understanding the data leakage problem in machine learning. However, prior work has centered around empirical observations of gradient inversion attacks, lacks theoretical grounding, and cannot disentangle the usefulness of defending methods from the computational limitation of attacking methods. In this work, we propose to view the problem as a… ▽ More

    Submitted 21 March, 2025; v1 submitted 13 February, 2024; originally announced February 2024.

  22. arXiv:2401.15530  [pdf, ps, other

    cs.LG cs.IT

    An Information-Theoretic Analysis of In-Context Learning

    Authors: Hong Jun Jeon, Jason D. Lee, Qi Lei, Benjamin Van Roy

    Abstract: Previous theoretical results pertaining to meta-learning on sequences build on contrived assumptions and are somewhat convoluted. We introduce new information-theoretic tools that lead to an elegant and very general decomposition of error into three components: irreducible error, meta-learning error, and intra-task error. These tools unify analyses across many meta-learning challenges. To illustra… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  23. arXiv:2312.10586  [pdf, other

    cs.CV

    Few-Shot Learning from Augmented Label-Uncertain Queries in Bongard-HOI

    Authors: Qinqian Lei, Bo Wang, Robby T. Tan

    Abstract: Detecting human-object interactions (HOI) in a few-shot setting remains a challenge. Existing meta-learning methods struggle to extract representative features for classification due to the limited data, while existing few-shot HOI models rely on HOI text labels for classification. Moreover, some query images may display visual similarity to those outside their class, such as similar backgrounds b… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: 9 pages, 4 figures

  24. arXiv:2312.05720  [pdf, other

    cs.LG cs.AI cs.CL cs.CR

    Beyond Gradient and Priors in Privacy Attacks: Leveraging Pooler Layer Inputs of Language Models in Federated Learning

    Authors: Jianwei Li, Sheng Liu, Qi Lei

    Abstract: Language models trained via federated learning (FL) demonstrate impressive capabilities in handling complex tasks while protecting user privacy. Recent studies indicate that leveraging gradient information and prior knowledge can potentially reveal training samples within FL setting. However, these investigations have overlooked the potential privacy risks tied to the intrinsic architecture of the… ▽ More

    Submitted 15 March, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

  25. arXiv:2310.13191  [pdf, other

    cs.CL cs.AI

    Towards Robust Pruning: An Adaptive Knowledge-Retention Pruning Strategy for Language Models

    Authors: Jianwei Li, Qi Lei, Wei Cheng, Dongkuan Xu

    Abstract: The pruning objective has recently extended beyond accuracy and sparsity to robustness in language models. Despite this, existing methods struggle to enhance robustness against adversarial attacks when continually increasing model sparsity and require a retraining process. As humans step into the era of large language models, these issues become increasingly prominent. This paper proposes that the… ▽ More

    Submitted 10 January, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

  26. arXiv:2310.13183  [pdf, other

    cs.CV cs.CL

    Breaking through Deterministic Barriers: Randomized Pruning Mask Generation and Selection

    Authors: Jianwei Li, Weizhi Gao, Qi Lei, Dongkuan Xu

    Abstract: It is widely acknowledged that large and sparse models have higher accuracy than small and dense models under the same model size constraints. This motivates us to train a large model and then remove its redundant neurons or weights by pruning. Most existing works pruned the networks in a deterministic way, the performance of which solely depends on a single pruning criterion and thus lacks variet… ▽ More

    Submitted 10 January, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

  27. arXiv:2307.11030  [pdf, other

    stat.ML cs.LG

    Cluster-aware Semi-supervised Learning: Relational Knowledge Distillation Provably Learns Clustering

    Authors: Yijun Dong, Kevin Miller, Qi Lei, Rachel Ward

    Abstract: Despite the empirical success and practical significance of (relational) knowledge distillation that matches (the relations of) features between teacher and student models, the corresponding theoretical interpretations remain limited for various knowledge distillation paradigms. In this work, we take an initial step toward a theoretical understanding of relational knowledge distillation (RKD), wit… ▽ More

    Submitted 23 October, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  28. arXiv:2306.12383  [pdf, ps, other

    cs.LG stat.ML

    Sample Complexity for Quadratic Bandits: Hessian Dependent Bounds and Optimal Algorithms

    Authors: Qian Yu, Yining Wang, Baihe Huang, Qi Lei, Jason D. Lee

    Abstract: In stochastic zeroth-order optimization, a problem of practical relevance is understanding how to fully exploit the local geometry of the underlying objective function. We consider a fundamental setting in which the objective function is quadratic, and provide the first tight characterization of the optimal Hessian-dependent sample complexity. Our contribution is twofold. First, from an informatio… ▽ More

    Submitted 25 December, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

  29. arXiv:2212.03714  [pdf, other

    cs.LG cs.CR stat.ML

    Reconstructing Training Data from Model Gradient, Provably

    Authors: Zihan Wang, Jason D. Lee, Qi Lei

    Abstract: Understanding when and how much a model gradient leaks information about the training sample is an important question in privacy. In this paper, we present a surprising result: even without training or memorizing the data, we can fully reconstruct the training samples from a single gradient query at a randomly chosen parameter value. We prove the identifiability of the training data under mild con… ▽ More

    Submitted 10 June, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

  30. arXiv:2211.13723  [pdf, other

    cs.LG cs.AI

    Improving Multi-task Learning via Seeking Task-based Flat Regions

    Authors: Hoang Phan, Lam Tran, Quyen Tran, Ngoc N. Tran, Tuan Truong, Qi Lei, Nhat Ho, Dinh Phung, Trung Le

    Abstract: Multi-Task Learning (MTL) is a widely-used and powerful learning paradigm for training deep neural networks that allows learning more than one objective by a single backbone. Compared to training tasks separately, MTL significantly reduces computational costs, improves data efficiency, and potentially enhances model performance by leveraging knowledge across tasks. Hence, it has been adopted in a… ▽ More

    Submitted 23 May, 2025; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: 35 pages, 17 figures, 7 tables

  31. arXiv:2210.13983  [pdf, other

    cs.LG

    Optimization for Amortized Inverse Problems

    Authors: Tianci Liu, Tong Yang, Quan Zhang, Qi Lei

    Abstract: Incorporating a deep generative model as the prior distribution in inverse problems has established substantial success in reconstructing images from corrupted observations. Notwithstanding, the existing optimization approaches use gradient descent largely without adapting to the non-convex nature of the problem and can be sensitive to initial values, impeding further performance improvement. In t… ▽ More

    Submitted 28 January, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

  32. arXiv:2209.14434  [pdf, other

    cs.CV cs.AI

    Efficient Medical Image Assessment via Self-supervised Learning

    Authors: Chun-Yin Huang, Qi Lei, Xiaoxiao Li

    Abstract: High-performance deep learning methods typically rely on large annotated training datasets, which are difficult to obtain in many clinical applications due to the high cost of medical image labeling. Existing data assessment methods commonly require knowing the labels in advance, which are not feasible to achieve our goal of 'knowing which data to label.' To this end, we formulate and propose a no… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

  33. arXiv:2205.05236  [pdf, other

    cs.SI cs.DB

    Reconnecting the Estranged Relationships: Optimizing the Influence Propagation in Evolving Networks

    Authors: Taotao Cai, Qi Lei, Quan Z. Sheng, Shuiqiao Yang, Jian Yang, Wei Emma Zhang

    Abstract: Influence Maximization (IM), which aims to select a set of users from a social network to maximize the expected number of influenced users, has recently received significant attention for mass communication and commercial marketing. Existing research efforts dedicated to the IM problem depend on a strong assumption: the selected seed users are willing to spread the information after receiving bene… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

  34. arXiv:2203.15664  [pdf, other

    cs.LG stat.ML

    Nearly Minimax Algorithms for Linear Bandits with Shared Representation

    Authors: Jiaqi Yang, Qi Lei, Jason D. Lee, Simon S. Du

    Abstract: We give novel algorithms for multi-task and lifelong linear bandits with shared representation. Specifically, we consider the setting where we play $M$ linear bandits with dimension $d$, each for $T$ rounds, and these $M$ bandit tasks share a common $k(\ll d)$ dimensional linear representation. For both the multi-task setting where we play the tasks concurrently, and the lifelong setting where we… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: 19 pages, 3 figures

  35. arXiv:2202.12230  [pdf, other

    cs.LG

    Sample Efficiency of Data Augmentation Consistency Regularization

    Authors: Shuo Yang, Yijun Dong, Rachel Ward, Inderjit S. Dhillon, Sujay Sanghavi, Qi Lei

    Abstract: Data augmentation is popular in the training of large neural networks; currently, however, there is no clear theoretical comparison between different algorithmic choices on how to use augmented data. In this paper, we take a step in this direction - we first present a simple and novel analysis for linear regression with label invariant augmentations, demonstrating that data augmentation consistenc… ▽ More

    Submitted 16 June, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

  36. arXiv:2201.09020  [pdf, other

    cs.LG cs.CY

    Bi-CLKT: Bi-Graph Contrastive Learning based Knowledge Tracing

    Authors: Xiangyu Song, Jianxin Li, Qi Lei, Wei Zhao, Yunliang Chen, Ajmal Mian

    Abstract: The goal of Knowledge Tracing (KT) is to estimate how well students have mastered a concept based on their historical learning of related exercises. The benefit of knowledge tracing is that students' learning plans can be better organised and adjusted, and interventions can be made when necessary. With the recent rise of deep learning, Deep Knowledge Tracing (DKT) has utilised Recurrent Neural Net… ▽ More

    Submitted 22 January, 2022; originally announced January 2022.

    Comments: 12pages, 2 figures

  37. Origami-inspired soft twisting actuator

    Authors: Diancheng Li, Dongliang Fan, Renjie Zhu, Qiaozhi Lei, Yuxuan Liao, Xin Yang, Yang Pan, Zheng Wang, Yang Wu, Sicong Liu, Hongqiang Wang

    Abstract: Soft actuators have shown great advantages in compliance and morphology matched for manipulation of delicate objects and inspection in a confined space. There is an unmet need for a soft actuator that can provide torsional motion to e.g. enlarge working space and increase degrees of freedom. Towards this goal, we present origami-inspired soft pneumatic actuators (OSPAs) made from silicone. The pro… ▽ More

    Submitted 2 November, 2022; v1 submitted 3 November, 2021; originally announced November 2021.

    Comments: 9 figures. Soft Robotics (2022)

  38. arXiv:2110.09507  [pdf, other

    cs.LG stat.ML

    Provable Hierarchy-Based Meta-Reinforcement Learning

    Authors: Kurtland Chua, Qi Lei, Jason D. Lee

    Abstract: Hierarchical reinforcement learning (HRL) has seen widespread interest as an approach to tractable learning of complex modular behaviors. However, existing work either assume access to expert-constructed hierarchies, or use hierarchy-learning heuristics with no provable guarantees. To address this gap, we analyze HRL in the meta-RL setting, where a learner learns latent hierarchical structure duri… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

  39. arXiv:2107.06466  [pdf, other

    cs.LG stat.ML

    Going Beyond Linear RL: Sample Efficient Neural Function Approximation

    Authors: Baihe Huang, Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei, Runzhe Wang, Jiaqi Yang

    Abstract: Deep Reinforcement Learning (RL) powered by neural net approximation of the Q function has had enormous empirical success. While the theory of RL has traditionally focused on linear function approximation (or eluder dimension) approaches, little is known about nonlinear RL with neural net approximations of the Q functions. This is the focus of this work, where we study function approximation with… ▽ More

    Submitted 25 December, 2021; v1 submitted 13 July, 2021; originally announced July 2021.

  40. arXiv:2107.04518  [pdf, ps, other

    cs.LG stat.ML

    Optimal Gradient-based Algorithms for Non-concave Bandit Optimization

    Authors: Baihe Huang, Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei, Runzhe Wang, Jiaqi Yang

    Abstract: Bandit problems with linear or concave reward have been extensively studied, but relatively few works have studied bandits with non-concave reward. This work considers a large family of bandit problems where the unknown underlying reward function is non-concave, including the low-rank generalized linear bandit problems and two-layer neural network with polynomial activation bandit problem. For the… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

  41. arXiv:2107.02377  [pdf, ps, other

    cs.LG cs.AI math.OC stat.ML

    A Short Note on the Relationship of Information Gain and Eluder Dimension

    Authors: Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei

    Abstract: Eluder dimension and information gain are two widely used methods of complexity measures in bandit and reinforcement learning. Eluder dimension was originally proposed as a general complexity measure of function classes, but the common examples of where it is known to be small are function spaces (vector spaces). In these cases, the primary tool to upper bound the eluder dimension is the elliptic… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  42. arXiv:2106.12108  [pdf, other

    cs.LG stat.ML

    Near-Optimal Linear Regression under Distribution Shift

    Authors: Qi Lei, Wei Hu, Jason D. Lee

    Abstract: Transfer learning is essential when sufficient data comes from the source domain, with scarce labeled data from the target domain. We develop estimators that achieve minimax linear risk for linear regression problems under distribution shift. Our algorithms cover different transfer learning settings including covariate shift and model shift. We also consider when data are generated from either lin… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: ICML 2021

  43. arXiv:2105.02221  [pdf, other

    cs.LG stat.ML

    How Fine-Tuning Allows for Effective Meta-Learning

    Authors: Kurtland Chua, Qi Lei, Jason D. Lee

    Abstract: Representation learning has been widely studied in the context of meta-learning, enabling rapid learning of new tasks through shared representations. Recent works such as MAML have explored using fine-tuning-based metrics, which measure the ease by which fine-tuning can achieve good performance, as proxies for obtaining representations. We present a theoretical framework for analyzing representati… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

  44. arXiv:2102.11203  [pdf, other

    cs.LG cs.AI stat.ML

    A Theory of Label Propagation for Subpopulation Shift

    Authors: Tianle Cai, Ruiqi Gao, Jason D. Lee, Qi Lei

    Abstract: One of the central problems in machine learning is domain adaptation. Unlike past theoretical work, we consider a new model for subpopulation shift in the input or representation space. In this work, we propose a provably effective framework for domain adaptation based on label propagation. In our analysis, we use a simple but realistic expansion assumption, proposed in \citet{wei2021theoretical}.… ▽ More

    Submitted 19 July, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: ICML 2021

  45. arXiv:2010.05263  [pdf, other

    cs.LG stat.ML

    Fast Convergence of Langevin Dynamics on Manifold: Geodesics meet Log-Sobolev

    Authors: Xiao Wang, Qi Lei, Ioannis Panageas

    Abstract: Sampling is a fundamental and arguably very important task with numerous applications in Machine Learning. One approach to sample from a high dimensional distribution $e^{-f}$ for some function $f$ is the Langevin Algorithm (LA). Recently, there has been a lot of progress in showing fast convergence of LA even in cases where $f$ is non-convex, notably [53], [39] in which the former paper focuses o… ▽ More

    Submitted 6 December, 2020; v1 submitted 11 October, 2020; originally announced October 2020.

  46. arXiv:2008.01064  [pdf, other

    cs.LG stat.ML

    Predicting What You Already Know Helps: Provable Self-Supervised Learning

    Authors: Jason D. Lee, Qi Lei, Nikunj Saunshi, Jiacheng Zhuo

    Abstract: Self-supervised representation learning solves auxiliary prediction tasks (known as pretext tasks) without requiring labeled data to learn useful semantic representations. These pretext tasks are created solely using the input features, such as predicting a missing image patch, recovering the color channels of an image from context, or predicting missing words in text; yet predicting this \textit{… ▽ More

    Submitted 13 November, 2021; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: NeurIPS 2021

  47. arXiv:2007.07244  [pdf, ps, other

    cs.SD cs.MM eess.AS

    Transformer-XL Based Music Generation with Multiple Sequences of Time-valued Notes

    Authors: Xianchao Wu, Chengyuan Wang, Qinying Lei

    Abstract: Current state-of-the-art AI based classical music creation algorithms such as Music Transformer are trained by employing single sequence of notes with time-shifts. The major drawback of absolute time interval expression is the difficulty of similarity computing of notes that share the same note value yet different tempos, in one or among MIDI files. In addition, the usage of single sequence restri… ▽ More

    Submitted 11 July, 2020; originally announced July 2020.

    Comments: 9 pages, 7 figures

  48. arXiv:2003.10392  [pdf, other

    cs.LG stat.ML

    Steepest Descent Neural Architecture Optimization: Escaping Local Optimum with Signed Neural Splitting

    Authors: Lemeng Wu, Mao Ye, Qi Lei, Jason D. Lee, Qiang Liu

    Abstract: Developing efficient and principled neural architecture optimization methods is a critical challenge of modern deep learning. Recently, Liu et al.[19] proposed a splitting steepest descent (S2D) method that jointly optimizes the neural parameters and architectures based on progressively growing network structures by splitting neurons into multiple copies in a steepest descent fashion. However, S2D… ▽ More

    Submitted 20 June, 2021; v1 submitted 23 March, 2020; originally announced March 2020.

  49. arXiv:2003.08089  [pdf, other

    cs.LG cs.IT stat.ML

    Solving Inverse Problems with a Flow-based Noise Model

    Authors: Jay Whang, Qi Lei, Alexandros G. Dimakis

    Abstract: We study image inverse problems with a normalizing flow prior. Our formulation views the solution as the maximum a posteriori estimate of the image conditioned on the measurements. This formulation allows us to use noise models with arbitrary dependencies as well as non-linear forward operators. We empirically validate the efficacy of our method on various inverse problems, including compressed se… ▽ More

    Submitted 1 July, 2021; v1 submitted 18 March, 2020; originally announced March 2020.

  50. arXiv:2002.09434  [pdf, ps, other

    cs.LG math.OC stat.ML

    Few-Shot Learning via Learning the Representation, Provably

    Authors: Simon S. Du, Wei Hu, Sham M. Kakade, Jason D. Lee, Qi Lei

    Abstract: This paper studies few-shot learning via representation learning, where one uses $T$ source tasks with $n_1$ data per task to learn a representation in order to reduce the sample complexity of a target task for which there is only $n_2 (\ll n_1)$ data. Specifically, we focus on the setting where there exists a good \emph{common representation} between source and target, and our goal is to understa… ▽ More

    Submitted 30 March, 2021; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: ICLR2021