Skip to main content

Showing 1–22 of 22 results for author: Sun, W W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.15338  [pdf, other

    cs.GT cs.LG stat.ML

    Fairness-aware Contextual Dynamic Pricing with Strategic Buyers

    Authors: Pangpang Liu, Will Wei Sun

    Abstract: Contextual pricing strategies are prevalent in online retailing, where the seller adjusts prices based on products' attributes and buyers' characteristics. Although such strategies can enhance seller's profits, they raise concerns about fairness when significant price disparities emerge among specific groups, such as gender or race. These disparities can lead to adverse perceptions of fairness amo… ▽ More

    Submitted 25 January, 2025; originally announced January 2025.

  2. arXiv:2412.19436  [pdf, other

    stat.ML cs.LG

    Low-Rank Contextual Reinforcement Learning from Heterogeneous Human Feedback

    Authors: Seong Jin Lee, Will Wei Sun, Yufeng Liu

    Abstract: Reinforcement learning from human feedback (RLHF) has become a cornerstone for aligning large language models with human preferences. However, the heterogeneity of human feedback, driven by diverse individual contexts and preferences, poses significant challenges for reward learning. To address this, we propose a Low-rank Contextual RLHF (LoCo-RLHF) framework that integrates contextual information… ▽ More

    Submitted 26 December, 2024; originally announced December 2024.

  3. arXiv:2410.22488  [pdf, other

    stat.ML cs.AI cs.CR cs.LG

    Privacy-Preserving Dynamic Assortment Selection

    Authors: Young Hyun Cho, Will Wei Sun

    Abstract: With the growing demand for personalized assortment recommendations, concerns over data privacy have intensified, highlighting the urgent need for effective privacy-preserving strategies. This paper presents a novel framework for privacy-preserving dynamic assortment selection using the multinomial logit (MNL) bandits model. Our approach employs a perturbed upper confidence bound method, integrati… ▽ More

    Submitted 29 October, 2024; originally announced October 2024.

  4. arXiv:2410.02504  [pdf, other

    stat.ML cs.LG

    Dual Active Learning for Reinforcement Learning from Human Feedback

    Authors: Pangpang Liu, Chengchun Shi, Will Wei Sun

    Abstract: Aligning large language models (LLMs) with human preferences is critical to recent advances in generative artificial intelligence. Reinforcement learning from human feedback (RLHF) is widely applied to achieve this objective. A key step in RLHF is to learn the reward function from human feedback. However, human feedback is costly and time-consuming, making it essential to collect high-quality conv… ▽ More

    Submitted 30 December, 2024; v1 submitted 3 October, 2024; originally announced October 2024.

  5. arXiv:2406.14784  [pdf, other

    cs.LG stat.OT

    Active Learning for Fair and Stable Online Allocations

    Authors: Riddhiman Bhattacharya, Thanh Nguyen, Will Wei Sun, Mohit Tawarmalani

    Abstract: We explore an active learning approach for dynamic fair resource allocation problems. Unlike previous work that assumes full feedback from all agents on their allocations, we consider feedback from a select subset of agents at each epoch of the online resource allocation process. Despite this restriction, our proposed algorithms provide regret bounds that are sub-linear in number of time-periods f… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  6. arXiv:2404.17592  [pdf, other

    cs.IR cs.LG stat.ML

    Low-Rank Online Dynamic Assortment with Dual Contextual Information

    Authors: Seong Jin Lee, Will Wei Sun, Yufeng Liu

    Abstract: As e-commerce expands, delivering real-time personalized recommendations from vast catalogs poses a critical challenge for retail platforms. Maximizing revenue requires careful consideration of both individual customer characteristics and available item features to optimize assortments over time. In this paper, we consider the dynamic assortment problem with dual contexts -- user and item features… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  7. arXiv:2403.11841  [pdf, other

    stat.ML cs.AI cs.LG

    Pessimistic Causal Reinforcement Learning with Mediators for Confounded Offline Data

    Authors: Danyang Wang, Chengchun Shi, Shikai Luo, Will Wei Sun

    Abstract: In real-world scenarios, datasets collected from randomized experiments are often constrained by size, due to limitations in time and budget. As a result, leveraging large observational datasets becomes a more attractive option for achieving high-quality policy learning. However, most existing offline reinforcement learning (RL) methods depend on two key assumptions--unconfoundedness and positivit… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  8. arXiv:2402.16792  [pdf, other

    stat.ML cs.CR cs.LG

    Rate-Optimal Rank Aggregation with Private Pairwise Rankings

    Authors: Shirong Xu, Will Wei Sun, Guang Cheng

    Abstract: In various real-world scenarios, such as recommender systems and political surveys, pairwise rankings are commonly collected and utilized for rank aggregation to derive an overall ranking of items. However, preference rankings can reveal individuals' personal preferences, highlighting the need to protect them from exposure in downstream analysis. In this paper, we address the challenge of preservi… ▽ More

    Submitted 2 April, 2025; v1 submitted 26 February, 2024; originally announced February 2024.

  9. arXiv:2312.17111  [pdf, other

    stat.ML cs.LG stat.ME

    Online Tensor Inference

    Authors: Xin Wen, Will Wei Sun, Yichen Zhang

    Abstract: Recent technological advances have led to contemporary applications that demand real-time processing and analysis of sequentially arriving tensor data. Traditional offline learning, involving the storage and utilization of all data in each computational iteration, becomes impractical for high-dimensional tensor data due to its voluminous size. Furthermore, existing low-rank tensor methods lack the… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  10. arXiv:2307.04055  [pdf, other

    stat.ML cs.AI cs.GT cs.LG

    Contextual Dynamic Pricing with Strategic Buyers

    Authors: Pangpang Liu, Zhuoran Yang, Zhaoran Wang, Will Wei Sun

    Abstract: Personalized pricing, which involves tailoring prices based on individual characteristics, is commonly used by firms to implement a consumer-specific pricing policy. In this process, buyers can also strategically manipulate their feature data to obtain a lower price, incurring certain manipulation costs. Such strategic behavior can hinder firms from maximizing their profits. In this paper, we stud… ▽ More

    Submitted 25 June, 2024; v1 submitted 8 July, 2023; originally announced July 2023.

    Comments: The paper has been accepted by JASA

  11. arXiv:2305.10015  [pdf, other

    stat.ML cs.LG

    Utility Theory of Synthetic Data Generation

    Authors: Shirong Xu, Will Wei Sun, Guang Cheng

    Abstract: Synthetic data algorithms are widely employed in industries to generate artificial data for downstream learning tasks. While existing research primarily focuses on empirically evaluating utility of synthetic data, its theoretical understanding is largely lacking. This paper bridges the practice-theory gap by establishing relevant utility theory in a statistical learning framework. It considers two… ▽ More

    Submitted 2 April, 2025; v1 submitted 17 May, 2023; originally announced May 2023.

  12. arXiv:2301.00841  [pdf, other

    stat.ML cs.CR cs.LG math.ST

    Ranking Differential Privacy

    Authors: Shirong Xu, Will Wei Sun, Guang Cheng

    Abstract: Rankings are widely collected in various real-life scenarios, leading to the leakage of personal information such as users' preferences on videos or news. To protect rankings, existing works mainly develop privacy protection on a single ranking within a set of ranking or pairwise comparisons of a ranking under the $ε$-differential privacy. This paper proposes a novel notion called $ε$-ranking diff… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

    Comments: 59 pages, 8 figures

    MSC Class: 62F07

  13. arXiv:2212.11385  [pdf, other

    stat.ML cs.LG math.ST

    Online Statistical Inference in Decision-Making with Matrix Context

    Authors: Qiyu Han, Will Wei Sun, Yichen Zhang

    Abstract: The study of online decision-making problems that leverage contextual information has drawn notable attention due to their significant applications in fields ranging from healthcare to autonomous systems. In modern applications, contextual information can be rich and is often represented as a matrix. Moreover, while existing online decision algorithms mainly focus on reward maximization, less atte… ▽ More

    Submitted 18 April, 2025; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: The paper has been accepted by the Annals of Statistics

  14. arXiv:2205.03699  [pdf, other

    cs.LG cs.GT cs.MA stat.ML

    Dynamic Matching Bandit For Two-Sided Online Markets

    Authors: Yuantong Li, Chi-hua Wang, Guang Cheng, Will Wei Sun

    Abstract: Two-sided online matching platforms are employed in various markets. However, agents' preferences in the current market are usually implicit and unknown, thus needing to be learned from data. With the growing availability of dynamic side information involved in the decision process, modern online matching methodology demands the capability to track shifting preferences for agents based on contextu… ▽ More

    Submitted 28 May, 2024; v1 submitted 7 May, 2022; originally announced May 2022.

  15. arXiv:2109.07340  [pdf, other

    stat.ML cs.LG math.ST

    Distribution-free Contextual Dynamic Pricing

    Authors: Yiyun Luo, Will Wei Sun, and Yufeng Liu

    Abstract: Contextual dynamic pricing aims to set personalized prices based on sequential interactions with customers. At each time period, a customer who is interested in purchasing a product comes to the platform. The customer's valuation for the product is a linear function of contexts, including product and customer features, plus some random market noise. The seller does not observe the customer's true… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

  16. arXiv:2108.03706  [pdf, other

    stat.ML cs.AI cs.LG math.ST

    Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning

    Authors: Pratik Ramprasad, Yuantong Li, Zhuoran Yang, Zhaoran Wang, Will Wei Sun, Guang Cheng

    Abstract: The recent emergence of reinforcement learning has created a demand for robust statistical inference methods for the parameter estimates computed using these algorithms. Existing methods for statistical inference in online learning are restricted to settings involving independently sampled observations, while existing statistical inference methods in reinforcement learning (RL) are limited to the… ▽ More

    Submitted 28 June, 2022; v1 submitted 8 August, 2021; originally announced August 2021.

    Comments: To Appear in Journal of the American Statistical Association

  17. arXiv:2103.06428  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Covariate-assisted Sparse Tensor Completion

    Authors: Hilda S Ibriga, Will Wei Sun

    Abstract: We aim to provably complete a sparse and highly-missing tensor in the presence of covariate information along tensor modes. Our motivation comes from online advertising where users click-through-rates (CTR) on ads over various devices form a CTR tensor that has about 96% missing entries and has many zeros on non-missing entries, which makes the standalone tensor completion method unsatisfactory. B… ▽ More

    Submitted 7 April, 2022; v1 submitted 10 March, 2021; originally announced March 2021.

    Comments: To Appear in Journal of the American Statistical Association

  18. arXiv:2007.15788  [pdf, other

    stat.ML cs.LG

    Stochastic Low-rank Tensor Bandits for Multi-dimensional Online Decision Making

    Authors: Jie Zhou, Botao Hao, Zheng Wen, Jingfei Zhang, Will Wei Sun

    Abstract: Multi-dimensional online decision making plays a crucial role in many real applications such as online recommendation and digital marketing. In these problems, a decision at each time is a combination of choices from different types of entities. To solve it, we introduce stochastic low-rank tensor bandits, a class of bandits whose mean rewards can be represented as a low-rank tensor. We consider t… ▽ More

    Submitted 13 February, 2024; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: Accepted by Journal of the American Statistical Association

  19. arXiv:2007.02470  [pdf, other

    stat.ML cs.LG

    Online Regularization towards Always-Valid High-Dimensional Dynamic Pricing

    Authors: Chi-Hua Wang, Zhanyu Wang, Will Wei Sun, Guang Cheng

    Abstract: Devising dynamic pricing policy with always valid online statistical learning procedure is an important and as yet unresolved problem. Most existing dynamic pricing policy, which focus on the faithfulness of adopted customer choice models, exhibit a limited capability for adapting the online uncertainty of learned statistical model during pricing process. In this paper, we propose a novel approach… ▽ More

    Submitted 20 November, 2023; v1 submitted 5 July, 2020; originally announced July 2020.

    Comments: The following article has been accepted by JASA; see https://www.tandfonline.com/doi/full/10.1080/01621459.2023.2284979

  20. arXiv:2002.09735  [pdf, other

    stat.ML cs.LG math.ST

    Partially Observed Dynamic Tensor Response Regression

    Authors: Jie Zhou, Will Wei Sun, Jingfei Zhang, Lexin Li

    Abstract: In modern data science, dynamic tensor data is prevailing in numerous applications. An important task is to characterize the relationship between such dynamic tensor and external covariates. However, the tensor data is often only partially observed, rendering many existing methods inapplicable. In this article, we develop a regression model with partially observed dynamic tensor as the response an… ▽ More

    Submitted 13 May, 2021; v1 submitted 22 February, 2020; originally announced February 2020.

    Comments: Improved lower bound on observation probability (Assumptions 2,6); Improved sample complexity conditions (Assumptions 5,10); Improved final statistical error rate in Theorems 1-2; add a new initialization section; extend to sub-Gaussian error tensor

  21. arXiv:1904.00479  [pdf, other

    stat.ML cs.LG

    Sparse Tensor Additive Regression

    Authors: Botao Hao, Boxiang Wang, Pengyuan Wang, Jingfei Zhang, Jian Yang, Will Wei Sun

    Abstract: Tensors are becoming prevalent in modern applications such as medical imaging and digital marketing. In this paper, we propose a sparse tensor additive regression (STAR) that models a scalar response as a flexible nonparametric function of tensor covariates. The proposed model effectively exploits the sparse and low-rank structures in the tensor additive regression. We formulate the parameter esti… ▽ More

    Submitted 5 March, 2021; v1 submitted 31 March, 2019; originally announced April 2019.

    Comments: Accepted by Journal of Machine Learning Research

  22. arXiv:1601.04586  [pdf, other

    stat.ME cs.LG stat.ML

    Sparse Convex Clustering

    Authors: Binhuan Wang, Yilong Zhang, Will Wei Sun, Yixin Fang

    Abstract: Convex clustering, a convex relaxation of k-means clustering and hierarchical clustering, has drawn recent attentions since it nicely addresses the instability issue of traditional nonconvex clustering methods. Although its computational and statistical properties have been recently studied, the performance of convex clustering has not yet been investigated in the high-dimensional clustering scena… ▽ More

    Submitted 10 February, 2017; v1 submitted 18 January, 2016; originally announced January 2016.