Skip to main content

Showing 1–11 of 11 results for author: Ge, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.06671  [pdf, other

    math.ST stat.ML

    Covariates-Adjusted Mixed-Membership Estimation: A Novel Network Model with Optimal Guarantees

    Authors: Jianqing Fan, Jiawei Ge, Jikai Hou

    Abstract: This paper addresses the problem of mixed-membership estimation in networks, where the goal is to efficiently estimate the latent mixed-membership structure from the observed network. Recognizing the widespread availability and valuable information carried by node covariates, we propose a novel network model that incorporates both community information, as represented by the Degree-Corrected Mixed… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  2. arXiv:2409.14049  [pdf, other

    stat.ME stat.AP stat.OT

    Adaptive radar detection of subspace-based distributed target in power heterogeneous clutter

    Authors: Daipeng Xiao, Weijian Liu, Jun Liu, Lingyan Dai, Xueli Fang, Jianjun Ge

    Abstract: This paper investigates the problem of adaptive detection of distributed targets in power heterogeneous clutter. In the considered scenario, all the data share the identical structure of clutter covariance matrix, but with varying and unknown power mismatches. To address this problem, we iteratively estimate all the unknowns, including the coordinate matrix of the target, the clutter covariance ma… ▽ More

    Submitted 9 October, 2024; v1 submitted 21 September, 2024; originally announced September 2024.

    Comments: 9 pages, 11 figures. This manuscript is accepted in IEEE Sensors Journal

    Report number: Manuscript No. Sensors-77503-2024.R1

  3. arXiv:2406.04201  [pdf, ps, other

    cs.LG cs.MA math.OC stat.ML

    Securing Equal Share: A Principled Approach for Learning Multiplayer Symmetric Games

    Authors: Jiawei Ge, Yuanhao Wang, Wenzhe Li, Chi Jin

    Abstract: This paper examines multiplayer symmetric constant-sum games with more than two players in a competitive setting, including examples like Mahjong, Poker, and various board and video games. In contrast to two-player zero-sum games, equilibria in multiplayer games are neither unique nor non-exploitable, failing to provide meaningful guarantees when competing against opponents who play different equi… ▽ More

    Submitted 2 October, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  4. arXiv:2405.10302  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Optimal Aggregation of Prediction Intervals under Unsupervised Domain Shift

    Authors: Jiawei Ge, Debarghya Mukherjee, Jianqing Fan

    Abstract: As machine learning models are increasingly deployed in dynamic environments, it becomes paramount to assess and quantify uncertainties associated with distribution shifts. A distribution shift occurs when the underlying data-generating process changes, leading to a deviation in the model's performance. The prediction interval, which captures the range of likely outcomes for a given prediction, se… ▽ More

    Submitted 7 October, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

  5. arXiv:2311.15961  [pdf, ps, other

    stat.ML cs.LG math.ST

    Maximum Likelihood Estimation is All You Need for Well-Specified Covariate Shift

    Authors: Jiawei Ge, Shange Tang, Jianqing Fan, Cong Ma, Chi Jin

    Abstract: A key challenge of modern machine learning systems is to achieve Out-of-Distribution (OOD) generalization -- generalizing to target data whose distribution differs from that of source data. Despite its significant importance, the fundamental question of ``what are the most effective algorithms for OOD generalization'' remains open even under the standard setting of covariate shift. This paper addr… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  6. arXiv:2306.16549  [pdf, other

    stat.ME math.ST stat.ML

    UTOPIA: Universally Trainable Optimal Prediction Intervals Aggregation

    Authors: Jianqing Fan, Jiawei Ge, Debarghya Mukherjee

    Abstract: Uncertainty quantification in prediction presents a compelling challenge with vast applications across various domains, including biomedical science, economics, and weather forecasting. There exists a wide array of methods for constructing prediction intervals, such as quantile regression and conformal prediction. However, practitioners often face the challenge of selecting the most suitable metho… ▽ More

    Submitted 13 July, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

  7. arXiv:2303.01566  [pdf, ps, other

    stat.ML cs.LG math.ST

    On the Provable Advantage of Unsupervised Pretraining

    Authors: Jiawei Ge, Shange Tang, Jianqing Fan, Chi Jin

    Abstract: Unsupervised pretraining, which learns a useful representation using a large amount of unlabeled data to facilitate the learning of downstream tasks, is a critical component of modern large-scale machine learning systems. Despite its tremendous empirical success, the rigorous theoretical understanding of why unsupervised pretraining generally helps remains rather limited -- most existing results a… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  8. arXiv:2212.05367  [pdf, other

    stat.ME

    Weakest link pruning of a dendrogram

    Authors: Jiacheng Ge, Robert Tibshirani

    Abstract: Hierarchical clustering is a popular method for identifying distinct groups in a dataset. The most commonly used method for pruning a dendrogram is via a single horizontal cut. In this paper, we propose a new technique "weakest link optimal pruning". We prove its superiority over horizontal pruning and provide some examples illustrating how the two methods can behave quite differently.

    Submitted 18 January, 2023; v1 submitted 10 December, 2022; originally announced December 2022.

  9. arXiv:2007.07085  [pdf, other

    cs.IR cs.LG stat.ML

    Semi-supervised Collaborative Filtering by Text-enhanced Domain Adaptation

    Authors: Wenhui Yu, Xiao Lin, Junfeng Ge, Wenwu Ou, Zheng Qin

    Abstract: Data sparsity is an inherent challenge in the recommender systems, where most of the data is collected from the implicit feedbacks of users. This causes two difficulties in designing effective algorithms: first, the majority of users only have a few interactions with the system and there is no enough data for learning; second, there are no negative samples in the implicit feedbacks and it is a com… ▽ More

    Submitted 28 June, 2020; originally announced July 2020.

    Comments: KDD 2020 paper

  10. arXiv:2006.15261  [pdf, other

    stat.ML cs.LG math.OC

    Picasso: A Sparse Learning Library for High Dimensional Data Analysis in R and Python

    Authors: Jason Ge, Xingguo Li, Haoming Jiang, Han Liu, Tong Zhang, Mengdi Wang, Tuo Zhao

    Abstract: We describe a new library named picasso, which implements a unified framework of pathwise coordinate optimization for a variety of sparse learning problems (e.g., sparse linear regression, sparse logistic regression, sparse Poisson regression and scaled sparse linear regression) combined with efficient active set selection strategies. Besides, the library allows users to choose different sparsity-… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

    Journal ref: Journal of Machine Learning Research 20 (2019): 44-1

  11. arXiv:1706.06066  [pdf, other

    stat.ML cs.LG math.OC

    On Quadratic Convergence of DC Proximal Newton Algorithm for Nonconvex Sparse Learning in High Dimensions

    Authors: Xingguo Li, Lin F. Yang, Jason Ge, Jarvis Haupt, Tong Zhang, Tuo Zhao

    Abstract: We propose a DC proximal Newton algorithm for solving nonconvex regularized sparse learning problems in high dimensions. Our proposed algorithm integrates the proximal Newton algorithm with multi-stage convex relaxation based on the difference of convex (DC) programming, and enjoys both strong computational and statistical guarantees. Specifically, by leveraging a sophisticated characterization of… ▽ More

    Submitted 15 February, 2018; v1 submitted 19 June, 2017; originally announced June 2017.

    Comments: 36 pages, 5 figures, 1 table, Accepted at NIPS 2017