Skip to main content

Showing 1–22 of 22 results for author: Ge, Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2507.03279  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Conformal Information Pursuit for Interactively Guiding Large Language Models

    Authors: Kwan Ho Ryan Chan, Yuyan Ge, Edgar Dobriban, Hamed Hassani, René Vidal

    Abstract: A significant use case of instruction-finetuned Large Language Models (LLMs) is to solve question-answering tasks interactively. In this setting, an LLM agent is tasked with making a prediction by sequentially querying relevant information from the user, as opposed to a single-turn conversation. This paper explores sequential querying strategies that aim to minimize the expected number of queries.… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

  2. arXiv:2505.06601  [pdf, ps, other

    stat.ML cs.LG

    Learning Guarantee of Reward Modeling Using Deep Neural Networks

    Authors: Yuanhang Luo, Yeheng Ge, Ruijian Han, Guohao Shen

    Abstract: In this work, we study the learning theory of reward modeling with pairwise comparison data using deep neural networks. We establish a novel non-asymptotic regret bound for deep reward estimators in a non-parametric setting, which depends explicitly on the network architecture. Furthermore, to underscore the critical importance of clear human beliefs, we introduce a margin-type condition that assu… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

  3. arXiv:2502.20414  [pdf, other

    stat.ML cs.LG

    Transfer Learning through Enhanced Sufficient Representation: Enriching Source Domain Knowledge with Target Data

    Authors: Yeheng Ge, Xueyu Zhou, Jian Huang

    Abstract: Transfer learning is an important approach for addressing the challenges posed by limited data availability in various applications. It accomplishes this by transferring knowledge from well-established source domains to a less familiar target domain. However, traditional transfer learning methods often face difficulties due to rigid model assumptions and the need for a high degree of similarity be… ▽ More

    Submitted 22 February, 2025; originally announced February 2025.

    Comments: 44 pages

    MSC Class: 62G05; 68T07

  4. arXiv:2308.00894  [pdf, other

    cs.IR cs.LG stat.ME

    User-Controllable Recommendation via Counterfactual Retrospective and Prospective Explanations

    Authors: Juntao Tan, Yingqiang Ge, Yan Zhu, Yinglong Xia, Jiebo Luo, Jianchao Ji, Yongfeng Zhang

    Abstract: Modern recommender systems utilize users' historical behaviors to generate personalized recommendations. However, these systems often lack user controllability, leading to diminished user satisfaction and trust in the systems. Acknowledging the recent advancements in explainable recommender systems that enhance users' understanding of recommendation mechanisms, we propose leveraging these advancem… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: Accepted for presentation at 26th European Conference on Artificial Intelligence (ECAI2023)

  5. arXiv:2206.06469  [pdf

    cs.LG stat.ML

    Invariant Structure Learning for Better Generalization and Causal Explainability

    Authors: Yunhao Ge, Sercan Ö. Arik, Jinsung Yoon, Ao Xu, Laurent Itti, Tomas Pfister

    Abstract: Learning the causal structure behind data is invaluable for improving generalization and obtaining high-quality explanations. We propose a novel framework, Invariant Structure Learning (ISL), that is designed to improve causal structure discovery by utilizing generalization as an indication. ISL splits the data into different environments, and learns a structure that is invariant to the target acr… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: 16 pages (including Appendix), 4 figures

  6. Bayesian spectral density approach for identification and uncertainty quantification of bridge section's flutter derivatives operated in turbulent flow

    Authors: Xiaolei Chu, Wei Cui, Peng Liu, Lin Zhao, Yaojun Ge

    Abstract: This study presents a Bayesian spectral density approach for identification and uncertainty quantification of flutter derivatives of bridge sections utilizing buffeting displacement responses, where the wind tunnel test is conducted in turbulent flow. Different from traditional time-domain approaches (e.g., least square method and stochastic subspace identification), the newly-proposed approach is… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

  7. arXiv:2106.10694  [pdf, other

    stat.CO stat.AP

    Life-cycle assessment for flutter probability of a long-span suspension bridge based on field monitoring data

    Authors: Xiaolei Chu, Hung Nguyen Sinh, Wei Cui, Lin Zhao, Yaojun Ge

    Abstract: Assessment of structural safety status is of paramount importance for existing bridges, where accurate evaluation of flutter probability is essential for long-span bridges. In current engineering practice, at the design stage, flutter critical wind speed is usually estimated by the wind tunnel test, which is sensitive to modal frequencies and damping ratios. After construction, structural properti… ▽ More

    Submitted 20 June, 2021; originally announced June 2021.

  8. arXiv:2010.05125  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Task-aware Robust Deep Learning Systems

    Authors: Keji Han, Yun Li, Xianzhong Long, Yao Ge

    Abstract: Many works demonstrate that deep learning system is vulnerable to adversarial attack. A deep learning system consists of two parts: the deep learning task and the deep model. Nowadays, most existing works investigate the impact of the deep model on robustness of deep learning systems, ignoring the impact of the learning task. In this paper, we adopt the binary and interval label encoding strategy… ▽ More

    Submitted 1 December, 2021; v1 submitted 10 October, 2020; originally announced October 2020.

    Comments: 9 Pages

  9. arXiv:2010.02506  [pdf, other

    cs.LG stat.ML

    Interactive Reinforcement Learning for Feature Selection with Decision Tree in the Loop

    Authors: Wei Fan, Kunpeng Liu, Hao Liu, Yong Ge, Hui Xiong, Yanjie Fu

    Abstract: We study the problem of balancing effectiveness and efficiency in automated feature selection. After exploring many feature selection methods, we observe a computational dilemma: 1) traditional feature selection is mostly efficient, but difficult to identify the best subset; 2) the emerging reinforced feature selection automatically navigates to the best subset, but is usually inefficient. Can we… ▽ More

    Submitted 2 October, 2020; originally announced October 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2008.12001

  10. arXiv:2009.12933  [pdf, other

    cs.GR stat.CO

    A grammar of graphics framework for generalized parallel coordinate plots

    Authors: Yawei Ge, Heike Hofmann

    Abstract: Parallel coordinate plots (PCP) are a useful tool in exploratory data analysis of high-dimensional numerical data. The use of PCPs is limited when working with categorical variables or a mix of categorical and continuous variables. In this paper, we propose generalized parallel coordinate plots (GPCP) to extend the ability of PCPs from just numeric variables to dealing seamlessly with a mix of cat… ▽ More

    Submitted 27 September, 2020; originally announced September 2020.

    Comments: 26 pages, 14 figures. For the implementation in R, see: https://github.com/yaweige/ggpcp

  11. arXiv:2008.12001  [pdf, other

    cs.LG cs.AI stat.ML

    AutoFS: Automated Feature Selection via Diversity-aware Interactive Reinforcement Learning

    Authors: Wei Fan, Kunpeng Liu, Hao Liu, Pengyang Wang, Yong Ge, Yanjie Fu

    Abstract: In this paper, we study the problem of balancing effectiveness and efficiency in automated feature selection. Feature selection is a fundamental intelligence for machine learning and predictive analysis. After exploring many feature selection methods, we observe a computational dilemma: 1) traditional feature selection methods (e.g., mRMR) are mostly efficient, but difficult to identify the best s… ▽ More

    Submitted 16 September, 2020; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: Accepted by ICDM 2020. In this version, we revised some typos or mistakes for camera-ready

  12. arXiv:2008.09316  [pdf, other

    cs.LG stat.ML

    Explainable Recommender Systems via Resolving Learning Representations

    Authors: Ninghao Liu, Yong Ge, Li Li, Xia Hu, Rui Chen, Soo-Hyun Choi

    Abstract: Recommender systems play a fundamental role in web applications in filtering massive information and matching user interests. While many efforts have been devoted to developing more effective models in various scenarios, the exploration on the explainability of recommender systems is running behind. Explanations could help improve user experience and discover system defects. In this paper, after f… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

  13. arXiv:2005.06546  [pdf

    cs.LG stat.ML

    Triaging moderate COVID-19 and other viral pneumonias from routine blood tests

    Authors: Forrest Sheng Bao, Youbiao He, Jie Liu, Yuanfang Chen, Qian Li, Christina R. Zhang, Lei Han, Baoli Zhu, Yaorong Ge, Shi Chen, Ming Xu, Liu Ouyang

    Abstract: The COVID-19 is sweeping the world with deadly consequences. Its contagious nature and clinical similarity to other pneumonias make separating subjects contracted with COVID-19 and non-COVID-19 viral pneumonia a priority and a challenge. However, COVID-19 testing has been greatly limited by the availability and cost of existing methods, even in developed countries like the US. Intrigued by the wid… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    ACM Class: I.5.4

  14. arXiv:2003.00009  [pdf

    q-bio.QM cs.LG stat.ML

    Cell Mechanics Based Computational Classification of Red Blood Cells Via Machine Intelligence Applied to Morpho-Rheological Markers

    Authors: Yan Ge, Philipp Rosendahl, Claudio Durán, Nicole Töpfner, Sara Ciucci, Jochen Guck, Carlo Vittorio Cannistraci

    Abstract: Despite fluorescent cell-labelling being widely employed in biomedical studies, some of its drawbacks are inevitable, with unsuitable fluorescent probes or probes inducing a functional change being the main limitations. Consequently, the demand for and development of label-free methodologies to classify cells is strong and its impact on precision medicine is relevant. Towards this end, high-throug… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

    Comments: 13 pages, 3 figures, 4 tables

    MSC Class: 68T99

    Journal ref: IEEE/ACM Trans. Comput. Biol. Bioinform (2019)

  15. arXiv:2002.00844  [pdf, other

    cs.SI cs.IR cs.LG stat.ML

    DiffNet++: A Neural Influence and Interest Diffusion Network for Social Recommendation

    Authors: Le Wu, Junwei Li, Peijie Sun, Richang Hong, Yong Ge, Meng Wang

    Abstract: Social recommendation has emerged to leverage social connections among users for predicting users' unknown preferences, which could alleviate the data sparsity issue in collaborative filtering based recommendation. Early approaches relied on utilizing each user's first-order social neighbors' interests for better user modeling and failed to model the social influence diffusion process from the glo… ▽ More

    Submitted 5 January, 2021; v1 submitted 15 January, 2020; originally announced February 2020.

    Comments: This paper has been accepted by IEEE TKDE

  16. arXiv:2001.09595  [pdf, other

    cs.LG cs.IR stat.ML

    Developing Multi-Task Recommendations with Long-Term Rewards via Policy Distilled Reinforcement Learning

    Authors: Xi Liu, Li Li, Ping-Chun Hsieh, Muhe Xie, Yong Ge, Rui Chen

    Abstract: With the explosive growth of online products and content, recommendation techniques have been considered as an effective tool to overcome information overload, improve user experience, and boost business revenue. In recent years, we have observed a new desideratum of considering long-term rewards of multiple related recommendation tasks simultaneously. The consideration of long-term rewards is str… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

  17. arXiv:1908.11051  [pdf

    stat.ML cs.LG physics.ao-ph

    Data-based wind disaster climate identification algorithm and extreme wind speed prediction

    Authors: Wei Cui, Teng Ma, Lin Zhao, Yaojun Ge

    Abstract: An extreme wind speed estimation method that considers wind hazard climate types is critical for design wind load calculation for building structures affected by mixed climates. However, it is very difficult to obtain wind hazard climate types from meteorological data records, because they restrict the application of extreme wind speed estimation in mixed climates. This paper first proposes a wind… ▽ More

    Submitted 29 August, 2019; originally announced August 2019.

  18. arXiv:1906.08143  [pdf

    cs.CV eess.SP math.OC physics.med-ph stat.ML

    Model-based Deep Medical Imaging: the roadmap of generalizing iterative reconstruction model using deep learning

    Authors: Jing Cheng, Haifeng Wang, Yanjie Zhu, Qiegen Liu, Qiyang Zhang, Ting Su, Jianwei Chen, Yongshuai Ge, Zhanli Hu, Xin Liu, Hairong Zheng, Leslie Ying, Dong Liang

    Abstract: Medical imaging is playing a more and more important role in clinics. However, there are several issues in different imaging modalities such as slow imaging speed in MRI, radiation injury in CT and PET. Therefore, accelerating MRI, reducing radiation dose in CT and PET have been ongoing research topics since their invention. Usually, acquiring less data is a direct but important strategy to addres… ▽ More

    Submitted 5 September, 2019; v1 submitted 19 June, 2019; originally announced June 2019.

    Comments: part of the preliminary work will be presented at MICCAI2019

  19. arXiv:1901.06247  [pdf, other

    cs.LG stat.ML

    Micro- and Macro-Level Churn Analysis of Large-Scale Mobile Games

    Authors: Xi Liu, Muhe Xie, Xidao Wen, Rui Chen, Yong Ge, Nick Duffield, Na Wang

    Abstract: As mobile devices become more and more popular, mobile gaming has emerged as a promising market with billion-dollar revenues. A variety of mobile game platforms and services have been developed around the world. A critical challenge for these platforms and services is to understand the churn behavior in mobile games, which usually involves churn at micro level (between an app and a specific user)… ▽ More

    Submitted 14 January, 2019; originally announced January 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1808.06573

  20. arXiv:1901.00615  [pdf, other

    stat.ML cs.LG

    Structure learning via unstructured kernel-based M-regression

    Authors: Xin He, Yeheng Ge, Xingdong Feng

    Abstract: In statistical learning, identifying underlying structures of true target functions based on observed data plays a crucial role to facilitate subsequent modeling and analysis. Unlike most of those existing methods that focus on some specific settings under certain model assumptions, this paper proposes a general and novel framework for recovering true structures of target functions by using unstru… ▽ More

    Submitted 3 May, 2021; v1 submitted 3 January, 2019; originally announced January 2019.

  21. arXiv:1812.10140  [pdf, other

    cs.LG stat.ML

    Mixed-Order Spectral Clustering for Networks

    Authors: Yan Ge, Haiping Lu, Pan Peng

    Abstract: Clustering is fundamental for gaining insights from complex networks, and spectral clustering (SC) is a popular approach. Conventional SC focuses on second-order structures (e.g., edges connecting two nodes) without direct consideration of higher-order structures (e.g., triangles and cliques). This has motivated SC extensions that directly consider higher-order structures. However, both approaches… ▽ More

    Submitted 25 December, 2018; originally announced December 2018.

    Comments: 12 pages

  22. arXiv:1808.06573  [pdf, other

    cs.LG stat.ML

    A Semi-Supervised and Inductive Embedding Model for Churn Prediction of Large-Scale Mobile Games

    Authors: Xi Liu, Muhe Xie, Xidao Wen, Rui Chen, Yong Ge, Nick Duffield, Na Wang

    Abstract: Mobile gaming has emerged as a promising market with billion-dollar revenues. A variety of mobile game platforms and services have been developed around the world. One critical challenge for these platforms and services is to understand user churn behavior in mobile games. Accurate churn prediction will benefit many stakeholders such as game developers, advertisers, and platform operators. In this… ▽ More

    Submitted 10 October, 2018; v1 submitted 20 August, 2018; originally announced August 2018.

    Comments: to appear in ICDM 2018