Skip to main content

Showing 1–22 of 22 results for author: He, Q

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.16540  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    A Theoretical Understanding of Chain-of-Thought: Coherent Reasoning and Error-Aware Demonstration

    Authors: Yingqian Cui, Pengfei He, Xianfeng Tang, Qi He, Chen Luo, Jiliang Tang, Yue Xing

    Abstract: Few-shot Chain-of-Thought (CoT) prompting has demonstrated strong performance in improving the reasoning capabilities of large language models (LLMs). While theoretical investigations have been conducted to understand CoT, the underlying transformer used in these studies isolates the CoT reasoning process into separated in-context learning steps (Stepwise ICL). In this work, we theoretically show… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  2. arXiv:2409.13198  [pdf, other

    cs.CL cs.LG stat.ML

    Exploring Scaling Laws for Local SGD in Large Language Model Training

    Authors: Qiaozhi He, Xiaomin Zhuang, Zhihua Wu

    Abstract: This paper investigates scaling laws for local SGD in LLM training, a distributed optimization algorithm that facilitates training on loosely connected devices. Through extensive experiments, we show that local SGD achieves competitive results compared to conventional methods, given equivalent model parameters, datasets, and computational resources. Furthermore, we explore the application of local… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

    Comments: Technical Report

  3. Nonlinear Regression Analysis

    Authors: Hsin-Hsiung Huang, Qing He

    Abstract: Nonlinear regression analysis is a popular and important tool for scientists and engineers. In this article, we introduce theories and methods of nonlinear regression and its statistical inferences using the frequentist and Bayesian statistical modeling and computation. Least squares with the Gauss-Newton method is the most widely used approach to parameters estimation. Under the assumption of nor… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  4. A Framework of Zero-Inflated Bayesian Negative Binomial Regression Models For Spatiotemporal Data

    Authors: Qing He, Hsin-Hsiung Huang

    Abstract: Spatiotemporal data analysis with massive zeros is widely used in many areas such as epidemiology and public health. We use a Bayesian framework to fit zero-inflated negative binomial models and employ a set of latent variables from Pólya-Gamma distributions to derive an efficient Gibbs sampler. The proposed model accommodates varying spatial and temporal random effects through Gaussian process pr… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Journal ref: Journal of Statistical Planning and Inference (2024). 229, 106098

  5. arXiv:2304.03476  [pdf, other

    stat.ME

    Generalizing the intention-to-treat effect of an active control against placebo from historical placebo-controlled trials to an active-controlled trial: A case study of the efficacy of daily oral TDF/FTC in the HPTN 084 study

    Authors: Qijia He, Fei Gao, Oliver Dukes, Sinead Delany-Moretlwe, Bo Zhang

    Abstract: In many clinical settings, an active-controlled trial design (e.g., a non-inferiority or superiority design) is often used to compare an experimental medicine to an active control (e.g., an FDA-approved, standard therapy). One prominent example is a recent phase 3 efficacy trial, HIV Prevention Trials Network Study 084 (HPTN 084), comparing long-acting cabotegravir, a new HIV pre-exposure prophyla… ▽ More

    Submitted 29 December, 2023; v1 submitted 7 April, 2023; originally announced April 2023.

  6. arXiv:2209.10642  [pdf

    physics.soc-ph cs.DL stat.AP

    Caught in the Crossfire: Fears of Chinese-American Scientists

    Authors: Yu Xie, Xihong Lin, Ju Li, Qian He, Junming Huang

    Abstract: The US leadership in science and technology has greatly benefitted from immigrants from other countries, most notably from China in the recent decades. However, feeling the pressure of potential federal investigation since the 2018 launch of the China Initiative under the Trump administration, Chinese-origin scientists in the US now face higher incentives to leave the US and lower incentives to ap… ▽ More

    Submitted 23 September, 2022; v1 submitted 21 September, 2022; originally announced September 2022.

    Comments: 16 pages, 2 figures

    ACM Class: J.4

  7. arXiv:2109.01993  [pdf, ps, other

    stat.AP

    Statistical computation methods for microbiome compositional data network inference

    Authors: Liang Chen, Qiuyan He, Hui Wan, Shun He, Minghua Deng

    Abstract: Microbes can affect processes from food production to human health. Such microbes are not isolated, but rather interact with each other and establish connections with their living environments. Understanding these interactions is essential to an understanding of the organization and complex interplay of microbial communities, as well as the structure and dynamics of various ecosystems. A common an… ▽ More

    Submitted 5 September, 2021; originally announced September 2021.

  8. arXiv:2103.15036  [pdf, other

    stat.AP

    External Correlates of Adult Digital Problem-Solving Behavior: Log Data Analysis of a Large-Scale Assessment

    Authors: Susu Zhang, Xueying Tang, Qiwei He, Jingchen Liu, Zhiliang Ying

    Abstract: Using the action sequence data (i.e., log data) from the problem-solving in technology-rich environments assessment on the 2012 Programme for the International Assessment of Adult Competencies survey, the current study examines the associations between adult digital problem-solving behavior and several demographic and cognitive variables. Action sequence features extracted using multidimensional s… ▽ More

    Submitted 27 March, 2021; originally announced March 2021.

  9. arXiv:2003.00911  [pdf, other

    cs.IR cs.LG stat.ML

    A Survey on Knowledge Graph-Based Recommender Systems

    Authors: Qingyu Guo, Fuzhen Zhuang, Chuan Qin, Hengshu Zhu, Xing Xie, Hui Xiong, Qing He

    Abstract: To solve the information explosion problem and enhance user experience in various online applications, recommender systems have been developed to model users preferences. Although numerous efforts have been made toward more personalized recommendations, recommender systems still suffer from several challenges, such as data sparsity and cold start. In recent years, generating recommendations with t… ▽ More

    Submitted 27 February, 2020; originally announced March 2020.

    Comments: 17 pages, 1 figure

  10. arXiv:1912.02968  [pdf, other

    cs.LG physics.comp-ph stat.ML

    Physics-Informed Neural Networks for Multiphysics Data Assimilation with Application to Subsurface Transport

    Authors: QiZhi He, David Brajas-Solano, Guzel Tartakovsky, Alexandre M. Tartakovsky

    Abstract: Data assimilation for parameter and state estimation in subsurface transport problems remains a significant challenge due to the sparsity of measurements, the heterogeneity of porous media, and the high computational cost of forward numerical models. We present a physics-informed deep neural networks (DNNs) machine learning method for estimating space-dependent hydraulic conductivity, hydraulic he… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

  11. arXiv:1911.08967  [pdf, ps, other

    cs.LG stat.ML

    Transfer Learning Toolkit: Primers and Benchmarks

    Authors: Fuzhen Zhuang, Keyu Duan, Tongjia Guo, Yongchun Zhu, Dongbo Xi, Zhiyuan Qi, Qing He

    Abstract: The transfer learning toolkit wraps the codes of 17 transfer learning models and provides integrated interfaces, allowing users to use those models by calling a simple function. It is easy for primary researchers to use this toolkit and to choose proper models for real-world applications. The toolkit is written in Python and distributed under MIT open source license. In this paper, the current sta… ▽ More

    Submitted 20 November, 2019; originally announced November 2019.

    Comments: A Transfer Learning Toolkit

  12. arXiv:1911.02685  [pdf, ps, other

    cs.LG stat.ML

    A Comprehensive Survey on Transfer Learning

    Authors: Fuzhen Zhuang, Zhiyuan Qi, Keyu Duan, Dongbo Xi, Yongchun Zhu, Hengshu Zhu, Hui Xiong, Qing He

    Abstract: Transfer learning aims at improving the performance of target learners on target domains by transferring the knowledge contained in different but related source domains. In this way, the dependence on a large number of target domain data can be reduced for constructing target learners. Due to the wide application prospects, transfer learning has become a popular and promising area in machine learn… ▽ More

    Submitted 23 June, 2020; v1 submitted 6 November, 2019; originally announced November 2019.

    Comments: 31 pages, 7 figures

  13. arXiv:1910.05250  [pdf, other

    cs.LG stat.ML

    Efficient and Adaptive Kernelization for Nonlinear Max-margin Multi-view Learning

    Authors: Changying Du, Jia He, Changde Du, Fuzhen Zhuang, Qing He, Guoping Long

    Abstract: Existing multi-view learning methods based on kernel function either require the user to select and tune a single predefined kernel or have to compute and store many Gram matrices to perform multiple kernel learning. Apart from the huge consumption of manpower, computation and memory resources, most of these models seek point estimation of their parameters, and are prone to overfitting to small tr… ▽ More

    Submitted 11 October, 2019; originally announced October 2019.

    Comments: Multi-view learning, Adaptive kernel, Maximum margin learning, Linear scalability, Dirichlet process Gaussian mixtures, Bayesian inference, Data augmentation, Hamiltonian Monte Carlo

  14. arXiv:1910.04420  [pdf, other

    cs.LG stat.ML

    Learning beyond Predefined Label Space via Bayesian Nonparametric Topic Modelling

    Authors: Changying Du, Fuzhen Zhuang, Jia He, Qing He, Guoping Long

    Abstract: In real world machine learning applications, testing data may contain some meaningful new categories that have not been seen in labeled training data. To simultaneously recognize new data categories and assign most appropriate category labels to the data actually from known categories, existing models assume the number of unknown new categories is pre-specified, though it is difficult to determine… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

    Comments: Learning beyond predefined labels; Generalized zero-shot learning; Semi-supervised learning; Generative model; Nonparametric Bayesian learning; Hierarchical Dirichlet process; Topic modelling; Collapsed Gibbs sampling

  15. arXiv:1907.11377  [pdf

    eess.SP cs.AI cs.LG stat.ML

    Deep Learning Detection of Inaccurate Smart Electricity Meters: A Case Study

    Authors: Ming Liu, Dongpeng Liu, Guangyu Sun, Yi Zhao, Duolin Wang, Fangxing Liu, Xiang Fang, Qing He, Dong Xu

    Abstract: Detecting inaccurate smart meters and targeting them for replacement can save significant resources. For this purpose, a novel deep-learning method was developed based on long short-term memory (LSTM) and a modified convolutional neural network (CNN) to predict electricity usage trajectories based on historical data. From the significant difference between the predicted trajectory and the observed… ▽ More

    Submitted 7 August, 2020; v1 submitted 26 July, 2019; originally announced July 2019.

  16. Field-aware Calibration: A Simple and Empirically Strong Method for Reliable Probabilistic Predictions

    Authors: Feiyang Pan, Xiang Ao, Pingzhong Tang, Min Lu, Dapeng Liu, Lei Xiao, Qing He

    Abstract: It is often observed that the probabilistic predictions given by a machine learning model can disagree with averaged actual outcomes on specific subsets of data, which is also known as the issue of miscalibration. It is responsible for the unreliability of practical machine learning systems. For example, in online advertising, an ad can receive a click-through rate prediction of 0.1 over some popu… ▽ More

    Submitted 27 January, 2020; v1 submitted 25 May, 2019; originally announced May 2019.

    Comments: WWW 2020

  17. arXiv:1904.11547  [pdf, other

    cs.LG cs.IR stat.ML

    Warm Up Cold-start Advertisements: Improving CTR Predictions via Learning to Learn ID Embeddings

    Authors: Feiyang Pan, Shuokai Li, Xiang Ao, Pingzhong Tang, Qing He

    Abstract: Click-through rate (CTR) prediction has been one of the most central problems in computational advertising. Lately, embedding techniques that produce low-dimensional representations of ad IDs drastically improve CTR prediction accuracies. However, such learning techniques are data demanding and work poorly on new ads with little logging data, which is known as the cold-start problem. In this pap… ▽ More

    Submitted 25 April, 2019; originally announced April 2019.

    Comments: Accepted at SIGIR 2019

  18. Latent Feature Extraction for Process Data via Multidimensional Scaling

    Authors: Xueying Tang, Zhi Wang, Qiwei He, Jingchen Liu, Zhiliang Ying

    Abstract: Computer-based interactive items have become prevalent in recent educational assessments. In such items, the entire human-computer interactive process is recorded in a log file and is known as the response process. This paper aims at extracting useful information from response processes. In particular, we consider an exploratory latent variable analysis for process data. Latent variables are extra… ▽ More

    Submitted 21 April, 2019; originally announced April 2019.

    Comments: 26 pages, 11 figures

    Journal ref: Psychometrika 85 (2020) 378-397

  19. arXiv:1903.09367  [pdf, other

    math.ST stat.CO stat.ML

    High-Dimensional Linear Regression via Implicit Regularization

    Authors: Peng Zhao, Yun Yang, Qiao-Chu He

    Abstract: Many statistical estimators for high-dimensional linear regression are M-estimators, formed through minimizing a data-dependent square loss function plus a regularizer. This work considers a new class of estimators implicitly defined through a discretized gradient dynamic system under overparameterization. We show that under suitable restricted isometry conditions, overparameterization leads to im… ▽ More

    Submitted 12 February, 2022; v1 submitted 22 March, 2019; originally announced March 2019.

    Comments: Accepted by Biometrika

  20. arXiv:1811.07350  [pdf, other

    cs.LG stat.ML

    Policy Optimization with Model-based Explorations

    Authors: Feiyang Pan, Qingpeng Cai, An-Xiang Zeng, Chun-Xiang Pan, Qing Da, Hualin He, Qing He, Pingzhong Tang

    Abstract: Model-free reinforcement learning methods such as the Proximal Policy Optimization algorithm (PPO) have successfully applied in complex decision-making problems such as Atari games. However, these methods suffer from high variances and high sample complexity. On the other hand, model-based reinforcement learning methods that learn the transition dynamics are more sample efficient, but they often s… ▽ More

    Submitted 18 November, 2018; originally announced November 2018.

    Comments: Accepted at AAAI-19

  21. arXiv:1511.02552  [pdf, other

    stat.ME

    Estimation for bivariate quantile varying coefficient model

    Authors: Linglong Kong, Haoxu Shu, Giseon Heo, Qianchuan Chad He

    Abstract: We propose a bivariate quantile regression method for the bivariate varying coefficient model through a directional approach. The varying coefficients are approximated by the B-spline basis and an $L_{2}$ type penalty is imposed to achieve desired smoothness. We develop a multistage estimation procedure based the Propagation-Separation~(PS) approach to borrow information from nearby directions. Th… ▽ More

    Submitted 8 November, 2015; originally announced November 2015.

  22. A multi-functional analyzer uses parameter constraints to improve the efficiency of model-based gene-set analysis

    Authors: Zhishi Wang, Qiuling He, Bret Larget, Michael A. Newton

    Abstract: We develop a model-based methodology for integrating gene-set information with an experimentally-derived gene list. The methodology uses a previously reported sampling model, but takes advantage of natural constraints in the high-dimensional discrete parameter space in order to work from a more structured prior distribution than is currently available. We show how the natural constraints are expre… ▽ More

    Submitted 1 June, 2015; v1 submitted 23 October, 2013; originally announced October 2013.

    Comments: Published at http://dx.doi.org/10.1214/14-AOAS777 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS777

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 1, 225-246