Search | arXiv e-print repository

Correlated Bayesian Additive Regression Trees with Gaussian Process for Regression Analysis of Dependent Data

Authors: Xuetao Lu a, Robert E. McCulloch

Abstract: Bayesian Additive Regression Trees (BART) has gained widespread popularity, prompting the development of various extensions for different applications. However, limited attention has been given to analyzing dependent data. Based on a general correlated error assumption and an innovative dummy representation, we introduces a novel extension of BART, called Correlated BART (CBART), designed to handl… ▽ More Bayesian Additive Regression Trees (BART) has gained widespread popularity, prompting the development of various extensions for different applications. However, limited attention has been given to analyzing dependent data. Based on a general correlated error assumption and an innovative dummy representation, we introduces a novel extension of BART, called Correlated BART (CBART), designed to handle correlated errors. By integrating CBART with a Gaussian process (GP), we propose the CBART-GP model, in which the CBART and GP components are loosely coupled, allowing them to be estimated and applied independently. CBART captures the covariate mean function E[y|x]=f(x), while the Gaussian process models the dependency structure in the response $y$. We also developed a computationally efficient approach, named two-stage analysis of variance with weighted residuals, for the estimation of CBART-GP. Simulation studies demonstrate the superiority of CBART-GP over other models, and a real-world application illustrates its practical applicability. △ Less

Submitted 12 September, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

arXiv:2204.10525 [pdf]

Research on spatial information transmission efficiency and capability of safe evacuation signs

Authors: Ruiwen Fan, Zhangyin Dai, Shixiang Tian, Ting Xia a, Hui Zhou, Congbao Huang

Abstract: As an indispensable spatial direction information indicator for emergency evacuation, the spatial relationship between safety evacuation signs and evacuees will affect the response time of evacuees and the evacuation efficiency. This paper takes 2 kinds of common safety evacuation signs, hangtag-type and embedded, as the research object and designs space direction information transmission efficien… ▽ More As an indispensable spatial direction information indicator for emergency evacuation, the spatial relationship between safety evacuation signs and evacuees will affect the response time of evacuees and the evacuation efficiency. This paper takes 2 kinds of common safety evacuation signs, hangtag-type and embedded, as the research object and designs space direction information transmission efficiency and capability simulation experiment and fire drill, the efficiency and capability of spatial direction information transmission of safety evacuation signs are studied. The results show that the space angle of the hangtag-type safety evacuation sign is inversely proportional to the information transmission efficiency and capability of the space direction, and the fire drill also confirms this conclusion. When the spatial angle of the embedded safety evacuation sign is 5°, the spatial direction information transmission efficiency and capability increase. Simultaneously, the average escape time of the participants in the fire drill was lower, and the percentage of choosing unfamiliarity exports increased. The evolution of spatial angle has no significant effect on the intention of the response of subjects of different genders; when choosing the direction, males are more easily affected by the change of spatial angle than females; the confidence level of females' choice is more easily affected by spatial angle. In addition, according to the research results, the corresponding three-dimensional structure safety evacuation signs are designed. The functional structure of the safety evacuation signs is perfected, which can effectively improve the efficiency of fire emergency evacuation. △ Less

Submitted 22 April, 2022; originally announced April 2022.

MSC Class: 14J26 ACM Class: I.2.7

arXiv:1905.10713 [pdf, other]

doi 10.1145/3366423.3380154

Field-aware Calibration: A Simple and Empirically Strong Method for Reliable Probabilistic Predictions

Authors: Feiyang Pan, Xiang Ao, Pingzhong Tang, Min Lu, Dapeng Liu, Lei Xiao, Qing He

Abstract: It is often observed that the probabilistic predictions given by a machine learning model can disagree with averaged actual outcomes on specific subsets of data, which is also known as the issue of miscalibration. It is responsible for the unreliability of practical machine learning systems. For example, in online advertising, an ad can receive a click-through rate prediction of 0.1 over some popu… ▽ More It is often observed that the probabilistic predictions given by a machine learning model can disagree with averaged actual outcomes on specific subsets of data, which is also known as the issue of miscalibration. It is responsible for the unreliability of practical machine learning systems. For example, in online advertising, an ad can receive a click-through rate prediction of 0.1 over some population of users where its actual click rate is 0.15. In such cases, the probabilistic predictions have to be fixed before the system can be deployed. In this paper, we first introduce a new evaluation metric named field-level calibration error that measures the bias in predictions over the sensitive input field that the decision-maker concerns. We show that existing post-hoc calibration methods have limited improvements in the new field-level metric and other non-calibration metrics such as the AUC score. To this end, we propose Neural Calibration, a simple yet powerful post-hoc calibration method that learns to calibrate by making full use of the field-aware information over the validation set. We present extensive experiments on five large-scale datasets. The results showed that Neural Calibration significantly improves against uncalibrated predictions in common metrics such as the negative log-likelihood, Brier score and AUC, as well as the proposed field-level calibration error. △ Less

Submitted 27 January, 2020; v1 submitted 25 May, 2019; originally announced May 2019.

Comments: WWW 2020

arXiv:1904.11547 [pdf, other]

Warm Up Cold-start Advertisements: Improving CTR Predictions via Learning to Learn ID Embeddings

Authors: Feiyang Pan, Shuokai Li, Xiang Ao, Pingzhong Tang, Qing He

Abstract: Click-through rate (CTR) prediction has been one of the most central problems in computational advertising. Lately, embedding techniques that produce low-dimensional representations of ad IDs drastically improve CTR prediction accuracies. However, such learning techniques are data demanding and work poorly on new ads with little logging data, which is known as the cold-start problem. In this pap… ▽ More Click-through rate (CTR) prediction has been one of the most central problems in computational advertising. Lately, embedding techniques that produce low-dimensional representations of ad IDs drastically improve CTR prediction accuracies. However, such learning techniques are data demanding and work poorly on new ads with little logging data, which is known as the cold-start problem. In this paper, we aim to improve CTR predictions during both the cold-start phase and the warm-up phase when a new ad is added to the candidate pool. We propose Meta-Embedding, a meta-learning-based approach that learns to generate desirable initial embeddings for new ad IDs. The proposed method trains an embedding generator for new ad IDs by making use of previously learned ads through gradient-based meta-learning. In other words, our method learns how to learn better embeddings. When a new ad comes, the trained generator initializes the embedding of its ID by feeding its contents and attributes. Next, the generated embedding can speed up the model fitting during the warm-up phase when a few labeled examples are available, compared to the existing initialization methods. Experimental results on three real-world datasets showed that Meta-Embedding can significantly improve both the cold-start and warm-up performances for six existing CTR prediction models, ranging from lightweight models such as Factorization Machines to complicated deep models such as PNN and DeepFM. All of the above apply to conversion rate (CVR) predictions as well. △ Less

Submitted 25 April, 2019; originally announced April 2019.

Comments: Accepted at SIGIR 2019

Showing 1–4 of 4 results for author: Ao, X