Skip to main content

Showing 1–5 of 5 results for author: Gan, G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2112.14865  [pdf, other

    stat.AP

    Compositional Data Regression in Insurance with Exponential Family PCA

    Authors: Guojun Gan, Emiliano A. Valdez

    Abstract: Compositional data are multivariate observations that carry only relative information between components. Applying standard multivariate statistical methodology directly to analyze compositional data can lead to paradoxes and misinterpretations. Compositional data also frequently appear in insurance, especially with telematics information. However, such type of data does not receive deserved speci… ▽ More

    Submitted 29 December, 2021; originally announced December 2021.

    Comments: 21 pages, 5 figures, 10 tables

    MSC Class: 62P05

  2. arXiv:2101.10896  [pdf, other

    stat.AP

    Applications of Clustering with Mixed Type Data in Life Insurance

    Authors: Shuang Yin, Guojun Gan, Emiliano A. Valdez, Jeyaraj Vadiveloo

    Abstract: Death benefits are generally the largest cash flow item that affects financial statements of life insurers where some still do not have a systematic process to track and monitor death claims experience. In this article, we explore data clustering to examine and understand how actual death claims differ from expected, an early stage of developing a monitoring system crucial for risk management. We… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: 25 pages, 6 figures, 5 tables

    MSC Class: 62P05

  3. arXiv:2008.00048  [pdf, other

    stat.AP

    Analysis of Prescription Drug Utilization with Beta Regression Models

    Authors: Guojun Gan, Emiliano A. Valdez

    Abstract: The healthcare sector in the U.S. is complex and is also a large sector that generates about 20% of the country's gross domestic product. Healthcare analytics has been used by researchers and practitioners to better understand the industry. In this paper, we examine and demonstrate the use of Beta regression models to study the utilization of brand name drugs in the U.S. to understand the variabil… ▽ More

    Submitted 31 July, 2020; originally announced August 2020.

    Comments: 26 pages, 10 Figures, 11 Tables

    MSC Class: 91G05

  4. arXiv:2007.15172  [pdf, other

    stat.AP

    Skewed link regression models for imbalanced binary response with applications to life insurance

    Authors: Shuang Yin, Dipak K. Dey, Emiliano A. Valdez, Guojun Gan, Jeyaraj Vadiveloo

    Abstract: For a portfolio of life insurance policies observed for a stated period of time, e.g., one year, mortality is typically a rare event. When we examine the outcome of dying or not from such portfolios, we have an imbalanced binary response. The popular logistic and probit regression models can be inappropriate for imbalanced binary response as model estimates may be biased, and if not addressed prop… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

    Comments: 25 pages, 7 Tables, 2 Figures

    MSC Class: 62P05

  5. arXiv:2006.05617  [pdf, other

    stat.AP

    Hybrid Tree-based Models for Insurance Claims

    Authors: Zhiyu Quan, Zhiguo Wang, Guojun Gan, Emiliano A. Valdez

    Abstract: Two-part models and Tweedie generalized linear models (GLMs) have been used to model loss costs for short-term insurance contract. For most portfolios of insurance claims, there is typically a large proportion of zero claims that leads to imbalances resulting in inferior prediction accuracy of these traditional approaches. This article proposes the use of tree-based models with a hybrid structure… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    Comments: 24 pages, 6 figures

    MSC Class: 62P05