Skip to main content

Showing 1–17 of 17 results for author: Lee, S X

Searching in archive stat. Search in all archives.
.
  1. arXiv:2005.06848  [pdf, ps, other

    stat.CO

    Multi-Node EM Algorithm for Finite Mixture Models

    Authors: Sharon X. Lee, Geoffrey J. McLachlan, Kaleb L. Leemaqz

    Abstract: Finite mixture models are powerful tools for modelling and analyzing heterogeneous data. Parameter estimation is typically carried out using maximum likelihood estimation via the Expectation-Maximization (EM) algorithm. Recently, the adoption of flexible distributions as component densities has become increasingly popular. Often, the EM algorithm for these models involves complicated expressions t… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Comments: 12 Pages,1 figure

  2. arXiv:1904.12057  [pdf, ps, other

    stat.ME

    Comment on "Hidden truncation hyperbolic distributions, finite mixtures thereof and their application for clustering" Murray, Browne, and \McNicholas

    Authors: Geoffrey J. McLachlan, Sharon X. Lee

    Abstract: We comment on the paper of Murray, Browne, and McNicholas (2017), who proposed mixtures of skew distributions, which they termed hidden truncation hyperbolic (HTH). They recently made a clarification (Murray, Browne, McNicholas, 2019) concerning their claim that the so-called CFUST distribution is a special case of the HTH distribution. There are also some other matters in the original version of… ▽ More

    Submitted 26 April, 2019; originally announced April 2019.

    Comments: 7 pages

  3. arXiv:1810.04842  [pdf, ps, other

    stat.ME

    On formulations of skew factor models: skew errors versus skew factors

    Authors: Sharon X. Lee, Geoffrey J. McLachlan

    Abstract: In the past few years, there have been a number of proposals for generalizing the factor analysis (FA) model and its mixture version (known as mixtures of factor analyzers (MFA)) using non-normal and asymmetric distributions. These models adopt various types of skew densities for either the factors or the errors. While the relationships between various choices of skew distributions have been discu… ▽ More

    Submitted 20 November, 2018; v1 submitted 11 October, 2018; originally announced October 2018.

  4. arXiv:1802.02467  [pdf, other

    stat.ME

    Mixtures of Factor Analyzers with Fundamental Skew Symmetric Distributions

    Authors: Sharon X. Lee, Tsung-I Lin, Geoffrey J. McLachlan

    Abstract: Mixtures of factor analyzers (MFA) provide a powerful tool for modelling high-dimensional datasets. In recent years, several generalizations of MFA have been developed where the normality assumption of the factors and/or of the errors was relaxed to allow for skewness in the data. However, due to the form of the adopted component densities, the distribution of the factors/errors in most of these m… ▽ More

    Submitted 26 October, 2018; v1 submitted 7 February, 2018; originally announced February 2018.

  5. arXiv:1608.02797  [pdf, other

    stat.CO cs.DC

    A block EM algorithm for multivariate skew normal and skew t-mixture models

    Authors: Sharon X Lee, Kaleb L Leemaqz, Geoffrey J McLachlan

    Abstract: Finite mixtures of skew distributions provide a flexible tool for modelling heterogeneous data with asymmetric distributional features. However, parameter estimation via the Expectation-Maximization (EM) algorithm can become very time-consuming due to the complicated expressions involved in the E-step that are numerically expensive to evaluate. A more time-efficient implementation of the EM algori… ▽ More

    Submitted 9 August, 2016; originally announced August 2016.

  6. arXiv:1606.02054  [pdf, other

    stat.CO

    A simple multithreaded implementation of the EM algorithm for mixture models

    Authors: Sharon X Lee, Kaleb L Lee, Geoffrey J McLachlan

    Abstract: Finite mixture models have been widely used for the modelling and analysis of data from heterogeneous populations. Maximum likelihood estimation of the parameters is typically carried out via the Expectation-Maximization (EM) algorithm. The complexity of the implementation of the algorithm depends on the parametric distribution that is adopted as the component densities of the mixture model. In th… ▽ More

    Submitted 7 June, 2016; originally announced June 2016.

  7. arXiv:1601.00773  [pdf, other

    math.ST stat.CO

    Comment on "On Nomenclature, and the Relative Merits of Two Formulations of Skew Distributions" by A. Azzalini, R. Browne, M. Genton, and P. McNicholas

    Authors: Geoffrey J. McLachlan, Sharon X. Lee

    Abstract: We comment on the recent paper by Azzalini et al. (2015) on two different distributions proposed in the literature for the modelling of data that have asymmetric and possibly long-tailed clusters. They are referred to as the restricted and unrestricted skew normal and skew t-distributions by Lee and McLachlan (2013a). We clarify an apparent misunderstanding in Azzalini et al.(2015) of this nomencl… ▽ More

    Submitted 5 January, 2016; originally announced January 2016.

  8. arXiv:1509.02069  [pdf, other

    stat.CO stat.ME

    EMMIXcskew: an R Package for the Fitting of a Mixture of Canonical Fundamental Skew t-Distributions

    Authors: Sharon X. Lee, Geoffrey J. McLachlan

    Abstract: This paper presents an R package EMMIXcskew for the fitting of the canonical fundamental skew t-distribution (CFUST) and finite mixtures of this distribution (FM-CFUST) via maximum likelihood (ML). The CFUST distribution provides a flexible family of models to handle non-normal data, with parameters for capturing skewness and heavy-tails in the data. It formally encompasses the normal, t, and skew… ▽ More

    Submitted 9 February, 2017; v1 submitted 7 September, 2015; originally announced September 2015.

  9. arXiv:1411.2820  [pdf, other

    q-bio.QM stat.ME stat.ML

    Supervised Classification of Flow Cytometric Samples via the Joint Clustering and Matching (JCM) Procedure

    Authors: Sharon X. Lee, Geoffrey J. McLachlan, Saumyadipta Pyne

    Abstract: We consider the use of the Joint Clustering and Matching (JCM) procedure for the supervised classification of a flow cytometric sample with respect to a number of predefined classes of such samples. The JCM procedure has been proposed as a method for the unsupervised classification of cells within a sample into a number of clusters and in the case of multiple samples, the matching of these cluster… ▽ More

    Submitted 11 November, 2014; originally announced November 2014.

  10. arXiv:1405.0685  [pdf, other

    stat.ME

    Finite Mixtures of Canonical Fundamental Skew t-Distributions

    Authors: Sharon X. Lee, Geoffrey J. McLachlan

    Abstract: This is an extended version of the paper Lee and McLachlan (2014b) with simulations and applications added. This paper introduces a finite mixture of canonical fundamental skew t (CFUST) distributions for a model-based approach to clustering where the clusters are asymmetric and possibly long-tailed (Lee and McLachlan, 2014b). The family of CFUST distributions includes the restricted multivariate… ▽ More

    Submitted 4 May, 2014; originally announced May 2014.

    Comments: This is an extended version of the paper Lee and McLachlan (2014b) with simulations and applications added

  11. arXiv:1404.1733  [pdf, other

    stat.ME

    Comment on "Comparing two formulations of skew distributions with special reference to model-based clustering" by A. Azzalini, R. Browne, M. Genton, and P. McNicholas

    Authors: Geoffrey J. McLachlan, Sharon X. Lee

    Abstract: In this paper, we comment on the recent comparison in Azzalini et al. (2014) of two different distributions proposed in the literature for the modelling of data that have asymmetric and possibly long-tailed clusters. They are referred to as the restricted and unrestricted skew t-distributions by Lee and McLachlan (2013a). Firstly, we wish to point out that in Lee and McLachlan (2014b), which prece… ▽ More

    Submitted 7 April, 2014; originally announced April 2014.

  12. arXiv:1401.8182  [pdf, other

    stat.ME

    Maximum Likelihood Estimation for Finite Mixtures of Canonical Fundamental Skew t-Distributions: the Unification of the Unrestricted and Restricted Skew t-Mixture Models

    Authors: Sharon X. Lee, Geoffrey J. McLachlan

    Abstract: In this paper, we present an algorithm for the fitting of a location-scale variant of the canonical fundamental skew t (CFUST) distribution, a superclass of the restricted and unrestricted skew t-distributions. In recent years, a few versions of the multivariate skew $t$ (MST) model have been put forward, together with various EM-type algorithms for parameter estimation. These formulations adopted… ▽ More

    Submitted 31 January, 2014; originally announced January 2014.

  13. arXiv:1310.5336  [pdf, other

    stat.ME

    The skew-t factor analysis model

    Authors: Tsung-I Lin, Pal H. Wu, Geoffrey J. McLachlan, Sharon X. Lee

    Abstract: Factor analysis is a classical data reduction technique that seeks a potentially lower number of unobserved variables that can account for the correlations among the observed variables. This paper presents an extension of the factor analysis model by assuming jointly a restricted version of multivariate skew t distribution for the latent factors and unobservable errors, called the skew-t factor an… ▽ More

    Submitted 3 December, 2013; v1 submitted 20 October, 2013; originally announced October 2013.

  14. arXiv:1307.1748  [pdf, other

    stat.ME

    Extending mixtures of factor models using the restricted multivariate skew-normal distribution

    Authors: Tsung-I Lin, Geoffrey J. McLachlan, Sharon X. Lee

    Abstract: The mixture of factor analyzers (MFA) model provides a powerful tool for analyzing high-dimensional data as it can reduce the number of free parameters through its factor-analytic representation of the component covariance matrices. This paper extends the MFA model to incorporate a restricted version of the multivariate skew-normal distribution to model the distribution of the latent component fac… ▽ More

    Submitted 6 July, 2013; originally announced July 2013.

  15. arXiv:1211.5290  [pdf, ps, other

    stat.CO stat.ME

    EMMIX-uskew: An R Package for Fitting Mixtures of Multivariate Skew t-distributions via the EM Algorithm

    Authors: Sharon X. Lee, Geoffrey J. McLachlan

    Abstract: This paper describes an algorithm for fitting finite mixtures of unrestricted Multivariate Skew t (FM-uMST) distributions. The package EMMIX-uskew implements a closed-form expectation-maximization (EM) algorithm for computing the maximum likelihood (ML) estimates of the parameters for the (unrestricted) FM-MST model in R. EMMIX-uskew also supports visualization of fitted contours in two and three… ▽ More

    Submitted 27 March, 2013; v1 submitted 22 November, 2012; originally announced November 2012.

  16. On Mixtures of Skew Normal and Skew t-Distributions

    Authors: Sharon X. Lee, Geoffrey J. McLachlan

    Abstract: Finite mixture of skew distributions have emerged as an effective tool in modelling heterogeneous data with asymmetric features. With various proposals appearing rapidly in the recent years, which are similar but not identical, the connections between them and their relative performance becomes rather unclear. This paper aims to provide a concise overview of these developments by presenting a syst… ▽ More

    Submitted 28 May, 2013; v1 submitted 15 November, 2012; originally announced November 2012.

    Journal ref: Advances in Data Analysis and Classification 2013

  17. arXiv:1109.4706  [pdf, ps, other

    stat.ME

    On the fitting of mixtures of multivariate skew t-distributions via the EM algorithm

    Authors: S. X. Lee, G. J. McLachlan

    Abstract: We show how the expectation-maximization (EM) algorithm can be applied exactly for the fitting of mixtures of general multivariate skew t (MST) distributions, eliminating the need for computationally expensive Monte Carlo estimation. Finite mixtures of MST distributions have proven to be useful in modelling heterogeneous data with asymmetric and heavy tail behaviour. Recently, they have been explo… ▽ More

    Submitted 5 September, 2012; v1 submitted 22 September, 2011; originally announced September 2011.