Skip to main content

Showing 1–6 of 6 results for author: Zhang, H S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2501.04296  [pdf, other

    stat.ME

    Inside Out: Externalizing Assumptions in Data Analysis as Validation Checks

    Authors: H. Sherry Zhang, Roger D. Peng

    Abstract: In data analysis, unexpected results often prompt researchers to revisit their procedures to identify potential issues. While some researchers may struggle to identify the root causes, experienced researchers can often quickly diagnose problems by checking a few key assumptions. These checked assumptions, or expectations, are typically informal, difficult to trace, and rarely discussed in publicat… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

  2. arXiv:2407.13663  [pdf, other

    stat.CO cs.NE

    Squintability and Other Metrics for Assessing Projection Pursuit Indexes, and Guiding Optimization Choices

    Authors: H. Sherry Zhang, Dianne Cook, Nicolas Langrené, Jessica Wai Yin Leung

    Abstract: The projection pursuit (PP) guided tour optimizes a criterion function, known as the PP index, to gradually reveal projections of interest from high-dimensional data through animation. Optimization of some PP indexes can be non-trivial, if they are non-smooth functions, or when the optimum has a small "squint angle", detectable only from close proximity. Here, measures for calculating the smoothne… ▽ More

    Submitted 8 March, 2025; v1 submitted 18 July, 2024; originally announced July 2024.

  3. arXiv:2401.05812  [pdf, other

    stat.CO

    A Tidy Framework and Infrastructure to Systematically Assemble Spatio-temporal Indexes from Multivariate Data

    Authors: H. Sherry Zhang, Dianne Cook, Ursula Laa, Nicolas Langrené, Patricia Menéndez

    Abstract: Indexes are useful for summarizing multivariate information into single metrics for monitoring, communicating, and decision-making. While most work has focused on defining new indexes for specific purposes, more attention needs to be directed towards making it possible to understand index behavior in different data conditions, and to determine how their structure affects their values and variation… ▽ More

    Submitted 13 May, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  4. arXiv:2205.00259  [pdf, other

    stat.CO stat.ME

    cubble: An R Package for Organizing and Wrangling Multivariate Spatio-temporal Data

    Authors: H. Sherry Zhang, Dianne Cook, Ursula Laa, Nicolas Langrené, Patricia Menéndez

    Abstract: Multivariate spatio-temporal data refers to multiple measurements taken across space and time. For many analyses, spatial and time components can be separately studied: for example, to explore the temporal trend of one variable for a single spatial location, or to model the spatial distribution of one variable at a given time. However for some studies, it is important to analyse different aspects… ▽ More

    Submitted 10 January, 2024; v1 submitted 30 April, 2022; originally announced May 2022.

  5. arXiv:2104.08016  [pdf, other

    cs.GR stat.OT

    A Review of the State-of-the-Art on Tours for Dynamic Visualization of High-dimensional Data

    Authors: Stuart Lee, Dianne Cook, Natalia da Silva, Ursula Laa, Earo Wang, Nick Spyrison, H. Sherry Zhang

    Abstract: This article discusses a high-dimensional visualization technique called the tour, which can be used to view data in more than three dimensions. We review the theory and history behind the technique, as well as modern software developments and applications of the tour that are being found across the sciences and machine learning.

    Submitted 19 April, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

  6. Visual Diagnostics for Constrained Optimisation with Application to Guided Tours

    Authors: H. Sherry Zhang, Dianne Cook, Ursula Laa, Nicolas Langrené, Patricia Menéndez

    Abstract: A guided tour helps to visualise high-dimensional data by showing low-dimensional projections along a projection pursuit optimisation path. Projection pursuit is a generalisation of principal component analysis, in the sense that different indexes are used to define the interestingness of the projected data. While much work has been done in developing new indexes in the literature, less has been d… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

    Journal ref: R Journal 13(2) 624-641 (2021)