Using tours to visually investigate properties of new projection pursuit indexes with application to problems in physics
Authors:
Ursula Laa,
Dianne Cook
Abstract:
Projection pursuit is used to find interesting low-dimensional projections of high-dimensional data by optimizing an index over all possible projections. Most indexes have been developed to detect departure from known distributions, such as normality, or to find separations between known groups. Here, we are interested in finding projections revealing potentially complex bivariate patterns, using…
▽ More
Projection pursuit is used to find interesting low-dimensional projections of high-dimensional data by optimizing an index over all possible projections. Most indexes have been developed to detect departure from known distributions, such as normality, or to find separations between known groups. Here, we are interested in finding projections revealing potentially complex bivariate patterns, using new indexes constructed from scagnostics and a maximum information coefficient, with a purpose to detect unusual relationships between model parameters describing physics phenomena. The performance of these indexes is examined with respect to ideal behaviour, using simulated data, and then applied to problems from gravitational wave astronomy. The implementation builds upon the projection pursuit tools available in the R package, tourr, with indexes constructed from code in the R packages, scagnostics, minerva and mbgraphic.
△ Less
Submitted 13 January, 2020; v1 submitted 31 January, 2019;
originally announced February 2019.
Dynamical projections for the visualization of PDFSense data
Authors:
Dianne Cook,
Ursula Laa,
German Valencia
Abstract:
A recent paper on visualizing the sensitivity of hadronic experiments to nucleon structure [1] introduces the tool PDFSense which defines measures to allow the user to judge the sensitivity of PDF fits to a given experiment. The sensitivity is characterized by high-dimensional data residuals that are visualized in a 3-d subspace of the 10 first principal components or using t-SNE [2]. We show how…
▽ More
A recent paper on visualizing the sensitivity of hadronic experiments to nucleon structure [1] introduces the tool PDFSense which defines measures to allow the user to judge the sensitivity of PDF fits to a given experiment. The sensitivity is characterized by high-dimensional data residuals that are visualized in a 3-d subspace of the 10 first principal components or using t-SNE [2]. We show how a tour, a dynamic visualisation of high dimensional data, can extend this tool beyond 3-d relationships. This approach enables resolving structure orthogonal to the 2-d viewing plane used so far, and hence finer tuned assessment of the sensitivity.
△ Less
Submitted 22 July, 2018; v1 submitted 25 June, 2018;
originally announced June 2018.