-
PROFIT: Projection-based Test in Longitudinal Functional Data
Authors:
Salil Koner,
So Young Park,
Ana-Maria Staicu
Abstract:
In many modern applications, a dependent functional response is observed for each subject over repeated time, leading to longitudinal functional data. In this paper, we propose a novel statistical procedure to test whether the mean function varies over time. Our approach relies on reducing the dimension of the response using data-driven orthogonal projections and it employs a likelihood-based hypo…
▽ More
In many modern applications, a dependent functional response is observed for each subject over repeated time, leading to longitudinal functional data. In this paper, we propose a novel statistical procedure to test whether the mean function varies over time. Our approach relies on reducing the dimension of the response using data-driven orthogonal projections and it employs a likelihood-based hypothesis testing. We investigate the methodology theoretically and discuss a computationally efficient implementation. The proposed test maintains the type I error rate, and shows excellent power to detect departures from the null hypothesis in finite sample simulation studies. We apply our method to the longitudinal diffusion tensor imaging study of multiple sclerosis (MS) patients to formally assess whether the brain's health tissue, as summarized by fractional anisotropy (FA) profile, degrades over time during the study period.
△ Less
Submitted 2 October, 2023; v1 submitted 22 April, 2021;
originally announced April 2021.
-
Transparency Tools for Fairness in AI (Luskin)
Authors:
Mingliang Chen,
Aria Shahverdi,
Sarah Anderson,
Se Yong Park,
Justin Zhang,
Dana Dachman-Soled,
Kristin Lauter,
Min Wu
Abstract:
We propose new tools for policy-makers to use when assessing and correcting fairness and bias in AI algorithms. The three tools are:
- A new definition of fairness called "controlled fairness" with respect to choices of protected features and filters. The definition provides a simple test of fairness of an algorithm with respect to a dataset. This notion of fairness is suitable in cases where fa…
▽ More
We propose new tools for policy-makers to use when assessing and correcting fairness and bias in AI algorithms. The three tools are:
- A new definition of fairness called "controlled fairness" with respect to choices of protected features and filters. The definition provides a simple test of fairness of an algorithm with respect to a dataset. This notion of fairness is suitable in cases where fairness is prioritized over accuracy, such as in cases where there is no "ground truth" data, only data labeled with past decisions (which may have been biased).
- Algorithms for retraining a given classifier to achieve "controlled fairness" with respect to a choice of features and filters. Two algorithms are presented, implemented and tested. These algorithms require training two different models in two stages. We experiment with combinations of various types of models for the first and second stage and report on which combinations perform best in terms of fairness and accuracy.
- Algorithms for adjusting model parameters to achieve a notion of fairness called "classification parity". This notion of fairness is suitable in cases where accuracy is prioritized. Two algorithms are presented, one which assumes that protected features are accessible to the model during testing, and one which assumes protected features are not accessible during testing.
We evaluate our tools on three different publicly available datasets. We find that the tools are useful for understanding various dimensions of bias, and that in practice the algorithms are effective in starkly reducing a given observed bias when tested on new data.
△ Less
Submitted 8 July, 2020;
originally announced July 2020.
-
Simple fixed-effects inference for complex functional models
Authors:
So Young Park,
Ana-Maria Staicu,
Luo Xiao,
Ciprian Crainiceanu
Abstract:
We propose simple inferential approaches for the fixed effects in complex functional mixed effects models. We estimate the fixed effects under the independence of functional residuals assumption and then bootstrap independent units (e.g. subjects) to estimate the variability of and conduct inference in the form of hypothesis testing on the fixed effects parameters. Simulations show excellent cover…
▽ More
We propose simple inferential approaches for the fixed effects in complex functional mixed effects models. We estimate the fixed effects under the independence of functional residuals assumption and then bootstrap independent units (e.g. subjects) to estimate the variability of and conduct inference in the form of hypothesis testing on the fixed effects parameters. Simulations show excellent coverage probability of the confidence intervals and size of tests. Methods are motivated by and applied to the Baltimore Longitudinal Study of Aging (BLSA), though they are applicable to other studies that collect correlated functional data.
△ Less
Submitted 4 July, 2016;
originally announced July 2016.
-
Conditional analysis for mixed covariates, with application to feed intake of lactating sows
Authors:
So Young Park,
Cai Li,
Santa-Maria Mendoza,
Eric van Heugten,
Ana-Maria Staicu
Abstract:
We propose a novel modeling framework to study the effect of covariates of various types on the conditional distribution of the response. The methodology accommodates flexible model structure, allows for joint estimation of the quantiles at all levels, and involves a computationally efficient estimation algorithm. Extensive numerical investigation confirms good performance of the proposed method.…
▽ More
We propose a novel modeling framework to study the effect of covariates of various types on the conditional distribution of the response. The methodology accommodates flexible model structure, allows for joint estimation of the quantiles at all levels, and involves a computationally efficient estimation algorithm. Extensive numerical investigation confirms good performance of the proposed method. The methodology is motivated by and applied to a lactating sow study, where the primary interest is to understand how the dynamic change of minute-by-minute temperature in the farrowing rooms within a day (functional covariate) is associated with low quantiles of feed intake of lactating sows, while accounting for other sow-specific information (vector covariate).
△ Less
Submitted 30 May, 2019; v1 submitted 18 May, 2016;
originally announced May 2016.
-
Interactive graphics for functional data analyses
Authors:
Julia Wrobel,
So Young Park,
Ana Maria Staicu,
Jeff Goldsmith
Abstract:
Although there are established graphics that accompany the most common functional data analyses, generating these graphics for each dataset and analysis can be cumbersome and time consuming. Often, the barriers to visualization inhibit useful exploratory data analyses and prevent the development of intuition for a method and its application to a particular dataset. The refund.shiny package was dev…
▽ More
Although there are established graphics that accompany the most common functional data analyses, generating these graphics for each dataset and analysis can be cumbersome and time consuming. Often, the barriers to visualization inhibit useful exploratory data analyses and prevent the development of intuition for a method and its application to a particular dataset. The refund.shiny package was developed to address these issues for several of the most common functional data analyses. After conducting an analysis, the plot_shiny() function is used to generate an interactive visualization environment that contains several distinct graphics, many of which are updated in response to user input. These visualizations reduce the burden of exploratory analyses and can serve as a useful tool for the communication of results to non-statisticians.
△ Less
Submitted 12 February, 2016;
originally announced February 2016.
-
Longitudinal Functional Data Analysis
Authors:
So Young Park,
Ana-Maria Staicu
Abstract:
We consider analysis of dependent functional data that are correlated because of a longitudinal-based design: each subject is observed at repeated time visits and for each visit we record a functional variable. We propose a novel parsimonious modeling framework for the repeatedly observed functional variables that allows to extract low dimensional features. The proposed methodology accounts for th…
▽ More
We consider analysis of dependent functional data that are correlated because of a longitudinal-based design: each subject is observed at repeated time visits and for each visit we record a functional variable. We propose a novel parsimonious modeling framework for the repeatedly observed functional variables that allows to extract low dimensional features. The proposed methodology accounts for the longitudinal design, is designed for the study of the dynamic behavior of the underlying process, and is computationally fast. Theoretical properties of this framework are studied and numerical investigation confirms excellent behavior in finite samples. The proposed method is motivated by and applied to a diffusion tensor imaging study of multiple sclerosis. Using Shiny (Chang et al., 2015) we implement interactive plots to help visualize longitudinal functional data as well as the various components and prediction obtained using the proposed method.
△ Less
Submitted 29 June, 2015;
originally announced June 2015.