Skip to main content

Showing 1–10 of 10 results for author: Fernández, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2209.00124  [pdf, other

    math.ST stat.ME stat.ML

    A general framework for the analysis of kernel-based tests

    Authors: Tamara Fernández, Nicolás Rivera

    Abstract: Kernel-based tests provide a simple yet effective framework that use the theory of reproducing kernel Hilbert spaces to design non-parametric testing procedures. In this paper we propose new theoretical tools that can be used to study the asymptotic behaviour of kernel-based tests in several data scenarios, and in many different testing problems. Unlike current approaches, our methods avoid using… ▽ More

    Submitted 31 August, 2022; originally announced September 2022.

  2. arXiv:2206.07239  [pdf, other

    stat.ME cs.LG

    A Multiple kernel testing procedure for non-proportional hazards in factorial designs

    Authors: Marc Ditzhaus, Tamara Fernández, Nicolás Rivera

    Abstract: In this paper we propose a Multiple kernel testing procedure to infer survival data when several factors (e.g. different treatment groups, gender, medical history) and their interaction are of interest simultaneously. Our method is able to deal with complex data and can be seen as an alternative to the omnipresent Cox model when assumptions such as proportionality cannot be justified. Our methodol… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  3. arXiv:2111.10275  [pdf, other

    stat.ML cs.LG stat.ME

    Composite Goodness-of-fit Tests with Kernels

    Authors: Oscar Key, Arthur Gretton, François-Xavier Briol, Tamara Fernandez

    Abstract: Model misspecification can create significant challenges for the implementation of probabilistic models, and this has led to development of a range of robust methods which directly account for this issue. However, whether these more involved methods are required will depend on whether the model is really misspecified, and there is a lack of generally applicable methods to answer this question. In… ▽ More

    Submitted 19 April, 2025; v1 submitted 19 November, 2021; originally announced November 2021.

    Journal ref: Journal of Machine Learning Research 26(51):1-60 2025

  4. arXiv:2011.08991  [pdf, other

    stat.ME stat.ML

    A kernel test for quasi-independence

    Authors: Tamara Fernández, Wenkai Xu, Marc Ditzhaus, Arthur Gretton

    Abstract: We consider settings in which the data of interest correspond to pairs of ordered times, e.g, the birth times of the first and second child, the times at which a new user creates an account and makes the first purchase on a website, and the entry and survival times of patients in a clinical trial. In these settings, the two times are not independent (the second occurs after the first), yet it is s… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

  5. arXiv:2008.08397  [pdf, other

    stat.ML cs.LG stat.ME

    Kernelized Stein Discrepancy Tests of Goodness-of-fit for Time-to-Event Data

    Authors: Tamara Fernandez, Nicolas Rivera, Wenkai Xu, Arthur Gretton

    Abstract: Survival Analysis and Reliability Theory are concerned with the analysis of time-to-event data, in which observations correspond to waiting times until an event of interest such as death from a particular disease or failure of a component in a mechanical system. This type of data is unique due to the presence of censoring, a type of missing data that occurs when we do not observe the actual time o… ▽ More

    Submitted 26 August, 2020; v1 submitted 19 August, 2020; originally announced August 2020.

    Comments: Proceedings of the International Conference on Machine Learning, 2020

  6. arXiv:1912.03784  [pdf, other

    stat.ME stat.ML

    A kernel log-rank test of independence for right-censored data

    Authors: Tamara Fernandez, Arthur Gretton, David Rindt, Dino Sejdinovic

    Abstract: We introduce a general non-parametric independence test between right-censored survival times and covariates, which may be multivariate. Our test statistic has a dual interpretation, first in terms of the supremum of a potentially infinite collection of weight-indexed log-rank tests, with weight functions belonging to a reproducing kernel Hilbert space (RKHS) of functions; and second, as the norm… ▽ More

    Submitted 19 November, 2021; v1 submitted 8 December, 2019; originally announced December 2019.

  7. arXiv:1904.05187  [pdf, other

    stat.ME stat.ML

    A Reproducing Kernel Hilbert Space log-rank test for the two-sample problem

    Authors: Tamara Fernandez, Nicolas Rivera

    Abstract: Weighted log-rank tests are arguably the most widely used tests by practitioners for the two-sample problem in the context of right-censored data. Many approaches have been considered to make weighted log-rank tests more robust against a broader family of alternatives, among them, considering linear combinations of weighted log-rank tests, and taking the maximum among a finite collection of them.… ▽ More

    Submitted 29 April, 2020; v1 submitted 10 April, 2019; originally announced April 2019.

  8. arXiv:1810.04806  [pdf, other

    math.ST stat.ME

    Kaplan-Meier V- and U-statistics

    Authors: Tamara Fernández, Nicolás Rivera

    Abstract: In this paper, we study Kaplan-Meier V- and U-statistics respectively defined as $θ(\widehat{F}_n)=\sum_{i,j}K(X_{[i:n]},X_{[j:n]})W_iW_j$ and $θ_U(\widehat{F}_n)=\sum_{i\neq j}K(X_{[i:n]},X_{[j:n]})W_iW_j/\sum_{i\neq j}W_iW_j$, where $\widehat{F}_n$ is the Kaplan-Meier estimator, $\{W_1,\ldots,W_n\}$ are the Kaplan-Meier weights and $K:(0,\infty)^2\to\mathbb R$ is a symmetric kernel. As in the ca… ▽ More

    Submitted 12 March, 2020; v1 submitted 10 October, 2018; originally announced October 2018.

  9. arXiv:1810.04286  [pdf, other

    stat.ME

    A maximum-mean-discrepancy goodness-of-fit test for censored data

    Authors: Tamara Fernández, Arthur Gretton

    Abstract: We introduce a kernel-based goodness-of-fit test for censored data, where observations may be missing in random time intervals: a common occurrence in clinical trials and industrial life-testing. The test statistic is straightforward to compute, as is the test threshold, and we establish consistency under the null. Unlike earlier approaches such as the Log-rank test, we make no assumptions as to h… ▽ More

    Submitted 9 October, 2018; originally announced October 2018.

  10. arXiv:1611.00817  [pdf, other

    stat.ML

    Gaussian Processes for Survival Analysis

    Authors: Tamara Fernández, Nicolás Rivera, Yee Whye Teh

    Abstract: We introduce a semi-parametric Bayesian model for survival analysis. The model is centred on a parametric baseline hazard, and uses a Gaussian process to model variations away from it nonparametrically, as well as dependence on covariates. As opposed to many other methods in survival analysis, our framework does not impose unnecessary constraints in the hazard rate or in the survival function. Fur… ▽ More

    Submitted 2 November, 2016; originally announced November 2016.

    Comments: To appear in NIPS 2016