Skip to main content

Showing 1–10 of 10 results for author: Eriksen, P S

Searching in archive stat. Search in all archives.
.
  1. Shotgun DNA sequencing for human identification: Dynamic SNP selection and likelihood ratio calculations accounting for errors

    Authors: Mikkel Meyer Andersen, Marie-Louise Kampmann, Alberte Honoré Jepsen, Niels Morling, Poul Svante Eriksen, Claus Børsting, Jeppe Dyrberg Andersen

    Abstract: In forensic genetics, short tandem repeats (STRs) are used for human identification (HID). Degraded biological trace samples with low amounts of short DNA fragments (low-quality DNA samples) pose a challenge for STR typing. Predefined single nucleotide polymorphisms (SNPs) can be amplified on short PCR fragments and used to generate SNP profiles from low-quality DNA samples. However, the stochasti… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: 25 pages, 9 figures

  2. arXiv:2201.08659  [pdf, other

    cs.LG stat.CO

    Unity Smoothing for Handling Inconsistent Evidence in Bayesian Networks and Unity Propagation for Faster Inference

    Authors: Mads Lindskou, Torben Tvedebrink, Poul Svante Eriksen, Søren Højsgaard, Niels Morling

    Abstract: We propose Unity Smoothing (US) for handling inconsistencies between a Bayesian network model and new unseen observations. We show that prediction accuracy, using the junction tree algorithm with US is comparable to that of Laplace smoothing. Moreover, in applications were sparsity of the data structures is utilized, US outperforms Laplace smoothing in terms of memory usage. Furthermore, we detail… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

  3. arXiv:2103.03647  [pdf, other

    stat.CO

    sparta: Sparse Tables and their Algebra with a View Towards High Dimensional Graphical Models

    Authors: Mads Lindskou, Søren Højsgaard, Poul Svante Eriksen, Torben Tvedebrink

    Abstract: A graphical model is a multivariate (potentially very high dimensional) probabilistic model, which is formed by combining lower dimensional components. Inference (computation of conditional probabilities) is based on message passing algorithms that utilize conditional independence structures. In graphical models for discrete variables with finite state spaces, there is a fundamental problem in hig… ▽ More

    Submitted 2 June, 2021; v1 submitted 5 March, 2021; originally announced March 2021.

  4. arXiv:2103.02366  [pdf, other

    math.ST stat.CO

    Detecting Outliers in High-dimensional Data with Mixed Variable Types using Conditional Gaussian Regression Models

    Authors: Mads Lindskou, Torben Tvedebrink, Poul Svante Eriksen, Niels Morling

    Abstract: Outlier detection has gained increasing interest in recent years, due to newly emerging technologies and the huge amount of high-dimensional data that are now available. Outlier detection can help practitioners to identify unwanted noise and/or locate interesting abnormal observations. To address this, we developed a novel method for outlier detection for use in, possibly high-dimensional, dataset… ▽ More

    Submitted 19 May, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

  5. arXiv:2012.00513  [pdf, other

    stat.CO stat.AP stat.ML

    DNA mixture deconvolution using an evolutionary algorithm with multiple populations, hill-climbing, and guided mutation

    Authors: Søren B. Vilsen, Torben Tvedebrink, Poul Svante Eriksen

    Abstract: DNA samples crime cases analysed in forensic genetics, frequently contain DNA from multiple contributors. These occur as convolutions of the DNA profiles of the individual contributors to the DNA sample. Thus, in cases where one or more of the contributors were unknown, an objective of interest would be the separation, often called deconvolution, of these unknown profiles. In order to obtain decon… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  6. arXiv:1509.07982  [pdf, other

    stat.ME q-bio.MN stat.ML

    Targeted Fused Ridge Estimation of Inverse Covariance Matrices from Multiple High-Dimensional Data Classes

    Authors: Anders Ellern Bilgrau, Carel F. W. Peeters, Poul Svante Eriksen, Martin Bøgsted, Wessel N. van Wieringen

    Abstract: We consider the problem of jointly estimating multiple inverse covariance matrices from high-dimensional data consisting of distinct classes. An $\ell_2$-penalized maximum likelihood approach is employed. The suggested approach is flexible and generic, incorporating several other $\ell_2$-penalized estimators as special cases. In addition, the approach allows specification of target matrices throu… ▽ More

    Submitted 26 March, 2020; v1 submitted 26 September, 2015; originally announced September 2015.

    Comments: 52 pages, 11 figures

    Journal ref: Journal of Machine Learning Research, 21(26):1--52, 2020

  7. arXiv:1503.07990  [pdf, other

    stat.ML q-bio.GN stat.ME

    Estimating a common covariance matrix for network meta-analysis of gene expression datasets in diffuse large B-cell lymphoma

    Authors: Anders Ellern Bilgrau, Rasmus Froberg Brøndum, Poul Svante Eriksen, Karen Dybkær, Martin Bøgsted

    Abstract: The estimation of covariance matrices of gene expressions has many applications in cancer systems biology. Many gene expression studies, however, are hampered by low sample size and it has therefore become popular to increase sample size by collecting gene expression data across studies. Motivated by the traditional meta-analysis using random effects models, we present a hierarchical random covari… ▽ More

    Submitted 21 August, 2017; v1 submitted 27 March, 2015; originally announced March 2015.

    Comments: 18 pages, 4 figures

  8. arXiv:1406.6508  [pdf, other

    stat.AP math.PR

    The multivariate Dirichlet-multinomial distribution and its application in forensic genetics to adjust for sub-population effects using the θ-correction

    Authors: Torben Tvedebrink, Poul Svante Eriksen, Niels Morling

    Abstract: In this paper, we discuss the construction of a multivariate generalisation of the Dirichlet-multinomial distribution. An example from forensic genetics in the statistical analysis of DNA mixtures motivates the study of this multivariate extension. In forensic genetics, adjustment of the match probabilities due to remote ancestry in the population is often done using the so-called θ-correction.… ▽ More

    Submitted 4 November, 2014; v1 submitted 25 June, 2014; originally announced June 2014.

    Comments: 11 pages, 4 figures

  9. arXiv:1304.2129  [pdf, other

    stat.AP stat.CO

    A gentle introduction to the discrete Laplace method for estimating Y-STR haplotype frequencies

    Authors: Mikkel Meyer Andersen, Poul Svante Eriksen, Niels Morling

    Abstract: Y-STR data simulated under a Fisher-Wright model of evolution with a single-step mutation model turns out to be well predicted by a method using discrete Laplace distributions.

    Submitted 16 October, 2013; v1 submitted 8 April, 2013; originally announced April 2013.

    Comments: 18 pages, 5 figures

  10. arXiv:1210.1773  [pdf, other

    stat.CO q-bio.PE stat.OT

    Efficient Forward Simulation of Fisher-Wright Populations with Stochastic Population Size and Neutral Single Step Mutations in Haplotypes

    Authors: Mikkel Meyer Andersen, Poul Svante Eriksen

    Abstract: In both population genetics and forensic genetics it is important to know how haplotypes are distributed in a population. Simulation of population dynamics helps facilitating research on the distribution of haplotypes. In forensic genetics, the haplotypes can for example consist of lineage markers such as short tandem repeat loci on the Y chromosome (Y-STR). A dominating model for describing popul… ▽ More

    Submitted 5 October, 2012; originally announced October 2012.

    Comments: 17 pages, 6 figures

    MSC Class: 62-04 ACM Class: G.3