Skip to main content

Showing 1–3 of 3 results for author: Ostrouchov, G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2507.00962  [pdf, ps, other

    stat.CO stat.AP

    clustra: A multi-platform k-means clustering algorithm for analysis of longitudinal trajectories in large electronic health records data

    Authors: Nimish Adhikari, Hanna Gerlovin, George Ostrouchov, Rachel Ehrbar, Alyssa B. Dufour, Brian R. Ferolito, Serkalem Demissie, Lauren Costa, Yuk-Lam Ho, Laura Tarko, Edmon Begoli, Kelly Cho, David R. Gagnon

    Abstract: Background and Objective: Variables collected over time, or longitudinally, such as biologic measurements in electronic health records data, are not simple to summarize with a single time-point, and thus can be more holistically conceptualized as trajectories over time. Cluster analysis with longitudinal data further allows for clinical representation of groups of subjects with similar trajectorie… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: 15 pages, 11 figures, clustra package available in https://cran.r-project.org/web/packages/clustra/index.html, SAS macros available in https://github.com/MVP-CHAMPION/clustra-SAS

  2. arXiv:2303.16369  [pdf, other

    stat.AP cs.DC

    A Spatially Correlated Competing Risks Time-to-Event Model for Supercomputer GPU Failure Data

    Authors: Jie Min, Yili Hong, William Q. Meeker, George Ostrouchov

    Abstract: Graphics processing units (GPUs) are widely used in many high-performance computing (HPC) applications such as imaging/video processing and training deep-learning models in artificial intelligence. GPUs installed in HPC systems are often heavily used, and GPU failures occur during HPC system operations. Thus, the reliability of GPUs is of interest for the overall reliability of HPC systems. The Cr… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: 45 pages, 25 figures

  3. arXiv:1709.01195  [pdf, other

    stat.CO

    Parallel Statistical Computing with R: An Illustration on Two Architectures

    Authors: George Ostrouchov, Wei-Chen Chen, Drew Schmidt

    Abstract: To harness the full benefit of new computing platforms, it is necessary to develop software with parallel computing capabilities. This is no less true for statisticians than for astrophysicists. The R programming language, which is perhaps the most popular software environment for statisticians today, has many packages available for parallel computing. Their diversity in approach can be difficult… ▽ More

    Submitted 6 September, 2017; v1 submitted 4 September, 2017; originally announced September 2017.

    Comments: Presented at: International Statistical Institute 61st World Statistics Congress, Marrakech, Morocco, July 16-21, 2017