Skip to main content

Showing 1–8 of 8 results for author: Park, J Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2411.10957  [pdf, other

    cs.LG cs.AI stat.ML

    IMPaCT GNN: Imposing invariance with Message Passing in Chronological split Temporal Graphs

    Authors: Sejun Park, Joo Young Park, Hyunwoo Park

    Abstract: This paper addresses domain adaptation challenges in graph data resulting from chronological splits. In a transductive graph learning setting, where each node is associated with a timestamp, we focus on the task of Semi-Supervised Node Classification (SSNC), aiming to classify recent nodes using labels of past nodes. Temporal dependencies in node connections create domain shifts, causing significa… ▽ More

    Submitted 16 November, 2024; originally announced November 2024.

    Comments: 11 pages (without appendix), 35 pages (with appendix), 14 figures

  2. arXiv:2309.10284  [pdf, other

    stat.ME math.ST stat.AP

    Rank-adaptive covariance testing with applications to genomics and neuroimaging

    Authors: David Veitch, Yinqiu He, Jun Young Park

    Abstract: In biomedical studies, testing for differences in covariance offers scientific insights beyond mean differences, especially when differences are driven by complex joint behavior between features. However, when differences in joint behavior are weakly dispersed across many dimensions and arise from differences in low-rank structures within the data, as is often the case in genomics and neuroimaging… ▽ More

    Submitted 21 November, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

  3. arXiv:2303.04745  [pdf, other

    cs.LG stat.ML

    A General Theory of Correct, Incorrect, and Extrinsic Equivariance

    Authors: Dian Wang, Xupeng Zhu, Jung Yeon Park, Mingxi Jia, Guanang Su, Robert Platt, Robin Walters

    Abstract: Although equivariant machine learning has proven effective at many tasks, success depends heavily on the assumption that the ground truth function is symmetric over the entire domain matching the symmetry in an equivariant neural network. A missing piece in the equivariant learning literature is the analysis of equivariant networks when symmetry exists only partially in the domain. In this work, w… ▽ More

    Submitted 28 October, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: Published at NeurIPS 2023

  4. The Building Data Genome Project 2, energy meter data from the ASHRAE Great Energy Predictor III competition

    Authors: Clayton Miller, Anjukan Kathirgamanathan, Bianca Picchetti, Pandarasamy Arjunan, June Young Park, Zoltan Nagy, Paul Raftery, Brodie W. Hobson, Zixiao Shi, Forrest Meggers

    Abstract: This paper describes an open data set of 3,053 energy meters from 1,636 non-residential buildings with a range of two full years (2016 and 2017) at an hourly frequency (17,544 measurements per meter resulting in approximately 53.6 million measurements). These meters were collected from 19 sites across North America and Europe, with one or more meters per building measuring whole building electrica… ▽ More

    Submitted 16 August, 2020; v1 submitted 3 June, 2020; originally announced June 2020.

    Journal ref: Scientific Data volume 7, Article number: 368 (2020)

  5. arXiv:2002.05578  [pdf, other

    cs.LG stat.ML

    Multiresolution Tensor Learning for Efficient and Interpretable Spatial Analysis

    Authors: Jung Yeon Park, Kenneth Theo Carr, Stephan Zheng, Yisong Yue, Rose Yu

    Abstract: Efficient and interpretable spatial analysis is crucial in many fields such as geology, sports, and climate science. Tensor latent factor models can describe higher-order correlations for spatial data. However, they are computationally expensive to train and are sensitive to initialization, leading to spatially incoherent, uninterpretable results. We develop a novel Multiresolution Tensor Learning… ▽ More

    Submitted 14 August, 2020; v1 submitted 13 February, 2020; originally announced February 2020.

    Comments: ICML 2020

  6. arXiv:2002.02601  [pdf, other

    stat.ML cs.LG q-bio.QM stat.AP stat.ME

    Bidimensional linked matrix factorization for pan-omics pan-cancer analysis

    Authors: Eric F. Lock, Jun Young Park, Katherine A. Hoadley

    Abstract: Several modern applications require the integration of multiple large data matrices that have shared rows and/or columns. For example, cancer studies that integrate multiple omics platforms across multiple types of cancer, pan-omics pan-cancer analysis, have extended our knowledge of molecular heterogenity beyond what was observed in single tumor and single platform studies. However, these studies… ▽ More

    Submitted 7 April, 2022; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: 26 pages, 5 figures

    Journal ref: Annals of Applied Statistics 2022, Vol. 16, No. 1, 193-215

  7. arXiv:1906.03722  [pdf, other

    stat.ML cs.LG q-bio.QM stat.ME

    Integrative Factorization of Bidimensionally Linked Matrices

    Authors: Jun Young Park, Eric F. Lock

    Abstract: Advances in molecular "omics'" technologies have motivated new methodology for the integration of multiple sources of high-content biomedical data. However, most statistical methods for integrating multiple data matrices only consider data shared vertically (one cohort on multiple platforms) or horizontally (different cohorts on a single platform). This is limiting for data that take the form of b… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

    Comments: 27 pages, 4 figures

    Journal ref: Biometrics, 2019

  8. arXiv:1411.5732  [pdf

    cs.CL cs.IR cs.LG stat.ML

    A Joint Probabilistic Classification Model of Relevant and Irrelevant Sentences in Mathematical Word Problems

    Authors: Suleyman Cetintas, Luo Si, Yan Ping Xin, Dake Zhang, Joo Young Park, Ron Tzur

    Abstract: Estimating the difficulty level of math word problems is an important task for many educational applications. Identification of relevant and irrelevant sentences in math word problems is an important step for calculating the difficulty levels of such problems. This paper addresses a novel application of text categorization to identify two types of sentences in mathematical word problems, namely re… ▽ More

    Submitted 20 November, 2014; originally announced November 2014.

    Comments: appears in Journal of Educational Data Mining (JEDM, 2010)