Skip to main content

Showing 1–32 of 32 results for author: Chung, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2503.13404  [pdf, other

    cs.AI cs.LG stat.ML

    Fed-Joint: Joint Modeling of Nonlinear Degradation Signals and Failure Events for Remaining Useful Life Prediction using Federated Learning

    Authors: Cheoljoon Jeong, Xubo Yue, Seokhyun Chung

    Abstract: Many failure mechanisms of machinery are closely related to the behavior of condition monitoring (CM) signals. To achieve a cost-effective preventive maintenance strategy, accurate remaining useful life (RUL) prediction based on the signals is of paramount importance. However, the CM signals are often recorded at different factories and production lines, with limited amounts of data. Unfortunately… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  2. arXiv:2502.15104  [pdf, other

    q-bio.NC stat.ML

    Estimating Neural Representation Alignment from Sparsely Sampled Inputs and Features

    Authors: Chanwoo Chun, Abdulkadir Canatar, SueYeon Chung, Daniel D. Lee

    Abstract: In both artificial and biological systems, the centered kernel alignment (CKA) has become a widely used tool for quantifying neural representation similarity. While current CKA estimators typically correct for the effects of finite stimuli sampling, the effects of sampling a subset of neurons are overlooked, introducing notable bias in standard experimental scenarios. Here, we provide a theoretica… ▽ More

    Submitted 24 February, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

  3. arXiv:2412.05439  [pdf, other

    cond-mat.dis-nn q-bio.NC stat.ML

    Statistical Mechanics of Support Vector Regression

    Authors: Abdulkadir Canatar, SueYeon Chung

    Abstract: A key problem in deep learning and computational neuroscience is relating the geometrical properties of neural representations to task performance. Here, we consider this problem for continuous decoding tasks where neural variability may affect task precision. Using methods from statistical mechanics, we study the average-case learning curves for $\varepsilon$-insensitive Support Vector Regression… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

  4. arXiv:2410.17998  [pdf, other

    cs.LG math.SP math.ST stat.ML

    Estimating the Spectral Moments of the Kernel Integral Operator from Finite Sample Matrices

    Authors: Chanwoo Chun, SueYeon Chung, Daniel D. Lee

    Abstract: Analyzing the structure of sampled features from an input data distribution is challenging when constrained by limited measurements in both the number of inputs and features. Traditional approaches often rely on the eigenvalue spectrum of the sample covariance matrix derived from finite measurement matrices; however, these spectra are sensitive to the size of the measurement matrix, leading to bia… ▽ More

    Submitted 8 February, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

    Comments: Accepted for publication in the Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS) 2025

  5. arXiv:2407.16935  [pdf, other

    stat.ML cs.LG

    Federated Automatic Latent Variable Selection in Multi-output Gaussian Processes

    Authors: Jingyi Gao, Seokhyun Chung

    Abstract: This paper explores a federated learning approach that automatically selects the number of latent processes in multi-output Gaussian processes (MGPs). The MGP has seen great success as a transfer learning tool when data is generated from multiple sources/units/entities. A common approach in MGPs to transfer knowledge across units involves gathering all data from each unit to a central server and e… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  6. arXiv:2405.06851  [pdf, other

    q-bio.NC cond-mat.dis-nn cond-mat.stat-mech cs.NE stat.ML

    Nonlinear classification of neural manifolds with contextual information

    Authors: Francesca Mignacco, Chi-Ning Chou, SueYeon Chung

    Abstract: Understanding how neural systems efficiently process information through distributed representations is a fundamental challenge at the interface of neuroscience and machine learning. Recent approaches analyze the statistical and geometrical attributes of neural representations as population-level mechanistic descriptors of task implementation. In particular, manifold capacity has emerged as a prom… ▽ More

    Submitted 30 March, 2025; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: 7 pages, 7 figures

  7. arXiv:2403.16377  [pdf, other

    cs.LG eess.SY stat.ML

    Real-time Adaptation for Condition Monitoring Signal Prediction using Label-aware Neural Processes

    Authors: Seokhyun Chung, Raed Al Kontar

    Abstract: Building a predictive model that rapidly adapts to real-time condition monitoring (CM) signals is critical for engineering systems/units. Unfortunately, many current methods suffer from a trade-off between representation power and agility in online settings. For instance, parametric methods that assume an underlying functional form for CM signals facilitate efficient online prediction updates. How… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  8. arXiv:2312.10072  [pdf, other

    cs.HC cs.AI cs.LG stat.AP

    Assessing the Usability of GutGPT: A Simulation Study of an AI Clinical Decision Support System for Gastrointestinal Bleeding Risk

    Authors: Colleen Chan, Kisung You, Sunny Chung, Mauro Giuffrè, Theo Saarinen, Niroop Rajashekar, Yuan Pu, Yeo Eun Shin, Loren Laine, Ambrose Wong, René Kizilcec, Jasjeet Sekhon, Dennis Shung

    Abstract: Applications of large language models (LLMs) like ChatGPT have potential to enhance clinical decision support through conversational interfaces. However, challenges of human-algorithmic interaction and clinician trust are poorly understood. GutGPT, a LLM for gastrointestinal (GI) bleeding risk prediction and management guidance, was deployed in clinical simulation scenarios alongside the electroni… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10, 2023, New Orleans, United States, 11 pages

  9. arXiv:2311.02838  [pdf, other

    stat.ML cs.LG eess.SP

    Barron Space for Graph Convolution Neural Networks

    Authors: Seok-Young Chung, Qiyu Sun

    Abstract: Graph convolutional neural network (GCNN) operates on graph domain and it has achieved a superior performance to accomplish a wide range of tasks. In this paper, we introduce a Barron space of functions on a compact domain of graph signals. We prove that the proposed Barron space is a reproducing kernel Banach space, it can be decomposed into the union of a family of reproducing kernel Hilbert spa… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

  10. arXiv:2310.17571  [pdf, other

    econ.EM stat.ML

    Inside the black box: Neural network-based real-time prediction of US recessions

    Authors: Seulki Chung

    Abstract: Long short-term memory (LSTM) and gated recurrent unit (GRU) are used to model US recessions from 1967 to 2021. Their predictive performances are compared to those of the traditional linear models. The out-of-sample performance suggests the application of LSTM and GRU in recession forecasting, especially for longer-term forecasts. The Shapley additive explanations (SHAP) method is applied to both… ▽ More

    Submitted 23 May, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

  11. arXiv:2211.14961  [pdf, ps, other

    q-bio.NC cond-mat.dis-nn cond-mat.stat-mech cs.NE stat.ML

    Linear Classification of Neural Manifolds with Correlated Variability

    Authors: Albert J. Wakhloo, Tamara J. Sussman, SueYeon Chung

    Abstract: Understanding how the statistical and geometric properties of neural activity relate to performance is a key problem in theoretical neuroscience and deep learning. Here, we calculate how correlations between object representations affect the capacity, a measure of linear separability. We show that for spherical object manifolds, introducing correlations between centroids effectively pushes the sph… ▽ More

    Submitted 13 July, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: 6 pages and 5 figures in main text. 13 pages and 1 figure in supplementary material

    Journal ref: Phys. Rev. Lett. 131, 027301 (2023)

  12. arXiv:2211.14827  [pdf, other

    cs.LG cs.AI stat.ML

    Domain Generalization for Robust Model-Based Offline Reinforcement Learning

    Authors: Alan Clark, Shoaib Ahmed Siddiqui, Robert Kirk, Usman Anwar, Stephen Chung, David Krueger

    Abstract: Existing offline reinforcement learning (RL) algorithms typically assume that training data is either: 1) generated by a known policy, or 2) of entirely unknown origin. We consider multi-demonstrator offline RL, a middle ground where we know which demonstrators generated each dataset, but make no assumptions about the underlying policies of the demonstrators. This is the most natural setting when… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: Accepted to the NeurIPS 2022 Workshops on Distribution Shifts and Offline Reinforcement Learning

  13. arXiv:2202.02649  [pdf, other

    stat.ML cs.LG q-bio.NC

    The Implicit Bias of Gradient Descent on Generalized Gated Linear Networks

    Authors: Samuel Lippl, L. F. Abbott, SueYeon Chung

    Abstract: Understanding the asymptotic behavior of gradient-descent training of deep neural networks is essential for revealing inductive biases and improving network performance. We derive the infinite-time training limit of a mathematically tractable class of deep nonlinear neural networks, gated linear networks (GLNs), and generalize these results to gated networks described by general homogeneous polyno… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

    Comments: 23 pages, 5 figures

  14. arXiv:2105.14602  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    On the geometry of generalization and memorization in deep neural networks

    Authors: Cory Stephenson, Suchismita Padhy, Abhinav Ganesh, Yue Hui, Hanlin Tang, SueYeon Chung

    Abstract: Understanding how large neural networks avoid memorizing training data is key to explaining their high generalization performance. To examine the structure of when and where memorization occurs in a deep network, we use a recently developed replica-based mean field theoretic geometric analysis method. We find that all layers preferentially learn from examples which share features, and link this be… ▽ More

    Submitted 30 May, 2021; originally announced May 2021.

    Comments: ICLR 2021

  15. arXiv:2008.13044  [pdf, other

    cs.LG cs.AI stat.ML

    Reinforcement Learning with Feedback-modulated TD-STDP

    Authors: Stephen Chung, Robert Kozma

    Abstract: Spiking neuron networks have been used successfully to solve simple reinforcement learning tasks with continuous action set applying learning rules based on spike-timing-dependent plasticity (STDP). However, most of these models cannot be applied to reinforcement learning tasks with discrete action set since they assume that the selected action is a deterministic function of firing rate of neurons… ▽ More

    Submitted 29 August, 2020; originally announced August 2020.

    Comments: 17 pages, 4 figures

    ACM Class: I.2.8

  16. Weakly-supervised Multi-output Regression via Correlated Gaussian Processes

    Authors: Seokhyun Chung, Raed Al Kontar, Zhenke Wu

    Abstract: Multi-output regression seeks to borrow strength and leverage commonalities across different but related outputs in order to enhance learning and prediction accuracy. A fundamental assumption is that the output/group membership labels for all observations are known. This assumption is often violated in real applications. For instance, in healthcare datasets, sensitive attributes such as ethnicity… ▽ More

    Submitted 23 May, 2022; v1 submitted 19 February, 2020; originally announced February 2020.

    Journal ref: Informs Journal on Data Science, 2022

  17. arXiv:2002.05318  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    Online Optimization with Memory and Competitive Control

    Authors: Guanya Shi, Yiheng Lin, Soon-Jo Chung, Yisong Yue, Adam Wierman

    Abstract: This paper presents competitive algorithms for a novel class of online optimization problems with memory. We consider a setting where the learner seeks to minimize the sum of a hitting cost and a switching cost that depends on the previous $p$ decisions. This setting generalizes Smoothed Online Convex Optimization. The proposed approach, Optimistic Regularized Online Balanced Descent, achieves a c… ▽ More

    Submitted 8 January, 2021; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Neural Information Processing Systems (NeurIPS 2020)

  18. arXiv:1912.02522  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    VoxSRC 2019: The first VoxCeleb Speaker Recognition Challenge

    Authors: Joon Son Chung, Arsha Nagrani, Ernesto Coto, Weidi Xie, Mitchell McLaren, Douglas A Reynolds, Andrew Zisserman

    Abstract: The VoxCeleb Speaker Recognition Challenge 2019 aimed to assess how well current speaker recognition technology is able to identify speakers in unconstrained or `in the wild' data. It consisted of: (i) a publicly available speaker recognition dataset from YouTube videos together with ground truth annotation and standardised evaluation software; and (ii) a public challenge and workshop held at Inte… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

    Comments: ISCA Archive

  19. arXiv:1911.11943  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Novelty Detection Via Blurring

    Authors: Sungik Choi, Sae-Young Chung

    Abstract: Conventional out-of-distribution (OOD) detection schemes based on variational autoencoder or Random Network Distillation (RND) have been observed to assign lower uncertainty to the OOD than the target distribution. In this work, we discover that such conventional novelty detection schemes are also vulnerable to the blurred images. Based on the observation, we construct a novel RND-based OOD detect… ▽ More

    Submitted 3 March, 2020; v1 submitted 26 November, 2019; originally announced November 2019.

    Comments: ICLR 2020

  20. arXiv:1911.09816  [pdf, other

    eess.IV cs.CV stat.AP

    Two-stage dimension reduction for noisy high-dimensional images and application to Cryogenic Electron Microscopy

    Authors: Szu-Chi Chung, Shao-Hsuan Wang, Po-Yao Niu, Su-Yun Huang, Wei-Hau Chang, I-Ping Tu

    Abstract: Principal component analysis (PCA) is arguably the most widely used dimension-reduction method for vector-type data. When applied to a sample of images, PCA requires vectorization of the image data, which in turn entails solving an eigenvalue problem for the sample covariance matrix. We propose herein a two-stage dimension reduction (2SDR) method for image reconstruction from high-dimensional nois… ▽ More

    Submitted 27 February, 2021; v1 submitted 21 November, 2019; originally announced November 2019.

    Comments: 29 pages, 8 figures and 3 tables

    Journal ref: Annals of Mathematical Sciences and Applications. Volume 5, Number 2, 283-316, 2020

  21. arXiv:1910.09792  [pdf, other

    cs.LG cs.CV stat.ML

    Robust Training with Ensemble Consensus

    Authors: Jisoo Lee, Sae-Young Chung

    Abstract: Since deep neural networks are over-parameterized, they can memorize noisy examples. We address such a memorization issue in the presence of label noise. From the fact that deep neural networks cannot generalize to neighborhoods of memorized features, we hypothesize that noisy examples do not consistently incur small losses on the network under a certain perturbation. Based on this, we propose a n… ▽ More

    Submitted 11 November, 2020; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: ICLR 2020

  22. arXiv:1908.04752  [pdf, other

    eess.IV cs.LG q-bio.QM stat.ML

    Identification of relevant diffusion MRI metrics impacting cognitive functions using a novel feature selection method

    Authors: Tongda Xu, Xiyan Cai, Yao Wang, Xiuyuan Wang, Sohae Chung, Els Fieremans, Joseph Rath, Steven Flanagan, Yvonne W Lui

    Abstract: Mild Traumatic Brain Injury (mTBI) is a significant public health problem. The most troubling symptoms after mTBI are cognitive complaints. Studies show measurable differences between patients with mTBI and healthy controls with respect to tissue microstructure using diffusion MRI. However, it remains unclear which diffusion measures are the most informative with regard to cognitive functions in b… ▽ More

    Submitted 11 November, 2019; v1 submitted 10 August, 2019; originally announced August 2019.

  23. arXiv:1906.05819  [pdf, other

    cs.LG eess.SY stat.ML

    Robust Regression for Safe Exploration in Control

    Authors: Anqi Liu, Guanya Shi, Soon-Jo Chung, Anima Anandkumar, Yisong Yue

    Abstract: We study the problem of safe learning and exploration in sequential control problems. The goal is to safely collect data samples from operating in an environment, in order to learn to achieve a challenging control goal (e.g., an agile maneuver close to a boundary). A central challenge in this setting is how to quantify uncertainty in order to choose provably-safe actions that allow us to collect i… ▽ More

    Submitted 26 June, 2020; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: 2nd Annual Conference on Learning for Dynamics and Control

  24. Fourier Phase Retrieval with Extended Support Estimation via Deep Neural Network

    Authors: Kyung-Su Kim, Sae-Young Chung

    Abstract: We consider the problem of sparse phase retrieval from Fourier transform magnitudes to recover the $k$-sparse signal vector and its support $\mathcal{T}$. We exploit extended support estimate $\mathcal{E}$ with size larger than $k$ satisfying $\mathcal{E} \supseteq \mathcal{T}$ and obtained by a trained deep neural network (DNN). To make the DNN learnable, it provides $\mathcal{E}$ as the union of… ▽ More

    Submitted 13 August, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

  25. arXiv:1904.00864  [pdf, other

    cs.LG eess.SP stat.ML

    Tree Search Network for Sparse Regression

    Authors: Kyung-Su Kim, Sae-Young Chung

    Abstract: We consider the classical sparse regression problem of recovering a sparse signal $x_0$ given a measurement vector $y = Φx_0+w$. We propose a tree search algorithm driven by the deep neural network for sparse regression (TSN). TSN improves the signal reconstruction performance of the deep neural network designed for sparse regression by performing a tree search with pruning. It is observed in both… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

  26. arXiv:1903.08297  [pdf, other

    cs.LG cs.CV stat.ML

    Deep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening

    Authors: Nan Wu, Jason Phang, Jungkyu Park, Yiqiu Shen, Zhe Huang, Masha Zorin, Stanisław Jastrzębski, Thibault Févry, Joe Katsnelson, Eric Kim, Stacey Wolfson, Ujas Parikh, Sushma Gaddam, Leng Leng Young Lin, Kara Ho, Joshua D. Weinstein, Beatriu Reig, Yiming Gao, Hildegard Toth, Kristine Pysarenko, Alana Lewin, Jiyon Lee, Krystal Airola, Eralda Mema, Stephanie Chung , et al. (7 additional authors not shown)

    Abstract: We present a deep convolutional neural network for breast cancer screening exam classification, trained and evaluated on over 200,000 exams (over 1,000,000 images). Our network achieves an AUC of 0.895 in predicting whether there is a cancer in the breast, when tested on the screening population. We attribute the high accuracy of our model to a two-stage training procedure, which allows us to use… ▽ More

    Submitted 19 March, 2019; originally announced March 2019.

    Comments: MIDL 2019 [arXiv:1907.08612]

    Report number: MIDL/2019/ExtendedAbstract/SkxYez76FE

  27. Functional Principal Component Analysis for Extrapolating Multi-stream Longitudinal Data

    Authors: Seokhyun Chung, Raed Kontar

    Abstract: The advance of modern sensor technologies enables collection of multi-stream longitudinal data where multiple signals from different units are collected in real-time. In this article, we present a non-parametric approach to predict the evolution of multi-stream longitudinal data for an in-service unit through borrowing strength from other historical units. Our approach first decomposes each stream… ▽ More

    Submitted 9 March, 2019; originally announced March 2019.

    Report number: Volume: 70, Issue: 4, December 2021

    Journal ref: IEEE Transactions on Reliability, 2020

  28. arXiv:1805.12375  [pdf, other

    cs.LG stat.ML

    Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

    Authors: Su Young Lee, Sungik Choi, Sae-Young Chung

    Abstract: We propose Episodic Backward Update (EBU) - a novel deep reinforcement learning algorithm with a direct value propagation. In contrast to the conventional use of the experience replay with uniform random sampling, our agent samples a whole episode and successively propagates the value of a state to its previous states. Our computationally efficient recursive algorithm allows sparse and delayed rew… ▽ More

    Submitted 11 November, 2019; v1 submitted 31 May, 2018; originally announced May 2018.

    Comments: NeurIPS 2019

  29. A Mathematical Programming Approach for Integrated Multiple Linear Regression Subset Selection and Validation

    Authors: Seokhyun Chung, Young Woong Park, Taesu Cheong

    Abstract: Subset selection for multiple linear regression aims to construct a regression model that minimizes errors by selecting a small number of explanatory variables. Once a model is built, various statistical tests and diagnostics are conducted to validate the model and to determine whether the regression assumptions are met. Most traditional approaches require human decisions at this step. For example… ▽ More

    Submitted 26 July, 2020; v1 submitted 12 December, 2017; originally announced December 2017.

    Journal ref: Pattern Recognition 108(2020): 107565

  30. arXiv:1710.06487  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.NE q-bio.NC stat.ML

    Classification and Geometry of General Perceptual Manifolds

    Authors: SueYeon Chung, Daniel D. Lee, Haim Sompolinsky

    Abstract: Perceptual manifolds arise when a neural population responds to an ensemble of sensory signals associated with different physical features (e.g., orientation, pose, scale, location, and intensity) of the same perceptual object. Object recognition and discrimination requires classifying the manifolds in a manner that is insensitive to variability within a manifold. How neuronal systems give rise to… ▽ More

    Submitted 24 June, 2018; v1 submitted 17 October, 2017; originally announced October 2017.

    Comments: 24 pages, 12 figures, Supplementary Materials

    Journal ref: Phys. Rev. X 8, 031003 (2018)

  31. Learning Data Manifolds with a Cutting Plane Method

    Authors: SueYeon Chung, Uri Cohen, Haim Sompolinsky, Daniel D. Lee

    Abstract: We consider the problem of classifying data manifolds where each manifold represents invariances that are parameterized by continuous degrees of freedom. Conventional data augmentation methods rely upon sampling large numbers of training examples from these manifolds; instead, we propose an iterative algorithm called M_{CP} based upon a cutting-plane approach that efficiently solves a quadratic se… ▽ More

    Submitted 28 May, 2017; originally announced May 2017.

    Journal ref: Neural Computation. Volume:30, Issue:10, (2018) pp.2593-2615

  32. arXiv:1512.01834  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.NE q-bio.NC stat.ML

    Linear Readout of Object Manifolds

    Authors: SueYeon Chung, Daniel D. Lee, Haim Sompolinsky

    Abstract: Objects are represented in sensory systems by continuous manifolds due to sensitivity of neuronal responses to changes in physical features such as location, orientation, and intensity. What makes certain sensory representations better suited for invariant decoding of objects by downstream networks? We present a theory that characterizes the ability of a linear readout network, the perceptron, to… ▽ More

    Submitted 21 August, 2016; v1 submitted 6 December, 2015; originally announced December 2015.

    Comments: 5 pages, 3 figures, accepted in Physical Review E as Rapid Communication on 14th May. 2016

    Journal ref: Phys. Rev. E 93, 060301 (R) (2016)