Skip to main content

Showing 1–19 of 19 results for author: Yi, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2409.06960  [pdf, other

    stat.ML cs.LG physics.data-an stat.AP

    Toward Model-Agnostic Detection of New Physics Using Data-Driven Signal Regions

    Authors: Soheun Yi, John Alison, Mikael Kuusela

    Abstract: In the search for new particles in high-energy physics, it is crucial to select the Signal Region (SR) in such a way that it is enriched with signal events if they are present. While most existing search methods set the region relying on prior domain knowledge, it may be unavailable for a completely novel particle that falls outside the current scope of understanding. We address this issue by prop… ▽ More

    Submitted 10 December, 2024; v1 submitted 10 September, 2024; originally announced September 2024.

    Comments: 5 pages, 2 figures

  2. arXiv:2404.13321  [pdf

    stat.AP eess.SY

    Accelerated System-Reliability-based Disaster Resilience Analysis for Structural Systems

    Authors: Taeyong Kim, Sang-ri Yi

    Abstract: Resilience has emerged as a crucial concept for evaluating structural performance under disasters because of its ability to extend beyond traditional risk assessments, accounting for a system's ability to minimize disruptions and maintain functionality during recovery. To facilitate the holistic understanding of resilience performance in structural systems, a system-reliability-based disaster resi… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 25 pages, 18 figures

  3. arXiv:2403.11429  [pdf, other

    stat.AP

    Long-range Ising model for regional-scale seismic risk analysis

    Authors: Sebin Oh, Sang-ri Yi, Ziqi Wang

    Abstract: This study introduces the long-range Ising model from statistical mechanics to the Performance-Based Earthquake Engineering (PBEE) framework for regional seismic damage analysis. The application of the PBEE framework at a regional scale involves estimating the damage states of numerous structures, typically performed using fragility function-based stochastic simulations. However, these simulations… ▽ More

    Submitted 23 May, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  4. arXiv:2402.04582  [pdf, other

    stat.AP stat.ML

    Dimensionality reduction can be used as a surrogate model for high-dimensional forward uncertainty quantification

    Authors: Jungho Kim, Sang-ri Yi, Ziqi Wang

    Abstract: We introduce a method to construct a stochastic surrogate model from the results of dimensionality reduction in forward uncertainty quantification. The hypothesis is that the high-dimensional input augmented by the output of a computational model admits a low-dimensional representation. This assumption can be met by numerous uncertainty quantification applications with physics-based computational… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  5. arXiv:2306.02106  [pdf, other

    stat.AP

    Impacts of Innovation School System in Korea: A Latent Space Item Response Model with Neyman-Scott Point Process

    Authors: Seorim Yi, Minkyu Kim, Jaewoo Park, Minjeong Jeon, Ick Hoon Jin

    Abstract: South Korea's educational system has faced criticism for its lack of focus on critical thinking and creativity, resulting in high levels of stress and anxiety among students. As part of the government's effort to improve the educational system, the innovation school system was introduced in 2009, which aims to develop students' creativity as well as their non-cognitive skills. To better understand… ▽ More

    Submitted 27 May, 2024; v1 submitted 3 June, 2023; originally announced June 2023.

  6. arXiv:2207.06587  [pdf, other

    stat.ME stat.AP

    A Spatio-Temporal Dirichlet Process Mixture Model for Coronavirus Disease-19

    Authors: Jaewoo Park, Seorim Yi, Won Chang, Jorge Mateu

    Abstract: Understanding the spatio-temporal patterns of the coronavirus disease 2019 (COVID-19) is essential to construct public health interventions. Spatially referenced data can provide richer opportunities to understand the mechanism of the disease spread compared to the more often encountered aggregated count data. We propose a spatio-temporal Dirichlet process mixture model to analyze confirmed cases… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: 26 pages, 10 figures

  7. arXiv:2206.12891  [pdf, other

    stat.ME

    Hierarchical nuclear norm penalization for multi-view data

    Authors: Sangyoon Yi, Raymond K. W. Wong, Irina Gaynanova

    Abstract: The prevalence of data collected on the same set of samples from multiple sources (i.e., multi-view data) has prompted significant development of data integration methods based on low-rank matrix factorizations. These methods decompose signal matrices from each view into the sum of shared and individual structures, which are further used for dimension reduction, exploratory analyses, and quantifyi… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

    Comments: 39 pages, 10 figures, 3 tables

  8. arXiv:2201.00459  [pdf, ps, other

    stat.AP

    A sampling scheme for estimating the prevalence of a pandemic

    Authors: Ze Liu, Siyu Yi, Jianghu, Dong, Min-Qian Liu, Yongdao Zhou

    Abstract: The spread of COVID-19 makes it essential to investigate its prevalence. In such investigation research, as far as we know, the widely-used sampling methods didn't use the information sufficiently about the numbers of the previously diagnosed cases, which provides a priori information about the true numbers of infections. This motivates us to develop a new, two-stage sampling method in this paper,… ▽ More

    Submitted 2 January, 2022; originally announced January 2022.

  9. arXiv:2011.08753  [pdf, other

    stat.ML cs.LG

    Confounding Feature Acquisition for Causal Effect Estimation

    Authors: Shirly Wang, Seung Eun Yi, Shalmali Joshi, Marzyeh Ghassemi

    Abstract: Reliable treatment effect estimation from observational data depends on the availability of all confounding information. While much work has targeted treatment effect estimation from observational data, there is relatively little work in the setting of confounding variable missingness, where collecting more information on confounders is often costly or time-consuming. In this work, we frame this c… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

  10. arXiv:2011.04868  [pdf, other

    cs.LG math.OC stat.ML

    Neural Network Compression Via Sparse Optimization

    Authors: Tianyi Chen, Bo Ji, Yixin Shi, Tianyu Ding, Biyi Fang, Sheng Yi, Xiao Tu

    Abstract: The compression of deep neural networks (DNNs) to reduce inference cost becomes increasingly important to meet realistic deployment requirements of various applications. There have been a significant amount of work regarding network compression, while most of them are heuristic rule-based or typically not friendly to be incorporated into varying scenarios. On the other hand, sparse optimization yi… ▽ More

    Submitted 11 November, 2020; v1 submitted 9 November, 2020; originally announced November 2020.

  11. arXiv:2007.10740  [pdf, other

    cs.LG cs.CV stat.ML

    Balanced Meta-Softmax for Long-Tailed Visual Recognition

    Authors: Jiawei Ren, Cunjun Yu, Shunan Sheng, Xiao Ma, Haiyu Zhao, Shuai Yi, Hongsheng Li

    Abstract: Deep classifiers have achieved great success in visual recognition. However, real-world data is long-tailed by nature, leading to the mismatch between training and testing distributions. In this paper, we show that the Softmax function, though used in most classification tasks, gives a biased gradient estimation under the long-tailed setup. This paper presents Balanced Softmax, an elegant unbiased… ▽ More

    Submitted 22 November, 2020; v1 submitted 21 July, 2020; originally announced July 2020.

    Comments: NeurIPS 2020 camera-ready; Code available at https://github.com/jiawei-ren/BalancedMetaSoftmax

  12. arXiv:2007.05181  [pdf, other

    cs.LG stat.ML

    Sample-based Regularization: A Transfer Learning Strategy Toward Better Generalization

    Authors: Yunho Jeon, Yongseok Choi, Jaesun Park, Subin Yi, Dongyeon Cho, Jiwon Kim

    Abstract: Training a deep neural network with a small amount of data is a challenging problem as it is vulnerable to overfitting. However, one of the practical difficulties that we often face is to collect many samples. Transfer learning is a cost-effective solution to this problem. By using the source model trained with a large-scale dataset, the target model can alleviate the overfitting originated from t… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

  13. arXiv:2004.03639  [pdf, other

    math.OC cs.LG stat.ML

    Orthant Based Proximal Stochastic Gradient Method for $\ell_1$-Regularized Optimization

    Authors: Tianyi Chen, Tianyu Ding, Bo Ji, Guanyi Wang, Jing Tian, Yixin Shi, Sheng Yi, Xiao Tu, Zhihui Zhu

    Abstract: Sparsity-inducing regularization problems are ubiquitous in machine learning applications, ranging from feature selection to model compression. In this paper, we present a novel stochastic method -- Orthant Based Proximal Stochastic Gradient Method (OBProx-SG) -- to solve perhaps the most popular instance, i.e., the l1-regularized problem. The OBProx-SG method contains two steps: (i) a proximal st… ▽ More

    Submitted 23 July, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: Accepted by ECML 2020

  14. arXiv:1910.07632  [pdf, other

    cs.LG stat.ML

    Adaptive Transfer Learning of Multi-View Time Series Classification

    Authors: Donglin Zhan, Shiyu Yi, Dongli Xu, Xiao Yu, Denglin Jiang, Siqi Yu, Haoting Zhang, Wenfang Shangguan, Weihua Zhang

    Abstract: Time Series Classification (TSC) has been an important and challenging task in data mining, especially on multivariate time series and multi-view time series data sets. Meanwhile, transfer learning has been widely applied in computer vision and natural language processing applications to improve deep neural network's generalization capabilities. However, very few previous works applied transfer le… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

    Comments: 12 pages, 5 figures

  15. arXiv:1910.02519  [pdf, other

    cs.LG stat.ML

    FIS-GAN: GAN with Flow-based Importance Sampling

    Authors: Shiyu Yi, Donglin Zhan, Wenqing Zhang, Denglin Jiang, Kang An, Hao Wang

    Abstract: Generative Adversarial Networks (GAN) training process, in most cases, apply Uniform or Gaussian sampling methods in the latent space, which probably spends most of the computation on examples that can be properly handled and easy to generate. Theoretically, importance sampling speeds up stochastic optimization in supervised learning by prioritizing training examples. In this paper, we explore the… ▽ More

    Submitted 16 December, 2022; v1 submitted 6 October, 2019; originally announced October 2019.

  16. arXiv:1909.04999  [pdf, other

    cs.LG cs.CV stat.ML

    Domain-Agnostic Few-Shot Classification by Learning Disparate Modulators

    Authors: Yongseok Choi, Junyoung Park, Subin Yi, Dong-Yeon Cho

    Abstract: Although few-shot learning research has advanced rapidly with the help of meta-learning, its practical usefulness is still limited because most of them assumed that all meta-training and meta-testing examples came from a single domain. We propose a simple but effective way for few-shot classification in which a task distribution spans multiple domains including ones never seen during meta-training… ▽ More

    Submitted 17 September, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

    Comments: Presented at NeurIPS 2019 Workshop on Meta-Learning (MetaLearn 2019)

  17. arXiv:1906.01819  [pdf, other

    cs.LG stat.ML

    Discriminative Few-Shot Learning Based on Directional Statistics

    Authors: Junyoung Park, Subin Yi, Yongseok Choi, Dong-Yeon Cho, Jiwon Kim

    Abstract: Metric-based few-shot learning methods try to overcome the difficulty due to the lack of training examples by learning embedding to make comparison easy. We propose a novel algorithm to generate class representatives for few-shot classification tasks. As a probabilistic model for learned features of inputs, we consider a mixture of von Mises-Fisher distributions which is known to be more expressiv… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

  18. arXiv:1812.07699  [pdf, other

    cs.LG stat.ML

    A Comparison of LSTMs and Attention Mechanisms for Forecasting Financial Time Series

    Authors: Thomas Hollis, Antoine Viscardi, Seung Eun Yi

    Abstract: While LSTMs show increasingly promising results for forecasting Financial Time Series (FTS), this paper seeks to assess if attention mechanisms can further improve performance. The hypothesis is that attention can help prevent long-term dependencies experienced by LSTM models. To test this hypothesis, the main contribution of this paper is the implementation of an LSTM with attention. Both the ben… ▽ More

    Submitted 18 December, 2018; originally announced December 2018.

  19. arXiv:1805.00215  [pdf, other

    cs.LG cs.AI stat.ML

    Internal node bagging

    Authors: Shun Yi

    Abstract: We introduce a novel view to understand how dropout works as an inexplicit ensemble learning method, which doesn't point out how many and which nodes to learn a certain feature. We propose a new training method named internal node bagging, it explicitly forces a group of nodes to learn a certain feature in training time, and combine those nodes to be one node in inference time. It means we can use… ▽ More

    Submitted 20 September, 2018; v1 submitted 1 May, 2018; originally announced May 2018.