Skip to main content

Showing 1–19 of 19 results for author: Chung, H W

Searching in archive stat. Search in all archives.
.
  1. arXiv:2501.02851  [pdf, other

    cs.SI cs.IT stat.ML

    Exact Matching in Correlated Networks with Node Attributes for Improved Community Recovery

    Authors: Joonhyuk Yang, Hye Won Chung

    Abstract: We study community detection in multiple networks whose nodes and edges are jointly correlated. This setting arises naturally in applications such as social platforms, where a shared set of users may exhibit both correlated friendship patterns and correlated attributes across different platforms. Extending the classical Stochastic Block Model (SBM) and its contextual counterpart (CSBM), we introdu… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.

    Comments: 30 pages, 3 figures

  2. arXiv:2406.03057  [pdf, other

    cs.LG stat.ML

    BWS: Best Window Selection Based on Sample Scores for Data Pruning across Broad Ranges

    Authors: Hoyong Choi, Nohyun Ki, Hye Won Chung

    Abstract: Data subset selection aims to find a smaller yet informative subset of a large dataset that can approximate the full-dataset training, addressing challenges associated with training neural networks on large-scale datasets. However, existing methods tend to specialize in either high or low selection ratio regimes, lacking a universal approach that consistently achieves competitive performance acros… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  3. arXiv:2402.10482  [pdf, other

    cs.LG stat.ML

    Rethinking Self-Distillation: Label Averaging and Enhanced Soft Label Refinement with Partial Labels

    Authors: Hyeonsu Jeong, Hye Won Chung

    Abstract: We investigate the mechanisms of self-distillation in multi-class classification, particularly in the context of linear probing with fixed feature extractors where traditional feature learning explanations do not apply. Our theoretical analysis reveals that multi-round self-distillation effectively performs label averaging among instances with high feature correlations, governed by the eigenvector… ▽ More

    Submitted 19 February, 2025; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: ICLR 2025

  4. arXiv:2305.19666  [pdf, other

    cs.DS cs.LG cs.SI stat.ML

    Efficient Algorithms for Exact Graph Matching on Correlated Stochastic Block Models with Constant Correlation

    Authors: Joonhyuk Yang, Dongpil Shin, Hye Won Chung

    Abstract: We consider the problem of graph matching, or learning vertex correspondence, between two correlated stochastic block models (SBMs). The graph matching problem arises in various fields, including computer vision, natural language processing and bioinformatics, and in particular, matching graphs with inherent community structure has significance related to de-anonymization of correlated social netw… ▽ More

    Submitted 2 June, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: ICML 2023

  5. arXiv:2301.05331  [pdf, other

    math.ST cs.LG math.PR stat.ML

    Detection problems in the spiked matrix models

    Authors: Ji Hyung Jung, Hye Won Chung, Ji Oon Lee

    Abstract: We study the statistical decision process of detecting the low-rank signal from various signal-plus-noise type data matrices, known as the spiked random matrix models. We first show that the principal component analysis can be improved by entrywise pre-transforming the data matrix if the noise is non-Gaussian, generalizing the known results for the spiked random matrix models with rank-1 signals.… ▽ More

    Submitted 16 January, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: 80 pages, 6 figures. arXiv admin note: text overlap with arXiv:2104.13517

    MSC Class: 62H25; 62H15; 60B20

  6. arXiv:2301.00006  [pdf, other

    cs.HC cs.IT cs.LG stat.ML

    Recovering Top-Two Answers and Confusion Probability in Multi-Choice Crowdsourcing

    Authors: Hyeonsu Jeong, Hye Won Chung

    Abstract: Crowdsourcing has emerged as an effective platform for labeling large amounts of data in a cost- and time-efficient manner. Most previous work has focused on designing an efficient algorithm to recover only the ground-truth labels of the data. In this paper, we consider multi-choice crowdsourcing tasks with the goal of recovering not only the ground truth, but also the most confusing answer and th… ▽ More

    Submitted 31 May, 2023; v1 submitted 29 December, 2022; originally announced January 2023.

    Comments: ICML 2023

  7. arXiv:2212.09396  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Rank-1 Matrix Completion with Gradient Descent and Small Random Initialization

    Authors: Daesung Kim, Hye Won Chung

    Abstract: The nonconvex formulation of the matrix completion problem has received significant attention in recent years due to its affordable complexity compared to the convex formulation. Gradient Descent (GD) is a simple yet efficient baseline algorithm for solving nonconvex optimization problems. The success of GD has been witnessed in many different problems in both theory and practice when it is combin… ▽ More

    Submitted 2 July, 2025; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: NeurIPS 2023

  8. arXiv:2204.05832  [pdf, other

    cs.CL cs.LG stat.ML

    What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization?

    Authors: Thomas Wang, Adam Roberts, Daniel Hesslow, Teven Le Scao, Hyung Won Chung, Iz Beltagy, Julien Launay, Colin Raffel

    Abstract: Large pretrained Transformer language models have been shown to exhibit zero-shot generalization, i.e. they can perform a wide variety of tasks that they were not explicitly trained on. However, the architectures and pretraining objectives used across state-of-the-art models differ significantly, and there has been limited systematic comparison of these factors. In this work, we present a large-sc… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

  9. arXiv:2203.00821  [pdf, other

    math.ST math.PR stat.ML

    Asymptotic Normality of Log Likelihood Ratio and Fundamental Limit of the Weak Detection for Spiked Wigner Matrices

    Authors: Hye Won Chung, Jiho Lee, Ji Oon Lee

    Abstract: We consider the problem of detecting the presence of a signal in a rank-one spiked Wigner model. For general non-Gaussian noise, assuming that the signal is drawn from the Rademacher prior, we prove that the log likelihood ratio (LR) of the spiked model against the null model converges to a Gaussian when the signal-to-noise ratio is below a certain threshold. The threshold is optimal in the sense… ▽ More

    Submitted 18 December, 2024; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: 33 pages, 2 figures

    MSC Class: 62H15; 60F05; 82B44

  10. arXiv:2111.12550  [pdf, other

    cs.HC cs.IT cs.LG stat.ML

    A Worker-Task Specialization Model for Crowdsourcing: Efficient Inference and Fundamental Limits

    Authors: Doyeon Kim, Jeonghwan Lee, Hye Won Chung

    Abstract: Crowdsourcing system has emerged as an effective platform for labeling data with relatively low cost by using non-expert workers. Inferring correct labels from multiple noisy answers on data, however, has been a challenging problem, since the quality of the answers varies widely across tasks and workers. Many existing works have assumed that there is a fixed ordering of workers in terms of their s… ▽ More

    Submitted 13 September, 2023; v1 submitted 19 November, 2021; originally announced November 2021.

    Comments: To appear at IEEE Transactions on Information Theory

  11. arXiv:2104.13517  [pdf, other

    math.ST cs.LG math.PR stat.ML

    Detection of Signal in the Spiked Rectangular Models

    Authors: Ji Hyung Jung, Hye Won Chung, Ji Oon Lee

    Abstract: We consider the problem of detecting signals in the rank-one signal-plus-noise data matrix models that generalize the spiked Wishart matrices. We show that the principal component analysis can be improved by pre-transforming the matrix entries if the noise is non-Gaussian. As an intermediate step, we prove a sharp phase transition of the largest eigenvalues of spiked rectangular matrices, which ex… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: 38 pages, 6 figures

    MSC Class: 62H25; 62H15; 60B20

  12. arXiv:2008.06808  [pdf, other

    cs.LG stat.ML

    Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition

    Authors: Henry Tsai, Jayden Ooi, Chun-Sung Ferng, Hyung Won Chung, Jason Riesa

    Abstract: Transformer-based models have achieved stateof-the-art results in many tasks in natural language processing. However, such models are usually slow at inference time, making deployment difficult. In this paper, we develop an efficient algorithm to search for fast models while maintaining model quality. We describe a novel approach to decompose the Transformer architecture into smaller components, a… ▽ More

    Submitted 15 August, 2020; originally announced August 2020.

  13. arXiv:2004.00101  [pdf, ps, other

    cs.HC cs.LG stat.ML

    Crowdsourced Labeling for Worker-Task Specialization Model

    Authors: Doyeon Kim, Hye Won Chung

    Abstract: We consider crowdsourced labeling under a $d$-type worker-task specialization model, where each worker and task is associated with one particular type among a finite set of types and a worker provides a more reliable answer to tasks of the matched type than to tasks of unmatched types. We design an inference algorithm that recovers binary task labels (up to any given recovery accuracy) by using wo… ▽ More

    Submitted 9 June, 2021; v1 submitted 21 March, 2020; originally announced April 2020.

    Comments: To appear at IEEE International Symposium on Information Theory (ISIT) 2021

  14. arXiv:2003.10038  [pdf, other

    stat.ML cs.IT cs.LG

    Robust Hypergraph Clustering via Convex Relaxation of Truncated MLE

    Authors: Jeonghwan Lee, Daesung Kim, Hye Won Chung

    Abstract: We study hypergraph clustering in the weighted $d$-uniform hypergraph stochastic block model ($d$\textsf{-WHSBM}), where each edge consisting of $d$ nodes from the same community has higher expected weight than the edges consisting of nodes from different communities. We propose a new hypergraph clustering algorithm, called \textsf{CRTMLE}, and provide its performance guarantee under the $d$\texts… ▽ More

    Submitted 15 November, 2020; v1 submitted 22 March, 2020; originally announced March 2020.

    Comments: 20 pages, 4 figure

    Journal ref: Published at IEEE Journal on Selected Areas in Information Theory (JSAIT), Issue 3, 2020

  15. arXiv:2001.11775  [pdf, other

    cs.IT cs.LG stat.ML

    Binary Classification with XOR Queries: Fundamental Limits and An Efficient Algorithm

    Authors: Daesung Kim, Hye Won Chung

    Abstract: We consider a query-based data acquisition problem for binary classification of unknown labels, which has diverse applications in communications, crowdsourcing, recommender systems and active learning. To ensure reliable recovery of unknown labels with as few number of queries as possible, we consider an effective query type that asks "group attribute" of a chosen subset of objects. In particular,… ▽ More

    Submitted 30 April, 2021; v1 submitted 31 January, 2020; originally announced January 2020.

    Comments: Accepted to IEEE Transactions on Information Theory. 37 pages, 9 figures

  16. arXiv:2001.05676  [pdf, other

    math.ST cs.LG math.PR stat.ML

    Weak Detection in the Spiked Wigner Model with General Rank

    Authors: Ji Hyung Jung, Hye Won Chung, Ji Oon Lee

    Abstract: We study the statistical decision process of detecting the signal from a `signal+noise' type matrix model with an additive Wigner noise. We propose a hypothesis test based on the linear spectral statistics of the data matrix, which does not depend on the distribution of the signal or the noise. The test is optimal under the Gaussian noise if the signal-to-noise ratio is small, as it minimizes the… ▽ More

    Submitted 4 March, 2021; v1 submitted 16 January, 2020; originally announced January 2020.

    Comments: 35 pages, 3 figures

    MSC Class: 62H15; 60B20

  17. arXiv:1904.09109  [pdf, other

    cs.LG cs.IT stat.ML

    Shallow Neural Network can Perfectly Classify an Object following Separable Probability Distribution

    Authors: Youngjae Min, Hye Won Chung

    Abstract: Guiding the design of neural networks is of great importance to save enormous resources consumed on empirical decisions of architectural parameters. This paper constructs shallow sigmoid-type neural networks that achieve 100% accuracy in classification for datasets following a linear separability condition. The separability condition in this work is more relaxed than the widely used linear separab… ▽ More

    Submitted 19 April, 2019; originally announced April 2019.

    Comments: 5 pages. To be presented at the 2019 IEEE International Symposium on Information Theory (ISIT)

  18. arXiv:1809.10827  [pdf, other

    math.ST cs.LG math.PR stat.ML

    Weak detection in the spiked Wigner model

    Authors: Hye Won Chung, Ji Oon Lee

    Abstract: We consider the weak detection problem in a rank-one spiked Wigner data matrix where the signal-to-noise ratio is small so that reliable detection is impossible. We propose a hypothesis test on the presence of the signal by utilizing the linear spectral statistics of the data matrix. The test is data-driven and does not require prior knowledge about the distribution of the signal or the noise. Whe… ▽ More

    Submitted 10 November, 2019; v1 submitted 27 September, 2018; originally announced September 2018.

    Comments: 45 pages, 5 figures

    MSC Class: 62H15; 60B20

  19. arXiv:1804.05296  [pdf, other

    cs.CR cs.CY cs.LG stat.ML

    Adversarial Attacks Against Medical Deep Learning Systems

    Authors: Samuel G. Finlayson, Hyung Won Chung, Isaac S. Kohane, Andrew L. Beam

    Abstract: The discovery of adversarial examples has raised concerns about the practical deployment of deep learning systems. In this paper, we demonstrate that adversarial examples are capable of manipulating deep learning systems across three clinical domains. For each of our representative medical deep learning classifiers, both white and black box attacks were highly successful. Our models are representa… ▽ More

    Submitted 4 February, 2019; v1 submitted 14 April, 2018; originally announced April 2018.