Skip to main content

Showing 101–150 of 515 results for author: Koo, J

.
  1. arXiv:2403.07189  [pdf, ps, other

    cs.IT cond-mat.dis-nn math-ph math.ST

    A multiscale cavity method for sublinear-rank symmetric matrix factorization

    Authors: Jean Barbier, Justin Ko, Anas A. Rahman

    Abstract: We consider a statistical model for symmetric matrix factorization with additive Gaussian noise in the high-dimensional regime where the rank $M$ of the signal matrix to infer scales with its size $N$ as $M={\rm o}(\sqrt{\ln N})$. Allowing for an $N$-dependent rank offers new challenges and requires new methods. Working in the Bayes-optimal setting, we show that whenever the signal has i.i.d.~entr… ▽ More

    Submitted 20 March, 2025; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 64 pages. Filled out proof details, with one step being more involved than initially thought and resulting in changes to the main theorem

  2. arXiv:2403.04234  [pdf, other

    stat.ML cs.LG

    Fundamental limits of Non-Linear Low-Rank Matrix Estimation

    Authors: Pierre Mergny, Justin Ko, Florent Krzakala, Lenka Zdeborová

    Abstract: We consider the task of estimating a low-rank matrix from non-linear and noisy observations. We prove a strong universality result showing that Bayes-optimal performances are characterized by an equivalent Gaussian model with an effective prior, whose parameters are entirely determined by an expansion of the non-linear function. In particular, we show that to reconstruct the signal accurately, one… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 42 pages, 2 figures

  3. arXiv:2403.04134  [pdf, other

    cs.RO

    An Adaptable, Safe, and Portable Robot-Assisted Feeding System

    Authors: Ethan Kroll Gordon, Rajat Kumar Jenamani, Amal Nanavati, Ziang Liu, Haya Bolotski, Raida Karim, Daniel Stabile, Atharva Kashyap, Bernie Hao Zhu, Xilai Dai, Tyler Schrenk, Jonathan Ko, Taylor Kessler Faulkner, Tapomayukh Bhattacharjee, Siddhartha Srinivasa

    Abstract: We demonstrate a robot-assisted feeding system that enables people with mobility impairments to feed themselves. Our system design embodies Safety, Portability, and User Control, with comprehensive full-stack safety checks, the ability to be mounted on and powered by any powered wheelchair, and a custom web-app allowing care-recipients to leverage their own assistive devices for robot control. For… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: HRI 2024 Demo; Corrected inaccurate author ordering in ACM DL which occurred due to formatting issues

  4. arXiv:2403.03695  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG math.PR math.ST

    Spectral Phase Transition and Optimal PCA in Block-Structured Spiked models

    Authors: Pierre Mergny, Justin Ko, Florent Krzakala

    Abstract: We discuss the inhomogeneous spiked Wigner model, a theoretical framework recently introduced to study structured noise in various learning scenarios, through the prism of random matrix theory, with a specific focus on its spectral properties. Our primary objective is to find an optimal spectral method and to extend the celebrated \cite{BBP} (BBP) phase transition criterion -- well-known in the ho… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 26 pages, 2 figures

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, PMLR 235:35470-35491, 2024

  5. Crowdsourcing Dermatology Images with Google Search Ads: Creating a Real-World Skin Condition Dataset

    Authors: Abbi Ward, Jimmy Li, Julie Wang, Sriram Lakshminarasimhan, Ashley Carrick, Bilson Campana, Jay Hartford, Pradeep Kumar S, Tiya Tiyasirichokchai, Sunny Virmani, Renee Wong, Yossi Matias, Greg S. Corrado, Dale R. Webster, Dawn Siegel, Steven Lin, Justin Ko, Alan Karthikesalingam, Christopher Semturs, Pooja Rao

    Abstract: Background: Health datasets from clinical sources do not reflect the breadth and diversity of disease in the real world, impacting research, medical education, and artificial intelligence (AI) tool development. Dermatology is a suitable area to develop and test a new and scalable method to create representative health datasets. Methods: We used Google Search advertisements to invite contribution… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Journal ref: JAMA Network Open (2024)

  6. arXiv:2402.18293  [pdf, other

    cs.CV

    Continuous Memory Representation for Anomaly Detection

    Authors: Joo Chan Lee, Taejune Kim, Eunbyung Park, Simon S. Woo, Jong Hwan Ko

    Abstract: There have been significant advancements in anomaly detection in an unsupervised manner, where only normal images are available for training. Several recent methods aim to detect anomalies based on a memory, comparing or reconstructing the input with directly stored normal features (or trained features with normal images). However, such memory-based approaches operate on a discrete feature space i… ▽ More

    Submitted 24 July, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: Project page: https://tae-mo.github.io/crad/

  7. arXiv:2402.17958  [pdf, other

    astro-ph.CO astro-ph.GA

    Spatial Distribution of Intracluster Light versus Dark Matter in Horizon Run 5

    Authors: Jaewon Yoo, Changbom Park, Cristiano G. Sabiu, Ankit Singh, Jongwan Ko, Jaehyun Lee, Christophe Pichon, M. James Jee, Brad K. Gibson, Owain Snaith, Juhan Kim, Jihye Shin, Yonghwi Kim, Hyowon Kim

    Abstract: One intriguing approach for studying the dynamical evolution of galaxy clusters is to compare the spatial distributions among various components, such as dark matter, member galaxies, gas, and intracluster light (ICL). Utilizing the recently introduced Weighted Overlap Coefficient (WOC) \citep{2022ApJS..261...28Y}, we analyze the spatial distributions of components within 174 galaxy clusters (… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 23 pages, 12 figures, accepted for publication in ApJ

  8. Waveform Simulation for Scintillation Characteristics of NaI(Tl) Crystal

    Authors: J. J. Choi, C. Ha, E. J. Jeon, K. W. Kim, S. K. Kim, Y. D. Kim, Y. J. Ko, B. C. Koh, H. S. Lee, S. H. Lee, S. M. Lee, B. J. Park, G. H. Yu

    Abstract: The lowering of the energy threshold in the NaI detector is crucial not only for comprehensive validation of DAMA/LIBRA but also for exploring new possibilities in the search for low-mass dark matter and observing coherent elastic scattering between neutrino and nucleus. Alongside hardware enhancements, extensive efforts have focused on refining event selection to discern noise, achieved through p… ▽ More

    Submitted 17 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Journal ref: NIM A 1065, 169489 (2024)

  9. arXiv:2402.16506  [pdf, other

    cs.CV

    Stochastic Conditional Diffusion Models for Robust Semantic Image Synthesis

    Authors: Juyeon Ko, Inho Kong, Dogyun Park, Hyunwoo J. Kim

    Abstract: Semantic image synthesis (SIS) is a task to generate realistic images corresponding to semantic maps (labels). However, in real-world applications, SIS often encounters noisy user inputs. To address this, we propose Stochastic Conditional Diffusion Model (SCDM), which is a robust conditional diffusion model that features novel forward and generation processes tailored for SIS with noisy labels. It… ▽ More

    Submitted 3 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  10. arXiv:2402.15566  [pdf

    eess.IV cs.CV cs.LG

    Closing the AI generalization gap by adjusting for dermatology condition distribution differences across clinical settings

    Authors: Rajeev V. Rikhye, Aaron Loh, Grace Eunhae Hong, Preeti Singh, Margaret Ann Smith, Vijaytha Muralidharan, Doris Wong, Rory Sayres, Michelle Phung, Nicolas Betancourt, Bradley Fong, Rachna Sahasrabudhe, Khoban Nasim, Alec Eschholz, Basil Mustafa, Jan Freyberg, Terry Spitz, Yossi Matias, Greg S. Corrado, Katherine Chou, Dale R. Webster, Peggy Bui, Yuan Liu, Yun Liu, Justin Ko , et al. (1 additional authors not shown)

    Abstract: Recently, there has been great progress in the ability of artificial intelligence (AI) algorithms to classify dermatological conditions from clinical photographs. However, little is known about the robustness of these algorithms in real-world settings where several factors can lead to a loss of generalizability. Understanding and overcoming these limitations will permit the development of generali… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  11. arXiv:2402.15122  [pdf, other

    hep-ex physics.ins-det

    Measurements of low-energy nuclear recoil quenching factors for Na and I recoils in the NaI(Tl) scintillator

    Authors: S. H. Lee, H. W. Joo, H. J. Kim, K. W. Kim, S. K. Kim, Y. D. Kim, Y. J. Ko, H. S. Lee, J. Y. Lee, H. S. Park, Y. S. Yoon

    Abstract: Elastic scattering off nuclei in target detectors, involving interactions with dark matter and coherent elastic neutrino nuclear recoil (CE$ν$NS), results in the deposition of low energy within the nuclei, dissipating rapidly through a combination of heat and ionization. The primary energy loss mechanism for nuclear recoil is heat, leading to consistently smaller measurable scintillation signals c… ▽ More

    Submitted 8 July, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  12. arXiv:2402.14196  [pdf, other

    cs.CV cs.GR

    Mip-Grid: Anti-aliased Grid Representations for Neural Radiance Fields

    Authors: Seungtae Nam, Daniel Rho, Jong Hwan Ko, Eunbyung Park

    Abstract: Despite the remarkable achievements of neural radiance fields (NeRF) in representing 3D scenes and generating novel view images, the aliasing issue, rendering "jaggies" or "blurry" images at varying camera distances, remains unresolved in most existing approaches. The recently proposed mip-NeRF has addressed this challenge by rendering conical frustums instead of rays. However, it relies on MLP ar… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted to NeurIPS 2023

  13. arXiv:2402.03898  [pdf, other

    cs.CL cs.AI cs.LG

    DistiLLM: Towards Streamlined Distillation for Large Language Models

    Authors: Jongwoo Ko, Sungnyun Kim, Tianyi Chen, Se-Young Yun

    Abstract: Knowledge distillation (KD) is widely used for compressing a teacher model to a smaller student model, reducing its inference cost and memory footprint while preserving model capabilities. However, current KD methods for auto-regressive sequence models (e.g., large language models) suffer from missing a standardized objective function. Moreover, the recent use of student-generated outputs to addre… ▽ More

    Submitted 3 July, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: ICML 2024; Code is available at https://github.com/jongwooko/distillm

  14. arXiv:2401.15894  [pdf, other

    cs.LG cs.AI

    Enhancing Topological Dependencies in Spatio-Temporal Graphs with Cycle Message Passing Blocks

    Authors: Minho Lee, Yun Young Choi, Sun Woo Park, Seunghwan Lee, Joohwan Ko, Jaeyoung Hong

    Abstract: Graph Neural Networks (GNNs) and Transformer-based models have been increasingly adopted to learn the complex vector representations of spatio-temporal graphs, capturing intricate spatio-temporal dependencies crucial for applications such as traffic datasets. Although many existing methods utilize multi-head attention mechanisms and message-passing neural networks (MPNNs) to capture both spatial a… ▽ More

    Submitted 5 December, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Proceedings of the Third Learning on Graphs Conference (LoG 2024)

  15. νOscillation: a software package for computation and simulation of neutrino propagation and interaction

    Authors: Seonghyeok Jang, Eunju Jeon, Eunil Won, Young Ju Ko, Kyungmin Lee

    Abstract: The behavior of neutrinos is the only phenomenon that cannot be explained by the standard model of particle physics. Because of these mysterious neutrino interactions observed in nature, at present, there is growing interest in this field and ongoing or planned neutrino experiments are seeking solutions to this mystery very actively. The design of neutrino experiments and the analysis of neutrino… ▽ More

    Submitted 4 October, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: 15 pages, 8 figures

    Journal ref: J. Korean Phys. Soc. 85 (2024) 381-388

  16. arXiv:2401.10989  [pdf, other

    stat.ML cs.LG stat.CO

    Provably Scalable Black-Box Variational Inference with Structured Variational Families

    Authors: Joohwan Ko, Kyurae Kim, Woo Chang Kim, Jacob R. Gardner

    Abstract: Variational families with full-rank covariance approximations are known not to work well in black-box variational inference (BBVI), both empirically and theoretically. In fact, recent computational complexity results for BBVI have established that full-rank variational families scale poorly with the dimensionality of the problem compared to e.g. mean-field families. This is particularly critical t… ▽ More

    Submitted 30 November, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Accepted to ICML'24; v3: fixed typos

  17. arXiv:2401.09986  [pdf, other

    cs.LG cs.AI

    Improving Local Training in Federated Learning via Temperature Scaling

    Authors: Kichang Lee, Songkuk Kim, JeongGil Ko

    Abstract: Federated learning is inherently hampered by data heterogeneity: non-i.i.d. training data over local clients. We propose a novel model training approach for federated learning, FLex&Chill, which exploits the Logit Chilling method. Through extensive evaluations, we demonstrate that, in the presence of non-i.i.d. data characteristics inherent in federated learning systems, this approach can expedite… ▽ More

    Submitted 26 June, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: 24 pages

    MSC Class: 68 ACM Class: I.2.11

  18. arXiv:2401.09678  [pdf, other

    cs.SE cs.FL cs.LO eess.SY

    Integrating Graceful Degradation and Recovery through Requirement-driven Adaptation

    Authors: Simon Chu, Justin Koe, David Garlan, Eunsuk Kang

    Abstract: Cyber-physical systems (CPS) are subject to environmental uncertainties such as adverse operating conditions, malicious attacks, and hardware degradation. These uncertainties may lead to failures that put the system in a sub-optimal or unsafe state. Systems that are resilient to such uncertainties rely on two types of operations: (1) graceful degradation, to ensure that the system maintains an acc… ▽ More

    Submitted 8 April, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: Pre-print for the SEAMS '24 conference (Software Engineering for Adaptive and Self-Managing Systems Conference)

  19. arXiv:2401.07476  [pdf, other

    nucl-ex hep-ex

    Background study of the AMoRE-pilot experiment

    Authors: A. Agrawal, V. V. Alenkov, P. Aryal, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, Seonho Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Yu. M. Gavrilyuk, A. M. Gezhaev, O. Gileva , et al. (83 additional authors not shown)

    Abstract: We report a study on the background of the Advanced Molybdenum-Based Rare process Experiment (AMoRE), a search for neutrinoless double beta decay (\znbb) of $^{100}$Mo. The pilot stage of the experiment was conducted using $\sim$1.9 kg of \CAMOO~ crystals at the Yangyang Underground Laboratory, South Korea, from 2015 to 2018. We compared the measured $β/γ$ energy spectra in three experimental conf… ▽ More

    Submitted 7 April, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

  20. arXiv:2401.07462  [pdf, other

    hep-ex physics.ins-det

    Nonproportionality of NaI(Tl) Scintillation Detector for Dark Matter Search Experiments

    Authors: S. M. Lee, G. Adhikari, N. Carlin, J. Y. Cho, J. J. Choi, S. Choi, A. C. Ezeribe, L. E. Fran. a, C. Ha, I. S. Hahn, S. J. Hollick, E. J. Jeon, H. W. Joo, W. G. Kang, M. Kauer, B. H. Kim, H. J. Kim, J. Kim, K. W. Kim, S. H. Kim, S. K. Kim, S. W. Kim, W. K. Kim, Y. D. Kim, Y. H. Kim , et al. (37 additional authors not shown)

    Abstract: We present a comprehensive study of the nonproportionality of NaI(Tl) scintillation detectors within the context of dark matter search experiments. Our investigation, which integrates COSINE-100 data with supplementary $γ$ spectroscopy, measures light yields across diverse energy levels from full-energy $γ$ peaks produced by the decays of various isotopes. These $γ$ peaks of interest were produced… ▽ More

    Submitted 10 May, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

    Comments: 12 pages, 7 figures

    Journal ref: Eur. Phys. J. C 84 (2024) 484

  21. Transfer-Learning-Based Autotuning Using Gaussian Copula

    Authors: Thomas Randall, Jaehoon Koo, Brice Videau, Michael Kruse, Xingfu Wu, Paul Hovland, Mary Hall, Rong Ge, Prasanna Balaprakash

    Abstract: As diverse high-performance computing (HPC) systems are built, many opportunities arise for applications to solve larger problems than ever before. Given the significantly increased complexity of these HPC systems and application tuning, empirical performance tuning, such as autotuning, has emerged as a promising approach in recent years. Despite its effectiveness, autotuning is often a computatio… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 13 pages, 5 figures, 7 tables, the definitive version of this work is published in the Proceedings of the ACM International Conference on Supercomputing 2023, available at https://dl.acm.org/doi/10.1145/3577193.3593712

    ACM Class: I.2.4; G.3; D.2.8

    Journal ref: Proceedings of the 37th International Conference on Supercomputing (2023) 37-49

  22. arXiv:2401.03650  [pdf, other

    eess.AS cs.SD eess.SP

    DDD: A Perceptually Superior Low-Response-Time DNN-based Declipper

    Authors: Jayeon Yi, Junghyun Koo, Kyogu Lee

    Abstract: Clipping is a common nonlinear distortion that occurs whenever the input or output of an audio system exceeds the supported range. This phenomenon undermines not only the perception of speech quality but also downstream processes utilizing the disrupted signal. Therefore, a real-time-capable, robust, and low-response-time method for speech declipping (SD) is desired. In this work, we introduce DDD… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: To appear, ICASSP 2024. Demo samples at https://stet-stet.github.io/DDD, repo at https://github.com/stet-stet/DDD

  23. Universality on thermodynamic relation with corrections in de Sitter black holes

    Authors: Junbeom Ko, Bogeun Gwak

    Abstract: We herein investigate the universal relation proposed by Goon and Penco in de Sitter black holes with electric charge or angular momentum. Our analysis focuses on the cosmological horizon, which only exists in de Sitter and Nariai spacetimes. Because the relation is given in a general case, the overall relationship may be valid. However, we elucidate the details of the relation, highlighting disti… ▽ More

    Submitted 14 March, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: 19 pages, published in JHEP

  24. arXiv:2312.08847  [pdf, other

    cs.AI cs.LG cs.NE stat.ML

    Knowledge-Driven Modulation of Neural Networks with Attention Mechanism for Next Activity Prediction

    Authors: Ivan Donadello, Jonghyeon Ko, Fabrizio Maria Maggi, Jan Mendling, Francesco Riva, Matthias Weidlich

    Abstract: Predictive Process Monitoring (PPM) aims at leveraging historic process execution data to predict how ongoing executions will continue up to their completion. In recent years, PPM techniques for the prediction of the next activities have matured significantly, mainly thanks to the use of Neural Networks (NNs) as a predictor. While their performance is difficult to beat in the general case, there a… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    MSC Class: 68T20 (Primary) 68T01; 68T05; 68T37 (Secondary) ACM Class: I.2.6; I.2.8; I.2.m

  25. arXiv:2312.07957  [pdf, other

    physics.ins-det hep-ex

    Scintillation characteristics of an undoped CsI crystal at low-temperature for dark matter search

    Authors: W. K. Kim, H. Y. Lee, K. W. Kim, Y. J. Ko, J. A. Jeon, H. J. Kim, H. S. Lee

    Abstract: The scintillation characteristics of 1 g undoped CsI crystal were studied by directly coupling two silicon photomultipliers (SiPMs) over a temperature range from room temperature to 86 K. The scintillation decay time and light output were measured using x-ray and gamma-ray peaks from a $^{109}$Cd radioactive source. An increase in decay time was observed as the temperature decreased from room temp… ▽ More

    Submitted 15 July, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  26. arXiv:2311.15569  [pdf, other

    cs.CV cs.AI

    Towards Difficulty-Agnostic Efficient Transfer Learning for Vision-Language Models

    Authors: Yongjin Yang, Jongwoo Ko, Se-Young Yun

    Abstract: Vision-language models (VLMs) like CLIP have demonstrated remarkable applicability across a variety of downstream tasks, including zero-shot image classification. Recently, the use of prompts or adapters for efficient transfer learning (ETL) has gained significant attention for effectively adapting to downstream tasks. However, previous studies have overlooked the challenge of varying transfer dif… ▽ More

    Submitted 11 October, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: EMNLP 2024; code available at: https://github.com/YangYongJin/APEX

  27. arXiv:2311.14993  [pdf, other

    cs.CV

    Coordinate-Aware Modulation for Neural Fields

    Authors: Joo Chan Lee, Daniel Rho, Seungtae Nam, Jong Hwan Ko, Eunbyung Park

    Abstract: Neural fields, mapping low-dimensional input coordinates to corresponding signals, have shown promising results in representing various signals. Numerous methodologies have been proposed, and techniques employing MLPs and grid representations have achieved substantial success. MLPs allow compact and high expressibility, yet often suffer from spectral bias and slow convergence speed. On the other h… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: Project page: http://maincold2.github.io/cam/

  28. arXiv:2311.13831  [pdf, other

    cs.CV

    Posterior Distillation Sampling

    Authors: Juil Koo, Chanho Park, Minhyuk Sung

    Abstract: We introduce Posterior Distillation Sampling (PDS), a novel optimization method for parametric image editing based on diffusion models. Existing optimization-based methods, which leverage the powerful 2D prior of diffusion models to handle various parametric images, have mainly focused on generation. Unlike generation, editing requires a balance between conforming to the target attribute and prese… ▽ More

    Submitted 31 March, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: Project page: https://posterior-distillation-sampling.github.io/

  29. arXiv:2311.13681  [pdf, other

    cs.CV cs.GR

    Compact 3D Gaussian Representation for Radiance Field

    Authors: Joo Chan Lee, Daniel Rho, Xiangyu Sun, Jong Hwan Ko, Eunbyung Park

    Abstract: Neural Radiance Fields (NeRFs) have demonstrated remarkable potential in capturing complex 3D scenes with high fidelity. However, one persistent challenge that hinders the widespread adoption of NeRFs is the computational bottleneck due to the volumetric rendering. On the other hand, 3D Gaussian splatting (3DGS) has recently emerged as an alternative representation that leverages a 3D Gaussisan-ba… ▽ More

    Submitted 15 February, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: Project page: http://maincold2.github.io/c3dgs/

  30. arXiv:2311.09585  [pdf, other

    cs.CL

    LifeTox: Unveiling Implicit Toxicity in Life Advice

    Authors: Minbeom Kim, Jahyun Koo, Hwanhee Lee, Joonsuk Park, Hwaran Lee, Kyomin Jung

    Abstract: As large language models become increasingly integrated into daily life, detecting implicit toxicity across diverse contexts is crucial. To this end, we introduce LifeTox, a dataset designed for identifying implicit toxicity within a broad range of advice-seeking scenarios. Unlike existing safety datasets, LifeTox comprises diverse contexts derived from personal experiences through open-ended ques… ▽ More

    Submitted 18 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: 11 pages, 5 figures, NAACL 2024

  31. arXiv:2311.07837  [pdf, ps, other

    math.NT

    Gauss's form class groups and Shimura's canonical models

    Authors: Ja Kyung Koo, Dong Hwa Shin, Dong Sung Yoon

    Abstract: Let $N$ be a positive integer and $Γ$ be a subgroup of $\mathrm{SL}_2(\mathbb{Z})$ containing $Γ_1(N)$. Let $K$ be an imaginary quadratic field and $\mathcal{O}$ be an order of discriminant $D_\mathcal{O}$ in $K$. Under some assumptions, we show that $Γ$ induces a form class group of discriminant $D_\mathcal{O}$ (or of order $\mathcal{O}$) and level $N$ if and only if there is a certain canonical… ▽ More

    Submitted 7 March, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: 18 pages, The title has been changed

    MSC Class: Primary 11R37; Secondary 11E12; 11R65

  32. arXiv:2311.07607  [pdf, other

    cs.AI cs.LG

    Modeling Choice via Self-Attention

    Authors: Joohwan Ko, Andrew A. Li

    Abstract: Models of choice are a fundamental input to many now-canonical optimization problems in the field of Operations Management, including assortment, inventory, and price optimization. Naturally, accurate estimation of these models from data is a critical step in the application of these optimization problems in practice. Concurrently, recent advancements in deep learning have sparked interest in inte… ▽ More

    Submitted 8 February, 2024; v1 submitted 11 November, 2023; originally announced November 2023.

  33. arXiv:2311.06320  [pdf, other

    physics.soc-ph q-bio.PE

    Pursuing equitable access to vaccines for the next epidemic

    Authors: Hsin-Ju Chou, Jing-Yuan Ko, Sung-Po Chao

    Abstract: To mitigate the pandemic stemming from COVID-19, numerous nations have initiated extensive vaccination campaigns for their citizens since late 2020. While affluent countries have predominantly received vaccine allocations, fewer doses have been dispatched to nations with lower average incomes. This unequal distribution not only widens the disparity between wealthy and impoverished regions but also… ▽ More

    Submitted 28 March, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

    Comments: v.2: 14 pages, 7 sets of figures (English rephrasing, figures reploting, refrerence added)

  34. arXiv:2311.05010  [pdf, other

    astro-ph.IM physics.ins-det

    Alpha backgrounds in NaI(Tl) crystals of COSINE-100

    Authors: G. Adhikari, N. Carlin, D. F. F. S. Cavalcante, J. Y. Cho, J. J. Choi, S. Choi, A. C. Ezeribe, L. E. Franca, C. Ha, I. S. Hahn, S. J. Hollick, E. J. Jeon, H. W. Joo, W. G. Kang, M. Kauer, B. H. Kim, H. J. Kim, J. Kim, K. W. Kim, S. H. Kim, S. K. Kim, S. W. Kim, W. K. Kim, Y. D. Kim, Y. H. Kim , et al. (38 additional authors not shown)

    Abstract: COSINE-100 is a dark matter direct detection experiment with 106 kg NaI(Tl) as the target material. 210Pb and daughter isotopes are a dominant background in the WIMP region of interest and are detected via beta decay and alpha decay. Analysis of the alpha channel complements the background model as observed in the beta/gamma channel. We present the measurement of the quenching factors and Monte Ca… ▽ More

    Submitted 30 January, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

  35. arXiv:2310.20258  [pdf, other

    cs.LG

    Advancing Bayesian Optimization via Learning Correlated Latent Space

    Authors: Seunghun Lee, Jaewon Chu, Sihyeon Kim, Juyeon Ko, Hyunwoo J. Kim

    Abstract: Bayesian optimization is a powerful method for optimizing black-box functions with limited function evaluations. Recent works have shown that optimization in a latent space through deep generative models such as variational autoencoders leads to effective and efficient Bayesian optimization for structured or discrete data. However, as the optimization does not take place in the input space, it lea… ▽ More

    Submitted 19 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

  36. arXiv:2310.17668  [pdf, other

    cs.LG

    Fine tuning Pre trained Models for Robustness Under Noisy Labels

    Authors: Sumyeong Ahn, Sihyeon Kim, Jongwoo Ko, Se-Young Yun

    Abstract: The presence of noisy labels in a training dataset can significantly impact the performance of machine learning models. To tackle this issue, researchers have explored methods for Learning with Noisy Labels to identify clean samples and reduce the influence of noisy labels. However, constraining the influence of a certain portion of the training dataset can result in a reduction in overall general… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 10 pages (17 pages including supplementary)

    MSC Class: Computer Science; Artificial Intelligence

  37. arXiv:2310.15668  [pdf, other

    cs.SI cs.DB

    Hypergraph Motifs and Their Extensions Beyond Binary

    Authors: Geon Lee, Seokbum Yoon, Jihoon Ko, Hyunju Kim, Kijung Shin

    Abstract: Hypergraphs naturally represent group interactions, which are omnipresent in many domains: collaborations of researchers, co-purchases of items, and joint interactions of proteins, to name a few. In this work, we propose tools for answering the following questions: (Q1) what are the structural design principles of real-world hypergraphs? (Q2) how can we compare local structures of hypergraphs of d… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Extended version of VLDB 2020 paper arXiv:2003.01853

  38. arXiv:2310.14055  [pdf, other

    math.PR

    Spectral Phase Transitions in Non-Linear Wigner Spiked Models

    Authors: Alice Guionnet, Justin Ko, Florent Krzakala, Pierre Mergny, Lenka Zdeborová

    Abstract: We study the asymptotic behavior of the spectrum of a random matrix where a non-linearity is applied entry-wise to a Wigner matrix perturbed by a rank-one spike with independent and identically distributed entries. In this setting, we show that when the signal-to-noise ratio scale as $N^{\frac{1}{2} (1-1/k_\star)}$, where $k_\star$ is the first non-zero generalized information coefficient of the f… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: 27 pages

    MSC Class: 60B20

  39. arXiv:2310.10054  [pdf, other

    cs.CL cs.AI cs.LG

    NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models

    Authors: Jongwoo Ko, Seungjoon Park, Yujin Kim, Sumyeong Ahn, Du-Seong Chang, Euijai Ahn, Se-Young Yun

    Abstract: Structured pruning methods have proven effective in reducing the model size and accelerating inference speed in various network architectures such as Transformers. Despite the versatility of encoder-decoder models in numerous NLP tasks, the structured pruning methods on such models are relatively less explored compared to encoder-only models. In this study, we investigate the behavior of the struc… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Findings of the Association for Computational Linguistics: EMNLP 2023

  40. arXiv:2310.07498  [pdf, other

    astro-ph.GA

    Low-mass Quiescent Galaxies Are Small in Isolated Environments: Environmental Dependence of the Mass-Size Relation of Low-mass Quiescent Galaxies

    Authors: Yongmin Yoon, Jae-Woo Kim, Jongwan Ko

    Abstract: We study the mass-size relation of quiescent galaxies across various environments, with a particular focus on its environmental dependence at the low-mass part of $\log(M_\mathrm{star}/M_{\odot})\lesssim10.0$. Our sample consists of 13,667 quiescent galaxies with $\log(M_\mathrm{star}/M_{\odot})\ge9.4$ and $0.01<z<0.04$ from the Sloan Digital Sky Survey. We find that the mass-size relation of low-… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 20 pages, 14 figures, 1 table, accepted for publication in the ApJ

  41. arXiv:2310.06511  [pdf, other

    cs.LG

    Self-Supervised Dataset Distillation for Transfer Learning

    Authors: Dong Bok Lee, Seanie Lee, Joonho Ko, Kenji Kawaguchi, Juho Lee, Sung Ju Hwang

    Abstract: Dataset distillation methods have achieved remarkable success in distilling a large dataset into a small set of representative samples. However, they are not designed to produce a distilled dataset that can be effectively used for facilitating self-supervised pre-training. To this end, we propose a novel problem of distilling an unlabeled dataset into a set of small synthetic samples for efficient… ▽ More

    Submitted 11 April, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

  42. arXiv:2310.05424  [pdf, other

    cs.CL

    Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding

    Authors: Sangmin Bae, Jongwoo Ko, Hwanjun Song, Se-Young Yun

    Abstract: To tackle the high inference latency exhibited by autoregressive language models, previous studies have proposed an early-exiting framework that allocates adaptive computation paths for each token based on the complexity of generating the subsequent token. However, we observed several shortcomings, including performance degradation caused by a state copying mechanism or numerous exit paths, and se… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 (Long)

  43. arXiv:2310.02823  [pdf, other

    cs.LG stat.ML

    Learning to Scale Logits for Temperature-Conditional GFlowNets

    Authors: Minsu Kim, Joohwan Ko, Taeyoung Yun, Dinghuai Zhang, Ling Pan, Woochang Kim, Jinkyoo Park, Emmanuel Bengio, Yoshua Bengio

    Abstract: GFlowNets are probabilistic models that sequentially generate compositional structures through a stochastic policy. Among GFlowNets, temperature-conditional GFlowNets can introduce temperature-based controllability for exploration and exploitation. We propose \textit{Logit-scaling GFlowNets} (Logit-GFN), a novel architectural design that greatly accelerates the training of temperature-conditional… ▽ More

    Submitted 2 June, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: ICML 2024, 23 pages, 21 figures

  44. arXiv:2310.00109  [pdf, other

    cs.LG cs.DC cs.DL

    FedAIoT: A Federated Learning Benchmark for Artificial Intelligence of Things

    Authors: Samiul Alam, Tuo Zhang, Tiantian Feng, Hui Shen, Zhichao Cao, Dong Zhao, JeongGil Ko, Kiran Somasundaram, Shrikanth S. Narayanan, Salman Avestimehr, Mi Zhang

    Abstract: There is a significant relevance of federated learning (FL) in the realm of Artificial Intelligence of Things (AIoT). However, most existing FL works do not use datasets collected from authentic IoT devices and thus do not capture unique modalities and inherent challenges of IoT data. To fill this critical gap, in this work, we introduce FedAIoT, an FL benchmark for AIoT. FedAIoT includes eight da… ▽ More

    Submitted 21 August, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: Camera-ready version of the Journal of Data-centric Machine Learning Research (DMLR)

  45. arXiv:2309.10310  [pdf, other

    cs.LG

    TensorCodec: Compact Lossy Compression of Tensors without Strong Data Assumptions

    Authors: Taehyung Kwon, Jihoon Ko, Jinhong Jung, Kijung Shin

    Abstract: Many real-world datasets are represented as tensors, i.e., multi-dimensional arrays of numerical values. Storing them without compression often requires substantial space, which grows exponentially with the order. While many tensor compression algorithms are available, many of them rely on strong data assumptions regarding its order, sparsity, rank, and smoothness. In this work, we propose TENSORC… ▽ More

    Submitted 20 September, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: Accepted to ICDM 2023 - IEEE International Conference on Data Mining 2023

  46. arXiv:2309.10069  [pdf

    q-bio.NC cs.AI

    Sex-based Disparities in Brain Aging: A Focus on Parkinson's Disease

    Authors: Iman Beheshti, Samuel Booth, Ji Hyun Ko

    Abstract: PD is linked to faster brain aging. Sex is recognized as an important factor in PD, such that males are twice as likely as females to have the disease and have more severe symptoms and a faster progression rate. Despite previous research, there remains a significant gap in understanding the function of sex in the process of brain aging in PD patients. The T1-weighted MRI-driven brain-predicted age… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 35 pages, 5 figures

  47. arXiv:2309.07471  [pdf, other

    cs.CV

    EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization

    Authors: Minjung Kim, Junseo Koo, Gunhee Kim

    Abstract: Visual localization is the task of estimating a 6-DoF camera pose of a query image within a provided 3D reference map. Thanks to recent advances in various 3D sensors, 3D point clouds are becoming a more accurate and affordable option for building the reference map, but research to match the points of 3D point clouds with pixels in 2D images for visual localization remains challenging. Existing ap… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: Accepted to ICCV 2023

  48. arXiv:2308.12599  [pdf, other

    cs.SD cs.LG eess.AS

    Exploiting Time-Frequency Conformers for Music Audio Enhancement

    Authors: Yunkee Chae, Junghyun Koo, Sungho Lee, Kyogu Lee

    Abstract: With the proliferation of video platforms on the internet, recording musical performances by mobile devices has become commonplace. However, these recordings often suffer from degradation such as noise and reverberation, which negatively impact the listening experience. Consequently, the necessity for music audio enhancement (referred to as music enhancement from this point onward), involving the… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: Accepted by ACM Multimedia 2023

  49. arXiv:2308.11916  [pdf, other

    cs.CV

    Semantic-Aware Implicit Template Learning via Part Deformation Consistency

    Authors: Sihyeon Kim, Minseok Joo, Jaewon Lee, Juyeon Ko, Juhan Cha, Hyunwoo J. Kim

    Abstract: Learning implicit templates as neural fields has recently shown impressive performance in unsupervised shape correspondence. Despite the success, we observe current approaches, which solely rely on geometric information, often learn suboptimal deformation across generic object shapes, which have high structural variability. In this paper, we highlight the importance of part deformation consistency… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: ICCV camera-ready version

  50. arXiv:2308.11250  [pdf, ps, other

    math.NT

    Class fields and form class groups for solving certain quadratic Diophantine equations

    Authors: Ho Yun Jung, Ja Kyung Koo, Dong Hwa Shin, Dong Sung Yoon

    Abstract: Let $K$ be an imaginary quadratic field and $\mathcal{O}$ be an order in $K$. We construct class fields associated with form class groups which are isomorphic to certain $\mathcal{O}$-ideal class groups in terms of the theory of canonical models due to Shimura. As its applications, by using such class fields, for a positive integer $n$ we first find primes of the form $x^2+ny^2$ with additional co… ▽ More

    Submitted 24 February, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: 30 pages, The title has been changed

    MSC Class: Primary 11R37; Secondary 11E12; 11R65