Skip to main content

Showing 51–100 of 731 results for author: Yun, S

.
  1. arXiv:2501.02199  [pdf, other

    math.NA cs.AI

    Can ChatGPT implement finite element models for geotechnical engineering applications?

    Authors: Taegu Kim, Tae Sup Yun, Hyoung Suk Suh

    Abstract: This study assesses the capability of ChatGPT to generate finite element code for geotechnical engineering applications from a set of prompts. We tested three different initial boundary value problems using a hydro-mechanically coupled formulation for unsaturated soils, including the dissipation of excess pore water pressure through fluid mass diffusion in one-dimensional space, time-dependent dif… ▽ More

    Submitted 4 January, 2025; originally announced January 2025.

  2. arXiv:2501.01550  [pdf

    physics.optics cond-mat.mes-hall

    Dynamic realization of emergent high-dimensional optical vortices

    Authors: Dongha Kim, Geonhyeong Park, Yun-Seok Choi, Arthur Baucour, Jisung Hwang, Sanghyeok Park, Hee Seong Yun, Jonghwa Shin, Haiwen Wang, Shanhui Fan, Dong Ki Yoon, Min-Kyo Seo

    Abstract: The dimensionality of vortical structures has recently been extended beyond two dimensions, providing higher-order topological characteristics and robustness for high-capacity information processing and turbulence control. The generation of high-dimensional vortical structures has mostly been demonstrated in classical systems through the complex interference of fluidic, acoustic, or electromagneti… ▽ More

    Submitted 2 January, 2025; originally announced January 2025.

    Comments: 21 pages,5 figures

  3. arXiv:2412.20303  [pdf

    physics.optics

    Controllable Thermo-Stimulated Luminescence in Niobate Persistent Phosphor by Constructing the Photovoltaic/Electrolytic Cell for Remote Intelligent Anti-Counterfeiting

    Authors: Yuanyuan Hu, Dangli Gao, Xiangyu Zhang, Sining Yun

    Abstract: Persistent luminescence (PersL) carrying remote key information plays a crucial role for intelligent anti-counterfeiting applications. However, the weak PersL intensity accompanied by uncontrollability limits their practical application. Here we develop LiNbO3 (LNO):Pr,Bi phosphor with enhanced red PersL by trace doping Sm3+. The LNO:Pr,Bi,Sm phosphor exhibits quadruplet luminescence, including po… ▽ More

    Submitted 28 December, 2024; originally announced December 2024.

  4. arXiv:2412.13262  [pdf

    physics.app-ph physics.med-ph physics.optics

    Optical Coherence Elastography Measures Mechanical Tension in the Lens and Capsule in situ

    Authors: Xu Feng, Guo-yang Li, Yuxuan Jiang, Owen Shortt-Nguyen, Seok-Hyun Yun

    Abstract: Lens tension is essential for accommodative vision but remains challenging to measure with precision. Here, we present an optical coherence elastography (OCE) technique that quantifies both the tension and elastic modulus of lens tissue and capsule. This method derives mechanical parameters from surface wave dispersion across a critical frequency range of 1-30 kHz. Using isolated lenses from six-m… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

  5. arXiv:2412.04077  [pdf, other

    cs.CV

    SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning

    Authors: Seokju Yun, Seunghye Chae, Dongheon Lee, Youngmin Ro

    Abstract: Domain generalization (DG) aims to adapt a model using one or multiple source domains to ensure robust performance in unseen target domains. Recently, Parameter-Efficient Fine-Tuning (PEFT) of foundation models has shown promising results in the context of DG problem. Nevertheless, existing PEFT methods still struggle to strike a balance between preserving generalizable components of the pre-train… ▽ More

    Submitted 21 March, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

    Comments: CVPR 2025 Project page: https://ysj9909.github.io/SoRA.github.io/

  6. arXiv:2412.03093  [pdf, other

    cs.CV

    Expanding Event Modality Applications through a Robust CLIP-Based Encoder

    Authors: Sungheon Jeong, Hanning Chen, Sanggeon Yun, Suhyeon Cho, Wenjun Huang, Xiangjian Liu, Mohsen Imani

    Abstract: This paper introduces a powerful encoder that transfers CLIP`s capabilities to event-based data, enhancing its utility and expanding its applicability across diverse domains. While large-scale datasets have significantly advanced image-based models, the scarcity of comprehensive event datasets has limited performance potential in event modality. To address this challenge, we adapt CLIP`s architect… ▽ More

    Submitted 8 May, 2025; v1 submitted 4 December, 2024; originally announced December 2024.

  7. arXiv:2411.19503  [pdf, other

    physics.chem-ph

    Hierarchical Framework for Retrosynthesis Prediction with Enhanced Reaction Center Localization

    Authors: Seongeun Yun, Won Bo Lee

    Abstract: Retrosynthesis is essential for designing synthetic pathways for complex molecules and can be revolutionized by AI to automate and accelerate chemical synthesis planning for drug discovery and materials science. Here, we propose a hierarchical framework for retrosynthesis prediction that systematically integrates reaction center identification, action prediction, and termination decision into a un… ▽ More

    Submitted 29 November, 2024; originally announced November 2024.

  8. arXiv:2411.17900  [pdf, other

    q-fin.CP

    Pretrained LLM Adapted with LoRA as a Decision Transformer for Offline RL in Quantitative Trading

    Authors: Suyeol Yun

    Abstract: Developing effective quantitative trading strategies using reinforcement learning (RL) is challenging due to the high risks associated with online interaction with live financial markets. Consequently, offline RL, which leverages historical market data without additional exploration, becomes essential. However, existing offline RL methods often struggle to capture the complex temporal dependencies… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

    Comments: Accepted for presentation at the ICAIF 2024 Workshop on LLMs and Generative AI for Finance (poster session)

  9. arXiv:2411.17702  [pdf, other

    eess.SP cs.LG

    Finding "Good Views" of Electrocardiogram Signals for Inferring Abnormalities in Cardiac Condition

    Authors: Hyewon Jeong, Suyeol Yun, Hammaad Adam

    Abstract: Electrocardiograms (ECGs) are an established technique to screen for abnormal cardiac signals. Recent work has established that it is possible to detect arrhythmia directly from the ECG signal using deep learning algorithms. While a few prior approaches with contrastive learning have been successful, the best way to define a positive sample remains an open question. In this project, we investigate… ▽ More

    Submitted 11 November, 2024; originally announced November 2024.

  10. Noise-Aware Ensemble Learning for Efficient Radar Modulation Recognition

    Authors: Do-Hyun Park, Min-Wook Jeon, Jinwoo Jeong, Isaac Sim, Sangbom Yun, Junghyun Seo, Hyoung-Nam Kim

    Abstract: Electronic warfare support (ES) systems intercept adversary radar signals and estimate various types of signal information, including modulation schemes. The accurate and rapid identification of modulation schemes under conditions of very low signal power remains a significant challenge for ES systems. This paper proposes a recognition model based on a noise-aware ensemble learning (NAEL) framewor… ▽ More

    Submitted 14 May, 2025; v1 submitted 22 November, 2024; originally announced November 2024.

    Comments: 13 pages, 11 figures

  11. arXiv:2411.14612  [pdf, other

    cs.LG cs.AI

    Exploiting Boosting in Hyperdimensional Computing for Enhanced Reliability in Healthcare

    Authors: SungHeon Jeong, Hamza Errahmouni Barkam, Sanggeon Yun, Yeseong Kim, Shaahin Angizi, Mohsen Imani

    Abstract: Hyperdimensional computing (HDC) enables efficient data encoding and processing in high-dimensional space, benefiting machine learning and data analysis. However, underutilization of these spaces can lead to overfitting and reduced model reliability, especially in data-limited systems a critical issue in sectors like healthcare that demand robustness and consistent performance. We introduce BoostH… ▽ More

    Submitted 13 January, 2025; v1 submitted 21 November, 2024; originally announced November 2024.

    Comments: Accepted to DATE 2025

  12. arXiv:2411.11712  [pdf

    physics.optics physics.bio-ph physics.ins-det

    Consensus Statement on Brillouin Light Scattering Microscopy of Biological Materials

    Authors: Pierre Bouvet, Carlo Bevilacqua, Yogeshwari Ambekar, Giuseppe Antonacci, Joshua Au, Silvia Caponi, Sophie Chagnon-Lessard, Juergen Czarske, Thomas Dehoux, Daniele Fioretto, Yujian Fu, Jochen Guck, Thorsten Hamann, Dag Heinemann, Torsten Jähnke, Hubert Jean-Ruel, Irina Kabakova, Kristie Koski, Nektarios Koukourakis, David Krause, Salvatore La Cavera III, Timm Landes, Jinhao Li, Jeremie Margueritat, Maurizio Mattarelli , et al. (19 additional authors not shown)

    Abstract: Brillouin Light Scattering (BLS) spectroscopy is a non-invasive, non-contact, label-free optical technique that can provide information on the mechanical properties of a material on the sub-micron scale. Over the last decade it has seen increased applications in the life sciences, driven by the observed significance of mechanical properties in biological processes, the realization of more sensitiv… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

    Comments: Main Text & Supplementary Text: 56 pages, 3 Figures, 2 Supplementary Figures, 1 Supplementary Table

  13. arXiv:2411.09809  [pdf, other

    cs.DC

    Scalable Readability Evaluation for Graph Layouts: 2D Geometric Distributed Algorithms

    Authors: Sanggeon Yun

    Abstract: Graphs, consisting of vertices and edges, are vital for representing complex relationships in fields like social networks, finance, and blockchain. Visualizing these graphs helps analysts identify structural patterns, with readability metrics-such as node occlusion and edge crossing-assessing layout clarity. However, calculating these metrics is computationally intensive, making scalability a chal… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

  14. arXiv:2411.09072  [pdf, other

    cs.LG

    Continuous GNN-based Anomaly Detection on Edge using Efficient Adaptive Knowledge Graph Learning

    Authors: Sanggeon Yun, Ryozo Masukawa, William Youngwoo Chung, Minhyoung Na, Nathaniel Bastian, Mohsen Imani

    Abstract: The increasing demand for robust security solutions across various industries has made Video Anomaly Detection (VAD) a critical task in applications such as intelligent surveillance, evidence investigation, and violence detection. Traditional approaches to VAD often rely on finetuning large pre-trained models, which can be computationally expensive and impractical for real-time or resource-constra… ▽ More

    Submitted 13 January, 2025; v1 submitted 13 November, 2024; originally announced November 2024.

    Comments: Accepted to DATE 2025

  15. arXiv:2411.02460  [pdf, other

    cs.CL cs.AI cs.LG

    Code-Switching Curriculum Learning for Multilingual Transfer in LLMs

    Authors: Haneul Yoo, Cheonbok Park, Sangdoo Yun, Alice Oh, Hwaran Lee

    Abstract: Large language models (LLMs) now exhibit near human-level performance in various tasks, but their performance drops drastically after a handful of high-resource languages due to the imbalance in pre-training data. Inspired by the human process of second language acquisition, particularly code-switching (the practice of language alternation in a conversation), we propose code-switching curriculum l… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

  16. arXiv:2411.01179  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models

    Authors: Wonguk Cho, Seokeon Choi, Debasmit Das, Matthias Reisser, Taesup Kim, Sungrack Yun, Fatih Porikli

    Abstract: Recent advancements in text-to-image diffusion models have enabled the personalization of these models to generate custom images from textual prompts. This paper presents an efficient LoRA-based personalization approach for on-device subject-driven generation, where pre-trained diffusion models are fine-tuned with user-specific data on resource-constrained devices. Our method, termed Hollowed Net,… ▽ More

    Submitted 2 November, 2024; originally announced November 2024.

    Comments: NeurIPS 2024

  17. arXiv:2411.00551  [pdf, other

    cs.LG cs.AI

    Conditional Synthesis of 3D Molecules with Time Correction Sampler

    Authors: Hojung Jung, Youngrok Park, Laura Schmid, Jaehyeong Jo, Dongkyu Lee, Bongsang Kim, Se-Young Yun, Jinwoo Shin

    Abstract: Diffusion models have demonstrated remarkable success in various domains, including molecular generation. However, conditional molecular generation remains a fundamental challenge due to an intrinsic trade-off between targeting specific chemical properties and generating meaningful samples from the data distribution. In this work, we present Time-Aware Conditional Synthesis (TACS), a novel approac… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

    Comments: NeurIPS 2024

  18. arXiv:2411.00154  [pdf, other

    cs.CL cs.AI cs.LG

    Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models

    Authors: Haritz Puerto, Martin Gubri, Sangdoo Yun, Seong Joon Oh

    Abstract: Membership inference attacks (MIA) attempt to verify the membership of a given data sample in the training set for a model. MIA has become relevant in recent years, following the rapid development of large language models (LLM). Many are concerned about the usage of copyrighted materials for training them and call for methods for detecting such usage. However, recent research has largely concluded… ▽ More

    Submitted 3 February, 2025; v1 submitted 31 October, 2024; originally announced November 2024.

    Comments: Findings of NAACL 2025. Our code is available at https://github.com/parameterlab/mia-scaling

  19. arXiv:2410.22623  [pdf, other

    cs.CV

    PV-VTT: A Privacy-Centric Dataset for Mission-Specific Anomaly Detection and Natural Language Interpretation

    Authors: Ryozo Masukawa, Sanggeon Yun, Yoshiki Yamaguchi, Mohsen Imani

    Abstract: Video crime detection is a significant application of computer vision and artificial intelligence. However, existing datasets primarily focus on detecting severe crimes by analyzing entire video clips, often neglecting the precursor activities (i.e., privacy violations) that could potentially prevent these crimes. To address this limitation, we present PV-VTT (Privacy Violation Video To Text), a u… ▽ More

    Submitted 4 December, 2024; v1 submitted 29 October, 2024; originally announced October 2024.

    Comments: Accepted to WACV 2025, Dataset Available Here : https://ryozomasukawa.github.io/PV-VTT.github.io/

  20. arXiv:2410.18857  [pdf, other

    cs.CV cs.LG

    Probabilistic Language-Image Pre-Training

    Authors: Sanghyuk Chun, Wonjae Kim, Song Park, Sangdoo Yun

    Abstract: Vision-language models (VLMs) embed aligned image-text pairs into a joint space but often rely on deterministic embeddings, assuming a one-to-one correspondence between images and texts. This oversimplifies real-world relationships, which are inherently many-to-many, with multiple captions describing a single image and vice versa. We introduce Probabilistic Language-Image Pre-training (ProLIP), th… ▽ More

    Submitted 12 March, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

    Comments: Code: https://github.com/naver-ai/prolip HuggingFace Hub: https://huggingface.co/collections/SanghyukChun/prolip-6712595dfc87fd8597350291 33 pages, 4.8 MB; LongProLIP paper: arXiv:2503.08048

  21. arXiv:2410.18652  [pdf, other

    cs.LG cs.AI cs.CL

    $C^2$: Scalable Auto-Feedback for LLM-based Chart Generation

    Authors: Woosung Koh, Jang Han Yoon, MinHyung Lee, Youngjin Song, Jaegwan Cho, Jaehyun Kang, Taehyeon Kim, Se-Young Yun, Youngjae Yu, Bongshin Lee

    Abstract: Generating high-quality charts with Large Language Models (LLMs) presents significant challenges due to limited data and the high cost of scaling through human curation. $\langle \text{instruction}, \text{data}, \text{code} \rangle$ triplets are scarce and expensive to manually curate as their creation demands technical expertise. To address this scalability challenge, we introduce a reference-fre… ▽ More

    Submitted 12 February, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

    Comments: NAACL 2025 Main (Long)

  22. arXiv:2410.15876  [pdf, other

    cs.LG cs.AI cs.MA

    FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL

    Authors: Woosung Koh, Wonbeen Oh, Siyeol Kim, Suhin Shin, Hyeongjin Kim, Jaein Jang, Junghyun Lee, Se-Young Yun

    Abstract: Multi-agent reinforcement learning has demonstrated significant potential in addressing complex cooperative tasks across various real-world applications. However, existing MARL approaches often rely on the restrictive assumption that the number of entities (e.g., agents, obstacles) remains constant between training and inference. This overlooks scenarios where entities are dynamically removed or a… ▽ More

    Submitted 3 December, 2024; v1 submitted 21 October, 2024; originally announced October 2024.

    Comments: NeurIPS '24 Open-World Agents Workshop

  23. arXiv:2410.13621  [pdf, other

    cs.CV

    EP-SAM: Weakly Supervised Histopathology Segmentation via Enhanced Prompt with Segment Anything

    Authors: Joonhyeon Song, Seohwan Yun, Seongho Yoon, Joohyeok Kim, Sangmin Lee

    Abstract: This work proposes a novel approach beyond supervised learning for effective pathological image analysis, addressing the challenge of limited robust labeled data. Pathological diagnosis of diseases like cancer has conventionally relied on the evaluation of morphological features by physicians and pathologists. However, recent advancements in compute-aided diagnosis (CAD) systems are gaining signif… ▽ More

    Submitted 21 October, 2024; v1 submitted 17 October, 2024; originally announced October 2024.

    Comments: 10 pages, 7 figures

  24. arXiv:2410.10870  [pdf, other

    cs.CL cs.AI cs.LG

    PortLLM: Personalizing Evolving Large Language Models with Training-Free and Portable Model Patches

    Authors: Rana Muhammad Shahroz Khan, Pingzhi Li, Sukwon Yun, Zhenyu Wang, Shahriar Nirjon, Chau-Wai Wong, Tianlong Chen

    Abstract: As large language models (LLMs) increasingly shape the AI landscape, fine-tuning pretrained models has become more popular than in the pre-LLM era for achieving optimal performance in domain-specific tasks. However, pretrained LLMs such as ChatGPT are periodically evolved, i.e., model parameters are frequently updated), making it challenging for downstream users with limited resources to keep up w… ▽ More

    Submitted 28 March, 2025; v1 submitted 8 October, 2024; originally announced October 2024.

  25. arXiv:2410.10166  [pdf, other

    cs.LG cs.AI

    Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models

    Authors: Yongjin Yang, Sihyeon Kim, Hojung Jung, Sangmin Bae, SangMook Kim, Se-Young Yun, Kimin Lee

    Abstract: Fine-tuning text-to-image diffusion models with human feedback is an effective method for aligning model behavior with human intentions. However, this alignment process often suffers from slow convergence due to the large size and noise present in human feedback datasets. In this work, we propose FiFA, a novel automated data filtering algorithm designed to enhance the fine-tuning of diffusion mode… ▽ More

    Submitted 2 April, 2025; v1 submitted 14 October, 2024; originally announced October 2024.

    Comments: ICLR 2025; Project Page available at : https://sprain02.github.io/FiFA/

  26. arXiv:2410.08245  [pdf, other

    cs.LG cs.AI

    Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts

    Authors: Sukwon Yun, Inyoung Choi, Jie Peng, Yangfan Wu, Jingxuan Bao, Qiyiwen Zhang, Jiayi Xin, Qi Long, Tianlong Chen

    Abstract: Multimodal learning has gained increasing importance across various fields, offering the ability to integrate data from diverse sources such as images, text, and personalized records, which are frequently observed in medical domains. However, in scenarios where some modalities are missing, many existing frameworks struggle to accommodate arbitrary modality combinations, often relying heavily on a… ▽ More

    Submitted 31 October, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

    Comments: NeurIPS 2024 Spotlight

  27. arXiv:2410.05628  [pdf, other

    cs.AI

    A Unified Framework for Motion Reasoning and Generation in Human Interaction

    Authors: Jeongeun Park, Sungjoon Choi, Sangdoo Yun

    Abstract: Recent advancements in large language models (LLMs) have significantly improved their ability to generate natural and contextually relevant text, enabling more human-like AI interactions. However, generating and understanding interactive human-like motion, where multiple individuals engage in coordinated movements, remains challenging due to the complexity of modeling these interactions. Additiona… ▽ More

    Submitted 12 March, 2025; v1 submitted 7 October, 2024; originally announced October 2024.

    Comments: https://vim-motion-language.github.io/

  28. arXiv:2410.03782  [pdf, other

    cs.LG cs.CV

    DaWin: Training-free Dynamic Weight Interpolation for Robust Adaptation

    Authors: Changdae Oh, Yixuan Li, Kyungwoo Song, Sangdoo Yun, Dongyoon Han

    Abstract: Adapting a pre-trained foundation model on downstream tasks should ensure robustness against distribution shifts without the need to retrain the whole model. Although existing weight interpolation methods are simple yet effective, we argue their static nature limits downstream performance while achieving efficiency. In this work, we propose DaWin, a training-free dynamic weight interpolation metho… ▽ More

    Submitted 13 March, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: ICLR 2025 camera-ready

  29. arXiv:2410.02506  [pdf, other

    cs.MA cs.LG

    Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems

    Authors: Guibin Zhang, Yanwei Yue, Zhixun Li, Sukwon Yun, Guancheng Wan, Kun Wang, Dawei Cheng, Jeffrey Xu Yu, Tianlong Chen

    Abstract: Recent advancements in large language model (LLM)-powered agents have shown that collective intelligence can significantly outperform individual capabilities, largely attributed to the meticulously designed inter-agent communication topologies. Though impressive in performance, existing multi-agent pipelines inherently introduce substantial token overhead, as well as increased economic costs, whic… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  30. arXiv:2409.15889  [pdf, other

    cs.CV

    CAD: Memory Efficient Convolutional Adapter for Segment Anything

    Authors: Joohyeok Kim, Joonhyeon Song, Seohwan Yun, Seongho Yoon, Sangmin Lee

    Abstract: The Foundation model for image segmentation, Segment Anything (SAM), has been actively researched in various fields since its proposal. Various researches have been proposed to adapt SAM to specific domains, with one notable approach involving the addition and training of lightweight adapter modules. While adapter-based fine-tuning approaches have reported parameter efficiency and significant perf… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: 14 pages

  31. arXiv:2409.09882  [pdf, other

    eess.SY cs.RO

    Safe Control of Quadruped in Varying Dynamics via Safety Index Adaptation

    Authors: Kai S. Yun, Rui Chen, Chase Dunaway, John M. Dolan, Changliu Liu

    Abstract: Varying dynamics pose a fundamental difficulty when deploying safe control laws in the real world. Safety Index Synthesis (SIS) deeply relies on the system dynamics and once the dynamics change, the previously synthesized safety index becomes invalid. In this work, we show the real-time efficacy of Safety Index Adaptation (SIA) in varying dynamics. SIA enables real-time adaptation to the changing… ▽ More

    Submitted 15 September, 2024; originally announced September 2024.

  32. arXiv:2409.07808  [pdf, other

    cs.LG

    FedHide: Federated Learning by Hiding in the Neighbors

    Authors: Hyunsin Park, Sungrack Yun

    Abstract: We propose a prototype-based federated learning method designed for embedding networks in classification or verification tasks. Our focus is on scenarios where each client has data from a single class. The main challenge is to develop an embedding network that can distinguish between different classes while adhering to privacy constraints. Sharing true class prototypes with the server or other cli… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Comments: ECCV 2024

  33. arXiv:2409.07787  [pdf, other

    cs.CL

    Stable Language Model Pre-training by Reducing Embedding Variability

    Authors: Woojin Chung, Jiwoo Hong, Na Min An, James Thorne, Se-Young Yun

    Abstract: Stable pre-training is essential for achieving better-performing language models. However, tracking pre-training stability by calculating gradient variance at every step is impractical due to the significant computational costs. We explore Token Embedding Variability (TEV) as a simple and efficient proxy for assessing pre-training stability in language models with pre-layer normalization, given th… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

  34. arXiv:2409.01141  [pdf, other

    cs.AR cs.LG

    Duplex: A Device for Large Language Models with Mixture of Experts, Grouped Query Attention, and Continuous Batching

    Authors: Sungmin Yun, Kwanhee Kyung, Juhwan Cho, Jaewan Choi, Jongmin Kim, Byeongho Kim, Sukhan Lee, Kyomin Sohn, Jung Ho Ahn

    Abstract: Large language models (LLMs) have emerged due to their capability to generate high-quality content across diverse contexts. To reduce their explosively increasing demands for computing resources, a mixture of experts (MoE) has emerged. The MoE layer enables exploiting a huge number of parameters with less computation. Applying state-of-the-art continuous batching increases throughput; however, it… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: 15 pages, 16 figures, accepted at MICRO 2024

  35. arXiv:2408.16218  [pdf, other

    cs.LG stat.ML

    Large-Scale Targeted Cause Discovery with Data-Driven Learning

    Authors: Jang-Hyun Kim, Claudia Skok Gibbs, Sangdoo Yun, Hyun Oh Song, Kyunghyun Cho

    Abstract: We propose a novel machine learning approach for inferring causal variables of a target variable from observations. Our focus is on directly inferring a set of causal factors without requiring full causal graph reconstruction, which is computationally challenging in large-scale systems. The identified causal set consists of all potential regulators of the target variable under experimental setting… ▽ More

    Submitted 7 April, 2025; v1 submitted 28 August, 2024; originally announced August 2024.

    Comments: v2: add intervention analysis

  36. arXiv:2408.13092  [pdf, other

    cs.LG

    Diffusion-based Episodes Augmentation for Offline Multi-Agent Reinforcement Learning

    Authors: Jihwan Oh, Sungnyun Kim, Gahee Kim, Sunghwan Kim, Se-Young Yun

    Abstract: Offline multi-agent reinforcement learning (MARL) is increasingly recognized as crucial for effectively deploying RL algorithms in environments where real-time interaction is impractical, risky, or costly. In the offline setting, learning from a static dataset of past interactions allows for the development of robust and safe policies without the need for live data collection, which can be fraught… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: Accepted by SPIGM Workshop at ICML 2024 (Structured Probabilistic Inference & Generative Modeling)

  37. Inching toward the QCD Axions with Axion Magnetic Resonance in Helioscopes

    Authors: Hyeonseok Seong, Chen Sun, Seokhoon Yun

    Abstract: Utilizing a helical magnet profile to enhance axion-photon conversion showed great promise in laboratory searches for high axion masses. We extend the mechanism, known as the axion-magnetic resonance (AMR), from laser experiments to axion helioscopes and demonstrate its potential in covering QCD axion parameter space. Specifically, we apply AMR to the CAST experiment legacy, make projections for t… ▽ More

    Submitted 23 March, 2025; v1 submitted 20 August, 2024; originally announced August 2024.

    Comments: 24 pages, 7 figures, 1 table, v2: references added, experimental relevance discussed, matches published version

    Report number: DESY-24-124, LA-UR-24-28820, CTPU-PTC-24-25

    Journal ref: JHEP 03 (2025) 071

  38. arXiv:2408.11063  [pdf, other

    cs.CL cs.AI cs.LG

    Tabular Transfer Learning via Prompting LLMs

    Authors: Jaehyun Nam, Woomin Song, Seong Hyeon Park, Jihoon Tack, Sukmin Yun, Jaehyung Kim, Kyu Hwan Oh, Jinwoo Shin

    Abstract: Learning with a limited number of labeled data is a central problem in real-world applications of machine learning, as it is often expensive to obtain annotations. To deal with the scarcity of labeled data, transfer learning is a conventional approach; it suggests to learn a transferable knowledge by training a neural network from multiple other sources. In this paper, we investigate transfer lear… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: COLM 2024

  39. arXiv:2408.09674  [pdf, other

    cs.CV

    Implicit Grid Convolution for Multi-Scale Image Super-Resolution

    Authors: Dongheon Lee, Seokju Yun, Youngmin Ro

    Abstract: For Image Super-Resolution (SR), it is common to train and evaluate scale-specific models composed of an encoder and upsampler for each targeted scale. Consequently, many SR studies encounter substantial training times and complex deployment requirements. In this paper, we address this limitation by training and evaluating multiple scales simultaneously. Notably, we observe that encoder features a… ▽ More

    Submitted 15 November, 2024; v1 submitted 18 August, 2024; originally announced August 2024.

  40. arXiv:2408.08163  [pdf, ps, other

    math.AP

    Ill-posedness of the Boltzmann-BGK model in the exponential class

    Authors: Donghyun Lee, Sungbin Park, Seok-Bae Yun

    Abstract: BGK (Bhatnagar-Gross-Krook) model is a relaxation-type model of the Boltzmann equation, which is popularly used in place of the Boltzmann equation in physics and engineering. In this paper, we address the ill-posedness problem for the BGK model, in which the solution instantly escapes the initial solution space. For this, we propose two ill-posedness scenarios, namely, the homogeneous and the inho… ▽ More

    Submitted 22 August, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

    Comments: 79 pages, 1 figures; remark 1.5 and acknowledgement modified, references added for section 1

    MSC Class: 82C40; 35Q20; 82C40; 35A01; 35A02

  41. arXiv:2408.07327  [pdf, other

    cs.LG cs.AI

    An Offline Meta Black-box Optimization Framework for Adaptive Design of Urban Traffic Light Management Systems

    Authors: Taeyoung Yun, Kanghoon Lee, Sujin Yun, Ilmyung Kim, Won-Woo Jung, Min-Cheol Kwon, Kyujin Choi, Yoohyeon Lee, Jinkyoo Park

    Abstract: Complex urban road networks with high vehicle occupancy frequently face severe traffic congestion. Designing an effective strategy for managing multiple traffic lights plays a crucial role in managing congestion. However, most current traffic light management systems rely on human-crafted decisions, which may not adapt well to diverse traffic patterns. In this paper, we delve into two pivotal desi… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 12 pages, 7 figures, 10 tables

  42. arXiv:2408.05337  [pdf, other

    cs.CV cs.AI

    VACoDe: Visual Augmented Contrastive Decoding

    Authors: Sihyeon Kim, Boryeong Cho, Sangmin Bae, Sumyeong Ahn, Se-Young Yun

    Abstract: Despite the astonishing performance of recent Large Vision-Language Models (LVLMs), these models often generate inaccurate responses. To address this issue, previous studies have focused on mitigating hallucinations by employing contrastive decoding (CD) with augmented images, which amplifies the contrast with the original image. However, these methods have limitations, including reliance on a sin… ▽ More

    Submitted 26 July, 2024; originally announced August 2024.

    Comments: 10 pages, 7 figures

    MSC Class: 68T01 ACM Class: I.2.0

  43. arXiv:2407.21035  [pdf, other

    cs.CV

    Direct Unlearning Optimization for Robust and Safe Text-to-Image Models

    Authors: Yong-Hyun Park, Sangdoo Yun, Jin-Hwa Kim, Junho Kim, Geonhui Jang, Yonghyun Jeong, Junghyo Jo, Gayoung Lee

    Abstract: Recent advancements in text-to-image (T2I) models have unlocked a wide range of applications but also present significant risks, particularly in their potential to generate unsafe content. To mitigate this issue, researchers have developed unlearning techniques to remove the model's ability to generate potentially harmful content. However, these methods are easily bypassed by adversarial attacks,… ▽ More

    Submitted 16 January, 2025; v1 submitted 17 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted for NeurIPS 2024

  44. arXiv:2407.17857  [pdf, other

    cs.CV cs.AI

    Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network

    Authors: Sukwon Yun, Jie Peng, Alexandro E. Trevino, Chanyoung Park, Tianlong Chen

    Abstract: Recent advancements in graph-based approaches for multiplexed immunofluorescence (mIF) images have significantly propelled the field forward, offering deeper insights into patient-level phenotyping. However, current graph-based methodologies encounter two primary challenges: (1) Cellular Heterogeneity, where existing approaches fail to adequately address the inductive biases inherent in graphs, pa… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  45. arXiv:2407.13977  [pdf, other

    stat.ML cs.LG

    A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits

    Authors: Junghyun Lee, Se-Young Yun, Kwang-Sung Jun

    Abstract: We present a unified likelihood ratio-based confidence sequence (CS) for any (self-concordant) generalized linear model (GLM) that is guaranteed to be convex and numerically tight. We show that this is on par or improves upon known CSs for various GLMs, including Gaussian, Bernoulli, and Poisson. In particular, for the first time, our CS for Bernoulli has a $\mathrm{poly}(S)$-free radius where… ▽ More

    Submitted 15 January, 2025; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: 39 pages, 2 figures, 2 tables; Accepted to the 38th Conference on Neural Information Processing Systems (NeurIPS 2024) (ver3: minor revisions, code refactoring; ver2: major revision, including new experiments, reorganization, fixing typos in the proofs of ver1, etc)

  46. arXiv:2407.13078  [pdf, other

    cs.CV cs.AI

    Enhancing Temporal Action Localization: Advanced S6 Modeling with Recurrent Mechanism

    Authors: Sangyoun Lee, Juho Jung, Changdae Oh, Sunghee Yun

    Abstract: Temporal Action Localization (TAL) is a critical task in video analysis, identifying precise start and end times of actions. Existing methods like CNNs, RNNs, GCNs, and Transformers have limitations in capturing long-range dependencies and temporal causality. To address these challenges, we propose a novel TAL architecture leveraging the Selective State Space Model (S6). Our approach integrates th… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 8 pages, 3 figures, Preprint

  47. arXiv:2407.08245  [pdf, other

    cs.LG cs.CV

    Feature Diversification and Adaptation for Federated Domain Generalization

    Authors: Seunghan Yang, Seokeon Choi, Hyunsin Park, Sungha Choi, Simyung Chang, Sungrack Yun

    Abstract: Federated learning, a distributed learning paradigm, utilizes multiple clients to build a robust global model. In real-world applications, local clients often operate within their limited domains, leading to a `domain shift' across clients. Privacy concerns limit each client's learning to its own domain data, which increase the risk of overfitting. Moreover, the process of aggregating models train… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  48. arXiv:2407.06123  [pdf, other

    cs.HC

    Investigating User Perceptions of Collaborative Agenda Setting in Virtual Health Counseling Session

    Authors: Mina Fallah, Farnaz Nouraei, Hye Sun Yun, Timothy Bickmore

    Abstract: Virtual health counselors offer the potential to provide users with information and counseling in complex areas such as disease management and health education. However, ensuring user engagement is challenging, particularly when the volume of information and length of counseling sessions increase. Agenda setting a clinical counseling technique where a patient and clinician collaboratively decide o… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  49. arXiv:2407.03563  [pdf, other

    eess.AS cs.CL cs.LG eess.IV

    Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition

    Authors: Sungnyun Kim, Kangwook Jang, Sangmin Bae, Hoirin Kim, Se-Young Yun

    Abstract: Audio-visual speech recognition (AVSR) aims to transcribe human speech using both audio and video modalities. In practical environments with noise-corrupted audio, the role of video information becomes crucial. However, prior works have primarily focused on enhancing audio features in AVSR, overlooking the importance of video features. In this study, we strengthen the video features by learning th… ▽ More

    Submitted 14 October, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted at SLT 2024 Main Conference; Code is available at https://github.com/sungnyun/avsr-temporal-dynamics

  50. arXiv:2407.01639  [pdf, other

    cs.LG cs.SE

    ModelVerification.jl: a Comprehensive Toolbox for Formally Verifying Deep Neural Networks

    Authors: Tianhao Wei, Luca Marzari, Kai S. Yun, Hanjiang Hu, Peizhi Niu, Xusheng Luo, Changliu Liu

    Abstract: Deep Neural Networks (DNN) are crucial in approximating nonlinear functions across diverse applications, ranging from image classification to control. Verifying specific input-output properties can be a highly challenging task due to the lack of a single, self-contained framework that allows a complete range of verification types. To this end, we present \texttt{ModelVerification.jl (MV)}, the fir… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.