Skip to main content

Showing 1–50 of 76 results for author: Yoon, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.03014  [pdf, ps, other

    cs.CR cs.CL cs.LG

    Intrinsic Fingerprint of LLMs: Continue Training is NOT All You Need to Steal A Model!

    Authors: Do-hyeon Yoon, Minsoo Chun, Thomas Allen, Hans Müller, Min Wang, Rajesh Sharma

    Abstract: Large language models (LLMs) face significant copyright and intellectual property challenges as the cost of training increases and model reuse becomes prevalent. While watermarking techniques have been proposed to protect model ownership, they may not be robust to continue training and development, posing serious threats to model attribution and copyright protection. This work introduces a simple… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: This paper flags a potential case of model plagiarism, copyright violation, and information fabrication in arXiv:2505.21411

  2. arXiv:2506.20112  [pdf

    cs.CL

    A Multi-Pass Large Language Model Framework for Precise and Efficient Radiology Report Error Detection

    Authors: Songsoo Kim, Seungtae Lee, See Young Lee, Joonho Kim, Keechan Kan, Dukyong Yoon

    Abstract: Background: The positive predictive value (PPV) of large language model (LLM)-based proofreading for radiology reports is limited due to the low error prevalence. Purpose: To assess whether a three-pass LLM framework enhances PPV and reduces operational costs compared with baseline approaches. Materials and Methods: A retrospective analysis was performed on 1,000 consecutive radiology reports (250… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 29 pages, 5 figures, 4 tables. Code available at https://github.com/radssk/mp-rred

    ACM Class: I.2.7

  3. arXiv:2506.11417  [pdf, ps, other

    cs.CV cs.AI

    Stop learning it all to mitigate visual hallucination, Focus on the hallucination target

    Authors: Dokyoon Yoon, Youngsook Song, Woomyong Park

    Abstract: Multimodal Large Language Models (MLLMs) frequently suffer from hallucination issues, generating information about objects that are not present in input images during vision-language tasks. These hallucinations particularly undermine model reliability in practical applications requiring accurate object identification. To address this challenge, we propose \mymethod,\ a preference learning approach… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: Accepted to CVPR 2025

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025

  4. arXiv:2506.08956  [pdf, ps, other

    cs.CV cs.LG

    Data Augmentation For Small Object using Fast AutoAugment

    Authors: DaeEun Yoon, Semin Kim, SangWook Yoo, Jongha Lee

    Abstract: In recent years, there has been tremendous progress in object detection performance. However, despite these advances, the detection performance for small objects is significantly inferior to that of large objects. Detecting small objects is one of the most challenging and important problems in computer vision. To improve the detection performance for small objects, we propose an optimal data augme… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: Accepted and published in the USB Proceedings of the 20th International Conference on Modeling Decisions for Artificial Intelligence (MDAI 2023), Umeå, Sweden, June 19--22, 2023, ISBN 978-91-527-7293-5, pp.\ 12--21

  5. arXiv:2506.08423  [pdf

    cond-mat.mtrl-sci cs.LG physics.ins-det

    Mic-hackathon 2024: Hackathon on Machine Learning for Electron and Scanning Probe Microscopy

    Authors: Utkarsh Pratiush, Austin Houston, Kamyar Barakati, Aditya Raghavan, Dasol Yoon, Harikrishnan KP, Zhaslan Baraissov, Desheng Ma, Samuel S. Welborn, Mikolaj Jakowski, Shawn-Patrick Barhorst, Alexander J. Pattison, Panayotis Manganaris, Sita Sirisha Madugula, Sai Venkata Gayathri Ayyagari, Vishal Kennedy, Ralph Bulanadi, Michelle Wang, Kieran J. Pang, Ian Addison-Smith, Willy Menacho, Horacio V. Guzman, Alexander Kiefer, Nicholas Furth, Nikola L. Kolev , et al. (48 additional authors not shown)

    Abstract: Microscopy is a primary source of information on materials structure and functionality at nanometer and atomic scales. The data generated is often well-structured, enriched with metadata and sample histories, though not always consistent in detail or format. The adoption of Data Management Plans (DMPs) by major funding agencies promotes preservation and access. However, deriving insights remains d… ▽ More

    Submitted 27 June, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

  6. arXiv:2505.24195  [pdf, ps, other

    cs.HC cs.CL

    WikiGap: Promoting Epistemic Equity by Surfacing Knowledge Gaps Between English Wikipedia and other Language Editions

    Authors: Zining Wang, Yuxuan Zhang, Dongwook Yoon, Nicholas Vincent, Farhan Samir, Vered Shwartz

    Abstract: With more than 11 times as many pageviews as the next, English Wikipedia dominates global knowledge access relative to other language editions. Readers are prone to assuming English Wikipedia as a superset of all language editions, leading many to prefer it even when their primary language is not English. Other language editions, however, comprise complementary facts rooted in their respective cul… ▽ More

    Submitted 4 June, 2025; v1 submitted 30 May, 2025; originally announced May 2025.

  7. arXiv:2505.14489  [pdf, ps, other

    cs.AI cs.CL

    Reasoning Models Better Express Their Confidence

    Authors: Dongkeun Yoon, Seungone Kim, Sohee Yang, Sunkyoung Kim, Soyeon Kim, Yongil Kim, Eunbi Choi, Yireun Kim, Minjoon Seo

    Abstract: Despite their strengths, large language models (LLMs) often fail to communicate their confidence accurately, making it difficult to assess when they might be wrong and limiting their reliability. In this work, we demonstrate that reasoning models-LLMs that engage in extended chain-of-thought (CoT) reasoning-exhibit superior performance not only in problem-solving but also in accurately expressing… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: Work in progress

  8. arXiv:2505.11709  [pdf, ps, other

    cs.CV cs.LG cs.RO

    EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video

    Authors: Ryan Hoque, Peide Huang, David J. Yoon, Mouli Sivapurapu, Jian Zhang

    Abstract: Imitation learning for manipulation has a well-known data scarcity problem. Unlike natural language and 2D computer vision, there is no Internet-scale corpus of data for dexterous manipulation. One appealing option is egocentric human video, a passively scalable data source. However, existing large-scale datasets such as Ego4D do not have native hand pose annotations and do not focus on object man… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  9. arXiv:2504.19634  [pdf, ps, other

    cs.CV

    NSegment : Label-specific Deformations for Remote Sensing Image Segmentation

    Authors: Yechan Kim, DongHo Yoon, SooYeon Kim, Moongu Jeon

    Abstract: Labeling errors in remote sensing (RS) image segmentation datasets often remain implicit and subtle due to ambiguous class boundaries, mixed pixels, shadows, complex terrain features, and subjective annotator bias. Furthermore, the scarcity of annotated RS data due to high image acquisition and labeling costs complicates training noise-robust models. While sophisticated mechanisms such as label se… ▽ More

    Submitted 27 June, 2025; v1 submitted 28 April, 2025; originally announced April 2025.

    Comments: Preprint

  10. arXiv:2504.16112  [pdf, other

    cs.AR cs.AI cs.CL cs.DC

    HPU: High-Bandwidth Processing Unit for Scalable, Cost-effective LLM Inference via GPU Co-processing

    Authors: Myunghyun Rhee, Joonseop Sim, Taeyoung Ahn, Seungyong Lee, Daegun Yoon, Euiseok Kim, Kyoung Park, Youngpyo Joo, Hosik Kim

    Abstract: The attention layer, a core component of Transformer-based LLMs, brings out inefficiencies in current GPU systems due to its low operational intensity and the substantial memory requirements of KV caches. We propose a High-bandwidth Processing Unit (HPU), a memoryintensive co-processor that enhances GPU resource utilization during large-batched LLM inference. By offloading memory-bound operations,… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: 6 pages

  11. arXiv:2504.04953  [pdf, other

    cs.CL cs.AI

    M-Prometheus: A Suite of Open Multilingual LLM Judges

    Authors: José Pombal, Dongkeun Yoon, Patrick Fernandes, Ian Wu, Seungone Kim, Ricardo Rei, Graham Neubig, André F. T. Martins

    Abstract: The use of language models for automatically evaluating long-form text (LLM-as-a-judge) is becoming increasingly common, yet most LLM judges are optimized exclusively for English, with strategies for enhancing their multilingual evaluation capabilities remaining largely unexplored in the current literature. This has created a disparity in the quality of automatic evaluation methods for non-English… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  12. arXiv:2503.13441  [pdf, other

    cs.RO cs.AI cs.CV

    Humanoid Policy ~ Human Policy

    Authors: Ri-Zhao Qiu, Shiqi Yang, Xuxin Cheng, Chaitanya Chawla, Jialong Li, Tairan He, Ge Yan, David J. Yoon, Ryan Hoque, Lars Paulsen, Ge Yang, Jian Zhang, Sha Yi, Guanya Shi, Xiaolong Wang

    Abstract: Training manipulation policies for humanoid robots with diverse data enhances their robustness and generalization across tasks and platforms. However, learning solely from robot demonstrations is labor-intensive, requiring expensive tele-operated data collection which is difficult to scale. This paper investigates a more scalable data source, egocentric human demonstrations, to serve as cross-embo… ▽ More

    Submitted 24 March, 2025; v1 submitted 17 March, 2025; originally announced March 2025.

    Comments: Code and data: https://human-as-robot.github.io/

  13. arXiv:2503.02107  [pdf, other

    cs.RO

    Balancing Act: Trading Off Doppler Odometry and Map Registration for Efficient Lidar Localization

    Authors: Katya M. Papais, Daniil Lisus, David J. Yoon, Andrew Lambert, Keith Y. K. Leung, Timothy D. Barfoot

    Abstract: Most autonomous vehicles rely on accurate and efficient localization, which is achieved by comparing live sensor data to a preexisting map, to navigate their environment. Balancing the accuracy of localization with computational efficiency remains a significant challenge, as high-accuracy methods often come with higher computational costs. In this paper, we present two ways of improving lidar loca… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: 8 pages, 3 figures, 2 tables, submitted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2025

  14. arXiv:2503.00825  [pdf, other

    cs.HC

    Who Reaps All the Superchats? A Large-Scale Analysis of Income Inequality in Virtual YouTuber Livestreaming

    Authors: Ruijing Zhao, Brian Diep, Jiaxin Pei, Dongwook Yoon, David Jurgens, Jian Zhu

    Abstract: The explosive growth of Virtual YouTubers (VTubers)-streamers who perform behind virtual anime avatars-has created a unique digital economy with profound implications for content creators, platforms, and viewers. Understanding the economic landscape of VTubers is crucial for designing equitable platforms, supporting content creator livelihoods, and fostering sustainable digital communities. To thi… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

    Comments: This paper has been conditionally accepted to ACM CHI 2025

  15. arXiv:2503.00790  [pdf

    cs.SD cs.ET eess.AS

    Acoustic Anomaly Detection on UAM Propeller Defect with Acoustic dataset for Crack of drone Propeller (ADCP)

    Authors: Juho Lee, Donghyun Yoon, Gumoon Jeong, Hyeoncheol Kim

    Abstract: The imminent commercialization of UAM requires stable, AI-based maintenance systems to ensure safety for both passengers and pedestrians. This paper presents a methodology for non-destructively detecting cracks in UAM propellers using drone propeller sound datasets. Normal operating sounds were recorded, and abnormal sounds (categorized as ripped and broken) were differentiated by varying the micr… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

    Comments: 25 pages

  16. arXiv:2502.00399  [pdf, other

    cs.CY

    Integrating Urban Air Mobility with Highway Infrastructure: A Strategic Approach for Vertiport Location Selection in the Seoul Metropolitan Area

    Authors: Donghyun Yoon, Minwoo Jeong, Jinyong Lee, Seyun Kim, Yoonjin Yoon

    Abstract: This study focuses on identifying suitable locations for highway-transfer Vertiports to integrate Urban Air Mobility (UAM) with existing highway infrastructure. UAM offers an effective solution for enhancing transportation accessibility in the Seoul Metropolitan Area, where conventional transportation often struggle to connect suburban employment zones such as industrial parks. By integrating UAM… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

    Comments: 24 pages

    Journal ref: 104th Transportation Research Board Annual Meeting (2025)

  17. arXiv:2412.03736  [pdf, other

    cs.CL

    Domain-specific Question Answering with Hybrid Search

    Authors: Dewang Sultania, Zhaoyu Lu, Twisha Naik, Franck Dernoncourt, David Seunghyun Yoon, Sanat Sharma, Trung Bui, Ashok Gupta, Tushar Vatsa, Suhas Suresha, Ishita Verma, Vibha Belavadi, Cheng Chen, Michael Friedrich

    Abstract: Domain specific question answering is an evolving field that requires specialized solutions to address unique challenges. In this paper, we show that a hybrid approach combining a fine-tuned dense retriever with keyword based sparse search methods significantly enhances performance. Our system leverages a linear combination of relevance signals, including cosine similarity from dense retrieval, BM… ▽ More

    Submitted 21 December, 2024; v1 submitted 4 December, 2024; originally announced December 2024.

    Comments: AAAI-25 Workshop on Document Understanding and Intelligence

  18. arXiv:2412.01340  [pdf, other

    cs.CL

    A 2-step Framework for Automated Literary Translation Evaluation: Its Promises and Pitfalls

    Authors: Sheikh Shafayat, Dongkeun Yoon, Woori Jang, Jiwoo Choi, Alice Oh, Seohyon Jung

    Abstract: In this work, we propose and evaluate the feasibility of a two-stage pipeline to evaluate literary machine translation, in a fine-grained manner, from English to Korean. The results show that our framework provides fine-grained, interpretable metrics suited for literary translation and obtains a higher correlation with human judgment than traditional machine translation metrics. Nonetheless, it st… ▽ More

    Submitted 1 January, 2025; v1 submitted 2 December, 2024; originally announced December 2024.

  19. arXiv:2411.19121  [pdf, ps, other

    cs.CV cs.AI

    MSG score: A Comprehensive Evaluation for Multi-Scene Video Generation

    Authors: Daewon Yoon, Hyungsuk Lee, Wonsik Shin

    Abstract: This paper addresses the metrics required for generating multi-scene videos based on a continuous scenario, as opposed to traditional short video generation. Scenario-based videos require a comprehensive evaluation that considers multiple factors such as character consistency, artistic coherence, aesthetic quality, and the alignment of the generated content with the intended prompt. Additionally,… ▽ More

    Submitted 28 November, 2024; originally announced November 2024.

  20. arXiv:2411.00003  [pdf, other

    cs.AI cs.LG math.OC

    Unsupervised Training of Diffusion Models for Feasible Solution Generation in Neural Combinatorial Optimization

    Authors: Seong-Hyun Hong, Hyun-Sung Kim, Zian Jang, Deunsol Yoon, Hyungseok Song, Byung-Jun Lee

    Abstract: Recent advancements in neural combinatorial optimization (NCO) methods have shown promising results in generating near-optimal solutions without the need for expert-crafted heuristics. However, high performance of these approaches often rely on problem-specific human-expertise-based search after generating candidate solutions, limiting their applicability to commonly solved CO problems such as Tra… ▽ More

    Submitted 12 February, 2025; v1 submitted 15 October, 2024; originally announced November 2024.

  21. arXiv:2410.17578  [pdf, other

    cs.CL

    MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models

    Authors: Guijin Son, Dongkeun Yoon, Juyoung Suk, Javier Aula-Blasco, Mano Aslan, Vu Trong Kim, Shayekh Bin Islam, Jaume Prats-Cristià, Lucía Tormo-Bañuelos, Seungone Kim

    Abstract: As Large Language Models (LLMs) are now capable of producing fluent and coherent content in languages other than English, it is not imperative to precisely evaluate these non-English outputs. However, when assessing the outputs from mutlilingual LLMs, prior works often employed LLM based evaluators that excel at assessing English outputs, without a thorough examination of whether these evaluators… ▽ More

    Submitted 29 March, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

    Comments: work in progress

  22. arXiv:2409.19989  [pdf, other

    cs.CV cs.GR

    RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models

    Authors: Jangyeong Kim, Donggoo Kang, Junyoung Choi, Jeonga Wi, Junho Gwon, Jiun Bae, Dumim Yoon, Junghyun Han

    Abstract: Text-to-texture generation has recently attracted increasing attention, but existing methods often suffer from the problems of view inconsistencies, apparent seams, and misalignment between textures and the underlying mesh. In this paper, we propose a robust text-to-texture method for generating consistent and seamless textures that are well aligned with the mesh. Our method leverages state-of-the… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

    Comments: 11 pages, 13 figures

  23. arXiv:2406.05761  [pdf, other

    cs.CL

    The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

    Authors: Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Guijin Son, Yejin Cho, Sheikh Shafayat, Jinheon Baek, Sue Hyun Park, Hyeonbin Hwang, Jinkyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang , et al. (7 additional authors not shown)

    Abstract: As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec… ▽ More

    Submitted 25 March, 2025; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: NAACL 2025 (Main Conference)

  24. arXiv:2404.14760  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Retrieval Augmented Generation for Domain-specific Question Answering

    Authors: Sanat Sharma, David Seunghyun Yoon, Franck Dernoncourt, Dewang Sultania, Karishma Bagga, Mengjiao Zhang, Trung Bui, Varun Kotte

    Abstract: Question answering (QA) has become an important application in the advanced development of large language models. General pre-trained large language models for question-answering are not trained to properly understand the knowledge or terminology for a specific domain, such as finance, healthcare, education, and customer service for a product. To better cater to domain-specific understanding, we b… ▽ More

    Submitted 29 May, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: AAAI 2024 (Association for the Advancement of Artificial Intelligence) Scientific Document Understanding Workshop

  25. arXiv:2404.01537  [pdf, other

    cs.RO

    Are Doppler Velocity Measurements Useful for Spinning Radar Odometry?

    Authors: Daniil Lisus, Keenan Burnett, David J. Yoon, Richard Poulton, John Marshall, Timothy D. Barfoot

    Abstract: Spinning, frequency-modulated continuous-wave (FMCW) radars with 360 degree coverage have been gaining popularity for autonomous-vehicle navigation. However, unlike `fixed' automotive radar, commercially available spinning radar systems typically do not produce radial velocities due to the lack of repeated measurements in the same direction and the fundamental hardware setup. To make these radial… ▽ More

    Submitted 5 December, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: 8 pages, 7 figures, 2 tables, accepted to Robotics and Automation Letters (RA-L)

    Journal ref: IEEE Robotics and Automation Letters, vol. 10, no. 1, pp. 224-231, Jan. 2025

  26. arXiv:2402.13781  [pdf, other

    cs.LG cs.DC

    Preserving Near-Optimal Gradient Sparsification Cost for Scalable Distributed Deep Learning

    Authors: Daegun Yoon, Sangyoon Oh

    Abstract: Communication overhead is a major obstacle to scaling distributed training systems. Gradient sparsification is a potential optimization approach to reduce the communication volume without significant loss of model fidelity. However, existing gradient sparsification methods have low scalability owing to inefficient design of their algorithms, which raises the communication overhead significantly. I… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 24th IEEE/ACM International Symposium on Cluster, Cloud, and Internet Computing (CCGrid 2024). Code: https://github.com/kljp/exdyna

  27. arXiv:2401.10695  [pdf, other

    cs.CL

    LangBridge: Multilingual Reasoning Without Multilingual Supervision

    Authors: Dongkeun Yoon, Joel Jang, Sungdong Kim, Seungone Kim, Sheikh Shafayat, Minjoon Seo

    Abstract: We introduce LangBridge, a zero-shot approach to adapt language models for multilingual reasoning tasks without multilingual supervision. LangBridge operates by bridging two models, each specialized in different aspects: (1) one specialized in understanding multiple languages (e.g., mT5 encoder) and (2) one specialized in reasoning (e.g., MetaMath). LangBridge connects the two models by introducin… ▽ More

    Submitted 3 June, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: ACL 2024 Main

  28. arXiv:2312.02819  [pdf, other

    cs.CV

    Deterministic Guidance Diffusion Model for Probabilistic Weather Forecasting

    Authors: Donggeun Yoon, Minseok Seo, Doyi Kim, Yeji Choi, Donghyeon Cho

    Abstract: Weather forecasting requires not only accuracy but also the ability to perform probabilistic prediction. However, deterministic weather forecasting methods do not support probabilistic predictions, and conversely, probabilistic models tend to be less accurate. To address these challenges, in this paper, we introduce the \textbf{\textit{D}}eterministic \textbf{\textit{G}}uidance \textbf{\textit{D}}… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 16 pages

  29. arXiv:2310.00967  [pdf, other

    cs.LG cs.DC

    MiCRO: Near-Zero Cost Gradient Sparsification for Scaling and Accelerating Distributed DNN Training

    Authors: Daegun Yoon, Sangyoon Oh

    Abstract: Gradient sparsification is a communication optimisation technique for scaling and accelerating distributed deep neural network (DNN) training. It reduces the increasing communication traffic for gradient aggregation. However, existing sparsifiers have poor scalability because of the high computational cost of gradient selection and/or increase in communication traffic. In particular, an increase i… ▽ More

    Submitted 20 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 30th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC 2023). Code: https://github.com/kljp/micro

  30. arXiv:2309.08872  [pdf, other

    cs.CL cs.AI cs.LG

    PDFTriage: Question Answering over Long, Structured Documents

    Authors: Jon Saad-Falcon, Joe Barrow, Alexa Siu, Ani Nenkova, David Seunghyun Yoon, Ryan A. Rossi, Franck Dernoncourt

    Abstract: Large Language Models (LLMs) have issues with document question answering (QA) in situations where the document is unable to fit in the small context length of an LLM. To overcome this issue, most existing works focus on retrieving the relevant context from the document, representing them as plain text. However, documents such as PDFs, web pages, and presentations are naturally structured with dif… ▽ More

    Submitted 8 November, 2023; v1 submitted 16 September, 2023; originally announced September 2023.

  31. DEFT: Exploiting Gradient Norm Difference between Model Layers for Scalable Gradient Sparsification

    Authors: Daegun Yoon, Sangyoon Oh

    Abstract: Gradient sparsification is a widely adopted solution for reducing the excessive communication traffic in distributed deep learning. However, most existing gradient sparsifiers have relatively poor scalability because of considerable computational cost of gradient selection and/or increased communication traffic owing to gradient build-up. To address these challenges, we propose a novel gradient sp… ▽ More

    Submitted 13 July, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: International Conference on Parallel Processing (ICPP) 2023. Code: https://github.com/kljp/deft

  32. arXiv:2306.07052  [pdf, other

    cs.CL cs.AI

    Gradient Ascent Post-training Enhances Language Model Generalization

    Authors: Dongkeun Yoon, Joel Jang, Sungdong Kim, Minjoon Seo

    Abstract: In this work, we empirically show that updating pretrained LMs (350M, 1.3B, 2.7B) with just a few steps of Gradient Ascent Post-training (GAP) on random, unlabeled text corpora enhances its zero-shot generalization capabilities across diverse NLP tasks. Specifically, we show that GAP can allow LMs to become comparable to 2-3x times larger LMs across 12 different NLP tasks. We also show that applyi… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: ACL 2023 Main Conference (Short Paper)

  33. arXiv:2305.09248  [pdf, other

    cs.CG

    Maximum-Width Rainbow-Bisecting Empty Annulus

    Authors: Sang Won Bae, Sandip Banerjee, Arpita Baral, Priya Ranjan Sinha Mahapatra, Sang Duk Yoon

    Abstract: Given a set of $n$ colored points with $k$ colors in the plane, we study the problem of computing a maximum-width rainbow-bisecting empty annulus (of objects specifically axis-parallel square, axis-parallel rectangle and circle) problem. We call a region rainbow if it contains at least one point of each color. The maximum-width rainbow-bisecting empty annulus problem asks to find an annulus $A$ of… ▽ More

    Submitted 26 March, 2024; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: A preliminary version is accepted in EuroCG 2021 and the expanded version is accepted in the journal Computational Geometry: Theory and Applications

  34. arXiv:2304.13215  [pdf, other

    cs.AR

    PROBE3.0: A Systematic Framework for Design-Technology Pathfinding with Improved Design Enablement

    Authors: Suhyeong Choi, Jinwook Jung, Andrew B. Kahng, Minsoo Kim, Chul-Hong Park, Bodhisatta Pramanik, Dooseok Yoon

    Abstract: We propose a systematic framework to conduct design-technology pathfinding for PPAC in advanced nodes. Our goal is to provide configurable, scalable generation of process design kit (PDK) and standard-cell library, spanning key scaling boosters (backside PDN and buried power rail), to explore PPAC across given technology and design parameters. We build on PROBE2.0, which addressed only area and co… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: 14 pages, 17 figures, submitted to IEEE Trans. on CAD

  35. arXiv:2304.10805  [pdf, ps, other

    cs.AI cs.LG

    RPLKG: Robust Prompt Learning with Knowledge Graph

    Authors: YongTaek Lim, Yewon Kim, Suho Kang, Dokyung Yoon, KyungWoo Song

    Abstract: Large-scale pre-trained models surpass in transferability and robust generalization across diverse datasets. The emergence of multimodal pre-trained models like CLIP has significantly boosted performance in various experiments. However, generalizing to new datasets or domains remains challenging, especially with limited labeled data. Also, existing methods often lack interpretability and impose hi… ▽ More

    Submitted 21 June, 2025; v1 submitted 21 April, 2023; originally announced April 2023.

  36. arXiv:2304.03456  [pdf, other

    cs.CV cs.LG

    Rethinking Evaluation Protocols of Visual Representations Learned via Self-supervised Learning

    Authors: Jae-Hun Lee, Doyoung Yoon, ByeongMoon Ji, Kyungyul Kim, Sangheum Hwang

    Abstract: Linear probing (LP) (and $k$-NN) on the upstream dataset with labels (e.g., ImageNet) and transfer learning (TL) to various downstream datasets are commonly employed to evaluate the quality of visual representations learned via self-supervised learning (SSL). Although existing SSL methods have shown good performances under those evaluation protocols, we observe that the performances are very sensi… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  37. arXiv:2303.06511  [pdf, other

    cs.RO

    Need for Speed: Fast Correspondence-Free Lidar-Inertial Odometry Using Doppler Velocity

    Authors: David J. Yoon, Keenan Burnett, Johann Laconte, Yi Chen, Heethesh Vhavle, Soeren Kammel, James Reuther, Timothy D. Barfoot

    Abstract: In this paper, we present a fast, lightweight odometry method that uses the Doppler velocity measurements from a Frequency-Modulated Continuous-Wave (FMCW) lidar without data association. FMCW lidar is a recently emerging technology that enables per-return relative radial velocity measurements via the Doppler effect. Since the Doppler measurement model is linear with respect to the 6-degrees-of-fr… ▽ More

    Submitted 29 September, 2023; v1 submitted 11 March, 2023; originally announced March 2023.

    Comments: Accepted and presented at IROS 2023

  38. arXiv:2303.06507  [pdf, other

    cs.RO

    Towards Consistent Batch State Estimation Using a Time-Correlated Measurement Noise Model

    Authors: David J. Yoon, Timothy D. Barfoot

    Abstract: In this paper, we present an algorithm for learning time-correlated measurement covariances for application in batch state estimation. We parameterize the inverse measurement covariance matrix to be block-banded, which conveniently factorizes and results in a computationally efficient approach for correlating measurements across the entire trajectory. We train our covariance model through supervis… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

    Comments: ICRA 2023

  39. DIFAI: Diverse Facial Inpainting using StyleGAN Inversion

    Authors: Dongsik Yoon, Jeong-gi Kwak, Yuanming Li, David Han, Hanseok Ko

    Abstract: Image inpainting is an old problem in computer vision that restores occluded regions and completes damaged images. In the case of facial image inpainting, most of the methods generate only one result for each masked image, even though there are other reasonable possibilities. To prevent any potential biases and unnatural constraints stemming from generating only one image, we propose a novel frame… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

    Comments: ICIP 2022

  40. arXiv:2301.08044  [pdf, other

    cs.CV

    Reference Guided Image Inpainting using Facial Attributes

    Authors: Dongsik Yoon, Jeonggi Kwak, Yuanming Li, David Han, Youngsaeng Jin, Hanseok Ko

    Abstract: Image inpainting is a technique of completing missing pixels such as occluded region restoration, distracting objects removal, and facial completion. Among these inpainting tasks, facial completion algorithm performs face inpainting according to the user direction. Existing approaches require delicate and well controlled input by the user, thus it is difficult for an average user to provide the gu… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: BMVC 2021

  41. arXiv:2301.00310  [pdf, other

    cs.SI cs.DB

    Graphlets over Time: A New Lens for Temporal Network Analysis

    Authors: Deukryeol Yoon, Dongjin Lee, Minyoung Choe, Kijung Shin

    Abstract: Graphs are widely used for modeling various types of interactions, such as email communications and online discussions. Many of such real-world graphs are temporal, and specifically, they grow over time with new nodes and edges. Counting the instances of each graphlet (i.e., an induced subgraph isomorphism class) has been successful in characterizing local structures of graphs, with many applica… ▽ More

    Submitted 3 January, 2023; v1 submitted 31 December, 2022; originally announced January 2023.

    Comments: 13 pages, 7 figures

  42. arXiv:2212.02059  [pdf, other

    cs.CV

    Region-Conditioned Orthogonal 3D U-Net for Weather4Cast Competition

    Authors: Taehyeon Kim, Shinhwan Kang, Hyeonjeong Shin, Deukryeol Yoon, Seongha Eom, Kijung Shin, Se-Young Yun

    Abstract: The Weather4Cast competition (hosted by NeurIPS 2022) required competitors to predict super-resolution rain movies in various regions of Europe when low-resolution satellite contexts covering wider regions are given. In this paper, we show that a general baseline 3D U-Net can be significantly improved with region-conditioned layers as well as orthogonality regularizations on 1x1x1 convolutional la… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: workshop at NeurIPS 2022 Competition Track on Weather4Cast

  43. arXiv:2211.14807  [pdf, other

    cs.CG

    Universal convex covering problems under translation and discrete rotations

    Authors: Mook Kwon Jung, Sang Duk Yoon, Hee-Kap Ahn, Takeshi Tokuyama

    Abstract: We consider the smallest-area universal covering of planar objects of perimeter 2 (or equivalently closed curves of length 2) allowing translation and discrete rotations. In particular, we show that the solution is an equilateral triangle of height 1 when translation and discrete rotation of $π$ are allowed. Our proof is purely geometric and elementary. We also give convex coverings of closed curv… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    MSC Class: 52C15; 05B40 ACM Class: F.0; G.0

  44. arXiv:2210.07760  [pdf, other

    cs.CV

    Lightweight Alpha Matting Network Using Distillation-Based Channel Pruning

    Authors: Donggeun Yoon, Jinsun Park, Donghyeon Cho

    Abstract: Recently, alpha matting has received a lot of attention because of its usefulness in mobile applications such as selfies. Therefore, there has been a demand for a lightweight alpha matting model due to the limited computational resources of commercial portable devices. To this end, we suggest a distillation-based channel pruning method for the alpha matting networks. In the pruning step, we remove… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted by ACCV2022

  45. arXiv:2210.01504  [pdf, other

    cs.CL

    Knowledge Unlearning for Mitigating Privacy Risks in Language Models

    Authors: Joel Jang, Dongkeun Yoon, Sohee Yang, Sungmin Cha, Moontae Lee, Lajanugen Logeswaran, Minjoon Seo

    Abstract: Pretrained Language Models (LMs) memorize a vast amount of knowledge during initial pretraining, including information that may violate the privacy of personal lives and identities. Previous work addressing privacy issues for language models has mostly focused on data preprocessing and differential privacy methods, both requiring re-training the underlying LM. We propose knowledge unlearning as an… ▽ More

    Submitted 19 December, 2022; v1 submitted 4 October, 2022; originally announced October 2022.

  46. arXiv:2209.08497  [pdf, other

    cs.LG cs.AI cs.DC

    Empirical Analysis on Top-k Gradient Sparsification for Distributed Deep Learning in a Supercomputing Environment

    Authors: Daegun Yoon, Sangyoon Oh

    Abstract: To train deep learning models faster, distributed training on multiple GPUs is the very popular scheme in recent years. However, the communication bandwidth is still a major bottleneck of training performance. To improve overall training performance, recent works have proposed gradient sparsification methods that reduce the communication traffic significantly. Most of them require gradient sorting… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: 4 pages, 4 figures, The 8th International Conference on Next Generation Computing (ICNGC) 2022

  47. arXiv:2209.03304  [pdf, other

    cs.RO

    Picking Up Speed: Continuous-Time Lidar-Only Odometry using Doppler Velocity Measurements

    Authors: Yuchen Wu, David J. Yoon, Keenan Burnett, Soeren Kammel, Yi Chen, Heethesh Vhavle, Timothy D. Barfoot

    Abstract: Frequency-Modulated Continuous-Wave (FMCW) lidar is a recently emerging technology that additionally enables per-return instantaneous relative radial velocity measurements via the Doppler effect. In this letter, we present the first continuous-time lidar-only odometry algorithm using these Doppler velocity measurements from an FMCW lidar to aid odometry in geometrically degenerate environments. We… ▽ More

    Submitted 3 December, 2022; v1 submitted 7 September, 2022; originally announced September 2022.

    Comments: RA-L & ICRA2023

  48. arXiv:2208.12392  [pdf, other

    cs.AR cs.AI cs.CR cs.LG

    DiVa: An Accelerator for Differentially Private Machine Learning

    Authors: Beomsik Park, Ranggi Hwang, Dongho Yoon, Yoonhyuk Choi, Minsoo Rhu

    Abstract: The widespread deployment of machine learning (ML) is raising serious concerns on protecting the privacy of users who contributed to the collection of training data. Differential privacy (DP) is rapidly gaining momentum in the industry as a practical standard for privacy protection. Despite DP's importance, however, little has been explored within the computer systems community regarding the impli… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

    Comments: Accepted for publication at the 55th IEEE/ACM International Symposium on Microarchitecture (MICRO-55), 2022

  49. arXiv:2207.10257  [pdf, other

    cs.CV cs.GR

    Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis

    Authors: Jeong-gi Kwak, Yuanming Li, Dongsik Yoon, Donghyeon Kim, David Han, Hanseok Ko

    Abstract: Over the years, 2D GANs have achieved great successes in photorealistic portrait generation. However, they lack 3D understanding in the generation process, thus they suffer from multi-view inconsistency problem. To alleviate the issue, many 3D-aware GANs have been proposed and shown notable results, but 3D GANs struggle with editing semantic attributes. The controllability and interpretability of… ▽ More

    Submitted 26 July, 2022; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: ECCV 2022, project page: https://jgkwak95.github.io/surfgan/

  50. arXiv:2206.04386  [pdf

    cs.HC cs.CY cs.ET

    Interaction Design for VR Applications: Understanding Needs for University Curricula

    Authors: Oloff C. Biermann, Daniel Ajisafe, Dongwook Yoon

    Abstract: As virtual reality (VR) is emerging in the tech sector, developers and designers are under pressure to create immersive experiences for their products. However, the current curricula from top institutions focus primarily on technical considerations for building VR applications, missing out on concerns and usability problems specific to VR interaction design. To better understand current needs, we… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: 7 pages, 2 figures, published to CHI EA. For the associated presentation, see https://dl.acm.org/action/downloadSupplement?doi=10.1145%2F3491101.3519859&file=3491101.3519859-talk-video.mp4

    ACM Class: K.3.2; H.5.1

    Journal ref: CHI '22 Extended Abstracts. ACM, New York, NY, USA, 7 pages (2022)