Skip to main content

Showing 1–23 of 23 results for author: Jeon, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.07345  [pdf, other

    cs.CL cs.AI cs.IR

    QUPID: Quantified Understanding for Enhanced Performance, Insights, and Decisions in Korean Search Engines

    Authors: Ohjoon Kwon, Changsu Lee, Jihye Back, Lim Sun Suk, Inho Kang, Donghyeon Jeon

    Abstract: Large language models (LLMs) have been widely used for relevance assessment in information retrieval. However, our study demonstrates that combining two distinct small language models (SLMs) with different architectures can outperform LLMs in this task. Our approach -- QUPID -- integrates a generative SLM with an embedding-based SLM, achieving higher relevance judgment accuracy while reducing comp… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Journal ref: ACL 2025 Industry Track

  2. arXiv:2503.16459  [pdf

    cs.HC cs.RO

    The Realization of Virtual Environments in the Lower Limb Exoskeletal Robot

    Authors: Minsu Chang, Doyoung Jeon

    Abstract: This study proposes the realization of various virtual environments using a lower limb exoskeletal robot for futuristic gait rehabilitation. The proposed method allows the user to feel virtual gravity, buoyancy, and drag while actively walking. The virtual environments include four fluidic conditions: Water, Olive oil, Honey, and Peanut Butter, and four gravitational conditions consisting of the E… ▽ More

    Submitted 23 February, 2025; originally announced March 2025.

    Comments: 8 pages, 6 figures, have submitted to IEEE

    MSC Class: 68T40 (Robotics); 93C85 (Automated Systems) ACM Class: I.2.9; I.5.4

  3. arXiv:2503.01905  [pdf, other

    cs.LG cs.AI

    PaCA: Partial Connection Adaptation for Efficient Fine-Tuning

    Authors: Sunghyeon Woo, Sol Namkung, Sunwoo Lee, Inho Jeong, Beomseok Kim, Dongsuk Jeon

    Abstract: Prior parameter-efficient fine-tuning (PEFT) algorithms reduce memory usage and computational costs of fine-tuning large neural network models by training only a few additional adapter parameters, rather than the entire model. However, the reduction in computational costs due to PEFT does not necessarily translate to a reduction in training time; although the computational costs of the adapter lay… ▽ More

    Submitted 11 March, 2025; v1 submitted 28 February, 2025; originally announced March 2025.

  4. arXiv:2502.16457  [pdf, other

    cs.CL

    Towards Fully-Automated Materials Discovery via Large-Scale Synthesis Dataset and Expert-Level LLM-as-a-Judge

    Authors: Heegyu Kim, Taeyang Jeon, Seungtaek Choi, Ji Hoon Hong, Dong Won Jeon, Ga-Yeon Baek, Gyeong-Won Kwak, Dong-Hee Lee, Jisu Bae, Chihoon Lee, Yunseo Kim, Seon-Jin Choi, Jin-Seong Park, Sung Beom Cho, Hyunsouk Cho

    Abstract: Materials synthesis is vital for innovations such as energy storage, catalysis, electronics, and biomedical devices. Yet, the process relies heavily on empirical, trial-and-error methods guided by expert intuition. Our work aims to support the materials science community by providing a practical, data-driven resource. We have curated a comprehensive dataset of 17K expert-verified synthesis recipes… ▽ More

    Submitted 19 March, 2025; v1 submitted 23 February, 2025; originally announced February 2025.

    Comments: under review

  5. arXiv:2501.05703  [pdf

    cs.HC

    Visualization Tool: Exploring COVID-19 Data

    Authors: Dong Hyun Jeon, Jong Kwan Lee, Prabal Dhaubhadel, Aaron Kuhlman

    Abstract: The ability to effectively visualize data is crucial in the contemporary world where information is often voluminous and complex. Visualizations, such as charts, graphs, and maps, provide an intuitive and easily understandable means to interpret, analyze, and communicate patterns, trends, and insights hidden within large datasets. These graphical representations can help researchers, policymakers,… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

    Comments: Published in ISIITA 2024

  6. arXiv:2501.04161  [pdf, other

    cs.LG cs.IR

    KGIF: Optimizing Relation-Aware Recommendations with Knowledge Graph Information Fusion

    Authors: Dong Hyun Jeon, Wenbo Sun, Houbing Herbert Song, Dongfang Liu, Velasquez Alvaro, Yixin Chloe Xie, Shuteng Niu

    Abstract: While deep-learning-enabled recommender systems demonstrate strong performance benchmarks, many struggle to adapt effectively in real-world environments due to limited use of user-item relationship data and insufficient transparency in recommendation generation. Traditional collaborative filtering approaches fail to integrate multifaceted item attributes, and although Factorization Machines accoun… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

    Comments: Published at IEEE Big Data 2024

  7. arXiv:2412.04140  [pdf, other

    cs.LG cs.AI

    Understanding Memorization in Generative Models via Sharpness in Probability Landscapes

    Authors: Dongjae Jeon, Dueun Kim, Albert No

    Abstract: In this paper, we introduce a geometric framework to analyze memorization in diffusion models through the sharpness of the log probability density. We mathematically justify a previously proposed score-difference-based memorization metric by demonstrating its effectiveness in quantifying sharpness. Additionally, we propose a novel memorization metric that captures sharpness at the initial stage of… ▽ More

    Submitted 1 March, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

  8. arXiv:2410.17519  [pdf, other

    cs.CL

    Large Language Models Still Exhibit Bias in Long Text

    Authors: Wonje Jeung, Dongjae Jeon, Ashkan Yousefpour, Jonghyun Choi

    Abstract: Existing fairness benchmarks for large language models (LLMs) primarily focus on simple tasks, such as multiple-choice questions, overlooking biases that may arise in more complex scenarios like long-text generation. To address this gap, we introduce the Long Text Fairness Test (LTF-TEST), a framework that evaluates biases in LLMs through essay-style prompts. LTF-TEST covers 14 topics and 10 demog… ▽ More

    Submitted 25 October, 2024; v1 submitted 22 October, 2024; originally announced October 2024.

    Comments: 22 page, 38 figures, Neurips (SoLaR Workshop)

  9. SLM as Guardian: Pioneering AI Safety with Small Language Models

    Authors: Ohjoon Kwon, Donghyeon Jeon, Nayoung Choi, Gyu-Hwung Cho, Changbong Kim, Hyunwoo Lee, Inho Kang, Sun Kim, Taiwoo Park

    Abstract: Most prior safety research of large language models (LLMs) has focused on enhancing the alignment of LLMs to better suit the safety requirements of humans. However, internalizing such safeguard features into larger models brought challenges of higher training cost and unintended degradation of helpfulness. To overcome such challenges, a modular approach employing a smaller LLM to detect harmful us… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  10. arXiv:2405.17878  [pdf, other

    cs.LG cs.AI

    An Information Theoretic Evaluation Metric For Strong Unlearning

    Authors: Dongjae Jeon, Wonje Jeung, Taeheon Kim, Albert No, Jonghyun Choi

    Abstract: Machine unlearning (MU) aims to remove the influence of specific data from trained models, addressing privacy concerns and ensuring compliance with regulations such as the "right to be forgotten." Evaluating strong unlearning, where the unlearned model is indistinguishable from one retrained without the forgetting data, remains a significant challenge in deep neural networks (DNNs). Common black-b… ▽ More

    Submitted 19 October, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  11. arXiv:2404.08672  [pdf, other

    cs.IR cs.AI cs.CL cs.CY cs.LG

    Taxonomy and Analysis of Sensitive User Queries in Generative AI Search

    Authors: Hwiyeol Jo, Taiwoo Park, Hyunwoo Lee, Nayoung Choi, Changbong Kim, Ohjoon Kwon, Donghyeon Jeon, Eui-Hyeon Lee, Kyoungho Shin, Sun Suk Lim, Kyungmi Kim, Jihye Lee, Sun Kim

    Abstract: Although there has been a growing interest among industries in integrating generative LLMs into their services, limited experience and scarcity of resources act as a barrier in launching and servicing large-scale LLM-based services. In this paper, we share our experiences in developing and operating generative AI models within a national-scale search engine, with a specific focus on the sensitiven… ▽ More

    Submitted 16 April, 2025; v1 submitted 5 April, 2024; originally announced April 2024.

    Comments: NAACL2025(Findings), corrected typo in co-corresponding authors

  12. arXiv:2402.17812  [pdf, other

    cs.LG cs.CL

    DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation

    Authors: Sunghyeon Woo, Baeseong Park, Byeongwook Kim, Minjung Jo, Se Jung Kwon, Dongsuk Jeon, Dongsoo Lee

    Abstract: Large language models (LLMs) have achieved significant success across various domains. However, training these LLMs typically involves substantial memory and computational costs during both forward and backward propagation. While parameter-efficient fine-tuning (PEFT) considerably reduces the training memory associated with parameters, it does not address the significant computational costs and ac… ▽ More

    Submitted 28 February, 2025; v1 submitted 27 February, 2024; originally announced February 2024.

  13. arXiv:2308.11199  [pdf, other

    cs.CV cs.AI cs.LG

    ConcatPlexer: Additional Dim1 Batching for Faster ViTs

    Authors: Donghoon Han, Seunghyeon Seo, Donghyeon Jeon, Jiho Jang, Chaerin Kong, Nojun Kwak

    Abstract: Transformers have demonstrated tremendous success not only in the natural language processing (NLP) domain but also the field of computer vision, igniting various creative approaches and applications. Yet, the superior performance and modeling flexibility of transformers came with a severe increase in computation costs, and hence several works have proposed methods to reduce this burden. Inspired… ▽ More

    Submitted 31 January, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

  14. arXiv:2306.17618  [pdf, other

    cs.CV

    Polarimetric iToF: Measuring High-Fidelity Depth through Scattering Media

    Authors: Daniel S. Jeon, Andreas Meuleman, Seung-Hwan Baek, Min H. Kim

    Abstract: Indirect time-of-flight (iToF) imaging allows us to capture dense depth information at a low cost. However, iToF imaging often suffers from multipath interference (MPI) artifacts in the presence of scattering media, resulting in severe depth-accuracy degradation. For instance, iToF cameras cannot measure depth accurately through fog because ToF active illumination scatters back to the sensor befor… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 12353-12362

  15. arXiv:2305.04001  [pdf, other

    cs.CV cs.AI

    AADiff: Audio-Aligned Video Synthesis with Text-to-Image Diffusion

    Authors: Seungwoo Lee, Chaerin Kong, Donghyeon Jeon, Nojun Kwak

    Abstract: Recent advances in diffusion models have showcased promising results in the text-to-video (T2V) synthesis task. However, as these T2V models solely employ text as the guidance, they tend to struggle in modeling detailed temporal dynamics. In this paper, we introduce a novel T2V framework that additionally employ audio signals to control the temporal dynamics, empowering an off-the-shelf T2I diffus… ▽ More

    Submitted 23 May, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: CVPR2023 Workshop on AI for Content Creation. Project Page: https://lifrary.github.io/AADiff/

  16. arXiv:2211.11153  [pdf, other

    cs.LG cs.CL cs.CV

    Unifying Vision-Language Representation Space with Single-tower Transformer

    Authors: Jiho Jang, Chaerin Kong, Donghyeon Jeon, Seonhoon Kim, Nojun Kwak

    Abstract: Contrastive learning is a form of distance learning that aims to learn invariant features from two related representations. In this paper, we explore the bold hypothesis that an image and its caption can be simply regarded as two different views of the underlying mutual information, and train a model to learn a unified vision-language representation space that encodes both modalities at once in a… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

    Comments: AAAI 2023, 11 pages

  17. arXiv:2210.05872  [pdf, other

    cs.CV

    Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image Manipulation

    Authors: Chaerin Kong, DongHyeon Jeon, Ohjoon Kwon, Nojun Kwak

    Abstract: Fashion attribute editing is a task that aims to convert the semantic attributes of a given fashion image while preserving the irrelevant regions. Previous works typically employ conditional GANs where the generator explicitly learns the target attributes and directly execute the conversion. These approaches, however, are neither scalable nor generic as they operate only with few limited attribute… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted to WACV 2023

  18. Sparse Ellipsometry: Portable Acquisition of Polarimetric SVBRDF and Shape with Unstructured Flash Photography

    Authors: Inseung Hwang, Daniel S. Jeon, Adolfo Muñoz, Diego Gutierrez, Xin Tong, Min H. Kim

    Abstract: Ellipsometry techniques allow to measure polarization information of materials, requiring precise rotations of optical components with different configurations of lights and sensors. This results in cumbersome capture devices, carefully calibrated in lab conditions, and in very long acquisition times, usually in the order of a few days per object. Recent techniques allow to capture polarimetric sp… ▽ More

    Submitted 8 February, 2023; v1 submitted 9 July, 2022; originally announced July 2022.

    Journal ref: ACM Transactions on Graphics 41, 4, Article 133 (July 2022)

  19. arXiv:2205.05300  [pdf, other

    cs.CL

    User Guide for KOTE: Korean Online Comments Emotions Dataset

    Authors: Duyoung Jeon, Junho Lee, Cheongtag Kim

    Abstract: Sentiment analysis that classifies data into positive or negative has been dominantly used to recognize emotional aspects of texts, despite the deficit of thorough examination of emotional meanings. Recently, corpora labeled with more than just valence are built to exceed this limit. However, most Korean emotion corpora are small in the number of instances and cover a limited range of emotions. We… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: 16 pages, 4 figures

  20. arXiv:2109.04650  [pdf, other

    cs.CL

    What Changes Can Large-scale Language Models Bring? Intensive Study on HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers

    Authors: Boseop Kim, HyoungSeok Kim, Sang-Woo Lee, Gichang Lee, Donghyun Kwak, Dong Hyeon Jeon, Sunghyun Park, Sungju Kim, Seonhoon Kim, Dongpil Seo, Heungsub Lee, Minyoung Jeong, Sungjae Lee, Minsub Kim, Suk Hyun Ko, Seokhun Kim, Taeyong Park, Jinuk Kim, Soyoung Kang, Na-Hyeon Ryu, Kang Min Yoo, Minsuk Chang, Soobin Suh, Sookyo In, Jinseong Park , et al. (12 additional authors not shown)

    Abstract: GPT-3 shows remarkable in-context learning ability of large-scale language models (LMs) trained on hundreds of billion scale data. Here we address some remaining issues less reported by the GPT-3 paper, such as a non-English LM, the performances of different sized models, and the effect of recently introduced prompt optimization on in-context learning. To achieve this, we introduce HyperCLOVA, a K… ▽ More

    Submitted 28 November, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP2021 as a long paper. Fixed some typos

  21. arXiv:2102.03207  [pdf, other

    cs.SD cs.AI eess.AS

    Real-time Denoising and Dereverberation with Tiny Recurrent U-Net

    Authors: Hyeong-Seok Choi, Sungjin Park, Jie Hwan Lee, Hoon Heo, Dongsuk Jeon, Kyogu Lee

    Abstract: Modern deep learning-based models have seen outstanding performance improvement with speech enhancement tasks. The number of parameters of state-of-the-art models, however, is often too large to be deployed on devices for real-world applications. To this end, we propose Tiny Recurrent U-Net (TRU-Net), a lightweight online inference model that matches the performance of current state-of-the-art mod… ▽ More

    Submitted 22 June, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: 5 pages, 2 figures, 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). arXiv admin note: text overlap with arXiv:2006.00687

  22. arXiv:2009.00463  [pdf, other

    eess.IV cs.CV

    Single-shot Hyperspectral-Depth Imaging with Learned Diffractive Optics

    Authors: Seung-Hwan Baek, Hayato Ikoma, Daniel S. Jeon, Yuqi Li, Wolfgang Heidrich, Gordon Wetzstein, Min H. Kim

    Abstract: Imaging depth and spectrum have been extensively studied in isolation from each other for decades. Recently, hyperspectral-depth (HS-D) imaging emerges to capture both information simultaneously by combining two different imaging systems; one for depth, the other for spectrum. While being accurate, this combinational approach induces increased form factor, cost, capture time, and alignment/registr… ▽ More

    Submitted 15 August, 2021; v1 submitted 1 September, 2020; originally announced September 2020.

    ACM Class: I.2.10; I.4.1; I.5

    Journal ref: International Conference on Computer Vision (ICCV) 2021

  23. arXiv:2006.14317  [pdf

    cs.AR

    A Fast Finite Field Multiplier for SIKE

    Authors: Yeonsoo Jeon, Dongsuk Jeon

    Abstract: Various post-quantum cryptography algorithms have been recently proposed. Supersingluar isogeny Diffie-Hellman key exchange (SIKE) is one of the most promising candidates due to its small key size. However, the SIKE scheme requires numerous finite field multiplications for its isogeny computation, and hence suffers from slow encryption and decryption process. In this paper, we propose a fast finit… ▽ More

    Submitted 26 November, 2020; v1 submitted 25 June, 2020; originally announced June 2020.