Skip to main content

Showing 1–14 of 14 results for author: Seo, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.02882  [pdf, other

    cs.CL cs.LG

    DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models

    Authors: Sunghee Jung, Donghun Lee, Shinbok Lee, Gaeun Seo, Daniel Lee, Byeongil Ko, Junrae Cho, Kihyun Kim, Eunggyun Kim, Myeongcheol Shin

    Abstract: Tool-Augmented Larage Language Models (TA-LLMs) have shown promise in real-world applications, but face challenges in handling incomplete queries and out-of-scope requests. While existing approaches rely mainly on Supervised Fine-Tuning with expert trajectories, we propose DiaTool-DPO, a novel method that enhances TA-LLM's dialogue capabilities through Direct Preference Optimization. We model TA-L… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  2. arXiv:2503.00564  [pdf, other

    cs.CL

    ToolDial: Multi-turn Dialogue Generation Method for Tool-Augmented Language Models

    Authors: Jeonghoon Shim, Gyuhyeon Seo, Cheongsu Lim, Yohan Jo

    Abstract: Tool-Augmented Language Models (TALMs) leverage external APIs to answer user queries across various domains. However, existing benchmark datasets for TALM research often feature simplistic dialogues that do not reflect real-world scenarios, such as the need for models to ask clarifying questions or proactively call additional APIs when essential information is missing. To address these limitations… ▽ More

    Submitted 1 March, 2025; originally announced March 2025.

    Comments: Accepted to ICLR 2025

  3. arXiv:2502.18934  [pdf, other

    cs.CL cs.LG

    Kanana: Compute-efficient Bilingual Language Models

    Authors: Kanana LLM Team, Yunju Bak, Hojin Lee, Minho Ryu, Jiyeon Ham, Seungjae Jung, Daniel Wontae Nam, Taegyeong Eo, Donghun Lee, Doohae Jung, Boseop Kim, Nayeon Kim, Jaesun Park, Hyunho Kim, Hyunwoong Ko, Changmin Lee, Kyoung-Woon On, Seulye Baeg, Junrae Cho, Sunghee Jung, Jieun Kang, EungGyun Kim, Eunhwa Kim, Byeongil Ko, Daniel Lee , et al. (4 additional authors not shown)

    Abstract: We introduce Kanana, a series of bilingual language models that demonstrate exceeding performance in Korean and competitive performance in English. The computational cost of Kanana is significantly lower than that of state-of-the-art models of similar size. The report details the techniques employed during pre-training to achieve compute-efficient yet competitive models, including high quality dat… ▽ More

    Submitted 28 February, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

    Comments: 40 pages, 15 figures

  4. arXiv:2501.12194  [pdf

    cs.SD cs.AI cs.LG eess.AS

    An End-to-End Approach for Korean Wakeword Systems with Speaker Authentication

    Authors: Geonwoo Seo

    Abstract: Wakeword detection plays a critical role in enabling AI assistants to listen to user voices and interact effectively. However, for languages other than English, there is a significant lack of pre-trained wakeword models. Additionally, systems that merely determine the presence of a wakeword can pose serious privacy concerns. In this paper, we propose an end-to-end approach that trains wakewords fo… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

    Comments: 19 pages, 10 figures, implementation code available at https://github.com/gws8820/securewakeword-model, https://github.com/gws8820/wyoming-securewakeword, demo video at https://www.youtube.com/watch?v=F3AXUbL-i-o

    ACM Class: I.2.7; I.5.4

  5. arXiv:2411.14054  [pdf, other

    cs.CL cs.AI

    FunctionChat-Bench: Comprehensive Evaluation of Language Models' Generative Capabilities in Korean Tool-use Dialogs

    Authors: Shinbok Lee, Gaeun Seo, Daniel Lee, Byeongil Ko, Sunghee Jung, Myeongcheol Shin

    Abstract: This study investigates language models' generative capabilities in tool-use dialogs. We categorize the models' outputs in tool-use dialogs into four distinct types: Tool Call, Answer Completion, Slot Question, and Relevance Detection, which serve as aspects for evaluation. We introduce FunctionChat-Bench, comprising 700 evaluation items and automated assessment programs. Using this benchmark, we… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

    Comments: 8 pages

  6. arXiv:2411.05357  [pdf, other

    cs.CV

    Enhancing Visual Classification using Comparative Descriptors

    Authors: Hankyeol Lee, Gawon Seo, Wonseok Choi, Geunyoung Jung, Kyungwoo Song, Jiyoung Jung

    Abstract: The performance of vision-language models (VLMs), such as CLIP, in visual classification tasks, has been enhanced by leveraging semantic knowledge from large language models (LLMs), including GPT. Recent studies have shown that in zero-shot classification tasks, descriptors incorporating additional cues, high-level concepts, or even random characters often outperform those using only the category… ▽ More

    Submitted 10 November, 2024; v1 submitted 8 November, 2024; originally announced November 2024.

    Comments: Accepted by WACV 2025

  7. arXiv:2407.17491  [pdf, ps, other

    cs.CV cs.LG

    Robust Adaptation of Foundation Models with Black-Box Visual Prompting

    Authors: Changdae Oh, Gyeongdeok Seo, Geunyoung Jung, Zhi-Qi Cheng, Hosik Choi, Jiyoung Jung, Kyungwoo Song

    Abstract: With a surge of large-scale pre-trained models, parameter-efficient transfer learning (PETL) of large models has garnered significant attention. While promising, they commonly rely on two optimistic assumptions: 1) full access to the parameters of a PTM, and 2) sufficient memory capacity to cache all intermediate activations for gradient computation. However, in most real-world applications, PTMs… ▽ More

    Submitted 31 May, 2025; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: Extended work from the CVPR'23 paper: arxiv:2303.14773; This paper has been submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) for possible publication

  8. arXiv:2404.12168  [pdf, other

    cs.CV cs.AI

    Real-World Efficient Blind Motion Deblurring via Blur Pixel Discretization

    Authors: Insoo Kim, Jae Seok Choi, Geonseok Seo, Kinam Kwon, Jinwoo Shin, Hyong-Euk Lee

    Abstract: As recent advances in mobile camera technology have enabled the capability to capture high-resolution images, such as 4K images, the demand for an efficient deblurring model handling large motion has increased. In this paper, we discover that the image residual errors, i.e., blur-sharp pixel differences, can be grouped into some categories according to their motion blur type and how complex their… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: CVPR2024 Camera-Ready

  9. arXiv:2404.10355  [pdf, other

    cs.AR

    AERO: Adaptive Erase Operation for Improving Lifetime and Performance of Modern NAND Flash-Based SSDs

    Authors: Sungjun Cho, Beomjun Kim, Hyunuk Cho, Gyeongseob Seo, Onur Mutlu, Myungsuk Kim, Jisung Park

    Abstract: This work investigates a new erase scheme in NAND flash memory to improve the lifetime and performance of modern solid-state drives (SSDs). In NAND flash memory, an erase operation applies a high voltage (e.g., > 20 V) to flash cells for a long time (e.g., > 3.5 ms), which degrades cell endurance and potentially delays user I/O requests. While a large body of prior work has proposed various techni… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Accepted for publication at Proceedings of the 29th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2024

  10. arXiv:2202.10846  [pdf, ps, other

    cs.CC math.AC math.AG math.RA math.RT

    P-class is a proper subclass of NP-class; and more

    Authors: JongJin Kim, GwangJin Kim, JongPyo Lee, ShuanHong Wang, Ki-Bong Nam, GyungSig Seo, InSu Kim, YangGon Kim

    Abstract: We may give rise to some questions related to the mathematical structures of $P$-class and $NP$-class. We have seen that one is a proper subclass of the other. Here we disclose more that $P$- class turns out to be the proper distributive sublattice of the $NP$- class.

    Submitted 9 July, 2022; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2003.05321, arXiv:1912.10849

    MSC Class: Primary17B10; 17B50; Secondary 68Q15; 68Q17

    Journal ref: Journal of Applied Algebra and Discrete Structures,Vol.2(2004),No.1,pp.1-26,SAS international publications,URL:www.sasip.net

  11. Deep Context- and Relation-Aware Learning for Aspect-based Sentiment Analysis

    Authors: Shinhyeok Oh, Dongyub Lee, Taesun Whang, IlNam Park, Gaeun Seo, EungGyun Kim, Harksoo Kim

    Abstract: Existing works for aspect-based sentiment analysis (ABSA) have adopted a unified approach, which allows the interactive relations among subtasks. However, we observe that these methods tend to predict polarities based on the literal meaning of aspect and opinion terms and mainly consider relations implicitly among subtasks at the word level. In addition, identifying multiple aspect-opinion pairs w… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: Accepted to ACL-IJCNLP 2021

  12. arXiv:2005.11220  [pdf, other

    cs.CV

    KL-Divergence-Based Region Proposal Network for Object Detection

    Authors: Geonseok Seo, Jaeyoung Yoo, Jaeseok Choi, Nojun Kwak

    Abstract: The learning of the region proposal in object detection using the deep neural networks (DNN) is divided into two tasks: binary classification and bounding box regression task. However, traditional RPN (Region Proposal Network) defines these two tasks as different problems, and they are trained independently. In this paper, we propose a new region proposal learning method that considers the boundin… ▽ More

    Submitted 22 May, 2020; originally announced May 2020.

    Comments: 5 pages, 3 figures, Accepted to ICIP 2020

  13. arXiv:1911.12721  [pdf, other

    cs.CV

    Training Multi-Object Detector by Estimating Bounding Box Distribution for Input Image

    Authors: Jaeyoung Yoo, Hojun Lee, Inseop Chung, Geonseok Seo, Nojun Kwak

    Abstract: In multi-object detection using neural networks, the fundamental problem is, "How should the network learn a variable number of bounding boxes in different input images?". Previous methods train a multi-object detection network through a procedure that directly assigns the ground truth bounding boxes to the specific locations of the network's output. However, this procedure makes the training of a… ▽ More

    Submitted 5 September, 2021; v1 submitted 28 November, 2019; originally announced November 2019.

    Comments: 10 pages, 7 figures

  14. arXiv:1812.04920  [pdf, other

    cs.CV

    C3: Concentrated-Comprehensive Convolution and its application to semantic segmentation

    Authors: Hyojin Park, Youngjoon Yoo, Geonseok Seo, Dongyoon Han, Sangdoo Yun, Nojun Kwak

    Abstract: One of the practical choices for making a lightweight semantic segmentation model is to combine a depth-wise separable convolution with a dilated convolution. However, the simple combination of these two methods results in an over-simplified operation which causes severe performance degradation due to loss of information contained in the feature map. To resolve this problem, we propose a new block… ▽ More

    Submitted 28 July, 2019; v1 submitted 12 December, 2018; originally announced December 2018.