Skip to main content

Showing 1–28 of 28 results for author: Nam, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.13235  [pdf, ps, other

    cs.CV cs.LG

    WriteViT: Handwritten Text Generation with Vision Transformer

    Authors: Dang Hoai Nam, Huynh Tong Dang Khoa, Vo Nguyen Le Duy

    Abstract: Humans can quickly generalize handwriting styles from a single example by intuitively separating content from style. Machines, however, struggle with this task, especially in low-data settings, often missing subtle spatial and stylistic cues. Motivated by this gap, we introduce WriteViT, a one-shot handwritten text synthesis framework that incorporates Vision Transformers (ViT), a family of models… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  2. arXiv:2504.20196  [pdf, other

    cs.SE cs.AI cs.HC

    Prompting LLMs for Code Editing: Struggles and Remedies

    Authors: Daye Nam, Ahmed Omran, Ambar Murillo, Saksham Thakur, Abner Araujo, Marcel Blistein, Alexander Frömmgen, Vincent Hellendoorn, Satish Chandra

    Abstract: Large Language Models (LLMs) are rapidly transforming software engineering, with coding assistants embedded in an IDE becoming increasingly prevalent. While research has focused on improving the tools and understanding developer perceptions, a critical gap exists in understanding how developers actually use these tools in their daily workflows, and, crucially, where they struggle. This paper addre… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  3. arXiv:2504.18691  [pdf, other

    cs.HC cs.AI cs.SE

    From Prompts to Propositions: A Logic-Based Lens on Student-LLM Interactions

    Authors: Ali Alfageeh, Sadegh AlMahdi Kazemi Zarkouei, Daye Nam, Daniel Prol, Matin Amoozadeh, Souti Chattopadhyay, James Prather, Paul Denny, Juho Leinonen, Michael Hilton, Sruti Srinivasa Ragavan, Mohammad Amin Alipour

    Abstract: Background and Context. The increasing integration of large language models (LLMs) in computing education presents an emerging challenge in understanding how students use LLMs and craft prompts to solve computational tasks. Prior research has used both qualitative and quantitative methods to analyze prompting behavior, but these approaches lack scalability or fail to effectively capture the semant… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

  4. arXiv:2502.18934  [pdf, other

    cs.CL cs.LG

    Kanana: Compute-efficient Bilingual Language Models

    Authors: Kanana LLM Team, Yunju Bak, Hojin Lee, Minho Ryu, Jiyeon Ham, Seungjae Jung, Daniel Wontae Nam, Taegyeong Eo, Donghun Lee, Doohae Jung, Boseop Kim, Nayeon Kim, Jaesun Park, Hyunho Kim, Hyunwoong Ko, Changmin Lee, Kyoung-Woon On, Seulye Baeg, Junrae Cho, Sunghee Jung, Jieun Kang, EungGyun Kim, Eunhwa Kim, Byeongil Ko, Daniel Lee , et al. (4 additional authors not shown)

    Abstract: We introduce Kanana, a series of bilingual language models that demonstrate exceeding performance in Korean and competitive performance in English. The computational cost of Kanana is significantly lower than that of state-of-the-art models of similar size. The report details the techniques employed during pre-training to achieve compute-efficient yet competitive models, including high quality dat… ▽ More

    Submitted 28 February, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

    Comments: 40 pages, 15 figures

  5. arXiv:2502.05500  [pdf

    cs.RO cs.AI

    Vision-Ultrasound Robotic System based on Deep Learning for Gas and Arc Hazard Detection in Manufacturing

    Authors: Jin-Hee Lee, Dahyun Nam, Robin Inho Kee, YoungKey Kim, Seok-Jun Buu

    Abstract: Gas leaks and arc discharges present significant risks in industrial environments, requiring robust detection systems to ensure safety and operational efficiency. Inspired by human protocols that combine visual identification with acoustic verification, this study proposes a deep learning-based robotic system for autonomously detecting and classifying gas leaks and arc discharges in manufacturing… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

    Comments: Submitted to Engineering Applications of Artificial Intelligence

    ACM Class: I.2.1; I.2.9

  6. arXiv:2410.12944  [pdf, other

    cs.SE cs.HC

    How much does AI impact development speed? An enterprise-based randomized controlled trial

    Authors: Elise Paradis, Kate Grey, Quinn Madison, Daye Nam, Andrew Macvean, Vahid Meimand, Nan Zhang, Ben Ferrari-Church, Satish Chandra

    Abstract: How much does AI assistance impact developer productivity? To date, the software engineering literature has provided a range of answers, targeting a diversity of outcomes: from perceived productivity to speed on task and developer throughput. Our randomized controlled trial with 96 full-time Google software engineers contributes to this literature by sharing an estimate of the impact of three AI f… ▽ More

    Submitted 11 November, 2024; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: 12 pages, 7 figures, 3 tables

    ACM Class: C.4; D.2.8; D.2.6; H.5.2; I.2.1; I.2.m

  7. arXiv:2410.00414  [pdf, ps, other

    cs.CL

    Semantic Parsing with Candidate Expressions for Knowledge Base Question Answering

    Authors: Daehwan Nam, Gary Geunbae Lee

    Abstract: Semantic parsers convert natural language to logical forms, which can be evaluated on knowledge bases (KBs) to produce denotations. Recent semantic parsers have been developed with sequence-to-sequence (seq2seq) pre-trained language models (PLMs) or large language models, where the models treat logical forms as sequences of tokens. For syntactic and semantic validity, the semantic parsers use gram… ▽ More

    Submitted 7 April, 2025; v1 submitted 1 October, 2024; originally announced October 2024.

  8. arXiv:2409.15784  [pdf

    physics.app-ph cond-mat.mtrl-sci cs.LG physics.optics

    Deep-learning real-time phase retrieval of imperfect diffraction patterns from X-ray free-electron lasers

    Authors: Sung Yun Lee, Do Hyung Cho, Chulho Jung, Daeho Sung, Daewoong Nam, Sangsoo Kim, Changyong Song

    Abstract: Machine learning is attracting surging interest across nearly all scientific areas by enabling the analysis of large datasets and the extraction of scientific information from incomplete data. Data-driven science is rapidly growing, especially in X-ray methodologies, where advanced light sources and detection technologies accumulate vast amounts of data that exceed meticulous human inspection capa… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    MSC Class: 68T07 ACM Class: J.2

  9. arXiv:2409.12521  [pdf, other

    cs.RO eess.SY

    GraspSAM: When Segment Anything Model Meets Grasp Detection

    Authors: Sangjun Noh, Jongwon Kim, Dongwoo Nam, Seunghyeok Back, Raeyoung Kang, Kyoobin Lee

    Abstract: Grasp detection requires flexibility to handle objects of various shapes without relying on prior knowledge of the object, while also offering intuitive, user-guided control. This paper introduces GraspSAM, an innovative extension of the Segment Anything Model (SAM), designed for prompt-driven and category-agnostic grasp detection. Unlike previous methods, which are often limited by small-scale tr… ▽ More

    Submitted 23 September, 2024; v1 submitted 19 September, 2024; originally announced September 2024.

    Comments: 6 pages (main), 1 page (references)

  10. arXiv:2407.16574  [pdf, other

    cs.CL

    TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback

    Authors: Eunseop Yoon, Hee Suk Yoon, SooHwan Eom, Gunsoo Han, Daniel Wontae Nam, Daejin Jo, Kyoung-Woon On, Mark A. Hasegawa-Johnson, Sungwoong Kim, Chang D. Yoo

    Abstract: Reinforcement Learning from Human Feedback (RLHF) leverages human preference data to train language models to align more closely with human essence. These human preference data, however, are labeled at the sequence level, creating a mismatch between sequence-level preference labels and tokens, which are autoregressively generated from the language model. Although several recent approaches have tri… ▽ More

    Submitted 8 December, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: ACL2024 Findings

  11. arXiv:2407.00305  [pdf, other

    cs.HC

    Student-AI Interaction: A Case Study of CS1 students

    Authors: Matin Amoozadeh, Daye Nam, Daniel Prol, Ali Alfageeh, James Prather, Michael Hilton, Sruti Srinivasa Ragavan, Mohammad Amin Alipour

    Abstract: The new capabilities of generative artificial intelligence tools Generative AI, such as ChatGPT, allow users to interact with the system in intuitive ways, such as simple conversations, and receive (mostly) good-quality answers. These systems can support students' learning objectives by providing accessible explanations and examples even with vague queries. At the same time, they can encourage und… ▽ More

    Submitted 10 October, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

    Comments: Koli Calling 2024

  12. arXiv:2404.04656  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Binary Classifier Optimization for Large Language Model Alignment

    Authors: Seungjae Jung, Gunsoo Han, Daniel Wontae Nam, Kyoung-Woon On

    Abstract: In real-world services such as ChatGPT, aligning models based on user feedback is crucial for improving model performance. However, due to the simplicity and convenience of providing feedback, users typically offer only basic binary signals, such as 'thumbs-up' or 'thumbs-down'. Most existing alignment research, on the other hand, relies on preference-based approaches that require both positive an… ▽ More

    Submitted 9 June, 2025; v1 submitted 6 April, 2024; originally announced April 2024.

    Comments: ACL 2025 main

  13. arXiv:2401.07059  [pdf

    cs.CY

    Classifying Proposals of Decentralized Autonomous Organizations Using Large Language Models

    Authors: Christian Ziegler, Marcos Miranda, Guangye Cao, Gustav Arentoft, Doo Wan Nam

    Abstract: Our study demonstrates the effective use of Large Language Models (LLMs) for automating the classification of complex datasets. We specifically target proposals of Decentralized Autonomous Organizations (DAOs), as the clas-sification of this data requires the understanding of context and, therefore, depends on human expertise, leading to high costs associated with the task. The study applies an it… ▽ More

    Submitted 3 July, 2024; v1 submitted 13 January, 2024; originally announced January 2024.

    Report number: Dawo/2024/01 ACM Class: H.0

  14. arXiv:2310.10817  [pdf, other

    cs.SE cs.HC

    Understanding Documentation Use Through Log Analysis: An Exploratory Case Study of Four Cloud Services

    Authors: Daye Nam, Andrew Macvean, Brad Myers, Bogdan Vasilescu

    Abstract: Almost no modern software system is written from scratch, and developers are required to effectively learn to use third-party libraries or software services. Thus, many practitioners and researchers have looked for ways to create effective documentation that supports developers' learning. However, few efforts have focused on how people actually use the documentation. In this paper, we report on an… ▽ More

    Submitted 29 February, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

  15. arXiv:2310.06404  [pdf, other

    cs.CL cs.AI cs.LG

    Hexa: Self-Improving for Knowledge-Grounded Dialogue System

    Authors: Daejin Jo, Daniel Wontae Nam, Gunsoo Han, Kyoung-Woon On, Taehwan Kwon, Seungeun Rho, Sungwoong Kim

    Abstract: A common practice in knowledge-grounded dialogue generation is to explicitly utilize intermediate steps (e.g., web-search, memory retrieval) with modular approaches. However, data for such steps are often inaccessible compared to those of dialogue responses as they are unobservable in an ordinary dialogue. To fill in the absence of these data, we develop a self-improving method to improve the gene… ▽ More

    Submitted 2 April, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: This work has been submitted to the IEEE for possible publication

  16. arXiv:2310.04631  [pdf, other

    cs.HC

    Trust in Generative AI among students: An Exploratory Study

    Authors: Matin Amoozadeh, David Daniels, Daye Nam, Aayush Kumar, Stella Chen, Michael Hilton, Sruti Srinivasa Ragavan, Mohammad Amin Alipour

    Abstract: Generative artificial systems (GenAI) have experienced exponential growth in the past couple of years. These systems offer exciting capabilities, such as generating programs, that students can well utilize for their learning. Among many dimensions that might affect the effective adoption of GenAI, in this paper, we investigate students' \textit{trust}. Trust in GenAI influences the extent to which… ▽ More

    Submitted 1 February, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted at SIGCSE 2024

  17. arXiv:2307.08177  [pdf, other

    cs.SE cs.AI cs.HC

    Using an LLM to Help With Code Understanding

    Authors: Daye Nam, Andrew Macvean, Vincent Hellendoorn, Bogdan Vasilescu, Brad Myers

    Abstract: Understanding code is challenging, especially when working in new and complex development environments. Code comments and documentation can help, but are typically scarce or hard to navigate. Large language models (LLMs) are revolutionizing the process of writing code. Can they do the same for helping understand it? In this study, we provide a first investigation of an LLM-based conversational UI… ▽ More

    Submitted 16 January, 2024; v1 submitted 16 July, 2023; originally announced July 2023.

  18. arXiv:2305.13973  [pdf, other

    cs.CL

    Effortless Integration of Memory Management into Open-Domain Conversation Systems

    Authors: Eunbi Choi, Kyoung-Woon On, Gunsoo Han, Sungwoong Kim, Daniel Wontae Nam, Daejin Jo, Seung Eun Rho, Taehwan Kwon, Minjoon Seo

    Abstract: Open-domain conversation systems integrate multiple conversation skills into a single system through a modular approach. One of the limitations of the system, however, is the absence of management capability for external memory. In this paper, we propose a simple method to improve BlenderBot3 by integrating memory management ability into it. Since no training data exists for this purpose, we propo… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  19. arXiv:2305.00630  [pdf, other

    cs.CV

    TRACE: Table Reconstruction Aligned to Corner and Edges

    Authors: Youngmin Baek, Daehyun Nam, Jaeheung Surh, Seung Shin, Seonghyeon Kim

    Abstract: A table is an object that captures structured and informative content within a document, and recognizing a table in an image is challenging due to the complexity and variety of table layouts. Many previous works typically adopt a two-stage approach; (1) Table detection(TD) localizes the table region in an image and (2) Table Structure Recognition(TSR) identifies row- and column-wise adjacency rela… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: 18 pages, 7 figures, Accepted by ICDAR 2023

  20. arXiv:2301.11403  [pdf, other

    cs.SI cs.CL cs.LG

    Detecting Pump&Dump Stock Market Manipulation from Online Forums

    Authors: D. Nam, D. B. Skillicorn

    Abstract: The intersection of social media, low-cost trading platforms, and naive investors has created an ideal situation for information-based market manipulations, especially pump&dumps. Manipulators accumulate small-cap stocks, disseminate false information on social media to inflate their price, and sell at the peak. We collect a dataset of stocks whose price and volume profiles have the characteristic… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

  21. arXiv:2210.05409  [pdf, other

    cs.LG cs.AI

    LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward

    Authors: Daejin Jo, Sungwoong Kim, Daniel Wontae Nam, Taehwan Kwon, Seungeun Rho, Jongmin Kim, Donghoon Lee

    Abstract: Episodic count has been widely used to design a simple yet effective intrinsic motivation for reinforcement learning with a sparse reward. However, the use of episodic count in a high-dimensional state space as well as over a long episode time requires a thorough state compression and fast hashing, which hinders rigorous exploitation of it in such hard and complex exploration environments. Moreove… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS 2022

  22. arXiv:2201.03758  [pdf, other

    cs.SE

    Predictive Synthesis of API-Centric Code

    Authors: Daye Nam, Baishakhi Ray, Seohyun Kim, Xianshan Qu, Satish Chandra

    Abstract: Today's programmers, especially data science practitioners, make heavy use of data-processing libraries (APIs) such as PyTorch, Tensorflow, NumPy, Pandas, and the like. Program synthesizers can provide significant coding assistance to this community of users; however program synthesis also can be slow due to enormous search spaces. In this work, we examine ways in which machine learning can be use… ▽ More

    Submitted 17 May, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

  23. arXiv:2112.00152  [pdf, ps, other

    math.PR cs.DM math-ph

    One-step replica symmetry breaking of random regular NAE-SAT II

    Authors: Danny Nam, Allan Sly, Youngtak Sohn

    Abstract: Continuing our earlier work in \cite{nss20a}, we study the random regular k-NAE-SAT model in the condensation regime. In \cite{nss20a}, the 1RSB properties of the model were established with positive probability. In this paper, we improve the result to probability arbitrarily close to one. To do so, we introduce a new framework which is the synthesis of two approaches: the small subgraph condition… ▽ More

    Submitted 17 December, 2023; v1 submitted 30 November, 2021; originally announced December 2021.

    Comments: 57 pages, 1 figure. Accepted to Communications in Mathematical Physics. arXiv admin note: text overlap with arXiv:2011.14270

    MSC Class: 60G15; 60K35; 82B44; 82D30

  24. arXiv:2108.04539  [pdf, other

    cs.CL

    BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents

    Authors: Teakgyu Hong, Donghyun Kim, Mingi Ji, Wonseok Hwang, Daehyun Nam, Sungrae Park

    Abstract: Key information extraction (KIE) from document images requires understanding the contextual and spatial semantics of texts in two-dimensional (2D) space. Many recent studies try to solve the task by developing pre-trained language models focusing on combining visual features from document images with texts and their layout. On the other hand, this paper tackles the problem by going back to the bas… ▽ More

    Submitted 5 April, 2022; v1 submitted 10 August, 2021; originally announced August 2021.

    Comments: AAAI 2022 - Main Technical Track

  25. arXiv:2105.11366  [pdf, other

    cs.LG

    GMAC: A Distributional Perspective on Actor-Critic Framework

    Authors: Daniel Wontae Nam, Younghoon Kim, Chan Y. Park

    Abstract: In this paper, we devise a distributional framework on actor-critic as a solution to distributional instability, action type restriction, and conflation between samples and statistics. We propose a new method that minimizes the Cramér distance with the multi-step Bellman target distribution generated from a novel Sample-Replacement algorithm denoted SR($λ$), which learns the correct value distribu… ▽ More

    Submitted 15 July, 2021; v1 submitted 24 May, 2021; originally announced May 2021.

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139:7927-7936, 2021

  26. arXiv:2007.09629  [pdf, other

    cs.CV

    Character Region Attention For Text Spotting

    Authors: Youngmin Baek, Seung Shin, Jeonghun Baek, Sungrae Park, Junyeop Lee, Daehyun Nam, Hwalsuk Lee

    Abstract: A scene text spotter is composed of text detection and recognition modules. Many studies have been conducted to unify these modules into an end-to-end trainable model to achieve better performance. A typical architecture places detection and recognition modules into separate branches, and a RoI pooling is commonly used to let the branches share a visual feature. However, there still exists a chanc… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Comments: 17 pages, 9 figures, Accepted by ECCV 2020

  27. arXiv:2006.06244  [pdf, other

    cs.CV

    CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks

    Authors: Youngmin Baek, Daehyun Nam, Sungrae Park, Junyeop Lee, Seung Shin, Jeonghun Baek, Chae Young Lee, Hwalsuk Lee

    Abstract: Despite the recent success of text detection and recognition methods, existing evaluation metrics fail to provide a fair and reliable comparison among those methods. In addition, there exists no end-to-end evaluation metric that takes characteristics of OCR tasks into account. Previous end-to-end metric contains cascaded errors from the binary scoring process applied in both detection and recognit… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: 12 pages, 8 figures

  28. 3D Display Calibration by Visual Pattern Analysis

    Authors: Hyoseok Hwang, Hyun Sung Chang, Dongkyung Nam, In So Kweon

    Abstract: Nearly all 3D displays need calibration for correct rendering. More often than not, the optical elements in a 3D display are misaligned from the designed parameter setting. As a result, 3D magic does not perform well as intended. The observed images tend to get distorted. In this paper, we propose a novel display calibration method to fix the situation. In our method, a pattern image is displayed… ▽ More

    Submitted 22 June, 2016; originally announced June 2016.

    Comments: 13 pages, 10 figures.submitted to IEEE Transactions on Image Processing