Skip to main content

Showing 1–50 of 138 results for author: Park, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.13295  [pdf, ps, other

    eess.AS cs.SD

    Instance-Specific Test-Time Training for Speech Editing in the Wild

    Authors: Taewoo Kim, Uijong Lee, Hayoung Park, Choongsang Cho, Nam In Park, Young Han Lee

    Abstract: Speech editing systems aim to naturally modify speech content while preserving acoustic consistency and speaker identity. However, previous studies often struggle to adapt to unseen and diverse acoustic conditions, resulting in degraded editing performance in real-world scenarios. To address this, we propose an instance-specific test-time training method for speech editing in the wild. Our approac… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: Submitted to IEEE Signal Processing Letters

  2. arXiv:2506.12790  [pdf, ps, other

    cs.LG math.NA physics.comp-ph

    PDEfuncta: Spectrally-Aware Neural Representation for PDE Solution Modeling

    Authors: Minju Jo, Woojin Cho, Uvini Balasuriya Mudiyanselage, Seungjun Lee, Noseong Park, Kookjin Lee

    Abstract: Scientific machine learning often involves representing complex solution fields that exhibit high-frequency features such as sharp transitions, fine-scale oscillations, and localized structures. While implicit neural representations (INRs) have shown promise for continuous function modeling, capturing such high-frequency behavior remains a challenge-especially when modeling multiple solution field… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  3. arXiv:2506.09526  [pdf, ps, other

    cs.LG cs.AI

    Neural Functions for Learning Periodic Signal

    Authors: Woojin Cho, Minju Jo, Kookjin Lee, Noseong Park

    Abstract: As function approximators, deep neural networks have served as an effective tool to represent various signal types. Recent approaches utilize multi-layer perceptrons (MLPs) to learn a nonlinear mapping from a coordinate to its corresponding signal, facilitating the learning of continuous neural representations from discrete data points. Despite notable successes in learning diverse signal types, c… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  4. arXiv:2505.08516  [pdf, ps, other

    cs.LG cs.AI

    Learning Advanced Self-Attention for Linear Transformers in the Singular Value Domain

    Authors: Hyowon Wi, Jeongwhan Choi, Noseong Park

    Abstract: Transformers have demonstrated remarkable performance across diverse domains. The key component of Transformers is self-attention, which learns the relationship between any two tokens in the input sequence. Recent studies have revealed that the self-attention can be understood as a normalized adjacency matrix of a graph. Notably, from the perspective of graph signal processing (GSP), the self-atte… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: IJCAI25 Accepted

  5. arXiv:2504.11623  [pdf, other

    cs.LG cs.AI

    Possibility for Proactive Anomaly Detection

    Authors: Jinsung Jeon, Jaehyeon Park, Sewon Park, Jeongwhan Choi, Minjung Kim, Noseong Park

    Abstract: Time-series anomaly detection, which detects errors and failures in a workflow, is one of the most important topics in real-world applications. The purpose of time-series anomaly detection is to reduce potential damages or losses. However, existing anomaly detection models detect anomalies through the error between the model output and the ground truth (observed) value, which makes them impractica… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: Accepted at ICLR 2025 I Can't Believe It's Not Better: Challenges in Applied Deep Learning Workshop (ICBINB)

  6. arXiv:2504.04052  [pdf, other

    cs.LG cs.AI

    PIORF: Physics-Informed Ollivier-Ricci Flow for Long-Range Interactions in Mesh Graph Neural Networks

    Authors: Youn-Yeol Yu, Jeongwhan Choi, Jaehyeon Park, Kookjin Lee, Noseong Park

    Abstract: Recently, data-driven simulators based on graph neural networks have gained attention in modeling physical systems on unstructured meshes. However, they struggle with long-range dependencies in fluid flows, particularly in refined mesh regions. This challenge, known as the 'over-squashing' problem, hinders information propagation. While existing graph rewiring methods address this issue to some ex… ▽ More

    Submitted 5 April, 2025; originally announced April 2025.

    Comments: Accepted to ICLR 2025. Youn-Yeol Yu and Jeongwhan Choi contributed equally to this work

  7. arXiv:2503.21166  [pdf, other

    cs.LG

    Unveiling the Potential of Superexpressive Networks in Implicit Neural Representations

    Authors: Uvini Balasuriya Mudiyanselage, Woojin Cho, Minju Jo, Noseong Park, Kookjin Lee

    Abstract: In this study, we examine the potential of one of the ``superexpressive'' networks in the context of learning neural functions for representing complex signals and performing machine learning downstream tasks. Our focus is on evaluating their performance on computer vision and scientific machine learning tasks including signal representation/inverse problems and solutions of partial differential e… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

    Comments: Accepted at ICLR 2025 Workshop on Neural Network Weights as a New Data Modality

  8. arXiv:2502.19759  [pdf, other

    cs.SD eess.AS

    Does Your Voice Assistant Remember? Analyzing Conversational Context Recall and Utilization in Voice Interaction Models

    Authors: Heeseung Kim, Che Hyun Lee, Sangkwon Park, Jiheum Yeom, Nohil Park, Sangwon Yu, Sungroh Yoon

    Abstract: Recent advancements in multi-turn voice interaction models have improved user-model communication. However, while closed-source models effectively retain and recall past utterances, whether open-source models share this ability remains unexplored. To fill this gap, we systematically evaluate how well open-source interaction models utilize past utterances using ContextDialog, a benchmark we propose… ▽ More

    Submitted 23 May, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

    Comments: ACL 2025 Findings, Project Page: https://contextdialog.github.io/

  9. arXiv:2502.19629  [pdf

    cs.AI

    Agentic Mixture-of-Workflows for Multi-Modal Chemical Search

    Authors: Tiffany J. Callahan, Nathaniel H. Park, Sara Capponi

    Abstract: The vast and complex materials design space demands innovative strategies to integrate multidisciplinary scientific knowledge and optimize materials discovery. While large language models (LLMs) have demonstrated promising reasoning and automation capabilities across various domains, their application in materials science remains limited due to a lack of benchmarking standards and practical implem… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

    Comments: PDF includes supplemental material

  10. arXiv:2502.11767  [pdf, ps, other

    cs.LG cs.CL

    From Selection to Generation: A Survey of LLM-based Active Learning

    Authors: Yu Xia, Subhojyoti Mukherjee, Zhouhang Xie, Junda Wu, Xintong Li, Ryan Aponte, Hanjia Lyu, Joe Barrow, Hongjie Chen, Franck Dernoncourt, Branislav Kveton, Tong Yu, Ruiyi Zhang, Jiuxiang Gu, Nesreen K. Ahmed, Yu Wang, Xiang Chen, Hanieh Deilamsalehy, Sungchul Kim, Zhengmian Hu, Yue Zhao, Nedim Lipka, Seunghyun Yoon, Ting-Hao Kenneth Huang, Zichao Wang , et al. (9 additional authors not shown)

    Abstract: Active Learning (AL) has been a powerful paradigm for improving model efficiency and performance by selecting the most informative data points for labeling and training. In recent active learning frameworks, Large Language Models (LLMs) have been employed not only for selection but also for generating entirely new data instances and providing more cost-effective annotations. Motivated by the incre… ▽ More

    Submitted 31 May, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: ACL 2025

  11. arXiv:2501.18824  [pdf, other

    cs.CL cs.LG

    Memory-Efficient Fine-Tuning of Transformers via Token Selection

    Authors: Antoine Simoulin, Namyong Park, Xiaoyi Liu, Grey Yang

    Abstract: Fine-tuning provides an effective means to specialize pre-trained models for various downstream tasks. However, fine-tuning often incurs high memory overhead, especially for large transformer-based models, such as LLMs. While existing methods may reduce certain parts of the memory required for fine-tuning, they still require caching all intermediate activations computed in the forward pass to upda… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

    Comments: EMNLP 2024

  12. arXiv:2501.14249  [pdf, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  13. arXiv:2501.04304  [pdf, other

    cs.CV cs.LG

    DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models

    Authors: Hyogon Ryu, NaHyeon Park, Hyunjung Shim

    Abstract: Despite the widespread use of text-to-image diffusion models across various tasks, their computational and memory demands limit practical applications. To mitigate this issue, quantization of diffusion models has been explored. It reduces memory usage and computational costs by compressing weights and activations into lower-bit formats. However, existing methods often struggle to preserve both ima… ▽ More

    Submitted 12 February, 2025; v1 submitted 8 January, 2025; originally announced January 2025.

    Comments: Accepted ICLR 2025. Project page: https://ugonfor.kr/DGQ

  14. arXiv:2501.02157  [pdf, ps, other

    cs.CL

    Personalized Graph-Based Retrieval for Large Language Models

    Authors: Steven Au, Cameron J. Dimacali, Ojasmitha Pedirappagari, Namyong Park, Franck Dernoncourt, Yu Wang, Nikos Kanakaris, Hanieh Deilamsalehy, Ryan A. Rossi, Nesreen K. Ahmed

    Abstract: As large language models (LLMs) evolve, their ability to deliver personalized and context-aware responses offers transformative potential for improving user experiences. Existing personalization approaches, however, often rely solely on user history to augment the prompt, limiting their effectiveness in generating tailored outputs, especially in cold-start scenarios with sparse data. To address th… ▽ More

    Submitted 31 May, 2025; v1 submitted 3 January, 2025; originally announced January 2025.

  15. arXiv:2412.13501  [pdf, other

    cs.AI cs.HC

    GUI Agents: A Survey

    Authors: Dang Nguyen, Jian Chen, Yu Wang, Gang Wu, Namyong Park, Zhengmian Hu, Hanjia Lyu, Junda Wu, Ryan Aponte, Yu Xia, Xintong Li, Jing Shi, Hongjie Chen, Viet Dac Lai, Zhouhang Xie, Sungchul Kim, Ruiyi Zhang, Tong Yu, Mehrab Tanjim, Nesreen K. Ahmed, Puneet Mathur, Seunghyun Yoon, Lina Yao, Branislav Kveton, Thien Huu Nguyen , et al. (4 additional authors not shown)

    Abstract: Graphical User Interface (GUI) agents, powered by Large Foundation Models, have emerged as a transformative approach to automating human-computer interaction. These agents autonomously interact with digital systems or software applications via GUIs, emulating human actions such as clicking, typing, and navigating visual elements across diverse platforms. Motivated by the growing interest and funda… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

  16. arXiv:2412.12732  [pdf

    cs.HC

    Using LLM-Generated Draft Replies to Support Human Experts in Responding to Stakeholder Inquiries in Maritime Industry: A Real-World Case Study of Industrial AI

    Authors: Tita Alissa Bach, Aleksandar Babic, Narae Park, Tor Sporsem, Rasmus Ulfsnes, Henrik Smith-Meyer, Torkel Skeie

    Abstract: The maritime industry requires effective communication among diverse stakeholders to address complex, safety-critical challenges. Industrial AI, including Large Language Models (LLMs), has the potential to augment human experts' workflows in this specialized domain. Our case study investigated the utility of LLMs in drafting replies to stakeholder inquiries and supporting case handlers. We conduct… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

    Comments: These authors share the first authorship: Tita Alissa Bach (1), Aleksandar Babic (1), Narae Park (1)

  17. arXiv:2412.02142  [pdf, other

    cs.CV cs.AI cs.CL cs.IR

    Personalized Multimodal Large Language Models: A Survey

    Authors: Junda Wu, Hanjia Lyu, Yu Xia, Zhehao Zhang, Joe Barrow, Ishita Kumar, Mehrnoosh Mirtaheri, Hongjie Chen, Ryan A. Rossi, Franck Dernoncourt, Tong Yu, Ruiyi Zhang, Jiuxiang Gu, Nesreen K. Ahmed, Yu Wang, Xiang Chen, Hanieh Deilamsalehy, Namyong Park, Sungchul Kim, Huanrui Yang, Subrata Mitra, Zhengmian Hu, Nedim Lipka, Dang Nguyen, Yue Zhao , et al. (2 additional authors not shown)

    Abstract: Multimodal Large Language Models (MLLMs) have become increasingly important due to their state-of-the-art performance and ability to integrate multiple data modalities, such as text, images, and audio, to perform complex tasks with high accuracy. This paper presents a comprehensive survey on personalized multimodal large language models, focusing on their architecture, training methods, and applic… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

  18. arXiv:2411.16079  [pdf, other

    cs.CV cs.AI

    Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models

    Authors: Donggeun Ko, Dongjun Lee, Namjun Park, Wonkyeong Shim, Jaekwang Kim

    Abstract: Neural networks struggle with image classification when biases are learned and misleads correlations, affecting their generalization and performance. Previous methods require attribute labels (e.g. background, color) or utilizes Generative Adversarial Networks (GANs) to mitigate biases. We introduce DiffuBias, a novel pipeline for text-to-image generation that enhances classifier robustness by gen… ▽ More

    Submitted 24 November, 2024; originally announced November 2024.

    Comments: 8 pages + Appendix

  19. arXiv:2411.00369  [pdf, other

    cs.CL

    GRS-QA -- Graph Reasoning-Structured Question Answering Dataset

    Authors: Anish Pahilajani, Devasha Trivedi, Jincen Shuai, Khin S. Yone, Samyak Rajesh Jain, Namyong Park, Ryan A. Rossi, Nesreen K. Ahmed, Franck Dernoncourt, Yu Wang

    Abstract: Large Language Models (LLMs) have excelled in multi-hop question-answering (M-QA) due to their advanced reasoning abilities. However, the impact of the inherent reasoning structures on LLM M-QA performance remains unclear, largely due to the absence of QA datasets that provide fine-grained reasoning structures. To address this gap, we introduce the Graph Reasoning-Structured Question Answering Dat… ▽ More

    Submitted 7 November, 2024; v1 submitted 1 November, 2024; originally announced November 2024.

    Comments: 15 pages, 24 figures, 10 tables

  20. arXiv:2410.20011  [pdf, other

    cs.CL

    A Survey of Small Language Models

    Authors: Chien Van Nguyen, Xuan Shen, Ryan Aponte, Yu Xia, Samyadeep Basu, Zhengmian Hu, Jian Chen, Mihir Parmar, Sasidhar Kunapuli, Joe Barrow, Junda Wu, Ashish Singh, Yu Wang, Jiuxiang Gu, Franck Dernoncourt, Nesreen K. Ahmed, Nedim Lipka, Ruiyi Zhang, Xiang Chen, Tong Yu, Sungchul Kim, Hanieh Deilamsalehy, Namyong Park, Mike Rimer, Zhehao Zhang , et al. (3 additional authors not shown)

    Abstract: Small Language Models (SLMs) have become increasingly important due to their efficiency and performance to perform various language tasks with minimal computational resources, making them ideal for various settings including on-device, mobile, edge devices, among many others. In this article, we present a comprehensive survey on SLMs, focusing on their architectures, training techniques, and model… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

  21. arXiv:2410.18955  [pdf, other

    cs.CL

    BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning

    Authors: Yujuan Velvin Fu, Giridhar Kaushik Ramachandran, Namu Park, Kevin Lybarger, Fei Xia, Ozlem Uzuner, Meliha Yetisgen

    Abstract: Large language models (LLMs) such as ChatGPT are fine-tuned on large and diverse instruction-following corpora, and can generalize to new tasks. However, those instruction-tuned LLMs often perform poorly in specialized medical natural language understanding (NLU) tasks that require domain knowledge, granular text comprehension, and structured data extraction. To bridge the gap, we: (1) propose a u… ▽ More

    Submitted 9 March, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

    Comments: 3 figures an 5 tables; Accepted by AMIA 2025 Informatics Summit

  22. arXiv:2410.06442  [pdf, other

    cs.LG cs.AI

    MaD-Scientist: AI-based Scientist solving Convection-Diffusion-Reaction Equations Using Massive PINN-Based Prior Data

    Authors: Mingu Kang, Dongseok Lee, Woojin Cho, Jaehyeon Park, Kookjin Lee, Anthony Gruber, Youngjoon Hong, Noseong Park

    Abstract: Large language models (LLMs), like ChatGPT, have shown that even trained with noisy prior data, they can generalize effectively to new tasks through in-context learning (ICL) and pre-training techniques. Motivated by this, we explore whether a similar approach can be applied to scientific foundation models (SFMs). Our methodology is structured as follows: (i) we collect low-cost physics-informed n… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  23. arXiv:2410.04001  [pdf, other

    cs.LG cs.AI math.NA

    FastLRNR and Sparse Physics Informed Backpropagation

    Authors: Woojin Cho, Kookjin Lee, Noseong Park, Donsub Rim, Gerrit Welper

    Abstract: We introduce Sparse Physics Informed Backpropagation (SPInProp), a new class of methods for accelerating backpropagation for a specialized neural network architecture called Low Rank Neural Representation (LRNR). The approach exploits the low rank structure within LRNR and constructs a reduced neural network approximation that is much smaller in size. We call the smaller network FastLRNR. We show… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: 10 pages, 3 figures

    MSC Class: 68T07; 65D25; 65M22

  24. arXiv:2409.15760  [pdf, other

    cs.SD eess.AS

    NanoVoice: Efficient Speaker-Adaptive Text-to-Speech for Multiple Speakers

    Authors: Nohil Park, Heeseung Kim, Che Hyun Lee, Jooyoung Choi, Jiheum Yeom, Sungroh Yoon

    Abstract: We present NanoVoice, a personalized text-to-speech model that efficiently constructs voice adapters for multiple speakers simultaneously. NanoVoice introduces a batch-wise speaker adaptation technique capable of fine-tuning multiple references in parallel, significantly reducing training time. Beyond building separate adapters for each speaker, we also propose a parameter sharing technique that r… ▽ More

    Submitted 20 December, 2024; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025, Demo Page: https://nanovoice.github.io/

  25. arXiv:2409.15759  [pdf, other

    cs.SD eess.AS

    VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient Speaker-Adaptive Text-to-Speech via Autoguidance

    Authors: Jiheum Yeom, Heeseung Kim, Jooyoung Choi, Che Hyun Lee, Nohil Park, Sungroh Yoon

    Abstract: When applying parameter-efficient finetuning via LoRA onto speaker adaptive text-to-speech models, adaptation performance may decline compared to full-finetuned counterparts, especially for out-of-domain speakers. Here, we propose VoiceGuider, a parameter-efficient speaker adaptive text-to-speech system reinforced with autoguidance to enhance the speaker adaptation performance, reducing the gap ag… ▽ More

    Submitted 20 December, 2024; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025, Demo Page: https://voiceguider.github.io/

  26. arXiv:2409.12539  [pdf

    cs.CV

    Improving Cone-Beam CT Image Quality with Knowledge Distillation-Enhanced Diffusion Model in Imbalanced Data Settings

    Authors: Joonil Hwang, Sangjoon Park, NaHyeon Park, Seungryong Cho, Jin Sung Kim

    Abstract: In radiation therapy (RT), the reliance on pre-treatment computed tomography (CT) images encounter challenges due to anatomical changes, necessitating adaptive planning. Daily cone-beam CT (CBCT) imaging, pivotal for therapy adjustment, falls short in tissue density accuracy. To address this, our innovative approach integrates diffusion models for CT image generation, offering precise control over… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

    Comments: MICCAI 2024

  27. arXiv:2409.08732  [pdf, other

    cs.LG cs.AI

    Bridging Dynamic Factor Models and Neural Controlled Differential Equations for Nowcasting GDP

    Authors: Seonkyu Lim, Jeongwhan Choi, Noseong Park, Sang-Ha Yoon, ShinHyuck Kang, Young-Min Kim, Hyunjoong Kang

    Abstract: Gross domestic product (GDP) nowcasting is crucial for policy-making as GDP growth is a key indicator of economic conditions. Dynamic factor models (DFMs) have been widely adopted by government agencies for GDP nowcasting due to their ability to handle irregular or missing macroeconomic indicators and their interpretability. However, DFMs face two main challenges: i) the lack of capturing economic… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

    Comments: Accepted at CIKM 2024. Seonkyu Lim and Jeongwhan Choi are co-first authors with equal contributions

  28. arXiv:2409.08248  [pdf, other

    cs.CV

    TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder

    Authors: NaHyeon Park, Kunhee Kim, Hyunjung Shim

    Abstract: Recent breakthroughs in text-to-image models have opened up promising research avenues in personalized image generation, enabling users to create diverse images of a specific subject using natural language prompts. However, existing methods often suffer from performance degradation when given only a single reference image. They tend to overfit the input, producing highly similar outputs regardless… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Comments: Project page: https://textboost.github.io

  29. arXiv:2408.11793  [pdf

    cs.AI

    Leveraging Chemistry Foundation Models to Facilitate Structure Focused Retrieval Augmented Generation in Multi-Agent Workflows for Catalyst and Materials Design

    Authors: Nathaniel H. Park, Tiffany J. Callahan, James L. Hedrick, Tim Erdmann, Sara Capponi

    Abstract: Molecular property prediction and generative design via deep learning models has been the subject of intense research given its potential to accelerate development of new, high-performance materials. More recently, these workflows have been significantly augmented with the advent of large language models (LLMs) and systems of autonomous agents capable of utilizing pre-trained models to make predic… ▽ More

    Submitted 12 December, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

  30. arXiv:2408.09446  [pdf, other

    cs.LG math.NA physics.comp-ph

    Parameterized Physics-informed Neural Networks for Parameterized PDEs

    Authors: Woojin Cho, Minju Jo, Haksoo Lim, Kookjin Lee, Dongeun Lee, Sanghyun Hong, Noseong Park

    Abstract: Complex physical systems are often described by partial differential equations (PDEs) that depend on parameters such as the Reynolds number in fluid mechanics. In applications such as design optimization or uncertainty quantification, solutions of those PDEs need to be evaluated at numerous points in the parameter space. While physics-informed neural networks (PINNs) have emerged as a new strong c… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

  31. MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity

    Authors: Kanghyun Choi, Hye Yoon Lee, Dain Kwon, SunJong Park, Kyuyeun Kim, Noseong Park, Jonghyun Choi, Jinho Lee

    Abstract: Data-free quantization (DFQ) is a technique that creates a lightweight network from its full-precision counterpart without the original training data, often through a synthetic dataset. Although several DFQ methods have been proposed for vision transformer (ViT) architectures, they fail to achieve efficacy in low-bit settings. Examining the existing methods, we observe that their synthetic data pr… ▽ More

    Submitted 14 April, 2025; v1 submitted 29 July, 2024; originally announced July 2024.

    Comments: Published to AAAI 2025

  32. arXiv:2407.12374  [pdf, other

    cs.IR cs.AI

    Graph Signal Processing for Cross-Domain Recommendation

    Authors: Jeongeun Lee, Seongku Kang, Won-Yong Shin, Jeongwhan Choi, Noseong Park, Dongha Lee

    Abstract: Cross-domain recommendation (CDR) extends conventional recommender systems by leveraging user-item interactions from dense domains to mitigate data sparsity and the cold start problem. While CDR offers substantial potential for enhancing recommendation performance, most existing CDR methods suffer from sensitivity to the ratio of overlapping users and intrinsic discrepancy between source and targe… ▽ More

    Submitted 22 July, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

  33. Addressing Prediction Delays in Time Series Forecasting: A Continuous GRU Approach with Derivative Regularization

    Authors: Sheo Yon Jhin, Seojin Kim, Noseong Park

    Abstract: Time series forecasting has been an essential field in many different application areas, including economic analysis, meteorology, and so forth. The majority of time series forecasting models are trained using the mean squared error (MSE). However, this training based on MSE causes a limitation known as prediction delay. The prediction delay, which implies the ground-truth precedes the prediction,… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: KDD 2024 accepted paper

  34. arXiv:2406.15635  [pdf, other

    cs.LG cs.CR cs.CV

    DataFreeShield: Defending Adversarial Attacks without Training Data

    Authors: Hyeyoon Lee, Kanghyun Choi, Dain Kwon, Sunjong Park, Mayoore Selvarasa Jaiswal, Noseong Park, Jonghyun Choi, Jinho Lee

    Abstract: Recent advances in adversarial robustness rely on an abundant set of training data, where using external or additional datasets has become a common setting. However, in real life, the training data is often kept private for security and privacy issues, while only the pretrained weight is available to the public. In such scenarios, existing methods that assume accessibility to the original data bec… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  35. arXiv:2406.06134  [pdf, other

    cs.CV cs.AI cs.LG

    DiffInject: Revisiting Debias via Synthetic Data Generation using Diffusion-based Style Injection

    Authors: Donggeun Ko, Sangwoo Jo, Dongjun Lee, Namjun Park, Jaekwang Kim

    Abstract: Dataset bias is a significant challenge in machine learning, where specific attributes, such as texture or color of the images are unintentionally learned resulting in detrimental performance. To address this, previous efforts have focused on debiasing models either by developing novel debiasing algorithms or by generating synthetic data to mitigate the prevalent dataset biases. However, generativ… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 10 pages (including supplementary), 3 figures, SynData4CV@CVPR 24 (Workshop)

  36. arXiv:2406.05109  [pdf, other

    cs.LG

    Large Generative Graph Models

    Authors: Yu Wang, Ryan A. Rossi, Namyong Park, Huiyuan Chen, Nesreen K. Ahmed, Puja Trivedi, Franck Dernoncourt, Danai Koutra, Tyler Derr

    Abstract: Large Generative Models (LGMs) such as GPT, Stable Diffusion, Sora, and Suno are trained on a huge amount of language corpus, images, videos, and audio that are extremely diverse from numerous domains. This training paradigm over diverse well-curated data lies at the heart of generating creative and sensible content. However, all previous graph generative models (e.g., GraphRNN, MDVAE, MoFlow, GDS… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  37. arXiv:2406.03671  [pdf, other

    cs.LG cs.AI

    PANDA: Expanded Width-Aware Message Passing Beyond Rewiring

    Authors: Jeongwhan Choi, Sumin Park, Hyowon Wi, Sung-Bae Cho, Noseong Park

    Abstract: Recent research in the field of graph neural network (GNN) has identified a critical issue known as "over-squashing," resulting from the bottleneck phenomenon in graph structures, which impedes the propagation of long-range information. Prior works have proposed a variety of graph rewiring concepts that aim at optimizing the spatial or spectral properties of graphs to promote the signal propagatio… ▽ More

    Submitted 19 July, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024

  38. arXiv:2405.16305  [pdf, other

    cs.LG

    Efficiently Parameterized Neural Metriplectic Systems

    Authors: Anthony Gruber, Kookjin Lee, Haksoo Lim, Noseong Park, Nathaniel Trask

    Abstract: Metriplectic systems are learned from data in a way that scales quadratically in both the size of the state and the rank of the metriplectic data. Besides being provably energy conserving and entropy stable, the proposed approach comes with approximation results demonstrating its ability to accurately learn metriplectic dynamics from data as well as an error estimate indicating its potential for g… ▽ More

    Submitted 26 January, 2025; v1 submitted 25 May, 2024; originally announced May 2024.

  39. arXiv:2405.04746  [pdf, other

    cs.IR cs.AI cs.LG

    SVD-AE: Simple Autoencoders for Collaborative Filtering

    Authors: Seoyoung Hong, Jeongwhan Choi, Yeon-Chang Lee, Srijan Kumar, Noseong Park

    Abstract: Collaborative filtering (CF) methods for recommendation systems have been extensively researched, ranging from matrix factorization and autoencoder-based to graph filtering-based methods. Recently, lightweight methods that require almost no training have been recently proposed to reduce overall computation. However, existing methods still have room to improve the trade-offs among accuracy, efficie… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted by IJCAI 2024

  40. arXiv:2405.00287  [pdf, other

    cs.IR cs.AI cs.LG

    SCONE: A Novel Stochastic Sampling to Generate Contrastive Views and Hard Negative Samples for Recommendation

    Authors: Chaejeong Lee, Jeongwhan Choi, Hyowon Wi, Sung-Bae Cho, Noseong Park

    Abstract: Graph-based collaborative filtering (CF) has emerged as a promising approach in recommender systems. Despite its achievements, graph-based CF models face challenges due to data sparsity and negative sampling. In this paper, we propose a novel Stochastic sampling for i) COntrastive views and ii) hard NEgative samples (SCONE) to overcome these issues. SCONE generates dynamic augmented views and dive… ▽ More

    Submitted 19 December, 2024; v1 submitted 30 April, 2024; originally announced May 2024.

    Comments: Accepted to WSDM 2025. Chaejeong Lee and Jeongwhan Choi are co-first authors with equal contributions

  41. arXiv:2404.02072  [pdf, other

    cs.CV cs.LG

    EGTR: Extracting Graph from Transformer for Scene Graph Generation

    Authors: Jinbae Im, JeongYeon Nam, Nokyung Park, Hyungmin Lee, Seunghyun Park

    Abstract: Scene Graph Generation (SGG) is a challenging task of detecting objects and predicting relationships between objects. After DETR was developed, one-stage SGG models based on a one-stage object detector have been actively studied. However, complex modeling is used to predict the relationship between objects, and the inherent relationship between object queries learned in the multi-head self-attenti… ▽ More

    Submitted 24 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 (Best paper award candidate)

  42. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  43. arXiv:2404.01578  [pdf, other

    cs.LG cs.SI

    GLEMOS: Benchmark for Instantaneous Graph Learning Model Selection

    Authors: Namyong Park, Ryan Rossi, Xing Wang, Antoine Simoulin, Nesreen Ahmed, Christos Faloutsos

    Abstract: The choice of a graph learning (GL) model (i.e., a GL algorithm and its hyperparameter settings) has a significant impact on the performance of downstream tasks. However, selecting the right GL model becomes increasingly difficult and time consuming as more and more GL models are developed. Accordingly, it is of great significance and practical value to equip users of GL with the ability to perfor… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: NeurIPS 2023

  44. arXiv:2404.00826  [pdf, other

    cs.CL

    Extracting Social Determinants of Health from Pediatric Patient Notes Using Large Language Models: Novel Corpus and Methods

    Authors: Yujuan Fu, Giridhar Kaushik Ramachandran, Nicholas J Dobbins, Namu Park, Michael Leu, Abby R. Rosenberg, Kevin Lybarger, Fei Xia, Ozlem Uzuner, Meliha Yetisgen

    Abstract: Social determinants of health (SDoH) play a critical role in shaping health outcomes, particularly in pediatric populations where interventions can have long-term implications. SDoH are frequently studied in the Electronic Health Record (EHR), which provides a rich repository for diverse patient data. In this work, we present a novel annotated corpus, the Pediatric Social History Annotation Corpus… ▽ More

    Submitted 4 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: 12 pages, 2 figures and 3 tables. Accepted by LREC-COLING 2024

  45. arXiv:2403.18975  [pdf, other

    cs.CL

    A Novel Corpus of Annotated Medical Imaging Reports and Information Extraction Results Using BERT-based Language Models

    Authors: Namu Park, Kevin Lybarger, Giridhar Kaushik Ramachandran, Spencer Lewis, Aashka Damani, Ozlem Uzuner, Martin Gunn, Meliha Yetisgen

    Abstract: Medical imaging is critical to the diagnosis, surveillance, and treatment of many health conditions, including oncological, neurological, cardiovascular, and musculoskeletal disorders, among others. Radiologists interpret these complex, unstructured images and articulate their assessments through narrative reports that remain largely unstructured. This unstructured narrative must be converted into… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted at LREC-COLING 2024

  46. arXiv:2403.11004  [pdf, other

    cs.LG cs.SI

    Forward Learning of Graph Neural Networks

    Authors: Namyong Park, Xing Wang, Antoine Simoulin, Shuai Yang, Grey Yang, Ryan Rossi, Puja Trivedi, Nesreen Ahmed

    Abstract: Graph neural networks (GNNs) have achieved remarkable success across a wide range of applications, such as recommendation, drug discovery, and question answering. Behind the success of GNNs lies the backpropagation (BP) algorithm, which is the de facto standard for training deep neural networks (NNs). However, despite its effectiveness, BP imposes several constraints, which are not only biological… ▽ More

    Submitted 12 April, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

    Comments: ICLR 2024

  47. arXiv:2403.08244  [pdf

    cs.CE

    Evaluating the Efficiency and Cost-effectiveness of RPB-based CO2 Capture: A Comprehensive Approach to Simultaneous Design and Operating Condition Optimization

    Authors: Howoun Jung, Nohjin Park, Jay H. Lee

    Abstract: Despite ongoing global initiatives to reduce CO2 emissions, implementing large-scale CO2 capture using amine solvents is fraught with economic uncertainties and technical hurdles. The Rotating Packed Bed (RPB) presents a promising alternative to traditional packed towers, offering compact design and adaptability. Nonetheless, scaling RPB processes to an industrial level is challenging due to the n… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 44 pages, 11 figures, 6 tables

  48. arXiv:2402.15162  [pdf, other

    cs.CL cs.AI cs.LG

    Entity-level Factual Adaptiveness of Fine-tuning based Abstractive Summarization Models

    Authors: Jongyoon Song, Nohil Park, Bongkyu Hwang, Jaewoong Yun, Seongho Joe, Youngjune L. Gwon, Sungroh Yoon

    Abstract: Abstractive summarization models often generate factually inconsistent content particularly when the parametric knowledge of the model conflicts with the knowledge in the input document. In this paper, we analyze the robustness of fine-tuning based summarization models to the knowledge conflict, which we call factual adaptiveness. We utilize pre-trained language models to construct evaluation sets… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: EACL 2024

  49. arXiv:2402.12721  [pdf, other

    cs.CV cs.AI

    PAC-FNO: Parallel-Structured All-Component Fourier Neural Operators for Recognizing Low-Quality Images

    Authors: Jinsung Jeon, Hyundong Jin, Jonghyun Choi, Sanghyun Hong, Dongeun Lee, Kookjin Lee, Noseong Park

    Abstract: A standard practice in developing image recognition models is to train a model on a specific image resolution and then deploy it. However, in real-world inference, models often encounter images different from the training sets in resolution and/or subject to natural variations such as weather changes, noise types and compression artifacts. While traditional solutions involve training multiple mode… ▽ More

    Submitted 14 March, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted at ICLR 2024

  50. arXiv:2312.16581  [pdf, other

    cs.LG cs.IR

    Continuous-time Autoencoders for Regular and Irregular Time Series Imputation

    Authors: Hyowon Wi, Yehjin Shin, Noseong Park

    Abstract: Time series imputation is one of the most fundamental tasks for time series. Real-world time series datasets are frequently incomplete (or irregular with missing observations), in which case imputation is strongly required. Many different time series imputation methods have been proposed. Recent self-attention-based methods show the state-of-the-art imputation performance. However, it has been ove… ▽ More

    Submitted 24 June, 2024; v1 submitted 27 December, 2023; originally announced December 2023.

    Comments: Published as a WSDM'24 full paper (oral presentation)