Skip to main content

Showing 1–50 of 140 results for author: Chung, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.09486  [pdf, other

    cs.LG cs.AI

    Preserving Plasticity in Continual Learning with Adaptive Linearity Injection

    Authors: Seyed Roozbeh Razavi Rohani, Khashayar Khajavi, Wesley Chung, Mo Chen, Sharan Vaswani

    Abstract: Loss of plasticity in deep neural networks is the gradual reduction in a model's capacity to incrementally learn and has been identified as a key obstacle to learning in non-stationary problem settings. Recent work has shown that deep linear networks tend to be resilient towards loss of plasticity. Motivated by this observation, we propose Adaptive Linearization (AdaLin), a general approach that d… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

    Comments: Accepted in 4th Conference on Lifelong Learning Agents (CoLLAs), 2025

  2. arXiv:2505.08388  [pdf, ps, other

    cs.RO

    MDF: Multi-Modal Data Fusion with CNN-Based Object Detection for Enhanced Indoor Localization Using LiDAR-SLAM

    Authors: Saqi Hussain Kalan, Boon Giin Lee, Wan-Young Chung

    Abstract: Indoor localization faces persistent challenges in achieving high accuracy, particularly in GPS-deprived environments. This study unveils a cutting-edge handheld indoor localization system that integrates 2D LiDAR and IMU sensors, delivering enhanced high-velocity precision mapping, computational efficiency, and real-time adaptability. Unlike 3D LiDAR systems, it excels with rapid processing, low-… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  3. arXiv:2505.07271  [pdf, ps, other

    cs.CL cs.AI cs.LG

    On the Robustness of Reward Models for Language Model Alignment

    Authors: Jiwoo Hong, Noah Lee, Eunki Kim, Guijin Son, Woojin Chung, Aman Gupta, Shao Tang, James Thorne

    Abstract: The Bradley-Terry (BT) model is widely practiced in reward modeling for reinforcement learning with human feedback (RLHF). Despite its effectiveness, reward models (RMs) trained with BT model loss are prone to over-optimization, losing generalizability to unseen input distributions. In this paper, we study the cause of over-optimization in RM training and its downstream effects on the RLHF procedu… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: ICML 2025

  4. arXiv:2505.03164  [pdf, other

    cs.HC

    InfoVids: Reimagining the Viewer Experience with Alternative Visualization-Presenter Relationships

    Authors: Ji Won Chung, Tongyu Zhou, Ivy Chen, Kevin Hsu, Ryan A. Rossi, Alexa Siu, Shunan Guo, Franck Dernoncourt, James Tompkin, Jeff Huang

    Abstract: Traditional data presentations typically separate the presenter and visualization into two separate spaces--the 3D world and a 2D screen--enforcing visualization-centric stories. To create a more human-centric viewing experience, we establish a more equitable relationship between the visualization and the presenter through our InfoVids. These infographics-inspired informational videos are crafted… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  5. arXiv:2504.14915  [pdf, other

    eess.AS cs.AI

    StableQuant: Layer Adaptive Post-Training Quantization for Speech Foundation Models

    Authors: Yeona Hong, Hyewon Han, Woo-jin Chung, Hong-Goo Kang

    Abstract: In this paper, we propose StableQuant, a novel adaptive post-training quantization (PTQ) algorithm for widely used speech foundation models (SFMs). While PTQ has been successfully employed for compressing large language models (LLMs) due to its ability to bypass additional fine-tuning, directly applying these techniques to SFMs may not yield optimal results, as SFMs utilize distinct network archit… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: Accepted at ICASSP 2025

  6. arXiv:2504.12516  [pdf, ps, other

    cs.CL

    BrowseComp: A Simple Yet Challenging Benchmark for Browsing Agents

    Authors: Jason Wei, Zhiqing Sun, Spencer Papay, Scott McKinney, Jeffrey Han, Isa Fulford, Hyung Won Chung, Alex Tachard Passos, William Fedus, Amelia Glaese

    Abstract: We present BrowseComp, a simple yet challenging benchmark for measuring the ability for agents to browse the web. BrowseComp comprises 1,266 questions that require persistently navigating the internet in search of hard-to-find, entangled information. Despite the difficulty of the questions, BrowseComp is simple and easy-to-use, as predicted answers are short and easily verifiable against reference… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

  7. arXiv:2502.10822  [pdf, other

    eess.AS cs.AI cs.SD

    NeuroAMP: A Novel End-to-end General Purpose Deep Neural Amplifier for Personalized Hearing Aids

    Authors: Shafique Ahmed, Ryandhimas E. Zezario, Hui-Guan Yuan, Amir Hussain, Hsin-Min Wang, Wei-Ho Chung, Yu Tsao

    Abstract: The prevalence of hearing aids is increasing. However, optimizing the amplification processes of hearing aids remains challenging due to the complexity of integrating multiple modular components in traditional methods. To address this challenge, we present NeuroAMP, a novel deep neural network designed for end-to-end, personalized amplification in hearing aids. NeuroAMP leverages both spectral fea… ▽ More

    Submitted 15 February, 2025; originally announced February 2025.

  8. arXiv:2502.01634  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Online Gradient Boosting Decision Tree: In-Place Updates for Efficient Adding/Deleting Data

    Authors: Huawei Lin, Jun Woo Chung, Yingjie Lao, Weijie Zhao

    Abstract: Gradient Boosting Decision Tree (GBDT) is one of the most popular machine learning models in various applications. However, in the traditional settings, all data should be simultaneously accessed in the training procedure: it does not allow to add or delete any data instances after training. In this paper, we propose an efficient online learning framework for GBDT supporting both incremental and d… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

    Comments: 25 pages, 11 figures, 16 tables. Keywords: Decremental Learning, Incremental Learning, Machine Unlearning, Online Learning, Gradient Boosting Decision Trees, GBDTs

  9. arXiv:2501.02851  [pdf, other

    cs.SI cs.IT stat.ML

    Exact Matching in Correlated Networks with Node Attributes for Improved Community Recovery

    Authors: Joonhyuk Yang, Hye Won Chung

    Abstract: We study community detection in multiple networks whose nodes and edges are jointly correlated. This setting arises naturally in applications such as social platforms, where a shared set of users may exhibit both correlated friendship patterns and correlated attributes across different platforms. Extending the classical Stochastic Block Model (SBM) and its contextual counterpart (CSBM), we introdu… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.

    Comments: 30 pages, 3 figures

  10. arXiv:2412.16720  [pdf, other

    cs.AI

    OpenAI o1 System Card

    Authors: OpenAI, :, Aaron Jaech, Adam Kalai, Adam Lerer, Adam Richardson, Ahmed El-Kishky, Aiden Low, Alec Helyar, Aleksander Madry, Alex Beutel, Alex Carney, Alex Iftimie, Alex Karpenko, Alex Tachard Passos, Alexander Neitz, Alexander Prokofiev, Alexander Wei, Allison Tam, Ally Bennett, Ananya Kumar, Andre Saraiva, Andrea Vallone, Andrew Duberstein, Andrew Kondrich , et al. (238 additional authors not shown)

    Abstract: The o1 model series is trained with large-scale reinforcement learning to reason using chain of thought. These advanced reasoning capabilities provide new avenues for improving the safety and robustness of our models. In particular, our models can reason about our safety policies in context when responding to potentially unsafe prompts, through deliberative alignment. This leads to state-of-the-ar… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

  11. arXiv:2412.16339  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Deliberative Alignment: Reasoning Enables Safer Language Models

    Authors: Melody Y. Guan, Manas Joglekar, Eric Wallace, Saachi Jain, Boaz Barak, Alec Helyar, Rachel Dias, Andrea Vallone, Hongyu Ren, Jason Wei, Hyung Won Chung, Sam Toyer, Johannes Heidecke, Alex Beutel, Amelia Glaese

    Abstract: As large-scale language models increasingly impact safety-critical domains, ensuring their reliable adherence to well-defined principles remains a fundamental challenge. We introduce Deliberative Alignment, a new paradigm that directly teaches the model safety specifications and trains it to explicitly recall and accurately reason over the specifications before answering. We used this approach to… ▽ More

    Submitted 8 January, 2025; v1 submitted 20 December, 2024; originally announced December 2024.

    Comments: 24 pages

  12. arXiv:2412.15299  [pdf, other

    cs.CL cs.SD eess.AS

    LAMA-UT: Language Agnostic Multilingual ASR through Orthography Unification and Language-Specific Transliteration

    Authors: Sangmin Lee, Woo-Jin Chung, Hong-Goo Kang

    Abstract: Building a universal multilingual automatic speech recognition (ASR) model that performs equitably across languages has long been a challenge due to its inherent difficulties. To address this task we introduce a Language-Agnostic Multilingual ASR pipeline through orthography Unification and language-specific Transliteration (LAMA-UT). LAMA-UT operates without any language-specific modules while ma… ▽ More

    Submitted 22 December, 2024; v1 submitted 19 December, 2024; originally announced December 2024.

  13. arXiv:2412.10945  [pdf, other

    cs.LG physics.ao-ph

    A Staged Deep Learning Approach to Spatial Refinement in 3D Temporal Atmospheric Transport

    Authors: M. Giselle Fernández-Godino, Wai Tong Chung, Akshay A. Gowardhan, Matthias Ihme, Qingkai Kong, Donald D. Lucas, Stephen C. Myers

    Abstract: High-resolution spatiotemporal simulations effectively capture the complexities of atmospheric plume dispersion in complex terrain. However, their high computational cost makes them impractical for applications requiring rapid responses or iterative processes, such as optimization, uncertainty quantification, or inverse modeling. To address this challenge, this work introduces the Dual-Stage Tempo… ▽ More

    Submitted 18 December, 2024; v1 submitted 14 December, 2024; originally announced December 2024.

    Comments: 12 pages, 10 figures

    Report number: LLNL-JRNL-2001564 MSC Class: 68T07; 86A10; 93A30; 62M20 ACM Class: I.2.6; I.6.5; J.2

  14. arXiv:2412.07224  [pdf, other

    cs.LG cs.AI

    Parseval Regularization for Continual Reinforcement Learning

    Authors: Wesley Chung, Lynn Cherif, David Meger, Doina Precup

    Abstract: Loss of plasticity, trainability loss, and primacy bias have been identified as issues arising when training deep neural networks on sequences of tasks -- all referring to the increased difficulty in training on new tasks. We propose to use Parseval regularization, which maintains orthogonality of weight matrices, to preserve useful optimization properties and improve training in a continual reinf… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

  15. arXiv:2411.15204  [pdf, other

    cs.LG cs.AI cs.CV

    Label Distribution Shift-Aware Prediction Refinement for Test-Time Adaptation

    Authors: Minguk Jang, Hye Won Chung

    Abstract: Test-time adaptation (TTA) is an effective approach to mitigate performance degradation of trained models when encountering input distribution shifts at test time. However, existing TTA methods often suffer significant performance drops when facing additional class distribution shifts. We first analyze TTA methods under label distribution shifts and identify the presence of class-wise confusion pa… ▽ More

    Submitted 4 February, 2025; v1 submitted 20 November, 2024; originally announced November 2024.

  16. arXiv:2411.09838  [pdf, other

    eess.IV cs.CV

    OneNet: A Channel-Wise 1D Convolutional U-Net

    Authors: Sanghyun Byun, Kayvan Shah, Ayushi Gang, Christopher Apton, Jacob Song, Woo Seong Chung

    Abstract: Many state-of-the-art computer vision architectures leverage U-Net for its adaptability and efficient feature extraction. However, the multi-resolution convolutional design often leads to significant computational demands, limiting deployment on edge devices. We present a streamlined alternative: a 1D convolutional encoder that retains accuracy while enhancing its suitability for edge applications… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

  17. arXiv:2411.09072  [pdf, other

    cs.LG

    Continuous GNN-based Anomaly Detection on Edge using Efficient Adaptive Knowledge Graph Learning

    Authors: Sanggeon Yun, Ryozo Masukawa, William Youngwoo Chung, Minhyoung Na, Nathaniel Bastian, Mohsen Imani

    Abstract: The increasing demand for robust security solutions across various industries has made Video Anomaly Detection (VAD) a critical task in applications such as intelligent surveillance, evidence investigation, and violence detection. Traditional approaches to VAD often rely on finetuning large pre-trained models, which can be computationally expensive and impractical for real-time or resource-constra… ▽ More

    Submitted 13 January, 2025; v1 submitted 13 November, 2024; originally announced November 2024.

    Comments: Accepted to DATE 2025

  18. arXiv:2411.04368  [pdf, other

    cs.CL

    Measuring short-form factuality in large language models

    Authors: Jason Wei, Nguyen Karina, Hyung Won Chung, Yunxin Joy Jiao, Spencer Papay, Amelia Glaese, John Schulman, William Fedus

    Abstract: We present SimpleQA, a benchmark that evaluates the ability of language models to answer short, fact-seeking questions. We prioritized two properties in designing this eval. First, SimpleQA is challenging, as it is adversarially collected against GPT-4 responses. Second, responses are easy to grade, because questions are created such that there exists only a single, indisputable answer. Each answe… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

    Comments: Blog post: https://openai.com/index/introducing-simpleqa/

  19. arXiv:2411.01048  [pdf, other

    cs.CV

    MultiDepth: Multi-Sample Priors for Refining Monocular Metric Depth Estimations in Indoor Scenes

    Authors: Sanghyun Byun, Jacob Song, Woo Seong Chung

    Abstract: Monocular metric depth estimation (MMDE) is a crucial task to solve for indoor scene reconstruction on edge devices. Despite this importance, existing models are sensitive to factors such as boundary frequency of objects in the scene and scene complexity, failing to fully capture many indoor scenes. In this work, we propose to close this gap through the task of monocular metric depth refinement (M… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

  20. arXiv:2410.21276  [pdf, other

    cs.CL cs.AI cs.CV cs.CY cs.LG cs.SD eess.AS

    GPT-4o System Card

    Authors: OpenAI, :, Aaron Hurst, Adam Lerer, Adam P. Goucher, Adam Perelman, Aditya Ramesh, Aidan Clark, AJ Ostrow, Akila Welihinda, Alan Hayes, Alec Radford, Aleksander Mądry, Alex Baker-Whitcomb, Alex Beutel, Alex Borzunov, Alex Carney, Alex Chow, Alex Kirillov, Alex Nichol, Alex Paino, Alex Renzin, Alex Tachard Passos, Alexander Kirillov, Alexi Christakis , et al. (395 additional authors not shown)

    Abstract: GPT-4o is an autoregressive omni model that accepts as input any combination of text, audio, image, and video, and generates any combination of text, audio, and image outputs. It's trained end-to-end across text, vision, and audio, meaning all inputs and outputs are processed by the same neural network. GPT-4o can respond to audio inputs in as little as 232 milliseconds, with an average of 320 mil… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

  21. arXiv:2410.13952  [pdf, other

    cs.CV

    Satellite Streaming Video QoE Prediction: A Real-World Subjective Database and Network-Level Prediction Models

    Authors: Bowen Chen, Zaixi Shang, Jae Won Chung, David Lerner, Werner Robitza, Rakesh Rao Ramachandra Rao, Alexander Raake, Alan C. Bovik

    Abstract: Demand for streaming services, including satellite, continues to exhibit unprecedented growth. Internet Service Providers find themselves at the crossroads of technological advancements and rising customer expectations. To stay relevant and competitive, these ISPs must ensure their networks deliver optimal video streaming quality, a key determinant of user satisfaction. Towards this end, it is imp… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  22. arXiv:2410.05572  [pdf, other

    cs.LG cs.AI math.DS

    Improved deep learning of chaotic dynamical systems with multistep penalty losses

    Authors: Dibyajyoti Chakraborty, Seung Whan Chung, Ashesh Chattopadhyay, Romit Maulik

    Abstract: Predicting the long-term behavior of chaotic systems remains a formidable challenge due to their extreme sensitivity to initial conditions and the inherent limitations of traditional data-driven modeling approaches. This paper introduces a novel framework that addresses these challenges by leveraging the recently proposed multi-step penalty (MP) optimization technique. Our approach extends the app… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: 7 pages, 5 Figures, Submitted to CASML2024

  23. arXiv:2409.10587  [pdf, other

    cs.CV

    SoccerNet 2024 Challenges Results

    Authors: Anthony Cioppa, Silvio Giancola, Vladimir Somers, Victor Joos, Floriane Magera, Jan Held, Seyed Abolfazl Ghasemzadeh, Xin Zhou, Karolina Seweryn, Mateusz Kowalczyk, Zuzanna Mróz, Szymon Łukasik, Michał Hałoń, Hassan Mkhallati, Adrien Deliège, Carlos Hinojosa, Karen Sanchez, Amir M. Mansourian, Pierre Miralles, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Adam Gorski , et al. (59 additional authors not shown)

    Abstract: The SoccerNet 2024 challenges represent the fourth annual video understanding challenges organized by the SoccerNet team. These challenges aim to advance research across multiple themes in football, including broadcast video understanding, field understanding, and player understanding. This year, the challenges encompass four vision-based tasks. (1) Ball Action Spotting, focusing on precisely loca… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 7 pages, 1 figure

  24. arXiv:2409.07787  [pdf, other

    cs.CL

    Stable Language Model Pre-training by Reducing Embedding Variability

    Authors: Woojin Chung, Jiwoo Hong, Na Min An, James Thorne, Se-Young Yun

    Abstract: Stable pre-training is essential for achieving better-performing language models. However, tracking pre-training stability by calculating gradient variance at every step is impractical due to the significant computational costs. We explore Token Embedding Variability (TEV) as a simple and efficient proxy for assessing pre-training stability in language models with pre-layer normalization, given th… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

  25. arXiv:2408.10676  [pdf, other

    cs.LG

    Representation Norm Amplification for Out-of-Distribution Detection in Long-Tail Learning

    Authors: Dong Geun Shin, Hye Won Chung

    Abstract: Detecting out-of-distribution (OOD) samples is a critical task for reliable machine learning. However, it becomes particularly challenging when the models are trained on long-tailed datasets, as the models often struggle to distinguish tail-class in-distribution samples from OOD samples. We examine the main challenges in this problem by identifying the trade-offs between OOD detection and in-distr… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 30 pages, 8 figures, 17 tables

  26. arXiv:2407.12604  [pdf, ps, other

    cs.IT cs.DS cs.SI

    Exact Graph Matching in Correlated Gaussian-Attributed Erdős-Rényi Model

    Authors: Joonhyuk Yang, Hye Won Chung

    Abstract: Graph matching problem aims to identify node correspondence between two or more correlated graphs. Previous studies have primarily focused on models where only edge information is provided. However, in many social networks, not only the relationships between users, represented by edges, but also their personal information, represented by features, are present. In this paper, we address the challen… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: IEEE International Symposium on Information Theory (ISIT) 2024

  27. arXiv:2407.08991  [pdf

    eess.AS cs.AI cs.CC

    Optimization of DNN-based speaker verification model through efficient quantization technique

    Authors: Yeona Hong, Woo-Jin Chung, Hong-Goo Kang

    Abstract: As Deep Neural Networks (DNNs) rapidly advance in various fields, including speech verification, they typically involve high computational costs and substantial memory consumption, which can be challenging to manage on mobile systems. Quantization of deep models offers a means to reduce both computational and memory expenses. Our research proposes an optimization framework for the quantization of… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: in Korean language, Accepted at Society of Electronic Engineers of Korea Conference 2024

  28. arXiv:2407.00568  [pdf, other

    cs.LG cs.AI

    Divide And Conquer: Learning Chaotic Dynamical Systems With Multistep Penalty Neural Ordinary Differential Equations

    Authors: Dibyajyoti Chakraborty, Seung Whan Chung, Troy Arcomano, Romit Maulik

    Abstract: Forecasting high-dimensional dynamical systems is a fundamental challenge in various fields, such as geosciences and engineering. Neural Ordinary Differential Equations (NODEs), which combine the power of neural networks and numerical solvers, have emerged as a promising algorithm for forecasting complex nonlinear dynamical systems. However, classical techniques used for NODE training are ineffect… ▽ More

    Submitted 15 October, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

    Comments: 25 pages, 17 Figures, submitted to Computer Methods in Applied Mechanics and Engineering

  29. arXiv:2406.18561  [pdf, other

    cs.CV cs.LG

    SelMatch: Effectively Scaling Up Dataset Distillation via Selection-Based Initialization and Partial Updates by Trajectory Matching

    Authors: Yongmin Lee, Hye Won Chung

    Abstract: Dataset distillation aims to synthesize a small number of images per class (IPC) from a large dataset to approximate full dataset training with minimal performance loss. While effective in very small IPC ranges, many distillation methods become less effective, even underperforming random sample selection, as IPC increases. Our examination of state-of-the-art trajectory-matching based distillation… ▽ More

    Submitted 28 May, 2024; originally announced June 2024.

    Comments: ICML 2024

  30. arXiv:2406.17329  [pdf, other

    eess.SP cs.SD eess.AS physics.bio-ph

    Speaker-Independent Acoustic-to-Articulatory Inversion through Multi-Channel Attention Discriminator

    Authors: Woo-Jin Chung, Hong-Goo Kang

    Abstract: We present a novel speaker-independent acoustic-to-articulatory inversion (AAI) model, overcoming the limitations observed in conventional AAI models that rely on acoustic features derived from restricted datasets. To address these challenges, we leverage representations from a pre-trained self-supervised learning (SSL) model to more effectively estimate the global, local, and kinematic pattern in… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Accepted to INTERSPEECH 2024

  31. arXiv:2406.03057  [pdf, other

    cs.LG stat.ML

    BWS: Best Window Selection Based on Sample Scores for Data Pruning across Broad Ranges

    Authors: Hoyong Choi, Nohyun Ki, Hye Won Chung

    Abstract: Data subset selection aims to find a smaller yet informative subset of a large dataset that can approximate the full-dataset training, addressing challenges associated with training neural networks on large-scale datasets. However, existing methods tend to specialize in either high or low selection ratio regimes, lacking a universal approach that consistently achieves competitive performance acros… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  32. arXiv:2403.19180  [pdf

    eess.SP cs.ET

    A Robust UWOC-assisted Multi-hop Topology for Underwater Sensor Network Nodes

    Authors: Maaz Salman, Javad Bolboli, Wan-Young Chung

    Abstract: Underwater environment is substantially less explored territory as compared to earth surface due to lack of robust underwater communication infrastructure. For Internet of Underwater things connectivity, underwater wireless optical communication can play a vital role, compared to conventional radio frequency communication, due to longer range, high data rate, low latency, and unregulated bandwidth… ▽ More

    Submitted 31 March, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  33. An Interpretable Generalization Mechanism for Accurately Detecting Anomaly and Identifying Networking Intrusion Techniques

    Authors: Hao-Ting Pai, Yu-Hsuan Kang, Wen-Cheng Chung

    Abstract: Recent advancements in Intrusion Detection Systems (IDS), integrating Explainable AI (XAI) methodologies, have led to notable improvements in system performance via precise feature selection. However, a thorough understanding of cyber-attacks requires inherently explainable decision-making processes within IDS. In this paper, we present the Interpretable Generalization Mechanism (IG), poised to re… ▽ More

    Submitted 5 November, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Journal ref: IEEE Transactions on Information Forensics and Security, 2024

  34. arXiv:2402.11223  [pdf, other

    cs.LG

    HEAL: Brain-inspired Hyperdimensional Efficient Active Learning

    Authors: Yang Ni, Zhuowen Zou, Wenjun Huang, Hanning Chen, William Youngwoo Chung, Samuel Cho, Ranganath Krishnan, Pietro Mercati, Mohsen Imani

    Abstract: Drawing inspiration from the outstanding learning capability of our human brains, Hyperdimensional Computing (HDC) emerges as a novel computing paradigm, and it leverages high-dimensional vector presentation and operations for brain-like lightweight Machine Learning (ML). Practical deployments of HDC have significantly enhanced the learning efficiency compared to current deep ML methods on a broad… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  35. arXiv:2402.10482  [pdf, other

    cs.LG stat.ML

    Rethinking Self-Distillation: Label Averaging and Enhanced Soft Label Refinement with Partial Labels

    Authors: Hyeonsu Jeong, Hye Won Chung

    Abstract: We investigate the mechanisms of self-distillation in multi-class classification, particularly in the context of linear probing with fixed feature extractors where traditional feature learning explanations do not apply. Our theoretical analysis reveals that multi-round self-distillation effectively performs label averaging among instances with high feature correlations, governed by the eigenvector… ▽ More

    Submitted 19 February, 2025; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: ICLR 2025

  36. arXiv:2401.10245  [pdf, other

    cs.CE physics.flu-dyn

    Train Small, Model Big: Scalable Physics Simulators via Reduced Order Modeling and Domain Decomposition

    Authors: Seung Whan Chung, Youngsoo Choi, Pratanu Roy, Thomas Moore, Thomas Roy, Tiras Y. Lin, Du Y. Nguyen, Christopher Hahn, Eric B. Duoss, Sarah E. Baker

    Abstract: Numerous cutting-edge scientific technologies originate at the laboratory scale, but transitioning them to practical industry applications is a formidable challenge. Traditional pilot projects at intermediate scales are costly and time-consuming. An alternative, the E-pilot, relies on high-fidelity numerical simulations, but even these simulations can be computationally prohibitive at larger scale… ▽ More

    Submitted 5 December, 2023; originally announced January 2024.

    Comments: 40 pages, 12 figures. Submitted to Computer Methods in Applied Mechanics and Engineering

    Report number: LLNL-JRNL-857774 MSC Class: 65F55; 65N55 (primary) 76D07 (secondary)

  37. arXiv:2312.15320  [pdf

    q-bio.QM cs.CV cs.LG cs.MM q-bio.GN

    GestaltMML: Enhancing Rare Genetic Disease Diagnosis through Multimodal Machine Learning Combining Facial Images and Clinical Texts

    Authors: Da Wu, Jingye Yang, Cong Liu, Tzung-Chien Hsieh, Elaine Marchi, Justin Blair, Peter Krawitz, Chunhua Weng, Wendy Chung, Gholson J. Lyon, Ian D. Krantz, Jennifer M. Kalish, Kai Wang

    Abstract: Individuals with suspected rare genetic disorders often undergo multiple clinical evaluations, imaging studies, laboratory tests and genetic tests, to find a possible answer over a prolonged period of time. Addressing this "diagnostic odyssey" thus has substantial clinical, psychosocial, and economic benefits. Many rare genetic diseases have distinctive facial features, which can be used by artifi… ▽ More

    Submitted 21 April, 2024; v1 submitted 23 December, 2023; originally announced December 2023.

    Comments: Significant revisions

  38. arXiv:2311.10327  [pdf, other

    cs.RO eess.SY

    Dimensionality Reduction of Dynamics on Lie Manifolds via Structure-Aware Canonical Correlation Analysis

    Authors: Wooyoung Chung, Daniel Polani, Stas Tiomkin

    Abstract: Incorporating prior knowledge into a data-driven modeling problem can drastically improve performance, reliability, and generalization outside of the training sample. The stronger the structural properties, the more effective these improvements become. Manifolds are a powerful nonlinear generalization of Euclidean space for modeling finite dimensions. Structural impositions in constrained systems… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  39. arXiv:2311.00364  [pdf, other

    eess.AS cs.SD physics.bio-ph

    C2C: Cough to COVID-19 Detection in BHI 2023 Data Challenge

    Authors: Woo-Jin Chung, Miseul Kim, Hong-Goo Kang

    Abstract: This report describes our submission to BHI 2023 Data Competition: Sensor challenge. Our Audio Alchemists team designed an acoustic-based COVID-19 diagnosis system, Cough to COVID-19 (C2C), and won the 1st place in the challenge. C2C involves three key contributions: pre-processing of input signals, cough-related representation extraction leveraging Wav2vec2.0, and data augmentation. Through exper… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 1st place winning paper from the BHI 2023 Data Challenge Competition: Sensor Informatics

  40. arXiv:2310.12467  [pdf, other

    cs.CL

    Contrastive Learning for Inference in Dialogue

    Authors: Etsuko Ishii, Yan Xu, Bryan Wilie, Ziwei Ji, Holy Lovenia, Willy Chung, Pascale Fung

    Abstract: Inference, especially those derived from inductive processes, is a crucial component in our conversation to complement the information implicitly or explicitly conveyed by a speaker. While recent large language models show remarkable advances in inference tasks, their performance in inductive reasoning, where not all information is present in the context, is far behind deductive reasoning. In this… ▽ More

    Submitted 12 November, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP2023

  41. arXiv:2310.08885  [pdf, other

    cs.CL

    InstructTODS: Large Language Models for End-to-End Task-Oriented Dialogue Systems

    Authors: Willy Chung, Samuel Cahyawijaya, Bryan Wilie, Holy Lovenia, Pascale Fung

    Abstract: Large language models (LLMs) have been used for diverse tasks in natural language processing (NLP), yet remain under-explored for task-oriented dialogue systems (TODS), especially for end-to-end TODS. We present InstructTODS, a novel off-the-shelf framework for zero-shot end-to-end task-oriented dialogue systems that can adapt to diverse domains without fine-tuning. By leveraging LLMs, InstructTOD… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  42. arXiv:2309.13457  [pdf, other

    cs.LG cs.CV physics.comp-ph physics.flu-dyn

    Turbulence in Focus: Benchmarking Scaling Behavior of 3D Volumetric Super-Resolution with BLASTNet 2.0 Data

    Authors: Wai Tong Chung, Bassem Akoush, Pushan Sharma, Alex Tamkin, Ki Sung Jung, Jacqueline H. Chen, Jack Guo, Davy Brouzet, Mohsen Talei, Bruno Savard, Alexei Y. Poludnenko, Matthias Ihme

    Abstract: Analysis of compressible turbulent flows is essential for applications related to propulsion, energy generation, and the environment. Here, we present BLASTNet 2.0, a 2.2 TB network-of-datasets containing 744 full-domain samples from 34 high-fidelity direct numerical simulations, which addresses the current limited availability of 3D high-fidelity reacting and non-reacting compressible turbulent f… ▽ More

    Submitted 27 October, 2023; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: Accepted in Adv. in Neural Information Processing Systems 36 (NeurIPS 2023). Link: https://nips.cc/virtual/2023/poster/73433 . 55 pages, 21 figures. Keywords: Super-resolution, 3D, Neural Scaling, Physics-informed Loss, Computational Fluid Dynamics, Partial Differential Equations, Turbulent Reacting Flows, Direct Numerical Simulation, Fluid Mechanics, Combustion, Computer Vision

  43. arXiv:2309.10413  [pdf, other

    cs.CL

    PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems

    Authors: Bryan Wilie, Yan Xu, Willy Chung, Samuel Cahyawijaya, Holy Lovenia, Pascale Fung

    Abstract: Grounding dialogue response generation on external knowledge is proposed to produce informative and engaging responses. However, current knowledge-grounded dialogue (KGD) systems often fail to align the generated responses with human-preferred qualities due to several issues like hallucination and the lack of coherence. Upon analyzing multiple language model generations, we observe the presence of… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  44. arXiv:2309.05182  [pdf, ps, other

    cs.IT cs.DS

    Graph Matching in Correlated Stochastic Block Models for Improved Graph Clustering

    Authors: Joonhyuk Yang, Hye Won Chung

    Abstract: We consider community detection from multiple correlated graphs sharing the same community structure. The correlated graphs are generated by independent subsampling of a parent graph sampled from the stochastic block model. The vertex correspondence between the correlated graphs is assumed to be unknown. We consider the two-step procedure where the vertex correspondence between the correlated grap… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: Allerton Conference 2023

  45. arXiv:2306.14517  [pdf, other

    cs.CL cs.SD eess.AS

    Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition

    Authors: Samuel Cahyawijaya, Holy Lovenia, Willy Chung, Rita Frieske, Zihan Liu, Pascale Fung

    Abstract: Speech emotion recognition plays a crucial role in human-computer interactions. However, most speech emotion recognition research is biased toward English-speaking adults, which hinders its applicability to other demographic groups in different languages and age groups. In this work, we analyze the transferability of emotion recognition across three different languages--English, Mandarin Chinese,… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted in INTERSPEECH 2023

  46. arXiv:2306.01859  [pdf, other

    cs.CV cs.AI

    Spatially Resolved Gene Expression Prediction from H&E Histology Images via Bi-modal Contrastive Learning

    Authors: Ronald Xie, Kuan Pang, Sai W. Chung, Catia T. Perciani, Sonya A. MacParland, Bo Wang, Gary D. Bader

    Abstract: Histology imaging is an important tool in medical diagnosis and research, enabling the examination of tissue structure and composition at the microscopic level. Understanding the underlying molecular mechanisms of tissue architecture is critical in uncovering disease mechanisms and developing effective treatments. Gene expression profiling provides insight into the molecular processes underlying t… ▽ More

    Submitted 27 October, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

  47. arXiv:2305.19666  [pdf, other

    cs.DS cs.LG cs.SI stat.ML

    Efficient Algorithms for Exact Graph Matching on Correlated Stochastic Block Models with Constant Correlation

    Authors: Joonhyuk Yang, Dongpil Shin, Hye Won Chung

    Abstract: We consider the problem of graph matching, or learning vertex correspondence, between two correlated stochastic block models (SBMs). The graph matching problem arises in various fields, including computer vision, natural language processing and bioinformatics, and in particular, matching graphs with inherent community structure has significance related to de-anonymization of correlated social netw… ▽ More

    Submitted 2 June, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: ICML 2023

  48. arXiv:2305.14705  [pdf, other

    cs.CL

    Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for Large Language Models

    Authors: Sheng Shen, Le Hou, Yanqi Zhou, Nan Du, Shayne Longpre, Jason Wei, Hyung Won Chung, Barret Zoph, William Fedus, Xinyun Chen, Tu Vu, Yuexin Wu, Wuyang Chen, Albert Webson, Yunxuan Li, Vincent Zhao, Hongkun Yu, Kurt Keutzer, Trevor Darrell, Denny Zhou

    Abstract: Sparse Mixture-of-Experts (MoE) is a neural architecture design that can be utilized to add learnable parameters to Large Language Models (LLMs) without increasing inference cost. Instruction tuning is a technique for training LLMs to follow instructions. We advocate combining these two approaches, as we find that MoE models benefit more from instruction tuning than dense models. In particular, we… ▽ More

    Submitted 5 July, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Preprint

  49. arXiv:2305.13627  [pdf, other

    cs.CL cs.AI

    InstructAlign: High-and-Low Resource Language Alignment via Continual Crosslingual Instruction Tuning

    Authors: Samuel Cahyawijaya, Holy Lovenia, Tiezheng Yu, Willy Chung, Pascale Fung

    Abstract: Large language models (LLMs) that are tuned with instructions have demonstrated remarkable capabilities in various tasks and languages. However, their ability to generalize to underrepresented languages is limited due to the scarcity of available data. Additionally, directly adapting new languages to instruction-tuned LLMs can result in catastrophic forgetting, which leads to the loss of multitask… ▽ More

    Submitted 24 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

  50. arXiv:2304.11220  [pdf, other

    cs.CL

    Learn What NOT to Learn: Towards Generative Safety in Chatbots

    Authors: Leila Khalatbari, Yejin Bang, Dan Su, Willy Chung, Saeed Ghadimi, Hossein Sameti, Pascale Fung

    Abstract: Conversational models that are generative and open-domain are particularly susceptible to generating unsafe content since they are trained on web-based social data. Prior approaches to mitigating this issue have drawbacks, such as disrupting the flow of conversation, limited generalization to unseen toxic input contexts, and sacrificing the quality of the dialogue for the sake of safety. In this p… ▽ More

    Submitted 25 April, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

    Comments: 9 pages, 3 tables, 3 figures