Skip to main content

Showing 1–17 of 17 results for author: Paik, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.20892  [pdf, other

    cs.LG

    One-Time Soft Alignment Enables Resilient Learning without Weight Transport

    Authors: Jeonghwan Cheon, Jaehyuk Bae, Se-Bum Paik

    Abstract: Backpropagation is the cornerstone of deep learning, but its reliance on symmetric weight transport and global synchronization makes it computationally expensive and biologically implausible. Feedback alignment offers a promising alternative by approximating error gradients through fixed random feedback, thereby avoiding symmetric weight transport. However, this approach often struggles with poor… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 28 pages

  2. arXiv:2503.02857  [pdf, other

    cs.CV cs.AI cs.CY

    Deepfake-Eval-2024: A Multi-Modal In-the-Wild Benchmark of Deepfakes Circulated in 2024

    Authors: Nuria Alina Chandra, Ryan Murtfeldt, Lin Qiu, Arnab Karmakar, Hannah Lee, Emmanuel Tanumihardja, Kevin Farhat, Ben Caffee, Sejin Paik, Changyeon Lee, Jongwook Choi, Aerin Kim, Oren Etzioni

    Abstract: In the age of increasingly realistic generative AI, robust deepfake detection is essential for mitigating fraud and disinformation. While many deepfake detectors report high accuracy on academic datasets, we show that these academic benchmarks are out of date and not representative of real-world deepfakes. We introduce Deepfake-Eval-2024, a new deepfake detection benchmark consisting of in-the-wil… ▽ More

    Submitted 27 May, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

  3. arXiv:2501.18642  [pdf, other

    cs.CV cs.AI cs.GR cs.HC cs.LG

    DebiasPI: Inference-time Debiasing by Prompt Iteration of a Text-to-Image Generative Model

    Authors: Sarah Bonna, Yu-Cheng Huang, Ekaterina Novozhilova, Sejin Paik, Zhengyang Shan, Michelle Yilin Feng, Ge Gao, Yonish Tayal, Rushil Kulkarni, Jialin Yu, Nupur Divekar, Deepti Ghadiyaram, Derry Wijaya, Margrit Betke

    Abstract: Ethical intervention prompting has emerged as a tool to counter demographic biases of text-to-image generative AI models. Existing solutions either require to retrain the model or struggle to generate images that reflect desired distributions on gender and race. We propose an inference-time process called DebiasPI for Debiasing-by-Prompt-Iteration that provides prompt intervention by enabling the… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

    Comments: This work was presented at The European Conference on Computer Vision (ECCV) 2024 Workshop "Fairness and ethics towards transparent AI: facing the chalLEnge through model Debiasing" (FAILED), Milano, Italy, on September 29, 2024, https://failed-workshop-eccv-2024.github.io

  4. arXiv:2412.17411  [pdf, other

    cs.LG cs.AI cs.NE

    Pretraining with random noise for uncertainty calibration

    Authors: Jeonghwan Cheon, Se-Bum Paik

    Abstract: Uncertainty calibration is crucial for various machine learning applications, yet it remains challenging. Many models exhibit hallucinations - confident yet inaccurate responses - due to miscalibrated confidence. Here, we show that the common practice of random initialization in deep learning, often considered a standard technique, is an underlying cause of this miscalibration, leading to excessiv… ▽ More

    Submitted 27 March, 2025; v1 submitted 23 December, 2024; originally announced December 2024.

  5. arXiv:2411.13100  [pdf, ps, other

    cs.CL cs.AI

    Song Form-aware Full-Song Text-to-Lyrics Generation with Multi-Level Granularity Syllable Count Control

    Authors: Yunkee Chae, Eunsik Shin, Suntae Hwang, Seungryeol Paik, Kyogu Lee

    Abstract: Lyrics generation presents unique challenges, particularly in achieving precise syllable control while adhering to song form structures such as verses and choruses. Conventional line-by-line approaches often lead to unnatural phrasing, underscoring the need for more granular syllable management. We propose a framework for lyrics generation that enables multi-level syllable control at the word, phr… ▽ More

    Submitted 23 June, 2025; v1 submitted 20 November, 2024; originally announced November 2024.

    Comments: Accepted to Interspeech 2025

  6. arXiv:2407.10091  [pdf, other

    cs.CL

    Enhancing Emotion Prediction in News Headlines: Insights from ChatGPT and Seq2Seq Models for Free-Text Generation

    Authors: Ge Gao, Jongin Kim, Sejin Paik, Ekaterina Novozhilova, Yi Liu, Sarah T. Bonna, Margrit Betke, Derry Tanti Wijaya

    Abstract: Predicting emotions elicited by news headlines can be challenging as the task is largely influenced by the varying nature of people's interpretations and backgrounds. Previous works have explored classifying discrete emotions directly from news headlines. We provide a different approach to tackling this problem by utilizing people's explanations of their emotion, written in free-text, on how they… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: published at LREC-COLING 2024

    ACM Class: I.2.7

    Journal ref: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) 5944-5955

  7. arXiv:2407.07133  [pdf

    cs.NE cs.AI cs.CV cs.LG

    Neuromimetic metaplasticity for adaptive continual learning

    Authors: Suhee Cho, Hyeonsu Lee, Seungdae Baek, Se-Bum Paik

    Abstract: Conventional intelligent systems based on deep neural network (DNN) models encounter challenges in achieving human-like continual learning due to catastrophic forgetting. Here, we propose a metaplasticity model inspired by human working memory, enabling DNNs to perform catastrophic forgetting-free continual learning without any pre- or post-processing. A key aspect of our approach involves impleme… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 25 pages, 5 figures, 1 table, 4 supplementary figures

  8. arXiv:2405.16731  [pdf, other

    cs.LG cs.NE

    Pretraining with Random Noise for Fast and Robust Learning without Weight Transport

    Authors: Jeonghwan Cheon, Sang Wan Lee, Se-Bum Paik

    Abstract: The brain prepares for learning even before interacting with the environment, by refining and optimizing its structures through spontaneous neural activity that resembles random noise. However, the mechanism of such a process has yet to be thoroughly understood, and it is unclear whether this process can benefit the algorithm of machine learning. Here, we study this issue using a neural network wi… ▽ More

    Submitted 9 May, 2025; v1 submitted 26 May, 2024; originally announced May 2024.

    Journal ref: Advances in Neural Information Processing Systems 37, 13748-13768, 2024

  9. arXiv:2311.06707  [pdf, other

    cs.SD cs.LG eess.AS

    Transfer Learning to Detect COVID-19 Coughs with Incremental Addition of Patient Coughs to Healthy People's Cough Detection Models

    Authors: Sudip Vhaduri, Seungyeon Paik, Jessica E Huber

    Abstract: Millions of people have died worldwide from COVID-19. In addition to its high death toll, COVID-19 has led to unbearable suffering for individuals and a huge global burden to the healthcare sector. Therefore, researchers have been trying to develop tools to detect symptoms of this human-transmissible disease remotely to control its rapid spread. Coughing is one of the common symptoms that research… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: This paper has been accepted to publish at EAI International Conference on Wireless Mobile Communication and Healthcare (MobiHealth'23)

  10. arXiv:2309.02422  [pdf, other

    stat.ML cs.LG stat.ME

    Integral Probability Metrics Meet Neural Networks: The Radon-Kolmogorov-Smirnov Test

    Authors: Seunghoon Paik, Michael Celentano, Alden Green, Ryan J. Tibshirani

    Abstract: Integral probability metrics (IPMs) constitute a general class of nonparametric two-sample tests that are based on maximizing the mean difference between samples from one distribution $P$ versus another $Q$, over all choices of data transformations $f$ living in some function space $\mathcal{F}$. Inspired by recent work that connects what are known as functions of… ▽ More

    Submitted 12 January, 2025; v1 submitted 5 September, 2023; originally announced September 2023.

  11. arXiv:2303.08610  [pdf, other

    cs.SD eess.AS

    Blind Estimation of Audio Processing Graph

    Authors: Sungho Lee, Jaehyun Park, Seungryeol Paik, Kyogu Lee

    Abstract: Musicians and audio engineers sculpt and transform their sounds by connecting multiple processors, forming an audio processing graph. However, most deep-learning methods overlook this real-world practice and assume fixed graph settings. To bridge this gap, we develop a system that reconstructs the entire graph from a given reference audio. We first generate a realistic graph-reference pair dataset… ▽ More

    Submitted 7 May, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: Accepted to ICASSP 2023

  12. arXiv:2205.11605  [pdf, other

    cs.CL cs.CY

    On Measuring Social Biases in Prompt-Based Multi-Task Learning

    Authors: Afra Feyza Akyürek, Sejin Paik, Muhammed Yusuf Kocyigit, Seda Akbiyik, Şerife Leman Runyun, Derry Wijaya

    Abstract: Large language models trained on a mixture of NLP tasks that are converted into a text-to-text format using prompts, can generalize into novel forms of language and handle novel tasks. A large body of work within prompt engineering attempts to understand the effects of input forms and prompts in achieving superior performance. We consider an alternative measure and inquire whether the way in which… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: Findings of NAACL 2022

  13. arXiv:2205.11601  [pdf, other

    cs.CL cs.CY

    Challenges in Measuring Bias via Open-Ended Language Generation

    Authors: Afra Feyza Akyürek, Muhammed Yusuf Kocyigit, Sejin Paik, Derry Wijaya

    Abstract: Researchers have devised numerous ways to quantify social biases vested in pretrained language models. As some language models are capable of generating coherent completions given a set of textual prompts, several prompting datasets have been proposed to measure biases between social groups -- posing language generation as a way of identifying biases. In this opinion paper, we analyze how specific… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: 4th Workshop on Gender Bias in Natural Language Processing. NAACL, 2022

  14. arXiv:2205.08295  [pdf, other

    stat.ML cs.LG

    Semi-Parametric Contextual Bandits with Graph-Laplacian Regularization

    Authors: Young-Geun Choi, Gi-Soo Kim, Seunghoon Paik, Myunghee Cho Paik

    Abstract: Non-stationarity is ubiquitous in human behavior and addressing it in the contextual bandits is challenging. Several works have addressed the problem by investigating semi-parametric contextual bandits and warned that ignoring non-stationarity could harm performances. Another prevalent human behavior is social interaction which has become available in a form of a social network or graph structure.… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

  15. arXiv:2202.08520  [pdf, other

    eess.AS cs.LG cs.SD

    End-to-end Music Remastering System Using Self-supervised and Adversarial Training

    Authors: Junghyun Koo, Seungryeol Paik, Kyogu Lee

    Abstract: Mastering is an essential step in music production, but it is also a challenging task that has to go through the hands of experienced audio engineers, where they adjust tone, space, and volume of a song. Remastering follows the same technical process, in which the context lies in mastering a song for the times. As these tasks have high entry barriers, we aim to lower the barriers by proposing an e… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022

  16. arXiv:2108.01812  [pdf, other

    cs.CL cs.SD eess.AS

    Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation

    Authors: Seongmin Park, Dongchan Shin, Sangyoun Paik, Subong Choi, Alena Kazakova, Jihwa Lee

    Abstract: Fine-tuning pretrained language models (LMs) is a popular approach to automatic speech recognition (ASR) error detection during post-processing. While error detection systems often take advantage of statistical language archetypes captured by LMs, at times the pretrained knowledge can hinder error detection performance. For instance, presence of speech disfluencies might confuse the post-processin… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

  17. arXiv:2103.02147  [pdf, other

    eess.AS cs.LG cs.SD

    Reverb Conversion of Mixed Vocal Tracks Using an End-to-end Convolutional Deep Neural Network

    Authors: Junghyun Koo, Seungryeol Paik, Kyogu Lee

    Abstract: Reverb plays a critical role in music production, where it provides listeners with spatial realization, timbre, and texture of the music. Yet, it is challenging to reproduce the musical reverb of a reference music track even by skilled engineers. In response, we propose an end-to-end system capable of switching the musical reverb factor of two different mixed vocal tracks. This method enables us t… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: To appear in ICASSP 2021