Skip to main content

Showing 1–9 of 9 results for author: Taejoong

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.04580  [pdf, other

    cs.LG cs.AI

    Technical Debt in In-Context Learning: Diminishing Efficiency in Long Context

    Authors: Taejong Joo, Diego Klabjan

    Abstract: Transformers have demonstrated remarkable in-context learning (ICL) capabilities, adapting to new tasks by simply conditioning on demonstrations without parameter updates. Compelling empirical and theoretical evidence suggests that ICL, as a general-purpose learner, could outperform task-specific models. However, it remains unclear to what extent the transformers optimally learn in-context compare… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

  2. arXiv:2411.00586  [pdf, other

    cs.LG

    Improving self-training under distribution shifts via anchored confidence with theoretical guarantees

    Authors: Taejong Joo, Diego Klabjan

    Abstract: Self-training often falls short under distribution shifts due to an increased discrepancy between prediction confidence and actual accuracy. This typically necessitates computationally demanding methods such as neighborhood or ensemble-based label corrections. Drawing inspiration from insights on early learning regularization, we develop a principled method to improve self-training under distribut… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

    Comments: NeurIPS 2024

  3. arXiv:2404.16212  [pdf, other

    cs.CR cs.CV cs.LG

    An Analysis of Recent Advances in Deepfake Image Detection in an Evolving Threat Landscape

    Authors: Sifat Muhammad Abdullah, Aravind Cheruvu, Shravya Kanchi, Taejoong Chung, Peng Gao, Murtuza Jadliwala, Bimal Viswanath

    Abstract: Deepfake or synthetic images produced using deep generative models pose serious risks to online platforms. This has triggered several research efforts to accurately detect deepfake images, achieving excellent performance on publicly available deepfake datasets. In this work, we study 8 state-of-the-art detectors and argue that they are far from being ready for deployment due to two recent developm… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Accepted to IEEE S&P 2024; 19 pages, 10 figures

  4. arXiv:2310.10611  [pdf, other

    cs.LG stat.ML

    IW-GAE: Importance Weighted Group Accuracy Estimation for Improved Calibration and Model Selection in Unsupervised Domain Adaptation

    Authors: Taejong Joo, Diego Klabjan

    Abstract: Distribution shifts pose significant challenges for model calibration and model selection tasks in the unsupervised domain adaptation problem -- a scenario where the goal is to perform well in a distribution shifted domain without labels. In this work, we tackle difficulties coming from distribution shifts by developing a novel importance weighted group accuracy estimator. Specifically, we present… ▽ More

    Submitted 17 July, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: ICML 2024

  5. Privacy Guarantees of BLE Contact Tracing: A Case Study on COVIDWISE

    Authors: Salman Ahmed, Ya Xiao, Taejoong, Chung, Carol Fung, Moti Yung, Danfeng, Yao

    Abstract: Google and Apple jointly introduced a digital contact tracing technology and an API called "exposure notification," to help health organizations and governments with contact tracing. The technology and its interplay with security and privacy constraints require investigation. In this study, we examine and analyze the security, privacy, and reliability of the technology with actual and typical scen… ▽ More

    Submitted 16 December, 2021; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: \{copyright} 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

    Journal ref: IEEE Computer 2021

  6. arXiv:2006.06399  [pdf, other

    cs.LG stat.ML

    Revisiting Explicit Regularization in Neural Networks for Well-Calibrated Predictive Uncertainty

    Authors: Taejong Joo, Uijung Chung

    Abstract: From the statistical learning perspective, complexity control via explicit regularization is a necessity for improving the generalization of over-parameterized models. However, the impressive generalization performance of neural networks with only implicit regularization may be at odds with this conventional wisdom. In this work, we revisit the importance of explicit regularization for obtaining w… ▽ More

    Submitted 6 February, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

  7. arXiv:2002.07965  [pdf, other

    cs.LG stat.ML

    Being Bayesian about Categorical Probability

    Authors: Taejong Joo, Uijung Chung, Min-Gwan Seo

    Abstract: Neural networks utilize the softmax as a building block in classification tasks, which contains an overconfidence problem and lacks an uncertainty representation ability. As a Bayesian alternative to the softmax, we consider a random variable of a categorical probability over class labels. In this framework, the prior distribution explicitly models the presumed noise inherent in the observed label… ▽ More

    Submitted 29 June, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: ICML 2020

  8. arXiv:2002.05366  [pdf, other

    cs.LG stat.ML

    Regularizing activations in neural networks via distribution matching with the Wasserstein metric

    Authors: Taejong Joo, Donggu Kang, Byunghoon Kim

    Abstract: Regularization and normalization have become indispensable components in training deep neural networks, resulting in faster training and improved generalization performance. We propose the projected error function regularization loss (PER) that encourages activations to follow the standard normal distribution. PER randomly projects activations onto one-dimensional space and computes the regulariza… ▽ More

    Submitted 26 April, 2020; v1 submitted 13 February, 2020; originally announced February 2020.

    Comments: ICLR 2020

  9. arXiv:1008.2574  [pdf, ps, other

    cs.NI

    An Empirical Study on Content Bundling in BitTorrent Swarming System

    Authors: Jinyoung Han, Taejoong Chung, Seungbae Kim, Hyun-chul Kim, Ted "Taekyoung" Kwon, Yanghee Choi

    Abstract: Despite the tremendous success of BitTorrent, its swarming system suffers from a fundamental limitation: lower or no availability of unpopular contents. Recently, Menasche et al. has shown that bundling is a promising solution to mitigate this availability problem; it improves the availability and reduces download times for unpopular contents by combining multiple files into a single swarm. There… ▽ More

    Submitted 16 August, 2010; originally announced August 2010.