Skip to main content

Showing 1–50 of 55 results for author: Kim, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.20688  [pdf, ps, other

    stat.ML cs.CV cs.LG stat.ME

    A False Discovery Rate Control Method Using a Fully Connected Hidden Markov Random Field for Neuroimaging Data

    Authors: Taehyo Kim, Qiran Jia, Mony J. de Leon, Hai Shu

    Abstract: False discovery rate (FDR) control methods are essential for voxel-wise multiple testing in neuroimaging data analysis, where hundreds of thousands or even millions of tests are conducted to detect brain regions associated with disease-related changes. Classical FDR control methods (e.g., BH, q-value, and LocalFDR) assume independence among tests and often lead to high false non-discovery rates (F… ▽ More

    Submitted 29 May, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

  2. arXiv:2503.06913  [pdf, ps, other

    stat.ME math.OC stat.ML

    Data-Driven Sequential Sampling for Tail Risk Mitigation

    Authors: Dohyun Ahn, Taeho Kim

    Abstract: Given a finite collection of stochastic alternatives, we study the problem of sequentially allocating a fixed sampling budget to identify the optimal alternative with a high probability, where the optimal alternative is defined as the one with the smallest value of extreme tail risk. We particularly consider a situation where these alternatives generate heavy-tailed losses whose probability distri… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: 52 pages, 5 Figures

  3. arXiv:2503.02645  [pdf, other

    cs.LG stat.ML stat.OT

    A Generalized Theory of Mixup for Structure-Preserving Synthetic Data

    Authors: Chungpa Lee, Jongho Im, Joseph H. T. Kim

    Abstract: Mixup is a widely adopted data augmentation technique known for enhancing the generalization of machine learning models by interpolating between data points. Despite its success and popularity, limited attention has been given to understanding the statistical properties of the synthetic data it generates. In this paper, we delve into the theoretical underpinnings of mixup, specifically its effects… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Journal ref: Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS) 2025

  4. arXiv:2503.01882  [pdf, other

    cs.LG physics.geo-ph stat.AP stat.ML

    Constructing balanced datasets for predicting failure modes in structural systems under seismic hazards

    Authors: Jungho Kim, Taeyong Kim

    Abstract: Accurate prediction of structural failure modes under seismic excitations is essential for seismic risk and resilience assessment. Traditional simulation-based approaches often result in imbalanced datasets dominated by non-failure or frequently observed failure scenarios, limiting the effectiveness in machine learning-based prediction. To address this challenge, this study proposes a framework fo… ▽ More

    Submitted 26 February, 2025; originally announced March 2025.

  5. arXiv:2502.16659  [pdf, other

    stat.ME math.OC stat.ML

    Optimizing Input Data Collection for Ranking and Selection

    Authors: Eunhye Song, Taeho Kim

    Abstract: We study a ranking and selection (R&S) problem when all solutions share common parametric Bayesian input models updated with the data collected from multiple independent data-generating sources. Our objective is to identify the best system by designing a sequential sampling algorithm that collects input and simulation data given a budget. We adopt the most probable best (MPB) as the estimator of t… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

    Comments: 48 pages, 6 figures

  6. arXiv:2501.00628  [pdf, other

    cs.LG stat.ML

    Matrix factorization and prediction for high dimensional co-occurrence count data via shared parameter alternating zero inflated Gamma model

    Authors: Taejoon Kim, Haiyan Wang

    Abstract: High-dimensional sparse matrix data frequently arise in various applications. A notable example is the weighted word-word co-occurrence count data, which summarizes the weighted frequency of word pairs appearing within the same context window. This type of data typically contains highly skewed non-negative values with an abundance of zeros. Another example is the co-occurrence of item-item or user… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

    Comments: 39 pages, 5 figures

  7. arXiv:2501.00623  [pdf, other

    cs.LG stat.ML

    Global dense vector representations for words or items using shared parameter alternating Tweedie model

    Authors: Taejoon Kim, Haiyan Wang

    Abstract: In this article, we present a model for analyzing the cooccurrence count data derived from practical fields such as user-item or item-item data from online shopping platform, cooccurring word-word pairs in sequences of texts. Such data contain important information for developing recommender systems or studying relevance of items or words from non-numerical sources. Different from traditional regr… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

    Comments: 43 pages 12 figures

  8. arXiv:2411.02776  [pdf, other

    cs.LG stat.AP

    Deep learning-based modularized loading protocol for parameter estimation of Bouc-Wen class models

    Authors: Sebin Oh, Junho Song, Taeyong Kim

    Abstract: This study proposes a modularized deep learning-based loading protocol for optimal parameter estimation of Bouc-Wen (BW) class models. The protocol consists of two key components: optimal loading history construction and CNN-based rapid parameter estimation. Each component is decomposed into independent sub-modules tailored to distinct hysteretic behaviors-basic hysteresis, structural degradation,… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

  9. arXiv:2408.13751  [pdf, other

    stat.ML cs.LG math.OC

    Improved identification of breakpoints in piecewise regression and its applications

    Authors: Taehyeong Kim, Hyungu Lee, Hayoung Choi

    Abstract: Identifying breakpoints in piecewise regression is critical in enhancing the reliability and interpretability of data fitting. In this paper, we propose novel algorithms based on the greedy algorithm to accurately and efficiently identify breakpoints in piecewise polynomial regression. The algorithm updates the breakpoints to minimize the error by exploring the neighborhood of each breakpoint. It… ▽ More

    Submitted 27 August, 2024; v1 submitted 25 August, 2024; originally announced August 2024.

    Comments: 13 pages, 6 figures

  10. arXiv:2407.10784  [pdf, other

    cs.LG cs.AI stat.ML

    AdapTable: Test-Time Adaptation for Tabular Data via Shift-Aware Uncertainty Calibrator and Label Distribution Handler

    Authors: Changhun Kim, Taewon Kim, Seungyeon Woo, June Yong Yang, Eunho Yang

    Abstract: In real-world scenarios, tabular data often suffer from distribution shifts that threaten the performance of machine learning models. Despite its prevalence and importance, handling distribution shifts in the tabular domain remains underexplored due to the inherent challenges within the tabular data itself. In this sense, test-time adaptation (TTA) offers a promising solution by adapting models to… ▽ More

    Submitted 12 February, 2025; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: NeurIPS Workshop on Table Representation Learning (NeurIPSW-TRL), 2024

  11. arXiv:2404.13321  [pdf

    stat.AP eess.SY

    Accelerated System-Reliability-based Disaster Resilience Analysis for Structural Systems

    Authors: Taeyong Kim, Sang-ri Yi

    Abstract: Resilience has emerged as a crucial concept for evaluating structural performance under disasters because of its ability to extend beyond traditional risk assessments, accounting for a system's ability to minimize disruptions and maintain functionality during recovery. To facilitate the holistic understanding of resilience performance in structural systems, a system-reliability-based disaster resi… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 25 pages, 18 figures

  12. arXiv:2312.03386  [pdf, other

    cs.LG stat.ML

    An Infinite-Width Analysis on the Jacobian-Regularised Training of a Neural Network

    Authors: Taeyoung Kim, Hongseok Yang

    Abstract: The recent theoretical analysis of deep neural networks in their infinite-width limits has deepened our understanding of initialisation, feature learning, and training of those networks, and brought new practical techniques for finding appropriate hyperparameters, learning network weights, and performing inference. In this paper, we broaden this line of research by showing that this infinite-width… ▽ More

    Submitted 21 August, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Accepted at ICML 2024. 74 pages, 18 figures

  13. arXiv:2310.13349  [pdf, other

    stat.ML cs.CV cs.LG

    DeepFDR: A Deep Learning-based False Discovery Rate Control Method for Neuroimaging Data

    Authors: Taehyo Kim, Hai Shu, Qiran Jia, Mony J. de Leon

    Abstract: Voxel-based multiple testing is widely used in neuroimaging data analysis. Traditional false discovery rate (FDR) control methods often ignore the spatial dependence among the voxel-based tests and thus suffer from substantial loss of testing power. While recent spatial FDR control methods have emerged, their validity and optimality remain questionable when handling the complex spatial dependencie… ▽ More

    Submitted 10 March, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of The 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024), PMLR 238:946-954, 2024

  14. arXiv:2307.09254  [pdf, other

    cs.LG cs.CL stat.ML

    Selective Generation for Controllable Language Models

    Authors: Minjae Lee, Kyungmin Kim, Taesoo Kim, Sangdon Park

    Abstract: Trustworthiness of generative language models (GLMs) is crucial in their deployment to critical decision making systems. Hence, certified risk control methods such as selective prediction and conformal prediction have been applied to mitigating the hallucination problem in various supervised downstream tasks. However, the lack of appropriate correctness metric hinders applying such principled meth… ▽ More

    Submitted 27 January, 2025; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: Accepted to NeurIPS 2024 (spotlight)

  15. arXiv:2307.08150  [pdf, other

    stat.ME

    Efficient Treatment Effect Estimation with Out-of-bag Post-stratification

    Authors: Taebin Kim, Lili Wang, Randy Lai, Sangho Yoon

    Abstract: Post-stratification is often used to estimate treatment effects with higher efficiency. However, the majority of existing post-stratification frameworks depend on prior knowledge of the distributions of covariates and assume that the units are classified into post-strata without error. We propose a novel method to determine a proper stratification rule by mapping the covariates into a post-stratif… ▽ More

    Submitted 12 September, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

  16. arXiv:2304.04221  [pdf, other

    stat.ME

    Maximum Agreement Linear Prediction via the Concordance Correlation Coefficient

    Authors: Taeho Kim, George Luta, Matteo Bottai, Pierre Chausse, Gheorghe Doros, Edsel A. Pena

    Abstract: This paper examines distributional properties and predictive performance of the estimated maximum agreement linear predictor (MALP) introduced in Bottai, Kim, Lieberman, Luta, and Pena (2022) paper in The American Statistician, which is the linear predictor maximizing Lin's concordance correlation coefficient (CCC) between the predictor and the predictand. It is compared and contrasted, theoretica… ▽ More

    Submitted 10 February, 2024; v1 submitted 9 April, 2023; originally announced April 2023.

    MSC Class: 62J99; 62H20; 62F99

  17. arXiv:2303.15833  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Complementary Domain Adaptation and Generalization for Unsupervised Continual Domain Shift Learning

    Authors: Wonguk Cho, Jinha Park, Taesup Kim

    Abstract: Continual domain shift poses a significant challenge in real-world applications, particularly in situations where labeled data is not available for new domains. The challenge of acquiring knowledge in this problem setting is referred to as unsupervised continual domain shift learning. Existing methods for domain adaptation and generalization have limitations in addressing this issue, as they focus… ▽ More

    Submitted 13 October, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: ICCV 2023

  18. arXiv:2210.13533  [pdf, other

    cs.LG cs.AI stat.ML

    Sufficient Invariant Learning for Distribution Shift

    Authors: Taero Kim, Subeen Park, Sungjun Lim, Yonghan Jung, Krikamol Muandet, Kyungwoo Song

    Abstract: Learning robust models under distribution shifts between training and test datasets is a fundamental challenge in machine learning. While learning invariant features across environments is a popular approach, it often assumes that these features are fully observed in both training and test sets-a condition frequently violated in practice. When models rely on invariant features absent in the test s… ▽ More

    Submitted 18 November, 2024; v1 submitted 24 October, 2022; originally announced October 2022.

  19. arXiv:2209.05150  [pdf, other

    cs.LG stat.ML

    Bounding the Rademacher Complexity of Fourier neural operators

    Authors: Taeyoung Kim, Myungjoo Kang

    Abstract: A Fourier neural operator (FNO) is one of the physics-inspired machine learning methods. In particular, it is a neural operator. In recent times, several types of neural operators have been developed, e.g., deep operator networks, Graph neural operator (GNO), and Multiwavelet-based operator (MWTO). Compared with other models, the FNO is computationally efficient and can learn nonlinear operators b… ▽ More

    Submitted 26 September, 2022; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: 21 pages, 19 figures

  20. arXiv:2207.07533  [pdf, ps, other

    stat.ME cs.LG stat.ML

    Selection of the Most Probable Best

    Authors: Taeho Kim, Kyoung-kuk Kim, Eunhye Song

    Abstract: We consider an expected-value ranking and selection (R&S) problem where all k solutions' simulation outputs depend on a common parameter whose uncertainty can be modeled by a distribution. We define the most probable best (MPB) to be the solution that has the largest probability of being optimal with respect to the distribution and design an efficient sequential sampling algorithm to learn the MPB… ▽ More

    Submitted 20 April, 2024; v1 submitted 15 July, 2022; originally announced July 2022.

  21. arXiv:2104.14695  [pdf, other

    stat.ME stat.AP

    Dynamic Gene Coexpression Analysis with Correlation Modeling

    Authors: Tae Hyun Kim, Dan Nicolae

    Abstract: In many transcriptomic studies, the correlation of genes might fluctuate with quantitative factors such as genetic ancestry. We propose a method that models the covariance between two variables to vary against a continuous covariate. For the bivariate case, the proposed score test statistic is computationally simple and robust to model misspecification of the covariance term. Subsequently, the met… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

  22. arXiv:2103.00083  [pdf, other

    stat.ML cs.LG

    Flexible Model Aggregation for Quantile Regression

    Authors: Rasool Fakoor, Taesup Kim, Jonas Mueller, Alexander J. Smola, Ryan J. Tibshirani

    Abstract: Quantile regression is a fundamental problem in statistical learning motivated by a need to quantify uncertainty in predictions, or to model a diverse population without being overly reductive. For instance, epidemiological forecasts, cost estimates, and revenue predictions all benefit from being able to quantify the range of possible values accurately. As such, many models have been developed for… ▽ More

    Submitted 15 April, 2023; v1 submitted 26 February, 2021; originally announced March 2021.

    Comments: Accepted at JMLR 2023

  23. arXiv:2101.02491  [pdf, ps, other

    math.ST stat.ME

    Density Deconvolution with Non-Standard Error Distributions: Rates of Convergence and Adaptive Estimation

    Authors: Alexander Goldenshluger, Taeho Kim

    Abstract: It is a typical standard assumption in the density deconvolution problem that the characteristic function of the measurement error distribution is non-zero on the real line. While this condition is assumed in the majority of existing works on the topic, there are many problem instances of interest where it is violated. In this paper we focus on non--standard settings where the characteristic funct… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: 32 pages

    MSC Class: 62G07; 62G20

  24. arXiv:2012.03501  [pdf, other

    cs.LG stat.ML

    Adaptive Local Bayesian Optimization Over Multiple Discrete Variables

    Authors: Taehyeon Kim, Jaeyeon Ahn, Nakyil Kim, Seyoung Yun

    Abstract: In the machine learning algorithms, the choice of the hyperparameter is often an art more than a science, requiring labor-intensive search with expert experience. Therefore, automation on hyperparameter optimization to exclude human intervention is a great appeal, especially for the black-box functions. Recently, there have been increasing demands of solving such concealed tasks for better general… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: workshop at NeurIPS 2020 Competition Track on Black-Box Optimization Challenge

  25. arXiv:2010.01792  [pdf, other

    cs.LG cs.CV cs.MA stat.ML

    Can we Generalize and Distribute Private Representation Learning?

    Authors: Sheikh Shams Azam, Taejin Kim, Seyyedali Hosseinalipour, Carlee Joe-Wong, Saurabh Bagchi, Christopher Brinton

    Abstract: We study the problem of learning representations that are private yet informative, i.e., provide information about intended "ally" targets while hiding sensitive "adversary" attributes. We propose Exclusion-Inclusion Generative Adversarial Network (EIGAN), a generalized private representation learning (PRL) architecture that accounts for multiple ally and adversary attributes unlike existing PRL s… ▽ More

    Submitted 30 January, 2022; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: In Proceedings of the 25th International Conference on Artificial Intelligence and Statistics (AISTATS) 2022

  26. arXiv:2007.02105  [pdf, other

    stat.ME stat.AP

    Prediction Regions for Poisson and Over-Dispersed Poisson Regression Models with Applications to Forecasting Number of Deaths during the COVID-19 Pandemic

    Authors: T. KIm, B. Lieberman, G. Luta, E. Pena

    Abstract: Motivated by the current Coronavirus Disease (COVID-19) pandemic, which is due to the SARS-CoV-2 virus, and the important problem of forecasting daily deaths and cumulative deaths, this paper examines the construction of prediction regions or intervals under the Poisson regression model and for an over-dispersed Poisson regression model. For the Poisson regression model, several prediction regions… ▽ More

    Submitted 6 July, 2020; v1 submitted 4 July, 2020; originally announced July 2020.

    Comments: There are 16 Figures with some containing one to four plot panels. The appendix section are supplementary materials. Without these supplementary materials, there are 35 pages in this manuscript

    MSC Class: Primary: 62J02; 62P99; Secondary: 62F99; 62M10

  27. arXiv:2006.09679  [pdf, other

    cs.LG cs.CV stat.ML

    FrostNet: Towards Quantization-Aware Network Architecture Search

    Authors: Taehoon Kim, YoungJoon Yoo, Jihoon Yang

    Abstract: INT8 quantization has become one of the standard techniques for deploying convolutional neural networks (CNNs) on edge devices to reduce the memory and computational resource usages. By analyzing quantized performances of existing mobile-target network architectures, we can raise an issue regarding the importance of network architecture for optimal INT8 quantization. In this paper, we present a ne… ▽ More

    Submitted 30 November, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

  28. arXiv:2003.01860  [pdf, ps, other

    stat.AP

    Designing a Bonus-Malus system reflecting the claim size under the dependent frequency-severity model

    Authors: Rosy Oh, Joseph H. T. Kim, Jae Youn Ahn

    Abstract: In auto insurance, a Bonus-Malus System (BMS) is commonly used as a posteriori risk classification mechanism to set the premium for the next contract period based on a policyholder's claim history. Even though recent literature reports evidence of a significant dependence between frequency and severity, the current BMS practice is to use a frequency-based transition rule while ignoring severity in… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

  29. arXiv:2002.11903  [pdf, other

    cs.LG stat.ML

    Acceleration of Actor-Critic Deep Reinforcement Learning for Visual Grasping in Clutter by State Representation Learning Based on Disentanglement of a Raw Input Image

    Authors: Taewon Kim, Yeseong Park, Youngbin Park, Il Hong Suh

    Abstract: For a robotic grasping task in which diverse unseen target objects exist in a cluttered environment, some deep learning-based methods have achieved state-of-the-art results using visual input directly. In contrast, actor-critic deep reinforcement learning (RL) methods typically perform very poorly when grasping diverse objects, especially when learning from raw images and sparse rewards. To make t… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

  30. arXiv:1912.13366  [pdf, other

    cs.LG cs.AI stat.ML

    Fast and Accurate Transferability Measurement for Heterogeneous Multivariate Data

    Authors: Seungcheol Park, Huiwen Xu, Taehun Kim, Inhwan Hwang, Kyung-Jun Kim, U Kang

    Abstract: Given a set of heterogeneous source datasets with their classifiers, how can we quickly find the most useful source dataset for a specific target task? We address the problem of measuring transferability between source and target datasets, where the source and the target have different feature spaces and distributions. We propose Transmeter, a fast and accurate method to estimate the transferabili… ▽ More

    Submitted 29 January, 2021; v1 submitted 23 December, 2019; originally announced December 2019.

  31. arXiv:1912.04871  [pdf, other

    cs.LG stat.ML

    Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients

    Authors: Brenden K. Petersen, Mikel Landajuela, T. Nathan Mundhenk, Claudio P. Santiago, Soo K. Kim, Joanne T. Kim

    Abstract: Discovering the underlying mathematical expressions describing a dataset is a core challenge for artificial intelligence. This is the problem of $\textit{symbolic regression}$. Despite recent advances in training neural networks to solve complex tasks, deep learning approaches to symbolic regression are underexplored. We propose a framework that leverages deep learning for symbolic regression via… ▽ More

    Submitted 5 April, 2021; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: Published at International Conference on Learning Representations, 2021

    Report number: LLNL-CONF-790457

    Journal ref: International Conference on Learning Representations, 2021

  32. arXiv:1912.03756  [pdf, other

    stat.ME

    Improved Multiple Confidence Intervals via Thresholding Informed by Prior Information

    Authors: Taeho Kim, Edsel A. Pena

    Abstract: Consider a statistical problem where a set of parameters are of interest to a researcher. Then multiple confidence intervals can be constructed to infer the set of parameters simultaneously. The constructed multiple confidence intervals are the realization of a multiple interval estimator (MIE), the main focus of this study. In particular, a thresholding approach is introduced to improve the perfo… ▽ More

    Submitted 8 December, 2019; originally announced December 2019.

    Comments: 34 pages and 7 figures

    MSC Class: 62F25; 62H12; 62H15

  33. arXiv:1910.00775  [pdf, other

    cs.LG cs.AI stat.ML

    Variational Temporal Abstraction

    Authors: Taesup Kim, Sungjin Ahn, Yoshua Bengio

    Abstract: We introduce a variational approach to learning and inference of temporally hierarchical structure and representation for sequential data. We propose the Variational Temporal Abstraction (VTA), a hierarchical recurrent state space model that can infer the latent temporal structure and thus perform the stochastic state transition hierarchically. We also propose to apply this model to implement the… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: Accepted in NeurIPS 2019

  34. arXiv:1906.05956  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Scalable Neural Architecture Search for 3D Medical Image Segmentation

    Authors: Sungwoong Kim, Ildoo Kim, Sungbin Lim, Woonhyuk Baek, Chiheon Kim, Hyungjoo Cho, Boogeon Yoon, Taesup Kim

    Abstract: In this paper, a neural architecture search (NAS) framework is proposed for 3D medical image segmentation, to automatically optimize a neural architecture from a large design space. Our NAS framework searches the structure of each layer including neural connectivities and operation types in both of the encoder and decoder. Since optimizing over a large discrete architecture space is difficult due… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Comments: 9 pages, 3 figures

  35. arXiv:1906.04691  [pdf, other

    cs.LG cs.CV stat.ML

    On Single Source Robustness in Deep Fusion Models

    Authors: Taewan Kim, Joydeep Ghosh

    Abstract: Algorithms that fuse multiple input sources benefit from both complementary and shared information. Shared information may provide robustness against faulty or noisy inputs, which is indispensable for safety-critical applications like self-driving cars. We investigate learning fusion algorithms that are robust against noise added to a single source. We first demonstrate that robustness against sin… ▽ More

    Submitted 16 October, 2019; v1 submitted 11 June, 2019; originally announced June 2019.

    Comments: Accepted to NeurIPS 2019

  36. arXiv:1905.13536  [pdf, other

    cs.CV cs.LG cs.PF eess.IV stat.ML

    Scaling Video Analytics on Constrained Edge Nodes

    Authors: Christopher Canel, Thomas Kim, Giulio Zhou, Conglong Li, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Subramanya R. Dulloor

    Abstract: As video camera deployments continue to grow, the need to process large volumes of real-time data strains wide area network infrastructure. When per-camera bandwidth is limited, it is infeasible for applications such as traffic monitoring and pedestrian tracking to offload high-quality video streams to a datacenter. This paper presents FilterForward, a new edge-to-cloud system that enables datacen… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

    Comments: This paper is an extended version of a paper with the same title published in the 2nd SysML Conference, SysML '19 (Canel et. al., 2019)

  37. arXiv:1905.00397  [pdf, other

    cs.LG cs.CV stat.ML

    Fast AutoAugment

    Authors: Sungbin Lim, Ildoo Kim, Taesup Kim, Chiheon Kim, Sungwoong Kim

    Abstract: Data augmentation is an essential technique for improving generalization ability of deep learning models. Recently, AutoAugment has been proposed as an algorithm to automatically search for augmentation policies from a dataset and has significantly enhanced performances on many image recognition tasks. However, its search method requires thousands of GPU hours even for a relatively small dataset.… ▽ More

    Submitted 25 May, 2019; v1 submitted 1 May, 2019; originally announced May 2019.

    Comments: 8 pages, 2 figure

    Report number: NeurIPS/2019/12

  38. arXiv:1902.06562  [pdf, other

    cs.LG eess.SP stat.ML

    Intra- and Inter-epoch Temporal Context Network (IITNet) Using Sub-epoch Features for Automatic Sleep Scoring on Raw Single-channel EEG

    Authors: Hogeon Seo, Seunghyeok Back, Seongju Lee, Deokhwan Park, Tae Kim, Kyoobin Lee

    Abstract: A deep learning model, named IITNet, is proposed to learn intra- and inter-epoch temporal contexts from raw single-channel EEG for automatic sleep scoring. To classify the sleep stage from half-minute EEG, called an epoch, sleep experts investigate sleep-related events and consider the transition rules between the found events. Similarly, IITNet extracts representative features at a sub-epoch leve… ▽ More

    Submitted 10 June, 2020; v1 submitted 18 February, 2019; originally announced February 2019.

    Comments: First three authors contributed equally to this work; Accepted manuscript for Biomedical Signal Processing and Control (BSPC); 12 pages, 6 figures;

  39. arXiv:1902.04224  [pdf, other

    cs.LG stat.ML

    Effective Network Compression Using Simulation-Guided Iterative Pruning

    Authors: Dae-Woong Jeong, Jaehun Kim, Youngseok Kim, Tae-Ho Kim, Myungsu Chae

    Abstract: Existing high-performance deep learning models require very intensive computing. For this reason, it is difficult to embed a deep learning model into a system with limited resources. In this paper, we propose the novel idea of the network compression as a method to solve this limitation. The principle of this idea is to make iterative pruning more effective and sophisticated by simulating the redu… ▽ More

    Submitted 11 February, 2019; originally announced February 2019.

    Comments: Submitted to NIPS 2018 MLPCD2

    MSC Class: 68T05

  40. arXiv:1812.08997  [pdf, other

    cs.LG stat.ML

    Stochastic Doubly Robust Gradient

    Authors: Kanghoon Lee, Jihye Choi, Moonsu Cha, Jung-Kwon Lee, Taeyoon Kim

    Abstract: When training a machine learning model with observational data, it is often encountered that some values are systemically missing. Learning from the incomplete data in which the missingness depends on some covariates may lead to biased estimation of parameters and even harm the fairness of decision outcome. This paper proposes how to adjust the causal effect of covariates on the missingness when t… ▽ More

    Submitted 21 December, 2018; originally announced December 2018.

    Comments: 9 pages, 2 figures

  41. arXiv:1812.02341  [pdf, other

    cs.LG stat.ML

    Quantifying Generalization in Reinforcement Learning

    Authors: Karl Cobbe, Oleg Klimov, Chris Hesse, Taehoon Kim, John Schulman

    Abstract: In this paper, we investigate the problem of overfitting in deep reinforcement learning. Among the most common benchmarks in RL, it is customary to use the same environments for both training and testing. This practice offers relatively little insight into an agent's ability to generalize. We address this issue by using procedurally generated environments to construct distinct training and test se… ▽ More

    Submitted 14 July, 2019; v1 submitted 5 December, 2018; originally announced December 2018.

  42. arXiv:1810.02358  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    Transfer Learning via Unsupervised Task Discovery for Visual Question Answering

    Authors: Hyeonwoo Noh, Taehoon Kim, Jonghwan Mun, Bohyung Han

    Abstract: We study how to leverage off-the-shelf visual and linguistic data to cope with out-of-vocabulary answers in visual question answering task. Existing large-scale visual datasets with annotations such as image class labels, bounding boxes and region descriptions are good sources for learning rich and diverse visual concepts. However, it is not straightforward how the visual concepts can be captured… ▽ More

    Submitted 7 April, 2019; v1 submitted 3 October, 2018; originally announced October 2018.

    Comments: CVPR 2019

  43. arXiv:1809.00758  [pdf

    cs.LG cs.CV cs.SD eess.AS stat.ML

    End-to-end Multimodal Emotion and Gender Recognition with Dynamic Joint Loss Weights

    Authors: Myungsu Chae, Tae-Ho Kim, Young Hoon Shin, June-Woo Kim, Soo-Young Lee

    Abstract: Multi-task learning is a method for improving the generalizability of multiple tasks. In order to perform multiple classification tasks with one neural network model, the losses of each task should be combined. Previous studies have mostly focused on multiple prediction tasks using joint loss with static weights for training models, choosing the weights between tasks without making sufficient cons… ▽ More

    Submitted 2 October, 2018; v1 submitted 3 September, 2018; originally announced September 2018.

    Comments: IROS 2018 Workshop on Crossmodal Learning for Intelligent Robotics

    MSC Class: 68T05

  44. arXiv:1806.03836  [pdf, other

    cs.LG stat.ML

    Bayesian Model-Agnostic Meta-Learning

    Authors: Taesup Kim, Jaesik Yoon, Ousmane Dia, Sungwoong Kim, Yoshua Bengio, Sungjin Ahn

    Abstract: Learning to infer Bayesian posterior from a few-shot dataset is an important step towards robust meta-learning due to the model uncertainty inherent in the problem. In this paper, we propose a novel Bayesian model-agnostic meta-learning method. The proposed method combines scalable gradient-based meta-learning with nonparametric variational inference in a principled probabilistic framework. During… ▽ More

    Submitted 18 November, 2018; v1 submitted 11 June, 2018; originally announced June 2018.

    Comments: First two authors contributed equally. 15 pages with appendix including experimental details. Accepted in NIPS 2018

  45. arXiv:1806.02071  [pdf, other

    cs.LG cs.GR physics.comp-ph physics.flu-dyn stat.ML

    Deep Fluids: A Generative Network for Parameterized Fluid Simulations

    Authors: Byungsoo Kim, Vinicius C. Azevedo, Nils Thuerey, Theodore Kim, Markus Gross, Barbara Solenthaler

    Abstract: This paper presents a novel generative model to synthesize fluid simulations from a set of reduced parameters. A convolutional neural network is trained on a collection of discrete, parameterizable fluid simulation velocity fields. Due to the capability of deep learning architectures to learn representative features of the data, our generative model is able to accurately approximate the training d… ▽ More

    Submitted 1 February, 2019; v1 submitted 6 June, 2018; originally announced June 2018.

    Comments: Computer Graphics Forum (Proceedings of EUROGRAPHICS 2019), additional materials: http://www.byungsoo.me/project/deep-fluids/

    Journal ref: Computer Graphics Forum (Proc. Eurographics), 38, 2 (2019), 59-70

  46. arXiv:1805.10724  [pdf, other

    cs.LG cs.HC stat.ML

    RetainVis: Visual Analytics with Interpretable and Interactive Recurrent Neural Networks on Electronic Medical Records

    Authors: Bum Chul Kwon, Min-Je Choi, Joanne Taery Kim, Edward Choi, Young Bin Kim, Soonwook Kwon, Jimeng Sun, Jaegul Choo

    Abstract: We have recently seen many successful applications of recurrent neural networks (RNNs) on electronic medical records (EMRs), which contain histories of patients' diagnoses, medications, and other various events, in order to predict the current and future states of patients. Despite the strong performance of RNNs, it is often challenging for users to understand why the model makes a particular pred… ▽ More

    Submitted 23 October, 2018; v1 submitted 27 May, 2018; originally announced May 2018.

    Comments: Accepted at IEEE VIS 2018. To appear in IEEE Transactions on Visualization and Computer Graphics in January 2019

  47. arXiv:1801.06700  [pdf, other

    cs.CL cs.AI cs.LG cs.NE stat.ML

    A Deep Reinforcement Learning Chatbot (Short Version)

    Authors: Iulian V. Serban, Chinnadhurai Sankar, Mathieu Germain, Saizheng Zhang, Zhouhan Lin, Sandeep Subramanian, Taesup Kim, Michael Pieper, Sarath Chandar, Nan Rosemary Ke, Sai Rajeswar, Alexandre de Brebisson, Jose M. R. Sotelo, Dendi Suhubdy, Vincent Michalski, Alexandre Nguyen, Joelle Pineau, Yoshua Bengio

    Abstract: We present MILABOT: a deep reinforcement learning chatbot developed by the Montreal Institute for Learning Algorithms (MILA) for the Amazon Alexa Prize competition. MILABOT is capable of conversing with humans on popular small talk topics through both speech and text. The system consists of an ensemble of natural language generation and retrieval models, including neural network and template-based… ▽ More

    Submitted 20 January, 2018; originally announced January 2018.

    Comments: 9 pages, 1 figure, 2 tables; presented at NIPS 2017, Conversational AI: "Today's Practice and Tomorrow's Potential" Workshop

    ACM Class: I.5.1; I.2.7

  48. arXiv:1711.07433  [pdf, other

    stat.ML cs.LG

    Relaxed Oracles for Semi-Supervised Clustering

    Authors: Taewan Kim, Joydeep Ghosh

    Abstract: Pairwise "same-cluster" queries are one of the most widely used forms of supervision in semi-supervised clustering. However, it is impractical to ask human oracles to answer every query correctly. In this paper, we study the influence of allowing "not-sure" answers from a weak oracle and propose an effective algorithm to handle such uncertainties in query responses. Two realistic weak oracle model… ▽ More

    Submitted 20 November, 2017; originally announced November 2017.

    Comments: NIPS 2017 Workshop: Learning with Limited Labeled Data (LLD 2017)

  49. arXiv:1709.03202  [pdf, other

    stat.ML cs.LG

    Semi-Supervised Active Clustering with Weak Oracles

    Authors: Taewan Kim, Joydeep Ghosh

    Abstract: Semi-supervised active clustering (SSAC) utilizes the knowledge of a domain expert to cluster data points by interactively making pairwise "same-cluster" queries. However, it is impractical to ask human oracles to answer every pairwise query. In this paper, we study the influence of allowing "not-sure" answers from a weak oracle and propose algorithms to efficiently handle uncertainties. Different… ▽ More

    Submitted 10 September, 2017; originally announced September 2017.

  50. arXiv:1709.02349  [pdf, other

    cs.CL cs.AI cs.LG cs.NE stat.ML

    A Deep Reinforcement Learning Chatbot

    Authors: Iulian V. Serban, Chinnadhurai Sankar, Mathieu Germain, Saizheng Zhang, Zhouhan Lin, Sandeep Subramanian, Taesup Kim, Michael Pieper, Sarath Chandar, Nan Rosemary Ke, Sai Rajeshwar, Alexandre de Brebisson, Jose M. R. Sotelo, Dendi Suhubdy, Vincent Michalski, Alexandre Nguyen, Joelle Pineau, Yoshua Bengio

    Abstract: We present MILABOT: a deep reinforcement learning chatbot developed by the Montreal Institute for Learning Algorithms (MILA) for the Amazon Alexa Prize competition. MILABOT is capable of conversing with humans on popular small talk topics through both speech and text. The system consists of an ensemble of natural language generation and retrieval models, including template-based models, bag-of-wor… ▽ More

    Submitted 5 November, 2017; v1 submitted 7 September, 2017; originally announced September 2017.

    Comments: 40 pages, 9 figures, 11 tables

    ACM Class: I.5.1; I.2.7