Skip to main content

Showing 1–25 of 25 results for author: Kumagai, A

.
  1. arXiv:2506.10259  [pdf, ps, other

    cs.LG stat.ML

    Meta-learning Representations for Learning from Multiple Annotators

    Authors: Atsutoshi Kumagai, Tomoharu Iwata, Taishi Nishiyama, Yasutoshi Ida, Yasuhiro Fujiwara

    Abstract: We propose a meta-learning method for learning from multiple noisy annotators. In many applications such as crowdsourcing services, labels for supervised learning are given by multiple annotators. Since the annotators have different skills or biases, given labels can be noisy. To learn accurate classifiers, existing methods require many noisy annotated data. However, sufficient data might be unava… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: 24 pages

  2. arXiv:2503.03789  [pdf, other

    cs.LG cs.AI stat.ML

    Positive-Unlabeled Diffusion Models for Preventing Sensitive Data Generation

    Authors: Hiroshi Takahashi, Tomoharu Iwata, Atsutoshi Kumagai, Yuuki Yamanaka, Tomoya Yamashita

    Abstract: Diffusion models are powerful generative models but often generate sensitive data that are unwanted by users, mainly because the unlabeled training data frequently contain such sensitive data. Since labeling all sensitive data in the large-scale unlabeled training data is impractical, we address this problem by using a small amount of labeled sensitive data. In this paper, we propose positive-unla… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Comments: Accepted at ICLR2025. Code is available at https://github.com/takahashihiroshi/pudm

  3. arXiv:2410.03263  [pdf, other

    cs.LG cs.AI

    Test-time Adaptation for Regression by Subspace Alignment

    Authors: Kazuki Adachi, Shin'ya Yamaguchi, Atsutoshi Kumagai, Tomoki Hamagami

    Abstract: This paper investigates test-time adaptation (TTA) for regression, where a regression model pre-trained in a source domain is adapted to an unknown target distribution with unlabeled target data. Although regression is one of the fundamental tasks in machine learning, most of the existing TTA methods have classification-specific designs, which assume that models output class-categorical prediction… ▽ More

    Submitted 22 January, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

    Comments: Accepted to ICLR 2025

  4. arXiv:2406.03680  [pdf, other

    cs.LG stat.ML

    Meta-learning for Positive-unlabeled Classification

    Authors: Atsutoshi Kumagai, Tomoharu Iwata, Yasuhiro Fujiwara

    Abstract: We propose a meta-learning method for positive and unlabeled (PU) classification, which improves the performance of binary classifiers obtained from only PU data in unseen target tasks. PU learning is an important problem since PU data naturally arise in real-world applications such as outlier detection and information retrieval. Existing PU learning methods require many PU data, but sufficient da… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 21 pages

  5. arXiv:2405.18929  [pdf, other

    stat.ML cs.AI cs.LG

    Deep Positive-Unlabeled Anomaly Detection for Contaminated Unlabeled Data

    Authors: Hiroshi Takahashi, Tomoharu Iwata, Atsutoshi Kumagai, Yuuki Yamanaka

    Abstract: Semi-supervised anomaly detection, which aims to improve the anomaly detection performance by using a small amount of labeled anomaly data in addition to unlabeled data, has attracted attention. Existing semi-supervised approaches assume that most unlabeled data are normal, and train anomaly detectors by minimizing the anomaly scores for the unlabeled data while maximizing those for the labeled an… ▽ More

    Submitted 8 February, 2025; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: Under revirew. Code is available at https://github.com/takahashihiroshi/pusvdd

  6. arXiv:2402.04051  [pdf, other

    cs.LG

    Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods

    Authors: Akira Ito, Masanori Yamada, Atsutoshi Kumagai

    Abstract: Recently, Ainsworth et al. showed that using weight matching (WM) to minimize the $L^2$ distance in a permutation search of model parameters effectively identifies permutations that satisfy linear mode connectivity (LMC), where the loss along a linear path between two independently trained models with different seeds remains nearly constant. This paper analyzes LMC using WM, which is useful for un… ▽ More

    Submitted 7 April, 2025; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: In Proceedings of the Thirteenth International Conference on Learning Representations (ICLR 2025)

  7. arXiv:2312.07952  [pdf, other

    stat.ML cs.AI cs.LG

    Meta-learning to Calibrate Gaussian Processes with Deep Kernels for Regression Uncertainty Estimation

    Authors: Tomoharu Iwata, Atsutoshi Kumagai

    Abstract: Although Gaussian processes (GPs) with deep kernels have been successfully used for meta-learning in regression tasks, its uncertainty estimation performance can be poor. We propose a meta-learning method for calibrating deep kernel GPs for improving regression uncertainty estimation performance with a limited number of training data. The proposed method meta-learns how to calibrate uncertainty us… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  8. arXiv:2311.05088  [pdf, other

    cs.LG cs.AI stat.ML

    Meta-learning of semi-supervised learning from tasks with heterogeneous attribute spaces

    Authors: Tomoharu Iwata, Atsutoshi Kumagai

    Abstract: We propose a meta-learning method for semi-supervised learning that learns from multiple tasks with heterogeneous attribute spaces. The existing semi-supervised meta-learning methods assume that all tasks share the same attribute space, which prevents us from learning with a wide variety of tasks. With the proposed method, the expected test performance on tasks with a small amount of labeled data… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  9. arXiv:2307.13899  [pdf, other

    cs.LG cs.AI cs.CV

    Regularizing Neural Networks with Meta-Learning Generative Models

    Authors: Shin'ya Yamaguchi, Daiki Chijiwa, Sekitoshi Kanai, Atsutoshi Kumagai, Hisashi Kashima

    Abstract: This paper investigates methods for improving generative data augmentation for deep learning. Generative data augmentation leverages the synthetic samples produced by generative models as an additional dataset for classification with small dataset settings. A key challenge of generative data augmentation is that the synthetic data contain uninformative samples that degrade accuracy. This is becaus… ▽ More

    Submitted 23 October, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted to NeurIPS 2023

  10. arXiv:2303.07597  [pdf, other

    cs.LG stat.ML

    Fast Regularized Discrete Optimal Transport with Group-Sparse Regularizers

    Authors: Yasutoshi Ida, Sekitoshi Kanai, Kazuki Adachi, Atsutoshi Kumagai, Yasuhiro Fujiwara

    Abstract: Regularized discrete optimal transport (OT) is a powerful tool to measure the distance between two discrete distributions that have been constructed from data samples on two different domains. While it has a wide range of applications in machine learning, in some cases the sampled data from only one of the domains will have class labels such as unsupervised domain adaptation. In this kind of probl… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: This is an extended version of the paper accepted by the 37th AAAI Conference on Artificial Intelligence (AAAI 2023)

  11. arXiv:2206.09543  [pdf, other

    stat.ML cs.AI cs.LG

    Meta-learning for Out-of-Distribution Detection via Density Estimation in Latent Space

    Authors: Tomoharu Iwata, Atsutoshi Kumagai

    Abstract: Many neural network-based out-of-distribution (OoD) detection methods have been proposed. However, they require many training data for each target task. We propose a simple yet effective meta-learning method to detect OoD with small in-distribution data in a target task. With the proposed method, the OoD detection is performed by density estimation in a latent space. A neural network shared among… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

  12. arXiv:2205.15619  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Meta-ticket: Finding optimal subnetworks for few-shot learning within randomly initialized neural networks

    Authors: Daiki Chijiwa, Shin'ya Yamaguchi, Atsutoshi Kumagai, Yasutoshi Ida

    Abstract: Few-shot learning for neural networks (NNs) is an important problem that aims to train NNs with a few data. The main challenge is how to avoid overfitting since over-parameterized NNs can easily overfit to such small dataset. Previous work (e.g. MAML by Finn et al. 2017) tackles this challenge by meta-learning, which learns how to learn from a few data by using various tasks. On the other hand, on… ▽ More

    Submitted 9 February, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  13. arXiv:2204.13263  [pdf, other

    cs.LG

    Covariance-aware Feature Alignment with Pre-computed Source Statistics for Test-time Adaptation to Multiple Image Corruptions

    Authors: Kazuki Adachi, Shin'ya Yamaguchi, Atsutoshi Kumagai

    Abstract: Real-world image recognition systems often face corrupted input images, which cause distribution shifts and degrade the performance of models. These systems often use a single prediction model in a central server and process images sent from various environments, such as cameras distributed in cities or cars. Such single models face images corrupted in heterogeneous ways in test time. Thus, they r… ▽ More

    Submitted 29 June, 2023; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: Extended version of the paper accepted to ICIP 2023

  14. arXiv:2204.12833  [pdf, other

    cs.LG cs.AI stat.ML

    Transfer Learning with Pre-trained Conditional Generative Models

    Authors: Shin'ya Yamaguchi, Sekitoshi Kanai, Atsutoshi Kumagai, Daiki Chijiwa, Hisashi Kashima

    Abstract: Transfer learning is crucial in training deep neural networks on new target tasks. Current transfer learning methods always assume at least one of (i) source and target task label spaces overlap, (ii) source datasets are available, and (iii) target network architectures are consistent with source ones. However, holding these assumptions is difficult in practical settings because the target task ra… ▽ More

    Submitted 20 February, 2025; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: Accepted by Machine Learning

    Journal ref: Machine Learning 114, 96 (2025)

  15. arXiv:2111.03282  [pdf, other

    stat.ML cs.LG

    Recurrent Neural Networks for Learning Long-term Temporal Dependencies with Reanalysis of Time Scale Representation

    Authors: Kentaro Ohno, Atsutoshi Kumagai

    Abstract: Recurrent neural networks with a gating mechanism such as an LSTM or GRU are powerful tools to model sequential data. In the mechanism, a forget gate, which was introduced to control information flow in a hidden state in the RNN, has recently been re-interpreted as a representative of the time scale of the state, i.e., a measure how long the RNN retains information on inputs. On the basis of this… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

    Comments: 8 pages, 5 figures, IEEE ICBK 2021

    ACM Class: I.2.6

  16. arXiv:2107.02087  [pdf, other

    physics.app-ph

    Experimental demonstration of a mmWave passive access point extender based on a binary reconfigurable intelligent surface

    Authors: Vladislav Popov, Mikhail Odit, Jean-Baptiste Gros, Vladimir Lenets, Akira Kumagai, Mathias Fink, Kotaro Enomoto, Geoffroy Lerosey

    Abstract: As data rates demands are exploding, 5G will soon rely on mmWaves that offer much higher bandwidths. Yet at these frequencies, attenuation and diffraction of waves require point to point communications with beamforming base stations that are complex and power greedy. Furthermore, since any obstacle at these frequencies completely blocks the waves, the networks must be extremely dense, resulting in… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

  17. arXiv:2107.00816  [pdf, other

    cs.LG stat.ML

    Few-shot Learning for Unsupervised Feature Selection

    Authors: Atsutoshi Kumagai, Tomoharu Iwata, Yasuhiro Fujiwara

    Abstract: We propose a few-shot learning method for unsupervised feature selection, which is a task to select a subset of relevant features in unlabeled data. Existing methods usually require many instances for feature selection. However, sufficient instances are often unavailable in practice. The proposed method can select a subset of relevant features in a target task given a few unlabeled target instance… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: 20 pages

  18. arXiv:2107.00801  [pdf, other

    stat.ML cs.LG

    Meta-Learning for Relative Density-Ratio Estimation

    Authors: Atsutoshi Kumagai, Tomoharu Iwata, Yasuhiro Fujiwara

    Abstract: The ratio of two probability densities, called a density-ratio, is a vital quantity in machine learning. In particular, a relative density-ratio, which is a bounded extension of the density-ratio, has received much attention due to its stability and has been used in various applications such as outlier detection and dataset comparison. Existing methods for (relative) density-ratio estimation (DRE)… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: 17 pages

  19. arXiv:2103.00684  [pdf, other

    stat.ML cs.LG

    Meta-learning One-class Classifiers with Eigenvalue Solvers for Supervised Anomaly Detection

    Authors: Tomoharu Iwata, Atsutoshi Kumagai

    Abstract: Neural network-based anomaly detection methods have shown to achieve high performance. However, they require a large amount of training data for each task. We propose a neural network-based meta-learning method for supervised anomaly detection. The proposed method improves the anomaly detection performance on unseen tasks, which contains a few labeled normal and anomalous instances, by meta-traini… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

  20. arXiv:2102.02950  [pdf, other

    stat.ML cs.AI cs.LG

    Adversarial Training Makes Weight Loss Landscape Sharper in Logistic Regression

    Authors: Masanori Yamada, Sekitoshi Kanai, Tomoharu Iwata, Tomokatsu Takahashi, Yuki Yamanaka, Hiroshi Takahashi, Atsutoshi Kumagai

    Abstract: Adversarial training is actively studied for learning robust models against adversarial examples. A recent study finds that adversarially trained models degenerate generalization performance on adversarial examples when their weight loss landscape, which is loss changes with respect to weights, is sharp. Unfortunately, it has been experimentally shown that adversarial training sharpens the weight… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

    Comments: 9 pages, 5 figures

  21. arXiv:2009.14379  [pdf, other

    stat.ML cs.LG

    Few-shot Learning for Time-series Forecasting

    Authors: Tomoharu Iwata, Atsutoshi Kumagai

    Abstract: Time-series forecasting is important for many applications. Forecasting models are usually trained using time-series data in a specific target task. However, sufficient data in the target task might be unavailable, which leads to performance degradation. In this paper, we propose a few-shot learning method that forecasts a future value of a time-series in a target task given a few time-series in t… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

  22. arXiv:2002.12011  [pdf, other

    stat.ML cs.LG

    Semi-supervised Anomaly Detection on Attributed Graphs

    Authors: Atsutoshi Kumagai, Tomoharu Iwata, Yasuhiro Fujiwara

    Abstract: We propose a simple yet effective method for detecting anomalous instances on an attribute graph with label information of a small number of instances. Although with standard anomaly detection methods it is usually assumed that instances are independent and identically distributed, in many real-world applications, instances are often explicitly connected with each other, resulting in so-called att… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: 10 pages

  23. arXiv:1807.02927  [pdf, other

    stat.ML cs.LG

    Zero-shot Domain Adaptation without Domain Semantic Descriptors

    Authors: Atsutoshi Kumagai, Tomoharu Iwata

    Abstract: We propose a method to infer domain-specific models such as classifiers for unseen domains, from which no data are given in the training phase, without domain semantic descriptors. When training and test distributions are different, standard supervised learning methods perform poorly. Zero-shot domain adaptation attempts to alleviate this problem by inferring models that generalize well to unseen… ▽ More

    Submitted 8 July, 2018; originally announced July 2018.

    Comments: 10 pages, 10 figures

  24. arXiv:cond-mat/0006097  [pdf, ps, other

    cond-mat.supr-con cond-mat.str-el

    Cooper Pair Formation in U(1) Gauge Theory of High Temperature Superconductivity

    Authors: Atsuya Kumagai, Masahiko Hayashi, Hiromichi Ebisawa

    Abstract: We study the two-dimensional spin-charge separated Ginzburg-Landau theory containing U(1) gauge interactions as a semi-phenomenological model describing fluctuating condensates in high temperature superconductivity. Transforming the original GL action, we abstract the effective action of Cooper pair. Especially, we clarify how Cooper pair correlation evolves in the normal state from the point of… ▽ More

    Submitted 7 June, 2000; originally announced June 2000.

    Comments: 4 pages, 5 figures included, submitted to J. Phys. Soc. Jpn

    Journal ref: J.Phys.Soc.Jap. 70 (2001) 509-512

  25. Dynamical Induction of s-wave Component in d-wave Superconductor Driven by Thermal Fluctuations

    Authors: Atsuya Kumagai, Hiromichi Ebisawa

    Abstract: We investigated the mutual induction effects between the d-wave and the s-wave components of order parameters due to superconducting fluctuation above the critical temperatures and calculated its contributions to paraconductivity and excess Hall conductivity based on the two-component stochastic TDGL equation. It is shown that the coupling of two components increases paraconductivity while it de… ▽ More

    Submitted 13 June, 1998; originally announced June 1998.

    Comments: 10 pages, 4 figures included, submitted to J.Phys.Soc.Jpn