Skip to main content

Showing 1–50 of 155 results for author: Yamaguchi, S

.
  1. arXiv:2505.12912  [pdf, ps, other

    cs.CV

    Uniformity First: Uniformity-aware Test-time Adaptation of Vision-language Models against Image Corruption

    Authors: Kazuki Adachi, Shin'ya Yamaguchi, Tomoki Hamagami

    Abstract: Pre-trained vision-language models such as contrastive language-image pre-training (CLIP) have demonstrated a remarkable generalizability, which has enabled a wide range of applications represented by zero-shot classification. However, vision-language models still suffer when they face datasets with large gaps from training ones, i.e., distribution shifts. We found that CLIP is especially vulnerab… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: Code is available at https://github.com/kzkadc/uninfo

  2. arXiv:2504.17562  [pdf, other

    cs.CL cs.LG

    When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars

    Authors: Rei Higuchi, Ryotaro Kawata, Naoki Nishikawa, Kazusato Oko, Shoichiro Yamaguchi, Sosuke Kobayashi, Seiya Tokui, Kohei Hayashi, Daisuke Okanohara, Taiji Suzuki

    Abstract: The ability to acquire latent semantics is one of the key properties that determines the performance of language models. One convenient approach to invoke this ability is to prepend metadata (e.g. URLs, domains, and styles) at the beginning of texts in the pre-training data, making it easier for the model to access latent semantics before observing the entire text. Previous studies have reported t… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  3. arXiv:2504.12717  [pdf, other

    cs.CV cs.AI cs.LG

    Post-pre-training for Modality Alignment in Vision-Language Foundation Models

    Authors: Shin'ya Yamaguchi, Dewei Feng, Sekitoshi Kanai, Kazuki Adachi, Daiki Chijiwa

    Abstract: Contrastive language image pre-training (CLIP) is an essential component of building modern vision-language foundation models. While CLIP demonstrates remarkable zero-shot performance on downstream tasks, the multi-modal feature spaces still suffer from a modality gap, which is a gap between image and text feature clusters and limits downstream task performance. Although existing works attempt to… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: Accepted to CVPR 2025; Code: https://github.com/yshinya6/clip-refine

  4. arXiv:2504.08375  [pdf, other

    hep-th cond-mat.str-el nlin.SI

    Boundary Scattering and Non-invertible Symmetries in 1+1 Dimensions

    Authors: Soichiro Shimamori, Satoshi Yamaguchi

    Abstract: Recent studies by Copetti, Córdova and Komatsu have revealed that when non-invertible symmetries are spontaneously broken, the conventional crossing relation of the S-matrix is modified by the effects of the corresponding topological quantum field theory (TQFT). In this paper, we extend these considerations to $(1+1)$-dimensional quantum field theories (QFTs) with boundaries. In the presence of a… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: 28 pages

    Report number: OU-HET 1270

  5. arXiv:2503.23921  [pdf, other

    hep-th hep-lat math.KT

    $K$-theoretic computation of the Atiyah(-Patodi)-Singer index of lattice Dirac operators

    Authors: Shoto Aoki, Hidenori Fukaya, Mikio Furuta, Shinichiroh Matsuo, Tetsuya Onogi, Satoshi Yamaguchi

    Abstract: We show that the Wilson Dirac operator in lattice gauge theory can be identified as a mathematical object in $K$-theory and that its associated spectral flow is equal to the index. In comparison to the standard lattice Dirac operator index, our formulation does not require the Ginsparg-Wilson relation and has broader applicability to systems with boundaries and to the mod-two version of the indice… ▽ More

    Submitted 15 April, 2025; v1 submitted 31 March, 2025; originally announced March 2025.

    Comments: 19 pages, 6 figures, minor corrections and references added

    Report number: OU-HET-1267

  6. arXiv:2503.23373  [pdf, ps, other

    math.AG

    On the fundamental group of the regular part of Fujiki's compact Kahler symplectic orbifolds

    Authors: Shun Yamaguchi

    Abstract: We calculate the fundamental group of the regular part of certain compact Kahler symplectic orbifolds constructed by Fujiki, called Fujiki's examples. We determine which one is an irreducible symplectic orbifold among Fujiki's examples. This answers a question posed by A.Perego.

    Submitted 30 March, 2025; originally announced March 2025.

  7. arXiv:2502.16013  [pdf, ps, other

    cond-mat.str-el cond-mat.supr-con

    Proximity-Induced Nodal Metal in an Extremely Underdoped CuO$_2$ Plane in Triple-Layer Cuprates

    Authors: Shin-ichiro Ideta, Shintaro Adachi, Takashi Noji, Shunpei Yamaguchi, Nae Sasaki, Shigeyuki Ishida, Shin-ichi Uchida, Takenori Fujii, Takao Watanabe, Wen O. Wang, Brian Moritz, Thomas P. Devereaux, Masashi Arita, Chung-Yu Mou, Teppei Yoshida, Kiyohisa Tanaka, Ting-Kuo Lee, Atsushi Fujimori

    Abstract: ARPES studies have established that the high-$T_c$ cuprates with single and double CuO$_2$ layers evolve from the Mott insulator to the pseudogap state with a Fermi arc, on which the superconducting (SC) gap opens. In four- to six-layer cuprates, on the other hand, small hole Fermi pockets are formed in the innermost CuO$_2$ planes, indicating antiferromagnetism. Here, we performed ARPES studies o… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

  8. arXiv:2502.09018  [pdf, other

    cs.LG cs.AI cs.CV

    Zero-shot Concept Bottleneck Models

    Authors: Shin'ya Yamaguchi, Kosuke Nishida, Daiki Chijiwa, Yasutoshi Ida

    Abstract: Concept bottleneck models (CBMs) are inherently interpretable and intervenable neural network models, which explain their final label prediction by the intermediate prediction of high-level semantic concepts. However, they require target task training to learn input-to-concept and concept-to-label mappings, incurring target dataset collections and training resources. In this paper, we present \tex… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

    Comments: 14 pages, 8 figures

  9. arXiv:2501.17620  [pdf, ps, other

    math.FA

    The distance in Morrey spaces to $C^{\infty}_{\mathrm{comp}}$

    Authors: Satoshi Yamaguchi

    Abstract: In this paper we characterize the distance between the function $f$ and the set $C^{\infty}_{\mathrm{comp}}(\mathbb{R}^d)$ in generalized Morrey spaces $L_{p,φ}(\mathbb{R}^d)$ with variable growth condition. We also prove that the bi-dual of $\overline{C^{\infty}_{\mathrm{comp}}(\mathbb{R}^d)}^{L_{p,φ}(\mathbb{R}^d)}$ is $L_{p,φ}(\mathbb{R}^d)$. As an application of the characterization of the dis… ▽ More

    Submitted 29 January, 2025; originally announced January 2025.

    Comments: 25 pages

    MSC Class: 42B35 (Primary) 46E30; 46B10; 42B20 (Secondary)

  10. arXiv:2501.11014  [pdf

    eess.IV cs.CV

    Transfer Learning Strategies for Pathological Foundation Models: A Systematic Evaluation in Brain Tumor Classification

    Authors: Ken Enda, Yoshitaka Oda, Zen-ichi Tanei, Kenichi Satoh, Hiroaki Motegi, Terasaka Shunsuke, Shigeru Yamaguchi, Takahiro Ogawa, Wang Lei, Masumi Tsuda, Shinya Tanaka

    Abstract: Foundation models pretrained on large-scale pathology datasets have shown promising results across various diagnostic tasks. Here, we present a systematic evaluation of transfer learning strategies for brain tumor classification using these models. We analyzed 254 cases comprising five major tumor types: glioblastoma, astrocytoma, oligodendroglioma, primary central nervous system lymphoma, and met… ▽ More

    Submitted 7 April, 2025; v1 submitted 19 January, 2025; originally announced January 2025.

    Comments: 25 pages, 7 figures

    MSC Class: 62M45; 62P10; 68T07 ACM Class: I.2.6; I.5.4; J.3

  11. arXiv:2501.07935  [pdf, ps, other

    hep-th cond-mat.str-el

    Exotic massive fermionic systems with huge vacuum degeneracy at boundaries

    Authors: Hiroki Kawakami, Satoshi Yamaguchi

    Abstract: We investigate a massive non-relativistic fermionic system exhibiting exotic features. When the mass parameter is set to zero, the system acquires the fermionic subsystem symmetry. Introducing the mass term explicitly breaks this symmetry, resulting in a trivially gapped system in the absence of boundaries. We demonstrate that with the introduction of boundaries, the system remains gapped, but it… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

    Comments: 13 pages, no figures

    Report number: OU-HET 1259

  12. arXiv:2501.02873  [pdf, other

    hep-lat cond-mat.str-el hep-th math.KT

    $η$ invariant of massive Wilson Dirac operator and the index

    Authors: Shoto Aoki, Hidenori Fukaya, Mikio Furuta, Shinichiroh Matsuo, Tetsuya Onogi, Satoshi Yamaguchi

    Abstract: We revisit the lattice index theorem in the perspective of $K$-theory. The standard definition given by the overlap Dirac operator equals to the $η$ invariant of the Wilson Dirac operator with a negative mass. This equality is not coincidental but reflects a mathematically profound significance known as the suspension isomorphism of $K$-groups. Specifically, we identify the Wilson Dirac operator a… ▽ More

    Submitted 27 January, 2025; v1 submitted 6 January, 2025; originally announced January 2025.

    Comments: 10 pages, 2 figures, Contribution to the 41st International Symposium on Lattice Field Theory (LATTICE2024), 28 July - 3 August 2024, Liverpool, UK, minor corrections

    Report number: OU-HET-1257

  13. arXiv:2412.08343  [pdf, other

    cs.GR cs.SD eess.AS

    SyncViolinist: Music-Oriented Violin Motion Generation Based on Bowing and Fingering

    Authors: Hiroki Nishizawa, Keitaro Tanaka, Asuka Hirata, Shugo Yamaguchi, Qi Feng, Masatoshi Hamanaka, Shigeo Morishima

    Abstract: Automatically generating realistic musical performance motion can greatly enhance digital media production, often involving collaboration between professionals and musicians. However, capturing the intricate body, hand, and finger movements required for accurate musical performances is challenging. Existing methods often fall short due to the complex mapping between audio and motion, typically req… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

    Comments: 10 pages, 7 figures, 6 tables, WACV 2025

  14. arXiv:2412.08056  [pdf, ps, other

    hep-th

    Anomalies and D-branes in the Dabholkar-Park background

    Authors: Hiroki Wada, Satoshi Yamaguchi

    Abstract: We consider D-branes in the Dabholkar-Park (DP) background, a $9$d orientifold theory obtained by gauging symmetry in the type IIB string theory compactified on a circle. Using anomalies in the world-sheet theory, we provide physical insights into the classification of stable D-branes by relative KR-theory. The nature, such as stability, of D-branes wrapping along the compactified circle can be ex… ▽ More

    Submitted 29 April, 2025; v1 submitted 10 December, 2024; originally announced December 2024.

    Comments: 23 pages

    Report number: OU-HET-1252

  15. arXiv:2411.00381  [pdf, other

    cs.HC

    Tappy Plugin for Figma: Predicting Tap Success Rates of User-Interface Elements under Development for Smartphones

    Authors: Shota Yamanaka, Hiroki Usuba, Junichi Sato, Naomi Sasaya, Fumiya Yamashita, Shuji Yamaguchi

    Abstract: Tapping buttons and hyperlinks on smartphones is a fundamental operation, but users sometimes fail to tap user-interface (UI) elements. Such mistakes degrade usability, and thus it is important for designers to configure UI elements so that users can accurately select them. To support designers in setting a UI element with an intended tap success rate, we developed a plugin for Figma, which is mod… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

  16. arXiv:2410.03263  [pdf, other

    cs.LG cs.AI

    Test-time Adaptation for Regression by Subspace Alignment

    Authors: Kazuki Adachi, Shin'ya Yamaguchi, Atsutoshi Kumagai, Tomoki Hamagami

    Abstract: This paper investigates test-time adaptation (TTA) for regression, where a regression model pre-trained in a source domain is adapted to an unknown target distribution with unlabeled target data. Although regression is one of the fundamental tasks in machine learning, most of the existing TTA methods have classification-specific designs, which assume that models output class-categorical prediction… ▽ More

    Submitted 22 January, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

    Comments: Accepted to ICLR 2025

  17. arXiv:2409.17663  [pdf, other

    cs.AI cs.CV cs.LG

    Explanation Bottleneck Models

    Authors: Shin'ya Yamaguchi, Kosuke Nishida

    Abstract: Recent concept-based interpretable models have succeeded in providing meaningful explanations by pre-defined concept sets. However, the dependency on the pre-defined concepts restricts the application because of the limited number of concepts for explanations. This paper proposes a novel interpretable deep neural network called explanation bottleneck models (XBMs). XBMs generate a text explanation… ▽ More

    Submitted 18 February, 2025; v1 submitted 26 September, 2024; originally announced September 2024.

    Comments: Accepted to AAAI 2025 (Oral)

  18. arXiv:2409.07102  [pdf, other

    cs.SI

    DisasterNeedFinder: Understanding the Information Needs in the 2024 Noto Earthquake (Comprehensive Explanation)

    Authors: Kota Tsubouchi, Shuji Yamaguchi, Keijirou Saitou, Akihisa Soemori, Masato Morita, Shigeki Asou

    Abstract: We propose and demonstrate the DisasterNeedFinder framework in order to provide appropriate information support for the Noto Peninsula Earthquake. In the event of a large-scale disaster, it is essential to accurately capture the ever-changing information needs. However, it is difficult to obtain appropriate information from the chaotic situation on the ground. Therefore, as a data-driven approach,… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

  19. arXiv:2408.16261  [pdf, other

    cs.LG cs.AI

    Evaluating Time-Series Training Dataset through Lens of Spectrum in Deep State Space Models

    Authors: Sekitoshi Kanai, Yasutoshi Ida, Kazuki Adachi, Mihiro Uchida, Tsukasa Yoshida, Shin'ya Yamaguchi

    Abstract: This study investigates a method to evaluate time-series datasets in terms of the performance of deep neural networks (DNNs) with state space models (deep SSMs) trained on the dataset. SSMs have attracted attention as components inside DNNs to address time-series data. Since deep SSMs have powerful representation capacities, training datasets play a crucial role in solving a new task. However, the… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: 11 pages, 5 figures

  20. arXiv:2408.03080  [pdf

    cond-mat.supr-con cond-mat.str-el

    Correlation between $T_{\mathrm{c}}$ and the Pseudogap Observed in the Optical Spectra of High $T_{\mathrm{c}}$ Superconducting Cuprates

    Authors: Setsuko Tajima, Yuhta Itoh, Katsuya Mizutamari, Shigeki Miyasaka, Masamichi Nakajima, Nae Sasaki, Shunpei Yamaguchi, Kei-ichi Harada, Takao Watanabe

    Abstract: We studied the temperature dependences of the optical spectra for optimally and underdoped Bi$_2$Sr$_2$Ca$_2$Cu$_3$O$_{10+z}$ single crystals. Similarly to the other cuprates' cases, a gap-like conductivity suppression was observed with reducing the temperature from above $T_{\mathrm{c}}$, creating a peak in the conductivity spectrum. The conductivity peak energy was insensitive to the doping leve… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: 12 pages, 5 figures

    Journal ref: J. Phys. Soc. Jpn. 93, 103701 (2024)

  21. arXiv:2407.17708  [pdf, ps, other

    math.KT cond-mat.str-el hep-lat hep-th math.DG

    The index of lattice Dirac operators and $K$-theory

    Authors: Shoto Aoki, Hidenori Fukaya, Mikio Furuta, Shinichiroh Matsuo, Tetsuya Onogi, Satoshi Yamaguchi

    Abstract: We mathematically show an equality between the index of a Dirac operator on a flat continuum torus and the $η$ invariant of a lattice Dirac operator known as the Wilson Dirac operator with a negative mass when the lattice spacing is sufficiently small. Unlike the standard approach, our formulation using $K$-theory does not require modified chiral symmetry on the lattice. We prove that a one-parame… ▽ More

    Submitted 2 June, 2025; v1 submitted 24 July, 2024; originally announced July 2024.

    Comments: 52 pages, 3 figures, some refinement in introduction, minor corrections about mathematical subtleties in sec.2 and sec.3 with additional references

    Report number: OH-HET-1236

  22. Effects of vortex and antivortex excitations in underdoped Bi$_2$Sr$_2$Ca$_2$Cu$_3$O$_{10+δ}$ bulk single crystals

    Authors: Takao Watanabe, Kenta Kosugi, Nae Sasaki, Shunpei Yamaguchi, Takenori Fujii, Ken Hayama, Itsuhiro Kakeya, Toshimitsu Ito

    Abstract: The observance of vortex and anti-vortex effects in bulk crystals can prove the existence of phase-disordered superconductivity in the bulk. To gain insights into the mechanisms that govern superconducting transition in copper oxide high-transition temperature ($T_c$) superconductors, this study investigated the transport properties of underdoped Bi$_2$Sr$_2$Ca$_2$Cu$_3$O$_{10+δ}$ (Bi-2223) bulk s… ▽ More

    Submitted 10 October, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: 14 pages, 11 figures, revised according to the referees'comments

    Journal ref: Phys. Rev. B 110, 134509 (2024)

  23. arXiv:2403.17423  [pdf, other

    cs.CV stat.ML

    Test-time Adaptation Meets Image Enhancement: Improving Accuracy via Uncertainty-aware Logit Switching

    Authors: Shohei Enomoto, Naoya Hasegawa, Kazuki Adachi, Taku Sasaki, Shin'ya Yamaguchi, Satoshi Suzuki, Takeharu Eda

    Abstract: Deep neural networks have achieved remarkable success in a variety of computer vision applications. However, there is a problem of degrading accuracy when the data distribution shifts between training and testing. As a solution of this problem, Test-time Adaptation~(TTA) has been well studied because of its practicality. Although TTA methods increase accuracy under distribution shift by updating t… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to IJCNN2024

  24. arXiv:2403.14114  [pdf, other

    cs.CV

    Test-time Similarity Modification for Person Re-identification toward Temporal Distribution Shift

    Authors: Kazuki Adachi, Shohei Enomoto, Taku Sasaki, Shin'ya Yamaguchi

    Abstract: Person re-identification (re-id), which aims to retrieve images of the same person in a given image from a database, is one of the most practical image recognition applications. In the real world, however, the environments that the images are taken from change over time. This causes a distribution shift between training and testing and degrades the performance of re-id. To maintain re-id performan… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted to IJCNN2024

  25. arXiv:2403.10097  [pdf, other

    cs.LG cs.AI cs.CV

    Adaptive Random Feature Regularization on Fine-tuning Deep Neural Networks

    Authors: Shin'ya Yamaguchi, Sekitoshi Kanai, Kazuki Adachi, Daiki Chijiwa

    Abstract: While fine-tuning is a de facto standard method for training deep neural networks, it still suffers from overfitting when using small target datasets. Previous methods improve fine-tuning performance by maintaining knowledge of the source datasets or introducing regularization terms such as contrastive loss. However, these methods require auxiliary source information (e.g., source labels or datase… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  26. arXiv:2311.13090  [pdf, other

    cs.AI cs.CV

    On the Limitation of Diffusion Models for Synthesizing Training Datasets

    Authors: Shin'ya Yamaguchi, Takuma Fukuda

    Abstract: Synthetic samples from diffusion models are promising for leveraging in training discriminative models as replications of real training datasets. However, we found that the synthetic datasets degrade classification performance over real datasets even when using state-of-the-art diffusion models. This means that modern diffusion models do not perfectly represent the data distribution for the purpos… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023 SyntheticData4ML Workshop

  27. arXiv:2310.03913  [pdf, other

    cs.RO

    TRAIL Team Description Paper for RoboCup@Home 2023

    Authors: Chikaha Tsuji, Dai Komukai, Mimo Shirasaka, Hikaru Wada, Tsunekazu Omija, Aoi Horo, Daiki Furuta, Saki Yamaguchi, So Ikoma, Soshi Tsunashima, Masato Kobayashi, Koki Ishimoto, Yuya Ikeda, Tatsuya Matsushima, Yusuke Iwasawa, Yutaka Matsuo

    Abstract: Our team, TRAIL, consists of AI/ML laboratory members from The University of Tokyo. We leverage our extensive research experience in state-of-the-art machine learning to build general-purpose in-home service robots. We previously participated in two competitions using Human Support Robot (HSR): RoboCup@Home Japan Open 2020 (DSPL) and World Robot Summit 2020, equivalent to RoboCup World Tournament.… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  28. arXiv:2309.16143  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Generative Semi-supervised Learning with Meta-Optimized Synthetic Samples

    Authors: Shin'ya Yamaguchi

    Abstract: Semi-supervised learning (SSL) is a promising approach for training deep classification models using labeled and unlabeled datasets. However, existing SSL methods rely on a large unlabeled dataset, which may not always be available in many real-world applications due to legal constraints (e.g., GDPR). In this paper, we investigate the research question: Can we train SSL models without real unlabel… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted to the 15th Asian Conference on Machine Learning (ACML2023); a preprint of the camera-ready version

  29. arXiv:2308.16454  [pdf, other

    cs.CV cs.LG

    Adversarial Finetuning with Latent Representation Constraint to Mitigate Accuracy-Robustness Tradeoff

    Authors: Satoshi Suzuki, Shin'ya Yamaguchi, Shoichiro Takeda, Sekitoshi Kanai, Naoki Makishima, Atsushi Ando, Ryo Masumura

    Abstract: This paper addresses the tradeoff between standard accuracy on clean examples and robustness against adversarial examples in deep neural networks (DNNs). Although adversarial training (AT) improves robustness, it degrades the standard accuracy, thus yielding the tradeoff. To mitigate this tradeoff, we propose a novel AT method called ARREST, which comprises three components: (i) adversarial finetu… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted by International Conference on Computer Vision (ICCV) 2023

  30. arXiv:2307.13899  [pdf, other

    cs.LG cs.AI cs.CV

    Regularizing Neural Networks with Meta-Learning Generative Models

    Authors: Shin'ya Yamaguchi, Daiki Chijiwa, Sekitoshi Kanai, Atsutoshi Kumagai, Hisashi Kashima

    Abstract: This paper investigates methods for improving generative data augmentation for deep learning. Generative data augmentation leverages the synthetic samples produced by generative models as an additional dataset for classification with small dataset settings. A key challenge of generative data augmentation is that the synthetic data contain uninformative samples that degrade accuracy. This is becaus… ▽ More

    Submitted 23 October, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted to NeurIPS 2023

  31. arXiv:2306.10656  [pdf, other

    cs.LG cs.AI stat.ML

    Virtual Human Generative Model: Masked Modeling Approach for Learning Human Characteristics

    Authors: Kenta Oono, Nontawat Charoenphakdee, Kotatsu Bito, Zhengyan Gao, Hideyoshi Igata, Masashi Yoshikawa, Yoshiaki Ota, Hiroki Okui, Kei Akita, Shoichiro Yamaguchi, Yohei Sugawara, Shin-ichi Maeda, Kunihiko Miyoshi, Yuki Saito, Koki Tsuda, Hiroshi Maruyama, Kohei Hayashi

    Abstract: Identifying the relationship between healthcare attributes, lifestyles, and personality is vital for understanding and improving physical and mental well-being. Machine learning approaches are promising for modeling their relationships and offering actionable suggestions. In this paper, we propose the Virtual Human Generative Model (VHGM), a novel deep generative model capable of estimating over 2… ▽ More

    Submitted 29 January, 2025; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: 19 pages, 4 figures

  32. arXiv:2306.05641  [pdf, other

    cs.LG cs.AI

    Toward Data Efficient Model Merging between Different Datasets without Performance Degradation

    Authors: Masanori Yamada, Tomoya Yamashita, Shin'ya Yamaguchi, Daiki Chijiwa

    Abstract: Model merging is attracting attention as a novel method for creating a new model by combining the weights of different trained models. While previous studies reported that model merging works well for models trained on a single dataset with different random seeds, model merging between different datasets remains unsolved. In this paper, we attempt to reveal the difficulty in merging such models tr… ▽ More

    Submitted 20 September, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: 29 pages; comments are welcome, accepted at ACML 2024

  33. arXiv:2304.12304  [pdf

    cs.AI cs.CY cs.LG

    A Survey on Multi-Resident Activity Recognition in Smart Environments

    Authors: Farhad MortezaPour Shiri, Thinagaran Perumal, Norwati Mustapha, Raihani Mohamed, Mohd Anuaruddin Bin Ahmadon, Shingo Yamaguchi

    Abstract: Human activity recognition (HAR) is a rapidly growing field that utilizes smart devices, sensors, and algorithms to automatically classify and identify the actions of individuals within a given environment. These systems have a wide range of applications, including assisting with caring tasks, increasing security, and improving energy efficiency. However, there are several challenges that must be… ▽ More

    Submitted 20 April, 2025; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: 16 pages, to appear in Evolution of Information, Communication and Computing Systems (EICCS) Book Series

  34. arXiv:2304.01550  [pdf, other

    hep-th cond-mat.stat-mech hep-lat

    Non-invertible symmetries and boundaries in four dimensions

    Authors: Masataka Koide, Yuta Nagoya, Satoshi Yamaguchi

    Abstract: We study quantum field theories with boundary by utilizing non-invertible symmetries. We consider three kinds of boundary conditions of the four dimensional $\mathbb{Z}_2$ lattice gauge theory at the critical point as examples. The weights of the elements on the boundary is determined so that these boundary conditions are related by the Kramers-Wannier-Wegner (KWW) duality. In other words, it is r… ▽ More

    Submitted 25 September, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: 18 pages, 19 figures: v2: references and comments added

    Report number: OU-HET 1180

    Journal ref: Phys. Rev. D 108, 065009(2023)

  35. Phase structure of linear quiver gauge theories from anomaly matching

    Authors: Okuto Morikawa, Hiroki Wada, Satoshi Yamaguchi

    Abstract: We consider the phase structure of the linear quiver gauge theory, using the 't Hooft anomaly matching condition. This theory is characterized by the length $K$ of the quiver diagram. When $K$ is even, the symmetry and its anomaly are the same as those of massless QCD. Therefore, one can expect that the spontaneous symmetry breaking similar to the chiral symmetry breaking occurs. On the other hand… ▽ More

    Submitted 6 February, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: 20 pages, 6 figures

    Report number: OU-HET 1157

  36. arXiv:2210.05268  [pdf, other

    cs.LG

    Component-Wise Natural Gradient Descent -- An Efficient Neural Network Optimization

    Authors: Tran Van Sang, Mhd Irvan, Rie Shigetomi Yamaguchi, Toshiyuki Nakata

    Abstract: Natural Gradient Descent (NGD) is a second-order neural network training that preconditions the gradient descent with the inverse of the Fisher Information Matrix (FIM). Although NGD provides an efficient preconditioner, it is not practicable due to the expensive computation required when inverting the FIM. This paper proposes a new NGD variant algorithm named Component-Wise Natural Gradient Desce… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  37. arXiv:2208.13193  [pdf, ps, other

    hep-th cond-mat.str-el

    SL$(2,\mathbb{Z})$ action on quantum field theories with U(1) subsystem symmetry

    Authors: Satoshi Yamaguchi

    Abstract: We consider SL$(2,\mathbb{Z})$ action on quantum field theories with U(1) subsystem symmetry in five dimensions. This is an analog of the SL$(2,\mathbb{Z})$ action considered in arXiv:hep-th/0307041. We show that the exotic level 1 BF theory and the exotic level 1 Chern-Simons theories are trivial and almost trivial, respectively. By using this fact, we define S operation and T operation. These op… ▽ More

    Submitted 13 January, 2023; v1 submitted 28 August, 2022; originally announced August 2022.

    Comments: 20 pages. v2: typos corrected. v3: typos corrected, comments added

    Report number: OU-HET 1153, YITP-22-86

  38. arXiv:2207.10283  [pdf, other

    cs.LG cs.AI stat.ML

    One-vs-the-Rest Loss to Focus on Important Samples in Adversarial Training

    Authors: Sekitoshi Kanai, Shin'ya Yamaguchi, Masanori Yamada, Hiroshi Takahashi, Kentaro Ohno, Yasutoshi Ida

    Abstract: This paper proposes a new loss function for adversarial training. Since adversarial training has difficulties, e.g., necessity of high model capacity, focusing on important data points by weighting cross-entropy loss has attracted much attention. However, they are vulnerable to sophisticated attacks, e.g., Auto-Attack. This paper experimentally reveals that the cause of their vulnerability is thei… ▽ More

    Submitted 26 April, 2023; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: ICML2023, 26 pages, 19 figures

  39. arXiv:2205.15619  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Meta-ticket: Finding optimal subnetworks for few-shot learning within randomly initialized neural networks

    Authors: Daiki Chijiwa, Shin'ya Yamaguchi, Atsutoshi Kumagai, Yasutoshi Ida

    Abstract: Few-shot learning for neural networks (NNs) is an important problem that aims to train NNs with a few data. The main challenge is how to avoid overfitting since over-parameterized NNs can easily overfit to such small dataset. Previous work (e.g. MAML by Finn et al. 2017) tackles this challenge by meta-learning, which learns how to learn from a few data by using various tasks. On the other hand, on… ▽ More

    Submitted 9 February, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  40. arXiv:2204.13263  [pdf, other

    cs.LG

    Covariance-aware Feature Alignment with Pre-computed Source Statistics for Test-time Adaptation to Multiple Image Corruptions

    Authors: Kazuki Adachi, Shin'ya Yamaguchi, Atsutoshi Kumagai

    Abstract: Real-world image recognition systems often face corrupted input images, which cause distribution shifts and degrade the performance of models. These systems often use a single prediction model in a central server and process images sent from various environments, such as cameras distributed in cities or cars. Such single models face images corrupted in heterogeneous ways in test time. Thus, they r… ▽ More

    Submitted 29 June, 2023; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: Extended version of the paper accepted to ICIP 2023

  41. arXiv:2204.12833  [pdf, other

    cs.LG cs.AI stat.ML

    Transfer Learning with Pre-trained Conditional Generative Models

    Authors: Shin'ya Yamaguchi, Sekitoshi Kanai, Atsutoshi Kumagai, Daiki Chijiwa, Hisashi Kashima

    Abstract: Transfer learning is crucial in training deep neural networks on new target tasks. Current transfer learning methods always assume at least one of (i) source and target task label spaces overlap, (ii) source datasets are available, and (iii) target network architectures are consistent with source ones. However, holding these assumptions is difficult in practical settings because the target task ra… ▽ More

    Submitted 20 February, 2025; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: Accepted by Machine Learning

    Journal ref: Machine Learning 114, 96 (2025)

  42. arXiv:2202.13062  [pdf, other

    cs.RO

    Learning-based Collision-free Planning on Arbitrary Optimization Criteria in the Latent Space through cGANs

    Authors: Tomoki Ando, Hiroto Iino, Hiroki Mori, Ryota Torishima, Kuniyuki Takahashi, Shoichiro Yamaguchi, Daisuke Okanohara, Tetsuya Ogata

    Abstract: We propose a new method for collision-free planning using Conditional Generative Adversarial Networks (cGANs) to transform between the robot's joint space and a latent space that captures only collision-free areas of the joint space, conditioned by an obstacle map. Generating multiple plausible trajectories is convenient in applications such as the manipulation of a robot arm by enabling the selec… ▽ More

    Submitted 5 February, 2023; v1 submitted 26 February, 2022; originally announced February 2022.

    Comments: 19 pages, 7 figures. An accompanying video is available at https://www.youtube.com/watch?v=IJUxdmaSwy0. arXiv admin note: text overlap with arXiv:2202.07203

  43. arXiv:2202.07203  [pdf, other

    cs.RO

    Collision-free Path Planning in the Latent Space through cGANs

    Authors: Tomoki Ando, Hiroki Mori, Ryota Torishima, Kuniyuki Takahashi, Shoichiro Yamaguchi, Daisuke Okanohara, Tetsuya Ogata

    Abstract: We show a new method for collision-free path planning by cGANs by mapping its latent space to only the collision-free areas of the robot joint space. Our method simply provides this collision-free latent space after which any planner, using any optimization conditions, can be used to generate the most suitable paths on the fly. We successfully verified this method with a simulated two-link robot a… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

    Comments: 10pages, 9figures

  44. arXiv:2202.04237  [pdf, other

    cs.CV cs.AI cs.LG

    Learning Robust Convolutional Neural Networks with Relevant Feature Focusing via Explanations

    Authors: Kazuki Adachi, Shin'ya Yamaguchi

    Abstract: Existing image recognition techniques based on convolutional neural networks (CNNs) basically assume that the training and test datasets are sampled from i.i.d distributions. However, this assumption is easily broken in the real world because of the distribution shift that occurs when the co-occurrence relations between objects and backgrounds in input images change. Under this type of distributio… ▽ More

    Submitted 23 March, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: Accepted by ICME 2022

  45. On elementary moves of singular Legendrian knots

    Authors: Sara Yamaguchi, Noboru Ito

    Abstract: We have two results. First, we give 96 generating sets oriented singular Reidemeister moves; it is an answer to a question by Bataineh, Khaled, Elhamdadi, and Hajij who give a generating set of oriented singular Reidemeister moves using their computation. Second, in the theory of plane curve and Legendrian knots introduced by V. I. Arnold, we select which moves survive as those of Legendrian singu… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: 6 pages

  46. arXiv:2111.11040  [pdf, ps, other

    hep-th hep-lat math-ph

    A physicist-friendly reformulation of the mod-two Atiyah-Patodi-Singer index

    Authors: Hidenori Fukaya, Mikio Furuta, Yoshiyuki Matsuki, Shinichiroh Matsuo, Tetsuya Onogi, Satoshi Yamaguchi, Mayuko Yamashita

    Abstract: Gauge anomaly in 4-dimensions can be viewed as a current inflow into an extra-dimension, where the total phase of the fermion partition function is given in a gauge invariant way by the Atiyah- Patodi-Singer(APS) eta-invariant of a 5-dimensional Dirac operator. However, this formalism requires a non-local boundary condition, with which the physical roles of edge/bulk modes are unclear and how the… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: 10 pages, 1 figure, talk presented at the 38th International Symposium on Lattice Field Theory (LATTICE2021); to appear in Proceedings of Science, PoS(LATTICE2021)617

    Report number: OU-HET-1120

  47. arXiv:2110.12861  [pdf, ps, other

    hep-th cond-mat.str-el

    Gapless edge modes in (4+1)-dimensional topologically massive tensor gauge theory and anomaly inflow for subsystem symmetry

    Authors: Satoshi Yamaguchi

    Abstract: We consider (4+1)-dimensional topologically massive tensor gauge theory. This theory is an analog of the (2+1)-dimensional topologically massive Maxwell-Chern-Simons theory. If the space has a boundary, we find that a (3+1)-dimensional gapless theory appears at the boundary. This gapless theory is a chiral version of the (3+1)-dimensional $\varphi$ theory. This gapless theory is protected by the a… ▽ More

    Submitted 14 February, 2022; v1 submitted 25 October, 2021; originally announced October 2021.

    Comments: 13 pages. v1: typos corrected, references added. v2: comments on the Chern-Simons level added, typos corrected, the appendix improved

    Report number: OU-HET 1111

    Journal ref: Prog Theor Exp Phys (2022)

  48. Gauge Kinetic Mixing and Dark Topological Defects

    Authors: Takashi Hiramatsu, Masahiro Ibe, Motoo Suzuki, Soma Yamaguchi

    Abstract: We discuss how the topological defects in the dark sector affect the Standard Model sector when the dark photon has a kinetic mixing with the QED photon. In particular, we consider the dark photon appearing in the successive gauge symmetry breaking, $\mathrm{SU}(2)\to \mathrm{U}(1) \to \mathbb{Z}_2$, where the remaining $\mathbb{Z}_2$ is the center of $\mathrm{SU(2)}$. In this model, the monopole… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

    Comments: 39 pages, 8 figures

    Report number: RUP-21-17

  49. arXiv:2109.05992  [pdf, other

    hep-th cond-mat.stat-mech hep-lat

    Non-invertible topological defects in 4-dimensional $\mathbb{Z}_2$ pure lattice gauge theory

    Authors: Masataka Koide, Yuta Nagoya, Satoshi Yamaguchi

    Abstract: We explore topological defects in the 4-dimensional pure $\mathbb{Z}_2$ lattice gauge theory. This theory has 1-form $\mathbb{Z}_{2}$ center symmetry as well as the Kramers-Wannier-Wegner (KWW) duality. We construct the KWW duality topological defects in the similar way to that constructed by Aasen, Mong, Fendley arXiv:1601.07185 for the 2-dimensional Ising model. These duality defects turn out to… ▽ More

    Submitted 1 November, 2021; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: 24 pages, 37 figures, v2: minor correction. v3: references and comments added

    Report number: OU-HET 1105

  50. arXiv:2106.09269  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Pruning Randomly Initialized Neural Networks with Iterative Randomization

    Authors: Daiki Chijiwa, Shin'ya Yamaguchi, Yasutoshi Ida, Kenji Umakoshi, Tomohiro Inoue

    Abstract: Pruning the weights of randomly initialized neural networks plays an important role in the context of lottery ticket hypothesis. Ramanujan et al. (2020) empirically showed that only pruning the weights can achieve remarkable performance instead of optimizing the weight values. However, to achieve the same level of performance as the weight optimization, the pruning approach requires more parameter… ▽ More

    Submitted 5 April, 2022; v1 submitted 17 June, 2021; originally announced June 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021); Selected for a spotlight presentation