Skip to main content

Showing 1–6 of 6 results for author: Nakayama, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2410.04380  [pdf, other

    eess.AS cs.SD

    HALL-E: Hierarchical Neural Codec Language Model for Minute-Long Zero-Shot Text-to-Speech Synthesis

    Authors: Yuto Nishimura, Takumi Hirose, Masanari Ohi, Hideki Nakayama, Nakamasa Inoue

    Abstract: Recently, Text-to-speech (TTS) models based on large language models (LLMs) that translate natural language text into sequences of discrete audio tokens have gained great research attention, with advances in neural audio codec (NAC) models using residual vector quantization (RVQ). However, long-form speech synthesis remains a significant challenge due to the high frame rate, which increases the le… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

  2. arXiv:2007.13559  [pdf, other

    cs.CV cs.LG eess.IV

    MADGAN: unsupervised Medical Anomaly Detection GAN using multiple adjacent brain MRI slice reconstruction

    Authors: Changhee Han, Leonardo Rundo, Kohei Murao, Tomoyuki Noguchi, Yuki Shimahara, Zoltan Adam Milacski, Saori Koshino, Evis Sala, Hideki Nakayama, Shinichi Satoh

    Abstract: Unsupervised learning can discover various unseen abnormalities, relying on large-scale unannotated medical images of healthy subjects. Towards this, unsupervised methods reconstruct a 2D/3D single medical image to detect outliers either in the learned feature space or from high reconstruction loss. However, without considering continuity between multiple adjacent slices, they cannot directly disc… ▽ More

    Submitted 12 October, 2020; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: 23 pages, 11 figures, submitted to BMC Bioinformatics. Extended version of arXiv:1906.06114

  3. arXiv:2001.03923  [pdf, ps, other

    cs.CV cs.LG eess.IV

    Bridging the gap between AI and Healthcare sides: towards developing clinically relevant AI-powered diagnosis systems

    Authors: Changhee Han, Leonardo Rundo, Kohei Murao, Takafumi Nemoto, Hideki Nakayama

    Abstract: Despite the success of Convolutional Neural Network-based Computer-Aided Diagnosis research, its clinical applications remain challenging. Accordingly, developing medical Artificial Intelligence (AI) fitting into a clinical environment requires identifying/bridging the gap between AI and Healthcare sides. Since the biggest problem in Medical Imaging lies in data paucity, confirming the clinical re… ▽ More

    Submitted 6 April, 2020; v1 submitted 12 January, 2020; originally announced January 2020.

    Comments: 13 pages, 2 figure, accepted to AIAI 2020

  4. arXiv:1906.06114  [pdf, other

    eess.IV cs.CV

    GAN-based Multiple Adjacent Brain MRI Slice Reconstruction for Unsupervised Alzheimer's Disease Diagnosis

    Authors: Changhee Han, Leonardo Rundo, Kohei Murao, Zoltán Ádám Milacski, Kazuki Umemoto, Evis Sala, Hideki Nakayama, Shin'ichi Satoh

    Abstract: Unsupervised learning can discover various unseen diseases, relying on large-scale unannotated medical images of healthy subjects. Towards this, unsupervised methods reconstruct a single medical image to detect outliers either in the learned feature space or from high reconstruction loss. However, without considering continuity between multiple adjacent slices, they cannot directly discriminate di… ▽ More

    Submitted 16 March, 2020; v1 submitted 14 June, 2019; originally announced June 2019.

    Comments: 10 pages, 4 figures, Accepted to Lecture Notes in Bioinformatics (LNBI) as a volume in the Springer series

  5. arXiv:1906.04962  [pdf, other

    cs.CV eess.IV

    Synthesizing Diverse Lung Nodules Wherever Massively: 3D Multi-Conditional GAN-based CT Image Augmentation for Object Detection

    Authors: Changhee Han, Yoshiro Kitamura, Akira Kudo, Akimichi Ichinose, Leonardo Rundo, Yujiro Furukawa, Kazuki Umemoto, Yuanzhong Li, Hideki Nakayama

    Abstract: Accurate Computer-Assisted Diagnosis, relying on large-scale annotated pathological images, can alleviate the risk of overlooking the diagnosis. Unfortunately, in medical imaging, most available datasets are small/fragmented. To tackle this, as a Data Augmentation (DA) method, 3D conditional Generative Adversarial Networks (GANs) can synthesize desired realistic/diverse 3D images as additional tra… ▽ More

    Submitted 12 August, 2019; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: 9 pages, 6 figures, accepted to 3DV 2019

  6. arXiv:1905.13456  [pdf, other

    eess.IV cs.AI cs.CV

    Combining Noise-to-Image and Image-to-Image GANs: Brain MR Image Augmentation for Tumor Detection

    Authors: Changhee Han, Leonardo Rundo, Ryosuke Araki, Yudai Nagano, Yujiro Furukawa, Giancarlo Mauri, Hideki Nakayama, Hideaki Hayashi

    Abstract: Convolutional Neural Networks (CNNs) achieve excellent computer-assisted diagnosis with sufficient annotated training data. However, most medical imaging datasets are small and fragmented. In this context, Generative Adversarial Networks (GANs) can synthesize realistic/diverse additional training images to fill the data lack in the real image distribution; researchers have improved classification… ▽ More

    Submitted 9 October, 2019; v1 submitted 31 May, 2019; originally announced May 2019.

    Comments: 12 pages, 7 figures, accepted to IEEE ACCESS