Skip to main content

Showing 1–23 of 23 results for author: Yamashita, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.06201  [pdf, other

    cs.CV cs.GR

    Size-Variable Virtual Try-On with Physical Clothes Size

    Authors: Yohei Yamashita, Chihiro Nakatani, Norimichi Ukita

    Abstract: This paper addresses a new virtual try-on problem of fitting any size of clothes to a reference person in the image domain. While previous image-based virtual try-on methods can produce highly natural try-on images, these methods fit the clothes on the person without considering the relative relationship between the physical sizes of the clothes and the person. Different from these methods, our me… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

  2. arXiv:2410.20735  [pdf

    q-bio.NC cs.AI physics.bio-ph

    Murine AI excels at cats and cheese: Structural differences between human and mouse neurons and their implementation in generative AIs

    Authors: Rino Saiga, Kaede Shiga, Yo Maruta, Chie Inomoto, Hiroshi Kajiwara, Naoya Nakamura, Yu Kakimoto, Yoshiro Yamamoto, Masahiro Yasutake, Masayuki Uesugi, Akihisa Takeuchi, Kentaro Uesugi, Yasuko Terada, Yoshio Suzuki, Viktor Nikitin, Vincent De Andrade, Francesco De Carlo, Yuichi Yamashita, Masanari Itokawa, Soichiro Ide, Kazutaka Ikeda, Ryuta Mizutani

    Abstract: Mouse and human brains have different functions that depend on their neuronal networks. In this study, we analyzed nanometer-scale three-dimensional structures of brain tissues of the mouse medial prefrontal cortex and compared them with structures of the human anterior cingulate cortex. The obtained results indicated that mouse neuronal somata are smaller and neurites are thinner than those of hu… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: 41 pages, 4 figures

    Journal ref: Sci. Rep. 15, 25091 (2025)

  3. arXiv:2410.15532  [pdf, ps, other

    cs.SD eess.AS

    Construction and Analysis of Impression Caption Dataset for Environmental Sounds

    Authors: Yuki Okamoto, Ryotaro Nagase, Minami Okamoto, Yuki Saito, Keisuke Imoto, Takahiro Fukumori, Yoichi Yamashita

    Abstract: Some datasets with the described content and order of occurrence of sounds have been released for conversion between environmental sound and text. However, there are very few texts that include information on the impressions humans feel, such as "sharp" and "gorgeous," when they hear environmental sounds. In this study, we constructed a dataset with impression captions for environmental sounds tha… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

  4. arXiv:2306.04143  [pdf, other

    cs.SD eess.AS

    RISC: A Corpus for Shout Type Classification and Shout Intensity Prediction

    Authors: Takahiro Fukumori, Taito Ishida, Yoichi Yamashita

    Abstract: The detection of shouted speech is crucial in audio surveillance and monitoring. Although it is desirable for a security system to be able to identify emergencies, existing corpora provide only a binary label (i.e., shouted or normal) for each speech sample, making it difficult to predict the shout intensity. Furthermore, most corpora comprise only utterances typical of hazardous situations, meani… ▽ More

    Submitted 19 October, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: This paper has been accepted for publication in IEEE/ACM Transactions on Audio, Speech, and Language Processing. DOI: 10.1109/TASLP.2024.3473302

  5. arXiv:2305.00302  [pdf, ps, other

    cs.SD eess.AS

    Environmental sound synthesis from vocal imitations and sound event labels

    Authors: Yuki Okamoto, Keisuke Imoto, Shinnosuke Takamichi, Ryotaro Nagase, Takahiro Fukumori, Yoichi Yamashita

    Abstract: One way of expressing an environmental sound is using vocal imitations, which involve the process of replicating or mimicking the rhythm and pitch of sounds by voice. We can effectively express the features of environmental sounds, such as rhythm and pitch, using vocal imitations, which cannot be expressed by conventional input information, such as sound event labels, images, or texts, in an envir… ▽ More

    Submitted 14 September, 2023; v1 submitted 29 April, 2023; originally announced May 2023.

    Comments: Submitted to ICASSP2024

  6. arXiv:2208.07679  [pdf, ps, other

    cs.SD eess.AS

    How Should We Evaluate Synthesized Environmental Sounds

    Authors: Yuki Okamoto, Keisuke Imoto, Shinnosuke Takamichi, Takahiro Fukumori, Yoichi Yamashita

    Abstract: Although several methods of environmental sound synthesis have been proposed, there has been no discussion on how synthesized environmental sounds should be evaluated. Only either subjective or objective evaluations have been conducted in conventional evaluations, and it is not clear what type of evaluation should be carried out. In this paper, we investigate how to evaluate synthesized environmen… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: Submitted APSIPA ASC 2022

  7. arXiv:2207.10106  [pdf, ps, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    World Robot Challenge 2020 -- Partner Robot: A Data-Driven Approach for Room Tidying with Mobile Manipulator

    Authors: Tatsuya Matsushima, Yuki Noguchi, Jumpei Arima, Toshiki Aoki, Yuki Okita, Yuya Ikeda, Koki Ishimoto, Shohei Taniguchi, Yuki Yamashita, Shoichi Seto, Shixiang Shane Gu, Yusuke Iwasawa, Yutaka Matsuo

    Abstract: Tidying up a household environment using a mobile manipulator poses various challenges in robotics, such as adaptation to large real-world environmental variations, and safe and robust deployment in the presence of humans.The Partner Robot Challenge in World Robot Challenge (WRC) 2020, a global competition held in September 2021, benchmarked tidying tasks in the real home environments, and importa… ▽ More

    Submitted 21 July, 2022; v1 submitted 20 July, 2022; originally announced July 2022.

  8. arXiv:2111.02666  [pdf, ps, other

    q-bio.NC cs.AI cs.LG cs.RO

    Emergence of sensory attenuation based upon the free-energy principle

    Authors: Hayato Idei, Wataru Ohata, Yuichi Yamashita, Tetsuya Ogata, Jun Tani

    Abstract: The brain attenuates its responses to self-produced exteroceptions (e.g., we cannot tickle ourselves). Is this phenomenon, known as sensory attenuation, enabled innately, or acquired through learning? Here, our simulation study using a multimodal hierarchical recurrent neural network model, based on variational free-energy minimization, shows that a mechanism for sensory attenuation can develop th… ▽ More

    Submitted 12 August, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

  9. arXiv:2110.11866  [pdf, ps, other

    cs.DC eess.SP

    Morlet wavelet transform using attenuated sliding Fourier transform and kernel integral for graphic processing unit

    Authors: Yukihiko Yamashita, Toru Wakahara

    Abstract: Morlet or Gabor wavelet transforms as well as Gaussian smoothing, are widely used in signal processing and image processing. However, the computational complexity of their direct calculations is proportional not only to the number of data points in a signal but also to the smoothing size, which is the standard deviation in the Gaussian function in their transform functions. Thus, when the standard… ▽ More

    Submitted 24 June, 2024; v1 submitted 3 September, 2021; originally announced October 2021.

    Comments: 18 pages

    ACM Class: I.5.4

  10. arXiv:2110.03243  [pdf, ps, other

    cs.SD

    Sound Event Detection Guided by Semantic Contexts of Scenes

    Authors: Noriyuki Tonami, Keisuke Imoto, Ryotaro Nagase, Yuki Okamoto, Takahiro Fukumori, Yoichi Yamashita

    Abstract: Some studies have revealed that contexts of scenes (e.g., "home," "office," and "cooking") are advantageous for sound event detection (SED). Mobile devices and sensing technologies give useful information on scenes for SED without the use of acoustic signals. However, conventional methods can employ pre-defined contexts in inference stages but not undefined contexts. This is because one-hot repres… ▽ More

    Submitted 17 February, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: Accepted to ICASSP 2022

  11. arXiv:2102.05872  [pdf, ps, other

    cs.SD eess.AS

    Onoma-to-wave: Environmental sound synthesis from onomatopoeic words

    Authors: Yuki Okamoto, Keisuke Imoto, Shinnosuke Takamichi, Ryosuke Yamanishi, Takahiro Fukumori, Yoichi Yamashita

    Abstract: In this paper, we propose a framework for environmental sound synthesis from onomatopoeic words. As one way of expressing an environmental sound, we can use an onomatopoeic word, which is a character sequence for phonetically imitating a sound. An onomatopoeic word is effective for describing diverse sound features. Therefore, using onomatopoeic words for environmental sound synthesis will enable… ▽ More

    Submitted 7 February, 2022; v1 submitted 11 February, 2021; originally announced February 2021.

    Comments: Accepted to APSIPA Transactions on Signal and Information Processing

  12. arXiv:2102.05288  [pdf, ps, other

    cs.SD

    Sound Event Detection Based on Curriculum Learning Considering Learning Difficulty of Events

    Authors: Noriyuki Tonami, Keisuke Imoto, Yuki Okamoto, Takahiro Fukumori, Yoichi Yamashita

    Abstract: In conventional sound event detection (SED) models, two types of events, namely, those that are present and those that do not occur in an acoustic scene, are regarded as the same type of events. The conventional SED methods cannot effectively exploit the difference between the two types of events. All time frames of sound events that do not occur in an acoustic scene are easily regarded as inactiv… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

    Comments: Accepted to ICASSP 2021

  13. Joint Analysis of Sound Events and Acoustic Scenes Using Multitask Learning

    Authors: Noriyuki Tonami, Keisuke Imoto, Ryosuke Yamanishi, Yoichi Yamashita

    Abstract: Sound event detection (SED) and acoustic scene classification (ASC) are important research topics in environmental sound analysis. Many research groups have addressed SED and ASC using neural-network-based methods, such as the convolutional neural network (CNN), recurrent neural network (RNN), and convolutional recurrent neural network (CRNN). The conventional methods address SED and ASC separatel… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: Accepted to IEICE Transactions on Information and Systems. arXiv admin note: text overlap with arXiv:1904.12146

  14. arXiv:2009.10887  [pdf

    eess.IV cs.AI physics.bio-ph

    Schizophrenia-mimicking layers outperform conventional neural network layers

    Authors: Ryuta Mizutani, Senta Noguchi, Rino Saiga, Yuichi Yamashita, Mitsuhiro Miyashita, Makoto Arai, Masanari Itokawa

    Abstract: We have reported nanometer-scale three-dimensional studies of brain networks of schizophrenia cases and found that their neurites are thin and tortuous compared to healthy controls. This suggests that connections between distal neurons are suppressed in microcircuits of schizophrenia cases. In this study, we applied these biological findings to the design of schizophrenia-mimicking artificial neur… ▽ More

    Submitted 1 April, 2022; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: 16 pages, 6 figures, and 1 table

    Journal ref: Frontiers Neurorobot 16, 851471 (2022)

  15. arXiv:2007.04719  [pdf, ps, other

    cs.SD eess.AS

    RWCP-SSD-Onomatopoeia: Onomatopoeic Word Dataset for Environmental Sound Synthesis

    Authors: Yuki Okamoto, Keisuke Imoto, Shinnosuke Takamichi, Ryosuke Yamanishi, Takahiro Fukumori, Yoichi Yamashita

    Abstract: Environmental sound synthesis is a technique for generating a natural environmental sound. Conventional work on environmental sound synthesis using sound event labels cannot finely control synthesized sounds, for example, the pitch and timbre. We consider that onomatopoeic words can be used for environmental sound synthesis. Onomatopoeic words are effective for explaining the feature of sounds. We… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Comments: Submitted to DCASE2020 workshop

  16. arXiv:2006.15253  [pdf, ps, other

    cs.SD eess.AS

    Sound Event Detection Using Duration Robust Loss Function

    Authors: Daichi Akiyama, Keisuke Imoto, Noriyuki Tonami, Yuki Okamoto, Ryosuke Yamanishi, Takahiro Fukumori, Yoichi Yamashita

    Abstract: Many methods of sound event detection (SED) based on machine learning regard a segmented time frame as one data sample to model training. However, the sound durations of sound events vary greatly depending on the sound event class, e.g., the sound event ``fan'' has a long time duration, while the sound event ``mouse clicking'' is instantaneous. The difference in the time duration between sound eve… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

    Comments: Submitted to DCASE2020 Workshop

  17. arXiv:2002.05848  [pdf, ps, other

    cs.SD eess.AS

    Sound Event Detection by Multitask Learning of Sound Events and Scenes with Soft Scene Labels

    Authors: Keisuke Imoto, Noriyuki Tonami, Yuma Koizumi, Masahiro Yasuda, Ryosuke Yamanishi, Yoichi Yamashita

    Abstract: Sound event detection (SED) and acoustic scene classification (ASC) are major tasks in environmental sound analysis. Considering that sound events and scenes are closely related to each other, some works have addressed joint analyses of sound events and acoustic scenes based on multitask learning (MTL), in which the knowledge of sound events and scenes can help in estimating them mutually. The con… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

    Comments: Accepted to ICASSP 2020

  18. Deep learning generates custom-made logistic regression models for explaining how breast cancer subtypes are classified

    Authors: Takuma Shibahara, Chisa Wada, Yasuho Yamashita, Kazuhiro Fujita, Masamichi Sato, Junichi Kuwata, Atsushi Okamoto, Yoshimasa Ono

    Abstract: Differentiating the intrinsic subtypes of breast cancer is crucial for deciding the best treatment strategy. Deep learning can predict the subtypes from genetic information more accurately than conventional statistical methods, but to date, deep learning has not been directly utilized to examine which genes are associated with which subtypes. To clarify the mechanisms embedded in the intrinsic sub… ▽ More

    Submitted 18 July, 2022; v1 submitted 20 January, 2020; originally announced January 2020.

    Comments: 25 pages, 5 figures

  19. arXiv:1908.10055  [pdf, ps, other

    cs.SD eess.AS

    Overview of Tasks and Investigation of Subjective Evaluation Methods in Environmental Sound Synthesis and Conversion

    Authors: Yuki Okamoto, Keisuke Imoto, Tatsuya Komatsu, Shinnosuke Takamichi, Takumi Yagyu, Ryosuke Yamanishi, Yoichi Yamashita

    Abstract: Synthesizing and converting environmental sounds have the potential for many applications such as supporting movie and game production, data augmentation for sound event detection and scene classification. Conventional works on synthesizing and converting environmental sounds are based on a physical modeling or concatenative approach. However, there are a limited number of works that have addresse… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

  20. arXiv:1906.10015  [pdf, other

    q-bio.NC cs.AI cs.NE

    A Review on Neural Network Models of Schizophrenia and Autism Spectrum Disorder

    Authors: Pablo Lanillos, Daniel Oliva, Anja Philippsen, Yuichi Yamashita, Yukie Nagai, Gordon Cheng

    Abstract: This survey presents the most relevant neural network models of autism spectrum disorder and schizophrenia, from the first connectionist models to recent deep network architectures. We analyzed and compared the most representative symptoms with its neural model counterpart, detailing the alteration introduced in the network that generates each of the symptoms, and identifying their strengths and w… ▽ More

    Submitted 23 October, 2019; v1 submitted 24 June, 2019; originally announced June 2019.

    Comments: Preprint submitted to Neural Networks. Research not referenced in the manuscript within the field of NN models of SZ and ASD are encouraged to contact the corresponding authors

    Journal ref: Neural Networks 122 (2020) 338-363

  21. arXiv:1904.12146  [pdf, ps, other

    cs.SD eess.AS

    Joint Analysis of Acoustic Events and Scenes Based on Multitask Learning

    Authors: Noriyuki Tonami, Keisuke Imoto, Masahiro Niitsuma, Ryosuke Yamanishi, Yoichi Yamashita

    Abstract: Acoustic event detection and scene classification are major research tasks in environmental sound analysis, and many methods based on neural networks have been proposed. Conventional methods have addressed these tasks separately; however, acoustic events and scenes are closely related to each other. For example, in the acoustic scene `office', the acoustic events `mouse clicking' and `keyboard typ… ▽ More

    Submitted 18 July, 2019; v1 submitted 27 April, 2019; originally announced April 2019.

    Comments: Accepted to WASPAA 2019

  22. arXiv:1708.01387  [pdf

    cs.DL

    Research Activity Classification based on Time Series Bibliometrics

    Authors: Takahiro Kawamura, Yasuhiro Yamashita, Katsuji Matsumura

    Abstract: Bibliometrics such as the number of papers and times cited are often used to compare researchers based on specific criteria. The criteria, however, are different in each research domain and are set by empirical laws. Moreover, there are arguments, such that the simple sum of metric values works to the advantage of elders. Therefore, this paper attempts to constitute features from time series data… ▽ More

    Submitted 4 August, 2017; originally announced August 2017.

    Journal ref: Proceedings of 21st International Conference on Science and Technology Indicators (STI 2016), pp. 1456-1460 (2016)

  23. arXiv:1605.03754  [pdf, other

    cs.MM

    Regression-based Intra-prediction for Image and Video Coding

    Authors: Carlo Noel Ochotorena, Yukihiko Yamashita

    Abstract: By utilizing previously known areas in an image, intra-prediction techniques can find a good estimate of the current block. This allows the encoder to store only the error between the original block and the generated estimate, thus leading to an improvement in coding efficiency. Standards such as AVC and HEVC describe expert-designed prediction modes operating in certain angular orientations along… ▽ More

    Submitted 12 May, 2016; originally announced May 2016.