Skip to main content

Showing 1–2 of 2 results for author: Port, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2006.13469  [pdf, other

    eess.AS cs.LG cs.SD

    Face-to-Music Translation Using a Distance-Preserving Generative Adversarial Network with an Auxiliary Discriminator

    Authors: Chelhwon Kim, Andrew Port, Mitesh Patel

    Abstract: Learning a mapping between two unrelated domains-such as image and audio, without any supervision is a challenging task. In this work, we propose a distance-preserving generative adversarial model to translate images of human faces into an audio domain. The audio domain is defined by a collection of musical note sounds recorded by 10 different instrument families (NSynth \cite{nsynth2017}) and a d… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Comments: 15 pages, 3 figures

  2. arXiv:2005.13291  [pdf, other

    eess.AS cs.CV cs.LG cs.SD stat.ML

    Deep Sensory Substitution: Noninvasively Enabling Biological Neural Networks to Receive Input from Artificial Neural Networks

    Authors: Andrew Port, Chelhwon Kim, Mitesh Patel

    Abstract: As is expressed in the adage "a picture is worth a thousand words", when using spoken language to communicate visual information, brevity can be a challenge. This work describes a novel technique for leveraging machine-learned feature embeddings to sonify visual (and other types of) information into a perceptual audio domain, allowing users to perceive this information using only their aural facul… ▽ More

    Submitted 25 August, 2021; v1 submitted 27 May, 2020; originally announced May 2020.

    Comments: 9 pages, 3 figures