Skip to main content

Showing 1–5 of 5 results for author: Binder, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2503.09963  [pdf, other

    eess.IV cs.CV

    Reference-Free 3D Reconstruction of Brain Dissection Photographs with Machine Learning

    Authors: Lin Tian, Sean I. Young, Jonathan Williams Ramirez, Dina Zemlyanker, Lucas Jacob Deden Binder, Rogeny Herisse, Theresa R. Connors, Derek H. Oakley, Bradley T. Hyman, Oula Puonti, Matthew S. Rosen, Juan Eugenio Iglesias

    Abstract: Correlation of neuropathology with MRI has the potential to transfer microscopic signatures of pathology to invivo scans. Recently, a classical registration method has been proposed, to build these correlations from 3D reconstructed stacks of dissection photographs, which are routinely taken at brain banks. These photographs bypass the need for exvivo MRI, which is not widely accessible. However,… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  2. arXiv:2008.00620  [pdf, ps, other

    eess.AS cs.CL cs.SD

    Audiovisual Speech Synthesis using Tacotron2

    Authors: Ahmed Hussen Abdelaziz, Anushree Prasanna Kumar, Chloe Seivwright, Gabriele Fanelli, Justin Binder, Yannis Stylianou, Sachin Kajarekar

    Abstract: Audiovisual speech synthesis is the problem of synthesizing a talking face while maximizing the coherency of the acoustic and visual speech. In this paper, we propose and compare two audiovisual speech synthesis systems for 3D face models. The first system is the AVTacotron2, which is an end-to-end text-to-audiovisual speech synthesizer based on the Tacotron2 architecture. AVTacotron2 converts a s… ▽ More

    Submitted 29 August, 2021; v1 submitted 2 August, 2020; originally announced August 2020.

    Comments: This work has been submitted to the 23rd ACM International Conference on Multimodal Interaction for possible publication

  3. arXiv:2001.05814  [pdf, other

    eess.SY

    Storage Placement and Sizing in a Distribution Grid with high PV-Generation

    Authors: Benjamin Matthiss, Arghavan Momenifarahani, Jann Binder

    Abstract: With the increasing penetration of renewable resources in the distribution grid, the demand for alternatives to grid reinforcement measures rises. One possible solution is the use of battery systems to balance the power flow at crucial locations in the grid. Hereby the optimal location and size of the system has to be determined in regards of investment and grid stabilizing effect. In this paper t… ▽ More

    Submitted 16 January, 2020; originally announced January 2020.

    Comments: 6 pages, 6 tables, 7 figures

  4. arXiv:1907.07807  [pdf, other

    eess.IV cs.CV cs.LG

    A fully 3D multi-path convolutional neural network with feature fusion and feature weighting for automatic lesion identification in brain MRI images

    Authors: Yunzhe Xue, Meiyan Xie, Fadi G. Farhat, Olga Boukrina, A. M. Barrett, Jeffrey R. Binder, Usman W. Roshan, William W. Graves

    Abstract: We propose a fully 3D multi-path convolutional network to predict stroke lesions from 3D brain MRI images. Our multi-path model has independent encoders for different modalities containing residual convolutional blocks, weighted multi-path feature fusion from different modalities, and weighted fusion modules to combine encoder and decoder features. Compared to existing 3D CNNs like DeepMedic, 3D U… ▽ More

    Submitted 16 November, 2019; v1 submitted 17 July, 2019; originally announced July 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract

  5. arXiv:1905.06860  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models

    Authors: Ahmed Hussen Abdelaziz, Barry-John Theobald, Justin Binder, Gabriele Fanelli, Paul Dixon, Nicholas Apostoloff, Thibaut Weise, Sachin Kajareker

    Abstract: Speech-driven visual speech synthesis involves mapping features extracted from acoustic speech to the corresponding lip animation controls for a face model. This mapping can take many forms, but a powerful approach is to use deep neural networks (DNNs). However, a limitation is the lack of synchronized audio, video, and depth data required to reliably train the DNNs, especially for speaker-indepen… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: 9 pages, 2 figures, 2 tables

    ACM Class: I.2.m; I.3.8