Skip to main content

Showing 1–3 of 3 results for author: Papademetris, X

Searching in archive cs. Search in all archives.
.
  1. arXiv:2102.11013  [pdf, other

    q-bio.NC cs.LG

    Multiple-shooting adjoint method for whole-brain dynamic causal modeling

    Authors: Juntang Zhuang, Nicha Dvornek, Sekhar Tatikonda, Xenophon Papademetris, Pamela Ventola, James Duncan

    Abstract: Dynamic causal modeling (DCM) is a Bayesian framework to infer directed connections between compartments, and has been used to describe the interactions between underlying neural populations based on functional neuroimaging data. DCM is typically analyzed with the expectation-maximization (EM) algorithm. However, because the inversion of a large-scale continuous system is difficult when noisy obse… ▽ More

    Submitted 14 February, 2021; originally announced February 2021.

    Comments: 27th International Conference on Information Processing in Medical Imaging

  2. arXiv:2010.07468  [pdf, other

    cs.LG cs.CV stat.ML

    AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients

    Authors: Juntang Zhuang, Tommy Tang, Yifan Ding, Sekhar Tatikonda, Nicha Dvornek, Xenophon Papademetris, James S. Duncan

    Abstract: Most popular optimizers for deep learning can be broadly categorized as adaptive methods (e.g. Adam) and accelerated schemes (e.g. stochastic gradient descent (SGD) with momentum). For many models such as convolutional neural networks (CNNs), adaptive methods typically converge faster but generalize worse compared to SGD; for complex settings such as generative adversarial networks (GANs), adaptiv… ▽ More

    Submitted 20 December, 2020; v1 submitted 14 October, 2020; originally announced October 2020.

    Journal ref: NeurIPS 2020

  3. arXiv:2006.02493  [pdf

    stat.ML cs.LG

    Adaptive Checkpoint Adjoint Method for Gradient Estimation in Neural ODE

    Authors: Juntang Zhuang, Nicha Dvornek, Xiaoxiao Li, Sekhar Tatikonda, Xenophon Papademetris, James Duncan

    Abstract: Neural ordinary differential equations (NODEs) have recently attracted increasing attention; however, their empirical performance on benchmark tasks (e.g. image classification) are significantly inferior to discrete-layer models. We demonstrate an explanation for their poorer performance is the inaccuracy of existing gradient estimation methods: the adjoint method has numerical errors in reverse-m… ▽ More

    Submitted 3 June, 2020; originally announced June 2020.

    Journal ref: https://proceedings.icml.cc/static/paper_files/icml/2020/917-Paper.pdf