Skip to main content

Showing 1–35 of 35 results for author: Carlson, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.20697  [pdf, ps, other

    cs.LG cs.AI stat.AP stat.ML

    Generating Hypotheses of Dynamic Causal Graphs in Neuroscience: Leveraging Generative Factor Models of Observed Time Series

    Authors: Zachary C. Brown, David Carlson

    Abstract: The field of hypothesis generation promises to reduce costs in neuroscience by narrowing the range of interventional studies needed to study various phenomena. Existing machine learning methods can generate scientific hypotheses from complex datasets, but many approaches assume causal relationships are static over time, limiting their applicability to systems with dynamic, state-dependent behavior… ▽ More

    Submitted 2 July, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

  2. arXiv:2505.18342  [pdf, ps, other

    cs.CV cs.LG

    Pose Splatter: A 3D Gaussian Splatting Model for Quantifying Animal Pose and Appearance

    Authors: Jack Goffinet, Youngjo Min, Carlo Tomasi, David E. Carlson

    Abstract: Accurate and scalable quantification of animal pose and appearance is crucial for studying behavior. Current 3D pose estimation techniques, such as keypoint- and mesh-based techniques, often face challenges including limited representational detail, labor-intensive annotation requirements, and expensive per-frame optimization. These limitations hinder the study of subtle movements and can make lar… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 19 pages, 13 figures

  3. arXiv:2412.04285  [pdf, other

    cs.LG

    Deep Causal Inference for Point-referenced Spatial Data with Continuous Treatments

    Authors: Ziyang Jiang, Zach Calhoun, Yiling Liu, Lei Duan, David Carlson

    Abstract: Causal reasoning is often challenging with spatial data, particularly when handling high-dimensional inputs. To address this, we propose a neural network (NN) based framework integrated with an approximate Gaussian process to manage spatial interference and unobserved confounding. Additionally, we adopt a generalized propensity-score-based approach to address partially observed outcomes when estim… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

    Comments: 16 pages, 4 figures, 5 tables

    ACM Class: J.2

  4. arXiv:2409.02327  [pdf, other

    stat.ML cs.LG

    Generative Principal Component Regression via Variational Inference

    Authors: Austin Talbot, Corey J Keller, David E Carlson, Alex V Kotlar

    Abstract: The ability to manipulate complex systems, such as the brain, to modify specific outcomes has far-reaching implications, particularly in the treatment of psychiatric disorders. One approach to designing appropriate manipulations is to target key features of predictive models. While generative latent variable models, such as probabilistic principal component analysis (PPCA), is a powerful tool for… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

  5. arXiv:2408.16084  [pdf, other

    cs.DC astro-ph.IM

    Benchmarking with Supernovae: A Performance Study of the FLASH Code

    Authors: Joshua Martin, Catherine Feldman, Eva Siegmann, Tony Curtis, David Carlson, Firat Coskun, Daniel Wood, Raul Gonzalez, Robert J. Harrison, Alan C. Calder

    Abstract: Astrophysical simulations are computation, memory, and thus energy intensive, thereby requiring new hardware advances for progress. Stony Brook University recently expanded its computing cluster "SeaWulf" with an addition of 94 new nodes featuring Intel Sapphire Rapids Xeon Max series CPUs. We present a performance and power efficiency study of this hardware performed with FLASH: a multi-scale, mu… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: Accepted to PEARC '24 (Practice and Experience in Advanced Research Computing)

    Journal ref: Practice and Experience in Advanced Research Computing 2024 Article no.8

  6. arXiv:2408.03616  [pdf, other

    eess.IV cs.CV

    Distillation Learning Guided by Image Reconstruction for One-Shot Medical Image Segmentation

    Authors: Feng Zhou, Yanjie Zhou, Longjie Wang, Yun Peng, David E. Carlson, Liyun Tu

    Abstract: Traditional one-shot medical image segmentation (MIS) methods use registration networks to propagate labels from a reference atlas or rely on comprehensive sampling strategies to generate synthetic labeled data for training. However, these methods often struggle with registration errors and low-quality synthetic images, leading to poor performance and generalization. To overcome this, we introduce… ▽ More

    Submitted 5 January, 2025; v1 submitted 7 August, 2024; originally announced August 2024.

  7. arXiv:2407.13778  [pdf, other

    cs.CV cs.LG

    Assessing the Potential of PlanetScope Satellite Imagery to Estimate Particulate Matter Oxidative Potential

    Authors: Ian Hough, Loïc Argentier, Ziyang Jiang, Tongshu Zheng, Mike Bergin, David Carlson, Jean-Luc Jaffrezo, Jocelyn Chanussot, Gaëlle Uzu

    Abstract: Oxidative potential (OP), which measures particulate matter's (PM) capacity to induce oxidative stress in the lungs, is increasingly recognized as an indicator of PM toxicity. Since OP is not routinely monitored, it can be challenging to estimate exposure and health impacts. Remote sensing data are commonly used to estimate PM mass concentration, but have never been used to estimate OP. In this st… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  8. arXiv:2401.08061  [pdf, other

    cs.LG cs.CV

    Augmenting Ground-Level PM2.5 Prediction via Kriging-Based Pseudo-Label Generation

    Authors: Lei Duan, Ziyang Jiang, David Carlson

    Abstract: Fusing abundant satellite data with sparse ground measurements constitutes a major challenge in climate modeling. To address this, we propose a strategy to augment the training dataset by introducing unlabeled satellite images paired with pseudo-labels generated through a spatial interpolation technique known as ordinary kriging, thereby making full use of the available satellite data resources. W… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 8 pages, 4 figures, NeurIPS 2023 Workshop: Tackling Climate Change with Machine Learning

  9. arXiv:2311.04259  [pdf, other

    astro-ph.IM astro-ph.HE cs.DC

    Ookami: An A64FX Computing Resource

    Authors: A. C. Calder, E. Siegmann, C. Feldman, S. Chheda, D. C. Smolarski, F. D. Swesty, A. Curtis, J. Dey, D. Carlson, B. Michalowicz, R. J. Harrison

    Abstract: We present a look at Ookami, a project providing community access to a testbed supercomputer with the ARM-based A64FX processors developed by a collaboration between RIKEN and Fujitsu and deployed in the Japanese supercomputer Fugaku. We describe the project, provide details about the user base and education/training program, and present highlights from performance studies of two astrophysical sim… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 9 pages, 3 figures, submitted to the Proceedings of 15th International Conference on Numerical Modeling of Space Plasma Flows

  10. arXiv:2306.07918  [pdf, other

    cs.LG stat.ML

    Causal Mediation Analysis with Multi-dimensional and Indirectly Observed Mediators

    Authors: Ziyang Jiang, Yiling Liu, Michael H. Klein, Ahmed Aloui, Yiman Ren, Keyu Li, Vahid Tarokh, David Carlson

    Abstract: Causal mediation analysis (CMA) is a powerful method to dissect the total effect of a treatment into direct and mediated effects within the potential outcome framework. This is important in many scientific applications to identify the underlying mechanisms of a treatment effect. However, in many scientific applications the mediator is unobserved, but there may exist related measurements. For examp… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: 16 pages, 4 figures, 5 tables

  11. arXiv:2302.02009  [pdf, other

    cs.LG stat.ML

    Domain Adaptation via Rebalanced Sub-domain Alignment

    Authors: Yiling Liu, Juncheng Dong, Ziyang Jiang, Ahmed Aloui, Keyu Li, Hunter Klein, Vahid Tarokh, David Carlson

    Abstract: Unsupervised domain adaptation (UDA) is a technique used to transfer knowledge from a labeled source domain to a different but related unlabeled target domain. While many UDA methods have shown success in the past, they often assume that the source and target domains must have identical class label distributions, which can limit their effectiveness in real-world scenarios. To address this limitati… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: 20 pages, 6 figures, 4 tables

  12. arXiv:2301.11351  [pdf, other

    cs.LG stat.ML

    Estimating Causal Effects using a Multi-task Deep Ensemble

    Authors: Ziyang Jiang, Zhuoran Hou, Yiling Liu, Yiman Ren, Keyu Li, David Carlson

    Abstract: A number of methods have been proposed for causal effect estimation, yet few have demonstrated efficacy in handling data with complex structures, such as images. To fill this gap, we propose Causal Multi-task Deep Ensemble (CMDE), a novel framework that learns both shared and group-specific information from the study population. We provide proofs demonstrating equivalency of CDME to a multi-task G… ▽ More

    Submitted 27 May, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: 18 pages, 7 figures, 3 tables, published at the 40th International Conference on Machine Learning (ICML 2023)

  13. arXiv:2208.04351  [pdf, other

    cs.SE cs.LG

    Learning to Learn to Predict Performance Regressions in Production at Meta

    Authors: Moritz Beller, Hongyu Li, Vivek Nair, Vijayaraghavan Murali, Imad Ahmad, Jürgen Cito, Drew Carlson, Ari Aye, Wes Dyer

    Abstract: Catching and attributing code change-induced performance regressions in production is hard; predicting them beforehand, even harder. A primer on automatically learning to predict performance regressions in software, this article gives an account of the experiences we gained when researching and deploying an ML-based regression prediction pipeline at Meta. In this paper, we report on a comparative… ▽ More

    Submitted 22 May, 2023; v1 submitted 8 August, 2022; originally announced August 2022.

  14. arXiv:2205.07384  [pdf, other

    cs.LG cs.AI stat.ML

    Incorporating Prior Knowledge into Neural Networks through an Implicit Composite Kernel

    Authors: Ziyang Jiang, Tongshu Zheng, Yiling Liu, David Carlson

    Abstract: It is challenging to guide neural network (NN) learning with prior knowledge. In contrast, many known properties, such as spatial smoothness or seasonality, are straightforward to model by choosing an appropriate kernel in a Gaussian process (GP). Many deep learning applications could be enhanced by modeling such known properties. For example, convolutional neural networks (CNNs) are frequently us… ▽ More

    Submitted 28 February, 2024; v1 submitted 15 May, 2022; originally announced May 2022.

    Comments: 27 pages, 13 figures, 5 tables, 3 algorithms, published in Transactions on Machine Learning Research (TMLR)

    ACM Class: I.5.1

  15. arXiv:2205.06791  [pdf, other

    stat.ML cs.LG

    Multiple Domain Causal Networks

    Authors: Tianhui Zhou, William E. Carson IV, Michael Hunter Klein, David Carlson

    Abstract: Observational studies are regarded as economic alternatives to randomized trials, often used in their stead to investigate and determine treatment efficacy. Due to lack of sample size, observational studies commonly combine data from multiple sources or different sites/centers. Despite the benefits of an increased sample size, a naive combination of multicenter data may result in incongruities ste… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: 6 figures, 2 tables

  16. arXiv:2201.02547  [pdf, other

    stat.ML cs.LG q-bio.GN

    AugmentedPCA: A Python Package of Supervised and Adversarial Linear Factor Models

    Authors: William E. Carson IV, Austin Talbot, David Carlson

    Abstract: Deep autoencoders are often extended with a supervised or adversarial loss to learn latent representations with desirable properties, such as greater predictivity of labels and outcomes or fairness with respects to a sensitive variable. Despite the ubiquity of supervised and adversarial deep latent factor models, these methods should demonstrate improvement over simpler linear approaches to be pre… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

    Comments: NeurIPS 2021 (Learning Meaningful Representations of Life Workshop)

  17. arXiv:2111.15347  [pdf, other

    cs.LG

    Adversarial Factor Models for the Generation of Improved Autism Diagnostic Biomarkers

    Authors: William E. Carson IV, Dmitry Isaev, Samatha Major, Guillermo Sapiro, Geraldine Dawson, David Carlson

    Abstract: Discovering reliable measures that inform on autism spectrum disorder (ASD) diagnosis is critical for providing appropriate and timely treatment for this neurodevelopmental disorder. In this work we present applications of adversarial linear factor models in the creation of improved biomarkers for ASD diagnosis. First, we demonstrate that an adversarial linear factor model can be used to remove co… ▽ More

    Submitted 24 September, 2021; originally announced November 2021.

    Comments: 5 pages, 3 figures

  18. arXiv:2111.09993  [pdf, other

    cs.LG eess.IV physics.med-ph

    Esophageal virtual disease landscape using mechanics-informed machine learning

    Authors: Sourav Halder, Jun Yamasaki, Shashank Acharya, Wenjun Kou, Guy Elisha, Dustin A. Carlson, Peter J. Kahrilas, John E. Pandolfino, Neelesh A. Patankar

    Abstract: The pathogenesis of esophageal disorders is related to the esophageal wall mechanics. Therefore, to understand the underlying fundamental mechanisms behind various esophageal disorders, it is crucial to map the esophageal wall mechanics-based parameters onto physiological and pathophysiological conditions corresponding to altered bolus transit and supraphysiologic IBP. In this work, we present a h… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

    Comments: 26 pages, 17 figures

    Journal ref: Artificial Intelligence in Medicine. 134 (2022) 102435

  19. arXiv:2110.01664  [pdf, other

    stat.ML cs.LG

    Estimating Potential Outcome Distributions with Collaborating Causal Networks

    Authors: Tianhui Zhou, William E Carson IV, David Carlson

    Abstract: Traditional causal inference approaches leverage observational study data to estimate the difference in observed and unobserved outcomes for a potential treatment, known as the Conditional Average Treatment Effect (CATE). However, CATE corresponds to the comparison on the first moment alone, and as such may be insufficient in reflecting the full picture of treatment effects. As an alternative, est… ▽ More

    Submitted 20 September, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

    Comments: https://openreview.net/forum?id=q1Fey9feu7

    Journal ref: Transactions on Machine Learning Research, 2022

  20. arXiv:2109.04561  [pdf, other

    stat.ML cs.LG stat.AP

    Supervising the Decoder of Variational Autoencoders to Improve Scientific Utility

    Authors: Liyun Tu, Austin Talbot, Neil Gallagher, David Carlson

    Abstract: Probabilistic generative models are attractive for scientific modeling because their inferred parameters can be used to generate hypotheses and design experiments. This requires that the learned model provide an accurate representation of the input data and yield a latent space that effectively predicts outcomes relevant to the scientific question. Supervised Variational Autoencoders (SVAEs) have… ▽ More

    Submitted 8 July, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

  21. A multi-stage machine learning model on diagnosis of esophageal manometry

    Authors: Wenjun Kou, Dustin A. Carlson, Alexandra J. Baumann, Erica N. Donnan, Jacob M. Schauer, Mozziyar Etemadi, John E. Pandolfino

    Abstract: High-resolution manometry (HRM) is the primary procedure used to diagnose esophageal motility disorders. Its interpretation and classification includes an initial evaluation of swallow-level outcomes and then derivation of a study-level diagnosis based on Chicago Classification (CC), using a tree-like algorithm. This diagnostic approach on motility disordered using HRM was mirrored using a multi-s… ▽ More

    Submitted 24 May, 2022; v1 submitted 25 June, 2021; originally announced June 2021.

    Journal ref: Artificial Intelligence in Medicine,Volume 124, February 2022, 102233

  22. Ookami: Deployment and Initial Experiences

    Authors: Andrew Burford, Alan C. Calder, David Carlson, Barbara Chapman, Firat CoŞKun, Tony Curtis, Catherine Feldman, Robert J. Harrison, Yan Kang, Benjamin Michalow-Icz, Eric Raut, Eva Siegmann, Daniel G. Wood, Robert L. Deleon, Mathew Jones, Nikolay A. Simakov, Joseph P. White, Dossay Oryspayev

    Abstract: Ookami is a computer technology testbed supported by the United States National Science Foundation. It provides researchers with access to the A64FX processor developed by Fujitsu in collaboration with RIKΞN for the Japanese path to exascale computing, as deployed in Fugaku, the fastest computer in the world. By focusing on crucial architectural details, the ARM-based, multi-core, 512-bit SIMD-vec… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: 14 pages, 7 figures, PEARC '21: Practice and Experience in Advanced Research Computing, July 18--22, 2021, Boston, MA, USA

  23. arXiv:2004.05209  [pdf, other

    stat.ML cs.LG q-bio.NC

    Estimating a Brain Network Predictive of Stress and Genotype with Supervised Autoencoders

    Authors: Austin Talbot, David Dunson, Kafui Dzirasa, David Carlson

    Abstract: Targeted stimulation of the brain has the potential to treat mental illnesses. We propose an approach to help design the stimulation protocol by identifying electrical dynamics across many brain regions that relate to illness states. We model multi-region electrical activity as a superposition of activity from latent networks, where the weights on the latent networks relate to an outcome of intere… ▽ More

    Submitted 7 March, 2023; v1 submitted 10 April, 2020; originally announced April 2020.

    Comments: 43 pages, 9 figures

  24. arXiv:2002.05212  [pdf, other

    stat.ML cs.LG

    Estimating Uncertainty Intervals from Collaborating Networks

    Authors: Tianhui Zhou, Yitong Li, Yuan Wu, David Carlson

    Abstract: Effective decision making requires understanding the uncertainty inherent in a prediction. In regression, this uncertainty can be estimated by a variety of methods; however, many of these methods are laborious to tune, generate overconfident uncertainty intervals, or lack sharpness (give imprecise intervals). We address these challenges by proposing a novel method to capture predictive distributio… ▽ More

    Submitted 12 November, 2021; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: This paper has been accepted for publication by the Journal of Machine learning Research(JMLR)

    Journal ref: Journal of Machine Learning Research. 22 (2021) 1-47

  25. arXiv:1910.02187  [pdf, other

    cs.LG cs.CL cs.SI stat.ML

    Dynamic Embedding on Textual Networks via a Gaussian Process

    Authors: Pengyu Cheng, Yitong Li, Xinyuan Zhang, Liqun Cheng, David Carlson, Lawrence Carin

    Abstract: Textual network embedding aims to learn low-dimensional representations of text-annotated nodes in a graph. Prior work in this area has typically focused on fixed graph structures; however, real-world networks are often dynamic. We address this challenge with a novel end-to-end node-embedding model, called Dynamic Embedding for Textual Networks with a Gaussian Process (DetGP). After training, DetG… ▽ More

    Submitted 27 November, 2019; v1 submitted 4 October, 2019; originally announced October 2019.

    Comments: Accepted for presentation at the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20)

  26. arXiv:1906.01998  [pdf, other

    cs.LG stat.ML

    The Secrets of Machine Learning: Ten Things You Wish You Had Known Earlier to be More Effective at Data Analysis

    Authors: Cynthia Rudin, David Carlson

    Abstract: Despite the widespread usage of machine learning throughout organizations, there are some key principles that are commonly missed. In particular: 1) There are at least four main families for supervised learning: logical modeling methods, linear combination methods, case-based reasoning methods, and iterative summarization methods. 2) For many application domains, almost all machine learning method… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: INFORMS TutORial 2019

  27. arXiv:1903.06336  [pdf, other

    stat.ML cs.LG

    On Target Shift in Adversarial Domain Adaptation

    Authors: Yitong Li, Michael Murias, Samantha Major, Geraldine Dawson, David E. Carlson

    Abstract: Discrepancy between training and testing domains is a fundamental problem in the generalization of machine learning techniques. Recently, several approaches have been proposed to learn domain invariant feature representations through adversarial deep learning. However, label shift, where the percentage of data in each class is different between domains, has received less attention. Label shift nat… ▽ More

    Submitted 14 March, 2019; originally announced March 2019.

  28. Phoenix: An Epidemic Approach to Time Reconstruction

    Authors: Jayant Gupchup, Douglas Carlson, Răzvan Musăloiu-E., Alex Szalay, Andreas Terzis

    Abstract: Harsh deployment environments and uncertain run-time conditions create numerous challenges for postmortem time reconstruction methods. For example, motes often reboot and thus lose their clock state, considering that the majority of mote platforms lack a real-time clock. While existing time reconstruction methods for long-term data gathering networks rely on a persistent basestation for assigning… ▽ More

    Submitted 2 February, 2019; originally announced February 2019.

    Journal ref: EWSN 2010 Proceedings of the 7th European Conference on Wireless Sensor Networks

  29. arXiv:1812.02784  [pdf, other

    cs.CV

    StoryGAN: A Sequential Conditional GAN for Story Visualization

    Authors: Yitong Li, Zhe Gan, Yelong Shen, Jingjing Liu, Yu Cheng, Yuexin Wu, Lawrence Carin, David Carlson, Jianfeng Gao

    Abstract: We propose a new task, called Story Visualization. Given a multi-sentence paragraph, the story is visualized by generating a sequence of images, one for each sentence. In contrast to video generation, story visualization focuses less on the continuity in generated images (frames), but more on the global consistency across dynamic scenes and characters -- a challenge that has not been addressed by… ▽ More

    Submitted 18 April, 2019; v1 submitted 6 December, 2018; originally announced December 2018.

  30. arXiv:1710.00421  [pdf, other

    cs.MM

    Video Generation From Text

    Authors: Yitong Li, Martin Renqiang Min, Dinghan Shen, David Carlson, Lawrence Carin

    Abstract: Generating videos from text has proven to be a significant challenge for existing generative models. We tackle this problem by training a conditional generative model to extract both static and dynamic information from text. This is manifested in a hybrid framework, employing a Variational Autoencoder (VAE) and a Generative Adversarial Network (GAN). The static features, called "gist," are used to… ▽ More

    Submitted 1 October, 2017; originally announced October 2017.

  31. arXiv:1704.08306  [pdf, other

    q-bio.NC cs.NE stat.ML

    A Digital Neuromorphic Architecture Efficiently Facilitating Complex Synaptic Response Functions Applied to Liquid State Machines

    Authors: Michael R. Smith, Aaron J. Hill, Kristofor D. Carlson, Craig M. Vineyard, Jonathon Donaldson, David R. Follett, Pamela L. Follett, John H. Naegle, Conrad D. James, James B. Aimone

    Abstract: Information in neural networks is represented as weighted connections, or synapses, between neurons. This poses a problem as the primary computational bottleneck for neural networks is the vector-matrix multiply when inputs are multiplied by the neural network weights. Conventional processing architectures are not well suited for simulating neural networks, often requiring large amounts of energy… ▽ More

    Submitted 21 March, 2017; originally announced April 2017.

    Comments: 8 pages, 4 Figures, Preprint of 2017 IJCNN

  32. arXiv:1612.03770  [pdf, other

    cs.NE cs.LG stat.ML

    Neurogenesis Deep Learning

    Authors: Timothy J. Draelos, Nadine E. Miner, Christopher C. Lamb, Jonathan A. Cox, Craig M. Vineyard, Kristofor D. Carlson, William M. Severa, Conrad D. James, James B. Aimone

    Abstract: Neural machine learning methods, such as deep neural networks (DNN), have achieved remarkable success in a number of complex data processing tasks. These methods have arguably had their strongest impact on tasks such as image and audio processing - data processing domains in which humans have long held clear advantages over conventional algorithms. In contrast to biological neural systems, which a… ▽ More

    Submitted 28 March, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

    Comments: 8 pages, 8 figures, Accepted to 2017 International Joint Conference on Neural Networks (IJCNN 2017)

    Report number: SAND2017-2174 C

  33. arXiv:1512.07962  [pdf, other

    stat.ML cs.LG

    Bridging the Gap between Stochastic Gradient MCMC and Stochastic Optimization

    Authors: Changyou Chen, David Carlson, Zhe Gan, Chunyuan Li, Lawrence Carin

    Abstract: Stochastic gradient Markov chain Monte Carlo (SG-MCMC) methods are Bayesian analogs to popular stochastic optimization methods; however, this connection is not well studied. We explore this relationship by applying simulated annealing to an SGMCMC algorithm. Furthermore, we extend recent SG-MCMC methods with two key components: i) adaptive preconditioners (as in ADAgrad or RMSprop), and ii) adapti… ▽ More

    Submitted 5 August, 2016; v1 submitted 25 December, 2015; originally announced December 2015.

    Comments: Merry Christmas from the Santa (algorithm). AISTATS 2016

  34. arXiv:1511.04156  [pdf, ps, other

    stat.ML cs.LG q-bio.NC

    Neuroprosthetic decoder training as imitation learning

    Authors: Josh Merel, David Carlson, Liam Paninski, John P. Cunningham

    Abstract: Neuroprosthetic brain-computer interfaces function via an algorithm which decodes neural activity of the user into movements of an end effector, such as a cursor or robotic arm. In practice, the decoder is often learned by updating its parameters while the user performs a task. When the user's intention is not directly observable, recent methods have demonstrated value in training the decoder agai… ▽ More

    Submitted 14 March, 2016; v1 submitted 12 November, 2015; originally announced November 2015.

  35. arXiv:1509.07087  [pdf, other

    stat.ML cs.LG

    Deep Temporal Sigmoid Belief Networks for Sequence Modeling

    Authors: Zhe Gan, Chunyuan Li, Ricardo Henao, David Carlson, Lawrence Carin

    Abstract: Deep dynamic generative models are developed to learn sequential dependencies in time-series data. The multi-layered model is designed by constructing a hierarchy of temporal sigmoid belief networks (TSBNs), defined as a sequential stack of sigmoid belief networks (SBNs). Each SBN has a contextual hidden state, inherited from the previous SBNs in the sequence, and is used to regulate its hidden bi… ▽ More

    Submitted 23 September, 2015; originally announced September 2015.

    Comments: to appear in NIPS 2015