Skip to main content

Showing 1–50 of 61 results for author: Scott, J G

.
  1. arXiv:2412.10937  [pdf, other

    q-bio.PE physics.bio-ph

    Asymmetric Interactions Shape Survival During Population Range Expansions

    Authors: Jason M. Gray, Rowan J. Barker-Clarke, Jacob G. Scott, Michael Hinczewski

    Abstract: An organism that is newly introduced into an existing population has a survival probability that is dependent on both the population density of its environment and the competition it experiences with the members of that population. Expanding populations naturally form regions of high and low density, and simultaneously experience ecological interactions both internally and at the boundary of their… ▽ More

    Submitted 14 December, 2024; originally announced December 2024.

    Comments: 21 pages, 5 figures, 3 tables

  2. arXiv:2410.19105  [pdf, other

    stat.ML cs.AI cs.LG stat.AP

    Conditional diffusions for amortized neural posterior estimation

    Authors: Tianyu Chen, Vansh Bansal, James G. Scott

    Abstract: Neural posterior estimation (NPE), a simulation-based computational approach for Bayesian inference, has shown great success in approximating complex posterior distributions. Existing NPE methods typically rely on normalizing flows, which approximate a distribution by composing many simple, invertible transformations. But flow-based models, while state of the art for NPE, are known to suffer from… ▽ More

    Submitted 12 March, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

  3. Inferring Density-Dependent Population Dynamics Mechanisms through Rate Disambiguation for Logistic Birth-Death Processes

    Authors: Linh Huynh, Jacob G. Scott, Peter J. Thomas

    Abstract: Density dependence is important in the ecology and evolution of microbial and cancer cells. Typically, we can only measure net growth rates, but the underlying density-dependent mechanisms that give rise to the observed dynamics can manifest in birth processes, death processes, or both. Therefore, we utilize the mean and variance of cell number fluctuations to separately identify birth and death r… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

  4. arXiv:2107.08481  [pdf, other

    cs.DL

    Accessing United States Bulk Patent Data with patentpy and patentr

    Authors: James Yu, Hayley Beltz, Milind Y. Desai, Péter Érdi, Jacob G. Scott, Raoul R. Wadhwa

    Abstract: The United States Patent and Trademark Office (USPTO) provides publicly accessible bulk data files containing information for all patents from 1976 onward. However, the format of these files changes over time and is memory-inefficient, which can pose issues for individual researchers. Here, we introduce the patentpy and patentr packages for the Python and R programming languages. They allow users… ▽ More

    Submitted 18 July, 2021; originally announced July 2021.

  5. arXiv:2010.15222  [pdf, other

    cs.SI

    Exploring complex networks with the ICON R package

    Authors: Raoul R. Wadhwa, Jacob G. Scott

    Abstract: We introduce ICON, an R package that contains 1075 complex network datasets in a standard edgelist format. All provided datasets have associated citations and have been indexed by the Colorado Index of Complex Networks - also referred to as ICON. In addition to supplying a large and diverse corpus of useful real-world networks, ICON also implements an S3 generic to work with the network and ggnetw… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

  6. arXiv:1912.06946  [pdf, other

    stat.AP

    Monotone function estimation in the presence of extreme data coarsening: Analysis of preeclampsia and birth weight in urban Uganda

    Authors: Jennifer E. Starling, Catherine E. Aiken, Jared S. Murray, Annettee Nakimuli, James G. Scott

    Abstract: This paper proposes a Bayesian hierarchical model to characterize the relationship between birth weight and maternal pre-eclampsia across gestation at a large maternity hospital in urban Uganda. Key scientific questions we investigate include: 1) how pre-eclampsia compares to other maternal-fetal covariates as a predictor of birth weight; and 2) whether the impact of pre-eclampsia on birthweight v… ▽ More

    Submitted 14 December, 2019; originally announced December 2019.

  7. arXiv:1912.03764  [pdf, other

    cond-mat.stat-mech q-bio.PE

    Controlling the speed and trajectory of evolution with counterdiabatic driving

    Authors: Shamreen Iram, Emily Dolson, Joshua Chiel, Julia Pelesko, Nikhil Krishnan, Özenç Güngör, Benjamin Kuznets-Speck, Sebastian Deffner, Efe Ilker, Jacob G. Scott, Michael Hinczewski

    Abstract: The pace and unpredictability of evolution are critically relevant in a variety of modern challenges: combating drug resistance in pathogens and cancer, understanding how species respond to environmental perturbations like climate change, and developing artificial selection approaches for agriculture. Great progress has been made in quantitative modeling of evolution using fitness landscapes, allo… ▽ More

    Submitted 3 June, 2020; v1 submitted 8 December, 2019; originally announced December 2019.

    Comments: Main text: 19 pages, 5 figures; SI: 14 pages, 5 figures

    Journal ref: Nat. Phys. (2020)

  8. arXiv:1911.08106  [pdf, other

    stat.AP

    How Likely are Ride-share Drivers to Earn a Living Wage? Large-scale Spatio-temporal Density Smoothing with the Graph-fused Elastic Net

    Authors: Mauricio Tec, Natalia Zuniga-Garcia, Randy B. Machemehl, James G. Scott

    Abstract: Ride-sourcing or transportation network companies (TNCs) provide on-demand transportation service for compensation, connecting drivers of personal vehicles with passengers through smartphone applications. In this study, we consider the problem of estimating a spatiotemporally varying probability distribution for the productivity of a TNC driver, using data on more than 1.2 million TNC trips in Aus… ▽ More

    Submitted 9 July, 2021; v1 submitted 19 November, 2019; originally announced November 2019.

  9. arXiv:1905.09405  [pdf, other

    stat.AP

    Targeted Smooth Bayesian Causal Forests: An analysis of heterogeneous treatment effects for simultaneous versus interval medical abortion regimens over gestation

    Authors: Jennifer E. Starling, Jared S. Murray, Patricia A. Lohr, Abigail R. A. Aiken, Carlos M. Carvalho, James G. Scott

    Abstract: We introduce Targeted Smooth Bayesian Causal Forests (tsBCF), a nonparametric Bayesian approach for estimating heterogeneous treatment effects which vary smoothly over a single covariate in the observational data setting. The tsBCF method induces smoothness by parameterizing terminal tree nodes with smooth functions, and allows for separate regularization of treatment effects versus prognostic eff… ▽ More

    Submitted 23 February, 2020; v1 submitted 22 May, 2019; originally announced May 2019.

  10. arXiv:1812.04567  [pdf, other

    stat.AP

    A flat persistence diagram for improved visualization of persistent homology

    Authors: Raoul R. Wadhwa, Andrew Dhawan, Drew F. K. Williamson, Jacob G. Scott

    Abstract: Visualization in the emerging field of topological data analysis has progressed from persistence barcodes and persistence diagrams to display of two-parameter persistent homology. Although persistence barcodes and diagrams have permitted insight into the geometry underlying complex datasets, visualization of even single-parameter persistent homology has significant room for improvement. Here, we p… ▽ More

    Submitted 5 January, 2019; v1 submitted 11 December, 2018; originally announced December 2018.

    Comments: 4 pages, 2 figures

  11. Optimizing adaptive cancer therapy: dynamic programming and evolutionary game theory

    Authors: Mark Gluzman, Jacob G. Scott, Alexander Vladimirsky

    Abstract: Recent clinical trials have shown that the adaptive drug therapy can be more efficient than a standard MTD-based policy in treatment of cancer patients. The adaptive therapy paradigm is not based on a preset schedule; instead, the doses are administered based on the current state of tumor. But the adaptive treatment policies examined so far have been largely ad hoc. In this paper we propose a meth… ▽ More

    Submitted 10 December, 2018; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: 22 pages, 10 figures

    MSC Class: 92C50; 49N90; 49Lxx

    Journal ref: Proceedings B (2020), 287:20192454

  12. Optimal post-selection inference for sparse signals: a nonparametric empirical-Bayes approach

    Authors: Spencer Woody, Oscar Hernan Madrid Padilla, James G. Scott

    Abstract: Many recently developed Bayesian methods have focused on sparse signal detection. However, much less work has been done addressing the natural follow-up question: how to make valid inferences for the magnitude of those signals after selection. Ordinary Bayesian credible intervals suffer from selection bias, owing to the fact that the target of inference is chosen adaptively. Existing Bayesian appr… ▽ More

    Submitted 13 November, 2020; v1 submitted 25 October, 2018; originally announced October 2018.

  13. arXiv:1809.10329  [pdf, other

    stat.AP

    Evaluation of Ride-Sourcing Search Frictions and Driver Productivity: A Spatial Denoising Approach

    Authors: Natalia Zuniga-Garcia, Mauricio Tec, James G. Scott, Natalia Ruiz-Juri, Randy B. Machemehl

    Abstract: This paper considers the problem of measuring spatial and temporal variation in driver productivity on ride-sourcing trips. This variation is especially important from a driver's perspective: if a platform's drivers experience systematic disparities in earnings because of variation in their riders' destinations, they may perceive the pricing model as inequitable. This perception can exacerbate sea… ▽ More

    Submitted 11 October, 2019; v1 submitted 26 September, 2018; originally announced September 2018.

    Comments: 34 pages

  14. arXiv:1805.07656  [pdf, other

    stat.ME

    BART with Targeted Smoothing: An analysis of patient-specific stillbirth risk

    Authors: Jennifer E. Starling, Jared S. Murray, Carlos M. Carvalho, Radek K. Bukowski, James G. Scott

    Abstract: This article introduces BART with Targeted Smoothing, or tsBART, a new Bayesian tree-based model for nonparametric regression. The goal of tsBART is to introduce smoothness over a single target covariate t, while not necessarily requiring smoothness over other covariates x. TsBART is based on the Bayesian Additive Regression Trees (BART) model, an ensemble of regression trees. TsBART extends BART… ▽ More

    Submitted 3 June, 2019; v1 submitted 19 May, 2018; originally announced May 2018.

  15. arXiv:1804.00327  [pdf, other

    stat.AP q-bio.PE

    Socioeconomic bias in influenza surveillance

    Authors: Samuel V. Scarpino, James G. Scott, Rosalind M. Eggo, Bruce Clements, Nedialko B. Dimitrov, Lauren Ancel Meyers

    Abstract: Individuals in low socioeconomic brackets are considered at-risk for developing influenza-related complications and often exhibit higher than average influenza-related hospitalization rates. This disparity has been attributed to various factors, including restricted access to preventative and therapeutic health care, limited sick leave, and household structure. Adequate influenza surveillance in t… ▽ More

    Submitted 1 April, 2018; originally announced April 2018.

  16. arXiv:1708.01947  [pdf, other

    stat.ML

    Interpretable Low-Dimensional Regression via Data-Adaptive Smoothing

    Authors: Wesley Tansey, Jesse Thomason, James G. Scott

    Abstract: We consider the problem of estimating a regression function in the common situation where the number of features is small, where interpretability of the model is a high priority, and where simple linear or additive models fail to provide adequate performance. To address this problem, we present Maximum Variance Total Variation denoising (MVTV), an approach that is conceptually related both to CART… ▽ More

    Submitted 6 August, 2017; originally announced August 2017.

    Comments: 4 pages, 1 figure presented at 2017 ICML Workshop on Human Interpretability in Machine Learning (WHI 2017), Sydney, NSW, Australia

  17. arXiv:1705.10879  [pdf, other

    q-bio.PE

    Evolutionary dynamics of incubation periods

    Authors: Bertrand Ottino-Loffler, Jacob G. Scott, Steven H. Strogatz

    Abstract: The incubation period of a disease is the time between an initiating pathologic event and the onset of symptoms. For typhoid fever, polio, measles, leukemia and many other diseases, the incubation period is highly variable. Some affected people take much longer than average to show symptoms, leading to a distribution of incubation periods that is right skewed and often approximately lognormal. Alt… ▽ More

    Submitted 30 May, 2017; originally announced May 2017.

    Comments: 24 pages, 8 figures, 1 table

  18. arXiv:1702.07405  [pdf, other

    stat.ML

    GapTV: Accurate and Interpretable Low-Dimensional Regression and Classification

    Authors: Wesley Tansey, James G. Scott

    Abstract: We consider the problem of estimating a regression function in the common situation where the number of features is small, where interpretability of the model is a high priority, and where simple linear or additive models fail to provide adequate performance. To address this problem, we present GapTV, an approach that is conceptually related both to CART and to the more recent CRISP algorithm, a s… ▽ More

    Submitted 23 February, 2017; originally announced February 2017.

  19. arXiv:1702.07398  [pdf, other

    stat.ML

    Deep Nonparametric Estimation of Discrete Conditional Distributions via Smoothed Dyadic Partitioning

    Authors: Wesley Tansey, Karl Pichotta, James G. Scott

    Abstract: We present an approach to deep estimation of discrete conditional probability distributions. Such models have several applications, including generative modeling of audio, image, and video data. Our approach combines two main techniques: dyadic partitioning and graph-based smoothing of the discrete space. By recursively decomposing each dimension into a series of binary splits and smoothing over t… ▽ More

    Submitted 28 February, 2017; v1 submitted 23 February, 2017; originally announced February 2017.

  20. Takeover times for a simple model of network infection

    Authors: Bertrand Ottino-Löffler, Jacob G. Scott, Steven H. Strogatz

    Abstract: We study a stochastic model of infection spreading on a network. At each time step a node is chosen at random, along with one of its neighbors. If the node is infected and the neighbor is susceptible, the neighbor becomes infected. How many time steps $T$ does it take to completely infect a network of $N$ nodes, starting from a single infected node? An analogy to the classic "coupon collector" pro… ▽ More

    Submitted 2 February, 2017; originally announced February 2017.

    Comments: 19 pages, 10 figures

    Journal ref: Phys. Rev. E 96, 012313 (2017)

  21. arXiv:1612.07867  [pdf, other

    stat.ME

    Sequential nonparametric tests for a change in distribution: an application to detecting radiological anomalies

    Authors: Oscar Hernan Madrid Padilla, Alex Athey, Alex Reinhart, James G. Scott

    Abstract: We propose a sequential nonparametric test for detecting a change in distribution, based on windowed Kolmogorov--Smirnov statistics. The approach is simple, robust, highly computationally efficient, easy to calibrate, and requires no parametric assumptions about the underlying null and alternative distributions. We show that both the false-alarm rate and the power of our procedure are amenable to… ▽ More

    Submitted 22 December, 2016; originally announced December 2016.

  22. arXiv:1612.00388  [pdf, other

    stat.ML cs.LG stat.AP

    Diet2Vec: Multi-scale analysis of massive dietary data

    Authors: Wesley Tansey, Edward W. Lowe Jr., James G. Scott

    Abstract: Smart phone apps that enable users to easily track their diets have become widespread in the last decade. This has created an opportunity to discover new insights into obesity and weight loss by analyzing the eating habits of the users of such apps. In this paper, we present diet2vec: an approach to modeling latent structure in a massive database of electronic diet journals. Through an iterative c… ▽ More

    Submitted 1 December, 2016; originally announced December 2016.

    Comments: Accepted to the NIPS 2016 Workshop on Machine Learning for Health

  23. arXiv:1608.03384  [pdf, other

    math.ST

    The DFS Fused Lasso: Linear-Time Denoising over General Graphs

    Authors: Oscar Hernan Madrid Padilla, James G. Scott, James Sharpnack, Ryan J. Tibshirani

    Abstract: The fused lasso, also known as (anisotropic) total variation denoising, is widely used for piecewise constant signal estimation with respect to a given undirected graph. The fused lasso estimate is highly nontrivial to compute when the underlying graph is large and has an arbitrary structure. But for a special graph structure, namely, the chain graph, the fused lasso---or simply, 1d fused lasso---… ▽ More

    Submitted 1 March, 2017; v1 submitted 11 August, 2016; originally announced August 2016.

    Journal ref: Journal of Machine Learning Research, Vol. 18, No. 176, 1-36, 2018

  24. arXiv:1608.00985  [pdf, other

    q-bio.PE q-bio.TO

    Cancer treatment scheduling and dynamic heterogeneity in social dilemmas of tumour acidity and vasculature

    Authors: Artem Kaznatcheev, Robert Vander Velde, Jacob G. Scott, David Basanta

    Abstract: Background: Tumours are diverse ecosystems with persistent heterogeneity in various cancer hallmarks like self-sufficiency of growth factor production for angiogenesis and reprogramming of energy-metabolism for aerobic glycolysis. This heterogeneity has consequences for diagnosis, treatment, and disease progression. Methods: We introduce the double goods game to study the dynamics of these trait… ▽ More

    Submitted 2 August, 2016; originally announced August 2016.

    Comments: 14 main pages (+10 pg appendix), 3 figures

    MSC Class: 92D25; 91A06; 91A22

  25. arXiv:1606.02321  [pdf, other

    stat.ML

    Better Conditional Density Estimation for Neural Networks

    Authors: Wesley Tansey, Karl Pichotta, James G. Scott

    Abstract: The vast majority of the neural network literature focuses on predicting point values for a given set of response variables, conditioned on a feature vector. In many cases we need to model the full joint conditional distribution over the response variables rather than simply making point predictions. In this paper, we present two novel approaches to such conditional density estimation (CDE): Multi… ▽ More

    Submitted 7 June, 2016; originally announced June 2016.

    Comments: 12 pages, 3 figures, code available soon

  26. arXiv:1511.06750  [pdf, other

    stat.ME

    A deconvolution path for mixtures

    Authors: Oscar Hernan Madrid Padilla, Nicholas G. Polson, James G. Scott

    Abstract: We propose a class of estimators for deconvolution in mixture models based on a simple two-step "bin-and-smooth" procedure applied to histogram counts. The method is both statistically and computationally efficient: by exploiting recent advances in convex optimization, we are able to provide a full deconvolution path that shows the estimate for the mixing distribution across a range of plausible d… ▽ More

    Submitted 25 May, 2017; v1 submitted 20 November, 2015; originally announced November 2015.

    Journal ref: Electronic Journal of Statistics Volume 12, Number 1 (2018), 1717-1751

  27. arXiv:1509.04348  [pdf, other

    stat.ME

    Nonparametric density estimation by histogram trend filtering

    Authors: Oscar Hernan Madrid Padilla, James G. Scott

    Abstract: We propose a novel approach for density estimation called histogram trend filtering. Our estimator arises from looking at surrogate Poisson model for counts of observations in a partition of the support of the data. We begin by showing consistency for a variational estimator for this density estimation problem. We then study a discrete estimator that can be efficiently found via convex optimizatio… ▽ More

    Submitted 6 February, 2016; v1 submitted 14 September, 2015; originally announced September 2015.

  28. arXiv:1507.07271  [pdf, other

    stat.ME physics.data-an stat.AP

    Multiscale spatial density smoothing: an application to large-scale radiological survey and anomaly detection

    Authors: Wesley Tansey, Alex Athey, Alex Reinhart, James G. Scott

    Abstract: We consider the problem of estimating a spatially varying density function, motivated by problems that arise in large-scale radiological survey and anomaly detection. In this context, the density functions to be estimated are the background gamma-ray energy spectra at sites spread across a large geographical area, such as nuclear production and waste-storage sites, military bases, medical faciliti… ▽ More

    Submitted 16 September, 2016; v1 submitted 26 July, 2015; originally announced July 2015.

    Comments: 36 pages, 10 figures

    Journal ref: Journal of the American Statistical Association, vol. 112 no. 519 (2017), pp. 1047-1063

  29. arXiv:1505.06475  [pdf, other

    stat.ML stat.CO

    A Fast and Flexible Algorithm for the Graph-Fused Lasso

    Authors: Wesley Tansey, James G. Scott

    Abstract: We propose a new algorithm for solving the graph-fused lasso (GFL), a method for parameter estimation that operates under the assumption that the signal tends to be locally constant over a predefined graph structure. Our key insight is to decompose the graph into a set of trails which can then each be solved efficiently using techniques for the ordinary (1D) fused lasso. We leverage these trails i… ▽ More

    Submitted 1 June, 2015; v1 submitted 24 May, 2015; originally announced May 2015.

    Comments: 16 pages, 6 figures

  30. arXiv:1502.06930  [pdf, ps, other

    stat.ME stat.CO stat.ML

    Tensor decomposition with generalized lasso penalties

    Authors: Oscar Hernan Madrid Padilla, James G. Scott

    Abstract: We present an approach for penalized tensor decomposition (PTD) that estimates smoothly varying latent factors in multi-way data. This generalizes existing work on sparse tensor decomposition and penalized matrix decompositions, in a manner parallel to the generalized lasso for regression and smoothing problems. Our approach presents many nontrivial challenges at the intersection of modeling and c… ▽ More

    Submitted 12 May, 2016; v1 submitted 24 February, 2015; originally announced February 2015.

  31. arXiv:1502.03175  [pdf, other

    stat.ML cs.LG stat.ME

    Proximal Algorithms in Statistics and Machine Learning

    Authors: Nicholas G. Polson, James G. Scott, Brandon T. Willard

    Abstract: In this paper we develop proximal methods for statistical learning. Proximal point algorithms are useful in statistics and machine learning for obtaining optimization solutions for composite functions. Our approach exploits closed-form solutions of proximal operators and envelope representations based on the Moreau, Forward-Backward, Douglas-Rachford and Half-Quadratic envelopes. Envelope represen… ▽ More

    Submitted 30 May, 2015; v1 submitted 10 February, 2015; originally announced February 2015.

  32. arXiv:1411.6144  [pdf, other

    stat.ME stat.AP stat.CO

    False discovery rate smoothing

    Authors: Wesley Tansey, Oluwasanmi Koyejo, Russell A. Poldrack, James G. Scott

    Abstract: We present false discovery rate smoothing, an empirical-Bayes method for exploiting spatial structure in large multiple-testing problems. FDR smoothing automatically finds spatially localized regions of significant test statistics. It then relaxes the threshold of statistical significance within these regions, and tightens it elsewhere, in a manner that controls the overall false-discovery rate at… ▽ More

    Submitted 14 November, 2016; v1 submitted 22 November, 2014; originally announced November 2014.

    Comments: Added misspecification analysis, added pathological scenario discussions, additional comparisons, new graph fused lasso algorithm

  33. arXiv:1409.3601  [pdf, other

    stat.CO math.ST

    Vertical-likelihood Monte Carlo

    Authors: Nicholas G. Polson, James G. Scott

    Abstract: In this review, we address the use of Monte Carlo methods for approximating definite integrals of the form $Z = \int L(x) d P(x)$, where $L$ is a target function (often a likelihood) and $P$ a finite measure. We present vertical-likelihood Monte Carlo, which is an approach for designing the importance function $g(x)$ used in importance sampling. Our approach exploits a duality between two random v… ▽ More

    Submitted 23 June, 2015; v1 submitted 11 September, 2014; originally announced September 2014.

  34. arXiv:1406.0177  [pdf, other

    stat.ME

    Mixtures, envelopes, and hierarchical duality

    Authors: Nicholas G. Polson, James G. Scott

    Abstract: We develop a connection between mixture and envelope representations of objective functions that arise frequently in statistics. We refer to this connection using the term "hierarchical duality." Our results suggest an interesting and previously under-exploited relationship between marginalization and profiling, or equivalently between the Fenchel--Moreau theorem for convex functions and the Berns… ▽ More

    Submitted 22 February, 2015; v1 submitted 1 June, 2014; originally announced June 2014.

  35. arXiv:1405.0506  [pdf, other

    stat.CO

    Sampling Polya-Gamma random variates: alternate and approximate techniques

    Authors: Jesse Windle, Nicholas G. Polson, James G. Scott

    Abstract: Efficiently sampling from the Pólya-Gamma distribution, ${PG}(b,z)$, is an essential element of Pólya-Gamma data augmentation. Polson et. al (2013) show how to efficiently sample from the ${PG}(1,z)$ distribution. We build two new samplers that offer improved performance when sampling from the ${PG}(b,z)$ distribution and $b$ is not unity.

    Submitted 2 May, 2014; originally announced May 2014.

  36. arXiv:1404.3331  [pdf, other

    stat.ME stat.ML

    Priors for Random Count Matrices Derived from a Family of Negative Binomial Processes

    Authors: Mingyuan Zhou, Oscar Hernan Madrid Padilla, James G. Scott

    Abstract: We define a family of probability distributions for random count matrices with a potentially unbounded number of rows and columns. The three distributions we consider are derived from the gamma-Poisson, gamma-negative binomial, and beta-negative binomial processes. Because the models lead to closed-form Gibbs sampling update equations, they are natural candidates for nonparametric Bayesian priors… ▽ More

    Submitted 13 July, 2015; v1 submitted 12 April, 2014; originally announced April 2014.

    Comments: To appear in Journal of the American Statistical Association (Theory and Methods). 31 pages + 11 page supplement, 5 figures

  37. arXiv:1309.5078  [pdf, other

    q-bio.TO

    A filter-flow perspective of hematogenous metastasis offers a non-genetic paradigm for personalized cancer therapy

    Authors: Jacob G. Scott, Alexander G. Fletcher, Philip K. Maini, Alexander R. A. Anderson, Philip Gerlee

    Abstract: Research into mechanisms of hematogenous metastasis has largely become genetic in focus, attempting to understand the molecular basis of `seed-soil' relationships. Preceeding this biological mechanism is the physical process of dissemination of circulating tumour cells (CTCs). We utilize a `filter-flow' paradigm to show that assumptions about CTC dynamics strongly affect metastatic efficiency: wit… ▽ More

    Submitted 19 September, 2013; originally announced September 2013.

    Comments: pre-publication draft, 2 figures, 2 tables

  38. arXiv:1308.0774  [pdf, other

    stat.CO

    Efficient Data Augmentation in Dynamic Models for Binary and Count Data

    Authors: Jesse Windle, Carlos M. Carvalho, James G. Scott, Liang Sun

    Abstract: Dynamic linear models with Gaussian observations and Gaussian states lead to closed-form formulas for posterior simulation. However, these closed-form formulas break down when the response or state evolution ceases to be Gaussian. Dynamic, generalized linear models exemplify a class of models for which this is the case, and include, amongst other models, dynamic binomial logistic regression and dy… ▽ More

    Submitted 19 September, 2013; v1 submitted 3 August, 2013; originally announced August 2013.

    Comments: 22 Pages, 1 figure, 1 tables

  39. arXiv:1307.6914  [pdf, other

    q-bio.PE

    Edge effects in game theoretic dynamics of spatially structured tumours

    Authors: Artem Kaznatcheev, Jacob G. Scott, David Basanta

    Abstract: Background: Analysing tumour architecture for metastatic potential usually focuses on phenotypic differences due to cellular morphology or specific genetic mutations, but often ignore the cell's position within the heterogeneous substructure. Similar disregard for local neighborhood structure is common in mathematical models. Methods: We view the dynamics of disease progression as an evolutionar… ▽ More

    Submitted 21 January, 2015; v1 submitted 25 July, 2013; originally announced July 2013.

    Comments: 14 pages, 3 figures; restructured abstract, added histology to fig. 1, added fig. 3, discussion of EMT introduced and cancer biology expanded

    MSC Class: 92C50

  40. arXiv:1307.3495  [pdf, other

    stat.ME stat.AP

    False discovery rate regression: an application to neural synchrony detection in primary visual cortex

    Authors: James G. Scott, Ryan C. Kelly, Matthew A. Smith, Pengcheng Zhou, Robert E. Kass

    Abstract: Many approaches for multiple testing begin with the assumption that all tests in a given study should be combined into a global false-discovery-rate analysis. But this may be inappropriate for many of today's large-scale screening problems, where auxiliary information about each test is often available, and where a combined analysis can lead to poorly calibrated error rates within different subset… ▽ More

    Submitted 8 June, 2014; v1 submitted 12 July, 2013; originally announced July 2013.

  41. arXiv:1306.0040  [pdf, other

    stat.CO math.ST stat.ML

    Expectation-maximization for logistic regression

    Authors: James G. Scott, Liang Sun

    Abstract: We present a family of expectation-maximization (EM) algorithms for binary and negative-binomial logistic regression, drawing a sharp connection with the variational-Bayes algorithm of Jaakkola and Jordan (2000). Indeed, our results allow a version of this variational-Bayes approach to be re-interpreted as a true EM algorithm. We study several interesting features of the algorithm, and of this pre… ▽ More

    Submitted 31 May, 2013; originally announced June 2013.

  42. arXiv:1305.4622  [pdf, other

    q-bio.TO

    Mathematical modeling of the metastatic process

    Authors: Jacob G. Scott, Philip Gerlee, David Basanta, Alexander G. Fletcher, Philip K. Maini, Alexander RA Anderson

    Abstract: Mathematical modeling in cancer has been growing in popularity and impact since its inception in 1932. The first theoretical mathematical modeling in cancer research was focused on understanding tumor growth laws and has grown to include the competition between healthy and normal tissue, carcinogenesis, therapy and metastasis. It is the latter topic, metastasis, on which we will focus this short r… ▽ More

    Submitted 21 May, 2013; v1 submitted 20 May, 2013; originally announced May 2013.

    Comments: 24 pages, 6 figures, Review

  43. arXiv:1304.3378  [pdf, other

    stat.ME math.ST

    Nonparametric Bayesian testing for monotonicity

    Authors: James G. Scott, Thomas S. Shively, Stephen G. Walker

    Abstract: This paper studies the problem of testing whether a function is monotone from a nonparametric Bayesian perspective. Two new families of tests are constructed. The first uses constrained smoothing splines, together with a hierarchical stochastic-process prior that explicitly controls the prior probability of monotonicity. The second uses regression splines, together with two proposals for the prior… ▽ More

    Submitted 1 June, 2014; v1 submitted 11 April, 2013; originally announced April 2013.

  44. arXiv:1301.4193  [pdf, other

    q-bio.PE math.PR

    A Markov chain model of evolution in asexually reproducing populations: insight and analytical tractability in the evolutionary process

    Authors: Daniel Nichol, Peter Jeavons, Robert Bonomo, Philip K. Maini, Jerome L. Paul, Robert A. Gatenby, Alexander R. A. Anderson, Jacob G. Scott

    Abstract: The evolutionary process has been modelled in many ways using both stochastic and deterministic models. We develop an algebraic model of evolution in a population of asexually reproducing organisms in which we represent a stochastic walk in phenotype space, constrained to the edges of an underlying graph representing the genotype, with a time-homogeneous Markov Chain. We show its equivalence to a… ▽ More

    Submitted 17 January, 2013; originally announced January 2013.

    Comments: 12 pages, 3 figures

  45. arXiv:1301.3934  [pdf, other

    q-bio.TO cs.CE

    Intrinsic cell factors that influence tumourigenicity in cancer stem cells - towards hallmarks of cancer stem cells

    Authors: Jacob G. Scott, Prakash Chinnaiyan, Alexander R. A. Anderson, Anita Hjelmeland, David Basanta

    Abstract: Since the discovery of a cancer initiating side population in solid tumours, studies focussing on the role of so-called cancer stem cells in cancer initiation and progression have abounded. The biological interrogation of these cells has yielded volumes of information about their behaviour, but there has, as of yet, not been many actionable generalised theoretical conclusions. To address this poin… ▽ More

    Submitted 20 August, 2013; v1 submitted 16 January, 2013; originally announced January 2013.

    Comments: 8 pages, 4 figures

  46. arXiv:1205.5182  [pdf, other

    q-bio.TO q-bio.PE

    A mathematical model of tumor self-seeding reveals secondary metastatic deposits as drivers of primary tumor growth

    Authors: Jacob G Scott, David Basanta, Alexander R. A. Anderson, Philip Gerlee

    Abstract: Two models of circulating tumor cell (CTC) dynamics have been proposed to explain the phenomenon of tumor 'self-seeding', whereby CTCs repopulate the primary tumor and accelerate growth: Primary Seeding, where cells from a primary tumor shed into the vasculature and return back to the primary themselves; and Secondary Seeding, where cells from the primary first metastasize in a secondary tissue an… ▽ More

    Submitted 25 February, 2013; v1 submitted 23 May, 2012; originally announced May 2012.

    Comments: 20 pages, 4 figures

  47. arXiv:1205.0310  [pdf, other

    stat.ME stat.CO stat.ML

    Bayesian inference for logistic models using Polya-Gamma latent variables

    Authors: Nicholas G. Polson, James G. Scott, Jesse Windle

    Abstract: We propose a new data-augmentation strategy for fully Bayesian inference in models with binomial likelihoods. The approach appeals to a new class of Polya-Gamma distributions, which are constructed in detail. A variety of examples are presented to show the versatility of the method, including logistic regression, negative binomial regression, nonlinear mixed-effects models, and spatial models for… ▽ More

    Submitted 22 July, 2013; v1 submitted 1 May, 2012; originally announced May 2012.

  48. arXiv:1111.0617  [pdf, other

    stat.AP

    The partition problem: case studies in Bayesian screening for time-varying model structure

    Authors: Zesong Liu, Jesse Windle, James G. Scott

    Abstract: This paper presents two case studies of data sets where the main inferential goal is to characterize time-varying patterns in model structure. Both of these examples are seen to be general cases of the so-called "partition problem," where auxiliary information (in this case, time) defines a partition over sample space, and where different models hold for each element of the partition. In the first… ▽ More

    Submitted 2 November, 2011; originally announced November 2011.

  49. arXiv:1110.5789  [pdf, other

    q-fin.ST stat.AP

    An empirical test for Eurozone contagion using an asset-pricing model with heavy-tailed stochastic volatility

    Authors: Nicholas G. Polson, James G. Scott

    Abstract: This paper proposes an empirical test of financial contagion in European equity markets during the tumultuous period of 2008-2011. Our analysis shows that traditional GARCH and Gaussian stochastic-volatility models are unable to explain two key stylized features of global markets during presumptive contagion periods: shocks to aggregate market volatility can be sudden and explosive, and they are a… ▽ More

    Submitted 26 March, 2012; v1 submitted 26 October, 2011; originally announced October 2011.

  50. arXiv:1109.4180  [pdf, other

    stat.ME math.ST stat.CO

    Default Bayesian analysis for multi-way tables: a data-augmentation approach

    Authors: Nicholas G. Polson, James G. Scott

    Abstract: This paper proposes a strategy for regularized estimation in multi-way contingency tables, which are common in meta-analyses and multi-center clinical trials. Our approach is based on data augmentation, and appeals heavily to a novel class of Polya-Gamma distributions. Our main contributions are to build up the relevant distributional theory and to demonstrate three useful features of this data-au… ▽ More

    Submitted 19 September, 2011; originally announced September 2011.