-
A New Convergence Analysis of Two Stochastic Frank-Wolfe Algorithms
Authors:
Natthawut Boonsiriphatthanajaroen,
Shane G. Henderson
Abstract:
We study the convergence properties of the original and away-step Frank-Wolfe algorithms for linearly constrained stochastic optimization assuming the availability of unbiased objective function gradient estimates. The objective function is not restricted to a finite summation form, like in previous analyses tailored to machine-learning applications. To enable the use of concentration inequalities…
▽ More
We study the convergence properties of the original and away-step Frank-Wolfe algorithms for linearly constrained stochastic optimization assuming the availability of unbiased objective function gradient estimates. The objective function is not restricted to a finite summation form, like in previous analyses tailored to machine-learning applications. To enable the use of concentration inequalities we assume either a uniform bound on the variance of gradient estimates or uniformly sub-Gaussian tails on gradient estimates. With one of these regularity assumptions along with sufficient sampling, we can ensure sufficiently accurate gradient estimates. We then use a Lyapunov argument to obtain the desired complexity bounds, relying on existing geometrical results for polytopes.
△ Less
Submitted 5 April, 2025;
originally announced April 2025.
-
An anatomically-informed correspondence initialisation method to improve learning-based registration for radiotherapy
Authors:
Edward G. A. Henderson,
Marcel van Herk,
Andrew F. Green,
Eliana M. Vasquez Osorio
Abstract:
We propose an anatomically-informed initialisation method for interpatient CT non-rigid registration (NRR), using a learning-based model to estimate correspondences between organ structures. A thin plate spline (TPS) deformation, set up using the correspondence predictions, is used to initialise the scans before a second NRR step. We compare two established NRR methods for the second step: a B-spl…
▽ More
We propose an anatomically-informed initialisation method for interpatient CT non-rigid registration (NRR), using a learning-based model to estimate correspondences between organ structures. A thin plate spline (TPS) deformation, set up using the correspondence predictions, is used to initialise the scans before a second NRR step. We compare two established NRR methods for the second step: a B-spline iterative optimisation-based algorithm and a deep learning-based approach. Registration performance is evaluated with and without the initialisation by assessing the similarity of propagated structures. Our proposed initialisation improved the registration performance of the learning-based method to more closely match the traditional iterative algorithm, with the mean distance-to-agreement reduced by 1.8mm for structures included in the TPS and 0.6mm for structures not included, while maintaining a substantial speed advantage (5 vs. 72 seconds).
△ Less
Submitted 26 February, 2025;
originally announced February 2025.
-
Deterministic and Stochastic Frank-Wolfe Recursion on Probability Spaces
Authors:
Di Yu,
Shane G. Henderson,
Raghu Pasupathy
Abstract:
Motivated by applications in emergency response and experimental design, we consider smooth stochastic optimization problems over probability measures supported on compact subsets of the Euclidean space. With the influence function as the variational object, we construct a deterministic Frank-Wolfe (dFW) recursion for probability spaces, made especially possible by a lemma that identifies a ``clos…
▽ More
Motivated by applications in emergency response and experimental design, we consider smooth stochastic optimization problems over probability measures supported on compact subsets of the Euclidean space. With the influence function as the variational object, we construct a deterministic Frank-Wolfe (dFW) recursion for probability spaces, made especially possible by a lemma that identifies a ``closed-form'' solution to the infinite-dimensional Frank-Wolfe sub-problem. Each iterate in dFW is expressed as a convex combination of the incumbent iterate and a Dirac measure concentrating on the minimum of the influence function at the incumbent iterate. To address common application contexts that have access only to Monte Carlo observations of the objective and influence function, we construct a stochastic Frank-Wolfe (sFW) variation that generates a random sequence of probability measures constructed using minima of increasingly accurate estimates of the influence function. We demonstrate that sFW's optimality gap sequence exhibits $O(k^{-1})$ iteration complexity almost surely and in expectation for smooth convex objectives, and $O(k^{-1/2})$ (in Frank-Wolfe gap) for smooth non-convex objectives. Furthermore, we show that an easy-to-implement fixed-step, fixed-sample version of (sFW) exhibits exponential convergence to $\varepsilon$-optimality. We end with a central limit theorem on the observed objective values at the sequence of generated random measures. To further intuition, we include several illustrative examples with exact influence function calculations.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Generating Synthetic Computed Tomography for Radiotherapy: SynthRAD2023 Challenge Report
Authors:
Evi M. C. Huijben,
Maarten L. Terpstra,
Arthur Jr. Galapon,
Suraj Pai,
Adrian Thummerer,
Peter Koopmans,
Manya Afonso,
Maureen van Eijnatten,
Oliver Gurney-Champion,
Zeli Chen,
Yiwen Zhang,
Kaiyi Zheng,
Chuanpu Li,
Haowen Pang,
Chuyang Ye,
Runqi Wang,
Tao Song,
Fuxin Fan,
Jingna Qiu,
Yixing Huang,
Juhyung Ha,
Jong Sung Park,
Alexandra Alain-Beaudoin,
Silvain Bériault,
Pengxin Yu
, et al. (34 additional authors not shown)
Abstract:
Radiation therapy plays a crucial role in cancer treatment, necessitating precise delivery of radiation to tumors while sparing healthy tissues over multiple days. Computed tomography (CT) is integral for treatment planning, offering electron density data crucial for accurate dose calculations. However, accurately representing patient anatomy is challenging, especially in adaptive radiotherapy, wh…
▽ More
Radiation therapy plays a crucial role in cancer treatment, necessitating precise delivery of radiation to tumors while sparing healthy tissues over multiple days. Computed tomography (CT) is integral for treatment planning, offering electron density data crucial for accurate dose calculations. However, accurately representing patient anatomy is challenging, especially in adaptive radiotherapy, where CT is not acquired daily. Magnetic resonance imaging (MRI) provides superior soft-tissue contrast. Still, it lacks electron density information while cone beam CT (CBCT) lacks direct electron density calibration and is mainly used for patient positioning. Adopting MRI-only or CBCT-based adaptive radiotherapy eliminates the need for CT planning but presents challenges. Synthetic CT (sCT) generation techniques aim to address these challenges by using image synthesis to bridge the gap between MRI, CBCT, and CT. The SynthRAD2023 challenge was organized to compare synthetic CT generation methods using multi-center ground truth data from 1080 patients, divided into two tasks: 1) MRI-to-CT and 2) CBCT-to-CT. The evaluation included image similarity and dose-based metrics from proton and photon plans. The challenge attracted significant participation, with 617 registrations and 22/17 valid submissions for tasks 1/2. Top-performing teams achieved high structural similarity indices (>0.87/0.90) and gamma pass rates for photon (>98.1%/99.0%) and proton (>97.3%/97.0%) plans. However, no significant correlation was found between image similarity metrics and dose accuracy, emphasizing the need for dose evaluation when assessing the clinical applicability of sCT. SynthRAD2023 facilitated the investigation and benchmarking of sCT generation techniques, providing insights for developing MRI-only and CBCT-based adaptive radiotherapy.
△ Less
Submitted 11 June, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
A multi-channel cycleGAN for CBCT to CT synthesis
Authors:
Chelsea A. H. Sargeant,
Edward G. A. Henderson,
Dónal M. McSweeney,
Aaron G. Rankin,
Denis Page
Abstract:
Image synthesis is used to generate synthetic CTs (sCTs) from on-treatment cone-beam CTs (CBCTs) with a view to improving image quality and enabling accurate dose computation to facilitate a CBCT-based adaptive radiotherapy workflow. As this area of research gains momentum, developments in sCT generation methods are difficult to compare due to the lack of large public datasets and sizeable variati…
▽ More
Image synthesis is used to generate synthetic CTs (sCTs) from on-treatment cone-beam CTs (CBCTs) with a view to improving image quality and enabling accurate dose computation to facilitate a CBCT-based adaptive radiotherapy workflow. As this area of research gains momentum, developments in sCT generation methods are difficult to compare due to the lack of large public datasets and sizeable variation in training procedures. To compare and assess the latest advancements in sCT generation, the SynthRAD2023 challenge provides a public dataset and evaluation framework for both MR and CBCT to sCT synthesis. Our contribution focuses on the second task, CBCT-to-sCT synthesis. By leveraging a multi-channel input to emphasize specific image features, our approach effectively addresses some of the challenges inherent in CBCT imaging, whilst restoring the contrast necessary for accurate visualisation of patients' anatomy. Additionally, we introduce an auxiliary fusion network to further enhance the fidelity of generated sCT images.
△ Less
Submitted 4 December, 2023;
originally announced December 2023.
-
Modeling the Risk of In-Person Instruction during the COVID-19 Pandemic
Authors:
Brian Liu,
Yujia Zhang,
Shane G. Henderson,
David B. Shmoys,
Peter I. Frazier
Abstract:
During the COVID-19 pandemic, safely implementing in-person indoor instruction was a high priority for universities nationwide. To support this effort at the University, we developed a mathematical model for estimating the risk of SARS-CoV-2 transmission in university classrooms. This model was used to evaluate combinations of feasible interventions for classrooms at the University during the pand…
▽ More
During the COVID-19 pandemic, safely implementing in-person indoor instruction was a high priority for universities nationwide. To support this effort at the University, we developed a mathematical model for estimating the risk of SARS-CoV-2 transmission in university classrooms. This model was used to evaluate combinations of feasible interventions for classrooms at the University during the pandemic and optimize the set of interventions that would allow higher occupancy levels, matching the pre-pandemic numbers of in-person courses. Importantly, we determined that requiring masking in dense classrooms with unrestricted seating with more than 90% of students vaccinated was easy to implement, incurred little logistical or financial cost, and allowed classes to be held at full capacity. A retrospective analysis at the end of the semester confirmed the model's assessment that the proposed classroom configuration would be safe. Our framework is generalizable and was used to support reopening decisions at Stanford University. In addition, our framework is flexible and applies to a wide range of indoor settings. It was repurposed for large university events and gatherings and could be used to support planning indoor space use to avoid transmission of infectious diseases across various industries, from secondary schools to movie theaters and restaurants.
△ Less
Submitted 19 February, 2024; v1 submitted 6 October, 2023;
originally announced October 2023.
-
Energy minimization of paired composite fermion wave functions in the spherical geometry
Authors:
Greg J. Henderson,
Gunnar Möller,
Steven H. Simon
Abstract:
We perform the energy minimization of the paired composite fermion (CF) wave functions, proposed by Möller and Simon (MS) [PRB 77, 075319 (2008)] and extended by Yutushui and Mross (YM) [PRB 102, 195153 (2020)], where the energy is minimized by varying the CF pairing function, in the case of an approximate model of the Coulomb interaction in the second Landau level for pairing channels…
▽ More
We perform the energy minimization of the paired composite fermion (CF) wave functions, proposed by Möller and Simon (MS) [PRB 77, 075319 (2008)] and extended by Yutushui and Mross (YM) [PRB 102, 195153 (2020)], where the energy is minimized by varying the CF pairing function, in the case of an approximate model of the Coulomb interaction in the second Landau level for pairing channels $\ell = -1, 3, 1$ which are expected to be in the Pfaffian, anti-Pfaffian and particle-hole symmetric (PH) Pfaffian phases respectively. It is found that the energy of the $\ell = -1$ MS wave function can be reduced substantially below that of the Moore-Read wave function at small system sizes, however, in the $\ell = 3$ case the energy cannot be reduced much below that of the YM trial wavefunction. Nonetheless, both our optimized and unoptimized wavefunctions with $\ell=-1,3$ extrapolate to roughly the same energy per particle in the thermodynamic limit. For the $\ell = 1$ case, the optimization makes no qualitative difference and these PH-Pfaffian wave functions are still energetically unfavourable. The effective CF pairing is analyzed in the resulting wave functions, where the effective pairing for the $\ell = -1, 3$ channels is found to be well approximated by a weak-pairing BCS ansatz and the $\ell = 1$ wave functions show no sign of emergent CF pairing.
△ Less
Submitted 29 September, 2023;
originally announced September 2023.
-
Unsupervised correspondence with combined geometric learning and imaging for radiotherapy applications
Authors:
Edward G. A. Henderson,
Marcel van Herk,
Andrew F. Green,
Eliana M. Vasquez Osorio
Abstract:
The aim of this study was to develop a model to accurately identify corresponding points between organ segmentations of different patients for radiotherapy applications. A model for simultaneous correspondence and interpolation estimation in 3D shapes was trained with head and neck organ segmentations from planning CT scans. We then extended the original model to incorporate imaging information us…
▽ More
The aim of this study was to develop a model to accurately identify corresponding points between organ segmentations of different patients for radiotherapy applications. A model for simultaneous correspondence and interpolation estimation in 3D shapes was trained with head and neck organ segmentations from planning CT scans. We then extended the original model to incorporate imaging information using two approaches: 1) extracting features directly from image patches, and 2) including the mean square error between patches as part of the loss function. The correspondence and interpolation performance were evaluated using the geodesic error, chamfer distance and conformal distortion metrics, as well as distances between anatomical landmarks. Each of the models produced significantly better correspondences than the baseline non-rigid registration approach. The original model performed similarly to the model with direct inclusion of image features. The best performing model configuration incorporated imaging information as part of the loss function which produced more anatomically plausible correspondences. We will use the best performing model to identify corresponding anatomical points on organs to improve spatial normalisation, an important step in outcome modelling, or as an initialisation for anatomically informed registrations. All our code is publicly available at https://github.com/rrr-uom-projects/Unsup-RT-Corr-Net
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
Conformal field theory approach to parton fractional quantum Hall trial wave functions
Authors:
Greg J. Henderson,
G. J. Sreejith,
Steven H. Simon
Abstract:
We show that all lowest Landau level projected and unprojected chiral parton type fractional quantum Hall ground and edge state trial wave functions, which take the form of products of integer quantum Hall wave functions, can be expressed as conformal field theory (CFT) correlation functions, where we can associate a chiral algebra to each parton state such that the CFT defined by the algebra is t…
▽ More
We show that all lowest Landau level projected and unprojected chiral parton type fractional quantum Hall ground and edge state trial wave functions, which take the form of products of integer quantum Hall wave functions, can be expressed as conformal field theory (CFT) correlation functions, where we can associate a chiral algebra to each parton state such that the CFT defined by the algebra is the ``smallest'' such CFT that can generate the corresponding ground and edge state trial wave functions. A field-theoretic generalisation of Laughlin's plasma analogy, known as generalised screening, is formulated for these states. If this holds, we argue that the inner products of edge state trial wave functions, for parton states where the ``densest'' trial wave function is unique, can be expressed as matrix elements of an exponentiated local action operator of the CFT, generalising the result of Dubail et al. [PRB 85, 11531 (2012)], which implies the equality between edge state and entanglement level counting to state counting in the corresponding CFT. We numerically test this result in two specific cases. We discuss how Read's arguments [PRB 79, 045308 (2009)] still apply, implying that conformal blocks of the CFT defined by the corresponding chiral algebra are valid quasi-hole trial wave functions, with the adiabatic braiding statistics given by the monodromy of these functions, assuming the existence of a quasi-particle trapping Hamiltonian. Generalisations of these constructions are discussed. It is shown that all chiral composite fermion wave functions can be expressed as CFT correlation functions without explicit symmetrisation or anti-symmetrisation and that the ground, edge, and certain quasi-hole trial wave functions of the $φ_n^m$ parton states can be expressed as the conformal blocks of the $U(1) \otimes SU(n)_m$ WZW models.
△ Less
Submitted 9 June, 2024; v1 submitted 21 September, 2023;
originally announced September 2023.
-
The impact of training dataset size and ensemble inference strategies on head and neck auto-segmentation
Authors:
Edward G. A. Henderson,
Marcel van Herk,
Eliana M. Vasquez Osorio
Abstract:
Convolutional neural networks (CNNs) are increasingly being used to automate segmentation of organs-at-risk in radiotherapy. Since large sets of highly curated data are scarce, we investigated how much data is required to train accurate and robust head and neck auto-segmentation models. For this, an established 3D CNN was trained from scratch with different sized datasets (25-1000 scans) to segmen…
▽ More
Convolutional neural networks (CNNs) are increasingly being used to automate segmentation of organs-at-risk in radiotherapy. Since large sets of highly curated data are scarce, we investigated how much data is required to train accurate and robust head and neck auto-segmentation models. For this, an established 3D CNN was trained from scratch with different sized datasets (25-1000 scans) to segment the brainstem, parotid glands and spinal cord in CTs. Additionally, we evaluated multiple ensemble techniques to improve the performance of these models. The segmentations improved with training set size up to 250 scans and the ensemble methods significantly improved performance for all organs. The impact of the ensemble methods was most notable in the smallest datasets, demonstrating their potential for use in cases where large training datasets are difficult to obtain.
△ Less
Submitted 30 March, 2023;
originally announced March 2023.
-
Generalised Automatic Anatomy Finder (GAAF): A general framework for 3D location-finding in CT scans
Authors:
Edward G. A. Henderson,
Eliana M. Vasquez Osorio,
Marcel van Herk,
Andrew F. Green
Abstract:
We present GAAF, a Generalised Automatic Anatomy Finder, for the identification of generic anatomical locations in 3D CT scans. GAAF is an end-to-end pipeline, with dedicated modules for data pre-processing, model training, and inference. At it's core, GAAF uses a custom a localisation convolutional neural network (CNN). The CNN model is small, lightweight and can be adjusted to suit the particula…
▽ More
We present GAAF, a Generalised Automatic Anatomy Finder, for the identification of generic anatomical locations in 3D CT scans. GAAF is an end-to-end pipeline, with dedicated modules for data pre-processing, model training, and inference. At it's core, GAAF uses a custom a localisation convolutional neural network (CNN). The CNN model is small, lightweight and can be adjusted to suit the particular application. The GAAF framework has so far been tested in the head and neck, and is able to find anatomical locations such as the centre-of-mass of the brainstem. GAAF was evaluated in an open-access dataset and is capable of accurate and robust localisation performance. All our code is open source and available at https://github.com/rrr-uom-projects/GAAF.
△ Less
Submitted 13 September, 2022;
originally announced September 2022.
-
COBRA: Cpu-Only aBdominal oRgan segmentAtion
Authors:
Edward G. A. Henderson,
Dónal M. McSweeney,
Andrew F. Green
Abstract:
Abdominal organ segmentation is a difficult and time-consuming task. To reduce the burden on clinical experts, fully-automated methods are highly desirable. Current approaches are dominated by Convolutional Neural Networks (CNNs) however the computational requirements and the need for large data sets limit their application in practice. By implementing a small and efficient custom 3D CNN, compilin…
▽ More
Abdominal organ segmentation is a difficult and time-consuming task. To reduce the burden on clinical experts, fully-automated methods are highly desirable. Current approaches are dominated by Convolutional Neural Networks (CNNs) however the computational requirements and the need for large data sets limit their application in practice. By implementing a small and efficient custom 3D CNN, compiling the trained model and optimizing the computational graph: our approach produces high accuracy segmentations (Dice Similarity Coefficient (%): Liver: 97.3$\pm$1.3, Kidneys: 94.8$\pm$3.6, Spleen: 96.4$\pm$3.0, Pancreas: 80.9$\pm$10.1) at a rate of 1.6 seconds per image. Crucially, we are able to perform segmentation inference solely on CPU (no GPU required), thereby facilitating easy and widespread deployment of the model without specialist hardware.
△ Less
Submitted 21 July, 2022;
originally announced July 2022.
-
Automatic identification of segmentation errors for radiotherapy using geometric learning
Authors:
Edward G. A. Henderson,
Andrew F. Green,
Marcel van Herk,
Eliana M. Vasquez Osorio
Abstract:
Automatic segmentation of organs-at-risk (OARs) in CT scans using convolutional neural networks (CNNs) is being introduced into the radiotherapy workflow. However, these segmentations still require manual editing and approval by clinicians prior to clinical use, which can be time consuming. The aim of this work was to develop a tool to automatically identify errors in 3D OAR segmentations without…
▽ More
Automatic segmentation of organs-at-risk (OARs) in CT scans using convolutional neural networks (CNNs) is being introduced into the radiotherapy workflow. However, these segmentations still require manual editing and approval by clinicians prior to clinical use, which can be time consuming. The aim of this work was to develop a tool to automatically identify errors in 3D OAR segmentations without a ground truth. Our tool uses a novel architecture combining a CNN and graph neural network (GNN) to leverage the segmentation's appearance and shape. The proposed model is trained using self-supervised learning using a synthetically-generated dataset of segmentations of the parotid and with realistic contouring errors. The effectiveness of our model is assessed with ablation tests, evaluating the efficacy of different portions of the architecture as well as the use of transfer learning from an unsupervised pretext task. Our best performing model predicted errors on the parotid gland with a precision of 85.0% & 89.7% for internal and external errors respectively, and recall of 66.5% & 68.6%. This offline QA tool could be used in the clinical pathway, potentially decreasing the time clinicians spend correcting contours by detecting regions which require their attention. All our code is publicly available at https://github.com/rrr-uom-projects/contour_auto_QATool.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
Control of light-atom solitons and atomic transport by optical vortex beams propagating through a Bose-Einstein Condensate
Authors:
Grant Henderson,
Gordon R. M. Robb,
Gian-Luca Oppo,
Alison M. Yao
Abstract:
We model propagation of far-red-detuned optical vortex beams through a Bose-Einstein Condensate using nonlinear Schrödinger and Gross-Pitaevskii equations. We show the formation of coupled light/atomic solitons that rotate azimuthally before moving off tangentially, carrying angular momentum. The number, and velocity, of solitons, depends on the orbital angular momentum of the optical field. Using…
▽ More
We model propagation of far-red-detuned optical vortex beams through a Bose-Einstein Condensate using nonlinear Schrödinger and Gross-Pitaevskii equations. We show the formation of coupled light/atomic solitons that rotate azimuthally before moving off tangentially, carrying angular momentum. The number, and velocity, of solitons, depends on the orbital angular momentum of the optical field. Using a Bessel-Gauss beam increases radial confinement so that solitons can rotate with fixed azimuthal velocity. Our model provides a highly controllable method of channelling a BEC and atomic transport.
△ Less
Submitted 22 March, 2022;
originally announced March 2022.
-
Entanglement Action for the Real-Space Entanglement Spectra of Composite Fermion Wave Functions
Authors:
Greg J. Henderson,
G J Sreejith,
Steven H. Simon
Abstract:
We argue and numerically substantiate that the real-space entanglement spectrum (RSES) of composite fermion quantum Hall states is given by the spectrum of a local boundary perturbation of a $(1+1)$d conformal field theory (CFT), which describes an effective edge dynamics along the real-space cut. The cut-and-glue approach suggests that the low-lying RSES is equivalent to the low-lying modes of so…
▽ More
We argue and numerically substantiate that the real-space entanglement spectrum (RSES) of composite fermion quantum Hall states is given by the spectrum of a local boundary perturbation of a $(1+1)$d conformal field theory (CFT), which describes an effective edge dynamics along the real-space cut. The cut-and-glue approach suggests that the low-lying RSES is equivalent to the low-lying modes of some effective edge action. The general structure of this action is deduced by mapping to a boundary critical problem, generalizing work of Dubail, Read, and Rezayi [PRB 85, 11531 (2012)]. Using trial wave functions we numerically test our model of the RSES for the $ν= 2/3$ bosonic composite fermion state.
△ Less
Submitted 21 August, 2021;
originally announced August 2021.
-
Variance Reduction in Simulation of Multiclass Processing Networks
Authors:
Shane G. Henderson,
Sean P. Meyn
Abstract:
We use simulation to estimate the steady-state performance of a stable multiclass queueing network. Standard estimators have been seen to perform poorly when the network is heavily loaded. We introduce two new simulation estimators. The first provides substantial variance reductions in moderately-loaded networks at very little additional computational cost. The second estimator provides substantia…
▽ More
We use simulation to estimate the steady-state performance of a stable multiclass queueing network. Standard estimators have been seen to perform poorly when the network is heavily loaded. We introduce two new simulation estimators. The first provides substantial variance reductions in moderately-loaded networks at very little additional computational cost. The second estimator provides substantial variance reductions in heavy traffic, again for a small additional computational cost. Both methods employ the variance reduction method of control variates, and differ in terms of how the control variates are constructed.
△ Less
Submitted 28 May, 2020;
originally announced May 2020.
-
Collisionally inhomogeneous Bose-Einstein condensates with a linear interaction gradient
Authors:
Andrea Di Carli,
Grant Henderson,
Stuart Flannigan,
Craig D. Colquhoun,
Matthew Mitchell,
Gian-Luca Oppo,
Andrew J. Daley,
Stefan Kuhr,
Elmar Haller
Abstract:
We study the evolution of a collisionally inhomogeneous matter wave in a spatial gradient of the interaction strength. Starting with a Bose-Einstein condensate with weak repulsive interactions in quasi-one-dimensional geometry, we monitor the evolution of a matter wave that simultaneously extends into spatial regions with attractive and repulsive interactions. We observe the formation and the deca…
▽ More
We study the evolution of a collisionally inhomogeneous matter wave in a spatial gradient of the interaction strength. Starting with a Bose-Einstein condensate with weak repulsive interactions in quasi-one-dimensional geometry, we monitor the evolution of a matter wave that simultaneously extends into spatial regions with attractive and repulsive interactions. We observe the formation and the decay of soliton-like density peaks, counter-propagating self-interfering wave packets, and the creation of cascades of solitons. The matter-wave dynamics is well reproduced in numerical simulations based on the nonpolynomial Schroedinger equation with three-body loss, allowing us to better understand the underlying behaviour based on a wavelet transformation. Our analysis provides new understanding of collapse processes for solitons, and opens interesting connections to other nonlinear instabilities.
△ Less
Submitted 19 May, 2020; v1 submitted 24 August, 2019;
originally announced August 2019.
-
Excitation modes of bright matter-wave solitons
Authors:
Andrea Di Carli,
Craig D. Colquhoun,
Grant Henderson,
Stuart Flannigan,
Gian-Luca Oppo,
Andrew J. Daley,
Stefan Kuhr,
Elmar Haller
Abstract:
We experimentally study the excitation modes of bright matter-wave solitons in a quasi-one-dimensional geometry. The solitons are created by quenching the interactions of a Bose-Einstein condensate of cesium atoms from repulsive to attractive in combination with a rapid reduction of the longitudinal confinement. A deliberate mismatch of quench parameters allows for the excitation of breathing mode…
▽ More
We experimentally study the excitation modes of bright matter-wave solitons in a quasi-one-dimensional geometry. The solitons are created by quenching the interactions of a Bose-Einstein condensate of cesium atoms from repulsive to attractive in combination with a rapid reduction of the longitudinal confinement. A deliberate mismatch of quench parameters allows for the excitation of breathing modes of the emerging soliton and for the determination of its breathing frequency as a function of atom number and confinement. In addition, we observe signatures of higher-order solitons and the splitting of the wave packet after the quench. Our experimental results are compared to analytical predictions and to numerical simulations of the one-dimensional Gross-Pitaevskii equation.
△ Less
Submitted 18 July, 2019; v1 submitted 9 May, 2019;
originally announced May 2019.
-
Comparing the Finite-Time Performance of Simulation-Optimization Algorithms
Authors:
Naijia Dong,
David J. Eckman,
Matthias Poloczek,
Xueqi Zhao,
Shane G. Henderson
Abstract:
We empirically evaluate the finite-time performance of several simulation-optimization algorithms on a testbed of problems with the goal of motivating further development of algorithms with strong finite-time performance. We investigate if the observed performance of the algorithms can be explained by properties of the problems, e.g., the number of decision variables, the topology of the objective…
▽ More
We empirically evaluate the finite-time performance of several simulation-optimization algorithms on a testbed of problems with the goal of motivating further development of algorithms with strong finite-time performance. We investigate if the observed performance of the algorithms can be explained by properties of the problems, e.g., the number of decision variables, the topology of the objective function, or the magnitude of the simulation error.
△ Less
Submitted 22 May, 2017;
originally announced May 2017.
-
Estimating the Probability that a Function Observed with Noise is Convex
Authors:
Nanjing Jian,
Shane G. Henderson
Abstract:
Consider a real-valued function that can only be observed with stochastic noise at a finite set of design points within a Euclidean space. We wish to determine whether there exists a convex function that goes through the true function values at the design points. We develop an asymptotically consistent Bayesian sequential sampling procedure that estimates the posterior probability of this being tr…
▽ More
Consider a real-valued function that can only be observed with stochastic noise at a finite set of design points within a Euclidean space. We wish to determine whether there exists a convex function that goes through the true function values at the design points. We develop an asymptotically consistent Bayesian sequential sampling procedure that estimates the posterior probability of this being true. In each iteration, the posterior probability is estimated using Monte Carlo simulation. We offer three variance reduction methods -- change of measure, acceptance-rejection, and conditional Monte Carlo. Numerical experiments suggest that the conditional Monte Carlo method should be preferred.
△ Less
Submitted 27 July, 2018; v1 submitted 12 March, 2017;
originally announced March 2017.
-
Bayes-Optimal Entropy Pursuit for Active Choice-Based Preference Learning
Authors:
Stephen N. Pallone,
Peter I. Frazier,
Shane G. Henderson
Abstract:
We analyze the problem of learning a single user's preferences in an active learning setting, sequentially and adaptively querying the user over a finite time horizon. Learning is conducted via choice-based queries, where the user selects her preferred option among a small subset of offered alternatives. These queries have been shown to be a robust and efficient way to learn an individual's prefer…
▽ More
We analyze the problem of learning a single user's preferences in an active learning setting, sequentially and adaptively querying the user over a finite time horizon. Learning is conducted via choice-based queries, where the user selects her preferred option among a small subset of offered alternatives. These queries have been shown to be a robust and efficient way to learn an individual's preferences. We take a parametric approach and model the user's preferences through a linear classifier, using a Bayesian prior to encode our current knowledge of this classifier. The rate at which we learn depends on the alternatives offered at every time epoch. Under certain noise assumptions, we show that the Bayes-optimal policy for maximally reducing entropy of the posterior distribution of this linear classifier is a greedy policy, and that this policy achieves a linear lower bound when alternatives can be constructed from the continuum. Further, we analyze a different metric called misclassification error, proving that the performance of the optimal policy that minimizes misclassification error is bounded below by a linear function of differential entropy. Lastly, we numerically compare the greedy entropy reduction policy with a knowledge gradient policy under a number of scenarios, examining their performance under both differential entropy and misclassification error.
△ Less
Submitted 24 February, 2017;
originally announced February 2017.
-
Probabilistic Bisection Converges Almost as Quickly as Stochastic Approximation
Authors:
Peter I. Frazier,
Shane G. Henderson,
Rolf Waeber
Abstract:
The probabilistic bisection algorithm (PBA) solves a class of stochastic root-finding problems in one dimension by successively updating a prior belief on the location of the root based on noisy responses to queries at chosen points. The responses indicate the direction of the root from the queried point, and are incorrect with a fixed probability. The fixed-probability assumption is problematic i…
▽ More
The probabilistic bisection algorithm (PBA) solves a class of stochastic root-finding problems in one dimension by successively updating a prior belief on the location of the root based on noisy responses to queries at chosen points. The responses indicate the direction of the root from the queried point, and are incorrect with a fixed probability. The fixed-probability assumption is problematic in applications, and so we extend the PBA to apply when this assumption is relaxed. The extension involves the use of a power-one test at each queried point. We explore the convergence behavior of the extended PBA, showing that it converges at a rate arbitrarily close to, but slower than, the canonical "square root" rate of stochastic approximation.
△ Less
Submitted 12 December, 2016;
originally announced December 2016.
-
Minimizing Multimodular Functions and Allocating Capacity in Bike-Sharing Systems
Authors:
Daniel Freund,
Shane G. Henderson,
David B. Shmoys
Abstract:
The growing popularity of bike-sharing systems around the world has motivated recent attention to models and algorithms for their effective operation. Most of this literature focuses on their daily operation for managing asymmetric demand. In this work, we consider the more strategic question of how to (re-)allocate dock-capacity in such systems. We develop mathematical formulations for variations…
▽ More
The growing popularity of bike-sharing systems around the world has motivated recent attention to models and algorithms for their effective operation. Most of this literature focuses on their daily operation for managing asymmetric demand. In this work, we consider the more strategic question of how to (re-)allocate dock-capacity in such systems. We develop mathematical formulations for variations of this problem (either for service performance over the course of one day or for a long-run-average) and exhibit discrete convex properties in associated optimization problems. This allows us to design a practically fast polynomial-time allocation algorithm to compute an optimal solution for this problem, which can also handle practically motivated constraints, such as a limit on the number of docks moved in the system.
We apply our algorithm to data sets from Boston, New York City, and Chicago to investigate how different dock allocations can yield better service in these systems. Recommendations based on our analysis have led to changes in the system design in Chicago and New York City. Beyond optimizing for improved quality of service through better allocations, our results also provide a metric to compare the impact of strategically reallocating docks and the rebalancing of bikes.
△ Less
Submitted 14 March, 2022; v1 submitted 28 November, 2016;
originally announced November 2016.
-
Efficient Ranking and Selection in Parallel Computing Environments
Authors:
Eric C. Ni,
Dragos F. Ciocan,
Shane G. Henderson,
Susan R. Hunter
Abstract:
The goal of ranking and selection (R&S) procedures is to identify the best stochastic system from among a finite set of competing alternatives. Such procedures require constructing estimates of each system's performance, which can be obtained simultaneously by running multiple independent replications on a parallel computing platform. However, nontrivial statistical and implementation issues arise…
▽ More
The goal of ranking and selection (R&S) procedures is to identify the best stochastic system from among a finite set of competing alternatives. Such procedures require constructing estimates of each system's performance, which can be obtained simultaneously by running multiple independent replications on a parallel computing platform. However, nontrivial statistical and implementation issues arise when designing R&S procedures for a parallel computing environment. Thus we propose several design principles for parallel R&S procedures that preserve statistical validity and maximize core utilization, especially when large numbers of alternatives or cores are involved. These principles are followed closely by our parallel Good Selection Procedure (GSP), which, under the assumption of normally distributed output, (i) guarantees to select a system in the indifference zone with high probability, (ii) runs efficiently on up to 1,024 parallel cores, and (iii) in an example uses smaller sample sizes compared to existing parallel procedures, particularly for large problems (over $10^6$ alternatives). In our computational study we discuss two methods for implementing GSP on parallel computers, namely the Message-Passing Interface (MPI) and Hadoop MapReduce and show that the latter provides good protection against core failures at the expense of a significant drop in utilization due to periodic unavoidable synchronization.
△ Less
Submitted 16 June, 2015;
originally announced June 2015.
-
A Spatio-Temporal Point Process Model for Ambulance Demand
Authors:
Zhengyi Zhou,
David S. Matteson,
Dawn B. Woodard,
Shane G. Henderson,
Athanasios C. Micheas
Abstract:
Ambulance demand estimation at fine time and location scales is critical for fleet management and dynamic deployment. We are motivated by the problem of estimating the spatial distribution of ambulance demand in Toronto, Canada, as it changes over discrete 2-hour intervals. This large-scale dataset is sparse at the desired temporal resolutions and exhibits location-specific serial dependence, dail…
▽ More
Ambulance demand estimation at fine time and location scales is critical for fleet management and dynamic deployment. We are motivated by the problem of estimating the spatial distribution of ambulance demand in Toronto, Canada, as it changes over discrete 2-hour intervals. This large-scale dataset is sparse at the desired temporal resolutions and exhibits location-specific serial dependence, daily and weekly seasonality. We address these challenges by introducing a novel characterization of time-varying Gaussian mixture models. We fix the mixture component distributions across all time periods to overcome data sparsity and accurately describe Toronto's spatial structure, while representing the complex spatio-temporal dynamics through time-varying mixture weights. We constrain the mixture weights to capture weekly seasonality, and apply a conditionally autoregressive prior on the mixture weights of each component to represent location-specific short-term serial dependence and daily seasonality. While estimation may be performed using a fixed number of mixture components, we also extend to estimate the number of components using birth-and-death Markov chain Monte Carlo. The proposed model is shown to give higher statistical predictive accuracy and to reduce the error in predicting EMS operational performance by as much as two-thirds compared to a typical industry practice.
△ Less
Submitted 27 May, 2014; v1 submitted 21 January, 2014;
originally announced January 2014.
-
Travel time estimation for ambulances using Bayesian data augmentation
Authors:
Bradford S. Westgate,
Dawn B. Woodard,
David S. Matteson,
Shane G. Henderson
Abstract:
We introduce a Bayesian model for estimating the distribution of ambulance travel times on each road segment in a city, using Global Positioning System (GPS) data. Due to sparseness and error in the GPS data, the exact ambulance paths and travel times on each road segment are unknown. We simultaneously estimate the paths, travel times, and parameters of each road segment travel time distribution u…
▽ More
We introduce a Bayesian model for estimating the distribution of ambulance travel times on each road segment in a city, using Global Positioning System (GPS) data. Due to sparseness and error in the GPS data, the exact ambulance paths and travel times on each road segment are unknown. We simultaneously estimate the paths, travel times, and parameters of each road segment travel time distribution using Bayesian data augmentation. To draw ambulance path samples, we use a novel reversible jump Metropolis-Hastings step. We also introduce two simpler estimation methods based on GPS speed data. We compare these methods to a recently published travel time estimation method, using simulated data and data from Toronto EMS. In both cases, out-of-sample point and interval estimates of ambulance trip times from the Bayesian method outperform estimates from the alternative methods. We also construct probability-of-coverage maps for ambulances. The Bayesian method gives more realistic maps than the recently published method. Finally, path estimates from the Bayesian method interpolate well between sparsely recorded GPS readings and are robust to GPS location errors.
△ Less
Submitted 6 December, 2013;
originally announced December 2013.
-
Forecasting emergency medical service call arrival rates
Authors:
David S. Matteson,
Mathew W. McLean,
Dawn B. Woodard,
Shane G. Henderson
Abstract:
We introduce a new method for forecasting emergency call arrival rates that combines integer-valued time series models with a dynamic latent factor structure. Covariate information is captured via simple constraints on the factor loadings. We directly model the count-valued arrivals per hour, rather than using an artificial assumption of normality. This is crucial for the emergency medical service…
▽ More
We introduce a new method for forecasting emergency call arrival rates that combines integer-valued time series models with a dynamic latent factor structure. Covariate information is captured via simple constraints on the factor loadings. We directly model the count-valued arrivals per hour, rather than using an artificial assumption of normality. This is crucial for the emergency medical service context, in which the volume of calls may be very low. Smoothing splines are used in estimating the factor levels and loadings to improve long-term forecasts. We impose time series structure at the hourly level, rather than at the daily level, capturing the fine-scale dependence in addition to the long-term structure. Our analysis considers all emergency priority calls received by Toronto EMS between January 2007 and December 2008 for which an ambulance was dispatched. Empirical results demonstrate significantly reduced error in forecasting call arrival volume. To quantify the impact of reduced forecast errors, we design a queueing model simulation that approximates the dynamics of an ambulance system. The results show better performance as the forecasting method improves. This notion of quantifying the operational impact of improved statistical procedures may be of independent interest.
△ Less
Submitted 25 July, 2011;
originally announced July 2011.
-
The structure of amorphous, crystalline and liquid GeO2
Authors:
M. Micoulaut,
L. Cormier,
G. S. Henderson
Abstract:
Germanium dioxide ($GeO_2$) is a chemical analogue of $SiO_2$. Furthermore, it is also to some extent a structural analogue, as the low and high-pressure short-range order (tetrahedral and octahedral) is the same. However, a number of differences exist. For example, the $GeO_2$ phase diagram exhibits a smaller number of polymorphs, and all three $GeO_2$ phases (crystalline, glass, liquid) have a…
▽ More
Germanium dioxide ($GeO_2$) is a chemical analogue of $SiO_2$. Furthermore, it is also to some extent a structural analogue, as the low and high-pressure short-range order (tetrahedral and octahedral) is the same. However, a number of differences exist. For example, the $GeO_2$ phase diagram exhibits a smaller number of polymorphs, and all three $GeO_2$ phases (crystalline, glass, liquid) have an increased sensitivity to pressure, undergoing pressure induced changes at much lower pressures than their equivalent $SiO_2$ analogues. In addition, differences exist in $GeO_2$ glass in the medium range order, resulting in the glass transition temperature of germania being much lower than for silica. This review highlights the structure of amorphous $GeO_2$ by different experimental (e.g., Raman and NMR spectroscopy, neutron and x-ray diffraction) and theoretical methods (e.g., classical molecular dynamics, ab initio calculations). It also addresses the structure of liquid and crystalline $GeO_2$ that have received much less attention. Furthermore, we compare and contrast the structural differences between $GeO_2$ and $SiO_2$, as well as, along the $GeO_2-SiO_2$ join. It is probably a very timely review as interest in this compound, that can be investigated in the liquid state at relatively low temperatures and pressures, continues to increase.
△ Less
Submitted 28 September, 2006;
originally announced September 2006.
-
Nanosurgery: Observation of Peptidoglycan Strands in Lactobacillus helveticus Cell Walls
Authors:
Max Firtel,
Grant Henderson,
Igor Sokolov
Abstract:
The internal cell wall structure of the bacterium Lactobacillus helveticus has been observed in situ in aqueous solution using an atomic force microscope (AFM). The AFM tip was used not only for imaging but presumably to remove mechanically large patches of the outer cell wall after appropriate chemical treatment, which typically leaves the bacteria alive. The surface exposed after such a surger…
▽ More
The internal cell wall structure of the bacterium Lactobacillus helveticus has been observed in situ in aqueous solution using an atomic force microscope (AFM). The AFM tip was used not only for imaging but presumably to remove mechanically large patches of the outer cell wall after appropriate chemical treatment, which typically leaves the bacteria alive. The surface exposed after such a surgery revealed ca. 26 nm thick twisted strands within the cell wall. The structure and location of the observed strands are consistent with the glycan backbone of peptidoglycan fibers that give strength to the cell wall. The found structural organization of these fibers has not been observed previously.
△ Less
Submitted 5 July, 2004;
originally announced July 2004.