-
Cryptogenic stroke and migraine: using probabilistic independence and machine learning to uncover latent sources of disease from the electronic health record
Authors:
Joshua W. Betts,
John M. Still,
Thomas A. Lasko
Abstract:
Migraine is a common but complex neurological disorder that doubles the lifetime risk of cryptogenic stroke (CS). However, this relationship remains poorly characterized, and few clinical guidelines exist to reduce this associated risk. We therefore propose a data-driven approach to extract probabilistically-independent sources from electronic health record (EHR) data and create a 10-year risk-pre…
▽ More
Migraine is a common but complex neurological disorder that doubles the lifetime risk of cryptogenic stroke (CS). However, this relationship remains poorly characterized, and few clinical guidelines exist to reduce this associated risk. We therefore propose a data-driven approach to extract probabilistically-independent sources from electronic health record (EHR) data and create a 10-year risk-predictive model for CS in migraine patients. These sources represent external latent variables acting on the causal graph constructed from the EHR data and approximate root causes of CS in our population. A random forest model trained on patient expressions of these sources demonstrated good accuracy (ROC 0.771) and identified the top 10 most predictive sources of CS in migraine patients. These sources revealed that pharmacologic interventions were the most important factor in minimizing CS risk in our population and identified a factor related to allergic rhinitis as a potential causative source of CS in migraine patients.
△ Less
Submitted 22 April, 2025;
originally announced May 2025.
-
Predict+Optimize Problem in Renewable Energy Scheduling
Authors:
Christoph Bergmeir,
Frits de Nijs,
Evgenii Genov,
Abishek Sriramulu,
Mahdi Abolghasemi,
Richard Bean,
John Betts,
Quang Bui,
Nam Trong Dinh,
Nils Einecke,
Rasul Esmaeilbeigi,
Scott Ferraro,
Priya Galketiya,
Robert Glasgow,
Rakshitha Godahewa,
Yanfei Kang,
Steffen Limmer,
Luis Magdalena,
Pablo Montero-Manso,
Daniel Peralta,
Yogesh Pipada Sunil Kumar,
Alejandro Rosales-Pérez,
Julian Ruddick,
Akylas Stratigakos,
Peter Stuckey
, et al. (3 additional authors not shown)
Abstract:
Predict+Optimize frameworks integrate forecasting and optimization to address real-world challenges such as renewable energy scheduling, where variability and uncertainty are critical factors. This paper benchmarks solutions from the IEEE-CIS Technical Challenge on Predict+Optimize for Renewable Energy Scheduling, focusing on forecasting renewable production and demand and optimizing energy cost.…
▽ More
Predict+Optimize frameworks integrate forecasting and optimization to address real-world challenges such as renewable energy scheduling, where variability and uncertainty are critical factors. This paper benchmarks solutions from the IEEE-CIS Technical Challenge on Predict+Optimize for Renewable Energy Scheduling, focusing on forecasting renewable production and demand and optimizing energy cost. The competition attracted 49 participants in total. The top-ranked method employed stochastic optimization using LightGBM ensembles, and achieved at least a 2% reduction in energy costs compared to deterministic approaches, demonstrating that the most accurate point forecast does not necessarily guarantee the best performance in downstream optimization. The published data and problem setting establish a benchmark for further research into integrated forecasting-optimization methods for energy systems, highlighting the importance of considering forecast uncertainty in optimization models to achieve cost-effective and reliable energy management. The novelty of this work lies in its comprehensive evaluation of Predict+Optimize methodologies applied to a real-world renewable energy scheduling problem, providing insights into the scalability, generalizability, and effectiveness of the proposed solutions. Potential applications extend beyond energy systems to any domain requiring integrated forecasting and optimization, such as supply chain management, transportation planning, and financial portfolio optimization.
△ Less
Submitted 14 April, 2025; v1 submitted 20 December, 2022;
originally announced December 2022.
-
Adaptive Class Weight based Dual Focal Loss for Improved Semantic Segmentation
Authors:
Md Sazzad Hossain,
Andrew P Paplinski,
John M Betts
Abstract:
In this paper, we propose a Dual Focal Loss (DFL) function, as a replacement for the standard cross entropy (CE) function to achieve a better treatment of the unbalanced classes in a dataset. Our DFL method is an improvement on the recently reported Focal Loss (FL) cross-entropy function, which proposes a scaling method that puts more weight on the examples that are difficult to classify over thos…
▽ More
In this paper, we propose a Dual Focal Loss (DFL) function, as a replacement for the standard cross entropy (CE) function to achieve a better treatment of the unbalanced classes in a dataset. Our DFL method is an improvement on the recently reported Focal Loss (FL) cross-entropy function, which proposes a scaling method that puts more weight on the examples that are difficult to classify over those that are easy. However, the scaling parameter of FL is empirically set, which is problem-dependent. In addition, like other CE variants, FL only focuses on the loss of true classes. Therefore, no loss feedback is gained from the false classes. Although focusing only on true examples increases probability on true classes and correspondingly reduces probability on false classes due to the nature of the softmax function, it does not achieve the best convergence due to avoidance of the loss on false classes. Our DFL method improves on the simple FL in two ways. Firstly, it takes the idea of FL to focus more on difficult examples than the easy ones, but evaluates loss on both true and negative classes with equal importance. Secondly, the scaling parameter of DFL has been made learnable so that it can tune itself by backpropagation rather than being dependent on manual tuning. In this way, our proposed DFL method offers an auto-tunable loss function that can reduce the class imbalance effect as well as put more focus on both true difficult examples and negative easy examples.
△ Less
Submitted 26 November, 2020; v1 submitted 26 September, 2019;
originally announced September 2019.
-
Prostate Segmentation from Ultrasound Images using Residual Fully Convolutional Network
Authors:
M. S. Hossain,
A. P. Paplinski,
J. M. Betts
Abstract:
Medical imaging based prostate cancer diagnosis procedure uses intra-operative transrectal ultrasound (TRUS) imaging to visualize the prostate shape and location to collect tissue samples. Correct tissue sampling from prostate requires accurate prostate segmentation in TRUS images. To achieve this, this study uses a novel residual connection based fully convolutional network. The advantage of this…
▽ More
Medical imaging based prostate cancer diagnosis procedure uses intra-operative transrectal ultrasound (TRUS) imaging to visualize the prostate shape and location to collect tissue samples. Correct tissue sampling from prostate requires accurate prostate segmentation in TRUS images. To achieve this, this study uses a novel residual connection based fully convolutional network. The advantage of this segmentation technique is that it requires no pre-processing of TRUS images to perform the segmentation. Thus, it offers a faster and straightforward prostate segmentation from TRUS images. Results show that the proposed technique can achieve around 86% Dice Similarity accuracy using only few TRUS datasets.
△ Less
Submitted 20 March, 2019;
originally announced March 2019.
-
A Radix Representation for each van der Waerden number $W(r, k)$ with $r$ colors: Why $\log_{r}W(r, k) < k^{2}$ is true whenever $k$ is the number of terms in the arithmetic progression
Authors:
Robert J Betts
Abstract:
Here we show that by expressing a van der Waerden number $W(r, k)$ by its radix polynomial representation, it not only is possible to locate each proper subset on $\mathbb{R}$ in which the van der Waerden number lies, but also to show that conditions exist for which the logarithm of the van der Waerden number necessarily is bounded above by the square of the number of terms $k$ in the arithmetic p…
▽ More
Here we show that by expressing a van der Waerden number $W(r, k)$ by its radix polynomial representation, it not only is possible to locate each proper subset on $\mathbb{R}$ in which the van der Waerden number lies, but also to show that conditions exist for which the logarithm of the van der Waerden number necessarily is bounded above by the square of the number of terms $k$ in the arithmetic progression. Furthermore we also use the method to find a mathematical expression or formula for the ratio of two "consecutive" van der Waerden numbers of the kind $W(r, k)$, $W(r, k + 1)$.
△ Less
Submitted 7 May, 2016; v1 submitted 24 April, 2016;
originally announced April 2016.
-
How to find the least upper bound on the van der Waerden Number $W(r, k)$ that is some integer Power of the coloring Integer $r$
Authors:
Robert J Betts
Abstract:
What is a least integer upper bound on van der Waerden number $W(r, k)$ among the powers of the integer $r$? We show how this can be found by expanding the integer $W(r, k)$ into powers of $r$. Doing this enables us to find both a least upper bound and a greatest lower bound on $W(r, k)$ that are some powers of $r$ and where the greatest lower bound is equal to or smaller than $W(r, k)$. A finite…
▽ More
What is a least integer upper bound on van der Waerden number $W(r, k)$ among the powers of the integer $r$? We show how this can be found by expanding the integer $W(r, k)$ into powers of $r$. Doing this enables us to find both a least upper bound and a greatest lower bound on $W(r, k)$ that are some powers of $r$ and where the greatest lower bound is equal to or smaller than $W(r, k)$. A finite series expansion of each $W(r, k)$ into integer powers of $r$ then helps us to find also a greatest real lower bound on any $k$ for which a conjecture posed by R. Graham is true, following immediately as a particular case of the overall result.
△ Less
Submitted 26 January, 2016; v1 submitted 11 December, 2015;
originally announced December 2015.
-
A nonconstructive Proof to show the Convergence of the $n^{th}$ root of diagonal Ramsey Number $r(n, n)$
Authors:
Robert J. Betts
Abstract:
Does the $n^{th}$ root of the diagonal Ramsey number converge to a finite limit? The answer is yes. A sequence can be shown to converge if it satifies convergence conditions other than or besides monotonicity. We show such a property holds for which the sequence of $n^{th}$ roots does converge, even if one has no a priori knowledge as to whether the sequence is monotone or not. We show also the…
▽ More
Does the $n^{th}$ root of the diagonal Ramsey number converge to a finite limit? The answer is yes. A sequence can be shown to converge if it satifies convergence conditions other than or besides monotonicity. We show such a property holds for which the sequence of $n^{th}$ roots does converge, even if one has no a priori knowledge as to whether the sequence is monotone or not. We show also the $n^{th}$ root of the diagonal Ramsey number can be expressed as a product of two factors, the first being a known convergent sequence and the second being an absolutely convergent infinite series. One also can express it where one product is convergent and the other has all its values from a uniformly convergent complex function holomorphic within the unit disc on the complex plane. Our motivation solely is to prove the conjecture as a problem in search of a solution, not to establish some deep theory about graphs. A second question is: If the limit exists what is it? At the time of this writing the understanding is the proofs sought need not be constructive. Here we show by nonconstructive proofs that the $n^{th}$ root of the diagonal Ramsey number converges to a finite limit. We also show that the limit of the $j^{th}$ root of the diagonal Ramsey number is two, where positive integer $j$ depends upon the Ramsey number.
△ Less
Submitted 7 October, 2012; v1 submitted 22 August, 2012;
originally announced August 2012.