-
Tracking the Hidden Forces Behind Laos' 2022 Exchange Rate Crisis and Balance of Payments Instability
Authors:
Mariza Cooray,
Rolando Gonzales Martinez
Abstract:
This working paper uses a Dynamic Factor Model ('the model') to identify underlying factors contributing to the debt-induced economic crisis in the People's Democratic Republic of Laos ('Laos'). The analysis aims to use the latent macroeconomic insights to propose ways forward for forecasting. We focus on Laos's historic structural weaknesses to identify when a balance of payments crisis with eith…
▽ More
This working paper uses a Dynamic Factor Model ('the model') to identify underlying factors contributing to the debt-induced economic crisis in the People's Democratic Republic of Laos ('Laos'). The analysis aims to use the latent macroeconomic insights to propose ways forward for forecasting. We focus on Laos's historic structural weaknesses to identify when a balance of payments crisis with either a persistent current account imbalance or rapid capital outflows would occur. By extracting latent economic factors from macroeconomic indicators, the model provides a starting point for analyzing the structural vulnerabilities leading to the value of the kip in USD terms dropping and contributing to inflation in the country. This findings of this working paper contribute to the broader literature on exchange rate instability and external sector vulnerabilities in emerging economies, offering insights on what constitutes as 'signals' as opposed to plain 'noise' from a macroeconomic forecasting standpoint.
△ Less
Submitted 17 March, 2025;
originally announced March 2025.
-
Enhancing Poverty Targeting with Spatial Machine Learning: An application to Indonesia
Authors:
Rolando Gonzales Martinez,
Mariza Cooray
Abstract:
This study leverages spatial machine learning (SML) to enhance the accuracy of Proxy Means Testing (PMT) for poverty targeting in Indonesia. Conventional PMT methodologies are prone to exclusion and inclusion errors due to their inability to account for spatial dependencies and regional heterogeneity. By integrating spatial contiguity matrices, SML models mitigate these limitations, facilitating a…
▽ More
This study leverages spatial machine learning (SML) to enhance the accuracy of Proxy Means Testing (PMT) for poverty targeting in Indonesia. Conventional PMT methodologies are prone to exclusion and inclusion errors due to their inability to account for spatial dependencies and regional heterogeneity. By integrating spatial contiguity matrices, SML models mitigate these limitations, facilitating a more precise identification and comparison of geographical poverty clusters. Utilizing household survey data from the Social Welfare Integrated Data Survey (DTKS) for the periods 2016 to 2020 and 2016 to 2021, this study examines spatial patterns in income distribution and delineates poverty clusters at both provincial and district levels. Empirical findings indicate that the proposed SML approach reduces exclusion errors from 28% to 20% compared to standard machine learning models, underscoring the critical role of spatial analysis in refining machine learning-based poverty targeting. These results highlight the potential of SML to inform the design of more equitable and effective social protection policies, particularly in geographically diverse contexts. Future research can explore the applicability of spatiotemporal models and assess the generalizability of SML approaches across varying socio-economic settings.
△ Less
Submitted 6 March, 2025;
originally announced March 2025.
-
MICG-AI: A multidimensional index of child growth based on digital phenotyping with Bayesian artificial intelligence
Authors:
Rolando Gonzales Martinez,
Hinke Haisma
Abstract:
This document proposes an algorithm for a mobile application designed to monitor multidimensional child growth through digital phenotyping. Digital phenotyping offers a unique opportunity to collect and analyze high-frequency data in real time, capturing behavioral, psychological, and physiological states of children in naturalistic settings. Traditional models of child growth primarily focus on p…
▽ More
This document proposes an algorithm for a mobile application designed to monitor multidimensional child growth through digital phenotyping. Digital phenotyping offers a unique opportunity to collect and analyze high-frequency data in real time, capturing behavioral, psychological, and physiological states of children in naturalistic settings. Traditional models of child growth primarily focus on physical metrics, often overlooking multidimensional aspects such as emotional, social, and cognitive development. In this paper, we introduce a Bayesian artificial intelligence (AI) algorithm that leverages digital phenotyping to create a Multidimensional Index of Child Growth (MICG). This index integrates data from various dimensions of child development, including physical, emotional, cognitive, and environmental factors. By incorporating probabilistic modeling, the proposed algorithm dynamically updates its learning based on data collected by the mobile app used by mothers and children. The app also infers uncertainty from response times, adjusting the importance of each dimension of child growth accordingly. Our contribution applies state-of-the-art technology to track multidimensional child development, enabling families and healthcare providers to make more informed decisions in real time.
△ Less
Submitted 19 December, 2024;
originally announced December 2024.
-
Bayesian algorithmic perfumery: A Hierarchical Relevance Vector Machine for the Estimation of Personalized Fragrance Preferences based on Three Sensory Layers and Jungian Personality Archetypes
Authors:
Rolando Gonzales Martinez
Abstract:
This study explores a Bayesian algorithmic approach to personalized fragrance recommendation by integrating hierarchical Relevance Vector Machines (RVM) and Jungian personality archetypes. The paper proposes a structured model that links individual scent preferences for top, middle, and base notes to personality traits derived from Jungian archetypes, such as the Hero, Caregiver, and Explorer, amo…
▽ More
This study explores a Bayesian algorithmic approach to personalized fragrance recommendation by integrating hierarchical Relevance Vector Machines (RVM) and Jungian personality archetypes. The paper proposes a structured model that links individual scent preferences for top, middle, and base notes to personality traits derived from Jungian archetypes, such as the Hero, Caregiver, and Explorer, among others. The algorithm utilizes Bayesian updating to dynamically refine predictions as users interact with each fragrance note. This iterative process allows for the personalization of fragrance experiences based on prior data and personality assessments, leading to adaptive and interpretable recommendations. By combining psychological theory with Bayesian machine learning, this approach addresses the complexity of modeling individual preferences while capturing user-specific and population-level trends. The study highlights the potential of hierarchical Bayesian frameworks in creating customized olfactory experiences, informed by psychological and demographic factors, contributing to advancements in personalized product design and machine learning applications in sensory-based industries.
△ Less
Submitted 6 November, 2024;
originally announced November 2024.
-
Boundary Peeling: Outlier Detection Method Using One-Class Peeling
Authors:
Sheikh Arafat,
Na Sun,
Maria L. Weese,
Waldyn G. Martinez
Abstract:
Unsupervised outlier detection constitutes a crucial phase within data analysis and remains a dynamic realm of research. A good outlier detection algorithm should be computationally efficient, robust to tuning parameter selection, and perform consistently well across diverse underlying data distributions. We introduce One-Class Boundary Peeling, an unsupervised outlier detection algorithm. One-cla…
▽ More
Unsupervised outlier detection constitutes a crucial phase within data analysis and remains a dynamic realm of research. A good outlier detection algorithm should be computationally efficient, robust to tuning parameter selection, and perform consistently well across diverse underlying data distributions. We introduce One-Class Boundary Peeling, an unsupervised outlier detection algorithm. One-class Boundary Peeling uses the average signed distance from iteratively-peeled, flexible boundaries generated by one-class support vector machines. One-class Boundary Peeling has robust hyperparameter settings and, for increased flexibility, can be cast as an ensemble method. In synthetic data simulations One-Class Boundary Peeling outperforms all state of the art methods when no outliers are present while maintaining comparable or superior performance in the presence of outliers, as compared to benchmark methods. One-Class Boundary Peeling performs competitively in terms of correct classification, AUC, and processing time using common benchmark data sets.
△ Less
Submitted 20 September, 2024; v1 submitted 11 September, 2023;
originally announced September 2023.
-
Pre-screening breast cancer with machine learning and deep learning
Authors:
Rolando Gonzales Martinez,
Daan-Max van Dongen
Abstract:
We suggest that deep learning can be used for pre-screening cancer by analyzing demographic and anthropometric information of patients, as well as biological markers obtained from routine blood samples and relative risks obtained from meta-analysis and international databases. We applied feature selection algorithms to a database of 116 women, including 52 healthy women and 64 women diagnosed with…
▽ More
We suggest that deep learning can be used for pre-screening cancer by analyzing demographic and anthropometric information of patients, as well as biological markers obtained from routine blood samples and relative risks obtained from meta-analysis and international databases. We applied feature selection algorithms to a database of 116 women, including 52 healthy women and 64 women diagnosed with breast cancer, to identify the best pre-screening predictors of cancer. We utilized the best predictors to perform k-fold Monte Carlo cross-validation experiments that compare deep learning against traditional machine learning algorithms. Our results indicate that a deep learning model with an input-layer architecture that is fine-tuned using feature selection can effectively distinguish between patients with and without cancer. Additionally, compared to machine learning, deep learning has the lowest uncertainty in its predictions. These findings suggest that deep learning algorithms applied to cancer pre-screening offer a radiation-free, non-invasive, and affordable complement to screening methods based on imagery. The implementation of deep learning algorithms in cancer pre-screening offer opportunities to identify individuals who may require imaging-based screening, can encourage self-examination, and decrease the psychological externalities associated with false positives in cancer screening. The integration of deep learning algorithms for both screening and pre-screening will ultimately lead to earlier detection of malignancy, reducing the healthcare and societal burden associated to cancer treatment.
△ Less
Submitted 5 February, 2023;
originally announced February 2023.
-
Small-time approximation of the transition density for diffusions with singularities. Application to the Wright-Fisher model
Authors:
Tania Roa,
María Inés Fariello,
Gerardo Martínez,
José León
Abstract:
The Wright-Fisher (W-F) diffusion model serves as a foundational framework for interpreting population evolution through allele frequency dynamics over time. Despite the known transition probability between consecutive generations, an exact analytical expression for the transition density at arbitrary time intervals remains elusive. Commonly utilized distributions such as Gaussian or Beta inadequa…
▽ More
The Wright-Fisher (W-F) diffusion model serves as a foundational framework for interpreting population evolution through allele frequency dynamics over time. Despite the known transition probability between consecutive generations, an exact analytical expression for the transition density at arbitrary time intervals remains elusive. Commonly utilized distributions such as Gaussian or Beta inadequately address the fixation issue at extreme allele frequencies (0 or 1), particularly for short periods. In this study, we introduce two alternative parametric functions, namely the Asymptotic Expansion (AE) and the Gaussian approximation (GaussA), derived through probabilistic methodologies, aiming to better approximate this density. The AE function provides a suitable density for allele frequency distributions, encompassing extreme values within the interval [0,1]. Additionally, we outline the range of validity for the GaussA approximation. While our primary focus is on W-F diffusion, we demonstrate how our findings extend to other diffusion models featuring singularities. Through simulations of allele frequencies under a W-F process and employing a recently developed adaptive density estimation method, we conduct a comparative analysis to assess the fit of the proposed densities against the Beta and Gaussian distributions.
△ Less
Submitted 21 June, 2024; v1 submitted 21 December, 2022;
originally announced December 2022.
-
Comparison of statistical sampling methods with ScannerBit, the GAMBIT scanning module
Authors:
The GAMBIT Scanner Workgroup,
:,
Gregory D. Martinez,
James McKay,
Ben Farmer,
Pat Scott,
Elinore Roebber,
Antje Putze,
Jan Conrad
Abstract:
We introduce ScannerBit, the statistics and sampling module of the public, open-source global fitting framework GAMBIT. ScannerBit provides a standardised interface to different sampling algorithms, enabling the use and comparison of multiple computational methods for inferring profile likelihoods, Bayesian posteriors, and other statistical quantities. The current version offers random, grid, rast…
▽ More
We introduce ScannerBit, the statistics and sampling module of the public, open-source global fitting framework GAMBIT. ScannerBit provides a standardised interface to different sampling algorithms, enabling the use and comparison of multiple computational methods for inferring profile likelihoods, Bayesian posteriors, and other statistical quantities. The current version offers random, grid, raster, nested sampling, differential evolution, Markov Chain Monte Carlo (MCMC) and ensemble Monte Carlo samplers. We also announce the release of a new standalone differential evolution sampler, Diver, and describe its design, usage and interface to ScannerBit. We subject Diver and three other samplers (the nested sampler MultiNest, the MCMC GreAT, and the native ScannerBit implementation of the ensemble Monte Carlo algorithm T-Walk) to a battery of statistical tests. For this we use a realistic physical likelihood function, based on the scalar singlet model of dark matter. We examine the performance of each sampler as a function of its adjustable settings, and the dimensionality of the sampling problem. We evaluate performance on four metrics: optimality of the best fit found, completeness in exploring the best-fit region, number of likelihood evaluations, and total runtime. For Bayesian posterior estimation at high resolution, T-Walk provides the most accurate and timely mapping of the full parameter space. For profile likelihood analysis in less than about ten dimensions, we find that Diver and MultiNest score similarly in terms of best fit and speed, outperforming GreAT and T-Walk; in ten or more dimensions, Diver substantially outperforms the other three samplers on all metrics.
△ Less
Submitted 15 October, 2017; v1 submitted 22 May, 2017;
originally announced May 2017.
-
Use of multiple singular value decompositions to analyze complex intracellular calcium ion signals
Authors:
Josue G. Martinez,
Jianhua Z. Huang,
Robert C. Burghardt,
Rola Barhoumi,
Raymond J. Carroll
Abstract:
We compare calcium ion signaling ($\mathrm {Ca}^{2+}$) between two exposures; the data are present as movies, or, more prosaically, time series of images. This paper describes novel uses of singular value decompositions (SVD) and weighted versions of them (WSVD) to extract the signals from such movies, in a way that is semi-automatic and tuned closely to the actual data and their many complexities…
▽ More
We compare calcium ion signaling ($\mathrm {Ca}^{2+}$) between two exposures; the data are present as movies, or, more prosaically, time series of images. This paper describes novel uses of singular value decompositions (SVD) and weighted versions of them (WSVD) to extract the signals from such movies, in a way that is semi-automatic and tuned closely to the actual data and their many complexities. These complexities include the following. First, the images themselves are of no interest: all interest focuses on the behavior of individual cells across time, and thus, the cells need to be segmented in an automated manner. Second, the cells themselves have 100$+$ pixels, so that they form 100$+$ curves measured over time, so that data compression is required to extract the features of these curves. Third, some of the pixels in some of the cells are subject to image saturation due to bit depth limits, and this saturation needs to be accounted for if one is to normalize the images in a reasonably unbiased manner. Finally, the $\mathrm {Ca}^{2+}$ signals have oscillations or waves that vary with time and these signals need to be extracted. Thus, our aim is to show how to use multiple weighted and standard singular value decompositions to detect, extract and clarify the $\mathrm {Ca}^{2+}$ signals. Our signal extraction methods then lead to simple although finely focused statistical methods to compare $\mathrm {Ca}^{2+}$ signals across experimental conditions.
△ Less
Submitted 28 September, 2010;
originally announced September 2010.