Skip to main content

Showing 1–24 of 24 results for author: Sykulski, A M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.15423  [pdf, ps, other

    cs.LG econ.EM stat.AP stat.ME stat.ML

    SplitWise Regression: Stepwise Modeling with Adaptive Dummy Encoding

    Authors: Marcell T. Kurbucz, Nikolaos Tzivanakis, Nilufer Sari Aslam, Adam M. Sykulski

    Abstract: Capturing nonlinear relationships without sacrificing interpretability remains a persistent challenge in regression modeling. We introduce SplitWise, a novel framework that enhances stepwise regression. It adaptively transforms numeric predictors into threshold-based binary features using shallow decision trees, but only when such transformations improve model fit, as assessed by the Akaike Inform… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 15 pages, 1 figure, 3 tables

    MSC Class: 62H20; 62J05; 68T05 ACM Class: G.3; I.2.6; I.5.1; I.5.2

  2. arXiv:2412.01399  [pdf, ps, other

    stat.AP stat.ME

    Navigating Challenges in Spatio-temporal Modelling of Antarctic Krill Abundance: Addressing Zero-inflated Data and Misaligned Covariates

    Authors: André Victor Ribeiro Amaral, Adam M. Sykulski, Sophie Fielding, Emma Cavan

    Abstract: Antarctic krill (Euphausia superba) are among the most abundant species on our planet and serve as a vital food source for many marine predators in the Southern Ocean. In this paper, we utilise statistical spatio-temporal methods to combine data from various sources and resolutions, aiming to model krill abundance. Our focus lies in fitting the model to a dataset comprising acoustic measurements o… ▽ More

    Submitted 17 June, 2025; v1 submitted 2 December, 2024; originally announced December 2024.

  3. Isotropy testing in spatial point patterns: nonparametric versus parametric replication under misspecification

    Authors: Jakub J. Pypkowski, Adam M. Sykulski, James S. Martin

    Abstract: Several hypothesis testing methods have been proposed to validate the assumption of isotropy in spatial point patterns. A majority of these methods are characterised by an unknown distribution of the test statistic under the null hypothesis of isotropy. Parametric approaches to approximating the distribution involve simulation of patterns from a user-specified isotropic model. Alternatively, nonpa… ▽ More

    Submitted 8 April, 2025; v1 submitted 29 November, 2024; originally announced November 2024.

    Comments: 24 pages, 13 figures, 3 tables

  4. arXiv:2312.13643  [pdf, other

    stat.ME math.ST

    Debiasing Welch's Method for Spectral Density Estimation

    Authors: Lachlan C. Astfalck, Adam M. Sykulski, Edward J. Cripps

    Abstract: Welch's method provides an estimator of the power spectral density that is statistically consistent. This is achieved by averaging over periodograms calculated from overlapping segments of a time series. For a finite length time series, while the variance of the estimator decreases as the number of segments increase, the magnitude of the estimator's bias increases: a bias-variance trade-off ensues… ▽ More

    Submitted 10 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Resubmitted to Biometrika

  5. arXiv:2204.06112  [pdf, other

    stat.AP

    Analysing and visualising bike-sharing demand with outliers

    Authors: Nicola Rennie, Catherine Cleophas, Adam M. Sykulski, Florian Dost

    Abstract: Bike-sharing is a popular component of sustainable urban mobility. It requires anticipatory planning, e.g. of station locations and inventory, to balance expected demand and capacity. However, external factors such as extreme weather or glitches in public transport, can cause demand to deviate from baseline levels. Identifying such outliers keeps historic data reliable and improves forecasts. In t… ▽ More

    Submitted 30 January, 2023; v1 submitted 12 April, 2022; originally announced April 2022.

    Comments: 32 pages

  6. A multivariate pseudo-likelihood approach to estimating directional ocean wave models

    Authors: Jake P. Grainger, Adam M. Sykulski, Kevin Ewans, Hans F. Hansen, Philip Jonathan

    Abstract: Ocean buoy data in the form of high frequency multivariate time series are routinely recorded at many locations in the world's oceans. Such data can be used to characterise the ocean wavefield, which is important for numerous socio-economic and scientific reasons. This characterisation is typically achieved by modelling the frequency-direction spectrum, which decomposes spatiotemporal variability… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  7. arXiv:2106.03823  [pdf, other

    stat.ML cs.LG stat.CO

    Multivariate Probabilistic Regression with Natural Gradient Boosting

    Authors: Michael O'Malley, Adam M. Sykulski, Rick Lumpkin, Alejandro Schuler

    Abstract: Many single-target regression problems require estimates of uncertainty along with the point predictions. Probabilistic regression algorithms are well-suited for these tasks. However, the options are much more limited when the prediction target is multivariate and a joint measure of uncertainty is required. For example, in predicting a 2D velocity vector a joint uncertainty would quantify the prob… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  8. arXiv:2104.04157  [pdf, other

    physics.soc-ph stat.AP stat.ME

    Outlier detection in network revenue management

    Authors: Nicola Rennie, Catherine Cleophas, Adam M. Sykulski, Florian Dost

    Abstract: This paper presents an automated approach for providing ranked lists of outliers in observed demand to support analysts in network revenue management. Such network revenue management, e.g. for railway itineraries, needs accurate demand forecasts. However, demand outliers across or in parts of a network complicate accurate demand forecasting, and the network structure makes such demand outliers har… ▽ More

    Submitted 24 February, 2023; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: 79 pages, re-structured and additional computational results

  9. arXiv:2012.00789  [pdf, ps, other

    physics.ao-ph eess.SP physics.data-an physics.flu-dyn stat.AP

    Separating Mesoscale and Submesoscale Flows from Clustered Drifter Trajectories

    Authors: Sarah Oscroft, Adam M. Sykulski, Jeffrey J. Early

    Abstract: Drifters deployed in close proximity collectively provide a unique observational data set with which to separate mesoscale and submesoscale flows. In this paper we provide a principled approach for doing so by fitting observed velocities to a local Taylor expansion of the velocity flow field. We demonstrate how to estimate mesoscale and submesoscale quantities that evolve slowly over time, as well… ▽ More

    Submitted 25 December, 2020; v1 submitted 1 December, 2020; originally announced December 2020.

    Comments: Accepted in Fluids

  10. arXiv:2008.10437  [pdf, other

    stat.AP physics.ao-ph physics.data-an stat.CO stat.ME

    Estimating the parameters of ocean wave spectra

    Authors: Jake P. Grainger, Adam M. Sykulski, Philip Jonathan, Kevin Ewans

    Abstract: Wind-generated waves are often treated as stochastic processes. There is particular interest in their spectral density functions, which are often expressed in some parametric form. Such spectral density functions are used as inputs when modelling structural response or other engineering concerns. Therefore, accurate and precise recovery of the parameters of such a form, from observed wave records,… ▽ More

    Submitted 25 March, 2021; v1 submitted 24 August, 2020; originally announced August 2020.

  11. Nonparametric Time Series Summary Statistics for High-Frequency Accelerometry Data from Individuals with Advanced Dementia

    Authors: Keerati Suibkitwanchai, Adam M. Sykulski, Guillermo Perez Algorta, Daniel Waller, Catherine Walshe

    Abstract: Accelerometry data has been widely used to measure activity and the circadian rhythm of individuals across the health sciences, in particular with people with advanced dementia. Modern accelerometers can record continuous observations on a single individual for several days at a sampling frequency of the order of one hertz. Such rich and lengthy data sets provide new opportunities for statistical… ▽ More

    Submitted 29 September, 2020; v1 submitted 3 May, 2020; originally announced May 2020.

    Journal ref: PLoS ONE 15(9): e0239368 (2020)

  12. arXiv:2002.07774  [pdf, other

    stat.AP physics.ao-ph physics.data-an physics.flu-dyn stat.CO

    Estimating the travel time and the most likely path from Lagrangian drifters

    Authors: Michael O'Malley, Adam M. Sykulski, Romuald Laso-Jadart, Mohammed-Amin Madoui

    Abstract: We provide a novel methodology for computing the most likely path taken by drifters between arbitrary fixed locations in the ocean. We also provide an estimate of the travel time associated with this path. Lagrangian pathways and travel times are of practical value not just in understanding surface velocities, but also in modelling the transport of ocean-borne species such as planktonic organisms,… ▽ More

    Submitted 18 March, 2021; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: 27 pages, 10 figures in the main text. 13 pages, 8 figures in the supplemental material

  13. arXiv:2001.05965  [pdf, ps, other

    stat.ME astro-ph.EP eess.SP physics.data-an stat.AP

    The Elliptical Ornstein-Uhlenbeck Process

    Authors: Adam M. Sykulski, Sofia C. Olhede, Hanna M. Sykulska-Lawrence

    Abstract: We introduce the elliptical Ornstein-Uhlenbeck (OU) process, which is a generalisation of the well-known univariate OU process to bivariate time series. This process maps out elliptical stochastic oscillations over time in the complex plane, which are observed in many applications of coupled bivariate time series. The appeal of the model is that elliptical oscillations are generated using one simp… ▽ More

    Submitted 7 December, 2021; v1 submitted 16 January, 2020; originally announced January 2020.

    Comments: To appear in Statistics and Its Interface

  14. Identifying and Responding to Outlier Demand in Revenue Management

    Authors: Nicola Rennie, Catherine Cleophas, Adam M. Sykulski, Florian Dost

    Abstract: Revenue management strongly relies on accurate forecasts. Thus, when extraordinary events cause outlier demand, revenue management systems need to recognise this and adapt both forecast and controls. Many passenger transport service providers, such as railways and airlines, control the sale of tickets through revenue management. State-of-the-art systems in these industries rely on analyst expertis… ▽ More

    Submitted 5 October, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

  15. arXiv:1907.02447  [pdf, other

    stat.ME stat.AP stat.CO stat.ML

    The Debiased Spatial Whittle Likelihood

    Authors: Arthur P. Guillaumin, Adam M. Sykulski, Sofia C. Olhede, Frederik J. Simons

    Abstract: We provide a computationally and statistically efficient method for estimating the parameters of a stochastic covariance model observed on a regular spatial grid in any number of dimensions. Our proposed method, which we call the Debiased Spatial Whittle likelihood, makes important corrections to the well-known Whittle likelihood to account for large sources of bias caused by boundary effects and… ▽ More

    Submitted 26 April, 2022; v1 submitted 4 July, 2019; originally announced July 2019.

  16. arXiv:1904.12064  [pdf, ps, other

    stat.ME physics.data-an stat.AP stat.CO stat.ML

    Smoothing and Interpolating Noisy GPS Data with Smoothing Splines

    Authors: Jeffrey J. Early, Adam M. Sykulski

    Abstract: A comprehensive methodology is provided for smoothing noisy, irregularly sampled data with non-Gaussian noise using smoothing splines. We demonstrate how the spline order and tension parameter can be chosen a priori from physical reasoning. We also show how to allow for non-Gaussian noise and outliers which are typical in GPS signals. We demonstrate the effectiveness of our methods on GPS trajecto… ▽ More

    Submitted 26 June, 2019; v1 submitted 26 April, 2019; originally announced April 2019.

    Comments: 16 pages, 8 figures

  17. arXiv:1605.09107  [pdf, ps, other

    stat.ME physics.ao-ph physics.flu-dyn stat.AP

    Analysis of nonstationary modulated time series with applications to oceanographic flow measurements

    Authors: Arthur P. Guillaumin, Adam M. Sykulski, Sofia C. Olhede, Jeffrey J. Early, Jonathan M. Lilly

    Abstract: We propose a new class of univariate nonstationary time series models, using the framework of modulated time series, which is appropriate for the analysis of rapidly-evolving time series as well as time series observations with missing data. We extend our techniques to a class of bivariate time series that are isotropic. Exact inference is often not computationally viable for time series analysis,… ▽ More

    Submitted 24 January, 2017; v1 submitted 30 May, 2016; originally announced May 2016.

    Comments: 31 pages, 5 figures, 3 tables

  18. arXiv:1605.06718  [pdf, ps, other

    stat.ME math.ST stat.CO stat.ML

    The De-Biased Whittle Likelihood

    Authors: Adam M. Sykulski, Sofia C. Olhede, Arthur P. Guillaumin, Jonathan M. Lilly, Jeffrey J. Early

    Abstract: The Whittle likelihood is a widely used and computationally efficient pseudo-likelihood. However, it is known to produce biased parameter estimates for large classes of models. We propose a method for de-biasing Whittle estimates for second-order stationary stochastic processes. The de-biased Whittle likelihood can be computed in the same $\mathcal{O}(n\log n)$ operations as the standard approach.… ▽ More

    Submitted 12 September, 2018; v1 submitted 21 May, 2016; originally announced May 2016.

    Comments: To appear shortly in Biometrika. Full published version includes extensions of theory to non-Gaussian processes, and new simulation examples with an AR(4) and non-Gaussian process

  19. arXiv:1605.05278  [pdf, ps, other

    stat.ME stat.CO stat.ML

    Exact Simulation of Noncircular or Improper Complex-Valued Stationary Gaussian Processes using Circulant Embedding

    Authors: Adam M. Sykulski, Donald B. Percival

    Abstract: This paper provides an algorithm for simulating improper (or noncircular) complex-valued stationary Gaussian processes. The technique utilizes recently developed methods for multivariate Gaussian processes from the circulant embedding literature. The method can be performed in $\mathcal{O}(n\log_2 n)$ operations, where $n$ is the length of the desired sequence. The method is exact, except when eig… ▽ More

    Submitted 15 March, 2017; v1 submitted 17 May, 2016; originally announced May 2016.

    Comments: Link to published version: http://ieeexplore.ieee.org/document/7738840/

    Journal ref: 2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP)

  20. arXiv:1605.01684  [pdf, other

    stat.ME physics.ao-ph physics.flu-dyn

    Fractional Brownian motion, the Matern process, and stochastic modeling of turbulent dispersion

    Authors: J. M. Lilly, A. M. Sykulski, J. J Early, S. C. Olhede

    Abstract: Stochastic process exhibiting power-law slopes in the frequency domain are frequently well modeled by fractional Brownian motion (fBm). In particular, the spectral slope at high frequencies is associated with the degree of small-scale roughness or fractal dimension. However, a broad class of real-world signals have a high-frequency slope, like fBm, but a plateau in the vicinity of zero frequency.… ▽ More

    Submitted 2 September, 2017; v1 submitted 5 May, 2016; originally announced May 2016.

    Journal ref: Nonlinear Processes in Geophysics, 24: 481-514 (2017)

  21. A Widely Linear Complex Autoregressive Process of Order One

    Authors: Adam M. Sykulski, Sofia C. Olhede, Jonathan M. Lilly

    Abstract: We propose a simple stochastic process for modeling improper or noncircular complex-valued signals. The process is a natural extension of a complex-valued autoregressive process, extended to include a widely linear autoregressive term. This process can then capture elliptical, as opposed to circular, stochastic oscillations in a bivariate signal. The process is order one and is more parsimonious t… ▽ More

    Submitted 15 March, 2017; v1 submitted 12 November, 2015; originally announced November 2015.

    Comments: Link to published version: http://ieeexplore.ieee.org/abstract/document/7539658/

    Journal ref: IEEE Transactions on Signal Processing, 64(23), 6200-6210, 2016

  22. arXiv:1508.05593  [pdf, other

    stat.ME

    A Power Variance Test for Nonstationarity in Complex-Valued Signals

    Authors: Thomas E. Bartlett, Adam M. Sykulski, Sofia C. Olhede, Jonathan M. Lilly, Jeffrey J. Early

    Abstract: We propose a novel algorithm for testing the hypothesis of nonstationarity in complex-valued signals. The implementation uses both the bootstrap and the Fast Fourier Transform such that the algorithm can be efficiently implemented in O(NlogN) time, where N is the length of the observed signal. The test procedure examines the second-order structure and contrasts the observed power variance - i.e. t… ▽ More

    Submitted 7 October, 2015; v1 submitted 23 August, 2015; originally announced August 2015.

  23. arXiv:1312.2923  [pdf, other

    stat.AP physics.ao-ph physics.flu-dyn stat.ME

    Lagrangian Time Series Models for Ocean Surface Drifter Trajectories

    Authors: Adam M. Sykulski, Sofia C. Olhede, Jonathan M. Lilly, Eric Danioux

    Abstract: This paper proposes stochastic models for the analysis of ocean surface trajectories obtained from freely-drifting satellite-tracked instruments. The proposed time series models are used to summarise large multivariate datasets and infer important physical parameters of inertial oscillations and other ocean processes. Nonstationary time series methods are employed to account for the spatiotemporal… ▽ More

    Submitted 21 April, 2015; v1 submitted 10 December, 2013; originally announced December 2013.

    Comments: 21 pages, 10 figures

    Journal ref: Journal of the Royal Statistical Society (Series C, Applied Statistics), 65(1), 29-50, 2016

  24. arXiv:1306.5993  [pdf, ps, other

    stat.ME stat.AP stat.CO stat.ML

    Frequency-Domain Stochastic Modeling of Stationary Bivariate or Complex-Valued Signals

    Authors: Adam M. Sykulski, Sofia C. Olhede, Jonathan M. Lilly, Jeffrey J. Early

    Abstract: There are three equivalent ways of representing two jointly observed real-valued signals: as a bivariate vector signal, as a single complex-valued signal, or as two analytic signals known as the rotary components. Each representation has unique advantages depending on the system of interest and the application goals. In this paper we provide a joint framework for all three representations in the c… ▽ More

    Submitted 15 March, 2017; v1 submitted 25 June, 2013; originally announced June 2013.

    Comments: To appear in IEEE Transactions on Signal Processing

    Journal ref: IEEE Transactions on Signal Processing, 2017