Skip to main content

Showing 1–27 of 27 results for author: Cucuringu, M

Searching in archive q-fin. Search in all archives.
.
  1. arXiv:2505.08180  [pdf, ps, other

    q-fin.CP q-fin.ST

    Forecasting Intraday Volume in Equity Markets with Machine Learning

    Authors: Mihai Cucuringu, Kang Li, Chao Zhang

    Abstract: This study focuses on forecasting intraday trading volumes, a crucial component for portfolio implementation, especially in high-frequency (HF) trading environments. Given the current scarcity of flexible methods in this area, we employ a suite of machine learning (ML) models enriched with numerous HF predictors to enhance the predictability of intraday trading volumes. Our findings reveal that in… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  2. arXiv:2505.07078  [pdf, ps, other

    q-fin.TR cs.AI cs.CE

    Can LLM-based Financial Investing Strategies Outperform the Market in Long Run?

    Authors: Weixian Waylon Li, Hyeonjun Kim, Mihai Cucuringu, Tiejun Ma

    Abstract: Large Language Models (LLMs) have recently been leveraged for asset pricing tasks and stock trading applications, enabling AI agents to generate investment decisions from unstructured financial data. However, most evaluations of LLM timing-based investing strategies are conducted on narrow timeframes and limited stock universes, overstating effectiveness due to survivorship and data-snooping biase… ▽ More

    Submitted 20 May, 2025; v1 submitted 11 May, 2025; originally announced May 2025.

    Comments: 14 pages

  3. arXiv:2504.20349  [pdf, other

    q-fin.TR

    ClusterLOB: Enhancing Trading Strategies by Clustering Orders in Limit Order Books

    Authors: Yichi Zhang, Mihai Cucuringu, Alexander Y. Shestopaloff, Stefan Zohren

    Abstract: In the rapidly evolving world of financial markets, understanding the dynamics of limit order book (LOB) is crucial for unraveling market microstructure and participant behavior. We introduce ClusterLOB as a method to cluster individual market events in a stream of market-by-order (MBO) data into different groups. To do so, each market event is augmented with six time-dependent features. By applyi… ▽ More

    Submitted 9 May, 2025; v1 submitted 28 April, 2025; originally announced April 2025.

  4. arXiv:2503.11499  [pdf, other

    q-fin.PM

    Tactical Asset Allocation with Macroeconomic Regime Detection

    Authors: Daniel Cunha Oliveira, Dylan Sandfelder, André Fujita, Xiaowen Dong, Mihai Cucuringu

    Abstract: This paper extends the tactical asset allocation literature by incorporating regime modeling using techniques from machine learning. We propose a novel model that classifies current regimes, forecasts the distribution of future regimes, and integrates these forecasts with the historical performance of individual assets to optimize portfolio allocations. Utilizing a macroeconomic data set from the… ▽ More

    Submitted 21 March, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

  5. arXiv:2502.18625  [pdf, other

    q-fin.TR

    To Make, or to Take, That Is the Question: Impact of LOB Mechanics on Natural Trading Strategies

    Authors: Jakob Albers, Mihai Cucuringu, Sam Howison, Alexander Y. Shestopaloff

    Abstract: Working at a very granular level, using data from a live trading experiment on the Binance linear Bitcoin perpetual-the most liquid crypto market worldwide-we examine the effects of (i) basic order book mechanics and (ii) the strong persistence of price changes from the immediate to the short timescale, revealing the interplay between returns, queue sizes, and orders' queue positions. For maker or… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  6. arXiv:2502.11310  [pdf, other

    stat.ML cs.LG q-fin.ST

    Generalized Factor Neural Network Model for High-dimensional Regression

    Authors: Zichuan Guo, Mihai Cucuringu, Alexander Y. Shestopaloff

    Abstract: We tackle the challenges of modeling high-dimensional data sets, particularly those with latent low-dimensional structures hidden within complex, non-linear, and noisy relationships. Our approach enables a seamless integration of concepts from non-parametric regression, factor models, and neural networks for high-dimensional regression. Our approach introduces PCA and Soft PCA layers, which can be… ▽ More

    Submitted 13 March, 2025; v1 submitted 16 February, 2025; originally announced February 2025.

    Comments: 43 pages, 13 figures

    MSC Class: 62G08; 68T07

  7. arXiv:2408.09960  [pdf, other

    q-fin.CP

    Causality-Inspired Models for Financial Time Series Forecasting

    Authors: Daniel Cunha Oliveira, Yutong Lu, Xi Lin, Mihai Cucuringu, Andre Fujita

    Abstract: We introduce a novel framework to financial time series forecasting that leverages causality-inspired models to balance the trade-off between invariance to distributional changes and minimization of prediction errors. To the best of our knowledge, this is the first study to conduct a comprehensive comparative analysis among state-of-the-art causal discovery algorithms, benchmarked against non-caus… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  8. arXiv:2408.05659  [pdf, other

    q-fin.ST

    A GCN-LSTM Approach for ES-mini and VX Futures Forecasting

    Authors: Nikolas Michael, Mihai Cucuringu, Sam Howison

    Abstract: We propose a novel data-driven network framework for forecasting problems related to E-mini S\&P 500 and CBOE Volatility Index futures, in which products with different expirations act as distinct nodes. We provide visual demonstrations of the correlation structures of these products in terms of their returns, realized volatility, and trading volume. The resulting networks offer insights into the… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

  9. arXiv:2309.08800  [pdf, other

    q-fin.ST

    Dynamic Time Warping for Lead-Lag Relationships in Lagged Multi-Factor Models

    Authors: Yichi Zhang, Mihai Cucuringu, Alexander Y. Shestopaloff, Stefan Zohren

    Abstract: In multivariate time series systems, lead-lag relationships reveal dependencies between time series when they are shifted in time relative to each other. Uncovering such relationships is valuable in downstream tasks, such as control, forecasting, and clustering. By understanding the temporal dependencies between different time series, one can better comprehend the complex interactions and patterns… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2305.06704

  10. arXiv:2308.01419  [pdf, other

    q-fin.ST cs.LG q-fin.RM

    Graph Neural Networks for Forecasting Multivariate Realized Volatility with Spillover Effects

    Authors: Chao Zhang, Xingyue Pu, Mihai Cucuringu, Xiaowen Dong

    Abstract: We present a novel methodology for modeling and forecasting multivariate realized volatilities using customized graph neural networks to incorporate spillover effects across stocks. The proposed model offers the benefits of incorporating spillover effects from multi-hop neighbors, capturing nonlinear relationships, and flexible training with different loss functions. Our empirical findings provide… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 8 figures, 5 tables

  11. arXiv:2305.06704  [pdf, other

    stat.ML cs.LG q-fin.CP q-fin.PM q-fin.ST q-fin.TR

    Robust Detection of Lead-Lag Relationships in Lagged Multi-Factor Models

    Authors: Yichi Zhang, Mihai Cucuringu, Alexander Y. Shestopaloff, Stefan Zohren

    Abstract: In multivariate time series systems, key insights can be obtained by discovering lead-lag relationships inherent in the data, which refer to the dependence between two time series shifted in time relative to one another, and which can be leveraged for the purposes of control, forecasting or clustering. We develop a clustering-driven methodology for robust detection of lead-lag relationships in lag… ▽ More

    Submitted 18 September, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  12. arXiv:2304.03877  [pdf, other

    stat.ML cs.LG q-fin.ST stat.ME

    OFTER: An Online Pipeline for Time Series Forecasting

    Authors: Nikolas Michael, Mihai Cucuringu, Sam Howison

    Abstract: We introduce OFTER, a time series forecasting pipeline tailored for mid-sized multivariate time series. OFTER utilizes the non-parametric models of k-nearest neighbors and Generalized Regression Neural Networks, integrated with a dimensionality reduction component. To circumvent the curse of dimensionality, we employ a weighted norm based on a modified version of the maximal correlation coefficien… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: 26 pages, 12 figures

  13. arXiv:2302.09382  [pdf, other

    q-fin.TR q-fin.PM

    Co-trading networks for modeling dynamic interdependency structures and estimating high-dimensional covariances in US equity markets

    Authors: Yutong Lu, Gesine Reinert, Mihai Cucuringu

    Abstract: The time proximity of trades across stocks reveals interesting topological structures of the equity market in the United States. In this article, we investigate how such concurrent cross-stock trading behaviors, which we denote as co-trading, shape the market structures and affect stock price co-movements. By leveraging a co-trading-based pairwise similarity measure, we propose a novel method to c… ▽ More

    Submitted 12 May, 2024; v1 submitted 18 February, 2023; originally announced February 2023.

  14. arXiv:2301.13009  [pdf, other

    q-fin.TR physics.soc-ph q-fin.GN

    DeFi: data-driven characterisation of Uniswap v3 ecosystem & an ideal crypto law for liquidity pools

    Authors: Deborah Miori, Mihai Cucuringu

    Abstract: Uniswap is a Constant Product Market Maker built around liquidity pools, where pairs of tokens are exchanged subject to a fee that is proportional to the size of transactions. At the time of writing, there exist more than 6,000 pools associated with Uniswap v3, implying that empirical investigations on the full ecosystem can easily become computationally expensive. Thus, we propose a systematic wo… ▽ More

    Submitted 31 January, 2023; v1 submitted 20 December, 2022; originally announced January 2023.

    Comments: 26 pages, 21 figures

  15. arXiv:2209.10334  [pdf, other

    q-fin.TR q-fin.ST

    Trade Co-occurrence, Trade Flow Decomposition, and Conditional Order Imbalance in Equity Markets

    Authors: Yutong Lu, Gesine Reinert, Mihai Cucuringu

    Abstract: The time proximity of high-frequency trades can contain a salient signal. In this paper, we propose a method to classify every trade, based on its proximity with other trades in the market within a short period of time, into five types. By means of a suitably defined normalized order imbalance associated to each type of trade, which we denote as conditional order imbalance (COI), we investigate th… ▽ More

    Submitted 13 March, 2024; v1 submitted 21 September, 2022; originally announced September 2022.

  16. arXiv:2209.08825  [pdf, other

    q-fin.MF

    SEC Form 13F-HR: Statistical investigation of trading imbalances and profitability analysis

    Authors: Deborah Miori, Mihai Cucuringu

    Abstract: US Institutions with more than $100 million assets under management must disclose part of their long positions into the SEC Form 13F-HR on a quarterly basis. We consider the number of variations in holdings between consecutive reporting periods, and compute imbalances in buying versus selling behaviour for the assets under consideration. A significant opportunity for profit arises if an external i… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 23 pages, 18 figures

  17. arXiv:2209.00268  [pdf, other

    q-fin.MF q-fin.CP

    Returns-Driven Macro Regimes and Characteristic Lead-Lag Behaviour between Asset Classes

    Authors: Deborah Miori, Mihai Cucuringu

    Abstract: We define data-driven macroeconomic regimes by clustering the relative performance in time of indices belonging to different asset classes. We then investigate lead-lag relationships within the regimes identified. Our study unravels market features characteristic of different windows in time and leverages on this knowledge to highlight market trends or risks that can be informative with respect to… ▽ More

    Submitted 2 September, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: 9 pages, 8 figures

  18. arXiv:2203.15470  [pdf, other

    stat.ML cs.LG q-fin.ST stat.AP stat.ME

    Graph similarity learning for change-point detection in dynamic networks

    Authors: Deborah Sulem, Henry Kenlay, Mihai Cucuringu, Xiaowen Dong

    Abstract: Dynamic networks are ubiquitous for modelling sequential graph-structured data, e.g., brain connectome, population flows and messages exchanges. In this work, we consider dynamic networks that are temporal sequences of graph snapshots, and aim at detecting abrupt changes in their structure. This task is often termed network change-point detection and has numerous applications, such as fraud detect… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: 33 pages, 21 figures, 5 tables

  19. arXiv:2203.15009  [pdf

    stat.ML cs.LG q-fin.ST stat.AP stat.ME

    DAMNETS: A Deep Autoregressive Model for Generating Markovian Network Time Series

    Authors: Jase Clarkson, Mihai Cucuringu, Andrew Elliott, Gesine Reinert

    Abstract: Generative models for network time series (also known as dynamic graphs) have tremendous potential in fields such as epidemiology, biology and economics, where complex graph-based dynamics are core objects of study. Designing flexible and scalable generative models is a very challenging task due to the high dimensionality of the data, as well as the need to represent temporal dependencies and marg… ▽ More

    Submitted 31 October, 2023; v1 submitted 28 March, 2022; originally announced March 2022.

  20. arXiv:2203.01664  [pdf, other

    q-fin.RM

    Tail-GAN: Learning to Simulate Tail Risk Scenarios

    Authors: Rama Cont, Mihai Cucuringu, Renyuan Xu, Chao Zhang

    Abstract: The estimation of loss distributions for dynamic portfolios requires the simulation of scenarios representing realistic joint dynamics of their components. We propose a novel data-driven approach for simulating realistic, high-dimensional multi-asset scenarios, focusing on accurately representing tail risk for a class of static and dynamic trading strategies. We exploit the joint elicitability pro… ▽ More

    Submitted 15 May, 2025; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: Updated version for publication in Management Science

  21. arXiv:2202.08962  [pdf, ps, other

    q-fin.ST q-fin.CP q-fin.RM

    Volatility forecasting with machine learning and intraday commonality

    Authors: Chao Zhang, Yihuang Zhang, Mihai Cucuringu, Zhongmin Qian

    Abstract: We apply machine learning models to forecast intraday realized volatility (RV), by exploiting commonality in intraday volatility via pooling stock data together, and by incorporating a proxy for the market volatility. Neural networks dominate linear regressions and tree-based models in terms of performance, due to their ability to uncover and model complex latent interactions among variables. Our… ▽ More

    Submitted 24 February, 2023; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: 40 pages, 12 figures, 6 tables; to appear in Journal of Financial Econometrics

  22. arXiv:2201.09319  [pdf, other

    q-fin.ST q-fin.TR stat.AP

    Option Volume Imbalance as a predictor for equity market returns

    Authors: Nikolas Michael, Mihai Cucuringu, Sam Howison

    Abstract: We investigate the use of the normalized imbalance between option volumes corresponding to positive and negative market views, as a predictor for directional price movements in the spot market. Via a nonlinear analysis, and using a decomposition of aggregated volumes into five distinct market participant classes, we find strong signs of predictability of excess market overnight returns. The strong… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

    Comments: 43 pages, 33 figures

  23. arXiv:2201.08283  [pdf, other

    stat.ML cs.LG q-fin.ST stat.ME

    Lead-lag detection and network clustering for multivariate time series with an application to the US equity market

    Authors: Stefanos Bennett, Mihai Cucuringu, Gesine Reinert

    Abstract: In multivariate time series systems, it has been observed that certain groups of variables partially lead the evolution of the system, while other variables follow this evolution with a time delay; the result is a lead-lag structure amongst the time series variables. In this paper, we propose a method for the detection of lead-lag clusters of time series in multivariate systems. We demonstrate tha… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Comments: 29 pages, 28 figures; preliminary version appeared at KDD 2021 - 7th SIGKKDD Workshop on Mining and Learning from Time Series (MiLeTS)

  24. arXiv:2112.13213  [pdf, other

    q-fin.TR q-fin.CP q-fin.ST

    Cross-Impact of Order Flow Imbalance in Equity Markets

    Authors: Rama Cont, Mihai Cucuringu, Chao Zhang

    Abstract: We investigate the impact of order flow imbalance (OFI) on price movements in equity markets in a multi-asset setting. First, we propose a systematic approach for combining OFIs at the top levels of the limit order book into an integrated OFI variable which better explains price impact, compared to the best-level OFI. We show that once the information from multiple levels is integrated into OFI, m… ▽ More

    Submitted 13 June, 2023; v1 submitted 25 December, 2021; originally announced December 2021.

    Comments: 33 pages, 10 figures, 11 tables

  25. arXiv:2111.09170  [pdf, other

    q-fin.PM

    A Universal End-to-End Approach to Portfolio Optimization via Deep Learning

    Authors: Chao Zhang, Zihao Zhang, Mihai Cucuringu, Stefan Zohren

    Abstract: We propose a universal end-to-end framework for portfolio optimization where asset distributions are directly obtained. The designed framework circumvents the traditional forecasting step and avoids the estimation of the covariance matrix, lifting the bottleneck for generalizing to a large amount of instruments. Our framework has the flexibility of optimizing various objective functions including… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: 12 pages,

  26. arXiv:2108.09750  [pdf, other

    q-fin.TR q-fin.GN q-fin.ST

    Fragmentation, Price Formation, and Cross-Impact in Bitcoin Markets

    Authors: Jakob Albers, Mihai Cucuringu, Sam Howison, Alexander Y. Shestopaloff

    Abstract: In light of micro-scale inefficiencies induced by the high degree of fragmentation of the Bitcoin trading landscape, we utilize a granular data set comprised of orderbook and trades data from the most liquid Bitcoin markets, in order to understand the price formation process at sub-1 second time scales. To achieve this goal, we construct a set of features that encapsulate relevant microstructural… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

    Comments: 62 pages, 34 figures, 24 tables

  27. arXiv:1909.04497  [pdf, other

    cs.LG q-fin.ST

    Equity2Vec: End-to-end Deep Learning Framework for Cross-sectional Asset Pricing

    Authors: Qiong Wu, Christopher G. Brinton, Zheng Zhang, Andrea Pizzoferrato, Zhenming Liu, Mihai Cucuringu

    Abstract: Pricing assets has attracted significant attention from the financial technology community. We observe that the existing solutions overlook the cross-sectional effects and not fully leveraged the heterogeneous data sets, leading to sub-optimal performance. To this end, we propose an end-to-end deep learning framework to price the assets. Our framework possesses two main properties: 1) We propose… ▽ More

    Submitted 26 October, 2021; v1 submitted 7 September, 2019; originally announced September 2019.

    Comments: 9 pages

    Journal ref: International Conference on AI in Finance, 2021