Search | arXiv e-print repository

Probabilistic Forecasting with Stochastic Interpolants and Föllmer Processes

Authors: Yifan Chen, Mark Goldstein, Mengjian Hua, Michael S. Albergo, Nicholas M. Boffi, Eric Vanden-Eijnden

Abstract: We propose a framework for probabilistic forecasting of dynamical systems based on generative modeling. Given observations of the system state over time, we formulate the forecasting problem as sampling from the conditional distribution of the future system state given its current state. To this end, we leverage the framework of stochastic interpolants, which facilitates the construction of a gene… ▽ More We propose a framework for probabilistic forecasting of dynamical systems based on generative modeling. Given observations of the system state over time, we formulate the forecasting problem as sampling from the conditional distribution of the future system state given its current state. To this end, we leverage the framework of stochastic interpolants, which facilitates the construction of a generative model between an arbitrary base distribution and the target. We design a fictitious, non-physical stochastic dynamics that takes as initial condition the current system state and produces as output a sample from the target conditional distribution in finite time and without bias. This process therefore maps a point mass centered at the current state onto a probabilistic ensemble of forecasts. We prove that the drift coefficient entering the stochastic differential equation (SDE) achieving this task is non-singular, and that it can be learned efficiently by square loss regression over the time-series data. We show that the drift and the diffusion coefficients of this SDE can be adjusted after training, and that a specific choice that minimizes the impact of the estimation error gives a Föllmer process. We highlight the utility of our approach on several complex, high-dimensional forecasting problems, including stochastically forced Navier-Stokes and video prediction on the KTH and CLEVRER datasets. △ Less

Submitted 27 August, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

arXiv:2208.12119 [pdf]

The Sustainable Response Strategy to COVID-19: Pandemic Urban Zoning Based on Multimodal Transport Data

Authors: Yufei Wang, Mingzhuang Hua, Xuewu Chen, Wendong Chen, Long Cheng

Abstract: Since the outbreak of COVID-19, it has rapidly evolved into a sudden and major public health emergency globally. With the variants of COVID-19, the difficulty of pandemic control continues to increase, which has brought significant costs to the society. The existing pandemic control zoning method ignores the impact on residents'lives. In this study, we propose a refined and low-cost pandemic contr… ▽ More Since the outbreak of COVID-19, it has rapidly evolved into a sudden and major public health emergency globally. With the variants of COVID-19, the difficulty of pandemic control continues to increase, which has brought significant costs to the society. The existing pandemic control zoning method ignores the impact on residents'lives. In this study, we propose a refined and low-cost pandemic control method by scientifically delineating zoning areas. First, a spatial interaction network is built up based on the multimodal transport travel data in Nanjing, China, and an improved Leiden community detection method based on the gravity model is used to obtain a preliminary zoning scheme. Then, we use spatial constraints to correct the results with the discrete spatial distribution. Finally, reasonable zones for pandemic control are obtained. The modularity of the algorithm results is 0.4185, proving that the proposed method is suitable for pandemic control zoning. The proposed method is also demonstrated to be able to minimize traffic flows between pandemic control areas and only 24.8% of travel connections are cut off, thus reducing the impact of pandemic control on residents'daily life and reducing the cost of pandemic control. The findings can help to inform sustainable strategies and suggestions for the pandemic control. △ Less

Submitted 25 August, 2022; originally announced August 2022.

Comments: 20pages 7080words

arXiv:2204.08603 [pdf]

Minimizing Fleet Size and Improving Bike Allocation of Bike Sharing under Future Uncertainty

Authors: Mingzhuang Hua, Xuewu Chen, Jingxu Chen, Yu Jiang

Abstract: As a rapidly expanding service, bike sharing is facing severe problems of bike over-supply and demand fluctuation in many Chinese cities. This study develops a large-scale method to determine the minimum fleet size under uncertainty, based on the bike sharing data of millions of trips in Nanjing. It is found that the algorithm of minimizing fleet size under the incomplete-information scenario is e… ▽ More As a rapidly expanding service, bike sharing is facing severe problems of bike over-supply and demand fluctuation in many Chinese cities. This study develops a large-scale method to determine the minimum fleet size under uncertainty, based on the bike sharing data of millions of trips in Nanjing. It is found that the algorithm of minimizing fleet size under the incomplete-information scenario is effective in handling future uncertainty. For a dockless bike sharing system, supplying 14.5% of the original fleet could meet 96.8% of trip demands. Meanwhile, the results suggest that providing a integrated service platform that integrates multiple companies can significantly reduce the total fleet size by 44.6%. Moreover, in view of the COVID-19 pandemic, this study proposes a social distancing policy that maintains a suitable usage interval. These findings provide useful insights for improving the resource efficiency and operational service of bike sharing and shared mobility. △ Less

Submitted 18 April, 2022; originally announced April 2022.

Comments: 31 pages,10 figures

arXiv:2203.09279 [pdf]

Transfer learning for cross-modal demand prediction of bike-share and public transit

Authors: Mingzhuang Hua, Francisco Camara Pereira, Yu Jiang, Xuewu Chen

Abstract: The urban transportation system is a combination of multiple transport modes, and the interdependencies across those modes exist. This means that the travel demand across different travel modes could be correlated as one mode may receive demand from or create demand for another mode, not to mention natural correlations between different demand time series due to general demand flow patterns across… ▽ More The urban transportation system is a combination of multiple transport modes, and the interdependencies across those modes exist. This means that the travel demand across different travel modes could be correlated as one mode may receive demand from or create demand for another mode, not to mention natural correlations between different demand time series due to general demand flow patterns across the network. It is expectable that cross-modal ripple effects become more prevalent, with Mobility as a Service. Therefore, by propagating demand data across modes, a better demand prediction could be obtained. To this end, this study explores various machine learning models and transfer learning strategies for cross-modal demand prediction. The trip data of bike-share, metro, and taxi are processed as the station-level passenger flows, and then the proposed prediction method is tested in the large-scale case studies of Nanjing and Chicago. The results suggest that prediction models with transfer learning perform better than unimodal prediction models. Furthermore, stacked Long Short-Term Memory model performs particularly well in cross-modal demand prediction. These results verify our combined method's forecasting improvement over existing benchmarks and demonstrate the good transferability for cross-modal demand prediction in multiple cities. △ Less

Submitted 17 March, 2022; originally announced March 2022.

Comments: 27 pages, 4 figures

arXiv:2003.06321 [pdf, other]

Micro-supervised Disturbance Learning: A Perspective of Representation Probability Distribution

Authors: Jielei Chu, Jing Liu, Hongjun Wang, Meng Hua, Zhiguo Gong, Tianrui Li

Abstract: The instability is shown in the existing methods of representation learning based on Euclidean distance under a broad set of conditions. Furthermore, the scarcity and high cost of labels prompt us to explore more expressive representation learning methods which depends on the labels as few as possible. To address these issues, the small-perturbation ideology is firstly introduced on the representa… ▽ More The instability is shown in the existing methods of representation learning based on Euclidean distance under a broad set of conditions. Furthermore, the scarcity and high cost of labels prompt us to explore more expressive representation learning methods which depends on the labels as few as possible. To address these issues, the small-perturbation ideology is firstly introduced on the representation learning model based on the representation probability distribution. The positive small-perturbation information (SPI) which only depend on two labels of each cluster is used to stimulate the representation probability distribution and then two variant models are proposed to fine-tune the expected representation distribution of RBM, namely, Micro-supervised Disturbance GRBM (Micro-DGRBM) and Micro-supervised Disturbance RBM (Micro-DRBM) models. The Kullback-Leibler (KL) divergence of SPI is minimized in the same cluster to promote the representation probability distributions to become more similar in Contrastive Divergence(CD) learning. In contrast, the KL divergence of SPI is maximized in the different clusters to enforce the representation probability distributions to become more dissimilar in CD learning. To explore the representation learning capability under the continuous stimulation of the SPI, we present a deep Micro-supervised Disturbance Learning (Micro-DL) framework based on the Micro-DGRBM and Micro-DRBM models and compare it with a similar deep structure which has not any external stimulation. Experimental results demonstrate that the proposed deep Micro-DL architecture shows better performance in comparison to the baseline method, the most related shallow models and deep frameworks for clustering. △ Less

Submitted 6 October, 2021; v1 submitted 13 March, 2020; originally announced March 2020.

Comments: 14 pages

Showing 1–5 of 5 results for author: Hua, M