-
Arctic teleconnection on climate and ozone pollution in the polar jet stream path of eastern US
Authors:
K Shuvo Bakar,
Sourish Das,
Sudeep Shukla,
Anirban Chakraborti
Abstract:
Arctic sea ice is in reduction and has been a key significant indicator of climate change. In this paper, we explore Arctic Sea ice extent data to identify teleconnection with weather change in the polar and sub-tropical jet stream intersection in eastern United States (US) and hence the potential influence in ground level ozone pollution. Several statistical methods including Bayesian techniques…
▽ More
Arctic sea ice is in reduction and has been a key significant indicator of climate change. In this paper, we explore Arctic Sea ice extent data to identify teleconnection with weather change in the polar and sub-tropical jet stream intersection in eastern United States (US) and hence the potential influence in ground level ozone pollution. Several statistical methods including Bayesian techniques such as: spatio-temporal modelling and Bayesian network are implemented to identify the teleconnection and also validated based on theories in atmospheric science. We observe that the teleconnection is relatively strong in autumn, winter and spring seasons compared to the summer. Furthermore, the sudden decremental effect of Arctic sea-ice extent in mid-2000s has a shifting influence in ozone pollutions compared to the previous years. A similar downward shift in the Arctic sea-ice extent has been projected in 2030. These findings indicate to initiate further strategic policies for the Arctic influence, ozone concentrations together the seasonal and global changing patterns of climate.
△ Less
Submitted 26 February, 2025;
originally announced February 2025.
-
An operational framework to automatically evaluate the quality of weather observations from third-party stations
Authors:
Quanxi Shao,
Ming Li,
Joel Janek Dabrowski,
Shuvo Bakar,
Ashfaqur Rahman,
Andrea Powell,
Brent Henderson
Abstract:
With increasing number of crowdsourced private automatic weather stations (called TPAWS) established to fill the gap of official network and obtain local weather information for various purposes, the data quality is a major concern in promoting their usage. Proper quality control and assessment are necessary to reach mutual agreement on the TPAWS observations. To derive near real-time assessment f…
▽ More
With increasing number of crowdsourced private automatic weather stations (called TPAWS) established to fill the gap of official network and obtain local weather information for various purposes, the data quality is a major concern in promoting their usage. Proper quality control and assessment are necessary to reach mutual agreement on the TPAWS observations. To derive near real-time assessment for operational system, we propose a simple, scalable and interpretable framework based on AI/Stats/ML models. The framework constructs separate models for individual data from official sources and then provides the final assessment by fusing the individual models. The performance of our proposed framework is evaluated by synthetic data and demonstrated by applying it to a re-al TPAWS network.
△ Less
Submitted 4 December, 2022;
originally announced December 2022.
-
Quality Control in Weather Monitoring with Dynamic Linear Models
Authors:
Joel Janek Dabrowski,
Ashfaqur Rahman,
Ming Li,
Quanxi Shao,
Shuvo Bakar,
Andrea Powell,
Brent Henderson
Abstract:
Decisions in agriculture are frequently based on weather. With an increase in the availability and affordability of off-the-shelf weather stations, farmers able to acquire localised weather information. However, with uncertainty in the sensor and installation quality, farmers are at risk of making poor decisions based on incorrect data. We present an automated approach to perform quality control o…
▽ More
Decisions in agriculture are frequently based on weather. With an increase in the availability and affordability of off-the-shelf weather stations, farmers able to acquire localised weather information. However, with uncertainty in the sensor and installation quality, farmers are at risk of making poor decisions based on incorrect data. We present an automated approach to perform quality control on weather sensors. Our approach uses time-series modelling and data fusion with Bayesian principles to provide predictions with uncertainty quantification. These predictions and uncertainty are used to estimate the validity of a sensor observation. We test on temperature, wind, and humidity data and achieve error hit rates above 80% and false negative rates below 11%.
△ Less
Submitted 2 March, 2023; v1 submitted 8 November, 2022;
originally announced November 2022.
-
Bayesian Gaussian models for interpolating large-dimensional data at misaligned areal units
Authors:
K. Shuvo Bakar
Abstract:
Areal level spatial data are often large, sparse and may appear with geographical shapes that are regular or irregular (e.g., postcode). Moreover, sometimes it is important to obtain predictive inference in regular or irregular areal shapes that is misaligned with the observed spatial areal geographical boundary. For example, in a survey the respondents were asked about their postcode, however for…
▽ More
Areal level spatial data are often large, sparse and may appear with geographical shapes that are regular or irregular (e.g., postcode). Moreover, sometimes it is important to obtain predictive inference in regular or irregular areal shapes that is misaligned with the observed spatial areal geographical boundary. For example, in a survey the respondents were asked about their postcode, however for policy making purposes, researchers are often interested to obtain information at the SA2. The statistical challenge is to obtain spatial prediction at the SA2s, where the SA2s may have overlapped geographical boundaries with postcodes. The study is motivated by a practical survey data obtained from the Australian National University (ANU) Poll. Here the main research question is to understand respondents' satisfaction level with the way Australia is heading. The data are observed at 1,944 postcodes among the 2,516 available postcodes across Australia, and prediction is obtained at the 2,196 SA2s. The proposed method also explored through a grid-based simulation study, where data have been observed in a regular grid and spatial prediction has been done in a regular grid that has a misaligned geographical boundary with the first regular grid-set. The real-life example with ANU Poll data addresses the situation of irregular geographical boundaries that are misaligned, i.e., model fitted with postcode data and hence obtained prediction at the SA2. A comparison study is also performed to validate the proposed method. In this paper, a Gaussian model is constructed under Bayesian hierarchy. The novelty lies in the development of the basis function that can address spatial sparsity and localised spatial structure. It can also address the large-dimensional spatial data modelling problem by constructing knot based reduced-dimensional basis functions.
△ Less
Submitted 9 November, 2017;
originally announced November 2017.
-
A Censored Bayesian Hierarchical Model For Precipitation
Authors:
Yang Liu,
Philip Kokic,
K. Shuvo Bakar
Abstract:
Modelling of precipitation, including extremes, is important for hydrological and agricultural applications. Traditionally, because of large sample properties for data over a large threshold value, generalised Pareto (GP) distributions are often used for modelling extreme rainfall. It can be shown that under certain conditions the generalised hyperbolic (GH) distributions can approximate the power…
▽ More
Modelling of precipitation, including extremes, is important for hydrological and agricultural applications. Traditionally, because of large sample properties for data over a large threshold value, generalised Pareto (GP) distributions are often used for modelling extreme rainfall. It can be shown that under certain conditions the generalised hyperbolic (GH) distributions can approximate the power law decay of the GP distribution in the tails. Given their flexible form, this raises the possibility that distributions from the GH family serve as a model for the entire rainfall distribution thus avoiding the need to select a threshold. In this paper, we use a flexible censored hierarchical model that leverages the GH distribution to accommodate data subject to heavy tails and an excessive number of zeros. The fitted model allows estimation of probabilities and return periods of the rainfall extremes, and it produces narrower credible intervals in the tails than the traditional GP method. The model not only fits the tails of the rainfall distribution, but fits the whole distribution very well. It also efficiently represents short-term dependencies in the data so it is suitable for evaluating duration over and below thresholds as well as duration of zero rainfall.
△ Less
Submitted 8 November, 2014;
originally announced November 2014.