-
Self-Directed Learning of Convex Labelings on Graphs
Authors:
Georgy Sokolov,
Maximilian Thiessen,
Margarita Akhmejanova,
Fabio Vitale,
Francesco Orabona
Abstract:
We study the problem of classifying the nodes of a given graph in the self-directed learning setup. This learning setting is a variant of online learning, where rather than an adversary determining the sequence in which nodes are presented, the learner autonomously and adaptively selects them. While self-directed learning of Euclidean halfspaces, linear functions, and general multiclass hypothesis…
▽ More
We study the problem of classifying the nodes of a given graph in the self-directed learning setup. This learning setting is a variant of online learning, where rather than an adversary determining the sequence in which nodes are presented, the learner autonomously and adaptively selects them. While self-directed learning of Euclidean halfspaces, linear functions, and general multiclass hypothesis classes was recently considered, no results previously existed specifically for self-directed node classification on graphs. In this paper, we address this problem developing efficient algorithms for it. More specifically, we focus on the case of (geodesically) convex clusters, i.e., for every two nodes sharing the same label, all nodes on every shortest path between them also share the same label. In particular, we devise an algorithm with runtime polynomial in $n$ that makes only $3(h(G)+1)^4 \ln n$ mistakes on graphs with two convex clusters, where $n$ is the total number of nodes and $h(G)$ is the Hadwiger number, i.e., the size of the largest clique minor of the graph $G$. We also show that our algorithm is robust to the case that clusters are slightly non-convex, still achieving a mistake bound logarithmic in $n$. Finally, we devise a simple and efficient algorithm for homophilic clusters, where strongly connected nodes tend to belong to the same class.
△ Less
Submitted 12 February, 2025; v1 submitted 2 September, 2024;
originally announced September 2024.
-
Detecting an Intermittent Change of Unknown Duration
Authors:
Grigory Sokolov,
Valentin S. Spivak,
Alexander G. Tartakovsky
Abstract:
Oftentimes in practice, the observed process changes statistical properties at an unknown point in time and the duration of a change is substantially finite, in which case one says that the change is intermittent or transient. We provide an overview of existing approaches for intermittent change detection and advocate in favor of a particular setting driven by the intermittent nature of the change…
▽ More
Oftentimes in practice, the observed process changes statistical properties at an unknown point in time and the duration of a change is substantially finite, in which case one says that the change is intermittent or transient. We provide an overview of existing approaches for intermittent change detection and advocate in favor of a particular setting driven by the intermittent nature of the change. We propose a novel optimization criterion that is more appropriate for many applied areas such as the detection of threats in physical-computer systems, near-Earth space informatics, epidemiology, pharmacokinetics, etc. We argue that controlling the local conditional probability of a false alarm, rather than the familiar average run length to a false alarm, and maximizing the local conditional probability of detection is a more reasonable approach versus a traditional quickest change detection approach that requires minimizing the expected delay to detection. We adopt the maximum likelihood (ML) approach with respect to the change duration and show that several commonly used detection rules (CUSUM, window-limited CUSUM, and FMA) are equivalent to the ML-based stopping times. We discuss how to choose design parameters for these rules and provide a comprehensive simulation study to corroborate intuitive expectations.
△ Less
Submitted 10 April, 2023; v1 submitted 31 October, 2022;
originally announced October 2022.
-
An Analytic Expression for the Distribution of the Generalized Shiryaev-Roberts Diffusion
Authors:
Aleksey S. Polunchenko,
Grigory Sokolov
Abstract:
We consider the quickest change-point detection problem where the aim is to detect the onset of a pre-specified drift in "live"-monitored standard Brownian motion; the change-point is assumed unknown (nonrandom). The topic of interest is the distribution of the Generalized Shryaev-Roberts (GSR) detection statistic set up to "sense" the presence of the drift. Specifically, we derive a closed-form f…
▽ More
We consider the quickest change-point detection problem where the aim is to detect the onset of a pre-specified drift in "live"-monitored standard Brownian motion; the change-point is assumed unknown (nonrandom). The topic of interest is the distribution of the Generalized Shryaev-Roberts (GSR) detection statistic set up to "sense" the presence of the drift. Specifically, we derive a closed-form formula for the transition probability density function (pdf) of the time-homogeneous Markov diffusion process generated by the GSR statistic when the Brownian motion under surveillance is "drift-free", i.e., in the pre-change regime; the GSR statistic's (deterministic) nonnegative headstart is assumed arbitrarily given. The transition pdf formula is found analytically, through direct solution of the respective Kolmogorov forward equation via the Fourier spectral method to achieve separation of the spacial and temporal variables. The obtained result generalizes the well-known formula for the (pre-change) stationary distribution of the GSR statistic: the latter's stationary distribution is the temporal limit of the distribution sought in this work. To conclude, we exploit the obtained formula numerically and briefly study the pre-change behavior of the GSR statistic versus three factors: (a) drift-shift magnitude, (b) time, and (c) the GSR statistic's headstart.
△ Less
Submitted 15 January, 2016;
originally announced January 2016.
-
On Robustness of the Shiryaev-Roberts Procedure for Quickest Change-Point Detection under Parameter Misspecification in the Post-Change Distribution
Authors:
Wenyu Du,
Aleksey S. Polunchenko,
Grigory Sokolov
Abstract:
The gist of the quickest change-point detection problem is to detect the presence of a change in the statistical behavior of a series of sequentially made observations, and do so in an optimal detection-speed-vs.-"false-positive"-risk manner. When optimality is understood either in the generalized Bayesian sense or as defined in Shiryaev's multi-cyclic setup, the so-called Shiryaev-Roberts (SR) de…
▽ More
The gist of the quickest change-point detection problem is to detect the presence of a change in the statistical behavior of a series of sequentially made observations, and do so in an optimal detection-speed-vs.-"false-positive"-risk manner. When optimality is understood either in the generalized Bayesian sense or as defined in Shiryaev's multi-cyclic setup, the so-called Shiryaev-Roberts (SR) detection procedure is known to be the "best one can do", provided, however, that the observations' pre- and post-change distributions are both fully specified. We consider a more realistic setup, viz. one where the post-change distribution is assumed known only up to a parameter, so that the latter may be "misspecified". The question of interest is the sensitivity (or robustness) of the otherwise "best" SR procedure with respect to a possible misspecification of the post-change distribution parameter. To answer this question, we provide a case study where, in a specific Gaussian scenario, we allow the SR procedure to be "out of tune" in the way of the post-change distribution parameter, and numerically assess the effect of the "mistuning" on Shiryaev's (multi-cyclic) Stationary Average Detection Delay delivered by the SR procedure. The comprehensive quantitative robustness characterization of the SR procedure obtained in the study can be used to develop the respective theory as well as to provide a rational for practical design of the SR procedure. The overall qualitative conclusion of the study is an expected one: the SR procedure is less (more) robust for less (more) contrast changes and for lower (higher) levels of the false alarm risk.
△ Less
Submitted 18 April, 2015;
originally announced April 2015.
-
Second-Order Asymptotic Optimality in Multisensor Sequential Change Detection
Authors:
Georgios Fellouris,
Grigory Sokolov
Abstract:
A generalized multisensor sequential change detection problem is considered, in which a number of (possibly correlated) sensors monitor an environment in real time, the joint distribution of their observations is determined by a global parameter vector, and at some unknown time there is a change in an unknown subset of components of this parameter vector. In this setup, we consider the problem of…
▽ More
A generalized multisensor sequential change detection problem is considered, in which a number of (possibly correlated) sensors monitor an environment in real time, the joint distribution of their observations is determined by a global parameter vector, and at some unknown time there is a change in an unknown subset of components of this parameter vector. In this setup, we consider the problem of detecting the time of the change as soon as possible, while controlling the rate of false alarms. We establish the second-order asymptotic optimality (with respect to Lorden's criterion) of various generalizations of the CUSUM rule; that is, we show that their additional expected worst-case detection delay (relative to the one that could be achieved if the affected subset was known) remains bounded as the rate of false alarm goes to 0, for any possible subset of affected components. This general framework incorporates the traditional multisensor setup in which only an unknown subset of sensors is affected by the change. The latter problem has a special structure which we exploit in order to obtain feasible representations of the proposed schemes. We present the results of a simulation study where we compare the proposed schemes with scalable detection rules that are only first-order asymptotically optimal. Finally, in the special case that the change affects exactly one sensor, we consider the scheme that runs in parallel the local CUSUM rules and study the problem of specifying the local thresholds.
△ Less
Submitted 10 January, 2016; v1 submitted 14 October, 2014;
originally announced October 2014.
-
An Exact Formula for the Average Run Length to False Alarm of the Generalized Shiryaev-Roberts Procedure for Change-Point Detection under Exponential Observations
Authors:
Wenyu Du,
Grigory Sokolov,
Aleksey S. Polunchenko
Abstract:
We derive analytically an exact closed-form formula for the standard minimax Average Run Length (ARL) to false alarm delivered by the Generalized Shiryaev-Roberts (GSR) change-point detection procedure devised to detect a shift in the baseline mean of a sequence of independent exponentially distributed observations. Specifically, the formula is found through direct solution of the respective integ…
▽ More
We derive analytically an exact closed-form formula for the standard minimax Average Run Length (ARL) to false alarm delivered by the Generalized Shiryaev-Roberts (GSR) change-point detection procedure devised to detect a shift in the baseline mean of a sequence of independent exponentially distributed observations. Specifically, the formula is found through direct solution of the respective integral (renewal) equation, and is a general result in that the GSR procedure's headstart is not restricted to a bounded range, nor is there a "ceiling" value for the detection threshold. Apart from the theoretical significance (in change-point detection, exact closed-form performance formulae are typically either difficult or impossible to get, especially for the GSR procedure), the obtained formula is also useful to a practitioner: in cases of practical interest, the formula is a function linear in both the detection threshold and the headstart, and, therefore, the ARL to false alarm of the GSR procedure can be easily computed.
△ Less
Submitted 21 October, 2014; v1 submitted 29 August, 2014;
originally announced August 2014.
-
Efficient Performance Evaluation of the Generalized Shiryaev--Roberts Detection Procedure in a Multi-Cyclic Setup
Authors:
Aleksey S. Polunchenko,
Grigory Sokolov,
Wenyu Du
Abstract:
We propose a numerical method to evaluate the performance of the emerging Generalized Shiryaev--Roberts (GSR) change-point detection procedure in a "minimax-ish" multi-cyclic setup where the procedure of choice is applied repetitively (cyclically) and the change is assumed to take place at an unknown time moment in a distant-future stationary regime. Specifically, the proposed method is based on t…
▽ More
We propose a numerical method to evaluate the performance of the emerging Generalized Shiryaev--Roberts (GSR) change-point detection procedure in a "minimax-ish" multi-cyclic setup where the procedure of choice is applied repetitively (cyclically) and the change is assumed to take place at an unknown time moment in a distant-future stationary regime. Specifically, the proposed method is based on the integral-equations approach and uses the collocation technique with the basis functions chosen so as to exploit a certain change-of-measure identity and the GSR detection statistic's unique martingale property. As a result, the method's accuracy and robustness improve, as does its efficiency since using the change-of-measure ploy the Average Run Length (ARL) to false alarm and the Stationary Average Detection Delay (STADD) are computed simultaneously. We show that the method's rate of convergence is quadratic and supply a tight upperbound on its error. We conclude with a case study and confirm experimentally that the proposed method's accuracy and rate of convergence are robust with respect to three factors: (a) partition fineness (coarse vs. fine), (b) change magnitude (faint vs. contrast), and (c) the level of the ARL to false alarm (low vs. high). Since the method is designed not restricted to a particular data distribution or to a specific value of the GSR detection statistic's headstart, this work may help gain greater insight into the characteristics of the GSR procedure and aid a practitioner to design the GSR procedure as needed while fully utilizing its potential.
△ Less
Submitted 17 December, 2013;
originally announced December 2013.
-
Optimal Design and Analysis of the Exponentially Weighted Moving Average Chart for Exponential Data
Authors:
Aleksey S. Polunchenko,
Grigory Sokolov,
Alexander G. Tartakovsky
Abstract:
We study optimal design of the Exponentially Weighted Moving Average (EWMA) chart by a proper choice of the smoothing factor and the initial value (headstart) of the decision statistic. The particular problem addressed is that of quickest detection of an abrupt change in the parameter of a discrete-time exponential model. Both pre- and post-change parameter values are assumed known, but the change…
▽ More
We study optimal design of the Exponentially Weighted Moving Average (EWMA) chart by a proper choice of the smoothing factor and the initial value (headstart) of the decision statistic. The particular problem addressed is that of quickest detection of an abrupt change in the parameter of a discrete-time exponential model. Both pre- and post-change parameter values are assumed known, but the change-point is not known. For this change-point detection scenario, we examine the performance of the conventional one-sided EWMA chart with respect to two optimality criteria: Pollak's minimax criterion associated with the maximal conditional expected delay to detection and Shiryaev's multi-cyclic setup associated with the stationary expected delay to detection. Using the integral-equations approach, we derive the exact closed-form formulae for all of the required performance measures. Based on these formulae we find the optimal smoothing factor and headstart by solving the corresponding two bivariate constraint optimization problems. Finally, the performance of the optimized EWMA chart is compared against that of the Shiryaev--Roberts--$r$ procedure in the minimax setting, and against that of the original Shiryaev--Roberts procedure in the multi-cyclic setting. The main conclusion is that the EWMA chart, when fully optimized, turns out to be a very competitive procedure, with performance nearly indistinguishable from that of the known-to-be-best Shiryaev--Roberts--$r$ and Shiryaev--Roberts procedures.
△ Less
Submitted 6 November, 2014; v1 submitted 26 July, 2013;
originally announced July 2013.
-
An Accurate Method for Determining the Pre-Change Run-Length Distribution of the Generalized Shiryaev--Roberts Detection Procedure
Authors:
Aleksey S. Polunchenko,
Grigory Sokolov,
Wenyu Du
Abstract:
Change-of-measure is a powerful technique used across statistics, probability and analysis. Particularly known as Wald's likelihood ratio identity, the technique enabled the proof of a number of exact and asymptotic optimality results pertaining to the problem of quickest change-point detection. Within the latter problem's context we apply the technique to develop a numerical method to compute the…
▽ More
Change-of-measure is a powerful technique used across statistics, probability and analysis. Particularly known as Wald's likelihood ratio identity, the technique enabled the proof of a number of exact and asymptotic optimality results pertaining to the problem of quickest change-point detection. Within the latter problem's context we apply the technique to develop a numerical method to compute the Generalized Shiryaev--Roberts (GSR) detection procedure's pre-change Run-Length distribution. Specifically, the method is based on the integral-equations approach and uses the collocation framework with the basis functions chosen so as to exploit a certain change-of-measure identity and a specific martingale property of the GSR procedure's detection statistic. As a result, the method's accuracy and robustness improve substantially, even though the method's theoretical rate of convergence is shown to be merely quadratic. A tight upper bound on the method's error is supplied as well. The method is not restricted to a particular data distribution or to a specific value of the GSR detection statistic's "headstart". To conclude, we offer a case study to demonstrate the proposed method at work, drawing particular attention to the method's accuracy and its robustness with respect to three factors: (a) partition size, (b) change magnitude, and (c) Average Run Length (ARL) to false alarm level. Specifically, assuming independent standard Gaussian observations undergoing a surge in the mean, we employ the method to study the GSR procedure's Run-Length's pre-change distribution, its average (i.e., the usual ARL to false alarm) and standard deviation. As expected from the theoretical analysis, the method's high accuracy and robustness with respect to the foregoing three factors are confirmed experimentally. We also comment on extending the method to handle other performance measures and other procedures.
△ Less
Submitted 14 October, 2013; v1 submitted 11 July, 2013;
originally announced July 2013.
-
Efficient Computer Network Anomaly Detection by Changepoint Detection Methods
Authors:
Alexander G. Tartakovsky,
Aleksey S. Polunchenko,
Grigory Sokolov
Abstract:
We consider the problem of efficient on-line anomaly detection in computer network traffic. The problem is approached statistically, as that of sequential (quickest) changepoint detection. A multi-cyclic setting of quickest change detection is a natural fit for this problem. We propose a novel score-based multi-cyclic detection algorithm. The algorithm is based on the so-called Shiryaev-Roberts pr…
▽ More
We consider the problem of efficient on-line anomaly detection in computer network traffic. The problem is approached statistically, as that of sequential (quickest) changepoint detection. A multi-cyclic setting of quickest change detection is a natural fit for this problem. We propose a novel score-based multi-cyclic detection algorithm. The algorithm is based on the so-called Shiryaev-Roberts procedure. This procedure is as easy to employ in practice and as computationally inexpensive as the popular Cumulative Sum chart and the Exponentially Weighted Moving Average scheme. The likelihood ratio based Shiryaev-Roberts procedure has appealing optimality properties, particularly it is exactly optimal in a multi-cyclic setting geared to detect a change occurring at a far time horizon. It is therefore expected that an intrusion detection algorithm based on the Shiryaev-Roberts procedure will perform better than other detection schemes. This is confirmed experimentally for real traces. We also discuss the possibility of complementing our anomaly detection algorithm with a spectral-signature intrusion detection system with false alarm filtering and true attack confirmation capability, so as to obtain a synergistic system.
△ Less
Submitted 8 December, 2012;
originally announced December 2012.