-
On Limiting Probability Distributions of Higher Order Markov Chains
Authors:
Lixing Han,
Jianhong Xu
Abstract:
The limiting (or stationary) probability distribution is one of the key characteristics of a Markov chain since it shows its long-term behavior. In this paper, for a higher order Markov chain, we establish a sufficient condition for the existence of its limiting probability distribution. This condition is built upon the regularity of its transition tensor. Our results extend the corresponding conc…
▽ More
The limiting (or stationary) probability distribution is one of the key characteristics of a Markov chain since it shows its long-term behavior. In this paper, for a higher order Markov chain, we establish a sufficient condition for the existence of its limiting probability distribution. This condition is built upon the regularity of its transition tensor. Our results extend the corresponding conclusions for first order chains. Besides, they complement the existing results concerning higher order chains that rely on approximation schemes or two-phase power iterations.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
Sculpting Subspaces: Constrained Full Fine-Tuning in LLMs for Continual Learning
Authors:
Nikhil Shivakumar Nayak,
Krishnateja Killamsetty,
Ligong Han,
Abhishek Bhandwaldar,
Prateek Chanda,
Kai Xu,
Hao Wang,
Aldo Pareja,
Oleg Silkin,
Mustafa Eyceoz,
Akash Srivastava
Abstract:
Continual learning in large language models (LLMs) is prone to catastrophic forgetting, where adapting to new tasks significantly degrades performance on previously learned ones. Existing methods typically rely on low-rank, parameter-efficient updates that limit the model's expressivity and introduce additional parameters per task, leading to scalability issues. To address these limitations, we pr…
▽ More
Continual learning in large language models (LLMs) is prone to catastrophic forgetting, where adapting to new tasks significantly degrades performance on previously learned ones. Existing methods typically rely on low-rank, parameter-efficient updates that limit the model's expressivity and introduce additional parameters per task, leading to scalability issues. To address these limitations, we propose a novel continual full fine-tuning approach leveraging adaptive singular value decomposition (SVD). Our method dynamically identifies task-specific low-rank parameter subspaces and constrains updates to be orthogonal to critical directions associated with prior tasks, thus effectively minimizing interference without additional parameter overhead or storing previous task gradients. We evaluate our approach extensively on standard continual learning benchmarks using both encoder-decoder (T5-Large) and decoder-only (LLaMA-2 7B) models, spanning diverse tasks including classification, generation, and reasoning. Empirically, our method achieves state-of-the-art results, up to 7% higher average accuracy than recent baselines like O-LoRA, and notably maintains the model's general linguistic capabilities, instruction-following accuracy, and safety throughout the continual learning process by reducing forgetting to near-negligible levels. Our adaptive SVD framework effectively balances model plasticity and knowledge retention, providing a practical, theoretically grounded, and computationally scalable solution for continual learning scenarios in large language models.
△ Less
Submitted 9 April, 2025;
originally announced April 2025.
-
Targeted Data Fusion for Causal Survival Analysis Under Distribution Shift
Authors:
Yi Liu,
Alexander W. Levis,
Ke Zhu,
Shu Yang,
Peter B. Gilbert,
Larry Han
Abstract:
Causal inference across multiple data sources offers a promising avenue to enhance the generalizability and replicability of scientific findings. However, data integration methods for time-to-event outcomes, common in biomedical research, are underdeveloped. Existing approaches focus on binary or continuous outcomes but fail to address the unique challenges of survival analysis, such as censoring…
▽ More
Causal inference across multiple data sources offers a promising avenue to enhance the generalizability and replicability of scientific findings. However, data integration methods for time-to-event outcomes, common in biomedical research, are underdeveloped. Existing approaches focus on binary or continuous outcomes but fail to address the unique challenges of survival analysis, such as censoring and the integration of discrete and continuous time. To bridge this gap, we propose two novel methods for estimating target site-specific causal effects in multi-source settings. First, we develop a semiparametric efficient estimator for settings where individual-level data can be shared across sites. Second, we introduce a federated learning framework designed for privacy-constrained environments, which dynamically reweights source-specific contributions to account for discrepancies with the target population. Both methods leverage flexible, nonparametric machine learning models to improve robustness and efficiency. We illustrate the utility of our approaches through simulation studies and an application to multi-site randomized trials of monoclonal neutralizing antibodies for HIV-1 prevention, conducted among cisgender men and transgender persons in the United States, Brazil, Peru, and Switzerland, as well as among women in sub-Saharan Africa. Our findings underscore the potential of these methods to enable efficient, privacy-preserving causal inference for time-to-event outcomes under distribution shift.
△ Less
Submitted 14 May, 2025; v1 submitted 30 January, 2025;
originally announced January 2025.
-
Variable selection for partially linear single-index varying-coefficient model
Authors:
Lijuan Han,
Liugen Xue,
Junshan Xie
Abstract:
This paper focuses on variable selection for a partially linear single-index varying-coefficient model. A regularized variable selection procedure by combining basis function approximations with SCAD penalty is proposed. It can simultaneously select significant variables in the parametric and nonparametric components and estimate the nonzero regression coefficients and coefficient functions. The c…
▽ More
This paper focuses on variable selection for a partially linear single-index varying-coefficient model. A regularized variable selection procedure by combining basis function approximations with SCAD penalty is proposed. It can simultaneously select significant variables in the parametric and nonparametric components and estimate the nonzero regression coefficients and coefficient functions. The consistency of the variable selection procedure and the oracle property of the penalized least-squares estimators for high-dimensional data are established. Some simulations and the real data analysis are constructed to illustrate the finite sample performances of the proposed method.
△ Less
Submitted 17 December, 2024;
originally announced December 2024.
-
On the Role of Surrogates in Conformal Inference of Individual Causal Effects
Authors:
Chenyin Gao,
Peter B. Gilbert,
Larry Han
Abstract:
Learning the Individual Treatment Effect (ITE) is essential for personalized decision-making, yet causal inference has traditionally focused on aggregated treatment effects. While integrating conformal prediction with causal inference can provide valid uncertainty quantification for ITEs, the resulting prediction intervals are often excessively wide, limiting their practical utility. To address th…
▽ More
Learning the Individual Treatment Effect (ITE) is essential for personalized decision-making, yet causal inference has traditionally focused on aggregated treatment effects. While integrating conformal prediction with causal inference can provide valid uncertainty quantification for ITEs, the resulting prediction intervals are often excessively wide, limiting their practical utility. To address this limitation, we introduce \underline{S}urrogate-assisted \underline{C}onformal \underline{I}nference for \underline{E}fficient I\underline{N}dividual \underline{C}ausal \underline{E}ffects (SCIENCE), a framework designed to construct more efficient prediction intervals for ITEs. SCIENCE accommodates the covariate shifts between source data and target data and applies to various data configurations, including semi-supervised and surrogate-assisted semi-supervised learning. Leveraging semi-parametric efficiency theory, SCIENCE produces rate double-robust prediction intervals under mild rate convergence conditions, permitting the use of flexible non-parametric models to estimate nuisance functions. We quantify efficiency gains by comparing semi-parametric efficiency bounds with and without the surrogates. Simulation studies demonstrate that our surrogate-assisted intervals offer substantial efficiency improvements over existing methods while maintaining valid group-conditional coverage. Applied to the phase 3 Moderna COVE COVID-19 vaccine trial, SCIENCE illustrates how multiple surrogate markers can be leveraged to generate more efficient prediction intervals.
△ Less
Submitted 21 January, 2025; v1 submitted 16 December, 2024;
originally announced December 2024.
-
Computational Graph Representation of Equations System Constructors in Hierarchical Circuit Simulation
Authors:
Zichao Long,
Lin Li,
Lei Han,
Xianglong Meng,
Chongjun Ding,
Ruiyan Li,
Wu Jiang,
Fuchen Ding,
Jiaqing Yue,
Zhichao Li,
Yisheng Hu,
Ding Li,
Heng Liao
Abstract:
Equations system constructors of hierarchical circuits play a central role in device modeling, nonlinear equations solving, and circuit design automation. However, existing constructors present limitations in applications to different extents. For example, the costs of developing and reusing device models -- especially coarse-grained equivalent models of circuit modules -- remain high while parame…
▽ More
Equations system constructors of hierarchical circuits play a central role in device modeling, nonlinear equations solving, and circuit design automation. However, existing constructors present limitations in applications to different extents. For example, the costs of developing and reusing device models -- especially coarse-grained equivalent models of circuit modules -- remain high while parameter sensitivity analysis is complex and inefficient. Inspired by differentiable programming and leveraging the ecosystem benefits of open-source software, we propose an equations system constructor using the computational graph representation, along with its JSON format netlist, to address these limitations. This representation allows for runtime dependencies between signals and subcircuit/device parameters. The proposed method streamlines the model development process and facilitates end-to-end computation of gradients of equations remainders with respect to parameters. This paper discusses in detail the overarching concept of hierarchical subcircuit/device decomposition and nested invocation by drawing parallels to functions in programming languages, and introduces rules for parameters passing and gradient propagation across hierarchical circuit modules. The presented numerical examples, including (1) an uncoupled CMOS model representation using "equivalent circuit decomposition+dynamic parameters" and (2) operational amplifier (OpAmp) auto device sizing, have demonstrated that the proposed method supports circuit simulation and design and particularly subcircuit modeling with improved efficiency, simplicity, and decoupling compared to existing techniques.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Decreasing and complete monotonicity of two functions defined by three derivatives of a completely monotonic function involving the trigamma function
Authors:
Hong-Ping Yin,
Ling-Xiong Han,
Feng Qi
Abstract:
In the paper, by convolution theorem of the Laplace transforms, a monotonicity rule for the ratio of two Laplace transforms, Bernstein's theorem for completely monotonic functions, and other analytic techniques, the authors verify decreasing property of a ratio between three derivatives of a function involving trigamma function and find necessary and sufficient conditions for a function defined by…
▽ More
In the paper, by convolution theorem of the Laplace transforms, a monotonicity rule for the ratio of two Laplace transforms, Bernstein's theorem for completely monotonic functions, and other analytic techniques, the authors verify decreasing property of a ratio between three derivatives of a function involving trigamma function and find necessary and sufficient conditions for a function defined by three derivatives of a function involving trigamma function to be completely monotonic. These results confirm previous guesses posed by Qi and generalize corresponding known conclusions.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
Standing wave solutions and instability for the Logarithmic Klein-Gordon equation
Authors:
Lijia Han,
Yue Qiu,
Xiaohong Wang
Abstract:
In this paper, we study the standing wave solutions of Klein--Gordon equation with logarithmic nonlinearity.
The existence of the standing wave solution related to the ground state $φ_0(x)$ is obtained. Further, we prove the instability of solutions around $φ_0(x)$.
In this paper, we study the standing wave solutions of Klein--Gordon equation with logarithmic nonlinearity.
The existence of the standing wave solution related to the ground state $φ_0(x)$ is obtained. Further, we prove the instability of solutions around $φ_0(x)$.
△ Less
Submitted 18 February, 2024;
originally announced February 2024.
-
Scattering for the magnetic Zakharov system in 3 dimensions
Authors:
Xiaohong Wang,
Lijia Han
Abstract:
We consider the global existence and scattering for solutions of magnetic Zakharov system in three-dimensional space. When the initial data is small, we prove the existence of smooth global solutions and scattering results, by combining the space--time resonance method, weighted Sobolev space and dispersive estimates. Moreover, the decay rates for the solutions are also obtained.
We consider the global existence and scattering for solutions of magnetic Zakharov system in three-dimensional space. When the initial data is small, we prove the existence of smooth global solutions and scattering results, by combining the space--time resonance method, weighted Sobolev space and dispersive estimates. Moreover, the decay rates for the solutions are also obtained.
△ Less
Submitted 16 February, 2024;
originally announced February 2024.
-
Multiply Robust Federated Estimation of Targeted Average Treatment Effects
Authors:
Larry Han,
Zhu Shen,
Jose Zubizarreta
Abstract:
Federated or multi-site studies have distinct advantages over single-site studies, including increased generalizability, the ability to study underrepresented populations, and the opportunity to study rare exposures and outcomes. However, these studies are challenging due to the need to preserve the privacy of each individual's data and the heterogeneity in their covariate distributions. We propos…
▽ More
Federated or multi-site studies have distinct advantages over single-site studies, including increased generalizability, the ability to study underrepresented populations, and the opportunity to study rare exposures and outcomes. However, these studies are challenging due to the need to preserve the privacy of each individual's data and the heterogeneity in their covariate distributions. We propose a novel federated approach to derive valid causal inferences for a target population using multi-site data. We adjust for covariate shift and covariate mismatch between sites by developing multiply-robust and privacy-preserving nuisance function estimation. Our methodology incorporates transfer learning to estimate ensemble weights to combine information from source sites. We show that these learned weights are efficient and optimal under different scenarios. We showcase the finite sample advantages of our approach in terms of efficiency and robustness compared to existing approaches.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
Student's t-Distribution: On Measuring the Inter-Rater Reliability When the Observations are Scarce
Authors:
Serge Gladkoff,
Lifeng Han,
Goran Nenadic
Abstract:
In natural language processing (NLP) we always rely on human judgement as the golden quality evaluation method. However, there has been an ongoing debate on how to better evaluate inter-rater reliability (IRR) levels for certain evaluation tasks, such as translation quality evaluation (TQE), especially when the data samples (observations) are very scarce. In this work, we first introduce the study…
▽ More
In natural language processing (NLP) we always rely on human judgement as the golden quality evaluation method. However, there has been an ongoing debate on how to better evaluate inter-rater reliability (IRR) levels for certain evaluation tasks, such as translation quality evaluation (TQE), especially when the data samples (observations) are very scarce. In this work, we first introduce the study on how to estimate the confidence interval for the measurement value when only one data (evaluation) point is available. Then, this leads to our example with two human-generated observational scores, for which, we introduce ``Student's \textit{t}-Distribution'' method and explain how to use it to measure the IRR score using only these two data points, as well as the confidence intervals (CIs) of the quality evaluation. We give quantitative analysis on how the evaluation confidence can be greatly improved by introducing more observations, even if only one extra observation. We encourage researchers to report their IRR scores in all possible means, e.g. using Student's \textit{t}-Distribution method whenever possible; thus making the NLP evaluation more meaningful, transparent, and trustworthy. This \textit{t}-Distribution method can be also used outside of NLP fields to measure IRR level for trustworthy evaluation of experimental investigations, whenever the observational data is scarce.
Keywords: Inter-Rater Reliability (IRR); Scarce Observations; Confidence Intervals (CIs); Natural Language Processing (NLP); Translation Quality Evaluation (TQE); Student's \textit{t}-Distribution
△ Less
Submitted 9 July, 2023; v1 submitted 8 March, 2023;
originally announced March 2023.
-
Is Stubborn Mining Severe in Imperfect GHOST Bitcoin-like Blockchains? Quantitative Analysis
Authors:
Haoran Zhu,
Xiaolin Chang,
Jelena Mišić,
Vojislav B. Mišić,
Lei Han,
Zhi Chen
Abstract:
GHOST, like the longest-chain protocol, is a chain selection protocol and its capability in resisting selfish mining attack has been validated in imperfect blockchains of Bitcoin and its variants (Bitcoin-like). This paper explores an analytical-model-based method to investigate the impact of stubborn mining attack in imperfect GHOST Bitcoin-like blockchains. We first quantify chain dynamics based…
▽ More
GHOST, like the longest-chain protocol, is a chain selection protocol and its capability in resisting selfish mining attack has been validated in imperfect blockchains of Bitcoin and its variants (Bitcoin-like). This paper explores an analytical-model-based method to investigate the impact of stubborn mining attack in imperfect GHOST Bitcoin-like blockchains. We first quantify chain dynamics based on Markov chain and then derive the formulas of miner revenue and system throughput. We also propose a new metric, "Hazard Index", which can be used to compare attack severity and also assist attacker in determining whether it is profitable to conduct an attack. The experiment results show that 1) An attacker with more than 30% computing power can get huge profit and extremely downgrade system throughput by launching stubborn mining attack. 2) A rational attacker should not launch stubborn mining attack if it has less than 25% computing power. 3) Stubborn mining attack causes more damage than selfish mining attack under GHOST. Our work provides insight into stubborn mining attack and is helpful in designing countermeasures.
△ Less
Submitted 31 January, 2023;
originally announced February 2023.
-
Centralizers of nilpotent elements in basic classical Lie superalgebras in good characteristic
Authors:
Leyu Han
Abstract:
Let \mathfrak{g}=\mathfrak{g}_{\bar{0}}\oplus\mathfrak{g}_{\bar{1}} be a basic classical Lie superalgebra over an algebraically closed field \mathbb{K} whose characteristic p>0 is a good prime for \mathfrak{g}. Let G_{\bar{0}} be the reductive algebraic group over \mathbb{K} such that \mathrm{Lie}(G_{\bar{0}})=\mathfrak{g}_{\bar{0}}. Suppose e\in\mathfrak{g}_{\bar{0}} is nilpotent. Write \mathfrak…
▽ More
Let \mathfrak{g}=\mathfrak{g}_{\bar{0}}\oplus\mathfrak{g}_{\bar{1}} be a basic classical Lie superalgebra over an algebraically closed field \mathbb{K} whose characteristic p>0 is a good prime for \mathfrak{g}. Let G_{\bar{0}} be the reductive algebraic group over \mathbb{K} such that \mathrm{Lie}(G_{\bar{0}})=\mathfrak{g}_{\bar{0}}. Suppose e\in\mathfrak{g}_{\bar{0}} is nilpotent. Write \mathfrak{g}^{e} for the centralizer of e in \mathfrak{g} and \mathfrak{z}(\mathfrak{g}^{e}) for the centre of \mathfrak{g}^{e}. We calculate a basis for \mathfrak{g}^{e} and \mathfrak{z}(\mathfrak{g}^{e}) by using associated cocharacters τ:\mathbb{K}^{\times}\rightarrow G_{\bar{0}} of e. In addition, we give the classification of e which are reachable, strongly reachable or satisfy the Panyushev property for exceptional Lie superalgebras D(2,1;α), G(3) and F(4).
△ Less
Submitted 24 October, 2022;
originally announced October 2022.
-
Reachable elements in basic classical Lie superalgebras
Authors:
Leyu Han
Abstract:
Let \mathfrak{g}=\mathfrak{g}_{\bar{0}}\oplus\mathfrak{g}_{\bar{1}} be a basic classical Lie superalgebra over \mathbb{C}, e\in\mathfrak{g}_{\bar{0}} a nilpotent element and \mathfrak{g}^{e} the centralizer of e in \mathfrak{g}. We study various properties of nilpotent elements in \mathfrak{g}, which have previously only been considered in the case of Lie algebras. In particular, we prove that e i…
▽ More
Let \mathfrak{g}=\mathfrak{g}_{\bar{0}}\oplus\mathfrak{g}_{\bar{1}} be a basic classical Lie superalgebra over \mathbb{C}, e\in\mathfrak{g}_{\bar{0}} a nilpotent element and \mathfrak{g}^{e} the centralizer of e in \mathfrak{g}. We study various properties of nilpotent elements in \mathfrak{g}, which have previously only been considered in the case of Lie algebras. In particular, we prove that e is reachable if and only if e satisfies the Panyushev property for \mathfrak{g}=\mathfrak{sl}(m|n), m\neq n or \mathfrak{psl}(n|n) and \mathfrak{osp}(m|2n). For exceptional Lie superalgebras \mathfrak{g}=D(2,1;α), G(3), F(4), we give the classification of e which are reachable, strongly reachable or satisfy the Panyushev property. In addition, we give bases for \mathfrak{g}^{e} and its centre \mathfrak{z}(\mathfrak{g}^{e}) for \mathfrak{g}=\mathfrak{psl}(n|n), which completes results of Han on the relationship between \dim\mathfrak{g}^{e}, \dim\mathfrak{z}(\mathfrak{g}^{e}) and the labelled Dynkin diagrams for all basic classical Lie superalgebras.
△ Less
Submitted 20 September, 2022;
originally announced September 2022.
-
Centers of centralizers of nilpotent elements in exceptional Lie superalgebras
Authors:
Leyu Han
Abstract:
Let $\mathfrak{g}=\mathfrak{g}_{\bar{0}}\oplus\mathfrak{g}_{\bar{1}}$ be a finite-dimensional simple Lie superalgebra of type $D(2,1;α)$, $G(3)$ or $F(4)$ over $\mathbb{C}$. Let $G$ be the simply connected semisimple algebraic group over $\mathbb{C}$ such that $\mathrm{Lie}(G)=\mathfrak{g}_{\bar{0}}$. Suppose $e\in\mathfrak{g}_{\bar{0}}$ is nilpotent. We describe the centralizer…
▽ More
Let $\mathfrak{g}=\mathfrak{g}_{\bar{0}}\oplus\mathfrak{g}_{\bar{1}}$ be a finite-dimensional simple Lie superalgebra of type $D(2,1;α)$, $G(3)$ or $F(4)$ over $\mathbb{C}$. Let $G$ be the simply connected semisimple algebraic group over $\mathbb{C}$ such that $\mathrm{Lie}(G)=\mathfrak{g}_{\bar{0}}$. Suppose $e\in\mathfrak{g}_{\bar{0}}$ is nilpotent. We describe the centralizer $\mathfrak{g}^{e}$ of $e$ in $\mathfrak{g}$ and its centre $\mathfrak{z}(\mathfrak{g}^{e})$ especially. We also determine the labelled Dynkin diagram for $e$. We prove theorems relating the dimension of $\left(\mathfrak{z}(\mathfrak{g}^{e})\right)^{G^{e}}$ and the labelled Dynkin diagram.
△ Less
Submitted 8 March, 2022;
originally announced March 2022.
-
Centres of centralizers of nilpotent elements in Lie superalgebras $\mathfrak{sl}(m|n)$ or $\mathfrak{osp}(m|2n)$
Authors:
Leyu Han
Abstract:
Let $\bar{G}$ be the simple algebraic supergroup $\mathrm{SL}(m|n)$ or $\mathrm{OSp}(m|2n)$ over $\mathbb{C}$. Let $\mathfrak{g}=\mathrm{Lie}(\bar{G})=\mathfrak{g}_{\bar{0}}\oplus\mathfrak{g}_{\bar{1}}$ and let $G=\bar{G}(\mathbb{C})$ where $\mathbb{C}$ is considered as a superalgebra concentrated in even degree. Suppose $e\in\mathfrak{g}_{\bar{0}}$ is nilpotent. We describe the centralizer…
▽ More
Let $\bar{G}$ be the simple algebraic supergroup $\mathrm{SL}(m|n)$ or $\mathrm{OSp}(m|2n)$ over $\mathbb{C}$. Let $\mathfrak{g}=\mathrm{Lie}(\bar{G})=\mathfrak{g}_{\bar{0}}\oplus\mathfrak{g}_{\bar{1}}$ and let $G=\bar{G}(\mathbb{C})$ where $\mathbb{C}$ is considered as a superalgebra concentrated in even degree. Suppose $e\in\mathfrak{g}_{\bar{0}}$ is nilpotent. We describe the centralizer $\mathfrak{g}^{e}$ of $e$ in $\mathfrak{g}$ and its centre $\mathfrak{z}(\mathfrak{g}^{e})$. In particular, we give bases for $\mathfrak{g}^{e}$, $\mathfrak{z}(\mathfrak{g}^{e})$ and $\left(\mathfrak{z}(\mathfrak{g}^{e})\right)^{G^{e}}$. We also determine the labelled Dynkin diagram $\varDelta$ with respect to $e$ and subsequently describe the relation between $\left(\mathfrak{z}(\mathfrak{g}^{e})\right)^{G^{e}}$ and $\varDelta$.
△ Less
Submitted 8 March, 2022;
originally announced March 2022.
-
Resurgence and Partial Theta Series
Authors:
Li Han,
Yong Li,
David Sauzin,
Shanzhong Sun
Abstract:
We consider partial theta series associated with periodic sequences of coefficients, of the form $Θ(τ) := \sum_{n>0} n^νf(n) e^{iπn^2τ/M}$, with $ν$ non-negative integer and an $M$-periodic function $f : \mathbb{Z} \rightarrow \mathbb{C}$. Such a function is analytic in the half-plane $\{Im(τ)>0\}$ and as $τ$ tends non-tangentially to any $α\in\mathbb{Q}$, a formal power series appears in the asym…
▽ More
We consider partial theta series associated with periodic sequences of coefficients, of the form $Θ(τ) := \sum_{n>0} n^νf(n) e^{iπn^2τ/M}$, with $ν$ non-negative integer and an $M$-periodic function $f : \mathbb{Z} \rightarrow \mathbb{C}$. Such a function is analytic in the half-plane $\{Im(τ)>0\}$ and as $τ$ tends non-tangentially to any $α\in\mathbb{Q}$, a formal power series appears in the asymptotic behaviour of $Θ(τ)$, depending on the parity of $ν$ and $f$. We discuss the summability and resurgence properties of these series by means of explicit formulas for their formal Borel transforms, and the consequences for the modularity properties of $Θ$, or its ``quantum modularity'' properties in the sense of Zagier's recent theory. The Discrete Fourier Transform of $f$ plays an unexpected role and leads to a number-theoretic analogue of Écalle's ``Bridge Equations''. The motto is: (quantum) modularity = Stokes phenomenon + Discrete Fourier Transform.
△ Less
Submitted 7 July, 2022; v1 submitted 30 December, 2021;
originally announced December 2021.
-
Federated Adaptive Causal Estimation (FACE) of Target Treatment Effects
Authors:
Larry Han,
Jue Hou,
Kelly Cho,
Rui Duan,
Tianxi Cai
Abstract:
Federated learning of causal estimands may greatly improve estimation efficiency by leveraging data from multiple study sites, but robustness to heterogeneity and model misspecifications is vital for ensuring validity. We develop a Federated Adaptive Causal Estimation (FACE) framework to incorporate heterogeneous data from multiple sites to provide treatment effect estimation and inference for a f…
▽ More
Federated learning of causal estimands may greatly improve estimation efficiency by leveraging data from multiple study sites, but robustness to heterogeneity and model misspecifications is vital for ensuring validity. We develop a Federated Adaptive Causal Estimation (FACE) framework to incorporate heterogeneous data from multiple sites to provide treatment effect estimation and inference for a flexibly specified target population of interest. FACE accounts for site-level heterogeneity in the distribution of covariates through density ratio weighting. To safely incorporate source sites and avoid negative transfer, we introduce an adaptive weighting procedure via a penalized regression, which achieves both consistency and optimal efficiency. Our strategy is communication-efficient and privacy-preserving, allowing participating sites to share summary statistics only once with other sites. We conduct both theoretical and numerical evaluations of FACE and apply it to conduct a comparative effectiveness study of BNT162b2 (Pfizer) and mRNA-1273 (Moderna) vaccines on COVID-19 outcomes in U.S. veterans using electronic health records from five VA regional sites. We show that compared to traditional methods, FACE meaningfully increases the precision of treatment effect estimates, with reductions in standard errors ranging from $26\%$ to $67\%$.
△ Less
Submitted 5 October, 2023; v1 submitted 16 December, 2021;
originally announced December 2021.
-
Measuring Uncertainty in Translation Quality Evaluation (TQE)
Authors:
Serge Gladkoff,
Irina Sorokina,
Lifeng Han,
Alexandra Alekseeva
Abstract:
From both human translators (HT) and machine translation (MT) researchers' point of view, translation quality evaluation (TQE) is an essential task. Translation service providers (TSPs) have to deliver large volumes of translations which meet customer specifications with harsh constraints of required quality level in tight time-frames and costs. MT researchers strive to make their models better, w…
▽ More
From both human translators (HT) and machine translation (MT) researchers' point of view, translation quality evaluation (TQE) is an essential task. Translation service providers (TSPs) have to deliver large volumes of translations which meet customer specifications with harsh constraints of required quality level in tight time-frames and costs. MT researchers strive to make their models better, which also requires reliable quality evaluation. While automatic machine translation evaluation (MTE) metrics and quality estimation (QE) tools are widely available and easy to access, existing automated tools are not good enough, and human assessment from professional translators (HAP) are often chosen as the golden standard \cite{han-etal-2021-TQA}. Human evaluations, however, are often accused of having low reliability and agreement. Is this caused by subjectivity or statistics is at play? How to avoid the entire text to be checked and be more efficient with TQE from cost and efficiency perspectives, and what is the optimal sample size of the translated text, so as to reliably estimate the translation quality of the entire material? This work carries out such motivated research to correctly estimate the confidence intervals \cite{Brown_etal2001Interval} depending on the sample size of the translated text, e.g. the amount of words or sentences, that needs to be processed on TQE workflow step for confident and reliable evaluation of overall translation quality. The methodology we applied for this work is from Bernoulli Statistical Distribution Modelling (BSDM) and Monte Carlo Sampling Analysis (MCSA).
△ Less
Submitted 15 November, 2021;
originally announced November 2021.
-
Price Optimization with Practical Constraints
Authors:
Xiaojie Wang,
Hsin-Chan Huang,
Lanshan Han,
Alvin Lim
Abstract:
In this paper, we study a retailer price optimization problem which includes the practical constraints: maximum number of price changes and minimum amount of price change (if a change is recommended). We provide a closed-form formula for the Euclidean projection onto the feasible set defined by these two constraints, based on which a simple gradient projection algorithm is proposed to solve the pr…
▽ More
In this paper, we study a retailer price optimization problem which includes the practical constraints: maximum number of price changes and minimum amount of price change (if a change is recommended). We provide a closed-form formula for the Euclidean projection onto the feasible set defined by these two constraints, based on which a simple gradient projection algorithm is proposed to solve the price optimization problem. We study the convergence and solution quality of the proposed algorithm. We extend the base model to include upper/lower bounds on the individual product prices and solve it with some adjustments to the gradient projection algorithm. Numerical results are reported to demonstrate the performance of the proposed algorithm.
△ Less
Submitted 19 April, 2021;
originally announced April 2021.
-
Monetizing Customer Load Data for an Energy Retailer: A Cooperative Game Approach
Authors:
Liyang Han,
Jalal Kazempour,
Pierre Pinson
Abstract:
When energy customers schedule loads ahead of time, this information, if acquired by their energy retailer, can improve the retailer's load forecasts. Better forecasts lead to wholesale purchase decisions that are likely to result in lower energy imbalance costs, and thus higher profits for the retailer. Therefore, this paper monetizes the value of the customer schedulable load data by quantifying…
▽ More
When energy customers schedule loads ahead of time, this information, if acquired by their energy retailer, can improve the retailer's load forecasts. Better forecasts lead to wholesale purchase decisions that are likely to result in lower energy imbalance costs, and thus higher profits for the retailer. Therefore, this paper monetizes the value of the customer schedulable load data by quantifying the retailer's profit gain from adjusting the wholesale purchase based on such data. Using a cooperative game theoretic approach, the retailer translates their increased profit in expectation into the value of cooperation, and redistributes a portion of it among the customers as monetary incentives for them to continue providing their load data. Through case studies, this paper demonstrates the significance of the additional profit for the retailer from using the proposed framework, and evaluates the long-term monetary benefits to the customers based on different payoff allocation methods.
△ Less
Submitted 10 August, 2021; v1 submitted 10 December, 2020;
originally announced December 2020.
-
A Note on the Sum of Non-Identically Distributed Doubly Truncated Normal Distributions
Authors:
Hao Chen,
Lanshan Han,
Alvin Lim
Abstract:
It is proved that the sum of n independent but non-identically distributed doubly truncated Normal distributions converges in distribution to a Normal distribution. It is also shown how the result can be applied in estimating a constrained mixed effects model.
It is proved that the sum of n independent but non-identically distributed doubly truncated Normal distributions converges in distribution to a Normal distribution. It is also shown how the result can be applied in estimating a constrained mixed effects model.
△ Less
Submitted 28 July, 2021; v1 submitted 18 August, 2020;
originally announced August 2020.
-
Spatial dependence and space-time trend in extreme events
Authors:
John H. J. Einmahl,
Ana Ferreira,
Laurens de Haan,
Claudia Neves,
Chen Zhou
Abstract:
The statistical theory of extremes is extended to observations that are non-stationary and not independent. The non-stationarity over time and space is controlled via the scedasis (tail scale) in the marginal distributions. Spatial dependence stems from multivariate extreme value theory. We establish asymptotic theory for both the weighted sequential tail empirical process and the weighted tail qu…
▽ More
The statistical theory of extremes is extended to observations that are non-stationary and not independent. The non-stationarity over time and space is controlled via the scedasis (tail scale) in the marginal distributions. Spatial dependence stems from multivariate extreme value theory. We establish asymptotic theory for both the weighted sequential tail empirical process and the weighted tail quantile process based on all observations, taken over time and space. The results yield two statistical tests for homoscedasticity in the tail, one in space and one in time. Further, we show that the common extreme value index can be estimated via a pseudo-maximum likelihood procedure based on pooling all (non-stationary and dependent) observations. Our leading example and application is rainfall in Northern Germany.
△ Less
Submitted 9 March, 2020;
originally announced March 2020.
-
Knot Locating in Piecewise Linear Approximation
Authors:
Carlos Ugaz,
Lanshan Han,
Alvin Lim
Abstract:
Many separable nonlinear optimization problems can be approximated by their nonlinear objective functions with piecewise linear functions. A natural question arising from applying this approach is how to break the interval of interest into subintervals (pieces) to achieve a good approximation. We present formulations to optimize the location of the knots. We apply a sequential quadratic programmin…
▽ More
Many separable nonlinear optimization problems can be approximated by their nonlinear objective functions with piecewise linear functions. A natural question arising from applying this approach is how to break the interval of interest into subintervals (pieces) to achieve a good approximation. We present formulations to optimize the location of the knots. We apply a sequential quadratic programming method and a spectral projected gradient method to solve the problem. We report numerical experiments to show the effectiveness of the proposed approaches.
△ Less
Submitted 6 September, 2019;
originally announced September 2019.
-
Quadratic Surface Support Vector Machine with L1 Norm Regularization
Authors:
Ahmad Mousavi,
Zheming Gao,
Lanshan Han,
Alvin Lim
Abstract:
We propose $\ell_1$ norm regularized quadratic surface support vector machine models for binary classification in supervised learning. We establish their desired theoretical properties, including the existence and uniqueness of the optimal solution, reduction to the standard SVMs over (almost) linearly separable data sets, and detection of true sparsity pattern over (almost) quadratically separabl…
▽ More
We propose $\ell_1$ norm regularized quadratic surface support vector machine models for binary classification in supervised learning. We establish their desired theoretical properties, including the existence and uniqueness of the optimal solution, reduction to the standard SVMs over (almost) linearly separable data sets, and detection of true sparsity pattern over (almost) quadratically separable data sets if the penalty parameter of $\ell_1$ norm is large enough. We also demonstrate their promising practical efficiency by conducting various numerical experiments on both synthetic and publicly available benchmark data sets.
△ Less
Submitted 30 January, 2021; v1 submitted 22 August, 2019;
originally announced August 2019.
-
The number of fuzzy subgroups of a finite abelian group of order $p^{n}q^{m}$
Authors:
Lingling Han,
Xiuyun Guo
Abstract:
The purpose of this paper is to determine the number of fuzzy subgroups of a finite abelian group of order $p^{n}q^{m}$. As an application of our main result, explicit formulas for the number of fuzzy subgroups of $\mathbb{Z}_{p}^{n}\times\mathbb{Z}_{q}^{m}$ and $\mathbb{Z}_{p^{n}}\times\mathbb{Z}_{q}^{m}$ are given.
The purpose of this paper is to determine the number of fuzzy subgroups of a finite abelian group of order $p^{n}q^{m}$. As an application of our main result, explicit formulas for the number of fuzzy subgroups of $\mathbb{Z}_{p}^{n}\times\mathbb{Z}_{q}^{m}$ and $\mathbb{Z}_{p^{n}}\times\mathbb{Z}_{q}^{m}$ are given.
△ Less
Submitted 14 April, 2019;
originally announced April 2019.
-
Estimation of the Shapley Value of a Peer-to-Peer Energy Sharing Game using Coalitional Stratified Random Sampling
Authors:
Liyang Han,
Thomas Morstyn,
Malcolm McCulloch
Abstract:
Various peer-to-peer energy markets have emerged in recent years in an attempt to manage distributed energy resources in a more efficient way. One of the main challenges these models face is how to create and allocate incentives to participants. Cooperative game theory offers a methodology to financially reward prosumers based on their contributions made to the local energy coalition using the Sha…
▽ More
Various peer-to-peer energy markets have emerged in recent years in an attempt to manage distributed energy resources in a more efficient way. One of the main challenges these models face is how to create and allocate incentives to participants. Cooperative game theory offers a methodology to financially reward prosumers based on their contributions made to the local energy coalition using the Shapley value, but its high computational complexity limits the size of the game. This paper explores a stratified sampling method proposed in existing literature for Shapley value estimation, and modifies the method for a peer-to-peer cooperative game to improve its scalability. Finally, selected case studies verify the effectiveness of the proposed coalitional stratified random sampling method and demonstrate results from large games.
△ Less
Submitted 26 March, 2019;
originally announced March 2019.
-
Improving the Scalability of a Prosumer Cooperative Game with K-Means Clustering
Authors:
Liyang Han,
Thomas Morstyn,
Constance Crozier,
Malcolm McCulloch
Abstract:
Among the various market structures under peer-to-peer energy sharing, one model based on cooperative game theory provides clear incentives for prosumers to collaboratively schedule their energy resources. The computational complexity of this model, however, increases exponentially with the number of participants. To address this issue, this paper proposes the application of K-means clustering to…
▽ More
Among the various market structures under peer-to-peer energy sharing, one model based on cooperative game theory provides clear incentives for prosumers to collaboratively schedule their energy resources. The computational complexity of this model, however, increases exponentially with the number of participants. To address this issue, this paper proposes the application of K-means clustering to the energy profiles following the grand coalition optimization. The cooperative model is run with the "clustered players" to compute their payoff allocations, which are then further distributed among the prosumers within each cluster. Case studies show that the proposed method can significantly improve the scalability of the cooperative scheme while maintaining a high level of financial incentives for the prosumers.
△ Less
Submitted 31 July, 2020; v1 submitted 26 March, 2019;
originally announced March 2019.
-
Uniform local well-posedness and inviscid limit for the Benjamin-Ono-Burgers equation
Authors:
Mingjuan Chen,
Boling Guo,
Lijia Han
Abstract:
In this paper, we study the Cauchy problem for the Benjamin-Ono-Burgers equation $\partial_t u-ε\partial_x^2 u+\mathcal{H}\partial_x^2u+u u_x=0$, where $\mathcal{H}$ denotes the Hilbert transform. We obtain that it is uniformly locally well-posed for small data in the refined Sobolev space $\widetilde{H}^σ(\mathbb{R})$($σ\geq 0$), whose low-frequency part is scaling critical and high-frequency par…
▽ More
In this paper, we study the Cauchy problem for the Benjamin-Ono-Burgers equation $\partial_t u-ε\partial_x^2 u+\mathcal{H}\partial_x^2u+u u_x=0$, where $\mathcal{H}$ denotes the Hilbert transform. We obtain that it is uniformly locally well-posed for small data in the refined Sobolev space $\widetilde{H}^σ(\mathbb{R})$($σ\geq 0$), whose low-frequency part is scaling critical and high-frequency part is equal to Sobolev space $H^σ$($σ\geq 0$). Furthermore, we also obtain its inviscid limit behavior in $\widetilde{H}^σ(\mathbb{R})$($σ\geq 0$).
△ Less
Submitted 8 March, 2019;
originally announced March 2019.
-
A continuation method for tensor complementarity problems
Authors:
Lixing Han
Abstract:
We introduce a Kojima-Megiddo-Mizuno type continuation method for solving tensor complementarity problems. We show that there exists a bounded continuation trajectory when the tensor is strictly semi-positive and any limit point tracing the trajectory gives a solution of the tensor complementarity problem. Moreover, when the tensor is strong strictly semi-positive, tracing the trajectory will conv…
▽ More
We introduce a Kojima-Megiddo-Mizuno type continuation method for solving tensor complementarity problems. We show that there exists a bounded continuation trajectory when the tensor is strictly semi-positive and any limit point tracing the trajectory gives a solution of the tensor complementarity problem. Moreover, when the tensor is strong strictly semi-positive, tracing the trajectory will converge to the unique solution. Some numerical results are given to illustrate the effectiveness of the method.
△ Less
Submitted 4 March, 2018;
originally announced March 2018.
-
Extreme Value Estimation for Discretely Sampled Continuous Processes
Authors:
Holger Drees,
Laurens de Haan,
Feridun Turkman
Abstract:
In environmental applications of extreme value statistics, the underlying stochastic process is often modeled either as a max-stable process in continuous time/space or as a process in the domain of attraction of such a max-stable process. In practice, however, the processes are typically only observed at discrete points and one has to resort to interpolation to fill in the gaps. We discuss the in…
▽ More
In environmental applications of extreme value statistics, the underlying stochastic process is often modeled either as a max-stable process in continuous time/space or as a process in the domain of attraction of such a max-stable process. In practice, however, the processes are typically only observed at discrete points and one has to resort to interpolation to fill in the gaps. We discuss the influence of such an interpolation on estimators of marginal parameters as well as estimators of the exponent measure. In particular, natural conditions on the fineness of the observational scheme are developed which ensure that asymptotically the interpolated estimators behave in the same way as the estimators which use fully observed continuous processes.
△ Less
Submitted 9 February, 2018; v1 submitted 20 September, 2017;
originally announced September 2017.
-
A homotopy method for solving multilinear systems with M-tensors
Authors:
Lixing Han
Abstract:
Multilinear systems of equations arise in various applications, such as numerical partial differential equations, data mining, and tensor complementarity problems. In this paper, we propose a homotopy method for finding the unique positive solution to a multilinear system with a nonsingular M-tensor and a positive right side vector. We analyze the method and prove its convergence to the desired so…
▽ More
Multilinear systems of equations arise in various applications, such as numerical partial differential equations, data mining, and tensor complementarity problems. In this paper, we propose a homotopy method for finding the unique positive solution to a multilinear system with a nonsingular M-tensor and a positive right side vector. We analyze the method and prove its convergence to the desired solution. We report some numerical results based on an implementation of the proposed method using a prediction-correction approach for path following.
△ Less
Submitted 25 January, 2017;
originally announced January 2017.
-
A homotopy method for computing the largest eigenvalue of an irreducible nonnegative tensor
Authors:
Liping Chen,
Lixing Han,
Hongxia Yin,
Liangmin Zhou
Abstract:
In this paper we propose a homotopy method to compute the largest eigenvalue and a corresponding eigenvector of a nonnegative tensor. We prove that it converges to the desired eigenpair when the tensor is irreducible. We also implement the method using an prediction-correction approach for path following. Some numerical results are provided to illustrate the efficiency of the method.
In this paper we propose a homotopy method to compute the largest eigenvalue and a corresponding eigenvector of a nonnegative tensor. We prove that it converges to the desired eigenpair when the tensor is irreducible. We also implement the method using an prediction-correction approach for path following. Some numerical results are provided to illustrate the efficiency of the method.
△ Less
Submitted 25 January, 2017;
originally announced January 2017.
-
Bias correction in multivariate extremes
Authors:
Anne-Laure Fougères,
Laurens de Haan,
Cécile Mercadier
Abstract:
The estimation of the extremal dependence structure is spoiled by the impact of the bias, which increases with the number of observations used for the estimation. Already known in the univariate setting, the bias correction procedure is studied in this paper under the multivariate framework. New families of estimators of the stable tail dependence function are obtained. They are asymptotically unb…
▽ More
The estimation of the extremal dependence structure is spoiled by the impact of the bias, which increases with the number of observations used for the estimation. Already known in the univariate setting, the bias correction procedure is studied in this paper under the multivariate framework. New families of estimators of the stable tail dependence function are obtained. They are asymptotically unbiased versions of the empirical estimator introduced by Huang [Statistics of bivariate extremes (1992) Erasmus Univ.]. Since the new estimators have a regular behavior with respect to the number of observations, it is possible to deduce aggregated versions so that the choice of the threshold is substantially simplified. An extensive simulation study is provided as well as an application on real data.
△ Less
Submitted 2 April, 2015;
originally announced April 2015.
-
Absence of shocks for 1D EULER-POISSON system
Authors:
Yan Guo,
Lijia Han,
Jingjun Zhang
Abstract:
It is shown that smooth solutions with small amplitude to the 1D Euler-Poisson system for electrons persist forever with no shock formation.
It is shown that smooth solutions with small amplitude to the 1D Euler-Poisson system for electrons persist forever with no shock formation.
△ Less
Submitted 2 February, 2015; v1 submitted 2 February, 2015;
originally announced February 2015.
-
Computing tensor eigenvalues via homotopy methods
Authors:
Liping Chen,
Lixing Han,
Liangmin Zhou
Abstract:
We introduce the concept of mode-k generalized eigenvalues and eigenvectors of a tensor and prove some properties of such eigenpairs. In particular, we derive an upper bound for the number of equivalence classes of generalized tensor eigenpairs using mixed volume. Based on this bound and the structures of tensor eigenvalue problems, we propose two homotopy continuation type algorithms to solve ten…
▽ More
We introduce the concept of mode-k generalized eigenvalues and eigenvectors of a tensor and prove some properties of such eigenpairs. In particular, we derive an upper bound for the number of equivalence classes of generalized tensor eigenpairs using mixed volume. Based on this bound and the structures of tensor eigenvalue problems, we propose two homotopy continuation type algorithms to solve tensor eigenproblems. With proper implementation, these methods can find all equivalence classes of isolated generalized eigenpairs and some generalized eigenpairs contained in the positive dimensional components (if there are any). We also introduce an algorithm that combines a heuristic approach and a Newton homotopy method to extract real generalized eigenpairs from the found complex generalized eigenpairs. A MATLAB software package TenEig has been developed to implement these methods. Numerical results are presented to illustrate the effectiveness and efficiency of TenEig for computing complex or real generalized eigenpairs.
△ Less
Submitted 13 January, 2016; v1 submitted 17 January, 2015;
originally announced January 2015.
-
Bayesian Model Averaging with Exponentiated Least Square Loss
Authors:
Dong Dai,
Lei Han,
Ting Yang,
Tong Zhang
Abstract:
The model averaging problem is to average multiple models to achieve a prediction accuracy not much worse than that of the best single model in terms of mean squared error. It is known that if the models are misspecified, model averaging is superior to model selection. Specifically, let $n$ be the sample size, then the worst case regret of the former decays at a rate of $O(1/n)$ while the worst ca…
▽ More
The model averaging problem is to average multiple models to achieve a prediction accuracy not much worse than that of the best single model in terms of mean squared error. It is known that if the models are misspecified, model averaging is superior to model selection. Specifically, let $n$ be the sample size, then the worst case regret of the former decays at a rate of $O(1/n)$ while the worst case regret of the latter decays at a rate of $O(1/\sqrt{n})$. The recently proposed $Q$-aggregation algorithm \citep{DaiRigZhang12} solves the model averaging problem with the optimal regret of $O(1/n)$ both in expectation and in deviation; however it suffers from two limitations: (1) for continuous dictionary, the proposed greedy algorithm for solving $Q$-aggregation is not applicable; (2) the formulation of $Q$-aggregation appears ad hoc without clear intuition. This paper examines a different approach to model averaging by considering a Bayes estimator for deviation optimal model averaging by using exponentiated least squares loss. We establish a primal-dual relationship of this estimator and that of $Q$-aggregation and propose new greedy procedures that satisfactorily resolve the above mentioned limitations of $Q$-aggregation.
△ Less
Submitted 27 February, 2018; v1 submitted 6 August, 2014;
originally announced August 2014.
-
On the block maxima method in extreme value theory: PWM estimators
Authors:
Ana Ferreira,
Laurens de Haan
Abstract:
In extreme value theory, there are two fundamental approaches, both widely used: the block maxima (BM) method and the peaks-over-threshold (POT) method. Whereas much theoretical research has gone into the POT method, the BM method has not been studied thoroughly. The present paper aims at providing conditions under which the BM method can be justified. We also provide a theoretical comparative stu…
▽ More
In extreme value theory, there are two fundamental approaches, both widely used: the block maxima (BM) method and the peaks-over-threshold (POT) method. Whereas much theoretical research has gone into the POT method, the BM method has not been studied thoroughly. The present paper aims at providing conditions under which the BM method can be justified. We also provide a theoretical comparative study of the methods, which is in general consistent with the vast literature on comparing the methods all based on simulated data and fully parametric models. The results indicate that the BM method is a rather efficient method under usual practical conditions. In this paper, we restrict attention to the i.i.d. case and focus on the probability weighted moment (PWM) estimators of Hosking, Wallis and Wood [Technometrics (1985) 27 251-261].
△ Less
Submitted 30 December, 2014; v1 submitted 11 October, 2013;
originally announced October 2013.
-
Estimation of extreme risk regions under multivariate regular variation
Authors:
Juan-Juan Cai,
John H. J. Einmahl,
Laurens de Haan
Abstract:
When considering d possibly dependent random variables, one is often interested in extreme risk regions, with very small probability p. We consider risk regions of the form ${\mathbf{z}\in\mathbb{R}^d:f(\mathbf{z})\leqβ}$, where f is the joint density and $β$ a small number. Estimation of such an extreme risk region is difficult since it contains hardly any or no data. Using extreme value theory,…
▽ More
When considering d possibly dependent random variables, one is often interested in extreme risk regions, with very small probability p. We consider risk regions of the form ${\mathbf{z}\in\mathbb{R}^d:f(\mathbf{z})\leqβ}$, where f is the joint density and $β$ a small number. Estimation of such an extreme risk region is difficult since it contains hardly any or no data. Using extreme value theory, we construct a natural estimator of an extreme risk region and prove a refined form of consistency, given a random sample of multivariate regularly varying random vectors. In a detailed simulation and comparison study, the good performance of the procedure is demonstrated. We also apply our estimator to financial data.
△ Less
Submitted 22 November, 2012;
originally announced November 2012.
-
An unconstrained optimization approach for finding real eigenvalues of even order symmetric tensors
Authors:
Lixing Han
Abstract:
Let $n$ be a positive integer and $m$ be a positive even integer. Let ${\mathcal A}$ be an $m^{th}$ order $n$-dimensional real weakly symmetric tensor and ${\mathcal B}$ be a real weakly symmetric positive definite tensor of the same size. $λ\in R$ is called a ${\mathcal B}_r$-eigenvalue of ${\mathcal A}$ if ${\mathcal A} x^{m-1} = λ{\mathcal B} x^{m-1}$ for some $x \in R^n \backslash \{0\}$. In t…
▽ More
Let $n$ be a positive integer and $m$ be a positive even integer. Let ${\mathcal A}$ be an $m^{th}$ order $n$-dimensional real weakly symmetric tensor and ${\mathcal B}$ be a real weakly symmetric positive definite tensor of the same size. $λ\in R$ is called a ${\mathcal B}_r$-eigenvalue of ${\mathcal A}$ if ${\mathcal A} x^{m-1} = λ{\mathcal B} x^{m-1}$ for some $x \in R^n \backslash \{0\}$. In this paper, we introduce two unconstrained optimization problems and obtain some variational characterizations for the minimum and maximum ${\mathcal B}_r$--eigenvalues of ${\mathcal A}$. Our results extend Auchmuty's unconstrained variational principles for eigenvalues of real symmetric matrices. This unconstrained optimization approach can be used to find a Z-, H-, or D-eigenvalue of an even order weakly symmetric tensor. We provide some numerical results to illustrate the effectiveness of this approach for finding a Z-eigenvalue and for determining the positive semidefiniteness of an even order symmetric tensor.
△ Less
Submitted 13 January, 2016; v1 submitted 22 March, 2012;
originally announced March 2012.
-
The generalized Pareto process; with a view towards application and simulation
Authors:
Ana Ferreira,
Laurens de Haan
Abstract:
In extreme value statistics, the peaks-over-threshold method is widely used. The method is based on the generalized Pareto distribution characterizing probabilities of exceedances over high thresholds in $\mathbb {R}^d$. We present a generalization of this concept in the space of continuous functions. We call this the generalized Pareto process. Differently from earlier papers, our definition is n…
▽ More
In extreme value statistics, the peaks-over-threshold method is widely used. The method is based on the generalized Pareto distribution characterizing probabilities of exceedances over high thresholds in $\mathbb {R}^d$. We present a generalization of this concept in the space of continuous functions. We call this the generalized Pareto process. Differently from earlier papers, our definition is not based on a distribution function but on functional properties, and does not need a reference to a related max-stable process. As an application, we use the theory to simulate wind fields connected to disastrous storms on the basis of observed extreme but not disastrous storms. We also establish the peaks-over-threshold approach in function space.
△ Less
Submitted 16 October, 2014; v1 submitted 12 March, 2012;
originally announced March 2012.
-
Estimating failure probabilities
Authors:
Holger Drees,
Laurens de Haan
Abstract:
In risk management, often the probability must be estimated that a random vector falls into an extreme failure set. In the framework of bivariate extreme value theory, we construct an estimator for such failure probabilities and analyze its asymptotic properties under natural conditions. It turns out that the estimation error is mainly determined by the accuracy of the statistical analysis of the…
▽ More
In risk management, often the probability must be estimated that a random vector falls into an extreme failure set. In the framework of bivariate extreme value theory, we construct an estimator for such failure probabilities and analyze its asymptotic properties under natural conditions. It turns out that the estimation error is mainly determined by the accuracy of the statistical analysis of the marginal distributions if the extreme value approximation to the dependence structure is at least as accurate as the generalized Pareto approximation to the marginal distributions. Moreover, we establish confidence intervals and briefly discuss generalizations to higher dimensions and issues arising in practical applications as well.
△ Less
Submitted 3 June, 2015; v1 submitted 4 July, 2011;
originally announced July 2011.
-
On tail trend detection: modeling relative risk
Authors:
Laurens de Haan,
Albert Klein Tank,
Cláudia Neves
Abstract:
The climate change dispute is about changes over time of environmental characteristics (such as rainfall). Some people say that a possible change is not so much in the mean but rather in the extreme phenomena (that is, the average rainfall may not change much but heavy storms may become more or less frequent). The paper studies changes over time in the probability that some high threshold is excee…
▽ More
The climate change dispute is about changes over time of environmental characteristics (such as rainfall). Some people say that a possible change is not so much in the mean but rather in the extreme phenomena (that is, the average rainfall may not change much but heavy storms may become more or less frequent). The paper studies changes over time in the probability that some high threshold is exceeded. The model is such that the threshold does not need to be specified, the results hold for any high threshold. For simplicity a certain linear trend is studied depending on one real parameter. Estimation and testing procedures (is there a trend?) are developed. Simulation results are presented. The method is applied to trends in heavy rainfall at 18 gauging stations across Germany and The Netherlands. A tentative conclusion is that the trend seems to depend on whether or not a station is close to the sea.
△ Less
Submitted 3 April, 2013; v1 submitted 21 June, 2011;
originally announced June 2011.
-
Inviscid limit for the derivative Ginzburg-Landau equation with small data in higher spatial dimensions
Authors:
Lijia Han,
Baoxiang Wang,
Boling Guo
Abstract:
We study the inviscid limit for the Cauchy problem of derivative Ginzburg-Landau equation in higher dimension space n>2. We show that it is global well-posed and its solution will converge to that of derivative Schrodinger equation.
We study the inviscid limit for the Cauchy problem of derivative Ginzburg-Landau equation in higher dimension space n>2. We show that it is global well-posed and its solution will converge to that of derivative Schrodinger equation.
△ Less
Submitted 7 April, 2010;
originally announced April 2010.
-
Perron-Frobenius theorem for nonnegative multilinear forms and extensions
Authors:
S. Friedland,
S. Gaubert,
L. Han
Abstract:
We prove an analog of Perron-Frobenius theorem for multilinear forms with nonnegative coefficients, and more generally, for polynomial maps with nonnegative coefficients. We determine the geometric convergence rate of the power algorithm to the unique normalized eigenvector.
We prove an analog of Perron-Frobenius theorem for multilinear forms with nonnegative coefficients, and more generally, for polynomial maps with nonnegative coefficients. We determine the geometric convergence rate of the power algorithm to the unique normalized eigenvector.
△ Less
Submitted 20 November, 2010; v1 submitted 11 May, 2009;
originally announced May 2009.
-
Stationary max-stable fields associated to negative definite functions
Authors:
Zakhar Kabluchko,
Martin Schlather,
Laurens de Haan
Abstract:
Let $W_i,i\in{\mathbb{N}}$, be independent copies of a zero-mean Gaussian process $\{W(t),t\in{\mathbb{R}}^d\}$ with stationary increments and variance $σ^2(t)$. Independently of $W_i$, let $\sum_{i=1}^{\infty}δ_{U_i}$ be a Poisson point process on the real line with intensity $e^{-y} dy$. We show that the law of the random family of functions $\{V_i(\cdot),i\in{\mathbb{N}}\}$, where…
▽ More
Let $W_i,i\in{\mathbb{N}}$, be independent copies of a zero-mean Gaussian process $\{W(t),t\in{\mathbb{R}}^d\}$ with stationary increments and variance $σ^2(t)$. Independently of $W_i$, let $\sum_{i=1}^{\infty}δ_{U_i}$ be a Poisson point process on the real line with intensity $e^{-y} dy$. We show that the law of the random family of functions $\{V_i(\cdot),i\in{\mathbb{N}}\}$, where $V_i(t)=U_i+W_i(t)-σ^2(t)/2$, is translation invariant. In particular, the process $η(t)=\bigvee_{i=1}^{\infty}V_i(t)$ is a stationary max-stable process with standard Gumbel margins. The process $η$ arises as a limit of a suitably normalized and rescaled pointwise maximum of $n$ i.i.d. stationary Gaussian processes as $n\to\infty$ if and only if $W$ is a (nonisotropic) fractional Brownian motion on ${\mathbb{R}}^d$. Under suitable conditions on $W$, the process $η$ has a mixed moving maxima representation.
△ Less
Submitted 25 September, 2009; v1 submitted 17 June, 2008;
originally announced June 2008.
-
Weighted approximations of tail copula processes with application to testing the bivariate extreme value condition
Authors:
John H. J. Einmahl,
Laurens de Haan,
Deyuan Li
Abstract:
Consider $n$ i.i.d. random vectors on $\mathbb{R}^2$, with unknown, common distribution function $F$. Under a sharpening of the extreme value condition on $F$, we derive a weighted approximation of the corresponding tail copula process. Then we construct a test to check whether the extreme value condition holds by comparing two estimators of the limiting extreme value distribution, one obtained…
▽ More
Consider $n$ i.i.d. random vectors on $\mathbb{R}^2$, with unknown, common distribution function $F$. Under a sharpening of the extreme value condition on $F$, we derive a weighted approximation of the corresponding tail copula process. Then we construct a test to check whether the extreme value condition holds by comparing two estimators of the limiting extreme value distribution, one obtained from the tail copula process and the other obtained by first estimating the spectral measure which is then used as a building block for the limiting extreme value distribution. We derive the limiting distribution of the test statistic from the aforementioned weighted approximation. This limiting distribution contains unknown functional parameters. Therefore, we show that a version with estimated parameters converges weakly to the true limiting distribution. Based on this result, the finite sample properties of our testing procedure are investigated through a simulation study. A real data application is also presented.
△ Less
Submitted 13 November, 2006;
originally announced November 2006.
-
Spatial extremes: Models for the stationary case
Authors:
Laurens de Haan,
Teresa T. Pereira
Abstract:
The aim of this paper is to provide models for spatial extremes in the case of stationarity. The spatial dependence at extreme levels of a stationary process is modeled using an extension of the theory of max-stable processes of de Haan and Pickands [Probab. Theory Related Fields 72 (1986) 477--492]. We propose three one-dimensional and three two-dimensional models. These models depend on just o…
▽ More
The aim of this paper is to provide models for spatial extremes in the case of stationarity. The spatial dependence at extreme levels of a stationary process is modeled using an extension of the theory of max-stable processes of de Haan and Pickands [Probab. Theory Related Fields 72 (1986) 477--492]. We propose three one-dimensional and three two-dimensional models. These models depend on just one parameter or a few parameters that measure the strength of tail dependence as a function of the distance between locations. We also propose two estimators for this parameter and prove consistency under domain of attraction conditions and asymptotic normality under appropriate extra conditions.
△ Less
Submitted 16 May, 2006;
originally announced May 2006.
-
On maximum likelihood estimation of the extreme value index
Authors:
Holger Drees,
Ana Ferreira,
Laurens de Haan
Abstract:
We prove asymptotic normality of the so-called maximum likelihood estimator of the extreme value index.
We prove asymptotic normality of the so-called maximum likelihood estimator of the extreme value index.
△ Less
Submitted 5 July, 2004;
originally announced July 2004.