-
Aspects of a randomly growing cluster in $\reals^d,d\geq 2
Authors:
Alan Frieze,
Ravi Kannan,
Wesley Pegden
Abstract:
We consider a simple model of a growing cluster of points in $\Re^d,d\geq 2$. Beginning with a point $X_1$ located at the origin, we generate a random sequence of points $X_1,X_2,\ldots,X_i,\ldots,$. To generate $X_{i},i\geq 2$ we choose a uniform integer $j$ in $[i-1]=\{1,2,\ldots,i-1\}$ and then let $X_{i}=X_j+D_i$ where $D_i=(δ_1,\ldots,δ_d)$. Here the $δ_j$ are independent copies of the Normal…
▽ More
We consider a simple model of a growing cluster of points in $\Re^d,d\geq 2$. Beginning with a point $X_1$ located at the origin, we generate a random sequence of points $X_1,X_2,\ldots,X_i,\ldots,$. To generate $X_{i},i\geq 2$ we choose a uniform integer $j$ in $[i-1]=\{1,2,\ldots,i-1\}$ and then let $X_{i}=X_j+D_i$ where $D_i=(δ_1,\ldots,δ_d)$. Here the $δ_j$ are independent copies of the Normal distribution $N(0,σ_i)$, where $σ_i=i^{-α}$ for some $α>0$. We prove that for any $α>0$ the resulting point set is bounded a.s., and moreover, that the points generated look like samples from a $β$-dimensional subset of $\Re^d$ from the standpoint of the minimum lengths of combinatorial structures on the point-sets, where $β=\min(d,1/α)$.
△ Less
Submitted 6 January, 2025;
originally announced January 2025.
-
MBExplainer: Multilevel bandit-based explanations for downstream models with augmented graph embeddings
Authors:
Ashkan Golgoon,
Ryan Franks,
Khashayar Filom,
Arjun Ravi Kannan
Abstract:
In many industrial applications, it is common that the graph embeddings generated from training GNNs are used in an ensemble model where the embeddings are combined with other tabular features (e.g., original node or edge features) in a downstream ML task. The tabular features may even arise naturally if, e.g., one tries to build a graph such that some of the node or edge features are stored in a…
▽ More
In many industrial applications, it is common that the graph embeddings generated from training GNNs are used in an ensemble model where the embeddings are combined with other tabular features (e.g., original node or edge features) in a downstream ML task. The tabular features may even arise naturally if, e.g., one tries to build a graph such that some of the node or edge features are stored in a tabular format. Here we address the problem of explaining the output of such ensemble models for which the input features consist of learned neural graph embeddings combined with additional tabular features. We propose MBExplainer, a model-agnostic explanation approach for downstream models with augmented graph embeddings. MBExplainer returns a human-legible triple as an explanation for an instance prediction of the whole pipeline consisting of three components: a subgraph with the highest importance, the topmost important nodal features, and the topmost important augmented downstream features. A game-theoretic formulation is used to take the contributions of each component and their interactions into account by assigning three Shapley values corresponding to their own specific games. Finding the explanation requires an efficient search through the corresponding local search spaces corresponding to each component. MBExplainer applies a novel multilevel search algorithm that enables simultaneous pruning of local search spaces in a computationally tractable way. In particular, three interweaved Monte Carlo Tree Search are utilized to iteratively prune the local search spaces. MBExplainer also includes a global search algorithm that uses contextual bandits to efficiently allocate pruning budget among the local search spaces. We show the effectiveness of MBExplainer by presenting a set of comprehensive numerical examples on multiple public graph datasets for both node and graph classification tasks.
△ Less
Submitted 31 October, 2024;
originally announced November 2024.
-
Mechanistic interpretability of large language models with applications to the financial services industry
Authors:
Ashkan Golgoon,
Khashayar Filom,
Arjun Ravi Kannan
Abstract:
Large Language Models such as GPTs (Generative Pre-trained Transformers) exhibit remarkable capabilities across a broad spectrum of applications. Nevertheless, due to their intrinsic complexity, these models present substantial challenges in interpreting their internal decision-making processes. This lack of transparency poses critical challenges when it comes to their adaptation by financial inst…
▽ More
Large Language Models such as GPTs (Generative Pre-trained Transformers) exhibit remarkable capabilities across a broad spectrum of applications. Nevertheless, due to their intrinsic complexity, these models present substantial challenges in interpreting their internal decision-making processes. This lack of transparency poses critical challenges when it comes to their adaptation by financial institutions, where concerns and accountability regarding bias, fairness, and reliability are of paramount importance. Mechanistic interpretability aims at reverse engineering complex AI models such as transformers. In this paper, we are pioneering the use of mechanistic interpretability to shed some light on the inner workings of large language models for use in financial services applications. We offer several examples of how algorithmic tasks can be designed for compliance monitoring purposes. In particular, we investigate GPT-2 Small's attention pattern when prompted to identify potential violation of Fair Lending laws. Using direct logit attribution, we study the contributions of each layer and its corresponding attention heads to the logit difference in the residual stream. Finally, we design clean and corrupted prompts and use activation patching as a causal intervention method to localize our task completion components further. We observe that the (positive) heads $10.2$ (head $2$, layer $10$), $10.7$, and $11.3$, as well as the (negative) heads $9.6$ and $10.6$ play a significant role in the task completion.
△ Less
Submitted 15 October, 2024; v1 submitted 15 July, 2024;
originally announced July 2024.
-
Constructing cospectral graphs by unfolding non-bipartite graphs
Authors:
M. Rajesh Kannan,
Shivaramakrishna Pragada,
Hitesh Wankhede
Abstract:
In 2010, Butler introduced the unfolding operation on a bipartite graph to produce two bipartite graphs, which are cospectral for the adjacency and the normalized Laplacian matrices. In this article, we describe how the idea of unfolding a bipartite graph with respect to another bipartite graph can be extended to nonbipartite graphs. In particular, we describe how unfoldings involving reflexive bi…
▽ More
In 2010, Butler introduced the unfolding operation on a bipartite graph to produce two bipartite graphs, which are cospectral for the adjacency and the normalized Laplacian matrices. In this article, we describe how the idea of unfolding a bipartite graph with respect to another bipartite graph can be extended to nonbipartite graphs. In particular, we describe how unfoldings involving reflexive bipartite, semi-reflexive bipartite, and multipartite graphs are used to obtain cospectral nonisomorphic graphs for the adjacency matrix.
△ Less
Submitted 30 April, 2024; v1 submitted 5 January, 2024;
originally announced January 2024.
-
Bounds and extremal graphs for the energy of complex unit gain graphs
Authors:
Aniruddha Samanta,
M. Rajesh Kannan
Abstract:
A complex unit gain graph ($ \mathbb{T} $-gain graph), $ Φ=(G, \varphi) $ is a graph where the gain function $ \varphi $ assigns a unit complex number to each orientation of an edge of $ G $ and its inverse is assigned to the opposite orientation. The associated adjacency matrix $ A(Φ) $ is defined canonically. The energy $ \mathcal{E}(Φ) $ of a $ \mathbb{T} $-gain graph $ Φ$ is the sum of the abs…
▽ More
A complex unit gain graph ($ \mathbb{T} $-gain graph), $ Φ=(G, \varphi) $ is a graph where the gain function $ \varphi $ assigns a unit complex number to each orientation of an edge of $ G $ and its inverse is assigned to the opposite orientation. The associated adjacency matrix $ A(Φ) $ is defined canonically. The energy $ \mathcal{E}(Φ) $ of a $ \mathbb{T} $-gain graph $ Φ$ is the sum of the absolute values of all eigenvalues of $ A(Φ) $. For any connected triangle-free $ \mathbb{T} $-gain graph $ Φ$ with the minimum vertex degree $ δ$, we establish a lower bound $ \mathcal{E}(Φ)\geq 2δ$ and characterize the equality. Then, we present a relationship between the characteristic and the matching polynomial of $ Φ$. Using this, we obtain an upper bound for the energy $ \mathcal{E}(Φ)\leq 2μ\sqrt{2Δ_e+1} $ and characterize the classes of graphs for which the bound sharp, where $ μ$ and $ Δ_e$ are the matching number and the maximum edge degree of $ Φ$, respectively. Further, for any unicyclic graph $ G $, we study the gains for which the gain energy $ \mathcal{E}(Φ) $ attains the maximum/minimum among all $ \mathbb{T} $-gain graphs defined on $G$.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
A clustering tool for interrogating finite element models based on eigenvectors of graph adjacency
Authors:
Ramaseshan Kannan
Abstract:
This note introduces an unsupervised learning algorithm to debug errors in finite element (FE) simulation models and details how it was productionised. The algorithm clusters degrees of freedom in the FE model using numerical properties of the adjacency of its stiffness matrix. The algorithm has been deployed as a tool called `Model Stability Analysis' tool within the commercial structural FE suit…
▽ More
This note introduces an unsupervised learning algorithm to debug errors in finite element (FE) simulation models and details how it was productionised. The algorithm clusters degrees of freedom in the FE model using numerical properties of the adjacency of its stiffness matrix. The algorithm has been deployed as a tool called `Model Stability Analysis' tool within the commercial structural FE suite Oasys GSA (www.oasys-software.com/gsa). It has been used successfully by end-users for debugging real world FE models and we present examples of the tool in action.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
A note on the distance and distance signless Laplacian spectral radius of complements of trees
Authors:
Iswar Mahato,
M. Rajesh Kannan
Abstract:
In this article, we show that the generalized tree shift operation increases the distance spectral radius, distance signless Laplacian spectral radius, and the $D_α$-spectral radius of complements of trees. As a consequence of this result, we correct an ambiguity in the proofs of some of the known results.
In this article, we show that the generalized tree shift operation increases the distance spectral radius, distance signless Laplacian spectral radius, and the $D_α$-spectral radius of complements of trees. As a consequence of this result, we correct an ambiguity in the proofs of some of the known results.
△ Less
Submitted 8 June, 2023;
originally announced June 2023.
-
Approximation of group explainers with coalition structure using Monte Carlo sampling on the product space of coalitions and features
Authors:
Konstandinos Kotsiopoulos,
Alexey Miroshnikov,
Khashayar Filom,
Arjun Ravi Kannan
Abstract:
In recent years, many Machine Learning (ML) explanation techniques have been designed using ideas from cooperative game theory. These game-theoretic explainers suffer from high complexity, hindering their exact computation in practical settings. In our work, we focus on a wide class of linear game values, as well as coalitional values, for the marginal game based on a given ML model and predictor…
▽ More
In recent years, many Machine Learning (ML) explanation techniques have been designed using ideas from cooperative game theory. These game-theoretic explainers suffer from high complexity, hindering their exact computation in practical settings. In our work, we focus on a wide class of linear game values, as well as coalitional values, for the marginal game based on a given ML model and predictor vector. By viewing these explainers as expectations over appropriate sample spaces, we design a novel Monte Carlo sampling algorithm that estimates them at a reduced complexity that depends linearly on the size of the background dataset. We set up a rigorous framework for the statistical analysis and obtain error bounds for our sampling methods. The advantage of this approach is that it is fast, easily implementable, and model-agnostic. Furthermore, it has similar statistical accuracy as other known estimation techniques that are more complex and model-specific. We provide rigorous proofs of statistical convergence, as well as numerical experiments whose results agree with our theoretical findings.
△ Less
Submitted 18 April, 2024; v1 submitted 17 March, 2023;
originally announced March 2023.
-
Bounding-Focused Discretization Methods for the Global Optimization of Nonconvex Semi-Infinite Programs
Authors:
Evren M. Turan,
Johannes Jäschke,
Rohit Kannan
Abstract:
We use sensitivity analysis to design bounding-focused discretization (cutting-surface) methods for the global optimization of nonconvex semi-infinite programs (SIPs). We begin by formulating the optimal bounding-focused discretization of SIPs as a max-min problem and propose variants that are more computationally tractable. We then use parametric sensitivity theory to design an effective heuristi…
▽ More
We use sensitivity analysis to design bounding-focused discretization (cutting-surface) methods for the global optimization of nonconvex semi-infinite programs (SIPs). We begin by formulating the optimal bounding-focused discretization of SIPs as a max-min problem and propose variants that are more computationally tractable. We then use parametric sensitivity theory to design an effective heuristic approach for solving these max-min problems. We also show how our new iterative discretization methods may be modified to ensure that the solutions of their discretizations converge to an optimal solution of the SIP. We then formulate optimal bounding-focused generalized discretization of SIPs as max-min problems and design heuristic algorithms for their solution. Numerical experiments on standard nonconvex SIP test instances from the literature demonstrate that our new bounding-focused discretization methods can significantly reduce the number of iterations for convergence relative to a state-of-the-art feasibility-focused discretization method.
△ Less
Submitted 22 June, 2025; v1 submitted 28 February, 2023;
originally announced March 2023.
-
Extremal problems for the eccentricity matrices of complements of trees
Authors:
Iswar Mahato,
M. Rajesh Kannan
Abstract:
The eccentricity matrix of a connected graph $G$, denoted by $\mathcal{E}(G)$, is obtained from the distance matrix of $G$ by keeping the largest nonzero entries in each row and each column, and leaving zeros in the remaining ones. The $\mathcal{E}$-eigenvalues of $G$ are the eigenvalues of $\mathcal{E}(G)$, in which the largest one is the $\mathcal{E}$-spectral radius of $G$. The $\mathcal{E}$-en…
▽ More
The eccentricity matrix of a connected graph $G$, denoted by $\mathcal{E}(G)$, is obtained from the distance matrix of $G$ by keeping the largest nonzero entries in each row and each column, and leaving zeros in the remaining ones. The $\mathcal{E}$-eigenvalues of $G$ are the eigenvalues of $\mathcal{E}(G)$, in which the largest one is the $\mathcal{E}$-spectral radius of $G$. The $\mathcal{E}$-energy of $G$ is the sum of the absolute values of all $\mathcal{E}$-eigenvalues of $G$. In this article, we study some of the extremal problems for eccentricity matrices of complements of trees and characterize the extremal graphs. First, we determine the unique tree whose complement has minimum (respectively, maximum) $\mathcal{E}$-spectral radius among the complements of trees. Then, we prove that the $\mathcal{E}$-eigenvalues of the complement of a tree are symmetric about the origin. As a consequence of these results, we characterize the trees whose complement has minimum (respectively, maximum) least $\mathcal{E}$-eigenvalues among the complements of trees. Finally, we discuss the extremal problems for the second largest $\mathcal{E}$-eigenvalue and the $\mathcal{E}$-energy of complements of trees and characterize the extremal graphs. As an application, we obtain a Nordhaus-Gaddum type lower bounds for the second largest $\mathcal{E}$-eigenvalue and $\mathcal{E}$-energy of a tree and its complement.
△ Less
Submitted 4 January, 2023;
originally announced January 2023.
-
Strong Partitioning and a Machine Learning Approximation for Accelerating the Global Optimization of Nonconvex QCQPs
Authors:
Rohit Kannan,
Harsha Nagarajan,
Deepjyoti Deka
Abstract:
We learn optimal instance-specific heuristics for the global minimization of nonconvex quadratically-constrained quadratic programs (QCQPs). Specifically, we consider partitioning-based convex mixed-integer programming relaxations for nonconvex QCQPs and propose the novel problem of strong partitioning to optimally partition variable domains without sacrificing global optimality. Since solving thi…
▽ More
We learn optimal instance-specific heuristics for the global minimization of nonconvex quadratically-constrained quadratic programs (QCQPs). Specifically, we consider partitioning-based convex mixed-integer programming relaxations for nonconvex QCQPs and propose the novel problem of strong partitioning to optimally partition variable domains without sacrificing global optimality. Since solving this max-min strong partitioning problem exactly can be very challenging, we design a local optimization method that leverages generalized gradients of the value function of its inner-minimization problem. However, even solving the strong partitioning problem to local optimality can be time-consuming. To address this, we propose a simple and practical machine learning (ML) approximation for homogeneous families of QCQPs. We conduct a detailed computational study on randomly generated QCQP families, including instances of the pooling problem, using the open-source global solver Alpine. Numerical experiments demonstrate that our ML approximation of strong partitioning reduces Alpine's solution time by a factor of 2 to 4.5 on average, with a maximum reduction factor of 10 to 200 across the different QCQP families.
△ Less
Submitted 20 September, 2024; v1 submitted 31 December, 2022;
originally announced January 2023.
-
Minimizers for the energy of eccentricity matrices of trees
Authors:
Iswar Mahato,
M. Rajesh Kannan
Abstract:
The eccentricity matrix of a connected graph $G$, denoted by $\mathcal{E}(G)$, is obtained from the distance matrix of $G$ by keeping the largest nonzero entries in each row and each column and leaving zeros in the remaining ones. The eigenvalues of $\mathcal{E}(G)$ are the $\mathcal{E}$-eigenvalues of $G$. The eccentricity energy (or the $\mathcal{E}$-energy) of $G$ is the sum of the absolute val…
▽ More
The eccentricity matrix of a connected graph $G$, denoted by $\mathcal{E}(G)$, is obtained from the distance matrix of $G$ by keeping the largest nonzero entries in each row and each column and leaving zeros in the remaining ones. The eigenvalues of $\mathcal{E}(G)$ are the $\mathcal{E}$-eigenvalues of $G$. The eccentricity energy (or the $\mathcal{E}$-energy) of $G$ is the sum of the absolute values of all $\mathcal{E}$-eigenvalues of $G$. In this article, we determine the unique tree with the minimum second largest $\mathcal{E}$-eigenvalue among all trees on $n$ vertices other than the star. Also, we characterize the trees with minimum $\mathcal{E}$-energy among all trees on $n$ vertices.
△ Less
Submitted 29 August, 2022;
originally announced August 2022.
-
Data-Driven Sample Average Approximation with Covariate Information
Authors:
Rohit Kannan,
Güzin Bayraksan,
James R. Luedtke
Abstract:
We study optimization for data-driven decision-making when we have observations of the uncertain parameters within the optimization model together with concurrent observations of covariates. Given a new covariate observation, the goal is to choose a decision that minimizes the expected cost conditioned on this observation. We investigate three data-driven frameworks that integrate a machine learni…
▽ More
We study optimization for data-driven decision-making when we have observations of the uncertain parameters within the optimization model together with concurrent observations of covariates. Given a new covariate observation, the goal is to choose a decision that minimizes the expected cost conditioned on this observation. We investigate three data-driven frameworks that integrate a machine learning prediction model within a stochastic programming sample average approximation (SAA) for approximating the solution to this problem. Two of the SAA frameworks are new and use out-of-sample residuals of leave-one-out prediction models for scenario generation. The frameworks we investigate are flexible and accommodate parametric, nonparametric, and semiparametric regression techniques. We derive conditions on the data generation process, the prediction model, and the stochastic program under which solutions of these data-driven SAAs are consistent and asymptotically optimal, and also derive convergence rates and finite sample guarantees. Computational experiments validate our theoretical results, demonstrate the potential advantages of our data-driven formulations over existing approaches (even when the prediction model is misspecified), and illustrate the benefits of our new data-driven formulations in the limited data regime.
△ Less
Submitted 27 July, 2022;
originally announced July 2022.
-
Squared distance matrices of trees with matrix weights
Authors:
Iswar Mahato,
M. Rajesh Kannan
Abstract:
Let $T$ be a tree on $n$ vertices whose edge weights are positive definite matrices of order $s$. The squared distance matrix of $T$, denoted by $Δ$, is the $ns \times ns$ block matrix with $Δ_{ij}=d(i,j)^2$, where $d(i,j)$ is the sum of the weights of the edges in the unique $(i,j)$-path. In this article, we obtain a formula for the determinant of $Δ$ and find $Δ^{-1}$ under some conditions.
Let $T$ be a tree on $n$ vertices whose edge weights are positive definite matrices of order $s$. The squared distance matrix of $T$, denoted by $Δ$, is the $ns \times ns$ block matrix with $Δ_{ij}=d(i,j)^2$, where $d(i,j)$ is the sum of the weights of the edges in the unique $(i,j)$-path. In this article, we obtain a formula for the determinant of $Δ$ and find $Δ^{-1}$ under some conditions.
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
Signed spectral Turań type theorems
Authors:
M. Rajesh Kannan,
Shivaramakrishna Pragada
Abstract:
A signed graph $Σ= (G, σ)$ is a graph where the function $σ$ assigns either $1$ or $-1$ to each edge of the simple graph $G$. The adjacency matrix of $Σ$, denoted by $A(Σ)$, is defined canonically. In a recent paper, Wang et al. extended the eigenvalue bounds of Hoffman and Cvetković for the signed graphs. They proposed an open problem related to the balanced clique number and the largest eigenval…
▽ More
A signed graph $Σ= (G, σ)$ is a graph where the function $σ$ assigns either $1$ or $-1$ to each edge of the simple graph $G$. The adjacency matrix of $Σ$, denoted by $A(Σ)$, is defined canonically. In a recent paper, Wang et al. extended the eigenvalue bounds of Hoffman and Cvetković for the signed graphs. They proposed an open problem related to the balanced clique number and the largest eigenvalue of a signed graph. We solve a strengthened version of this open problem. As a byproduct, we give alternate proofs for some of the known classical bounds for the least eigenvalues of the unsigned graphs. We extend the Turán's inequality for the signed graphs. Besides, we study the Bollobás and Nikiforov conjecture for the signed graphs and show that the conjecture need not be true for the signed graphs. Nevertheless, the conjecture holds for signed graphs under some assumptions. Finally, we study some of the relationships between the number of signed walks and the largest eigenvalue of a signed graph.
△ Less
Submitted 5 January, 2023; v1 submitted 21 April, 2022;
originally announced April 2022.
-
On the eccentricity matrices of trees: Inertia and spectral symmetry
Authors:
Iswar Mahato,
M. Rajesh Kannan
Abstract:
The \textit{eccentricity matrix} $\mathcal{E}(G)$ of a connected graph $G$ is obtained from the distance matrix of $G$ by keeping the largest non-zero entries in each row and each column, and leaving zeros in the remaining ones. The eigenvalues of $\mathcal{E}(G)$ are the \textit{$\mathcal{E}$-eigenvalues} of $G$. In this article, we find the inertia of the eccentricity matrices of trees. Interest…
▽ More
The \textit{eccentricity matrix} $\mathcal{E}(G)$ of a connected graph $G$ is obtained from the distance matrix of $G$ by keeping the largest non-zero entries in each row and each column, and leaving zeros in the remaining ones. The eigenvalues of $\mathcal{E}(G)$ are the \textit{$\mathcal{E}$-eigenvalues} of $G$. In this article, we find the inertia of the eccentricity matrices of trees. Interestingly, any tree on more than $4$ vertices with odd diameter has two positive and two negative $\mathcal{E}$-eigenvalues (irrespective of the structure of the tree). A tree with even diameter has the same number of positive and negative $\mathcal{E}$-eigenvalues, which is equal to the number of 'diametrically distinguished' vertices (see Definition 3.1). Besides we prove that the spectrum of the eccentricity matrix of a tree is symmetric with respect to the origin if and only if the tree has odd diameter. As an application, we characterize the trees with three distinct $\mathcal{E}$-eigenvalues.
△ Less
Submitted 30 March, 2022; v1 submitted 30 March, 2022;
originally announced March 2022.
-
Model-agnostic bias mitigation methods with regressor distribution control for Wasserstein-based fairness metrics
Authors:
Alexey Miroshnikov,
Konstandinos Kotsiopoulos,
Ryan Franks,
Arjun Ravi Kannan
Abstract:
This article is a companion paper to our earlier work Miroshnikov et al. (2021) on fairness interpretability, which introduces bias explanations. In the current work, we propose a bias mitigation methodology based upon the construction of post-processed models with fairer regressor distributions for Wasserstein-based fairness metrics. By identifying the list of predictors contributing the most to…
▽ More
This article is a companion paper to our earlier work Miroshnikov et al. (2021) on fairness interpretability, which introduces bias explanations. In the current work, we propose a bias mitigation methodology based upon the construction of post-processed models with fairer regressor distributions for Wasserstein-based fairness metrics. By identifying the list of predictors contributing the most to the bias, we reduce the dimensionality of the problem by mitigating the bias originating from those predictors. The post-processing methodology involves reshaping the predictor distributions by balancing the positive and negative bias explanations and allows for the regressor bias to decrease. We design an algorithm that uses Bayesian optimization to construct the bias-performance efficient frontier over the family of post-processed models, from which an optimal model is selected. Our novel methodology performs optimization in low-dimensional spaces and avoids expensive model retraining.
△ Less
Submitted 19 November, 2021;
originally announced November 2021.
-
On the construction of cospectral nonisomorphic bipartite graphs
Authors:
M. Rajesh Kannan,
Shivaramakrishna Pragada,
Hitesh Wankhede
Abstract:
In this article, we construct bipartite graphs which are cospectral for both the adjacency and normalized Laplacian matrices using partitioned tensor product. This extends the construction of Ji, Gong, and Wang \cite{ji-gong-wang}. Our proof of the cospectrality of adjacency matrices simplifies the proof of the bipartite case of Godsil and McKay's construction \cite{godsil-mckay-1976}, and shows t…
▽ More
In this article, we construct bipartite graphs which are cospectral for both the adjacency and normalized Laplacian matrices using partitioned tensor product. This extends the construction of Ji, Gong, and Wang \cite{ji-gong-wang}. Our proof of the cospectrality of adjacency matrices simplifies the proof of the bipartite case of Godsil and McKay's construction \cite{godsil-mckay-1976}, and shows that the corresponding normalized Laplacian matrices are also cospectral. We partially characterize the isomorphism in Godsil and McKay's construction, and generalize Ji et al.'s characterization of the isomorphism to biregular bipartite graphs. The essential idea in characterizing the isomorphism uses Hammack's cancellation law as opposed to Hall's marriage theorem used by Ji et al.
△ Less
Submitted 10 April, 2022; v1 submitted 18 October, 2021;
originally announced October 2021.
-
Bit Complexity of Jordan Normal Form and Spectral Factorization
Authors:
Papri Dey,
Ravi Kannan,
Nick Ryder,
Nikhil Srivastava
Abstract:
We study the bit complexity of two related fundamental computational problems in linear algebra and control theory. Our results are: (1) An $\tilde{O}(n^{ω+3}a+n^4a^2+n^ω\log(1/ε))$ time algorithm for finding an $ε-$approximation to the Jordan Normal form of an integer matrix with $a-$bit entries, where $ω$ is the exponent of matrix multiplication. (2) An…
▽ More
We study the bit complexity of two related fundamental computational problems in linear algebra and control theory. Our results are: (1) An $\tilde{O}(n^{ω+3}a+n^4a^2+n^ω\log(1/ε))$ time algorithm for finding an $ε-$approximation to the Jordan Normal form of an integer matrix with $a-$bit entries, where $ω$ is the exponent of matrix multiplication. (2) An $\tilde{O}(n^6d^6a+n^4d^4a^2+n^3d^3\log(1/ε))$ time algorithm for $ε$-approximately computing the spectral factorization $P(x)=Q^*(x)Q(x)$ of a given monic $n\times n$ rational matrix polynomial of degree $2d$ with rational $a-$bit coefficients having $a-$bit common denominators, which satisfies $P(x)\succeq 0$ for all real $x$. The first algorithm is used as a subroutine in the second one.
Despite its being of central importance, polynomial complexity bounds were not previously known for spectral factorization, and for Jordan form the best previous best running time was an unspecified polynomial in $n$ of degree at least twelve \cite{cai1994computing}. Our algorithms are simple and judiciously combine techniques from numerical and symbolic computation, yielding significant advantages over either approach by itself.
△ Less
Submitted 25 November, 2022; v1 submitted 28 September, 2021;
originally announced September 2021.
-
Eccentricity energy change of complete multipartite graphs due to edge deletion
Authors:
Iswar Mahato,
M. Rajesh Kannan
Abstract:
The eccentricity matrix $\varepsilon(G)$ of a graph $G$ is obtained from the distance matrix of $G$ by retaining the largest distances in each row and each column, and leaving zeros in the remaining ones. The eccentricity energy of $G$ is sum of the absolute values of the eigenvalues of $\varepsilon(G)$. Although the eccentricity matrices of graphs are closely related to the distance matrices of g…
▽ More
The eccentricity matrix $\varepsilon(G)$ of a graph $G$ is obtained from the distance matrix of $G$ by retaining the largest distances in each row and each column, and leaving zeros in the remaining ones. The eccentricity energy of $G$ is sum of the absolute values of the eigenvalues of $\varepsilon(G)$. Although the eccentricity matrices of graphs are closely related to the distance matrices of graphs, a number of properties of eccentricity matrices are substantially different from those of the distance matrices. The change in eccentricity energy of a graph due to an edge deletion is one such property. In this article, we give examples of graphs for which the eccentricity energy increase (resp., decrease) but the distance energy decrease (resp., increase) due to an edge deletion. Also, we prove that the eccentricity energy of the complete $k$-partite graph $K_{n_1,\hdots,n_k}$ with $k\geq 2$ and $ n_i\geq 2$, increases due to an edge deletion.
△ Less
Submitted 7 July, 2021;
originally announced July 2021.
-
Stability theory of game-theoretic group feature explanations for machine learning models
Authors:
Alexey Miroshnikov,
Konstandinos Kotsiopoulos,
Khashayar Filom,
Arjun Ravi Kannan
Abstract:
In this article, we study feature attributions of Machine Learning (ML) models originating from linear game values and coalitional values defined as operators on appropriate functional spaces. The main focus is on random games based on the conditional and marginal expectations. The first part of our work formulates a stability theory for these explanation operators by establishing certain bounds f…
▽ More
In this article, we study feature attributions of Machine Learning (ML) models originating from linear game values and coalitional values defined as operators on appropriate functional spaces. The main focus is on random games based on the conditional and marginal expectations. The first part of our work formulates a stability theory for these explanation operators by establishing certain bounds for both marginal and conditional explanations. The differences between the two games are then elucidated, such as showing that the marginal explanations can become discontinuous on some naturally-designed domains, while the conditional explanations remain stable. In the second part of our work, group explanation methodologies are devised based on game values with coalition structure, where the features are grouped based on dependencies. We show analytically that grouping features this way has a stabilizing effect on the marginal operator on both group and individual levels, and allows for the unification of marginal and conditional explanations. Our results are verified in a number of numerical experiments where an information-theoretic measure of dependence is used for grouping.
△ Less
Submitted 10 August, 2024; v1 submitted 22 February, 2021;
originally announced February 2021.
-
Bounds for the extremal eigenvalues of gain Laplacian matrices
Authors:
M. Rajesh Kannan,
Navish Kumar,
Shivaramakrishna Pragada
Abstract:
A complex unit gain graph ($\mathbb{T}$-gain graph), $Φ= (G, \varphi)$ is a graph where the function $\varphi$ assigns a unit complex number to each orientation of an edge of $G$, and its inverse is assigned to the opposite orientation. A $ \mathbb{T} $-gain graph $Φ$ is balanced if the product of the edge gains of each cycle (with a fixed orientation) is $1$. Signed graphs are special cases of…
▽ More
A complex unit gain graph ($\mathbb{T}$-gain graph), $Φ= (G, \varphi)$ is a graph where the function $\varphi$ assigns a unit complex number to each orientation of an edge of $G$, and its inverse is assigned to the opposite orientation. A $ \mathbb{T} $-gain graph $Φ$ is balanced if the product of the edge gains of each cycle (with a fixed orientation) is $1$. Signed graphs are special cases of $\mathbb{T}$-gain graphs.
The adjacency matrix of $Φ$, denoted by $ \mathbf{A}(Φ)$ is defined canonically. The gain Laplacian for $Φ$ is defined as $\mathbf{L}(Φ) = \mathbf{D}(Φ) - \mathbf{A}(Φ)$, where $\mathbf{D}(Φ)$ is the diagonal matrix with diagonal entries are the degrees of the vertices of $G$. The minimum number of vertices (resp., edges) to be deleted from $Φ$ in order to get a balanced gain graph the frustration number (resp, frustration index). We show that frustration number and frustration index are bounded below by the smallest eigenvalue of $\mathbf{L}(Φ)$. We provide several lower and upper bounds for extremal eigenvalues of $\mathbf{L}(Φ)$ in terms of different graph parameters such as the number of edges, vertex degrees, and average $2$-degrees. The signed graphs are particular cases of the $\mathbb{T}$-gain graphs, all the bounds established in paper hold for signed graphs. Most of the bounds established here are new for signed graphs. Finally, we perform comparative analysis for all the obtained bounds in the paper with the state-of-the-art bounds available in the literature for randomly generated Erdős-Reýni graphs.
Some of the major highlights of our paper are the gain-dependent bounds, limit convergence of the bounds to the extremal eigenvalues, and optimal extremal bounds obtained by posing optimization problems to achieve the best possible bounds.
△ Less
Submitted 9 May, 2021; v1 submitted 15 February, 2021;
originally announced February 2021.
-
Gain distance matrices for complex unit gain graphs
Authors:
Aniruddha Samanta,
M. Rajesh Kannan
Abstract:
A complex unit gain graph ($ \mathbb{T} $-gain graph), $ Φ=(G, \varphi) $ is a graph where the function $ \varphi $ assigns a unit complex number to each orientation of an edge of $ G $, and its inverse is assigned to the opposite orientation. %A complex unit gain graph($ \mathbb{T} $-gain graph) is a simple graph where each orientation of an edge is given a complex unit, and its inverse is assign…
▽ More
A complex unit gain graph ($ \mathbb{T} $-gain graph), $ Φ=(G, \varphi) $ is a graph where the function $ \varphi $ assigns a unit complex number to each orientation of an edge of $ G $, and its inverse is assigned to the opposite orientation. %A complex unit gain graph($ \mathbb{T} $-gain graph) is a simple graph where each orientation of an edge is given a complex unit, and its inverse is assigned to the opposite orientation of the edge. In this article, we propose gain distance matrices for $ \mathbb{T} $-gain graphs. These notions generalize the corresponding known concepts of distance matrices and signed distance matrices. Shahul K. Hameed et al. introduced signed distance matrices and developed their properties. Motivated by their work, we establish several spectral properties, including some equivalences between balanced $ \mathbb{T} $-gain graphs and gain distance matrices. Furthermore, we introduce the notion of positively weighted $ \mathbb{T} $-gain graphs and study some of their properties. Using these properties, Acharya's and Stanić's spectral criteria for balance are deduced. Moreover, the notions of order independence and distance compatibility are studied. Besides, we obtain some characterizations for distance compatibility.
△ Less
Submitted 27 January, 2021;
originally announced January 2021.
-
On the multiplicity of $Aα$-eigenvalues and the rank of complex unit gain graphs
Authors:
Aniruddha Samanta,
M. Rajesh Kannan
Abstract:
Let $ Φ=(G, \varphi) $ be a connected complex unit gain graph ($ \mathbb{T} $-gain graph) on a simple graph $ G $ with $ n $ vertices and maximum vertex degree $ Δ$. The associated adjacency matrix and degree matrix are denoted by $ A(Φ) $ and $ D(Φ) $, respectively. Let $ m_α(Φ,λ) $ be the multiplicity of $ λ$ as an eigenvalue of $ A_α(Φ) :=αD(Φ)+(1-α)A(Φ)$, for $ α\in[0,1) $. In this article, we…
▽ More
Let $ Φ=(G, \varphi) $ be a connected complex unit gain graph ($ \mathbb{T} $-gain graph) on a simple graph $ G $ with $ n $ vertices and maximum vertex degree $ Δ$. The associated adjacency matrix and degree matrix are denoted by $ A(Φ) $ and $ D(Φ) $, respectively. Let $ m_α(Φ,λ) $ be the multiplicity of $ λ$ as an eigenvalue of $ A_α(Φ) :=αD(Φ)+(1-α)A(Φ)$, for $ α\in[0,1) $. In this article, we establish that $ m_α(Φ, λ)\leq \frac{(Δ-2)n+2}{Δ-1}$, and characterize the classes of graphs for which the equality hold. Furthermore, we establish a couple of bounds for the rank of $A(Φ)$ in terms of the maximum vertex degree and the number of vertices. One of the main results extends a result known for unweighted graphs and simplifies the proof in [15], and other results provide better bounds for $r(Φ)$ than the bounds known in [8].
△ Less
Submitted 11 January, 2021;
originally announced January 2021.
-
Heteroscedasticity-aware residuals-based contextual stochastic optimization
Authors:
Rohit Kannan,
Güzin Bayraksan,
James Luedtke
Abstract:
We explore generalizations of some integrated learning and optimization frameworks for data-driven contextual stochastic optimization that can adapt to heteroscedasticity. We identify conditions on the stochastic program, data generation process, and the prediction setup under which these generalizations possess asymptotic and finite sample guarantees for a class of stochastic programs, including…
▽ More
We explore generalizations of some integrated learning and optimization frameworks for data-driven contextual stochastic optimization that can adapt to heteroscedasticity. We identify conditions on the stochastic program, data generation process, and the prediction setup under which these generalizations possess asymptotic and finite sample guarantees for a class of stochastic programs, including two-stage stochastic mixed-integer programs with continuous recourse. We verify that our assumptions hold for popular parametric and nonparametric regression methods.
△ Less
Submitted 8 January, 2021;
originally announced January 2021.
-
Residuals-based distributionally robust optimization with covariate information
Authors:
Rohit Kannan,
Güzin Bayraksan,
James R. Luedtke
Abstract:
We consider data-driven approaches that integrate a machine learning prediction model within distributionally robust optimization (DRO) given limited joint observations of uncertain parameters and covariates. Our framework is flexible in the sense that it can accommodate a variety of regression setups and DRO ambiguity sets. We investigate asymptotic and finite sample properties of solutions obtai…
▽ More
We consider data-driven approaches that integrate a machine learning prediction model within distributionally robust optimization (DRO) given limited joint observations of uncertain parameters and covariates. Our framework is flexible in the sense that it can accommodate a variety of regression setups and DRO ambiguity sets. We investigate asymptotic and finite sample properties of solutions obtained using Wasserstein, sample robust optimization, and phi-divergence-based ambiguity sets within our DRO formulations, and explore cross-validation approaches for sizing these ambiguity sets. Through numerical experiments, we validate our theoretical results, study the effectiveness of our approaches for sizing ambiguity sets, and illustrate the benefits of our DRO formulations in the limited data regime even when the prediction model is misspecified.
△ Less
Submitted 25 May, 2022; v1 submitted 2 December, 2020;
originally announced December 2020.
-
Wasserstein-based fairness interpretability framework for machine learning models
Authors:
Alexey Miroshnikov,
Konstandinos Kotsiopoulos,
Ryan Franks,
Arjun Ravi Kannan
Abstract:
The objective of this article is to introduce a fairness interpretability framework for measuring and explaining the bias in classification and regression models at the level of a distribution. In our work, we measure the model bias across sub-population distributions in the model output using the Wasserstein metric. To properly quantify the contributions of predictors, we take into account the fa…
▽ More
The objective of this article is to introduce a fairness interpretability framework for measuring and explaining the bias in classification and regression models at the level of a distribution. In our work, we measure the model bias across sub-population distributions in the model output using the Wasserstein metric. To properly quantify the contributions of predictors, we take into account the favorability of both the model and predictors with respect to the non-protected class. The quantification is accomplished by the use of transport theory, which gives rise to the decomposition of the model bias and bias explanations to positive and negative contributions. To gain more insight into the role of favorability and allow for additivity of bias explanations, we adapt techniques from cooperative game theory.
△ Less
Submitted 8 March, 2022; v1 submitted 5 November, 2020;
originally announced November 2020.
-
Normalized Laplacians for Gain Graphs
Authors:
M. Rajesh Kannan,
Navish Kumar,
Shivaramakrishna Pragada
Abstract:
We propose the notion of normalized Laplacian matrix $\mathcal{L}(Φ)$ for a gain graphs and study its properties in detail, providing insights and counterexamples along the way. We establish bounds for the eigenvalues of $\mathcal{L}(Φ)$ and characterize the classes of graphs for which equality holds. The relationships between the balancedness, bipartiteness, and their connection to the spectrum o…
▽ More
We propose the notion of normalized Laplacian matrix $\mathcal{L}(Φ)$ for a gain graphs and study its properties in detail, providing insights and counterexamples along the way. We establish bounds for the eigenvalues of $\mathcal{L}(Φ)$ and characterize the classes of graphs for which equality holds. The relationships between the balancedness, bipartiteness, and their connection to the spectrum of $\mathcal{L}(Φ)$ are also studied. Besides, we extend the edge version of eigenvalue interlacing for the gain graphs. Thereupon, we determine the coefficients for the characteristic polynomial of $\mathcal{L}(Φ)$.
△ Less
Submitted 10 March, 2022; v1 submitted 29 September, 2020;
originally announced September 2020.
-
Interval hulls of $N$-matrices and almost $P$-matrices
Authors:
Projesh Nath Choudhury,
M. Rajesh Kannan
Abstract:
We establish a characterization of almost $P$-matrices via a sign non-reversal property. In this we are inspired by the analogous results for $N$-matrices. Next, the interval hull of two $m \times n$ matrices $A=(a_{ij})$ and $B = (b_{ij})$, denoted by $\mathbb{I}(A,B)$, is the collection of all matrices $C \in \mathbb{R}^{m \times n}$ such that each $c_{ij}$ is a convex combination of $a_{ij}$ an…
▽ More
We establish a characterization of almost $P$-matrices via a sign non-reversal property. In this we are inspired by the analogous results for $N$-matrices. Next, the interval hull of two $m \times n$ matrices $A=(a_{ij})$ and $B = (b_{ij})$, denoted by $\mathbb{I}(A,B)$, is the collection of all matrices $C \in \mathbb{R}^{m \times n}$ such that each $c_{ij}$ is a convex combination of $a_{ij}$ and $b_{ij}$. Using the sign non-reversal property, we identify a finite subset of $\mathbb{I}(A,B)$ that determines if all matrices in $\mathbb{I}(A,B)$ are $N$-matrices/almost $P$-matrices. This provides a test for an entire class of matrices simultaneously to be $N$-matrices/almost $P$-matrices. We also establish analogous results for semipositive and minimally semipositive matrices. These characterizations may be considered similar in spirit to that of $P$-matrices by Bialas-Garloff [Linear Algebra Appl. 1984] and Rohn-Rex [SIMAX 1996], and of positive definite matrices by Rohn [SIMAX 1994].
△ Less
Submitted 4 September, 2020;
originally announced September 2020.
-
On the $A_α$-spectra of some join graphs
Authors:
Mainak Basunia,
Iswar Mahato,
M. Rajesh Kannan
Abstract:
Let $G$ be a simple, connected graph and let $A(G)$ be the adjacency matrix of $G$. If $D(G)$ is the diagonal matrix of the vertex degrees of $G$, then for every real $α\in [0,1]$, the matrix $A_α(G)$ is defined as $$A_α(G) = αD(G) + (1- α) A(G).$$ The eigenvalues of the matrix $A_α(G)$ form the $A_α$-spectrum of $G$. Let $G_1 \dot{\vee} G_2$, $G_1 \underline{\vee} G_2$,…
▽ More
Let $G$ be a simple, connected graph and let $A(G)$ be the adjacency matrix of $G$. If $D(G)$ is the diagonal matrix of the vertex degrees of $G$, then for every real $α\in [0,1]$, the matrix $A_α(G)$ is defined as $$A_α(G) = αD(G) + (1- α) A(G).$$ The eigenvalues of the matrix $A_α(G)$ form the $A_α$-spectrum of $G$. Let $G_1 \dot{\vee} G_2$, $G_1 \underline{\vee} G_2$, $G_1 \langle \textrm{v} \rangle G_2$ and $G_1 \langle \textrm{e} \rangle G_2$ denote the subdivision-vertex join, subdivision-edge join, $R$-vertex join and $R$-edge join of two graphs $G_1$ and $G_2$, respectively. In this paper, we compute the $A_α$-spectra of $G_1 \dot{\vee} G_2$, $G_1 \underline{\vee} G_2$, $G_1 \langle \textrm{v} \rangle G_2$ and $G_1 \langle \textrm{e} \rangle G_2$ for a regular graph $G_1$ and an arbitrary graph $G_2$ in terms of their $A_α$-eigenvalues. As an application of these results, we construct infinitely many pairs of $A_α$-cospectral graphs.
△ Less
Submitted 24 August, 2020;
originally announced August 2020.
-
Sign non-reversal property for totally non-negative and totally positive matrices, and testing total positivity of their interval hull
Authors:
Projesh Nath Choudhury,
M. Rajesh Kannan,
Apoorva Khare
Abstract:
A matrix $A$ is totally positive (or non-negative) of order $k$, denoted $TP_k$ (or $TN_k$), if all minors of size $\leq k$ are positive (or non-negative). It is well-known that such matrices are characterized by the variation diminishing property together with the sign non-reversal property. We do away with the former, and show that $A$ is $TP_k$ if and only if every submatrix formed from at most…
▽ More
A matrix $A$ is totally positive (or non-negative) of order $k$, denoted $TP_k$ (or $TN_k$), if all minors of size $\leq k$ are positive (or non-negative). It is well-known that such matrices are characterized by the variation diminishing property together with the sign non-reversal property. We do away with the former, and show that $A$ is $TP_k$ if and only if every submatrix formed from at most $k$ consecutive rows and columns has the sign non-reversal property. In fact this can be strengthened to only consider test vectors in $\mathbb{R}^k$ with alternating signs. We also show a similar characterization for all $TN_k$ matrices - more strongly, both of these characterizations use a single vector (with alternating signs) for each square submatrix. These characterizations are novel, and similar in spirit to the fundamental results characterizing $TP$ matrices by Gantmacher-Krein [Compos. Math. 1937] and $P$-matrices by Gale-Nikaido [Math. Ann. 1965].
As an application, we study the interval hull $\mathbb{I}(A,B)$ of two $m \times n$ matrices $A=(a_{ij})$ and $B = (b_{ij})$. This is the collection of $C \in \mathbb{R}^{m \times n}$ such that each $c_{ij}$ is between $a_{ij}$ and $b_{ij}$. Using the sign non-reversal property, we identify a two-element subset of $\mathbb{I}(A,B)$ that detects the $TP_k$ property for all of $\mathbb{I}(A,B)$ for arbitrary $k \geq 1$. In particular, this provides a test for total positivity (of any order), simultaneously for an entire class of rectangular matrices. In parallel, we also provide a finite set to test the total non-negativity (of any order) of an interval hull $\mathbb{I}(A,B)$.
△ Less
Submitted 13 February, 2021; v1 submitted 20 July, 2020;
originally announced July 2020.
-
Bounds for a solution set of linear complementarity problems over Hilbert spaces
Authors:
Projesh Nath Choudhury,
M. Rajesh Kannan,
K. C. Sivakumar
Abstract:
Let $H$ be a real Hilbert space. In this short note, using some of the properties of bounded linear operators with closed range defined on $H$, certain bounds for a specific convex subset of the solution set of infinite linear complementarity problems, are established.
Let $H$ be a real Hilbert space. In this short note, using some of the properties of bounded linear operators with closed range defined on $H$, certain bounds for a specific convex subset of the solution set of infinite linear complementarity problems, are established.
△ Less
Submitted 29 June, 2020;
originally announced June 2020.
-
Bounds for the energy of a complex unit gain graph
Authors:
Aniruddha Samanta,
M. Rajesh Kannan
Abstract:
A $\mathbb{T}$-gain graph, $Φ= (G, \varphi)$, is a graph in which the function $\varphi$ assigns a unit complex number to each orientation of an edge, and its inverse is assigned to the opposite orientation. The associated adjacency matrix $ A(Φ) $ is defined canonically. The energy $ \mathcal{E}(Φ) $ of a $ \mathbb{T} $-gain graph $ Φ$ is the sum of the absolute values of all eigenvalues of…
▽ More
A $\mathbb{T}$-gain graph, $Φ= (G, \varphi)$, is a graph in which the function $\varphi$ assigns a unit complex number to each orientation of an edge, and its inverse is assigned to the opposite orientation. The associated adjacency matrix $ A(Φ) $ is defined canonically. The energy $ \mathcal{E}(Φ) $ of a $ \mathbb{T} $-gain graph $ Φ$ is the sum of the absolute values of all eigenvalues of $ A(Φ) $.
We study the notion of energy of a vertex of a $ \mathbb{T} $-gain graph, and establish bounds for it. For any $ \mathbb{T} $-gain graph $ Φ$, we prove that $2τ(G)-2c(G) \leq \mathcal{E}(Φ) \leq 2τ(G)\sqrt{Δ(G)}$, where $ τ(G), c(G)$ and $ Δ(G)$ are the vertex cover number, the number of odd cycles and the largest vertex degree of $ G $, respectively. Furthermore, using the properties of vertex energy, we characterize the classes of $ \mathbb{T} $-gain graphs for which $ \mathcal{E}(Φ)=2τ(G)-2c(G) $ holds. Also, we characterize the classes of $ \mathbb{T} $-gain graphs for which $\mathcal{E}(Φ)= 2τ(G)\sqrt{Δ(G)} $ holds. This characterization solves a general version of an open problem. In addition, we establish bounds for the energy in terms of the spectral radius of the associated adjacency matrix.
△ Less
Submitted 18 May, 2020;
originally announced May 2020.
-
On dense subsets of matrices with distinct eigenvalues and distinct singular values
Authors:
Himadri Lal Das,
M. Rajesh Kannan
Abstract:
It is well known that the set of all $ n \times n $ matrices with distinct eigenvalues is a dense subset of the set of all real or complex $ n \times n $ matrices. In [Hartfiel, D. J. Dense sets of diagonalizable matrices. Proc. Amer. Math. Soc., 123(6): 1669-1672, 1995.], the author established a necessary and sufficient condition for a subspace of the set of all $n \times n$ matrices to have a d…
▽ More
It is well known that the set of all $ n \times n $ matrices with distinct eigenvalues is a dense subset of the set of all real or complex $ n \times n $ matrices. In [Hartfiel, D. J. Dense sets of diagonalizable matrices. Proc. Amer. Math. Soc., 123(6): 1669-1672, 1995.], the author established a necessary and sufficient condition for a subspace of the set of all $n \times n$ matrices to have a dense subset of matrices with distinct eigenvalues. We are interested in finding a few necessary and sufficient conditions for a subset of the set of all $n \times n$ real or complex matrices to have a dense subset of matrices with distinct eigenvalues. Some of our results are generalizing the results of Hartfiel. Also, we study the existence of dense subsets of matrices with distinct singular values, distinct analytic eigenvalues, and distinct analytic singular values, respectively, in the subsets of the set of all real or complex matrices.
△ Less
Submitted 29 March, 2020;
originally announced March 2020.
-
On the construction of cospectral graphs for the adjacency and normalized Laplacian matrices
Authors:
M. Rajesh Kannan,
Shivaramakrishna Pragada
Abstract:
In [Steve Butler. A note about cospectral graphs for the adjacency and normalized Laplacian matrices. Linear Multilinear Algebra, 58(3-4):387-390, 2010.], Butler constructed a family of bipartite graphs, which are cospectral for both the adjacency and the normalized Laplacian matrices. In this article, we extend this construction for generating larger classes of bipartite graphs, which are cospect…
▽ More
In [Steve Butler. A note about cospectral graphs for the adjacency and normalized Laplacian matrices. Linear Multilinear Algebra, 58(3-4):387-390, 2010.], Butler constructed a family of bipartite graphs, which are cospectral for both the adjacency and the normalized Laplacian matrices. In this article, we extend this construction for generating larger classes of bipartite graphs, which are cospectral for both the adjacency and the normalized Laplacian matrices. Also, we provide a couple of constructions of non-bipartite graphs, which are cospectral for the adjacency matrices but not necessarily for the normalized Laplacian matrices.
△ Less
Submitted 3 February, 2020;
originally announced February 2020.
-
Stochastic DC Optimal Power Flow With Reserve Saturation
Authors:
Rohit Kannan,
James R. Luedtke,
Line A. Roald
Abstract:
We propose an optimization framework for stochastic optimal power flow with uncertain loads and renewable generator capacity. Our model follows previous work in assuming that generator outputs respond to load imbalances according to an affine control policy, but introduces a model of saturation of generator reserves by assuming that when a generator's target level hits its limit, it abandons the a…
▽ More
We propose an optimization framework for stochastic optimal power flow with uncertain loads and renewable generator capacity. Our model follows previous work in assuming that generator outputs respond to load imbalances according to an affine control policy, but introduces a model of saturation of generator reserves by assuming that when a generator's target level hits its limit, it abandons the affine policy and produces at that limit. This is a particularly interesting feature in models where wind power plants, which have uncertain upper generation limits, are scheduled to provide reserves to balance load fluctuations. The resulting model is a nonsmooth nonconvex two-stage stochastic program, and we use a stochastic approximation method to find stationary solutions to a smooth approximation. Computational results on 6-bus and 118-bus test instances demonstrates that by considering the effects of saturation, our model can yield solutions with lower expected generation costs (at the same target line violation probability level) than those obtained from a model that enforces the affine policy to stay within generator limits with high probability.
△ Less
Submitted 10 October, 2019;
originally announced October 2019.
-
On the eigenvalue region of permutative doubly stochastic matrices
Authors:
Amrita Mandal,
Bibhas Adhikari,
M. Rajesh Kannan
Abstract:
This paper is devoted to the study of eigenvalue region of the doubly stochastic matrices which are also permutative, that is, each row of such a matrix is a permutation of any other row. We call these matrices as permutative doubly stochastic (PDS) matrices. A method is proposed to obtain symbolic representation of all PDS matrices of order $n$ by finding equivalence classes of permutationally si…
▽ More
This paper is devoted to the study of eigenvalue region of the doubly stochastic matrices which are also permutative, that is, each row of such a matrix is a permutation of any other row. We call these matrices as permutative doubly stochastic (PDS) matrices. A method is proposed to obtain symbolic representation of all PDS matrices of order $n$ by finding equivalence classes of permutationally similar symbolic PDS matrices. This is a hard problem in general as it boils down to finding all Latin squares of order $n.$ However, explicit symbolic representation of matrices in these classes are determined in this paper when $n=2, 3, 4.$ It is shown that eigenvalue regions are same for doubly stochastic matrices and PDS matrices when $n=2, 3.$ It is also established that this is no longer true for $n=4,$ and two line segments are determined which belong to the eigenvalue region of doubly stochastic matrices but not in the eigenvalue region of PDS matrices. Thus a conjecture is developed for the boundary of the eigenvalue region of PDS matrices of order $4.$ Finally, inclusion theorems for eigenvalue region of PDS matrices are proved when $n\geq 2.$
△ Less
Submitted 18 April, 2020; v1 submitted 4 October, 2019;
originally announced October 2019.
-
Counting the Number of Non-Equivalent Classes of Fuzzy Matrices Using Combinatorial Techniques
Authors:
S. R. Kannan,
Rajesh Kumar Mohapatra
Abstract:
The novelty of this paper is to construct the explicit combinatorial formula for the number of all distinct fuzzy matrices of finite order, which leads us to invent a new sequence. In order to achieve this new sequence, we analyze the behavioral study of equivalence classes on the set of all fuzzy matrices of a given order under a suitable natural equivalence relation. In addition this paper chara…
▽ More
The novelty of this paper is to construct the explicit combinatorial formula for the number of all distinct fuzzy matrices of finite order, which leads us to invent a new sequence. In order to achieve this new sequence, we analyze the behavioral study of equivalence classes on the set of all fuzzy matrices of a given order under a suitable natural equivalence relation. In addition this paper characterizes the properties of non-equivalent classes of fuzzy matrices of order n with elements having degrees of membership values anywhere in the closed unit interval [0,1]. Further, this paper also derives some important relevant results by enumerating the number of all distinct fuzzy matrices of a given order in general. And also, we achieve these results by incorporating the notion of k-level fuzzy matrices, chains, and flags (maximal chains).
Keywords: Fuzzy matrices; k-level fuzzy matrices; Chains; Flags; Binomial numbers
△ Less
Submitted 26 September, 2019;
originally announced September 2019.
-
On the spectral radius and the energy of eccentricity matrix of a graph
Authors:
Iswar Mahato,
R. Gurusamy,
M. Rajesh Kannan,
S. Arockiaraj
Abstract:
The eccentricity matrix $\varepsilon(G)$ of a graph $G$ is obtained from the distance matrix by retaining the eccentricities (the largest distance) in each row and each column. In this paper, we give a characterization of the star graph, among the trees, in terms of invertibility of the associated eccentricity matrix. The largest eigenvalue of $\varepsilon(G)$ is called the $\varepsilon$-spectral…
▽ More
The eccentricity matrix $\varepsilon(G)$ of a graph $G$ is obtained from the distance matrix by retaining the eccentricities (the largest distance) in each row and each column. In this paper, we give a characterization of the star graph, among the trees, in terms of invertibility of the associated eccentricity matrix. The largest eigenvalue of $\varepsilon(G)$ is called the $\varepsilon$-spectral radius, and the eccentricity energy (or the $\varepsilon$-energy) of $G$ is the sum of the absolute values of the eigenvalues of $\varepsilon(G)$. We establish some bounds for the $\varepsilon$-spectral radius and characterize the extreme graphs. Two graphs are said to be $\varepsilon$-equienergetic if they have the same $\varepsilon$-energy. For any $n \geq 5$, we construct a pair of $\varepsilon$-equienergetic graphs on $n$ vertices, which are not $\varepsilon$-cospectral.
△ Less
Submitted 12 September, 2019;
originally announced September 2019.
-
PLANC: Parallel Low Rank Approximation with Non-negativity Constraints
Authors:
Srinivas Eswar,
Koby Hayashi,
Grey Ballard,
Ramakrishnan Kannan,
Michael A. Matheson,
Haesun Park
Abstract:
We consider the problem of low-rank approximation of massive dense non-negative tensor data, for example to discover latent patterns in video and imaging applications. As the size of data sets grows, single workstations are hitting bottlenecks in both computation time and available memory. We propose a distributed-memory parallel computing solution to handle massive data sets, loading the input da…
▽ More
We consider the problem of low-rank approximation of massive dense non-negative tensor data, for example to discover latent patterns in video and imaging applications. As the size of data sets grows, single workstations are hitting bottlenecks in both computation time and available memory. We propose a distributed-memory parallel computing solution to handle massive data sets, loading the input data across the memories of multiple nodes and performing efficient and scalable parallel algorithms to compute the low-rank approximation. We present a software package called PLANC (Parallel Low Rank Approximation with Non-negativity Constraints), which implements our solution and allows for extension in terms of data (dense or sparse, matrices or tensors of any order), algorithm (e.g., from multiplicative updating techniques to alternating direction method of multipliers), and architecture (we exploit GPUs to accelerate the computation in this work).We describe our parallel distributions and algorithms, which are careful to avoid unnecessary communication and computation, show how to extend the software to include new algorithms and/or constraints, and report efficiency and scalability results for both synthetic and real-world data sets.
△ Less
Submitted 30 August, 2019;
originally announced September 2019.
-
On the spectrum of complex unit gain graph
Authors:
Aniruddha Samanta,
M. Rajesh Kannan
Abstract:
A $\mathbb{T}$-gain graph is a simple graph in which a unit complex number is assigned to each orientation of an edge, and its inverse is assigned to the opposite orientation. The associated adjacency matrix is defined canonically, and is called $\mathbb{T}$-gain adjacency matrix. Let $\mathbb{T}_{G} $ denote the collection of all $\mathbb{T}$-gain adjacency matrices on a graph $G$. In this articl…
▽ More
A $\mathbb{T}$-gain graph is a simple graph in which a unit complex number is assigned to each orientation of an edge, and its inverse is assigned to the opposite orientation. The associated adjacency matrix is defined canonically, and is called $\mathbb{T}$-gain adjacency matrix. Let $\mathbb{T}_{G} $ denote the collection of all $\mathbb{T}$-gain adjacency matrices on a graph $G$. In this article, we study the cospectrality of matrices in $\mathbb{T}_{G} $ and we establish equivalent conditions for a graph $G$ to be a tree in terms of the spectrum and the spectral radius of matrices in $\mathbb{T}_{G} $. We identify a class of connected graphs $\mathfrak{F^{'}}$ such that for each $G \in \mathfrak{F^{'}}$, the matrices in $\mathbb{T}_G$ have nonnegative real part up to diagonal unitary similarity. Then we establish bounds for the spectral radius of $\mathbb{T}$-gain adjacency matrices on $ G \in \mathfrak{F^{'}} $ in terms of their largest eigenvalues. Thereupon, we characterize $\mathbb{T}$-gain graphs for which the spectral radius of the associated $\mathbb{T}$-gain adjacency matrices equal to the largest vertex degree of the underlying graph. These bounds generalize results known for the spectral radius of Hermitian adjacency matrices of digraphs and provide an alternate proof of a result about the sharpness of the bound in terms of largest vertex degree established in [Krystal Guo, Bojan Mohar. Hermitian adjacency matrix of digraphs and mixed graphs. J. Graph Theory 85 (2017), no. 1, 217-248.].
△ Less
Submitted 17 April, 2023; v1 submitted 28 August, 2019;
originally announced August 2019.
-
Spectra of eccentricity matrices of graphs
Authors:
Iswar Mahato,
R. Gurusamy,
M. Rajesh Kannan,
S. Arockiaraj
Abstract:
The eccentricity matrix of a connected graph $G$ is obtained from the distance matrix of $G$ by retaining the largest distances in each row and each column, and setting the remaining entries as $0$. In this article, a conjecture about the least eigenvalue of eccentricity matrices of trees, presented in the article [Jianfeng Wang, Mei Lu, Francesco Belardo, Milan Randic. The anti-adjacency matrix o…
▽ More
The eccentricity matrix of a connected graph $G$ is obtained from the distance matrix of $G$ by retaining the largest distances in each row and each column, and setting the remaining entries as $0$. In this article, a conjecture about the least eigenvalue of eccentricity matrices of trees, presented in the article [Jianfeng Wang, Mei Lu, Francesco Belardo, Milan Randic. The anti-adjacency matrix of a graph: Eccentricity matrix. Discrete Applied Mathematics, 251: 299-309, 2018.], is solved affirmatively. In addition to this, the spectra and the inertia of eccentricity matrices of various classes of graphs are investigated.
△ Less
Submitted 7 February, 2019;
originally announced February 2019.
-
A stochastic approximation method for approximating the efficient frontier of chance-constrained nonlinear programs
Authors:
Rohit Kannan,
James Luedtke
Abstract:
We propose a stochastic approximation method for approximating the efficient frontier of chance-constrained nonlinear programs. Our approach is based on a bi-objective viewpoint of chance-constrained programs that seeks solutions on the efficient frontier of optimal objective value versus risk of constraint violation. To this end, we construct a reformulated problem whose objective is to minimize…
▽ More
We propose a stochastic approximation method for approximating the efficient frontier of chance-constrained nonlinear programs. Our approach is based on a bi-objective viewpoint of chance-constrained programs that seeks solutions on the efficient frontier of optimal objective value versus risk of constraint violation. To this end, we construct a reformulated problem whose objective is to minimize the probability of constraints violation subject to deterministic convex constraints (which includes a bound on the objective function value). We adapt existing smoothing-based approaches for chance-constrained problems to derive a convergent sequence of smooth approximations of our reformulated problem, and apply a projected stochastic subgradient algorithm to solve it. In contrast with exterior sampling-based approaches (such as sample average approximation) that approximate the original chance-constrained program with one having finite support, our proposal converges to stationary solutions of a smooth approximation of the original problem, thereby avoiding poor local solutions that may be an artefact of a fixed sample. Our proposal also includes a tailored implementation of the smoothing-based approach that chooses key algorithmic parameters based on problem data. Computational results on four test problems from the literature indicate that our proposed approach can efficiently determine good approximations of the efficient frontier.
△ Less
Submitted 28 May, 2020; v1 submitted 17 December, 2018;
originally announced December 2018.
-
Eigenvalue bounds for some classes of matrices associated with graphs
Authors:
Ranjit Mehatari,
M. Rajesh Kannan
Abstract:
For a given complex square matrix $A$ with constant row sum, we establish two new eigenvalue inclusion sets. Using these bounds, first we derive bounds for the second largest and smallest eigenvalues of adjacency matrices of $k$-regular graphs. Then, we establish some bounds for the second largest and the smallest eigenvalues of the normalized adjacency matrices of graphs and the second smallest e…
▽ More
For a given complex square matrix $A$ with constant row sum, we establish two new eigenvalue inclusion sets. Using these bounds, first we derive bounds for the second largest and smallest eigenvalues of adjacency matrices of $k$-regular graphs. Then, we establish some bounds for the second largest and the smallest eigenvalues of the normalized adjacency matrices of graphs and the second smallest eigenvalue and the largest eigenvalue of the Laplacian matrices of graphs. Sharpness of these bounds are verified by examples.
△ Less
Submitted 25 August, 2020; v1 submitted 12 December, 2018;
originally announced December 2018.
-
On the adjacency matrix of a complex unit gain graph
Authors:
Ranjit Mehatari,
M. Rajesh Kannan,
Aniruddha Samanta
Abstract:
A complex unit gain graph is a simple graph in which each orientation of an edge is given a complex number with modulus 1 and its inverse is assigned to the opposite orientation of the edge. In this article, first we establish bounds for the eigenvalues of the complex unit gain graphs. Then we study some of the properties of the adjacency matrix of complex unit gain graph in connection with the ch…
▽ More
A complex unit gain graph is a simple graph in which each orientation of an edge is given a complex number with modulus 1 and its inverse is assigned to the opposite orientation of the edge. In this article, first we establish bounds for the eigenvalues of the complex unit gain graphs. Then we study some of the properties of the adjacency matrix of complex unit gain graph in connection with the characteristic and the permanental polynomials. Then we establish spectral properties of the adjacency matrices of complex unit gain graphs. In particular, using Perron-Frobenius theory, we establish a characterization for bipartite graphs in terms of the set of eigenvalues of gain graph and the set of eigenvalues of the underlying graph. Also, we derive an equivalent condition on the gain so that the eigenvalues of the gain graph and the eigenvalues of the underlying graph are the same.
△ Less
Submitted 3 October, 2019; v1 submitted 10 December, 2018;
originally announced December 2018.
-
Shifted CholeskyQR for computing the QR factorization of ill-conditioned matrices
Authors:
Takeshi Fukaya,
Ramaseshan Kannan,
Yuji Nakatsukasa,
Yusaku Yamamoto,
Yuka Yanagisawa
Abstract:
The Cholesky QR algorithm is an efficient communication-minimizing algorithm for computing the QR factorization of a tall-skinny matrix. Unfortunately it has the inherent numerical instability and breakdown when the matrix is ill-conditioned. A recent work establishes that the instability can be cured by repeating the algorithm twice (called CholeskyQR2). However, the applicability of CholeskyQR2…
▽ More
The Cholesky QR algorithm is an efficient communication-minimizing algorithm for computing the QR factorization of a tall-skinny matrix. Unfortunately it has the inherent numerical instability and breakdown when the matrix is ill-conditioned. A recent work establishes that the instability can be cured by repeating the algorithm twice (called CholeskyQR2). However, the applicability of CholeskyQR2 is still limited by the requirement that the Cholesky factorization of the Gram matrix runs to completion, which means it does not always work for matrices $X$ with $κ_2(X)\gtrsim {\bf u}^{-\frac{1}{2}}$ where ${\bf u}$ is the unit roundoff. In this work we extend the applicability to $κ_2(X)=\mathcal{O}({\bf u}^{-1})$ by introducing a shift to the computed Gram matrix so as to guarantee the Cholesky factorization $R^TR= A^TA+sI$ succeeds numerically. We show that the computed $AR^{-1}$ has reduced condition number $\leq {\bf u}^{-\frac{1}{2}}$, for which CholeskyQR2 safely computes the QR factorization, yielding a computed $Q$ of orthogonality $\|Q^TQ-I\|_2$ and residual $\|A-QR\|_F/\|A\|_F$ both $\mathcal{O}({\bf u})$. Thus we obtain the required QR factorization by essentially running Cholesky QR thrice. We extensively analyze the resulting algorithm shiftedCholeskyQR to reveal its excellent numerical stability. shiftedCholeskyQR is also highly parallelizable, and applicable and effective also when working in an oblique inner product space. We illustrate our findings through experiments, in which we achieve significant (up to x40) speedup over alternative methods.
△ Less
Submitted 28 September, 2018;
originally announced September 2018.
-
Parallel Nonnegative CP Decomposition of Dense Tensors
Authors:
Grey Ballard,
Koby Hayashi,
Ramakrishnan Kannan
Abstract:
The CP tensor decomposition is a low-rank approximation of a tensor. We present a distributed-memory parallel algorithm and implementation of an alternating optimization method for computing a CP decomposition of dense tensor data that can enforce nonnegativity of the computed low-rank factors. The principal task is to parallelize the matricized-tensor times Khatri-Rao product (MTTKRP) bottleneck…
▽ More
The CP tensor decomposition is a low-rank approximation of a tensor. We present a distributed-memory parallel algorithm and implementation of an alternating optimization method for computing a CP decomposition of dense tensor data that can enforce nonnegativity of the computed low-rank factors. The principal task is to parallelize the matricized-tensor times Khatri-Rao product (MTTKRP) bottleneck subcomputation. The algorithm is computation efficient, using dimension trees to avoid redundant computation across MTTKRPs within the alternating method. Our approach is also communication efficient, using a data distribution and parallel algorithm across a multidimensional processor grid that can be tuned to minimize communication. We benchmark our software on synthetic as well as hyperspectral image and neuroscience dynamic functional connectivity data, demonstrating that our algorithm scales well to 100s of nodes (up to 4096 cores) and is faster and more general than the currently available parallel software.
△ Less
Submitted 19 June, 2018;
originally announced June 2018.
-
A note on linear preservers on semipositive and minimal semipositive matrices
Authors:
Projesh Nath Choudhury,
M. Rajesh Kannan,
K. C. Sivakumar
Abstract:
Semipositive matrices (matrices that map at least one nonnegative vector to a positive vector) and minimally semipositive matrices (semipositive matrices whose no column-deleted submatrix is semipositive) are well studied in matrix theory. In this short note, we study the structure of linear maps which preserve the set of all semipositive and minimal semipositive matrices.
Semipositive matrices (matrices that map at least one nonnegative vector to a positive vector) and minimally semipositive matrices (semipositive matrices whose no column-deleted submatrix is semipositive) are well studied in matrix theory. In this short note, we study the structure of linear maps which preserve the set of all semipositive and minimal semipositive matrices.
△ Less
Submitted 19 June, 2018;
originally announced June 2018.
-
Resistance matrices of graphs with matrix weights
Authors:
Fouzul Atik,
Ravindra B Bapat,
M. Rajesh Kannan
Abstract:
The \emph{resistance matrix} of a simple connected graph $G$ is denoted by $R$, and is defined by $R =(r_{ij})$, where $r_{ij}$ is the resistance distance between the vertices $i$ and $j$ of $G$. In this paper, we consider the resistance matrix of weighted graph with edge weights being positive definite matrices of same size. We derive a formula for the determinant and the inverse of the resistanc…
▽ More
The \emph{resistance matrix} of a simple connected graph $G$ is denoted by $R$, and is defined by $R =(r_{ij})$, where $r_{ij}$ is the resistance distance between the vertices $i$ and $j$ of $G$. In this paper, we consider the resistance matrix of weighted graph with edge weights being positive definite matrices of same size. We derive a formula for the determinant and the inverse of the resistance matrix. Then, we establish an interlacing inequality for the eigenvalues of resistance and Laplacian matrices. Using this interlacing inequality, we obtain the inertia of the resistance matrix.
△ Less
Submitted 4 April, 2018;
originally announced April 2018.
-
On distance and Laplacian matrices of trees with matrix weights
Authors:
Fouzul Atik,
M. Rajesh Kannan,
R. B. Bapat
Abstract:
The \emph{distance matrix} of a simple connected graph $G$ is $D(G)=(d_{ij})$, where $d_{ij}$ is the distance between the vertices $i$ and $j$ in $G$. We consider a weighted tree $T$ on $n$ vertices with edge weights are square matrix of same size. The distance $d_{ij}$ between the vertices $i$ and $j$ is the sum of the weight matrices of the edges in the unique path from $i$ to $j$. In this artic…
▽ More
The \emph{distance matrix} of a simple connected graph $G$ is $D(G)=(d_{ij})$, where $d_{ij}$ is the distance between the vertices $i$ and $j$ in $G$. We consider a weighted tree $T$ on $n$ vertices with edge weights are square matrix of same size. The distance $d_{ij}$ between the vertices $i$ and $j$ is the sum of the weight matrices of the edges in the unique path from $i$ to $j$. In this article we establish a characterization for the trees in terms of rank of (matrix) weighted Laplacian matrix associated with it. Then we establish a necessary and sufficient condition for the distance matrix $D$, with matrix weights, to be invertible and the formula for the inverse of $D$, if it exists. Also we study some of the properties of the distance matrices of matrix weighted trees in connection with the Laplacian matrices, g-inverses and eigenvalues.
△ Less
Submitted 27 October, 2017;
originally announced October 2017.