-
Kernel-Free Universum Quadratic Surface Twin Support Vector Machines for Imbalanced Data
Authors:
Hossein Moosaei,
Milan Hladík,
Ahmad Mousavi,
Zheming Gao,
Haojie Fu
Abstract:
Binary classification tasks with imbalanced classes pose significant challenges in machine learning. Traditional classifiers often struggle to accurately capture the characteristics of the minority class, resulting in biased models with subpar predictive performance. In this paper, we introduce a novel approach to tackle this issue by leveraging Universum points to support the minority class withi…
▽ More
Binary classification tasks with imbalanced classes pose significant challenges in machine learning. Traditional classifiers often struggle to accurately capture the characteristics of the minority class, resulting in biased models with subpar predictive performance. In this paper, we introduce a novel approach to tackle this issue by leveraging Universum points to support the minority class within quadratic twin support vector machine models. Unlike traditional classifiers, our models utilize quadratic surfaces instead of hyperplanes for binary classification, providing greater flexibility in modeling complex decision boundaries. By incorporating Universum points, our approach enhances classification accuracy and generalization performance on imbalanced datasets. We generated four artificial datasets to demonstrate the flexibility of the proposed methods. Additionally, we validated the effectiveness of our approach through empirical evaluations on benchmark datasets, showing superior performance compared to conventional classifiers and existing methods for imbalanced classification.
△ Less
Submitted 2 December, 2024;
originally announced December 2024.
-
A Brief Review of Explainable Artificial Intelligence in Healthcare
Authors:
Zahra Sadeghi,
Roohallah Alizadehsani,
Mehmet Akif Cifci,
Samina Kausar,
Rizwan Rehman,
Priyakshi Mahanta,
Pranjal Kumar Bora,
Ammar Almasri,
Rami S. Alkhawaldeh,
Sadiq Hussain,
Bilal Alatas,
Afshin Shoeibi,
Hossein Moosaei,
Milan Hladik,
Saeid Nahavandi,
Panos M. Pardalos
Abstract:
XAI refers to the techniques and methods for building AI applications which assist end users to interpret output and predictions of AI models. Black box AI applications in high-stakes decision-making situations, such as medical domain have increased the demand for transparency and explainability since wrong predictions may have severe consequences. Model explainability and interpretability are vit…
▽ More
XAI refers to the techniques and methods for building AI applications which assist end users to interpret output and predictions of AI models. Black box AI applications in high-stakes decision-making situations, such as medical domain have increased the demand for transparency and explainability since wrong predictions may have severe consequences. Model explainability and interpretability are vital successful deployment of AI models in healthcare practices. AI applications' underlying reasoning needs to be transparent to clinicians in order to gain their trust. This paper presents a systematic review of XAI aspects and challenges in the healthcare domain. The primary goals of this study are to review various XAI methods, their challenges, and related machine learning models in healthcare. The methods are discussed under six categories: Features-oriented methods, global methods, concept models, surrogate models, local pixel-based methods, and human-centric methods. Most importantly, the paper explores XAI role in healthcare problems to clarify its necessity in safety-critical applications. The paper intends to establish a comprehensive understanding of XAI-related applications in the healthcare field by reviewing the related experimental results. To facilitate future research for filling research gaps, the importance of XAI models from different viewpoints and their limitations are investigated.
△ Less
Submitted 4 April, 2023;
originally announced April 2023.
-
Multi-task twin support vector machine with Universum data
Authors:
Hossein Moosaei,
Fatemeh Bazikar,
Milan Hladík
Abstract:
Multi-task learning (MTL) has emerged as a promising topic of machine learning in recent years, aiming to enhance the performance of numerous related learning tasks by exploiting beneficial information. During the training phase, most of the existing multi-task learning models concentrate entirely on the target task data and ignore the non-target task data contained in the target tasks. To address…
▽ More
Multi-task learning (MTL) has emerged as a promising topic of machine learning in recent years, aiming to enhance the performance of numerous related learning tasks by exploiting beneficial information. During the training phase, most of the existing multi-task learning models concentrate entirely on the target task data and ignore the non-target task data contained in the target tasks. To address this issue, Universum data, that do not correspond to any class of a classification problem, may be used as prior knowledge in the training model. This study looks at the challenge of multi-task learning using Universum data to employ non-target task data, which leads to better performance. It proposes a multi-task twin support vector machine with Universum data (UMTSVM) and provides two approaches to its solution. The first approach takes into account the dual formulation of UMTSVM and tries to solve a quadratic programming problem. The second approach formulates a least-squares version of UMTSVM and refers to it as LS-UMTSVM to further increase the generalization performance. The solution of the two primal problems in LS-UMTSVM is simplified to solving just two systems of linear equations, resulting in an incredibly simple and quick approach. Numerical experiments on several popular multi-task data sets and medical data sets demonstrate the efficiency of the proposed methods.
△ Less
Submitted 22 June, 2022;
originally announced June 2022.
-
Ontology Matching Through Absolute Orientation of Embedding Spaces
Authors:
Jan Portisch,
Guilherme Costa,
Karolin Stefani,
Katharina Kreplin,
Michael Hladik,
Heiko Paulheim
Abstract:
Ontology matching is a core task when creating interoperable and linked open datasets. In this paper, we explore a novel structure-based mapping approach which is based on knowledge graph embeddings: The ontologies to be matched are embedded, and an approach known as absolute orientation is used to align the two embedding spaces. Next to the approach, the paper presents a first, preliminary evalua…
▽ More
Ontology matching is a core task when creating interoperable and linked open datasets. In this paper, we explore a novel structure-based mapping approach which is based on knowledge graph embeddings: The ontologies to be matched are embedded, and an approach known as absolute orientation is used to align the two embedding spaces. Next to the approach, the paper presents a first, preliminary evaluation using synthetic and real-world datasets. We find in experiments with synthetic data, that the approach works very well on similarly structured graphs; it handles alignment noise better than size and structural differences in the ontologies.
△ Less
Submitted 8 April, 2022;
originally announced April 2022.
-
Background Knowledge in Schema Matching: Strategy vs. Data
Authors:
Jan Portisch,
Michael Hladik,
Heiko Paulheim
Abstract:
The use of external background knowledge can be beneficial for the task of matching schemas or ontologies automatically. In this paper, we exploit six general-purpose knowledge graphs as sources of background knowledge for the matching task. The background sources are evaluated by applying three different exploitation strategies. We find that explicit strategies still outperform latent ones and th…
▽ More
The use of external background knowledge can be beneficial for the task of matching schemas or ontologies automatically. In this paper, we exploit six general-purpose knowledge graphs as sources of background knowledge for the matching task. The background sources are evaluated by applying three different exploitation strategies. We find that explicit strategies still outperform latent ones and that the choice of the strategy has a greater impact on the final alignment than the actual background dataset on which the strategy is applied. While we could not identify a universally superior resource, BabelNet achieved consistently good results. Our best matcher configuration with BabelNet performs very competitively when compared to other matching systems even though no dataset-specific optimizations were made.
△ Less
Submitted 29 June, 2021;
originally announced July 2021.
-
Sparse Universum Quadratic Surface Support Vector Machine Models for Binary Classification
Authors:
Hossein Moosaei,
Ahmad Mousavi,
Milan Hladík,
Zheming Gao
Abstract:
In binary classification, kernel-free linear or quadratic support vector machines are proposed to avoid dealing with difficulties such as finding appropriate kernel functions or tuning their hyper-parameters. Furthermore, Universum data points, which do not belong to any class, can be exploited to embed prior knowledge into the corresponding models so that the generalization performance is improve…
▽ More
In binary classification, kernel-free linear or quadratic support vector machines are proposed to avoid dealing with difficulties such as finding appropriate kernel functions or tuning their hyper-parameters. Furthermore, Universum data points, which do not belong to any class, can be exploited to embed prior knowledge into the corresponding models so that the generalization performance is improved. In this paper, we design novel kernel-free Universum quadratic surface support vector machine models. Further, we propose the L1 norm regularized version that is beneficial for detecting potential sparsity patterns in the Hessian of the quadratic surface and reducing to the standard linear models if the data points are (almost) linearly separable. The proposed models are convex such that standard numerical solvers can be utilized for solving them. Nonetheless, we formulate a least squares version of the L1 norm regularized model and next, design an effective tailored algorithm that only requires solving one linear system. Several theoretical properties of these models are then reported/proved as well. We finally conduct numerical experiments on both artificial and public benchmark data sets to demonstrate the feasibility and effectiveness of the proposed models.
△ Less
Submitted 3 April, 2021;
originally announced April 2021.
-
Efficient Set-Based Approaches for the Reliable Computation of Robot Capabilities
Authors:
Joshua Pickard,
Vincent Padois,
Milan Hladík,
David Daney
Abstract:
To reliably model real robot characteristics, interval linear systems of equations allow to describe families of problems that consider sets of values. This allows to easily account for typical complexities such as sets of joint states and design parameter uncertainties. Inner approximations of the solutions to the interval linear systems can be used to describe the common capabilities of a roboti…
▽ More
To reliably model real robot characteristics, interval linear systems of equations allow to describe families of problems that consider sets of values. This allows to easily account for typical complexities such as sets of joint states and design parameter uncertainties. Inner approximations of the solutions to the interval linear systems can be used to describe the common capabilities of a robotic manipulator corresponding to the considered sets of values. In this work, several classes of problems are considered. For each class, reliable and efficient polytope, n-cube, and n-ball inner approximations are presented. The interval approaches usually proposed are inefficient because they are too computationally heavy for certain applications, such as control. We propose efficient new inner approximation theorems for the considered classes of problems. This allows for usage with real-time applications as well as rapid analysis of potential designs. Several applications are presented for a redundant planar manipulator including locally evaluating the manipulator's velocity, acceleration, and static force capabilities, and evaluating its future acceleration capabilities over a given time horizon.
△ Less
Submitted 1 April, 2021;
originally announced April 2021.
-
FinMatcher at FinSim-2: Hypernym Detection in the Financial Services Domain using Knowledge Graphs
Authors:
Jan Portisch,
Michael Hladik,
Heiko Paulheim
Abstract:
This paper presents the FinMatcher system and its results for the FinSim 2021 shared task which is co-located with the Workshop on Financial Technology on the Web (FinWeb) in conjunction with The Web Conference. The FinSim-2 shared task consists of a set of concept labels from the financial services domain. The goal is to find the most relevant top-level concept from a given set of concepts. The F…
▽ More
This paper presents the FinMatcher system and its results for the FinSim 2021 shared task which is co-located with the Workshop on Financial Technology on the Web (FinWeb) in conjunction with The Web Conference. The FinSim-2 shared task consists of a set of concept labels from the financial services domain. The goal is to find the most relevant top-level concept from a given set of concepts. The FinMatcher system exploits three publicly available knowledge graphs, namely WordNet, Wikidata, and WebIsALOD. The graphs are used to generate explicit features as well as latent features which are fed into a neural classifier to predict the closest hypernym.
△ Less
Submitted 2 March, 2021;
originally announced March 2021.
-
Positivity and convexity in incomplete cooperative games
Authors:
Martin Černý,
Jan Bok,
David Hartman,
Milan Hladík
Abstract:
Incomplete cooperative games generalise the classical model of cooperative games by omitting the values of some of the coalitions. This allows to incorporate uncertainty into the model and study the underlying games as well as possible payoff distribution based only on the partial information. In this paper we perform a systematic study of incomplete games, focusing on two important classes of coo…
▽ More
Incomplete cooperative games generalise the classical model of cooperative games by omitting the values of some of the coalitions. This allows to incorporate uncertainty into the model and study the underlying games as well as possible payoff distribution based only on the partial information. In this paper we perform a systematic study of incomplete games, focusing on two important classes of cooperative games: positive and convex games.
Regarding positivity, we generalise previous results for a special class of minimal incomplete games to general setting. We characterise non-extendability to a positive game by the existence of a certificate and provide a description of the set of positive extensions using its extreme games. The results are then used to obtain explicit formulas for several classes of incomplete games with special structures.
The second part deals with convexity. We begin with considering the case of non-negative minimal incomplete games. Then we survey existing results in the related theory of set functions, namely providing context to the problem of completing partial functions. We provide a characterisation of extendability and a full description of the set of symmetric convex extensions. The set serves as an approximation of the set of convex extensions.
Finally, we outline an entirely new perspective on a connection between incomplete cooperative games and cooperative interval games.
△ Less
Submitted 18 March, 2022; v1 submitted 16 October, 2020;
originally announced October 2020.
-
RDF2Vec Light -- A Lightweight Approach for Knowledge Graph Embeddings
Authors:
Jan Portisch,
Michael Hladik,
Heiko Paulheim
Abstract:
Knowledge graph embedding approaches represent nodes and edges of graphs as mathematical vectors. Current approaches focus on embedding complete knowledge graphs, i.e. all nodes and edges. This leads to very high computational requirements on large graphs such as DBpedia or Wikidata. However, for most downstream application scenarios, only a small subset of concepts is of actual interest. In this…
▽ More
Knowledge graph embedding approaches represent nodes and edges of graphs as mathematical vectors. Current approaches focus on embedding complete knowledge graphs, i.e. all nodes and edges. This leads to very high computational requirements on large graphs such as DBpedia or Wikidata. However, for most downstream application scenarios, only a small subset of concepts is of actual interest. In this paper, we present RDF2Vec Light, a lightweight embedding approach based on RDF2Vec which generates vectors for only a subset of entities. To that end, RDF2Vec Light only traverses and processes a subgraph of the knowledge graph. Our method allows the application of embeddings of very large knowledge graphs in scenarios where such embeddings were not possible before due to a significantly lower runtime and significantly reduced hardware requirements.
△ Less
Submitted 17 September, 2020; v1 submitted 16 September, 2020;
originally announced September 2020.
-
KGvec2go -- Knowledge Graph Embeddings as a Service
Authors:
Jan Portisch,
Michael Hladik,
Heiko Paulheim
Abstract:
In this paper, we present KGvec2go, a Web API for accessing and consuming graph embeddings in a light-weight fashion in downstream applications. Currently, we serve pre-trained embeddings for four knowledge graphs. We introduce the service and its usage, and we show further that the trained models have semantic value by evaluating them on multiple semantic benchmarks. The evaluation also reveals t…
▽ More
In this paper, we present KGvec2go, a Web API for accessing and consuming graph embeddings in a light-weight fashion in downstream applications. Currently, we serve pre-trained embeddings for four knowledge graphs. We introduce the service and its usage, and we show further that the trained models have semantic value by evaluating them on multiple semantic benchmarks. The evaluation also reveals that the combination of multiple models can lead to a better outcome than the best individual model.
△ Less
Submitted 9 March, 2020;
originally announced March 2020.
-
Duality Gap in Interval Linear Programming
Authors:
Jana Novotná,
Milan Hladík,
Tomáš Masařík
Abstract:
This paper deals with the problem of linear programming with inexact data represented by real closed intervals. Optimization problems with interval data arise in practical computations and they are of theoretical interest for more than forty years. We extend the concept of duality gap (DG), the difference between the primal and its dual optimal value, into interval linear programming. We consider…
▽ More
This paper deals with the problem of linear programming with inexact data represented by real closed intervals. Optimization problems with interval data arise in practical computations and they are of theoretical interest for more than forty years. We extend the concept of duality gap (DG), the difference between the primal and its dual optimal value, into interval linear programming. We consider two situations: First, DG is zero for every realization of interval parameters (the so called strongly zero DG) and, second, DG is zero for at least one realization of interval parameters (the so called weakly zero DG). We characterize strongly and weakly zero DG and its special case where the matrix of coefficients is real. We discuss computational complexity of testing weakly and strongly zero DG for commonly used types of interval linear programs and their variants with the real matrix of coefficients. We distinguish the NP-hard cases and the cases that are efficiently decidable. Based on DG conditions, we extend previous results about the bounds of the optimal value set given by Rohn. We provide equivalent statements for the bounds
△ Less
Submitted 15 February, 2018;
originally announced February 2018.
-
Interval Linear Algebra and Computational Complexity
Authors:
Jaroslav Horáček,
Milan Hladík,
Michal Černý
Abstract:
This work connects two mathematical fields - computational complexity and interval linear algebra. It introduces the basic topics of interval linear algebra - regularity and singularity, full column rank, solving a linear system, deciding solvability of a linear system, computing inverse matrix, eigenvalues, checking positive (semi)definiteness or stability. We discuss these problems and relations…
▽ More
This work connects two mathematical fields - computational complexity and interval linear algebra. It introduces the basic topics of interval linear algebra - regularity and singularity, full column rank, solving a linear system, deciding solvability of a linear system, computing inverse matrix, eigenvalues, checking positive (semi)definiteness or stability. We discuss these problems and relations between them from the view of computational complexity. Many problems in interval linear algebra are intractable, hence we emphasize subclasses of these problems that are easily solvable or decidable. The aim of this work is to provide a basic insight into this field and to provide materials for further reading and research.
△ Less
Submitted 31 January, 2016;
originally announced February 2016.
-
Selection-based Approach to Cooperative Interval Games
Authors:
Jan Bok,
Milan Hladík
Abstract:
Cooperative interval games are a generalized model of cooperative games in which the worth of every coalition corresponds to a closed interval representing the possible outcomes of its cooperation. Selections are all possible outcomes of the interval game with no additional uncertainty.
We introduce new selection-based classes of interval games and prove their characterization theorems and relat…
▽ More
Cooperative interval games are a generalized model of cooperative games in which the worth of every coalition corresponds to a closed interval representing the possible outcomes of its cooperation. Selections are all possible outcomes of the interval game with no additional uncertainty.
We introduce new selection-based classes of interval games and prove their characterization theorems and relations to existing classes based on the interval weakly better operator. We show new results regarding the core and imputations and examine a problem of equivalence for two different versions of the core, the main stability solution of cooperative games. Finally, we introduce the definition of strong imputation and strong core as universal solution concepts of interval games.
△ Less
Submitted 25 July, 2018; v1 submitted 14 October, 2014;
originally announced October 2014.
-
Efficient Solution of a Class of Quantified Constraints with Quantifier Prefix Exists-Forall
Authors:
Milan Hladík,
Stefan Ratschan
Abstract:
In various applications the search for certificates for certain properties (e.g., stability of dynamical systems, program termination) can be formulated as a quantified constraint solving problem with quantifier prefix exists-forall. In this paper, we present an algorithm for solving a certain class of such problems based on interval techniques in combination with conservative linear programming a…
▽ More
In various applications the search for certificates for certain properties (e.g., stability of dynamical systems, program termination) can be formulated as a quantified constraint solving problem with quantifier prefix exists-forall. In this paper, we present an algorithm for solving a certain class of such problems based on interval techniques in combination with conservative linear programming approximation. In comparison with previous work, the method is more general - allowing general Boolean structure in the input constraint, and more efficient - using splitting heuristics that learn from the success of previous linear programming approximations.
△ Less
Submitted 24 June, 2014; v1 submitted 20 December, 2013;
originally announced December 2013.
-
Characterizing and approximating eigenvalue sets of symmetric interval matrices
Authors:
Milan Hladik,
David Daney,
Elias Tsigaridas
Abstract:
We consider the eigenvalue problem for the case where the input matrix is symmetric and its entries perturb in some given intervals. We present a characterization of some of the exact boundary points, which allows us to introduce an inner approximation algorithm, that in many case estimates exact bounds. To our knowledge, this is the first algorithm that is able to guaran- tee exactness. We illust…
▽ More
We consider the eigenvalue problem for the case where the input matrix is symmetric and its entries perturb in some given intervals. We present a characterization of some of the exact boundary points, which allows us to introduce an inner approximation algorithm, that in many case estimates exact bounds. To our knowledge, this is the first algorithm that is able to guaran- tee exactness. We illustrate our approach by several examples and numerical experiments.
△ Less
Submitted 21 February, 2011;
originally announced February 2011.