-
Explainable Global Fairness Verification of Tree-Based Classifiers
Authors:
Stefano Calzavara,
Lorenzo Cazzaro,
Claudio Lucchese,
Federico Marcuzzi
Abstract:
We present a new approach to the global fairness verification of tree-based classifiers. Given a tree-based classifier and a set of sensitive features potentially leading to discrimination, our analysis synthesizes sufficient conditions for fairness, expressed as a set of traditional propositional logic formulas, which are readily understandable by human experts. The verified fairness guarantees a…
▽ More
We present a new approach to the global fairness verification of tree-based classifiers. Given a tree-based classifier and a set of sensitive features potentially leading to discrimination, our analysis synthesizes sufficient conditions for fairness, expressed as a set of traditional propositional logic formulas, which are readily understandable by human experts. The verified fairness guarantees are global, in that the formulas predicate over all the possible inputs of the classifier, rather than just a few specific test instances. Our analysis is formally proved both sound and complete. Experimental results on public datasets show that the analysis is precise, explainable to human experts and efficient enough for practical adoption.
△ Less
Submitted 27 September, 2022;
originally announced September 2022.
-
Beyond Robustness: Resilience Verification of Tree-Based Classifiers
Authors:
Stefano Calzavara,
Lorenzo Cazzaro,
Claudio Lucchese,
Federico Marcuzzi,
Salvatore Orlando
Abstract:
In this paper we criticize the robustness measure traditionally employed to assess the performance of machine learning models deployed in adversarial settings. To mitigate the limitations of robustness, we introduce a new measure called resilience and we focus on its verification. In particular, we discuss how resilience can be verified by combining a traditional robustness verification technique…
▽ More
In this paper we criticize the robustness measure traditionally employed to assess the performance of machine learning models deployed in adversarial settings. To mitigate the limitations of robustness, we introduce a new measure called resilience and we focus on its verification. In particular, we discuss how resilience can be verified by combining a traditional robustness verification technique with a data-independent stability analysis, which identifies a subset of the feature space where the model does not change its predictions despite adversarial manipulations. We then introduce a formally sound data-independent stability analysis for decision trees and decision tree ensembles, which we experimentally assess on public datasets and we leverage for resilience verification. Our results show that resilience verification is useful and feasible in practice, yielding a more reliable security assessment of both standard and robust decision tree models.
△ Less
Submitted 5 December, 2021;
originally announced December 2021.
-
Feature Partitioning for Robust Tree Ensembles and their Certification in Adversarial Scenarios
Authors:
Stefano Calzavara,
Claudio Lucchese,
Federico Marcuzzi,
Salvatore Orlando
Abstract:
Machine learning algorithms, however effective, are known to be vulnerable in adversarial scenarios where a malicious user may inject manipulated instances. In this work we focus on evasion attacks, where a model is trained in a safe environment and exposed to attacks at test time. The attacker aims at finding a minimal perturbation of a test instance that changes the model outcome.
We propose a…
▽ More
Machine learning algorithms, however effective, are known to be vulnerable in adversarial scenarios where a malicious user may inject manipulated instances. In this work we focus on evasion attacks, where a model is trained in a safe environment and exposed to attacks at test time. The attacker aims at finding a minimal perturbation of a test instance that changes the model outcome.
We propose a model-agnostic strategy that builds a robust ensemble by training its basic models on feature-based partitions of the given dataset. Our algorithm guarantees that the majority of the models in the ensemble cannot be affected by the attacker. We experimented the proposed strategy on decision tree ensembles, and we also propose an approximate certification method for tree ensembles that efficiently assess the minimal accuracy of a forest on a given dataset avoiding the costly computation of evasion attacks.
Experimental evaluation on publicly available datasets shows that proposed strategy outperforms state-of-the-art adversarial learning algorithms against evasion attacks.
△ Less
Submitted 7 April, 2020;
originally announced April 2020.
-
Machine Learning Tips and Tricks for Power Line Communications
Authors:
Andrea M. Tonello,
Nunzio A. Letizia,
Davide Righini,
Francesco Marcuzzi
Abstract:
A great deal of attention has been recently given to Machine Learning (ML) techniques in many different application fields. This paper provides a vision of what ML can do in Power Line Communications (PLC). We firstly and briefly describe classical formulations of ML, and distinguish deterministic from statistical learning models with relevance to communications. We then discuss ML applications in…
▽ More
A great deal of attention has been recently given to Machine Learning (ML) techniques in many different application fields. This paper provides a vision of what ML can do in Power Line Communications (PLC). We firstly and briefly describe classical formulations of ML, and distinguish deterministic from statistical learning models with relevance to communications. We then discuss ML applications in PLC for each layer, namely, for characterization and modeling, for the development of physical layer algorithms, for media access control and networking. Finally, other applications of PLC that can benefit from the usage of ML, as grid diagnostics, are analyzed. Illustrative numerical examples are reported to serve the purpose of validating the ideas and motivate future research endeavors in this stimulating signal/data processing field.
△ Less
Submitted 6 June, 2019; v1 submitted 24 April, 2019;
originally announced April 2019.
-
Analytic heuristics for a fast DSC-MRI
Authors:
Marco Virgulin,
Marco Castellaro,
Enrico Grisan,
Fabio Marcuzzi
Abstract:
In this paper we propose a deterministic approach for the reconstruction of Dynamic Susceptibility Contrast magnetic resonance imaging data and compare it with the compressed sensing solution existing in the literature for the same problem. Our study is based on the mathematical analysis of the problem, which is computationally intractable because of its non polynomial complexity, but suggests sim…
▽ More
In this paper we propose a deterministic approach for the reconstruction of Dynamic Susceptibility Contrast magnetic resonance imaging data and compare it with the compressed sensing solution existing in the literature for the same problem. Our study is based on the mathematical analysis of the problem, which is computationally intractable because of its non polynomial complexity, but suggests simple heuristics that perform quite well. We give results on real images and on artificial phantoms with added noise.
△ Less
Submitted 11 December, 2018;
originally announced December 2018.
-
Smart Routing for Improved PLC Backhauling of the Radio Access Network
Authors:
Francesco Marcuzzi,
Andrea M. Tonello
Abstract:
Transmission line theory enables the bottom up study of networks based on wireline infrastructures. This technique is here applied to a simulator that brings together powerline communication networks with small radio cells ones in a hybrid paradigm: this allows to implement a study of channel capacity and communication quality based on the geometrical properties of the network and the type of cabl…
▽ More
Transmission line theory enables the bottom up study of networks based on wireline infrastructures. This technique is here applied to a simulator that brings together powerline communication networks with small radio cells ones in a hybrid paradigm: this allows to implement a study of channel capacity and communication quality based on the geometrical properties of the network and the type of cables employed to enable connectivity. Results are shown regarding how deterministic properties related to performance in the network can be retrieved and used to enable a smart routing algorithm.
△ Less
Submitted 5 February, 2018;
originally announced March 2018.
-
Statistical Assessment of PLC Networking for Front-Hauling in Small Radio Cells
Authors:
Andrea M. Tonello,
Francesco Marcuzzi
Abstract:
The employment of power lines for communications (PLC) has been theorized almost a century ago; although the physical medium is not meant for data transmission, recent technical developments pushed the capacity boundary much higher than expected, allowing to consider PLC for new applications as the one considered in this contribution, i.e., as a solution for small cell back/front-hauling. In princ…
▽ More
The employment of power lines for communications (PLC) has been theorized almost a century ago; although the physical medium is not meant for data transmission, recent technical developments pushed the capacity boundary much higher than expected, allowing to consider PLC for new applications as the one considered in this contribution, i.e., as a solution for small cell back/front-hauling. In principle, back-hauling for cellular networks via PLC is very attractive given how pervasive the power line infrastructure is. Telecom operators are looking for solutions that can handle the traffic increase which is doublying every year. This calls for improvements of both cellular systems and front-hauling technologies that have to bridge the radio network with the core network. Small cells are currently advocated as the solution to spatially fragment the network and offer high capacity in densely populated areas; this is also due to the availability of spectrum at high frequency (mmWave links) which inherently offers large bandwidth but forces the cells to become smaller due to the short distance coverage. Therefore, it appears that mobile operators may eventually have to deploy an order of magnitude more (small) cells compared to existing networks having macro cells. This translates in a high density cellular network that poses more challenges for the back-haul.
△ Less
Submitted 9 November, 2016;
originally announced November 2016.