Search | arXiv e-print repository

Explainable Global Fairness Verification of Tree-Based Classifiers

Authors: Stefano Calzavara, Lorenzo Cazzaro, Claudio Lucchese, Federico Marcuzzi

Abstract: We present a new approach to the global fairness verification of tree-based classifiers. Given a tree-based classifier and a set of sensitive features potentially leading to discrimination, our analysis synthesizes sufficient conditions for fairness, expressed as a set of traditional propositional logic formulas, which are readily understandable by human experts. The verified fairness guarantees a… ▽ More We present a new approach to the global fairness verification of tree-based classifiers. Given a tree-based classifier and a set of sensitive features potentially leading to discrimination, our analysis synthesizes sufficient conditions for fairness, expressed as a set of traditional propositional logic formulas, which are readily understandable by human experts. The verified fairness guarantees are global, in that the formulas predicate over all the possible inputs of the classifier, rather than just a few specific test instances. Our analysis is formally proved both sound and complete. Experimental results on public datasets show that the analysis is precise, explainable to human experts and efficient enough for practical adoption. △ Less

Submitted 27 September, 2022; originally announced September 2022.

Comments: 15 pages with 7 figures

arXiv:2112.02705 [pdf, other]

Beyond Robustness: Resilience Verification of Tree-Based Classifiers

Authors: Stefano Calzavara, Lorenzo Cazzaro, Claudio Lucchese, Federico Marcuzzi, Salvatore Orlando

Abstract: In this paper we criticize the robustness measure traditionally employed to assess the performance of machine learning models deployed in adversarial settings. To mitigate the limitations of robustness, we introduce a new measure called resilience and we focus on its verification. In particular, we discuss how resilience can be verified by combining a traditional robustness verification technique… ▽ More In this paper we criticize the robustness measure traditionally employed to assess the performance of machine learning models deployed in adversarial settings. To mitigate the limitations of robustness, we introduce a new measure called resilience and we focus on its verification. In particular, we discuss how resilience can be verified by combining a traditional robustness verification technique with a data-independent stability analysis, which identifies a subset of the feature space where the model does not change its predictions despite adversarial manipulations. We then introduce a formally sound data-independent stability analysis for decision trees and decision tree ensembles, which we experimentally assess on public datasets and we leverage for resilience verification. Our results show that resilience verification is useful and feasible in practice, yielding a more reliable security assessment of both standard and robust decision tree models. △ Less

Submitted 5 December, 2021; originally announced December 2021.

arXiv:2004.03295 [pdf, other]

Feature Partitioning for Robust Tree Ensembles and their Certification in Adversarial Scenarios

Authors: Stefano Calzavara, Claudio Lucchese, Federico Marcuzzi, Salvatore Orlando

Abstract: Machine learning algorithms, however effective, are known to be vulnerable in adversarial scenarios where a malicious user may inject manipulated instances. In this work we focus on evasion attacks, where a model is trained in a safe environment and exposed to attacks at test time. The attacker aims at finding a minimal perturbation of a test instance that changes the model outcome. We propose a… ▽ More Machine learning algorithms, however effective, are known to be vulnerable in adversarial scenarios where a malicious user may inject manipulated instances. In this work we focus on evasion attacks, where a model is trained in a safe environment and exposed to attacks at test time. The attacker aims at finding a minimal perturbation of a test instance that changes the model outcome. We propose a model-agnostic strategy that builds a robust ensemble by training its basic models on feature-based partitions of the given dataset. Our algorithm guarantees that the majority of the models in the ensemble cannot be affected by the attacker. We experimented the proposed strategy on decision tree ensembles, and we also propose an approximate certification method for tree ensembles that efficiently assess the minimal accuracy of a forest on a given dataset avoiding the costly computation of evasion attacks. Experimental evaluation on publicly available datasets shows that proposed strategy outperforms state-of-the-art adversarial learning algorithms against evasion attacks. △ Less

Submitted 7 April, 2020; originally announced April 2020.

arXiv:1904.11949 [pdf, other]

Machine Learning Tips and Tricks for Power Line Communications

Authors: Andrea M. Tonello, Nunzio A. Letizia, Davide Righini, Francesco Marcuzzi

Abstract: A great deal of attention has been recently given to Machine Learning (ML) techniques in many different application fields. This paper provides a vision of what ML can do in Power Line Communications (PLC). We firstly and briefly describe classical formulations of ML, and distinguish deterministic from statistical learning models with relevance to communications. We then discuss ML applications in… ▽ More A great deal of attention has been recently given to Machine Learning (ML) techniques in many different application fields. This paper provides a vision of what ML can do in Power Line Communications (PLC). We firstly and briefly describe classical formulations of ML, and distinguish deterministic from statistical learning models with relevance to communications. We then discuss ML applications in PLC for each layer, namely, for characterization and modeling, for the development of physical layer algorithms, for media access control and networking. Finally, other applications of PLC that can benefit from the usage of ML, as grid diagnostics, are analyzed. Illustrative numerical examples are reported to serve the purpose of validating the ideas and motivate future research endeavors in this stimulating signal/data processing field. △ Less

Submitted 6 June, 2019; v1 submitted 24 April, 2019; originally announced April 2019.

Comments: Accepted for publication in IEEE Access. 19 pages, 15 figures, 142 references. Added Sec. II-C

arXiv:1812.04303 [pdf, other]

Analytic heuristics for a fast DSC-MRI

Authors: Marco Virgulin, Marco Castellaro, Enrico Grisan, Fabio Marcuzzi

Abstract: In this paper we propose a deterministic approach for the reconstruction of Dynamic Susceptibility Contrast magnetic resonance imaging data and compare it with the compressed sensing solution existing in the literature for the same problem. Our study is based on the mathematical analysis of the problem, which is computationally intractable because of its non polynomial complexity, but suggests sim… ▽ More In this paper we propose a deterministic approach for the reconstruction of Dynamic Susceptibility Contrast magnetic resonance imaging data and compare it with the compressed sensing solution existing in the literature for the same problem. Our study is based on the mathematical analysis of the problem, which is computationally intractable because of its non polynomial complexity, but suggests simple heuristics that perform quite well. We give results on real images and on artificial phantoms with added noise. △ Less

Submitted 11 December, 2018; originally announced December 2018.

MSC Class: 94A08; 65T60;

arXiv:1803.02666 [pdf, other]

Smart Routing for Improved PLC Backhauling of the Radio Access Network

Authors: Francesco Marcuzzi, Andrea M. Tonello

Abstract: Transmission line theory enables the bottom up study of networks based on wireline infrastructures. This technique is here applied to a simulator that brings together powerline communication networks with small radio cells ones in a hybrid paradigm: this allows to implement a study of channel capacity and communication quality based on the geometrical properties of the network and the type of cabl… ▽ More Transmission line theory enables the bottom up study of networks based on wireline infrastructures. This technique is here applied to a simulator that brings together powerline communication networks with small radio cells ones in a hybrid paradigm: this allows to implement a study of channel capacity and communication quality based on the geometrical properties of the network and the type of cables employed to enable connectivity. Results are shown regarding how deterministic properties related to performance in the network can be retrieved and used to enable a smart routing algorithm. △ Less

Submitted 5 February, 2018; originally announced March 2018.

Comments: 2 pages, 2 figures, 11th Workshop on Power Line Communications; September 21, 2017; Praha, Czech Republic

arXiv:1611.02881 [pdf, ps, other]

Statistical Assessment of PLC Networking for Front-Hauling in Small Radio Cells

Authors: Andrea M. Tonello, Francesco Marcuzzi

Abstract: The employment of power lines for communications (PLC) has been theorized almost a century ago; although the physical medium is not meant for data transmission, recent technical developments pushed the capacity boundary much higher than expected, allowing to consider PLC for new applications as the one considered in this contribution, i.e., as a solution for small cell back/front-hauling. In princ… ▽ More The employment of power lines for communications (PLC) has been theorized almost a century ago; although the physical medium is not meant for data transmission, recent technical developments pushed the capacity boundary much higher than expected, allowing to consider PLC for new applications as the one considered in this contribution, i.e., as a solution for small cell back/front-hauling. In principle, back-hauling for cellular networks via PLC is very attractive given how pervasive the power line infrastructure is. Telecom operators are looking for solutions that can handle the traffic increase which is doublying every year. This calls for improvements of both cellular systems and front-hauling technologies that have to bridge the radio network with the core network. Small cells are currently advocated as the solution to spatially fragment the network and offer high capacity in densely populated areas; this is also due to the availability of spectrum at high frequency (mmWave links) which inherently offers large bandwidth but forces the cells to become smaller due to the short distance coverage. Therefore, it appears that mobile operators may eventually have to deploy an order of magnitude more (small) cells compared to existing networks having macro cells. This translates in a high density cellular network that poses more challenges for the back-haul. △ Less

Submitted 9 November, 2016; originally announced November 2016.

Comments: This work has been presented at the 10th workshop on PLC (WSPLC 2016) -- Paris, France, 10-11 October 2016. The paper consists of 4 pages, 4 figures

Showing 1–7 of 7 results for author: Marcuzzi, F