Skip to main content

Showing 1–21 of 21 results for author: Heikkilä, M

.
  1. arXiv:2506.01396  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Mitigating Disparate Impact of Differentially Private Learning through Bounded Adaptive Clipping

    Authors: Linzh Zhao, Aki Rehn, Mikko A. Heikkilä, Razane Tajeddine, Antti Honkela

    Abstract: Differential privacy (DP) has become an essential framework for privacy-preserving machine learning. Existing DP learning methods, however, often have disparate impacts on model predictions, e.g., for minority groups. Gradient clipping, which is often used in DP learning, can suppress larger gradients from challenging samples. We show that this problem is amplified by adaptive clipping, which will… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: NeurIPS 2025 under review. 22 pages, 8 figures

    ACM Class: I.2.6; K.4.2

  2. arXiv:2411.12377  [pdf, other

    cs.LG

    Non-IID data in Federated Learning: A Survey with Taxonomy, Metrics, Methods, Frameworks and Future Directions

    Authors: Daniel M. Jimenez G., David Solans, Mikko Heikkila, Andrea Vitaletti, Nicolas Kourtellis, Aris Anagnostopoulos, Ioannis Chatzigiannakis

    Abstract: Recent advances in machine learning have highlighted Federated Learning (FL) as a promising approach that enables multiple distributed users (so-called clients) to collectively train ML models without sharing their private data. While this privacy-preserving method shows potential, it struggles when data across clients is not independent and identically distributed (non-IID) data. The latter remai… ▽ More

    Submitted 12 December, 2024; v1 submitted 19 November, 2024; originally announced November 2024.

  3. arXiv:2411.04680  [pdf, other

    cs.LG cs.CR

    Differential Privacy in Continual Learning: Which Labels to Update?

    Authors: Marlon Tobaben, Talal Alrawajfeh, Marcus Klasson, Mikko Heikkilä, Arno Solin, Antti Honkela

    Abstract: The goal of continual learning (CL) is to retain knowledge across tasks, but this conflicts with strict privacy required for sensitive training data that prevents storing or memorising individual samples. To address that, we combine CL and differential privacy (DP). We highlight that failing to account for privacy leakage through the set of labels a model can output can break the privacy of otherw… ▽ More

    Submitted 22 May, 2025; v1 submitted 7 November, 2024; originally announced November 2024.

    Comments: 39 pages, 13 figures

  4. arXiv:2407.19286  [pdf, other

    cs.LG cs.CR

    On Using Secure Aggregation in Differentially Private Federated Learning with Multiple Local Steps

    Authors: Mikko A. Heikkilä

    Abstract: Federated learning is a distributed learning setting where the main aim is to train machine learning models without having to share raw data but only what is required for learning. To guarantee training data privacy and high-utility models, differential privacy and secure aggregation techniques are often combined with federated learning. However, with fine-grained protection granularities, e.g., w… ▽ More

    Submitted 24 March, 2025; v1 submitted 27 July, 2024; originally announced July 2024.

    Comments: 22 pages. Published in TMLR 03/2025: https://openreview.net/forum?id=uxyWlXPuIg

  5. arXiv:2407.15224  [pdf, other

    cs.LG cs.AI cs.CR cs.CY

    PUFFLE: Balancing Privacy, Utility, and Fairness in Federated Learning

    Authors: Luca Corbucci, Mikko A Heikkila, David Solans Noguero, Anna Monreale, Nicolas Kourtellis

    Abstract: Training and deploying Machine Learning models that simultaneously adhere to principles of fairness and privacy while ensuring good utility poses a significant challenge. The interplay between these three factors of trustworthiness is frequently underestimated and remains insufficiently explored. Consequently, many efforts focus on ensuring only two of these factors, neglecting one in the process.… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

  6. arXiv:2403.07937  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Speech Robust Bench: A Robustness Benchmark For Speech Recognition

    Authors: Muhammad A. Shah, David Solans Noguero, Mikko A. Heikkila, Bhiksha Raj, Nicolas Kourtellis

    Abstract: As Automatic Speech Recognition (ASR) models become ever more pervasive, it is important to ensure that they make reliable predictions under corruptions present in the physical and digital world. We propose Speech Robust Bench (SRB), a comprehensive benchmark for evaluating the robustness of ASR models to diverse corruptions. SRB is composed of 114 input perturbations which simulate an heterogeneo… ▽ More

    Submitted 9 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: submitted to NeurIPS datasets and benchmark track 2025

  7. arXiv:2402.03088  [pdf, ps, other

    quant-ph

    Locally unitary quantum state evolution is local

    Authors: Matias Heikkilä

    Abstract: We study the localization properties of bipartite channels, whose action on a subsystem yields a unitary channel. In particular we show that, under such channel, the subsystem must evolve independent of its environment. This point of view is another way to verify certain well-known conservation laws of quantum information in a generalized way. A no-go theorem for non classical conditional semantic… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  8. arXiv:2209.11595  [pdf, other

    cs.LG cs.CR stat.ML

    Differentially private partitioned variational inference

    Authors: Mikko A. Heikkilä, Matthew Ashman, Siddharth Swaroop, Richard E. Turner, Antti Honkela

    Abstract: Learning a privacy-preserving model from sensitive data which are distributed across multiple devices is an increasingly important problem. The problem is often formulated in the federated learning context, with the aim of learning a single global model while keeping the data distributed. Moreover, Bayesian learning is a popular approach for modelling, since it naturally supports reliable uncertai… ▽ More

    Submitted 18 April, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: Published in TMLR 04/2023: https://openreview.net/forum?id=55BcghgicI

    Journal ref: Transactions on Machine Learning Research, ISSN 2835-8856, 2023

  9. arXiv:2106.00477  [pdf, other

    cs.CR cs.LG stat.ML

    Tight Accounting in the Shuffle Model of Differential Privacy

    Authors: Antti Koskela, Mikko A. Heikkilä, Antti Honkela

    Abstract: Shuffle model of differential privacy is a novel distributed privacy model based on a combination of local privacy mechanisms and a secure shuffler. It has been shown that the additional randomisation provided by the shuffler improves privacy bounds compared to the purely local mechanisms. Accounting tight bounds, however, is complicated by the complexity brought by the shuffler. The recently prop… ▽ More

    Submitted 31 January, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: 21 pages, 5 figures

  10. arXiv:2010.10928  [pdf, other

    cond-mat.mtrl-sci

    Role of ALD Al2O3 surface passivation on the performance of p-type Cu2O thin film transistors

    Authors: Mari Napari, Tahmida N. Huq, David J. Meeth, Mikko J. Heikkilä, Kham M. Niang, Han Wang, Tomi Iivonen, Haiyan Wang, Markku Leskelä, Mikko Ritala, Andrew J. Flewitt, Robert L. Z. Hoye, Judith L. MacManus-Driscol

    Abstract: High-performance p-type oxide thin film transistors (TFTs) have great potential for many semiconductor applications. However, these devices typically suffer from low hole mobility and high off-state currents. We fabricated p-type TFTs with a phase-pure polycrystalline Cu2O semiconductor channel grown by atomic layer deposition (ALD). The TFT switching characteristics were improved by applying a th… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: 25 pages, 6 figures, full paper

  11. arXiv:2007.05553  [pdf, other

    cs.CR cs.DC cs.LG stat.ML

    Differentially private cross-silo federated learning

    Authors: Mikko A. Heikkilä, Antti Koskela, Kana Shimizu, Samuel Kaski, Antti Honkela

    Abstract: Strict privacy is of paramount importance in distributed machine learning. Federated learning, with the main idea of communicating only what is needed for learning, has been recently introduced as a general approach for distributed learning to enhance learning and improve security. However, federated learning by itself does not guarantee any privacy for data subjects. To quantify and control how m… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

    Comments: 14 pages, 5 figures

  12. arXiv:2006.07735  [pdf, other

    cs.NI eess.SP

    UAV-Aided Interference Assessment for Private 5G NR Deployments: Challenges and Solutions

    Authors: Jani Urama, Richard Wiren, Olga Galinina, Juhani Kauppi, Kimmo Hiltunen, Juha Erkkilä, Fedor Chernogorov, Pentti Eteläaho, Marjo Heikkilä, Johan Torsner, Sergey Andreev, Mikko Valkama

    Abstract: Industrial automation has created a high demand for private 5G networks, the deployment of which calls for an efficient and reliable solution to ensure strict compliance with the regulatory emission limits. While traditional methods for measuring outdoor interference include collecting real-world data by walking or driving, the use of unmanned aerial vehicles (UAVs) offers an attractive alternativ… ▽ More

    Submitted 13 June, 2020; originally announced June 2020.

    Comments: 7 pages, 4 figures

  13. arXiv:2004.14699  [pdf

    eess.SP cs.NI

    A 6G White Paper on Connectivity for Remote Areas

    Authors: Harri Saarnisaari, Sudhir Dixit, Mohamed-Slim Alouini, Abdelaali Chaoub, Marco Giordani, Adrian Kliks, Marja Matinmikko-Blue, Nan Zhang, Anuj Agrawal, Mats Andersson, Vimal Bhatia, Wei Cao, Yunfei Chen, Wei Feng, Marjo Heikkilä, Josep M. Jornet, Luciano Mendes, Heikki Karvonen, Brejesh Lall, Matti Latva-aho, Xiangling Li, Kalle Lähetkangas, Moshe T. Masonta, Alok Pandey, Pekka Pirinen , et al. (9 additional authors not shown)

    Abstract: In many places all over the world rural and remote areas lack proper connectivity that has led to increasing digital divide. These areas might have low population density, low incomes, etc., making them less attractive places to invest and operate connectivity networks. 6G could be the first mobile radio generation truly aiming to close the digital divide. However, in order to do so, special requi… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

    Comments: A 6G white paper, 17 pages

  14. arXiv:2004.00625  [pdf, other

    astro-ph.EP astro-ph.IM

    Experimental constraints on the ordinary chondrite shock darkening caused by asteroid collisions

    Authors: T. Kohout, E. V. Petrova, G. A. Yakovlev, V. I. Grokhovsky, A. Penttilä, A. Maturilli, J. -G. Moreau, S. V. Berzin, J. Wasiljeff, I. A. Danilenko, D. A. Zamyatin, R. F. Muftakhetdinova, M. Heikkilä

    Abstract: Shock-induced changes in ordinary chondrite meteorites related to impacts or planetary collisions are known to be capable of altering their optical properties. Thus, one can hypothesize that a significant portion of the ordinary chondrite material may be hidden within the observed dark C/X asteroid population. The exact pressure-temperature conditions of the shock-induced darkening are not well co… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

    Journal ref: A&A 639, A146 (2020)

  15. arXiv:1901.10275  [pdf, other

    stat.ML cs.CR cs.LG stat.CO stat.ME

    Differentially Private Markov Chain Monte Carlo

    Authors: Mikko A. Heikkilä, Joonas Jälkö, Onur Dikmen, Antti Honkela

    Abstract: Recent developments in differentially private (DP) machine learning and DP Bayesian learning have enabled learning under strong privacy guarantees for the training data subjects. In this paper, we further extend the applicability of DP Bayesian learning by presenting the first general DP Markov chain Monte Carlo (MCMC) algorithm whose privacy-guarantees are not subject to unrealistic assumptions o… ▽ More

    Submitted 17 June, 2019; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: 22 pages, 12 figures

  16. arXiv:1901.10227  [pdf, other

    q-bio.QM cs.CR cs.LG stat.ML

    Representation Transfer for Differentially Private Drug Sensitivity Prediction

    Authors: Teppo Niinimäki, Mikko Heikkilä, Antti Honkela, Samuel Kaski

    Abstract: Motivation: Human genomic datasets often contain sensitive information that limits use and sharing of the data. In particular, simple anonymisation strategies fail to provide sufficient level of protection for genomic data, because the data are inherently identifiable. Differentially private machine learning can help by guaranteeing that the published results do not leak too much information about… ▽ More

    Submitted 29 January, 2019; originally announced January 2019.

    Comments: 12 pages, 5 figures

    Journal ref: Bioinformatics 35(14):i218-i224, 2019

  17. arXiv:1811.05169  [pdf, other

    stat.ME

    Nonparametric geometric outlier detection

    Authors: Matias Heikkilä

    Abstract: Outlier detection is a major topic in robust statistics due to the high practical significance of anomalous observations. Many existing methods are, however, either parametric or cease to perform well when the data is far from linearly structured. In this paper, we propose a quantity, Delaunay outlyingness, that is a nonparametric outlyingness score applicable to data with complicated structure. T… ▽ More

    Submitted 13 November, 2018; originally announced November 2018.

  18. arXiv:1806.04907  [pdf, other

    cs.RO

    Kinematics and Dynamic Modeling of a Planar Hydraulic Elastomer Actuator

    Authors: Mahdi Momeni Kelageri, Mikko Heikkila, Jarno Jokinen, Matti Linjama, Reza Ghabcheloo

    Abstract: This paper presents modeling of a compliant 2D manipulator, a so called soft hydraulic/fluidic elastomer actuator. Our focus is on fiber-Reinforced Fluidic Elastomer Actuators (RFEA) driven by a constant pressure hydraulic supply and modulated on/off valves. We present a model that not only provides the dynamics behavior of the system but also the kinematics of the actuator. In addition to that, t… ▽ More

    Submitted 13 June, 2018; originally announced June 2018.

    Comments: 8 pages, 10 figures

  19. arXiv:1806.04894  [pdf, other

    cs.RO

    Design, Fabrication and Control of an Hydraulic Elastomer Actuator

    Authors: Mahdi Momeni Kelageri, Mikko Heikkila, Minna Poikelispaa, Reza Ghabcheloo, Matti Linjama, Jyrki Vuorinen

    Abstract: This paper presents design, fabrication and control of a compliant 2D manipulator, a so called soft actuator. Our focus is on fiber-reinforced elastomer actuators driven by a constant pressure hydraulic supply and modulated on/off valves. For a given diameters, we study the effect of four different elastomer materials and that of number of reinforcement fiber turns on forces generated by the actua… ▽ More

    Submitted 13 June, 2018; originally announced June 2018.

    Comments: 7 pages, 13 Figures

  20. arXiv:1703.01106  [pdf, other

    stat.ML cs.CR cs.LG stat.CO

    Differentially Private Bayesian Learning on Distributed Data

    Authors: Mikko Heikkilä, Eemil Lagerspetz, Samuel Kaski, Kana Shimizu, Sasu Tarkoma, Antti Honkela

    Abstract: Many applications of machine learning, for example in health care, would benefit from methods that can guarantee privacy of data subjects. Differential privacy (DP) has become established as a standard for protecting learning results. The standard DP algorithms require a single trusted party to have access to the entire data, which is a clear weakness. We consider DP Bayesian learning in a distrib… ▽ More

    Submitted 29 May, 2017; v1 submitted 3 March, 2017; originally announced March 2017.

    Comments: 13 pages, 7 figures. Modified text, changed algorithm used, included tests on additional dataset, fixed several errors, added proof of asymptotic efficiency to supplement

  21. arXiv:1511.08627  [pdf, ps, other

    math.ST

    On Asymptotic Properties of the Separating Hill Estimator

    Authors: Matias Heikkilä, Yves Dominicy, Pauliina Ilmonen

    Abstract: Modeling and understanding multivariate extreme events is challenging, but of great importance in various applications - e.g. in biostatistics, climatology, and finance. The separating Hill estimator can be used in estimating the extreme value index of a heavy tailed multivariate elliptical distribution. We consider the asymptotic behavior of the separating Hill estimator under estimated location… ▽ More

    Submitted 12 January, 2016; v1 submitted 27 November, 2015; originally announced November 2015.

    MSC Class: 60G70; 62H12