Skip to main content

Showing 1–50 of 126 results for author: Kaski, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.07805  [pdf, other

    stat.ML cs.IT

    Generalization Analysis for Bayesian Optimal Experiment Design under Model Misspecification

    Authors: Roubing Tang, Sabina J. Sloman, Samuel Kaski

    Abstract: In many settings in science and industry, such as drug discovery and clinical trials, a central challenge is designing experiments under time and budget constraints. Bayesian Optimal Experimental Design (BOED) is a paradigm to pick maximally informative designs that has been increasingly applied to such problems. During training, BOED selects inputs according to a pre-determined acquisition criter… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  2. arXiv:2506.07259  [pdf, ps, other

    stat.ML cs.LG

    ALINE: Joint Amortization for Bayesian Inference and Active Data Acquisition

    Authors: Daolang Huang, Xinyi Wen, Ayush Bharti, Samuel Kaski, Luigi Acerbi

    Abstract: Many critical applications, from autonomous scientific discovery to personalized medicine, demand systems that can both strategically acquire the most informative data and instantaneously perform inference based upon it. While amortized methods for Bayesian inference and experimental design offer part of the solution, neither approach is optimal in the most general and challenging task, where new… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

    Comments: 27 pages, 13 figures

  3. arXiv:2505.23496  [pdf, other

    cs.LG stat.ML

    Epistemic Errors of Imperfect Multitask Learners When Distributions Shift

    Authors: Sabina J. Sloman, Michele Caprio, Samuel Kaski

    Abstract: When data are noisy, a statistical learner's goal is to resolve epistemic uncertainty about the data it will encounter at test-time, i.e., to identify the distribution of test (target) data. Many real-world learning settings introduce sources of epistemic uncertainty that can not be resolved on the basis of training (source) data alone: The source data may arise from multiple tasks (multitask lear… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  4. arXiv:2505.21133  [pdf, other

    cs.LG stat.ML

    Robust and Computation-Aware Gaussian Processes

    Authors: Marshal Arijona Sinaga, Julien Martinelli, Samuel Kaski

    Abstract: Gaussian processes (GPs) are widely used for regression and optimization tasks such as Bayesian optimization (BO) due to their expressiveness and principled uncertainty estimates. However, in settings with large datasets corrupted by outliers, standard GPs and their sparse approximations struggle with computational tractability and robustness. We introduce Robust Computation-aware Gaussian Process… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  5. arXiv:2503.00924  [pdf, other

    stat.ML cs.LG

    PABBO: Preferential Amortized Black-Box Optimization

    Authors: Xinyu Zhang, Daolang Huang, Samuel Kaski, Julien Martinelli

    Abstract: Preferential Bayesian Optimization (PBO) is a sample-efficient method to learn latent user utilities from preferential feedback over a pair of designs. It relies on a statistical surrogate model for the latent function, usually a Gaussian process, and an acquisition strategy to select the next candidate pair to get user feedback on. Due to the non-conjugacy of the associated likelihood, every PBO… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

    Comments: 25 pages, 17 figures. Accepted at the Thirteenth International Conference on Learning Representations (ICLR 2025)

  6. arXiv:2411.03263  [pdf, ps, other

    cs.LG stat.ML

    Proxy-informed Bayesian transfer learning with unknown sources

    Authors: Sabina J. Sloman, Julien Martinelli, Samuel Kaski

    Abstract: Generalization outside the scope of one's training data requires leveraging prior knowledge about the effects that transfer, and the effects that don't, between different data sources. Transfer learning is a framework for specifying and refining this knowledge about sets of source (training) and target (prediction) data. A challenging open problem is addressing the empirical phenomenon of negative… ▽ More

    Submitted 13 June, 2025; v1 submitted 5 November, 2024; originally announced November 2024.

    Comments: Accepted for UAI 2025

  7. arXiv:2411.02064  [pdf, other

    stat.ML cs.LG

    Amortized Bayesian Experimental Design for Decision-Making

    Authors: Daolang Huang, Yujia Guo, Luigi Acerbi, Samuel Kaski

    Abstract: Many critical decisions, such as personalized medical diagnoses and product pricing, are made based on insights gained from designing, observing, and analyzing a series of experiments. This highlights the crucial role of experimental design, which goes beyond merely collecting information on system parameters as in traditional Bayesian experimental design (BED), but also plays a key part in facili… ▽ More

    Submitted 2 January, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

    Comments: 20 pages, 6 figures. Accepted at the 38th Conference on Neural Information Processing Systems (NeurIPS 2024)

  8. arXiv:2410.15320  [pdf, other

    stat.ML cs.LG

    Amortized Probabilistic Conditioning for Optimization, Simulation and Inference

    Authors: Paul E. Chang, Nasrulloh Loka, Daolang Huang, Ulpu Remes, Samuel Kaski, Luigi Acerbi

    Abstract: Amortized meta-learning methods based on pre-training have propelled fields like natural language processing and vision. Transformer-based neural processes and their variants are leading models for probabilistic meta-learning with a tractable objective. Often trained on synthetic data, these models implicitly capture essential latent information in the data-generation process. However, existing me… ▽ More

    Submitted 4 March, 2025; v1 submitted 20 October, 2024; originally announced October 2024.

    Comments: 39 pages, 24 figures. To appear in the 28th International Conference on Artificial Intelligence and Statistics (AISTATS 2025)

  9. arXiv:2410.07930  [pdf, other

    stat.ML cs.LG stat.CO

    Cost-aware simulation-based inference

    Authors: Ayush Bharti, Daolang Huang, Samuel Kaski, François-Xavier Briol

    Abstract: Simulation-based inference (SBI) is the preferred framework for estimating parameters of intractable models in science and engineering. A significant challenge in this context is the large computational cost of simulating data from complex models, and the fact that this cost often depends on parameter values. We therefore propose \textit{cost-aware SBI methods} which can significantly reduce the c… ▽ More

    Submitted 17 February, 2025; v1 submitted 10 October, 2024; originally announced October 2024.

  10. arXiv:2410.07890  [pdf, other

    stat.ML cs.LG

    Identifying latent disease factors differently expressed in patient subgroups using group factor analysis

    Authors: Fabio S. Ferreira, John Ashburner, Arabella Bouzigues, Chatrin Suksasilp, Lucy L. Russell, Phoebe H. Foster, Eve Ferry-Bolder, John C. van Swieten, Lize C. Jiskoot, Harro Seelaar, Raquel Sanchez-Valle, Robert Laforce, Caroline Graff, Daniela Galimberti, Rik Vandenberghe, Alexandre de Mendonca, Pietro Tiraboschi, Isabel Santana, Alexander Gerhard, Johannes Levin, Sandro Sorbi, Markus Otto, Florence Pasquier, Simon Ducharme, Chris R. Butler , et al. (11 additional authors not shown)

    Abstract: In this study, we propose a novel approach to uncover subgroup-specific and subgroup-common latent factors addressing the challenges posed by the heterogeneity of neurological and mental disorders, which hinder disease understanding, treatment development, and outcome prediction. The proposed approach, sparse Group Factor Analysis (GFA) with regularised horseshoe priors, was implemented with proba… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: 38 pages, 14 figures

  11. arXiv:2406.03288  [pdf, other

    cs.LG stat.ML

    Embarrassingly Parallel GFlowNets

    Authors: Tiago da Silva, Luiz Max Carvalho, Amauri Souza, Samuel Kaski, Diego Mesquita

    Abstract: GFlowNets are a promising alternative to MCMC sampling for discrete compositional random variables. Training GFlowNets requires repeated evaluations of the unnormalized target distribution or reward function. However, for large-scale posterior sampling, this may be prohibitive since it incurs traversing the data several times. Moreover, if the data are distributed across clients, employing standar… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024

  12. arXiv:2405.14657  [pdf, other

    cs.LG stat.ML

    Heteroscedastic Preferential Bayesian Optimization with Informative Noise Distributions

    Authors: Marshal Arijona Sinaga, Julien Martinelli, Vikas Garg, Samuel Kaski

    Abstract: Preferential Bayesian optimization (PBO) is a sample-efficient framework for learning human preferences between candidate designs. PBO classically relies on homoscedastic noise models to represent human aleatoric uncertainty. Yet, such noise fails to accurately capture the varying levels of human aleatoric uncertainty, particularly when the user possesses partial knowledge among different pairs of… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  13. arXiv:2311.03002  [pdf, other

    cs.LG stat.ML

    Estimating treatment effects from single-arm trials via latent-variable modeling

    Authors: Manuel Haussmann, Tran Minh Son Le, Viivi Halla-aho, Samu Kurki, Jussi V. Leinonen, Miika Koskinen, Samuel Kaski, Harri Lähdesmäki

    Abstract: Randomized controlled trials (RCTs) are the accepted standard for treatment effect estimation but they can be infeasible due to ethical reasons and prohibitive costs. Single-arm trials, where all patients belong to the treatment group, can be a viable alternative but require access to an external control group. We propose an identifiable deep latent-variable model for this scenario that can also a… ▽ More

    Submitted 4 March, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: Published at the 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024

  14. arXiv:2310.14968  [pdf, other

    cs.LG stat.ML

    Bayesian Active Learning in the Presence of Nuisance Parameters

    Authors: Sabina J. Sloman, Ayush Bharti, Julien Martinelli, Samuel Kaski

    Abstract: In many settings, such as scientific inference, optimization, and transfer learning, the learner has a well-defined objective, which can be treated as estimation of a target parameter, and no intrinsic interest in characterizing the entire data-generating process. Usually, the learner must also contend with additional sources of uncertainty or variables -- with nuisance parameters. Bayesian active… ▽ More

    Submitted 10 June, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted for UAI 2024

  15. arXiv:2310.12595  [pdf, other

    cs.LG stat.ML

    Bayesian Meta-Learning for Improving Generalizability of Health Prediction Models With Similar Causal Mechanisms

    Authors: Sophie Wharrie, Lisa Eick, Lotta Mäkinen, Andrea Ganna, Samuel Kaski, FinnGen

    Abstract: Machine learning strategies like multi-task learning, meta-learning, and transfer learning enable efficient adaptation of machine learning models to specific applications in healthcare, such as prediction of various diseases, by leveraging generalizable knowledge across large datasets and multiple domains. In particular, Bayesian meta-learning methods pool data across related prediction tasks to l… ▽ More

    Submitted 30 December, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

  16. arXiv:2310.11439  [pdf, other

    cs.LG cs.AI stat.ML

    From Alexnet to Transformers: Measuring the Non-linearity of Deep Neural Networks with Affine Optimal Transport

    Authors: Quentin Bouniot, Ievgen Redko, Anton Mallasto, Charlotte Laclau, Oliver Struckmeier, Karol Arndt, Markus Heinonen, Ville Kyrki, Samuel Kaski

    Abstract: In the last decade, we have witnessed the introduction of several novel deep neural network (DNN) architectures exhibiting ever-increasing performance across diverse tasks. Explaining the upward trend of their performance, however, remains difficult as different DNN architectures of comparable depth and width -- common factors associated with their expressive power -- may exhibit a drastically dif… ▽ More

    Submitted 6 April, 2025; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: Code available at https://github.com/qbouniot/AffScoreDeep

    Journal ref: Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR), 2025, pp. 25250-25260

  17. arXiv:2309.12032  [pdf, other

    cs.LG stat.ML

    Human-in-the-Loop Causal Discovery under Latent Confounding using Ancestral GFlowNets

    Authors: Tiago da Silva, Eliezer Silva, António Góis, Dominik Heider, Samuel Kaski, Diego Mesquita, Adèle Ribeiro

    Abstract: Structure learning is the crux of causal inference. Notably, causal discovery (CD) algorithms are brittle when data is scarce, possibly inferring imprecise causal relations that contradict expert knowledge -- especially when considering latent confounders. To aggravate the issue, most CD methods do not provide uncertainty estimates, making it hard for users to interpret results and improve the inf… ▽ More

    Submitted 1 November, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

  18. arXiv:2306.10915  [pdf, other

    stat.ML cs.LG

    Practical Equivariances via Relational Conditional Neural Processes

    Authors: Daolang Huang, Manuel Haussmann, Ulpu Remes, ST John, Grégoire Clarté, Kevin Sebastian Luck, Samuel Kaski, Luigi Acerbi

    Abstract: Conditional Neural Processes (CNPs) are a class of metalearning models popular for combining the runtime efficiency of amortized inference with reliable uncertainty quantification. Many relevant machine learning tasks, such as in spatio-temporal modeling, Bayesian Optimization and continuous control, inherently contain equivariances -- for example to translation -- which the model can exploit for… ▽ More

    Submitted 5 November, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 38 pages, 8 figures. Accepted at the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  19. arXiv:2306.02775  [pdf, other

    stat.ML cs.LG

    Input-gradient space particle inference for neural network ensembles

    Authors: Trung Trinh, Markus Heinonen, Luigi Acerbi, Samuel Kaski

    Abstract: Deep Ensembles (DEs) demonstrate improved accuracy, calibration and robustness to perturbations over single neural networks partly due to their functional diversity. Particle-based variational inference (ParVI) methods enhance diversity by formalizing a repulsion term based on a network similarity kernel. However, weight-space repulsion is inefficient due to over-parameterization, while direct fun… ▽ More

    Submitted 5 March, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Published at ICLR 2024 (spotlight presentation). Code is available at https://github.com/AaltoPML/FoRDE

  20. arXiv:2305.15871  [pdf, other

    stat.ML cs.LG stat.CO

    Learning Robust Statistics for Simulation-based Inference under Model Misspecification

    Authors: Daolang Huang, Ayush Bharti, Amauri Souza, Luigi Acerbi, Samuel Kaski

    Abstract: Simulation-based inference (SBI) methods such as approximate Bayesian computation (ABC), synthetic likelihood, and neural posterior estimation (NPE) rely on simulating statistics to infer parameters of intractable likelihood models. However, such methods are known to yield untrustworthy and misleading inference outcomes under model misspecification, thus hindering their widespread applicability. I… ▽ More

    Submitted 5 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 22 pages, 13 figures, Published at NeurIPS 2023

  21. arXiv:2305.14120  [pdf, other

    cs.LG stat.ML

    Learning Relevant Contextual Variables Within Bayesian Optimization

    Authors: Julien Martinelli, Ayush Bharti, Armi Tiihonen, S. T. John, Louis Filstroff, Sabina J. Sloman, Patrick Rinke, Samuel Kaski

    Abstract: Contextual Bayesian Optimization (CBO) efficiently optimizes black-box functions with respect to design variables, while simultaneously integrating contextual information regarding the environment, such as experimental conditions. However, the relevance of contextual variables is not necessarily known beforehand. Moreover, contextual variables can sometimes be optimized themselves at an additional… ▽ More

    Submitted 24 May, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  22. arXiv:2305.11567  [pdf, other

    cs.LG stat.ML

    TSGM: A Flexible Framework for Generative Modeling of Synthetic Time Series

    Authors: Alexander Nikitin, Letizia Iannucci, Samuel Kaski

    Abstract: Temporally indexed data are essential in a wide range of fields and of interest to machine learning researchers. Time series data, however, are often scarce or highly sensitive, which precludes the sharing of data between researchers and industrial organizations and the application of existing and new data-intensive ML methods. A possible solution to this bottleneck is to generate synthetic data.… ▽ More

    Submitted 9 July, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

  23. arXiv:2304.05010  [pdf, other

    stat.AP

    Characterizing personalized effects of family information on disease risk using graph representation learning

    Authors: Sophie Wharrie, Zhiyu Yang, Andrea Ganna, Samuel Kaski

    Abstract: Family history is considered a risk factor for many diseases because it implicitly captures shared genetic, environmental and lifestyle factors. Finland's nationwide electronic health record (EHR) system spanning multiple generations presents new opportunities for studying a connected network of medical histories for entire families. In this work we present a graph-based deep learning approach for… ▽ More

    Submitted 8 August, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

    Journal ref: Proceedings of the 8th Machine Learning for Healthcare Conference, in Proceedings of Machine Learning Research, 219 (2023) 824-845

  24. arXiv:2301.11674  [pdf, other

    stat.ME stat.CO stat.ML

    Optimally-Weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference

    Authors: Ayush Bharti, Masha Naslidnyk, Oscar Key, Samuel Kaski, François-Xavier Briol

    Abstract: Likelihood-free inference methods typically make use of a distance between simulated and real data. A common example is the maximum mean discrepancy (MMD), which has previously been used for approximate Bayesian computation, minimum distance estimation, generalised Bayesian inference, and within the nonparametric learning framework. The MMD is commonly estimated at a root-$m$ rate, where $m$ is th… ▽ More

    Submitted 10 May, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  25. arXiv:2210.15961  [pdf, other

    stat.ML cs.CR cs.LG

    DPVIm: Differentially Private Variational Inference Improved

    Authors: Joonas Jälkö, Lukas Prediger, Antti Honkela, Samuel Kaski

    Abstract: Differentially private (DP) release of multidimensional statistics typically considers an aggregate sensitivity, e.g. the vector norm of a high-dimensional vector. However, different dimensions of that vector might have widely different magnitudes and therefore DP perturbation disproportionately affects the signal across dimensions. We observe this problem in the gradient release of the DP-SGD alg… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

  26. arXiv:2210.13937  [pdf, other

    cs.LG stat.ML

    Multi-Fidelity Bayesian Optimization with Unreliable Information Sources

    Authors: Petrus Mikkola, Julien Martinelli, Louis Filstroff, Samuel Kaski

    Abstract: Bayesian optimization (BO) is a powerful framework for optimizing black-box, expensive-to-evaluate functions. Over the past decade, many algorithms have been proposed to integrate cheaper, lower-fidelity approximations of the objective function into the optimization process, with the goal of converging towards the global optimum at a reduced cost. This task is generally referred to as multi-fideli… ▽ More

    Submitted 24 February, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted for publication at AISTATS 2023. Code available at https://github.com/AaltoPML/rMFBO

  27. arXiv:2210.06032  [pdf, other

    cs.LG cs.ET q-bio.BM stat.ML

    Modular Flows: Differential Molecular Generation

    Authors: Yogesh Verma, Samuel Kaski, Markus Heinonen, Vikas Garg

    Abstract: Generating new molecules is fundamental to advancing critical applications such as drug discovery and material synthesis. Flows can generate molecules effectively by inverting the encoding process, however, existing flow models either require artifactual dequantization or specific node/edge orderings, lack desiderata such as permutation invariance, or induce discrepancy between the encoding and th… ▽ More

    Submitted 13 October, 2022; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS 2022. More info at: https://yogeshverma1998.github.io/ModFlow/

  28. arXiv:2206.02435  [pdf, other

    stat.ML cs.LG

    Tackling covariate shift with node-based Bayesian neural networks

    Authors: Trung Trinh, Markus Heinonen, Luigi Acerbi, Samuel Kaski

    Abstract: Bayesian neural networks (BNNs) promise improved generalization under covariate shift by providing principled probabilistic representations of epistemic uncertainty. However, weight-based BNNs often struggle with high computational complexity of large-scale architectures and datasets. Node-based BNNs have recently been introduced as scalable alternatives, which induce epistemic uncertainty by mult… ▽ More

    Submitted 9 June, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: Published at ICML 2022 (long oral presentation). Code is available at https://github.com/AaltoPML/node-BNN-covariate-shift

  29. arXiv:2205.14485  [pdf, other

    stat.ML cs.CR cs.LG

    Noise-Aware Statistical Inference with Differentially Private Synthetic Data

    Authors: Ossi Räisä, Joonas Jälkö, Samuel Kaski, Antti Honkela

    Abstract: While generation of synthetic data under differential privacy (DP) has received a lot of attention in the data privacy community, analysis of synthetic data has received much less. Existing work has shown that simply analysing DP synthetic data as if it were real does not produce valid inferences of population-level quantities. For example, confidence intervals become too narrow, which we demonstr… ▽ More

    Submitted 24 February, 2023; v1 submitted 28 May, 2022; originally announced May 2022.

    Comments: 24 pages, 14 figures

  30. arXiv:2205.11136  [pdf, other

    astro-ph.EP astro-ph.IM stat.AP

    Accounting for stellar activity signals in radial-velocity data by using Change Point Detection techniques

    Authors: U. Simola, A. Bonfanti, X. Dumusque, J. Cisewski-Kehe, S. Kaski, J. Corander

    Abstract: Active regions on the photosphere of a star have been the major obstacle for detecting Earth-like exoplanets using the radial velocity (RV) method. A commonly employed solution for addressing stellar activity is to assume a linear relationship between the RV observations and the activity indicators along the entire time series, and then remove the estimated contribution of activity from the variat… ▽ More

    Submitted 31 May, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: 31 pages, 18 Figures

    Journal ref: A&A 664, A127 (2022)

  31. arXiv:2202.11154  [pdf, other

    stat.ML cs.LG stat.ME

    Parallel MCMC Without Embarrassing Failures

    Authors: Daniel Augusto de Souza, Diego Mesquita, Samuel Kaski, Luigi Acerbi

    Abstract: Embarrassingly parallel Markov Chain Monte Carlo (MCMC) exploits parallel computing to scale Bayesian inference to large datasets by using a two-step approach. First, MCMC is run in parallel on (sub)posteriors defined on data partitions. Then, a server combines local results. While efficient, this framework is very sensitive to the quality of subposterior sampling. Common sampling problems such as… ▽ More

    Submitted 29 March, 2022; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: To appear in the 25th International Conference on Artificial Intelligence and Statistics (AISTATS 2022). For associated code, see https://github.com/spectraldani/pai/

  32. arXiv:2202.00095  [pdf, other

    stat.ML cs.LG

    Deconfounded Representation Similarity for Comparison of Neural Networks

    Authors: Tianyu Cui, Yogesh Kumar, Pekka Marttinen, Samuel Kaski

    Abstract: Similarity metrics such as representational similarity analysis (RSA) and centered kernel alignment (CKA) have been used to compare layer-wise representations between neural networks. However, these metrics are confounded by the population structure of data items in the input space, leading to spuriously high similarity for even completely random neural networks and inconsistent domain relations i… ▽ More

    Submitted 31 January, 2022; originally announced February 2022.

  33. arXiv:2201.12090  [pdf, other

    stat.ML cs.LG

    Approximate Bayesian Computation with Domain Expert in the Loop

    Authors: Ayush Bharti, Louis Filstroff, Samuel Kaski

    Abstract: Approximate Bayesian computation (ABC) is a popular likelihood-free inference method for models with intractable likelihood functions. As ABC methods usually rely on comparing summary statistics of observed and simulated data, the choice of the statistics is crucial. This choice involves a trade-off between loss of information and dimensionality reduction, and is often determined based on domain k… ▽ More

    Submitted 20 June, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: Accepted for publication at ICML 2022. Code available at https://github.com/lfilstro/HITL-ABC

  34. arXiv:2112.12841  [pdf, other

    stat.AP stat.ME

    ABC of the Future

    Authors: Henri Pesonen, Umberto Simola, Alvaro Köhn-Luque, Henri Vuollekoski, Xiaoran Lai, Arnoldo Frigessi, Samuel Kaski, David T. Frazier, Worapree Maneesoonthorn, Gael M. Martin, Jukka Corander

    Abstract: Approximate Bayesian computation (ABC) has advanced in two decades from a seminal idea to a practically applicable inference tool for simulator-based statistical models, which are becoming increasingly popular in many research domains. The computational feasibility of ABC for practical applications has been recently boosted by adopting techniques from machine learning to build surrogate models for… ▽ More

    Submitted 3 October, 2022; v1 submitted 23 December, 2021; originally announced December 2021.

    Comments: 29 pages, 7 figures update : added details to some of the sections, corrected typos and clarified notation

  35. arXiv:2112.01380  [pdf, other

    stat.ME

    Prior knowledge elicitation: The past, present, and future

    Authors: Petrus Mikkola, Osvaldo A. Martin, Suyog Chandramouli, Marcelo Hartmann, Oriol Abril Pla, Owen Thomas, Henri Pesonen, Jukka Corander, Aki Vehtari, Samuel Kaski, Paul-Christian Bürkner, Arto Klami

    Abstract: Specification of the prior distribution for a Bayesian model is a central part of the Bayesian workflow for data analysis, but it is often difficult even for statistical experts. In principle, prior elicitation transforms domain knowledge of various kinds into well-defined prior distributions, and offers a solution to the prior specification problem. In practice, however, we are still fairly far f… ▽ More

    Submitted 9 May, 2023; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: 69 pages, 1 figure

  36. arXiv:2111.08524  [pdf, other

    cs.LG stat.ML

    Non-separable Spatio-temporal Graph Kernels via SPDEs

    Authors: Alexander Nikitin, ST John, Arno Solin, Samuel Kaski

    Abstract: Gaussian processes (GPs) provide a principled and direct approach for inference and learning on graphs. However, the lack of justified graph kernels for spatio-temporal modelling has held back their use in graph problems. We leverage an explicit link between stochastic partial differential equations (SPDEs) and GPs on graphs, introduce a framework for deriving graph kernels via SPDEs, and derive n… ▽ More

    Submitted 27 December, 2024; v1 submitted 16 November, 2021; originally announced November 2021.

  37. arXiv:2111.01555  [pdf, other

    cs.LG stat.ML

    Likelihood-Free Inference in State-Space Models with Unknown Dynamics

    Authors: Alexander Aushev, Thong Tran, Henri Pesonen, Andrew Howes, Samuel Kaski

    Abstract: Likelihood-free inference (LFI) has been successfully applied to state-space models, where the likelihood of observations is not available but synthetic observations generated by a black-box simulator can be used for inference instead. However, much of the research up to now have been restricted to cases, in which a model of state transition dynamics can be formulated in advance and the simulation… ▽ More

    Submitted 20 February, 2023; v1 submitted 2 November, 2021; originally announced November 2021.

  38. arXiv:2110.14426  [pdf, other

    stat.ML cs.CR cs.LG

    Locally Differentially Private Bayesian Inference

    Authors: Tejas Kulkarni, Joonas Jälkö, Samuel Kaski, Antti Honkela

    Abstract: In recent years, local differential privacy (LDP) has emerged as a technique of choice for privacy-preserving data collection in several scenarios when the aggregator is not trustworthy. LDP provides client-side privacy by adding noise at the user's end. Thus, clients need not rely on the trustworthiness of the aggregator. In this work, we provide a noise-aware probabilistic modeling framework,… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

  39. arXiv:2110.03768  [pdf, other

    stat.ML cs.LG

    De-randomizing MCMC dynamics with the diffusion Stein operator

    Authors: Zheyang Shen, Markus Heinonen, Samuel Kaski

    Abstract: Approximate Bayesian inference estimates descriptors of an intractable target distribution - in essence, an optimization problem within a family of distributions. For example, Langevin dynamics (LD) extracts asymptotically exact samples from a diffusion process because the time evolution of its marginal distributions constitutes a curve that minimizes the KL-divergence via steepest descent in the… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: 22 pages, 6 figures. NeurIPS 2021

  40. arXiv:2106.10905  [pdf, other

    cs.LG stat.ML

    Variational multiple shooting for Bayesian ODEs with Gaussian processes

    Authors: Pashupati Hegde, Çağatay Yıldız, Harri Lähdesmäki, Samuel Kaski, Markus Heinonen

    Abstract: Recent machine learning advances have proposed black-box estimation of unknown continuous-time system dynamics directly from data. However, earlier works are based on approximative ODE solutions or point estimates. We propose a novel Bayesian nonparametric model that uses Gaussian processes to infer posteriors of unknown ODE systems directly from data. We derive sparse variational inference with d… ▽ More

    Submitted 17 July, 2022; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: Camera-ready version at UAI 2022

  41. arXiv:2106.04193  [pdf, other

    stat.ML cs.AI cs.LG

    Targeted Active Learning for Bayesian Decision-Making

    Authors: Louis Filstroff, Iiris Sundin, Petrus Mikkola, Aleksei Tiulpin, Juuso Kylmäoja, Samuel Kaski

    Abstract: Active learning is usually applied to acquire labels of informative data points in supervised learning, to maximize accuracy in a sample-efficient way. However, maximizing the accuracy is not the end goal when the results are used for decision-making, for example in personalized medicine or economics. We argue that when acquiring samples sequentially, separating learning and decision-making is sub… ▽ More

    Submitted 20 October, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

  42. arXiv:2103.11648  [pdf, other

    cs.LG cs.CR stat.ML

    D3p -- A Python Package for Differentially-Private Probabilistic Programming

    Authors: Lukas Prediger, Niki Loppi, Samuel Kaski, Antti Honkela

    Abstract: We present d3p, a software package designed to help fielding runtime efficient widely-applicable Bayesian inference under differential privacy guarantees. d3p achieves general applicability to a wide range of probabilistic modelling problems by implementing the differentially private variational inference algorithm, allowing users to fit any parametric probabilistic model with a differentiable den… ▽ More

    Submitted 15 September, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

    Journal ref: Proceedings on Privacy Enhancing Technologies, 2022(2), 407-425

  43. arXiv:2011.01226  [pdf, other

    stat.ML cs.LG

    Sample-efficient reinforcement learning using deep Gaussian processes

    Authors: Charles Gadd, Markus Heinonen, Harri Lähdesmäki, Samuel Kaski

    Abstract: Reinforcement learning provides a framework for learning to control which actions to take towards completing a task through trial-and-error. In many applications observing interactions is costly, necessitating sample-efficient learning. In model-based reinforcement learning efficiency is improved by learning to simulate the world dynamics. The challenge is that model inaccuracies rapidly accumulat… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

  44. arXiv:2011.00467  [pdf, other

    cs.LG cs.CR stat.ML

    Differentially Private Bayesian Inference for Generalized Linear Models

    Authors: Tejas Kulkarni, Joonas Jälkö, Antti Koskela, Samuel Kaski, Antti Honkela

    Abstract: Generalized linear models (GLMs) such as logistic regression are among the most widely used arms in data analyst's repertoire and often used on sensitive datasets. A large body of prior works that investigate GLMs under differential privacy (DP) constraints provide only private point estimates of the regression coefficients, and are not able to quantify parameter uncertainty. In this work, with lo… ▽ More

    Submitted 12 May, 2021; v1 submitted 1 November, 2020; originally announced November 2020.

  45. arXiv:2010.13498  [pdf, other

    stat.ML cs.LG

    Scalable Bayesian neural networks by layer-wise input augmentation

    Authors: Trung Trinh, Samuel Kaski, Markus Heinonen

    Abstract: We introduce implicit Bayesian neural networks, a simple and scalable approach for uncertainty representation in deep learning. Standard Bayesian approach to deep learning requires the impractical inference of the posterior distribution over millions of parameters. Instead, we propose to induce a distribution that captures the uncertainty over neural networks by augmenting each layer's inputs with… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: 8 pages

  46. arXiv:2010.09327  [pdf, other

    cs.LG stat.ML

    Bayesian Inference for Optimal Transport with Stochastic Cost

    Authors: Anton Mallasto, Markus Heinonen, Samuel Kaski

    Abstract: In machine learning and computer vision, optimal transport has had significant success in learning generative models and defining metric distances between structured and stochastic data objects, that can be cast as probability measures. The key element of optimal transport is the so called lifting of an \emph{exact} cost (distance) function, defined on the sample space, to a cost (distance) betwee… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

  47. arXiv:2010.09293  [pdf, other

    cs.LG cs.CR stat.ML

    Privacy-preserving Data Sharing on Vertically Partitioned Data

    Authors: Razane Tajeddine, Joonas Jälkö, Samuel Kaski, Antti Honkela

    Abstract: In this work, we introduce a differentially private method for generating synthetic data from vertically partitioned data, \emph{i.e.}, where data of the same individuals is distributed across multiple data holders or parties. We present a differentially privacy stochastic gradient descent (DP-SGD) algorithm to train a mixture model over such partitioned data using variational inference. We modify… ▽ More

    Submitted 2 September, 2022; v1 submitted 19 October, 2020; originally announced October 2020.

  48. arXiv:2009.06227  [pdf, other

    cs.LG cs.MA stat.ML

    Teaching to Learn: Sequential Teaching of Agents with Inner States

    Authors: Mustafa Mert Celikok, Pierre-Alexandre Murena, Samuel Kaski

    Abstract: In sequential machine teaching, a teacher's objective is to provide the optimal sequence of inputs to sequential learners in order to guide them towards the best model. In this paper we extend this setting from current static one-data-set analyses to learners which change their learning algorithm or latent state to improve during learning, and to generalize to new datasets. We introduce a multi-ag… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

  49. arXiv:2007.05553  [pdf, other

    cs.CR cs.DC cs.LG stat.ML

    Differentially private cross-silo federated learning

    Authors: Mikko A. Heikkilä, Antti Koskela, Kana Shimizu, Samuel Kaski, Antti Honkela

    Abstract: Strict privacy is of paramount importance in distributed machine learning. Federated learning, with the main idea of communicating only what is needed for learning, has been recently introduced as a general approach for distributed learning to enhance learning and improve security. However, federated learning by itself does not guarantee any privacy for data subjects. To quantify and control how m… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

    Comments: 14 pages, 5 figures

  50. arXiv:2006.10571  [pdf, other

    cs.LG stat.ML

    Likelihood-Free Inference with Deep Gaussian Processes

    Authors: Alexander Aushev, Henri Pesonen, Markus Heinonen, Jukka Corander, Samuel Kaski

    Abstract: In recent years, surrogate models have been successfully used in likelihood-free inference to decrease the number of simulator evaluations. The current state-of-the-art performance for this task has been achieved by Bayesian Optimization with Gaussian Processes (GPs). While this combination works well for unimodal target distributions, it is restricting the flexibility and applicability of Bayesia… ▽ More

    Submitted 5 October, 2021; v1 submitted 18 June, 2020; originally announced June 2020.