Skip to main content

Showing 1–5 of 5 results for author: Saengkyongam, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.04295  [pdf, other

    cs.LG cs.AI stat.ML

    Identifying Representations for Intervention Extrapolation

    Authors: Sorawit Saengkyongam, Elan Rosenfeld, Pradeep Ravikumar, Niklas Pfister, Jonas Peters

    Abstract: The premise of identifiable and causal representation learning is to improve the current representation learning paradigm in terms of generalizability or robustness. Despite recent progress in questions of identifiability, more theoretical results demonstrating concrete advantages of these methods for downstream tasks are needed. In this paper, we consider the task of intervention extrapolation: p… ▽ More

    Submitted 5 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted at the International Conference on Learning Representations (ICLR) 2024

  2. arXiv:2306.10983  [pdf, other

    stat.ML cs.LG

    Effect-Invariant Mechanisms for Policy Generalization

    Authors: Sorawit Saengkyongam, Niklas Pfister, Predrag Klasnja, Susan Murphy, Jonas Peters

    Abstract: Policy learning is an important component of many real-world learning systems. A major challenge in policy learning is how to adapt efficiently to unseen environments or tasks. Recently, it has been suggested to exploit invariant conditional distributions to learn models that generalize better to unseen environments. However, assuming invariance of entire conditional distributions (which we call f… ▽ More

    Submitted 27 June, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

  3. arXiv:2202.01864  [pdf, other

    stat.ML cs.LG stat.ME

    Exploiting Independent Instruments: Identification and Distribution Generalization

    Authors: Sorawit Saengkyongam, Leonard Henckel, Niklas Pfister, Jonas Peters

    Abstract: Instrumental variable models allow us to identify a causal function between covariates $X$ and a response $Y$, even in the presence of unobserved confounding. Most of the existing estimators assume that the error term in the response $Y$ and the hidden confounders are uncorrelated with the instruments $Z$. This is often motivated by a graphical separation, an argument that also justifies independe… ▽ More

    Submitted 22 September, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: Accepted at ICML 2022

  4. arXiv:2106.00808  [pdf, other

    cs.LG cs.AI stat.ML

    Invariant Policy Learning: A Causal Perspective

    Authors: Sorawit Saengkyongam, Nikolaj Thams, Jonas Peters, Niklas Pfister

    Abstract: Contextual bandit and reinforcement learning algorithms have been successfully used in various interactive learning systems such as online advertising, recommender systems, and dynamic pricing. However, they have yet to be widely adopted in high-stakes application domains, such as healthcare. One reason may be that existing approaches assume that the underlying mechanisms are static in the sense t… ▽ More

    Submitted 22 September, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

  5. arXiv:1805.08845  [pdf, other

    stat.ML cs.LG

    Counterfactual Mean Embeddings

    Authors: Krikamol Muandet, Motonobu Kanagawa, Sorawit Saengkyongam, Sanparith Marukatat

    Abstract: Counterfactual inference has become a ubiquitous tool in online advertisement, recommendation systems, medical diagnosis, and econometrics. Accurate modeling of outcome distributions associated with different interventions -- known as counterfactual distributions -- is crucial for the success of these applications. In this work, we propose to model counterfactual distributions using a novel Hilber… ▽ More

    Submitted 10 July, 2021; v1 submitted 22 May, 2018; originally announced May 2018.

    Comments: 71 pages