Skip to main content

Showing 1–16 of 16 results for author: Gurwicz, Y

.
  1. arXiv:2506.09891  [pdf, ps, other

    cs.LG cs.AI cs.CE physics.ao-ph

    Causal Climate Emulation with Bayesian Filtering

    Authors: Sebastian Hickman, Ilija Trajkovic, Julia Kaltenborn, Francis Pelletier, Alex Archibald, Yaniv Gurwicz, Peer Nowack, David Rolnick, Julien Boussard

    Abstract: Traditional models of climate change use complex systems of coupled equations to simulate physical processes across the Earth system. These simulations are highly computationally expensive, limiting our predictions of climate change and analyses of its causes and effects. Machine learning has the potential to quickly emulate data from climate models, but current approaches are not able to incorpor… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: 32 pages, 21 figures

  2. arXiv:2412.07446  [pdf, other

    cs.AI cs.CL cs.LG stat.ML

    A Causal World Model Underlying Next Token Prediction: Exploring GPT in a Controlled Environment

    Authors: Raanan Y. Rohekar, Yaniv Gurwicz, Sungduk Yu, Estelle Aflalo, Vasudev Lal

    Abstract: Do generative pre-trained transformer (GPT) models, trained only to predict the next token, implicitly learn a world model from which a sequence is generated one token at a time? We address this question by deriving a causal interpretation of the attention mechanism in GPT, and suggesting a causal world model that arises from this interpretation. Furthermore, we propose that GPT models, at inferen… ▽ More

    Submitted 2 May, 2025; v1 submitted 10 December, 2024; originally announced December 2024.

    Comments: International Conference on Machine Learning (ICML), 2025

  3. arXiv:2410.07013  [pdf, other

    cs.LG

    Causal Representation Learning in Temporal Data via Single-Parent Decoding

    Authors: Philippe Brouillard, Sébastien Lachapelle, Julia Kaltenborn, Yaniv Gurwicz, Dhanya Sridhar, Alexandre Drouin, Peer Nowack, Jakob Runge, David Rolnick

    Abstract: Scientific research often seeks to understand the causal structure underlying high-level variables in a system. For example, climate scientists study how phenomena, such as El Niño, affect other climate processes at remote locations across the globe. However, scientists typically collect low-level measurements, such as geographically distributed temperature readings. From these, one needs to learn… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: 33 pages, 17 figures

  4. arXiv:2408.15993  [pdf, other

    cs.CV cs.LG physics.ao-ph

    ClimDetect: A Benchmark Dataset for Climate Change Detection and Attribution

    Authors: Sungduk Yu, Brian L. White, Anahita Bhiwandiwalla, Musashi Hinck, Matthew Lyle Olson, Yaniv Gurwicz, Raanan Y. Rohekar, Tung Nguyen, Vasudev Lal

    Abstract: Detecting and attributing temperature increases driven by climate change is crucial for understanding global warming and informing adaptation strategies. However, distinguishing human-induced climate signals from natural variability remains challenging for traditional detection and attribution (D&A) methods, which rely on identifying specific "fingerprints" -- spatial patterns expected to emerge f… ▽ More

    Submitted 10 March, 2025; v1 submitted 28 August, 2024; originally announced August 2024.

  5. arXiv:2404.03118  [pdf, other

    cs.CV

    LVLM-Interpret: An Interpretability Tool for Large Vision-Language Models

    Authors: Gabriela Ben Melech Stan, Estelle Aflalo, Raanan Yehezkel Rohekar, Anahita Bhiwandiwalla, Shao-Yen Tseng, Matthew Lyle Olson, Yaniv Gurwicz, Chenfei Wu, Nan Duan, Vasudev Lal

    Abstract: In the rapidly evolving landscape of artificial intelligence, multi-modal large language models are emerging as a significant area of interest. These models, which combine various forms of data input, are becoming increasingly popular. However, understanding their internal mechanisms remains a complex task. Numerous advancements have been made in the field of explainability tools and mechanisms, y… ▽ More

    Submitted 24 June, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  6. arXiv:2312.02858  [pdf, other

    cs.LG cs.AI physics.ao-ph stat.ME

    Towards Causal Representations of Climate Model Data

    Authors: Julien Boussard, Chandni Nagda, Julia Kaltenborn, Charlotte Emilie Elektra Lange, Philippe Brouillard, Yaniv Gurwicz, Peer Nowack, David Rolnick

    Abstract: Climate models, such as Earth system models (ESMs), are crucial for simulating future climate change based on projected Shared Socioeconomic Pathways (SSP) greenhouse gas emissions scenarios. While ESMs are sophisticated and invaluable, machine learning-based emulators trained on existing simulation data can project additional climate scenarios much faster and are computationally efficient. Howeve… ▽ More

    Submitted 6 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

  7. arXiv:2311.03721  [pdf, other

    cs.LG cs.AI cs.CE physics.ao-ph

    ClimateSet: A Large-Scale Climate Model Dataset for Machine Learning

    Authors: Julia Kaltenborn, Charlotte E. E. Lange, Venkatesh Ramesh, Philippe Brouillard, Yaniv Gurwicz, Chandni Nagda, Jakob Runge, Peer Nowack, David Rolnick

    Abstract: Climate models have been key for assessing the impact of climate change and simulating future climate scenarios. The machine learning (ML) community has taken an increased interest in supporting climate scientists' efforts on various tasks such as climate model emulation, downscaling, and prediction tasks. Many of those tasks have been addressed on datasets created with single climate models. Howe… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: To be published in the 37th Conference on Neural Information Processing Systems (NeurIPS 2023): Track on Datasets and Benchmarks. Project website: https://climateset.github.io/

  8. arXiv:2310.20307  [pdf, other

    cs.AI cs.LG

    Causal Interpretation of Self-Attention in Pre-Trained Transformers

    Authors: Raanan Y. Rohekar, Yaniv Gurwicz, Shami Nisimov

    Abstract: We propose a causal interpretation of self-attention in the Transformer neural network architecture. We interpret self-attention as a mechanism that estimates a structural equation model for a given input sequence of symbols (tokens). The structural equation model can be interpreted, in turn, as a causal structure over the input symbols under the specific context of the input sequence. Importantly… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023). arXiv admin note: text overlap with arXiv:2210.10621

  9. arXiv:2306.00624  [pdf, other

    cs.AI cs.LG stat.ML

    From Temporal to Contemporaneous Iterative Causal Discovery in the Presence of Latent Confounders

    Authors: Raanan Y. Rohekar, Shami Nisimov, Yaniv Gurwicz, Gal Novik

    Abstract: We present a constraint-based algorithm for learning causal structures from observational time-series data, in the presence of latent confounders. We assume a discrete-time, stationary structural vector autoregressive process, with both temporal and contemporaneous causal relations. One may ask if temporal and contemporaneous relations should be treated differently. The presented algorithm gradual… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Proceedings of the 40-th International Conference on Machine Learning (ICML), 2023

  10. arXiv:2210.10621  [pdf, other

    cs.IR cs.AI cs.LG stat.ML

    CLEAR: Causal Explanations from Attention in Neural Recommenders

    Authors: Shami Nisimov, Raanan Y. Rohekar, Yaniv Gurwicz, Guy Koren, Gal Novik

    Abstract: We present CLEAR, a method for learning session-specific causal graphs, in the possible presence of latent confounders, from attention in pre-trained attention-based recommenders. These causal graphs describe user behavior, within the context captured by attention, and can provide a counterfactual explanation for a recommendation. In essence, these causal graphs allow answering "why" questions uni… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: Causality, Counterfactuals and Sequential Decision-Making for Recommender Systems (CONSEQUENCES) workshop at RecSys 2022, Seattle, WA, USA

  11. arXiv:2111.04095  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Iterative Causal Discovery in the Possible Presence of Latent Confounders and Selection Bias

    Authors: Raanan Y. Rohekar, Shami Nisimov, Yaniv Gurwicz, Gal Novik

    Abstract: We present a sound and complete algorithm, called iterative causal discovery (ICD), for recovering causal graphs in the presence of latent confounders and selection bias. ICD relies on the causal Markov and faithfulness assumptions and recovers the equivalence class of the underlying causal graph. It starts with a complete graph, and consists of a single iterative stage that gradually refines this… ▽ More

    Submitted 17 January, 2022; v1 submitted 7 November, 2021; originally announced November 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021). arXiv admin note: text overlap with arXiv:2012.07513

  12. arXiv:2107.05001  [pdf, other

    stat.ML cs.AI cs.LG

    Improving Efficiency and Accuracy of Causal Discovery Using a Hierarchical Wrapper

    Authors: Shami Nisimov, Yaniv Gurwicz, Raanan Y. Rohekar, Gal Novik

    Abstract: Causal discovery from observational data is an important tool in many branches of science. Under certain assumptions it allows scientists to explain phenomena, predict, and make decisions. In the large sample limit, sound and complete causal discovery algorithms have been previously introduced, where a directed acyclic graph (DAG), or its equivalence class, representing causal relations is searche… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

    Comments: The 37th Conference on Uncertainty in Artificial Intelligence (UAI 2021), Workshop on Tractable Probabilistic Modeling

  13. arXiv:2012.07513  [pdf, other

    cs.AI cs.LG stat.ML

    A Single Iterative Step for Anytime Causal Discovery

    Authors: Raanan Y. Rohekar, Yaniv Gurwicz, Shami Nisimov, Gal Novik

    Abstract: We present a sound and complete algorithm for recovering causal graphs from observed, non-interventional data, in the possible presence of latent confounders and selection bias. We rely on the causal Markov and faithfulness assumptions and recover the equivalence class of the underlying causal graph by performing a series of conditional independence (CI) tests between observed variables. We propos… ▽ More

    Submitted 24 December, 2020; v1 submitted 14 December, 2020; originally announced December 2020.

    Comments: 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada, Workshop on Causal Discovery & Causality-Inspired Machine Learning

  14. arXiv:1905.13195  [pdf, other

    stat.ML cs.AI cs.LG

    Modeling Uncertainty by Learning a Hierarchy of Deep Neural Connections

    Authors: Raanan Y. Rohekar, Yaniv Gurwicz, Shami Nisimov, Gal Novik

    Abstract: Modeling uncertainty in deep neural networks, despite recent important advances, is still an open problem. Bayesian neural networks are a powerful solution, where the prior over network weights is a design choice, often a normal distribution or other distribution encouraging sparsity. However, this prior is agnostic to the generative process of the input data, which might lead to unwarranted gener… ▽ More

    Submitted 27 October, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

  15. arXiv:1809.04828  [pdf, other

    stat.ML cs.LG

    Bayesian Structure Learning by Recursive Bootstrap

    Authors: Raanan Y. Rohekar, Yaniv Gurwicz, Shami Nisimov, Guy Koren, Gal Novik

    Abstract: We address the problem of Bayesian structure learning for domains with hundreds of variables by employing non-parametric bootstrap, recursively. We propose a method that covers both model averaging and model selection in the same framework. The proposed method deals with the main weakness of constraint-based learning---sensitivity to errors in the independence tests---by a novel way of combining b… ▽ More

    Submitted 13 September, 2018; originally announced September 2018.

  16. arXiv:1806.09141  [pdf, other

    stat.ML cs.AI cs.LG

    Constructing Deep Neural Networks by Bayesian Network Structure Learning

    Authors: Raanan Y. Rohekar, Shami Nisimov, Yaniv Gurwicz, Guy Koren, Gal Novik

    Abstract: We introduce a principled approach for unsupervised structure learning of deep neural networks. We propose a new interpretation for depth and inter-layer connectivity where conditional independencies in the input distribution are encoded hierarchically in the network structure. Thus, the depth of the network is determined inherently. The proposed method casts the problem of neural network structur… ▽ More

    Submitted 17 October, 2018; v1 submitted 24 June, 2018; originally announced June 2018.