Skip to main content

Showing 1–9 of 9 results for author: Nisimov, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.20307  [pdf, other

    cs.AI cs.LG

    Causal Interpretation of Self-Attention in Pre-Trained Transformers

    Authors: Raanan Y. Rohekar, Yaniv Gurwicz, Shami Nisimov

    Abstract: We propose a causal interpretation of self-attention in the Transformer neural network architecture. We interpret self-attention as a mechanism that estimates a structural equation model for a given input sequence of symbols (tokens). The structural equation model can be interpreted, in turn, as a causal structure over the input symbols under the specific context of the input sequence. Importantly… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023). arXiv admin note: text overlap with arXiv:2210.10621

  2. arXiv:2306.00624  [pdf, other

    cs.AI cs.LG stat.ML

    From Temporal to Contemporaneous Iterative Causal Discovery in the Presence of Latent Confounders

    Authors: Raanan Y. Rohekar, Shami Nisimov, Yaniv Gurwicz, Gal Novik

    Abstract: We present a constraint-based algorithm for learning causal structures from observational time-series data, in the presence of latent confounders. We assume a discrete-time, stationary structural vector autoregressive process, with both temporal and contemporaneous causal relations. One may ask if temporal and contemporaneous relations should be treated differently. The presented algorithm gradual… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Proceedings of the 40-th International Conference on Machine Learning (ICML), 2023

  3. arXiv:2210.10621  [pdf, other

    cs.IR cs.AI cs.LG stat.ML

    CLEAR: Causal Explanations from Attention in Neural Recommenders

    Authors: Shami Nisimov, Raanan Y. Rohekar, Yaniv Gurwicz, Guy Koren, Gal Novik

    Abstract: We present CLEAR, a method for learning session-specific causal graphs, in the possible presence of latent confounders, from attention in pre-trained attention-based recommenders. These causal graphs describe user behavior, within the context captured by attention, and can provide a counterfactual explanation for a recommendation. In essence, these causal graphs allow answering "why" questions uni… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: Causality, Counterfactuals and Sequential Decision-Making for Recommender Systems (CONSEQUENCES) workshop at RecSys 2022, Seattle, WA, USA

  4. arXiv:2111.04095  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Iterative Causal Discovery in the Possible Presence of Latent Confounders and Selection Bias

    Authors: Raanan Y. Rohekar, Shami Nisimov, Yaniv Gurwicz, Gal Novik

    Abstract: We present a sound and complete algorithm, called iterative causal discovery (ICD), for recovering causal graphs in the presence of latent confounders and selection bias. ICD relies on the causal Markov and faithfulness assumptions and recovers the equivalence class of the underlying causal graph. It starts with a complete graph, and consists of a single iterative stage that gradually refines this… ▽ More

    Submitted 17 January, 2022; v1 submitted 7 November, 2021; originally announced November 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021). arXiv admin note: text overlap with arXiv:2012.07513

  5. arXiv:2107.05001  [pdf, other

    stat.ML cs.AI cs.LG

    Improving Efficiency and Accuracy of Causal Discovery Using a Hierarchical Wrapper

    Authors: Shami Nisimov, Yaniv Gurwicz, Raanan Y. Rohekar, Gal Novik

    Abstract: Causal discovery from observational data is an important tool in many branches of science. Under certain assumptions it allows scientists to explain phenomena, predict, and make decisions. In the large sample limit, sound and complete causal discovery algorithms have been previously introduced, where a directed acyclic graph (DAG), or its equivalence class, representing causal relations is searche… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

    Comments: The 37th Conference on Uncertainty in Artificial Intelligence (UAI 2021), Workshop on Tractable Probabilistic Modeling

  6. arXiv:2012.07513  [pdf, other

    cs.AI cs.LG stat.ML

    A Single Iterative Step for Anytime Causal Discovery

    Authors: Raanan Y. Rohekar, Yaniv Gurwicz, Shami Nisimov, Gal Novik

    Abstract: We present a sound and complete algorithm for recovering causal graphs from observed, non-interventional data, in the possible presence of latent confounders and selection bias. We rely on the causal Markov and faithfulness assumptions and recover the equivalence class of the underlying causal graph by performing a series of conditional independence (CI) tests between observed variables. We propos… ▽ More

    Submitted 24 December, 2020; v1 submitted 14 December, 2020; originally announced December 2020.

    Comments: 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada, Workshop on Causal Discovery & Causality-Inspired Machine Learning

  7. arXiv:1905.13195  [pdf, other

    stat.ML cs.AI cs.LG

    Modeling Uncertainty by Learning a Hierarchy of Deep Neural Connections

    Authors: Raanan Y. Rohekar, Yaniv Gurwicz, Shami Nisimov, Gal Novik

    Abstract: Modeling uncertainty in deep neural networks, despite recent important advances, is still an open problem. Bayesian neural networks are a powerful solution, where the prior over network weights is a design choice, often a normal distribution or other distribution encouraging sparsity. However, this prior is agnostic to the generative process of the input data, which might lead to unwarranted gener… ▽ More

    Submitted 27 October, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

  8. arXiv:1809.04828  [pdf, other

    stat.ML cs.LG

    Bayesian Structure Learning by Recursive Bootstrap

    Authors: Raanan Y. Rohekar, Yaniv Gurwicz, Shami Nisimov, Guy Koren, Gal Novik

    Abstract: We address the problem of Bayesian structure learning for domains with hundreds of variables by employing non-parametric bootstrap, recursively. We propose a method that covers both model averaging and model selection in the same framework. The proposed method deals with the main weakness of constraint-based learning---sensitivity to errors in the independence tests---by a novel way of combining b… ▽ More

    Submitted 13 September, 2018; originally announced September 2018.

  9. arXiv:1806.09141  [pdf, other

    stat.ML cs.AI cs.LG

    Constructing Deep Neural Networks by Bayesian Network Structure Learning

    Authors: Raanan Y. Rohekar, Shami Nisimov, Yaniv Gurwicz, Guy Koren, Gal Novik

    Abstract: We introduce a principled approach for unsupervised structure learning of deep neural networks. We propose a new interpretation for depth and inter-layer connectivity where conditional independencies in the input distribution are encoded hierarchically in the network structure. Thus, the depth of the network is determined inherently. The proposed method casts the problem of neural network structur… ▽ More

    Submitted 17 October, 2018; v1 submitted 24 June, 2018; originally announced June 2018.