Skip to main content

Showing 1–50 of 51 results for author: Ravanbakhsh, S

.
  1. arXiv:2505.12049  [pdf, ps, other

    cs.LG cs.AI

    Beyond Scalar Rewards: An Axiomatic Framework for Lexicographic MDPs

    Authors: Mehran Shakerinava, Siamak Ravanbakhsh, Adam Oberman

    Abstract: Recent work has formalized the reward hypothesis through the lens of expected utility theory, by interpreting reward as utility. Hausner's foundational work showed that dropping the continuity axiom leads to a generalization of expected utility theory where utilities are lexicographically ordered vectors of arbitrary dimension. In this paper, we extend this result by identifying a simple and pract… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  2. arXiv:2503.10834  [pdf, other

    stat.ML cs.LG

    On the Identifiability of Causal Abstractions

    Authors: Xiusi Li, Sékou-Oumar Kaba, Siamak Ravanbakhsh

    Abstract: Causal representation learning (CRL) enhances machine learning models' robustness and generalizability by learning structural causal models associated with data-generating processes. We focus on a family of CRL methods that uses contrastive data pairs in the observable space, generated before and after a random, unknown intervention, to identify the latent causal model. (Brehmer et al., 2022) show… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: 15 pages, 4 figures, published in AISTATS 2025

  3. arXiv:2502.03638  [pdf, other

    cond-mat.mtrl-sci cs.LG

    SymmCD: Symmetry-Preserving Crystal Generation with Diffusion Models

    Authors: Daniel Levy, Siba Smarak Panigrahi, Sékou-Oumar Kaba, Qiang Zhu, Kin Long Kelvin Lee, Mikhail Galkin, Santiago Miret, Siamak Ravanbakhsh

    Abstract: Generating novel crystalline materials has the potential to lead to advancements in fields such as electronics, energy storage, and catalysis. The defining characteristic of crystals is their symmetry, which plays a central role in determining their physical properties. However, existing crystal generation methods either fail to generate materials that display the symmetries of real-world crystals… ▽ More

    Submitted 23 May, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

    Comments: 24 pages, 10 figures, International Conference on Learning Representations (ICLR) 2025

  4. arXiv:2501.07773  [pdf, other

    cs.LG

    Symmetry-Aware Generative Modeling through Learned Canonicalization

    Authors: Kusha Sareen, Daniel Levy, Arnab Kumar Mondal, Sékou-Oumar Kaba, Tara Akhound-Sadegh, Siamak Ravanbakhsh

    Abstract: Generative modeling of symmetric densities has a range of applications in AI for science, from drug discovery to physics simulations. The existing generative modeling paradigm for invariant densities combines an invariant prior with an equivariant generative process. However, we observe that this technique is not necessary and has several drawbacks resulting from the limitations of equivariant net… ▽ More

    Submitted 3 February, 2025; v1 submitted 13 January, 2025; originally announced January 2025.

    Comments: NeurReps 2024 Workshop Version

  5. arXiv:2410.01312  [pdf, other

    cs.LG

    Sampling from Energy-based Policies using Diffusion

    Authors: Vineet Jain, Tara Akhound-Sadegh, Siamak Ravanbakhsh

    Abstract: Energy-based policies offer a flexible framework for modeling complex, multimodal behaviors in reinforcement learning (RL). In maximum entropy RL, the optimal policy is a Boltzmann distribution derived from the soft Q-function, but direct sampling from this distribution in continuous action spaces is computationally intractable. As a result, existing methods typically use simpler parametric distri… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  6. arXiv:2402.06121  [pdf, other

    cs.LG stat.ML

    Iterated Denoising Energy Matching for Sampling from Boltzmann Densities

    Authors: Tara Akhound-Sadegh, Jarrid Rector-Brooks, Avishek Joey Bose, Sarthak Mittal, Pablo Lemos, Cheng-Hao Liu, Marcin Sendera, Siamak Ravanbakhsh, Gauthier Gidel, Yoshua Bengio, Nikolay Malkin, Alexander Tong

    Abstract: Efficiently generating statistically independent samples from an unnormalized probability distribution, such as equilibrium samples of many-body systems, is a foundational problem in science. In this paper, we propose Iterated Denoising Energy Matching (iDEM), an iterative algorithm that uses a novel stochastic score matching objective leveraging solely the energy function and its gradient -- and… ▽ More

    Submitted 26 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Published at ICML 2024. Code for iDEM is available at https://github.com/jarridrb/dem

  7. arXiv:2402.04821  [pdf, other

    cs.LG

    E(3)-Equivariant Mesh Neural Networks

    Authors: Thuan Trang, Nhat Khang Ngo, Daniel Levy, Thieu N. Vo, Siamak Ravanbakhsh, Truong Son Hy

    Abstract: Triangular meshes are widely used to represent three-dimensional objects. As a result, many recent works have address the need for geometric deep learning on 3D mesh. However, we observe that the complexities in many of these architectures does not translate to practical performance, and simple deep models for geometric graphs are competitive in practice. Motivated by this observation, we minimall… ▽ More

    Submitted 18 February, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  8. arXiv:2312.09016  [pdf, other

    cs.LG stat.ML

    Symmetry Breaking and Equivariant Neural Networks

    Authors: Sékou-Oumar Kaba, Siamak Ravanbakhsh

    Abstract: Using symmetry as an inductive bias in deep learning has been proven to be a principled approach for sample-efficient model design. However, the relationship between symmetry and the imperative for equivariance in neural networks is not always obvious. Here, we analyze a key limitation that arises in equivariant functions: their incapacity to break symmetry at the level of individual data samples.… ▽ More

    Submitted 22 March, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 14 pages, 2 figures, Symmetry and Geometry in Neural Representations

  9. arXiv:2311.04293  [pdf, other

    cs.LG

    Lie Point Symmetry and Physics Informed Networks

    Authors: Tara Akhound-Sadegh, Laurence Perreault-Levasseur, Johannes Brandstetter, Max Welling, Siamak Ravanbakhsh

    Abstract: Symmetries have been leveraged to improve the generalization of neural networks through different mechanisms from data augmentation to equivariant architectures. However, despite their potential, their integration into neural solvers for partial differential equations (PDEs) remains largely unexplored. We explore the integration of PDE symmetries, known as Lie point symmetries, in a major family o… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023

  10. arXiv:2311.03096  [pdf, other

    cs.LG stat.ML

    Weight-Sharing Regularization

    Authors: Mehran Shakerinava, Motahareh Sohrabi, Siamak Ravanbakhsh, Simon Lacoste-Julien

    Abstract: Weight-sharing is ubiquitous in deep learning. Motivated by this, we propose a "weight-sharing regularization" penalty on the weights $w \in \mathbb{R}^d$ of a neural network, defined as $\mathcal{R}(w) = \frac{1}{d - 1}\sum_{i > j}^d |w_i - w_j|$. We study the proximal mapping of $\mathcal{R}$ and provide an intuitive interpretation of it in terms of a physical system of interacting particles. We… ▽ More

    Submitted 10 March, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: Our code is available at https://github.com/motahareh-sohrabi/weight-sharing-regularization

  11. arXiv:2310.02505  [pdf, other

    cs.LG cs.AI

    Learning to Reach Goals via Diffusion

    Authors: Vineet Jain, Siamak Ravanbakhsh

    Abstract: We present a novel perspective on goal-conditioned reinforcement learning by framing it within the context of denoising diffusion models. Analogous to the diffusion process, where Gaussian noise is used to create random trajectories that walk away from the data manifold, we construct trajectories that move away from potential goal states. We then learn a goal-conditioned policy to reverse these de… ▽ More

    Submitted 26 October, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

  12. arXiv:2310.01647  [pdf, other

    cs.LG

    Equivariant Adaptation of Large Pretrained Models

    Authors: Arnab Kumar Mondal, Siba Smarak Panigrahi, Sékou-Oumar Kaba, Sai Rajeswar, Siamak Ravanbakhsh

    Abstract: Equivariant networks are specifically designed to ensure consistent behavior with respect to a set of input transformations, leading to higher sample efficiency and more accurate and robust predictions. However, redesigning each component of prevalent deep neural network architectures to achieve chosen equivariance is a difficult problem and can result in a computationally expensive network during… ▽ More

    Submitted 29 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 17 pages, 6 figures. Accepted to NeurIPS 2023

  13. arXiv:2309.03139  [pdf, other

    cs.LG

    Using Multiple Vector Channels Improves E(n)-Equivariant Graph Neural Networks

    Authors: Daniel Levy, Sékou-Oumar Kaba, Carmelo Gonzales, Santiago Miret, Siamak Ravanbakhsh

    Abstract: We present a natural extension to E(n)-equivariant graph neural networks that uses multiple equivariant vectors per node. We formulate the extension and show that it improves performance across different physical systems benchmark tasks, with minimal differences in runtime or number of parameters. The proposed multichannel EGNN outperforms the standard singlechannel EGNN on N-body charged particle… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  14. arXiv:2306.11941  [pdf, other

    cs.LG cs.AI

    Efficient Dynamics Modeling in Interactive Environments with Koopman Theory

    Authors: Arnab Kumar Mondal, Siba Smarak Panigrahi, Sai Rajeswar, Kaleem Siddiqi, Siamak Ravanbakhsh

    Abstract: The accurate modeling of dynamics in interactive environments is critical for successful long-range prediction. Such a capability could advance Reinforcement Learning (RL) and Planning algorithms, but achieving it is challenging. Inaccuracies in model estimates can compound, resulting in increased errors over long horizons. We approach this problem from the lens of Koopman theory, where the nonlin… ▽ More

    Submitted 12 May, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Accepted to ICLR 2024 and EWRL 2023

  15. arXiv:2305.18593  [pdf, other

    cs.LG cs.AI

    On Diffusion Modeling for Anomaly Detection

    Authors: Victor Livernoche, Vineet Jain, Yashar Hezaveh, Siamak Ravanbakhsh

    Abstract: Known for their impressive performance in generative modeling, diffusion models are attractive candidates for density-based anomaly detection. This paper investigates different variations of diffusion modeling for unsupervised and semi-supervised anomaly detection. In particular, we find that Denoising Diffusion Probability Models (DDPM) are performant on anomaly detection benchmarks yet computati… ▽ More

    Submitted 24 March, 2025; v1 submitted 29 May, 2023; originally announced May 2023.

  16. arXiv:2212.05596  [pdf, ps, other

    astro-ph.GA

    Galaxies on graph neural networks: towards robust synthetic galaxy catalogs with deep generative models

    Authors: Yesukhei Jagvaral, Francois Lanusse, Sukhdeep Singh, Rachel Mandelbaum, Siamak Ravanbakhsh, Duncan Campbell

    Abstract: The future astronomical imaging surveys are set to provide precise constraints on cosmological parameters, such as dark energy. However, production of synthetic data for these surveys, to test and validate analysis methods, suffers from a very high computational cost. In particular, generating mock galaxy catalogs at sufficiently large volume and high resolution will soon become computationally un… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

    Comments: Accepted as extended abstract at ICML 2022 Workshop on Machine Learning for Astrophysics. Condensed version of arXiv:2204.07077

  17. arXiv:2211.15420  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Equivariant Networks for Crystal Structures

    Authors: Sékou-Oumar Kaba, Siamak Ravanbakhsh

    Abstract: Supervised learning with deep models has tremendous potential for applications in materials science. Recently, graph neural networks have been used in this context, drawing direct inspiration from models for molecules. However, materials are typically much more structured than molecules, which is a feature that these models do not leverage. In this work, we introduce a class of models that are equ… ▽ More

    Submitted 15 January, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: 10 pages, 4 figures + appendix

  18. arXiv:2211.06489  [pdf, other

    cs.LG cs.AI

    Equivariance with Learned Canonicalization Functions

    Authors: Sékou-Oumar Kaba, Arnab Kumar Mondal, Yan Zhang, Yoshua Bengio, Siamak Ravanbakhsh

    Abstract: Symmetry-based neural networks often constrain the architecture in order to achieve invariance or equivariance to a group of transformations. In this paper, we propose an alternative that avoids this architectural constraint by learning to produce canonical representations of the data. These canonicalization functions can readily be plugged into non-equivariant backbone architectures. We offer exp… ▽ More

    Submitted 7 July, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: 21 pages, 5 figures

  19. arXiv:2210.14927  [pdf, other

    astro-ph.IM astro-ph.CO

    Characterization Of Inpaint Residuals In Interferometric Measurements of the Epoch Of Reionization

    Authors: Michael Pagano, Jing Liu, Adrian Liu, Nicholas S. Kern, Aaron Ewall-Wice, Philip Bull, Robert Pascua, Siamak Ravanbakhsh, Zara Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Jacob Burba, Steven Carey, Chris L. Carilli, Carina Cheng, David R. DeBoer , et al. (53 additional authors not shown)

    Abstract: Radio Frequency Interference (RFI) is one of the systematic challenges preventing 21cm interferometric instruments from detecting the Epoch of Reionization. To mitigate the effects of RFI on data analysis pipelines, numerous inpaint techniques have been developed to restore RFI corrupted data. We examine the qualitative and quantitative errors introduced into the visibilities and power spectrum du… ▽ More

    Submitted 20 February, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: 21 pages, 13 figures

  20. arXiv:2206.13637  [pdf, ps, other

    cs.AI cs.GT cs.LG

    Utility Theory for Sequential Decision Making

    Authors: Mehran Shakerinava, Siamak Ravanbakhsh

    Abstract: The von Neumann-Morgenstern (VNM) utility theorem shows that under certain axioms of rationality, decision-making is reduced to maximizing the expectation of some utility function. We extend these axioms to increasingly structured sequential decision making settings and identify the structure of the corresponding utility functions. In particular, we show that memoryless preferences lead to a utili… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: ICML 2022

  21. Galaxies and Halos on Graph Neural Networks: Deep Generative Modeling Scalar and Vector Quantities for Intrinsic Alignment

    Authors: Yesukhei Jagvaral, François Lanusse, Sukhdeep Singh, Rachel Mandelbaum, Siamak Ravanbakhsh, Duncan Campbell

    Abstract: In order to prepare for the upcoming wide-field cosmological surveys, large simulations of the Universe with realistic galaxy populations are required. In particular, the tendency of galaxies to naturally align towards overdensities, an effect called intrinsic alignments (IA), can be a major source of systematics in the weak lensing analysis. As the details of galaxy formation and evolution releva… ▽ More

    Submitted 22 July, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: 15 pages, 7 figures (+2 figures in Appendix), matches published version at MNRAS

  22. arXiv:2203.13913  [pdf, other

    cs.LG cs.AI cs.DS cs.NE stat.ML

    SpeqNets: Sparsity-aware Permutation-equivariant Graph Networks

    Authors: Christopher Morris, Gaurav Rattan, Sandra Kiefer, Siamak Ravanbakhsh

    Abstract: While (message-passing) graph neural networks have clear limitations in approximating permutation-equivariant functions over graphs or general relational data, more expressive, higher-order graph neural networks do not scale to large graphs. They either operate on $k$-order tensors or consider all $k$-node subgraphs, implying an exponential dependence on $k$ in memory requirements, and do not adap… ▽ More

    Submitted 30 August, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: ICML 2022, fixed typo in Observation 1

  23. arXiv:2202.10930  [pdf, other

    cs.LG cs.AI

    Transformation Coding: Simple Objectives for Equivariant Representations

    Authors: Mehran Shakerinava, Arnab Kumar Mondal, Siamak Ravanbakhsh

    Abstract: We present a simple non-generative approach to deep representation learning that seeks equivariant deep embedding through simple objectives. In contrast to existing equivariant networks, our transformation coding approach does not constrain the choice of the feed-forward layer or the architecture and allows for an unknown group action on the input space. We introduce several such transformation co… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

  24. arXiv:2106.06662  [pdf, other

    cs.LG

    Equivariant Networks for Pixelized Spheres

    Authors: Mehran Shakerinava, Siamak Ravanbakhsh

    Abstract: Pixelizations of Platonic solids such as the cube and icosahedron have been widely used to represent spherical data, from climate records to Cosmic Microwave Background maps. Platonic solids have well-known global symmetries. Once we pixelize each face of the solid, each face also possesses its own local symmetries in the form of Euclidean isometries. One way to combine these symmetries is through… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: Accepted to ICML 2021

  25. arXiv:2102.08382  [pdf, other

    astro-ph.CO astro-ph.IM

    Recovering the Wedge Modes Lost to 21-cm Foregrounds

    Authors: Samuel Gagnon-Hartman, Yue Cui, Jacob Kennedy, Adrian Liu, Siamak Ravanbakhsh

    Abstract: One of the critical challenges facing imaging studies of the 21-cm signal at the Epoch of Reionization (EoR) is the separation of astrophysical foreground contamination. These foregrounds are known to lie in a wedge-shaped region of $(k_{\perp},k_{\parallel})$ Fourier space. Removing these Fourier modes excises the foregrounds at grave expense to image fidelity, since the cosmological information… ▽ More

    Submitted 26 February, 2024; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: 15 pages, 15 figures, 2 tables. Appended erratum in v5 to describe necessary adjustment to reproduce performance in the face of overfitting, updated to match erratum accepted by MNRAS

    Journal ref: MNRAS 504, 4716 (2021)

  26. Deep Generative Models for Galaxy Image Simulations

    Authors: Francois Lanusse, Rachel Mandelbaum, Siamak Ravanbakhsh, Chun-Liang Li, Peter Freeman, Barnabas Poczos

    Abstract: Image simulations are essential tools for preparing and validating the analysis of current and future wide-field optical surveys. However, the galaxy models used as the basis for these simulations are typically limited to simple parametric light profiles, or use a fairly limited amount of available space-based data. In this work, we propose a methodology based on Deep Generative Models to create c… ▽ More

    Submitted 9 August, 2020; originally announced August 2020.

    Comments: 14 pages, submitted to MNRAS. Comments most welcome

  27. arXiv:2006.03627  [pdf, other

    cs.LG cs.CV math.GR stat.ML

    Equivariant Maps for Hierarchical Structures

    Authors: Renhao Wang, Marjan Albooyeh, Siamak Ravanbakhsh

    Abstract: While using invariant and equivariant maps, it is possible to apply deep learning to a range of primitive data structures, a formalism for dealing with hierarchy is lacking. This is a significant issue because many practical structures are hierarchies of simple building blocks; some examples include sequences of sets, graphs of graphs, or multiresolution images. Observing that the symmetry of a hi… ▽ More

    Submitted 23 November, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

  28. arXiv:2002.02912  [pdf, other

    cs.LG cs.NE math.GR stat.ML

    Universal Equivariant Multilayer Perceptrons

    Authors: Siamak Ravanbakhsh

    Abstract: Group invariant and equivariant Multilayer Perceptrons (MLP), also known as Equivariant Networks, have achieved remarkable success in learning on a variety of data structures, such as sequences, images, sets, and graphs. Using tools from group theory, this paper proves the universality of a broad class of equivariant MLPs with a single hidden layer. In particular, it is shown that having a hidden… ▽ More

    Submitted 24 June, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

  29. LRP2020: Machine Learning Advantages in Canadian Astrophysics

    Authors: K. A. Venn, S. Fabbro, A Liu, Y. Hezaveh, L. Perreault-Levasseur, G. Eadie, S. Ellison, J. Woo, JJ. Kavelaars, K. M. Yi, R. Hlozek, J. Bovy, H. Teimoorinia, S. Ravanbakhsh, L. Spencer

    Abstract: The application of machine learning (ML) methods to the analysis of astrophysical datasets is on the rise, particularly as the computing power and complex algorithms become more powerful and accessible. As the field of ML enjoys a continuous stream of breakthroughs, its applications demonstrate the great potential of ML, ranging from achieving tens of millions of times increase in analysis speed (… ▽ More

    Submitted 15 October, 2019; v1 submitted 2 October, 2019; originally announced October 2019.

    Comments: White paper E015 submitted to the Canadian Long Range Plan LRP2020

  30. arXiv:1905.11460  [pdf, other

    cs.LG stat.ML

    Incidence Networks for Geometric Deep Learning

    Authors: Marjan Albooyeh, Daniele Bertolini, Siamak Ravanbakhsh

    Abstract: Sparse incidence tensors can represent a variety of structured data. For example, we may represent attributed graphs using their node-node, node-edge, or edge-edge incidence matrices. In higher dimensions, incidence tensors can represent simplicial complexes and polytopes. In this paper, we formalize incidence tensors, analyze their structure, and present the family of equivariant networks that op… ▽ More

    Submitted 11 August, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: Last revised August 10, 2020

  31. arXiv:1903.09033  [pdf, other

    cs.LG stat.ML

    Equivariant Entity-Relationship Networks

    Authors: Devon Graham, Junhao Wang, Siamak Ravanbakhsh

    Abstract: The relational model is a ubiquitous representation of big-data, in part due to its extensive use in databases. In this paper, we propose the Equivariant Entity-Relationship Network (EERN), which is a Multilayer Perceptron equivariant to the symmetry transformations of the Entity-Relationship model. To this end, we identify the most expressive family of linear maps that are exactly equivariant to… ▽ More

    Submitted 5 June, 2020; v1 submitted 21 March, 2019; originally announced March 2019.

  32. arXiv:1812.03235  [pdf, other

    cs.LG stat.ML

    Improved Knowledge Graph Embedding using Background Taxonomic Information

    Authors: Bahare Fatemi, Siamak Ravanbakhsh, David Poole

    Abstract: Knowledge graphs are used to represent relational information in terms of triples. To enable learning about domains, embedding models, such as tensor factorization models, can be used to make predictions of new triples. Often there is background taxonomic information (in terms of subclasses and subproperties) that should also be taken into account. We show that existing fully expressive (a.k.a. un… ▽ More

    Submitted 7 December, 2018; originally announced December 2018.

  33. arXiv:1811.06533  [pdf, other

    astro-ph.CO cs.AI cs.LG

    Learning to Predict the Cosmological Structure Formation

    Authors: Siyu He, Yin Li, Yu Feng, Shirley Ho, Siamak Ravanbakhsh, Wei Chen, Barnabás Póczos

    Abstract: Matter evolved under influence of gravity from minuscule density fluctuations. Non-perturbative structure formed hierarchically over all scales, and developed non-Gaussian features in the Universe, known as the Cosmic Web. To fully understand the structure formation of the Universe is one of the holy grails of modern astrophysics. Astrophysicists survey large volumes of the Universe and employ a l… ▽ More

    Submitted 31 July, 2019; v1 submitted 15 November, 2018; originally announced November 2018.

    Comments: 8 pages, 5 figures, 1 table

    Journal ref: PNAS July 9, 2019 116 (28) 13825-13832

  34. arXiv:1806.11217  [pdf, other

    cs.CV

    Subject2Vec: Generative-Discriminative Approach from a Set of Image Patches to a Vector

    Authors: Sumedha Singla, Mingming Gong, Siamak Ravanbakhsh, Frank Sciurba, Barnabas Poczos, Kayhan N. Batmanghelich

    Abstract: We propose an attention-based method that aggregates local image features to a subject-level representation for predicting disease severity. In contrast to classical deep learning that requires a fixed dimensional input, our method operates on a set of image patches; hence it can accommodate variable length input image without image resizing. The model learns a clinically interpretable subject-lev… ▽ More

    Submitted 28 June, 2018; originally announced June 2018.

    Comments: MICCAI 2018

  35. arXiv:1803.02879  [pdf, other

    stat.ML cs.LG

    Deep Models of Interactions Across Sets

    Authors: Jason Hartford, Devon R Graham, Kevin Leyton-Brown, Siamak Ravanbakhsh

    Abstract: We use deep learning to model interactions across two or more sets of objects, such as user-movie ratings, protein-drug bindings, or ternary user-item-tag interactions. The canonical representation of such interactions is a matrix (or a higher-dimensional tensor) with an exchangeability property: the encoding's meaning is not changed by permuting rows or columns. We argue that models should hence… ▽ More

    Submitted 8 June, 2018; v1 submitted 7 March, 2018; originally announced March 2018.

  36. arXiv:1711.02033  [pdf, other

    astro-ph.CO cs.LG stat.ML

    Estimating Cosmological Parameters from the Dark Matter Distribution

    Authors: Siamak Ravanbakhsh, Junier Oliva, Sebastien Fromenteau, Layne C. Price, Shirley Ho, Jeff Schneider, Barnabas Poczos

    Abstract: A grand challenge of the 21st century cosmology is to accurately estimate the cosmological parameters of our Universe. A major approach to estimating the cosmological parameters is to use the large-scale matter distribution of the Universe. Galaxy surveys provide the means to map out cosmic large-scale structure in three dimensions. Information about galaxy locations is typically summarized in a "… ▽ More

    Submitted 6 November, 2017; originally announced November 2017.

    Comments: ICML 2016

  37. arXiv:1703.06114  [pdf, other

    cs.LG stat.ML

    Deep Sets

    Authors: Manzil Zaheer, Satwik Kottur, Siamak Ravanbakhsh, Barnabas Poczos, Ruslan Salakhutdinov, Alexander Smola

    Abstract: We study the problem of designing models for machine learning tasks defined on \emph{sets}. In contrast to traditional approach of operating on fixed dimensional vectors, we consider objective functions defined on sets that are invariant to permutations. Such problems are widespread, ranging from estimation of population statistics \cite{poczos13aistats}, to anomaly detection in piezometer data of… ▽ More

    Submitted 14 April, 2018; v1 submitted 10 March, 2017; originally announced March 2017.

    Comments: NIPS 2017

  38. arXiv:1703.02642  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.GA

    CMU DeepLens: Deep Learning For Automatic Image-based Galaxy-Galaxy Strong Lens Finding

    Authors: Francois Lanusse, Quanbin Ma, Nan Li, Thomas E. Collett, Chun-Liang Li, Siamak Ravanbakhsh, Rachel Mandelbaum, Barnabas Poczos

    Abstract: Galaxy-scale strong gravitational lensing is not only a valuable probe of the dark matter distribution of massive galaxies, but can also provide valuable cosmological constraints, either by studying the population of strong lenses or by measuring time delays in lensed quasars. Due to the rarity of galaxy-scale strongly lensed systems, fast and reliable automated lens finding methods will be essent… ▽ More

    Submitted 7 March, 2017; originally announced March 2017.

    Comments: 12 pages, 9 figures, submitted to MNRAS

  39. arXiv:1702.08389  [pdf, other

    stat.ML cs.NE

    Equivariance Through Parameter-Sharing

    Authors: Siamak Ravanbakhsh, Jeff Schneider, Barnabas Poczos

    Abstract: We propose to study equivariance in deep neural networks through parameter symmetries. In particular, given a group $\mathcal{G}$ that acts discretely on the input and output of a standard neural network layer $φ_{W}: \Re^{M} \to \Re^{N}$, we show that $φ_{W}$ is equivariant with respect to $\mathcal{G}$-action iff $\mathcal{G}$ explains the symmetries of the network parameters $W$. Inspired by th… ▽ More

    Submitted 13 June, 2017; v1 submitted 27 February, 2017; originally announced February 2017.

    Comments: icml'17

  40. arXiv:1611.04500  [pdf, other

    stat.ML cs.LG cs.NE

    Deep Learning with Sets and Point Clouds

    Authors: Siamak Ravanbakhsh, Jeff Schneider, Barnabas Poczos

    Abstract: We introduce a simple permutation equivariant layer for deep learning with set structure.This type of layer, obtained by parameter-sharing, has a simple implementation and linear-time complexity in the size of each set. We use deep permutation-invariant networks to perform point-could classification and MNIST-digit summation, where in both cases the output is invariant to permutations of the input… ▽ More

    Submitted 23 February, 2017; v1 submitted 14 November, 2016; originally announced November 2016.

  41. arXiv:1611.03879  [pdf, other

    stat.ML cs.LG

    Annealing Gaussian into ReLU: a New Sampling Strategy for Leaky-ReLU RBM

    Authors: Chun-Liang Li, Siamak Ravanbakhsh, Barnabas Poczos

    Abstract: Restricted Boltzmann Machine (RBM) is a bipartite graphical model that is used as the building block in energy-based deep generative models. Due to numerical stability and quantifiability of the likelihood, RBM is commonly used with Bernoulli units. Here, we consider an alternative member of exponential family RBM with leaky rectified linear units -- called leaky RBM. We first study the joint and… ▽ More

    Submitted 11 November, 2016; originally announced November 2016.

  42. arXiv:1609.05796  [pdf, other

    astro-ph.IM astro-ph.CO cs.AI stat.ML

    Enabling Dark Energy Science with Deep Generative Models of Galaxy Images

    Authors: Siamak Ravanbakhsh, Francois Lanusse, Rachel Mandelbaum, Jeff Schneider, Barnabas Poczos

    Abstract: Understanding the nature of dark energy, the mysterious force driving the accelerated expansion of the Universe, is a major challenge of modern cosmology. The next generation of cosmological surveys, specifically designed to address this issue, rely on accurate measurements of the apparent shapes of distant galaxies. However, shape measurement methods suffer from various unavoidable biases and the… ▽ More

    Submitted 30 November, 2016; v1 submitted 19 September, 2016; originally announced September 2016.

  43. arXiv:1601.00034  [pdf, other

    stat.ML cs.LG cs.NE

    Stochastic Neural Networks with Monotonic Activation Functions

    Authors: Siamak Ravanbakhsh, Barnabas Poczos, Jeff Schneider, Dale Schuurmans, Russell Greiner

    Abstract: We propose a Laplace approximation that creates a stochastic unit from any smooth monotonic activation function, using only Gaussian noise. This paper investigates the application of this stochastic approximation in training a family of Restricted Boltzmann Machines (RBM) that are closely linked to Bregman divergences. This family, that we call exponential family RBM (Exp-RBM), is a subset of the… ▽ More

    Submitted 22 July, 2016; v1 submitted 31 December, 2015; originally announced January 2016.

    Comments: AISTATS 2016

  44. arXiv:1509.08535  [pdf, other

    math.ST cs.AI cs.DM stat.ML

    Boolean Matrix Factorization and Noisy Completion via Message Passing

    Authors: Siamak Ravanbakhsh, Barnabas Poczos, Russell Greiner

    Abstract: Boolean matrix factorization and Boolean matrix completion from noisy observations are desirable unsupervised data-analysis methods due to their interpretability, but hard to perform due to their NP-hardness. We treat these problems as maximum a posteriori inference problems in a graphical model and present a message passing approach that scales linearly with the number of observations and factors… ▽ More

    Submitted 4 February, 2016; v1 submitted 28 September, 2015; originally announced September 2015.

  45. arXiv:1508.05013  [pdf, other

    cs.AI cs.CC cs.DS math.AC math.PR

    Message Passing and Combinatorial Optimization

    Authors: Siamak Ravanbakhsh

    Abstract: Graphical models use the intuitive and well-studied methods of graph theory to implicitly represent dependencies between variables in large systems. They can model the global behaviour of a complex system by specifying only local factors. This thesis studies inference in discrete graphical models from an algebraic perspective and the ways inference can be used to express and approximate NP-hard co… ▽ More

    Submitted 20 August, 2015; originally announced August 2015.

    Comments: Ravanbakhsh, S. (2015), Message Passing and Combinatorial Optimization, PhD thesis, University of Alberta

  46. arXiv:1409.7410  [pdf, other

    cs.AI cs.CC math.RA

    Revisiting Algebra and Complexity of Inference in Graphical Models

    Authors: Siamak Ravanbakhsh, Russell Greiner

    Abstract: This paper studies the form and complexity of inference in graphical models using the abstraction offered by algebraic structures. In particular, we broadly formalize inference problems in graphical models by viewing them as a sequence of operations based on commutative semigroups. We then study the computational complexity of inference by organizing various problems into an "inference hierarchy".… ▽ More

    Submitted 3 May, 2015; v1 submitted 25 September, 2014; originally announced September 2014.

  47. arXiv:1409.1456  [pdf, other

    cs.AI cs.CE q-bio.QM

    Accurate, fully-automated NMR spectral profiling for metabolomics

    Authors: Siamak Ravanbakhsh, Philip Liu, Trent Bjorndahl, Rupasri Mandal, Jason R. Grant, Michael Wilson, Roman Eisner, Igor Sinelnikov, Xiaoyu Hu, Claudio Luchinat, Russell Greiner, David S. Wishart

    Abstract: Many diseases cause significant changes to the concentrations of small molecules (aka metabolites) that appear in a person's biofluids, which means such diseases can often be readily detected from a person's "metabolic profile". This information can be extracted from a biofluid's NMR spectrum. Today, this is often done manually by trained human experts, which means this process is relatively slow,… ▽ More

    Submitted 7 September, 2014; v1 submitted 4 September, 2014; originally announced September 2014.

    Journal ref: PLoS ONE 10(5): e0124219, 2015

  48. arXiv:1406.0941  [pdf, other

    cs.AI

    Augmentative Message Passing for Traveling Salesman Problem and Graph Partitioning

    Authors: Siamak Ravanbakhsh, Reihaneh Rabbany, Russell Greiner

    Abstract: The cutting plane method is an augmentative constrained optimization procedure that is often used with continuous-domain optimization techniques such as linear and convex programs. We investigate the viability of a similar idea within message passing -- which produces integral solutions -- in the context of two combinatorial problems: 1) For Traveling Salesman Problem (TSP), we propose a factor-gr… ▽ More

    Submitted 4 June, 2014; originally announced June 2014.

    Report number: Advances in Neural Information Processing Systems 27 (NIPS 2014)

  49. arXiv:1405.1436  [pdf, ps, other

    cs.NE cs.LG stat.ML

    Training Restricted Boltzmann Machine by Perturbation

    Authors: Siamak Ravanbakhsh, Russell Greiner, Brendan Frey

    Abstract: A new approach to maximum likelihood learning of discrete graphical models and RBM in particular is introduced. Our method, Perturb and Descend (PD) is inspired by two ideas (I) perturb and MAP method for sampling (II) learning by Contrastive Divergence minimization. In contrast to perturb and MAP, PD leverages training data to learn the models that do not allow efficient MAP estimation. During th… ▽ More

    Submitted 6 May, 2014; originally announced May 2014.

  50. arXiv:1401.6686  [pdf, other

    cs.AI cs.CC stat.ML

    Perturbed Message Passing for Constraint Satisfaction Problems

    Authors: Siamak Ravanbakhsh, Russell Greiner

    Abstract: We introduce an efficient message passing scheme for solving Constraint Satisfaction Problems (CSPs), which uses stochastic perturbation of Belief Propagation (BP) and Survey Propagation (SP) messages to bypass decimation and directly produce a single satisfying assignment. Our first CSP solver, called Perturbed Blief Propagation, smoothly interpolates two well-known inference procedures; it start… ▽ More

    Submitted 2 February, 2015; v1 submitted 26 January, 2014; originally announced January 2014.

    Journal ref: JMLR 16(Jul):1249-1274, 2015