Skip to main content

Showing 1–6 of 6 results for author: Panigrahi, S S

.
  1. arXiv:2502.03638  [pdf, other

    cond-mat.mtrl-sci cs.LG

    SymmCD: Symmetry-Preserving Crystal Generation with Diffusion Models

    Authors: Daniel Levy, Siba Smarak Panigrahi, Sékou-Oumar Kaba, Qiang Zhu, Kin Long Kelvin Lee, Mikhail Galkin, Santiago Miret, Siamak Ravanbakhsh

    Abstract: Generating novel crystalline materials has the potential to lead to advancements in fields such as electronics, energy storage, and catalysis. The defining characteristic of crystals is their symmetry, which plays a central role in determining their physical properties. However, existing crystal generation methods either fail to generate materials that display the symmetries of real-world crystals… ▽ More

    Submitted 23 May, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

    Comments: 24 pages, 10 figures, International Conference on Learning Representations (ICLR) 2025

  2. arXiv:2412.04626  [pdf, other

    cs.LG cs.CL

    BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks

    Authors: Juan Rodriguez, Xiangru Jian, Siba Smarak Panigrahi, Tianyu Zhang, Aarash Feizi, Abhay Puri, Akshay Kalkunte, François Savard, Ahmed Masry, Shravan Nayak, Rabiul Awal, Mahsa Massoud, Amirhossein Abaskohi, Zichao Li, Suyuchen Wang, Pierre-André Noël, Mats Leon Richter, Saverio Vadacchino, Shubham Agarwal, Sanket Biswas, Sara Shanian, Ying Zhang, Noah Bolger, Kurt MacDonald, Simon Fauvel , et al. (18 additional authors not shown)

    Abstract: Multimodal AI has the potential to significantly enhance document-understanding tasks, such as processing receipts, understanding workflows, extracting data from documents, and summarizing reports. Code generation tasks that require long-structured outputs can also be enhanced by multimodality. Despite this, their use in commercial applications is often limited due to limited access to training da… ▽ More

    Submitted 17 March, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

    Comments: The project is hosted at https://bigdocs.github.io

    Journal ref: ICLR 2025 https://openreview.net/forum?id=UTgNFcpk0j

  3. arXiv:2405.14089  [pdf, other

    cs.LG

    Improved Canonicalization for Model Agnostic Equivariance

    Authors: Siba Smarak Panigrahi, Arnab Kumar Mondal

    Abstract: This work introduces a novel approach to achieving architecture-agnostic equivariance in deep learning, particularly addressing the limitations of traditional layerwise equivariant architectures and the inefficiencies of the existing architecture-agnostic methods. Building equivariant models using traditional methods requires designing equivariant versions of existing models and training them from… ▽ More

    Submitted 15 November, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted to EquiVision workshop, CVPR 2024. 8 pages, 2 figures, 2 tables

  4. arXiv:2310.01647  [pdf, other

    cs.LG

    Equivariant Adaptation of Large Pretrained Models

    Authors: Arnab Kumar Mondal, Siba Smarak Panigrahi, Sékou-Oumar Kaba, Sai Rajeswar, Siamak Ravanbakhsh

    Abstract: Equivariant networks are specifically designed to ensure consistent behavior with respect to a set of input transformations, leading to higher sample efficiency and more accurate and robust predictions. However, redesigning each component of prevalent deep neural network architectures to achieve chosen equivariance is a difficult problem and can result in a computationally expensive network during… ▽ More

    Submitted 29 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 17 pages, 6 figures. Accepted to NeurIPS 2023

  5. arXiv:2306.11941  [pdf, other

    cs.LG cs.AI

    Efficient Dynamics Modeling in Interactive Environments with Koopman Theory

    Authors: Arnab Kumar Mondal, Siba Smarak Panigrahi, Sai Rajeswar, Kaleem Siddiqi, Siamak Ravanbakhsh

    Abstract: The accurate modeling of dynamics in interactive environments is critical for successful long-range prediction. Such a capability could advance Reinforcement Learning (RL) and Planning algorithms, but achieving it is challenging. Inaccuracies in model estimates can compound, resulting in increased errors over long horizons. We approach this problem from the lens of Koopman theory, where the nonlin… ▽ More

    Submitted 12 May, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Accepted to ICLR 2024 and EWRL 2023

  6. arXiv:2110.12370  [pdf, other

    cs.CL

    Team Enigma at ArgMining-EMNLP 2021: Leveraging Pre-trained Language Models for Key Point Matching

    Authors: Manav Nitin Kapadnis, Sohan Patnaik, Siba Smarak Panigrahi, Varun Madhavan, Abhilash Nandy

    Abstract: We present the system description for our submission towards the Key Point Analysis Shared Task at ArgMining 2021. Track 1 of the shared task requires participants to develop methods to predict the match score between each pair of arguments and keypoints, provided they belong to the same topic under the same stance. We leveraged existing state of the art pre-trained language models along with inco… ▽ More

    Submitted 24 October, 2021; originally announced October 2021.