Skip to main content

Showing 1–7 of 7 results for author: Mantri, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.07337  [pdf, other

    cs.LG cs.SI

    FLASH: Flexible Learning of Adaptive Sampling from History in Temporal Graph Neural Networks

    Authors: Or Feldman, Krishna Sri Ipsit Mantri, Carola-Bibiane Schönlieb, Chaim Baskin, Moshe Eliasof

    Abstract: Aggregating temporal signals from historic interactions is a key step in future link prediction on dynamic graphs. However, incorporating long histories is resource-intensive. Hence, temporal graph neural networks (TGNNs) often rely on historical neighbors sampling heuristics such as uniform sampling or recent neighbors selection. These heuristics are static and fail to adapt to the underlying gra… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

    Comments: 22 pages, 4 figures, 12 tables

  2. arXiv:2502.06029  [pdf, ps, other

    cs.CV

    DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations

    Authors: Krishna Sri Ipsit Mantri, Carola-Bibiane Schönlieb, Bruno Ribeiro, Chaim Baskin, Moshe Eliasof

    Abstract: Pre-trained Vision Transformers now serve as powerful tools for computer vision. Yet, efficiently adapting them for multiple tasks remains a challenge that arises from the need to modify the rich hidden representations encoded by the learned weight matrices, without inducing interference between tasks. Current parameter-efficient methods like LoRA, which apply low-rank updates, force tasks to comp… ▽ More

    Submitted 1 June, 2025; v1 submitted 9 February, 2025; originally announced February 2025.

    Comments: CVPR 2025, 14 pages

  3. arXiv:2501.08406  [pdf, other

    cs.HC cs.CL cs.LG math.OC

    OptiChat: Bridging Optimization Models and Practitioners with Large Language Models

    Authors: Hao Chen, Gonzalo Esteban Constante-Flores, Krishna Sri Ipsit Mantri, Sai Madhukiran Kompalli, Akshdeep Singh Ahluwalia, Can Li

    Abstract: Optimization models have been applied to solve a wide variety of decision-making problems. These models are usually developed by optimization experts but are used by practitioners without optimization expertise in various application domains. As a result, practitioners often struggle to interact with and draw useful conclusions from optimization models independently. To fill this gap, we introduce… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

  4. arXiv:2407.02013  [pdf, other

    cs.LG

    DiGRAF: Diffeomorphic Graph-Adaptive Activation Function

    Authors: Krishna Sri Ipsit Mantri, Xinzhi Wang, Carola-Bibiane Schönlieb, Bruno Ribeiro, Beatrice Bevilacqua, Moshe Eliasof

    Abstract: In this paper, we propose a novel activation function tailored specifically for graph data in Graph Neural Networks (GNNs). Motivated by the need for graph-adaptive and flexible activation functions, we introduce DiGRAF, leveraging Continuous Piecewise-Affine Based (CPAB) transformations, which we augment with an additional GNN to learn a graph-adaptive diffeomorphic activation function in an end-… ▽ More

    Submitted 30 October, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: NeurIPS 2024

  5. arXiv:2308.07052  [pdf

    cs.CV cs.AI cs.CY cs.LG

    Diagnosis of Scalp Disorders using Machine Learning and Deep Learning Approach -- A Review

    Authors: Hrishabh Tiwari, Jatin Moolchandani, Shamla Mantri

    Abstract: The morbidity of scalp diseases is minuscule compared to other diseases, but the impact on the patient's life is enormous. It is common for people to experience scalp problems that include Dandruff, Psoriasis, Tinea-Capitis, Alopecia and Atopic-Dermatitis. In accordance with WHO research, approximately 70% of adults have problems with their scalp. It has been demonstrated in descriptive research t… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  6. arXiv:2306.05182  [pdf, other

    cs.CV cs.LG

    Interactive Fashion Content Generation Using LLMs and Latent Diffusion Models

    Authors: Krishna Sri Ipsit Mantri, Nevasini Sasikumar

    Abstract: Fashionable image generation aims to synthesize images of diverse fashion prevalent around the globe, helping fashion designers in real-time visualization by giving them a basic customized structure of how a specific design preference would look in real life and what further improvements can be made for enhanced customer satisfaction. Moreover, users can alone interact and generate fashionable ima… ▽ More

    Submitted 15 May, 2023; originally announced June 2023.

    Comments: Third Workshop on Ethical Considerations in Creative applications of Computer Vision (EC3V) at CVPR 2023. arXiv admin note: substantial text overlap with arXiv:2301.02110, arXiv:2112.10752 by other authors

  7. arXiv:2305.13048  [pdf, other

    cs.CL cs.AI

    RWKV: Reinventing RNNs for the Transformer Era

    Authors: Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Stella Biderman, Huanqi Cao, Xin Cheng, Michael Chung, Matteo Grella, Kranthi Kiran GV, Xuzheng He, Haowen Hou, Jiaju Lin, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartlomiej Koptyra, Hayden Lau, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Guangyu Song, Xiangru Tang, Bolun Wang , et al. (9 additional authors not shown)

    Abstract: Transformers have revolutionized almost all natural language processing (NLP) tasks but suffer from memory and computational complexity that scales quadratically with sequence length. In contrast, recurrent neural networks (RNNs) exhibit linear scaling in memory and computational requirements but struggle to match the same performance as Transformers due to limitations in parallelization and scala… ▽ More

    Submitted 10 December, 2023; v1 submitted 22 May, 2023; originally announced May 2023.