Skip to main content

Showing 1–7 of 7 results for author: Sai, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.09450  [pdf, ps, other

    cs.CL cs.AI

    UniToMBench: Integrating Perspective-Taking to Improve Theory of Mind in LLMs

    Authors: Prameshwar Thiyagarajan, Vaishnavi Parimi, Shamant Sai, Soumil Garg, Zhangir Meirbek, Nitin Yarlagadda, Kevin Zhu, Chris Kim

    Abstract: Theory of Mind (ToM), the ability to understand the mental states of oneself and others, remains a challenging area for large language models (LLMs), which often fail to predict human mental states accurately. In this paper, we introduce UniToMBench, a unified benchmark that integrates the strengths of SimToM and TOMBENCH to systematically improve and assess ToM capabilities in LLMs by integrating… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: Accepted at Conference of the North American Chapter of the Association for Computational Linguistics, Student Research Workshop 2025 (NAACL SRW 2025)

  2. arXiv:2501.10866  [pdf, other

    cs.LG

    QGAPHEnsemble : Combining Hybrid QLSTM Network Ensemble via Adaptive Weighting for Short Term Weather Forecasting

    Authors: Anuvab Sen, Udayon Sen, Mayukhi Paul, Apurba Prasad Padhy, Sujith Sai, Aakash Mallik, Chhandak Mallick

    Abstract: Accurate weather forecasting holds significant importance, serving as a crucial tool for decision-making in various industrial sectors. The limitations of statistical models, assuming independence among data points, highlight the need for advanced methodologies. The correlation between meteorological variables necessitate models capable of capturing complex dependencies. This research highlights t… ▽ More

    Submitted 18 January, 2025; originally announced January 2025.

    Comments: 8 pages and 9 figures, Accepted by the 15th IEEE International Symposium Series on Computational Intelligence (SSCI 2023), March 17-21, 2025, Trondheim, Norway

  3. arXiv:2501.00042  [pdf

    cs.LG cs.AI

    Resource-Efficient Transformer Architecture: Optimizing Memory and Execution Time for Real-Time Applications

    Authors: Krisvarish V, Priyadarshini T, K P Abhishek Sri Saai, Vaidehi Vijayakumar

    Abstract: This paper describes a memory-efficient transformer model designed to drive a reduction in memory usage and execution time by substantial orders of magnitude without impairing the model's performance near that of the original model. Recently, new architectures of transformers were presented, focused on parameter efficiency and computational optimization; however, such models usually require consid… ▽ More

    Submitted 25 December, 2024; originally announced January 2025.

    Comments: 5 pages, 1 figure

  4. arXiv:2405.20654  [pdf, other

    cs.CL cs.IR

    Passage-specific Prompt Tuning for Passage Reranking in Question Answering with Large Language Models

    Authors: Xuyang Wu, Zhiyuan Peng, Krishna Sravanthi Rajanala Sai, Hsin-Tai Wu, Yi Fang

    Abstract: Effective passage retrieval and reranking methods have been widely utilized to identify suitable candidates in open-domain question answering tasks, recent studies have resorted to LLMs for reranking the retrieved passages by the log-likelihood of the question conditioned on each passage. Although these methods have demonstrated promising results, the performance is notably sensitive to the human-… ▽ More

    Submitted 20 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: Accepted at Gen-IR@SIGIR24

  5. arXiv:2311.02087  [pdf

    cs.SD cs.AI cs.LG eess.AS eess.SP

    Design Of Rubble Analyzer Probe Using ML For Earthquake

    Authors: Abhishek Sebastian, R Pragna, K Vishal Vythianathan, Dasaraju Sohan Sai, U Shiva Sri Hari Al, R Anirudh, Apurv Choudhary

    Abstract: The earthquake rubble analyzer uses machine learning to detect human presence via ambient sounds, achieving 97.45% accuracy. It also provides real-time environmental data, aiding in assessing survival prospects for trapped individuals, crucial for post-earthquake rescue efforts

    Submitted 24 October, 2023; originally announced November 2023.

  6. arXiv:2110.03588  [pdf

    eess.IV cs.CV physics.med-ph

    A transformer-based deep learning approach for classifying brain metastases into primary organ sites using clinical whole brain MRI

    Authors: Qing Lyu, Sanjeev V. Namjoshi, Emory McTyre, Umit Topaloglu, Richard Barcus, Michael D. Chan, Christina K. Cramer, Waldemar Debinski, Metin N. Gurcan, Glenn J. Lesser, Hui-Kuan Lin, Reginald F. Munden, Boris C. Pasche, Kiran Kumar Solingapuram Sai, Roy E. Strowd, Stephen B. Tatter, Kounosuke Watabe, Wei Zhang, Ge Wang, Christopher T. Whitlow

    Abstract: Treatment decisions for brain metastatic disease rely on knowledge of the primary organ site, and currently made with biopsy and histology. Here we develop a novel deep learning approach for accurate non-invasive digital histology with whole-brain MRI data. Our IRB-approved single-site retrospective study was comprised of patients (n=1,399) referred for MRI treatment-planning and gamma knife radio… ▽ More

    Submitted 20 April, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

  7. arXiv:1908.09003  [pdf

    cs.CV eess.IV

    High Accurate Unhealthy Leaf Detection

    Authors: S. Mohan Sai, G. Gopichand, C. Vikas Reddy, K. Mona Teja

    Abstract: India is an agriculture-dependent country. As we all know that farming is the backbone of our country it is our responsibility to preserve the crops. However, we cannot stop the destruction of crops by natural calamities at least we have to try to protect our crops from diseases. To, detect a plant disease we need a fast automatic way. So, this paper presents a model to identify the particular dis… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

    Comments: Page 4, 5 with 1 figure, and page 6 with 2 figures