Skip to main content

Showing 1–50 of 362 results for author: Mitra, P

.
  1. arXiv:2509.13401  [pdf, ps, other

    hep-th gr-qc quant-ph

    An effective density matrix for vacua in asymptotically flat gravity

    Authors: Temple He, Prahar Mitra, Kathryn M. Zurek

    Abstract: We explicitly construct the density matrix associated to the vacuum state of a large spherically symmetric causal diamond of area $A$ in four-dimensional asymptotically flat gravity. We achieve this using the soft effective action, which characterizes the low-energy gravitational degrees of freedom that arise in the long-distance limit of the Einstein-Hilbert action and consists of both the soft g… ▽ More

    Submitted 16 September, 2025; originally announced September 2025.

    Comments: 11 pages, 1 figure

    Report number: CALT-TH 2025-030

  2. arXiv:2508.18309  [pdf, ps, other

    physics.gen-ph

    Vector Differential Operators in arbitrary coordinates: a general approach

    Authors: Priyabrata Mitra, Dhrubaditya Mitra

    Abstract: We present a method for calculating the results of operation of differential operators operating on components of vector in generalized coordinates not restricted to orthogonal one. For this we use the relationships between covariant, contravariant and physical components of a vector and the idea of covariant differentiation. This not only simplifies vector calculus in common curvilinear coordinat… ▽ More

    Submitted 23 August, 2025; originally announced August 2025.

  3. arXiv:2507.02851  [pdf, ps, other

    cs.CL cs.AI cs.IT cs.LG eess.SY

    MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs

    Authors: Purbesh Mitra, Sennur Ulukus

    Abstract: Recent advancements in the reasoning capabilities of large language models (LLMs) show that employing group relative policy optimization (GRPO) algorithm for reinforcement learning (RL) training allows the models to use more thinking/reasoning tokens for generating better responses. However, LLMs can generate only a finite amount of tokens while maintaining attention to the previously generated to… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

  4. arXiv:2506.10737  [pdf, ps, other

    cs.CL cs.IR

    TaxoAdapt: Aligning LLM-Based Multidimensional Taxonomy Construction to Evolving Research Corpora

    Authors: Priyanka Kargupta, Nan Zhang, Yunyi Zhang, Rui Zhang, Prasenjit Mitra, Jiawei Han

    Abstract: The rapid evolution of scientific fields introduces challenges in organizing and retrieving scientific literature. While expert-curated taxonomies have traditionally addressed this need, the process is time-consuming and expensive. Furthermore, recent automatic taxonomy construction methods either (1) over-rely on a specific corpus, sacrificing generalizability, or (2) depend heavily on the genera… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: Accepted to ACL 2025 Main Conference. Code available at: https://github.com/pkargupta/taxoadapt

  5. arXiv:2506.09957  [pdf, ps, other

    physics.app-ph

    Mechanism of Conductivity Enhancement of Polymers Employing Microbubble Lithography

    Authors: Anand Dev Ranjan, Dhananjay Mahapatra, Partha Mitra, Ayan Banerjee

    Abstract: The pursuit of green methodologies for fabricating optoelectronic devices necessitates the adoption of self-assembly-based strategies to engineer efficient and sustainable platforms. Microbubble lithography (MBL) stands out as a directed self-assembly technique, enabling real-time micropatterning of conductive structures. Notably, this approach achieves significant enhancements in the conductivity… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: 9 Pages, 6 Figures

  6. arXiv:2506.06568  [pdf, ps, other

    physics.ins-det hep-ex nucl-ex

    Removal of spallation-induced tritium from silicon through diffusion

    Authors: R. Saldanha, D. Reading, P. E. Warwick, A. E. Chavarria, B. Loer, P. Mitra, L. Pagani, P. Privitera

    Abstract: Tritium, predominantly produced through spallation reactions caused by cosmic ray interactions, is a significant radioactive background for silicon-based rare event detection experiments, such as dark matter searches. We have investigated the feasibility of removing cosmogenic tritium from high-purity silicon intended for use in low-background experiments. We demonstrate that significant tritium r… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: 17 pages, 10 figures, 2 tables

  7. arXiv:2505.17736  [pdf, ps, other

    cs.IR

    Modeling Ranking Properties with In-Context Learning

    Authors: Nilanjan Sinhababu, Andrew Parry, Debasis Ganguly, Pabitra Mitra

    Abstract: While standard IR models are mainly designed to optimize relevance, real-world search often needs to balance additional objectives such as diversity and fairness. These objectives depend on inter-document interactions and are commonly addressed using post-hoc heuristics or supervised learning methods, which require task-specific training for each ranking scenario and dataset. In this work, we prop… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 9 pages, 3 tables, 2 figures

  8. arXiv:2505.07754  [pdf

    q-bio.NC cs.CV

    Skeletonization of neuronal processes using Discrete Morse techniques from computational topology

    Authors: Samik Banerjee, Caleb Stam, Daniel J. Tward, Steven Savoia, Yusu Wang, Partha P. Mitra

    Abstract: To understand biological intelligence we need to map neuronal networks in vertebrate brains. Mapping mesoscale neural circuitry is done using injections of tracers that label groups of neurons whose axons project to different brain regions. Since many neurons are labeled, it is difficult to follow individual axons. Previous approaches have instead quantified the regional projections using the tota… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: Under Review in Nature

  9. arXiv:2504.21323  [pdf, other

    cs.CR cs.AI cs.LG

    How to Backdoor the Knowledge Distillation

    Authors: Chen Wu, Qian Ma, Prasenjit Mitra, Sencun Zhu

    Abstract: Knowledge distillation has become a cornerstone in modern machine learning systems, celebrated for its ability to transfer knowledge from a large, complex teacher model to a more efficient student model. Traditionally, this process is regarded as secure, assuming the teacher model is clean. This belief stems from conventional backdoor attacks relying on poisoned training data with backdoor trigger… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

  10. arXiv:2504.13089  [pdf, ps, other

    hep-ex hep-ph

    Absorption of Fermionic Dark Matter in the PICO-60 C$_{3}$F$_{8}$ Bubble Chamber

    Authors: E. Adams, B. Ali, R. Anderson-Dornan, I. J. Arnquist, M. Bai, D. Baxter, E. Behnke, B. Broerman, C. J. Chen, K. Clark, J. I. Collar, P. S. Cooper, D. Cranshaw, C. Cripe, M. Crisler, C. E. Dahl, M. Das, S. Das, S. Fallows, J. Farine, R. Filgas, A. García-Viltres, G. Giroux, O. Harris, H. Hawley-Herrera , et al. (36 additional authors not shown)

    Abstract: Fermionic dark matter absorption on nuclear targets via neutral current interactions is explored using a non-relativistic effective field theory framework. An analysis of data from the PICO-60 C$_{3}$F$_{8}$ bubble chamber sets leading constraints on spin-independent absorption for dark matter masses below 23 MeV/$\textit{c}^2$ and establishes the first limits on spin-dependent absorptive interact… ▽ More

    Submitted 24 June, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

  11. arXiv:2504.02010  [pdf, ps, other

    cs.LG cs.AI

    When Reasoning Meets Compression: Understanding the Effects of LLMs Compression on Large Reasoning Models

    Authors: Nan Zhang, Eugene Kwek, Yusen Zhang, Ngoc-Hieu Nguyen, Prasenjit Mitra, Rui Zhang

    Abstract: Compression methods, including quantization, distillation, and pruning, improve the computational efficiency of large reasoning models (LRMs). However, existing studies either fail to sufficiently compare all three compression methods on LRMs or lack in-depth interpretation analysis. In this paper, we investigate how the reasoning capabilities of LRMs are compromised during compression, through pe… ▽ More

    Submitted 1 October, 2025; v1 submitted 2 April, 2025; originally announced April 2025.

  12. arXiv:2503.19309  [pdf, other

    cs.CL

    Iterative Hypothesis Generation for Scientific Discovery with Monte Carlo Nash Equilibrium Self-Refining Trees

    Authors: Gollam Rabby, Diyana Muhammed, Prasenjit Mitra, Sören Auer

    Abstract: Scientific hypothesis generation is a fundamentally challenging task in research, requiring the synthesis of novel and empirically grounded insights. Traditional approaches rely on human intuition and domain expertise, while purely large language model (LLM) based methods often struggle to produce hypotheses that are both innovative and reliable. To address these limitations, we propose the Monte… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  13. arXiv:2503.19257  [pdf, other

    cs.CL cs.DL

    SCI-IDEA: Context-Aware Scientific Ideation Using Token and Sentence Embeddings

    Authors: Farhana Keya, Gollam Rabby, Prasenjit Mitra, Sahar Vahdati, Sören Auer, Yaser Jaradeh

    Abstract: Every scientific discovery starts with an idea inspired by prior work, interdisciplinary concepts, and emerging challenges. Recent advancements in large language models (LLMs) trained on scientific corpora have driven interest in AI-supported idea generation. However, generating context-aware, high-quality, and innovative ideas remains challenging. We introduce SCI-IDEA, a framework that uses LLM… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  14. arXiv:2503.05341  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Separating the bulk and interface contribution of spin-orbit torque in ferromagnet-Heavy metal bilayers tuned by variation of resistivity of heavy metal

    Authors: Abu Bakkar Miah, Dhananjaya Mahapatra, Soumik Aon, Harekrishna Bhunia, Partha Mitra

    Abstract: Harmonic Hall measurements were conducted on a series of Ferromagnetic metal/Heavy metal (FM/HM) bilayers with beta-Tungsten (W) as the HM and in-plane magnetized permalloy (Py) as the FM and the efficiencies of the two orthogonal components of the spin orbit-torque were extracted. Two sets of Hall bar-shaped devices were considered where the HM resistivity systematically varied over a wide range… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

  15. arXiv:2502.05414  [pdf, other

    cs.LG cs.CL

    Graph-based Molecular In-context Learning Grounded on Morgan Fingerprints

    Authors: Ali Al-Lawati, Jason Lucas, Zhiwei Zhang, Prasenjit Mitra, Suhang Wang

    Abstract: In-context learning (ICL) effectively conditions large language models (LLMs) for molecular tasks, such as property prediction and molecule captioning, by embedding carefully selected demonstration examples into the input prompt. This approach avoids the computational overhead of extensive pertaining and fine-tuning. However, current prompt retrieval methods for molecular tasks have relied on mole… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  16. arXiv:2501.03166  [pdf, other

    cs.CL cs.LG

    Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text

    Authors: Ali Al-Lawati, Jason Lucas, Prasenjit Mitra

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance in various NLP tasks, including semantic parsing, which translates natural language into formal code representations. However, the reverse process, translating code into natural language, termed semantic captioning, has received less attention. This task is becoming increasingly important as LLMs are integrated into platforms fo… ▽ More

    Submitted 7 February, 2025; v1 submitted 6 January, 2025; originally announced January 2025.

    Journal ref: COLING 2025

  17. arXiv:2412.21200  [pdf, other

    cs.IT cs.CL cs.DC cs.LG cs.NI

    Distributed Mixture-of-Agents for Edge Inference with Large Language Models

    Authors: Purbesh Mitra, Priyanka Kaswan, Sennur Ulukus

    Abstract: Mixture-of-Agents (MoA) has recently been proposed as a method to enhance performance of large language models (LLMs), enabling multiple individual LLMs to work together for collaborative inference. This collaborative approach results in improved responses to user prompts compared to relying on a single LLM. In this paper, we consider such an MoA architecture in a distributed setting, where LLMs o… ▽ More

    Submitted 30 December, 2024; originally announced December 2024.

  18. arXiv:2412.06206  [pdf, other

    cs.CL cs.AI

    SiReRAG: Indexing Similar and Related Information for Multihop Reasoning

    Authors: Nan Zhang, Prafulla Kumar Choubey, Alexander Fabbri, Gabriel Bernadett-Shapiro, Rui Zhang, Prasenjit Mitra, Caiming Xiong, Chien-Sheng Wu

    Abstract: Indexing is an important step towards strong performance in retrieval-augmented generation (RAG) systems. However, existing methods organize data based on either semantic similarity (similarity) or related information (relatedness), but do not cover both perspectives comprehensively. Our analysis reveals that modeling only one perspective results in insufficient knowledge synthesis, leading to sub… ▽ More

    Submitted 7 April, 2025; v1 submitted 8 December, 2024; originally announced December 2024.

    Comments: ICLR 2025

  19. arXiv:2411.19704  [pdf, other

    cs.NI eess.SP

    A PDD-Inspired Channel Estimation Scheme in NOMA Network

    Authors: Sumita Majhi, Pinaki Mitra

    Abstract: In 5G networks, non-orthogonal multiple access (NOMA) provides a number of benefits by providing uneven power distribution to multiple users at once. On the other hand, effective power allocation, successful successive interference cancellation (SIC), and user fairness all depend on precise channel state information (CSI). Because of dynamic channels, imperfect models, and feedback overhead, CSI p… ▽ More

    Submitted 29 November, 2024; originally announced November 2024.

  20. arXiv:2411.18712  [pdf, other

    hep-th gr-qc

    Charged Rotating Hairy Black Holes in AdS$_5 \times S^5$: Unveiling their Secrets

    Authors: Oscar J. C. Dias, Prahar Mitra, Jorge E. Santos

    Abstract: Using a mix of analytical and numerical methods, we construct new rotating, charged "hairy" black hole solutions of $D=5$, ${\cal N}=8$ gauged supergravity that are dual, via the AdS/CFT correspondence, to thermal states in $D=4$, ${\cal N}=4$ SYM at finite chemical and angular potential, thereby complementing and extending the results of [arXiv:1005.1287, arXiv:1806.01849, arXiv:1809.04084]. Thes… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

    Comments: 71 pages, 18 figures

  21. arXiv:2411.08346  [pdf, ps, other

    cond-mat.mes-hall

    Evidence of orbital Hall current induced correlation in second harmonic response of longitudinal and transverse voltage in light metal-ferromagnet bilayers

    Authors: Dhananjaya Mahapatra, Abu Bakkar Miah, HareKrishna Bhunia, Soumik Aon, Partha Mitra

    Abstract: We investigate the effect of orbital current arising from orbital Hall effect in thin films of Nb and Ti in ohmic contact with ferromagnetic Ni in the second harmonic longitudinal and transverse voltages in response to an a.c. current applied to the bilayer structures. Our experiments were analogous to those on Heavy Metal-Ferromagnet bilayers and we extract the Orbital Hall Torque efficiency and… ▽ More

    Submitted 11 June, 2025; v1 submitted 13 November, 2024; originally announced November 2024.

    Journal ref: Applied Physics Letters2025

  22. arXiv:2410.07625  [pdf, other

    cs.CV

    MorCode: Face Morphing Attack Generation using Generative Codebooks

    Authors: Aravinda Reddy PN, Raghavendra Ramachandra, Sushma Venkatesh, Krothapalli Sreenivasa Rao, Pabitra Mitra, Rakesh Krishna

    Abstract: Face recognition systems (FRS) can be compromised by face morphing attacks, which blend textural and geometric information from multiple facial images. The rapid evolution of generative AI, especially Generative Adversarial Networks (GAN) or Diffusion models, where encoded images are interpolated to generate high-quality face morphing images. In this work, we present a novel method for the automat… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

  23. arXiv:2409.17745  [pdf, other

    cs.IR cs.CL cs.LG

    Few-shot Prompting for Pairwise Ranking: An Effective Non-Parametric Retrieval Model

    Authors: Nilanjan Sinhababu, Andrew Parry, Debasis Ganguly, Debasis Samanta, Pabitra Mitra

    Abstract: A supervised ranking model, despite its advantage of being effective, usually involves complex processing - typically multiple stages of task-specific pre-training and fine-tuning. This has motivated researchers to explore simpler pipelines leveraging large language models (LLMs) that are capable of working in a zero-shot manner. However, since zero-shot inference does not make use of a training s… ▽ More

    Submitted 4 October, 2024; v1 submitted 26 September, 2024; originally announced September 2024.

    Comments: Accepted to EMNLP 2024

  24. arXiv:2409.13833  [pdf

    eess.SP

    Transfer Learning and Double U-Net Empowered Wave Propagation Model in Complex Indoor Environment

    Authors: Ziheng Fu, Swagato Mukherjee, Michael T. Lanagan, Prasenjit Mitra, Tarun Chawla, Ram M. Narayanan

    Abstract: A Machine Learning (ML) network based on transfer learning and transformer networks is applied to wave propagation models for complex indoor settings. This network is designed to predict signal propagation in environments with a variety of objects, effectively simulating the diverse range of furniture typically found in indoor spaces. We propose Attention U-Net with Efficient Networks as the backb… ▽ More

    Submitted 25 January, 2025; v1 submitted 20 September, 2024; originally announced September 2024.

  25. arXiv:2409.03427   

    astro-ph.IM astro-ph.HE hep-ex hep-ph

    The Giant Radio Array for Neutrino Detection (GRAND) Collaboration -- Contributions to the 10th International Workshop on Acoustic and Radio EeV Neutrino Detection Activities (ARENA 2024)

    Authors: Rafael Alves Batista, Aurélien Benoit-Lévy, Teresa Bister, Martina Bohacova, Mauricio Bustamante, Washington Carvalho, Yiren Chen, LingMei Cheng, Simon Chiche, Jean-Marc Colley, Pablo Correa, Nicoleta Cucu Laurenciu, Zigao Dai, Rogerio M. de Almeida, Beatriz de Errico, Sijbrand de Jong, João R. T. de Mello Neto, Krijn D de Vries, Valentin Decoene, Peter B. Denton, Bohao Duan, Kaikai Duan, Ralph Engel, William Erba, Yizhong Fan , et al. (100 additional authors not shown)

    Abstract: This is an index of the contributions by the Giant Radio Array for Neutrino Detection (GRAND) Collaboration to the 10th International Workshop on Acoustic and Radio EeV Neutrino Detection Activities (ARENA 2024, University of Chicago, June 11-14, 2024). The contributions include an overview of GRAND in its present and future incarnations, methods of radio-detection that are being developed for the… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

    Comments: Note: To access the list of contributions, please follow the "HTML" link that can be found on the arXiv page

  26. Quantum Mechanics of a Spherically Symmetric Causal Diamond in Minkowski Spacetime

    Authors: Mathew W. Bub, Temple He, Prahar Mitra, Yiwen Zhang, Kathryn M. Zurek

    Abstract: We construct the phase space of a spherically symmetric causal diamond in $(d+2)$-dimensional Minkowski spacetime. Utilizing the covariant phase space formalism, we identify the relevant degrees of freedom that localize to the $d$-dimensional bifurcate horizon and, upon canonical quantization, determine their commutators. On this phase space, we find two Iyer-Wald charges. The first of these charg… ▽ More

    Submitted 25 March, 2025; v1 submitted 20 August, 2024; originally announced August 2024.

    Comments: 9 pages, 2 figures; v2: Added minor clarifications throughout, version to appear in PRL

    Report number: CALT-TH 2024-032

    Journal ref: Phys. Rev. Lett. 134, 121501 (2025)

  27. arXiv:2408.10926  [pdf, other

    astro-ph.IM hep-ex hep-ph

    GRANDlib: A simulation pipeline for the Giant Radio Array for Neutrino Detection (GRAND)

    Authors: GRAND Collaboration, Rafael Alves Batista, Aurélien Benoit-Lévy, Teresa Bister, Martina Bohacova, Mauricio Bustamante, Washington Carvalho, Yiren Chen, LingMei Cheng, Simon Chiche, Jean-Marc Colley, Pablo Correa, Nicoleta Cucu Laurenciu, Zigao Dai, Rogerio M. de Almeida, Beatriz de Errico, Sijbrand de Jong, João R. T. de Mello Neto, Krijn D. de Vries, Valentin Decoene, Peter B. Denton, Bohao Duan, Kaikai Duan, Ralph Engel, William Erba , et al. (90 additional authors not shown)

    Abstract: The operation of upcoming ultra-high-energy cosmic-ray, gamma-ray, and neutrino radio-detection experiments, like the Giant Radio Array for Neutrino Detection (GRAND), poses significant computational challenges involving the production of numerous simulations of particle showers and their detection, and a high data throughput. GRANDlib is an open-source software tool designed to meet these challen… ▽ More

    Submitted 11 December, 2024; v1 submitted 20 August, 2024; originally announced August 2024.

    Comments: 11 pages, 9 figures, plus appendices; Matches published version

    Journal ref: Computer Physics Communications, volume=308, pages=109461, issn=0010-4655 (2025)

  28. arXiv:2408.04362  [pdf, other

    cs.SD eess.AS

    NeuralMultiling: A Novel Neural Architecture Search for Smartphone based Multilingual Speaker Verification

    Authors: Aravinda Reddy PN, Raghavendra Ramachandra, K. Sreenivasa Rao, Pabitra Mitra

    Abstract: Multilingual speaker verification introduces the challenge of verifying a speaker in multiple languages. Existing systems were built using i-vector/x-vector approaches along with Bi-LSTMs, which were trained to discriminate speakers, irrespective of the language. Instead of exploring the design space manually, we propose a neural architecture search for multilingual speaker verification suitable f… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  29. arXiv:2407.17872  [pdf, other

    physics.ins-det astro-ph.CO

    The DAMIC-M Low Background Chamber

    Authors: I. Arnquist, N. Avalos, P. Bailly, D. Baxter, X. Bertou, M. Bogdan, C. Bourgeois, J. Brandt, A. Cadiou, N. Castello-Mor, A. E. Chavarria, M. Conde, J. Cuevas-Zepeda, A. Dastgheibi-Fard, C. De Dominicis, O. Deligny, R. Desani, M. Dhellot, J. Duarte-Campderros, E. Estrada, D. Florin, N. Gadola, R. Gaior, E. -L. Gkougkousis, J. Gonzalez Sanchez , et al. (44 additional authors not shown)

    Abstract: The DArk Matter In CCDs at Modane (DAMIC-M) experiment is designed to search for light dark matter (m$_χ$<10\,GeV/c$^2$) at the Laboratoire Souterrain de Modane (LSM) in France. DAMIC-M will use skipper charge-coupled devices (CCDs) as a kg-scale active detector target. Its single-electron resolution will enable eV-scale energy thresholds and thus world-leading sensitivity to a range of hidden sec… ▽ More

    Submitted 27 September, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Journal ref: 2024 JINST 19 T11010

  30. arXiv:2407.16672  [pdf, other

    cs.IT cs.ET cs.NI

    6G at $\frac{1}{6}g$: The Future of Cislunar Communications

    Authors: Sahan Liyanaarachchi, Stavros Mitrolaris, Purbesh Mitra, Sennur Ulukus

    Abstract: What will the future of cislunar communications be? The ever-expanding horizons of the space exploration missions, and the need for establishing sustainable space communication and navigation infrastructure necessitate to think this question thoroughly. In this article, we examine how some of the concepts of 6G technologies developed for terrestrial networks can be relevant in the context of cislu… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  31. arXiv:2407.05788  [pdf, other

    cs.LG cs.AI

    Automated Computational Energy Minimization of ML Algorithms using Constrained Bayesian Optimization

    Authors: Pallavi Mitra, Felix Biessmann

    Abstract: Bayesian optimization (BO) is an efficient framework for optimization of black-box objectives when function evaluations are costly and gradient information is not easily accessible. BO has been successfully applied to automate the task of hyperparameter optimization (HPO) in machine learning (ML) models with the primary objective of optimizing predictive performance on held-out data. In recent yea… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 13 pages

  32. arXiv:2406.13384  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Straight Through Gumbel Softmax Estimator based Bimodal Neural Architecture Search for Audio-Visual Deepfake Detection

    Authors: Aravinda Reddy PN, Raghavendra Ramachandra, Krothapalli Sreenivasa Rao, Pabitra Mitra, Vinod Rathod

    Abstract: Deepfakes are a major security risk for biometric authentication. This technology creates realistic fake videos that can impersonate real people, fooling systems that rely on facial features and voice patterns for identification. Existing multimodal deepfake detectors rely on conventional fusion methods, such as majority rule and ensemble voting, which often struggle to adapt to changing data char… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  33. arXiv:2406.11937  [pdf, other

    physics.ins-det hep-ex physics.data-an

    Using graph neural networks to reconstruct charged pion showers in the CMS High Granularity Calorimeter

    Authors: M. Aamir, G. Adamov, T. Adams, C. Adloff, S. Afanasiev, C. Agrawal, C. Agrawal, A. Ahmad, H. A. Ahmed, S. Akbar, N. Akchurin, B. Akgul, B. Akgun, R. O. Akpinar, E. Aktas, A. Al Kadhim, V. Alexakhin, J. Alimena, J. Alison, A. Alpana, W. Alshehri, P. Alvarez Dominguez, M. Alyari, C. Amendola, R. B. Amir , et al. (550 additional authors not shown)

    Abstract: A novel method to reconstruct the energy of hadronic showers in the CMS High Granularity Calorimeter (HGCAL) is presented. The HGCAL is a sampling calorimeter with very fine transverse and longitudinal granularity. The active media are silicon sensors and scintillator tiles readout by SiPMs and the absorbers are a combination of lead and Cu/CuW in the electromagnetic section, and steel in the hadr… ▽ More

    Submitted 18 December, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Journal ref: JINST 19 (2024) P11025

  34. arXiv:2406.04478  [pdf, other

    cs.CL cs.LG

    PromptFix: Few-shot Backdoor Removal via Adversarial Prompt Tuning

    Authors: Tianrong Zhang, Zhaohan Xi, Ting Wang, Prasenjit Mitra, Jinghui Chen

    Abstract: Pre-trained language models (PLMs) have attracted enormous attention over the past few years with their unparalleled performances. Meanwhile, the soaring cost to train PLMs as well as their amazing generalizability have jointly contributed to few-shot fine-tuning and prompting as the most popular training paradigms for natural language processing (NLP) models. Nevertheless, existing studies have s… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: NAACL 2024

  35. arXiv:2405.20876  [pdf, other

    cs.CV cs.AI

    Investigating Calibration and Corruption Robustness of Post-hoc Pruned Perception CNNs: An Image Classification Benchmark Study

    Authors: Pallavi Mitra, Gesina Schwalbe, Nadja Klein

    Abstract: Convolutional Neural Networks (CNNs) have achieved state-of-the-art performance in many computer vision tasks. However, high computational and storage demands hinder their deployment into resource-constrained environments, such as embedded devices. Model pruning helps to meet these restrictions by reducing the model size, while maintaining superior performance. Meanwhile, safety-critical applicati… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 11 pages, 3 figures

  36. arXiv:2405.15442  [pdf, other

    eess.IV cs.CV cs.LG

    Towards Precision Healthcare: Robust Fusion of Time Series and Image Data

    Authors: Ali Rasekh, Reza Heidari, Amir Hosein Haji Mohammad Rezaie, Parsa Sharifi Sedeh, Zahra Ahmadi, Prasenjit Mitra, Wolfgang Nejdl

    Abstract: With the increasing availability of diverse data types, particularly images and time series data from medical experiments, there is a growing demand for techniques designed to combine various modalities of data effectively. Our motivation comes from the important areas of predicting mortality and phenotyping where using different modalities of data could significantly improve our ability to predic… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  37. arXiv:2405.14023  [pdf, other

    cs.LG

    WordGame: Efficient & Effective LLM Jailbreak via Simultaneous Obfuscation in Query and Response

    Authors: Tianrong Zhang, Bochuan Cao, Yuanpu Cao, Lu Lin, Prasenjit Mitra, Jinghui Chen

    Abstract: The recent breakthrough in large language models (LLMs) such as ChatGPT has revolutionized production processes at an unprecedented pace. Alongside this progress also comes mounting concerns about LLMs' susceptibility to jailbreaking attacks, which leads to the generation of harmful or unsafe content. While safety alignment measures have been implemented in LLMs to mitigate existing jailbreak atte… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  38. Diamond of Infrared Equivalences in Abelian Gauge Theories

    Authors: Temple He, Prahar Mitra, Kathryn M. Zurek

    Abstract: We demonstrate a tree-level equivalence between four distinct infrared objects in $(d+2)$-dimensional abelian gauge theories. These are ($i$) the large gauge charge $Q_\varepsilon$ where the function $\varepsilon$ on the sphere parameterizing large gauge transformations is identified with the Goldstone mode $θ$ of spontaneously broken large gauge symmetry; ($ii$) the soft effective action that cap… ▽ More

    Submitted 26 November, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 8 pages, 1 figure; v2: Clarified discussion in Section III.C, minor clarification footnotes added, version to appear in PRD

    Report number: CALT-TH 2024-018

    Journal ref: Phys. Rev. D 110, 105018 (2024)

  39. arXiv:2405.06275  [pdf, other

    cs.CL

    Pruning as a Domain-specific LLM Extractor

    Authors: Nan Zhang, Yanchi Liu, Xujiang Zhao, Wei Cheng, Runxue Bao, Rui Zhang, Prasenjit Mitra, Haifeng Chen

    Abstract: Large Language Models (LLMs) have exhibited remarkable proficiency across a wide array of NLP tasks. However, the escalation in model size also engenders substantial deployment costs. While few efforts have explored model pruning techniques to reduce the size of LLMs, they mainly center on general or task-specific weights. This leads to suboptimal performance due to lacking specificity on the targ… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: NAACL 2024 Findings

  40. arXiv:2405.03439  [pdf, other

    cond-mat.mes-hall

    Anomalous Inverse Spin Hall Effect (AISHE) due to Unconventional Spin Currents in Ferromagnetic Films with Tailored Interfacial Magnetic Anisotropy

    Authors: Soumik Aon, Harekrishna Bhunia, Pratap Kumar Pal, Abu Bakkar Miah, Dhananjaya Mahapatra, Anjan Barman, Partha Mitra

    Abstract: A single layer ferromagnetic film magnetized in the plane of an ac current flow, exhibits a characteristic Hall voltage with harmonic and second harmonic components, which is attributed to the presence of spin currents with polarization non-collinear with the magnetization. A set of 30 nm thick permalloy (Py) films used in this study are deposited at an oblique angle with respect to the substrate… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  41. arXiv:2404.19749  [pdf, other

    cs.IT cs.LG cs.MA cs.NI eess.SP

    Scale-Robust Timely Asynchronous Decentralized Learning

    Authors: Purbesh Mitra, Sennur Ulukus

    Abstract: We consider an asynchronous decentralized learning system, which consists of a network of connected devices trying to learn a machine learning model without any centralized parameter server. The users in the network have their own local training data, which is used for learning across all the nodes in the network. The learning method consists of two processes, evolving simultaneously without any n… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  42. arXiv:2404.12679  [pdf, other

    cs.CV cs.CR

    MLSD-GAN -- Generating Strong High Quality Face Morphing Attacks using Latent Semantic Disentanglement

    Authors: Aravinda Reddy PN, Raghavendra Ramachandra, Krothapalli Sreenivasa Rao, Pabitra Mitra

    Abstract: Face-morphing attacks are a growing concern for biometric researchers, as they can be used to fool face recognition systems (FRS). These attacks can be generated at the image level (supervised) or representation level (unsupervised). Previous unsupervised morphing attacks have relied on generative adversarial networks (GANs). More recently, researchers have used linear interpolation of StyleGAN-en… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  43. WildGraph: Realistic Graph-based Trajectory Generation for Wildlife

    Authors: Ali Al-Lawati, Elsayed Eshra, Prasenjit Mitra

    Abstract: Trajectory generation is an important task in movement studies; it circumvents the privacy, ethical, and technical challenges of collecting real trajectories from the target population. In particular, real trajectories in the wildlife domain are scarce as a result of ethical and environmental constraints of the collection process. In this paper, we consider the problem of generating long-horizon t… ▽ More

    Submitted 7 February, 2025; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: 12 pages, 7 figures, SIGSPATIAL '24

    Journal ref: SIGSPATIAL 2024: Proceedings of the 32nd ACM International Conference on Advances in Geographic Information System

  44. arXiv:2404.03934  [pdf, other

    cond-mat.mes-hall

    Direct Electrical Detection of Spin Chemical Potential Due to Spin Hall Effect in $β$-Tungsten and Platinum Using a Pair of Ferromagnetic and Normal Metal Voltage probes

    Authors: Soumik Aon, Abu Bakkar Miah, Arpita Mandal, Harekrishna Bhunia, Dhananjaya Mahapatra, Partha Mitra

    Abstract: The phenomenon of Spin Hall Effect (SHE) generates a pure spin current transverse to an applied current in materials with strong spin-orbit coupling, although not detectable through conventional electrical measurement. An intuitive Hall effect like measurement configuration is implemented to directly measure pure spin chemical potential of the accumulated spins at the edges of heavy metal (HM) cha… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  45. Information Security and Privacy in the Digital World: Some Selected Topics

    Authors: Jaydip Sen, Joceli Mayer, Subhasis Dasgupta, Subrata Nandi, Srinivasan Krishnaswamy, Pinaki Mitra, Mahendra Pratap Singh, Naga Prasanthi Kundeti, Chandra Sekhara Rao MVP, Sudha Sree Chekuri, Seshu Babu Pallapothu, Preethi Nanjundan, Jossy P. George, Abdelhadi El Allahi, Ilham Morino, Salma AIT Oussous, Siham Beloualid, Ahmed Tamtaoui, Abderrahim Bajit

    Abstract: In the era of generative artificial intelligence and the Internet of Things, while there is explosive growth in the volume of data and the associated need for processing, analysis, and storage, several new challenges are faced in identifying spurious and fake information and protecting the privacy of sensitive data. This has led to an increasing demand for more robust and resilient schemes for aut… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

    Comments: Published by IntechOpen, London Uk in Nov 2023, the book contains 8 chapters spanning over 131 pages. arXiv admin note: text overlap with arXiv:2307.02055, arXiv:2304.00258

  46. arXiv:2403.15724  [pdf, other

    cs.CL cs.AI

    PEaCE: A Chemistry-Oriented Dataset for Optical Character Recognition on Scientific Documents

    Authors: Nan Zhang, Connor Heaton, Sean Timothy Okonsky, Prasenjit Mitra, Hilal Ezgi Toraman

    Abstract: Optical Character Recognition (OCR) is an established task with the objective of identifying the text present in an image. While many off-the-shelf OCR models exist, they are often trained for either scientific (e.g., formulae) or generic printed English text. Extracting text from chemistry publications requires an OCR model that is capable in both realms. Nougat, a recent tool, exhibits strong ab… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: LREC-COLING 2024

  47. An On-Shell Derivation of the Soft Effective Action in Abelian Gauge Theories

    Authors: Temple He, Prahar Mitra, Allic Sivaramakrishnan, Kathryn M. Zurek

    Abstract: We derive the soft effective action in $(d+2)$-dimensional abelian gauge theories from the on-shell action obeying Neumann boundary conditions at timelike and null infinity and Dirichlet boundary conditions at spatial infinity. This allows us to identify the on-shell degrees of freedom on the boundary with the soft modes living on the celestial sphere. Following the work of Donnelly and Wall, this… ▽ More

    Submitted 27 June, 2025; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 25 pages, version appearing in PRD; v2: Corrected typos in Appendix A

    Report number: CALT-TH 2024-012

  48. arXiv:2403.11100  [pdf, other

    cs.LG cs.CV cs.NE

    Graph Expansion in Pruned Recurrent Neural Network Layers Preserve Performance

    Authors: Suryam Arnav Kalra, Arindam Biswas, Pabitra Mitra, Biswajit Basu

    Abstract: Expansion property of a graph refers to its strong connectivity as well as sparseness. It has been reported that deep neural networks can be pruned to a high degree of sparsity while maintaining their performance. Such pruning is essential for performing real time sequence learning tasks using recurrent neural networks in resource constrained platforms. We prune recurrent networks such as RNNs and… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted as tiny paper in ICLR 2024

    MSC Class: 05C68 ACM Class: I.2.6

  49. arXiv:2403.10141  [pdf, other

    cond-mat.mes-hall

    Anisotropic magneto-photothermal voltage in Sb2Te3 topological insulator thin films

    Authors: Subhadip Manna, Sambhu G Nath, Samrat Roy, Soumik Aon, Sayani Pal, Kanav Sharma, Dhananjaya Mahapatra, Partha Mitra, Sourin Das, Bipul Pal, Chiranjib Mitra

    Abstract: We studied longitudinal and Hall photothermal voltages under a planar magnetic field scan in epitaxial thin films of the Topological Insulator (TI) Sb2Te3, grown using pulsed laser deposition (PLD). Unlike prior research that utilised polarised light-induced photocurrent to investigate the TI, our study introduces advancements based on unpolarized light-induced local heating. This method yields a… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  50. arXiv:2403.04086  [pdf, other

    cs.LG

    Automated Multi-Task Learning for Joint Disease Prediction on Electronic Health Records

    Authors: Suhan Cui, Prasenjit Mitra

    Abstract: In the realm of big data and digital healthcare, Electronic Health Records (EHR) have become a rich source of information with the potential to improve patient care and medical research. In recent years, machine learning models have proliferated for analyzing EHR data to predict patients future health conditions. Among them, some studies advocate for multi-task learning (MTL) to jointly predict mu… ▽ More

    Submitted 8 October, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted by NeurIPS 2024