Search | arXiv e-print repository

XRISM View of the Newly Detected Galactic Source MAXI J1744-294

Authors: Kaushik Chatterjee, Santanu Mondal, Biswaraj Palit, Chandra B. Singh, Sujoy Kumar Nath, Mayukh Pahari, Brajesh Kumar, Wei Wang, Hsiang-Kuang Chang, Xiaowei Liu

Abstract: The transient Galactic source MAXI J1744-294 went into an outburst in 2025 for the very first time. We study the spectral properties of this source during this outburst using archival data from the XRISM satellite for both of its Resolve and Xtend instruments. We have analyzed the source during one epoch, on March 03, 2025, or MJD 60737, on which XRISM data were available. Using both phenomenologi… ▽ More The transient Galactic source MAXI J1744-294 went into an outburst in 2025 for the very first time. We study the spectral properties of this source during this outburst using archival data from the XRISM satellite for both of its Resolve and Xtend instruments. We have analyzed the source during one epoch, on March 03, 2025, or MJD 60737, on which XRISM data were available. Using both phenomenological and physical model fitting approaches for continuum emissions, along with line emission and interstellar absorption models, we analyzed the spectral data in the broad 2-10 keV energy band. From our spectral analysis, we have found the existence of multiple iron lines, which are different components of the Fe XXV emission. These line complexes arise from two highly ionized plasmas with ionization rate ~ 10000 erg cm/s with distinct turbulent velocities: one broad (vturb ~ 2513 km/s) from hot gas at the inner accretion disk, and one narrow (vturb ~ 153 km/s ) scattered by nearby photoionized gas. The source is a moderately spinning black hole with a spin of 0.63-0.70, a mass of 5.7-10.1 Solar masses, and a disk inclination angle of 19-24 degrees. The spectral model fitted parameters suggest that the source is in the soft spectral state. The source is situated in a crowded field near the Galactic center, resulting in a very high hydrogen column density. △ Less

Submitted 7 July, 2025; v1 submitted 28 June, 2025; originally announced June 2025.

Comments: 10 Pages, 6 Figures, 2 Tables (Suggestions are welcome!)

arXiv:2506.22398 [pdf, ps, other]

Cost Effective Designs For Next-Generation Radio Telescopes

Authors: G B Raghavkrishna, Deeptangshu Banik, B Ravi Kumar, D Veeraswamy

Abstract: Radio astronomy has entered its golden era, with many revolutionary facilities such as SKA, ngVLA, and LOFAR2.0 coming online in the next decade. These facilities are certain to redefine radio astronomy. However, on smaller scales-such as at institutional or amateur levels-radio astronomy is still mostly practiced for educational or personal interest. The primary reason small-scale radio astronomi… ▽ More Radio astronomy has entered its golden era, with many revolutionary facilities such as SKA, ngVLA, and LOFAR2.0 coming online in the next decade. These facilities are certain to redefine radio astronomy. However, on smaller scales-such as at institutional or amateur levels-radio astronomy is still mostly practiced for educational or personal interest. The primary reason small-scale radio astronomical experiments rarely produce cutting-edge scientific results is the limitation of funding available for procuring or developing components similar to those used in professional-grade facilities. A second major reason is the lack of tools and skills required for simulating components for a complete telescope. In this work, we address the first of these challenges and suggest novel ideas and designs for cost-effective, next-generation radio telescopes that can be built at small scales. We also describe the observational strategies required to produce cutting-edge scientific results with these telescopes-results comparable to those from professional facilities. △ Less

Submitted 27 June, 2025; originally announced June 2025.

Comments: 14 pages, 3 figures

arXiv:2506.21543 [pdf, ps, other]

Detecting weighted hidden cliques

Authors: Urmisha Chatterjee, Karissa Huang, Ritabrata Karmakar, B. R. Vinay Kumar, Gábor Lugosi, Nandan Malhotra, Anirban Mandal, Maruf Alam Tarafdar

Abstract: We study a generalization of the classical hidden clique problem to graphs with real-valued edge weights. Formally, we define a hypothesis testing problem. Under the null hypothesis, edges of a complete graph on $n$ vertices are associated with independent and identically distributed edge weights from a distribution $P$. Under the alternate hypothesis, $k$ vertices are chosen at random and the edg… ▽ More We study a generalization of the classical hidden clique problem to graphs with real-valued edge weights. Formally, we define a hypothesis testing problem. Under the null hypothesis, edges of a complete graph on $n$ vertices are associated with independent and identically distributed edge weights from a distribution $P$. Under the alternate hypothesis, $k$ vertices are chosen at random and the edge weights between them are drawn from a distribution $Q$, while the remaining are sampled from $P$. The goal is to decide, upon observing the edge weights, which of the two hypotheses they were generated from. We investigate the problem under two different scenarios: (1) when $P$ and $Q$ are completely known, and (2) when there is only partial information of $P$ and $Q$. In the first scenario, we obtain statistical limits on $k$ when the two hypotheses are distinguishable, and when they are not. Additionally, in each of the scenarios, we provide bounds on the minimal risk of the hypothesis testing problem when $Q$ is not absolutely continuous with respect to $P$. We also provide computationally efficient spectral tests that can distinguish the two hypotheses as long as $k=Ω(\sqrt{n})$ in both the scenarios. △ Less

Submitted 26 June, 2025; originally announced June 2025.

MSC Class: 62F03

arXiv:2506.12103 [pdf, other]

The Amazon Nova Family of Models: Technical Report and Model Card

Authors: Amazon AGI, Aaron Langford, Aayush Shah, Abhanshu Gupta, Abhimanyu Bhatter, Abhinav Goyal, Abhinav Mathur, Abhinav Mohanty, Abhishek Kumar, Abhishek Sethi, Abi Komma, Abner Pena, Achin Jain, Adam Kunysz, Adam Opyrchal, Adarsh Singh, Aditya Rawal, Adok Achar Budihal Prasad, Adrià de Gispert, Agnika Kumar, Aishwarya Aryamane, Ajay Nair, Akilan M, Akshaya Iyengar, Akshaya Vishnu Kudlu Shanbhogue , et al. (761 additional authors not shown)

Abstract: We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents… ▽ More We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents and text. Amazon Nova Micro is a text-only model that delivers our lowest-latency responses at very low cost. Amazon Nova Canvas is an image generation model that creates professional grade images with rich customization controls. Amazon Nova Reel is a video generation model offering high-quality outputs, customization, and motion control. Our models were built responsibly and with a commitment to customer trust, security, and reliability. We report benchmarking results for core capabilities, agentic performance, long context, functional adaptation, runtime performance, and human evaluation. △ Less

Submitted 17 March, 2025; originally announced June 2025.

Comments: 48 pages, 10 figures

Report number: 20250317

arXiv:2506.11089 [pdf, ps, other]

Better Pseudo-labeling with Multi-ASR Fusion and Error Correction by SpeechLLM

Authors: Jeena Prakash, Blessingh Kumar, Kadri Hacioglu, Bidisha Sharma, Sindhuja Gopalan, Malolan Chetlur, Shankar Venkatesan, Andreas Stolcke

Abstract: Automatic speech recognition (ASR) models rely on high-quality transcribed data for effective training. Generating pseudo-labels for large unlabeled audio datasets often relies on complex pipelines that combine multiple ASR outputs through multi-stage processing, leading to error propagation, information loss and disjoint optimization. We propose a unified multi-ASR prompt-driven framework using p… ▽ More Automatic speech recognition (ASR) models rely on high-quality transcribed data for effective training. Generating pseudo-labels for large unlabeled audio datasets often relies on complex pipelines that combine multiple ASR outputs through multi-stage processing, leading to error propagation, information loss and disjoint optimization. We propose a unified multi-ASR prompt-driven framework using postprocessing by either textual or speech-based large language models (LLMs), replacing voting or other arbitration logic for reconciling the ensemble outputs. We perform a comparative study of multiple architectures with and without LLMs, showing significant improvements in transcription accuracy compared to traditional methods. Furthermore, we use the pseudo-labels generated by the various approaches to train semi-supervised ASR models for different datasets, again showing improved performance with textual and speechLLM transcriptions compared to baselines. △ Less

Submitted 5 June, 2025; originally announced June 2025.

arXiv:2505.24868 [pdf, other]

Consistent line clustering using geometric hypergraphs

Authors: Kalle Alaluusua, Konstantin Avrachenkov, B. R. Vinay Kumar, Lasse Leskelä

Abstract: Traditional data analysis often represents data as a weighted graph with pairwise similarities, but many problems do not naturally fit this framework. In line clustering, points in a Euclidean space must be grouped so that each cluster is well approximated by a line segment. Since any two points define a line, pairwise similarities fail to capture the structure of the problem, necessitating the us… ▽ More Traditional data analysis often represents data as a weighted graph with pairwise similarities, but many problems do not naturally fit this framework. In line clustering, points in a Euclidean space must be grouped so that each cluster is well approximated by a line segment. Since any two points define a line, pairwise similarities fail to capture the structure of the problem, necessitating the use of higher-order interactions modeled by geometric hypergraphs. We encode geometry into a 3-uniform hypergraph by treating sets of three points as hyperedges whenever they are approximately collinear. The resulting hypergraph contains information about the underlying line segments, which can then be extracted using community recovery algorithms. In contrast to classical hypergraph block models, latent geometric constraints in this construction introduce significant dependencies between hyperedges, which restricts the applicability of many standard theoretical tools. We aim to determine the fundamental limits of line clustering and evaluate hypergraph-based line clustering methods. To this end, we derive information-theoretic thresholds for exact and almost exact recovery for data generated from intersecting lines on a plane with additive Gaussian noise. We develop a polynomial-time spectral algorithm and show that it succeeds under noise conditions that match the information-theoretic bounds up to a polylogarithmic factor. △ Less

Submitted 30 May, 2025; originally announced May 2025.

Comments: 40 pages, 4 figures

MSC Class: 62H30; 62R10; 62C20; 05C65; 05C80; 62H12; 94A15; 90B15

arXiv:2505.21811 [pdf, ps, other]

Revisiting Self-attention for Cross-domain Sequential Recommendation

Authors: Clark Mingxuan Ju, Leonardo Neves, Bhuvesh Kumar, Liam Collins, Tong Zhao, Yuwei Qiu, Qing Dou, Sohail Nizam, Sen Yang, Neil Shah

Abstract: Sequential recommendation is a popular paradigm in modern recommender systems. In particular, one challenging problem in this space is cross-domain sequential recommendation (CDSR), which aims to predict future behaviors given user interactions across multiple domains. Existing CDSR frameworks are mostly built on the self-attention transformer and seek to improve by explicitly injecting additional… ▽ More Sequential recommendation is a popular paradigm in modern recommender systems. In particular, one challenging problem in this space is cross-domain sequential recommendation (CDSR), which aims to predict future behaviors given user interactions across multiple domains. Existing CDSR frameworks are mostly built on the self-attention transformer and seek to improve by explicitly injecting additional domain-specific components (e.g. domain-aware module blocks). While these additional components help, we argue they overlook the core self-attention module already present in the transformer, a naturally powerful tool to learn correlations among behaviors. In this work, we aim to improve the CDSR performance for simple models from a novel perspective of enhancing the self-attention. Specifically, we introduce a Pareto-optimal self-attention and formulate the cross-domain learning as a multi-objective problem, where we optimize the recommendation task while dynamically minimizing the cross-domain attention scores. Our approach automates knowledge transfer in CDSR (dubbed as AutoCDSR) -- it not only mitigates negative transfer but also encourages complementary knowledge exchange among auxiliary domains. Based on the idea, we further introduce AutoCDSR+, a more performant variant with slight additional cost. Our proposal is easy to implement and works as a plug-and-play module that can be incorporated into existing transformer-based recommenders. Besides flexibility, it is practical to deploy because it brings little extra computational overheads without heavy hyper-parameter tuning. AutoCDSR on average improves Recall@10 for SASRec and Bert4Rec by 9.8% and 16.0% and NDCG@10 by 12.0% and 16.7%, respectively. Code is available at https://github.com/snap-research/AutoCDSR. △ Less

Submitted 27 May, 2025; originally announced May 2025.

Comments: Accepted to KDD'25

arXiv:2505.19831 [pdf, ps, other]

SN 2024aecx: A double-peaked rapidly evolving Type IIb supernova at 11 Mpc

Authors: Xingzhu Zou, Brajesh Kumar, Rishabh Singh Teja, D. K. Sahu, Xinlei Chen, Avinash Singh, Weikang Lin, Xiangkun Liu, Dezi Liu, Hrishav Das, Mridweeka Singh, Yu Pan, Guowang Du, Helong Guo, Tao Wang, Xufeng Zhu, Jujia Zhang, Yuan Fang, Chenxu Liu, Kaushik Chatterjee, Yuan-Pei Yang, Liping Li, Qian Zhai, Edoardo P. Lagioia, Xueling Du , et al. (4 additional authors not shown)

Abstract: We present the results of low-resolution spectroscopic and densely sampled multiband simultaneous optical imaging ($ugi$ and $vrz$ bands) follow-up of supernova (SN) 2024aecx. The photometric data is supplemented with $Swift$/UVOT and ATLAS survey observations. The SN was discovered in the spiral galaxy NGC 3521 (distance $\sim$11 Mpc) within a day after the explosion. The early spectra of SN 2024… ▽ More We present the results of low-resolution spectroscopic and densely sampled multiband simultaneous optical imaging ($ugi$ and $vrz$ bands) follow-up of supernova (SN) 2024aecx. The photometric data is supplemented with $Swift$/UVOT and ATLAS survey observations. The SN was discovered in the spiral galaxy NGC 3521 (distance $\sim$11 Mpc) within a day after the explosion. The early spectra of SN 2024aecx show a weak signature of hydrogen lines, which disappeared in $\sim$30 days after the explosion. Light curves in all bands show a distinct feature of two peaks, and the first peak is likely due to the shock cooling emission. The early phase light curve evolution of SN 2024aecx has similarity with the typical Type IIb events, but the decay rate in different bands (e.g., $\rm Δm_{15}$ = 1.60 $\pm$ 0.05 mag, $g$-band) is significantly faster in the post-peak phase. It attained the secondary maximum in $\sim$19 days ($g$-band) with a peak absolute magnitude of M$_{g}$= -17.94 $\pm$ 0.10 mag. The color evolution of SN 2024aecx is displaying a red-blue-red trend between days $\sim$8 to 40. The analytical model fitting to the light curves reveals an envelope mass and progenitor radii in the range of $\sim$0.03 - 0.24 $M_\odot$ and $\sim$169 - 200 $R_\odot$, respectively. Modeling of the pseudo-bolometric light curve suggests that synthesized $^{56}$Ni in the explosion was $\sim$0.15 M$_{\odot}$ with ejecta mass and kinetic energy of $\sim$0.7 M$_{\odot}$ and $\sim$0.16 x 10$^{51}$ erg, respectively. The observational properties and modeling indicate that the SN 2024aecx progenitor belongs to the extended progenitor category. △ Less

Submitted 26 May, 2025; originally announced May 2025.

Comments: 22 pages, 14 figures, 4 tables, submitted

arXiv:2505.17332 [pdf, ps, other]

SweEval: Do LLMs Really Swear? A Safety Benchmark for Testing Limits for Enterprise Use

Authors: Hitesh Laxmichand Patel, Amit Agarwal, Arion Das, Bhargava Kumar, Srikant Panda, Priyaranjan Pattnayak, Taki Hasan Rafi, Tejaswini Kumar, Dong-Kyu Chae

Abstract: Enterprise customers are increasingly adopting Large Language Models (LLMs) for critical communication tasks, such as drafting emails, crafting sales pitches, and composing casual messages. Deploying such models across different regions requires them to understand diverse cultural and linguistic contexts and generate safe and respectful responses. For enterprise applications, it is crucial to miti… ▽ More Enterprise customers are increasingly adopting Large Language Models (LLMs) for critical communication tasks, such as drafting emails, crafting sales pitches, and composing casual messages. Deploying such models across different regions requires them to understand diverse cultural and linguistic contexts and generate safe and respectful responses. For enterprise applications, it is crucial to mitigate reputational risks, maintain trust, and ensure compliance by effectively identifying and handling unsafe or offensive language. To address this, we introduce SweEval, a benchmark simulating real-world scenarios with variations in tone (positive or negative) and context (formal or informal). The prompts explicitly instruct the model to include specific swear words while completing the task. This benchmark evaluates whether LLMs comply with or resist such inappropriate instructions and assesses their alignment with ethical frameworks, cultural nuances, and language comprehension capabilities. In order to advance research in building ethically aligned AI systems for enterprise use and beyond, we release the dataset and code: https://github.com/amitbcp/multilingual_profanity. △ Less

Submitted 22 May, 2025; originally announced May 2025.

Comments: Published in the Proceedings of the 2025 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2025), Industry Track, pages 558-582

ACM Class: I.2.7; I.2.6

arXiv:2505.15318 [pdf, ps, other]

Linear Convergence of Plug-and-Play Algorithms with Kernel Denoisers

Authors: Arghya Sinha, Bhartendu Kumar, Chirayu D. Athalye, Kunal N. Chaudhury

Abstract: The use of denoisers for image reconstruction has shown significant potential, especially for the Plug-and-Play (PnP) framework. In PnP, a powerful denoiser is used as an implicit regularizer in proximal algorithms such as ISTA and ADMM. The focus of this work is on the convergence of PnP iterates for linear inverse problems using kernel denoisers. It was shown in prior work that the update operat… ▽ More The use of denoisers for image reconstruction has shown significant potential, especially for the Plug-and-Play (PnP) framework. In PnP, a powerful denoiser is used as an implicit regularizer in proximal algorithms such as ISTA and ADMM. The focus of this work is on the convergence of PnP iterates for linear inverse problems using kernel denoisers. It was shown in prior work that the update operator in standard PnP is contractive for symmetric kernel denoisers under appropriate conditions on the denoiser and the linear forward operator. Consequently, we could establish global linear convergence of the iterates using the contraction mapping theorem. In this work, we develop a unified framework to establish global linear convergence for symmetric and nonsymmetric kernel denoisers. Additionally, we derive quantitative bounds on the contraction factor (convergence rate) for inpainting, deblurring, and superresolution. We present numerical results to validate our theoretical findings. △ Less

Submitted 21 May, 2025; originally announced May 2025.

Comments: 18 pages, 14 Figures

MSC Class: 94A08; 41A25; 65F10

arXiv:2505.10138 [pdf, ps, other]

Cislunar Mean-Motion Resonances: Definitions, Widths, and Comparisons with Resonant Satellites

Authors: Anjali Rawat, Bhanu Kumar, Aaron J. Rosengren, Shane D. Ross

Abstract: Lunar mean-motion resonances (MMRs) significantly shape cislunar dynamics beyond GEO, forming stable-unstable orbit pairs with corresponding intermingled chaotic and regular regions. The resonance zone is rigorously defined using the separatrix of unstable resonant periodic orbits surrounding stable quasi-periodic regions. Our study leverages the planar, circular, restricted three-body problem (PC… ▽ More Lunar mean-motion resonances (MMRs) significantly shape cislunar dynamics beyond GEO, forming stable-unstable orbit pairs with corresponding intermingled chaotic and regular regions. The resonance zone is rigorously defined using the separatrix of unstable resonant periodic orbits surrounding stable quasi-periodic regions. Our study leverages the planar, circular, restricted three-body problem (PCR3BP) to estimate the (stable) resonance widths and (unstable) chaotic resonance zones of influence of the 2:1 and 3:1 MMRs across various Jacobi constants, employing a Poincaré map at perigee and presenting findings in easily interpretable geocentric orbital elements. An analysis of the semi-major axis versus eccentricity plane reveals broader regions of resonance influence than those predicted by semi-analytical models based on the perturbed Kepler problem. A comparison with high-fidelity 3-dimensional ephemeris propagation of several spacecraft - TESS, IBEX, and Spektr-R - in these regions is made, which shows good agreement with the simplified CR3BP model. △ Less

Submitted 15 May, 2025; originally announced May 2025.

Comments: Submitted to The Journal of Guidance, Control, and Dynamics

arXiv:2505.05812 [pdf]

Towards order of magnitude X-ray dose reduction in breast cancer imaging using phase contrast and deep denoising

Authors: Ashkan Pakzad, Robert Turnbull, Simon J. Mutch, Thomas A. Leatham, Darren Lockie, Jane Fox, Beena Kumar, Daniel Häsermann, Christopher J. Hall, Anton Maksimenko, Benedicta D. Arhatari, Yakov I. Nesterets, Amir Entezam, Seyedamir T. Taba, Patrick C. Brennan, Timur E. Gureyev, Harry M. Quiney

Abstract: Breast cancer is the most frequently diagnosed human cancer in the United States at present. Early detection is crucial for its successful treatment. X-ray mammography and digital breast tomosynthesis are currently the main methods for breast cancer screening. However, both have known limitations in terms of their sensitivity and specificity to breast cancers, while also frequently causing patient… ▽ More Breast cancer is the most frequently diagnosed human cancer in the United States at present. Early detection is crucial for its successful treatment. X-ray mammography and digital breast tomosynthesis are currently the main methods for breast cancer screening. However, both have known limitations in terms of their sensitivity and specificity to breast cancers, while also frequently causing patient discomfort due to the requirement for breast compression. Breast computed tomography is a promising alternative, however, to obtain high-quality images, the X-ray dose needs to be sufficiently high. As the breast is highly radiosensitive, dose reduction is particularly important. Phase-contrast computed tomography (PCT) has been shown to produce higher-quality images at lower doses and has no need for breast compression. It is demonstrated in the present study that, when imaging full fresh mastectomy samples with PCT, deep learning-based image denoising can further reduce the radiation dose by a factor of 16 or more, without any loss of image quality. The image quality has been assessed both in terms of objective metrics, such as spatial resolution and contrast-to-noise ratio, as well as in an observer study by experienced medical imaging specialists and radiologists. This work was carried out in preparation for live patient PCT breast cancer imaging, initially at specialized synchrotron facilities. △ Less

Submitted 9 May, 2025; originally announced May 2025.

Comments: 16 pages, 3 figures, 1 table

arXiv:2504.21838 [pdf, ps, other]

Learning Universal User Representations Leveraging Cross-domain User Intent at Snapchat

Authors: Clark Mingxuan Ju, Leonardo Neves, Bhuvesh Kumar, Liam Collins, Tong Zhao, Yuwei Qiu, Qing Dou, Yang Zhou, Sohail Nizam, Rengim Ozturk, Yvette Liu, Sen Yang, Manish Malik, Neil Shah

Abstract: The development of powerful user representations is a key factor in the success of recommender systems (RecSys). Online platforms employ a range of RecSys techniques to personalize user experience across diverse in-app surfaces. User representations are often learned individually through user's historical interactions within each surface and user representations across different surfaces can be sh… ▽ More The development of powerful user representations is a key factor in the success of recommender systems (RecSys). Online platforms employ a range of RecSys techniques to personalize user experience across diverse in-app surfaces. User representations are often learned individually through user's historical interactions within each surface and user representations across different surfaces can be shared post-hoc as auxiliary features or additional retrieval sources. While effective, such schemes cannot directly encode collaborative filtering signals across different surfaces, hindering its capacity to discover complex relationships between user behaviors and preferences across the whole platform. To bridge this gap at Snapchat, we seek to conduct universal user modeling (UUM) across different in-app surfaces, learning general-purpose user representations which encode behaviors across surfaces. Instead of replacing domain-specific representations, UUM representations capture cross-domain trends, enriching existing representations with complementary information. This work discusses our efforts in developing initial UUM versions, practical challenges, technical choices and modeling and research directions with promising offline performance. Following successful A/B testing, UUM representations have been launched in production, powering multiple use cases and demonstrating their value. UUM embedding has been incorporated into (i) Long-form Video embedding-based retrieval, leading to 2.78% increase in Long-form Video Open Rate, (ii) Long-form Video L2 ranking, with 19.2% increase in Long-form Video View Time sum, (iii) Lens L2 ranking, leading to 1.76% increase in Lens play time, and (iv) Notification L2 ranking, with 0.87% increase in Notification Open Rate. △ Less

Submitted 9 June, 2025; v1 submitted 30 April, 2025; originally announced April 2025.

Comments: Accepted to the industrial track of SIGIR'25

arXiv:2504.21701 [pdf, other]

Leptogenesis, $0νββ$ and lepton flavor violation in modular left-right asymmetric model with polyharmonic $Maaβ$ forms

Authors: Bhabana Kumar, Mrinal Kumar Das

Abstract: In the absence of supersymmetry, modular forms need not be holomorphic functions of the modulus $τ$. Using this idea, we construct a non-supersymmetric framework using polyharmonic $Maaβ$ forms. In this approach, the Yukawa coupling is no longer strictly holomorphic in $τ$ but instead incorporates both holomorphic and non-holomorphic components. We realize a non-supersymmetric, left-right asymmetr… ▽ More In the absence of supersymmetry, modular forms need not be holomorphic functions of the modulus $τ$. Using this idea, we construct a non-supersymmetric framework using polyharmonic $Maaβ$ forms. In this approach, the Yukawa coupling is no longer strictly holomorphic in $τ$ but instead incorporates both holomorphic and non-holomorphic components. We realize a non-supersymmetric, left-right asymmetric model based on the $Γ_3$ modular group, where the active neutrino masses are generated via an extended inverse seesaw mechanism. The model successfully predicts the sum of neutrino masses below the current experimental bound and accommodates neutrino mixing angles within the $3σ$ range. Given its strong predictive power in neutrino oscillation parameters, we further explore its implications for beyond Standard Model (BSM) phenomena, including neutrinoless double beta ($0νββ$) decay, lepton flavor violation (LFV), and baryogenesis via leptogenesis (BAU). Our findings indicate that the model predicts an effective Majorana mass and LFV branching ratios consistent with experimental constraints while also providing a viable explanation for the observed baryon asymmetry through resonant leptogenesis. △ Less

Submitted 30 April, 2025; originally announced April 2025.

Comments: 27 pages and 30 figures

arXiv:2504.17034 [pdf, other]

An extremely soft and weak fast X-ray transient associated with a luminous supernova

Authors: W. -X. Li, Z. -P. Zhu, X. -Z. Zou, J. -J. Geng, L. -D. Liu, Y. -H. Wang, R. -Z. Li, D. Xu, H. Sun, X. -F. Wang, Y. -W. Yu, B. Zhang, X. -F. Wu, Y. Yang, A. V. Filippenko, X. -W. Liu, W. -M. Yuan, D. Aguado, J. An, T. An, D. A. H. Buckley, A. J. Castro-Tirado, S. -Y. Fu, J. P. U. Fynbo, D. A. Howell , et al. (80 additional authors not shown)

Abstract: Long gamma-ray bursts (LGRBs), including their subclasses of low-luminosity GRBs (LL-GRBs) and X-ray flashes (XRFs) characterized by low spectral peak energies, are known to be associated with broad-lined Type Ic supernovae (SNe Ic-BL), which result from the core collapse of massive stars that lose their outer hydrogen and helium envelopes. However, the soft and weak end of the GRB/XRF population… ▽ More Long gamma-ray bursts (LGRBs), including their subclasses of low-luminosity GRBs (LL-GRBs) and X-ray flashes (XRFs) characterized by low spectral peak energies, are known to be associated with broad-lined Type Ic supernovae (SNe Ic-BL), which result from the core collapse of massive stars that lose their outer hydrogen and helium envelopes. However, the soft and weak end of the GRB/XRF population remains largely unexplored, due to the limited sensitivity to soft X-ray emission. Here we report the discovery of a fast X-ray transient, EP250108a, detected by the Einstein Probe (EP) in the soft X-ray band at redshift $z = 0.176$, which was followed up by extensive multiband observations. EP250108a shares similar X-ray luminosity as XRF\,060218, the prototype of XRFs, but it extends GRBs/XRFs down to the unprecedentedly soft and weak regimes, with its $E_{\rm peak} \lesssim 1.8\,\mathrm{keV}$ and $E_{\rm iso} \lesssim 10^{49}\, \mathrm{erg}$, respectively. Meanwhile, EP250108a is found to be associated with SN\,2025kg, one of the most luminous and possibly magnetar-powered SNe Ic-BL detected so far. Modeling of the well-sampled optical light curves favors a mildly relativistic outflow as the origin of this event. This discovery demonstrates that EP, with its unique capability, is opening a new observational window into the diverse outcomes of death of massive stars. △ Less

Submitted 23 April, 2025; originally announced April 2025.

Comments: 54 pages, 10 figures, submitted

arXiv:2503.15805 [pdf, ps, other]

Multiwavelength Analysis of GRB 250101A: From Gamma-ray Prompt Emission to Optical Afterglow

Authors: Guowang Du, Yehao Cheng, Yuan-Pei Yang, Jun Yang, Jinghua Zhang, Dan Zhu, Yu Pan, Yuan Fang, Xingzhu Zou, Brajesh Kumar, Helong Guo, Xufeng Zhu, Yangwei Zhang, Fanchuan Kong, Chenxi Shang, Xinlei Chen, Xiangkun Liu, Xiaowei Liu

Abstract: The interaction between the relativistic jet and the circumburst medium produces a multiwavelength afterglow of a gamma-ray burst (GRBs). In this work, we present multiwavelength properties of GRB~250101A based on the observations of Swift, Fermi and Mephisto. The spectral analysis of Swift/BAT and Fermi/GBM reveals a soft prompt spectrum with a low-energy photon index of $-1.18$ and a peak energy… ▽ More The interaction between the relativistic jet and the circumburst medium produces a multiwavelength afterglow of a gamma-ray burst (GRBs). In this work, we present multiwavelength properties of GRB~250101A based on the observations of Swift, Fermi and Mephisto. The spectral analysis of Swift/BAT and Fermi/GBM reveals a soft prompt spectrum with a low-energy photon index of $-1.18$ and a peak energy of 33 keV, and the isotropic energy is $1.4\times10^{52}~{\rm erg}$. The prompt emission of GRB 250101A aligns with Type II GRBs in the Amati relation. Meanwhile, our analysis indicates that GRB 250101A is an X-ray-rich or X-ray-dominated GRB, with intrinsic properties suggesting that it is relatively softer than most classical GRBs. Optical observation with Mephisto, beginning 197 s post-trigger, shows a single power-law decay in $uvgriz$ bands, with $F_{ν,\mathrm{obs}} \propto t^{-0.76} ν^{-1.21}$. The observed spectral index significantly exceeds theoretical predictions under standard afterglow models, suggesting a color excess of $\sim0.216$ mag. However, combining X-ray and optical afterglow, we find that GRB 250101A is more likely a ``normal burst'' rather than an ``optical-dark burst'', and the dust extinction effect plays an important role in the optical blue bands. Furthermore, there is a structural change at $T_0+2924$ s in the optical light curve, indicating a density drop of $\sim50$ \% in the interstellar medium at a distance of $\sim0.13~{\rm pc}$. Our analysis shows that this GRB clearly shows some unique characteristics in its observed X-ray rich prompt emission as well as the circumburst environment, implying a special progenitor. △ Less

Submitted 26 June, 2025; v1 submitted 19 March, 2025; originally announced March 2025.

Comments: 20 pages, 10 figures, 3 tables. Accepted for publication in ApJ

arXiv:2503.14095 [pdf, other]

Towards Location-Specific Precipitation Projections Using Deep Neural Networks

Authors: Bipin Kumar, Bhvisy Kumar Yadav, Soumypdeep Mukhopadhyay, Rakshit Rohan, Bhupendra Bahadur Singh, Rajib Chattopadhyay, Nagraju Chilukoti, Atul Kumar Sahai

Abstract: Accurate precipitation estimates at individual locations are crucial for weather forecasting and spatial analysis. This study presents a paradigm shift by leveraging Deep Neural Networks (DNNs) to surpass traditional methods like Kriging for station-specific precipitation approximation. We propose two innovative NN architectures: one utilizing precipitation, elevation, and location, and another in… ▽ More Accurate precipitation estimates at individual locations are crucial for weather forecasting and spatial analysis. This study presents a paradigm shift by leveraging Deep Neural Networks (DNNs) to surpass traditional methods like Kriging for station-specific precipitation approximation. We propose two innovative NN architectures: one utilizing precipitation, elevation, and location, and another incorporating additional meteorological parameters like humidity, temperature, and wind speed. Trained on a vast dataset (1980-2019), these models outperform Kriging across various evaluation metrics (correlation coefficient, root mean square error, bias, and skill score) on a five-year validation set. This compelling evidence demonstrates the transformative power of deep learning for spatial prediction, offering a robust and precise alternative for station-specific precipitation estimation. △ Less

Submitted 18 March, 2025; originally announced March 2025.

Comments: 21 pages, 9 figures

arXiv:2503.09386 [pdf, ps, other]

Optimal control of fractional Poisson equation from non-local to local

Authors: Ram Manohar, Kedarnath Buda, B. V. Rathish Kumar

Abstract: In this article, the limiting behavior of the solution $\bar u_s$ of the optimal control problem subjected to the fractional Poisson equation $$(-Δ)^s u_s(x)=f_s(x), \quad x\in Ω$$ defined on domain $Ω$ bounded by smooth boundary with zero exterior boundary conditions $u_s(x)\equiv 0, \quad x \in Ω^c $ is established. We will prove that $\lim_{s\to 1^-} \bar u_s= \bar u$, where $\bar u$ is a solut… ▽ More In this article, the limiting behavior of the solution $\bar u_s$ of the optimal control problem subjected to the fractional Poisson equation $$(-Δ)^s u_s(x)=f_s(x), \quad x\in Ω$$ defined on domain $Ω$ bounded by smooth boundary with zero exterior boundary conditions $u_s(x)\equiv 0, \quad x \in Ω^c $ is established. We will prove that $\lim_{s\to 1^-} \bar u_s= \bar u$, where $\bar u$ is a solution of the optimal control problem subjected to classical Poisson equation $-Δu(x)=f(x), \quad x \in Ω$ and $u(x)=0, \quad x\in \partial Ω.$ △ Less

Submitted 12 March, 2025; originally announced March 2025.

Comments: 10 Pages, 3 Authors

MSC Class: $65\mathrm{N}30$; $65\mathrm{N}50$; $49\mathrm{J}20$; $65\mathrm{K}10$

arXiv:2503.05742 [pdf, other]

Adaptive SIPG method for approximations of boundary control problems governed by parabolic PDEs

Authors: Ram Manohar, B. V. Rathish Kumar, Kedarnath Buda, Rajen Kumar Sinha

Abstract: This study presents an aposteriori error analysis of adaptive finite element approximations of parabolic boundary control problems with bilateral box constraints that act on a Neumann boundary. The control problem is discretized using the symmetric interior penalty Galerkin (SIPG) technique. We derive both reliable and efficient type residual-based error estimators coupling with the data oscillati… ▽ More This study presents an aposteriori error analysis of adaptive finite element approximations of parabolic boundary control problems with bilateral box constraints that act on a Neumann boundary. The control problem is discretized using the symmetric interior penalty Galerkin (SIPG) technique. We derive both reliable and efficient type residual-based error estimators coupling with the data oscillations. The implementation of these error estimators serves as a guide for the adaptive mesh refinement process, indicating whether or not more refinement is required. Although the control error estimator effectively captured control approximation errors, it had limitations in guiding refinement localization in critical cases. To overcome this, an alternative control indicator was used in numerical tests. The results demonstrated the clear superiority of adaptive refinements over uniform refinements, confirming the proposed approach's effectiveness in achieving accurate solutions while optimizing computational efficiency. numerical experiment showcases the effectiveness of the derived error estimators. △ Less

Submitted 20 February, 2025; originally announced March 2025.

Comments: 41 pages, 26 Figures, 4 Authors

MSC Class: $65N30$; $65N50$; $49J20$; $65K10$

arXiv:2503.01872 [pdf, other]

FairGen: Controlling Sensitive Attributes for Fair Generations in Diffusion Models via Adaptive Latent Guidance

Authors: Mintong Kang, Vinayshekhar Bannihatti Kumar, Shamik Roy, Abhishek Kumar, Sopan Khosla, Balakrishnan Murali Narayanaswamy, Rashmi Gangadharaiah

Abstract: Text-to-image diffusion models often exhibit biases toward specific demographic groups, such as generating more males than females when prompted to generate images of engineers, raising ethical concerns and limiting their adoption. In this paper, we tackle the challenge of mitigating generation bias towards any target attribute value (e.g., "male" for "gender") in diffusion models while preserving… ▽ More Text-to-image diffusion models often exhibit biases toward specific demographic groups, such as generating more males than females when prompted to generate images of engineers, raising ethical concerns and limiting their adoption. In this paper, we tackle the challenge of mitigating generation bias towards any target attribute value (e.g., "male" for "gender") in diffusion models while preserving generation quality. We propose FairGen, an adaptive latent guidance mechanism which controls the generation distribution during inference. In FairGen, a latent guidance module dynamically adjusts the diffusion process to enforce specific attributes, while a memory module tracks the generation statistics and steers latent guidance to align with the targeted fair distribution of the attribute values. Further, given the limitations of existing datasets in comprehensively assessing bias in diffusion models, we introduce a holistic bias evaluation benchmark HBE, covering diverse domains and incorporating complex prompts across various applications. Extensive evaluations on HBE and Stable Bias datasets demonstrate that FairGen outperforms existing bias mitigation approaches, achieving substantial bias reduction (e.g., 68.5% gender bias reduction on Stable Diffusion 2). Ablation studies highlight FairGen's ability to flexibly and precisely control generation distribution at any user-specified granularity, ensuring adaptive and targeted bias mitigation. △ Less

Submitted 25 February, 2025; originally announced March 2025.

Comments: Under submission

arXiv:2502.13108 [pdf, other]

Clinical QA 2.0: Multi-Task Learning for Answer Extraction and Categorization

Authors: Priyaranjan Pattnayak, Hitesh Laxmichand Patel, Amit Agarwal, Bhargava Kumar, Srikant Panda, Tejaswini Kumar

Abstract: Clinical Question Answering (CQA) plays a crucial role in medical decision-making, enabling physicians to extract relevant information from Electronic Medical Records (EMRs). While transformer-based models such as BERT, BioBERT, and ClinicalBERT have demonstrated state-of-the-art performance in CQA, existing models lack the ability to categorize extracted answers, which is critical for structured… ▽ More Clinical Question Answering (CQA) plays a crucial role in medical decision-making, enabling physicians to extract relevant information from Electronic Medical Records (EMRs). While transformer-based models such as BERT, BioBERT, and ClinicalBERT have demonstrated state-of-the-art performance in CQA, existing models lack the ability to categorize extracted answers, which is critical for structured retrieval, content filtering, and medical decision support. To address this limitation, we introduce a Multi-Task Learning (MTL) framework that jointly trains CQA models for both answer extraction and medical categorization. In addition to predicting answer spans, our model classifies responses into five standardized medical categories: Diagnosis, Medication, Symptoms, Procedure, and Lab Reports. This categorization enables more structured and interpretable outputs, making clinical QA models more useful in real-world healthcare settings. We evaluate our approach on emrQA, a large-scale dataset for medical question answering. Results show that MTL improves F1-score by 2.2% compared to standard fine-tuning, while achieving 90.7% accuracy in answer categorization. These findings suggest that MTL not only enhances CQA performance but also introduces an effective mechanism for categorization and structured medical information retrieval. △ Less

Submitted 23 April, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

arXiv:2502.12723 [pdf, other]

myEye2Wheeler: A Two-Wheeler Indian Driver Real-World Eye-Tracking Dataset

Authors: Bhaiya Vaibhaw Kumar, Deepti Rawat, Tanvi Kandalla, Aarnav Nagariya, Kavita Vemuri

Abstract: This paper presents the myEye2Wheeler dataset, a unique resource of real-world gaze behaviour of two-wheeler drivers navigating complex Indian traffic. Most datasets are from four-wheeler drivers on well-planned roads and homogeneous traffic. Our dataset offers a critical lens into the unique visual attention patterns and insights into the decision-making of Indian two-wheeler drivers. The analysi… ▽ More This paper presents the myEye2Wheeler dataset, a unique resource of real-world gaze behaviour of two-wheeler drivers navigating complex Indian traffic. Most datasets are from four-wheeler drivers on well-planned roads and homogeneous traffic. Our dataset offers a critical lens into the unique visual attention patterns and insights into the decision-making of Indian two-wheeler drivers. The analysis demonstrates that existing saliency models, like TASED-Net, perform less effectively on the myEye-2Wheeler dataset compared to when applied on the European 4-wheeler eye tracking datasets (DR(Eye)VE), highlighting the need for models specifically tailored to the traffic conditions. By introducing the dataset, we not only fill a significant gap in two-wheeler driver behaviour research in India but also emphasise the critical need for developing context-specific saliency models. The larger aim is to improve road safety for two-wheeler users and lane-planning to support a cost-effective mode of transport. △ Less

Submitted 18 February, 2025; originally announced February 2025.

arXiv:2502.00564 [pdf, other]

doi 10.1051/0004-6361/202452667

The 4m International Liquid Mirror Telescope: Construction, operation, and science

Authors: Jean Surdej, Paul Hickson, Kuntal Misra, Dipankar Banerjee, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Sara Filali, Joschua Hellemeier, Manisha Kharayat, Brajesh Kumar, Hitesh Kumar, Mukesh Kumar, T. S. Kumar, Priyanshi Kumari, Vibhore Negi, Anna Pospieszalska-Surdej, Sarath Prabhavu, Bikram Pradhan, Kumar Pranshu, Himanshu Rawat, B. Krishna Reddy, Arun Sasidharan Pillai , et al. (4 additional authors not shown)

Abstract: The International Liquid Mirror Telescope (ILMT) project was motivated by the need for an inexpensive 4 metre diameter optical telescope that could be devoted entirely to astronomical surveys. Its scientific programmes include the detection and study of transients, variable objects, asteroids, comets, space debris and low surface brightness galaxies. To this end, a collaboration was formed between… ▽ More The International Liquid Mirror Telescope (ILMT) project was motivated by the need for an inexpensive 4 metre diameter optical telescope that could be devoted entirely to astronomical surveys. Its scientific programmes include the detection and study of transients, variable objects, asteroids, comets, space debris and low surface brightness galaxies. To this end, a collaboration was formed between the Institute of Astrophysics and Geophysics (Liège University, Belgium), several Canadian universities (University of British Columbia, Laval University, University of Montreal, University of Toronto, York University, University of Victoria) and the Aryabhatta Research Institute of Observational Sciences (ARIES, India). After several years of design work in Belgium and construction in India on the ARIES Devasthal site, the telescope saw its first light on 29 April 2022. Its commissioning phase lasted from May 2022 until June 2023 (beginning of the monsoon). The ILMT was inaugurated on 21 March 2023 and has been in regular operation since October 2023. The telescope continuously observes the sky passing at the zenith using the SDSS g', r', and i' filters. This paper describes the ILMT, its operation, performance and shows some initial results. △ Less

Submitted 1 February, 2025; originally announced February 2025.

Comments: 12 pages, 18 figures, accepted for publication in Astronomy & Astrophysics

arXiv:2502.00556 [pdf, other]

PyLMT: A transient detection pipeline for the 4-m International Liquid Mirror Telescope

Authors: Kumar Pranshu, Kuntal Misra, Bhavya Ailawadhi, Monalisa Dubey, Naveen Dukiya, Sara Filali, Paul Hickson, Brajesh Kumar, Vibhore Negi, Jean Surdej

Abstract: The International Liquid Mirror Telescope (ILMT) is a 4-m aperture, zenith-pointing telescope with a field-of-view of 22', situated in the foothills of the Himalayas. The telescope operates in continuous survey mode, making it a useful instrument for time-domain astronomy, particularly for detecting transients, variable stars, active galactic nuclei variability, and asteroids. This paper presents… ▽ More The International Liquid Mirror Telescope (ILMT) is a 4-m aperture, zenith-pointing telescope with a field-of-view of 22', situated in the foothills of the Himalayas. The telescope operates in continuous survey mode, making it a useful instrument for time-domain astronomy, particularly for detecting transients, variable stars, active galactic nuclei variability, and asteroids. This paper presents the PyLMT transient detection pipeline to detect such transient/varying sources in the ILMT images. The pipeline utilises the image subtraction technique to compare a pair of images from the same field, identifying such sources in subtracted images with the help of convolutional neural networks (CNN) based real/bogus classifiers. The test accuracies determined for the real/bogus classifiers ranged from 94% to 98%. The resulting precision of the pipeline calculated over candidate alerts in the ILMT frames is 0.91. It also houses a CNN-aided transient candidate classifier that classifies the transient/variable candidates based on host morphology. The test accuracy of the candidate classifier is 98.6%. It has the provision to identify catalogued asteroids and other solar system objects using public databases. The median execution time of the pipeline is approximately 29 minutes per image of 17 minutes exposure. Relevant CNNs have been trained on data acquired with the ILMT during the cycle of October-November 2022. Subsequent tests on those images have confirmed the detection of numerous catalogued asteroids, variable stars, and other uncatalogued sources. The pipeline has been operational and has detected 12 extragalactic transients, including 2 new discoveries in the November 2023-May 2024 observation cycle. △ Less

Submitted 1 February, 2025; originally announced February 2025.

Comments: 21 pages, 26 figures, accepted for publication in MNRAS

arXiv:2412.21097 [pdf, ps, other]

Effects of asymmetric dark matter on a magnetized neutron star: A two-fluid approach

Authors: Pinku Routaray, Vishal Parmar, H. C. Das, Bharat Kumar, G. F. Burgio, H. -J. Schulze

Abstract: We study the interaction between dark matter (DM) and highly magnetized neutron stars (NSs), focusing on how DM particle mass, mass fraction, and magnetic field (MF) strength affect NS structure and stability. We consider self-interacting, nonannihilating, asymmetric fermionic DM that couples to NSs only through gravitational interaction. Using the Quantum Monte Carlo Relativistic Mean Field (QMC-… ▽ More We study the interaction between dark matter (DM) and highly magnetized neutron stars (NSs), focusing on how DM particle mass, mass fraction, and magnetic field (MF) strength affect NS structure and stability. We consider self-interacting, nonannihilating, asymmetric fermionic DM that couples to NSs only through gravitational interaction. Using the Quantum Monte Carlo Relativistic Mean Field (QMC-RMF4) model with density-dependent magnetic fields, we investigate the magnetized equation of state and examine the accumulation of DM under various conditions. Our results show that as the DM fraction increases, the maximum gravitational mass of the NS decreases, especially for heavier DM particles, while lighter DM particles can induce a transition from a dark core to a halo structure, increasing the maximum mass. Strong MFs soften the equation of state and reduce the dark mass a NS core can retain before transitioning to a halo. By comparing our results with observations from Neutro Star Interior Composition Explorer and GW170817, we identify the possible range of DM parameters for these objects. We find that the magnetic field slightly changes these limits, mainly affecting the maximum NS mass and tidal deformability. These findings provide key insights into how DM and MF jointly shape the mass-radius relation and the stability of DM-admixed magnetized NSs. △ Less

Submitted 3 June, 2025; v1 submitted 30 December, 2024; originally announced December 2024.

Comments: PHYSICAL REVIEW D 111, 103045 (2025)

arXiv:2412.20602 [pdf]

NLP-based Regulatory Compliance -- Using GPT 4.0 to Decode Regulatory Documents

Authors: Bimal Kumar, Dmitri Roussinov

Abstract: Large Language Models (LLMs) such as GPT-4.0 have shown significant promise in addressing the semantic complexities of regulatory documents, particularly in detecting inconsistencies and contradictions. This study evaluates GPT-4.0's ability to identify conflicts within regulatory requirements by analyzing a curated corpus with artificially injected ambiguities and contradictions, designed in coll… ▽ More Large Language Models (LLMs) such as GPT-4.0 have shown significant promise in addressing the semantic complexities of regulatory documents, particularly in detecting inconsistencies and contradictions. This study evaluates GPT-4.0's ability to identify conflicts within regulatory requirements by analyzing a curated corpus with artificially injected ambiguities and contradictions, designed in collaboration with architects and compliance engineers. Using metrics such as precision, recall, and F1 score, the experiment demonstrates GPT-4.0's effectiveness in detecting inconsistencies, with findings validated by human experts. The results highlight the potential of LLMs to enhance regulatory compliance processes, though further testing with larger datasets and domain-specific fine-tuning is needed to maximize accuracy and practical applicability. Future work will explore automated conflict resolution and real-world implementation through pilot projects with industry partners. △ Less

Submitted 29 December, 2024; originally announced December 2024.

Comments: accepted for presentation at Georg Nemetschek Institute Symposium & Expo on Artificial Intelligence for the Built World - Munich, Germany. 12 Sept 2024

arXiv:2412.19794 [pdf, ps, other]

MVTamperBench: Evaluating Robustness of Vision-Language Models

Authors: Amit Agarwal, Srikant Panda, Angeline Charles, Bhargava Kumar, Hitesh Patel, Priyaranjan Pattnayak, Taki Hasan Rafi, Tejaswini Kumar, Hansa Meghwani, Karan Gupta, Dong-Kyu Chae

Abstract: Multimodal Large Language Models (MLLMs), are recent advancement of Vision-Language Models (VLMs) that have driven major advances in video understanding. However, their vulnerability to adversarial tampering and manipulations remains underexplored. To address this gap, we introduce \textbf{MVTamperBench}, a benchmark that systematically evaluates MLLM robustness against five prevalent tampering te… ▽ More Multimodal Large Language Models (MLLMs), are recent advancement of Vision-Language Models (VLMs) that have driven major advances in video understanding. However, their vulnerability to adversarial tampering and manipulations remains underexplored. To address this gap, we introduce \textbf{MVTamperBench}, a benchmark that systematically evaluates MLLM robustness against five prevalent tampering techniques: rotation, masking, substitution, repetition, and dropping; based on real-world visual tampering scenarios such as surveillance interference, social media content edits, and misinformation injection. MVTamperBench comprises ~3.4K original videos, expanded into over ~17K tampered clips covering 19 distinct video manipulation tasks. This benchmark challenges models to detect manipulations in spatial and temporal coherence. We evaluate 45 recent MLLMs from 15+ model families. We reveal substantial variability in resilience across tampering types and show that larger parameter counts do not necessarily guarantee robustness. MVTamperBench sets a new benchmark for developing tamper-resilient MLLM in safety-critical applications, including detecting clickbait, preventing harmful content distribution, and enforcing policies on media platforms. We release all code, data, and benchmark to foster open research in trustworthy video understanding. Code: https://amitbcp.github.io/MVTamperBench/ Data: https://huggingface.co/datasets/Srikant86/MVTamperBench △ Less

Submitted 11 June, 2025; v1 submitted 27 December, 2024; originally announced December 2024.

MSC Class: 68T37; 68T05; 68Q32; 68T45; 94A08; 68T40; 68Q85 ACM Class: I.2.10; I.2.7; I.5.4; I.4.9; I.4.8; H.5.1

arXiv:2412.17759 [pdf, other]

Survey of Large Multimodal Model Datasets, Application Categories and Taxonomy

Authors: Priyaranjan Pattnayak, Hitesh Laxmichand Patel, Bhargava Kumar, Amit Agarwal, Ishan Banerjee, Srikant Panda, Tejaswini Kumar

Abstract: Multimodal learning, a rapidly evolving field in artificial intelligence, seeks to construct more versatile and robust systems by integrating and analyzing diverse types of data, including text, images, audio, and video. Inspired by the human ability to assimilate information through many senses, this method enables applications such as text-to-video conversion, visual question answering, and imag… ▽ More Multimodal learning, a rapidly evolving field in artificial intelligence, seeks to construct more versatile and robust systems by integrating and analyzing diverse types of data, including text, images, audio, and video. Inspired by the human ability to assimilate information through many senses, this method enables applications such as text-to-video conversion, visual question answering, and image captioning. Recent developments in datasets that support multimodal language models (MLLMs) are highlighted in this overview. Large-scale multimodal datasets are essential because they allow for thorough testing and training of these models. With an emphasis on their contributions to the discipline, the study examines a variety of datasets, including those for training, domain-specific tasks, and real-world applications. It also emphasizes how crucial benchmark datasets are for assessing models' performance in a range of scenarios, scalability, and applicability. Since multimodal learning is always changing, overcoming these obstacles will help AI research and applications reach new heights. △ Less

Submitted 23 December, 2024; originally announced December 2024.

arXiv:2412.17709 [pdf]

doi 10.1088/1555-6611/adbad5

Wakefield generation and electron acceleration via propagation of radially polarized laser pulses in homogeneous plasma

Authors: Shivani Aggarwal, Saumya Singh, Dinkar Mishra, Bhupesh Kumar, Pallavi Jha

Abstract: The paper presents a study of wakefield generation and electron injection via propagation of radially polarized laser pulses in homogeneous pre-ionized plasma. The analytical study is based on Lorentz force and continuity equations. Perturbation technique and quasi-static approximation are used for evaluating the generated longitudinal wakefields. Trapping and acceleration of electrons are examine… ▽ More The paper presents a study of wakefield generation and electron injection via propagation of radially polarized laser pulses in homogeneous pre-ionized plasma. The analytical study is based on Lorentz force and continuity equations. Perturbation technique and quasi-static approximation are used for evaluating the generated longitudinal wakefields. Trapping and acceleration of electrons are examined by injecting a test electron in the generated wakefields. The results are compared with those obtained via linearly polarized laser pulses. The validation of analytical results is performed using the Fourier-Bessel particle-in-cell (FBPIC) simulation code. It is seen that there is a significant enhancement in amplitude of the longitudinal wakefield generated and electron energy gain via radially polarized laser pulses as compared to linearly polarized laser pulse case. △ Less

Submitted 23 December, 2024; originally announced December 2024.

arXiv:2412.15340 [pdf]

Second harmonic generation by radially polarized laser beam propagating in homogeneous plasma

Authors: Shivani Aggarwal, Saumya Singh, Dinkar Mishra, Bhupesh Kumar, Pallavi Jha

Abstract: An analytical study of second harmonic generation due to the interaction of radially polarized laser beam with homogeneous and unmagnetized plasma is presented. The analytical study is based on Lorentz force, continuity and electromagnetic wave equations. Amplitude of second harmonic radiation is derived with the help of current density and dispersion relation obtained at twice the fundamental fre… ▽ More An analytical study of second harmonic generation due to the interaction of radially polarized laser beam with homogeneous and unmagnetized plasma is presented. The analytical study is based on Lorentz force, continuity and electromagnetic wave equations. Amplitude of second harmonic radiation is derived with the help of current density and dispersion relation obtained at twice the fundamental frequency of the laser field. Perturbation technique is used for evaluation of current density. The variation of amplitude and efficiency of radially polarized second harmonic radiation with propagation distance is graphically depicted. It is seen that radially polarized laser propagating in plasma gives efficient second harmonic radiation generation. △ Less

Submitted 19 December, 2024; originally announced December 2024.

arXiv:2412.11641 [pdf]

Comparison of three reconstruction algorithms for low-dose phase-contrast computed tomography of the breast with synchrotron radiation

Authors: Sandro Donato, Simone Caputo, Luca Brombal, Bruno Golosio, Renata Longo, Giuliana Tromba, Raffaele G. Agostino, Gianluigi Greco, Benedicta D. Arhatari, Chris Hall, Anton Maksimenko, Daniel Hausermann, Darren Lockie, Jane Fox, Beena Kumar, Sarah Lewis, Patrick C. Brennan, Harry M. Quiney, Seyedamir Tavakoli Taba, Timur E. Gureyev

Abstract: Three different computed tomography (CT) reconstruction algorithms: Filtered Back Projection (FBP), Unified Tomographic Reconstruction (UTR) and customized Simultaneous Algebraic Reconstruction Technique (cSART), have been systematically compared and evaluated using experimental data from CT scans of ten fresh mastectomy samples collected at the Imaging and Medical beamline of the Australian Synch… ▽ More Three different computed tomography (CT) reconstruction algorithms: Filtered Back Projection (FBP), Unified Tomographic Reconstruction (UTR) and customized Simultaneous Algebraic Reconstruction Technique (cSART), have been systematically compared and evaluated using experimental data from CT scans of ten fresh mastectomy samples collected at the Imaging and Medical beamline of the Australian Synchrotron. All the scans were collected at the mean glandular dose of 2 mGy, using monochromatic X-rays with 32 keV energy, flat-panel detectors with 0.1 mm pixels and 6 meter distance between the rotation stage and the detector. Paganin's phase retrieval method was used in conjunction with all three CT reconstruction algorithms. The reconstructed images were compared in terms of the objective image quality characteristics, including spatial resolution, contrast, signal-to-noise, and contrast-to-noise ratios. The images were also evaluated by seven experienced medical imaging specialists, rating perceptible contrast, sharpness of tissue interfaces, image noise, calcification visibility and overall image quality. Of the three compared algorithms, cSART was clearly superior to UTR and FBP in terms of most measured objective image quality characteristics. At the same time, the results of the subjective quality evaluation consistently favoured the images reconstructed by FBP, followed by UTR, with cSART receiving lower scores on average. We argue that this apparent disagreement between the objective and subjective assessments of image quality can be explained by the importance assigned to image contrast in the subjective assessment, while the signal-to-noise ratio seemed to receive relatively low weighting. This study was conducted in preparation for phase-contrast breast CT imaging of live patients at Australian Synchrotron (Melbourne, Australia). △ Less

Submitted 16 December, 2024; originally announced December 2024.

Comments: 21 pages, 5 figures, 2 tables

arXiv:2412.03590 [pdf]

doi 10.17577/IJERTV13IS100138

Enhancing Document AI Data Generation Through Graph-Based Synthetic Layouts

Authors: Amit Agarwal, Hitesh Patel, Priyaranjan Pattnayak, Srikant Panda, Bhargava Kumar, Tejaswini Kumar

Abstract: The development of robust Document AI models has been constrained by limited access to high-quality, labeled datasets, primarily due to data privacy concerns, scarcity, and the high cost of manual annotation. Traditional methods of synthetic data generation, such as text and image augmentation, have proven effective for increasing data diversity but often fail to capture the complex layout structu… ▽ More The development of robust Document AI models has been constrained by limited access to high-quality, labeled datasets, primarily due to data privacy concerns, scarcity, and the high cost of manual annotation. Traditional methods of synthetic data generation, such as text and image augmentation, have proven effective for increasing data diversity but often fail to capture the complex layout structures present in real world documents. This paper proposes a novel approach to synthetic document layout generation using Graph Neural Networks (GNNs). By representing document elements (e.g., text blocks, images, tables) as nodes in a graph and their spatial relationships as edges, GNNs are trained to generate realistic and diverse document layouts. This method leverages graph-based learning to ensure structural coherence and semantic consistency, addressing the limitations of traditional augmentation techniques. The proposed framework is evaluated on tasks such as document classification, named entity recognition (NER), and information extraction, demonstrating significant performance improvements. Furthermore, we address the computational challenges of GNN based synthetic data generation and propose solutions to mitigate domain adaptation issues between synthetic and real-world datasets. Our experimental results show that graph-augmented document layouts outperform existing augmentation techniques, offering a scalable and flexible solution for training Document AI models. △ Less

Submitted 27 November, 2024; originally announced December 2024.

Comments: Published in IJERT, Volume 13, Issue 10 (October 2024)

ACM Class: I.2.6; I.2.7; I.5.4; H.3.3; H.2.8; G.2.2

Journal ref: IJERT, Volume 13, Issue 10, October 2024

arXiv:2411.14962 [pdf, other]

LLM for Barcodes: Generating Diverse Synthetic Data for Identity Documents

Authors: Hitesh Laxmichand Patel, Amit Agarwal, Bhargava Kumar, Karan Gupta, Priyaranjan Pattnayak

Abstract: Accurate barcode detection and decoding in Identity documents is crucial for applications like security, healthcare, and education, where reliable data extraction and verification are essential. However, building robust detection models is challenging due to the lack of diverse, realistic datasets an issue often tied to privacy concerns and the wide variety of document formats. Traditional tools l… ▽ More Accurate barcode detection and decoding in Identity documents is crucial for applications like security, healthcare, and education, where reliable data extraction and verification are essential. However, building robust detection models is challenging due to the lack of diverse, realistic datasets an issue often tied to privacy concerns and the wide variety of document formats. Traditional tools like Faker rely on predefined templates, making them less effective for capturing the complexity of real-world identity documents. In this paper, we introduce a new approach to synthetic data generation that uses LLMs to create contextually rich and realistic data without relying on predefined field. Using the vast knowledge LLMs have about different documents and content, our method creates data that reflects the variety found in real identity documents. This data is then encoded into barcode and overlayed on templates for documents such as Driver's licenses, Insurance cards, Student IDs. Our approach simplifies the process of dataset creation, eliminating the need for extensive domain knowledge or predefined fields. Compared to traditional methods like Faker, data generated by LLM demonstrates greater diversity and contextual relevance, leading to improved performance in barcode detection models. This scalable, privacy-first solution is a big step forward in advancing machine learning for automated document processing and identity verification. △ Less

Submitted 23 December, 2024; v1 submitted 22 November, 2024; originally announced November 2024.

Comments: 5 pages, 1 figures

arXiv:2411.10716 [pdf, other]

FlowScope: Enhancing Decision Making by Time Series Forecasting based on Prediction Optimization using HybridFlow Forecast Framework

Authors: Nitin Sagar Boyeena, Begari Susheel Kumar

Abstract: Time series forecasting is crucial in several sectors, such as meteorology, retail, healthcare, and finance. Accurately forecasting future trends and patterns is crucial for strategic planning and making well-informed decisions. In this case, it is crucial to include many forecasting methodologies. The strengths of Auto-regressive Integrated Moving Average (ARIMA) for linear time series, Seasonal… ▽ More Time series forecasting is crucial in several sectors, such as meteorology, retail, healthcare, and finance. Accurately forecasting future trends and patterns is crucial for strategic planning and making well-informed decisions. In this case, it is crucial to include many forecasting methodologies. The strengths of Auto-regressive Integrated Moving Average (ARIMA) for linear time series, Seasonal ARIMA models (SARIMA) for seasonal time series, Exponential Smoothing State Space Models (ETS) for handling errors and trends, and Long Short-Term Memory (LSTM) Neural Network model for complex pattern recognition have been combined to create a comprehensive framework called FlowScope. SARIMA excels in capturing seasonal variations, whereas ARIMA ensures effective handling of linear time series. ETS models excel in capturing trends and correcting errors, whereas LSTM networks excel in reflecting intricate temporal connections. By combining these methods from both machine learning and deep learning, we propose a deep-hybrid learning approach FlowScope which offers a versatile and robust platform for predicting time series data. This empowers enterprises to make informed decisions and optimize long-term strategies for maximum performance. Keywords: Time Series Forecasting, HybridFlow Forecast Framework, Deep-Hybrid Learning, Informed Decisions. △ Less

Submitted 16 November, 2024; originally announced November 2024.

Comments: 12 pages and 6 figures

MSC Class: 62M10 (Primary); 68T07 (Secondary) ACM Class: I.2.6; G.3; I.5

arXiv:2411.06947 [pdf, other]

Focused ion beam polishing based optimization of high-Q silica microdisk resonators

Authors: Lekshmi Eswaramoorthy, Parul Sharma, Brijesh Kumar, Abhay Anand V S, Anuj Kumar Singh, Kishor Kumar Mandal, Sudha Mokkapati, Anshuman Kumar

Abstract: Whispering gallery mode (WGM) microdisk resonators are promising optical devices that confine light efficiently and enable enhanced nonlinear optical effects. This work presents a novel approach to reduce sidewall roughness in SiO\textsubscript{2} microdisk resonators using focused ion beam (FIB) polishing. The microdisks, with varying diameter ranging from 5 to 20 $μ$m are fabricated using a mult… ▽ More Whispering gallery mode (WGM) microdisk resonators are promising optical devices that confine light efficiently and enable enhanced nonlinear optical effects. This work presents a novel approach to reduce sidewall roughness in SiO\textsubscript{2} microdisk resonators using focused ion beam (FIB) polishing. The microdisks, with varying diameter ranging from 5 to 20 $μ$m are fabricated using a multi-step fabrication scheme. However, the etching process introduces significant sidewall roughness, which increases with decreasing microdisk radius, degrading the resonators' quality. To address this issue, a FIB system is employed to polish the sidewalls, using optimized process parameters to minimize Ga ion implantation. White light interferometry measurements reveal a significant reduction in surface roughness from 7 nm to 20 nm for a 5 $μ$m diameter microdisk, leading to a substantial enhancement in the scattering quality factor (Qss) from $3\times 10^2$ to $2\times 10^6$. These findings demonstrate the effectiveness of FIB polishing in improving the quality of microdisk resonators and open up new possibilities for the fabrication of advanced photonic devices. △ Less

Submitted 11 November, 2024; originally announced November 2024.

arXiv:2411.06189 [pdf]

doi 10.1063/5.0247983

Twisted terahertz radiation generation using Laguerre-Gaussian laser pulse propagating in axially magnetized plasma

Authors: Dinkar Mishra, Saumya Singh, Bhupesh Kumar, Pallavi Jha

Abstract: We present analytical and simulation study of twisted terahertz (THz) radiation generation via propagation of a circularly polarized Laguerre Gaussian (LG) laser pulse in homogeneous plasma embedded in an axial magnetic field. Analytical formulation is based on perturbation technique and quasistatic approximation. Longitudinal and transverse wakefields generated via laser plasma interactions are e… ▽ More We present analytical and simulation study of twisted terahertz (THz) radiation generation via propagation of a circularly polarized Laguerre Gaussian (LG) laser pulse in homogeneous plasma embedded in an axial magnetic field. Analytical formulation is based on perturbation technique and quasistatic approximation. Longitudinal and transverse wakefields generated via laser plasma interactions are evaluated using Lorentz force and Maxwells equations in the mildly nonlinear regime. It is observed that two linearly polarized twisted terahertz (THz) radiation beams are generated in mutually perpendicular planes. Superposition of the two beams result in a single linearly polarized twisted THz radiation beam with modified amplitude and polarization direction. Three dimensional (3D) particle in cell (PIC) simulations are performed for this configuration using FBPIC code. Graphical comparison of amplitude of the resultant THz beam obtained via analytical and simulation studies is presented. △ Less

Submitted 9 November, 2024; originally announced November 2024.

arXiv:2410.16689 [pdf, other]

Astrophysical constraints on neutron star $f$-modes with a nonparametric equation of state representation

Authors: Sailesh Ranjan Mohanty, Utkarsh Mali, H. C. Das, Bharat Kumar, Philippe Landry

Abstract: We constrain the fundamental-mode ($f$-mode) oscillation frequencies of nonrotating neutron stars using a phenomenological Gaussian process model for the unknown dense-matter equation of state conditioned on a suite of gravitational-wave, radio and X-ray observations. We infer the quadrupolar $f$-mode frequency preferred by the astronomical data as a function of neutron star mass, with error estim… ▽ More We constrain the fundamental-mode ($f$-mode) oscillation frequencies of nonrotating neutron stars using a phenomenological Gaussian process model for the unknown dense-matter equation of state conditioned on a suite of gravitational-wave, radio and X-ray observations. We infer the quadrupolar $f$-mode frequency preferred by the astronomical data as a function of neutron star mass, with error estimates that quantify the impact of equation of state uncertainty, and compare it to the contact frequency for inspiralling neutron-star binaries, finding that resonance with the orbital frequency can be achieved for the coalescences with the most unequal mass ratio. For an optimally configured binary neutron star merger, we estimate the gravitational waveform's tidal phasing due to $f$-mode dynamical tides as $7^{+2}_{-3}$ rad at merger. We assess prospects for distinguishing $f$-mode dynamical tides with current and future-generation gravitational-wave observatories. △ Less

Submitted 22 October, 2024; originally announced October 2024.

Comments: 12 pages, 7 figures

arXiv:2410.14041 [pdf, other]

From Barriers to Tactics: A Behavioral Science-Informed Agentic Workflow for Personalized Nutrition Coaching

Authors: Eric Yang, Tomas Garcia, Hannah Williams, Bhawesh Kumar, Martin Ramé, Eileen Rivera, Yiran Ma, Jonathan Amar, Caricia Catalani, Yugang Jia

Abstract: Effective management of cardiometabolic conditions requires sustained positive nutrition habits, often hindered by complex and individualized barriers. Direct human management is simply not scalable, while previous attempts aimed at automating nutrition coaching lack the personalization needed to address these diverse challenges. This paper introduces a novel LLM-powered agentic workflow designed… ▽ More Effective management of cardiometabolic conditions requires sustained positive nutrition habits, often hindered by complex and individualized barriers. Direct human management is simply not scalable, while previous attempts aimed at automating nutrition coaching lack the personalization needed to address these diverse challenges. This paper introduces a novel LLM-powered agentic workflow designed to provide personalized nutrition coaching by directly targeting and mitigating patient-specific barriers. Grounded in behavioral science principles, the workflow leverages a comprehensive mapping of nutrition-related barriers to corresponding evidence-based strategies. A specialized LLM agent intentionally probes for and identifies the root cause of a patient's dietary struggles. Subsequently, a separate LLM agent delivers tailored tactics designed to overcome those specific barriers with patient context. We designed and validated our approach through a user study with individuals with cardiometabolic conditions, demonstrating the system's ability to accurately identify barriers and provide personalized guidance. Furthermore, we conducted a large-scale simulation study, grounding on real patient vignettes and expert-validated metrics, to evaluate the system's performance across a wide range of scenarios. Our findings demonstrate the potential of this LLM-powered agentic workflow to improve nutrition coaching by providing personalized, scalable, and behaviorally-informed interventions. △ Less

Submitted 17 October, 2024; originally announced October 2024.

Comments: 22 pages

arXiv:2410.13890 [pdf, other]

Machine Learning-Based Estimation of Superdroplet Growth Rates Using DNS Data

Authors: Divyaprakash, Nikita N. Makwana, Amitabh Bhattacharya, Bipin Kumar

Abstract: Droplet growth and size spectra play a crucial role in the microphysics of atmospheric clouds. However, it is challenging to represent droplet growth rate accurately in cloud-resolving models such as Large Eddy Simulations (LESs). The assumption of "well-mixed" condition within each grid cell, often made by traditional LES solvers, typically falls short near the edges of clouds, where sharp gradie… ▽ More Droplet growth and size spectra play a crucial role in the microphysics of atmospheric clouds. However, it is challenging to represent droplet growth rate accurately in cloud-resolving models such as Large Eddy Simulations (LESs). The assumption of "well-mixed" condition within each grid cell, often made by traditional LES solvers, typically falls short near the edges of clouds, where sharp gradients in water vapor supersaturation occur. This under-resolution of supersaturation gradients can lead to significant errors in prediction of droplet growth rate, which in turn affects the prediction of buoyancy at cloud edges, as well as forecast of precipitation. In "superdroplet" based LES model, a Lagrangian coarse-graining approach groups multiple droplets into superdroplets, each encompassing a specific number and size of actual droplets. The superdroplets are advected by the underlying LES velocity field, and the growth rate of these superdroplets is based on the filtered supersaturation field represented by the LES. To overcome the limitations of the "well-mixed" assumption, we propose a parameterization for superdroplet growth using high-fidelity Direct Numerical Simulation (DNS) data. We introduce a novel clustering algorithm to map droplets in DNS fields to superdroplets. The effective supersaturation at each superdroplet location is computed by averaging the unfiltered supersaturation of the associated droplets, which may differ from the value of filtered supersaturation at the superdroplet location. We then develop a machine learning-based parameterization to relate the effective growth rate of superdroplets to other filtered DNS flow variables. Preliminary results show a promising $R^2$ value of nearly 0.9 between the predicted and true effective supersaturation values for the superdroplets, for a range of superdroplet multiplicities. △ Less

Submitted 12 October, 2024; originally announced October 2024.

Comments: 17 pages, 10 figures

arXiv:2410.03789 [pdf, other]

Benchmarking Turbulence Models to Represent Cloud-Edge Mixing

Authors: Johannes Kainz, Nikitabahen N. Makwana, Bipin Kumar, S. Ravichandran, Johan Fries, Gaetano Sardina, Bernhard Mehlig, Fabian Hoffmann

Abstract: Considering turbulence is crucial to understanding clouds. However, covering all scales involved in the turbulent mixing of clouds with their environment is computationally challenging, urging the development of simpler models to represent some of the processes involved. By using full direct numerical simulations as a reference, this study compares several statistical approaches for representing s… ▽ More Considering turbulence is crucial to understanding clouds. However, covering all scales involved in the turbulent mixing of clouds with their environment is computationally challenging, urging the development of simpler models to represent some of the processes involved. By using full direct numerical simulations as a reference, this study compares several statistical approaches for representing small-scale turbulent mixing. All models use a comparable Lagrangian representation of cloud microphysics, and simulate the same cases of cloud-edge mixing, covering different ambient humidities and turbulence intensities. It is demonstrated that all statistical models represent the evolution of thermodynamics successfully, but not all models capture the changes in cloud microphysics (cloud droplet number concentration, droplet mean radius, and spectral width). Implications of these results for using the presented models as subgrid-scale schemes are discussed. △ Less

Submitted 3 October, 2024; originally announced October 2024.

arXiv:2410.01888 [pdf, other]

Conformal Prediction Sets Can Cause Disparate Impact

Authors: Jesse C. Cresswell, Bhargava Kumar, Yi Sui, Mouloud Belbahri

Abstract: Conformal prediction is a statistically rigorous method for quantifying uncertainty in models by having them output sets of predictions, with larger sets indicating more uncertainty. However, prediction sets are not inherently actionable; many applications require a single output to act on, not several. To overcome this limitation, prediction sets can be provided to a human who then makes an infor… ▽ More Conformal prediction is a statistically rigorous method for quantifying uncertainty in models by having them output sets of predictions, with larger sets indicating more uncertainty. However, prediction sets are not inherently actionable; many applications require a single output to act on, not several. To overcome this limitation, prediction sets can be provided to a human who then makes an informed decision. In any such system it is crucial to ensure the fairness of outcomes across protected groups, and researchers have proposed that Equalized Coverage be used as the standard for fairness. By conducting experiments with human participants, we demonstrate that providing prediction sets can lead to disparate impact in decisions. Disquietingly, we find that providing sets that satisfy Equalized Coverage actually increases disparate impact compared to marginal coverage. Instead of equalizing coverage, we propose to equalize set sizes across groups which empirically leads to lower disparate impact. △ Less

Submitted 13 February, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

Comments: ICLR 2025 Spotlight, https://openreview.net/forum?id=fZK6AQXlUU. Code and experimental data are available at https://github.com/layer6ai-labs/conformal-prediction-fairness

arXiv:2409.14716 [pdf, other]

Simultaneous Multiband Photometry of the Early Optical Afterglow of GRB 240825A with Mephisto

Authors: Yehao Cheng, Yu Pan, Yuan-Pei Yang, Jinghua Zhang, Guowang Du, Yuan Fang, Brajesh Kumar, Helong Guo, Xinzhong Er, Xinlei Chen, Chenxu Liu, Tao Wang, Zhenfei Qin, Yicheng Jin, Xingzhu Zou, Xuhui Han, Pinpin Zhang, Liping Xin, Chao Wu, Jianhui Lian, Xiangkun Liu, Xiaowei Liu

Abstract: Gamma-ray bursts (GRBs) are the most luminous transients in the universe. The interaction of the relativistic jet with the circumburst medium produces an afterglow and generates multiwavelength emission. In this work, we present simultaneous multiband photometry of GRB~240825A with the Multi-channel Photometric Survey Telescope (Mephisto) and analyze its temporal and spectral properties. The measu… ▽ More Gamma-ray bursts (GRBs) are the most luminous transients in the universe. The interaction of the relativistic jet with the circumburst medium produces an afterglow and generates multiwavelength emission. In this work, we present simultaneous multiband photometry of GRB~240825A with the Multi-channel Photometric Survey Telescope (Mephisto) and analyze its temporal and spectral properties. The measurement began 128 seconds after the GRB trigger and continued until the fourth day when the afterglow essentially diminished and the measured brightness was close to that of the host galaxy. Based on the multiband light curves in the $uvgriz$ bands, we find that the optical flux density satisfies $F_{ν,{\rm obs}}\propto t^{-1.34}ν^{-2.48}$ with a spectral index of $2.48$ much larger than those of most other GRBs. To reconcile the measured much softer spectral energy distribution (SED) with that predicted by the standard afterglow model, an extra host-galaxy extinction of $E_{B-V}\sim(0.37-0.57)$ mag is required. We interpreted this excess as arising from a dense circumburst medium. We further find that the SED of the optical afterglow hardened as the afterglow decayed and the color excess $E_{B-V}$ decreased $\sim0.26$ mag from 100 seconds to 3000 seconds after the GRB trigger. Finally, we analyze the properties of the host galaxy of GRB~240825A based on data from the SDSS, PanSTARRS and HSC-SSP surveys. For a host redshift of $z=0.659$, the stellar mass and star formation rate of the host galaxy are estimated to be $\log(M_*/M_\odot)=10.0^{+0.3}_{-0.3}$ and $\log({\rm SFR}/M_{\odot}{\rm yr}^{-1})= 0.6^{+0.8}_{-3.3}$, respectively, pointing to a gas-rich, star-forming, medium-size galaxy. △ Less

Submitted 11 December, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

Comments: 16 pages, 5 figures, 1 table. Accepted for publication in ApJ

arXiv:2409.02131 [pdf, other]

Impact of Dark Matter and Rotation on Neutron Star Properties

Authors: Pinku Routaray, Abirbhav Chakrawarty, Bharat Kumar

Abstract: In this study, we examine the combined effects of dark matter (DM) and rotation on the properties of neutron stars (NSs). We employ a self-interacting dark matter model, motivated by the neutron decay anomaly, within the relativistic mean-field formalism to explore its impact on both static and rotating NSs. The Hartle-Thorne approach is utilized to model rotating NSs, treating the DM interaction… ▽ More In this study, we examine the combined effects of dark matter (DM) and rotation on the properties of neutron stars (NSs). We employ a self-interacting dark matter model, motivated by the neutron decay anomaly, within the relativistic mean-field formalism to explore its impact on both static and rotating NSs. The Hartle-Thorne approach is utilized to model rotating NSs, treating the DM interaction strength ($G$) as a free parameter and considering angular velocity ($Ω$) for rotation. We investigate how DM influences the mass-shedding limit, determined using the Keplerian frequency, and analyze the variations in angular velocity at different DM interaction strengths to assess their effects on NS mass, radius, central energy density, and eccentricity. Our results indicate that while rotation increases mass and radius due to centrifugal forces, DM softens the EOS, reducing these properties, particularly at higher DM fractions. DM also reduces rotational deformation, leading to lower eccentricity compared to DM-free NSs at the same angular velocity. Additionally, we calculate the relative deviations in maximum rotational mass and canonical equatorial radius from their baseline values, finding that high DM fractions combined with low angular velocities result in significant reductions, while low DM fractions with high rotational speeds lead to positive deviations, indicating greater deformation. △ Less

Submitted 2 September, 2024; originally announced September 2024.

Comments: Comments are welcome

arXiv:2408.15216 [pdf, other]

doi 10.1103/PhysRevB.111.075112

Theoretical investigation of quantum oscillations of specific heat in Kondo insulators

Authors: Arnav Pushkar, Brijesh Kumar

Abstract: The electronic specific heat of Kondo insulators in magnetic field is studied for the half-filled Kondo lattice model on simple cubic lattice using a low-temperature theory in Kumar representation. The calculated specific heat is found to show quantum oscillations, which appear soon after the inversion transition and become prominent with decreasing Kondo coupling. Interestingly, it is noted that… ▽ More The electronic specific heat of Kondo insulators in magnetic field is studied for the half-filled Kondo lattice model on simple cubic lattice using a low-temperature theory in Kumar representation. The calculated specific heat is found to show quantum oscillations, which appear soon after the inversion transition and become prominent with decreasing Kondo coupling. Interestingly, it is noted that the field derivative of specific heat closely resembles the magnetic quantum oscillations, and exhibits more pronounced oscillations at finite temperatures than the magnetization itself. An empirical Lifshitz-Kosevich fit with two frequencies given by the theory describes these quantum oscillations reasonably well, where the frequencies correspond to the extremal areas on the surface of charge gap, a remnant of the Fermi surface in the insulating case. △ Less

Submitted 27 August, 2024; originally announced August 2024.

Comments: 12 pages, 8 figures, 1 table

Journal ref: Phys. Rev. B 111, 075112 (2025)

arXiv:2408.12969 [pdf, other]

Peering into the Heart of the Giant Molecular Cloud G148.24+00.41: A Deep Near-infrared View of the Newly Hatched Cluster FSR 655

Authors: Vineet Rawat, M. R. Samal, D. K. Ojha, Brajesh Kumar, Saurabh Sharma, J. Jose, Ram Sagar, R. K. Yadav

Abstract: We present a detailed near-infrared study of an embedded cluster located in the hub of the giant molecular cloud G148.24+00.41 of mass $\sim$10$^5$ $M_\odot$, with the TANSPEC instrument mounted on the 3.6 m Devasthal Optical Telescope. The hub is located near the geometric center of the cloud and represents its most massive clump. We studied the central 2 pc $\times$ 2 pc area of the hub with 5… ▽ More We present a detailed near-infrared study of an embedded cluster located in the hub of the giant molecular cloud G148.24+00.41 of mass $\sim$10$^5$ $M_\odot$, with the TANSPEC instrument mounted on the 3.6 m Devasthal Optical Telescope. The hub is located near the geometric center of the cloud and represents its most massive clump. We studied the central 2 pc $\times$ 2 pc area of the hub with 5$σ$ limiting magnitudes of 20.5, 20.1, and 18.6 mag in the $J$, $H$, and $K_s$ bands, respectively. Using the $K_s$-band luminosity function and comparing it with the synthetic clusters, we obtained the age of the cluster as $\sim$0.5 Myr, which was found to corroborate well with the visual extinction versus the age of nearby embedded clusters. We find that the present mass of the cluster is around $\sim$180 $M_\odot$, and the cluster is currently forming stars at a rate of $\sim$330 $M_\odot$ $\rm{Myr}^{-1}$, with an efficiency of $\sim$20%. The cluster is connected to an extended gas reservoir through a filamentary network; thus, we hypothesize that the cluster has the potential to become a richer cluster in a few Myr of time. △ Less

Submitted 23 August, 2024; originally announced August 2024.

Comments: 17 pages and 13 figures

arXiv:2407.18423 [pdf, other]

HDL-GPT: High-Quality HDL is All You Need

Authors: Bhuvnesh Kumar, Saurav Nanda, Ganapathy Parthasarathy, Pawan Patil, Austin Tsai, Parivesh Choudhary

Abstract: This paper presents Hardware Description Language Generative Pre-trained Transformers (HDL-GPT), a novel approach that leverages the vast repository of open-source High Definition Language (HDL) codes to train superior quality large code models. The core premise of this paper is the hypothesis that high-quality HDL is all you need to create models with exceptional performance and broad zero-shot g… ▽ More This paper presents Hardware Description Language Generative Pre-trained Transformers (HDL-GPT), a novel approach that leverages the vast repository of open-source High Definition Language (HDL) codes to train superior quality large code models. The core premise of this paper is the hypothesis that high-quality HDL is all you need to create models with exceptional performance and broad zero-shot generalization abilities. The paper elucidates the methods employed for the curation and augmentation of large corpora from open-source HDL code, transforming highly variable quality data into high-quality data through careful prompting and context maintenance. We demonstrate that the careful selection, filtering, and augmentation of data across HDLs can yield powerful models that surpass current state-of-the-art models. We also explore the impact of different fine-tuning methods on the quality of results. We describe experimental results across a range of fine-tuned SOTA LLMs, substantiating our claims. We demonstrate improvements of 50% to 200% over SOTA HDL models on current benchmarks in tasks ranging from HDL circuit explanations, code generation, formal and simulation testbench creation, triaging bugs, and fixing them. HDL-GPT opens new avenues for the development of advanced model training techniques for circuit design tasks. △ Less

Submitted 25 July, 2024; originally announced July 2024.

Comments: DAC 2024 Invited Paper

arXiv:2407.18044 [pdf, other]

The Geometry of Queries: Query-Based Innovations in Retrieval-Augmented Generation

Authors: Eric Yang, Jonathan Amar, Jong Ha Lee, Bhawesh Kumar, Yugang Jia

Abstract: Digital health chatbots powered by Large Language Models (LLMs) have the potential to significantly improve personal health management for chronic conditions by providing accessible and on-demand health coaching and question-answering. However, these chatbots risk providing unverified and inaccurate information because LLMs generate responses based on patterns learned from diverse internet data. R… ▽ More Digital health chatbots powered by Large Language Models (LLMs) have the potential to significantly improve personal health management for chronic conditions by providing accessible and on-demand health coaching and question-answering. However, these chatbots risk providing unverified and inaccurate information because LLMs generate responses based on patterns learned from diverse internet data. Retrieval Augmented Generation (RAG) can help mitigate hallucinations and inaccuracies in LLM responses by grounding it on reliable content. However, efficiently and accurately retrieving most relevant set of content for real-time user questions remains a challenge. In this work, we introduce Query-Based Retrieval Augmented Generation (QB-RAG), a novel approach that pre-computes a database of potential queries from a content base using LLMs. For an incoming patient question, QB-RAG efficiently matches it against this pre-generated query database using vector search, improving alignment between user questions and the content. We establish a theoretical foundation for QB-RAG and provide a comparative analysis of existing retrieval enhancement techniques for RAG systems. Finally, our empirical evaluation demonstrates that QB-RAG significantly improves the accuracy of healthcare question answering, paving the way for robust and trustworthy LLM applications in digital health. △ Less

Submitted 25 July, 2024; originally announced July 2024.

Comments: 22 pages

arXiv:2407.13190 [pdf, ps, other]

A Note on Generalized Locally Toeplitz Operators

Authors: V. B. Kiran Kumar, N. S. Sarathkumar

Abstract: Generalized Locally Toeplitz (GLT) matrix sequences arise from large linear systems that approximate Partial Differential Equations (PDEs), Fractional Differential Equations (FDEs), and Integro-Differential Equations (IDEs). GLT sequences of matrices have been developed to study the spectral/singular value behaviour of the numerical approximations to various PDEs, Fades and IDEs. These approximati… ▽ More Generalized Locally Toeplitz (GLT) matrix sequences arise from large linear systems that approximate Partial Differential Equations (PDEs), Fractional Differential Equations (FDEs), and Integro-Differential Equations (IDEs). GLT sequences of matrices have been developed to study the spectral/singular value behaviour of the numerical approximations to various PDEs, Fades and IDEs. These approximations can be achieved using any discretization method on appropriate grids through local techniques such as Finite Differences, Finite Elements, Finite Volumes, Isogeometric Analysis, and Discontinuous Galerkin methods. Spectral and singular value symbols are essential for analyzing the eigenvalue and singular value distributions of matrix sequences in the Weyl sense. In this article, we provide a comprehensive overview of the operator-theoretic aspect of GLT sequences. The theory of GLT sequences, along with findings on the asymptotic spectral distribution of perturbed matrix sequences, is a highly effective and successful method for calculating the spectral symbol f. Therefore, developing an automatic procedure to compute the spectral symbols of these matrix sequences would be advantageous, a task that Ahmed Ratnani, N S Sarathkumar, S. Serra-Capizzano have partially undertaken. As an application of the theory developed here, we propose an automatic procedure for computing the symbol of the underlying sequences of matrices, assuming they form a GLT sequence that meets mild conditions. △ Less

Submitted 24 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

arXiv:2406.19789 [pdf, other]

Investigation of Phase Shift and Travel Time of Acoustic Waves in the Lower Solar Atmosphere Using Multi-Height Velocities

Authors: Hirdesh Kumar, Brajesh Kumar, Shibu K. Mathew, A. Raja Bayanna, S. P. Rajaguru

Abstract: We report and discuss phase-shift and phase travel time of low-frequency (ν < 5.0 mHz) acoustic waves estimated within the photosphere and photosphere-chromosphere interface regions, utilizing multi-height velocities in the quiet Sun. The bisector method has been employed to estimate seven height velocities in the photosphere within the Fe I 6173 Å line scan, while nine height velocities are estim… ▽ More We report and discuss phase-shift and phase travel time of low-frequency (ν < 5.0 mHz) acoustic waves estimated within the photosphere and photosphere-chromosphere interface regions, utilizing multi-height velocities in the quiet Sun. The bisector method has been employed to estimate seven height velocities in the photosphere within the Fe I 6173 Å line scan, while nine height velocities are estimated from the chromospheric Ca II 8542 Å line scan observations obtained from the Narrow Band Imager instrument installed with the Multi-Application Solar Telescope operational at the Udaipur Solar Observatory, India. Utilizing fast Fourier transform at each pixel over the full field-of-view, phase shift and coherence have been estimated. The frequency and height-dependent phase shift integrated over the regions having an absolute line-of-sight magnetic field of less than 10 G indicates the non-evanescent nature of low-frequency acoustic waves within the photosphere and photosphere-chromosphere interface regions. Phase travel time estimated within the photosphere shows non-zero values, aligning with previous simulations and observations. Further, we report that the non-evanescent nature persists beyond the photosphere, encompassing the photospheric-chromospheric height range. We discuss possible factors contributing to the non-evanescent nature of low-frequency acoustic waves. Additionally, our observations reveal a downward propagation of high-frequency acoustic waves, indicating the refraction from higher layers in the solar atmosphere. This study contributes valuable insights into the understanding of the complex dynamics of acoustic waves within different lower solar atmospheric layers, shedding light on the non-evanescent nature and downward propagation of the acoustic waves. △ Less

Submitted 28 June, 2024; originally announced June 2024.

Comments: 12 pages, 10 figures, Accepted for Publication in the Astrophysical Journal

arXiv:2406.19132 [pdf, other]

Origin of extended Main Sequence Turn Off in open cluster NGC 2355

Authors: Jayanand Maurya, M. R. Samal, Louis Amard, Yu Zhang, Hubiao Niu, Sang Chul Kim, Y. C. Joshi, B. Kumar

Abstract: The presence of extended Main Sequence Turn-Off (eMSTO) in the open clusters has been attributed to various factors, such as spread in rotation rates, binary stars, and dust-like extinction from stellar excretion discs. We present a comprehensive analysis of the eMSTO in the open cluster NGC 2355. Using spectra from the Gaia-ESO archives, we find that the stars in the red part of the eMSTO have a… ▽ More The presence of extended Main Sequence Turn-Off (eMSTO) in the open clusters has been attributed to various factors, such as spread in rotation rates, binary stars, and dust-like extinction from stellar excretion discs. We present a comprehensive analysis of the eMSTO in the open cluster NGC 2355. Using spectra from the Gaia-ESO archives, we find that the stars in the red part of the eMSTO have a higher mean v sin i value of 135.3$\pm$4.6 km s$^{-1}$ compared to the stars in the blue part that have an average v sin i equal to 81.3$\pm$5.6 km s$^{-1}$. This suggests that the eMSTO in NGC 2355 is possibly caused by the spread in rotation rates of stars. We do not find any substantial evidence of the dust-like extinction from the eMSTO stars using ultraviolet data from the Swift survey. The estimated synchronization time for low mass ratio close binaries in the blue part of the eMSTO suggests that they would be mostly slow-rotating if present. However, the stars in the blue part of the eMSTO are preferentially located in the outer region of the cluster indicating that they may lack low mass ratio close binaries. The spread in rotation rates of eMSTO stars in NGC 2355 is most likely caused by the star-disc interaction mechanism. The stars in the lower main sequence beyond the eMSTO region of NGC 2355 are slow-rotating (mean v sin i = 26.5$\pm$1.3 km s$^{-1}$) possibly due to the magnetic braking of their rotations. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: 11 pages, 12 figures, accepted for publication in MNRAS

Showing 1–50 of 558 results for author: Kumar, B