-
iVAMS 3.0: Hierarchical-Machine-Learning-Metamodel-Integrated Intelligent Verilog-AMS for Ultra-Fast, Accurate Mixed-Signal Design Optimization
Authors:
Saraju P. Mohanty,
Elias Kougianos
Abstract:
Analog/Mixed-Signal (AMS) circuits and systems continually present significant challenges to designers with the increase of design complexity and aggressive technology scaling. This is due to the large number of design factors and parameters that must be taken into account as well as the process variations which are prominent in nano-CMOS circuits. Design optimization techniques while presenting a…
▽ More
Analog/Mixed-Signal (AMS) circuits and systems continually present significant challenges to designers with the increase of design complexity and aggressive technology scaling. This is due to the large number of design factors and parameters that must be taken into account as well as the process variations which are prominent in nano-CMOS circuits. Design optimization techniques while presenting an accurate and fast design flow which can perform design optimization in reasonable time are still lacking. Even with techniques such as metamodeling that aid the design phase, there is still the need to improve them for accuracy and time cost. As a trade-off of the accuracy and speed, this paper presents a design flow for ultra-fast variability-aware optimization of nano-CMOS based physical design of analog circuits. It combines a Kriging bootstrapped Artificial Neural Network (ANN) metamodel with a Particle Swarm Optimization (PSO) based algorithm in the design optimization flow. The Kriging bootstrapped ANN metamodel provides a trade-off between analog-quality accuracy and scalability and can be effectively used for large and complex AMS circuits. The proposed technique uses Kriging to bootstrap target samples used for the ANN training. This introduces Kriging characteristics, which account for correlation effects between design parameters, to the ANN. The effectiveness of the design flow is demonstrated using a PLL as a case study with as many as 21 design parameters. It is observed that the bootstrapped Kriging metamodeling is 24X faster than simple ANN metamodeling. The layout optimization for such a complex circuit can be performed effectively in a short time using this approach. The optimization flow could achieve significant reductions in the mean and standard deviation of the PLL characteristics. Thus, the proposed research is a major contribution to design for cost.
△ Less
Submitted 1 June, 2025;
originally announced June 2025.
-
A White Paper on The Multi-Messenger Science Landscape in India
Authors:
Samsuzzaman Afroz,
Sanjib Kumar Agarwalla,
Dipankar Bhattacharya,
Soumya Bhattacharya,
Subir Bhattacharyya,
Varun Bhalerao,
Debanjan Bose,
Chinmay Borwanker,
Ishwara Chandra C. H.,
Aniruddha Chakraborty,
Indranil Chakraborty,
Sovan Chakraborty,
Debarati Chatterjee,
Varsha Chitnis,
Moon Moon Devi,
Sanjeev Dhurandhar,
Amol Dighe,
Bitan Ghosal,
Sourendu Gupta,
Arpan Hait,
Md Emanuel Hoque,
Pratik Majumdar,
Nilmani Mathur,
Harsh Mehta,
Subhendra Mohanty
, et al. (13 additional authors not shown)
Abstract:
The multi-messenger science using different observational windows to the Universe such as Gravitational Waves (GWs), Electromagnetic Waves (EMs), Cosmic Rays (CRs), and Neutrinos offer an opportunity to study from the scale of a neutron star to cosmological scales over a large cosmic time. At the smallest scales, we can explore the structure of the neutron star and the different energetics involve…
▽ More
The multi-messenger science using different observational windows to the Universe such as Gravitational Waves (GWs), Electromagnetic Waves (EMs), Cosmic Rays (CRs), and Neutrinos offer an opportunity to study from the scale of a neutron star to cosmological scales over a large cosmic time. At the smallest scales, we can explore the structure of the neutron star and the different energetics involved in the transition of a pre-merger neutron star to a post-merger neutron star. This will open up a window to study the properties of matter in extreme conditions and a guaranteed discovery space. On the other hand, at the largest cosmological scales, multi-messenger observations allow us to study the long-standing problems in physical cosmology related to the Hubble constant, dark matter, and dark energy by mapping the expansion history of the Universe using GW sources. Moreover, the multi-messenger studies of astrophysical systems such as white dwarfs, neutron stars, and black holes of different masses, all the way up to a high redshift Universe, will bring insightful understanding into the physical processes associated with them that are inaccessible otherwise. This white paper discusses the key cases in the domain of multi-messenger astronomy and the role of observatories in India which can explore uncharted territories and open discovery spaces in different branches of physics ranging from nuclear physics to astrophysics.
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
hChain 4.0: A Secure and Scalable Permissioned Blockchain for EHR Management in Smart Healthcare
Authors:
Musharraf N. Alruwaill,
Saraju P. Mohanty,
Elias Kougianos
Abstract:
The growing utilization of Internet of Medical Things (IoMT) devices, including smartwatches and wearable medical devices, has facilitated real-time health monitoring and data analysis to enhance healthcare outcomes. These gadgets necessitate improved security measures to safeguard sensitive health data while tackling scalability issues in real-time settings. The proposed system, hChain 4.0, emplo…
▽ More
The growing utilization of Internet of Medical Things (IoMT) devices, including smartwatches and wearable medical devices, has facilitated real-time health monitoring and data analysis to enhance healthcare outcomes. These gadgets necessitate improved security measures to safeguard sensitive health data while tackling scalability issues in real-time settings. The proposed system, hChain 4.0, employs a permissioned blockchain to provide a secure and scalable data infrastructure designed to fulfill these needs. This stands in contrast to conventional systems, which are vulnerable to security flaws or rely on public blockchains, constrained by scalability and expense. The proposed approach introduces a high-privacy method in which health data are encrypted using the Advanced Encryption Standard (AES) for time-efficient encryption, combined with Partial Homomorphic Encryption (PHE) to enable secure computations on encrypted data, thereby enhancing privacy. Moreover, it utilizes private channels that enable isolated communication and ledger between stakeholders, ensuring robust privacy while supporting collaborative operations. The proposed framework enables anonymized health data sharing for medical research by pseudonymizing patient identity. Additionally, hChain 4.0 incorporates Attribute-Based Access Control (ABAC) to provide secure electronic health record (EHR) sharing among authorized parties, where ABAC ensures fine-grained permission management vital for multi-organizational healthcare settings. Experimental assessments indicate that the proposed approach achieves higher scalability, cost-effectiveness, and validated security.
△ Less
Submitted 19 May, 2025;
originally announced May 2025.
-
hChain: Blockchain Based Large Scale EHR Data Sharing with Enhanced Security and Privacy
Authors:
Musharraf Alruwaill,
Saraju Mohanty,
Elias Kougianos
Abstract:
Concerns regarding privacy and data security in conventional healthcare prompted alternative technologies. In smart healthcare, blockchain technology addresses existing concerns with security, privacy, and electronic healthcare transmission. Integration of Blockchain Technology with the Internet of Medical Things (IoMT) allows real-time monitoring of protected healthcare data. Utilizing edge devic…
▽ More
Concerns regarding privacy and data security in conventional healthcare prompted alternative technologies. In smart healthcare, blockchain technology addresses existing concerns with security, privacy, and electronic healthcare transmission. Integration of Blockchain Technology with the Internet of Medical Things (IoMT) allows real-time monitoring of protected healthcare data. Utilizing edge devices with IoMT devices is very advantageous for addressing security, computing, and storage challenges. Encryption using symmetric and asymmetric keys is used to conceal sensitive information from unauthorized parties. SHA256 is an algorithm for one-way hashing. It is used to verify that the data has not been altered, since if it had, the hash value would have changed. This article offers a blockchain-based smart healthcare system using IoMT devices for continuous patient monitoring. In addition, it employs edge resources in addition to IoMT devices to have extra computing power and storage to hash and encrypt incoming data before sending it to the blockchain. Symmetric key is utilized to keep the data private even in the blockchain, allowing the patient to safely communicate the data through smart contracts while preventing unauthorized physicians from seeing the data. Through the use of a verification node and blockchain, an asymmetric key is used for the signing and validation of patient data in the healthcare provider system. In addition to other security measures, location-based authentication is recommended to guarantee that data originates from the patient area. Through the edge device, SHA256 is utilized to secure the data's integrity and a secret key is used to maintain its secrecy. The hChain architecture improves the computing power of IoMT environments, the security of EHR sharing through smart contracts, and the privacy and authentication procedures.
△ Less
Submitted 18 May, 2025;
originally announced May 2025.
-
Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom
Authors:
Rishika Sen,
Sujoy Roychowdhury,
Sumit Soman,
H. G. Ranjani,
Srikhetra Mohanty
Abstract:
Knowledge Distillation (KD) is one of the approaches to reduce the size of Large Language Models (LLMs). A LLM with smaller number of model parameters (student) is trained to mimic the performance of a LLM of a larger size (teacher model) on a specific task. For domain-specific tasks, it is not clear if teacher or student model, or both, must be considered for domain adaptation. In this work, we s…
▽ More
Knowledge Distillation (KD) is one of the approaches to reduce the size of Large Language Models (LLMs). A LLM with smaller number of model parameters (student) is trained to mimic the performance of a LLM of a larger size (teacher model) on a specific task. For domain-specific tasks, it is not clear if teacher or student model, or both, must be considered for domain adaptation. In this work, we study this problem from perspective of telecom domain Question-Answering (QA) task. We systematically experiment with Supervised Fine-tuning (SFT) of teacher only, SFT of student only and SFT of both prior to KD. We design experiments to study the impact of vocabulary (same and different) and KD algorithms (vanilla KD and Dual Space KD, DSKD) on the distilled model. Multi-faceted evaluation of the distillation using 14 different metrics (N-gram, embedding and LLM-based metrics) is considered. Experimental results show that SFT of teacher improves performance of distilled model when both models have same vocabulary, irrespective of algorithm and metrics. Overall, SFT of both teacher and student results in better performance across all metrics, although the statistical significance of the same depends on the vocabulary of the teacher models.
△ Less
Submitted 28 April, 2025;
originally announced April 2025.
-
Explicit Lossless Vertex Expanders
Authors:
Jun-Ting Hsieh,
Alexander Lubotzky,
Sidhanth Mohanty,
Assaf Reiner,
Rachel Yun Zhang
Abstract:
We give the first construction of explicit constant-degree lossless vertex expanders. Specifically, for any $\varepsilon > 0$ and sufficiently large $d$, we give an explicit construction of an infinite family of $d$-regular graphs where every small set $S$ of vertices has $(1-\varepsilon)d|S|$ neighbors (which implies $(1-2\varepsilon)d|S|$ unique-neighbors). Our results also extend naturally to c…
▽ More
We give the first construction of explicit constant-degree lossless vertex expanders. Specifically, for any $\varepsilon > 0$ and sufficiently large $d$, we give an explicit construction of an infinite family of $d$-regular graphs where every small set $S$ of vertices has $(1-\varepsilon)d|S|$ neighbors (which implies $(1-2\varepsilon)d|S|$ unique-neighbors). Our results also extend naturally to construct biregular bipartite graphs of any constant imbalance, where small sets on each side have strong expansion guarantees. The graphs we construct admit a free group action, and hence realize new families of quantum LDPC codes of Lin and M. Hsieh with a linear time decoding algorithm.
Our construction is based on taking an appropriate product of a constant-sized lossless expander with a base graph constructed from Ramanujan Cayley cubical complexes.
△ Less
Submitted 21 April, 2025;
originally announced April 2025.
-
Using Artificial Neural Networks to Optimize Acceleration Due to Gravity g Measurement in a Compound Pendulum Experiment
Authors:
Sudakhina Prusty,
Raja Das,
Saralasrita Mohanty
Abstract:
In this study, we explore a novel approach of implementing the Artificial Neural Network (ANN) model to validate a traditional experiment performed in the undergraduate physics laboratory, e.g. measurement of acceleration due to gravity g using compound pendulum. The input layer for the ANN model comprises of different parameters of the compound pendulum experiment such as its effective length, ti…
▽ More
In this study, we explore a novel approach of implementing the Artificial Neural Network (ANN) model to validate a traditional experiment performed in the undergraduate physics laboratory, e.g. measurement of acceleration due to gravity g using compound pendulum. The input layer for the ANN model comprises of different parameters of the compound pendulum experiment such as its effective length, time period of oscillation and initial angular displacement. The model is first trained by using 70 percent of the experimental data. Then the trained model is validated and tested on the rest 30 percent of the experimental data which are treated as unseen data to predict the value of g. The ANN-predicted values are compared with the experimental value of g to assess precision. The average value of g was determined to be 1009.029797cm/s^2 with a random error of $\pm$6.817633 cm/s^2 through traditional experiment. However, the ANN-predicted value of g was 1009.029858 with a mean absolute error of 0.000592 cm/s^2. The authors propose that ANN-based methodologies could bridge the gap between theoretical understanding and practical application by introducing students to cutting-edge computational techniques. Such integration allows for a deeper engagement with the subject matter, encouraging critical thinking and problem-solving skills in experimental physics. This innovative approach underscores the transformative role of machine learning in modern physics education. The model's performance demonstrates high accuracy and robustness in predicting g, outperforming traditional empirical approaches. Our results indicate that ANN-based optimization can significantly improve the precision of gravitational measurements, offering a reliable and efficient tool for applications that require high-accuracy determination of gravitational acceleration.
△ Less
Submitted 2 April, 2025; v1 submitted 30 March, 2025;
originally announced March 2025.
-
Nanoparticle Deposition Techniques for Silica Nanoparticles: Synthesis, Electrophoretic Deposition, and Optimization- A review
Authors:
Srabani Karmakar,
Milind Deo,
Imteaz Rahaman,
Swomitra Kumar Mohanty
Abstract:
Silica nanoparticles have emerged as key building blocks for advanced applications in electronics, catalysis, energy storage, biomedicine, and environmental science. In this review, we focus on recent developments in both the synthesis and deposition of these nanoparticles, emphasizing the widely used Stöber method and the versatile technique of electrophoretic deposition (EPD). The Stöber method…
▽ More
Silica nanoparticles have emerged as key building blocks for advanced applications in electronics, catalysis, energy storage, biomedicine, and environmental science. In this review, we focus on recent developments in both the synthesis and deposition of these nanoparticles, emphasizing the widely used Stöber method and the versatile technique of electrophoretic deposition (EPD). The Stöber method is celebrated for its simplicity and reliability, offering precise control over particle size, morphology, and surface properties to produce uniform, monodisperse silica nanoparticles that meet high-quality standards for advanced applications. EPD, on the other hand, is a cost-effective, room-temperature process that enables uniform coatings on substrates with complex geometries. When compared to traditional techniques such as chemical vapor deposition, atomic layer deposition, and spin coating, EPD stands out due to its scalability, enhanced material compatibility, and ease of processing. Moreover, Future research should integrate AI-driven optimization with active learning to enhance electrophoretic deposition (EPD) of silica nanoparticles, leveraging predictive modeling and real-time adjustments for improved film quality and process efficiency. This approach promises to accelerate material discovery and enable scalable nanofabrication of advanced functional films.
△ Less
Submitted 28 March, 2025;
originally announced March 2025.
-
Sterile-neutrino search based on 259 days of KATRIN data
Authors:
Himal Acharya,
Max Aker,
Dominic Batzler,
Armen Beglarian,
Justus Beisenkötter,
Matteo Biassoni,
Benedikt Bieringer,
Yanina Biondi,
Matthias Böttcher,
Beate Bornschein,
Lutz Bornschein,
Marco Carminati,
Auttakit Chatrabhuti,
Suren Chilingaryan,
Deseada Díaz Barrero,
Byron A. Daniel,
Martin Descher,
Otokar Dragoun,
Guido Drexlin,
Frank Edzards,
Klaus Eitel,
Enrico Ellinger,
Ralph Engel,
Sanshiro Enomoto,
Luca Fallböhmer
, et al. (110 additional authors not shown)
Abstract:
Neutrinos are the most abundant fundamental matter particles in the Universe and play a crucial role in particle physics and cosmology. Neutrino oscillation, discovered about 25 years ago, reveals that the three known species mix with each other. Anomalous results from reactor and radioactive-source experiments suggest a possible fourth neutrino state, the sterile neutrino, which does not interact…
▽ More
Neutrinos are the most abundant fundamental matter particles in the Universe and play a crucial role in particle physics and cosmology. Neutrino oscillation, discovered about 25 years ago, reveals that the three known species mix with each other. Anomalous results from reactor and radioactive-source experiments suggest a possible fourth neutrino state, the sterile neutrino, which does not interact via the weak force. The KATRIN experiment, primarily designed to measure the neutrino mass via tritium $β$-decay, also searches for sterile neutrinos suggested by these anomalies. A sterile-neutrino signal would appear as a distortion in the $β$-decay energy spectrum, characterized by a discontinuity in curvature (kink) related to the sterile-neutrino mass. This signature, which depends only on the shape of the spectrum rather than its absolute normalization, offers a robust, complementary approach to reactor experiments. KATRIN examined the energy spectrum of 36 million tritium $β$-decay electrons recorded in 259 measurement days within the last 40 electronvolt below the endpoint. The results exclude a substantial part of the parameter space suggested by the gallium anomaly and challenge the Neutrino-4 claim. Together with other neutrino-disappearance experiments, KATRIN probes sterile-to-active mass splittings from a fraction of an electron-volt squared to several hundred electron-volts squared, excluding light sterile neutrinos with mixing angles above a few percent.
△ Less
Submitted 24 March, 2025;
originally announced March 2025.
-
Measurement of the inhomogeneity of the KATRIN tritium source electric potential by high-resolution spectroscopy of conversion electrons from $^{83m}$Kr
Authors:
H. Acharya,
M. Aker,
D. Batzler,
A. Beglarian,
J. Beisenkötter,
M. Biassoni,
B. Bieringer,
Y. Biondi,
F. Block,
B. Bornschein,
L. Bornschein,
M. Böttcher,
M. Carminati,
A. Chatrabhuti,
S. Chilingaryan,
B. A. Daniel,
M. Descher,
D. Díaz Barrero,
O. Dragoun,
G. Drexlin,
F. Edzards,
K. Eitel,
E. Ellinger,
R. Engel,
S. Enomoto
, et al. (108 additional authors not shown)
Abstract:
Precision spectroscopy of the electron spectrum of the tritium $β$-decay near the kinematic endpoint is a direct method to determine the effective electron antineutrino mass. The KArlsruhe TRItium Neutrino (KATRIN) experiment aims to determine this quantity with a sensitivity of better than 0.3$\,$eV (90$\,$% C.L.). An inhomogeneous electric potential in the tritium source of KATRIN can lead to di…
▽ More
Precision spectroscopy of the electron spectrum of the tritium $β$-decay near the kinematic endpoint is a direct method to determine the effective electron antineutrino mass. The KArlsruhe TRItium Neutrino (KATRIN) experiment aims to determine this quantity with a sensitivity of better than 0.3$\,$eV (90$\,$% C.L.). An inhomogeneous electric potential in the tritium source of KATRIN can lead to distortions of the $β$-spectrum, which directly impact the neutrino-mass observable. This effect can be quantified through precision spectroscopy of the conversion-electrons of co-circulated metastable $^{83m}$Kr. Therefore, dedicated, several-weeks long measurement campaigns have been performed within the KATRIN data taking schedule. In this work, we infer the tritium source potential observables from these measurements, and present their implications for the neutrino-mass determination.
△ Less
Submitted 17 March, 2025;
originally announced March 2025.
-
What's DAT? Three Case Studies of Measuring Software Development Productivity at Meta With Diff Authoring Time
Authors:
Moritz Beller,
Amanda Park,
Karim Nakad,
Akshay Patel,
Sarita Mohanty,
Ford Garberson,
Ian G. Malone,
Vaishali Garg,
Henri Verroken,
Andrew Kennedy,
Pavel Avgustinov
Abstract:
This paper introduces Diff Authoring Time (DAT), a powerful, yet conceptually simple approach to measuring software development productivity that enables rigorous experimentation. DAT is a time based metric, which assess how long engineers take to develop changes, using a privacy-aware telemetry system integrated with version control, the IDE, and the OS. We validate DAT through observational stud…
▽ More
This paper introduces Diff Authoring Time (DAT), a powerful, yet conceptually simple approach to measuring software development productivity that enables rigorous experimentation. DAT is a time based metric, which assess how long engineers take to develop changes, using a privacy-aware telemetry system integrated with version control, the IDE, and the OS. We validate DAT through observational studies, surveys, visualizations, and descriptive statistics. At Meta, DAT has powered experiments and case studies on more than 20 projects. Here, we highlight (1) an experiment on introducing mock types (a 14% DAT improvement), (2) the development of automatic memoization in the React compiler (33% improvement), and (3) an estimate of thousands of DAT hours saved annually through code sharing (> 50% improvement). DAT offers a precise, yet high-coverage measure for development productivity, aiding business decisions. It enhances development efficiency by aligning the internal development workflow with the experiment-driven culture of external product development. On the research front, DAT has enabled us to perform rigorous experimentation on long-standing software engineering questions such as "do types make development more efficient?"
△ Less
Submitted 13 March, 2025;
originally announced March 2025.
-
Testing the Starobinsky model of inflation with resonant cavities
Authors:
Subhendra Mohanty,
Sukanta Panda,
Archit Vidyarthi
Abstract:
We show that in the Starobinsky inflation model stochastic gravitational waves are produced when the scalaron - which is the massive scalar mode of the metric - decays into gravitons during reheating. This decay is accompanied by decay of scalaron into matter as well through a similar coupling, proving an efficient reheating stage. The stochastic gravitational waves thus produced have characterist…
▽ More
We show that in the Starobinsky inflation model stochastic gravitational waves are produced when the scalaron - which is the massive scalar mode of the metric - decays into gravitons during reheating. This decay is accompanied by decay of scalaron into matter as well through a similar coupling, proving an efficient reheating stage. The stochastic gravitational waves thus produced have characteristic strain $h_c\sim 10^{-35}-10^{-34}$ in the frequency range $10^{5}-10^{12}\, {\rm Hz}$ which makes them accessible to resonant cavity searches for graviton to photon conversions. Their detection could conclusively validate the Starobinsky inflation model.
△ Less
Submitted 9 March, 2025;
originally announced March 2025.
-
Fine-Grained Bias Detection in LLM: Enhancing detection mechanisms for nuanced biases
Authors:
Suvendu Mohanty
Abstract:
Recent advancements in Artificial Intelligence, particularly in Large Language Models (LLMs), have transformed natural language processing by improving generative capabilities. However, detecting biases embedded within these models remains a challenge. Subtle biases can propagate misinformation, influence decision-making, and reinforce stereotypes, raising ethical concerns. This study presents a d…
▽ More
Recent advancements in Artificial Intelligence, particularly in Large Language Models (LLMs), have transformed natural language processing by improving generative capabilities. However, detecting biases embedded within these models remains a challenge. Subtle biases can propagate misinformation, influence decision-making, and reinforce stereotypes, raising ethical concerns. This study presents a detection framework to identify nuanced biases in LLMs. The approach integrates contextual analysis, interpretability via attention mechanisms, and counterfactual data augmentation to capture hidden biases across linguistic contexts. The methodology employs contrastive prompts and synthetic datasets to analyze model behaviour across cultural, ideological, and demographic scenarios.
Quantitative analysis using benchmark datasets and qualitative assessments through expert reviews validate the effectiveness of the framework. Results show improvements in detecting subtle biases compared to conventional methods, which often fail to highlight disparities in model responses to race, gender, and socio-political contexts. The framework also identifies biases arising from imbalances in training data and model architectures. Continuous user feedback ensures adaptability and refinement. This research underscores the importance of proactive bias mitigation strategies and calls for collaboration between policymakers, AI developers, and regulators. The proposed detection mechanisms enhance model transparency and support responsible LLM deployment in sensitive applications such as education, legal systems, and healthcare. Future work will focus on real-time bias monitoring and cross-linguistic generalization to improve fairness and inclusivity in AI-driven communication tools.
△ Less
Submitted 7 March, 2025;
originally announced March 2025.
-
Gauge Invariant Effective Potential
Authors:
Debanjan Balui,
Joydeep Chakrabortty,
Debmalya Dey,
Subhendra Mohanty
Abstract:
We show that the long-standing problem of gauge dependence of the effective potential arises due to the factorisation of the determinant of operators, which is invalid when we take the zeta-regularised trace of the operators. We show by correcting for this assumption by computing the multiplicative anomaly, the gauge-dependent terms of the effective potential cancel. We also show that in two- and…
▽ More
We show that the long-standing problem of gauge dependence of the effective potential arises due to the factorisation of the determinant of operators, which is invalid when we take the zeta-regularised trace of the operators. We show by correcting for this assumption by computing the multiplicative anomaly, the gauge-dependent terms of the effective potential cancel. We also show that in two- and odd-dimensional non-compact spacetime manifolds where the multiplicative anomaly term is zero, the standard calculation of one-loop effective potential gives a gauge-independent result. These results are in support of our claim that the multiplicative anomaly may play a crucial role in removing the gauge dependence in the effective potential in the four-dimensional non-compact manifold. Noting the non-trivial aspects of this anomaly computation for a generic scenario, we propose the Heat-Kernel method to compute the effective potential where this anomaly emerges as a total derivative, thus redundant. We explicitly show how one can calculate the gauge independent, effective action and the Coleman-Weinberg effective potential by employing the Heat-Kernel method. Based on this result, we advocate the Heat-Kernel expansion as the most straightforward method, as it naturally deals with the matrix elliptic operator for the calculation of manifestly gauge independent, effective actions compared to other conventional methods.
△ Less
Submitted 24 February, 2025;
originally announced February 2025.
-
Why is the strength of a polymer network so low?
Authors:
Shaswat Mohanty,
Jose Blanchet,
Zhigang Suo,
Wei Cai
Abstract:
Experiments have long shown that a polymer network of covalent bonds commonly ruptures at a stress that is orders of magnitude lower than the strength of the covalent bonds. Here we investigate this large reduction in strength by coarse-grained molecular dynamics simulations. We show that the network ruptures by sequentially breaking a small fraction of bonds, and that each broken bond lies on the…
▽ More
Experiments have long shown that a polymer network of covalent bonds commonly ruptures at a stress that is orders of magnitude lower than the strength of the covalent bonds. Here we investigate this large reduction in strength by coarse-grained molecular dynamics simulations. We show that the network ruptures by sequentially breaking a small fraction of bonds, and that each broken bond lies on the minimum "shortest path". The shortest path is the path of the fewest bonds that connect two monomers at the opposite ends of the network. As the network is stretched, the minimum shortest path straightens and bears high tension set by covalent bonds, while most strands off the path deform by entropic elasticity. After a bond on the minimum shortest path breaks, the process repeats for the next minimum shortest path. As the network is stretched and bonds are broken, the scatter in lengths of the shortest paths first narrows, causing stress to rise, and then broadens, causing stress to decline. This sequential breaking of a small fraction of bonds causes the network to rupture at a stress that is orders of magnitude below the strength of the covalent bonds.
△ Less
Submitted 16 February, 2025;
originally announced February 2025.
-
Ground state properties of a spin-$\frac{5}{2}$ frustrated triangular lattice antiferromagnet NH$_{4}$Fe(PO$_{3}$F)$_2$
Authors:
S. Mohanty,
K. M. Ranjith,
C. S. Saramgi,
Y. Skourski,
B. Büchner,
H. -J. Grafe,
R. Nath
Abstract:
Structural and magnetic properties of a two-dimensional spin-$\frac{5}{2}$ frustrated triangular lattice antiferromagnet NH$_{4}$Fe(PO$_{3}$F)$_2$ are explored via x-ray diffraction, magnetic susceptibility, high-field magnetization, heat capacity, and $^{31}$P nuclear magnetic resonance experiments on a polycrystalline sample. The compound portrays distorted triangular units of the Fe$^{3+}$ ions…
▽ More
Structural and magnetic properties of a two-dimensional spin-$\frac{5}{2}$ frustrated triangular lattice antiferromagnet NH$_{4}$Fe(PO$_{3}$F)$_2$ are explored via x-ray diffraction, magnetic susceptibility, high-field magnetization, heat capacity, and $^{31}$P nuclear magnetic resonance experiments on a polycrystalline sample. The compound portrays distorted triangular units of the Fe$^{3+}$ ions with anisotropic bond lengths. The magnetic susceptibility shows a broad maxima around $T^{\rm{max}}_χ\simeq 12$ K, mimicking the short-range antiferromagnetic order of a low-dimensional spin system. The magnetic susceptibility and NMR shift could be modeled assuming the spin-$5/2$ isotropic triangular lattice model and the average value of the exchange coupling is estimated to be $J/k_{\rm B} \simeq 1.7$ K. This value of the exchange coupling is reproduced well from the saturation field of the pulse field data. It shows the onset of a magnetic ordering at $T_{\rm N} \simeq 5.7$ K, setting the frustration ratio of $f = \frac{|θ_{\rm CW}|}{T_{\rm N}} \simeq 5.7$. Such a value of $f$ reflects moderate magnetic frustration in the compound. The d$M$/d$H$ vs $H$ plots of the low temperature magnetic isotherms exhibit a sharp peak at $H_{\rm SF} \simeq 1.45$ T, suggesting a field-induced spin-flop transition and magnetic anisotropy. The rectangular shape of the $^{31}$P NMR spectra below $T_{\rm N}$ unfolds that the ordering is commensurate antiferromagnet type. Three distinct phase regimes are clearly discerned in the $H - T$ phase diagram, redolent of a frustrated magnet with in-plane (XY-type) anisotropy.
△ Less
Submitted 10 February, 2025;
originally announced February 2025.
-
Convolutional Neural Network Segmentation for Satellite Imagery Data to Identify Landforms Using U-Net Architecture
Authors:
Mitul Goswami,
Sainath Dey,
Aniruddha Mukherjee,
Suneeta Mohanty,
Prasant Kumar Pattnaik
Abstract:
This study demonstrates a novel use of the U-Net architecture in the field of semantic segmentation to detect landforms using preprocessed satellite imagery. The study applies the U-Net model for effective feature extraction by using Convolutional Neural Network (CNN) segmentation techniques. Dropout is strategically used for regularization to improve the model's perseverance, and the Adam optimiz…
▽ More
This study demonstrates a novel use of the U-Net architecture in the field of semantic segmentation to detect landforms using preprocessed satellite imagery. The study applies the U-Net model for effective feature extraction by using Convolutional Neural Network (CNN) segmentation techniques. Dropout is strategically used for regularization to improve the model's perseverance, and the Adam optimizer is used for effective training. The study thoroughly assesses the performance of the U-Net architecture utilizing a large sample of preprocessed satellite topographical images. The model excels in semantic segmentation tasks, displaying high-resolution outputs, quick feature extraction, and flexibility to a wide range of applications. The findings highlight the U-Net architecture's substantial contribution to the advancement of machine learning and image processing technologies. The U-Net approach, which emphasizes pixel-wise categorization and comprehensive segmentation map production, is helpful in practical applications such as autonomous driving, disaster management, and land use planning. This study not only investigates the complexities of U-Net architecture for semantic segmentation, but also highlights its real-world applications in image classification, analysis, and landform identification. The study demonstrates the U-Net model's key significance in influencing the environment of modern technology.
△ Less
Submitted 8 February, 2025;
originally announced February 2025.
-
Enhancing Near Real Time AI-NWP Hurricane Forecasts: Improving Explainability and Performance Through Physics-Based Models and Land Surface Feedback
Authors:
Naveen Sudharsan,
Manmeet Singh,
Sasanka Talukdar,
Shyama Mohanty,
Harsh Kamath,
Krishna K. Osuri,
Hassan Dashtian,
Michael Young,
Zong-Liang Yang,
Clint Dawson,
L. Ruby Leung,
Sundararaman Gopalakrishnan,
Avichal Mehra,
Vijay Tallapragada,
Dev Niyogi
Abstract:
Hurricane track forecasting remains a significant challenge due to the complex interactions between the atmosphere, land, and ocean. Although AI-based numerical weather prediction models, such as Google Graphcast operation, have significantly improved hurricane track forecasts, they currently function as atmosphere-only models, omitting critical land and ocean interactions. To investigate the impa…
▽ More
Hurricane track forecasting remains a significant challenge due to the complex interactions between the atmosphere, land, and ocean. Although AI-based numerical weather prediction models, such as Google Graphcast operation, have significantly improved hurricane track forecasts, they currently function as atmosphere-only models, omitting critical land and ocean interactions. To investigate the impact of land feedback, we conducted independent simulations using the physics-based Hurricane WRF experimental model to assess how soil moisture variations influence storm trajectories. Our results show that land surface conditions significantly alter storm paths, demonstrating the importance of land-atmosphere coupling in hurricane prediction. Although recent advances have introduced AI-based atmosphere-ocean coupled models, a fully functional AI-driven atmosphere-land-ocean model does not yet exist. Our findings suggest that AI-NWP models could be further improved by incorporating land surface interactions, improving both forecast accuracy and explainability. Developing a fully coupled AI-based weather model would mark a critical step toward more reliable and physically consistent hurricane forecasting, with direct applications for disaster preparedness and risk mitigation.
△ Less
Submitted 3 February, 2025;
originally announced February 2025.
-
FruitPAL: An IoT-Enabled Framework for Automatic Monitoring of Fruit Consumption in Smart Healthcare
Authors:
Abdulrahman Alkinani,
Alakananda Mitra,
Saraju P. Mohanty,
Elias Kougianos
Abstract:
Fruits are rich sources of essential vitamins and nutrients that are vital for human health. This study introduces two fully automated devices, FruitPAL and its updated version, FruitPAL 2.0, which aim to promote safe fruit consumption while reducing health risks. Both devices leverage a high-quality dataset of fifteen fruit types and use advanced models- YOLOv8 and YOLOv5 V6.0- to enhance detecti…
▽ More
Fruits are rich sources of essential vitamins and nutrients that are vital for human health. This study introduces two fully automated devices, FruitPAL and its updated version, FruitPAL 2.0, which aim to promote safe fruit consumption while reducing health risks. Both devices leverage a high-quality dataset of fifteen fruit types and use advanced models- YOLOv8 and YOLOv5 V6.0- to enhance detection accuracy. The original FruitPAL device can identify various fruit types and notify caregivers if an allergic reaction is detected, thanks to YOLOv8's improved accuracy and rapid response time. Notifications are transmitted via the cloud to mobile devices, ensuring real-time updates and immediate accessibility. FruitPAL 2.0 builds upon this by not only detecting fruit but also estimating its nutritional value, thereby encouraging healthy consumption. Trained on the YOLOv5 V6.0 model, FruitPAL 2.0 analyzes fruit intake to provide users with valuable dietary insights. This study aims to promote fruit consumption by helping individuals make informed choices, balancing health benefits with allergy awareness. By alerting users to potential allergens while encouraging the consumption of nutrient-rich fruits, these devices support both health maintenance and dietary awareness.
△ Less
Submitted 20 January, 2025;
originally announced February 2025.
-
Conversational Text Extraction with Large Language Models Using Retrieval-Augmented Systems
Authors:
Soham Roy,
Mitul Goswami,
Nisharg Nargund,
Suneeta Mohanty,
Prasant Kumar Pattnaik
Abstract:
This study introduces a system leveraging Large Language Models (LLMs) to extract text and enhance user interaction with PDF documents via a conversational interface. Utilizing Retrieval-Augmented Generation (RAG), the system provides informative responses to user inquiries while highlighting relevant passages within the PDF. Upon user upload, the system processes the PDF, employing sentence embed…
▽ More
This study introduces a system leveraging Large Language Models (LLMs) to extract text and enhance user interaction with PDF documents via a conversational interface. Utilizing Retrieval-Augmented Generation (RAG), the system provides informative responses to user inquiries while highlighting relevant passages within the PDF. Upon user upload, the system processes the PDF, employing sentence embeddings to create a document-specific vector store. This vector store enables efficient retrieval of pertinent sections in response to user queries. The LLM then engages in a conversational exchange, using the retrieved information to extract text and generate comprehensive, contextually aware answers. While our approach demonstrates competitive ROUGE values compared to existing state-of-the-art techniques for text extraction and summarization, we acknowledge that further qualitative evaluation is necessary to fully assess its effectiveness in real-world applications. The proposed system gives competitive ROUGE values as compared to existing state-of-the-art techniques for text extraction and summarization, thus offering a valuable tool for researchers, students, and anyone seeking to efficiently extract knowledge and gain insights from documents through an intuitive question-answering interface.
△ Less
Submitted 16 January, 2025;
originally announced January 2025.
-
Bridging Context Gaps: Enhancing Comprehension in Long-Form Social Conversations Through Contextualized Excerpts
Authors:
Shrestha Mohanty,
Sarah Xuan,
Jacob Jobraeel,
Anurag Kumar,
Deb Roy,
Jad Kabbara
Abstract:
We focus on enhancing comprehension in small-group recorded conversations, which serve as a medium to bring people together and provide a space for sharing personal stories and experiences on crucial social matters. One way to parse and convey information from these conversations is by sharing highlighted excerpts in subsequent conversations. This can help promote a collective understanding of rel…
▽ More
We focus on enhancing comprehension in small-group recorded conversations, which serve as a medium to bring people together and provide a space for sharing personal stories and experiences on crucial social matters. One way to parse and convey information from these conversations is by sharing highlighted excerpts in subsequent conversations. This can help promote a collective understanding of relevant issues, by highlighting perspectives and experiences to other groups of people who might otherwise be unfamiliar with and thus unable to relate to these experiences. The primary challenge that arises then is that excerpts taken from one conversation and shared in another setting might be missing crucial context or key elements that were previously introduced in the original conversation. This problem is exacerbated when conversations become lengthier and richer in themes and shared experiences. To address this, we explore how Large Language Models (LLMs) can enrich these excerpts by providing socially relevant context. We present approaches for effective contextualization to improve comprehension, readability, and empathy. We show significant improvements in understanding, as assessed through subjective and objective evaluations. While LLMs can offer valuable context, they struggle with capturing key social aspects. We release the Human-annotated Salient Excerpts (HSE) dataset to support future work. Additionally, we show how context-enriched excerpts can provide more focused and comprehensive conversation summaries.
△ Less
Submitted 27 December, 2024;
originally announced December 2024.
-
Bridging the Data Provenance Gap Across Text, Speech and Video
Authors:
Shayne Longpre,
Nikhil Singh,
Manuel Cherep,
Kushagra Tiwary,
Joanna Materzynska,
William Brannon,
Robert Mahari,
Naana Obeng-Marnu,
Manan Dey,
Mohammed Hamdy,
Nayan Saxena,
Ahmad Mustafa Anis,
Emad A. Alghamdi,
Vu Minh Chien,
Da Yin,
Kun Qian,
Yizhi Li,
Minnie Liang,
An Dinh,
Shrestha Mohanty,
Deividas Mataciunas,
Tobin South,
Jianguo Zhang,
Ariel N. Lee,
Campbell S. Lund
, et al. (18 additional authors not shown)
Abstract:
Progress in AI is driven largely by the scale and quality of training data. Despite this, there is a deficit of empirical analysis examining the attributes of well-established datasets beyond text. In this work we conduct the largest and first-of-its-kind longitudinal audit across modalities--popular text, speech, and video datasets--from their detailed sourcing trends and use restrictions to thei…
▽ More
Progress in AI is driven largely by the scale and quality of training data. Despite this, there is a deficit of empirical analysis examining the attributes of well-established datasets beyond text. In this work we conduct the largest and first-of-its-kind longitudinal audit across modalities--popular text, speech, and video datasets--from their detailed sourcing trends and use restrictions to their geographical and linguistic representation. Our manual analysis covers nearly 4000 public datasets between 1990-2024, spanning 608 languages, 798 sources, 659 organizations, and 67 countries. We find that multimodal machine learning applications have overwhelmingly turned to web-crawled, synthetic, and social media platforms, such as YouTube, for their training sets, eclipsing all other sources since 2019. Secondly, tracing the chain of dataset derivations we find that while less than 33% of datasets are restrictively licensed, over 80% of the source content in widely-used text, speech, and video datasets, carry non-commercial restrictions. Finally, counter to the rising number of languages and geographies represented in public AI training datasets, our audit demonstrates measures of relative geographical and multilingual representation have failed to significantly improve their coverage since 2013. We believe the breadth of our audit enables us to empirically examine trends in data sourcing, restrictions, and Western-centricity at an ecosystem-level, and that visibility into these questions are essential to progress in responsible AI. As a contribution to ongoing improvements in dataset transparency and responsible use, we release our entire multimodal audit, allowing practitioners to trace data provenance across text, speech, and video.
△ Less
Submitted 18 February, 2025; v1 submitted 18 December, 2024;
originally announced December 2024.
-
SprayCraft: Graph-Based Route Optimization for Variable Rate Precision Spraying
Authors:
Kiran K. Kethineni,
Saraju P. Mohanty,
Elias Kougianos,
Sanjukta Bhowmick,
Laavanya Rachakonda
Abstract:
To efficiently manage plant diseases, Agriculture Cyber-Physical Systems (A-CPS) have been developed to detect and localize disease infestations by integrating the Internet of Agro-Things (IoAT). By the nature of plant and pathogen interactions, the spread of a disease appears as a focus with density of infected plants and intensity of infection diminishing outwards. This gradient of infection nee…
▽ More
To efficiently manage plant diseases, Agriculture Cyber-Physical Systems (A-CPS) have been developed to detect and localize disease infestations by integrating the Internet of Agro-Things (IoAT). By the nature of plant and pathogen interactions, the spread of a disease appears as a focus with density of infected plants and intensity of infection diminishing outwards. This gradient of infection needs variable rate and precision pesticide spraying to efficiently utilize resources and effectively handle the diseases. This article, SprayCraft presents a graph based method for disease management A-CPS to identify disease hotspots and compute near optimal path for a spraying drone to perform variable rate precision spraying. It uses graph to represent the diseased locations and their spatial relation, Message Passing is performed over the graph to compute the probability of a location to be a disease hotspot. These probabilities also serve as disease intensity measures and are used for variable rate spraying at each location. Whereas, the graph is utilized to compute tour path by considering it as Traveling Salesman Problem (TSP) for precision spraying by the drone. Proposed method has been validated on synthetic data of locations of diseased locations in a farmland.
△ Less
Submitted 12 December, 2024;
originally announced December 2024.
-
Detection and parameter estimation of supermassive black hole ringdown signals using a pulsar timing array
Authors:
Xuan Tao,
Yan Wang,
Soumya D. Mohanty
Abstract:
Gravitational wave (GW) searches using pulsar timing arrays (PTAs) are commonly assumed to be limited to a GW frequency of $\lesssim 4\times 10^{-7}$Hz given by the Nyquist rate associated with the average observational cadence of $2$ weeks for a single pulsar. However, by taking advantage of asynchronous observations of multiple pulsars, a PTA can detect GW signals at higher frequencies. This all…
▽ More
Gravitational wave (GW) searches using pulsar timing arrays (PTAs) are commonly assumed to be limited to a GW frequency of $\lesssim 4\times 10^{-7}$Hz given by the Nyquist rate associated with the average observational cadence of $2$ weeks for a single pulsar. However, by taking advantage of asynchronous observations of multiple pulsars, a PTA can detect GW signals at higher frequencies. This allows a sufficiently large PTA to detect and characterize the ringdown signals emitted following the merger of supermassive binary black holes (SMBBHs), leading to stringent tests of the no-hair theorem in the mass range of such systems. Such large-scale PTAs are imminent with the advent of the FAST telescope and the upcoming era of the Square Kilometer Array (SKA). To scope out the data analysis challenges involved in such a search, we propose a likelihood-based method coupled with Particle Swarm Optimization and apply it to a simulated large-scale PTA comprised of $100$ pulsars, each having a timing residual noise standard deviation of $100$~nsec, with randomized observation times. Focusing on the dominant $(2,2)$ mode of the ringdown signal, we show that it is possible to achieve a $99\%$ detection probability with a false alarm probability below $0.2\%$ for an optimal signal-to-noise ratio (SNR) $>10$. This corresponds, for example, to an equal-mass non-spinning SMBBH with an observer frame chirp mass $M_c = 9.52\times10^{9}M_{\odot}$ at a luminosity distance of $D_L = 420$ Mpc.
△ Less
Submitted 29 March, 2025; v1 submitted 10 December, 2024;
originally announced December 2024.
-
Ionization chemistry in the inner disc: a combined treatment of ionic and thermionic emission and arbitrary grain size distributions
Authors:
Morgan Williams,
Subhanjoy Mohanty
Abstract:
In the inner regions of protoplanetary discs, ionization chemistry controls the fluid viscosity, and is thus key to understanding various accretion, outflow and planet formation processes. The ionization is driven by thermal and non-thermal processes in the gas-phase, as well as by dust-gas interactions that lead to grain charging and ionic and thermionic emission from grain surfaces. The latter d…
▽ More
In the inner regions of protoplanetary discs, ionization chemistry controls the fluid viscosity, and is thus key to understanding various accretion, outflow and planet formation processes. The ionization is driven by thermal and non-thermal processes in the gas-phase, as well as by dust-gas interactions that lead to grain charging and ionic and thermionic emission from grain surfaces. The latter dust-gas interactions are moreover a strong function of the grain size distribution. However, analyses of chemical networks that include ionic/thermionic emission have so far only considered grains of a single size (or only approximately treated the effects of a size distribution), while analyses that include a distribution of grain sizes have ignored ionic/thermionic emission. Here, we: (1) investigate a general chemical network, widely applicable in inner disc regions, that includes gas-phase reactions, ionic and thermionic emission, and an arbitrary grain size distribution; (2) present a numerical method to solve this network in equilibrium; and (3) elucidate a general method to estimate the chemical time-scale. We show that: (a) approximating a grain size distribution by an "effective dust-to-gas ratio" (as done in previous work) can predict significantly inaccurate grain charges; and (b) grain charging significantly alters grain collisional time-scales in the inner disc. For conditions generally found in the inner disc, this work facilitates: (i) calculation of fluid resistivities and viscosity; and (ii) inclusion of the effect of grain charging on grain fragmentation and coagulation (a critical effect that is often ignored).
△ Less
Submitted 21 November, 2024;
originally announced November 2024.
-
One Loop Thermal Effective Action
Authors:
Joydeep Chakrabortty,
Subhendra Mohanty
Abstract:
We compute the one loop effective action for a Quantum Field Theory at finite temperature, in the presence of background gauge fields, employing the Heat-Kernel method. This method enables us to compute the thermal corrections to the Wilson coefficients associated with effective operators up to arbitrary mass dimension, which emerge after integrating out heavy scalars and fermions from a generic U…
▽ More
We compute the one loop effective action for a Quantum Field Theory at finite temperature, in the presence of background gauge fields, employing the Heat-Kernel method. This method enables us to compute the thermal corrections to the Wilson coefficients associated with effective operators up to arbitrary mass dimension, which emerge after integrating out heavy scalars and fermions from a generic UV theory. The Heat-Kernel coefficients are functions of non-zero background `electric', `magnetic' fields, and Polyakov loops. A major application of our formalism is the calculation of the finite temperature Coleman-Weinberg potential in effective theories, necessary for the study of phase transitions. A novel feature of this work is the systematic calculation of the dependence of Polyakov loops on the thermal factors of Heat-Kernel coefficients and the Coleman-Weinberg potential. We study the effect of Polyakov loop factors on phase transitions and comment on future directions in applications of the results derived in this work.
△ Less
Submitted 25 February, 2025; v1 submitted 21 November, 2024;
originally announced November 2024.
-
Chemical evolution of an evaporating lava pool
Authors:
Alfred Curry,
Subhanjoy Mohanty,
James E. Owen
Abstract:
Many known rocky exoplanets are so highly irradiated that their dayside surfaces are molten, and `silicate atmospheres', composed of rock-forming elements, are generated above these lava pools. The compositions of these `lava planet' atmospheres are of great interest because they must be linked to the composition of the underlying rocky interiors. It may be possible to investigate these atmosphere…
▽ More
Many known rocky exoplanets are so highly irradiated that their dayside surfaces are molten, and `silicate atmospheres', composed of rock-forming elements, are generated above these lava pools. The compositions of these `lava planet' atmospheres are of great interest because they must be linked to the composition of the underlying rocky interiors. It may be possible to investigate these atmospheres, either by detecting them directly via emission spectroscopy or by observing the dust tails which trail the low mass `catastrophically evaporating planets'. In this work, we develop a simple chemical model of the lava pool--atmosphere system under mass loss, to study its evolution. Mass loss can occur both into space and from the day to the nightside. We show that the system reaches a steady state, where the material in the escaping atmosphere has the same composition as that melted into the lava pool from the mantle. We show that the catastrophically evaporating planets are likely to be in this evolved state. This means that the composition of their dust tails is likely to be a direct trace of the composition of the mantle material that is melted into the lava pool. We further show that, due to the strength of day-to-nightside atmospheric transport, this evolved state may even apply to relatively high-mass planets (>1 Earth Mass). Moreover, the low pressure of evolved atmospheres implies that non-detections may not be due to the total lack of an atmosphere. Both conclusions are important for the interpretation of future observations.
△ Less
Submitted 20 November, 2024;
originally announced November 2024.
-
Explicit Two-Sided Vertex Expanders Beyond the Spectral Barrier
Authors:
Jun-Ting Hsieh,
Ting-Chun Lin,
Sidhanth Mohanty,
Ryan O'Donnell,
Rachel Yun Zhang
Abstract:
We construct the first explicit two-sided vertex expanders that bypass the spectral barrier.
Previously, the strongest known explicit vertex expanders were given by $d$-regular Ramanujan graphs, whose spectral properties imply that every small subset of vertices $S$ has at least $0.5d|S|$ distinct neighbors. However, it is possible to construct Ramanujan graphs containing a small set $S$ with no…
▽ More
We construct the first explicit two-sided vertex expanders that bypass the spectral barrier.
Previously, the strongest known explicit vertex expanders were given by $d$-regular Ramanujan graphs, whose spectral properties imply that every small subset of vertices $S$ has at least $0.5d|S|$ distinct neighbors. However, it is possible to construct Ramanujan graphs containing a small set $S$ with no more than $0.5d|S|$ neighbors. In fact, no explicit construction was known to break the $0.5 d$-barrier.
In this work, we give an explicit construction of an infinite family of $d$-regular graphs (for large enough $d$) where every small set expands by a factor of $\approx 0.6d$. More generally, for large enough $d_1,d_2$, we give an infinite family of $(d_1,d_2)$-biregular graphs where small sets on the left expand by a factor of $\approx 0.6d_1$, and small sets on the right expand by a factor of $\approx 0.6d_2$. In fact, our construction satisfies an even stronger property: small sets on the left and right have unique-neighbor expansion $0.6d_1$ and $0.6d_2$ respectively.
Our construction follows the tripartite line product framework of Hsieh, McKenzie, Mohanty & Paredes, and instantiates it using the face-vertex incidence of the $4$-dimensional Ramanujan clique complex as its base component. As a key part of our analysis, we derive new bounds on the triangle density of small sets in the Ramanujan clique complex.
△ Less
Submitted 18 November, 2024;
originally announced November 2024.
-
Weak Poincaré Inequalities, Simulated Annealing, and Sampling from Spherical Spin Glasses
Authors:
Brice Huang,
Sidhanth Mohanty,
Amit Rajaraman,
David X. Wu
Abstract:
There has been a recent surge of powerful tools to show rapid mixing of Markov chains, via functional inequalities such as Poincaré inequalities. In many situations, Markov chains fail to mix rapidly from a worst-case initialization, yet are expected to approximately sample from a random initialization. For example, this occurs if the target distribution has metastable states, small clusters accou…
▽ More
There has been a recent surge of powerful tools to show rapid mixing of Markov chains, via functional inequalities such as Poincaré inequalities. In many situations, Markov chains fail to mix rapidly from a worst-case initialization, yet are expected to approximately sample from a random initialization. For example, this occurs if the target distribution has metastable states, small clusters accounting for a vanishing fraction of the mass that are essentially disconnected from the bulk of the measure. Under such conditions, a Poincaré inequality cannot hold, necessitating new tools to prove sampling guarantees.
We develop a framework to analyze simulated annealing, based on establishing so-called weak Poincaré inequalities. These inequalities imply mixing from a suitably warm start, and simulated annealing provides a way to chain such warm starts together into a sampling algorithm. We further identify a local-to-global principle to prove weak Poincaré inequalities, mirroring the spectral independence and localization schemes frameworks for analyzing mixing times of Markov chains.
As our main application, we prove that simulated annealing samples from the Gibbs measure of a spherical spin glass for inverse temperatures up to a natural threshold, matching recent algorithms based on algorithmic stochastic localization. This provides the first Markov chain sampling guarantee that holds beyond the uniqueness threshold for spherical spin glasses, where mixing from a worst-case initialization is provably slow due to the presence of metastable states. As an ingredient in our proof, we prove bounds on the operator norm of the covariance matrix of spherical spin glasses in the full replica-symmetric regime.
Additionally, we resolve a question related to sampling using data-based initializations.
△ Less
Submitted 22 November, 2024; v1 submitted 13 November, 2024;
originally announced November 2024.
-
Everything You Wanted to Know About Consumer Light Management in Smart Energy
Authors:
Prajnyajit Mohanty,
Umesh C. Pati,
Kamalakanta Mahapatra,
Saraju P. Mohanty
Abstract:
Consumer lighting plays a significant role in the development of smart cities and smart villages. With the advancement of (IoT) technology, smart lighting solutions have become more prevalent in residential areas as well. These solutions provide consumers with increased energy efficiency, added convenience, and improved security. On the other hand, the growing number of IoT devices has become a gl…
▽ More
Consumer lighting plays a significant role in the development of smart cities and smart villages. With the advancement of (IoT) technology, smart lighting solutions have become more prevalent in residential areas as well. These solutions provide consumers with increased energy efficiency, added convenience, and improved security. On the other hand, the growing number of IoT devices has become a global concern due to the carbon footprint and carbon emissions associated with these devices. The overuse of batteries increases maintenance and cost to IoT devices and simultaneously possesses adverse environmental effects, ultimately exacerbating the pace of climate change. Therefore, in tandom with the principles of Industry 4.0, it has become crucial for manufacturing and research industries to prioritize sustainable measures adhering to smart energy as a prevention to the negative impacts. Consequently, it has undoubtedly garnered global interest from scientists, researchers, and industrialists to integrate state-of-the-art technologies in order to solve the current issues in consumer light management systems making it a complete sustainable, and smart solution for consumer lighting application. This manuscript provides a thorough investigation of various methods as well as techniques to design a state-of-the-art IoT-enabled consumer light management system. It critically reviews the existing works done in consumer light management systems, emphasizing the significant limitations and the need for sustainability. The top-down approach of developing sustainable computing frameworks for IoT-enabled consumer light management has been reviewed based on the multidisciplinary technologies involved and state-of-the-art works in the respective domains. Lastly, this article concludes by highlighting possible avenues for future research.
△ Less
Submitted 13 November, 2024;
originally announced November 2024.
-
Magnetic properties of frustrated spin-$\frac{1}{2}$ capped-kagome antiferromagnet (CsBr)Cu$_5$V$_2$O$_{10}$
Authors:
S. Guchhait,
D. V. Ambika,
S. Mohanty,
Y. Furukawa,
R. Nath
Abstract:
The structural and magnetic properties of a spin-$\frac{1}{2}$ averievite (CsBr)Cu$_5$V$_2$O$_{10}$ are investigated by means of temperature-dependent x-ray diffraction, magnetization, heat capacity, and $^{51}$V nuclear magnetic resonance (NMR) measurements. The crystal structure (trigonal, $P\bar{3}$) features a frustrated capped-kagome lattice of the magnetic Cu$^{2+}$ ions. Magnetic susceptibi…
▽ More
The structural and magnetic properties of a spin-$\frac{1}{2}$ averievite (CsBr)Cu$_5$V$_2$O$_{10}$ are investigated by means of temperature-dependent x-ray diffraction, magnetization, heat capacity, and $^{51}$V nuclear magnetic resonance (NMR) measurements. The crystal structure (trigonal, $P\bar{3}$) features a frustrated capped-kagome lattice of the magnetic Cu$^{2+}$ ions. Magnetic susceptibility analysis indicates a large Curie-Weiss temperature of $θ_{\rm CW} \simeq-175$ K. Heat capacity signals the onset of a magnetic long-range-order (LRO) at $T_{\rm N}\simeq 21.5$ K at zero magnetic field due to the presence of significant inter-planer coupling in this system. The magnetic LRO below 27 K is further evident from the drastic change in the $^{51}$V NMR signal intensity and rapid enhancement in the $^{51}$V spin-lattice relaxation rate in a magnetic field of 6.3 T. The frustration index $f=|θ_{\rm CW}|/T_{\rm N} \simeq 8$ ascertains strong magnetic frustration in this compound. From the high-temperature value of the $^{51}$V NMR spin-lattice relaxation rate, the leading antiferromagnetic exchange interaction between the Cu$^{2+}$ ions is calculated to be $J/k_{\rm B}\simeq 136$ K.
△ Less
Submitted 9 November, 2024;
originally announced November 2024.
-
Astrophysical constraints on neutron star $f$-modes with a nonparametric equation of state representation
Authors:
Sailesh Ranjan Mohanty,
Utkarsh Mali,
H. C. Das,
Bharat Kumar,
Philippe Landry
Abstract:
We constrain the fundamental-mode ($f$-mode) oscillation frequencies of nonrotating neutron stars using a phenomenological Gaussian process model for the unknown dense-matter equation of state conditioned on a suite of gravitational-wave, radio and X-ray observations. We infer the quadrupolar $f$-mode frequency preferred by the astronomical data as a function of neutron star mass, with error estim…
▽ More
We constrain the fundamental-mode ($f$-mode) oscillation frequencies of nonrotating neutron stars using a phenomenological Gaussian process model for the unknown dense-matter equation of state conditioned on a suite of gravitational-wave, radio and X-ray observations. We infer the quadrupolar $f$-mode frequency preferred by the astronomical data as a function of neutron star mass, with error estimates that quantify the impact of equation of state uncertainty, and compare it to the contact frequency for inspiralling neutron-star binaries, finding that resonance with the orbital frequency can be achieved for the coalescences with the most unequal mass ratio. For an optimally configured binary neutron star merger, we estimate the gravitational waveform's tidal phasing due to $f$-mode dynamical tides as $7^{+2}_{-3}$ rad at merger. We assess prospects for distinguishing $f$-mode dynamical tides with current and future-generation gravitational-wave observatories.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
First constraints on general neutrino interactions based on KATRIN data
Authors:
M. Aker,
D. Batzler,
A. Beglarian,
J. Beisenkötter,
M. Biassoni,
B. Bieringer,
Y. Biondi,
F. Block,
B. Bornschein,
L. Bornschein,
M. Böttcher,
M. Carminati,
A. Chatrabhuti,
S. Chilingaryan,
B. A. Daniel,
M. Descher,
D. Díaz Barrero,
P. J. Doe,
O. Dragoun,
G. Drexlin,
F. Edzards,
K. Eitel,
E. Ellinger,
R. Engel,
S. Enomoto
, et al. (108 additional authors not shown)
Abstract:
The precision measurement of the tritium $β$-decay spectrum performed by the KATRIN experiment provides a unique way to search for general neutrino interactions (GNI). All theoretical allowed GNI terms involving neutrinos are incorporated into a low-energy effective field theory, and can be identified by specific signatures in the measured tritium $β$-spectrum. In this paper an effective descripti…
▽ More
The precision measurement of the tritium $β$-decay spectrum performed by the KATRIN experiment provides a unique way to search for general neutrino interactions (GNI). All theoretical allowed GNI terms involving neutrinos are incorporated into a low-energy effective field theory, and can be identified by specific signatures in the measured tritium $β$-spectrum. In this paper an effective description of the impact of GNI on the $β$-spectrum is formulated and the first constraints on the effective GNI parameters are derived based on the 4 million electrons collected in the second measurement campaign of KATRIN in 2019. In addition, constraints on selected types of interactions are investigated, thereby exploring the potential of KATRIN to search for more specific new physics cases, including a right-handed W boson, a charged Higgs or leptoquarks.
△ Less
Submitted 12 November, 2024; v1 submitted 14 October, 2024;
originally announced October 2024.
-
QPUF 2.0: Exploring Quantum Physical Unclonable Functions for Security-by-Design of Energy Cyber-Physical Systems
Authors:
Venkata K. V. V. Bathalapalli,
Saraju P. Mohanty,
Chenyun Pan,
Elias Kougianos
Abstract:
Sustainable advancement is being made to improve the efficiency of the generation, transmission, and distribution of renewable energy resources, as well as managing them to ensure the reliable operation of the smart grid. Supervisory control and data acquisition (SCADA) enables sustainable management of grid communication flow through its real-time data sensing, processing, and actuation capabilit…
▽ More
Sustainable advancement is being made to improve the efficiency of the generation, transmission, and distribution of renewable energy resources, as well as managing them to ensure the reliable operation of the smart grid. Supervisory control and data acquisition (SCADA) enables sustainable management of grid communication flow through its real-time data sensing, processing, and actuation capabilities at various levels in the energy distribution framework. The security vulnerabilities associated with the SCADA-enabled grid infrastructure and management could jeopardize the smart grid operations. This work explores the potential of Quantum Physical Unclonable Functions (QPUF) for the security, privacy, and reliability of the smart grid's energy transmission and distribution framework.
Quantum computing has emerged as a formidable security solution for high-performance computing applications through its probabilistic nature of information processing. This work has a quantum hardware-assisted security mechanism based on intrinsic properties of quantum hardware driven by quantum mechanics to provide tamper-proof security for quantum computing driven smart grid infrastructure. This work introduces a novel QPUF architecture using quantum logic gates based on quantum decoherence, entanglement, and superposition. This generates a unique bitstream for each quantum device as a fingerprint. The proposed QPUF design is evaluated on IBM and Google quantum systems and simulators. The deployment on the IBM quantum simulator (ibmq_qasm_simulator) has achieved an average Hamming distance of 50.07%, 51% randomness, and 86% of the keys showing 100% reliability.
△ Less
Submitted 16 October, 2024;
originally announced October 2024.
-
Easydiagnos: a framework for accurate feature selection for automatic diagnosis in smart healthcare
Authors:
Prasenjit Maji,
Amit Kumar Mondal,
Hemanta Kumar Mondal,
Saraju P. Mohanty
Abstract:
The rapid advancements in artificial intelligence (AI) have revolutionized smart healthcare, driving innovations in wearable technologies, continuous monitoring devices, and intelligent diagnostic systems. However, security, explainability, robustness, and performance optimization challenges remain critical barriers to widespread adoption in clinical environments. This research presents an innovat…
▽ More
The rapid advancements in artificial intelligence (AI) have revolutionized smart healthcare, driving innovations in wearable technologies, continuous monitoring devices, and intelligent diagnostic systems. However, security, explainability, robustness, and performance optimization challenges remain critical barriers to widespread adoption in clinical environments. This research presents an innovative algorithmic method using the Adaptive Feature Evaluator (AFE) algorithm to improve feature selection in healthcare datasets and overcome problems. AFE integrating Genetic Algorithms (GA), Explainable Artificial Intelligence (XAI), and Permutation Combination Techniques (PCT), the algorithm optimizes Clinical Decision Support Systems (CDSS), thereby enhancing predictive accuracy and interpretability. The proposed method is validated across three diverse healthcare datasets using six distinct machine learning algorithms, demonstrating its robustness and superiority over conventional feature selection techniques. The results underscore the transformative potential of AFE in smart healthcare, enabling personalized and transparent patient care. Notably, the AFE algorithm, when combined with a Multi-layer Perceptron (MLP), achieved an accuracy of up to 98.5%, highlighting its capability to improve clinical decision-making processes in real-world healthcare applications.
△ Less
Submitted 30 September, 2024;
originally announced October 2024.
-
NUTRIVISION: A System for Automatic Diet Management in Smart Healthcare
Authors:
Madhumita Veeramreddy,
Ashok Kumar Pradhan,
Swetha Ghanta,
Laavanya Rachakonda,
Saraju P Mohanty
Abstract:
Maintaining health and fitness through a balanced diet is essential for preventing non communicable diseases such as heart disease, diabetes, and cancer. NutriVision combines smart healthcare with computer vision and machine learning to address the challenges of nutrition and dietary management. This paper introduces a novel system that can identify food items, estimate quantities, and provide com…
▽ More
Maintaining health and fitness through a balanced diet is essential for preventing non communicable diseases such as heart disease, diabetes, and cancer. NutriVision combines smart healthcare with computer vision and machine learning to address the challenges of nutrition and dietary management. This paper introduces a novel system that can identify food items, estimate quantities, and provide comprehensive nutritional information. NutriVision employs the Faster Region based Convolutional Neural Network, a deep learning algorithm that improves object detection by generating region proposals and then classifying those regions, making it highly effective for accurate and fast food identification even in complex and disorganized meal settings. Through smartphone based image capture, NutriVision delivers instant nutritional data, including macronutrient breakdown, calorie count, and micronutrient details. One of the standout features of NutriVision is its personalized nutritional analysis and diet recommendations, which are tailored to each user's dietary preferences, nutritional needs, and health history. By providing customized advice, NutriVision helps users achieve specific health and fitness goals, such as managing dietary restrictions or controlling weight. In addition to offering precise food detection and nutritional assessment, NutriVision supports smarter dietary decisions by integrating user data with recommendations that promote a balanced, healthful diet. This system presents a practical and advanced solution for nutrition management and has the potential to significantly influence how people approach their dietary choices, promoting healthier eating habits and overall well being. This paper discusses the design, performance evaluation, and prospective applications of the NutriVision system.
△ Less
Submitted 30 September, 2024;
originally announced September 2024.
-
Machine learning approaches for automatic defect detection in photovoltaic systems
Authors:
Swayam Rajat Mohanty,
Moin Uddin Maruf,
Vaibhav Singh,
Zeeshan Ahmad
Abstract:
Solar photovoltaic (PV) modules are prone to damage during manufacturing, installation and operation which reduces their power conversion efficiency. This diminishes their positive environmental impact over the lifecycle. Continuous monitoring of PV modules during operation via unmanned aerial vehicles is essential to ensure that defective panels are promptly replaced or repaired to maintain high…
▽ More
Solar photovoltaic (PV) modules are prone to damage during manufacturing, installation and operation which reduces their power conversion efficiency. This diminishes their positive environmental impact over the lifecycle. Continuous monitoring of PV modules during operation via unmanned aerial vehicles is essential to ensure that defective panels are promptly replaced or repaired to maintain high power conversion efficiencies. Computer vision provides an automatic, non-destructive and cost-effective tool for monitoring defects in large-scale PV plants. We review the current landscape of deep learning-based computer vision techniques used for detecting defects in solar modules. We compare and evaluate the existing approaches at different levels, namely the type of images used, data collection and processing method, deep learning architectures employed, and model interpretability. Most approaches use convolutional neural networks together with data augmentation or generative adversarial network-based techniques. We evaluate the deep learning approaches by performing interpretability analysis on classification tasks. This analysis reveals that the model focuses on the darker regions of the image to perform the classification. We find clear gaps in the existing approaches while also laying out the groundwork for mitigating these challenges when building new models. We conclude with the relevant research gaps that need to be addressed and approaches for progress in this field: integrating geometric deep learning with existing approaches for building more robust and reliable models, leveraging physics-based neural networks that combine domain expertise of physical laws to build more domain-aware deep learning models, and incorporating interpretability as a factor for building models that can be trusted. The review points towards a clear roadmap for making this technology commercially relevant.
△ Less
Submitted 24 September, 2024;
originally announced September 2024.
-
Generalizability of Graph Neural Network Force Fields for Predicting Solid-State Properties
Authors:
Shaswat Mohanty,
Yifan Wang,
Wei Cai
Abstract:
Machine-learned force fields (MLFFs) promise to offer a computationally efficient alternative to ab initio simulations for complex molecular systems. However, ensuring their generalizability beyond training data is crucial for their wide application in studying solid materials. This work investigates the ability of a graph neural network (GNN)-based MLFF, trained on Lennard-Jones Argon, to describ…
▽ More
Machine-learned force fields (MLFFs) promise to offer a computationally efficient alternative to ab initio simulations for complex molecular systems. However, ensuring their generalizability beyond training data is crucial for their wide application in studying solid materials. This work investigates the ability of a graph neural network (GNN)-based MLFF, trained on Lennard-Jones Argon, to describe solid-state phenomena not explicitly included during training. We assess the MLFF's performance in predicting phonon density of states (PDOS) for a perfect face-centered cubic (FCC) crystal structure at both zero and finite temperatures. Additionally, we evaluate vacancy migration rates and energy barriers in an imperfect crystal using direct molecular dynamics (MD) simulations and the string method. Notably, vacancy configurations were absent from the training data. Our results demonstrate the MLFF's capability to capture essential solid-state properties with good agreement to reference data, even for unseen configurations. We further discuss data engineering strategies to enhance the generalizability of MLFFs. The proposed set of benchmark tests and workflow for evaluating MLFF performance in describing perfect and imperfect crystals pave the way for reliable application of MLFFs in studying complex solid-state materials.
△ Less
Submitted 21 December, 2024; v1 submitted 15 September, 2024;
originally announced September 2024.
-
On the Relationship between Truth and Political Bias in Language Models
Authors:
Suyash Fulay,
William Brannon,
Shrestha Mohanty,
Cassandra Overney,
Elinor Poole-Dayan,
Deb Roy,
Jad Kabbara
Abstract:
Language model alignment research often attempts to ensure that models are not only helpful and harmless, but also truthful and unbiased. However, optimizing these objectives simultaneously can obscure how improving one aspect might impact the others. In this work, we focus on analyzing the relationship between two concepts essential in both language model alignment and political science: truthful…
▽ More
Language model alignment research often attempts to ensure that models are not only helpful and harmless, but also truthful and unbiased. However, optimizing these objectives simultaneously can obscure how improving one aspect might impact the others. In this work, we focus on analyzing the relationship between two concepts essential in both language model alignment and political science: truthfulness and political bias. We train reward models on various popular truthfulness datasets and subsequently evaluate their political bias. Our findings reveal that optimizing reward models for truthfulness on these datasets tends to result in a left-leaning political bias. We also find that existing open-source reward models (i.e., those trained on standard human preference datasets) already show a similar bias and that the bias is larger for larger models. These results raise important questions about the datasets used to represent truthfulness, potential limitations of aligning models to be both truthful and politically unbiased, and what language models capture about the relationship between truth and politics.
△ Less
Submitted 11 October, 2024; v1 submitted 8 September, 2024;
originally announced September 2024.
-
Gravitational radiation from binary systems in Unimodular gravity
Authors:
Indranil Chakraborty,
Soumya Jana,
Subhendra Mohanty
Abstract:
Unimodular gravity (UG) is classically considered identical to General Relativity (GR). However, due to restricted diffeomorphism symmetry, the Bianchi identites do not lead to the conservation of energy-momentum tensor. Thus, the conservation of energy-momentum tensor needs to be separately assumed in order to reconcile with GR. Relaxing this assumption, one finds that the conservation violation…
▽ More
Unimodular gravity (UG) is classically considered identical to General Relativity (GR). However, due to restricted diffeomorphism symmetry, the Bianchi identites do not lead to the conservation of energy-momentum tensor. Thus, the conservation of energy-momentum tensor needs to be separately assumed in order to reconcile with GR. Relaxing this assumption, one finds that the conservation violation can lead to differences with GR, which can be subsequently examined in astrophysical and cosmological scenarios. To this end, we examine the predictions of UG in the context of binary systems emitting gravitational radiation. Primarily, we show how the field equations involve a diffusion function which quantifies the measure of non-conservation. Due to this violation, the dispersion relation is modified. Incorporating these changes, we provide an expression for the energy loss by the binaries, which reduces to Peters-Mathews result in the GR limit. Using binary pulsar data, we constrain the theory parameter $ζ$ (which signifies non-conservation) by determining the rate of orbital decay. The strongest constrain on $ζ$ comes out to be $\vert ζ\vert \leq 5\times 10^{-4}$ which is better by an order of magnitude than an existing equivalent constraint coming from the tidal deformability of the neutron stars.
△ Less
Submitted 17 February, 2025; v1 submitted 4 September, 2024;
originally announced September 2024.
-
Measurement of the electric potential and the magnetic field in the shifted analysing plane of the KATRIN experiment
Authors:
M. Aker,
D. Batzler,
A. Beglarian,
J. Behrens,
J. Beisenkötter,
M. Biassoni,
B. Bieringer,
Y. Biondi,
F. Block,
S. Bobien,
M. Böttcher,
B. Bornschein,
L. Bornschein,
T. S. Caldwell,
M. Carminati,
A. Chatrabhuti,
S. Chilingaryan,
B. A. Daniel,
K. Debowski,
M. Descher,
D. Díaz Barrero,
P. J. Doe,
O. Dragoun,
G. Drexlin,
F. Edzards
, et al. (113 additional authors not shown)
Abstract:
The projected sensitivity of the effective electron neutrino-mass measurement with the KATRIN experiment is below 0.3 eV (90 % CL) after five years of data acquisition. The sensitivity is affected by the increased rate of the background electrons from KATRIN's main spectrometer. A special shifted-analysing-plane (SAP) configuration was developed to reduce this background by a factor of two. The co…
▽ More
The projected sensitivity of the effective electron neutrino-mass measurement with the KATRIN experiment is below 0.3 eV (90 % CL) after five years of data acquisition. The sensitivity is affected by the increased rate of the background electrons from KATRIN's main spectrometer. A special shifted-analysing-plane (SAP) configuration was developed to reduce this background by a factor of two. The complex layout of electromagnetic fields in the SAP configuration requires a robust method of estimating these fields. We present in this paper a dedicated calibration measurement of the fields using conversion electrons of gaseous $^\mathrm{83m}$Kr, which enables the neutrino-mass measurements in the SAP configuration.
△ Less
Submitted 9 August, 2024;
originally announced August 2024.
-
Consent in Crisis: The Rapid Decline of the AI Data Commons
Authors:
Shayne Longpre,
Robert Mahari,
Ariel Lee,
Campbell Lund,
Hamidah Oderinwale,
William Brannon,
Nayan Saxena,
Naana Obeng-Marnu,
Tobin South,
Cole Hunter,
Kevin Klyman,
Christopher Klamm,
Hailey Schoelkopf,
Nikhil Singh,
Manuel Cherep,
Ahmad Anis,
An Dinh,
Caroline Chitongo,
Da Yin,
Damien Sileo,
Deividas Mataciunas,
Diganta Misra,
Emad Alghamdi,
Enrico Shippole,
Jianguo Zhang
, et al. (24 additional authors not shown)
Abstract:
General-purpose artificial intelligence (AI) systems are built on massive swathes of public web data, assembled into corpora such as C4, RefinedWeb, and Dolma. To our knowledge, we conduct the first, large-scale, longitudinal audit of the consent protocols for the web domains underlying AI training corpora. Our audit of 14,000 web domains provides an expansive view of crawlable web data and how co…
▽ More
General-purpose artificial intelligence (AI) systems are built on massive swathes of public web data, assembled into corpora such as C4, RefinedWeb, and Dolma. To our knowledge, we conduct the first, large-scale, longitudinal audit of the consent protocols for the web domains underlying AI training corpora. Our audit of 14,000 web domains provides an expansive view of crawlable web data and how codified data use preferences are changing over time. We observe a proliferation of AI-specific clauses to limit use, acute differences in restrictions on AI developers, as well as general inconsistencies between websites' expressed intentions in their Terms of Service and their robots.txt. We diagnose these as symptoms of ineffective web protocols, not designed to cope with the widespread re-purposing of the internet for AI. Our longitudinal analyses show that in a single year (2023-2024) there has been a rapid crescendo of data restrictions from web sources, rendering ~5%+ of all tokens in C4, or 28%+ of the most actively maintained, critical sources in C4, fully restricted from use. For Terms of Service crawling restrictions, a full 45% of C4 is now restricted. If respected or enforced, these restrictions are rapidly biasing the diversity, freshness, and scaling laws for general-purpose AI systems. We hope to illustrate the emerging crises in data consent, for both developers and creators. The foreclosure of much of the open web will impact not only commercial AI, but also non-commercial AI and academic research.
△ Less
Submitted 24 July, 2024; v1 submitted 20 July, 2024;
originally announced July 2024.
-
Hierarchical search method for gravitational waves from stellar-mass binary black holes in noisy space-based detector data
Authors:
Yao Fu,
Yan Wang,
Soumya D. Mohanty
Abstract:
Future space-based laser interferometric detectors, such as LISA, will be able to detect gravitational waves (GWs) generated during the inspiral phase of stellar-mass binary black holes (SmBBHs). The detection and characterization of GWs from SmBBHs poses a formidable data analysis challenge, arising from the large number of wave cycles that make the search extremely sensitive to mismatches in sig…
▽ More
Future space-based laser interferometric detectors, such as LISA, will be able to detect gravitational waves (GWs) generated during the inspiral phase of stellar-mass binary black holes (SmBBHs). The detection and characterization of GWs from SmBBHs poses a formidable data analysis challenge, arising from the large number of wave cycles that make the search extremely sensitive to mismatches in signal and template parameters in a likelihood-based approach. This makes the search for the maximum of the likelihood function over the signal parameter space an extremely difficult task. We present a data analysis method that addresses this problem using both algorithmic innovations and hardware acceleration driven by GPUs. The method follows a hierarchical approach in which a semi-coherent $\mathcal{F}$-statistic is computed with different numbers of frequency domain partitions at different stages, with multiple particle swarm optimization (PSO) runs used in each stage for global optimization. An important step in the method is the judicious partitioning of the parameter space at each stage to improve the convergence probability of PSO and avoid premature convergence to noise-induced secondary maxima. The hierarchy of stages confines the semi-coherent searches to progressively smaller parameter ranges, with the final stage performing a search for the global maximum of the fully-coherent $\mathcal{F}$-statistic. We test our method on 2.5 years of a single LISA TDI combination and find that for an injected SmBBH signal with a SNR between $\approx 11$ and $\approx 14$, the method can estimate (i) the chirp mass with a relative error of $\lesssim 0.01\%$, (ii) the time of coalescence within $\approx 100$ sec, (iii) the sky location within $\approx 0.2$ ${\rm deg}^2$, and (iv) orbital eccentricity at a fiducial signal frequency of 10 mHz with a relative error of $\lesssim 1\%$. (abr.)
△ Less
Submitted 3 February, 2025; v1 submitted 15 July, 2024;
originally announced July 2024.
-
Evidence of quantum spin liquid state in a Cu$^{2+}$-based $S = 1/2$ triangular lattice antiferromagnet
Authors:
K. Bhattacharya,
S. Mohanty,
A. D. Hillier,
M. T. F. Telling,
R. Nath,
M. Majumder
Abstract:
The layered triangular lattice owing to $1:2$ order of $B$ and $B'$ sites in the triple perovskite $A_3 B B'_2$O$_9$ family provides an enticing domain for exploring the complex phenomena of quantum spin liquids (QSLs). We report a comprehensive investigation of the ground state properties of Sr$_3$CuTa$_2$O$_9$ that belongs to the above family, by employing magnetization, specific heat, and muon…
▽ More
The layered triangular lattice owing to $1:2$ order of $B$ and $B'$ sites in the triple perovskite $A_3 B B'_2$O$_9$ family provides an enticing domain for exploring the complex phenomena of quantum spin liquids (QSLs). We report a comprehensive investigation of the ground state properties of Sr$_3$CuTa$_2$O$_9$ that belongs to the above family, by employing magnetization, specific heat, and muon spin relaxation ($μ$SR) experiments down to the lowest temperature of 0.1~K. Analysis of the magnetic susceptibility indicates that the spin-lattice is a nearly isotropic $S = 1/2$ triangular lattice. We illustrate the observation of a gapless QSL, in which conventional spin ordering or freezing effects are absent, even at temperatures more than two orders of magnitude smaller than the exchange energy ($J_{\rm CW}/k_{\rm B} \simeq -5.04$~K). Magnetic specific heat in zero-field follows a power law, $C_{\rm m} \sim T^η$, below 1.2~K with $η\approx 2/3$, which is consistent with a theoretical proposal of the presence of spinon Fermi surface. Below 1.2~K, the $μ$SR relaxation rate shows no temperature dependence, suggesting persistent spin dynamics as expected for a QSL state. Delving deeper, we also analyze longitudinal field $μ$SR spectra revealing strong dynamical correlations in the spin-disordered ground state. All of these highlight the characteristics of spin entanglement in the QSL state.
△ Less
Submitted 14 July, 2024;
originally announced July 2024.
-
IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Authors:
Shrestha Mohanty,
Negar Arabzadeh,
Andrea Tupini,
Yuxuan Sun,
Alexey Skrynnik,
Artem Zholus,
Marc-Alexandre Côté,
Julia Kiseleva
Abstract:
Seamless interaction between AI agents and humans using natural language remains a key goal in AI research. This paper addresses the challenges of developing interactive agents capable of understanding and executing grounded natural language instructions through the IGLU competition at NeurIPS. Despite advancements, challenges such as a scarcity of appropriate datasets and the need for effective e…
▽ More
Seamless interaction between AI agents and humans using natural language remains a key goal in AI research. This paper addresses the challenges of developing interactive agents capable of understanding and executing grounded natural language instructions through the IGLU competition at NeurIPS. Despite advancements, challenges such as a scarcity of appropriate datasets and the need for effective evaluation platforms persist. We introduce a scalable data collection tool for gathering interactive grounded language instructions within a Minecraft-like environment, resulting in a Multi-Modal dataset with around 9,000 utterances and over 1,000 clarification questions. Additionally, we present a Human-in-the-Loop interactive evaluation platform for qualitative analysis and comparison of agent performance through multi-turn communication with human annotators. We offer to the community these assets referred to as IDAT (IGLU Dataset And Toolkit) which aim to advance the development of intelligent, interactive AI agents and provide essential resources for further research.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
SubLock: Sub-Circuit Replacement based Input Dependent Key-based Logic Locking for Robust IP Protection
Authors:
Vijaypal Singh Rathor,
Munesh Singh,
Kshira Sagar Sahoo,
Saraju P. Mohanty
Abstract:
Intellectual Property (IP) piracy, overbuilding, reverse engineering, and hardware Trojan are serious security concerns during integrated circuit (IC) development. Logic locking has proven to be a solid defence for mitigating these threats. The existing logic locking techniques are vulnerable to SAT-based attacks. However, several SAT-resistant logic locking methods are reported; they require sign…
▽ More
Intellectual Property (IP) piracy, overbuilding, reverse engineering, and hardware Trojan are serious security concerns during integrated circuit (IC) development. Logic locking has proven to be a solid defence for mitigating these threats. The existing logic locking techniques are vulnerable to SAT-based attacks. However, several SAT-resistant logic locking methods are reported; they require significant overhead. This paper proposes a novel input dependent key-based logic locking (IDKLL) that effectively prevents SAT-based attacks with low overhead. We first introduce a novel idea of IDKLL, where a design is locked such that it functions correctly for all input patterns only when their corresponding valid key sequences are applied. In contrast to conventional logic locking, the proposed IDKLL method uses multiple key sequences (instead of a single key sequence) as a valid key that provides correct functionality for all inputs. Further, we propose a sub-circuit replacement based IDKLL approach called SubLock that locks the design by replacing the original sub-circuitry with the corresponding IDKLL based locked circuit to prevent SAT attack with low overhead. The experimental evaluation on ISCAS benchmarks shows that the proposed SubLock mitigates the SAT attack with high security and reduced overhead over the well-known existing methods.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Direct neutrino-mass measurement based on 259 days of KATRIN data
Authors:
M. Aker,
D. Batzler,
A. Beglarian,
J. Behrens,
J. Beisenkötter,
M. Biassoni,
B. Bieringer,
Y. Biondi,
F. Block,
S. Bobien,
M. Böttcher,
B. Bornschein,
L. Bornschein,
T. S. Caldwell,
M. Carminati,
A. Chatrabhuti,
S. Chilingaryan,
B. A. Daniel,
K. Debowski,
M. Descher,
D. Díaz Barrero,
P. J. Doe,
O. Dragoun,
G. Drexlin,
F. Edzards
, et al. (124 additional authors not shown)
Abstract:
The fact that neutrinos carry a non-vanishing rest mass is evidence of physics beyond the Standard Model of elementary particles. Their absolute mass bears important relevance from particle physics to cosmology. In this work, we report on the search for the effective electron antineutrino mass with the KATRIN experiment. KATRIN performs precision spectroscopy of the tritium $β$-decay close to the…
▽ More
The fact that neutrinos carry a non-vanishing rest mass is evidence of physics beyond the Standard Model of elementary particles. Their absolute mass bears important relevance from particle physics to cosmology. In this work, we report on the search for the effective electron antineutrino mass with the KATRIN experiment. KATRIN performs precision spectroscopy of the tritium $β$-decay close to the kinematic endpoint. Based on the first five neutrino-mass measurement campaigns, we derive a best-fit value of $m_ν^{2} = {-0.14^{+0.13}_{-0.15}}~\mathrm{eV^2}$, resulting in an upper limit of $m_ν< {0.45}~\mathrm{eV}$ at 90 % confidence level. With six times the statistics of previous data sets, amounting to 36 million electrons collected in 259 measurement days, a substantial reduction of the background level and improved systematic uncertainties, this result tightens KATRIN's previous bound by a factor of almost two.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Designing Reconfigurable Interconnection Network of Heterogeneous Chiplets Using Kalman Filter
Authors:
Siamak Biglari,
Ruixiao Huang,
Hui Zhao,
Saraju Mohanty
Abstract:
Heterogeneous chiplets have been proposed for accelerating high-performance computing tasks. Integrated inside one package, CPU and GPU chiplets can share a common interconnection network that can be implemented through the interposer. However, CPU and GPU applications have very different traffic patterns in general. Without effective management of the network resource, some chiplets can suffer si…
▽ More
Heterogeneous chiplets have been proposed for accelerating high-performance computing tasks. Integrated inside one package, CPU and GPU chiplets can share a common interconnection network that can be implemented through the interposer. However, CPU and GPU applications have very different traffic patterns in general. Without effective management of the network resource, some chiplets can suffer significant performance degradation because the network bandwidth is taken away by communication-intensive applications. Therefore, techniques need to be developed to effectively manage the shared network resources. In a chiplet-based system, resource management needs to not only react in real-time but also be cost-efficient. In this work, we propose a reconfigurable network architecture, leveraging Kalman Filter to make accurate predictions on network resources needed by the applications and then adaptively change the resource allocation. Using our design, the network bandwidth can be fairly allocated to avoid starvation or performance degradation. Our evaluation results show that the proposed reconfigurable interconnection network can dynamically react to the changes in traffic demand of the chiplets and improve the system performance with low cost and design complexity.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
Locally Stationary Distributions: A Framework for Analyzing Slow-Mixing Markov Chains
Authors:
Kuikui Liu,
Sidhanth Mohanty,
Prasad Raghavendra,
Amit Rajaraman,
David X. Wu
Abstract:
Many natural Markov chains fail to mix to their stationary distribution in polynomially many steps. Often, this slow mixing is inevitable since it is computationally intractable to sample from their stationary measure.
Nevertheless, Markov chains can be shown to always converge quickly to measures that are locally stationary, i.e., measures that don't change over a small number of steps. These l…
▽ More
Many natural Markov chains fail to mix to their stationary distribution in polynomially many steps. Often, this slow mixing is inevitable since it is computationally intractable to sample from their stationary measure.
Nevertheless, Markov chains can be shown to always converge quickly to measures that are locally stationary, i.e., measures that don't change over a small number of steps. These locally stationary measures are analogous to local minima in continuous optimization, while stationary measures correspond to global minima.
While locally stationary measures can be statistically far from stationary measures, do they enjoy provable theoretical guarantees that have algorithmic implications? We study this question in this work and demonstrate three algorithmic applications of locally stationary measures:
1. We show that Glauber dynamics on the hardcore model can be used to find independent sets of size $Ω\left(\frac{\log d}{d} \cdot n\right)$ in triangle-free graphs of degree at most $d$.
2. Let $W$ be a symmetric real matrix with bounded spectral diameter and $v$ be a unit vector. Given the matrix $M = λvv^\top + W$ with a planted rank-one spike along vector $v$, for sufficiently large constant $λ$, Glauber dynamics on the Ising model defined by $M$ samples vectors $x \in \{\pm 1\}^n$ that have constant correlation with the vector $v$.
3. Let $M = A_{\mathbf{G}} - \frac{d}{n}\mathbf{1}\mathbf{1}^\top$ be a centered version of the adjacency matrix where the graph $\mathbf{G}$ is drawn from a sparse 2-community stochastic block model. We show that for sufficiently large constant signal-to-noise ratio, Glauber dynamics on the Ising model defined by $M$ samples vectors $x \in \{\pm 1\}^n$ that have constant correlation with the hidden community vector $\mathbfσ$.
△ Less
Submitted 8 April, 2025; v1 submitted 31 May, 2024;
originally announced May 2024.
-
First high peak and average power single-pass THz FEL based on high brightness photoinjector
Authors:
M. Krasilnikov,
Z. Aboulbanine,
G. Adhikari,
N. Aftab,
A. Asoyan,
P. Boonpornprasert,
H. Davtyan,
G. Georgiev,
J. Good,
A. Grebinyk,
M. Gross,
A. Hoffmann,
E. Kongmon,
X. -K. Li,
A. Lueangaramwong,
D. Melkumyan,
S. Mohanty,
R. Niemczyk,
A. Oppelt,
H. Qian,
C. Richard,
F. Stephan,
G. Vashchenko,
T. Weilbach,
X. Zhang
, et al. (9 additional authors not shown)
Abstract:
Advanced experiments using THz pump and X-ray probe pulses at modern free-electron lasers (FELs) like the European X-ray FEL require a frequency-tunable, high-power, narrow-band THz source maintaining the repetition rate and pulse structure of the X-ray pulses. This paper reports the first results from a THz source, that is based on a single-pass high-gain THz FEL operating with a central waveleng…
▽ More
Advanced experiments using THz pump and X-ray probe pulses at modern free-electron lasers (FELs) like the European X-ray FEL require a frequency-tunable, high-power, narrow-band THz source maintaining the repetition rate and pulse structure of the X-ray pulses. This paper reports the first results from a THz source, that is based on a single-pass high-gain THz FEL operating with a central wavelength of 100 micrometers. The THz FEL prototype is currently in operation at the Photo Injector Test facility at DESY in Zeuthen (PITZ) and uses the same type of electron source as the European XFEL photo injector. A self-amplified spontaneous emission (SASE) FEL was envisioned as the main mechanism for generating the THz pulses. Although the THz FEL at PITZ is supposed to use the same mechanism as at X-ray facilities, it cannot be considered as a simple scaling of the radiation wavelength because there is a large difference in the number of electrons per radiation wavelength, which is five orders of magnitude higher for the THz case. The bunching factor arising from the electron beam current profile contributes strongly to the initial spontaneous emission starting the FEL process. Proof-of-principle experiments were done at PITZ using an LCLS-I undulator to generate the first high-power, high-repetition-rate single-pass THz FEL radiation. Electron bunches with a beam energy of ~17 MeV and a bunch charge of up to several nC are used to generate THz pulses with a pulse energy of several tens of microjoules. For example, for an electron beam with a charge of ~2.4 nC, more than 100 microjoules were generated at a central wavelength of 100 micrometers. The narrowband spectrum was also demonstrated by spectral measurements. These proof-of-principle experiments pave the way for a tunable, high-repetition-rate THz source providing pulses with energies in the millijoule range.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.