-
Maximizing solubility in rock salt high-entropy oxides
Authors:
Matthew Furst,
Joseph Petruska,
Dhiya Srikanth,
Jacob T. Sivak,
Susan B. Sinnott,
Christina M. Rost,
Jon-Paul Maria,
Saeed S. I. Almishal
Abstract:
To explore and quantitatively map the cation-size mismatch solubility limits in high-entropy oxides (HEOs), we report on Ca$^{2+}$ substitution in prototypical MgCoNiCuZnO, because, while isovalent, Ca$^{2+}$ is 38% larger than its partners' average ionic radii. Using the thermodynamics-grounded bond-length distribution descriptor, we identify Ca$^{2+}$-Cu$^{2+}$ interactions as the primary prospe…
▽ More
To explore and quantitatively map the cation-size mismatch solubility limits in high-entropy oxides (HEOs), we report on Ca$^{2+}$ substitution in prototypical MgCoNiCuZnO, because, while isovalent, Ca$^{2+}$ is 38% larger than its partners' average ionic radii. Using the thermodynamics-grounded bond-length distribution descriptor, we identify Ca$^{2+}$-Cu$^{2+}$ interactions as the primary prospective lattice destabilizer. Bulk synthesis confirms only 4% Ca solubility with Cu at 950$^o$C, modestly rising to 5% after Cu removal at 1150$^o$C. We then employ far-from-equilibrium pulsed-laser deposition to investigate metastable solubility: epitaxial films incorporate 10% Ca with Cu and a full 20% Ca without, doubling and quadrupling the respective bulk limits. This Ca uptake additionally enables deterministic lattice-parameter control via Ca concentration. Overall, our results demonstrate both the extended solubility possible in HEO systems, particularly when accessing metastable states through quenching from high-energy plasma, and that the specific constellation of solid-solvent cations can be rationally engineered to minimize bond-length distributions when largely misfit cations are added, thus expanding the accessible compositional space.
△ Less
Submitted 18 June, 2025;
originally announced June 2025.
-
Structure-Dynamics Correlation and Its Link to Fragility and Dynamic Heterogeneity
Authors:
Mohit Sharma,
Srikanth Sastry,
Sarika Maitra Bhattacharyya
Abstract:
Understanding the connection between structure, dynamics, and fragility (the rate at which relaxation time grow with decreasing temperature) is central to unraveling the glass transition. Fragility is often linked to dynamic heterogeneity, and thus it is commonly assumed that if structure influences dynamics, more fragile systems should exhibit stronger structure dynamics correlations. In this stu…
▽ More
Understanding the connection between structure, dynamics, and fragility (the rate at which relaxation time grow with decreasing temperature) is central to unraveling the glass transition. Fragility is often linked to dynamic heterogeneity, and thus it is commonly assumed that if structure influences dynamics, more fragile systems should exhibit stronger structure dynamics correlations. In this study, we test the generality of this assumption using three model systems: Lennard-Jones (LJ) and Weeks--Chandler--Andersen, where fragility is tuned via density, and a modified LJ (q, p) system, where potential softness is changed to vary fragility. We employ a structural order parameter derived from the mean field caging potential and analyze energy barriers at both macroscopic and microscopic levels. While the macroscopic slope of the energy barrier, suitably defined, correlates with fragility, no consistent correlation is found for the microscopic energy barriers. Instead, the latter shows a strong correlation with an independently computed structure dynamics measure obtained from isoconfigurational ensemble. Surprisingly, the two systems with the highest structure dynamics correlation, LJ at rho = 1.1 and the (8, 5) model, are respectively the least and most fragile within their classes. These systems exhibit broad mobility distributions, bimodal displacement profiles, and high non-Gaussian parameters, all indicative of dynamic heterogeneity. However, their dynamic susceptibilities remain low, suggesting a decoupling between spatial correlation and temporal heterogeneity. Both systems lie in the enthalpy-dominated regime and are near the spinodal, suggesting mechanical instability as a source of heterogeneity. These findings challenge the conventional linkage among fragility, heterogeneity, and structure-dynamics correlation.
△ Less
Submitted 18 June, 2025; v1 submitted 15 June, 2025;
originally announced June 2025.
-
The Amazon Nova Family of Models: Technical Report and Model Card
Authors:
Amazon AGI,
Aaron Langford,
Aayush Shah,
Abhanshu Gupta,
Abhimanyu Bhatter,
Abhinav Goyal,
Abhinav Mathur,
Abhinav Mohanty,
Abhishek Kumar,
Abhishek Sethi,
Abi Komma,
Abner Pena,
Achin Jain,
Adam Kunysz,
Adam Opyrchal,
Adarsh Singh,
Aditya Rawal,
Adok Achar Budihal Prasad,
Adrià de Gispert,
Agnika Kumar,
Aishwarya Aryamane,
Ajay Nair,
Akilan M,
Akshaya Iyengar,
Akshaya Vishnu Kudlu Shanbhogue
, et al. (761 additional authors not shown)
Abstract:
We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents…
▽ More
We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents and text. Amazon Nova Micro is a text-only model that delivers our lowest-latency responses at very low cost. Amazon Nova Canvas is an image generation model that creates professional grade images with rich customization controls. Amazon Nova Reel is a video generation model offering high-quality outputs, customization, and motion control. Our models were built responsibly and with a commitment to customer trust, security, and reliability. We report benchmarking results for core capabilities, agentic performance, long context, functional adaptation, runtime performance, and human evaluation.
△ Less
Submitted 17 March, 2025;
originally announced June 2025.
-
A novel visual data-based diagnostic approach for estimation of regime transition in pool boiling
Authors:
Pranay Nirapure,
Ayushman Singh,
Srikanth Rangarajan,
Bahgat Sammakia
Abstract:
This study introduces a novel metric, the Index of Visual Similarity (IVS), to qualitatively characterize boiling heat transfer regimes using only visual data. The IVS is constructed by combining morphological similarity, through SIFT-based feature matching, with physical similarity, via vapor area estimation using Mask R-CNN. High-speed images of pool boiling on two distinct surfaces, polished co…
▽ More
This study introduces a novel metric, the Index of Visual Similarity (IVS), to qualitatively characterize boiling heat transfer regimes using only visual data. The IVS is constructed by combining morphological similarity, through SIFT-based feature matching, with physical similarity, via vapor area estimation using Mask R-CNN. High-speed images of pool boiling on two distinct surfaces, polished copper and porous copper foam, are employed to demonstrate the generalizability of the approach. IVS captures critical changes in bubble shape, size, and distribution that correspond to transitions in heat transfer mechanisms. The metric is validated against an equivalent metric, $Φ$, derived from measured heat transfer coefficients (HTC), showing strong correlation and reliability in detecting boiling regime transitions, including the onset of nucleate boiling and proximity to critical heat flux (CHF). Given experimental limitations in precisely measuring changes in HTC, the sensitivity of IVS to surface superheat is also examined to reinforce the credibility of IVS. IVS thus emerges as a powerful, rapid, and non-intrusive tool for real-time, image-based boiling diagnostics, with promising applications in phase change heat transfer.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
Effect of Weak Measurement Reversal on Quantum Correlations in a Correlated Amplitude Damping Channel, with a Neural Network Perspective
Authors:
Venkat Abhignan,
Bidyut Bikash Boruah,
R. Srikanth,
Ashutosh Singh
Abstract:
We study the evolution of quantum correlations in Bell, Werner, and maximally entangled mixed states of two qubits subjected to correlated amplitude-damping channels. Our primary focus is to evaluate the robustness of entanglement as a resource for quantum information protocols such as dense coding, teleportation, and Einstein-Podolsky-Rosen (EPR) steering under the influence of noise. In addition…
▽ More
We study the evolution of quantum correlations in Bell, Werner, and maximally entangled mixed states of two qubits subjected to correlated amplitude-damping channels. Our primary focus is to evaluate the robustness of entanglement as a resource for quantum information protocols such as dense coding, teleportation, and Einstein-Podolsky-Rosen (EPR) steering under the influence of noise. In addition, we investigate the behaviour of other quantum correlations, including quantum discord and coherence, and analyze their hierarchy under decoherence. To counteract the detrimental effects of the channels, we apply the weak measurement and quantum measurement reversal (WMR) protocol, comparing the effectiveness of single-qubit and two-qubit WMR techniques. Our results show that the two-qubit WMR protocol significantly outperforms the single-qubit approach in preserving quantum correlations. Furthermore, we employ a neural network model to enhance our analysis of the relationship between different quantum correlation measures during the evolution. Using a MATLAB-based artificial neural network with 80 neurons across three hidden layers and trained with the Levenberg-Marquardt algorithm, we successfully predict trace distance discord from other correlations, achieving low prediction errors. Besides, our analysis of the neural network weights suggests that concurrence and EPR steering have the most positive influence on the accurate discord predictions.
△ Less
Submitted 5 June, 2025;
originally announced June 2025.
-
Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering
Authors:
Andres Carofilis,
Pradeep Rangappa,
Srikanth Madikeri,
Shashi Kumar,
Sergio Burdisso,
Jeena Prakash,
Esau Villatoro-Tello,
Petr Motlicek,
Bidisha Sharma,
Kadri Hacioglu,
Shankar Venkatesan,
Saurabh Vyas,
Andreas Stolcke
Abstract:
Fine-tuning pretrained ASR models for specific domains is challenging when labeled data is scarce. But unlabeled audio and labeled data from related domains are often available. We propose an incremental semi-supervised learning pipeline that first integrates a small in-domain labeled set and an auxiliary dataset from a closely related domain, achieving a relative improvement of 4% over no auxilia…
▽ More
Fine-tuning pretrained ASR models for specific domains is challenging when labeled data is scarce. But unlabeled audio and labeled data from related domains are often available. We propose an incremental semi-supervised learning pipeline that first integrates a small in-domain labeled set and an auxiliary dataset from a closely related domain, achieving a relative improvement of 4% over no auxiliary data. Filtering based on multi-model consensus or named entity recognition (NER) is then applied to select and iteratively refine pseudo-labels, showing slower performance saturation compared to random selection. Evaluated on the multi-domain Wow call center and Fisher English corpora, it outperforms single-step fine-tuning. Consensus-based filtering outperforms other methods, providing up to 22.3% relative improvement on Wow and 24.8% on Fisher over single-step fine-tuning with random selection. NER is the second-best filter, providing competitive performance at a lower computational cost.
△ Less
Submitted 5 June, 2025;
originally announced June 2025.
-
OpenAg: Democratizing Agricultural Intelligence
Authors:
Srikanth Thudumu,
Jason Fisher
Abstract:
Agriculture is undergoing a major transformation driven by artificial intelligence (AI), machine learning, and knowledge representation technologies. However, current agricultural intelligence systems often lack contextual understanding, explainability, and adaptability, especially for smallholder farmers with limited resources. General-purpose large language models (LLMs), while powerful, typical…
▽ More
Agriculture is undergoing a major transformation driven by artificial intelligence (AI), machine learning, and knowledge representation technologies. However, current agricultural intelligence systems often lack contextual understanding, explainability, and adaptability, especially for smallholder farmers with limited resources. General-purpose large language models (LLMs), while powerful, typically lack the domain-specific knowledge and contextual reasoning needed for practical decision support in farming. They tend to produce recommendations that are too generic or unrealistic for real-world applications. To address these challenges, we present OpenAg, a comprehensive framework designed to advance agricultural artificial general intelligence (AGI). OpenAg combines domain-specific foundation models, neural knowledge graphs, multi-agent reasoning, causal explainability, and adaptive transfer learning to deliver context-aware, explainable, and actionable insights. The system includes: (i) a unified agricultural knowledge base that integrates scientific literature, sensor data, and farmer-generated knowledge; (ii) a neural agricultural knowledge graph for structured reasoning and inference; (iii) an adaptive multi-agent reasoning system where AI agents specialize and collaborate across agricultural domains; and (iv) a causal transparency mechanism that ensures AI recommendations are interpretable, scientifically grounded, and aligned with real-world constraints. OpenAg aims to bridge the gap between scientific knowledge and the tacit expertise of experienced farmers to support scalable and locally relevant agricultural decision-making.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Gradient Inversion Attacks on Parameter-Efficient Fine-Tuning
Authors:
Hasin Us Sami,
Swapneel Sen,
Amit K. Roy-Chowdhury,
Srikanth V. Krishnamurthy,
Basak Guler
Abstract:
Federated learning (FL) allows multiple data-owners to collaboratively train machine learning models by exchanging local gradients, while keeping their private data on-device. To simultaneously enhance privacy and training efficiency, recently parameter-efficient fine-tuning (PEFT) of large-scale pretrained models has gained substantial attention in FL. While keeping a pretrained (backbone) model…
▽ More
Federated learning (FL) allows multiple data-owners to collaboratively train machine learning models by exchanging local gradients, while keeping their private data on-device. To simultaneously enhance privacy and training efficiency, recently parameter-efficient fine-tuning (PEFT) of large-scale pretrained models has gained substantial attention in FL. While keeping a pretrained (backbone) model frozen, each user fine-tunes only a few lightweight modules to be used in conjunction, to fit specific downstream applications. Accordingly, only the gradients with respect to these lightweight modules are shared with the server. In this work, we investigate how the privacy of the fine-tuning data of the users can be compromised via a malicious design of the pretrained model and trainable adapter modules. We demonstrate gradient inversion attacks on a popular PEFT mechanism, the adapter, which allow an attacker to reconstruct local data samples of a target user, using only the accessible adapter gradients. Via extensive experiments, we demonstrate that a large batch of fine-tuning images can be retrieved with high fidelity. Our attack highlights the need for privacy-preserving mechanisms for PEFT, while opening up several future directions. Our code is available at https://github.com/info-ucr/PEFTLeak.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering
Authors:
Pradeep Rangappa,
Andres Carofilis,
Jeena Prakash,
Shashi Kumar,
Sergio Burdisso,
Srikanth Madikeri,
Esau Villatoro-Tello,
Bidisha Sharma,
Petr Motlicek,
Kadri Hacioglu,
Shankar Venkatesan,
Saurabh Vyas,
Andreas Stolcke
Abstract:
Fine-tuning pretrained ASR models for specific domains is challenging for small organizations with limited labeled data and computational resources. Here, we explore different data selection pipelines and propose a robust approach that improves ASR adaptation by filtering pseudo-labels generated using Whisper (encoder-decoder) and Zipformer (transducer) models. Our approach integrates multiple sel…
▽ More
Fine-tuning pretrained ASR models for specific domains is challenging for small organizations with limited labeled data and computational resources. Here, we explore different data selection pipelines and propose a robust approach that improves ASR adaptation by filtering pseudo-labels generated using Whisper (encoder-decoder) and Zipformer (transducer) models. Our approach integrates multiple selection strategies -- including word error rate (WER) prediction, named entity recognition (NER), and character error rate (CER) analysis -- to extract high-quality training segments. We evaluate our method on Whisper and Zipformer using a 7500-hour baseline, comparing it to a CER-based approach relying on hypotheses from three ASR systems. Fine-tuning on 7500 hours of pseudo-labeled call center data achieves 12.3% WER, while our filtering reduces the dataset to 100 hours (1.4%) with similar performance; a similar trend is observed on Fisher English.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Compress, Gather, and Recompute: REFORMing Long-Context Processing in Transformers
Authors:
Woomin Song,
Sai Muralidhar Jayanthi,
Srikanth Ronanki,
Kanthashree Mysore Sathyendra,
Jinwoo Shin,
Aram Galstyan,
Shubham Katiyar,
Sravan Babu Bodapati
Abstract:
As large language models increasingly gain popularity in real-world applications, processing extremely long contexts, often exceeding the model's pre-trained context limits, has emerged as a critical challenge. While existing approaches to efficient long-context processing show promise, recurrent compression-based methods struggle with information preservation, whereas random access approaches req…
▽ More
As large language models increasingly gain popularity in real-world applications, processing extremely long contexts, often exceeding the model's pre-trained context limits, has emerged as a critical challenge. While existing approaches to efficient long-context processing show promise, recurrent compression-based methods struggle with information preservation, whereas random access approaches require substantial memory resources. We introduce REFORM, a novel inference framework that efficiently handles long contexts through a two-phase approach. First, it incrementally processes input chunks while maintaining a compressed KV cache, constructs cross-layer context embeddings, and utilizes early exit strategy for improved efficiency. Second, it identifies and gathers essential tokens via similarity matching and selectively recomputes the KV cache. Compared to baselines, REFORM achieves over 50% and 27% performance gains on RULER and BABILong respectively at 1M context length. It also outperforms baselines on Infinite-Bench and MM-NIAH, demonstrating flexibility across diverse tasks and domains. Additionally, REFORM reduces inference time by 30% and peak memory usage by 5%, achieving both efficiency and superior performance.
△ Less
Submitted 1 June, 2025;
originally announced June 2025.
-
The Disparate Effects of Partial Information in Bayesian Strategic Learning
Authors:
Srikanth Avasarala,
Serena Wang,
Juba Ziani
Abstract:
We study how partial information about scoring rules affects fairness in strategic learning settings. In strategic learning, a learner deploys a scoring rule, and agents respond strategically by modifying their features -- at some cost -- to improve their outcomes. However, in our work, agents do not observe the scoring rule directly; instead, they receive a noisy signal of said rule. We consider…
▽ More
We study how partial information about scoring rules affects fairness in strategic learning settings. In strategic learning, a learner deploys a scoring rule, and agents respond strategically by modifying their features -- at some cost -- to improve their outcomes. However, in our work, agents do not observe the scoring rule directly; instead, they receive a noisy signal of said rule. We consider two different agent models: (i) naive agents, who take the noisy signal at face value, and (ii) Bayesian agents, who update a prior belief based on the signal.
Our goal is to understand how disparities in outcomes arise between groups that differ in their costs of feature modification, and how these disparities vary with the level of transparency of the learner's rule. For naive agents, we show that utility disparities can grow unboundedly with noise, and that the group with lower costs can, perhaps counter-intuitively, be disproportionately harmed under limited transparency. In contrast, for Bayesian agents, disparities remain bounded. We provide a full characterization of disparities across groups as a function of the level of transparency and show that they can vary non-monotonically with noise; in particular, disparities are often minimized at intermediate levels of transparency. Finally, we extend our analysis to settings where groups differ not only in cost, but also in prior beliefs, and study how this asymmetry influences fairness.
△ Less
Submitted 31 May, 2025;
originally announced June 2025.
-
Supervised Quantum Machine Learning: A Future Outlook from Qubits to Enterprise Applications
Authors:
Srikanth Thudumu,
Jason Fisher,
Hung Du
Abstract:
Supervised Quantum Machine Learning (QML) represents an intersection of quantum computing and classical machine learning, aiming to use quantum resources to support model training and inference. This paper reviews recent developments in supervised QML, focusing on methods such as variational quantum circuits, quantum neural networks, and quantum kernel methods, along with hybrid quantum-classical…
▽ More
Supervised Quantum Machine Learning (QML) represents an intersection of quantum computing and classical machine learning, aiming to use quantum resources to support model training and inference. This paper reviews recent developments in supervised QML, focusing on methods such as variational quantum circuits, quantum neural networks, and quantum kernel methods, along with hybrid quantum-classical workflows. We examine recent experimental studies that show partial indications of quantum advantage and describe current limitations including noise, barren plateaus, scalability issues, and the lack of formal proofs of performance improvement over classical methods. The main contribution is a ten-year outlook (2025-2035) that outlines possible developments in supervised QML, including a roadmap describing conditions under which QML may be used in applied research and enterprise systems over the next decade.
△ Less
Submitted 17 June, 2025; v1 submitted 30 May, 2025;
originally announced May 2025.
-
Evaluating Gemini in an arena for learning
Authors:
LearnLM Team,
Abhinit Modi,
Aditya Srikanth Veerubhotla,
Aliya Rysbek,
Andrea Huber,
Ankit Anand,
Avishkar Bhoopchand,
Brett Wiltshire,
Daniel Gillick,
Daniel Kasenberg,
Eleni Sgouritsa,
Gal Elidan,
Hengrui Liu,
Holger Winnemoeller,
Irina Jurenka,
James Cohan,
Jennifer She,
Julia Wilkowski,
Kaiz Alarakyia,
Kevin R. McKee,
Komal Singh,
Lisa Wang,
Markus Kunesch,
Miruna Pîslar,
Niv Efron
, et al. (12 additional authors not shown)
Abstract:
Artificial intelligence (AI) is poised to transform education, but the research community lacks a robust, general benchmark to evaluate AI models for learning. To assess state-of-the-art support for educational use cases, we ran an "arena for learning" where educators and pedagogy experts conduct blind, head-to-head, multi-turn comparisons of leading AI models. In particular, $N = 189$ educators d…
▽ More
Artificial intelligence (AI) is poised to transform education, but the research community lacks a robust, general benchmark to evaluate AI models for learning. To assess state-of-the-art support for educational use cases, we ran an "arena for learning" where educators and pedagogy experts conduct blind, head-to-head, multi-turn comparisons of leading AI models. In particular, $N = 189$ educators drew from their experience to role-play realistic learning use cases, interacting with two models sequentially, after which $N = 206$ experts judged which model better supported the user's learning goals. The arena evaluated a slate of state-of-the-art models: Gemini 2.5 Pro, Claude 3.7 Sonnet, GPT-4o, and OpenAI o3. Excluding ties, experts preferred Gemini 2.5 Pro in 73.2% of these match-ups -- ranking it first overall in the arena. Gemini 2.5 Pro also demonstrated markedly higher performance across key principles of good pedagogy. Altogether, these results position Gemini 2.5 Pro as a leading model for learning.
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
The spectrum of local dualisable modular representations
Authors:
Dave Benson,
Srikanth B. Iyengar,
Henning Krause,
Julia Pevtsova
Abstract:
For a point $\mathfrak{p}$ in the spectrum of the cohomology ring of a finite group $G$ over a field $k$, we calculate the spectrum for the subcategory of dualisable objects inside the tensor triangulated category of $\mathfrak{p}$-local and $\mathfrak{p}$-torsion objects in the (big) stable module category of the group algebra $kG$.
For a point $\mathfrak{p}$ in the spectrum of the cohomology ring of a finite group $G$ over a field $k$, we calculate the spectrum for the subcategory of dualisable objects inside the tensor triangulated category of $\mathfrak{p}$-local and $\mathfrak{p}$-torsion objects in the (big) stable module category of the group algebra $kG$.
△ Less
Submitted 25 May, 2025;
originally announced May 2025.
-
Towards Large Reasoning Models for Agriculture
Authors:
Hossein Zaremehrjerdi,
Shreyan Ganguly,
Ashlyn Rairdin,
Elizabeth Tranel,
Benjamin Feuer,
Juan Ignacio Di Salvo,
Srikanth Panthulugiri,
Hernan Torres Pacin,
Victoria Moser,
Sarah Jones,
Joscif G Raigne,
Yanben Shen,
Heidi M. Dornath,
Aditya Balu,
Adarsh Krishnamurthy,
Asheesh K Singh,
Arti Singh,
Baskar Ganapathysubramanian,
Chinmay Hegde,
Soumik Sarkar
Abstract:
Agricultural decision-making involves complex, context-specific reasoning, where choices about crops, practices, and interventions depend heavily on geographic, climatic, and economic conditions. Traditional large language models (LLMs) often fall short in navigating this nuanced problem due to limited reasoning capacity. We hypothesize that recent advances in large reasoning models (LRMs) can bet…
▽ More
Agricultural decision-making involves complex, context-specific reasoning, where choices about crops, practices, and interventions depend heavily on geographic, climatic, and economic conditions. Traditional large language models (LLMs) often fall short in navigating this nuanced problem due to limited reasoning capacity. We hypothesize that recent advances in large reasoning models (LRMs) can better handle such structured, domain-specific inference. To investigate this, we introduce AgReason, the first expert-curated open-ended science benchmark with 100 questions for agricultural reasoning. Evaluations across thirteen open-source and proprietary models reveal that LRMs outperform conventional ones, though notable challenges persist, with the strongest Gemini-based baseline achieving 36% accuracy. We also present AgThoughts, a large-scale dataset of 44.6K question-answer pairs generated with human oversight and equipped with synthetically generated reasoning traces. Using AgThoughts, we develop AgThinker, a suite of small reasoning models that can be run on consumer-grade GPUs, and show that our dataset can be effective in unlocking agricultural reasoning abilities in LLMs. Our project page is here: https://baskargroup.github.io/Ag_reasoning/
△ Less
Submitted 27 May, 2025; v1 submitted 25 May, 2025;
originally announced May 2025.
-
Designing and Implementing Robust Test Automation Frameworks using Cucumber BDD and Java
Authors:
Srikanth Srinivas,
Lagan Goel
Abstract:
Modern software development demands rapid, reliable testing methods to maintain high quality in increasingly complex systems. This paper details a comprehensive approach to designing and implementing robust test automation frameworks by leveraging Cucumber BDD with Java. By utilizing Cucumber BDD natural language syntax, the framework enables clear communication between technical and non-technical…
▽ More
Modern software development demands rapid, reliable testing methods to maintain high quality in increasingly complex systems. This paper details a comprehensive approach to designing and implementing robust test automation frameworks by leveraging Cucumber BDD with Java. By utilizing Cucumber BDD natural language syntax, the framework enables clear communication between technical and non-technical team members, ensuring that requirements are accurately translated into executable tests. Java, renowned for its versatility and extensive libraries, serves as the backbone for creating scalable, maintainable, and efficient test scripts. The framework described herein focuses on modular architecture, facilitating re usability and streamlined maintenance across diverse application domains. It systematically addresses challenges such as test data management, dynamic environment handling, and integration with continuous integration/continuous delivery pipelines. Empirical evaluations demonstrate that this integrated approach not only reduces manual testing effort but also significantly enhances defect detection and overall software reliability. The methodology encourages the adoption of best practices in test design, including clear documentation, iterative development, and automated reporting.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
Assessing the Quality of AI-Generated Clinical Notes: A Validated Evaluation of a Large Language Model Scribe
Authors:
Erin Palm,
Astrit Manikantan,
Mark E. Pepin,
Herprit Mahal,
Srikanth Subramanya Belwadi
Abstract:
In medical practices across the United States, physicians have begun implementing generative artificial intelligence (AI) tools to perform the function of scribes in order to reduce the burden of documenting clinical encounters. Despite their widespread use, no established methods exist to gauge the quality of AI scribes. To address this gap, we developed a blinded study comparing the relative per…
▽ More
In medical practices across the United States, physicians have begun implementing generative artificial intelligence (AI) tools to perform the function of scribes in order to reduce the burden of documenting clinical encounters. Despite their widespread use, no established methods exist to gauge the quality of AI scribes. To address this gap, we developed a blinded study comparing the relative performance of large language model (LLM) generated clinical notes with those from field experts based on audio-recorded clinical encounters. Quantitative metrics from the Physician Documentation Quality Instrument (PDQI9) provided a framework to measure note quality, which we adapted to assess relative performance of AI generated notes. Clinical experts spanning 5 medical specialties used the PDQI9 tool to evaluate specialist-drafted Gold notes and LLM authored Ambient notes. Two evaluators from each specialty scored notes drafted from a total of 97 patient visits. We found uniformly high inter rater agreement (RWG greater than 0.7) between evaluators in general medicine, orthopedics, and obstetrics and gynecology, and moderate (RWG 0.5 to 0.7) to high inter rater agreement in pediatrics and cardiology. We found a modest yet significant difference in the overall note quality, wherein Gold notes achieved a score of 4.25 out of 5 and Ambient notes scored 4.20 out of 5 (p = 0.04). Our findings support the use of the PDQI9 instrument as a practical method to gauge the quality of LLM authored notes, as compared to human-authored notes.
△ Less
Submitted 15 May, 2025;
originally announced May 2025.
-
UBGAN: Enhancing Coded Speech with Blind and Guided Bandwidth Extension
Authors:
Kishan Gupta,
Srikanth Korse,
Andreas Brendel,
Nicola Pia,
Guillaume Fuchs
Abstract:
In practical application of speech codecs, a multitude of factors such as the quality of the radio connection, limiting hardware or required user experience necessitate trade-offs between achievable perceptual quality, engendered bitrate and computational complexity. Most conventional and neural speech codecs operate on wideband (WB) speech signals to achieve this compromise. To further enhance th…
▽ More
In practical application of speech codecs, a multitude of factors such as the quality of the radio connection, limiting hardware or required user experience necessitate trade-offs between achievable perceptual quality, engendered bitrate and computational complexity. Most conventional and neural speech codecs operate on wideband (WB) speech signals to achieve this compromise. To further enhance the perceptual quality of coded speech, bandwidth extension (BWE) of the transmitted speech is an attractive and popular technique in conventional speech coding. In contrast, neural speech codecs are typically trained end-to-end to a specific set of requirements and are often not easily adaptable. In particular, they are typically trained to operate at a single fixed sampling rate. With the Universal Bandwidth Extension Generative Adversarial Network (UBGAN), we propose a modular and lightweight GAN-based solution that increases the operational flexibility of a wide range of conventional and neural codecs. Our model operates in the subband domain and extends the bandwidth of WB signals from 8 kHz to 16 kHz, resulting in super-wideband (SWB) signals. We further introduce two variants, guided-UBGAN and blind-UBGAN, where the guided version transmits quantized learned representation as a side information at a very low bitrate additional to the bitrate of the codec, while blind-BWE operates without such side-information. Our subjective assessments demonstrate the advantage of UBGAN applied to WB codecs and highlight the generalization capacity of our proposed method across multiple codecs and bitrates.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
Quantum steganography using catalytic and entanglement-assisted quantum codes
Authors:
Sanjoy Dutta,
Nihar Ranjan Dash,
Subhashish Banerjee,
R. Srikanth
Abstract:
Steganography is the technique for transmitting a secret message by employing subterfuge to conceal it in innocent-looking data, rather than by overt security measures as in cryptography. Typically, non-degenerate quantum error-correcting codes (QECCs) are used as the cover medium, with the stego message disguised as noise. As in cryptography, a large number of bits or ebits are pre-shared, in thi…
▽ More
Steganography is the technique for transmitting a secret message by employing subterfuge to conceal it in innocent-looking data, rather than by overt security measures as in cryptography. Typically, non-degenerate quantum error-correcting codes (QECCs) are used as the cover medium, with the stego message disguised as noise. As in cryptography, a large number of bits or ebits are pre-shared, in this case mainly in order to ensure the innocence effect. In this work we develop three steganographic protocols: first, a scheme based on catalytic quantum codes to minimize initial pre-shared resources; second, a scheme incorporating prior entanglement into QECCs in the form of possibly degenerate entanglement-assisted QECCs; third, a scheme that uses the phase bit of a pre-shared ebit, combined with QECCs.
△ Less
Submitted 21 May, 2025;
originally announced May 2025.
-
Coarse grained descriptions of the dynamics of yielding of amorphous solids under cyclic shear
Authors:
Debargha Sarkar,
Jishnu N. Nampoothiri,
Muhittin Mungan,
Jack T. Parley,
Peter Sollich,
Srikanth Sastry
Abstract:
Recent computer simulations reveal several intriguing features in the evolution of properties of amorphous solids subjected to repeated cyclic shear deformation. These include the divergence of the number of cycles to reach steady states as the yielding point is approached, a non-monotonic change of properties with cycles, and the possibility of a spectrum of frozen states. Theoretical attempts to…
▽ More
Recent computer simulations reveal several intriguing features in the evolution of properties of amorphous solids subjected to repeated cyclic shear deformation. These include the divergence of the number of cycles to reach steady states as the yielding point is approached, a non-monotonic change of properties with cycles, and the possibility of a spectrum of frozen states. Theoretical attempts to capture these properties through simple models, including the Ehrenfest model describing a random walk in a confining potential, have met partial success. Here, we show that incorporating the influence of mechanical noise through a feedback term leads to a genuine dynamical transition with characteristics reflecting those of yielding. Coarse graining the dynamics into a small number of variables leads to new insights regarding the dynamics of yielding.
△ Less
Submitted 20 May, 2025;
originally announced May 2025.
-
Mixed Signals: Understanding Model Disagreement in Multimodal Empathy Detection
Authors:
Maya Srikanth,
Run Chen,
Julia Hirschberg
Abstract:
Multimodal models play a key role in empathy detection, but their performance can suffer when modalities provide conflicting cues. To understand these failures, we examine cases where unimodal and multimodal predictions diverge. Using fine-tuned models for text, audio, and video, along with a gated fusion model, we find that such disagreements often reflect underlying ambiguity, as evidenced by an…
▽ More
Multimodal models play a key role in empathy detection, but their performance can suffer when modalities provide conflicting cues. To understand these failures, we examine cases where unimodal and multimodal predictions diverge. Using fine-tuned models for text, audio, and video, along with a gated fusion model, we find that such disagreements often reflect underlying ambiguity, as evidenced by annotator uncertainty. Our analysis shows that dominant signals in one modality can mislead fusion when unsupported by others. We also observe that humans, like models, do not consistently benefit from multimodal input. These insights position disagreement as a useful diagnostic signal for identifying challenging examples and improving empathy system robustness.
△ Less
Submitted 20 May, 2025;
originally announced May 2025.
-
Probing Reheating Phase via Non-Helical Magnetogenesis and Secondary Gravitational Waves
Authors:
Subhasis Maiti,
Debaprasad Maity,
Rohan Srikanth
Abstract:
In the past two decades, significant advancements have been made in observational techniques to enhance our understanding of the universe and its evolutionary processes. However, our knowledge of the post-inflation reheating phase remains limited due to its small-scale dynamics. Traditional observations, such as those of the Cosmic Microwave Background (CMB), primarily provide insights into large-…
▽ More
In the past two decades, significant advancements have been made in observational techniques to enhance our understanding of the universe and its evolutionary processes. However, our knowledge of the post-inflation reheating phase remains limited due to its small-scale dynamics. Traditional observations, such as those of the Cosmic Microwave Background (CMB), primarily provide insights into large-scale dynamics, making it challenging to glean information about the reheating era. In this paper, our primary aim is to explore how the generation of Gravitational Waves (GWs) spectra, resulting from electromagnetic fields in the early universe, can offer valuable insights into the Reheating dynamics. We investigate how the spectral shape of GWs varies across different frequency ranges, depending on the initial magnetic profile and reheating dynamics. For this, we consider a well-known non-helical magnetogenesis model, where the usual electromagnetic kinetic term is coupled with a background scalar. Notably, for such a scenario, we observe distinct spectral shapes with sufficiently high amplitudes for different reheating histories with the equation of state parametrized by ($w_{\rm re}$). We identify spectral breaks in the GW spectra for both $w_{\rm re}<1/3$ and $w_{\rm re}>1/3$ scenarios. We find that future GW experiments such as BBO, LISA, SKA, and DECIGO are well within the reach of observing those distinct spectral shapes and can potentially shed light on the underlying mechanism of the reheating phase.
△ Less
Submitted 19 May, 2025;
originally announced May 2025.
-
Trailblazer: Learning offroad costmaps for long range planning
Authors:
Kasi Viswanath,
Felix Sanchez,
Timothy Overbye,
Jason M. Gregory,
Srikanth Saripalli
Abstract:
Autonomous navigation in off-road environments remains a significant challenge in field robotics, particularly for Unmanned Ground Vehicles (UGVs) tasked with search and rescue, exploration, and surveillance. Effective long-range planning relies on the integration of onboard perception systems with prior environmental knowledge, such as satellite imagery and LiDAR data. This work introduces Trailb…
▽ More
Autonomous navigation in off-road environments remains a significant challenge in field robotics, particularly for Unmanned Ground Vehicles (UGVs) tasked with search and rescue, exploration, and surveillance. Effective long-range planning relies on the integration of onboard perception systems with prior environmental knowledge, such as satellite imagery and LiDAR data. This work introduces Trailblazer, a novel framework that automates the conversion of multi-modal sensor data into costmaps, enabling efficient path planning without manual tuning. Unlike traditional approaches, Trailblazer leverages imitation learning and a differentiable A* planner to learn costmaps directly from expert demonstrations, enhancing adaptability across diverse terrains. The proposed methodology was validated through extensive real-world testing, achieving robust performance in dynamic and complex environments, demonstrating Trailblazer's potential for scalable, efficient autonomous navigation.
△ Less
Submitted 10 June, 2025; v1 submitted 14 May, 2025;
originally announced May 2025.
-
Measuring $\mathbb{Z}_2$ invariants in dimer models and cross-coupled ladders with a programmable photonic molecule
Authors:
Sashank Kaushik Sridhar,
Rohith Srikanth,
Alexander R. Miller,
Ferguson J. McComb,
Avik Dutt
Abstract:
Topological models are characterized by a quantized topological invariant and provide a description of novel phases of matter that can exhibit localized edge states, corner modes, and chiral transport. We experimentally realize two 1-D lattices supporting symmetry-protected topology - the Su-Schrieffer-Heeger (SSH) and extended SSH models using the synthetic frequency dimension of coupled fiber ri…
▽ More
Topological models are characterized by a quantized topological invariant and provide a description of novel phases of matter that can exhibit localized edge states, corner modes, and chiral transport. We experimentally realize two 1-D lattices supporting symmetry-protected topology - the Su-Schrieffer-Heeger (SSH) and extended SSH models using the synthetic frequency dimension of coupled fiber ring resonators. We introduce and experimentally demonstrate cascaded heterodyning as a technique for low-noise, single-shot winding number measurements through the mean chiral displacement and band structure measurements. Through our robust setup and detection techniques we can extend our capability to realizing 1-D ladder models, demonstrating a modified Creutz ladder with a staggered flux with each plaquette. This highly reconfigurable and compact fiber optics platform for Hamiltonian simulation, along with a low-noise detection scheme, provides a path forward for chip-scale realizations.
△ Less
Submitted 7 May, 2025;
originally announced May 2025.
-
Jailbreak Detection in Clinical Training LLMs Using Feature-Based Predictive Models
Authors:
Tri Nguyen,
Lohith Srikanth Pentapalli,
Magnus Sieverding,
Laurah Turner,
Seth Overla,
Weibing Zheng,
Chris Zhou,
David Furniss,
Danielle Weber,
Michael Gharib,
Matt Kelleher,
Michael Shukis,
Cameron Pawlik,
Kelly Cohen
Abstract:
Jailbreaking in Large Language Models (LLMs) threatens their safe use in sensitive domains like education by allowing users to bypass ethical safeguards. This study focuses on detecting jailbreaks in 2-Sigma, a clinical education platform that simulates patient interactions using LLMs. We annotated over 2,300 prompts across 158 conversations using four linguistic variables shown to correlate stron…
▽ More
Jailbreaking in Large Language Models (LLMs) threatens their safe use in sensitive domains like education by allowing users to bypass ethical safeguards. This study focuses on detecting jailbreaks in 2-Sigma, a clinical education platform that simulates patient interactions using LLMs. We annotated over 2,300 prompts across 158 conversations using four linguistic variables shown to correlate strongly with jailbreak behavior. The extracted features were used to train several predictive models, including Decision Trees, Fuzzy Logic-based classifiers, Boosting methods, and Logistic Regression. Results show that feature-based predictive models consistently outperformed Prompt Engineering, with the Fuzzy Decision Tree achieving the best overall performance. Our findings demonstrate that linguistic-feature-based models are effective and explainable alternatives for jailbreak detection. We suggest future work explore hybrid frameworks that integrate prompt-based flexibility with rule-based robustness for real-time, spectrum-based jailbreak monitoring in educational LLMs.
△ Less
Submitted 21 April, 2025;
originally announced May 2025.
-
Fermionic Band Dispersions and an Evidence of Cooperon Excitations in a Spin-$1/2$ Trimer Chain
Authors:
P. Srikanth Patnaik,
Snehasish Sen,
A. K. Bera,
Sudhansu S. Mandal,
Anushree Roy,
S. M. Yusuf
Abstract:
We obtain the solution of the Hamiltonian of an antiferromagnetically coupled spin-$1/2$ trimer chain in terms of three bands that host three different species of fermions. While the lowest two bands correspond to spin-$1/2$ fermions, the fermions in the highest band are of spin-$3/2$. Because the bands are for different species of fermions, the particle-hole excitation channel across the bands is…
▽ More
We obtain the solution of the Hamiltonian of an antiferromagnetically coupled spin-$1/2$ trimer chain in terms of three bands that host three different species of fermions. While the lowest two bands correspond to spin-$1/2$ fermions, the fermions in the highest band are of spin-$3/2$. Because the bands are for different species of fermions, the particle-hole excitation channel across the bands is closed. However, fractionalized excitations as spin-$1/2$ and spin-$3/2$ fermions in pairs open a cooperon channel of excitations in Raman scattering. The background spectral intensity profile obtained by Raman scattering measurements in Na$_2$Cu$_3$Ge$_4$O$_{12}$ having a trimer chain consisting of spin-$1/2$ Cu ions, has comprehensively been shown to be consistent with these excitations.
△ Less
Submitted 30 April, 2025;
originally announced April 2025.
-
A Gradient-Optimized TSK Fuzzy Framework for Explainable Phishing Detection
Authors:
Lohith Srikanth Pentapalli,
Jon Salisbury,
Josette Riep,
Kelly Cohen
Abstract:
Phishing attacks represent an increasingly sophisticated and pervasive threat to individuals and organizations, causing significant financial losses, identity theft, and severe damage to institutional reputations. Existing phishing detection methods often struggle to simultaneously achieve high accuracy and explainability, either failing to detect novel attacks or operating as opaque black-box mod…
▽ More
Phishing attacks represent an increasingly sophisticated and pervasive threat to individuals and organizations, causing significant financial losses, identity theft, and severe damage to institutional reputations. Existing phishing detection methods often struggle to simultaneously achieve high accuracy and explainability, either failing to detect novel attacks or operating as opaque black-box models. To address this critical gap, we propose a novel phishing URL detection system based on a first-order Takagi-Sugeno-Kang (TSK) fuzzy inference model optimized through gradient-based techniques. Our approach intelligently combines the interpretability and human-like reasoning capabilities of fuzzy logic with the precision and adaptability provided by gradient optimization methods, specifically leveraging the Adam optimizer for efficient parameter tuning. Experiments conducted using a comprehensive dataset of over 235,000 URLs demonstrate rapid convergence, exceptional predictive performance (accuracy averaging 99.95% across 5 cross-validation folds, with a perfect AUC i.e. 1.00). Furthermore, optimized fuzzy rules and membership functions improve interoperability, clearly indicating how the model makes decisions - an essential feature for cybersecurity applications. This high-performance, transparent, and interpretable phishing detection framework significantly advances current cybersecurity defenses, providing practitioners with accurate and explainable decision-making tools.
△ Less
Submitted 25 April, 2025;
originally announced April 2025.
-
CiteFix: Enhancing RAG Accuracy Through Post-Processing Citation Correction
Authors:
Harsh Maheshwari,
Srikanth Tenneti,
Alwarappan Nakkiran
Abstract:
Retrieval Augmented Generation (RAG) has emerged as a powerful application of Large Language Models (LLMs), revolutionizing information search and consumption. RAG systems combine traditional search capabilities with LLMs to generate comprehensive answers to user queries, ideally with accurate citations. However, in our experience of developing a RAG product, LLMs often struggle with source attrib…
▽ More
Retrieval Augmented Generation (RAG) has emerged as a powerful application of Large Language Models (LLMs), revolutionizing information search and consumption. RAG systems combine traditional search capabilities with LLMs to generate comprehensive answers to user queries, ideally with accurate citations. However, in our experience of developing a RAG product, LLMs often struggle with source attribution, aligning with other industry studies reporting citation accuracy rates of only about 74% for popular generative search engines. To address this, we present efficient post-processing algorithms to improve citation accuracy in LLM-generated responses, with minimal impact on latency and cost. Our approaches cross-check generated citations against retrieved articles using methods including keyword + semantic matching, fine tuned model with BERTScore, and a lightweight LLM-based technique. Our experimental results demonstrate a relative improvement of 15.46% in the overall accuracy metrics of our RAG system. This significant enhancement potentially enables a shift from our current larger language model to a relatively smaller model that is approximately 12x more cost-effective and 3x faster in inference time, while maintaining comparable performance. This research contributes to enhancing the reliability and trustworthiness of AI-generated content in information retrieval and summarization tasks which is critical to gain customer trust especially in commercial products.
△ Less
Submitted 11 June, 2025; v1 submitted 22 April, 2025;
originally announced April 2025.
-
Minimal Magnetogenesis: The Role of Inflationary Perturbations and ALPs, and Its Gravitational Wave Signatures
Authors:
Subhasis Maiti,
Debaprasad Maity,
Rohan Srikanth
Abstract:
Any attempt to understand the ubiquitous nature of the magnetic field in the present universe seems to lead us towards its primordial origin. For large-scale magnetic fields, however, their strength and length scale may not necessarily originate from a singular primordial mechanism, namely inflationary magnetogenesis, which has been a popular consideration in the literature. In this paper, we prop…
▽ More
Any attempt to understand the ubiquitous nature of the magnetic field in the present universe seems to lead us towards its primordial origin. For large-scale magnetic fields, however, their strength and length scale may not necessarily originate from a singular primordial mechanism, namely inflationary magnetogenesis, which has been a popular consideration in the literature. In this paper, we propose a minimal scenario wherein a large-scale magnetic field is generated from the inflationary perturbation without any non-conformal coupling. Due to their origin in the inflationary scalar spectrum, these primordial fields are inherently weak, with their strength suppressed by the small amplitude of scalar fluctuations. We then consider the coupling between this large-scale weak primordial magnetic field and a light axion of mass $<10^{-28}$ eV, which is assumed to be frozen in a misaligned state until the photon decoupling. After the decoupling, when the universe enters into a dark age, the light axion coherently oscillates. By appropriately tuning the axion-photon coupling parameter $α$, we demonstrate that a large-scale magnetic field of sufficient strength can indeed be generated through tachyonic resonance. We further show that the produced magnetic field induces a unique spectrum with multiple peaks of secondary gravitational waves, which the upcoming CMB-S4 can probe through B-mode polarization. The strength can be sufficient enough to violate the PLANCK bound on tensor-to-scalar ratio $r \lesssim 0.036$. Such a violation leads to a constraint on $α\lesssim 80$. With this limiting value of the coupling, we find that present-day magnetic field strength could be as high as $10^{-10}$ Gauss at $\Mpc$ scale, consistent with observation.
△ Less
Submitted 21 April, 2025;
originally announced April 2025.
-
Algorithmic Prompt Generation for Diverse Human-like Teaming and Communication with Large Language Models
Authors:
Siddharth Srikanth,
Varun Bhatt,
Boshen Zhang,
Werner Hager,
Charles Michael Lewis,
Katia P. Sycara,
Aaquib Tabrez,
Stefanos Nikolaidis
Abstract:
Understanding how humans collaborate and communicate in teams is essential for improving human-agent teaming and AI-assisted decision-making. However, relying solely on data from large-scale user studies is impractical due to logistical, ethical, and practical constraints, necessitating synthetic models of multiple diverse human behaviors. Recently, agents powered by Large Language Models (LLMs) h…
▽ More
Understanding how humans collaborate and communicate in teams is essential for improving human-agent teaming and AI-assisted decision-making. However, relying solely on data from large-scale user studies is impractical due to logistical, ethical, and practical constraints, necessitating synthetic models of multiple diverse human behaviors. Recently, agents powered by Large Language Models (LLMs) have been shown to emulate human-like behavior in social settings. But, obtaining a large set of diverse behaviors requires manual effort in the form of designing prompts. On the other hand, Quality Diversity (QD) optimization has been shown to be capable of generating diverse Reinforcement Learning (RL) agent behavior. In this work, we combine QD optimization with LLM-powered agents to iteratively search for prompts that generate diverse team behavior in a long-horizon, multi-step collaborative environment. We first show, through a human-subjects experiment (n=54 participants), that humans exhibit diverse coordination and communication behavior in this domain. We then show that our approach can effectively replicate trends from human teaming data and also capture behaviors that are not easily observed without collecting large amounts of data. Our findings highlight the combination of QD and LLM-powered agents as an effective tool for studying teaming and communication strategies in multi-agent collaboration.
△ Less
Submitted 4 April, 2025;
originally announced April 2025.
-
Instabilities and bifurcations in turbulent porous media flow
Authors:
Vishal Srikanth,
Andrey V. Kuznetsov
Abstract:
Microscale turbulent flow in porous media is conducive to the development of flow instabilities due to strong vortical and shearing flow occurring within the pore space. When the flow instabilities around individual solid obstacles interact with numerous others within the porous medium, unique symmetry-breaking phenomena emerge as a result. This paper focuses on investigations of the vortex dynami…
▽ More
Microscale turbulent flow in porous media is conducive to the development of flow instabilities due to strong vortical and shearing flow occurring within the pore space. When the flow instabilities around individual solid obstacles interact with numerous others within the porous medium, unique symmetry-breaking phenomena emerge as a result. This paper focuses on investigations of the vortex dynamics and flow instabilities behind solid obstacles in porous media, emphasizing how solid obstacle geometry and porosity influence both microscale and macroscale flow behavior. Two distinct symmetry-breaking mechanisms were identified in different porosity ranges. In low porosity media (< 0.8), a "deviatory flow" phenomenon occurs, where the macroscale flow deviates from the direction of applied pressure gradient at Reynolds numbers above 500. Deviatory flow is a source of macroscale Reynolds stress anisotropy, which is counterbalanced by a diminished vortex core size. In the intermediate porosity regime (0.8-0.95), a "jetting flow" mechanism creates asymmetric microscale velocity channels in the pore space through temporally biased vortex shedding, occurring during the transition to turbulence. Both symmetry-breaking phenomena are critically influenced by solid obstacle shape, porosity, and Reynolds number. Circularity of solid obstacle geometry and an adequately high Reynolds number provide critical conditions for symmetry-breaking, whereas porosity can be used to parametrize the degree of symmetry-breaking. This paper provides fundamental insights into the intricate flow dynamics in porous media, offering a comprehensive understanding of how microscale vortex interactions generate macroscale flow asymmetries across different geometric configurations.
△ Less
Submitted 1 April, 2025;
originally announced April 2025.
-
Contradiction Detection in RAG Systems: Evaluating LLMs as Context Validators for Improved Information Consistency
Authors:
Vignesh Gokul,
Srikanth Tenneti,
Alwarappan Nakkiran
Abstract:
Retrieval Augmented Generation (RAG) systems have emerged as a powerful method for enhancing large language models (LLMs) with up-to-date information. However, the retrieval step in RAG can sometimes surface documents containing contradictory information, particularly in rapidly evolving domains such as news. These contradictions can significantly impact the performance of LLMs, leading to inconsi…
▽ More
Retrieval Augmented Generation (RAG) systems have emerged as a powerful method for enhancing large language models (LLMs) with up-to-date information. However, the retrieval step in RAG can sometimes surface documents containing contradictory information, particularly in rapidly evolving domains such as news. These contradictions can significantly impact the performance of LLMs, leading to inconsistent or erroneous outputs. This study addresses this critical challenge in two ways. First, we present a novel data generation framework to simulate different types of contradictions that may occur in the retrieval stage of a RAG system. Second, we evaluate the robustness of different LLMs in performing as context validators, assessing their ability to detect contradictory information within retrieved document sets. Our experimental results reveal that context validation remains a challenging task even for state-of-the-art LLMs, with performance varying significantly across different types of contradictions. While larger models generally perform better at contradiction detection, the effectiveness of different prompting strategies varies across tasks and model architectures. We find that chain-of-thought prompting shows notable improvements for some models but may hinder performance in others, highlighting the complexity of the task and the need for more robust approaches to context validation in RAG systems.
△ Less
Submitted 31 March, 2025;
originally announced April 2025.
-
Test and Calibration of the Solar Ultraviolet Imaging Telescope (SUIT) on board Aditya-L1
Authors:
Janmejoy Sarkar,
VN Nived,
Soumya Roy,
Rushikesh Deogaonkar,
Sreejith Padinhatteeri,
Raja Bayanna,
Ravi Kesharwani,
A. N. Ramaprakash,
Durgesh Tripathi,
Rahul Gopalakrishnan,
Bhushan Joshi,
. Sakya Sinha,
. Mahesh Burse,
Manoj Varma,
Anurag Tyagi,
Reena Yadav,
Chaitanya Rajarshi,
H. N. Adithya,
Abhijit Adoni,
Gazi A. Ahmed,
Dipankar Banerjee,
Rani Bhandare,
Bhargava Ram B. S.,
Kalpesh Chillal,
Pravin Chordia
, et al. (30 additional authors not shown)
Abstract:
The Solar Ultraviolet Imaging Telescope (SUIT) on board the AdityaL1 mission observes the Sun in the 200-400 nm wavelength range. This paper presents the results of various on ground and on board tests and their comparison with the specifications. Moreover, we also present the scheme for data calibration. We demonstrate that the test results are compliant with the specified figures, except the spa…
▽ More
The Solar Ultraviolet Imaging Telescope (SUIT) on board the AdityaL1 mission observes the Sun in the 200-400 nm wavelength range. This paper presents the results of various on ground and on board tests and their comparison with the specifications. Moreover, we also present the scheme for data calibration. We demonstrate that the test results are compliant with the specified figures, except the spatial resolution. Such discrepancy will limit the photometric measurements only, at a scale of 2.2" instead of 1.4" as originally envisioned. The results obtained here show that SUIT observations open up a new window for solar observations.
△ Less
Submitted 30 March, 2025;
originally announced March 2025.
-
AI-Powered Assistive Technologies for Visual Impairment
Authors:
Prudhvi Naayini,
Praveen Kumar Myakala,
Chiranjeevi Bura,
Anil Kumar Jonnalagadda,
Srikanth Kamatala
Abstract:
Artificial Intelligence (AI) is revolutionizing assistive technologies. It offers innovative solutions to enhance the quality of life for individuals with visual impairments. This review examines the development, applications, and impact of AI-powered tools in key domains, such as computer vision, natural language processing (NLP), and wearable devices. Specific advancements include object recogni…
▽ More
Artificial Intelligence (AI) is revolutionizing assistive technologies. It offers innovative solutions to enhance the quality of life for individuals with visual impairments. This review examines the development, applications, and impact of AI-powered tools in key domains, such as computer vision, natural language processing (NLP), and wearable devices. Specific advancements include object recognition for identifying everyday items, scene description for understanding surroundings, and NLP-driven text-to-speech systems for accessing digital information. Assistive technologies like smart glasses, smartphone applications, and AI-enabled navigation aids are discussed, demonstrating their ability to support independent travel, facilitate social interaction, and increase access to education and employment opportunities.
The integration of deep learning models, multimodal interfaces, and real-time data processing has transformed the functionality and usability of these tools, fostering inclusivity and empowerment. This article also addresses critical challenges, including ethical considerations, affordability, and adaptability in diverse environments. Future directions highlight the need for interdisciplinary collaboration to refine these technologies, ensuring equitable access and sustainable innovation. By providing a comprehensive overview, this review underscores AI's transformative potential in promoting independence, enhancing accessibility, and fostering social inclusion for visually impaired individuals.
△ Less
Submitted 13 January, 2025;
originally announced March 2025.
-
Understanding Common Ground Misalignment in Goal-Oriented Dialog: A Case-Study with Ubuntu Chat Logs
Authors:
Rupak Sarkar,
Neha Srikanth,
Taylor Hudson,
Rachel Rudinger,
Claire Bonial,
Philip Resnik
Abstract:
While it is commonly accepted that maintaining common ground plays a role in conversational success, little prior research exists connecting conversational grounding to success in task-oriented conversations. We study failures of grounding in the Ubuntu IRC dataset, where participants use text-only communication to resolve technical issues. We find that disruptions in conversational flow often ste…
▽ More
While it is commonly accepted that maintaining common ground plays a role in conversational success, little prior research exists connecting conversational grounding to success in task-oriented conversations. We study failures of grounding in the Ubuntu IRC dataset, where participants use text-only communication to resolve technical issues. We find that disruptions in conversational flow often stem from a misalignment in common ground, driven by a divergence in beliefs and assumptions held by participants. These disruptions, which we call conversational friction, significantly correlate with task success. We find that although LLMs can identify overt cases of conversational friction, they struggle with subtler and more context-dependent instances requiring pragmatic or domain-specific reasoning.
△ Less
Submitted 16 March, 2025;
originally announced March 2025.
-
Rouse Mode Analysis of Chain Relaxation in Reversibly Crosslinked Polymer Melts
Authors:
Rahul Karmakar,
Srikanth Sastry,
Sanat K. Kumar,
Tarak K. Patra
Abstract:
Polymer melts with chains undergoing reversible crosslinking have distinctively favorable dynamic properties, e.g., self healing and reprocessability. In these situations there are two relevant elementary time scales: the segmental and the sticker association times. A convenient framework to model these situations is the sticky Rouse model and here we perform hybrid moleculear dynamics (MD) Monte…
▽ More
Polymer melts with chains undergoing reversible crosslinking have distinctively favorable dynamic properties, e.g., self healing and reprocessability. In these situations there are two relevant elementary time scales: the segmental and the sticker association times. A convenient framework to model these situations is the sticky Rouse model and here we perform hybrid moleculear dynamics (MD) Monte Carlo (MC) simulations to examine its relevance. In agreement with the underpinning idea discussed above we find that reversibly crosslinked chains show two distinct modes of relaxation behavior depending on the magnitude of bond lifetimes. For bond lifetimes shorter than the chain end-to-end relaxation time, the polymers exhibit essentially Rouse like dynamics, but with an apparently increased local friction relative to the non-sticky analog. For longer bond lifetimes, the chains exhibit two modes of relaxation: the faster mode is independent of bond lifetime, but the slower mode is controlled by it. However, these slower mode results are not consistent with the predictions of the sticky Rouse model. Our Rouse mode analysis as a function of chain length, N, imply that this is likely a result of the relatively short chain length employed, but they nevertheless suggest that theories need to include these small chain effects if they are to be relevant to experimental systems with short chains following Rouse dynamics.
△ Less
Submitted 27 February, 2025;
originally announced March 2025.
-
Thermodynamics-Inspired High-Entropy Oxide Synthesis
Authors:
Saeed S. I. Almishal,
Matthew Furst,
Yueze Tan,
Jacob T. Sivak,
Gerald Bejger,
Dhiya Srikanth,
Joseph Petruska,
Christina M. Rost,
Susan B. Sinnott,
Long-Qing Chen,
Jon-Paul Maria
Abstract:
High-entropy oxide (HEO) thermodynamics transcend temperature-centric approaches, spanning a multidimensional landscape where oxygen chemical potential plays a decisive role. Here, we experimentally demonstrate how controlling the oxygen chemical potential coerces multivalent cations into divalent states in rock salt HEOs. We construct a preferred valence phase diagram based on thermodynamic stabi…
▽ More
High-entropy oxide (HEO) thermodynamics transcend temperature-centric approaches, spanning a multidimensional landscape where oxygen chemical potential plays a decisive role. Here, we experimentally demonstrate how controlling the oxygen chemical potential coerces multivalent cations into divalent states in rock salt HEOs. We construct a preferred valence phase diagram based on thermodynamic stability and equilibrium analysis, alongside a high throughput enthalpic stability map derived from atomistic calculations leveraging machine learning interatomic potentials. We identify and synthesize seven equimolar single-phase rock salt compositions that accommodate multivalent Mn, Fe, or both, as confirmed by X-ray diffraction and fluorescence. X-ray absorption fine structure spectra reveal predominantly divalent cations. Ultimately, we introduce oxygen chemical potential overlap as a key complementary descriptor predicting HEO stability and synthesizability. Although we focus on rock salt HEOs, our methods are chemically and structurally agnostic, providing a broadly adaptable framework for navigating HEOs thermodynamics and enabling a broader compositional range with contemporary property interest.
△ Less
Submitted 12 March, 2025; v1 submitted 10 March, 2025;
originally announced March 2025.
-
A comprehensive review on developments of synthetic dimensions
Authors:
Danying Yu,
Wange Song,
Luojia Wang,
Rohith Srikanth,
Sashank Kaushik Sridhar,
Tao Chen,
Chenxi Huang,
Guangzhen Li,
Xin Qiao,
Xiaoxiong Wu,
Zhaohui Dong,
Yanyan He,
Meng Xiao,
Xianfeng Chen,
Avik Dutt,
Bryce Gadway,
Luqi Yuan
Abstract:
The concept of synthetic dimensions has emerged as a powerful framework in photonics and atomic physics, enabling the exploration of high-dimensional physics beyond conventional spatial constraints. Originally developed for quantum simulations in high dimensions, synthetic dimensions have since demonstrated advantages in designing novel Hamiltonians and manipulating quantum or optical states for e…
▽ More
The concept of synthetic dimensions has emerged as a powerful framework in photonics and atomic physics, enabling the exploration of high-dimensional physics beyond conventional spatial constraints. Originally developed for quantum simulations in high dimensions, synthetic dimensions have since demonstrated advantages in designing novel Hamiltonians and manipulating quantum or optical states for exploring topological physics, and for applications in computing and information processing. Here we provide a comprehensive overview of progress in synthetic dimensions across photonic, atomic, and other physical platforms over the past decade. We showcase different approaches used to construct synthetic dimensions and highlight key physical phenomena enabled by the advantage of such a framework. By offering a unified perspective on developments in this field, we aim to provide insights into how synthetic dimensions can bridge fundamental physics and applied technologies, fostering interdisciplinary engagement in quantum simulation, atomic and photonic engineering, and information processing.
△ Less
Submitted 3 March, 2025;
originally announced March 2025.
-
Speculative Ad-hoc Querying
Authors:
Haoyu Li,
Srikanth Kandula,
Maria Angels de Luis Balaguer,
Aditya Akella,
Venkat Arun
Abstract:
Analyzing large datasets requires responsive query execution, but executing SQL queries on massive datasets can be slow. This paper explores whether query execution can begin even before the user has finished typing, allowing results to appear almost instantly. We propose SpeQL, a system that leverages Large Language Models (LLMs) to predict likely queries based on the database schema, the user's…
▽ More
Analyzing large datasets requires responsive query execution, but executing SQL queries on massive datasets can be slow. This paper explores whether query execution can begin even before the user has finished typing, allowing results to appear almost instantly. We propose SpeQL, a system that leverages Large Language Models (LLMs) to predict likely queries based on the database schema, the user's past queries, and their incomplete query. Since exact query prediction is infeasible, SpeQL speculates on partial queries in two ways: 1) it predicts the query structure to compile and plan queries in advance, and 2) it precomputes smaller temporary tables that are much smaller than the original database, but are still predicted to contain all information necessary to answer the user's final query. Additionally, SpeQL continuously displays results for speculated queries and subqueries in real time, aiding exploratory analysis. A utility/user study showed that SpeQL improved task completion time, and participants reported that its speculative display of results helped them discover patterns in the data more quickly. In the study, SpeQL improves user's query latency by up to $289\times$ and kept the overhead reasonable, at $\$4$ per hour.
△ Less
Submitted 1 March, 2025;
originally announced March 2025.
-
Learning Autonomy: Off-Road Navigation Enhanced by Human Input
Authors:
Akhil Nagariya,
Dimitar Filev,
Srikanth Saripalli,
Gaurav Pandey
Abstract:
In the area of autonomous driving, navigating off-road terrains presents a unique set of challenges, from unpredictable surfaces like grass and dirt to unexpected obstacles such as bushes and puddles. In this work, we present a novel learning-based local planner that addresses these challenges by directly capturing human driving nuances from real-world demonstrations using only a monocular camera.…
▽ More
In the area of autonomous driving, navigating off-road terrains presents a unique set of challenges, from unpredictable surfaces like grass and dirt to unexpected obstacles such as bushes and puddles. In this work, we present a novel learning-based local planner that addresses these challenges by directly capturing human driving nuances from real-world demonstrations using only a monocular camera. The key features of our planner are its ability to navigate in challenging off-road environments with various terrain types and its fast learning capabilities. By utilizing minimal human demonstration data (5-10 mins), it quickly learns to navigate in a wide array of off-road conditions. The local planner significantly reduces the real world data required to learn human driving preferences. This allows the planner to apply learned behaviors to real-world scenarios without the need for manual fine-tuning, demonstrating quick adjustment and adaptability in off-road autonomous driving technology.
△ Less
Submitted 14 May, 2025; v1 submitted 25 February, 2025;
originally announced February 2025.
-
Homological properties of the module of differentials
Authors:
Jürgen Herzog,
Benjamin Briggs,
Srikanth B. Iyengar
Abstract:
These notes were produced by Jürgen Herzog to accompany his lectures in Recife, Brazil, in 1980, on the homological algebra of noetherian local rings. They are are concerned with two conjectures made by Wolmer Vasconcelos: if the conormal module of a local ring has finite projective dimension, or if the module of differentials, taken over an appropriate field, has finite projective dimension, then…
▽ More
These notes were produced by Jürgen Herzog to accompany his lectures in Recife, Brazil, in 1980, on the homological algebra of noetherian local rings. They are are concerned with two conjectures made by Wolmer Vasconcelos: if the conormal module of a local ring has finite projective dimension, or if the module of differentials, taken over an appropriate field, has finite projective dimension, then the ring must be complete intersection. The notes present an accessible and self-contained account of the strongest results known at the time in connection with these problems; this includes a number of ideas that have not appeared elsewhere. In the last section, Herzog turns his attention to the cotangent complex, and conjectures himself that if the cotangent complex of a local ring has bounded homology groups, then the ring must be complete intersection. Among other results, he proves that the conjecture holds for local rings of characteristic zero over which all modules have rational Poincaré series.
Sadly Jürgen Herzog passed away in April of 2024. The notes in this form have been prepared in his memory, newly typeset and lightly edited. A short appendix has been added to survey some of the results of the intervening decades.
△ Less
Submitted 19 February, 2025;
originally announced February 2025.
-
AIDE: AI-Driven Exploration in the Space of Code
Authors:
Zhengyao Jiang,
Dominik Schmidt,
Dhruv Srikanth,
Dixing Xu,
Ian Kaplan,
Deniss Jacenko,
Yuxiang Wu
Abstract:
Machine learning, the foundation of modern artificial intelligence, has driven innovations that have fundamentally transformed the world. Yet, behind advancements lies a complex and often tedious process requiring labor and compute intensive iteration and experimentation. Engineers and scientists developing machine learning models spend much of their time on trial-and-error tasks instead of concep…
▽ More
Machine learning, the foundation of modern artificial intelligence, has driven innovations that have fundamentally transformed the world. Yet, behind advancements lies a complex and often tedious process requiring labor and compute intensive iteration and experimentation. Engineers and scientists developing machine learning models spend much of their time on trial-and-error tasks instead of conceptualizing innovative solutions or research hypotheses. To address this challenge, we introduce AI-Driven Exploration (AIDE), a machine learning engineering agent powered by large language models (LLMs). AIDE frames machine learning engineering as a code optimization problem, and formulates trial-and-error as a tree search in the space of potential solutions. By strategically reusing and refining promising solutions, AIDE effectively trades computational resources for enhanced performance, achieving state-of-the-art results on multiple machine learning engineering benchmarks, including our Kaggle evaluations, OpenAI MLE-Bench and METRs RE-Bench.
△ Less
Submitted 18 February, 2025;
originally announced February 2025.
-
A freeness criterion for complexes with derived actions
Authors:
Sylvain Brochard,
Srikanth B. Iyengar,
Chandrashekhar B. Khare
Abstract:
Inspired by the patching method of Calegari and Geraghty, and a conjecture of de Smit that has been proved by the first author, we present a conjectural freeness criterion without patching for complexes over commutative noetherian local rings with derived actions, and verify it in several cases.
Inspired by the patching method of Calegari and Geraghty, and a conjecture of de Smit that has been proved by the first author, we present a conjectural freeness criterion without patching for complexes over commutative noetherian local rings with derived actions, and verify it in several cases.
△ Less
Submitted 18 February, 2025;
originally announced February 2025.
-
Beyond the Lens: Quantifying the Impact of Scientific Documentaries through Amazon Reviews
Authors:
Jill Naiman,
Aria Pessianzadeh,
Hanyu Zhao,
AJ Christensen,
Kalina Borkiewicz,
Shriya Srikanth,
Anushka Gami,
Emma Maxwell,
Louisa Zhang,
Sri Nithya Yeragorla,
Rezvaneh Rezapour
Abstract:
Engaging the public with science is critical for a well-informed population. A popular method of scientific communication is documentaries. Once released, it can be difficult to assess the impact of such works on a large scale, due to the overhead required for in-depth audience feedback studies. In what follows, we overview our complementary approach to qualitative studies through quantitative imp…
▽ More
Engaging the public with science is critical for a well-informed population. A popular method of scientific communication is documentaries. Once released, it can be difficult to assess the impact of such works on a large scale, due to the overhead required for in-depth audience feedback studies. In what follows, we overview our complementary approach to qualitative studies through quantitative impact and sentiment analysis of Amazon reviews for several scientific documentaries. In addition to developing a novel impact category taxonomy for this analysis, we release a dataset containing 1296 human-annotated sentences from 1043 Amazon reviews for six movies created in whole or part by the Advanced Visualization Lab (AVL). This interdisciplinary team is housed at the National Center for Supercomputing Applications and consists of visualization designers who focus on cinematic presentations of scientific data. Using this data, we train and evaluate several machine learning and large language models, discussing their effectiveness and possible generalizability for documentaries beyond those focused on for this work. Themes are also extracted from our annotated dataset which, along with our large language model analysis, demonstrate a measure of the ability of scientific documentaries to engage with the public.
△ Less
Submitted 4 March, 2025; v1 submitted 12 February, 2025;
originally announced February 2025.
-
NLI under the Microscope: What Atomic Hypothesis Decomposition Reveals
Authors:
Neha Srikanth,
Rachel Rudinger
Abstract:
Decomposition of text into atomic propositions is a flexible framework allowing for the closer inspection of input and output text. We use atomic decomposition of hypotheses in two natural language reasoning tasks, traditional NLI and defeasible NLI, to form atomic sub-problems, or granular inferences that models must weigh when solving the overall problem. These atomic sub-problems serve as a too…
▽ More
Decomposition of text into atomic propositions is a flexible framework allowing for the closer inspection of input and output text. We use atomic decomposition of hypotheses in two natural language reasoning tasks, traditional NLI and defeasible NLI, to form atomic sub-problems, or granular inferences that models must weigh when solving the overall problem. These atomic sub-problems serve as a tool to further understand the structure of both NLI and defeasible reasoning, probe a model's consistency and understanding of different inferences, and measure the diversity of examples in benchmark datasets. Our results indicate that LLMs still struggle with logical consistency on atomic NLI and defeasible NLI sub-problems. Lastly, we identify critical atomic sub-problems of defeasible NLI examples, or those that most contribute to the overall label, and propose a method to measure the inferential consistency of a model, a metric designed to capture the degree to which a model makes consistently correct or incorrect predictions about the same fact under different contexts.
△ Less
Submitted 7 March, 2025; v1 submitted 11 February, 2025;
originally announced February 2025.
-
Lepton flavor violation in the Majorana and Dirac scotogenic models
Authors:
Raghavendra Srikanth Hundi
Abstract:
In this work we have considered two minimal versions of scotogenic models, where neutrinos acquire masses through a radiative mechanism. We call these two models as Majorana and Dirac scotogenic models. In the former model, neutrinos have Majorana nature, and in the later one, neutrinos are Dirac particles. These two models are related to each other in terms of additional fields and symmetries of…
▽ More
In this work we have considered two minimal versions of scotogenic models, where neutrinos acquire masses through a radiative mechanism. We call these two models as Majorana and Dirac scotogenic models. In the former model, neutrinos have Majorana nature, and in the later one, neutrinos are Dirac particles. These two models are related to each other in terms of additional fields and symmetries of the model. Hence, to compare these two models in future experiments, we have analyzed lepton flavor violating (LFV) processes in both of them, in the charged lepton sector. We have found that the 3-body LFV decays in both these models can get different contributions. Among all the LFV decays and after satisfying relevant constraints, we have found that $τ\to3μ$ can have a branching ratio as high as $10^{-10}(10^{-11})$ in the Majorana(Dirac) scotogenic model. This branching ratio can be probed in the future planned experiments.
△ Less
Submitted 7 February, 2025;
originally announced February 2025.
-
The Algebraic Cost of a Boolean Sum
Authors:
Ian Orzel,
Srikanth Srinivasan,
Sébastien Tavenas,
Amir Yehudayoff
Abstract:
The P versus NP problem is about the computational power of an existential $\exists_{w \in \{0,1\}^n}$ quantifier. The VP versus VNP problem is about the power of a boolean sum $\sum_{w \in \{0,1\}^n}$ operation. We study the power of a single boolean sum $\sum_{w \in \{0,1\}}$, and prove that in some cases the cost of eliminating this sum is large. This identifies a fundamental difference between…
▽ More
The P versus NP problem is about the computational power of an existential $\exists_{w \in \{0,1\}^n}$ quantifier. The VP versus VNP problem is about the power of a boolean sum $\sum_{w \in \{0,1\}^n}$ operation. We study the power of a single boolean sum $\sum_{w \in \{0,1\}}$, and prove that in some cases the cost of eliminating this sum is large. This identifies a fundamental difference between the permanent and the determinant. This investigation also leads to the simplest proof we are aware of that the permanent is VNP-complete.
△ Less
Submitted 4 February, 2025;
originally announced February 2025.
-
The Dead Internet Theory: A Survey on Artificial Interactions and the Future of Social Media
Authors:
Prathamesh Muzumdar,
Sumanth Cheemalapati,
Srikanth Reddy RamiReddy,
Kuldeep Singh,
George Kurian,
Apoorva Muley
Abstract:
The Dead Internet Theory (DIT) suggests that much of today's internet, particularly social media, is dominated by non-human activity, AI-generated content, and corporate agendas, leading to a decline in authentic human interaction. This study explores the origins, core claims, and implications of DIT, emphasizing its relevance in the context of social media platforms. The theory emerged as a respo…
▽ More
The Dead Internet Theory (DIT) suggests that much of today's internet, particularly social media, is dominated by non-human activity, AI-generated content, and corporate agendas, leading to a decline in authentic human interaction. This study explores the origins, core claims, and implications of DIT, emphasizing its relevance in the context of social media platforms. The theory emerged as a response to the perceived homogenization of online spaces, highlighting issues like the proliferation of bots, algorithmically generated content, and the prioritization of engagement metrics over genuine user interaction. AI technologies play a central role in this phenomenon, as social media platforms increasingly use algorithms and machine learning to curate content, drive engagement, and maximize advertising revenue. While these tools enhance scalability and personalization, they also prioritize virality and consumption over authentic communication, contributing to the erosion of trust, the loss of content diversity, and a dehumanized internet experience. This study redefines DIT in the context of social media, proposing that the commodification of content consumption for revenue has taken precedence over meaningful human connectivity. By focusing on engagement metrics, platforms foster a sense of artificiality and disconnection, underscoring the need for human-centric approaches to revive authentic online interaction and community building.
△ Less
Submitted 6 January, 2025;
originally announced February 2025.
-
Determinants of Human Development Index (HDI): A Regression Analysis of Economic and Social Indicators
Authors:
Kuldeep Singh,
Sumanth Cheemalapati,
Srikanth Reddy RamiReddy,
George Kurian,
Prathamesh Muzumdar,
Apoorva Muley
Abstract:
This study aims to investigate the factors influencing the Human Development Index (HDI). Five variables-GDP per capita, health expenditure, education expenditure, infant mortality rate (per 1,000 live births), and average years of schooling-were analyzed to develop a regression model assessing their impact on HDI. The results indicate that GDP per capita, infant mortality rate, and average years…
▽ More
This study aims to investigate the factors influencing the Human Development Index (HDI). Five variables-GDP per capita, health expenditure, education expenditure, infant mortality rate (per 1,000 live births), and average years of schooling-were analyzed to develop a regression model assessing their impact on HDI. The results indicate that GDP per capita, infant mortality rate, and average years of schooling are significant predictors of HDI. Specifically, the study finds a positive relationship between GDP per capita and average years of schooling with HDI, while infant mortality rate is negatively associated with HDI.
△ Less
Submitted 6 January, 2025;
originally announced February 2025.
-
GO: The Great Outdoors Multimodal Dataset
Authors:
Peng Jiang,
Kasi Viswanath,
Akhil Nagariya,
George Chustz,
Maggie Wigness,
Philip Osteen,
Timothy Overbye,
Christian Ellis,
Long Quang,
Srikanth Saripalli
Abstract:
The Great Outdoors (GO) dataset is a multi-modal annotated data resource aimed at advancing ground robotics research in unstructured environments. This dataset provides the most comprehensive set of data modalities and annotations compared to existing off-road datasets. In total, the GO dataset includes six unique sensor types with high-quality semantic annotations and GPS traces to support tasks…
▽ More
The Great Outdoors (GO) dataset is a multi-modal annotated data resource aimed at advancing ground robotics research in unstructured environments. This dataset provides the most comprehensive set of data modalities and annotations compared to existing off-road datasets. In total, the GO dataset includes six unique sensor types with high-quality semantic annotations and GPS traces to support tasks such as semantic segmentation, object detection, and SLAM. The diverse environmental conditions represented in the dataset present significant real-world challenges that provide opportunities to develop more robust solutions to support the continued advancement of field robotics, autonomous exploration, and perception systems in natural environments. The dataset can be downloaded at: https://www.unmannedlab.org/the-great-outdoors-dataset/
△ Less
Submitted 31 January, 2025;
originally announced January 2025.