-
Exceptional point in a PT symmetric non-Hermitian terahertz plasmonic metasurface
Authors:
Anshul Bhardwaj,
Maidul Islam,
Chandan Kumar,
Anuraj Panwar,
Gagan Kumar
Abstract:
In this paper, we experimentally demonstrate a non-Hermitian open PT-symmetric terahertz metasurface comprising complementary plasmonic structures capable of exhibiting an exceptional point (EP). The metasurface consists of two resonators of different sizes, representing effective gain and loss elements, placed orthogonally in close proximity to realize a non-Hermitian configuration leading to a P…
▽ More
In this paper, we experimentally demonstrate a non-Hermitian open PT-symmetric terahertz metasurface comprising complementary plasmonic structures capable of exhibiting an exceptional point (EP). The metasurface consists of two resonators of different sizes, representing effective gain and loss elements, placed orthogonally in close proximity to realize a non-Hermitian configuration leading to a PT symmetry state. A diagonal displacement of one resonator within this strongly coupled near-field configuration leads to the emergence of an exceptional point, where the system undergoes a sudden phase transition from a PT symmetric to a PT-asymmetric state. Terahertz time-domain spectroscopy (THz-TDS) is performed on the fabricated samples to experimentally validate the phase transition observed in numerical simulations. We employ coupled mode theory (CMT) to analyse and distinguish between the PT-symmetric, exceptional point, and PT-asymmetric states. This theoretical framework enables the calculation of eigenvalues, phase spectra, and eigenmodes associated with the metamaterial design, thereby corroborating the simulation results. Furthermore, we construct Poincare sphere to visualize the orientation of the polarization states of the eigenmodes, which further indicates the presence of the exceptional point. This comprehensive study of exceptional point in a plasmonic system holds potential for the development of practical, highly sensitive terahertz devices, addressing limitations of conventional PT-symmetric systems that rely on traditional gain and loss media.
△ Less
Submitted 22 June, 2025;
originally announced June 2025.
-
Quantitative agreement between experiment and theory for Vibrational Circular Dichroism enhanced by electronically excited states
Authors:
Mariia Sapova,
Chandan Kumar,
Sahar Ashtari-Jafari,
Wybren J. Buma,
Lucas Visscher
Abstract:
Intensity enhancement in vibrational circular dichroism (VCD) arises in open-shell transition metal complexes from coupling between low-lying excited states (LLESs) and ground-state vibrational modes. In this work we apply Nafie's vibronic coupling theory to M(II)-(-)-sparteine-Cl$_2$ to investigate these enhancement effects. We show that the VCD intensity is extremely sensitive to the excitation…
▽ More
Intensity enhancement in vibrational circular dichroism (VCD) arises in open-shell transition metal complexes from coupling between low-lying excited states (LLESs) and ground-state vibrational modes. In this work we apply Nafie's vibronic coupling theory to M(II)-(-)-sparteine-Cl$_2$ to investigate these enhancement effects. We show that the VCD intensity is extremely sensitive to the excitation energies that neither time-dependent density functional theory (TDDFT) nor state-averaged complete active space self consistent field (SA-CASSCF) calculations can predict with sufficient accuracy. We argue that instead of using more accurate quantum chemistry methods these excitation energies can be treated as parameters and optimized against experimental spectra. With this approach we obtain simulated VCD similarity scores above 0.4, a threshold considered reliable for absolute configuration assignment.
△ Less
Submitted 19 June, 2025;
originally announced June 2025.
-
Threshold resummation for $W$-boson pair production at NNLO+NNLL
Authors:
Pulak Banerjee,
Chinmoy Dey,
M. C. Kumar,
Vaibhav Pandey
Abstract:
We present results for threshold resummation of the invariant mass distribution, for on-shell production of a pair of $W$-bosons at next-to-next-to-leading order + next-to-next-to-leading logarithmic (NNLO+NNLL) accuracy in QCD. Owing to its sensitivity to the self-interactions between gauge bosons, this process is important to investigate at the energies of the Large Hadron Collider (LHC). We ach…
▽ More
We present results for threshold resummation of the invariant mass distribution, for on-shell production of a pair of $W$-bosons at next-to-next-to-leading order + next-to-next-to-leading logarithmic (NNLO+NNLL) accuracy in QCD. Owing to its sensitivity to the self-interactions between gauge bosons, this process is important to investigate at the energies of the Large Hadron Collider (LHC). We achieve this resummation by exploiting the factorization properties of the soft and virtual parts of the partonic cross-section. Our analysis has been carried out for the invariant mass distribution up to $Q$ = 2500 GeV. At this highest $Q$ we find that, for 13.6 TeV LHC, the NNLL resummation enhances the NNLO cross-sections by about $6.3\%$ and reduces the conventional scale uncertainties from 6.8\% at NNLO to 4.1\% at NNLO+NNLL. We also estimate the intrinsic uncertainties due to the non-perturbative parton distribution functions at the highest perturbative order, for both fixed-order and resummed results, to be around 3\% for $Q \sim$ 2000 GeV.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
Attention Is Not Always the Answer: Optimizing Voice Activity Detection with Simple Feature Fusion
Authors:
Kumud Tripathi,
Chowdam Venkata Kumar,
Pankaj Wasnik
Abstract:
Voice Activity Detection (VAD) plays a key role in speech processing, often utilizing hand-crafted or neural features. This study examines the effectiveness of Mel-Frequency Cepstral Coefficients (MFCCs) and pre-trained model (PTM) features, including wav2vec 2.0, HuBERT, WavLM, UniSpeech, MMS, and Whisper. We propose FusionVAD, a unified framework that combines both feature types using three fusi…
▽ More
Voice Activity Detection (VAD) plays a key role in speech processing, often utilizing hand-crafted or neural features. This study examines the effectiveness of Mel-Frequency Cepstral Coefficients (MFCCs) and pre-trained model (PTM) features, including wav2vec 2.0, HuBERT, WavLM, UniSpeech, MMS, and Whisper. We propose FusionVAD, a unified framework that combines both feature types using three fusion strategies: concatenation, addition, and cross-attention (CA). Experimental results reveal that simple fusion techniques, particularly addition, outperform CA in both accuracy and efficiency. Fusion-based models consistently surpass single-feature models, highlighting the complementary nature of MFCCs and PTM features. Notably, our best-performing fusion model exceeds the state-of-the-art Pyannote across multiple datasets, achieving an absolute average improvement of 2.04%. These results confirm that simple feature fusion enhances VAD robustness while maintaining computational efficiency.
△ Less
Submitted 2 June, 2025;
originally announced June 2025.
-
Magnetic Charge State Controlled Spin-Wave Dynamics in Nanoscale Three-Dimensional Artificial Spin Ice
Authors:
Chandan Kumar,
Amrit Kumar Mondal,
Sreya Pal,
Sayan Mathur,
Jay R. Scott,
Arjen van Den Berg,
Adekunle O. Adeyeye,
Sam Ladak,
Anjan Barman
Abstract:
Three-dimensional (3D) magnetic nanostructures offer a versatile platform for exploring complex spin textures and spin-wave (SW) dynamics, with implications in next-generation spintronic and magnonic technologies. Advances in 3D nanofabrication have allowed a wide-range of structures and phenomena to be realized. Whilst the study of simple cylindrical magnetic nanowires allows the realization of u…
▽ More
Three-dimensional (3D) magnetic nanostructures offer a versatile platform for exploring complex spin textures and spin-wave (SW) dynamics, with implications in next-generation spintronic and magnonic technologies. Advances in 3D nanofabrication have allowed a wide-range of structures and phenomena to be realized. Whilst the study of simple cylindrical magnetic nanowires allows the realization of ultrafast domain walls and a spin Cherenkov effect, placing such wires of complex cross-section into 3D arrangements allows one to produce magnetic metamaterials, known as artificial spin-ice (ASI), where the overall ground state and spin dynamics are governed by magnetostatic interactions between elements. Here, using Brillouin Light Scattering (BLS) we demonstrate the direct detection of magnetic charged states in a 3D-ASI system. The measured spin-wave modes in 3D-ASI are found to be directly controlled by the local magnetic charge configuration and the direction of the applied magnetic field. Micromagnetic simulations provide insight into the spatially selective excitation of spin waves and the evolution of magnetic microstates, uncovering a direct link to the field-dependent characteristics of the spin-wave spectrum. These findings make 3D-ASI architectures a promising system to realize reconfigurable, low-power magnonic devices with engineered collective dynamics.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
A Data-Driven Probabilistic Framework for Cascading Urban Risk Analysis Using Bayesian Networks
Authors:
Chunduru Rohith Kumar,
PHD Surya Shanmuk,
Prabhala Naga Srinivas,
Sri Venkatesh Lankalapalli,
Debasis Dwibedy
Abstract:
The increasing complexity of cascading risks in urban systems necessitates robust, data-driven frameworks to model interdependencies across multiple domains. This study presents a foundational Bayesian network-based approach for analyzing cross-domain risk propagation across key urban domains, including air, water, electricity, agriculture, health, infrastructure, weather, and climate. Directed Ac…
▽ More
The increasing complexity of cascading risks in urban systems necessitates robust, data-driven frameworks to model interdependencies across multiple domains. This study presents a foundational Bayesian network-based approach for analyzing cross-domain risk propagation across key urban domains, including air, water, electricity, agriculture, health, infrastructure, weather, and climate. Directed Acyclic Graphs (DAGs) are constructed using Bayesian Belief Networks (BBNs), with structure learning guided by Hill-Climbing search optimized through Bayesian Information Criterion (BIC) and K2 scoring. The framework is trained on a hybrid dataset that combines real-world urban indicators with synthetically generated data from Generative Adversarial Networks (GANs), and is further balanced using the Synthetic Minority Over-sampling Technique (SMOTE). Conditional Probability Tables (CPTs) derived from the learned structures enable interpretable probabilistic reasoning and quantify the likelihood of cascading failures. The results identify key intra- and inter-domain risk factors and demonstrate the framework's utility for proactive urban resilience planning. This work establishes a scalable, interpretable foundation for cascading risk assessment and serves as a basis for future empirical research in this emerging interdisciplinary field.
△ Less
Submitted 7 May, 2025;
originally announced May 2025.
-
Combinatorial Identities Using the Matrix Tree Theorem
Authors:
Nayana Shibu Deepthi,
Chanchal Kumar
Abstract:
The matrix tree theorem, initially formulated by Kirchhoff, is a fundamental result in algebraic graph theory that provides an elegant way to count spanning trees using the Laplacian determinant. In this paper, we explore some interesting applications of the matrix tree theorem. In particular, we present a combinatorial interpretation of a distribution of $(n-1)^{n-1}$, in the context of uprooted…
▽ More
The matrix tree theorem, initially formulated by Kirchhoff, is a fundamental result in algebraic graph theory that provides an elegant way to count spanning trees using the Laplacian determinant. In this paper, we explore some interesting applications of the matrix tree theorem. In particular, we present a combinatorial interpretation of a distribution of $(n-1)^{n-1}$, in the context of uprooted spanning trees of the complete graph $K_{n}$, which was previously obtained by Chauve--Dulucq--Guibert. Furthermore, we establish a combinatorial explanation for the distribution of $m^{n-1}n^{m-1}$, related to spanning trees of the complete bipartite graph $K_{m,n}$, which seems new.
△ Less
Submitted 30 April, 2025;
originally announced April 2025.
-
Crossover between the zeptosecond and attosecond physics
Authors:
T. Nandi,
Yash Kumar,
Adya P. Mishra,
Nishchal R. Dwivedi,
Chandra Kumar,
Gajendra Singh,
N. Sowmya,
H. C. Manjunatha,
Sudhir R. Jain,
A. S. Kheifets
Abstract:
Nuclear orbiting resonances have been revealed at the sub-barrier energies as an atomic phenomenon by means of x-ray spectroscopy experiments. This interpretation is supported by several phenomenological models and theoretical estimates of the nuclear orbiting timescale and cross-section, inelastic scattering cross section including both nuclear and Coulomb excitation, and the Wigner-Smith time de…
▽ More
Nuclear orbiting resonances have been revealed at the sub-barrier energies as an atomic phenomenon by means of x-ray spectroscopy experiments. This interpretation is supported by several phenomenological models and theoretical estimates of the nuclear orbiting timescale and cross-section, inelastic scattering cross section including both nuclear and Coulomb excitation, and the Wigner-Smith time delay. We demonstrate that a multi-photon exchange during nuclear orbiting is responsible for an atomic excitation. Furthermore, proximity of the projectile and target nucleus during the nuclear orbiting modifies the effective charge of the projectile. Even though this orbiting induced excitation is triggered in zeptoseconds, it can still be observed in the attosecond time scale because of the Wigner-Smith time delay inherent to autoionization. Thus, we demonstrate the crossover between the zeptosecond and attosecond time scales which are native to nuclear and atomic physics, respectively. Markedly, this crossover may be the reason for x-ray production from ultra short nuclear processes ($\leq 10^{-21}$ sec). This explanation is likely to resolve the fission time scale anomaly and can stimulate cross-disciplinary research ranging from solid state to high-energy physics.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
Husimi phase distribution in non-Gaussian operations
Authors:
Ramniwas Meena,
Chandan Kumar,
Subhashish Banerjee
Abstract:
The Husimi phase distribution, an experimentally measurable quantity, is investigated for single-mode and two-mode squeezed vacuum states. The analysis highlights that non-Gaussian operations, i.e., photon subtraction (PS), photon addition (PA) and photon catalysis (PC), are effective tools for localizing phase distribution and enhancing phase robustness in the presence of noise, while PC enhances…
▽ More
The Husimi phase distribution, an experimentally measurable quantity, is investigated for single-mode and two-mode squeezed vacuum states. The analysis highlights that non-Gaussian operations, i.e., photon subtraction (PS), photon addition (PA) and photon catalysis (PC), are effective tools for localizing phase distribution and enhancing phase robustness in the presence of noise, while PC enhances phase sensitivity but leads to greater delocalization. The work highlights the perspective that combined effects of squeezing, beam splitter transmittance, and environmental interactions must be carefully considered when quantum state engineering protocols are designed and phase properties provide a valuable insight into this endeavour.
△ Less
Submitted 28 March, 2025;
originally announced March 2025.
-
No LLM is Free From Bias: A Comprehensive Study of Bias Evaluation in Large Language Models
Authors:
Charaka Vinayak Kumar,
Ashok Urlana,
Gopichand Kanumolu,
Bala Mallikarjunarao Garlapati,
Pruthwik Mishra
Abstract:
Advancements in Large Language Models (LLMs) have increased the performance of different natural language understanding as well as generation tasks. Although LLMs have breached the state-of-the-art performance in various tasks, they often reflect different forms of bias present in the training data. In the light of this perceived limitation, we provide a unified evaluation of benchmarks using a se…
▽ More
Advancements in Large Language Models (LLMs) have increased the performance of different natural language understanding as well as generation tasks. Although LLMs have breached the state-of-the-art performance in various tasks, they often reflect different forms of bias present in the training data. In the light of this perceived limitation, we provide a unified evaluation of benchmarks using a set of representative small and medium-sized LLMs that cover different forms of biases starting from physical characteristics to socio-economic categories. Moreover, we propose five prompting approaches to carry out the bias detection task across different aspects of bias. Further, we formulate three research questions to gain valuable insight in detecting biases in LLMs using different approaches and evaluation metrics across benchmarks. The results indicate that each of the selected LLMs suffer from one or the other form of bias with the Phi-3.5B model being the least biased. Finally, we conclude the paper with the identification of key challenges and possible future directions.
△ Less
Submitted 27 May, 2025; v1 submitted 14 March, 2025;
originally announced March 2025.
-
First- and Half-order Schemes for Regime Switching Stochastic Differential Equation with Non-differentiable Drift Coefficient
Authors:
Divyanshu Vashistha,
Chaman Kumar
Abstract:
An explicit first-order drift-randomized Milstein scheme for a regime switching stochastic differential equation is proposed and its bi-stability and rate of strong convergence are investigated for a non-differentiable drift coefficient.
Precisely, drift is Lipschitz continuous while diffusion along with its derivative is Lipschitz continuous.
Further, we explore the significance of evaluating…
▽ More
An explicit first-order drift-randomized Milstein scheme for a regime switching stochastic differential equation is proposed and its bi-stability and rate of strong convergence are investigated for a non-differentiable drift coefficient.
Precisely, drift is Lipschitz continuous while diffusion along with its derivative is Lipschitz continuous.
Further, we explore the significance of evaluating Brownian trajectories at every switching time of the underlying Markov chain in achieving the convergence rate $1.0$ of the proposed scheme.
In this context, possible variants of the scheme, namely modified randomized and reduced randomized schemes, are considered and their convergence rates are shown to be $1/2$.
Numerical experiments are performed to illustrate the convergence rates of these schemes along with their corresponding non-randomized versions.
Further, it is illustrated that the half-order non-randomized reduced and modified schemes outperforms the classical Euler scheme.
△ Less
Submitted 9 March, 2025;
originally announced March 2025.
-
HalluCounter: Reference-free LLM Hallucination Detection in the Wild!
Authors:
Ashok Urlana,
Gopichand Kanumolu,
Charaka Vinayak Kumar,
Bala Mallikarjunarao Garlapati,
Rahul Mishra
Abstract:
Response consistency-based, reference-free hallucination detection (RFHD) methods do not depend on internal model states, such as generation probabilities or gradients, which Grey-box models typically rely on but are inaccessible in closed-source LLMs. However, their inability to capture query-response alignment patterns often results in lower detection accuracy. Additionally, the lack of large-sc…
▽ More
Response consistency-based, reference-free hallucination detection (RFHD) methods do not depend on internal model states, such as generation probabilities or gradients, which Grey-box models typically rely on but are inaccessible in closed-source LLMs. However, their inability to capture query-response alignment patterns often results in lower detection accuracy. Additionally, the lack of large-scale benchmark datasets spanning diverse domains remains a challenge, as most existing datasets are limited in size and scope. To this end, we propose HalluCounter, a novel reference-free hallucination detection method that utilizes both response-response and query-response consistency and alignment patterns. This enables the training of a classifier that detects hallucinations and provides a confidence score and an optimal response for user queries. Furthermore, we introduce HalluCounterEval, a benchmark dataset comprising both synthetically generated and human-curated samples across multiple domains. Our method outperforms state-of-the-art approaches by a significant margin, achieving over 90\% average confidence in hallucination detection across datasets.
△ Less
Submitted 27 May, 2025; v1 submitted 6 March, 2025;
originally announced March 2025.
-
Next to Soft Threshold Resummation for $VH$ Production
Authors:
Arunima Bhattacharya,
Chinmoy Dey,
M. C. Kumar,
Vaibhav Pandey
Abstract:
We study the threshold effects for the associated production of a Higgs boson with a massive vector boson $(V=Z,W)$ in the $q\bar{q} \rightarrow V^\star \rightarrow VH$ process at the LHC. By leveraging the universality of threshold logarithms and employing soft-virtual (SV) and next-to-soft virtual (NSV) resummation techniques, we compute threshold corrections to next-to-next-to-leading logarithm…
▽ More
We study the threshold effects for the associated production of a Higgs boson with a massive vector boson $(V=Z,W)$ in the $q\bar{q} \rightarrow V^\star \rightarrow VH$ process at the LHC. By leveraging the universality of threshold logarithms and employing soft-virtual (SV) and next-to-soft virtual (NSV) resummation techniques, we compute threshold corrections to next-to-next-to-leading logarithmic accuracy. After matching the resummed predictions to the Next-to-Next-to-Leading order (NNLO) fixed order results, we present the invariant mass distribution to NNLO$+\overline{\text{NNLL}}$ accuracy in QCD for the current LHC energies and the total production cross sections. The $VH$ production channel is crucial for studying the couplings of the Higgs boson to the vector bosons $(W,Z)$ and understanding the mechanism of electroweak symmetry breaking. Precision measurements of this process help test the validity of the standard model (SM) and can reveal potential deviations indicating new physics.
△ Less
Submitted 27 February, 2025;
originally announced February 2025.
-
Continuous variable quantum teleportation, $U(2)$ invariant squeezing and non-Gaussian resource states
Authors:
Mohak Sharma,
Chandan Kumar,
Shikhar Arora,
Arvind
Abstract:
We investigate the role of quadrature squeezing in the quantum teleportation protocol for coherent states, using non-Gaussian resource states. For the two-mode systems, the non-Gaussian resource states that we use are obtained by an experimentally realizable scheme of photon subtraction, photon addition, and photon catalysis, on the two-mode squeezed vacuum, and two-mode squeezed thermal states. W…
▽ More
We investigate the role of quadrature squeezing in the quantum teleportation protocol for coherent states, using non-Gaussian resource states. For the two-mode systems, the non-Gaussian resource states that we use are obtained by an experimentally realizable scheme of photon subtraction, photon addition, and photon catalysis, on the two-mode squeezed vacuum, and two-mode squeezed thermal states. We first analyze the non-classical attribute of quadrature squeezing in these generated non-Gaussian states using the $U(2)$ invariant squeezing approach, which allows us to account for all possible quadratures. We then show that the presence of such non-classicality in non-Gaussian resource states is not necessary for successful quantum teleportation, a finding which is at variance with an earlier result in this direction. This result is important since it demonstrates how non-classicality other than quadrature squeezing present in the resource can be utilized for quantum teleportation.
△ Less
Submitted 24 February, 2025;
originally announced February 2025.
-
Provisioning Time-Based Subscription in NDN: A Secure and Efficient Access Control Scheme
Authors:
Nazatul H. Sultan,
Chandan Kumar,
Saurab Dulal,
Vijay Varadharajan,
Seyit Camtepe,
Surya Nepal
Abstract:
This paper proposes a novel encryption-based access control mechanism for Named Data Networking (NDN). The scheme allows data producers to share their content in encrypted form before transmitting it to consumers. The encryption mechanism incorporates time-based subscription access policies directly into the encrypted content, enabling only consumers with valid subscriptions to decrypt it. This ma…
▽ More
This paper proposes a novel encryption-based access control mechanism for Named Data Networking (NDN). The scheme allows data producers to share their content in encrypted form before transmitting it to consumers. The encryption mechanism incorporates time-based subscription access policies directly into the encrypted content, enabling only consumers with valid subscriptions to decrypt it. This makes the scheme well-suited for real-world, subscription-based applications like Netflix. Additionally, the scheme introduces an anonymous and unlinkable signature-based authentication mechanism that empowers edge routers to block bogus content requests at the network's entry point, thereby mitigating Denial of Service (DoS) attacks. A formal security proof demonstrates the scheme's resistance to Chosen Plaintext Attacks (CPA). Performance analysis, using Mini-NDN-based emulation and a Charm library implementation, further confirms the practicality of the scheme. Moreover, it outperforms closely related works in terms of functionality, security, and communication overhead.
△ Less
Submitted 27 January, 2025;
originally announced January 2025.
-
Soft gluon resummation for gluon fusion $ZH$ production
Authors:
Goutam Das,
Chinmoy Dey,
M. C. Kumar,
Kajal Samanta
Abstract:
We examine the effects of soft gluons on Higgs boson production in association with a $Z$ boson at the Large Hadron Collider (LHC). Utilizing the universal cusp anomalous dimensions and splitting kernels, we analyze effects of soft gluons on the gluon fusion $ZH$ process, focusing on the total production cross-section as well as the invariant mass distribution at the next-to-leading logarithmic le…
▽ More
We examine the effects of soft gluons on Higgs boson production in association with a $Z$ boson at the Large Hadron Collider (LHC). Utilizing the universal cusp anomalous dimensions and splitting kernels, we analyze effects of soft gluons on the gluon fusion $ZH$ process, focusing on the total production cross-section as well as the invariant mass distribution at the next-to-leading logarithmic level. Additionally, we estimate the next-to-soft effects on this subprocess to the same level of accuracy. A detailed phenomenological analysis is performed for the $13.6$ TeV LHC. Finally, combining these results with those from other subprocesses, we provide comprehensive predictions for the $ZH$ production cross-section and the invariant mass distribution that will be valuable for comparison with experimental data from the upcoming LHC run as well as the future hadron colliders.
△ Less
Submitted 17 January, 2025;
originally announced January 2025.
-
RUL forecasting for wind turbine predictive maintenance based on deep learning
Authors:
Syed Shazaib Shah,
Tan Daoliang,
Sah Chandan Kumar
Abstract:
Predictive maintenance (PdM) is increasingly pursued to reduce wind farm operation and maintenance costs by accurately predicting the remaining useful life (RUL) and strategically scheduling maintenance. However, the remoteness of wind farms often renders current methodologies ineffective, as they fail to provide a sufficiently reliable advance time window for maintenance planning, limiting PdM's…
▽ More
Predictive maintenance (PdM) is increasingly pursued to reduce wind farm operation and maintenance costs by accurately predicting the remaining useful life (RUL) and strategically scheduling maintenance. However, the remoteness of wind farms often renders current methodologies ineffective, as they fail to provide a sufficiently reliable advance time window for maintenance planning, limiting PdM's practicality. This study introduces a novel deep learning (DL) methodology for future RUL forecasting. By employing a multi-parametric attention-based DL approach that bypasses feature engineering, thereby minimizing the risk of human error, two models: ForeNet-2d and ForeNet-3d are proposed. These models successfully forecast the RUL for seven multifaceted wind turbine (WT) failures with a 2-week forecast window. The most precise forecast deviated by only 10 minutes from the actual RUL, while the least accurate prediction deviated by 1.8 days, with most predictions being off by only a few hours. This methodology offers a substantial time frame to access remote WTs and perform necessary maintenance, thereby enabling the practical implementation of PdM.
△ Less
Submitted 9 December, 2024;
originally announced December 2024.
-
Movie Recommendation using Web Crawling
Authors:
Pronit Raj,
Chandrashekhar Kumar,
Harshit Shekhar,
Amit Kumar,
Kritibas Paul,
Debasish Jana
Abstract:
In today's digital world, streaming platforms offer a vast array of movies, making it hard for users to find content matching their preferences. This paper explores integrating real time data from popular movie websites using advanced HTML scraping techniques and APIs. It also incorporates a recommendation system trained on a static Kaggle dataset, enhancing the relevance and freshness of suggesti…
▽ More
In today's digital world, streaming platforms offer a vast array of movies, making it hard for users to find content matching their preferences. This paper explores integrating real time data from popular movie websites using advanced HTML scraping techniques and APIs. It also incorporates a recommendation system trained on a static Kaggle dataset, enhancing the relevance and freshness of suggestions. By combining content based filtering, collaborative filtering, and a hybrid model, we create a system that utilizes both historical and real time data for more personalized suggestions. Our methodology shows that incorporating dynamic data not only boosts user satisfaction but also aligns recommendations with current viewing trends.
△ Less
Submitted 14 December, 2024;
originally announced December 2024.
-
Exotic Coherent Structures and Their Collisional Dynamics in a (3+1) dimensional Bogoyavlensky-Konopelchenko Equation
Authors:
C. Senthil Kumar,
R. Radha
Abstract:
In this paper, we analyse the (3+1) dimensional Bogoyavlensky - Konopelchenko equation. Using Painlevé Truncation approach, we have constructed solutions in terms of lower dimensional arbitrary functions of space and time. By suitably harnessing the arbitrary functions present in the solution, we have generated physically interesting solutions like periodic solutions, kinks, linear rogue waves, li…
▽ More
In this paper, we analyse the (3+1) dimensional Bogoyavlensky - Konopelchenko equation. Using Painlevé Truncation approach, we have constructed solutions in terms of lower dimensional arbitrary functions of space and time. By suitably harnessing the arbitrary functions present in the solution, we have generated physically interesting solutions like periodic solutions, kinks, linear rogue waves, line lumps, dipole lumps and hybrid dromions. It is interesting to note that unlike in (2+1) dimensional nonlinear partial differential equations, the line lumps interact and undergo elastic collision without exchange of energy which is confirmed by the asymptotic analysis. The hybrid dromions are also found to retain their amplitudes during interaction undergoing elastic collision. The highlight of the results is that one also observes the two nonparallel ghost solitons as well whose intersection gives rise to hybrid dromions, a phenomenon not witnessed in (2+1) dimensions.
△ Less
Submitted 13 December, 2024;
originally announced December 2024.
-
Pseudo-scalar Higgs decay to three parton amplitudes at NNLO to higher orders in dimensional regulator
Authors:
Pulak Banerjee,
Chinmoy Dey,
M. C. Kumar,
V. Ravindran
Abstract:
We present for the first time the second-order corrections of pseudo-scalar($A$) Higgs decay to three parton to higher orders in the dimensional regulator. We compute the one and two-loop amplitudes for processes, $A\to ggg$ and $A\to q\bar{q}g$ in the effective theory framework. With suitable crossing of the external momenta, these calculations are well-suited for predicting the differential dist…
▽ More
We present for the first time the second-order corrections of pseudo-scalar($A$) Higgs decay to three parton to higher orders in the dimensional regulator. We compute the one and two-loop amplitudes for processes, $A\to ggg$ and $A\to q\bar{q}g$ in the effective theory framework. With suitable crossing of the external momenta, these calculations are well-suited for predicting the differential distribution of pseudo-scalar Higgs in association with a jet at hadron colliders, up to next-to-next-to-leading order (NNLO) in the strong coupling constant. These results expanded to higher orders in dimensional regulator will contribute to the full three loop cross section. We implement the finite pieces of the amplitudes in a numerical code which can be used with any Monte Carlo phase space generator.
△ Less
Submitted 26 November, 2024;
originally announced November 2024.
-
Energy-efficient Federated Learning with Dynamic Model Size Allocation
Authors:
M S Chaitanya Kumar,
Sai Satya Narayana J,
Yunkai Bao,
Xin Wang,
Steve Drew
Abstract:
Federated Learning (FL) presents a paradigm shift towards distributed model training across isolated data repositories or edge devices without explicit data sharing. Despite of its advantages, FL is inherently less efficient than centralized training models, leading to increased energy consumption and, consequently, higher carbon emissions. In this paper, we propose CAMA, a carbon-aware FL framewo…
▽ More
Federated Learning (FL) presents a paradigm shift towards distributed model training across isolated data repositories or edge devices without explicit data sharing. Despite of its advantages, FL is inherently less efficient than centralized training models, leading to increased energy consumption and, consequently, higher carbon emissions. In this paper, we propose CAMA, a carbon-aware FL framework, promoting the operation on renewable excess energy and spare computing capacity, aiming to minimize operational carbon emissions. CAMA introduces a dynamic model adaptation strategy which adapts the model sizes based on the availability of energy and computing resources. Ordered dropout is integratged to enable the aggregation with varying model sizes. Empirical evaluations on real-world energy and load traces demonstrate that our method achieves faster convergence and ensures equitable client participation, while scaling efficiently to handle large numbers of clients. The source code of CAMA is available at https://github.com/denoslab/CAMA.
△ Less
Submitted 23 November, 2024;
originally announced November 2024.
-
Milstein-type schemes for McKean-Vlasov SDEs driven by Brownian motion and Poisson random measure (with super-linear coefficients)
Authors:
Sani Biswas,
Chaman Kumar,
Christoph Reisinger,
Verena Schwarz
Abstract:
In this work, we present a general Milstein-type scheme for McKean-Vlasov stochastic differential equations (SDEs) driven by Brownian motion and Poisson random measure and the associated system of interacting particles where drift, diffusion and jump coefficients may grow super-linearly in the state variable and linearly in the measure component. The strong rate of $\mathcal{L}^2$-convergence of t…
▽ More
In this work, we present a general Milstein-type scheme for McKean-Vlasov stochastic differential equations (SDEs) driven by Brownian motion and Poisson random measure and the associated system of interacting particles where drift, diffusion and jump coefficients may grow super-linearly in the state variable and linearly in the measure component. The strong rate of $\mathcal{L}^2$-convergence of the proposed scheme is shown to be arbitrarily close to one under appropriate regularity assumptions on the coefficients. For the derivation of the Milstein scheme and to show its strong rate of convergence, we provide an Itô formula for the interacting particle system connected with the McKean-Vlasov SDE driven by Brownian motion and Poisson random measure. Moreover, we use the notion of Lions derivative to examine our results. The two-fold challenges arising due to the presence of the empirical measure and super-linearity of the jump coefficient are resolved by identifying and exploiting an appropriate coercivity-type condition.
△ Less
Submitted 7 January, 2025; v1 submitted 18 November, 2024;
originally announced November 2024.
-
Mean-field equation for phase-modulated optical parametric oscillator
Authors:
A. D. Sanchez,
S. Chaitanya Kumar,
M. Ebrahim-Zadeh
Abstract:
The widely established techniques for the generation of ultrashort optical pulses rely on passive mode-locking of lasers, with the output pulse duration and emission spectrum determined by the intrinsic lifetime of laser transition in the gain medium. Due to the instantaneous nature of nonlinear gain, optical parametric oscillators (OPOs) are capable of generating optical radiation in all time sca…
▽ More
The widely established techniques for the generation of ultrashort optical pulses rely on passive mode-locking of lasers, with the output pulse duration and emission spectrum determined by the intrinsic lifetime of laser transition in the gain medium. Due to the instantaneous nature of nonlinear gain, optical parametric oscillators (OPOs) are capable of generating optical radiation in all time scales from continuous-wave (cw) to ultrashort femtosecond regime, if driven by laser pump sources in the corresponding time domain. In the ultrashort time scale, operation of OPOs conventionally relies on mode-locked pump lasers, with the concomitant disadvantages of large footprint and high cost. At the same time, the lack of gain storage mandates the use of synchronous pumping, resulting in increased complexity. In this paper, we present the concept of phase-modulated OPO driven by cw pump laser. The approach overcomes the traditional drawbacks of ultrafast OPOs, enabling femtosecond pulse generation without the need for synchronous pumping, resulting in a simplified, compact and cost-effective architecture using cw input pump lasers. We derive a mean-field equation for a degenerate $χ^{(2)}$ OPO driven by a cw laser with intracavity electro-optic modulator (EOM), and also including dispersion compensation. The new equation predicts the formation of stable femtosecond pulses ($<$200~fs), in both normal and anomalous dispersion regimes, with a controllable repetition rate determined by the frequency of the EOM. The remarkable functionality of the proposed scheme paves the way for the development of a new class of widely tunable coherent femtosecond light sources in both bulk and integrated format based on $χ^{(2)}$ OPOs using cw pump lasers.
△ Less
Submitted 7 October, 2024;
originally announced October 2024.
-
Quadratic frequency comb based on phase-modulated cw-driven optical parametric oscillator with intracavity dispersion control
Authors:
A. D. Sanchez,
S. Chaitanya Kumar,
M. Ebrahim-Zadeh
Abstract:
We report a novel configuration of bulk $χ^{(2)}$ optical parametric oscillator (OPO) capable of delivering coherent broadband phase-locked spectrum when driven by a continuous-wave (cw) pump laser. By deploying an electro-optic modulator (EOM) internal to a degenerate cw OPO based on MgO:sPPLT or MgO:PPLN and implementing intracavity dispersion control, we show that output spectra extending over…
▽ More
We report a novel configuration of bulk $χ^{(2)}$ optical parametric oscillator (OPO) capable of delivering coherent broadband phase-locked spectrum when driven by a continuous-wave (cw) pump laser. By deploying an electro-optic modulator (EOM) internal to a degenerate cw OPO based on MgO:sPPLT or MgO:PPLN and implementing intracavity dispersion control, we show that output spectra extending over 9 nm (119 nm) with well-defined phase coherence can be generated. Pumping the cw OPO at 532~nm (1550~nm) for degenerate operation in the normal (anomalous) dispersion regime at 1064~nm (3100~nm), and using the coupled-wave equations to simulate the OPO in the presence of intracavity dispersion compensation, we demonstrate that this device offers a new alternative for $χ^{(2)}$-based frequency comb generation. We also show that in the time domain the output of such a device corresponds to femtosecond quadratic solitons in both dispersion regimes. The described concept is generic, paving the way for the realization of a new class of coherent broadband frequency comb sources in different spectral regions based on $χ^{(2)}$ OPOs pumped by cw lasers.
△ Less
Submitted 7 October, 2024;
originally announced October 2024.
-
CUDA-based focused Gaussian beams second-harmonic generation efficiency calculator
Authors:
A. D. Sanchez,
S. Chaitanya Kumar,
M. Ebrahim-Zadeh
Abstract:
We present an object-oriented programming (OOP) CUDA-based package for fast and accurate simulation of second-harmonic generation (SHG) efficiency using focused Gaussian beams. The model includes linear as well as two-photon absorption that can ultimately lead to thermal lensing due to self-heating effects. Our approach speeds up calculations by nearly 40x (11x) without (with) temperature profiles…
▽ More
We present an object-oriented programming (OOP) CUDA-based package for fast and accurate simulation of second-harmonic generation (SHG) efficiency using focused Gaussian beams. The model includes linear as well as two-photon absorption that can ultimately lead to thermal lensing due to self-heating effects. Our approach speeds up calculations by nearly 40x (11x) without (with) temperature profiles with respect to an equivalent implementation using CPU. The package offers a valuable tool for experimental design and study of 3D field propagation in nonlinear three-wave interactions. It is useful for optimization of SHG-based experiments and mitigates undesired thermal effects, enabling improved oven designs and advanced device architectures, leading to stable, efficient high-power SHG.
△ Less
Submitted 7 October, 2024;
originally announced October 2024.
-
Threshold resummation for $Z$-boson pair production at NNLO+NNLL
Authors:
Pulak Banerjee,
Chinmoy Dey,
M. C. Kumar,
Vaibhav Pandey
Abstract:
The production of a pair of on-shell $Z$-bosons is an important process at the Large Hadron Collider. Owing to its large production cross section at the LHC, this process is very useful for SM precision studies, electroweak symmetry breaking sector as well as to unravel the possible new physics. In this work, we have performed the threshold resummation of the large logarithms that arise in the par…
▽ More
The production of a pair of on-shell $Z$-bosons is an important process at the Large Hadron Collider. Owing to its large production cross section at the LHC, this process is very useful for SM precision studies, electroweak symmetry breaking sector as well as to unravel the possible new physics. In this work, we have performed the threshold resummation of the large logarithms that arise in the partonic threshold limit $z \to 1$, up to Next-to-Next-to-Leading Logarithmic (NNLL) accuracy. The presence of the two-loop contributions in the process dependent resummation coefficient $g_0$ makes the numerical computation a non-trivial task. After matching the resummed predictions to the Next-to-Next-to-Leading order (NNLO) fixed order results, we present the invariant mass distribution to NNLO+NNLL accuracy in QCD for the current LHC energies. We find that in the high invariant mass region ($Q=1$ TeV), while the NNLO corrections are as large as $83\%$ with respect to the leading order, the NNLL contribution enhances the cross section by additional few percent, about $4\%$ for $13.6$ TeV LHC. In this invariant mass region, the conventional scale uncertainties in the fixed order results get reduced from $3.4\%$ at NNLO to about $2.6\%$ at NNLO+NNLL, and this reduction is expected to be more for higher $Q$ values.
△ Less
Submitted 24 September, 2024;
originally announced September 2024.
-
Comparing on-off detector and single photon detector in photon subtraction based continuous variable quantum teleportation
Authors:
Chandan Kumar,
Karunesh K. Mishra,
Sibasish Ghosh
Abstract:
We consider here two distinct photon detectors namely, single photon detector and on-off detector, to implement photon subtraction on a two-mode squeezed vacuum (TMSV) state. The two distinct photon subtracted TMSV states generated are utilized individually as resource states in continuous variable quantum teleportation. Owing to the fact that the two generated states have different success probab…
▽ More
We consider here two distinct photon detectors namely, single photon detector and on-off detector, to implement photon subtraction on a two-mode squeezed vacuum (TMSV) state. The two distinct photon subtracted TMSV states generated are utilized individually as resource states in continuous variable quantum teleportation. Owing to the fact that the two generated states have different success probabilities (of photon subtraction) and fidelities (of quantum teleportation), we consider the product of the success probability and fidelity enhancement as a figure of merit for the comparison of the two detectors. The results show that the single photon detector should be preferred over the on-off detector for the maximization of the considered figure of merit.
△ Less
Submitted 24 September, 2024;
originally announced September 2024.
-
Quantum Computing for Automotive Applications
Authors:
Carlos A. RiofrÃo,
Johannes Klepsch,
Jernej Rudi Finžgar,
Florian Kiwit,
Leonhard Hölscher,
Marvin Erdmann,
Lukas Müller,
Chandan Kumar,
Youssef Achari Berrada,
Andre Luckow
Abstract:
Quantum computing could impact various industries, with the automotive industry with many computational challenges, from optimizing supply chains and manufacturing to vehicle engineering, being particularly promising. This chapter investigates state-of-the-art quantum algorithms to enhance efficiency, accuracy, and scalability across the automotive value chain. We explore recent advances in quantu…
▽ More
Quantum computing could impact various industries, with the automotive industry with many computational challenges, from optimizing supply chains and manufacturing to vehicle engineering, being particularly promising. This chapter investigates state-of-the-art quantum algorithms to enhance efficiency, accuracy, and scalability across the automotive value chain. We explore recent advances in quantum optimization, machine learning, and numerical and chemistry simulations, highlighting their potential and limitations. We identify and discuss key challenges in near-term and fault-tolerant algorithms and their practical use in industrial applications. While quantum algorithms show potential in many application domains, current noisy intermediate-scale quantum hardware limits scale and, thus, business benefits. In the long term, fault-tolerant systems promise theoretical speedups; however, they also require further progress in hardware and software (e.g., related to error correction and data loading). We expect that with this progress, significant practical benefits will emerge eventually.
△ Less
Submitted 24 March, 2025; v1 submitted 21 September, 2024;
originally announced September 2024.
-
Probing Elastic Isotropy in Entropy Stabilized Transition Metal Oxides: Experimental Estimation of Single Crystal Elastic Constants from Polycrystalline Materials
Authors:
Lalith Kumar Bhaskar,
Niraja Moharana,
Hendrik Holz,
Rajaprakash Ramachandramoorthy,
K. C. Hari Kumar,
Ravi Kumar
Abstract:
Single Crystal Elastic Constants (SECs) are pivotal for understanding material deformation, validating interatomic potentials, and enabling crucial material simulations. The entropy stabilized oxide showcases intriguing properties, underscoring the necessity for the determination of precise SECs to establish reliable interatomic potential and unlock its full potential using simulations. This study…
▽ More
Single Crystal Elastic Constants (SECs) are pivotal for understanding material deformation, validating interatomic potentials, and enabling crucial material simulations. The entropy stabilized oxide showcases intriguing properties, underscoring the necessity for the determination of precise SECs to establish reliable interatomic potential and unlock its full potential using simulations. This study presents an innovative methodology for estimating SECs from polycrystalline materials, requiring only two diffraction elastic constants and isotropic elastic constants for crystals with cubic symmetry. Validation using phase-pure nickel demonstrated good agreement with existing literature values, with a maximum 11.5% deviation for $C_{12}$ values. Extending the methodology to [(MgNiCoCuZn)O], SECs were calculated: 219 GPa for $C_{11}$, 116 GPa for $C_{12}$, and 51 GPa for $C_{44}$. Comparison with literature-reported values from DFT calculations revealed a significant divergence, ranging from 25% to 59% in the bulk and shear modulus calculated using the Voigt-Reuss-Hill average. To comprehend this disparity, we conducted DFT calculations and thoroughly examined the factors influencing these values. This study not only introduces a straightforward and dependable SEC estimation methodology but also provides precise experimental SEC values for [(MgNiCoCuZn)O] entropy stabilized oxides at ambient conditions, crucial for developing accurate interatomic potentials in future research.
△ Less
Submitted 20 September, 2024;
originally announced September 2024.
-
Characterisation of Front-End Electronics of ChaSTE experiment onboard Chandayaan-3 lander
Authors:
K. Durga Prasad,
Chandan Kumar,
Sanjeev K. Mishra,
P. Kalyana S. Reddy,
Janmejay Kumar,
Tinkal Ladiya,
Arpit Patel,
Anil Bhardwaj
Abstract:
Chandra Surface Thermophysical Experiment (ChaSTE) is one of the payloads flown onboard the Chandrayaan-3 lander. The objective of the experiment is in-situ investigation of thermal behaviour of outermost 100 mm layer of the lunar surface by deploying a thermal probe. The probe consists of 10 temperature sensors (Platinum RTDs) mounted at different locations along the length of the probe to measur…
▽ More
Chandra Surface Thermophysical Experiment (ChaSTE) is one of the payloads flown onboard the Chandrayaan-3 lander. The objective of the experiment is in-situ investigation of thermal behaviour of outermost 100 mm layer of the lunar surface by deploying a thermal probe. The probe consists of 10 temperature sensors (Platinum RTDs) mounted at different locations along the length of the probe to measure lunar soil temperatures as a function of depth. A heater is also mounted on the probe for thermal conductivity measurements. The onboard electronics of ChaSTE has two parts, Front-End Electronics (FEE) and processing electronics (PE). The front-end electronics (FEE) card is responsible for carrying out necessary sensor signal conditioning,which includes exciting the RTD sensors,acquiring analog voltages and then converting the acquired analog signals to digital signals using an Analog to Digital Converter(ADC). The front-end card is further interfaced with the processing electronics card for digital processing and spacecraft interface.The calibration, characterisation and functional test activities of Front-End Electronics of ChaSTE were carried out with the objective of testing and ensuring proper functionality and performance.A two phase calibration process involving electronic offset correction and temperature calibration were carried out. All these activities were successfully completed and the results from them provided us with a really good understanding of the behaviour of the FEE under different thermal and electrical conditions as well as when subjected to the simulated conditions of the actual ChaSTE experiment. The performance of the ChaSTE front-end electronics was very much within the design margins and its behaviour in simulated lunar environment was as desired. The data from these activities is useful in the interpretation of the actual science data of ChaSTE.
△ Less
Submitted 30 August, 2024;
originally announced September 2024.
-
No Size Fits All: The Perils and Pitfalls of Leveraging LLMs Vary with Company Size
Authors:
Ashok Urlana,
Charaka Vinayak Kumar,
Bala Mallikarjunarao Garlapati,
Ajeet Kumar Singh,
Rahul Mishra
Abstract:
Large language models (LLMs) are playing a pivotal role in deploying strategic use cases across a range of organizations, from large pan-continental companies to emerging startups. The issues and challenges involved in the successful utilization of LLMs can vary significantly depending on the size of the organization. It is important to study and discuss these pertinent issues of LLM adaptation wi…
▽ More
Large language models (LLMs) are playing a pivotal role in deploying strategic use cases across a range of organizations, from large pan-continental companies to emerging startups. The issues and challenges involved in the successful utilization of LLMs can vary significantly depending on the size of the organization. It is important to study and discuss these pertinent issues of LLM adaptation with a focus on the scale of the industrial concerns and brainstorm possible solutions and prospective directions. Such a study has not been prominently featured in the current research literature. In this study, we adopt a threefold strategy: first, we conduct a case study with industry practitioners to formulate the key research questions; second, we examine existing industrial publications to address these questions; and finally, we provide a practical guide for industries to utilize LLMs more efficiently. We release the GitHub\footnote{\url{https://github.com/vinayakcse/IndustrialLLMsPapers}} repository with the most recent papers in the field.
△ Less
Submitted 1 December, 2024; v1 submitted 21 July, 2024;
originally announced August 2024.
-
Synchrotron x-ray diffraction and DFT study of non-centrosymmetric EuRhGe3 under high pressure
Authors:
N. S. Dhami,
V. Balédent,
I. Batistić,
O. Bednarchuk,
D. Kaczorowski,
J. P. Itié,
S. R. Shieh,
C. M. N. Kumar,
Y. Utsumi
Abstract:
Antiferromagnetic intermetallic compound EuRhGe3 crystalizes in a non-centrosymmetric BaNiSn3-type (I4mm) structure. We studied its pressure-dependent crystal structure by using synchrotron powder x-ray diffraction at room temperature. Our results show a smooth contraction of the unit cell volume by applying pressure while preserving I4mm symmetry. No structural transition was observed up to 35 GP…
▽ More
Antiferromagnetic intermetallic compound EuRhGe3 crystalizes in a non-centrosymmetric BaNiSn3-type (I4mm) structure. We studied its pressure-dependent crystal structure by using synchrotron powder x-ray diffraction at room temperature. Our results show a smooth contraction of the unit cell volume by applying pressure while preserving I4mm symmetry. No structural transition was observed up to 35 GPa. By the equation of state fitting analysis, the bulk modulus and its pressure derivative were determined to be 73 (1) GPa and 5.5 (2), respectively. Furthermore, similar to the isostructural EuCoGe3, an anisotropic compression of a and c lattice parameters was observed. Our experimental results show a good agreement with the pressure-dependent structural evolution expected from theoretical calculations below 13 GPa. Reflecting a strong deviation from integer Eu valence, the experimental volume data appear to be smaller than those of DFT calculated values at higher pressures.
△ Less
Submitted 1 August, 2024;
originally announced August 2024.
-
Gemma 2: Improving Open Language Models at a Practical Size
Authors:
Gemma Team,
Morgane Riviere,
Shreya Pathak,
Pier Giuseppe Sessa,
Cassidy Hardin,
Surya Bhupatiraju,
Léonard Hussenot,
Thomas Mesnard,
Bobak Shahriari,
Alexandre Ramé,
Johan Ferret,
Peter Liu,
Pouya Tafti,
Abe Friesen,
Michelle Casbon,
Sabela Ramos,
Ravin Kumar,
Charline Le Lan,
Sammy Jerome,
Anton Tsitsulin,
Nino Vieillard,
Piotr Stanczyk,
Sertan Girgin,
Nikola Momchev,
Matt Hoffman
, et al. (173 additional authors not shown)
Abstract:
In this work, we introduce Gemma 2, a new addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters. In this new version, we apply several known technical modifications to the Transformer architecture, such as interleaving local-global attentions (Beltagy et al., 2020a) and group-query attention (Ainslie et al., 2023). We al…
▽ More
In this work, we introduce Gemma 2, a new addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters. In this new version, we apply several known technical modifications to the Transformer architecture, such as interleaving local-global attentions (Beltagy et al., 2020a) and group-query attention (Ainslie et al., 2023). We also train the 2B and 9B models with knowledge distillation (Hinton et al., 2015) instead of next token prediction. The resulting models deliver the best performance for their size, and even offer competitive alternatives to models that are 2-3 times bigger. We release all our models to the community.
△ Less
Submitted 2 October, 2024; v1 submitted 31 July, 2024;
originally announced August 2024.
-
SPOCKMIP: Segmentation of Vessels in MRAs with Enhanced Continuity using Maximum Intensity Projection as Loss
Authors:
Chethan Radhakrishna,
Karthikesh Varma Chintalapati,
Sri Chandana Hudukula Ram Kumar,
Raviteja Sutrave,
Hendrik Mattern,
Oliver Speck,
Andreas Nürnberger,
Soumick Chatterjee
Abstract:
Identification of vessel structures of different sizes in biomedical images is crucial in the diagnosis of many neurodegenerative diseases. However, the sparsity of good-quality annotations of such images makes the task of vessel segmentation challenging. Deep learning offers an efficient way to segment vessels of different sizes by learning their high-level feature representations and the spatial…
▽ More
Identification of vessel structures of different sizes in biomedical images is crucial in the diagnosis of many neurodegenerative diseases. However, the sparsity of good-quality annotations of such images makes the task of vessel segmentation challenging. Deep learning offers an efficient way to segment vessels of different sizes by learning their high-level feature representations and the spatial continuity of such features across dimensions. Semi-supervised patch-based approaches have been effective in identifying small vessels of one to two voxels in diameter. This study focuses on improving the segmentation quality by considering the spatial correlation of the features using the Maximum Intensity Projection~(MIP) as an additional loss criterion. Two methods are proposed with the incorporation of MIPs of label segmentation on the single~(z-axis) and multiple perceivable axes of the 3D volume. The proposed MIP-based methods produce segmentations with improved vessel continuity, which is evident in visual examinations of ROIs. Patch-based training is improved by introducing an additional loss term, MIP loss, to penalise the predicted discontinuity of vessels. A training set of 14 volumes is selected from the StudyForrest dataset comprising of 18 7-Tesla 3D Time-of-Flight~(ToF) Magnetic Resonance Angiography (MRA) images. The generalisation performance of the method is evaluated using the other unseen volumes in the dataset. It is observed that the proposed method with multi-axes MIP loss produces better quality segmentations with a median Dice of $80.245 \pm 0.129$. Also, the method with single-axis MIP loss produces segmentations with a median Dice of $79.749 \pm 0.109$. Furthermore, a visual comparison of the ROIs in the predicted segmentation reveals a significant improvement in the continuity of the vessels when MIP loss is incorporated into training.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Continuous variable quantum teleportation using photon subtracted and photon added two mode squeezed coherent state
Authors:
Shikhar Arora,
Chandan Kumar,
Arvind
Abstract:
We consider non-Gaussian states generated by photon subtraction (PS) and photon addition (PA) on two-mode squeezed coherent (TMSC) states, as resource states for continuous variable (CV) quantum teleportation (QT). To this end, we derive the Wigner characteristic function for the family of photon subtracted and photon added TMSC states, which is then utilized to calculate the fidelity of teleporti…
▽ More
We consider non-Gaussian states generated by photon subtraction (PS) and photon addition (PA) on two-mode squeezed coherent (TMSC) states, as resource states for continuous variable (CV) quantum teleportation (QT). To this end, we derive the Wigner characteristic function for the family of photon subtracted and photon added TMSC states, which is then utilized to calculate the fidelity of teleporting a single mode coherent state and a squeezed vacuum state. The analysis shows that while symmetric PS enhances the fidelity of QT in an extensive range of squeezing, asymmetric PS enhances the performance marginally and only in the low squeezing regime. The addition operations on the other hand are less useful, symmetric three-PA leads to a marginal improvement while the other addition operations are useless. We have considered the actual experimental setup for PS and PA operations and computed their success probabilities which should be kept in mind while advocating the use of these operations. We could compute the fidelity of QT for a broad range of states because we analytically derived the Wigner characteristic function for these family of states which we think will be useful for various other applications of these families of states.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Shear-Layer Perturbation Responses from Time-Resolved Schlieren Data
Authors:
Spencer L. Stahl,
Chandan Kumar,
Datta V. Gaitonde
Abstract:
A novel combination of physics-based and data-driven post-processing techniques is proposed to extract acoustic-related shear-layer perturbation responses directly from spatio-temporally resolved schlieren video. The physics-based component is derived from a momentum potential theory extension that extracts irrotational (acoustic and thermal) information from density gradients embedded in schliere…
▽ More
A novel combination of physics-based and data-driven post-processing techniques is proposed to extract acoustic-related shear-layer perturbation responses directly from spatio-temporally resolved schlieren video. The physics-based component is derived from a momentum potential theory extension that extracts irrotational (acoustic and thermal) information from density gradients embedded in schlieren pixel intensities. For the unheated shear layer, the method spotlights acoustic structures and tones otherwise hidden. The filtered data is then subjected to a data-driven Dynamic Mode Decomposition Reduced Order Model (DMD-ROM), which provides the response to forced perturbations. This method applies a learned linear model to isolate and quantify growth rates of acoustic phenomena suited for efficient parametric studies. A shear-layer comprised of two streams at Mach 2.461 and 0.175, corresponding to a convective Mach number 0.88 and containing shocks, is adopted for illustration. The overall perturbation response is first obtained using an impulse forcing in the wall normal direction of the splitter plate, extending in both subsonic and supersonic streams. Subsequently, impulse and harmonic forcings are independently applied in a local pixel-by-pixel manner for a precise receptivity study. The acoustic response shows a convective wavepacket and an acoustic burst from the splitter plate. The interaction with the primary shock and associated wave dispersion emits a second, slower, acoustic wave. Harmonic forcing indicates higher frequency-dependent sensitivity in the supersonic stream, with the most sensitive location near the outer boundary layer region. Excitation here yields an order of magnitude larger acoustic response compared to disturbances in the subsonic stream. Some receptive forcing inputs do not generate significant acoustic waves, which may guide excitation with low noise impact.
△ Less
Submitted 7 July, 2024;
originally announced July 2024.
-
Re-examination of the role of displacement and photon catalysis operation in continuous variable measurement device-independent quantum key distribution
Authors:
Chandan Kumar,
Arvind
Abstract:
We investigate the benefits of using $m$-photon catalysed two-mode squeezed coherent ($m$-PCTMSC) state in continuous variable measurement device-independent quantum key distribution (CV-MDI-QKD). To that end, we derive the Wigner characteristic function of the $m$-PCTMSC state and show that the 0-PCTMSC state is a Gaussian state and is an inferior choice as compared to the zero photon catalyzed t…
▽ More
We investigate the benefits of using $m$-photon catalysed two-mode squeezed coherent ($m$-PCTMSC) state in continuous variable measurement device-independent quantum key distribution (CV-MDI-QKD). To that end, we derive the Wigner characteristic function of the $m$-PCTMSC state and show that the 0-PCTMSC state is a Gaussian state and is an inferior choice as compared to the zero photon catalyzed two-mode squeezed vacuum state for CV-MDI-QKD. We carry out the optimization of the secret key rate with respect to all state parameters, namely variance, transmissivity, and displacement. Contrary to many recent proposals, the results show that zero- and single-photon catalysis operation provides only a marginal benefit in improving the maximum transmission distance. Secondly, we find that displacement offers no benefit in improving CV-MDI-QKD.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
High-entropy magnetism of murunskite
Authors:
D. Tolj,
P. Reddy,
I. Živković,
L. Akšamović,
J. R. Soh,
K. Komȩdera,
I. Biało,
C. M. N. Kumar,
T. Ivšić,
M. Novak,
O. Zaharko,
C. Ritter,
T. La Grange,
W. TabiÅ›,
I. Batistić,
L. Forró,
H. M. Rønnow,
D. K. Sunko,
N. Barišić
Abstract:
Murunskite (K$_2$FeCu$_3$S$_4$) is a bridging compound between the only two known families of high-temperature superconductors. It is a semiconductor like the parent compounds of cuprates, yet isostructural to metallic iron-pnictides. Moreover, like both families, it has an antiferromagnetic (AF)-like response with an ordered phase occurring below $\approx$ 100 K. Through comprehensive neutron, Mö…
▽ More
Murunskite (K$_2$FeCu$_3$S$_4$) is a bridging compound between the only two known families of high-temperature superconductors. It is a semiconductor like the parent compounds of cuprates, yet isostructural to metallic iron-pnictides. Moreover, like both families, it has an antiferromagnetic (AF)-like response with an ordered phase occurring below $\approx$ 100 K. Through comprehensive neutron, Mössbauer, and XPS measurements on single crystals, we unveil AF with a nearly commensurate quarter-zone wave vector. Intriguingly, the only identifiable magnetic atoms, iron, are randomly distributed over one-quarter of available crystallographic sites in 2D planes, while the remaining sites are occupied by closed-shell copper. Notably, any interpretation in terms of a spin-density wave is challenging, in contrast to the metallic iron-pnictides where Fermi-surface nesting can occur. Our findings align with a disordered-alloy picture featuring magnetic interactions up to second neighbors. Moreover, in the paramagnetic state, iron ions are either in Fe$^{3+}$ or Fe$^{2+}$ oxidation states, associated with two distinct paramagnetic sites identified by Mössbauer spectroscopy. Upon decreasing the temperature below the appearance of magnetic interactions, these two signals merge completely into a third, implying an orbital transition. It completes the cascade of (local) transitions that transform iron atoms from fully orbitally and magnetically disordered to homogeneously ordered in inverse space, but still randomly distributed in real space.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Optimization of state parameters in displacement assisted photon subtracted measurement-device-independent quantum key distribution
Authors:
Chandan Kumar,
Sarbani Chatterjee,
Arvind
Abstract:
Non-Gaussian operations, in particular, photon subtraction (PS), have been shown to enhance the performance of various quantum information processing tasks including continuous variable measurement device independent quantum key distribution (CV-MDI-QKD). This work investigates the role of non-Gaussian resource states, namely, the photon subtracted two-mode squeezed coherent (PSTMSC) (which includ…
▽ More
Non-Gaussian operations, in particular, photon subtraction (PS), have been shown to enhance the performance of various quantum information processing tasks including continuous variable measurement device independent quantum key distribution (CV-MDI-QKD). This work investigates the role of non-Gaussian resource states, namely, the photon subtracted two-mode squeezed coherent (PSTMSC) (which include photon subtracted two-mode squeezed vacuum (PSTMSV) as a special case) states in CV-MDI-QKD. To this end, we derive the Wigner characteristic function for the resource states, from which the covariance matrix and, finally, the secret key rate expressions are extracted. The optimization of the state parameters is undertaken to find the most suitable resource states in this family of states. There have been previous studies on the PSTMSV and PSTMSC states in CV-MDI-QKD that make use of PS operation. We evaluate such proposals and find to our surprise that both PSTMSC and PSTMSV resource states underperform as compared to the TMSV state rendering PS operation and displacement undesirable.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
No real advantage of photon subtraction and displacement in continuous variable measurement device independent quantum key distribution
Authors:
Chandan Kumar,
Sarbani Chatterjee,
Arvind
Abstract:
We critically analyse the role of single photon subtraction (SPS) and displacement in improving the performance of continuous variable measurement device independent quantum key distribution (CV-MDI-QKD). We consider CV-MDI-QKD with resource states generated by SPS on a displaced two-mode squeezed vacuum state. Optimizing the secret key rate with state parameters reveals that implementing SPS yiel…
▽ More
We critically analyse the role of single photon subtraction (SPS) and displacement in improving the performance of continuous variable measurement device independent quantum key distribution (CV-MDI-QKD). We consider CV-MDI-QKD with resource states generated by SPS on a displaced two-mode squeezed vacuum state. Optimizing the secret key rate with state parameters reveals that implementing SPS yields no benefits in improving the loss tolerance of CV-MDI-QKD. Additionally, we find that displacement too is not useful in improving the performance of CV-MDI-QKD. While our result is in contradistinction with the widely held belief in the field regarding the utility of SPS and displacement in CV-MDI-QKD, it also calls for a re-examination of the role of non-Gaussian operations in increasing the efficiency of various quantum information processing protocols.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Exploring the Decentraland Economy: Multifaceted Parcel Attributes, Key Insights, and Benchmarking
Authors:
Dipika Jha,
Ankit K. Bhagat,
Raju Halder,
Rajendra N. Paramanik,
Chandra M. Kumar
Abstract:
This paper presents a comprehensive Decentraland parcels dataset, called IITP-VDLand, sourced from diverse platforms such as Decentraland, OpenSea, Etherscan, Google BigQuery, and various Social Media Platforms. Unlike existing datasets which have limited attributes and records, IITP-VDLand offers a rich array of attributes, encompassing parcel characteristics, trading history, past activities, tr…
▽ More
This paper presents a comprehensive Decentraland parcels dataset, called IITP-VDLand, sourced from diverse platforms such as Decentraland, OpenSea, Etherscan, Google BigQuery, and various Social Media Platforms. Unlike existing datasets which have limited attributes and records, IITP-VDLand offers a rich array of attributes, encompassing parcel characteristics, trading history, past activities, transactions, and social media interactions. Alongside, we introduce a key attribute in the dataset, namely Rarity score, which measures the uniqueness of each parcel within the virtual world. Addressing the significant challenge posed by the dispersed nature of this data across various sources, we employ a systematic approach, utilizing both available APIs and custom scripts, to gather it. Subsequently, we meticulously curate and organize the information into four distinct fragments: (1) Characteristics, (2) OpenSea Trading History, (3) Ethereum Activity Transactions, and (4) Social Media. We envisage that this dataset would serve as a robust resource for training machine- and deep-learning models specifically designed to address real-world challenges within the domain of Decentraland parcels. The performance benchmarking of more than 20 state-of-the-art price prediction models on our dataset yields promising results, achieving a maximum R2 score of 0.8251 and an accuracy of 74.23% in case of Extra Trees Regressor and Classifier. The key findings reveal that the ensemble models perform better than both deep learning and linear models for our dataset. We observe a significant impact of coordinates, geographical proximity, rarity score, and few other economic indicators on the prediction of parcel prices.
△ Less
Submitted 2 March, 2025; v1 submitted 11 April, 2024;
originally announced April 2024.
-
TrustAI at SemEval-2024 Task 8: A Comprehensive Analysis of Multi-domain Machine Generated Text Detection Techniques
Authors:
Ashok Urlana,
Aditya Saibewar,
Bala Mallikarjunarao Garlapati,
Charaka Vinayak Kumar,
Ajeet Kumar Singh,
Srinivasa Rao Chalamala
Abstract:
The Large Language Models (LLMs) exhibit remarkable ability to generate fluent content across a wide spectrum of user queries. However, this capability has raised concerns regarding misinformation and personal information leakage. In this paper, we present our methods for the SemEval2024 Task8, aiming to detect machine-generated text across various domains in both mono-lingual and multi-lingual co…
▽ More
The Large Language Models (LLMs) exhibit remarkable ability to generate fluent content across a wide spectrum of user queries. However, this capability has raised concerns regarding misinformation and personal information leakage. In this paper, we present our methods for the SemEval2024 Task8, aiming to detect machine-generated text across various domains in both mono-lingual and multi-lingual contexts. Our study comprehensively analyzes various methods to detect machine-generated text, including statistical, neural, and pre-trained model approaches. We also detail our experimental setup and perform a in-depth error analysis to evaluate the effectiveness of these methods. Our methods obtain an accuracy of 86.9\% on the test set of subtask-A mono and 83.7\% for subtask-B. Furthermore, we also highlight the challenges and essential factors for consideration in future studies.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Learn and Search: An Elegant Technique for Object Lookup using Contrastive Learning
Authors:
Chandan Kumar,
Jansel Herrera-Gerena,
John Just,
Matthew Darr,
Ali Jannesari
Abstract:
The rapid proliferation of digital content and the ever-growing need for precise object recognition and segmentation have driven the advancement of cutting-edge techniques in the field of object classification and segmentation. This paper introduces "Learn and Search", a novel approach for object lookup that leverages the power of contrastive learning to enhance the efficiency and effectiveness of…
▽ More
The rapid proliferation of digital content and the ever-growing need for precise object recognition and segmentation have driven the advancement of cutting-edge techniques in the field of object classification and segmentation. This paper introduces "Learn and Search", a novel approach for object lookup that leverages the power of contrastive learning to enhance the efficiency and effectiveness of retrieval systems.
In this study, we present an elegant and innovative methodology that integrates deep learning principles and contrastive learning to tackle the challenges of object search. Our extensive experimentation reveals compelling results, with "Learn and Search" achieving superior Similarity Grid Accuracy, showcasing its efficacy in discerning regions of utmost similarity within an image relative to a cropped image.
The seamless fusion of deep learning and contrastive learning to address the intricacies of object identification not only promises transformative applications in image recognition, recommendation systems, and content tagging but also revolutionizes content-based search and retrieval. The amalgamation of these techniques, as exemplified by "Learn and Search," represents a significant stride in the ongoing evolution of methodologies in the dynamic realm of object classification and segmentation.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Authors:
Gemini Team,
Petko Georgiev,
Ving Ian Lei,
Ryan Burnell,
Libin Bai,
Anmol Gulati,
Garrett Tanzer,
Damien Vincent,
Zhufeng Pan,
Shibo Wang,
Soroosh Mariooryad,
Yifan Ding,
Xinyang Geng,
Fred Alcober,
Roy Frostig,
Mark Omernick,
Lexi Walker,
Cosmin Paduraru,
Christina Sorokin,
Andrea Tacchetti,
Colin Gaffney,
Samira Daruki,
Olcan Sercinoglu,
Zach Gleicher,
Juliette Love
, et al. (1112 additional authors not shown)
Abstract:
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February…
▽ More
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content.
△ Less
Submitted 16 December, 2024; v1 submitted 8 March, 2024;
originally announced March 2024.
-
Non-Gaussian two mode squeezed thermal states in continuous variable quantum teleportation
Authors:
Chandan Kumar
Abstract:
While photon catalyzed two mode squeezed vacuum state has been considered in context of quantum teleportation, similar studies have not been yet conducted for photon catalyzed two-mode squeezed thermal (TMST) state. This can be attributed to challenges involved in the evaluation of teleportation fidelity for photon catalyzed TMST state. In this article, we consider a practical scheme for the imple…
▽ More
While photon catalyzed two mode squeezed vacuum state has been considered in context of quantum teleportation, similar studies have not been yet conducted for photon catalyzed two-mode squeezed thermal (TMST) state. This can be attributed to challenges involved in the evaluation of teleportation fidelity for photon catalyzed TMST state. In this article, we consider a practical scheme for the implementation of non-Gaussian operation, viz., photon subtraction, photon addition, and photon catalysis, on TMST state. The generated states are employed as resources in continuous-variable quantum teleportation. The results show that the three non-Gaussian operations can enhance the teleportation fidelity. Considering the success probability of the non-Gaussian operations, we identify single-photon catalysis and single photon subtraction to be optimal for teleporting input coherent states, at low and intermediate squeezing levels.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
LLMs with Industrial Lens: Deciphering the Challenges and Prospects -- A Survey
Authors:
Ashok Urlana,
Charaka Vinayak Kumar,
Ajeet Kumar Singh,
Bala Mallikarjunarao Garlapati,
Srinivasa Rao Chalamala,
Rahul Mishra
Abstract:
Large language models (LLMs) have become the secret ingredient driving numerous industrial applications, showcasing their remarkable versatility across a diverse spectrum of tasks. From natural language processing and sentiment analysis to content generation and personalized recommendations, their unparalleled adaptability has facilitated widespread adoption across industries. This transformative…
▽ More
Large language models (LLMs) have become the secret ingredient driving numerous industrial applications, showcasing their remarkable versatility across a diverse spectrum of tasks. From natural language processing and sentiment analysis to content generation and personalized recommendations, their unparalleled adaptability has facilitated widespread adoption across industries. This transformative shift driven by LLMs underscores the need to explore the underlying associated challenges and avenues for enhancement in their utilization. In this paper, our objective is to unravel and evaluate the obstacles and opportunities inherent in leveraging LLMs within an industrial context. To this end, we conduct a survey involving a group of industry practitioners, develop four research questions derived from the insights gathered, and examine 68 industry papers to address these questions and derive meaningful conclusions. We maintain the Github repository with the most recent papers in the field.
△ Less
Submitted 27 May, 2025; v1 submitted 22 February, 2024;
originally announced February 2024.
-
Evaluating Ground State Energies of Chemical Systems with Low-Depth Quantum Circuits and High Accuracy
Authors:
Shuo Sun,
Chandan Kumar,
Kevin Shen,
Elvira Shishenina,
Christian B. Mendl
Abstract:
Solving electronic structure problems is considered one of the most promising applications of quantum computing. However, due to limitations imposed by the coherence time of qubits in the Noisy Intermediate Scale Quantum (NISQ) era or the capabilities of early fault-tolerant quantum devices, it is vital to design algorithms with low-depth circuits. In this work, we develop an enhanced Variational…
▽ More
Solving electronic structure problems is considered one of the most promising applications of quantum computing. However, due to limitations imposed by the coherence time of qubits in the Noisy Intermediate Scale Quantum (NISQ) era or the capabilities of early fault-tolerant quantum devices, it is vital to design algorithms with low-depth circuits. In this work, we develop an enhanced Variational Quantum Eigensolver (VQE) ansatz based on the Qubit Coupled Cluster (QCC) approach, which demands optimization over only $n$ parameters rather than the usual $n+2m$ parameters, where $n$ represents the number of Pauli string time evolution gates $e^{-itP}$, and $m$ is the number of qubits involved. We evaluate the ground state energies of $\mathrm{O_3}$, $\mathrm{Li_4}$, and $\mathrm{Cr_2}$, using CAS(2,2), (4,4) and (6,6) respectively in conjunction with our enhanced QCC ansatz, UCCSD (Unitary Coupled Cluster Single Double) ansatz, and canonical CCSD method as the active space solver, and compare with CASCI results. Finally, we assess our enhanced QCC ansatz on two distinct quantum hardware, IBM Kolkata and Quantinuum H1-1.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
Unsupervised learning based object detection using Contrastive Learning
Authors:
Chandan Kumar,
Jansel Herrera-Gerena,
John Just,
Matthew Darr,
Ali Jannesari
Abstract:
Training image-based object detectors presents formidable challenges, as it entails not only the complexities of object detection but also the added intricacies of precisely localizing objects within potentially diverse and noisy environments. However, the collection of imagery itself can often be straightforward; for instance, cameras mounted in vehicles can effortlessly capture vast amounts of d…
▽ More
Training image-based object detectors presents formidable challenges, as it entails not only the complexities of object detection but also the added intricacies of precisely localizing objects within potentially diverse and noisy environments. However, the collection of imagery itself can often be straightforward; for instance, cameras mounted in vehicles can effortlessly capture vast amounts of data in various real-world scenarios. In light of this, we introduce a groundbreaking method for training single-stage object detectors through unsupervised/self-supervised learning.
Our state-of-the-art approach has the potential to revolutionize the labeling process, substantially reducing the time and cost associated with manual annotation. Furthermore, it paves the way for previously unattainable research opportunities, particularly for large, diverse, and challenging datasets lacking extensive labels.
In contrast to prevalent unsupervised learning methods that primarily target classification tasks, our approach takes on the unique challenge of object detection. We pioneer the concept of intra-image contrastive learning alongside inter-image counterparts, enabling the acquisition of crucial location information essential for object detection. The method adeptly learns and represents this location information, yielding informative heatmaps. Our results showcase an outstanding accuracy of \textbf{89.2\%}, marking a significant breakthrough of approximately \textbf{15x} over random initialization in the realm of unsupervised object detection within the field of computer vision.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
Large Language Models aren't all that you need
Authors:
Kiran Voderhobli Holla,
Chaithanya Kumar,
Aryan Singh
Abstract:
This paper describes the architecture and systems built towards solving the SemEval 2023 Task 2: MultiCoNER II (Multilingual Complex Named Entity Recognition) [1]. We evaluate two approaches (a) a traditional Conditional Random Fields model and (b) a Large Language Model (LLM) fine-tuned with a customized head and compare the two approaches. The novel ideas explored are: 1) Decaying auxiliary loss…
▽ More
This paper describes the architecture and systems built towards solving the SemEval 2023 Task 2: MultiCoNER II (Multilingual Complex Named Entity Recognition) [1]. We evaluate two approaches (a) a traditional Conditional Random Fields model and (b) a Large Language Model (LLM) fine-tuned with a customized head and compare the two approaches. The novel ideas explored are: 1) Decaying auxiliary loss (with residual) - where we train the model on an auxiliary task of Coarse-Grained NER and include this task as a part of the loss function 2) Triplet token blending - where we explore ways of blending the embeddings of neighboring tokens in the final NER layer prior to prediction 3) Task-optimal heads - where we explore a variety of custom heads and learning rates for the final layer of the LLM. We also explore multiple LLMs including GPT-3 and experiment with a variety of dropout and other hyperparameter settings before arriving at our final model which achieves micro & macro f1 of 0.85/0.84 (on dev) and 0.67/0.61 on the test data . We show that while pre-trained LLMs, by themselves, bring about a large improvement in scores as compared to traditional models, we also demonstrate that tangible improvements to the Macro-F1 score can be made by augmenting the LLM with additional feature/loss/model engineering techniques described above.
△ Less
Submitted 1 January, 2024;
originally announced January 2024.
-
Gemini: A Family of Highly Capable Multimodal Models
Authors:
Gemini Team,
Rohan Anil,
Sebastian Borgeaud,
Jean-Baptiste Alayrac,
Jiahui Yu,
Radu Soricut,
Johan Schalkwyk,
Andrew M. Dai,
Anja Hauth,
Katie Millican,
David Silver,
Melvin Johnson,
Ioannis Antonoglou,
Julian Schrittwieser,
Amelia Glaese,
Jilin Chen,
Emily Pitler,
Timothy Lillicrap,
Angeliki Lazaridou,
Orhan Firat,
James Molloy,
Michael Isard,
Paul R. Barham,
Tom Hennigan,
Benjamin Lee
, et al. (1326 additional authors not shown)
Abstract:
This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr…
▽ More
This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of the Gemini family in cross-modal reasoning and language understanding will enable a wide variety of use cases. We discuss our approach toward post-training and deploying Gemini models responsibly to users through services including Gemini, Gemini Advanced, Google AI Studio, and Cloud Vertex AI.
△ Less
Submitted 9 May, 2025; v1 submitted 18 December, 2023;
originally announced December 2023.