Search | arXiv e-print repository

Contradiction Detection in RAG Systems: Evaluating LLMs as Context Validators for Improved Information Consistency

Authors: Vignesh Gokul, Srikanth Tenneti, Alwarappan Nakkiran

Abstract: Retrieval Augmented Generation (RAG) systems have emerged as a powerful method for enhancing large language models (LLMs) with up-to-date information. However, the retrieval step in RAG can sometimes surface documents containing contradictory information, particularly in rapidly evolving domains such as news. These contradictions can significantly impact the performance of LLMs, leading to inconsi… ▽ More Retrieval Augmented Generation (RAG) systems have emerged as a powerful method for enhancing large language models (LLMs) with up-to-date information. However, the retrieval step in RAG can sometimes surface documents containing contradictory information, particularly in rapidly evolving domains such as news. These contradictions can significantly impact the performance of LLMs, leading to inconsistent or erroneous outputs. This study addresses this critical challenge in two ways. First, we present a novel data generation framework to simulate different types of contradictions that may occur in the retrieval stage of a RAG system. Second, we evaluate the robustness of different LLMs in performing as context validators, assessing their ability to detect contradictory information within retrieved document sets. Our experimental results reveal that context validation remains a challenging task even for state-of-the-art LLMs, with performance varying significantly across different types of contradictions. While larger models generally perform better at contradiction detection, the effectiveness of different prompting strategies varies across tasks and model architectures. We find that chain-of-thought prompting shows notable improvements for some models but may hinder performance in others, highlighting the complexity of the task and the need for more robust approaches to context validation in RAG systems. △ Less

Submitted 31 March, 2025; originally announced April 2025.

arXiv:2501.14392 [pdf, other]

Emission from driven atoms in collective strong coupling with an optical cavity

Authors: V. R. Thakar, Arun Bahuleyan, V. I. Gokul, S. P. Dinesh, S. A. Rangwala

Abstract: We study self sustained cavity emission from driven atoms in collective strong coupling. The cavity emission occurs over a wide range of atom-cavity and drive laser detunings without any external input to the cavity mode. Second order correlation measurements ($g^2(τ)$), further reveal unanticipated phenomenon in the observed cavity emission such as, (a) damped oscillations at two frequencies and… ▽ More We study self sustained cavity emission from driven atoms in collective strong coupling. The cavity emission occurs over a wide range of atom-cavity and drive laser detunings without any external input to the cavity mode. Second order correlation measurements ($g^2(τ)$), further reveal unanticipated phenomenon in the observed cavity emission such as, (a) damped oscillations at two frequencies and (b) significantly distinct $g^2(τ)$ for different polarization components. The intricate relation between cavity emission intensity, drive laser detuning and atom-cavity detunings is explained. A possible mechanism for the damped oscillations with two frequency components in $g^2(τ)$ is suggested. Measurements show the existence of two separate polarization decoupled mechanisms with distinct photon statistics, through which energy is transferred from the drive field to the cavity field. The statistical properties and mechanisms underlying cavity emission, as presented in this work, are expected to provide valuable insights for extending non-destructive detection techniques to the regime of collective strong coupling. △ Less

Submitted 24 January, 2025; originally announced January 2025.

arXiv:2409.05451 [pdf, other]

Detection of radiatively open systems using an optical cavity

Authors: V. I. Gokul, Arun Bahuleyan, Raghuveer Singh Yadav, S. P. Dinesh, V. R. Thakar, Rahul Sawant, S. A. Rangwala

Abstract: We experimentally demonstrate a cavity-based detection scheme for a cold atomic ensemble with a radiatively open transition. Our method exploits the collective strong coupling of atoms to the cavity mode, which results in off-resonant probing of the atomic ensemble, leading to a dramatic reduction in losses from the detection cycle. We then show the viability of this frequency measurement for dete… ▽ More We experimentally demonstrate a cavity-based detection scheme for a cold atomic ensemble with a radiatively open transition. Our method exploits the collective strong coupling of atoms to the cavity mode, which results in off-resonant probing of the atomic ensemble, leading to a dramatic reduction in losses from the detection cycle. We then show the viability of this frequency measurement for detecting a small number of atoms and molecules by theoretical modelling. Compared with the most commonly used fluorescence method, we show that the cavity-based scheme allows rapid and prolonged detection of the system's evolution with minimal destruction. △ Less

Submitted 9 September, 2024; originally announced September 2024.

arXiv:2402.15114 [pdf, other]

Cavity based non-destructive detection of photoassociation in a dark MOT

Authors: V. I. Gokul, Arun Bahuleyan, S. P. Dinesh, V. R. Thakar, S. A. Rangwala

Abstract: The photoassociation (PA) of rubidium dimer (Rb2) in a dark magneto-optic trap (MOT) is studied using atom-cavity collective strong coupling. This allows non-destructive detection of the molecule formation process as well as rapid and repeated interrogation of the atom-molecule system. The vacuum Rabi splitting (VRS) measurements from the bright MOT are carefully calibrated against equivalent meas… ▽ More The photoassociation (PA) of rubidium dimer (Rb2) in a dark magneto-optic trap (MOT) is studied using atom-cavity collective strong coupling. This allows non-destructive detection of the molecule formation process as well as rapid and repeated interrogation of the atom-molecule system. The vacuum Rabi splitting (VRS) measurements from the bright MOT are carefully calibrated against equivalent measurements with fluorescence. Further loading rates in dark MOT are determined using VRS. This method provides a reliable, fast, and non-destructive detection scheme for ultracold molecules when the atoms are non-fluorescing using the free atoms coupled to a cavity. △ Less

Submitted 23 February, 2024; originally announced February 2024.

arXiv:2402.06810 [pdf, other]

Evaluating Co-Creativity using Total Information Flow

Authors: Vignesh Gokul, Chris Francis, Shlomo Dubnov

Abstract: Co-creativity in music refers to two or more musicians or musical agents interacting with one another by composing or improvising music. However, this is a very subjective process and each musician has their own preference as to which improvisation is better for some context. In this paper, we aim to create a measure based on total information flow to quantitatively evaluate the co-creativity proc… ▽ More Co-creativity in music refers to two or more musicians or musical agents interacting with one another by composing or improvising music. However, this is a very subjective process and each musician has their own preference as to which improvisation is better for some context. In this paper, we aim to create a measure based on total information flow to quantitatively evaluate the co-creativity process in music. In other words, our measure is an indication of how "good" a creative musical process is. Our main hypothesis is that a good musical creation would maximize information flow between the participants captured by music voices recorded in separate tracks. We propose a method to compute the information flow using pre-trained generative models as entropy estimators. We demonstrate how our method matches with human perception using a qualitative study. △ Less

Submitted 9 February, 2024; originally announced February 2024.

arXiv:2401.02135 [pdf, other]

PosCUDA: Position based Convolution for Unlearnable Audio Datasets

Authors: Vignesh Gokul, Shlomo Dubnov

Abstract: Deep learning models require large amounts of clean data to acheive good performance. To avoid the cost of expensive data acquisition, researchers use the abundant data available on the internet. This raises significant privacy concerns on the potential misuse of personal data for model training without authorisation. Recent works such as CUDA propose solutions to this problem by adding class-wise… ▽ More Deep learning models require large amounts of clean data to acheive good performance. To avoid the cost of expensive data acquisition, researchers use the abundant data available on the internet. This raises significant privacy concerns on the potential misuse of personal data for model training without authorisation. Recent works such as CUDA propose solutions to this problem by adding class-wise blurs to make datasets unlearnable, i.e a model can never use the acquired dataset for learning. However these methods often reduce the quality of the data making it useless for practical applications. We introduce PosCUDA, a position based convolution for creating unlearnable audio datasets. PosCUDA uses class-wise convolutions on small patches of audio. The location of the patches are based on a private key for each class, hence the model learns the relations between positional blurs and labels, while failing to generalize. We empirically show that PosCUDA can achieve unlearnability while maintaining the quality of the original audio datasets. Our proposed method is also robust to different audio feature representations such as MFCC, raw audio and different architectures such as transformers, convolutional networks etc. △ Less

Submitted 4 January, 2024; originally announced January 2024.

arXiv:2310.16415 [pdf, other]

Dynamic Fabry-Perot cavity stabilization technique for atom-cavity experiments

Authors: S. P. Dinesh, V. R. Thakar, V. I. Gokul, Arun Bahuleyan, S. A. Rangwala

Abstract: We present a stabilization technique developed to lock and dynamically tune the resonant frequency of a moderate finesse Fabry-Pérot (FP) cavity used in precision atom-cavity quantum electrodynamics (QED) experiments. Most experimental setups with active stabilization either operate at one fixed resonant frequency or use transfer cavities to achieve the ability to tune the resonant frequency of th… ▽ More We present a stabilization technique developed to lock and dynamically tune the resonant frequency of a moderate finesse Fabry-Pérot (FP) cavity used in precision atom-cavity quantum electrodynamics (QED) experiments. Most experimental setups with active stabilization either operate at one fixed resonant frequency or use transfer cavities to achieve the ability to tune the resonant frequency of the cavity. In this work, we present a simple and cost-effective solution to actively stabilize an optical cavity while achieving a dynamic tuning range of over 100 MHz with a precision under 1 MHz. Our unique scheme uses a reference laser locked to an electro-optic modulator (EOM) shifted saturation absorption spectroscopy (SAS) signal. The cavity is locked to the PDH error signal obtained from the dip in the reflected intensity of this reference laser. Our setup provides the feature to efficiently tune the resonant frequency of the cavity by only changing the EOM drive without unlocking and re-locking either the reference laser or the cavity. We present measurements of precision control of the resonant cavity frequency and vacuum Rabi splitting (VRS) to quantify the stability achieved and hence show that this technique is suitable for a variety of cavity QED experiments. △ Less

Submitted 25 October, 2023; originally announced October 2023.

arXiv:2103.09876 [pdf, other]

Bias-Free FedGAN: A Federated Approach to Generate Bias-Free Datasets

Authors: Vaikkunth Mugunthan, Vignesh Gokul, Lalana Kagal, Shlomo Dubnov

Abstract: Federated Generative Adversarial Network (FedGAN) is a communication-efficient approach to train a GAN across distributed clients without clients having to share their sensitive training data. In this paper, we experimentally show that FedGAN generates biased data points under non-independent-and-identically-distributed (non-iid) settings. Also, we propose Bias-Free FedGAN, an approach to generate… ▽ More Federated Generative Adversarial Network (FedGAN) is a communication-efficient approach to train a GAN across distributed clients without clients having to share their sensitive training data. In this paper, we experimentally show that FedGAN generates biased data points under non-independent-and-identically-distributed (non-iid) settings. Also, we propose Bias-Free FedGAN, an approach to generate bias-free synthetic datasets using FedGAN. Our approach generates metadata at the aggregator using the models received from clients and retrains the federated model to achieve bias-free results for image synthesis. Bias-Free FedGAN has the same communication cost as that of FedGAN. Experimental results on image datasets (MNIST and FashionMNIST) validate our claims. △ Less

Submitted 15 April, 2021; v1 submitted 17 March, 2021; originally announced March 2021.

arXiv:2010.11398 [pdf, other]

DPD-InfoGAN: Differentially Private Distributed InfoGAN

Authors: Vaikkunth Mugunthan, Vignesh Gokul, Lalana Kagal, Shlomo Dubnov

Abstract: Generative Adversarial Networks (GANs) are deep learning architectures capable of generating synthetic datasets. Despite producing high-quality synthetic images, the default GAN has no control over the kinds of images it generates. The Information Maximizing GAN (InfoGAN) is a variant of the default GAN that introduces feature-control variables that are automatically learned by the framework, henc… ▽ More Generative Adversarial Networks (GANs) are deep learning architectures capable of generating synthetic datasets. Despite producing high-quality synthetic images, the default GAN has no control over the kinds of images it generates. The Information Maximizing GAN (InfoGAN) is a variant of the default GAN that introduces feature-control variables that are automatically learned by the framework, hence providing greater control over the different kinds of images produced. Due to the high model complexity of InfoGAN, the generative distribution tends to be concentrated around the training data points. This is a critical problem as the models may inadvertently expose the sensitive and private information present in the dataset. To address this problem, we propose a differentially private version of InfoGAN (DP-InfoGAN). We also extend our framework to a distributed setting (DPD-InfoGAN) to allow clients to learn different attributes present in other clients' datasets in a privacy-preserving manner. In our experiments, we show that both DP-InfoGAN and DPD-InfoGAN can synthesize high-quality images with flexible control over image attributes while preserving privacy. △ Less

Submitted 22 March, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

arXiv:1703.04364 [pdf, other]

Deep Learning for Skin Lesion Classification

Authors: P. Mirunalini, Aravindan Chandrabose, Vignesh Gokul, S. M. Jaisakthi

Abstract: Melanoma, a malignant form of skin cancer is very threatening to life. Diagnosis of melanoma at an earlier stage is highly needed as it has a very high cure rate. Benign and malignant forms of skin cancer can be detected by analyzing the lesions present on the surface of the skin using dermoscopic images. In this work, an automated skin lesion detection system has been developed which learns the r… ▽ More Melanoma, a malignant form of skin cancer is very threatening to life. Diagnosis of melanoma at an earlier stage is highly needed as it has a very high cure rate. Benign and malignant forms of skin cancer can be detected by analyzing the lesions present on the surface of the skin using dermoscopic images. In this work, an automated skin lesion detection system has been developed which learns the representation of the image using Google's pretrained CNN model known as Inception-v3 \cite{cnn}. After obtaining the representation vector for our input dermoscopic images we have trained two layer feed forward neural network to classify the images as malignant or benign. The system also classifies the images based on the cause of the cancer either due to melanocytic or non-melanocytic cells using a different neural network. These classification tasks are part of the challenge organized by International Skin Imaging Collaboration (ISIC) 2017. Our system learns to classify the images based on the model built using the training images given in the challenge and the experimental results were evaluated using validation and test sets. Our system has achieved an overall accuracy of 65.8\% for the validation set. △ Less

Submitted 13 March, 2017; originally announced March 2017.

Comments: 3 pages with 3 figures

Showing 1–10 of 10 results for author: Gokul, V