-
Superconductivity in Spin-Orbit coupled SU(8) Dirac Fermions on Honeycomb lattice
Authors:
Ankush Chaubey,
Basudeb Mondal,
Vijay B. Shenoy,
Subhro Bhattacharjee
Abstract:
We study superconducting (SC) phases that are naturally proximate to a spin-orbit coupled SU(8) Dirac semi-metal on a honeycomb lattice. This system, which offers enhanced low-energy symmetries, presents an interesting platform for realizing unconventional superconductivity in j=3/2 electrons. In particular, we find 72 superconducting charge-$2e$ fermion bilinears which, under classification of mi…
▽ More
We study superconducting (SC) phases that are naturally proximate to a spin-orbit coupled SU(8) Dirac semi-metal on a honeycomb lattice. This system, which offers enhanced low-energy symmetries, presents an interesting platform for realizing unconventional superconductivity in j=3/2 electrons. In particular, we find 72 superconducting charge-$2e$ fermion bilinears which, under classification of microscopic symmetries, lead to 12 different SCs -- four singlets, two doublets, and six triplets -- 7 of them are gapped and 5 are symmetry-protected nodal SCs. The strong spin-orbit coupling leads to locking of the spin of the Cooper pairs with real-space direction -- as is evident from the structure of the Cooper pair wave-functions -- leading to unusual non-unitary superconductors (even singlets), and with finite momentum pairing (for the triplets). This results, in many cases, in the magnitude of multiple pairing gaps being intricately dependent on the direction of the SC order-parameter. The present classification of SCs along with normal phases (Phys. Rev. B 108, 245106 (2023)) provides the complete list of naturally occurring phases in the vicinity of such a SU(8) Dirac semi-metal. This study allows for understanding the global phase diagram of such systems, stimulating further experimental work on candidate materials such as metallic halides (MX$_3$ with M=Zr, Hf, and X=Cl, Br). Further, it provides the starting point for the exploration of unconventional phase transitions in such systems.
△ Less
Submitted 8 May, 2025;
originally announced May 2025.
-
Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning
Authors:
Ashutosh Chaubey,
Xulang Guan,
Mohammad Soleymani
Abstract:
The human face plays a central role in social communication, necessitating the use of performant computer vision tools for human-centered applications. We propose Face-LLaVA, a multimodal large language model for face-centered, in-context learning, including facial expression and attribute recognition. Additionally, Face-LLaVA is able to generate natural language descriptions that can be used for…
▽ More
The human face plays a central role in social communication, necessitating the use of performant computer vision tools for human-centered applications. We propose Face-LLaVA, a multimodal large language model for face-centered, in-context learning, including facial expression and attribute recognition. Additionally, Face-LLaVA is able to generate natural language descriptions that can be used for reasoning. Leveraging existing visual databases, we first developed FaceInstruct-1M, a face-centered database for instruction tuning MLLMs for face processing. We then developed a novel face-specific visual encoder powered by Face-Region Guided Cross-Attention that integrates face geometry with local visual features. We evaluated the proposed method across nine different datasets and five different face processing tasks, including facial expression recognition, action unit detection, facial attribute detection, age estimation and deepfake detection. Face-LLaVA achieves superior results compared to existing open-source MLLMs and competitive performance compared to commercial solutions. Our model output also receives a higher reasoning rating by GPT under a zero-shot setting across all the tasks. Both our dataset and model wil be released at https://face-llava.github.io to support future advancements in social AI and foundational vision-language research.
△ Less
Submitted 9 April, 2025;
originally announced April 2025.
-
DiTaiListener: Controllable High Fidelity Listener Video Generation with Diffusion
Authors:
Maksim Siniukov,
Di Chang,
Minh Tran,
Hongkun Gong,
Ashutosh Chaubey,
Mohammad Soleymani
Abstract:
Generating naturalistic and nuanced listener motions for extended interactions remains an open problem. Existing methods often rely on low-dimensional motion codes for facial behavior generation followed by photorealistic rendering, limiting both visual fidelity and expressive richness. To address these challenges, we introduce DiTaiListener, powered by a video diffusion model with multimodal cond…
▽ More
Generating naturalistic and nuanced listener motions for extended interactions remains an open problem. Existing methods often rely on low-dimensional motion codes for facial behavior generation followed by photorealistic rendering, limiting both visual fidelity and expressive richness. To address these challenges, we introduce DiTaiListener, powered by a video diffusion model with multimodal conditions. Our approach first generates short segments of listener responses conditioned on the speaker's speech and facial motions with DiTaiListener-Gen. It then refines the transitional frames via DiTaiListener-Edit for a seamless transition. Specifically, DiTaiListener-Gen adapts a Diffusion Transformer (DiT) for the task of listener head portrait generation by introducing a Causal Temporal Multimodal Adapter (CTM-Adapter) to process speakers' auditory and visual cues. CTM-Adapter integrates speakers' input in a causal manner into the video generation process to ensure temporally coherent listener responses. For long-form video generation, we introduce DiTaiListener-Edit, a transition refinement video-to-video diffusion model. The model fuses video segments into smooth and continuous videos, ensuring temporal consistency in facial expressions and image quality when merging short video segments produced by DiTaiListener-Gen. Quantitatively, DiTaiListener achieves the state-of-the-art performance on benchmark datasets in both photorealism (+73.8% in FID on RealTalk) and motion representation (+6.1% in FD metric on VICO) spaces. User studies confirm the superior performance of DiTaiListener, with the model being the clear preference in terms of feedback, diversity, and smoothness, outperforming competitors by a significant margin.
△ Less
Submitted 4 April, 2025;
originally announced April 2025.
-
ContextIQ: A Multimodal Expert-Based Video Retrieval System for Contextual Advertising
Authors:
Ashutosh Chaubey,
Anoubhav Agarwaal,
Sartaki Sinha Roy,
Aayush Agrawal,
Susmita Ghose
Abstract:
Contextual advertising serves ads that are aligned to the content that the user is viewing. The rapid growth of video content on social platforms and streaming services, along with privacy concerns, has increased the need for contextual advertising. Placing the right ad in the right context creates a seamless and pleasant ad viewing experience, resulting in higher audience engagement and, ultimate…
▽ More
Contextual advertising serves ads that are aligned to the content that the user is viewing. The rapid growth of video content on social platforms and streaming services, along with privacy concerns, has increased the need for contextual advertising. Placing the right ad in the right context creates a seamless and pleasant ad viewing experience, resulting in higher audience engagement and, ultimately, better ad monetization. From a technology standpoint, effective contextual advertising requires a video retrieval system capable of understanding complex video content at a very granular level. Current text-to-video retrieval models based on joint multimodal training demand large datasets and computational resources, limiting their practicality and lacking the key functionalities required for ad ecosystem integration. We introduce ContextIQ, a multimodal expert-based video retrieval system designed specifically for contextual advertising. ContextIQ utilizes modality-specific experts-video, audio, transcript (captions), and metadata such as objects, actions, emotion, etc.-to create semantically rich video representations. We show that our system, without joint training, achieves better or comparable results to state-of-the-art models and commercial solutions on multiple text-to-video retrieval benchmarks. Our ablation studies highlight the benefits of leveraging multiple modalities for enhanced video retrieval accuracy instead of using a vision-language model alone. Furthermore, we show how video retrieval systems such as ContextIQ can be used for contextual advertising in an ad ecosystem while also addressing concerns related to brand safety and filtering inappropriate content.
△ Less
Submitted 29 March, 2025; v1 submitted 29 October, 2024;
originally announced October 2024.
-
Variational wave-functions for correlated metals
Authors:
Ankush Chaubey,
Harsh Nigam,
Subhro Bhattacharjee,
K. Sengupta
Abstract:
We study a set of many-body wave-functions of Fermions that are naturally written using momentum space basis and allow for quantum superposition of Fermion occupancy, $\{n_{\bf k}\}$. This {enables} us to capture the fluctuations of the Fermi-surface {(FS)} -- the singularly most important signature of a metal. We bench-mark our results in one spatial dimensions (1D) to show that these wave-functi…
▽ More
We study a set of many-body wave-functions of Fermions that are naturally written using momentum space basis and allow for quantum superposition of Fermion occupancy, $\{n_{\bf k}\}$. This {enables} us to capture the fluctuations of the Fermi-surface {(FS)} -- the singularly most important signature of a metal. We bench-mark our results in one spatial dimensions (1D) to show that these wave-functions allow for quantitative understanding of the Tomonaga-Luttinger liquid (TLL); computations of certain correlators using them can in fact be extended to larger systems sizes compared to conventional exact diagonalization (ED) allowing for a more systematic comparison with bosonization techniques. Finally we show that this basis may be useful for obtaining fixed-point wave-function for strongly correlated metals {in dimensions greater that one}. In particular, we study the case of coherent (equal) superposition of elliptical FS {in continuum (2D) and on a} square lattice{. In case of the former, our variational wave-function systematically interpolates between the phenomenology of the Fermi liquid ground state, i.e., finite single-Fermion residue at a sharp FS, to a non-Fermi liquid (NFL) with zero residue. In the NFL the jump in $\langle n_{\bf k}\rangle$ at the FS is replaced by a point of inflection (similar to a 1D TLL) whose contour is consistent with the Luttinger Theorem. In case of the square lattice, we} find highly anisotropic distribution of the quasi-particle residue, which, at finite resolution has an uncanny resemblance to the Fermi-arcs{, albeit at zero temperature,} seen in the pseudo-gap state of the cuprates.
△ Less
Submitted 1 August, 2024;
originally announced August 2024.
-
Meta-Learning Framework for End-to-End Imposter Identification in Unseen Speaker Recognition
Authors:
Ashutosh Chaubey,
Sparsh Sinha,
Susmita Ghose
Abstract:
Speaker identification systems are deployed in diverse environments, often different from the lab conditions on which they are trained and tested. In this paper, first, we show the problem of generalization using fixed thresholds (computed using EER metric) for imposter identification in unseen speaker recognition and then introduce a robust speaker-specific thresholding technique for better perfo…
▽ More
Speaker identification systems are deployed in diverse environments, often different from the lab conditions on which they are trained and tested. In this paper, first, we show the problem of generalization using fixed thresholds (computed using EER metric) for imposter identification in unseen speaker recognition and then introduce a robust speaker-specific thresholding technique for better performance. Secondly, inspired by the recent use of meta-learning techniques in speaker verification, we propose an end-to-end meta-learning framework for imposter detection which decouples the problem of imposter detection from unseen speaker identification. Thus, unlike most prior works that use some heuristics to detect imposters, the proposed network learns to detect imposters by leveraging the utterances of the enrolled speakers. Furthermore, we show the efficacy of the proposed techniques on VoxCeleb1, VCTK and the FFSVC 2022 datasets, beating the baselines by up to 10%.
△ Less
Submitted 30 September, 2023; v1 submitted 1 June, 2023;
originally announced June 2023.
-
Magnetized matter effects on dilaton photon mixing
Authors:
Ankur Chaubey,
Manoj K. Jaiswal,
Avijit K. Ganguly
Abstract:
Dilatons ($φ(x)$) are a class of bosonic scalar particles associated with scaling symmetry and its compensation (under the violations of the same). Due to two photon coupling, they can produce optical signatures in a magnetic field. In vacuum or plain matter they couple to one of the transversely polarized state of the photon. But in a magnetized matter, they couple to both the transversely polari…
▽ More
Dilatons ($φ(x)$) are a class of bosonic scalar particles associated with scaling symmetry and its compensation (under the violations of the same). Due to two photon coupling, they can produce optical signatures in a magnetic field. In vacuum or plain matter they couple to one of the transversely polarized state of the photon. But in a magnetized matter, they couple to both the transversely polarized state of photon (due to emergence of a parity violating part of photon self energy contribution from a magnetized matter). A part of this work is directed towards understanding the issue of mixing of scalar with various polarizations states of photon in a medium ( magnetized or unmagnetized ) due to the constraints from different discrete (CPT) symmetries associated with the interaction. Based on these symmetry aided arguments, the structure of the mixing matrix is found to be $3 \times 3$. Thus there exists non-zero finite probabilities of oscillation between different polarization states of photon to dilaton. Our analytical and numerical analysis show no existence of periodic oscillation length either in temporal or spatial direction for most general values of the parameters in the theory. Possible astrophysical consequences of these results, those can be detected through observations are discussed.
△ Less
Submitted 26 December, 2022; v1 submitted 14 December, 2022;
originally announced December 2022.
-
Improved Relation Networks for End-to-End Speaker Verification and Identification
Authors:
Ashutosh Chaubey,
Sparsh Sinha,
Susmita Ghose
Abstract:
Speaker identification systems in a real-world scenario are tasked to identify a speaker amongst a set of enrolled speakers given just a few samples for each enrolled speaker. This paper demonstrates the effectiveness of meta-learning and relation networks for this use case. We propose improved relation networks for speaker verification and few-shot (unseen) speaker identification. The use of rela…
▽ More
Speaker identification systems in a real-world scenario are tasked to identify a speaker amongst a set of enrolled speakers given just a few samples for each enrolled speaker. This paper demonstrates the effectiveness of meta-learning and relation networks for this use case. We propose improved relation networks for speaker verification and few-shot (unseen) speaker identification. The use of relation networks facilitates joint training of the frontend speaker encoder and the backend model. Inspired by the use of prototypical networks in speaker verification and to increase the discriminability of the speaker embeddings, we train the model to classify samples in the current episode amongst all speakers present in the training set. Furthermore, we propose a new training regime for faster model convergence by extracting more information from a given meta-learning episode with negligible extra computation. We evaluate the proposed techniques on VoxCeleb, SITW and VCTK datasets on the tasks of speaker verification and unseen speaker identification. The proposed approach outperforms the existing approaches consistently on both tasks.
△ Less
Submitted 21 July, 2022; v1 submitted 31 March, 2022;
originally announced March 2022.
-
OPAD: An Optimized Policy-based Active Learning Framework for Document Content Analysis
Authors:
Sumit Shekhar,
Bhanu Prakash Reddy Guda,
Ashutosh Chaubey,
Ishan Jindal,
Avneet Jain
Abstract:
Documents are central to many business systems, and include forms, reports, contracts, invoices or purchase orders. The information in documents is typically in natural language, but can be organized in various layouts and formats. There have been recent spurt of interest in understanding document content with novel deep learning architectures. However, document understanding tasks need dense info…
▽ More
Documents are central to many business systems, and include forms, reports, contracts, invoices or purchase orders. The information in documents is typically in natural language, but can be organized in various layouts and formats. There have been recent spurt of interest in understanding document content with novel deep learning architectures. However, document understanding tasks need dense information annotations, which are costly to scale and generalize. Several active learning techniques have been proposed to reduce the overall budget of annotation while maintaining the performance of the underlying deep learning model. However, most of these techniques work only for classification problems. But content detection is a more complex task, and has been scarcely explored in active learning literature. In this paper, we propose \textit{OPAD}, a novel framework using reinforcement policy for active learning in content detection tasks for documents. The proposed framework learns the acquisition function to decide the samples to be selected while optimizing performance metrics that the tasks typically have. Furthermore, we extend to weak labelling scenarios to further reduce the cost of annotation significantly. We propose novel rewards to account for class imbalance and user feedback in the annotation interface, to improve the active learning method. We show superior performance of the proposed \textit{OPAD} framework for active learning for various tasks related to document understanding like layout parsing, object detection and named entity recognition. Ablation studies for human feedback and class imbalance rewards are presented, along with a comparison of annotation times for different approaches.
△ Less
Submitted 7 October, 2021; v1 submitted 1 October, 2021;
originally announced October 2021.
-
Differentiating Dilatons from Axions by their mixing with photons
Authors:
Ankur Chaubey,
Manoj K. Jaiswal,
Damini Singh,
Venktesh Singh,
Avijit K. Ganguly
Abstract:
According to the model ($Λ$CDM), based on deep cosmological observations, the current universe is constituted of 5$\%$ baryonic matter and 25 $\%$ non-baryonic cold dark matter (of speculative origin). These include quanta of scalar filed like dilaton($φ$) of scale symmetry origin and quanta of pseudoscalar field of extra standard model symmetry ( Peccei-Quinn) origin, like axion ($φ'$). These fie…
▽ More
According to the model ($Λ$CDM), based on deep cosmological observations, the current universe is constituted of 5$\%$ baryonic matter and 25 $\%$ non-baryonic cold dark matter (of speculative origin). These include quanta of scalar filed like dilaton($φ$) of scale symmetry origin and quanta of pseudoscalar field of extra standard model symmetry ( Peccei-Quinn) origin, like axion ($φ'$). These fields couple to di-photons through dim-5 operators. In magnetized medium, they in principle can interact with the three degrees of freedom (two transverse ($A_{\parallel,\perp}$) and one longitudinal ($A_{L}$)) of photon($γ$) as long as the total spin is conserved. Because of intrinsic spin being zero, both $φ$ and $φ'$ could in principle have interacted with $A_{L}$, (having $s_{z}=0$). However, out of $φ$ and $φ'$ only one interacts with $A_{L}$. Furthermore, the ambient external magnetic field and media, breaks the intrinsic Lorentz symmetry of the system invoking Charge conjugation, Parity and Time reversal symmetries, we analyse the mixing dynamics of $φγ$ and $φ'γ$ systems and the structural {\it difference} of their mixing pattern. The strength of electromagnetic (EM) signals due to $φγ$ and $φ'γ$ mixing as a result would be {\it different}. We conclude by commenting on the possibility of detecting this {\it difference} -- in polarimetric observables the EMS -- using the existing space-borne detectors.
△ Less
Submitted 14 January, 2022; v1 submitted 24 July, 2021;
originally announced July 2021.
-
Improvement in the performance of multilayer insulation technique and impact in the rare physics search experiments
Authors:
D. Singh,
M. K. Singh,
A. Chaubey,
A. K. Ganguly,
V. Singh
Abstract:
Providing thermal insulation to systems at very low temperature from surroundings, involves blocking the transport of thermal energy regular or enhanced, taking place through radiative, conductive and convective processes. For instance, the enhancement of radiative heat transport that takes place by infra red or far infra red light at low temperature is due to diffractive propagation. The waveleng…
▽ More
Providing thermal insulation to systems at very low temperature from surroundings, involves blocking the transport of thermal energy regular or enhanced, taking place through radiative, conductive and convective processes. For instance, the enhancement of radiative heat transport that takes place by infra red or far infra red light at low temperature is due to diffractive propagation. The wavelength of light in this part of the spectrum usually lie in the range of mm to cms. Hence it can get bent across an obstacle while propagating forward. Apart from radiative, the convective and conductive processes also get affected due to appearance of non linearities in the modes of lattice vibrations and anomalies in material transport due to the appearance of vorticity and turbulence in the intervening media. The Multilayer insulation technique has offered a robust thermal protective mechanism to provide proper insulation to the cold walls of the cryostats from the heat of the surroundings. This work is focused on the estimation of performance and efficiency of the MLI technique as well as exploration of its versatile applicability. Three different spacer materials such as Dacron, Glass tissue, and Silk net with radiation shields are selected for the intervening medium in the present study. This article explores the thermal performance of MLI system by changing the physical parameters, varying the geometry of the radiation shields perforation styles of radiation shields and by analyzing the effect of arrangement of radiation shields on the conduction heat load. This analysis is concluded by studying the possibility of using MLI technique in the health sector by reducing the evaporation rate of liquid Oxygen during pandemic situations e.g. in COVID19.
△ Less
Submitted 17 July, 2021;
originally announced July 2021.
-
Elliptic Flow of Particles under the Influence of Electromagnetic Field Evolution in Relativistic Heavy Ion Collision
Authors:
Tewodros Gezhagn,
A. K. Chaubey
Abstract:
The bending of flow of identified particles from the relativistic heavy-ion collision is investigated using the iEBE-VISHNU framework. The Maxwell's equations are applied to compute the incremental drift velocity and the change in the elliptic flow of particles from the four sources of the electric force, which are of coulomb (EC), Lorentz (EL), Faraday (EF) and Plasma-based. We find out that the…
▽ More
The bending of flow of identified particles from the relativistic heavy-ion collision is investigated using the iEBE-VISHNU framework. The Maxwell's equations are applied to compute the incremental drift velocity and the change in the elliptic flow of particles from the four sources of the electric force, which are of coulomb (EC), Lorentz (EL), Faraday (EF) and Plasma-based. We find out that the field evolution arouses flow at lower transverse momentum and suppresses it at higher. Heavier particles get higher initial push, and particles and their anti-particles get crudely the same elliptic flow changes. Moreover, elliptic flow is found to show different percentage increase for different collision energies. To conclude, the present study shows that besides the inclusion of electromagnetic fields, the increase in collision energy affects the elliptic flow of particles in a non-uniform fashion throughout the evolution. Further study by softening many of the crude assumptions we made and keeping the functionality of parameters is needed to establish a better understanding of the electromagnetic field evolution and its effects on the created system.
△ Less
Submitted 3 July, 2021;
originally announced July 2021.
-
Electromagnetic field evolution in relativistic heavy-ion collision and its effect on flow of particles
Authors:
Tewodros Gezhagn,
A. K. Chaubey
Abstract:
We compute the electromagnetic fields generated in relativistic heavy-ion collisions using the iEBE-VISHNU framework. We calculated the incremental drift velocity from the possible four sources of the electric force (coulomb, Lorentz, Faraday, and Plasma-based) on the particles created. The effect of this external electromagnetic field on the flow harmonics of particles was investigated, and we fo…
▽ More
We compute the electromagnetic fields generated in relativistic heavy-ion collisions using the iEBE-VISHNU framework. We calculated the incremental drift velocity from the possible four sources of the electric force (coulomb, Lorentz, Faraday, and Plasma-based) on the particles created. The effect of this external electromagnetic field on the flow harmonics of particles was investigated, and we found out that the flow harmonics values get suppressed and rouse in a non-uniform fashion throughout the evolution. More precisely, a maximum of close to three percent increase in elliptic flow was observed. We also found mass more dominant factor than charges for the change in flow harmonics due to the created electromagnetic field. On the top of that, the magnetic field perpendicular to the reaction plane is found to be sizable while the different radial electric forces were found to cancel out each other. Finally, we found out that the inclusion of an electromagnetic field affects the flow of particles by suppressing or rising it in a non-uniform fashion throughout the evolution.
△ Less
Submitted 3 July, 2021;
originally announced July 2021.
-
Exploring scalar-photon interactions in energetic astrophysical events
Authors:
Ankur Chaubey,
Manoj K. Jaiswal,
Avijit K. Ganguly
Abstract:
Scalar fields like dilaton appear in quantum field theory (QFT) due to scale symmetry breaking. Their appeal also extends to modified theories of gravity, like $F(R)$ gravity, Horva Lifshitz gravity etc. In unified theories they make their appearance through compactification of the extra dimension. Apart from resolving the issues of compactification scale and size, the particles of their fields ca…
▽ More
Scalar fields like dilaton appear in quantum field theory (QFT) due to scale symmetry breaking. Their appeal also extends to modified theories of gravity, like $F(R)$ gravity, Horva Lifshitz gravity etc. In unified theories they make their appearance through compactification of the extra dimension. Apart from resolving the issues of compactification scale and size, the particles of their fields can also turn out to be excellent candidate to solve the dark energy (DE) and dark matter (DM) problem of the universe. In this work we study their mixing dynamics with photons in a magnetized media, by incorporating the effect of parity violating part of the photon polarization tensor, evaluated in a finite density magnetized media. This piece, though in general is odd in the external magnetic field strength $eB$; in this work we however have retained terms to $O$($eB$). We are able to demonstrate in this work that, in magnetized medium a dilatonic scalar field $(φ)$ can excite the two transverse degrees of freedom (DOF) of the photons. One due to direct coupling and the other indirectly through the parity violating term originating due to magnetized medium effects. This results in the mixing dynamics being governed by, $3\times 3$ mixing matrices. This mixing results in making the underlying media optically active. In this work we focus on the spectro-polarimetric imprints of these particles, on the spectra of the electromagnetic (EM) fields of Gamma Ray Bursters (GRB). Focusing on a range of parameters (i.e., magnetic field strength, plasma frequency $(ω_{p})$, size of the magnetized volume, coupling strength to photons and their mass) we make an attempt to point out how space-borne detectors should be designed to optimise their detection possibility.
△ Less
Submitted 27 December, 2020;
originally announced December 2020.
-
Universal Adversarial Perturbations: A Survey
Authors:
Ashutosh Chaubey,
Nikhil Agrawal,
Kavya Barnwal,
Keerat K. Guliani,
Pramod Mehta
Abstract:
Over the past decade, Deep Learning has emerged as a useful and efficient tool to solve a wide variety of complex learning problems ranging from image classification to human pose estimation, which is challenging to solve using statistical machine learning algorithms. However, despite their superior performance, deep neural networks are susceptible to adversarial perturbations, which can cause the…
▽ More
Over the past decade, Deep Learning has emerged as a useful and efficient tool to solve a wide variety of complex learning problems ranging from image classification to human pose estimation, which is challenging to solve using statistical machine learning algorithms. However, despite their superior performance, deep neural networks are susceptible to adversarial perturbations, which can cause the network's prediction to change without making perceptible changes to the input image, thus creating severe security issues at the time of deployment of such systems. Recent works have shown the existence of Universal Adversarial Perturbations, which, when added to any image in a dataset, misclassifies it when passed through a target model. Such perturbations are more practical to deploy since there is minimal computation done during the actual attack. Several techniques have also been proposed to defend the neural networks against these perturbations. In this paper, we attempt to provide a detailed discussion on the various data-driven and data-independent methods for generating universal perturbations, along with measures to defend against such perturbations. We also cover the applications of such universal perturbations in various deep learning tasks.
△ Less
Submitted 16 May, 2020;
originally announced May 2020.
-
Mixing dynamics of dimension-five interactions (scalar/pseudoscalar-photon) in magnetized medium
Authors:
Ankur Chaubey,
Manoj K. Jaiswal,
Avijit K. Ganguly
Abstract:
In many extentions of standard model, dimension-5 scalar di-photon ($g_{γγφ}φ$ $F^{μν}F_{μν}$ ) or pseudoscalar di-photon ($g_{γγa}a\tilde{F}^{μν} F_{μν} $, ) interaction materializes due to scale symmetry breaking or $U_A(1)$ symmetry breaking. In a magnetized vacuum (i.e., in an external background field $\bar{F}_{μν} $) the transverse degrees of freedom of the photons-- for such systems-- can b…
▽ More
In many extentions of standard model, dimension-5 scalar di-photon ($g_{γγφ}φ$ $F^{μν}F_{μν}$ ) or pseudoscalar di-photon ($g_{γγa}a\tilde{F}^{μν} F_{μν} $, ) interaction materializes due to scale symmetry breaking or $U_A(1)$ symmetry breaking. In a magnetized vacuum (i.e., in an external background field $\bar{F}_{μν} $) the transverse degrees of freedom of the photons-- for such systems-- can be described in terms of the form factors constructed out of the background field strength tensor ($\bar{F}_{μν} $) and the same for dynamical photon ($f^{μν}$); they happen to be $\bar{F}_{μν}f^{μν}$ and ${\tilde{\bar{F}}_{μν}f^{μν}}$. These form factors transform differently under CP transformation. While $\bar{F}_{μν}f^{μν}$ (describing polarization orthogonal to B ($|γ_{||} >$)) is CP even,
the other one, ${\tilde{\bar{F}}_{μν}f^{μν}}$( describing polarization along B ($|γ_\bot >$), is CP odd. In the interaction Lagrangian, if the scalar is interchanged with the pseudoscalar, the role of the two form factors just gets interchanged. Thus for nearly degenerate strengths of the coupling constants ( $g_{γγφ}$ and $g_{γγ_a}$ ) and masses ($m_φ$ and $m_a$ ) of the respective candidates, proper identification of one from the other may become very difficult in laboratory or astrophysics based experiments. The basic motivation of this investigation is to reduce this uncertainty through incorporation of parity violating ({\it originating through magnetized medium effects }) part of the photon self-energy in the effective Lagrangian. This step, in turn affects the (Pseudo) Scalar Photon mixing dynamics drastically and brings out a significant change in the spectrum of the electromagnetic beam undergoing such interaction.
△ Less
Submitted 1 July, 2019;
originally announced July 2019.