Skip to main content

Showing 51–100 of 159 results for author: Nag, S

.
  1. arXiv:2307.05463  [pdf, other

    cs.CV

    EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone

    Authors: Shraman Pramanick, Yale Song, Sayan Nag, Kevin Qinghong Lin, Hardik Shah, Mike Zheng Shou, Rama Chellappa, Pengchuan Zhang

    Abstract: Video-language pre-training (VLP) has become increasingly important due to its ability to generalize to various vision and language tasks. However, existing egocentric VLP frameworks utilize separate video and language encoders and learn task-specific cross-modal information only during fine-tuning, limiting the development of a unified system. In this work, we introduce the second generation of e… ▽ More

    Submitted 18 August, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: Published in ICCV 2023

  2. arXiv:2306.02680  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    BeAts: Bengali Speech Acts Recognition using Multimodal Attention Fusion

    Authors: Ahana Deb, Sayan Nag, Ayan Mahapatra, Soumitri Chattopadhyay, Aritra Marik, Pijush Kanti Gayen, Shankha Sanyal, Archi Banerjee, Samir Karmakar

    Abstract: Spoken languages often utilise intonation, rhythm, intensity, and structure, to communicate intention, which can be interpreted differently depending on the rhythm of speech of their utterance. These speech acts provide the foundation of communication and are unique in expression to the language. Recent advancements in attention-based models, demonstrating their ability to learn powerful represent… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted at INTERSPEECH 2023

  3. arXiv:2304.00733  [pdf, other

    cs.CV

    Unbiased Scene Graph Generation in Videos

    Authors: Sayak Nag, Kyle Min, Subarna Tripathi, Amit K. Roy Chowdhury

    Abstract: The task of dynamic scene graph generation (SGG) from videos is complicated and challenging due to the inherent dynamics of a scene, temporal fluctuation of model predictions, and the long-tailed distribution of the visual relationships in addition to the already existing challenges in image-based SGG. Existing methods for dynamic SGG have primarily focused on capturing spatio-temporal context usi… ▽ More

    Submitted 29 June, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: Published in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023

  4. arXiv:2303.14863  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion

    Authors: Sauradip Nag, Xiatian Zhu, Jiankang Deng, Yi-Zhe Song, Tao Xiang

    Abstract: We propose a new formulation of temporal action detection (TAD) with denoising diffusion, DiffTAD in short. Taking as input random temporal proposals, it can yield action proposals accurately given an untrimmed long video. This presents a generative modeling perspective, against previous discriminative learning manners. This capability is achieved by first diffusing the ground-truth proposals to r… ▽ More

    Submitted 14 July, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

    Comments: ICCV 2023; Code available at https://github.com/sauradip/DiffusionTAD

  5. arXiv:2303.09695  [pdf, other

    cs.CV cs.GR cs.MM

    PersonalTailor: Personalizing 2D Pattern Design from 3D Garment Point Clouds

    Authors: Sauradip Nag, Anran Qi, Xiatian Zhu, Ariel Shamir

    Abstract: Garment pattern design aims to convert a 3D garment to the corresponding 2D panels and their sewing structure. Existing methods rely either on template fitting with heuristics and prior assumptions, or on model learning with complicated shape parameterization. Importantly, both approaches do not allow for personalization of the output garment, which today has increasing demands. To fill this deman… ▽ More

    Submitted 11 August, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: Technical Report

  6. arXiv:2303.05556  [pdf, other

    cs.CV

    An Evaluation of Non-Contrastive Self-Supervised Learning for Federated Medical Image Analysis

    Authors: Soumitri Chattopadhyay, Soham Ganguly, Sreejit Chaudhury, Sayan Nag, Samiran Chattopadhyay

    Abstract: Privacy and annotation bottlenecks are two major issues that profoundly affect the practicality of machine learning-based medical image analysis. Although significant progress has been made in these areas, these issues are not yet fully resolved. In this paper, we seek to tackle these concerns head-on and systematically explore the applicability of non-contrastive self-supervised learning (SSL) al… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

  7. arXiv:2303.02245  [pdf, other

    cs.CV

    Exploring Self-Supervised Representation Learning For Low-Resource Medical Image Analysis

    Authors: Soumitri Chattopadhyay, Soham Ganguly, Sreejit Chaudhury, Sayan Nag, Samiran Chattopadhyay

    Abstract: The success of self-supervised learning (SSL) has mostly been attributed to the availability of unlabeled yet large-scale datasets. However, in a specialized domain such as medical imaging which is a lot different from natural images, the assumption of data availability is unrealistic and impractical, as the data itself is scanty and found in small databases, collected for specific prognosis tasks… ▽ More

    Submitted 28 June, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted at IEEE ICIP 2023

  8. ViTA: A Vision Transformer Inference Accelerator for Edge Applications

    Authors: Shashank Nag, Gourav Datta, Souvik Kundu, Nitin Chandrachoodan, Peter A. Beerel

    Abstract: Vision Transformer models, such as ViT, Swin Transformer, and Transformer-in-Transformer, have recently gained significant traction in computer vision tasks due to their ability to capture the global relation between features which leads to superior performance. However, they are compute-heavy and difficult to deploy in resource-constrained edge devices. Existing hardware accelerators, including t… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: Accepted at ISCAS 2023

    Journal ref: 2023 IEEE International Symposium on Circuits and Systems (ISCAS), Monterey, CA, USA, 2023, pp. 1-5

  9. arXiv:2211.14924  [pdf, other

    cs.CV

    Post-Processing Temporal Action Detection

    Authors: Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang

    Abstract: Existing Temporal Action Detection (TAD) methods typically take a pre-processing step in converting an input varying-length video into a fixed-length snippet representation sequence, before temporal boundary estimation and action classification. This pre-processing step would temporally downsample the video, reducing the inference resolution and hampering the detection performance in the original… ▽ More

    Submitted 3 March, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: CVPR 2023; Code available at https://github.com/sauradip/GAP

  10. arXiv:2211.14905  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.MM

    Multi-Modal Few-Shot Temporal Action Detection

    Authors: Sauradip Nag, Mengmeng Xu, Xiatian Zhu, Juan-Manuel Perez-Rua, Bernard Ghanem, Yi-Zhe Song, Tao Xiang

    Abstract: Few-shot (FS) and zero-shot (ZS) learning are two different approaches for scaling temporal action detection (TAD) to new classes. The former adapts a pretrained vision model to a new task represented by as few as a single video per class, whilst the latter requires no training examples by exploiting a semantic description of the new class. In this work, we introduce a new multi-modality few-shot… ▽ More

    Submitted 27 March, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: Technical Report

  11. arXiv:2210.15075  [pdf, other

    cs.CV

    IDEAL: Improved DEnse locAL Contrastive Learning for Semi-Supervised Medical Image Segmentation

    Authors: Hritam Basak, Soumitri Chattopadhyay, Rohit Kundu, Sayan Nag, Rammohan Mallipeddi

    Abstract: Due to the scarcity of labeled data, Contrastive Self-Supervised Learning (SSL) frameworks have lately shown great potential in several medical image analysis tasks. However, the existing contrastive mechanisms are sub-optimal for dense pixel-level segmentation tasks due to their inability to mine local features. To this end, we extend the concept of metric learning to the segmentation task, using… ▽ More

    Submitted 2 March, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: Paper accepted for publication at IEEE ICASSP 2023

  12. arXiv:2210.04135  [pdf, other

    cs.CV cs.LG cs.MM

    VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment

    Authors: Shraman Pramanick, Li Jing, Sayan Nag, Jiachen Zhu, Hardik Shah, Yann LeCun, Rama Chellappa

    Abstract: Vision-language pre-training (VLP) has recently proven highly effective for various uni- and multi-modal downstream applications. However, most existing end-to-end VLP methods use high-resolution image-text box data to perform well on fine-grained region-level tasks, such as object detection, segmentation, and referring expression comprehension. Unfortunately, such high-resolution images with accu… ▽ More

    Submitted 29 October, 2023; v1 submitted 8 October, 2022; originally announced October 2022.

    Comments: Published in TMLR 2023

  13. arXiv:2209.08905  [pdf, ps, other

    nucl-ex nucl-th

    Shape evolution in the rapidly rotating $^{140}$Gd nucleus

    Authors: H. Pai, S. Rajbanshi, Somnath Nag, Sajad Ali, R. Palit, G. Mukherjee, F. S. Babra, R. Banik, Soumik Bhattacharya, S. Biswas, S. Chakraborty, R. Donthi, S. Jadhav, Md. S. R. Laskar, B. S. Naidu, S. Nandi, A. Goswami

    Abstract: Ground state band of $^{140}$Gd has been investigated following their population in the $^{112}$Sn($^{35}$Cl,~$α$p2n)$^{140}$Gd reaction at 195 MeV of beam energy using a large array of Compton suppressed HPGe clovers as the detection setup. Apart from other spectroscopic measurements, level lifetimes of the states have been extracted using the Doppler Shift Attenuation Method. Extracted quadrupol… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  14. arXiv:2208.00955  [pdf, other

    cs.CV

    Large-Scale Product Retrieval with Weakly Supervised Representation Learning

    Authors: Xiao Han, Kam Woh Ng, Sauradip Nag, Zhiyu Qu

    Abstract: Large-scale weakly supervised product retrieval is a practically useful yet computationally challenging problem. This paper introduces a novel solution for the eBay Visual Search Challenge (eProduct) held at the Ninth Workshop on Fine-Grained Visual Categorisation workshop (FGVC9) of CVPR 2022. This competition presents two challenges: (a) E-commerce is a drastically fine-grained domain including… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: FGVC9 CVPR2022

  15. arXiv:2207.08184  [pdf, other

    cs.CV cs.AI cs.CL cs.MM

    Zero-Shot Temporal Action Detection via Vision-Language Prompting

    Authors: Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang

    Abstract: Existing temporal action detection (TAD) methods rely on large training data including segment-level annotations, limited to recognizing previously seen classes alone during inference. Collecting and annotating a large training set for each class of interest is costly and hence unscalable. Zero-shot TAD (ZS-TAD) resolves this obstacle by enabling a pre-trained model to recognize any unseen action… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

    Comments: ECCV 2022; Code available at https://github.com/sauradip/STALE

  16. arXiv:2207.07059  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Semi-Supervised Temporal Action Detection with Proposal-Free Masking

    Authors: Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang

    Abstract: Existing temporal action detection (TAD) methods rely on a large number of training data with segment-level annotations. Collecting and annotating such a training set is thus highly expensive and unscalable. Semi-supervised TAD (SS-TAD) alleviates this problem by leveraging unlabeled videos freely available at scale. However, SS-TAD is also a much more challenging problem than supervised TAD, and… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: ECCV 2022; Code available at https://github.com/sauradip/SPOT

  17. arXiv:2207.06580  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Proposal-Free Temporal Action Detection via Global Segmentation Mask Learning

    Authors: Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang

    Abstract: Existing temporal action detection (TAD) methods rely on generating an overwhelmingly large number of proposals per video. This leads to complex model designs due to proposal generation and/or per-proposal action instance evaluation and the resultant high computational cost. In this work, for the first time, we propose a proposal-free Temporal Action detection model with Global Segmentation mask (… ▽ More

    Submitted 19 August, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: ECCV 2022; Code available at https://github.com/sauradip/TAGS

  18. ACLNet: An Attention and Clustering-based Cloud Segmentation Network

    Authors: Dhruv Makwana, Subhrajit Nag, Onkar Susladkar, Gayatri Deshmukh, Sai Chandra Teja R, Sparsh Mittal, C Krishna Mohan

    Abstract: We propose a novel deep learning model named ACLNet, for cloud segmentation from ground images. ACLNet uses both deep neural network and machine learning (ML) algorithm to extract complementary features. Specifically, it uses EfficientNet-B0 as the backbone, "`a trous spatial pyramid pooling" (ASPP) to learn at multiple receptive fields, and "global attention module" (GAM) to extract finegrained d… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: 11 pages, 3 figures, 5 tables, Published in remote sensing letters

    Journal ref: volume 13, pages 865-875, year 2022

  19. arXiv:2207.06001  [pdf, other

    q-bio.QM

    Studying the age of onset and detection of Chronic Myeloid Leukemia using a three-stage stochastic model

    Authors: Suryadeepto Nag, Ananda Shikhara Bhat, Siddhartha P. Chakrabarty

    Abstract: Chronic Myeloid Leukemia (CML) is a biphasic malignant clonal disorder that progresses, first with a chronic phase, where the cells have enhanced proliferation only, and then to a blast phase, where the cells have the ability of self-renewal. It is well-recognized that the Philadelphia chromosome (which contains the BCR-ABL fusion gene) is the "hallmark of CML". However, empirical studies have sho… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

  20. WaferSegClassNet -- A Light-weight Network for Classification and Segmentation of Semiconductor Wafer Defects

    Authors: Subhrajit Nag, Dhruv Makwana, Sai Chandra Teja R, Sparsh Mittal, C Krishna Mohan

    Abstract: As the integration density and design intricacy of semiconductor wafers increase, the magnitude and complexity of defects in them are also on the rise. Since the manual inspection of wafer defects is costly, an automated artificial intelligence (AI) based computer-vision approach is highly desired. The previous works on defect analysis have several limitations, such as low accuracy and the need fo… ▽ More

    Submitted 3 July, 2022; originally announced July 2022.

    Comments: 11 pages, 2 figures, 7 tables, Published in Computers in Industry

    Journal ref: Volume 142, 2022, 103720, ISSN 0166-3615,

  21. arXiv:2207.00506  [pdf, other

    cs.CV cs.CG

    How Far Can I Go ? : A Self-Supervised Approach for Deterministic Video Depth Forecasting

    Authors: Sauradip Nag, Nisarg Shah, Anran Qi, Raghavendra Ramachandra

    Abstract: In this paper we present a novel self-supervised method to anticipate the depth estimate for a future, unobserved real-world urban scene. This work is the first to explore self-supervised learning for estimation of monocular depth of future unobserved frames of a video. Existing works rely on a large number of annotated samples to generate the probabilistic prediction of depth for unseen frames. H… ▽ More

    Submitted 8 July, 2022; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted in ML4AD Workshop, NeurIPS 2021

  22. arXiv:2111.07042  [pdf

    cs.RO eess.SY

    Agile Satellite Planning for Multi-Payload Observations for Earth Science

    Authors: Rich Levinson, Sreeja Nag, Vinay Ravindra

    Abstract: We present planning challenges, methods and preliminary results for a new model-based paradigm for earth observing systems in adaptive remote sensing. Our heuristically guided constraint optimization planner produces coordinated plans for multiple satellites, each with multiple instruments (payloads). The satellites are agile, meaning they can quickly maneuver to change viewing angles in response… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

    Journal ref: International Workshop on Planning & Scheduling for Space (IWPSS) 2021

  23. arXiv:2110.10552  [pdf, other

    cs.CV cs.LG cs.MM

    Few-Shot Temporal Action Localization with Query Adaptive Transformer

    Authors: Sauradip Nag, Xiatian Zhu, Tao Xiang

    Abstract: Existing temporal action localization (TAL) works rely on a large number of training videos with exhaustive segment-level annotation, preventing them from scaling to new classes. As a solution to this problem, few-shot TAL (FS-TAL) aims to adapt a model to a new class represented by as few as a single video. Exiting FS-TAL methods assume trimmed training videos for new classes. However, this setti… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: BMVC 2021

  24. arXiv:2109.04572  [pdf, other

    cs.LG cs.AI physics.data-an

    Deciphering Environmental Air Pollution with Large Scale City Data

    Authors: Mayukh Bhattacharyya, Sayan Nag, Udita Ghosh

    Abstract: Air pollution poses a serious threat to sustainable environmental conditions in the 21st century. Its importance in determining the health and living standards in urban settings is only expected to increase with time. Various factors ranging from artificial emissions to natural phenomena are known to be primary causal agents or influencers behind rising air pollution levels. However, the lack of l… ▽ More

    Submitted 15 June, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: Accepted as a Oral Spotlight Paper at International Joint Conference of Artificial Intelligence (IJCAI) 2022

  25. arXiv:2108.09598  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    SERF: Towards better training of deep neural networks using log-Softplus ERror activation Function

    Authors: Sayan Nag, Mayukh Bhattacharyya

    Abstract: Activation functions play a pivotal role in determining the training dynamics and neural network performance. The widely adopted activation function ReLU despite being simple and effective has few disadvantages including the Dying ReLU problem. In order to tackle such problems, we propose a novel activation function called Serf which is self-regularized and nonmonotonic in nature. Like Mish, Serf… ▽ More

    Submitted 24 August, 2021; v1 submitted 21 August, 2021; originally announced August 2021.

  26. arXiv:2108.00340  [pdf, other

    cs.CV

    Reconstruction guided Meta-learning for Few Shot Open Set Recognition

    Authors: Sayak Nag, Dripta S. Raychaudhuri, Sujoy Paul, Amit K. Roy-Chowdhury

    Abstract: In many applications, we are constrained to learn classifiers from very limited data (few-shot classification). The task becomes even more challenging if it is also required to identify samples from unknown categories (open-set classification). Learning a good abstraction for a class with very few samples is extremely difficult, especially under open-set settings. As a result, open-set recognition… ▽ More

    Submitted 30 September, 2023; v1 submitted 31 July, 2021; originally announced August 2021.

    Comments: Accepted for publication in IEEE Transactions in Pattern Analysis and Machine Intelligence (TPAMI)

  27. arXiv:2107.06518  [pdf, other

    q-fin.RM

    Single Event Transition Risk: A Measure for Long Term Carbon Exposure

    Authors: Suryadeepto Nag, Siddhartha P. Chakrabarty, Sankarshan Basu

    Abstract: Although there is a growing consensus that a low-carbon transition will be necessary to mitigate the accelerated climate change, the magnitude of transition-risk for investors is difficult to measure exactly. Investors are therefore constrained by the unavailability of suitable measures to quantify the magnitude of the risk and are forced to use the likes of absolute emissions data or ESG scores i… ▽ More

    Submitted 25 May, 2022; v1 submitted 14 July, 2021; originally announced July 2021.

  28. arXiv:2105.12247  [pdf, other

    cs.LG cs.AI cs.CG cs.CV stat.ML

    GraphVICRegHSIC: Towards improved self-supervised representation learning for graphs with a hyrbid loss function

    Authors: Sayan Nag

    Abstract: Self-supervised learning and pre-training strategieshave developed over the last few years especiallyfor Convolutional Neural Networks (CNNs). Re-cently application of such methods can also be no-ticed for Graph Neural Networks (GNNs) . In thispaper, we have used a graph based self-supervisedlearning strategy with different loss functions (Bar-low Twins[Zbontaret al., 2021], HSIC[Tsaiet al.,2021],… ▽ More

    Submitted 26 November, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: Paper Accepted in the Weakly Supervised Representation Learning Workshop, IJCAI 2021 (IJCAI2021-WSRL)

  29. arXiv:2105.02687  [pdf, ps, other

    gr-qc astro-ph.CO hep-th

    Anisotropic Multiverse with Varying $c$, $G$ and Study of Thermodynamics

    Authors: Ujjal Debnath, Soumak Nag

    Abstract: We assume the anisotropic model of the Universe in the framework of varying speed of light $c$ and varying gravitational constant $G$ theories and study different types of singularities. For the singularity models, we write the scale factors in terms of cosmic time and found some conditions for possible singularities. For future singularities, we assume the forms of varying speed of light and vary… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: 8 pages

  30. arXiv:2105.00643  [pdf, other

    q-bio.PE

    Modeling the dynamics of COVID-19 transmission in India: Social Distancing, Regional Spread and Healthcare Capacity

    Authors: Suryadeepto Nag, Siddhartha P. Chakrabarty

    Abstract: In the new paradigm of health-centric governance, policy makers are in a constant need for appropriate metrics and estimates in order to determine the best policies in a non-arbitrary fashion. Thus, in this paper, a compartmentalized model for the transmission of COVID-19 is developed to facilitate policy making. A socially distanced compartment is added to the model and its utility in quantifying… ▽ More

    Submitted 19 April, 2022; v1 submitted 3 May, 2021; originally announced May 2021.

  31. arXiv:2104.04636  [pdf, ps, other

    math.PR

    Continuous-Time Higher Order Markov Chains: Formulation and Parameter Estimation

    Authors: Suryadeepto Nag

    Abstract: Stochastic processes find applications in modelling systems in a variety of disciplines. A large number of stochastic models considered are Markovian in nature. It is often observed that higher order Markov processes can model the data better. However most higher order Markov models are discrete. Here, we propose a novel continuous-time formulation of higher order Markov processes, as stochastic d… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

  32. arXiv:2102.07940  [pdf, other

    eess.SY

    Attitude Trajectory Optimization for Agile Satellites in Autonomous Remote Sensing Constellation

    Authors: Emmanuel Sin, Sreeja Nag, Vinay Ravindra, Alan Li, Murat Arcak

    Abstract: Agile attitude maneuvering maximizes the utility of remote sensing satellite constellations. By taking into account a satellite's physical properties and its actuator specifications, we may leverage the full performance potential of the attitude control system to conduct agile remote sensing beyond conventional slew-and-stabilize maneuvers. Employing a constellation of agile satellites, coordinate… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

    Comments: 24 pages, 27 figures

  33. arXiv:2102.06038  [pdf

    cs.SD cs.CL eess.AS

    A Fractal Approach to Characterize Emotions in Audio and Visual Domain: A Study on Cross-Modal Interaction

    Authors: Sayan Nag, Uddalok Sarkar, Shankha Sanyal, Archi Banerjee, Souparno Roy, Samir Karmakar, Ranjan Sengupta, Dipak Ghosh

    Abstract: It is already known that both auditory and visual stimulus is able to convey emotions in human mind to different extent. The strength or intensity of the emotional arousal vary depending on the type of stimulus chosen. In this study, we try to investigate the emotional arousal in a cross-modal scenario involving both auditory and visual stimulus while studying their source characteristics. A robus… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  34. arXiv:2102.06003  [pdf

    cs.SD cs.CL eess.AS

    Language Independent Emotion Quantification using Non linear Modelling of Speech

    Authors: Uddalok Sarkar, Sayan Nag, Chirayata Bhattacharya, Shankha Sanyal, Archi Banerjee, Ranjan Sengupta, Dipak Ghosh

    Abstract: At present emotion extraction from speech is a very important issue due to its diverse applications. Hence, it becomes absolutely necessary to obtain models that take into consideration the speaking styles of a person, vocal tract information, timbral qualities and other congenital information regarding his voice. Our speech production system is a nonlinear system like most other real world system… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  35. arXiv:2102.00616  [pdf

    cs.SD cs.LG cs.MM eess.AS

    Neural Network architectures to classify emotions in Indian Classical Music

    Authors: Uddalok Sarkar, Sayan Nag, Medha Basu, Archi Banerjee, Shankha Sanyal, Ranjan Sengupta, Dipak Ghosh

    Abstract: Music is often considered as the language of emotions. It has long been known to elicit emotions in human being and thus categorizing music based on the type of emotions they induce in human being is a very intriguing topic of research. When the task comes to classify emotions elicited by Indian Classical Music (ICM), it becomes much more challenging because of the inherent ambiguity associated wi… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

  36. arXiv:2101.05458  [pdf, ps, other

    q-bio.NC nlin.CD physics.med-ph

    On the stability of equilibria of the physiologically-informed dynamic causal model

    Authors: Sayan Nag

    Abstract: Experimental manipulations perturb the neuronal activity. This phenomenon is manifested in the fMRI response. Dynamic causal model and its variants can model these neuronal responses along with the BOLD responses [1, 2, 3, 4, 5] . Physiologically-informed DCM (P-DCM) [5] gives state-of-the-art results in this aspect. But, P-DCM has more parameters compared to the standard DCM model and the stabili… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  37. arXiv:2012.05694  [pdf

    cs.CV cs.AI cs.GR cs.LG physics.data-an

    Lookahead optimizer improves the performance of Convolutional Autoencoders for reconstruction of natural images

    Authors: Sayan Nag

    Abstract: Autoencoders are a class of artificial neural networks which have gained a lot of attention in the recent past. Using the encoder block of an autoencoder the input image can be compressed into a meaningful representation. Then a decoder is employed to reconstruct the compressed representation back to a version which looks like the input image. It has plenty of applications in the field of data com… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.

  38. arXiv:2010.09946  [pdf

    eess.SY astro-ph.IM

    Planning a Reference Constellation for Radiometric Cross-Calibration of Commercial Earth Observing Sensors

    Authors: Sreeja Nag, Philip Dabney, Vinay Ravindra, Cody Anderson

    Abstract: The Earth Observation planning community has access to tools that can propagate orbits and compute coverage of Earth observing imagers with customizable shapes and orientation, model the expected Earth Reflectance at various bands, epochs and directions, generate simplified instrument performance metrics for imagers and radars, and schedule single and multiple spacecraft payload operations. We are… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Journal ref: International Workshop on Planning and Scheduling for Space, Berkeley CA, July 2019

  39. arXiv:2010.09940  [pdf

    eess.SY

    Autonomous Scheduling of Agile Spacecraft Constellations with Delay Tolerant Networking for Reactive Imaging

    Authors: Sreeja Nag, Alan S. Li, Vinay Ravindra, Marc Sanchez Net, Kar-Ming Cheung, Rod Lammers, Brian Bledsoe

    Abstract: Small spacecraft now have precise attitude control systems available commercially, allowing them to slew in 3 degrees of freedom, and capture images within short notice. When combined with appropriate software, this agility can significantly increase response rate, revisit time and coverage. In prior work, we have demonstrated an algorithmic framework that combines orbital mechanics, attitude cont… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Journal ref: International Conference on Automated Planning and Scheduling SPARK Workshop, Berkeley, July 2019

  40. arXiv:2010.03350  [pdf, other

    q-fin.TR

    Modeling the commodity prices of base metals in Indian commodity market using a Higher Order Markovian Approach

    Authors: Suryadeepto Nag, Sankarshan Basu, Siddhartha P. Chakrabarty

    Abstract: A Higher Order Markovian (HOM) model to capture the dynamics of commodity prices is proposed as an alternative to a Markovian model. In particular, the order of the former model, is taken to be the delay, in the response of the industry, to the market information. This is then empirically analyzed for the prices of Copper Mini and four other bases metals, namely Aluminum, Lead, Nickel and Zinc, in… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

  41. arXiv:2006.15100  [pdf, other

    cs.LG eess.SP stat.ML

    E2GC: Energy-efficient Group Convolution in Deep Neural Networks

    Authors: Nandan Kumar Jha, Rajat Saini, Subhrajit Nag, Sparsh Mittal

    Abstract: The number of groups ($g$) in group convolution (GConv) is selected to boost the predictive performance of deep neural networks (DNNs) in a compute and parameter efficient manner. However, we show that naive selection of $g$ in GConv creates an imbalance between the computational complexity and degree of data reuse, which leads to suboptimal energy efficiency in DNNs. We devise an optimum group si… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

    Comments: Accepted as a conference paper in 2020 33rd International Conference on VLSI Design and 2020 19th International Conference on Embedded Systems (VLSID)

    ACM Class: I.5.1; I.5.2; I.5.5; C.0

    Journal ref: VLSID (2020) 155-160

  42. High spin states of $^{204}$At: isomeric states and shears band structure

    Authors: D. Kanjilal, S. K. Dey, S. S. Bhattacharjee, A. Bisoi, M. Das, C. C. Dey, S. Nag, R. Palit, S. Ray, S. Saha, J. Sethi, S. Saha

    Abstract: High-spin states of neutron deficient Trans-Lead nucleus $^{204}$At were populated up to $\sim 8\,{\rm MeV}$ excitation through the $^{12}$C + $^{197}$Au fusion evaporation reaction. Decay of the associated levels through prompt and delayed $γ$-ray emissions were studied to evaluate the underlying nuclear structure. The level scheme, which was partly known, was extended further. An isomeric… ▽ More

    Submitted 1 September, 2022; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this article is published in The European Physical Journal A and is available online at https://doi.org/10.1140/epja/s10050-022-00809-4

    Journal ref: Eur. Phys. J. A (2022) 58:159

  43. arXiv:2005.12524  [pdf

    cs.CV cs.MM

    A New Unified Method for Detecting Text from Marathon Runners and Sports Players in Video

    Authors: Sauradip Nag, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Michael Blumenstein

    Abstract: Detecting text located on the torsos of marathon runners and sports players in video is a challenging issue due to poor quality and adverse effects caused by flexible/colorful clothing, and different structures of human bodies or actions. This paper presents a new unified method for tackling the above challenges. The proposed method fuses gradient magnitude and direction coherence of text pixels i… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

    Comments: Accepted in Pattern Recognition, Elsevier

  44. arXiv:2004.08248  [pdf

    eess.AS cs.SD nlin.CD q-bio.NC

    Acoustical classification of different speech acts using nonlinear methods

    Authors: Chirayata Bhattacharyya, Sourya Sengupta, Sayan Nag, Shankha Sanyal, Archi Banerjee, Ranjan Sengupta, Dipak Ghosh

    Abstract: A recitation is a way of combining the words together so that they have a sense of rhythm and thus an emotional content is imbibed within. In this study we envisaged to answer these questions in a scientific manner taking into consideration 5 (five) well known Bengali recitations of different poets conveying a variety of moods ranging from joy to sorrow. The clips were recited as well as read (in… ▽ More

    Submitted 5 August, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

    Comments: 6 pages, 2 figures; Proceedings of WESPAC 2018, New Delhi, India, November 11-15, 2018

  45. arXiv:2004.07820  [pdf

    cs.SD cs.CL eess.AS

    Speaker Recognition in Bengali Language from Nonlinear Features

    Authors: Uddalok Sarkar, Soumyadeep Pal, Sayan Nag, Chirayata Bhattacharya, Shankha Sanyal, Archi Banerjee, Ranjan Sengupta, Dipak Ghosh

    Abstract: At present Automatic Speaker Recognition system is a very important issue due to its diverse applications. Hence, it becomes absolutely necessary to obtain models that take into consideration the speaking style of a person, vocal tract information, timbral qualities of his voice and other congenital information regarding his voice. The study of Bengali speech recognition and speaker identification… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

    Comments: arXiv admin note: text overlap with arXiv:1612.00171, arXiv:1601.07709

  46. arXiv:2004.02071  [pdf, ps, other

    cs.CL

    Incorporating Bilingual Dictionaries for Low Resource Semi-Supervised Neural Machine Translation

    Authors: Sreyashi Nag, Mihir Kale, Varun Lakshminarasimhan, Swapnil Singhavi

    Abstract: We explore ways of incorporating bilingual dictionaries to enable semi-supervised neural machine translation. Conventional back-translation methods have shown success in leveraging target side monolingual data. However, since the quality of back-translation models is tied to the size of the available parallel corpora, this could adversely impact the synthetically generated sentences in a low resou… ▽ More

    Submitted 4 April, 2020; originally announced April 2020.

  47. arXiv:1912.05014  [pdf, other

    cs.CV cs.LG cs.MM

    Hybrid Style Siamese Network: Incorporating style loss in complementary apparels retrieval

    Authors: Mayukh Bhattacharyya, Sayan Nag

    Abstract: Image Retrieval grows to be an integral part of fashion e-commerce ecosystem as it keeps expanding in multitudes. Other than the retrieval of visually similar items, the retrieval of visually compatible or complementary items is also an important aspect of it. Normal Siamese Networks tend to work well on complementary items retrieval. But it fails to identify low level style features which make it… ▽ More

    Submitted 9 June, 2020; v1 submitted 23 November, 2019; originally announced December 2019.

    Comments: Paper Accepted in the Third Workshop on Computer Vision for Fashion, Art and Design, CVPR 2020

  48. arXiv:1912.03641  [pdf, other

    cs.CV

    SaLite : A light-weight model for salient object detection

    Authors: Kitty Varghese, Sauradip Nag

    Abstract: Salient object detection is a prevalent computer vision task that has applications ranging from abnormality detection to abnormality processing. Context modelling is an important criterion in the domain of saliency detection. A global context helps in determining the salient object in a given image by contrasting away other objects in the global view of the scene. However, the local context featur… ▽ More

    Submitted 8 December, 2019; originally announced December 2019.

    Comments: This was submitted to NCVPRIPG 2019

  49. arXiv:1906.12039  [pdf, ps, other

    cs.CL cs.LG

    Supervised Contextual Embeddings for Transfer Learning in Natural Language Processing Tasks

    Authors: Mihir Kale, Aditya Siddhant, Sreyashi Nag, Radhika Parik, Matthias Grabmair, Anthony Tomasic

    Abstract: Pre-trained word embeddings are the primary method for transfer learning in several Natural Language Processing (NLP) tasks. Recent works have focused on using unsupervised techniques such as language modeling to obtain these embeddings. In contrast, this work focuses on extracting representations from multiple pre-trained supervised models, which enriches word embeddings with task and domain spec… ▽ More

    Submitted 28 June, 2019; originally announced June 2019.

    Comments: Appeared in 2nd Learning from Limited Labeled Data (LLD) Workshop at ICLR 2019

  50. Can many-body localization persist in the presence of long-range interactions or long-range hopping?

    Authors: Sabyasachi Nag, Arti Garg

    Abstract: We study many-body localization (MBL) in a one-dimensional system of spinless fermions with a deterministic aperiodic potential in the presence of long-range interactions or long-range hopping. Based on perturbative arguments there is a common belief that MBL can exist only in systems with short-range interactions and short-range hopping. We analyze effects of power-law interactions and power-law… ▽ More

    Submitted 13 June, 2019; v1 submitted 18 March, 2019; originally announced March 2019.

    Comments: 13 Figures

    Journal ref: Phys. Rev. B 99, 224203 (2019)