Skip to main content

Showing 1–35 of 35 results for author: Vishwakarma, D K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.05136  [pdf

    cs.CL

    MHS-STMA: Multimodal Hate Speech Detection via Scalable Transformer-Based Multilevel Attention Framework

    Authors: Anusha Chhabra, Dinesh Kumar Vishwakarma

    Abstract: Social media has a significant impact on people's lives. Hate speech on social media has emerged as one of society's most serious issues in recent years. Text and pictures are two forms of multimodal data that are distributed within articles. Unimodal analysis has been the primary emphasis of earlier approaches. Additionally, when doing multimodal analysis, researchers neglect to preserve the dist… ▽ More

    Submitted 17 September, 2024; v1 submitted 8 September, 2024; originally announced September 2024.

  2. arXiv:2409.05134  [pdf

    cs.CL

    Hate Content Detection via Novel Pre-Processing Sequencing and Ensemble Methods

    Authors: Anusha Chhabra, Dinesh Kumar Vishwakarma

    Abstract: Social media, particularly Twitter, has seen a significant increase in incidents like trolling and hate speech. Thus, identifying hate speech is the need of the hour. This paper introduces a computational framework to curb the hate content on the web. Specifically, this study presents an exhaustive study of pre-processing approaches by studying the impact of changing the sequence of text pre-proce… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

  3. arXiv:2409.00896  [pdf

    cs.CV

    A Noise and Edge extraction-based dual-branch method for Shallowfake and Deepfake Localization

    Authors: Deepak Dagar, Dinesh Kumar Vishwakarma

    Abstract: The trustworthiness of multimedia is being increasingly evaluated by advanced Image Manipulation Localization (IML) techniques, resulting in the emergence of the IML field. An effective manipulation model necessitates the extraction of non-semantic differential features between manipulated and legitimate sections to utilize artifacts. This requires direct comparisons between the two regions.. Curr… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

  4. arXiv:2408.16892  [pdf

    cs.CV cs.LG

    Tex-ViT: A Generalizable, Robust, Texture-based dual-branch cross-attention deepfake detector

    Authors: Deepak Dagar, Dinesh Kumar Vishwakarma

    Abstract: Deepfakes, which employ GAN to produce highly realistic facial modification, are widely regarded as the prevailing method. Traditional CNN have been able to identify bogus media, but they struggle to perform well on different datasets and are vulnerable to adversarial attacks due to their lack of robustness. Vision transformers have demonstrated potential in the realm of image classification probl… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  5. arXiv:2408.10248  [pdf

    cs.CV cs.AI

    Target-Dependent Multimodal Sentiment Analysis Via Employing Visual-to Emotional-Caption Translation Network using Visual-Caption Pairs

    Authors: Ananya Pandey, Dinesh Kumar Vishwakarma

    Abstract: The natural language processing and multimedia field has seen a notable surge in interest in multimodal sentiment recognition. Hence, this study aims to employ Target-Dependent Multimodal Sentiment Analysis (TDMSA) to identify the level of sentiment associated with every target (aspect) stated within a multimodal post consisting of a visual-caption pair. Despite the recent advancements in multimod… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  6. arXiv:2408.10246  [pdf

    cs.CV cs.AI cs.CL eess.AS

    VyAnG-Net: A Novel Multi-Modal Sarcasm Recognition Model by Uncovering Visual, Acoustic and Glossary Features

    Authors: Ananya Pandey, Dinesh Kumar Vishwakarma

    Abstract: Various linguistic and non-linguistic clues, such as excessive emphasis on a word, a shift in the tone of voice, or an awkward expression, frequently convey sarcasm. The computer vision problem of sarcasm recognition in conversation aims to identify hidden sarcastic, criticizing, and metaphorical information embedded in everyday dialogue. Prior, sarcasm recognition has focused mainly on text. Stil… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  7. arXiv:2408.02595  [pdf

    cs.CV cs.AI

    Modelling Visual Semantics via Image Captioning to extract Enhanced Multi-Level Cross-Modal Semantic Incongruity Representation with Attention for Multimodal Sarcasm Detection

    Authors: Sajal Aggarwal, Ananya Pandey, Dinesh Kumar Vishwakarma

    Abstract: Sarcasm is a type of irony, characterized by an inherent mismatch between the literal interpretation and the intended connotation. Though sarcasm detection in text has been extensively studied, there are situations in which textual input alone might be insufficient to perceive sarcasm. The inclusion of additional contextual cues, such as images, is essential to recognize sarcasm in social media da… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  8. arXiv:2408.02571  [pdf

    cs.CV cs.AI

    Contrastive Learning-based Multi Modal Architecture for Emoticon Prediction by Employing Image-Text Pairs

    Authors: Ananya Pandey, Dinesh Kumar Vishwakarma

    Abstract: The emoticons are symbolic representations that generally accompany the textual content to visually enhance or summarize the true intention of a written message. Although widely utilized in the realm of social media, the core semantics of these emoticons have not been extensively explored based on multiple modalities. Incorporating textual and visual information within a single message develops an… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  9. arXiv:2401.06999  [pdf

    cs.CV

    Datasets, Clues and State-of-the-Arts for Multimedia Forensics: An Extensive Review

    Authors: Ankit Yadav, Dinesh Kumar Vishwakarma

    Abstract: With the large chunks of social media data being created daily and the parallel rise of realistic multimedia tampering methods, detecting and localising tampering in images and videos has become essential. This survey focusses on approaches for tampering detection in multimedia data using deep learning models. Specifically, it presents a detailed analysis of benchmark datasets for malicious manipu… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  10. arXiv:2401.06998  [pdf

    cs.CV

    Towards Effective Image Forensics via A Novel Computationally Efficient Framework and A New Image Splice Dataset

    Authors: Ankit Yadav, Dinesh Kumar Vishwakarma

    Abstract: Splice detection models are the need of the hour since splice manipulations can be used to mislead, spread rumors and create disharmony in society. However, there is a severe lack of image splicing datasets, which restricts the capabilities of deep learning models to extract discriminative features without overfitting. This manuscript presents two-fold contributions toward splice detection. Firstl… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  11. arXiv:2401.06995  [pdf

    cs.CV

    A Visually Attentive Splice Localization Network with Multi-Domain Feature Extractor and Multi-Receptive Field Upsampler

    Authors: Ankit Yadav, Dinesh Kumar Vishwakarma

    Abstract: Image splice manipulation presents a severe challenge in today's society. With easy access to image manipulation tools, it is easier than ever to modify images that can mislead individuals, organizations or society. In this work, a novel, "Visually Attentive Splice Localization Network with Multi-Domain Feature Extractor and Multi-Receptive Field Upsampler" has been proposed. It contains a unique… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  12. arXiv:2311.18676  [pdf, other

    cs.SI cs.AI

    DQSSA: A Quantum-Inspired Solution for Maximizing Influence in Online Social Networks (Student Abstract)

    Authors: Aryaman Rao, Parth Singh, Dinesh Kumar Vishwakarma, Mukesh Prasad

    Abstract: Influence Maximization is the task of selecting optimal nodes maximising the influence spread in social networks. This study proposes a Discretized Quantum-based Salp Swarm Algorithm (DQSSA) for optimizing influence diffusion in social networks. By discretizing meta-heuristic algorithms and infusing them with quantum-inspired enhancements, we address issues like premature convergence and low effic… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: AAAI Conference on Artificial Intelligence 2024

  13. arXiv:2301.05220  [pdf, other

    cs.CL cs.AI cs.LG

    Adversarial Adaptation for French Named Entity Recognition

    Authors: Arjun Choudhry, Inder Khatri, Pankaj Gupta, Aaryan Gupta, Maxime Nicol, Marie-Jean Meurs, Dinesh Kumar Vishwakarma

    Abstract: Named Entity Recognition (NER) is the task of identifying and classifying named entities in large-scale texts into predefined classes. NER in French and other relatively limited-resource languages cannot always benefit from approaches proposed for languages like English due to a dearth of large, robust datasets. In this paper, we present our work that aims to mitigate the effects of this dearth of… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

    Comments: Preprint version of short paper accepted for the ECIR 2023 conference

  14. arXiv:2212.03692  [pdf, other

    cs.CL

    Transformer-Based Named Entity Recognition for French Using Adversarial Adaptation to Similar Domain Corpora

    Authors: Arjun Choudhry, Pankaj Gupta, Inder Khatri, Aaryan Gupta, Maxime Nicol, Marie-Jean Meurs, Dinesh Kumar Vishwakarma

    Abstract: Named Entity Recognition (NER) involves the identification and classification of named entities in unstructured text into predefined classes. NER in languages with limited resources, like French, is still an open problem due to the lack of large, robust, labelled datasets. In this paper, we propose a transformer-based NER approach for French using adversarial adaptation to similar domain or genera… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: Author version of Student Abstract to appear in AAAI 2023 - Student Abstract and Poster Program

  15. arXiv:2211.17200  [pdf, other

    cs.SI

    CKS: A Community-based K-shell Decomposition Approach using Community Bridge Nodes for Influence Maximization

    Authors: Inder Khatri, Aaryan Gupta, Arjun Choudhry, Aryan Tyagi, Dinesh Kumar Vishwakarma, Mukesh Prasad

    Abstract: Social networks have enabled user-specific advertisements and recommendations on their platforms, which puts a significant focus on Influence Maximisation (IM) for target advertising and related tasks. The aim is to identify nodes in the network which can maximize the spread of information through a diffusion cascade. We propose a community structures-based approach that employs K-Shell algorithm… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

    Comments: Accepted in the Student Abstract & Poster Presentation Track at AAAI 2023

  16. arXiv:2211.17108  [pdf, other

    cs.CL

    An Emotion-guided Approach to Domain Adaptive Fake News Detection using Adversarial Learning

    Authors: Arkajyoti Chakraborty, Inder Khatri, Arjun Choudhry, Pankaj Gupta, Dinesh Kumar Vishwakarma, Mukesh Prasad

    Abstract: Recent works on fake news detection have shown the efficacy of using emotions as a feature for improved performance. However, the cross-domain impact of emotion-guided features for fake news detection still remains an open problem. In this work, we propose an emotion-guided, domain-adaptive, multi-task approach for cross-domain fake news detection, proving the efficacy of emotion-guided models in… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

    Comments: Accepted in the Student Abstract & Poster Presentation track at AAAI 2023. arXiv admin note: substantial text overlap with arXiv:2211.13718

  17. arXiv:2211.13718  [pdf, other

    cs.CL

    Emotion-guided Cross-domain Fake News Detection using Adversarial Domain Adaptation

    Authors: Arjun Choudhry, Inder Khatri, Arkajyoti Chakraborty, Dinesh Kumar Vishwakarma, Mukesh Prasad

    Abstract: Recent works on fake news detection have shown the efficacy of using emotions as a feature or emotions-based features for improved performance. However, the impact of these emotion-guided features for fake news detection in cross-domain settings, where we face the problem of domain shift, is still largely unexplored. In this work, we evaluate the impact of emotion-guided features for cross-domain… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: Accepted as a Short Paper in the 19th International Conference on Natural Language Processing (ICON) 2022

  18. arXiv:2211.12374  [pdf, other

    cs.CL cs.LG

    An Emotion-Aware Multi-Task Approach to Fake News and Rumour Detection using Transfer Learning

    Authors: Arjun Choudhry, Inder Khatri, Minni Jain, Dinesh Kumar Vishwakarma

    Abstract: Social networking sites, blogs, and online articles are instant sources of news for internet users globally. However, in the absence of strict regulations mandating the genuineness of every text on social media, it is probable that some of these texts are fake news or rumours. Their deceptive nature and ability to propagate instantly can have an adverse effect on society. This necessitates the nee… ▽ More

    Submitted 7 December, 2022; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: Accepted in IEEE Transaction on Computational Social Systems 18 pages 5 figures

  19. arXiv:2211.09683  [pdf, other

    cs.SI physics.soc-ph

    Influence Maximization in Social Networks using Discretized Harris Hawks Optimization Algorithm and Neighbour Scout Strategy

    Authors: Inder Khatri, Arjun Choudhry, Aryaman Rao, Aryan Tyagi, Dinesh Kumar Vishwakarma, Mukesh Prasad

    Abstract: Influence Maximization (IM) is the task of determining k optimal influential nodes in a social network to maximize the influence spread using a propagation model. IM is a prominent problem for viral marketing, and helps significantly in social media advertising. However, developing effective algorithms with minimal time complexity for real-world social networks still remains a challenge. While tra… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: 24 pages, 7 figures

  20. arXiv:2211.09657  [pdf, other

    cs.SI cs.AI

    A Spreader Ranking Algorithm for Extremely Low-budget Influence Maximization in Social Networks using Community Bridge Nodes

    Authors: Aaryan Gupta, Inder Khatri, Arjun Choudhry, Pranav Chandhok, Dinesh Kumar Vishwakarma, Mukesh Prasad

    Abstract: In recent years, social networking platforms have gained significant popularity among the masses like connecting with people and propagating ones thoughts and opinions. This has opened the door to user-specific advertisements and recommendations on these platforms, bringing along a significant focus on Influence Maximisation (IM) on social networks due to its wide applicability in target advertisi… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: 21 pages, 7 figures

  21. arXiv:2112.08611  [pdf

    cs.SI

    Clickbait in YouTube Prevention, Detection and Analysis of the Bait using Ensemble Learning

    Authors: Peya Mowar, Mini Jain, Ruchika Goel, Dinesh Kumar Vishwakarma

    Abstract: Unscrupulous content creators on YouTube employ deceptive techniques such as spam and clickbait to reach a broad audience and trick users into clicking on their videos to increase their advertisement revenue. Clickbait detection on YouTube requires an in depth examination and analysis of the intricate relationship between the video content and video descriptors title and thumbnail. However, the cu… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: 26 pages, 16 figures

  22. arXiv:2109.13476  [pdf

    cs.SI

    Fake News Detection using Semi-Supervised Graph Convolutional Network

    Authors: Priyanka Meel, Dinesh Kumar Vishwakarma

    Abstract: Social media becomes the central way for people to obtain and utilise news, due to its rapidness and inexpensive value of data distribution. Though, such features of social media platforms also present it a root cause of fake news distribution, causing adverse consequences on both people and culture. Hence, detecting fake news has become a significant research interest for bringing feasible real t… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: 25 pages, 7 figures

  23. arXiv:2109.13063  [pdf

    cs.IR

    An Automated Multi-Web Platform Voting Framework to Predict Misleading Information Proliferated during COVID-19 Outbreak using Ensemble Method

    Authors: Deepika Varshney, Dinesh Kumar Vishwakarma

    Abstract: Spreading of misleading information on social web platforms has fuelled huge panic and confusion among the public regarding the Corona disease, the detection of which is of paramount importance. To address this issue, in this paper, we have developed an automated system that can collect and validate the fact from multi web-platform to decide the credibility of the content. To identify the credibil… ▽ More

    Submitted 19 September, 2021; originally announced September 2021.

    Comments: 22 pages, 06 figures

  24. arXiv:2109.12547  [pdf

    cs.SI

    Multi-modal Fusion using Fine-tuned Self-attention and Transfer Learning for Veracity Analysis of Web Information

    Authors: Priyanka Meel, Dinesh Kumar Vishwakarma

    Abstract: The nuisance of misinformation and fake news has escalated many folds since the advent of online social networks. Human consciousness and decision-making capabilities are negatively influenced by manipulated, fabricated, biased or unverified news posts. Therefore, there is a high demand for designing veracity analysis systems to detect fake information contents in multiple data modalities. In an a… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

    Comments: 31 pages, 12 figures

  25. arXiv:2109.09929  [pdf

    cs.SI

    A Unified Approach of Detecting Misleading Images via Tracing its Instances on Web and Analysing its Past Context for the Verification of Content

    Authors: Deepika Varshney, Dinesh Kumar Vishwakarma

    Abstract: The verification of multimedia content over social media is one of the challenging and crucial issues in the current scenario and gaining prominence in an age where user-generated content and online social web platforms are the leading sources in shaping and propagating news stories. As these sources allow users to share their opinions without restriction, opportunistic users often post misleading… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: 22 pages, 8 figures

  26. arXiv:2109.06488  [pdf

    cs.CL

    Multilevel profiling of situation and dialogue-based deep networks for movie genre classification using movie trailers

    Authors: Dinesh Kumar Vishwakarma, Mayank Jindal, Ayush Mittal, Aditya Sharma

    Abstract: Automated movie genre classification has emerged as an active and essential area of research and exploration. Short duration movie trailers provide useful insights about the movie as video content consists of the cognitive and the affective level features. Previous approaches were focused upon either cognitive or affective content analysis. In this paper, we propose a novel multi-modality: situati… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Comments: 21 pages, 7 figures

  27. arXiv:2105.05708  [pdf, other

    cs.CV

    Deep and Shallow Covariance Feature Quantization for 3D Facial Expression Recognition

    Authors: Walid Hariri, Nadir Farah, Dinesh Kumar Vishwakarma

    Abstract: Facial expressions recognition (FER) of 3D face scans has received a significant amount of attention in recent years. Most of the facial expression recognition methods have been proposed using mainly 2D images. These methods suffer from several issues like illumination changes and pose variations. Moreover, 2D mapping from 3D images may lack some geometric and topological characteristics of the fa… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  28. arXiv:2012.13318  [pdf

    cs.CV

    Person Re-Identification using Deep Learning Networks: A Systematic Review

    Authors: Ankit Yadav, Dinesh Kumar Vishwakarma

    Abstract: Person re-identification has received a lot of attention from the research community in recent times. Due to its vital role in security based applications, person re-identification lies at the heart of research relevant to tracking robberies, preventing terrorist attacks and other security critical events. While the last decade has seen tremendous growth in re-id approaches, very little review lit… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

    Comments: 34 pages, 15 figures

  29. A Deep Multi-Level Attentive network for Multimodal Sentiment Analysis

    Authors: Ashima Yadav, Dinesh Kumar Vishwakarma

    Abstract: Multimodal sentiment analysis has attracted increasing attention with broad application prospects. The existing methods focuses on single modality, which fails to capture the social media content for multiple modalities. Moreover, in multi-modal learning, most of the works have focused on simply combining the two modalities, without exploring the complicated correlations between them. This resulte… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: 11 pages, 7 figures

    Journal ref: ACM Transactions on Multimedia Computing, Communications, and Applications, 2022

  30. arXiv:2011.10358  [pdf

    cs.CL cs.IR

    A Deep Language-independent Network to analyze the impact of COVID-19 on the World via Sentiment Analysis

    Authors: Ashima Yadav, Dinesh Kumar Vishwakarma

    Abstract: Towards the end of 2019, Wuhan experienced an outbreak of novel coronavirus, which soon spread all over the world, resulting in a deadly pandemic that infected millions of people around the globe. The government and public health agencies followed many strategies to counter the fatal virus. However, the virus severely affected the social and economic lives of the people. In this paper, we extract… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

  31. View-invariant Deep Architecture for Human Action Recognition using late fusion

    Authors: Chhavi Dhiman, Dinesh Kumar Vishwakarma

    Abstract: Human action Recognition for unknown views is a challenging task. We propose a view-invariant deep human action recognition framework, which is a novel integration of two important action cues: motion and shape temporal dynamics (STD). The motion stream encapsulates the motion content of action as RGB Dynamic Images (RGB-DIs) which are processed by the fine-tuned InceptionV3 model. The STD stream… ▽ More

    Submitted 8 December, 2019; originally announced December 2019.

    Comments: 10 pages, 7 figures

    Report number: 8960517

    Journal ref: 2019

  32. arXiv:1912.00576  [pdf

    cs.CV

    Skeleton based Activity Recognition by Fusing Part-wise Spatio-temporal and Attention Driven Residues

    Authors: Chhavi Dhiman, Dinesh Kumar Vishwakarma, Paras Aggarwal

    Abstract: There exist a wide range of intra class variations of the same actions and inter class similarity among the actions, at the same time, which makes the action recognition in videos very challenging. In this paper, we present a novel skeleton-based part-wise Spatiotemporal CNN RIAC Network-based 3D human action recognition framework to visualise the action dynamics in part wise manner and utilise ea… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

    Comments: 20 pages, 9 figures

  33. arXiv:1903.04090  [pdf

    cs.CV

    A Hybrid Framework for Action Recognition in Low-Quality Video Sequences

    Authors: Tej Singh, Dinesh Kumar Vishwakarma

    Abstract: Vision-based activity recognition is essential for security, monitoring and surveillance applications. Further, real-time analysis having low-quality video and contain less information about surrounding due to poor illumination, and occlusions. Therefore, it needs a more robust and integrated model for low quality and night security operations. In this context, we proposed a hybrid model for illum… ▽ More

    Submitted 10 March, 2019; originally announced March 2019.

    Comments: 13 pages, 9 Figures

  34. A Deep Structure of Person Re-Identification using Multi-Level Gaussian Models

    Authors: Dinesh Kumar Vishwakarma, Sakshi Upadhyay

    Abstract: Person re-identification is being widely used in the forensic, and security and surveillance system, but person re-identification is a challenging task in real life scenario. Hence, in this work, a new feature descriptor model has been proposed using a multilayer framework of Gaussian distribution model on pixel features, which include color moments, color space values and Schmid filter responses.… ▽ More

    Submitted 20 May, 2018; originally announced May 2018.

    Comments: 9 pages

    Report number: 8469037

    Journal ref: IEEE Transactions on Multi-Scale Computing Systems 4 (2018) 513 - 521

  35. arXiv:1611.06683  [pdf

    cs.CV

    Covariate conscious approach for Gait recognition based upon Zernike moment invariants

    Authors: Himanshu Aggarwal, Dinesh K. Vishwakarma

    Abstract: Gait recognition i.e. identification of an individual from his/her walking pattern is an emerging field. While existing gait recognition techniques perform satisfactorily in normal walking conditions, there performance tend to suffer drastically with variations in clothing and carrying conditions. In this work, we propose a novel covariate cognizant framework to deal with the presence of such cova… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

    Comments: 11 pages