Skip to main content

Showing 1–40 of 40 results for author: Prasad, D K

.
  1. arXiv:2504.02912  [pdf, other

    cs.CV cs.AI cs.ET cs.LG

    Haphazard Inputs as Images in Online Learning

    Authors: Rohit Agarwal, Aryan Dessai, Arif Ahmed Sekh, Krishna Agarwal, Alexander Horsch, Dilip K. Prasad

    Abstract: The field of varying feature space in online learning settings, also known as haphazard inputs, is very prominent nowadays due to its applicability in various fields. However, the current solutions to haphazard inputs are model-dependent and cannot benefit from the existing advanced deep-learning methods, which necessitate inputs of fixed dimensions. Therefore, we propose to transform the varying… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: Accepted at IJCNN 2025

  2. arXiv:2501.13135  [pdf, other

    q-bio.OT cs.AI physics.med-ph q-bio.SC

    Applications and Challenges of AI and Microscopy in Life Science Research: A Review

    Authors: Himanshu Buckchash, Gyanendra Kumar Verma, Dilip K. Prasad

    Abstract: The complexity of human biology and its intricate systems holds immense potential for advancing human health, disease treatment, and scientific discovery. However, traditional manual methods for studying biological interactions are often constrained by the sheer volume and complexity of biological data. Artificial Intelligence (AI), with its proven ability to analyze vast datasets, offers a transf… ▽ More

    Submitted 22 January, 2025; originally announced January 2025.

  3. arXiv:2410.17394  [pdf, other

    cs.LG cs.AI

    packetLSTM: Dynamic LSTM Framework for Streaming Data with Varying Feature Space

    Authors: Rohit Agarwal, Karaka Prasanth Naidu, Alexander Horsch, Krishna Agarwal, Dilip K. Prasad

    Abstract: We study the online learning problem characterized by the varying input feature space of streaming data. Although LSTMs have been employed to effectively capture the temporal nature of streaming data, they cannot handle the dimension-varying streams in an online learning setting. Therefore, we propose a dynamic LSTM-based novel method, called packetLSTM, to model the dimension-varying streams. The… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

  4. arXiv:2409.10242  [pdf, other

    cs.LG cs.AI

    Hedging Is Not All You Need: A Simple Baseline for Online Learning Under Haphazard Inputs

    Authors: Himanshu Buckchash, Momojit Biswas, Rohit Agarwal, Dilip K. Prasad

    Abstract: Handling haphazard streaming data, such as data from edge devices, presents a challenging problem. Over time, the incoming data becomes inconsistent, with missing, faulty, or new inputs reappearing. Therefore, it requires models that are reliable. Recent methods to solve this problem depend on a hedging-based solution and require specialized elements like auxiliary dropouts, forked architectures,… ▽ More

    Submitted 30 December, 2024; v1 submitted 16 September, 2024; originally announced September 2024.

  5. arXiv:2405.05777  [pdf, other

    cs.CL cs.AI

    Towards a More Inclusive AI: Progress and Perspectives in Large Language Model Training for the Sámi Language

    Authors: Ronny Paul, Himanshu Buckchash, Shantipriya Parida, Dilip K. Prasad

    Abstract: Sámi, an indigenous language group comprising multiple languages, faces digital marginalization due to the limited availability of data and sophisticated language models designed for its linguistic intricacies. This work focuses on increasing technological participation for the Sámi language. We draw the attention of the ML community towards the language modeling problem of Ultra Low Resource (ULR… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  6. arXiv:2404.04903  [pdf, other

    cs.LG cs.AI

    Online Learning under Haphazard Input Conditions: A Comprehensive Review and Analysis

    Authors: Rohit Agarwal, Arijit Das, Alexander Horsch, Krishna Agarwal, Dilip K. Prasad

    Abstract: The domain of online learning has experienced multifaceted expansion owing to its prevalence in real-life applications. Nonetheless, this progression operates under the assumption that the input feature space of the streaming data remains constant. In this survey paper, we address the topic of online learning in the context of haphazard inputs, explicitly foregoing such an assumption. We discuss,… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  7. arXiv:2311.02538  [pdf, other

    cs.CV cs.AI

    Dense Video Captioning: A Survey of Techniques, Datasets and Evaluation Protocols

    Authors: Iqra Qasim, Alexander Horsch, Dilip K. Prasad

    Abstract: Untrimmed videos have interrelated events, dependencies, context, overlapping events, object-object interactions, domain specificity, and other semantics that are worth highlighting while describing a video in natural language. Owing to such a vast diversity, a single sentence can only correctly describe a portion of the video. Dense Video Captioning (DVC) aims at detecting and describing differen… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 35 pages, 10 figures

  8. arXiv:2309.08698  [pdf, other

    cs.AI cs.LG

    No Imputation Needed: A Switch Approach to Irregularly Sampled Time Series

    Authors: Rohit Agarwal, Aman Sinha, Ayan Vishwakarma, Xavier Coubez, Marianne Clausel, Mathieu Constant, Alexander Horsch, Dilip K. Prasad

    Abstract: Modeling irregularly-sampled time series (ISTS) is challenging because of missing values. Most existing methods focus on handling ISTS by converting irregularly sampled data into regularly sampled data via imputation. These models assume an underlying missing mechanism, which may lead to unwanted bias and sub-optimal performance. We present SLAN (Switch LSTM Aggregate Network), which utilizes a gr… ▽ More

    Submitted 19 August, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

  9. arXiv:2308.06983  [pdf, other

    cs.CV cs.AI cs.LG

    pNNCLR: Stochastic Pseudo Neighborhoods for Contrastive Learning based Unsupervised Representation Learning Problems

    Authors: Momojit Biswas, Himanshu Buckchash, Dilip K. Prasad

    Abstract: Nearest neighbor (NN) sampling provides more semantic variations than pre-defined transformations for self-supervised learning (SSL) based image recognition problems. However, its performance is restricted by the quality of the support set, which holds positive samples for the contrastive loss. In this work, we show that the quality of the support set plays a crucial role in any nearest neighbor b… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 15 pages, 5 figures

  10. arXiv:2307.04149  [pdf, other

    cs.CV cs.AI cs.LG

    Latent Graph Attention for Enhanced Spatial Context

    Authors: Ayush Singh, Yash Bhambhu, Himanshu Buckchash, Deepak K. Gupta, Dilip K. Prasad

    Abstract: Global contexts in images are quite valuable in image-to-image translation problems. Conventional attention-based and graph-based models capture the global context to a large extent, however, these are computationally expensive. Moreover, the existing approaches are limited to only learning the pairwise semantic relation between any two points on the image. In this paper, we present Latent Graph A… ▽ More

    Submitted 12 July, 2023; v1 submitted 9 July, 2023; originally announced July 2023.

    Comments: 20 pages, 7 figures

  11. arXiv:2306.05974  [pdf, other

    physics.optics

    Taxonomy of hybridly polarized Stokes vortex beams

    Authors: Gauri Arora, Ankit Butola, Ruchi Rajput, Rohit Agarwal, Krishna Agarwal, Alexander Horsch, Dilip K Prasad, Paramasivam Senthilkumaran

    Abstract: Structured beams carrying topological defects, namely phase and Stokes singularities, have gained extensive interest in numerous areas of optics. The non-separable spin and orbital angular momentum states of hybridly polarized Stokes singular beams provide additional freedom for manipulating optical fields. However, the characterization of hybridly polarized Stokes vortex beams remains challenging… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

  12. arXiv:2303.05155  [pdf, other

    cs.LG cs.AI

    Aux-Drop: Handling Haphazard Inputs in Online Learning Using Auxiliary Dropouts

    Authors: Rohit Agarwal, Deepak Gupta, Alexander Horsch, Dilip K. Prasad

    Abstract: Many real-world applications based on online learning produce streaming data that is haphazard in nature, i.e., contains missing features, features becoming obsolete in time, the appearance of new features at later points in time and a lack of clarity on the total number of input features. These challenges make it hard to build a learnable system for such applications, and almost no work exists in… ▽ More

    Submitted 31 May, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted at Transactions on Machine Learning Research (TMLR). Link: https://openreview.net/pdf?id=R9CgBkeZ6Z

    Journal ref: Transactions on Machine Learning Research, 2023

  13. arXiv:2303.03050  [pdf, other

    cs.CV cs.AI cs.IR

    MABNet: Master Assistant Buddy Network with Hybrid Learning for Image Retrieval

    Authors: Rohit Agarwal, Gyanendra Das, Saksham Aggarwal, Alexander Horsch, Dilip K. Prasad

    Abstract: Image retrieval has garnered growing interest in recent times. The current approaches are either supervised or self-supervised. These methods do not exploit the benefits of hybrid learning using both supervision and self-supervision. We present a novel Master Assistant Buddy Network (MABNet) for image retrieval which incorporates both learning mechanisms. MABNet consists of master and assistant bl… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted at International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2023

  14. arXiv:2303.02095  [pdf, other

    cs.CV cs.LG

    Data-Efficient Training of CNNs and Transformers with Coresets: A Stability Perspective

    Authors: Animesh Gupta, Irtiza Hasan, Dilip K. Prasad, Deepak K. Gupta

    Abstract: Coreset selection is among the most effective ways to reduce the training time of CNNs, however, only limited is known on how the resultant models will behave under variations of the coreset size, and choice of datasets and models. Moreover, given the recent paradigm shift towards transformer-based models, it is still an open question how coreset selection would impact their performance. There are… ▽ More

    Submitted 10 March, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

  15. arXiv:2303.01546  [pdf, other

    eess.IV cs.CV

    MiShape: 3D Shape Modelling of Mitochondria in Microscopy

    Authors: Abhinanda R. Punnakkal, Suyog S Jadhav, Alexander Horsch, Krishna Agarwal, Dilip K. Prasad

    Abstract: Fluorescence microscopy is a quintessential tool for observing cells and understanding the underlying mechanisms of life-sustaining processes of all living organisms. The problem of extracting 3D shape of mitochondria from fluorescence microscopy images remains unsolved due to the complex and varied shapes expressed by mitochondria and the poor resolving capacity of these microscopes. We propose a… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  16. arXiv:2301.13817  [pdf, other

    cs.CV

    Patch Gradient Descent: Training Neural Networks on Very Large Images

    Authors: Deepak K. Gupta, Gowreesh Mago, Arnav Chavan, Dilip K. Prasad

    Abstract: Traditional CNN models are trained and tested on relatively low resolution images (<300 px), and cannot be directly operated on large-scale images due to compute and memory constraints. We propose Patch Gradient Descent (PatchGD), an effective learning strategy that allows to train the existing CNN architectures on large-scale images in an end-to-end manner. PatchGD is based on the hypothesis that… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

  17. arXiv:2211.13769  [pdf, other

    cs.CV cs.AI cs.LG

    On Designing Light-Weight Object Trackers through Network Pruning: Use CNNs or Transformers?

    Authors: Saksham Aggarwal, Taneesh Gupta, Pawan Kumar Sahu, Arnav Chavan, Rishabh Tiwari, Dilip K. Prasad, Deepak K. Gupta

    Abstract: Object trackers deployed on low-power devices need to be light-weight, however, most of the current state-of-the-art (SOTA) methods rely on using compute-heavy backbones built using CNNs or transformers. Large sizes of such models do not allow their deployment in low-power conditions and designing compressed variants of large tracking models is of great importance. This paper demonstrates how high… ▽ More

    Submitted 26 March, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: Accepted at IEEE ICASSP 2023

  18. arXiv:2211.06739  [pdf, other

    cs.CV

    Partial Binarization of Neural Networks for Budget-Aware Efficient Learning

    Authors: Udbhav Bamba, Neeraj Anand, Saksham Aggarwal, Dilip K. Prasad, Deepak K. Gupta

    Abstract: Binarization is a powerful compression technique for neural networks, significantly reducing FLOPs, but often results in a significant drop in model performance. To address this issue, partial binarization techniques have been developed, but a systematic approach to mixing binary and full-precision parameters in a single network is still lacking. In this paper, we propose a controlled approach to… ▽ More

    Submitted 8 November, 2023; v1 submitted 12 November, 2022; originally announced November 2022.

    Comments: Accepted at WACV 2023 Conference

  19. arXiv:2206.12681  [pdf, other

    cs.CV

    UltraMNIST Classification: A Benchmark to Train CNNs for Very Large Images

    Authors: Deepak K. Gupta, Udbhav Bamba, Abhishek Thakur, Akash Gupta, Suraj Sharan, Ertugrul Demir, Dilip K. Prasad

    Abstract: Convolutional neural network (CNN) approaches available in the current literature are designed to work primarily with low-resolution images. When applied on very large images, challenges related to GPU memory, smaller receptive field than needed for semantic correspondence and the need to incorporate multi-scale features arise. The resolution of input images can be reduced, however, with significa… ▽ More

    Submitted 25 June, 2022; originally announced June 2022.

  20. arXiv:2111.09109  [pdf, other

    cs.LG

    Physics-guided Loss Functions Improve Deep Learning Performance in Inverse Scattering

    Authors: Zicheng Liu, Mayank Roy, Dilip K. Prasad, Krishna Agarwal

    Abstract: Solving electromagnetic inverse scattering problems (ISPs) is challenging due to the intrinsic nonlinearity, ill-posedness, and expensive computational cost. Recently, deep neural network (DNN) techniques have been successfully applied on ISPs and shown potential of superior imaging over conventional methods. In this paper, we analyse the analogy between DNN solvers and traditional iterative algor… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

  21. Artefact removal in ground truth and noise model deficient sub-cellular nanoscopy images using auto-encoder deep learning

    Authors: Suyog Jadhav, Sebastian Acuña, Krishna Agarwal, Dilip K. prasad

    Abstract: Image denoising or artefact removal using deep learning is possible in the availability of supervised training dataset acquired in real experiments or synthesized using known noise models. Neither of the conditions can be fulfilled for nanoscopy (super-resolution optical microscopy) images that are generated from microscopy videos through statistical analysis techniques. Due to several physical co… ▽ More

    Submitted 5 September, 2020; originally announced September 2020.

    Comments: 22 pages, 13 figures

  22. arXiv:2008.12617  [pdf, other

    eess.IV cs.CV

    Simulation-supervised deep learning for analysing organelles states and behaviour in living cells

    Authors: Arif Ahmed Sekh, Ida S. Opstad, Rohit Agarwal, Asa Birna Birgisdottir, Truls Myrmel, Balpreet Singh Ahluwalia, Krishna Agarwal, Dilip K. Prasad

    Abstract: In many real-world scientific problems, generating ground truth (GT) for supervised learning is almost impossible. The causes include limitations imposed by scientific instrument, physical phenomenon itself, or the complexity of modeling. Performing artificial intelligence (AI) tasks such as segmentation, tracking, and analytics of small sub-cellular structures such as mitochondria in microscopy v… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

    Comments: under review at NIPS 2020

  23. arXiv:2008.11828  [pdf, other

    cs.LG stat.ML

    Auxiliary Network: Scalable and agile online learning for dynamic system with inconsistently available inputs

    Authors: Rohit Agarwal, Arif Ahmed Sekh, Krishna Agarwal, Dilip K. Prasad

    Abstract: Streaming classification methods assume the number of input features is fixed and always received. But in many real-world scenarios demand is some input features are reliable while others are unreliable or inconsistent. In this paper, we propose a novel deep learning-based model called Auxiliary Network (Aux-Net), which is scalable and agile. It employs a weighted ensemble of classifiers to give a… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

    Comments: under review at NIPS 2020

  24. arXiv:2008.06713  [pdf, other

    eess.IV cs.CV

    Single image dehazing for a variety of haze scenarios using back projected pyramid network

    Authors: Ayush Singh, Ajay Bhave, Dilip K. Prasad

    Abstract: Learning to dehaze single hazy images, especially using a small training dataset is quite challenging. We propose a novel generative adversarial network architecture for this problem, namely back projected pyramid network (BPPNet), that gives good performance for a variety of challenging haze conditions, including dense haze and inhomogeneous haze. Our architecture incorporates learning of multipl… ▽ More

    Submitted 15 August, 2020; originally announced August 2020.

    Comments: 16 pages, 8 figures, to be published in Computer Vision ECCV 2020 Workshops

  25. arXiv:2007.02397  [pdf

    physics.bio-ph eess.IV

    High space-bandwidth in quantitative phase imaging using partially spatially coherent optical coherence microscopy and deep neural network

    Authors: Ankit Butola, Sheetal Raosaheb Kanade, Sunil Bhatt, Vishesh Kumar Dubey, Anand Kumar, Azeem Ahmad, Dilip K Prasad, Paramasivam Senthilkumaran, Balpreet Singh Ahluwalia, Dalip Singh Mehta

    Abstract: Quantitative phase microscopy (QPM) is a label-free technique that enables to monitor morphological changes at subcellular level. The performance of the QPM system in terms of spatial sensitivity and resolution depends on the coherence properties of the light source and the numerical aperture (NA) of objective lenses. Here, we propose high space-bandwidth QPM using partially spatially coherent opt… ▽ More

    Submitted 5 July, 2020; originally announced July 2020.

  26. arXiv:2004.00959  [pdf, other

    q-bio.PE cs.LG

    Neural network based country wise risk prediction of COVID-19

    Authors: Ratnabali Pal, Arif Ahmed Sekh, Samarjit Kar, Dilip K. Prasad

    Abstract: The recent worldwide outbreak of the novel coronavirus (COVID-19) has opened up new challenges to the research community. Artificial intelligence (AI) driven methods can be useful to predict the parameters, risks, and effects of such an epidemic. Such predictions can be helpful to control and prevent the spread of such diseases. The main challenges of applying AI is the small volume of data and th… ▽ More

    Submitted 16 September, 2020; v1 submitted 31 March, 2020; originally announced April 2020.

    Journal ref: Applied Sciences, 2020

  27. arXiv:2002.07377  [pdf

    physics.bio-ph eess.IV q-bio.QM

    High spatially sensitive quantitative phase imaging assisted with deep neural network for classification of human spermatozoa under stressed condition

    Authors: Ankit Butola, Daria Popova, Dilip K Prasad, Azeem Ahmad, Anowarul Habib, Jean Claude Tinguely, Purusotam Basnet, Ganesh Acharya, Paramasivam Senthilkumaran, Dalip Singh Mehta, Balpreet Singh Ahluwalia

    Abstract: Sperm cell motility and morphology observed under the bright field microscopy are the only criteria for selecting particular sperm cell during Intracytoplasmic Sperm Injection (ICSI) procedure of Assisted Reproductive Technology (ART). Several factors such as, oxidative stress, cryopreservation, heat, smoking and alcohol consumption, are negatively associated with the quality of sperm cell and fer… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

  28. arXiv:2002.00707  [pdf

    eess.IV

    Subsurface defect imaging in PZT ceramics using dual point contact excitation and detection

    Authors: H. Mahawar, K. Agarwal, D. K. Prasad, F. Melandso, A. Habib

    Abstract: The application of piezoelectric materials, such as Lead Zirconate Titanate (ZrxTi1-x) O3 (PZT) is increasing in multiple dynamic industries such as structural health monitoring, wireless energy harvesting devices, measuring blood flow, etc.The main aim of this paper is to denoise the images generated by dual point excitation and detection method for subsurface damage detection. Nonetheless, these… ▽ More

    Submitted 4 December, 2019; originally announced February 2020.

    Comments: 2 pages, 3 figures

  29. arXiv:1902.05657  [pdf, other

    cs.CV

    TMAV: Temporal Motionless Analysis of Video using CNN in MPSoC

    Authors: Somdip Dey, Amit K. Singh, Dilip K. Prasad, Klaus D. McDonald-Maier

    Abstract: Analyzing video for traffic categorization is an important pillar of Intelligent Transport Systems. However, it is difficult to analyze and predict traffic based on image frames because the representation of each frame may vary significantly within a short time period. This also would inaccurately represent the traffic over a longer period of time such as the case of video. We propose a novel bio-… ▽ More

    Submitted 18 February, 2019; v1 submitted 14 February, 2019; originally announced February 2019.

    Comments: 11 pages, 5 figures, 2 tables

    ACM Class: I.4; I.2.1; C.1.4

  30. arXiv:1902.04955  [pdf, other

    cs.CV

    Can We Automate Diagrammatic Reasoning?

    Authors: Sk. Arif Ahmed, Debi Prosad Dogra, Samarjit Kar, Partha Pratim Roy, Dilip K. Prasad

    Abstract: Learning to solve diagrammatic reasoning (DR) can be a challenging but interesting problem to the computer vision research community. It is believed that next generation pattern recognition applications should be able to simulate human brain to understand and analyze reasoning of images. However, due to the lack of benchmarks of diagrammatic reasoning, the present research primarily focuses on vis… ▽ More

    Submitted 13 February, 2019; originally announced February 2019.

  31. arXiv:1812.09271  [pdf

    cs.CV

    Polygonal approximation of digital planar curve using novel significant measure

    Authors: Mangayarkarasi Ramaiah, Dilip K. Prasad

    Abstract: This paper presents an iterative smoothing technique for polygonal approximation of digital image boundary. The technique starts with finest initial segmentation points of a curve. The contribution of initially segmented points towards preserving the original shape of the image boundary is determined by computing the significant measure of every initial segmentation points which is sensitive to sh… ▽ More

    Submitted 21 December, 2018; originally announced December 2018.

    Comments: 17 pages,15 figures

  32. arXiv:1812.02487  [pdf

    physics.med-ph physics.bio-ph

    Deep learning architecture LightOCT for diagnostic decision support using optical coherence tomography images of biological samples

    Authors: Ankit Butola, Dilip K. Prasad, Azeem Ahmad, Vishesh Dubey, Darakhshan Qaiser, Anurag Srivastava, Paramsivam Senthilkumaran, Balpreet Singh Ahluwalia, Dalip Singh Mehta

    Abstract: Optical coherence tomography (OCT) is being increasingly adopted as a label-free and non-invasive technique for biomedical applications such as cancer and ocular disease diagnosis. Diagnostic information for these tissues is manifest in textural and geometric features of the OCT images, which are used by human expertise to interpret and triage. However, it suffers delays due to the long process of… ▽ More

    Submitted 6 July, 2020; v1 submitted 6 December, 2018; originally announced December 2018.

  33. arXiv:1810.08317  [pdf

    cs.RO

    Enabling Grasp Action: Generalized Evaluation of Grasp Stability via Contact Stiffness from Contact Mechanics Insight

    Authors: Huixu Dong, Chen Qiu, Dilip K. Prasad, Ye Pan, Jiansheng Dai, I-Ming Chen

    Abstract: Performing a grasp is a pivotal capability for a robotic gripper. We propose a new evaluation approach of grasping stability via constructing a model of grasping stiffness based on the theory of contact mechanics. First, the mathematical models are built to explore soft contact and the general grasp stiffness between a finger and an object. Next, the grasping stiffness matrix is constructed to ref… ▽ More

    Submitted 18 October, 2018; originally announced October 2018.

    Comments: 12 pages, 14 figures

  34. arXiv:1809.04659  [pdf, other

    cs.CV

    Are object detection assessment criteria ready for maritime computer vision?

    Authors: Dilip K. Prasad, Huixu Dong, Deepu Rajan, Chai Quek

    Abstract: Maritime vessels equipped with visible and infrared cameras can complement other conventional sensors for object detection. However, application of computer vision techniques in maritime domain received attention only recently. The maritime environment offers its own unique requirements and challenges. Assessment of the quality of detections is a fundamental need in computer vision. However, the c… ▽ More

    Submitted 17 November, 2019; v1 submitted 12 September, 2018; originally announced September 2018.

    Journal ref: IEEE Transactions on Intelligent Transportation Systems (2020)

  35. arXiv:1702.00754  [pdf

    cs.CV

    Maritime situational awareness using adaptive multi-sensor management under hazy conditions

    Authors: D. K. Prasad, C. K. Prasath, D. Rajan, L. Rachmawati, E. Rajabally, C. Quek

    Abstract: This paper presents a multi-sensor architecture with an adaptive multi-sensor management system suitable for control and navigation of autonomous maritime vessels in hazy and poor-visibility conditions. This architecture resides in the autonomous maritime vessels. It augments the data from on-board imaging sensors and weather sensors with the AIS data and weather data from sensors on other vessels… ▽ More

    Submitted 2 February, 2017; originally announced February 2017.

    Comments: 11 pages, 2 figures, MTEC 2017

  36. arXiv:1701.08378  [pdf, other

    cs.CV

    MSCM-LiFe: Multi-scale cross modal linear feature for horizon detection in maritime images

    Authors: D. K. Prasad, D. Rajan, C. K. Prasath, L. Rachmawati, E. Rajabaly, C. Quek

    Abstract: This paper proposes a new method for horizon detection called the multi-scale cross modal linear feature. This method integrates three different concepts related to the presence of horizon in maritime images to increase the accuracy of horizon detection. Specifically it uses the persistence of horizon in multi-scale median filtering, and its detection as a linear feature commonly detected by two d… ▽ More

    Submitted 29 January, 2017; originally announced January 2017.

    Comments: 5 pages, 4 figures, IEEE TENCON 2016

  37. arXiv:1611.05842  [pdf, other

    cs.CV

    Video Processing from Electro-optical Sensors for Object Detection and Tracking in Maritime Environment: A Survey

    Authors: D. K. Prasad, D. Rajan, L. Rachmawati, E. Rajabaly, C. Quek

    Abstract: We present a survey on maritime object detection and tracking approaches, which are essential for the development of a navigational system for autonomous ships. The electro-optical (EO) sensor considered here is a video camera that operates in the visible or the infrared spectra, which conventionally complement radar and sonar and have demonstrated effectiveness for situational awareness at sea ha… ▽ More

    Submitted 17 November, 2016; originally announced November 2016.

    Comments: 23 pages

  38. arXiv:1608.01079  [pdf, other

    cs.CV

    Challenges in video based object detection in maritime scenario using computer vision

    Authors: D. K. Prasad, C. K. Prasath, D. Rajan, L. Rachmawati, E. Rajabaly, C. Quek

    Abstract: This paper discusses the technical challenges in maritime image processing and machine vision problems for video streams generated by cameras. Even well documented problems of horizon detection and registration of frames in a video are very challenging in maritime scenarios. More advanced problems of background subtraction and object detection in video streams are very challenging. Challenges aris… ▽ More

    Submitted 3 August, 2016; originally announced August 2016.

  39. arXiv:1305.3885  [pdf

    cs.CV cs.CG

    Geometric primitive feature extraction - concepts, algorithms, and applications

    Authors: Dilip K. Prasad

    Abstract: This thesis presents important insights and concepts related to the topic of the extraction of geometric primitives from the edge contours of digital images. Three specific problems related to this topic have been studied, viz., polygonal approximation of digital curves, tangent estimation of digital curves, and ellipse fitting anddetection from digital curves. For the problem of polygonal approxi… ▽ More

    Submitted 16 May, 2013; originally announced May 2013.

    Comments: 333 pages

  40. arXiv:1302.5189  [pdf

    cs.CV

    Object Detection in Real Images

    Authors: Dilip K. Prasad

    Abstract: Object detection and recognition are important problems in computer vision. Since these problems are meta-heuristic, despite a lot of research, practically usable, intelligent, real-time, and dynamic object detection/recognition methods are still unavailable. We propose a new object detection/recognition method, which improves over the existing methods in every stage of the object detection/recogn… ▽ More

    Submitted 21 February, 2013; originally announced February 2013.