-
OSLoPrompt: Bridging Low-Supervision Challenges and Open-Set Domain Generalization in CLIP
Authors:
Mohamad Hassan N C,
Divyam Gupta,
Mainak Singha,
Sai Bhargav Rongali,
Ankit Jha,
Muhammad Haris Khan,
Biplab Banerjee
Abstract:
We introduce Low-Shot Open-Set Domain Generalization (LSOSDG), a novel paradigm unifying low-shot learning with open-set domain generalization (ODG). While prompt-based methods using models like CLIP have advanced DG, they falter in low-data regimes (e.g., 1-shot) and lack precision in detecting open-set samples with fine-grained semantics related to training classes. To address these challenges,…
▽ More
We introduce Low-Shot Open-Set Domain Generalization (LSOSDG), a novel paradigm unifying low-shot learning with open-set domain generalization (ODG). While prompt-based methods using models like CLIP have advanced DG, they falter in low-data regimes (e.g., 1-shot) and lack precision in detecting open-set samples with fine-grained semantics related to training classes. To address these challenges, we propose OSLOPROMPT, an advanced prompt-learning framework for CLIP with two core innovations. First, to manage limited supervision across source domains and improve DG, we introduce a domain-agnostic prompt-learning mechanism that integrates adaptable domain-specific cues and visually guided semantic attributes through a novel cross-attention module, besides being supported by learnable domain- and class-generic visual prompts to enhance cross-modal adaptability. Second, to improve outlier rejection during inference, we classify unfamiliar samples as "unknown" and train specialized prompts with systematically synthesized pseudo-open samples that maintain fine-grained relationships to known classes, generated through a targeted query strategy with off-the-shelf foundation models. This strategy enhances feature learning, enabling our model to detect open samples with varied granularity more effectively. Extensive evaluations across five benchmarks demonstrate that OSLOPROMPT establishes a new state-of-the-art in LSOSDG, significantly outperforming existing methods.
△ Less
Submitted 20 March, 2025;
originally announced March 2025.
-
Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering
Authors:
Sai Bhargav Rongali,
Mohamad Hassan N C,
Ankit Jha,
Neha Bhargava,
Saurabh Prasad,
Biplab Banerjee
Abstract:
This paper tackles the intricate challenge of video question-answering (VideoQA). Despite notable progress, current methods fall short of effectively integrating questions with video frames and semantic object-level abstractions to create question-aware video representations. We introduce Local-Global Question Aware Video Embedding (LGQAVE), which incorporates three major innovations to integrate…
▽ More
This paper tackles the intricate challenge of video question-answering (VideoQA). Despite notable progress, current methods fall short of effectively integrating questions with video frames and semantic object-level abstractions to create question-aware video representations. We introduce Local-Global Question Aware Video Embedding (LGQAVE), which incorporates three major innovations to integrate multi-modal knowledge better and emphasize semantic visual concepts relevant to specific questions. LGQAVE moves beyond traditional ad-hoc frame sampling by utilizing a cross-attention mechanism that precisely identifies the most relevant frames concerning the questions. It captures the dynamics of objects within these frames using distinct graphs, grounding them in question semantics with the miniGPT model. These graphs are processed by a question-aware dynamic graph transformer (Q-DGT), which refines the outputs to develop nuanced global and local video representations. An additional cross-attention module integrates these local and global embeddings to generate the final video embeddings, which a language model uses to generate answers. Extensive evaluations across multiple benchmarks demonstrate that LGQAVE significantly outperforms existing models in delivering accurate multi-choice and open-ended answers.
△ Less
Submitted 12 December, 2024;
originally announced December 2024.
-
CDAD-Net: Bridging Domain Gaps in Generalized Category Discovery
Authors:
Sai Bhargav Rongali,
Sarthak Mehrotra,
Ankit Jha,
Mohamad Hassan N C,
Shirsha Bose,
Tanisha Gupta,
Mainak Singha,
Biplab Banerjee
Abstract:
In Generalized Category Discovery (GCD), we cluster unlabeled samples of known and novel classes, leveraging a training dataset of known classes. A salient challenge arises due to domain shifts between these datasets. To address this, we present a novel setting: Across Domain Generalized Category Discovery (AD-GCD) and bring forth CDAD-NET (Class Discoverer Across Domains) as a remedy. CDAD-NET is…
▽ More
In Generalized Category Discovery (GCD), we cluster unlabeled samples of known and novel classes, leveraging a training dataset of known classes. A salient challenge arises due to domain shifts between these datasets. To address this, we present a novel setting: Across Domain Generalized Category Discovery (AD-GCD) and bring forth CDAD-NET (Class Discoverer Across Domains) as a remedy. CDAD-NET is architected to synchronize potential known class samples across both the labeled (source) and unlabeled (target) datasets, while emphasizing the distinct categorization of the target data. To facilitate this, we propose an entropy-driven adversarial learning strategy that accounts for the distance distributions of target samples relative to source-domain class prototypes. Parallelly, the discriminative nature of the shared space is upheld through a fusion of three metric learning objectives. In the source domain, our focus is on refining the proximity between samples and their affiliated class prototypes, while in the target domain, we integrate a neighborhood-centric contrastive learning mechanism, enriched with an adept neighborsmining approach. To further accentuate the nuanced feature interrelation among semantically aligned images, we champion the concept of conditional image inpainting, underscoring the premise that semantically analogous images prove more efficacious to the task than their disjointed counterparts. Experimentally, CDAD-NET eclipses existing literature with a performance increment of 8-15% on three AD-GCD benchmarks we present.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
A Holistic Approach on Smart Garment for Patients with Juvenile Idiopathic Arthritis
Authors:
Safal Choudhary,
Princy Randhawa,
Sampath Kumar P Jinka,
Shiva Prasad H. C
Abstract:
Juvenile Idiopathic Arthritis (JIA) is a widespread and chronic condition that affects children and adolescents worldwide. The person suffering from JIA is characterized by chronic joint inflammation leading to pain, swelling, stiffness, and limited body movements. Individuals suffering from JIA require ongoing treatment for their lifetime. Beyond inflammation, JIA patients have expressed concerns…
▽ More
Juvenile Idiopathic Arthritis (JIA) is a widespread and chronic condition that affects children and adolescents worldwide. The person suffering from JIA is characterized by chronic joint inflammation leading to pain, swelling, stiffness, and limited body movements. Individuals suffering from JIA require ongoing treatment for their lifetime. Beyond inflammation, JIA patients have expressed concerns about various factors and the lack of responsive services addressing their challenges. The implementation of smart garments offers a promising solution to assist individuals with Juvenile Idiopathic Arthritis in performing their daily activities. These garments are designed to seamlessly integrate technology and clothing, providing not only physical support but also addressing the psychological and emotional aspects of living with a chronic condition. By incorporating sensors, these smart garments can monitor joint movement, detect inflammation, and provide real-time feedback to both patients and healthcare providers. To tackle these comprehensive challenges, the research aims to offer a solution through the design of a smart garment, created with a holistic approach. This smart garment is intended to improve the overall well-being of JIA patients by enhancing their mobility, comfort, and overall quality of life. The integration of technology into clothing can potentially revolutionize the way JIA is managed, allowing patients to better manage their condition and minimize its impact on their daily lives. The synergy between healthcare and technology holds great potential in addressing the multifaceted challenges posed by Juvenile Idiopathic Arthritis patients. Through innovation and empathy, this research aims to pave the way for a brighter future for individuals living with Juvenile Idiopathic Arthritis.
△ Less
Submitted 25 December, 2023;
originally announced January 2024.
-
Review of Hybrid Load Balancing Algorithms in Cloud Computing Environment
Authors:
Chukwuneke Chiamaka Ijeoma,
Inyiama,
Hyacinth C.,
Amaefule Samuel,
Onyesolu Moses Okechukwu,
Asogwa Doris Chinedu
Abstract:
In cloud computing environment, load balancing is a key issue which is required to distribute the dynamic workload over multiple machines to make certain that no single machine is overloaded. In recent research, many organizations lose significant part of their revenues in handling the requests given by the clients over the web servers i.e. unable to balance the load for web servers which results…
▽ More
In cloud computing environment, load balancing is a key issue which is required to distribute the dynamic workload over multiple machines to make certain that no single machine is overloaded. In recent research, many organizations lose significant part of their revenues in handling the requests given by the clients over the web servers i.e. unable to balance the load for web servers which results in loss of data, delay in time and increased costs. Various static and dynamic algorithms have been proposed and implemented in the past but this have not been fully efficient for load balancing. This gave room to hybrid algorithms. Hybrid methods inherit the properties from both static and dynamic load balancing techniques and attempts at overcoming the limitation of both algorithms. This paper is a study of various hybrid load balancing algorithms in cloud computing environment.
△ Less
Submitted 26 February, 2022;
originally announced February 2022.
-
AI Based Waste classifier with Thermo-Rapid Composting
Authors:
Saswati kumari behera,
Aouthithiye Barathwaj SR Y,
Vasundhara L,
Saisudha G,
Haariharan N C
Abstract:
Waste management is a certainly a very complex and difficult process especially in very large cities. It needs immense man power and also uses up other resources such as electricity and fuel. This creates a need to use a novel method with help of latest technologies. Here in this article we present a new waste classification technique using Computer Vision (CV) and deep learning (DL). To further i…
▽ More
Waste management is a certainly a very complex and difficult process especially in very large cities. It needs immense man power and also uses up other resources such as electricity and fuel. This creates a need to use a novel method with help of latest technologies. Here in this article we present a new waste classification technique using Computer Vision (CV) and deep learning (DL). To further improve waste classification ability, support machine vectors (SVM) are used. We also decompose the degradable waste with help of rapid composting. In this article we have mainly worked on segregation of municipal solid waste (MSW). For this model, we use YOLOv3 (You Only Look Once) a computer vision-based algorithm popularly used to detect objects which is developed based on Convolution Neural Networks (CNNs) which is a machine learning (ML) based tool. They are extensively used to extract features from a data especially image-oriented data. In this article we propose a waste classification technique which will be faster and more efficient. And we decompose the biodegradable waste by Berkley Method of composting (BKC)
△ Less
Submitted 3 August, 2021;
originally announced August 2021.
-
A vendors evaluation using AHP for an Indian steel pipe manufacturing company
Authors:
Giridhar Kamath,
Rakesh Naik,
Shiva Prasad H C
Abstract:
To improve a firms supply chain performance it is essential to have a vendor evaluation process to be able to showcase an organizations success in the present aggressive market. Hence, the process of evaluating the vendor is a crucial task of the purchasing executives in supply chain management. The objective of this research is to propose a methodology to evaluate the vendors for a steel pipe man…
▽ More
To improve a firms supply chain performance it is essential to have a vendor evaluation process to be able to showcase an organizations success in the present aggressive market. Hence, the process of evaluating the vendor is a crucial task of the purchasing executives in supply chain management. The objective of this research is to propose a methodology to evaluate the vendors for a steel pipe manufacturing firm in Gujarat, India. For the purpose of the study, the Analytical Hierarchy Process was used to evaluate the best raw material vendor for this company. Multiple qualitative and quantitative criteria are involved in the vendor evaluation process. To solve the complex problem of vendor evaluation, a tradeoff between this multicriteria is important. The outcomes indicated that the AHP technique makes it simpler to assign weights for the different criteria for evaluating the vendor. Research findings showed that quality is the most important criterion followed by delivery, cost and vendor relationship management.
△ Less
Submitted 31 May, 2018;
originally announced June 2018.
-
Does supplier evaluation impact process improvement?
Authors:
Shiva Prasad H C,
Giridhar Kamath,
Gopalkrishna Barkur,
Rakesh Naik
Abstract:
The research explores and examines factors for supplier evaluation and its impact on process improvement particularly aiming on a steel pipe manufacturing firm in Gujarat, India. Data was collected using in-depth interview. The questionnaire primarily involves the perception of evaluation of supplier. Factors influencing supplier evaluation and its influence on process improvement is also examined…
▽ More
The research explores and examines factors for supplier evaluation and its impact on process improvement particularly aiming on a steel pipe manufacturing firm in Gujarat, India. Data was collected using in-depth interview. The questionnaire primarily involves the perception of evaluation of supplier. Factors influencing supplier evaluation and its influence on process improvement is also examined in this study. The model testing and validation were done using partial least square method. Outcomes signified that the factors that influence the evaluation of the supplier are quality, cost, delivery and supplier relationship management. The study depicted that quality and cost factors for supplier evaluation are insignificant. The delivery and supplier relationship management have the significant influence on the evaluation of the supplier. The research also depicted that supplier evaluation has a significant influence on process improvement. Many researchers have considered quality, cost and delivery as the factors for evaluating the suppliers. But for a company, it is quintessential to have a good relationship with the supplier. Hence, the factor, supplier relationship management is considered for the study. Also, the case study company focused more on quality and cost factors for the supplier evaluation of the firm. However, delivery and supplier relationship management are also equally important for a firm in evaluating the supplier.
△ Less
Submitted 31 May, 2018;
originally announced June 2018.
-
TrueChain: Highly Performant Decentralized Public Ledger
Authors:
Eric Zhang,
Hendrik C,
Yang Liu,
Archit Sharma,
Jasper L
Abstract:
In this paper we present the initial design of Minerva consensus protocol for Truechain and other technical details. Currently, it is widely believed in the blockchain community that a public chain cannot simultaneously achieve high performance, decentralization and security. This is true in the case of a Nakamoto chain (low performance) or a delegated proof of stake chain (partially centralized),…
▽ More
In this paper we present the initial design of Minerva consensus protocol for Truechain and other technical details. Currently, it is widely believed in the blockchain community that a public chain cannot simultaneously achieve high performance, decentralization and security. This is true in the case of a Nakamoto chain (low performance) or a delegated proof of stake chain (partially centralized), which are the most popular block chain solutions at time of writing. Our consensus design enjoys the same consistency, liveness, transaction finality and security guarantee, a de-facto with the Hybrid Consensus. We go on to propose the idea of a new virtual machine on top of Ethereum which adds permissioned-chain based transaction processing capabilities in a permissionless setting. We also use the idea of data sharding and speculative transactions, and evaluation of smart contracts in a sharding friendly virtual machine. Finally, we will briefly discuss our fundamentally ASIC resistant mining algorithm, Truehash.
△ Less
Submitted 1 December, 2018; v1 submitted 3 May, 2018;
originally announced May 2018.
-
An Improved Secretive Coded Caching Scheme exploiting Common Demands
Authors:
Hari Hara Suthan C,
Ishani Chugh,
Prasad Krishnan
Abstract:
Coded caching schemes on broadcast networks with user caches help to offload traffic from peak times to off-peak times by prefetching information from the server to the users during off-peak times and thus serving the users more efficiently during peak times using coded transmissions. We consider the problem of secretive coded caching which was proposed recently, in which a user should not be able…
▽ More
Coded caching schemes on broadcast networks with user caches help to offload traffic from peak times to off-peak times by prefetching information from the server to the users during off-peak times and thus serving the users more efficiently during peak times using coded transmissions. We consider the problem of secretive coded caching which was proposed recently, in which a user should not be able to decode any information about any file that the user has not demanded. We propose a new secretive coded caching scheme which has a lower average rate compared to the existing state-of-the-art scheme, for the same memory available at the users. The proposed scheme is based on exploiting the presence of common demands between multiple users.
△ Less
Submitted 26 August, 2017; v1 submitted 23 May, 2017;
originally announced May 2017.
-
Survey on Various Gesture Recognition Techniques for Interfacing Machines Based on Ambient Intelligence
Authors:
Harshith C,
Karthik R. Shastry,
Manoj Ravindran,
M. V. V. N. S. Srikanth,
Naveen Lakshmikhanth
Abstract:
Gesture recognition is mainly apprehensive on analyzing the functionality of human wits. The main goal of gesture recognition is to create a system which can recognize specific human gestures and use them to convey information or for device control. Hand gestures provide a separate complementary modality to speech for expressing ones ideas. Information associated with hand gestures in a conversati…
▽ More
Gesture recognition is mainly apprehensive on analyzing the functionality of human wits. The main goal of gesture recognition is to create a system which can recognize specific human gestures and use them to convey information or for device control. Hand gestures provide a separate complementary modality to speech for expressing ones ideas. Information associated with hand gestures in a conversation is degree,discourse structure, spatial and temporal structure. The approaches present can be mainly divided into Data-Glove Based and Vision Based approaches. An important face feature point is the nose tip. Since nose is the highest protruding point from the face. Besides that, it is not affected by facial expressions.Another important function of the nose is that it is able to indicate the head pose. Knowledge of the nose location will enable us to align an unknown 3D face with those in a face database. Eye detection is divided into eye position detection and eye contour detection. Existing works in eye detection can be classified into two major categories: traditional image-based passive approaches and the active IR based approaches. The former uses intensity and shape of eyes for detection and the latter works on the assumption that eyes have a reflection under near IR illumination and produce bright/dark pupil effect. The traditional methods can be broadly classified into three categories: template based methods,appearance based methods and feature based methods. The purpose of this paper is to compare various human Gesture recognition systems for interfacing machines directly to human wits without any corporeal media in an ambient environment.
△ Less
Submitted 30 November, 2010;
originally announced December 2010.