-
From Plain Text to Poetic Form: Generating Metrically-Constrained Sanskrit Verses
Authors:
Manoj Balaji Jagadeeshan,
Samarth Bhatia,
Pretam Ray,
Harshul Raj Surana,
Akhil Rajeev P,
Priya Mishra,
Annarao Kulkarni,
Ganesh Ramakrishnan,
Prathosh AP,
Pawan Goyal
Abstract:
Recent advances in large language models (LLMs) have significantly improved natural language generation, including creative tasks like poetry composition. However, most progress remains concentrated in high-resource languages. This raises an important question: Can LLMs be adapted for structured poetic generation in a low-resource, morphologically rich language such as Sanskrit? In this work, we i…
▽ More
Recent advances in large language models (LLMs) have significantly improved natural language generation, including creative tasks like poetry composition. However, most progress remains concentrated in high-resource languages. This raises an important question: Can LLMs be adapted for structured poetic generation in a low-resource, morphologically rich language such as Sanskrit? In this work, we introduce a dataset designed for translating English prose into structured Sanskrit verse, with strict adherence to classical metrical patterns, particularly the Anushtub meter. We evaluate a range of generative models-both open-source and proprietary-under multiple settings. Specifically, we explore constrained decoding strategies and instruction-based fine-tuning tailored to metrical and semantic fidelity. Our decoding approach achieves over 99% accuracy in producing syntactically valid poetic forms, substantially outperforming general-purpose models in meter conformity. Meanwhile, instruction-tuned variants show improved alignment with source meaning and poetic style, as supported by human assessments, albeit with marginal trade-offs in metrical precision.
△ Less
Submitted 31 May, 2025;
originally announced June 2025.
-
Experimental realization of all logic elements and memory latch in SC-CNN Chua's circuit
Authors:
Ashokkumar P,
Sathish Aravindh M,
Venkatesan A,
Lakshmanan M
Abstract:
The Chua's circuit is examined using a State Controlled-Cellular Neural Network (SC-CNN) framework with two logical square wave input signals. We illustrate, in particular, that this nonlinear circuit can generate all the basic logic operations, including OR/NOR, AND/NAND, and XOR/XNOR gates, by making use of the hopping of attractors which this circuit produces in different phase space regimes. F…
▽ More
The Chua's circuit is examined using a State Controlled-Cellular Neural Network (SC-CNN) framework with two logical square wave input signals. We illustrate, in particular, that this nonlinear circuit can generate all the basic logic operations, including OR/NOR, AND/NAND, and XOR/XNOR gates, by making use of the hopping of attractors which this circuit produces in different phase space regimes. Further, it is shown that besides two-inputs, the circuit emulates multi-input logic elements. Moreover, all these logic elements are effectively functioning for a tolerable limit of noise intensity. These observations are experimentally realized. Thus our investigation sheds new light in the field of digital technology where the existing static logic gates may be replaced or complemented by this kind of dynamical nonlinear circuits.
△ Less
Submitted 29 May, 2025;
originally announced May 2025.
-
LangDAug: Langevin Data Augmentation for Multi-Source Domain Generalization in Medical Image Segmentation
Authors:
Piyush Tiwary,
Kinjawl Bhattacharyya,
Prathosh A. P
Abstract:
Medical image segmentation models often struggle to generalize across different domains due to various reasons. Domain Generalization (DG) methods overcome this either through representation learning or data augmentation (DAug). While representation learning methods seek domain-invariant features, they often rely on ad-hoc techniques and lack formal guarantees. DAug methods, which enrich model rep…
▽ More
Medical image segmentation models often struggle to generalize across different domains due to various reasons. Domain Generalization (DG) methods overcome this either through representation learning or data augmentation (DAug). While representation learning methods seek domain-invariant features, they often rely on ad-hoc techniques and lack formal guarantees. DAug methods, which enrich model representations through synthetic samples, have shown comparable or superior performance to representation learning approaches. We propose LangDAug, a novel $\textbf{Lang}$evin $\textbf{D}$ata $\textbf{Aug}$mentation for multi-source domain generalization in 2D medical image segmentation. LangDAug leverages Energy-Based Models (EBMs) trained via contrastive divergence to traverse between source domains, generating intermediate samples through Langevin dynamics. Theoretical analysis shows that LangDAug induces a regularization effect, and for GLMs, it upper-bounds the Rademacher complexity by the intrinsic dimensionality of the data manifold. Through extensive experiments on Fundus segmentation and 2D MRI prostate segmentation benchmarks, we show that LangDAug outperforms state-of-the-art domain generalization methods and effectively complements existing domain-randomization approaches. The codebase for our method is available at https://github.com/backpropagator/LangDAug.
△ Less
Submitted 26 May, 2025;
originally announced May 2025.
-
Latent Mamba Operator for Partial Differential Equations
Authors:
Karn Tiwari,
Niladri Dutta,
N M Anoop Krishnan,
Prathosh A P
Abstract:
Neural operators have emerged as powerful data-driven frameworks for solving Partial Differential Equations (PDEs), offering significant speedups over numerical methods. However, existing neural operators struggle with scalability in high-dimensional spaces, incur high computational costs, and face challenges in capturing continuous and long-range dependencies in PDE dynamics. To address these lim…
▽ More
Neural operators have emerged as powerful data-driven frameworks for solving Partial Differential Equations (PDEs), offering significant speedups over numerical methods. However, existing neural operators struggle with scalability in high-dimensional spaces, incur high computational costs, and face challenges in capturing continuous and long-range dependencies in PDE dynamics. To address these limitations, we introduce the Latent Mamba Operator (LaMO), which integrates the efficiency of state-space models (SSMs) in latent space with the expressive power of kernel integral formulations in neural operators. We also establish a theoretical connection between state-space models (SSMs) and the kernel integral of neural operators. Extensive experiments across diverse PDE benchmarks on regular grids, structured meshes, and point clouds covering solid and fluid physics datasets, LaMOs achieve consistent state-of-the-art (SOTA) performance, with a 32.3% improvement over existing baselines in solution operator approximation, highlighting its efficacy in modeling complex PDE solutions.
△ Less
Submitted 28 May, 2025; v1 submitted 25 May, 2025;
originally announced May 2025.
-
Characterization of bi-parametric potentials and rate of convergence of truncated hypersingular integrals in the Dunkl setting
Authors:
Sandeep Kumar Verma,
Athulya P
Abstract:
In this work, we introduce the $β$-semigroup for $β> 0$, which unifies and extends the classical Poisson (for $β=1$) and heat (for $β=2$) semigroups within the Dunkl analysis framework. Leveraging this semigroup, we derive an explicit representation for the inverse of the Dunkl-Riesz potential and characterize the image of the function space $L_k^p(\mathbb{R}^n)$ for $1 \leq p < \frac{n + 2γ}α$. W…
▽ More
In this work, we introduce the $β$-semigroup for $β> 0$, which unifies and extends the classical Poisson (for $β=1$) and heat (for $β=2$) semigroups within the Dunkl analysis framework. Leveraging this semigroup, we derive an explicit representation for the inverse of the Dunkl-Riesz potential and characterize the image of the function space $L_k^p(\mathbb{R}^n)$ for $1 \leq p < \frac{n + 2γ}α$. We further define the bi-parametric potential of order $α$ by $$\mathfrak{S}_k^{(α,β)} = \left(I + (-Δ_k)^{β/2}\right)^{-α/β}$$ and establish its inverse along with a detailed description of the associated range space. Our approach employs a wavelet-based method that represents the inverse as the limit of truncated hypersingular integrals parameterized by $ε> 0$. To analyze the convergence of these approximations, we introduce the concept of $η$-smoothness at a point $x_0$ in the Dunkl setting. We show that if a function $f \in L_k^p(\mathbb{R}^n) \cap L_k^2(\mathbb{R}^n)$, for $1 \leq p \leq \infty$, possesses $η$-smoothness at $x_0$, then the truncated hypersingular approximations converge to $f(x_0)$ as $ε\to 0^+$.
△ Less
Submitted 3 June, 2025; v1 submitted 21 May, 2025;
originally announced May 2025.
-
Navigating AI Policy Landscapes: Insights into Human Rights Considerations Across IEEE Regions
Authors:
Angel Mary John,
Jerrin Thomas Panachakel,
Anusha S. P
Abstract:
This paper explores the integration of human rights considerations into AI regulatory frameworks across different IEEE regions - specifically the United States (Region 1-6), Europe (Region 8), China (part of Region 10), and Singapore (part of Region 10). While all acknowledge the transformative potential of AI and the necessity of ethical guidelines, their regulatory approaches significantly diffe…
▽ More
This paper explores the integration of human rights considerations into AI regulatory frameworks across different IEEE regions - specifically the United States (Region 1-6), Europe (Region 8), China (part of Region 10), and Singapore (part of Region 10). While all acknowledge the transformative potential of AI and the necessity of ethical guidelines, their regulatory approaches significantly differ. Europe exhibits a rigorous framework with stringent protections for individual rights, while the U.S. promotes innovation with less restrictive regulations. China emphasizes state control and societal order in its AI strategies. In contrast, Singapore's advisory framework encourages self-regulation and aligns closely with international norms. This comparative analysis underlines the need for ongoing global dialogue to harmonize AI regulations that safeguard human rights while promoting technological advancement, reflecting the diverse perspectives and priorities of each region.
△ Less
Submitted 27 April, 2025;
originally announced April 2025.
-
GOTHAM: Graph Class Incremental Learning Framework under Weak Supervision
Authors:
Aditya Hemant Shahane,
Prathosh A. P,
Sandeep Kumar
Abstract:
Graphs are growing rapidly, along with the number of distinct label categories associated with them. Applications like e-commerce, healthcare, recommendation systems, and various social media platforms are rapidly moving towards graph representation of data due to their ability to capture both structural and attribute information. One crucial task in graph analysis is node classification, where un…
▽ More
Graphs are growing rapidly, along with the number of distinct label categories associated with them. Applications like e-commerce, healthcare, recommendation systems, and various social media platforms are rapidly moving towards graph representation of data due to their ability to capture both structural and attribute information. One crucial task in graph analysis is node classification, where unlabeled nodes are categorized into predefined classes. In practice, novel classes appear incrementally sometimes with just a few labels (seen classes) or even without any labels (unseen classes), either because they are new or haven't been explored much. Traditional methods assume abundant labeled data for training, which isn't always feasible. We investigate a broader objective: \emph{Graph Class Incremental Learning under Weak Supervision (GCL)}, addressing this challenge by meta-training on base classes with limited labeled instances. During the incremental streams, novel classes can have few-shot or zero-shot representation. Our proposed framework GOTHAM efficiently accommodates these unlabeled nodes by finding the closest prototype representation, serving as class representatives in the attribute space. For Text-Attributed Graphs (TAGs), our framework additionally incorporates semantic information to enhance the representation. By employing teacher-student knowledge distillation to mitigate forgetting, GOTHAM achieves promising results across various tasks. Experiments on datasets such as Cora-ML, Amazon, and OBGN-Arxiv showcase the effectiveness of our approach in handling evolving graph data under limited supervision. The repository is available here: \href{https://github.com/adityashahane10/GOTHAM--Graph-based-Class-Incremental-Learning-Framework-under-Weak-Supervision}{\small \textcolor{blue}{Code}}
△ Less
Submitted 7 April, 2025;
originally announced April 2025.
-
zScore: A Universal Decentralised Reputation System for the Blockchain Economy
Authors:
Himanshu Udupi,
Ashutosh Sahoo,
Akshay S. P.,
Gurukiran S.,
Parag Paul,
Petrus C. Martens
Abstract:
Modern society functions on trust. The onchain economy, however, is built on the founding principles of trustless peer-to-peer interactions in an adversarial environment without a centralised body of trust and needs a verifiable system to quantify credibility to minimise bad economic activity. We provide a robust framework titled zScore, a core primitive for reputation derived from a wallet's onch…
▽ More
Modern society functions on trust. The onchain economy, however, is built on the founding principles of trustless peer-to-peer interactions in an adversarial environment without a centralised body of trust and needs a verifiable system to quantify credibility to minimise bad economic activity. We provide a robust framework titled zScore, a core primitive for reputation derived from a wallet's onchain behaviour using state-of-the-art AI neural network models combined with real-world credentials ported onchain through zkTLS. The initial results tested on retroactive data from lending protocols establish a strong correlation between a good zScore and healthy borrowing and repayment behaviour, making it a robust and decentralised alibi for creditworthiness; we highlight significant improvements from previous attempts by protocols like Cred showcasing its robustness. We also present a list of possible applications of our system in Section 5, thereby establishing its utility in rewarding actual value creation while filtering noise and suspicious activity and flagging malicious behaviour by bad actors.
△ Less
Submitted 17 February, 2025;
originally announced March 2025.
-
Cross multiscale vision transformer for deep fake detection
Authors:
Akhshan P,
Taneti Sanjay,
Chandrakala S
Abstract:
The proliferation of deep fake technology poses significant challenges to digital media authenticity, necessitating robust detection mechanisms. This project evaluates deep fake detection using the SP Cup's 2025 deep fake detection challenge dataset. We focused on exploring various deep learning models for detecting deep fake content, utilizing traditional deep learning techniques alongside newer…
▽ More
The proliferation of deep fake technology poses significant challenges to digital media authenticity, necessitating robust detection mechanisms. This project evaluates deep fake detection using the SP Cup's 2025 deep fake detection challenge dataset. We focused on exploring various deep learning models for detecting deep fake content, utilizing traditional deep learning techniques alongside newer architectures. Our approach involved training a series of models and rigorously assessing their performance using metrics such as accuracy.
△ Less
Submitted 2 February, 2025;
originally announced February 2025.
-
CoNOAir: A Neural Operator for Forecasting Carbon Monoxide Evolution in Cities
Authors:
Sanchit Bedi,
Karn Tiwari,
Prathosh A. P.,
Sri Harsha Kota,
N. M. Anoop Krishnan
Abstract:
Carbon Monoxide (CO) is a dominant pollutant in urban areas due to the energy generation from fossil fuels for industry, automobile, and domestic requirements. Forecasting the evolution of CO in real-time can enable the deployment of effective early warning systems and intervention strategies. However, the computational cost associated with the physics and chemistry-based simulation makes it prohi…
▽ More
Carbon Monoxide (CO) is a dominant pollutant in urban areas due to the energy generation from fossil fuels for industry, automobile, and domestic requirements. Forecasting the evolution of CO in real-time can enable the deployment of effective early warning systems and intervention strategies. However, the computational cost associated with the physics and chemistry-based simulation makes it prohibitive to implement such a model at the city and country scale. To address this challenge, here, we present a machine learning model based on neural operator, namely, Complex Neural Operator for Air Quality (CoNOAir), that can effectively forecast CO concentrations. We demonstrate this by developing a country-level model for short-term (hourly) and long-term (72-hour) forecasts of CO concentrations. Our model outperforms state-of-the-art models such as Fourier neural operators (FNO) and provides reliable predictions for both short and long-term forecasts. We further analyse the capability of the model to capture extreme events and generate forecasts in urban cities in India. Interestingly, we observe that the model predicts the next hour CO concentrations with R2 values greater than 0.95 for all the cities considered. The deployment of such a model can greatly assist the governing bodies to provide early warning, plan intervention strategies, and develop effective strategies by considering several what-if scenarios. Altogether, the present approach could provide a fillip to real-time predictions of CO pollution in urban cities.
△ Less
Submitted 13 January, 2025; v1 submitted 10 January, 2025;
originally announced January 2025.
-
CPPJoules: An Energy Measurement Tool for C++
Authors:
Shivadharshan S,
Akilesh P,
Rajrupa Chattaraj,
Sridhar Chimalakonda
Abstract:
With the increasing complexity of modern software and the demand for high performance, energy consumption has become a critical factor for developers and researchers. While much of the research community is focused on evaluating the energy consumption of machine learning and artificial intelligence systems -- often implemented in Python -- there is a gap when it comes to tools and frameworks for m…
▽ More
With the increasing complexity of modern software and the demand for high performance, energy consumption has become a critical factor for developers and researchers. While much of the research community is focused on evaluating the energy consumption of machine learning and artificial intelligence systems -- often implemented in Python -- there is a gap when it comes to tools and frameworks for measuring energy usage in other programming languages. C++, in particular, remains a foundational language for a wide range of software applications, from game development to parallel programming frameworks, yet lacks dedicated energy measurement solutions. To address this, we have developed CPPJoules, a tool built on top of Intel-RAPL to measure the energy consumption of C++ code snippets. We have evaluated the tool by measuring the energy consumption of the standard computational tasks from the Rosetta Code repository. The demonstration of the tool is available at \url{https://www.youtube.com/watch?v=GZXYF3AKzPk} and related artifacts at \url{https://rishalab.github.io/CPPJoules/}.
△ Less
Submitted 18 December, 2024;
originally announced December 2024.
-
GraPE: A Generate-Plan-Edit Framework for Compositional T2I Synthesis
Authors:
Ashish Goswami,
Satyam Kumar Modi,
Santhosh Rishi Deshineni,
Harman Singh,
Prathosh A. P,
Parag Singla
Abstract:
Text-to-image (T2I) generation has seen significant progress with diffusion models, enabling generation of photo-realistic images from text prompts. Despite this progress, existing methods still face challenges in following complex text prompts, especially those requiring compositional and multi-step reasoning. Given such complex instructions, SOTA models often make mistakes in faithfully modeling…
▽ More
Text-to-image (T2I) generation has seen significant progress with diffusion models, enabling generation of photo-realistic images from text prompts. Despite this progress, existing methods still face challenges in following complex text prompts, especially those requiring compositional and multi-step reasoning. Given such complex instructions, SOTA models often make mistakes in faithfully modeling object attributes, and relationships among them. In this work, we present an alternate paradigm for T2I synthesis, decomposing the task of complex multi-step generation into three steps, (a) Generate: we first generate an image using existing diffusion models (b) Plan: we make use of Multi-Modal LLMs (MLLMs) to identify the mistakes in the generated image expressed in terms of individual objects and their properties, and produce a sequence of corrective steps required in the form of an edit-plan. (c) Edit: we make use of an existing text-guided image editing models to sequentially execute our edit-plan over the generated image to get the desired image which is faithful to the original instruction. Our approach derives its strength from the fact that it is modular in nature, is training free, and can be applied over any combination of image generation and editing models. As an added contribution, we also develop a model capable of compositional editing, which further helps improve the overall accuracy of our proposed approach. Our method flexibly trades inference time compute with performance on compositional text prompts. We perform extensive experimental evaluation across 3 benchmarks and 10 T2I models including DALLE-3 and the latest -- SD-3.5-Large. Our approach not only improves the performance of the SOTA models, by upto 3 points, it also reduces the performance gap between weaker and stronger models. $\href{https://dair-iitd.github.io/GraPE/}{https://dair-iitd.github.io/GraPE/}$
△ Less
Submitted 11 March, 2025; v1 submitted 8 December, 2024;
originally announced December 2024.
-
UnDIVE: Generalized Underwater Video Enhancement Using Generative Priors
Authors:
Suhas Srinath,
Aditya Chandrasekar,
Hemang Jamadagni,
Rajiv Soundararajan,
Prathosh A P
Abstract:
With the rise of marine exploration, underwater imaging has gained significant attention as a research topic. Underwater video enhancement has become crucial for real-time computer vision tasks in marine exploration. However, most existing methods focus on enhancing individual frames and neglect video temporal dynamics, leading to visually poor enhancements. Furthermore, the lack of ground-truth r…
▽ More
With the rise of marine exploration, underwater imaging has gained significant attention as a research topic. Underwater video enhancement has become crucial for real-time computer vision tasks in marine exploration. However, most existing methods focus on enhancing individual frames and neglect video temporal dynamics, leading to visually poor enhancements. Furthermore, the lack of ground-truth references limits the use of abundant available underwater video data in many applications. To address these issues, we propose a two-stage framework for enhancing underwater videos. The first stage uses a denoising diffusion probabilistic model to learn a generative prior from unlabeled data, capturing robust and descriptive feature representations. In the second stage, this prior is incorporated into a physics-based image formulation for spatial enhancement, while also enforcing temporal consistency between video frames. Our method enables real-time and computationally-efficient processing of high-resolution underwater videos at lower resolutions, and offers efficient enhancement in the presence of diverse water-types. Extensive experiments on four datasets show that our approach generalizes well and outperforms existing enhancement methods. Our code is available at github.com/suhas-srinath/undive.
△ Less
Submitted 8 November, 2024;
originally announced November 2024.
-
Bifurcation in narrow gap spherical Couette flow
Authors:
Ananthu J. P.,
Manjul Sharma,
Sameen A.,
Vinod Narayanan
Abstract:
Incompressible Navier-Stokes equations in the spherical coordinates are solved using a pseudo-spectral method to simulate the problem of spherical Couette flow. The flow is investigated for a narrow gap ratio with only the inner sphere rotating. We find that the flow is sensitive to the initial conditions and have used various initial conditions to obtain di!erent branches of the bifurcation curve…
▽ More
Incompressible Navier-Stokes equations in the spherical coordinates are solved using a pseudo-spectral method to simulate the problem of spherical Couette flow. The flow is investigated for a narrow gap ratio with only the inner sphere rotating. We find that the flow is sensitive to the initial conditions and have used various initial conditions to obtain di!erent branches of the bifurcation curve of the flow. We have identified three di!erent branches dominated respectively by axisymmetric flow, traveling wave instability, and equatorial instability. The axisymmetric branch shows unsteadiness at large Reynolds numbers. The traveling wave instability branch shows spiral instability and is prominent near poles. The traveling wave instability branch further exhibits a reversal in the propagation direction of the spiral instability as the Reynolds number is increased. This branch also exhibits a multi-mode equatorial instability at larger Reynolds numbers. The equatorial instability branch exhibits twin jet streams on either side of the equator, which becomes unstable at larger Reynolds numbers. The flow topology on the three branches are also investigated in their phase space and the found to exhibit a chaotic behavior at large Reynolds numbers on the traveling wave instability branch.
△ Less
Submitted 9 October, 2024;
originally announced October 2024.
-
Revisiting BPR: A Replicability Study of a Common Recommender System Baseline
Authors:
Aleksandr Milogradskii,
Oleg Lashinin,
Alexander P,
Marina Ananyeva,
Sergey Kolesnikov
Abstract:
Bayesian Personalized Ranking (BPR), a collaborative filtering approach based on matrix factorization, frequently serves as a benchmark for recommender systems research. However, numerous studies often overlook the nuances of BPR implementation, claiming that it performs worse than newly proposed methods across various tasks. In this paper, we thoroughly examine the features of the BPR model, indi…
▽ More
Bayesian Personalized Ranking (BPR), a collaborative filtering approach based on matrix factorization, frequently serves as a benchmark for recommender systems research. However, numerous studies often overlook the nuances of BPR implementation, claiming that it performs worse than newly proposed methods across various tasks. In this paper, we thoroughly examine the features of the BPR model, indicating their impact on its performance, and investigate open-source BPR implementations. Our analysis reveals inconsistencies between these implementations and the original BPR paper, leading to a significant decrease in performance of up to 50% for specific implementations. Furthermore, through extensive experiments on real-world datasets under modern evaluation settings, we demonstrate that with proper tuning of its hyperparameters, the BPR model can achieve performance levels close to state-of-the-art methods on the top-n recommendation tasks and even outperform them on specific datasets. Specifically, on the Million Song Dataset, the BPR model with hyperparameters tuning statistically significantly outperforms Mult-VAE by 10% in NDCG@100 with binary relevance function.
△ Less
Submitted 18 October, 2024; v1 submitted 21 September, 2024;
originally announced September 2024.
-
Appraisal-Guided Proximal Policy Optimization: Modeling Psychological Disorders in Dynamic Grid World
Authors:
Hari Prasad,
Chinnu Jacob,
Imthias Ahamed T. P
Abstract:
The integration of artificial intelligence across multiple domains has emphasized the importance of replicating human-like cognitive processes in AI. By incorporating emotional intelligence into AI agents, their emotional stability can be evaluated to enhance their resilience and dependability in critical decision-making tasks. In this work, we develop a methodology for modeling psychological diso…
▽ More
The integration of artificial intelligence across multiple domains has emphasized the importance of replicating human-like cognitive processes in AI. By incorporating emotional intelligence into AI agents, their emotional stability can be evaluated to enhance their resilience and dependability in critical decision-making tasks. In this work, we develop a methodology for modeling psychological disorders using Reinforcement Learning (RL) agents. We utilized Appraisal theory to train RL agents in a dynamic grid world environment with an Appraisal-Guided Proximal Policy Optimization (AG-PPO) algorithm. Additionally, we investigated numerous reward-shaping strategies to simulate psychological disorders and regulate the behavior of the agents. A comparison of various configurations of the modified PPO algorithm identified variants that simulate Anxiety disorder and Obsessive-Compulsive Disorder (OCD)-like behavior in agents. Furthermore, we compared standard PPO with AG-PPO and its configurations, highlighting the performance improvement in terms of generalization capabilities. Finally, we conducted an analysis of the agents' behavioral patterns in complex test environments to evaluate the associated symptoms corresponding to the psychological disorders. Overall, our work showcases the benefits of the appraisal-guided PPO algorithm over the standard PPO algorithm and the potential to simulate psychological disorders in a controlled artificial environment and evaluate them on RL agents.
△ Less
Submitted 29 July, 2024;
originally announced July 2024.
-
Measurement of the Sequential $3α$ Process in the Photodissociation of $^{12}\mathrm{C}$
Authors:
Resmi K. Bharathan,
Midhun C. V,
M. M Musthafa,
Sreena M,
Silpa Ajaykumar,
Farhana Thesni M. P,
Swapna B,
Vafiya Thaslim T. T,
Shaima A,
Nived K,
Akhil R,
Anagha P. K,
Arunima Dev T. V,
Keerthi E. S,
Akshay K. S,
Arun P. V,
S. Ghugre
Abstract:
The cross sections for the $^{12}\mathrm{C}(γ,α)^{8}\mathrm{Be}\rightarrow 3α$ reaction have been successfully measured using exclusive coincidence between three $α$ particles, minimizing Compton background. Sequential breakup kinematics are evident, and the cross sections are presented as locally averaged histogram values. Theoretical \textsc{Fresco} CDCC-CRC calculations reproduce the experiment…
▽ More
The cross sections for the $^{12}\mathrm{C}(γ,α)^{8}\mathrm{Be}\rightarrow 3α$ reaction have been successfully measured using exclusive coincidence between three $α$ particles, minimizing Compton background. Sequential breakup kinematics are evident, and the cross sections are presented as locally averaged histogram values. Theoretical \textsc{Fresco} CDCC-CRC calculations reproduce the experimental data, showing that the process involves electromagnetic coupling to both $^{8}\mathrm{Be}^{0^+}$ and $^{8}\mathrm{Be}^{2^+}$ states. This study confirms that the $^{12}\mathrm{C}(γ,α)^{8}\mathrm{Be}\rightarrow 3α$ reaction proceeds via a sequential mechanism, crucial for understanding its significance in radiotherapy dosimetry.
△ Less
Submitted 29 July, 2024;
originally announced July 2024.
-
HOLMES: Hyper-Relational Knowledge Graphs for Multi-hop Question Answering using LLMs
Authors:
Pranoy Panda,
Ankush Agarwal,
Chaitanya Devaguptapu,
Manohar Kaul,
Prathosh A P
Abstract:
Given unstructured text, Large Language Models (LLMs) are adept at answering simple (single-hop) questions. However, as the complexity of the questions increase, the performance of LLMs degrade. We believe this is due to the overhead associated with understanding the complex question followed by filtering and aggregating unstructured information in the raw text. Recent methods try to reduce this b…
▽ More
Given unstructured text, Large Language Models (LLMs) are adept at answering simple (single-hop) questions. However, as the complexity of the questions increase, the performance of LLMs degrade. We believe this is due to the overhead associated with understanding the complex question followed by filtering and aggregating unstructured information in the raw text. Recent methods try to reduce this burden by integrating structured knowledge triples into the raw text, aiming to provide a structured overview that simplifies information processing. However, this simplistic approach is query-agnostic and the extracted facts are ambiguous as they lack context. To address these drawbacks and to enable LLMs to answer complex (multi-hop) questions with ease, we propose to use a knowledge graph (KG) that is context-aware and is distilled to contain query-relevant information. The use of our compressed distilled KG as input to the LLM results in our method utilizing up to $67\%$ fewer tokens to represent the query relevant information present in the supporting documents, compared to the state-of-the-art (SoTA) method. Our experiments show consistent improvements over the SoTA across several metrics (EM, F1, BERTScore, and Human Eval) on two popular benchmark datasets (HotpotQA and MuSiQue).
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Deep Learning-Based Brain Image Segmentation for Automated Tumour Detection
Authors:
Suman Sourabh,
Murugappan Valliappan,
Narayana Darapaneni,
Anwesh R P
Abstract:
Introduction: The present study on the development and evaluation of an automated brain tumor segmentation technique based on deep learning using the 3D U-Net model. Objectives: The objective is to leverage state-of-the-art convolutional neural networks (CNNs) on a large dataset of brain MRI scans for segmentation. Methods: The proposed methodology applies pre-processing techniques for enhanced pe…
▽ More
Introduction: The present study on the development and evaluation of an automated brain tumor segmentation technique based on deep learning using the 3D U-Net model. Objectives: The objective is to leverage state-of-the-art convolutional neural networks (CNNs) on a large dataset of brain MRI scans for segmentation. Methods: The proposed methodology applies pre-processing techniques for enhanced performance and generalizability. Results: Extensive validation on an independent dataset confirms the model's robustness and potential for integration into clinical workflows. The study emphasizes the importance of data pre-processing and explores various hyperparameters to optimize the model's performance. The 3D U-Net, has given IoUs for training and validation dataset have been 0.8181 and 0.66 respectively. Conclusion: Ultimately, this comprehensive framework showcases the efficacy of deep learning in automating brain tumour detection, offering valuable support in clinical practice.
△ Less
Submitted 6 April, 2024;
originally announced April 2024.
-
Music Recommendation Based on Facial Emotion Recognition
Authors:
Rajesh B,
Keerthana V,
Narayana Darapaneni,
Anwesh Reddy P
Abstract:
Introduction: Music provides an incredible avenue for individuals to express their thoughts and emotions, while also serving as a delightful mode of entertainment for enthusiasts and music lovers. Objectives: This paper presents a comprehensive approach to enhancing the user experience through the integration of emotion recognition, music recommendation, and explainable AI using GRAD-CAM. Methods:…
▽ More
Introduction: Music provides an incredible avenue for individuals to express their thoughts and emotions, while also serving as a delightful mode of entertainment for enthusiasts and music lovers. Objectives: This paper presents a comprehensive approach to enhancing the user experience through the integration of emotion recognition, music recommendation, and explainable AI using GRAD-CAM. Methods: The proposed methodology utilizes a ResNet50 model trained on the Facial Expression Recognition (FER) dataset, consisting of real images of individuals expressing various emotions. Results: The system achieves an accuracy of 82% in emotion classification. By leveraging GRAD-CAM, the model provides explanations for its predictions, allowing users to understand the reasoning behind the system's recommendations. The model is trained on both FER and real user datasets, which include labelled facial expressions, and real images of individuals expressing various emotions. The training process involves pre-processing the input images, extracting features through convolutional layers, reasoning with dense layers, and generating emotion predictions through the output layer. Conclusion: The proposed methodology, leveraging the Resnet50 model with ROI-based analysis and explainable AI techniques, offers a robust and interpretable solution for facial emotion detection paper.
△ Less
Submitted 6 April, 2024;
originally announced April 2024.
-
A Deep Look Into -- Automated Lung X-Ray Abnormality Detection System
Authors:
Nagullas KS,
Vivekanand. V,
Narayana Darapaneni,
Anwesh R P
Abstract:
Introduction: Automated Lung X-Ray Abnormality Detection System is the application which distinguish the normal x-ray images from infected x-ray images and highlight area considered for prediction, with the recent pandemic a need to have a non-conventional method and faster detecting diseases, for which X ray serves the purpose. Obectives: As of current situation any viral disease that is infectio…
▽ More
Introduction: Automated Lung X-Ray Abnormality Detection System is the application which distinguish the normal x-ray images from infected x-ray images and highlight area considered for prediction, with the recent pandemic a need to have a non-conventional method and faster detecting diseases, for which X ray serves the purpose. Obectives: As of current situation any viral disease that is infectious is potential pandemic, so there is need for cheap and early detection system. Methods: This research will help to eases the work of expert to do further analysis. Accuracy of three different preexisting models such as DenseNet, MobileNet and VGG16 were high but models over-fitted primarily due to black and white images. Results: This led to building up new method such as as V-BreathNet which gave more than 96% percent accuracy. Conclusion: Thus, it can be stated that not all state-of art CNN models can be used on B/W images. In conclusion not all state-of-art CNN models can be used on B/W images.
△ Less
Submitted 6 April, 2024;
originally announced April 2024.
-
Partially Blinded Unlearning: Class Unlearning for Deep Networks a Bayesian Perspective
Authors:
Subhodip Panda,
Shashwat Sourav,
Prathosh A. P
Abstract:
In order to adhere to regulatory standards governing individual data privacy and safety, machine learning models must systematically eliminate information derived from specific subsets of a user's training data that can no longer be utilized. The emerging discipline of Machine Unlearning has arisen as a pivotal area of research, facilitating the process of selectively discarding information design…
▽ More
In order to adhere to regulatory standards governing individual data privacy and safety, machine learning models must systematically eliminate information derived from specific subsets of a user's training data that can no longer be utilized. The emerging discipline of Machine Unlearning has arisen as a pivotal area of research, facilitating the process of selectively discarding information designated to specific sets or classes of data from a pre-trained model, thereby eliminating the necessity for extensive retraining from scratch. The principal aim of this study is to formulate a methodology tailored for the purposeful elimination of information linked to a specific class of data from a pre-trained classification network. This intentional removal is crafted to degrade the model's performance specifically concerning the unlearned data class while concurrently minimizing any detrimental impacts on the model's performance in other classes. To achieve this goal, we frame the class unlearning problem from a Bayesian perspective, which yields a loss function that minimizes the log-likelihood associated with the unlearned data with a stability regularization in parameter space. This stability regularization incorporates Mohalanobis distance with respect to the Fisher Information matrix and $l_2$ distance from the pre-trained model parameters. Our novel approach, termed \textbf{Partially-Blinded Unlearning (PBU)}, surpasses existing state-of-the-art class unlearning methods, demonstrating superior effectiveness. Notably, PBU achieves this efficacy without requiring awareness of the entire training dataset but only to the unlearned data points, marking a distinctive feature of its performance.
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
CoroNetGAN: Controlled Pruning of GANs via Hypernetworks
Authors:
Aman Kumar,
Khushboo Anand,
Shubham Mandloi,
Ashutosh Mishra,
Avinash Thakur,
Neeraj Kasera,
Prathosh A P
Abstract:
Generative Adversarial Networks (GANs) have proven to exhibit remarkable performance and are widely used across many generative computer vision applications. However, the unprecedented demand for the deployment of GANs on resource-constrained edge devices still poses a challenge due to huge number of parameters involved in the generation process. This has led to focused attention on the area of co…
▽ More
Generative Adversarial Networks (GANs) have proven to exhibit remarkable performance and are widely used across many generative computer vision applications. However, the unprecedented demand for the deployment of GANs on resource-constrained edge devices still poses a challenge due to huge number of parameters involved in the generation process. This has led to focused attention on the area of compressing GANs. Most of the existing works use knowledge distillation with the overhead of teacher dependency. Moreover, there is no ability to control the degree of compression in these methods. Hence, we propose CoroNet-GAN for compressing GAN using the combined strength of differentiable pruning method via hypernetworks. The proposed method provides the advantage of performing controllable compression while training along with reducing training time by a substantial factor. Experiments have been done on various conditional GAN architectures (Pix2Pix and CycleGAN) to signify the effectiveness of our approach on multiple benchmark datasets such as Edges-to-Shoes, Horse-to-Zebra and Summer-to-Winter. The results obtained illustrate that our approach succeeds to outperform the baselines on Zebra-to-Horse and Summer-to-Winter achieving the best FID score of 32.3 and 72.3 respectively, yielding high-fidelity images across all the datasets. Additionally, our approach also outperforms the state-of-the-art methods in achieving better inference time on various smart-phone chipsets and data-types making it a feasible solution for deployment on edge devices.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Leveraging Internal Representations of Model for Magnetic Image Classification
Authors:
Adarsh N L,
Arun P V,
Alok Porwal,
Malcolm Aranha
Abstract:
Data generated by edge devices has the potential to train intelligent autonomous systems across various domains. Despite the emergence of diverse machine learning approaches addressing privacy concerns and utilizing distributed data, security issues persist due to the sensitive storage of data shards in disparate locations. This paper introduces a potentially groundbreaking paradigm for machine le…
▽ More
Data generated by edge devices has the potential to train intelligent autonomous systems across various domains. Despite the emergence of diverse machine learning approaches addressing privacy concerns and utilizing distributed data, security issues persist due to the sensitive storage of data shards in disparate locations. This paper introduces a potentially groundbreaking paradigm for machine learning model training, specifically designed for scenarios with only a single magnetic image and its corresponding label image available. We harness the capabilities of Deep Learning to generate concise yet informative samples, aiming to overcome data scarcity. Through the utilization of deep learning's internal representations, our objective is to efficiently address data scarcity issues and produce meaningful results. This methodology presents a promising avenue for training machine learning models with minimal data.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Enhancing Image Caption Generation Using Reinforcement Learning with Human Feedback
Authors:
Adarsh N L,
Arun P V,
Aravindh N L
Abstract:
Research on generative models to produce human-aligned / human-preferred outputs has seen significant recent contributions. Between text and image-generative models, we narrowed our focus to text-based generative models, particularly to produce captions for images that align with human preferences. In this research, we explored a potential method to amplify the performance of the Deep Neural Netwo…
▽ More
Research on generative models to produce human-aligned / human-preferred outputs has seen significant recent contributions. Between text and image-generative models, we narrowed our focus to text-based generative models, particularly to produce captions for images that align with human preferences. In this research, we explored a potential method to amplify the performance of the Deep Neural Network Model to generate captions that are preferred by humans. This was achieved by integrating Supervised Learning and Reinforcement Learning with Human Feedback (RLHF) using the Flickr8k dataset. Also, a novel loss function that is capable of optimizing the model based on human feedback is introduced. In this paper, we provide a concise sketch of our approach and results, hoping to contribute to the ongoing advances in the field of human-aligned generative AI models.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Cyclic Characters of Alternating Groups
Authors:
Amrutha P,
Amritanshu Prasad,
Velmurugan S
Abstract:
We determine the eigenvalues with multiplicity of each element of an alternating group in any irreducible representation. This is equivalent to determining the decomposition of cyclic representations of alternating groups into irreducibles. We characterize pairs $(w, V)$, where $w$ is an element and $V$ is an irreducible representation of an alternating group such that $w$ admits a non-zero invari…
▽ More
We determine the eigenvalues with multiplicity of each element of an alternating group in any irreducible representation. This is equivalent to determining the decomposition of cyclic representations of alternating groups into irreducibles. We characterize pairs $(w, V)$, where $w$ is an element and $V$ is an irreducible representation of an alternating group such that $w$ admits a non-zero invariant vector in $V$. We also establish large new families of global conjugacy classes for alternating groups, thereby giving a new proof of a result of Heide and Zalessky on the existence of such classes.
△ Less
Submitted 9 September, 2024; v1 submitted 8 March, 2024;
originally announced March 2024.
-
Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap
Authors:
Saurabh Srivastava,
Annarose M B,
Anto P V,
Shashank Menon,
Ajay Sukumar,
Adwaith Samod T,
Alan Philipose,
Stevin Prince,
Sooraj Thomas
Abstract:
We propose a framework for robust evaluation of reasoning capabilities of language models, using functional variants of benchmarks. Models that solve a reasoning test should exhibit no difference in performance over the static version of a problem compared to a snapshot of the functional variant. We have rewritten the relevant fragment of the MATH benchmark into its functional variant MATH(), with…
▽ More
We propose a framework for robust evaluation of reasoning capabilities of language models, using functional variants of benchmarks. Models that solve a reasoning test should exhibit no difference in performance over the static version of a problem compared to a snapshot of the functional variant. We have rewritten the relevant fragment of the MATH benchmark into its functional variant MATH(), with functionalization of other benchmarks to follow. When evaluating current state-of-the-art models over snapshots of MATH(), we find a reasoning gap -- the percentage difference between the static and functional accuracies. We find reasoning gaps from 58.35% to 80.31% among the state-of-the-art closed and open weights models that perform well on static benchmarks, with the caveat that the gaps are likely to be smaller with more sophisticated prompting strategies. Here we show that models which anecdotally have good reasoning performance over real-world tasks, have quantifiable lower gaps, motivating the open problem of building "gap 0" models. Code for evaluation and new evaluation datasets, three MATH() snapshots, are publicly available at https://github.com/consequentai/fneval/.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Bivariate Bernstein Fractal Interpolation and Numerical Integration on Triangular Domains
Authors:
Aparna M. P.,
P. Paramanathan
Abstract:
The fundamental aim of this paper is to provide the approximation and numerical integration of a discrete set of data points with Bernstein fractal approach. Using Bernstein polynomials in the iterated function system, the paper initially proposes the numerical integration formula for the data set corresponding to univariate functions. The proposed formula of integration is shown to be convergent…
▽ More
The fundamental aim of this paper is to provide the approximation and numerical integration of a discrete set of data points with Bernstein fractal approach. Using Bernstein polynomials in the iterated function system, the paper initially proposes the numerical integration formula for the data set corresponding to univariate functions. The proposed formula of integration is shown to be convergent by examining the data sets of certain weierstrass functions.
The paper then extends the Bernstein fractal approximation and numerical integration technique to two dimensional interpolating regions. Bernstein polynomials defined over triangular domain has been used for the purpose. The triangular domain has been partitioned and the newly generated points are assigned colors in a particular manner to maintain the chromatic number as 3. Following the above mentioned construction and approximation of bivariate Bernstein fractal interpolation functions, the paper introduces the numerical double integration formula using the constructed functions. The convergence of the double integration formula towards the actual integral value of the data sets is displayed with the help of some examples including the benchmark functions. Both the newly introduced iterated function systems are verified for their hyperbolicity and the resultant fractal interpolation functions are shown to be continuous.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
Hybrid subterahertz atmospheric pressure plasmatron for plasma chemical applications
Authors:
Sintsov S. V.,
Vodopyanov A. V.,
Mansfeld D. A.,
Fokin A. P.,
Ananichev A. A.,
Goryunov A. A.,
Preobrazhensky E. I.,
Chekmarev N. V.,
Glyavin M. Yu
Abstract:
This paper presents the results of an experimental study of a new hybrid plasmatron scheme, which was used to realize a gas discharge at atmospheric pressure supported by continuous focused submillimeter radiation with a frequency of 263 GHz. The implemented design allowed organizing a self-consistent interaction between submillimeter radiation and the supercritical plasma in a localized area both…
▽ More
This paper presents the results of an experimental study of a new hybrid plasmatron scheme, which was used to realize a gas discharge at atmospheric pressure supported by continuous focused submillimeter radiation with a frequency of 263 GHz. The implemented design allowed organizing a self-consistent interaction between submillimeter radiation and the supercritical plasma in a localized area both in terms of gas flow and electrodynamic. It is experimentally shown that the gas discharge absorbs up to 80% of the introduced submillimeter radiation power. The hybrid subterahertz plasmatron as an effective reactor for non-equilibrium plasma chemical processes was tested for the atmospheric nitrogen fixation.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Dynamic Multi Color Switching using Ultrathin Vanadium Oxide on Aluminium based Asymmetric Fabry-Perot Resonant Structure
Authors:
Shubhangi Saini,
Ashok P,
Amit Verma
Abstract:
Vanadium dioxide ($VO_{2}$) exhibits strong infrared optical switching due to its insulator-metal phase-transition property. However, in the visible wavelengths, it's intrinsic optical switching is quite low. Current research explores solutions like multilayering, intricate structural patterning, high thermal budget processes and costly metals for improved color switching. Nonetheless, the color g…
▽ More
Vanadium dioxide ($VO_{2}$) exhibits strong infrared optical switching due to its insulator-metal phase-transition property. However, in the visible wavelengths, it's intrinsic optical switching is quite low. Current research explores solutions like multilayering, intricate structural patterning, high thermal budget processes and costly metals for improved color switching. Nonetheless, the color gamut coverage with these methodologies remains notably limited. This work overcomes these limitations and demonstrates dynamic multi-colour switching covering a large color gamut using a simple, unpatterned, ultrathin ($\sim$ $\fracλ{14}$, where wavelength $λ$ is taken as 575 nm at the center of visible spectrum) asymmetric Fabry-Pérot structure of $VO_{2}$ on Aluminium (Al). We use the transfer matrix method to design the $VO_{2}/Aluminium\,(Al)/Sapphire$ structure for maximum visible reflectance switching. $VO_{2}$ films are synthesized using a simple, low thermal budget atmospheric oxidation of Vanadium (V). With varying oxidation durations, different colors of the oxidized samples are observed. Consistent and reversible color-switching is observed visibly and in reflectance measurements with the change in temperature from low (RT $\sim$ 30$^{\circ}$C) to high (HT $\sim$ 100$^{\circ}$C) or vice versa due to the phase transition property of the $VO_{2}$ layer in the structure. Compared to the existing studies, this work shows a significant change in chromaticities and covers a large color gamut when plotted on the CIE chromaticity diagram. This work has potential applications in the fields of display, thermochromic structures, and visible camouflage.
△ Less
Submitted 7 January, 2024;
originally announced January 2024.
-
Bilayer Vanadium Dioxide Thin Film with Elevated Transition Temperatures and High Resistance Switching
Authors:
Achintya Dutta,
Ashok P,
Amit Verma
Abstract:
Despite widespread interest in the phase-change applications of vanadium dioxide (VO$_2$), the fabrication of high-quality VO$_2$ thin films with elevated transition temperatures (TIMT) and high Insulator-Metal-Transition resistance switching still remains a challenge. This study introduces a two-step atmospheric oxidation approach to fabricate bilayer VO$_{2-x}$/VO$_2$ films on a c-plane sapphire…
▽ More
Despite widespread interest in the phase-change applications of vanadium dioxide (VO$_2$), the fabrication of high-quality VO$_2$ thin films with elevated transition temperatures (TIMT) and high Insulator-Metal-Transition resistance switching still remains a challenge. This study introduces a two-step atmospheric oxidation approach to fabricate bilayer VO$_{2-x}$/VO$_2$ films on a c-plane sapphire substrate. To quantify the impact of the VO$_2$ buffer layer, a single-layer VO$_2$ film of the same thickness was also fabricated. The bilayer VO$_{2-x}$/VO$_2$ films wherein the top VO$_{2-x}$ film was under-oxidized demonstrated an elevation in TIMT reaching ~97 $^\circ$C, one of the highest reported to date for VO$_2$ films and is achieved in a doping-free manner. Our results also reveal a one-order increase in resistance switching, with the optimum bilayer VO$_2$/VO$_2$ film exhibiting ~3.6 orders of switching from 25 $^\circ$C to 110 $^\circ$C, compared to the optimum single-layer VO$_2$ reference film. This is accompanied by a one-order decrease in the on-state resistance in its metallic phase. The elevation in TIMT, coupled with increased strain extracted from the XRD characterization of the bilayer film, suggests the possibility of compressive strain along the c-axis. These VO$_{2-x}$/VO$_2$ films also demonstrate a significant change in the slope of their resistance vs temperature curves contrary to the conventional smooth transition. This feature was ascribed to the rutile/monoclinic quasi-heterostructure formed due to the top VO$_{2-x}$ film having a reduced TIMT. Our findings carry significant implications for both the lucid fabrication of VO$_2$ thin film devices as well as the study of phase transitions in correlated oxides.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
Guided Prompting in SAM for Weakly Supervised Cell Segmentation in Histopathological Images
Authors:
Aayush Kumar Tyagi,
Vaibhav Mishra,
Prathosh A. P.,
Mausam
Abstract:
Cell segmentation in histopathological images plays a crucial role in understanding, diagnosing, and treating many diseases. However, data annotation for this is expensive since there can be a large number of cells per image, and expert pathologists are needed for labelling images. Instead, our paper focuses on using weak supervision -- annotation from related tasks -- to induce a segmenter. Recen…
▽ More
Cell segmentation in histopathological images plays a crucial role in understanding, diagnosing, and treating many diseases. However, data annotation for this is expensive since there can be a large number of cells per image, and expert pathologists are needed for labelling images. Instead, our paper focuses on using weak supervision -- annotation from related tasks -- to induce a segmenter. Recent foundation models, such as Segment Anything (SAM), can use prompts to leverage additional supervision during inference. SAM has performed remarkably well in natural image segmentation tasks; however, its applicability to cell segmentation has not been explored.
In response, we investigate guiding the prompting procedure in SAM for weakly supervised cell segmentation when only bounding box supervision is available. We develop two workflows: (1) an object detector's output as a test-time prompt to SAM (D-SAM), and (2) SAM as pseudo mask generator over training data to train a standalone segmentation model (SAM-S). On finding that both workflows have some complementary strengths, we develop an integer programming-based approach to reconcile the two sets of segmentation masks, achieving yet higher performance. We experiment on three publicly available cell segmentation datasets namely, ConSep, MoNuSeg, and TNBC, and find that all SAM-based solutions hugely outperform existing weakly supervised image segmentation models, obtaining 9-15 pt Dice gains.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
MalFake: A Multimodal Fake News Identification for Malayalam using Recurrent Neural Networks and VGG-16
Authors:
Adhish S. Sujan,
Ajitha. V,
Aleena Benny,
Amiya M. P.,
V. S. Anoop
Abstract:
The amount of news being consumed online has substantially expanded in recent years. Fake news has become increasingly common, especially in regional languages like Malayalam, due to the rapid publication and lack of editorial standards on some online sites. Fake news may have a terrible effect on society, causing people to make bad judgments, lose faith in authorities, and even engage in violent…
▽ More
The amount of news being consumed online has substantially expanded in recent years. Fake news has become increasingly common, especially in regional languages like Malayalam, due to the rapid publication and lack of editorial standards on some online sites. Fake news may have a terrible effect on society, causing people to make bad judgments, lose faith in authorities, and even engage in violent behavior. When we take into the context of India, there are many regional languages, and fake news is spreading in every language. Therefore, providing efficient techniques for identifying false information in regional tongues is crucial. Until now, little to no work has been done in Malayalam, extracting features from multiple modalities to classify fake news. Multimodal approaches are more accurate in detecting fake news, as features from multiple modalities are extracted to build the deep learning classification model. As far as we know, this is the first piece of work in Malayalam that uses multimodal deep learning to tackle false information. Models trained with more than one modality typically outperform models taught with only one modality. Our study in the Malayalam language utilizing multimodal deep learning is a significant step toward more effective misinformation detection and mitigation.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
CoNO: Complex Neural Operator for Continuous Dynamical Systems
Authors:
Karn Tiwari,
N M Anoop Krishnan,
Prathosh A P
Abstract:
Neural operators extend data-driven models to map between infinite-dimensional functional spaces. These models have successfully solved continuous dynamical systems represented by differential equations, viz weather forecasting, fluid flow, or solid mechanics. However, the existing operators still rely on real space, thereby losing rich representations potentially captured in the complex space by…
▽ More
Neural operators extend data-driven models to map between infinite-dimensional functional spaces. These models have successfully solved continuous dynamical systems represented by differential equations, viz weather forecasting, fluid flow, or solid mechanics. However, the existing operators still rely on real space, thereby losing rich representations potentially captured in the complex space by functional transforms. In this paper, we introduce a Complex Neural Operator (CoNO), that parameterizes the integral kernel in the complex fractional Fourier domain. Additionally, the model employing a complex-valued neural network along with aliasing-free activation functions preserves the complex values and complex algebraic properties, thereby enabling improved representation, robustness to noise, and generalization. We show that the model effectively captures the underlying partial differential equation with a single complex fractional Fourier transform. We perform an extensive empirical evaluation of CoNO on several datasets and additional tasks such as zero-shot super-resolution, evaluation of out-of-distribution data, data efficiency, and robustness to noise. CoNO exhibits comparable or superior performance to all the state-of-the-art models in these tasks. Altogether, CoNO presents a robust and superior model for modeling continuous dynamical systems, providing a fillip to scientific machine learning.
△ Less
Submitted 4 October, 2023; v1 submitted 3 October, 2023;
originally announced October 2023.
-
CoDBench: A Critical Evaluation of Data-driven Models for Continuous Dynamical Systems
Authors:
Priyanshu Burark,
Karn Tiwari,
Meer Mehran Rashid,
Prathosh A P,
N M Anoop Krishnan
Abstract:
Continuous dynamical systems, characterized by differential equations, are ubiquitously used to model several important problems: plasma dynamics, flow through porous media, weather forecasting, and epidemic dynamics. Recently, a wide range of data-driven models has been used successfully to model these systems. However, in contrast to established fields like computer vision, limited studies are a…
▽ More
Continuous dynamical systems, characterized by differential equations, are ubiquitously used to model several important problems: plasma dynamics, flow through porous media, weather forecasting, and epidemic dynamics. Recently, a wide range of data-driven models has been used successfully to model these systems. However, in contrast to established fields like computer vision, limited studies are available analyzing the strengths and potential applications of different classes of these models that could steer decision-making in scientific machine learning. Here, we introduce CodBench, an exhaustive benchmarking suite comprising 11 state-of-the-art data-driven models for solving differential equations. Specifically, we comprehensively evaluate 4 distinct categories of models, viz., feed forward neural networks, deep operator regression models, frequency-based neural operators, and transformer architectures against 8 widely applicable benchmark datasets encompassing challenges from fluid and solid mechanics. We conduct extensive experiments, assessing the operators' capabilities in learning, zero-shot super-resolution, data efficiency, robustness to noise, and computational efficiency. Interestingly, our findings highlight that current operators struggle with the newer mechanics datasets, motivating the need for more robust neural operators. All the datasets and codes will be shared in an easy-to-use fashion for the scientific community. We hope this resource will be an impetus for accelerated progress and exploration in modeling dynamical systems.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
Adapt then Unlearn: Exploring Parameter Space Semantics for Unlearning in Generative Adversarial Networks
Authors:
Piyush Tiwary,
Atri Guha,
Subhodip Panda,
Prathosh A. P
Abstract:
Owing to the growing concerns about privacy and regulatory compliance, it is desirable to regulate the output of generative models. To that end, the objective of this work is to prevent the generation of outputs containing undesired features from a pre-trained Generative Adversarial Network (GAN) where the underlying training data set is inaccessible. Our approach is inspired by the observation th…
▽ More
Owing to the growing concerns about privacy and regulatory compliance, it is desirable to regulate the output of generative models. To that end, the objective of this work is to prevent the generation of outputs containing undesired features from a pre-trained Generative Adversarial Network (GAN) where the underlying training data set is inaccessible. Our approach is inspired by the observation that the parameter space of GANs exhibits meaningful directions that can be leveraged to suppress specific undesired features. However, such directions usually result in the degradation of the quality of generated samples. Our proposed two-stage method, known as 'Adapt-then-Unlearn,' excels at unlearning such undesirable features while also maintaining the quality of generated samples. In the initial stage, we adapt a pre-trained GAN on a set of negative samples (containing undesired features) provided by the user. Subsequently, we train the original pre-trained GAN using positive samples, along with a repulsion regularizer. This regularizer encourages the learned model parameters to move away from the parameters of the adapted model (first stage) while not degrading the generation quality. We provide theoretical insights into the proposed method. To the best of our knowledge, our approach stands as the first method addressing unlearning within the realm of high-fidelity GANs (such as StyleGAN). We validate the effectiveness of our method through comprehensive experiments, encompassing both class-level unlearning on the MNIST and AFHQ dataset and feature-level unlearning tasks on the CelebA-HQ dataset. Our code and implementation is available at: https://github.com/atriguha/Adapt_Unlearn.
△ Less
Submitted 12 February, 2025; v1 submitted 25 September, 2023;
originally announced September 2023.
-
SN 2022jli: a type Ic supernova with periodic modulation of its light curve and an unusually long rise
Authors:
Moore T.,
Smartt S. J.,
Nicholl M.,
Srivastav S.,
Stevance H. F.,
Jess D. B.,
Grant S. D. T.,
Fulton M. D.,
Rhodes L.,
Sim S. A.,
Hirai R.,
Podsiadlowski P.,
Anderson J. P.,
Ashall C.,
Bate W.,
Fender R.,
Gutierrez C. P.,
Howell D. A.,
Huber M. E.,
Inserra C.,
Leloudas G.,
Monard L. A. G.,
Muller-Bravo T. E.,
Shappee B. J.,
Smith K. W.
, et al. (20 additional authors not shown)
Abstract:
We present multi-wavelength photometry and spectroscopy of SN 2022jli, an unprecedented Type Ic supernova discovered in the galaxy NGC 157 at a distance of $\approx$ 23 Mpc. The multi-band light curves reveal many remarkable characteristics. Peaking at a magnitude of $g=15.11\pm0.02$, the high-cadence photometry reveals 12.5$\pm0.2\ $day periodic undulations superimposed on the 200 day supernova d…
▽ More
We present multi-wavelength photometry and spectroscopy of SN 2022jli, an unprecedented Type Ic supernova discovered in the galaxy NGC 157 at a distance of $\approx$ 23 Mpc. The multi-band light curves reveal many remarkable characteristics. Peaking at a magnitude of $g=15.11\pm0.02$, the high-cadence photometry reveals 12.5$\pm0.2\ $day periodic undulations superimposed on the 200 day supernova decline. This periodicity is observed in the light curves from nine separate filter and instrument configurations with peak-to-peak amplitudes of $\simeq$ 0.1 mag. This is the first time that repeated periodic oscillations, over many cycles, have been detected in a supernova light curve. SN 2022jli also displays an extreme early excess which fades over $\approx$ 25 days followed by a rise to a peak luminosity of $L_{\rm opt} = 10^{42.1}$ erg s$^{-1}$. Although the exact explosion epoch is not constrained by data, the time from explosion to maximum light is $\gtrsim$ 59 days. The luminosity can be explained by a large ejecta mass ($M_{\rm ej}\approx12\pm6$M$_{\odot}$) powered by $^{56}$Ni but we find difficulty in quantitatively modelling the early excess with circumstellar interaction and cooling. Collision between the supernova ejecta and a binary companion is a possible source of this emission. We discuss the origin of the periodic variability in the light curve, including interaction of the SN ejecta with nested shells of circumstellar matter and neutron stars colliding with binary companions.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
HiveLink, an IoT based Smart Bee Hive Monitoring System
Authors:
Ajwin Dsouza,
Aditya P,
Sameer Hegde
Abstract:
HiveLink, the IoT-based Smart Bee Hive Monitoring System addresses the challenges faced by beekeepers in managing the influence of environmental impact, diseases, and collapse in honey bee colonies. Integrated with advanced sensors, the system monitors temperature, humidity, hive weight, and diurnal cycle. Leveraging IoT technology, the system provides real-time data, remote connectivity, and acti…
▽ More
HiveLink, the IoT-based Smart Bee Hive Monitoring System addresses the challenges faced by beekeepers in managing the influence of environmental impact, diseases, and collapse in honey bee colonies. Integrated with advanced sensors, the system monitors temperature, humidity, hive weight, and diurnal cycle. Leveraging IoT technology, the system provides real-time data, remote connectivity, and actionable insights for beekeepers. Monitoring the hive with the system enables early disease detection, proactive interventions, and optimized hive management. Minimizing manual inspections, enhancing productivity, and promoting sustainable practices to mitigate environmental impact and support honey bee populations. Therefore, this system is a demonstration of technology-driven solution to ensure the well-being of bee hives by facilitating data-driven decision-making and contributes to the resilience of beekeeping in the face of diverse challenges.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
Enhancing Knee Osteoarthritis severity level classification using diffusion augmented images
Authors:
Paleti Nikhil Chowdary,
Gorantla V N S L Vishnu Vardhan,
Menta Sai Akshay,
Menta Sai Aashish,
Vadlapudi Sai Aravind,
Garapati Venkata Krishna Rayalu,
Aswathy P
Abstract:
This research paper explores the classification of knee osteoarthritis (OA) severity levels using advanced computer vision models and augmentation techniques. The study investigates the effectiveness of data preprocessing, including Contrast-Limited Adaptive Histogram Equalization (CLAHE), and data augmentation using diffusion models. Three experiments were conducted: training models on the origin…
▽ More
This research paper explores the classification of knee osteoarthritis (OA) severity levels using advanced computer vision models and augmentation techniques. The study investigates the effectiveness of data preprocessing, including Contrast-Limited Adaptive Histogram Equalization (CLAHE), and data augmentation using diffusion models. Three experiments were conducted: training models on the original dataset, training models on the preprocessed dataset, and training models on the augmented dataset. The results show that data preprocessing and augmentation significantly improve the accuracy of the models. The EfficientNetB3 model achieved the highest accuracy of 84\% on the augmented dataset. Additionally, attention visualization techniques, such as Grad-CAM, are utilized to provide detailed attention maps, enhancing the understanding and trustworthiness of the models. These findings highlight the potential of combining advanced models with augmented data and attention visualization for accurate knee OA severity classification.
△ Less
Submitted 17 September, 2023;
originally announced September 2023.
-
Neural Discovery of Permutation Subgroups
Authors:
Pavan Karjol,
Rohan Kashyap,
Prathosh A P
Abstract:
We consider the problem of discovering subgroup $H$ of permutation group $S_{n}$. Unlike the traditional $H$-invariant networks wherein $H$ is assumed to be known, we present a method to discover the underlying subgroup, given that it satisfies certain conditions. Our results show that one could discover any subgroup of type $S_{k} (k \leq n)$ by learning an $S_{n}$-invariant function and a linear…
▽ More
We consider the problem of discovering subgroup $H$ of permutation group $S_{n}$. Unlike the traditional $H$-invariant networks wherein $H$ is assumed to be known, we present a method to discover the underlying subgroup, given that it satisfies certain conditions. Our results show that one could discover any subgroup of type $S_{k} (k \leq n)$ by learning an $S_{n}$-invariant function and a linear transformation. We also prove similar results for cyclic and dihedral subgroups. Finally, we provide a general theorem that can be extended to discover other subgroups of $S_{n}$. We also demonstrate the applicability of our results through numerical experiments on image-digit sum and symmetric polynomial regression tasks.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
A Unified Framework for Discovering Discrete Symmetries
Authors:
Pavan Karjol,
Rohan Kashyap,
Aditya Gopalan,
Prathosh A. P
Abstract:
We consider the problem of learning a function respecting a symmetry from among a class of symmetries. We develop a unified framework that enables symmetry discovery across a broad range of subgroups including locally symmetric, dihedral and cyclic subgroups. At the core of the framework is a novel architecture composed of linear, matrix-valued and non-linear functions that expresses functions inv…
▽ More
We consider the problem of learning a function respecting a symmetry from among a class of symmetries. We develop a unified framework that enables symmetry discovery across a broad range of subgroups including locally symmetric, dihedral and cyclic subgroups. At the core of the framework is a novel architecture composed of linear, matrix-valued and non-linear functions that expresses functions invariant to these subgroups in a principled manner. The structure of the architecture enables us to leverage multi-armed bandit algorithms and gradient descent to efficiently optimize over the linear and the non-linear functions, respectively, and to infer the symmetry that is ultimately learnt. We also discuss the necessity of the matrix-valued functions in the architecture. Experiments on image-digit sum and polynomial regression tasks demonstrate the effectiveness of our approach.
△ Less
Submitted 27 October, 2023; v1 submitted 6 September, 2023;
originally announced September 2023.
-
GenSelfDiff-HIS: Generative Self-Supervision Using Diffusion for Histopathological Image Segmentation
Authors:
Vishnuvardhan Purma,
Suhas Srinath,
Seshan Srirangarajan,
Aanchal Kakkar,
Prathosh A. P
Abstract:
Histopathological image segmentation is a laborious and time-intensive task, often requiring analysis from experienced pathologists for accurate examinations. To reduce this burden, supervised machine-learning approaches have been adopted using large-scale annotated datasets for histopathological image analysis. However, in several scenarios, the availability of large-scale annotated data is a bot…
▽ More
Histopathological image segmentation is a laborious and time-intensive task, often requiring analysis from experienced pathologists for accurate examinations. To reduce this burden, supervised machine-learning approaches have been adopted using large-scale annotated datasets for histopathological image analysis. However, in several scenarios, the availability of large-scale annotated data is a bottleneck while training such models. Self-supervised learning (SSL) is an alternative paradigm that provides some respite by constructing models utilizing only the unannotated data which is often abundant. The basic idea of SSL is to train a network to perform one or many pseudo or pretext tasks on unannotated data and use it subsequently as the basis for a variety of downstream tasks. It is seen that the success of SSL depends critically on the considered pretext task. While there have been many efforts in designing pretext tasks for classification problems, there haven't been many attempts on SSL for histopathological segmentation. Motivated by this, we propose an SSL approach for segmenting histopathological images via generative diffusion models in this paper. Our method is based on the observation that diffusion models effectively solve an image-to-image translation task akin to a segmentation task. Hence, we propose generative diffusion as the pretext task for histopathological image segmentation. We also propose a multi-loss function-based fine-tuning for the downstream task. We validate our method using several metrics on two publically available datasets along with a newly proposed head and neck (HN) cancer dataset containing hematoxylin and eosin (H\&E) stained images along with annotations. Codes will be made public at https://github.com/suhas-srinath/GenSelfDiff-HIS.
△ Less
Submitted 10 September, 2024; v1 submitted 4 September, 2023;
originally announced September 2023.
-
On the Existence of Elementwise Invariant Vectors in Representations of Symmetric Groups
Authors:
Amrutha P,
Amritanshu Prasad,
Velmurugan S
Abstract:
We determine when a permutation with cycle type $μ$ admits a non-zero invariant vector in the irreducible representation $V_λ$ of the symmetric group. We find that a majority of pairs $(λ,μ)$ have this property, with only a few simple exceptions.
We determine when a permutation with cycle type $μ$ admits a non-zero invariant vector in the irreducible representation $V_λ$ of the symmetric group. We find that a majority of pairs $(λ,μ)$ have this property, with only a few simple exceptions.
△ Less
Submitted 30 October, 2023; v1 submitted 16 August, 2023;
originally announced August 2023.
-
Correlating Medi-Claim Service by Deep Learning Neural Networks
Authors:
Jayanthi Vajiram,
Negha Senthil,
Nean Adhith. P
Abstract:
Medical insurance claims are of organized crimes related to patients, physicians, diagnostic centers, and insurance providers, forming a chain reaction that must be monitored constantly. These kinds of frauds affect the financial growth of both insured people and health insurance companies. The Convolution Neural Network architecture is used to detect fraudulent claims through a correlation study…
▽ More
Medical insurance claims are of organized crimes related to patients, physicians, diagnostic centers, and insurance providers, forming a chain reaction that must be monitored constantly. These kinds of frauds affect the financial growth of both insured people and health insurance companies. The Convolution Neural Network architecture is used to detect fraudulent claims through a correlation study of regression models, which helps to detect money laundering on different claims given by different providers. Supervised and unsupervised classifiers are used to detect fraud and non-fraud claims.
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
Exploration of legal implications of air and space travel for international and domestic travel and the Environment
Authors:
Jayanthi Vajiram,
Negha Senthil,
Nean Adhith. P,
Ritikaa. VN
Abstract:
The rapid growth of air and space travel in recent years has resulted in an increased demand for legal regulation in the aviation and aerospace fields. This paper provides an overview of air and space law, including the topics of aircraft accident investigations, air traffic control, international borders and law, and the regulation of space activities. With the increasing complexity of air and sp…
▽ More
The rapid growth of air and space travel in recent years has resulted in an increased demand for legal regulation in the aviation and aerospace fields. This paper provides an overview of air and space law, including the topics of aircraft accident investigations, air traffic control, international borders and law, and the regulation of space activities. With the increasing complexity of air and space travel, it is important to understand the legal implications of these activities. This paper examines the various legal aspects of air and space law, including the roles of national governments, international organizations, and private entities. It also provides an overview of the legal frameworks that govern these activities and the implications of international law. Finally, it considers the potential for future developments in the field of air and space law. This paper provides a comprehensive overview of the legal aspects of air and space travel and their implications for international and domestic travel, as well as for international business and other activities in the air and space domains.
△ Less
Submitted 27 July, 2023;
originally announced July 2023.
-
Multi-mission view of low-luminosity 'obscured' phase of GRS 1915+105
Authors:
Athulya M. P.,
Anuj Nandi
Abstract:
GRS 1915+105 is observed in an 'obscured' phase since May 2019, exhibiting steady and low X-ray luminosities, while being intervened by sporadic re-brightenings. In this work, we perform a comprehensive and wide-band analysis of the spectral and timing properties of the source during the period $2019-2021$ using AstroSat (SXT: $0.5-8$ keV; LAXPC: $3-60$ keV), NICER ($0.5-12$ keV), and NuSTAR (…
▽ More
GRS 1915+105 is observed in an 'obscured' phase since May 2019, exhibiting steady and low X-ray luminosities, while being intervened by sporadic re-brightenings. In this work, we perform a comprehensive and wide-band analysis of the spectral and timing properties of the source during the period $2019-2021$ using AstroSat (SXT: $0.5-8$ keV; LAXPC: $3-60$ keV), NICER ($0.5-12$ keV), and NuSTAR ($3-60$ keV) observations. Spectral analysis reveals the presence of a highly variable obscurer (N$_{H_{1}}\sim~10^{22} - 10^{24}$ atoms cm$^{-2}$) throughout the observation period. Source is detected in the Low/Hard state for most of the time, with the spectra being described by a Comptonised component ($Γ\sim 1.16 - 1.79$, kT$_{e}\sim 2-31$ keV). The source spectra steepen ($Γ\sim2.5$) indicating softening of the spectrum during the rise of the re-brightenings. Various emission and absorption lines corresponding to the neutral Fe-K$α$, Fe-XXV K$α$, Fe-XXVI K$α$, and the Ni-XXVIII K$α$ were detected with equivalent widths varying between 70 eV $-$ 3.5 keV. The column density of the absorbing plasma varied between $10^{16} - 10^{18}$ atoms cm$^{-2}$ at a distance $\leq2\times$10$^{10}$ cm. Interestingly, the source is also seen exhibiting various variability classes ($ρ, λ, δ, χ$) at relatively low luminosities ($\sim$0.01L$_{Edd}$) during the re-brightening phases. Different variability classes show signature of QPOs ($ν_{QPO}$: 20--180 mHz, rms$_{QPO}$: 7.5% - 16%). The source showed a maximum bolometric luminosity {(L$_{bol}$)} of $\sim$0.01L$_{Edd}$ (Re-brightening phases) and a minimum L$_{bol}$ of 0.004L$_{Edd}$ (Quiet phase) during the period. We discuss the possible disc dynamics around the black hole during this low-luminosity `obscured' phase.
△ Less
Submitted 9 July, 2023;
originally announced July 2023.
-
Hypergraph representation in brain network analysis
Authors:
Anagha P,
Selvakumar R
Abstract:
For the study of functional aspects of the brain network. This paper is a study on the hypergraph representation, based on the functional regions of the brain network. A new parameter that can measure how many multifunctioning regions each function contains and thereby the correlation of other functions with each function.
For the study of functional aspects of the brain network. This paper is a study on the hypergraph representation, based on the functional regions of the brain network. A new parameter that can measure how many multifunctioning regions each function contains and thereby the correlation of other functions with each function.
△ Less
Submitted 19 December, 2023; v1 submitted 5 May, 2023;
originally announced May 2023.
-
Analyzing travel time reliability of a bus route in a limited data set scenario: A case study
Authors:
Ashwini B P,
R Sumathi,
Sudhira H S
Abstract:
In this information era commuters prefer to know a reliable travel time to plan ahead of their journey using both public and private modes. In this direction reliability analysis using the location data of the buses is conducted in two folds in the current work; (i) Reliability analysis of a public transit service at route level, and (ii) Travel time reliability analysis of a route utilizing the l…
▽ More
In this information era commuters prefer to know a reliable travel time to plan ahead of their journey using both public and private modes. In this direction reliability analysis using the location data of the buses is conducted in two folds in the current work; (i) Reliability analysis of a public transit service at route level, and (ii) Travel time reliability analysis of a route utilizing the location data of the buses. The reliability parameters assessed for public transit service are headway, passenger waiting time, travel speed, and travel time as per the Service Level Benchmarks for Urban Transport by the National Urban Transport Policy, Government of India. And travel time reliability parameters such as Buffer Time Index, Travel Time Index, and Planning Time Index are assessed as per Federal Highway Administration, Department of Transportation, U S. The study is conducted in Tumakuru city, India for a significant bus route in a limited data sources scenario. The results suggest that (i) the Level of Service of the public transit service needs improvement. (ii)around 30% excess of average travel time is needed as buffer time. (iii) more than double the amount of free flow travel time must be planned during peak hours and in the worst case. In the future, the analysis conducted for the route can be extended for citywide performance analysis in both folds. Also, the same method can be applied to cities with similar demographics and traffic-related infrastructure.
△ Less
Submitted 31 March, 2023;
originally announced March 2023.
-
Bayesian Pseudo-Coresets via Contrastive Divergence
Authors:
Piyush Tiwary,
Kumar Shubham,
Vivek V. Kashyap,
Prathosh A. P
Abstract:
Bayesian methods provide an elegant framework for estimating parameter posteriors and quantification of uncertainty associated with probabilistic models. However, they often suffer from slow inference times. To address this challenge, Bayesian Pseudo-Coresets (BPC) have emerged as a promising solution. BPC methods aim to create a small synthetic dataset, known as pseudo-coresets, that approximates…
▽ More
Bayesian methods provide an elegant framework for estimating parameter posteriors and quantification of uncertainty associated with probabilistic models. However, they often suffer from slow inference times. To address this challenge, Bayesian Pseudo-Coresets (BPC) have emerged as a promising solution. BPC methods aim to create a small synthetic dataset, known as pseudo-coresets, that approximates the posterior inference achieved with the original dataset. This approximation is achieved by optimizing a divergence measure between the true posterior and the pseudo-coreset posterior. Various divergence measures have been proposed for constructing pseudo-coresets, with forward Kullback-Leibler (KL) divergence being the most successful. However, using forward KL divergence necessitates sampling from the pseudo-coreset posterior, often accomplished through approximate Gaussian variational distributions. Alternatively, one could employ Markov Chain Monte Carlo (MCMC) methods for sampling, but this becomes challenging in high-dimensional parameter spaces due to slow mixing. In this study, we introduce a novel approach for constructing pseudo-coresets by utilizing contrastive divergence. Importantly, optimizing contrastive divergence eliminates the need for approximations in the pseudo-coreset construction process. Furthermore, it enables the use of finite-step MCMC methods, alleviating the requirement for extensive mixing to reach a stationary distribution. To validate our method's effectiveness, we conduct extensive experiments on multiple datasets, demonstrating its superiority over existing BPC techniques.
△ Less
Submitted 8 May, 2024; v1 submitted 20 March, 2023;
originally announced March 2023.
-
Some Coupled Fixed Point Theorems for (ψ, φ)- contraction with Applications to Fractals
Authors:
Athul P,
D. Ramesh Kumar
Abstract:
In this paper, we obtain coupled fixed point theorem for (ψ, φ)-contractions under some generalized conditions on the real valued functions ψand φdefined on (0,\infinity). Also, we present a generalized version of coupled fixed point theorem for the same (ψ, φ)- contractions. A new approach to fractal generation using the relation between fractals and fixed points is given in light of these fixed…
▽ More
In this paper, we obtain coupled fixed point theorem for (ψ, φ)-contractions under some generalized conditions on the real valued functions ψand φdefined on (0,\infinity). Also, we present a generalized version of coupled fixed point theorem for the same (ψ, φ)- contractions. A new approach to fractal generation using the relation between fractals and fixed points is given in light of these fixed point theorems. We establish a new type of iterated function system consisting of generalized (ψ, φ)-contractions. We also extend those results to coupled fractals.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.