Skip to main content

Showing 1–36 of 36 results for author: Venkatraman, S

.
  1. arXiv:2504.01243  [pdf, other

    cs.CV cs.AI cs.LG cs.RO eess.IV

    FUSION: Frequency-guided Underwater Spatial Image recOnstructioN

    Authors: Jaskaran Singh Walia, Shravan Venkatraman, Pavithra LK

    Abstract: Underwater images suffer from severe degradations, including color distortions, reduced visibility, and loss of structural details due to wavelength-dependent attenuation and scattering. Existing enhancement methods primarily focus on spatial-domain processing, neglecting the frequency domain's potential to capture global color distributions and long-range dependencies. To address these limitation… ▽ More

    Submitted 13 April, 2025; v1 submitted 1 April, 2025; originally announced April 2025.

  2. arXiv:2503.18929  [pdf, other

    cs.LG

    Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training

    Authors: Brian R. Bartoldson, Siddarth Venkatraman, James Diffenderfer, Moksh Jain, Tal Ben-Nun, Seanie Lee, Minsu Kim, Johan Obando-Ceron, Yoshua Bengio, Bhavya Kailkhura

    Abstract: Reinforcement learning (RL) is a critical component of large language model (LLM) post-training. However, existing on-policy algorithms used for post-training are inherently incompatible with the use of experience replay buffers, which can be populated scalably by distributed off-policy actors to enhance exploration as compute increases. We propose efficiently obtaining this benefit of replay buff… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  3. arXiv:2503.09746  [pdf, other

    cs.LG cs.AI stat.ML

    Solving Bayesian inverse problems with diffusion priors and off-policy RL

    Authors: Luca Scimeca, Siddarth Venkatraman, Moksh Jain, Minsu Kim, Marcin Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yashar Hezaveh, Laurence Perreault-Levasseur, Yoshua Bengio, Glen Berseth, Nikolay Malkin

    Abstract: This paper presents a practical application of Relative Trajectory Balance (RTB), a recently introduced off-policy reinforcement learning (RL) objective that can asymptotically solve Bayesian inverse problems optimally. We extend the original work by using RTB to train conditional diffusion model posteriors from pretrained unconditional priors for challenging linear and non-linear inverse problems… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: Accepted as workshop paper at DeLTa workshop, ICLR 2025. arXiv admin note: substantial text overlap with arXiv:2405.20971

  4. arXiv:2502.06999  [pdf, other

    cs.LG

    Outsourced diffusion sampling: Efficient posterior inference in latent spaces of generative models

    Authors: Siddarth Venkatraman, Mohsin Hasan, Minsu Kim, Luca Scimeca, Marcin Sendera, Yoshua Bengio, Glen Berseth, Nikolay Malkin

    Abstract: Any well-behaved generative model over a variable $\mathbf{x}$ can be expressed as a deterministic transformation of an exogenous ('outsourced') Gaussian noise variable $\mathbf{z}$: $\mathbf{x}=f_θ(\mathbf{z})$. In such a model (\eg, a VAE, GAN, or continuous-time flow-based model), sampling of the target variable $\mathbf{x} \sim p_θ(\mathbf{x})$ is straightforward, but sampling from a posterior… ▽ More

    Submitted 28 May, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: ICML 2025; code: https://github.com/HyperPotatoNeo/Outsourced_Diffusion_Sampling

  5. arXiv:2501.19301  [pdf, other

    cs.CL cs.AI

    Beyond checkmate: exploring the creative chokepoints in AI text

    Authors: Nafis Irtiza Tripto, Saranya Venkatraman, Mahjabin Nahar, Dongwon Lee

    Abstract: Large Language Models (LLMs) have revolutionized Natural Language Processing (NLP) and Artificial Intelligence (AI), unlocking unprecedented capabilities. This rapid advancement has spurred research into various aspects of LLMs, their text generation & reasoning capability, and potential misuse, fueling the necessity for robust detection methods. While numerous prior research has focused on detect… ▽ More

    Submitted 31 January, 2025; originally announced January 2025.

    Comments: 18 pages, single columns, under review at Nature Machine Intelligence

  6. arXiv:2411.10843  [pdf

    eess.IV cs.AI cs.CV cs.LG

    A Novel Adaptive Hybrid Focal-Entropy Loss for Enhancing Diabetic Retinopathy Detection Using Convolutional Neural Networks

    Authors: Santhosh Malarvannan, Pandiyaraju V, Shravan Venkatraman, Abeshek A, Priyadarshini B, Kannan A

    Abstract: Diabetic retinopathy is a leading cause of blindness around the world and demands precise AI-based diagnostic tools. Traditional loss functions in multi-class classification, such as Categorical Cross-Entropy (CCE), are very common but break down with class imbalance, especially in cases with inherently challenging or overlapping classes, which leads to biased and less sensitive models. Since a he… ▽ More

    Submitted 23 April, 2025; v1 submitted 16 November, 2024; originally announced November 2024.

    Comments: 7 pages,7 figures

    MSC Class: 68T07; 92C55; 68U10 ACM Class: I.2.10; I.5.1; J.3

  7. arXiv:2411.09420  [pdf, other

    cs.CV cs.AI cs.LG

    SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers

    Authors: Shravan Venkatraman, Jaskaran Singh Walia, Joe Dhanith P R

    Abstract: Vision Transformers (ViTs) have redefined image classification by leveraging self-attention to capture complex patterns and long-range dependencies between image patches. However, a key challenge for ViTs is efficiently incorporating multi-scale feature representations, which is inherent in convolutional neural networks (CNNs) through their hierarchical structure. Graph transformers have made stri… ▽ More

    Submitted 7 January, 2025; v1 submitted 14 November, 2024; originally announced November 2024.

    Comments: 14 pages, 8 figures, 9 tables

    MSC Class: 68T07 ACM Class: I.2.10

  8. arXiv:2411.04032  [pdf, other

    cs.CL

    Beemo: Benchmark of Expert-edited Machine-generated Outputs

    Authors: Ekaterina Artemova, Jason Lucas, Saranya Venkatraman, Jooyoung Lee, Sergei Tilga, Adaku Uchendu, Vladislav Mikhailov

    Abstract: The rapid proliferation of large language models (LLMs) has increased the volume of machine-generated texts (MGTs) and blurred text authorship in various domains. However, most existing MGT benchmarks include single-author texts (human-written and machine-generated). This conventional design fails to capture more practical multi-author scenarios, where the user refines the LLM response for natural… ▽ More

    Submitted 17 March, 2025; v1 submitted 6 November, 2024; originally announced November 2024.

    Comments: Accepted to NAACL 2025

  9. arXiv:2409.17273  [pdf, other

    eess.IV cs.CV cs.LG

    Targeted Neural Architectures in Multi-Objective Frameworks for Complete Glioma Characterization from Multimodal MRI

    Authors: Shravan Venkatraman, Pandiyaraju V, Abeshek A, Aravintakshan S A, Pavan Kumar S, Kannan A, Madhan S

    Abstract: Brain tumors result from abnormal cell growth in brain tissue. If undiagnosed, they cause neurological deficits, including cognitive impairment, motor dysfunction, and sensory loss. As tumors grow, intracranial pressure increases, potentially leading to fatal complications such as brain herniation. Early diagnosis and treatment are crucial to controlling these effects and slowing tumor progression… ▽ More

    Submitted 18 March, 2025; v1 submitted 25 September, 2024; originally announced September 2024.

    Comments: 29 pages, 25 figures, 6 tables

    ACM Class: I.4.6

  10. A Novel Self-Attention-Enabled Weighted Ensemble-Based Convolutional Neural Network Framework for Distributed Denial of Service Attack Classification

    Authors: Kanthimathi S, Shravan Venkatraman, Jayasankar K S, Pranay Jiljith T, Jashwanth R

    Abstract: Distributed Denial of Service (DDoS) attacks are a major concern in network security, as they overwhelm systems with excessive traffic, compromise sensitive data, and disrupt network services. Accurately detecting these attacks is crucial to protecting network infrastructure. Traditional approaches, such as single Convolutional Neural Networks (CNNs) or conventional Machine Learning (ML) algorithm… ▽ More

    Submitted 12 October, 2024; v1 submitted 1 September, 2024; originally announced September 2024.

    Comments: 19 pages, 3 tables, 9 figures

    ACM Class: I.2.6

  11. arXiv:2409.00804  [pdf

    eess.IV cs.CV cs.LG

    Leveraging SeNet and ResNet Synergy within an Encoder-Decoder Architecture for Glioma Detection

    Authors: Pandiyaraju V, Shravan Venkatraman, Abeshek A, Pavan Kumar S, Aravintakshan S A

    Abstract: Brain tumors are abnormalities that can severely impact a patient's health, leading to life-threatening conditions such as cancer. These can result in various debilitating effects, including neurological issues, cognitive impairment, motor and sensory deficits, as well as emotional and behavioral changes. These symptoms significantly affect a patient's quality of life, making early diagnosis and t… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

    Comments: 9 pages, 6 figures, 1 table

    ACM Class: I.4.6

  12. arXiv:2407.18552  [pdf

    cs.MM cs.CL cs.CV cs.LG cs.SD eess.AS

    Multimodal Emotion Recognition using Audio-Video Transformer Fusion with Cross Attention

    Authors: Joe Dhanith P R, Shravan Venkatraman, Vigya Sharma, Santhosh Malarvannan, Modigari Narendra

    Abstract: Understanding emotions is a fundamental aspect of human communication. Integrating audio and video signals offers a more comprehensive understanding of emotional states compared to traditional methods that rely on a single data source, such as speech or facial expressions. Despite its potential, multimodal emotion recognition faces significant challenges, particularly in synchronization, feature e… ▽ More

    Submitted 19 February, 2025; v1 submitted 26 July, 2024; originally announced July 2024.

    Comments: 38 Pages, 9 Tables, 12 Figures

    ACM Class: F.2.2; I.2.7

  13. arXiv:2407.11753  [pdf

    cs.CV cs.AI cs.LG

    A Channel Attention-Driven Hybrid CNN Framework for Paddy Leaf Disease Detection

    Authors: Pandiyaraju V, Shravan Venkatraman, Abeshek A, Pavan Kumar S, Aravintakshan S A, Senthil Kumar A M, Kannan A

    Abstract: Farmers face various challenges when it comes to identifying diseases in rice leaves during their early stages of growth, which is a major reason for poor produce. Therefore, early and accurate disease identification is important in agriculture to avoid crop loss and improve cultivation. In this research, we propose a novel hybrid deep learning (DL) classifier designed by extending the Squeeze-and… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 17 pages, 4 tables, 10 figures

    ACM Class: F.2.2; I.2.7

  14. arXiv:2407.10921  [pdf, other

    eess.IV cs.CV cs.LG

    Leveraging Bi-Focal Perspectives and Granular Feature Integration for Accurate Reliable Early Alzheimer's Detection

    Authors: Shravan Venkatraman, Pandiyaraju V, Abeshek A, Pavan Kumar S, Aravintakshan S A

    Abstract: Being the most commonly known neurodegeneration, Alzheimer's Disease (AD) is annually diagnosed in millions of patients. The present medical scenario still finds the exact diagnosis and classification of AD through neuroimaging data as a challenging task. Traditional CNNs can extract a good amount of low-level information in an image while failing to extract high-level minuscule particles, which i… ▽ More

    Submitted 18 March, 2025; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: 14 pages, 12 figures, 6 tables

    ACM Class: I.4.8; I.2.10; I.4.6

  15. arXiv:2407.02844  [pdf, other

    eess.IV cs.CV cs.LG

    Exploiting Precision Mapping and Component-Specific Feature Enhancement for Breast Cancer Segmentation and Identification

    Authors: Pandiyaraju V, Shravan Venkatraman, Pavan Kumar S, Santhosh Malarvannan, Kannan A

    Abstract: Breast cancer is one of the leading causes of death globally, and thus there is an urgent need for early and accurate diagnostic techniques. Although ultrasound imaging is a widely used technique for breast cancer screening, it faces challenges such as poor boundary delineation caused by variations in tumor morphology and reduced diagnostic accuracy due to inconsistent image quality. To address th… ▽ More

    Submitted 10 February, 2025; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: 27 pages, 18 figures, 6 tables

    MSC Class: 68T07; 68T45; 92C55; 65D18 ACM Class: I.2.10; I.4.6; I.5.1; I.5.4

  16. arXiv:2406.12665  [pdf, other

    cs.CL cs.AI

    CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis

    Authors: Saranya Venkatraman, Nafis Irtiza Tripto, Dongwon Lee

    Abstract: The rise of unifying frameworks that enable seamless interoperability of Large Language Models (LLMs) has made LLM-LLM collaboration for open-ended tasks a possibility. Despite this, there have not been efforts to explore such collaborative writing. We take the next step beyond human-LLM collaboration to explore this multi-LLM scenario by generating the first exclusively LLM-generated collaborativ… ▽ More

    Submitted 10 February, 2025; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted to NAACL Findings 2025

  17. arXiv:2405.20971  [pdf, other

    cs.LG cs.CV

    Amortizing intractable inference in diffusion models for vision, language, and control

    Authors: Siddarth Venkatraman, Moksh Jain, Luca Scimeca, Minsu Kim, Marcin Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yoshua Bengio, Glen Berseth, Nikolay Malkin

    Abstract: Diffusion models have emerged as effective distribution estimators in vision, language, and reinforcement learning, but their use as priors in downstream tasks poses an intractable posterior inference problem. This paper studies amortized sampling of the posterior over data, $\mathbf{x}\sim p^{\rm post}(\mathbf{x})\propto p(\mathbf{x})r(\mathbf{x})$, in a model that consists of a diffusion generat… ▽ More

    Submitted 13 January, 2025; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: NeurIPS 2024; code: https://github.com/GFNOrg/diffusion-finetuning

  18. arXiv:2403.00954  [pdf, other

    cs.HC

    ClassInSight: Designing Conversation Support Tools to Visualize Classroom Discussion for Personalized Teacher Professional Development

    Authors: Tricia J. Ngoon, S Sushil, Angela Stewart, Ung-Sang Lee, Saranya Venkatraman, Neil Thawani, Prasenjit Mitra, Sherice Clarke, John Zimmerman, Amy Ogan

    Abstract: Teaching is one of many professions for which personalized feedback and reflection can help improve dialogue and discussion between the professional and those they serve. However, professional development (PD) is often impersonal as human observation is labor-intensive. Data-driven PD tools in teaching are of growing interest, but open questions about how professionals engage with their data in pr… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  19. arXiv:2402.00835  [pdf, other

    cs.CL cs.AI cs.LG

    ALISON: Fast and Effective Stylometric Authorship Obfuscation

    Authors: Eric Xing, Saranya Venkatraman, Thai Le, Dongwon Lee

    Abstract: Authorship Attribution (AA) and Authorship Obfuscation (AO) are two competing tasks of increasing importance in privacy research. Modern AA leverages an author's consistent writing style to match a text to its author using an AA classifier. AO is the corresponding adversarial task, aiming to modify a text in such a way that its semantics are preserved, yet an AA model cannot correctly infer its au… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 10 pages, 6 figures, 4 tables. To be published in the Proceedings of the 38th Annual AAAI Conference on Artificial Intelligence (AAAI-24)

    ACM Class: I.2.7; I.2.0

  20. arXiv:2311.08374  [pdf, other

    cs.CL

    A Ship of Theseus: Curious Cases of Paraphrasing in LLM-Generated Texts

    Authors: Nafis Irtiza Tripto, Saranya Venkatraman, Dominik Macko, Robert Moro, Ivan Srba, Adaku Uchendu, Thai Le, Dongwon Lee

    Abstract: In the realm of text manipulation and linguistic transformation, the question of authorship has been a subject of fascination and philosophical inquiry. Much like the Ship of Theseus paradox, which ponders whether a ship remains the same when each of its original planks is replaced, our research delves into an intriguing question: Does a text retain its original authorship when it undergoes numero… ▽ More

    Submitted 6 June, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: To appear in Association for Computational Linguistics (ACL 2024)

  21. arXiv:2310.12318  [pdf, other

    cs.CL cs.AI cs.CY cs.HC

    The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis

    Authors: Pranav Narayanan Venkit, Mukund Srinath, Sanjana Gautam, Saranya Venkatraman, Vipul Gupta, Rebecca J. Passonneau, Shomir Wilson

    Abstract: We conduct an inquiry into the sociotechnical aspects of sentiment analysis (SA) by critically examining 189 peer-reviewed papers on their applications, models, and datasets. Our investigation stems from the recognition that SA has become an integral component of diverse sociotechnical systems, exerting influence on both social and technical users. By delving into sociological and technological li… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: This paper has been accepted and will appear at the EMNLP 2023 Main Conference

  22. arXiv:2310.06202  [pdf, other

    cs.CL

    GPT-who: An Information Density-based Machine-Generated Text Detector

    Authors: Saranya Venkatraman, Adaku Uchendu, Dongwon Lee

    Abstract: The Uniform Information Density (UID) principle posits that humans prefer to spread information evenly during language production. We examine if this UID principle can help capture differences between Large Language Models (LLMs)-generated and human-generated texts. We propose GPT-who, the first psycholinguistically-inspired domain-agnostic statistical detector. This detector employs UID-based fea… ▽ More

    Submitted 3 April, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: To appear in Findings of the Association for Computational Linguistics: NAACL 2024

  23. arXiv:2309.06599  [pdf, other

    cs.LG

    Reasoning with Latent Diffusion in Offline Reinforcement Learning

    Authors: Siddarth Venkatraman, Shivesh Khaitan, Ravi Tej Akella, John Dolan, Jeff Schneider, Glen Berseth

    Abstract: Offline reinforcement learning (RL) holds promise as a means to learn high-reward policies from a static dataset, without the need for further environment interactions. However, a key challenge in offline RL lies in effectively stitching portions of suboptimal trajectories from the static dataset while avoiding extrapolation errors arising due to a lack of support in the dataset. Existing approach… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  24. arXiv:2308.09166  [pdf, other

    stat.ME math.DS

    Sparse reconstruction of ordinary differential equations with inference

    Authors: Sara Venkatraman, Sumanta Basu, Martin T. Wells

    Abstract: Sparse regression has emerged as a popular technique for learning dynamical systems from temporal data, beginning with the SINDy (Sparse Identification of Nonlinear Dynamics) framework proposed by arXiv:1509.03580. Quantifying the uncertainty inherent in differential equations learned from data remains an open problem, thus we propose leveraging recent advances in statistical inference for sparse… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  25. arXiv:2303.17006  [pdf, other

    cs.CL

    How do decoding algorithms distribute information in dialogue responses?

    Authors: Saranya Venkatraman, He He, David Reitter

    Abstract: Humans tend to follow the Uniform Information Density (UID) principle by distributing information evenly in utterances. We study if decoding algorithms implicitly follow this UID principle, and under what conditions adherence to UID might be desirable for dialogue generation. We generate responses using different decoding algorithms with GPT-2 on the Persona-Chat dataset and collect human judgment… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

  26. arXiv:2301.13712  [pdf, other

    eess.SY cs.CR

    A Bi-Level Stochastic Game Model for PMU Placement in Power Grid with Cybersecurity Risks

    Authors: Saptarshi Ghosh, Murali Sankar Venkatraman, Shehab Ahmed, Charalambos Konstantinou

    Abstract: Phasor measurement units (PMUs) provide accurate and high-fidelity measurements in order to monitor the state of the power grid and support various control and planning tasks. However, PMUs have a high installation cost prohibiting their massive deployment. Minimizing the number of installed PMUs needs to be achieved while also maintaining full observability of the network. At the same time, data… ▽ More

    Submitted 15 April, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

    Comments: 2023 IEEE Belgrade PowerTech

  27. arXiv:2203.07664  [pdf, other

    cs.CV cs.AI

    Can you even tell left from right? Presenting a new challenge for VQA

    Authors: Sai Raam Venkatraman, Rishi Rao, S. Balasubramanian, Chandra Sekhar Vorugunti, R. Raghunatha Sarma

    Abstract: Visual Question Answering (VQA) needs a means of evaluating the strengths and weaknesses of models. One aspect of such an evaluation is the evaluation of compositional generalisation, or the ability of a model to answer well on scenes whose scene-setups are different from the training set. Therefore, for this purpose, we need datasets whose train and test sets differ significantly in composition.… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  28. arXiv:2203.04563  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    MLNav: Learning to Safely Navigate on Martian Terrains

    Authors: Shreyansh Daftry, Neil Abcouwer, Tyler Del Sesto, Siddarth Venkatraman, Jialin Song, Lucas Igel, Amos Byon, Ugo Rosolia, Yisong Yue, Masahiro Ono

    Abstract: We present MLNav, a learning-enhanced path planning framework for safety-critical and resource-limited systems operating in complex environments, such as rovers navigating on Mars. MLNav makes judicious use of machine learning to enhance the efficiency of path planning while fully respecting safety constraints. In particular, the dominant computational cost in such safety-critical settings is runn… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: IEEE Robotics and Automation Letters (RA-L) and ICRA 2022

  29. An empirical Bayes approach to estimating dynamic models of co-regulated gene expression

    Authors: Sara Venkatraman, Sumanta Basu, Andrew G. Clark, Sofie Delbare, Myung Hee Lee, Martin T. Wells

    Abstract: Time-course gene expression datasets provide insight into the dynamics of complex biological processes, such as immune response and organ development. It is of interest to identify genes with similar temporal expression patterns because such genes are often biologically related. However, this task is challenging due to the high dimensionality of these datasets and the nonlinearity of gene expressi… ▽ More

    Submitted 31 December, 2021; originally announced December 2021.

  30. arXiv:2011.06022  [pdf, other

    cs.RO cs.LG

    Machine Learning Based Path Planning for Improved Rover Navigation (Pre-Print Version)

    Authors: Neil Abcouwer, Shreyansh Daftry, Siddarth Venkatraman, Tyler del Sesto, Olivier Toupet, Ravi Lanka, Jialin Song, Yisong Yue, Masahiro Ono

    Abstract: Enhanced AutoNav (ENav), the baseline surface navigation software for NASA's Perseverance rover, sorts a list of candidate paths for the rover to traverse, then uses the Approximate Clearance Evaluation (ACE) algorithm to evaluate whether the most highly ranked paths are safe. ACE is crucial for maintaining the safety of the rover, but is computationally expensive. If the most promising candidates… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: 9 pages, 5 figures, Pre-Print, This work has been submitted to the IEEE for possible publication

    ACM Class: I.2.6; I.2.9; I.2.8

  31. arXiv:2010.01488  [pdf, other

    cs.LG stat.ML

    Learning Compositional Structures for Deep Learning: Why Routing-by-agreement is Necessary

    Authors: Sai Raam Venkatraman, Ankit Anand, S. Balasubramanian, R. Raghunatha Sarma

    Abstract: A formal description of the compositionality of neural networks is associated directly with the formal grammar-structure of the objects it seeks to represent. This formal grammar-structure specifies the kind of components that make up an object, and also the configurations they are allowed to be in. In other words, objects can be described as a parse-tree of its components -- a structure that can… ▽ More

    Submitted 6 October, 2020; v1 submitted 4 October, 2020; originally announced October 2020.

  32. arXiv:2008.01695  [pdf, other

    nlin.SI

    Exact Solutions Of Time Fractional Generalized Burgers-Fisher Equation Using Exp function and Exponential Rational Function Methods

    Authors: Ramya Selvaraj, Swaminathan Venkatraman

    Abstract: Using modified Riemann-Liouville derivative, the Exp function and Exponential rational function methods are implemented to solve the time-fractional generalized Burgers-Fisher equation (TF-GBF). The TF-GBF is transformed into a nonlinear ordinary differential equation (NLODE) by applying the transformation of traveling wave. The suggested methods are then introduced to formulate exact solutions fo… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

  33. arXiv:2003.13217  [pdf, other

    cs.MM cs.SD eess.AS

    Deep Residual Neural Networks for Image in Speech Steganography

    Authors: Shivam Agarwal, Siddarth Venkatraman

    Abstract: Steganography is the art of hiding a secret message inside a publicly visible carrier message. Ideally, it is done without modifying the carrier, and with minimal loss of information in the secret message. Recently, various deep learning based approaches to steganography have been applied to different message types. We propose a deep learning based technique to hide a source RGB image message insi… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

  34. arXiv:1908.01300  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Building Deep, Equivariant Capsule Networks

    Authors: Sairaam Venkatraman, S. Balasubramanian, R. Raghunatha Sarma

    Abstract: Capsule networks are constrained by the parameter-expensive nature of their layers, and the general lack of provable equivariance guarantees. We present a variation of capsule networks that aims to remedy this. We identify that learning all pair-wise part-whole relationships between capsules of successive layers is inefficient. Further, we also realise that the choice of prediction networks and th… ▽ More

    Submitted 26 September, 2019; v1 submitted 4 August, 2019; originally announced August 2019.

  35. arXiv:1808.06324   

    cs.LG stat.ML

    PAC-learning is Undecidable

    Authors: Sairaam Venkatraman, S Balasubramanian, R Raghunatha Sarma

    Abstract: The problem of attempting to learn the mapping between data and labels is the crux of any machine learning task. It is, therefore, of interest to the machine learning community on practical as well as theoretical counts to consider the existence of a test or criterion for deciding the feasibility of attempting to learn. We investigate the existence of such a criterion in the setting of PAC-learnin… ▽ More

    Submitted 20 October, 2022; v1 submitted 20 August, 2018; originally announced August 2018.

    Comments: Error in paper

  36. arXiv:1611.07703  [pdf, other

    cs.CV

    'Part'ly first among equals: Semantic part-based benchmarking for state-of-the-art object recognition systems

    Authors: Ravi Kiran Sarvadevabhatla, Shanthakumar Venkatraman, R. Venkatesh Babu

    Abstract: An examination of object recognition challenge leaderboards (ILSVRC, PASCAL-VOC) reveals that the top-performing classifiers typically exhibit small differences amongst themselves in terms of error rate/mAP. To better differentiate the top performers, additional criteria are required. Moreover, the (test) images, on which the performance scores are based, predominantly contain fully visible object… ▽ More

    Submitted 24 November, 2016; v1 submitted 23 November, 2016; originally announced November 2016.

    Comments: Extended version of our ACCV-2016 paper. Author formatting modified