Skip to main content

Showing 1–26 of 26 results for author: Iyer, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.18625  [pdf, ps, other

    math.AG cs.CV

    Tropical Geometry Based Edge Detection Using Min-Plus and Max-Plus Algebra

    Authors: Shivam Kumar Jha S, Jaya NN Iyer

    Abstract: This paper proposes a tropical geometry-based edge detection framework that reformulates convolution and gradient computations using min-plus and max-plus algebra. The tropical formulation emphasizes dominant intensity variations, contributing to sharper and more continuous edge representations. Three variants are explored: an adaptive threshold-based method, a multi-kernel min-plus method, and a… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

    MSC Class: 14T90; 14-04

  2. arXiv:2503.02618  [pdf, other

    q-bio.NC cs.CV cs.LG

    ZAPBench: A Benchmark for Whole-Brain Activity Prediction in Zebrafish

    Authors: Jan-Matthis Lueckmann, Alexander Immer, Alex Bo-Yuan Chen, Peter H. Li, Mariela D. Petkova, Nirmala A. Iyer, Luuk Willem Hesselink, Aparna Dev, Gudrun Ihrke, Woohyun Park, Alyson Petruncio, Aubrey Weigel, Wyatt Korff, Florian Engert, Jeff W. Lichtman, Misha B. Ahrens, Michał Januszewski, Viren Jain

    Abstract: Data-driven benchmarks have led to significant progress in key scientific modeling domains including weather and structural biology. Here, we introduce the Zebrafish Activity Prediction Benchmark (ZAPBench) to measure progress on the problem of predicting cellular-resolution neural activity throughout an entire vertebrate brain. The benchmark is based on a novel dataset containing 4d light-sheet m… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

  3. arXiv:2503.00073  [pdf, other

    cs.CV cs.LG q-bio.NC

    Forecasting Whole-Brain Neuronal Activity from Volumetric Video

    Authors: Alexander Immer, Jan-Matthis Lueckmann, Alex Bo-Yuan Chen, Peter H. Li, Mariela D. Petkova, Nirmala A. Iyer, Aparna Dev, Gudrun Ihrke, Woohyun Park, Alyson Petruncio, Aubrey Weigel, Wyatt Korff, Florian Engert, Jeff W. Lichtman, Misha B. Ahrens, Viren Jain, Michał Januszewski

    Abstract: Large-scale neuronal activity recordings with fluorescent calcium indicators are increasingly common, yielding high-resolution 2D or 3D videos. Traditional analysis pipelines reduce this data to 1D traces by segmenting regions of interest, leading to inevitable information loss. Inspired by the success of deep learning on minimally processed data in other domains, we investigate the potential of f… ▽ More

    Submitted 27 February, 2025; originally announced March 2025.

  4. arXiv:2502.01273  [pdf, other

    cs.SE cs.AI

    Analysis of Student-LLM Interaction in a Software Engineering Project

    Authors: Agrawal Naman, Ridwan Shariffdeen, Guanlin Wang, Sanka Rasnayaka, Ganesh Neelakanta Iyer

    Abstract: Large Language Models (LLMs) are becoming increasingly competent across various domains, educators are showing a growing interest in integrating these LLMs into the learning process. Especially in software engineering, LLMs have demonstrated qualitatively better capabilities in code summarization, code generation, and debugging. Despite various research on LLMs for software engineering tasks in pr… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

    Comments: 8 pages

    ACM Class: D.2.3

  5. arXiv:2501.10363  [pdf, other

    cs.CY cs.SE

    A Web-Based IDE for DevOps Learning in Software Engineering Higher Education

    Authors: Ganesh Neelakanta Iyer, Andrew Goh Yisheng, Metilda Chee Heng Er, Weng Xian Choong, Shao Wei Koh

    Abstract: DevOps can be best explained as people working together to conceive, build and deliver secure software at top speed. DevOps practices enable software development (dev) and operations (ops) teams to accelerate delivery through automation, collaboration, fast feedback, and iterative improvement. It is now an integral part of the information technology industry, and students should be aware of it bef… ▽ More

    Submitted 8 December, 2024; originally announced January 2025.

  6. arXiv:2411.01251  [pdf

    eess.IV cs.CV cs.LG

    Enhancing Diabetic Retinopathy Detection with CNN-Based Models: A Comparative Study of UNET and Stacked UNET Architectures

    Authors: Ameya Uppina, S Navaneetha Krishnan, Talluri Krishna Sai Teja, Nikhil N Iyer, Joe Dhanith P R

    Abstract: Diabetic Retinopathy DR is a severe complication of diabetes. Damaged or abnormal blood vessels can cause loss of vision. The need for massive screening of a large population of diabetic patients has generated an interest in a computer-aided fully automatic diagnosis of DR. In the realm of Deep learning frameworks, particularly convolutional neural networks CNNs, have shown great interest and prom… ▽ More

    Submitted 20 January, 2025; v1 submitted 2 November, 2024; originally announced November 2024.

  7. arXiv:2408.09434  [pdf, other

    cs.CL cs.AI

    HySem: A context length optimized LLM pipeline for unstructured tabular extraction

    Authors: Narayanan PP, Anantharaman Palacode Narayana Iyer

    Abstract: Regulatory compliance reporting in the pharmaceutical industry relies on detailed tables, but these are often under-utilized beyond compliance due to their unstructured format and arbitrary content. Extracting and semantically representing tabular data is challenging due to diverse table presentations. Large Language Models (LLMs) demonstrate substantial potential for semantic representation, yet… ▽ More

    Submitted 5 October, 2024; v1 submitted 18 August, 2024; originally announced August 2024.

    Comments: 19 pages, 7 tables, 10 figures, 2 algorithms

    ACM Class: F.2.2; I.2.7

  8. arXiv:2407.04734  [pdf, other

    eess.SP cs.ET cs.LG cs.NI

    Neuro-Symbolic Fusion of Wi-Fi Sensing Data for Passive Radar with Inter-Modal Knowledge Transfer

    Authors: Marco Cominelli, Francesco Gringoli, Lance M. Kaplan, Mani B. Srivastava, Trevor Bihl, Erik P. Blasch, Nandini Iyer, Federico Cerutti

    Abstract: Wi-Fi devices, akin to passive radars, can discern human activities within indoor settings due to the human body's interaction with electromagnetic signals. Current Wi-Fi sensing applications predominantly employ data-driven learning techniques to associate the fluctuations in the physical properties of the communication channel with the human activity causing them. However, these techniques often… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 8 pages, 9 figures, accepted at 27th International Conference on Information Fusion (FUSION 2024)

  9. arXiv:2401.16186  [pdf, other

    cs.SE cs.AI

    An Empirical Study on Usage and Perceptions of LLMs in a Software Engineering Project

    Authors: Sanka Rasnayaka, Guanlin Wang, Ridwan Shariffdeen, Ganesh Neelakanta Iyer

    Abstract: Large Language Models (LLMs) represent a leap in artificial intelligence, excelling in tasks using human language(s). Although the main focus of general-purpose LLMs is not code generation, they have shown promising results in the domain. However, the usefulness of LLMs in an academic software engineering project has not been fully explored yet. In this study, we explore the usefulness of LLMs for… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 8 pages, 6 figures, accepted for publication at the LLM4Code workshop @ ICSE 2024

    ACM Class: D.2.3

  10. arXiv:2401.00809  [pdf, other

    cs.LG

    A review on different techniques used to combat the non-IID and heterogeneous nature of data in FL

    Authors: Venkataraman Natarajan Iyer

    Abstract: Federated Learning (FL) is a machine-learning approach enabling collaborative model training across multiple decentralized edge devices that hold local data samples, all without exchanging these samples. This collaborative process occurs under the supervision of a central server orchestrating the training or via a peer-to-peer network. The significance of FL is particularly pronounced in industrie… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  11. arXiv:2310.11153  [pdf, other

    cs.CV eess.SP

    Unsupervised Pre-Training Using Masked Autoencoders for ECG Analysis

    Authors: Guoxin Wang, Qingyuan Wang, Ganesh Neelakanta Iyer, Avishek Nag, Deepu John

    Abstract: Unsupervised learning methods have become increasingly important in deep learning due to their demonstrated large utilization of datasets and higher accuracy in computer vision and natural language processing tasks. There is a growing trend to extend unsupervised learning methods to other domains, which helps to utilize a large amount of unlabelled data. This paper proposes an unsupervised pre-tra… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Accepted by IEEE Biomedical Circuits and Systems (BIOCAS) 2023

  12. arXiv:2307.16318  [pdf, other

    cs.RO

    Efficient Q-Learning over Visit Frequency Maps for Multi-agent Exploration of Unknown Environments

    Authors: Xuyang Chen, Ashvin N. Iyer, Zixing Wang, Ahmed H. Qureshi

    Abstract: The robot exploration task has been widely studied with applications spanning from novel environment mapping to item delivery. For some time-critical tasks, such as rescue catastrophes, the agent is required to explore as efficiently as possible. Recently, Visit Frequency-based map representation achieved great success in such scenarios by discouraging repetitive visits with a frequency-based pena… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: Accepted by IROS 2023. 8 pages

  13. Enhancing Classification with Hierarchical Scalable Query on Fusion Transformer

    Authors: Sudeep Kumar Sahoo, Sathish Chalasani, Abhishek Joshi, Kiran Nanjunda Iyer

    Abstract: Real-world vision based applications require fine-grained classification for various area of interest like e-commerce, mobile applications, warehouse management, etc. where reducing the severity of mistakes and improving the classification accuracy is of utmost importance. This paper proposes a method to boost fine-grained classification through a hierarchical approach via learnable independent qu… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: 6 pages, 7 figures Published in IEEE ICCE 2023

    ACM Class: I.2.10; I.4.8; I.5.1

    Journal ref: 2023 IEEE International Conference on Consumer Electronics (ICCE), Las Vegas, NV, USA, 2023, pp. 1-6

  14. arXiv:2208.10787  [pdf, other

    cs.CV

    Semantic Driven Energy based Out-of-Distribution Detection

    Authors: Abhishek Joshi, Sathish Chalasani, Kiran Nanjunda Iyer

    Abstract: Detecting Out-of-Distribution (OOD) samples in real world visual applications like classification or object detection has become a necessary precondition in today's deployment of Deep Learning systems. Many techniques have been proposed, of which Energy based OOD methods have proved to be promising and achieved impressive performance. We propose semantic driven energy based method, which is an end… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: accepted at International Joint Conference on Neural Networks (IJCNN) 2022

  15. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  16. arXiv:2112.04552  [pdf, other

    cs.CE cs.AI cs.LG

    PATO: Producibility-Aware Topology Optimization using Deep Learning for Metal Additive Manufacturing

    Authors: Naresh S. Iyer, Amir M. Mirzendehdel, Sathyanarayanan Raghavan, Yang Jiao, Erva Ulu, Morad Behandish, Saigopal Nelaturi, Dean M. Robinson

    Abstract: In this paper, we propose PATO-a producibility-aware topology optimization (TO) framework to help efficiently explore the design space of components fabricated using metal additive manufacturing (AM), while ensuring manufacturability with respect to cracking. Specifically, parts fabricated through Laser Powder Bed Fusion are prone to defects such as warpage or cracking due to high residual stress… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

  17. arXiv:2105.14526  [pdf, other

    cs.LG

    LRTuner: A Learning Rate Tuner for Deep Neural Networks

    Authors: Nikhil Iyer, V Thejas, Nipun Kwatra, Ramachandran Ramjee, Muthian Sivathanu

    Abstract: One very important hyperparameter for training deep neural networks is the learning rate schedule of the optimizer. The choice of learning rate schedule determines the computational cost of getting close to a minima, how close you actually get to the minima, and most importantly the kind of local minima (wide/narrow) attained. The kind of minima attained has a significant impact on the generalizat… ▽ More

    Submitted 30 May, 2021; originally announced May 2021.

    Comments: 17 pages

  18. arXiv:2003.03977  [pdf, other

    cs.LG stat.ML

    Wide-minima Density Hypothesis and the Explore-Exploit Learning Rate Schedule

    Authors: Nikhil Iyer, V Thejas, Nipun Kwatra, Ramachandran Ramjee, Muthian Sivathanu

    Abstract: Several papers argue that wide minima generalize better than narrow minima. In this paper, through detailed experiments that not only corroborate the generalization properties of wide minima, we also provide empirical evidence for a new hypothesis that the density of wide minima is likely lower than the density of narrow minima. Further, motivated by this hypothesis, we design a novel explore-expl… ▽ More

    Submitted 1 June, 2021; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: 34 pages

  19. Variational Encoder-based Reliable Classification

    Authors: Chitresh Bhushan, Zhaoyuan Yang, Nurali Virani, Naresh Iyer

    Abstract: Machine learning models provide statistically impressive results which might be individually unreliable. To provide reliability, we propose an Epistemic Classifier (EC) that can provide justification of its belief using support from the training dataset as well as quality of reconstruction. Our approach is based on modified variational auto-encoders that can identify a semantically meaningful low-… ▽ More

    Submitted 17 October, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: Published in ICIP 2020. Typos fixed in revision

    Journal ref: IEEE International Conference on Image Processing (2020) 1941-1945

  20. arXiv:1911.07391  [pdf, other

    cs.LG stat.ML

    Justification-Based Reliability in Machine Learning

    Authors: Nurali Virani, Naresh Iyer, Zhaoyuan Yang

    Abstract: With the advent of Deep Learning, the field of machine learning (ML) has surpassed human-level performance on diverse classification tasks. At the same time, there is a stark need to characterize and quantify reliability of a model's prediction on individual samples. This is especially true in application of such models in safety-critical domains of industrial control and healthcare. To address th… ▽ More

    Submitted 14 November, 2021; v1 submitted 17 November, 2019; originally announced November 2019.

    Comments: Extended version of paper accepted at AAAI 2020 with supplementary materials, update remark and fix typo

  21. arXiv:1902.09972  [pdf, other

    cs.CR cs.LG

    Design of intentional backdoors in sequential models

    Authors: Zhaoyuan Yang, Naresh Iyer, Johan Reimann, Nurali Virani

    Abstract: Recent work has demonstrated robust mechanisms by which attacks can be orchestrated on machine learning models. In contrast to adversarial examples, backdoor or trojan attacks embed surgically modified samples with targeted labels in the model training process to cause the targeted model to learn to misclassify chosen samples in the presence of specific triggers, while keeping the model performanc… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

  22. Learning Contextual Bandits in a Non-stationary Environment

    Authors: Qingyun Wu, Naveen Iyer, Hongning Wang

    Abstract: Multi-armed bandit algorithms have become a reference solution for handling the explore/exploit dilemma in recommender systems, and many other important real-world problems, such as display advertisement. However, such algorithms usually assume a stationary reward distribution, which hardly holds in practice as users' preferences are dynamic. This inevitably costs a recommender system consistent s… ▽ More

    Submitted 23 May, 2018; originally announced May 2018.

    Comments: 10 pages, 13 figures, To appear on ACM Special Interest Group on Information Retrieval (SIGIR) 2018

  23. arXiv:1702.02289  [pdf, other

    cs.SD

    Neural Network Based Speaker Classification and Verification Systems with Enhanced Features

    Authors: Zhenhao Ge, Ananth N. Iyer, Srinath Cheluvaraja, Ram Sundaram, Aravind Ganapathiraju

    Abstract: This work presents a novel framework based on feed-forward neural network for text-independent speaker classification and verification, two related systems of speaker recognition. With optimized features and model training, it achieves 100% classification rate in classification and less than 6% Equal Error Rate (ERR), using merely about 1 second and 5 seconds of data respectively. Features with st… ▽ More

    Submitted 7 February, 2017; originally announced February 2017.

    Comments: Intelligent Systems Conference 2017, Sep. 7-8 2017, London, UK. arXiv admin note: text overlap with arXiv:1702.02285

  24. arXiv:1702.02285  [pdf, other

    cs.SD

    Speaker Change Detection Using Features through A Neural Network Speaker Classifier

    Authors: Zhenhao Ge, Ananth N. Iyer, Srinath Cheluvaraja, Aravind Ganapathiraju

    Abstract: The mechanism proposed here is for real-time speaker change detection in conversations, which firstly trains a neural network text-independent speaker classifier using in-domain speaker data. Through the network, features of conversational speech from out-of-domain speakers are then converted into likelihood vectors, i.e. similarity scores comparing to the in-domain speakers. These transformed fea… ▽ More

    Submitted 7 February, 2017; originally announced February 2017.

    Comments: Intelligent System Conference 2017, Sep. 7-8, 2017, London, UK. arXiv admin note: text overlap with arXiv:1702.02289

  25. arXiv:1606.08821  [pdf, other

    cs.CL

    Generation and Pruning of Pronunciation Variants to Improve ASR Accuracy

    Authors: Zhenhao Ge, Aravind Ganapathiraju, Ananth N. Iyer, Scott A. Randal, Felix I. Wyss

    Abstract: Speech recognition, especially name recognition, is widely used in phone services such as company directory dialers, stock quote providers or location finders. It is usually challenging due to pronunciation variations. This paper proposes an efficient and robust data-driven technique which automatically learns acceptable word pronunciations and updates the pronunciation dictionary to build a bette… ▽ More

    Submitted 28 June, 2016; originally announced June 2016.

    Comments: Interspeech 2016

  26. A Factorized Recurrent Neural Network based architecture for medium to large vocabulary Language Modelling

    Authors: Anantharaman Palacode Narayana Iyer

    Abstract: Statistical language models are central to many applications that use semantics. Recurrent Neural Networks (RNN) are known to produce state of the art results for language modelling, outperforming their traditional n-gram counterparts in many cases. To generate a probability distribution across a vocabulary, these models require a softmax output layer that linearly increases in size with the size… ▽ More

    Submitted 4 February, 2016; originally announced February 2016.

    Comments: 8 pages