Skip to main content

Showing 1–25 of 25 results for author: Agastya

Searching in archive cs. Search in all archives.
.
  1. Interference Detection in Spectrum-Blind Multi-User Optical Spectrum as a Service

    Authors: Agastya Raj, Daniel C. Kilper, Marco Ruffini

    Abstract: With the growing demand for high-bandwidth, low-latency applications, Optical Spectrum as a Service (OSaaS) is of interest for flexible bandwidth allocation within Elastic Optical Networks (EONs) and Open Line Systems (OLS). While OSaaS facilitates transparent connectivity and resource sharing among users, it raises concerns over potential network vulnerabilities due to shared fiber access and int… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: This is a preprint of a paper accepted and published in the Journal of Optical Communications and Networking (JOCN). The final published version is available at: https://doi.org/10.1364/JOCN.551188

    Journal ref: Journal of Optical Communications and Networking, Vol. 17, Issue 8, pp. C117-C126 (2025)

  2. arXiv:2504.13125  [pdf, other

    cs.CL cs.AI cs.LG

    LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard

    Authors: Varun Rao, Youran Sun, Mahendra Kumar, Tejas Mutneja, Agastya Mukherjee, Haizhao Yang

    Abstract: This paper investigates the application of large language models (LLMs) to financial tasks. We fine-tuned foundation models using the Open FinLLM Leaderboard as a benchmark. Building on Qwen2.5 and Deepseek-R1, we employed techniques including supervised fine-tuning (SFT), direct preference optimization (DPO), and reinforcement learning (RL) to enhance their financial capabilities. The fine-tuned… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  3. arXiv:2503.18495  [pdf, other

    cs.NI eess.SP

    Real-Time Streaming Telemetry Based Detection and Mitigation of OOK and Power Interference in Multi-User OSaaS Networks

    Authors: Agastya Raj, Devika Dass, Daniel C. Kilper, Marco Ruffini

    Abstract: We present a framework to identify and mitigate rogue OOK signals and user-generated power interference in a multi-user Optical-Spectrum-as-a-Service network. Experimental tests on the OpenIreland-testbed achieve up to 89% detection rate within 10 seconds of an interference event.

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: This paper is a preprint of a paper submitted to OFC 2025

  4. arXiv:2503.17094  [pdf, other

    cs.NI

    Transfer Learning for EDFA Gain Modeling: A Semi-Supervised Approach Using Internal Amplifier Features

    Authors: Agastya Raj, Dan Kilper, Marco Ruffini

    Abstract: The gain spectrum of an Erbium-Doped Fiber Amplifier (EDFA) has a complex dependence on channel loading, pump power, and operating mode, making accurate modeling difficult to achieve. Machine Learning (ML) based modeling methods can achieve high accuracy, but they require comprehensive data collection. We present a novel ML-based Semi-Supervised, Self-Normalizing Neural Network (SS-NN) framework t… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

    Comments: This paper is a preprint of a paper accepted to IEEE Future Networks World Forum (FNWF) 2024

  5. arXiv:2503.17079  [pdf, other

    cs.NI

    Interference Identification in Multi-User Optical Spectrum as a Service using Convolutional Neural Networks

    Authors: Agastya Raj, Zehao Wang, Frank Slyne, Tingjun Chen, Dan Kilper, Marco Ruffini

    Abstract: We introduce a ML-based architecture for network operators to detect impairments from specific OSaaS users while blind to the users' internal spectrum details. Experimental studies with three OSaaS users demonstrate the model's capability to accurately classify the source of impairments, achieving classification accuracy of 94.2%.

    Submitted 21 March, 2025; originally announced March 2025.

    Comments: This paper is a preprint of a paper accepted to ECOC 2024 and is subject to Institution of Engineering and Technology Copyright. A copy of record will be available at IET Digital Library

  6. arXiv:2503.17072  [pdf, other

    cs.LG cs.NI

    Multi-Span Optical Power Spectrum Evolution Modeling using ML-based Multi-Decoder Attention Framework

    Authors: Agastya Raj, Zehao Wang, Frank Slyne, Tingjun Chen, Dan Kilper, Marco Ruffini

    Abstract: We implement a ML-based attention framework with component-specific decoders, improving optical power spectrum prediction in multi-span networks. By reducing the need for in-depth training on each component, the framework can be scaled to multi-span topologies with minimal data collection, making it suitable for brown-field scenarios.

    Submitted 21 March, 2025; originally announced March 2025.

    Comments: This paper is a preprint of a paper accepted in ECOC 2024 and is subject to Institution of Engineering and Technology Copyright. A copy of record will be available at IET Digital Library

  7. arXiv:2412.14315  [pdf, other

    stat.ML cs.DS cs.LG cs.SI

    On the Robustness of Spectral Algorithms for Semirandom Stochastic Block Models

    Authors: Aditya Bhaskara, Agastya Vibhuti Jha, Michael Kapralov, Naren Sarayu Manoj, Davide Mazzali, Weronika Wrzos-Kaminska

    Abstract: In a graph bisection problem, we are given a graph $G$ with two equally-sized unlabeled communities, and the goal is to recover the vertices in these communities. A popular heuristic, known as spectral clustering, is to output an estimated community assignment based on the eigenvector corresponding to the second smallest eigenvalue of the Laplacian of $G$. Spectral algorithms can be shown to prova… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Comments: 45 pages. NeurIPS 2024

  8. arXiv:2407.03525  [pdf, ps, other

    cs.CL

    UnSeenTimeQA: Time-Sensitive Question-Answering Beyond LLMs' Memorization

    Authors: Md Nayem Uddin, Amir Saeidi, Divij Handa, Agastya Seth, Tran Cao Son, Eduardo Blanco, Steven R. Corman, Chitta Baral

    Abstract: This paper introduces UnSeenTimeQA, a novel data contamination-free time-sensitive question-answering (TSQA) benchmark. It differs from existing TSQA benchmarks by avoiding web-searchable queries grounded in the real world. We present a series of time-sensitive event scenarios based on synthetically generated facts. It requires large language models (LLMs) to engage in genuine temporal reasoning w… ▽ More

    Submitted 2 June, 2025; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted at ACL 2025 (Main)

  9. arXiv:2405.11844  [pdf

    cs.AR cs.ET

    NeRTCAM: CAM-Based CMOS Implementation of Reference Frames for Neuromorphic Processors

    Authors: Harideep Nair, William Leyman, Agastya Sampath, Quinn Jacobson, John Paul Shen

    Abstract: Neuromorphic architectures mimicking biological neural networks have been proposed as a much more efficient alternative to conventional von Neumann architectures for the exploding compute demands of AI workloads. Recent neuroscience theory on intelligence suggests that Cortical Columns (CCs) are the fundamental compute units in the neocortex and intelligence arises from CC's ability to store, pred… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Accepted and Presented at Neuro-Inspired Computational Elements (NICE) Conference, La Jolla, CA. 2024

  10. arXiv:2405.09755  [pdf, other

    cs.CV cs.RO

    Collision Avoidance Metric for 3D Camera Evaluation

    Authors: Vage Taamazyan, Alberto Dall'olio, Agastya Kalra

    Abstract: 3D cameras have emerged as a critical source of information for applications in robotics and autonomous driving. These cameras provide robots with the ability to capture and utilize point clouds, enabling them to navigate their surroundings and avoid collisions with other objects. However, current standard camera evaluation metrics often fail to consider the specific application context. These met… ▽ More

    Submitted 8 July, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  11. arXiv:2404.01049  [pdf, other

    astro-ph.IM cs.LG

    A Novel Sector-Based Algorithm for an Optimized Star-Galaxy Classification

    Authors: Anumanchi Agastya Sai Ram Likhit, Divyansh Tripathi, Akshay Agarwal

    Abstract: This paper introduces a novel sector-based methodology for star-galaxy classification, leveraging the latest Sloan Digital Sky Survey data (SDSS-DR18). By strategically segmenting the sky into sectors aligned with SDSS observational patterns and employing a dedicated convolutional neural network (CNN), we achieve state-of-the-art performance for star galaxy classification. Our preliminary results… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Journal ref: The Second Tiny Papers Track at ICLR 2024

  12. arXiv:2401.00287  [pdf, other

    cs.CL

    The Art of Defending: A Systematic Evaluation and Analysis of LLM Defense Strategies on Safety and Over-Defensiveness

    Authors: Neeraj Varshney, Pavel Dolin, Agastya Seth, Chitta Baral

    Abstract: As Large Language Models (LLMs) play an increasingly pivotal role in natural language processing applications, their safety concerns become critical areas of NLP research. This paper presents Safety and Over-Defensiveness Evaluation (SODE) benchmark: a collection of diverse safe and unsafe prompts with carefully designed evaluation methods that facilitate systematic evaluation, comparison, and ana… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

  13. arXiv:2308.02233  [pdf, other

    cs.NI cs.LG

    Self-Normalizing Neural Network, Enabling One Shot Transfer Learning for Modeling EDFA Wavelength Dependent Gain

    Authors: Agastya Raj, Zehao Wang, Frank Slyne, Tingjun Chen, Dan Kilper, Marco Ruffini

    Abstract: We present a novel ML framework for modeling the wavelength-dependent gain of multiple EDFAs, based on semi-supervised, self-normalizing neural networks, enabling one-shot transfer learning. Our experiments on 22 EDFAs in Open Ireland and COSMOS testbeds show high-accuracy transfer-learning even when operated across different amplifier types.

    Submitted 21 October, 2023; v1 submitted 4 August, 2023; originally announced August 2023.

    Comments: This paper is a preprint of a paper submitted to ECOC 2023 and is subject to Institution of Engineering and Technology Copyright. If accepted, the copy of record will be available at IET Digital Library

  14. arXiv:2207.12724  [pdf

    cs.NE cs.CL

    An Automated News Bias Classifier Using Caenorhabditis Elegans Inspired Recursive Feedback Network Architecture

    Authors: Agastya Sridharan, Natarajan S

    Abstract: Traditional approaches to classify the political bias of news articles have failed to generate accurate, generalizable results. Existing networks premised on CNNs and DNNs lack a model to identify and extrapolate subtle indicators of bias like word choice, context, and presentation. In this paper, we propose a network architecture that achieves human-level accuracy in assigning bias classification… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    Comments: The paper is under review for AACL-IJCNLP

  15. arXiv:2112.07499  [pdf, other

    cs.DS cs.AI cs.DM

    Reconfiguring Shortest Paths in Graphs

    Authors: Kshitij Gajjar, Agastya Vibhuti Jha, Manish Kumar, Abhiruk Lahiri

    Abstract: Reconfiguring two shortest paths in a graph means modifying one shortest path to the other by changing one vertex at a time so that all the intermediate paths are also shortest paths. This problem has several natural applications, namely: (a) revamping road networks, (b) rerouting data packets in synchronous multiprocessing setting, (c) the shipping container stowage problem, and (d) the train mar… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: 28 pages, 14 figures. To be presented at AAAI 2022

    MSC Class: 68Q25; 05C85; 68T99 ACM Class: F.2.2

  16. arXiv:2109.13488  [pdf, other

    cs.CV

    Towards Rotation Invariance in Object Detection

    Authors: Agastya Kalra, Guy Stoppi, Bradley Brown, Rishav Agarwal, Achuta Kadambi

    Abstract: Rotation augmentations generally improve a model's invariance/equivariance to rotation - except in object detection. In object detection the shape is not known, therefore rotation creates a label ambiguity. We show that the de-facto method for bounding box label rotation, the Largest Box Method, creates very large labels, leading to poor performance and in many cases worse performance than using n… ▽ More

    Submitted 30 September, 2021; v1 submitted 28 September, 2021; originally announced September 2021.

    Comments: Accepted ICCV 2021

  17. arXiv:2108.05484  [pdf, other

    cs.CV cs.LG

    Self-supervised Contrastive Learning for Irrigation Detection in Satellite Imagery

    Authors: Chitra Agastya, Sirak Ghebremusse, Ian Anderson, Colorado Reed, Hossein Vahabi, Alberto Todeschini

    Abstract: Climate change has caused reductions in river runoffs and aquifer recharge resulting in an increasingly unsustainable crop water demand from reduced freshwater availability. Achieving food security while deploying water in a sustainable manner will continue to be a major challenge necessitating careful monitoring and tracking of agricultural water usage. Historically, monitoring water usage has be… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

  18. arXiv:2103.02843  [pdf

    cs.DC cs.CE cs.LG physics.bio-ph q-bio.QM

    Pandemic Drugs at Pandemic Speed: Infrastructure for Accelerating COVID-19 Drug Discovery with Hybrid Machine Learning- and Physics-based Simulations on High Performance Computers

    Authors: Agastya P. Bhati, Shunzhou Wan, Dario Alfè, Austin R. Clyde, Mathis Bode, Li Tan, Mikhail Titov, Andre Merzky, Matteo Turilli, Shantenu Jha, Roger R. Highfield, Walter Rocchia, Nicola Scafuri, Sauro Succi, Dieter Kranzlmüller, Gerald Mathias, David Wifling, Yann Donon, Alberto Di Meglio, Sofia Vallecorsa, Heng Ma, Anda Trifan, Arvind Ramanathan, Tom Brettin, Alexander Partin , et al. (4 additional authors not shown)

    Abstract: The race to meet the challenges of the global pandemic has served as a reminder that the existing drug discovery process is expensive, inefficient and slow. There is a major bottleneck screening the vast number of potential small molecules to shortlist lead compounds for antiviral drug development. New opportunities to accelerate drug discovery lie at the interface between machine learning methods… ▽ More

    Submitted 4 September, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Journal ref: Interface Focus. 2021. 11 (6): 20210018

  19. arXiv:2010.10517  [pdf, other

    cs.DC cs.CE

    Scalable HPC and AI Infrastructure for COVID-19 Therapeutics

    Authors: Hyungro Lee, Andre Merzky, Li Tan, Mikhail Titov, Matteo Turilli, Dario Alfe, Agastya Bhati, Alex Brace, Austin Clyde, Peter Coveney, Heng Ma, Arvind Ramanathan, Rick Stevens, Anda Trifan, Hubertus Van Dam, Shunzhou Wan, Sean Wilkinson, Shantenu Jha

    Abstract: COVID-19 has claimed more 1 million lives and resulted in over 40 million infections. There is an urgent need to identify drugs that can inhibit SARS-CoV-2. In response, the DOE recently established the Medical Therapeutics project as part of the National Virtual Biotechnology Laboratory, and tasked it with creating the computational infrastructure and methods necessary to advance therapeutics dev… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  20. arXiv:2010.06574  [pdf, other

    cs.DC cs.CE q-bio.QM

    IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads

    Authors: Aymen Al Saadi, Dario Alfe, Yadu Babuji, Agastya Bhati, Ben Blaiszik, Thomas Brettin, Kyle Chard, Ryan Chard, Peter Coveney, Anda Trifan, Alex Brace, Austin Clyde, Ian Foster, Tom Gibbs, Shantenu Jha, Kristopher Keipert, Thorsten Kurth, Dieter Kranzlmüller, Hyungro Lee, Zhuozhao Li, Heng Ma, Andre Merzky, Gerald Mathias, Alexander Partin, Junqi Yin , et al. (11 additional authors not shown)

    Abstract: The drug discovery process currently employed in the pharmaceutical industry typically requires about 10 years and $2-3 billion to deliver one new drug. This is both too expensive and too slow, especially in emergencies like the COVID-19 pandemic. In silicomethodologies need to be improved to better select lead compounds that can proceed to later stages of the drug discovery protocol accelerating… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

  21. arXiv:2001.03194  [pdf, other

    cs.CV

    MatrixNets: A New Scale and Aspect Ratio Aware Architecture for Object Detection

    Authors: Abdullah Rashwan, Rishav Agarwal, Agastya Kalra, Pascal Poupart

    Abstract: We present MatrixNets (xNets), a new deep architecture for object detection. xNets map objects with similar sizes and aspect ratios into many specialized layers, allowing xNets to provide a scale and aspect ratio aware architecture. We leverage xNets to enhance single-stage object detection frameworks. First, we apply xNets on anchor-based object detection, for which we predict object centers and… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

    Comments: This is the full paper for arXiv:1908.04646 with more applications, experiments, and ablation study

  22. arXiv:1908.04646  [pdf, other

    cs.CV

    Matrix Nets: A New Deep Architecture for Object Detection

    Authors: Abdullah Rashwan, Agastya Kalra, Pascal Poupart

    Abstract: We present Matrix Nets (xNets), a new deep architecture for object detection. xNets map objects with different sizes and aspect ratios into layers where the sizes and the aspect ratios of the objects within their layers are nearly uniform. Hence, xNets provide a scale and aspect ratio aware architecture. We leverage xNets to enhance key-points based object detection. Our architecture achieves mAP… ▽ More

    Submitted 14 August, 2019; v1 submitted 13 August, 2019; originally announced August 2019.

    Comments: Short paper, stay tuned for the full paper!

  23. arXiv:1904.07435  [pdf, other

    cs.CV cs.LG

    Photofeeler-D3: A Neural Network with Voter Modeling for Dating Photo Impression Prediction

    Authors: Agastya Kalra, Ben Peterson

    Abstract: In just a few years, online dating has become the dominant way that young people meet to date, making the deceptively error-prone task of picking good dating profile photos vital to a generation's ability to form romantic connections. Until now, artificial intelligence approaches to Dating Photo Impression Prediction (DPIP) have been very inaccurate, unadaptable to real-world application, and have… ▽ More

    Submitted 10 May, 2019; v1 submitted 15 April, 2019; originally announced April 2019.

    Comments: 10 pages, 3 figures, 5 tables

  24. arXiv:1701.05265  [pdf, other

    stat.ML cs.LG

    Online Structure Learning for Sum-Product Networks with Gaussian Leaves

    Authors: Wilson Hsu, Agastya Kalra, Pascal Poupart

    Abstract: Sum-product networks have recently emerged as an attractive representation due to their dual view as a special type of deep neural network with clear semantics and a special type of probabilistic graphical model for which inference is always tractable. Those properties follow from some conditions (i.e., completeness and decomposability) that must be respected by the structure of the network. As a… ▽ More

    Submitted 18 January, 2017; originally announced January 2017.

  25. arXiv:1512.02194  [pdf, other

    cs.DC physics.comp-ph

    FabSim: facilitating computational research through automation on large-scale and distributed e-infrastructures

    Authors: Derek Groen, Agastya Bhati, James Suter, James Hetherington, Stefan Zasada, Peter Coveney

    Abstract: We present FabSim, a toolkit developed to simplify a range of computational tasks for researchers in diverse disciplines. FabSim is flexible, adaptable, and allows users to perform a wide range of tasks with ease. It also provides a systematic way to automate the use of resourcess, including HPC and distributed resources, and to make tasks easier to repeat by recording contextual information. To d… ▽ More

    Submitted 7 December, 2015; originally announced December 2015.

    Comments: 29 pages, 8 figures, 2 tables, submitted