Skip to main content

Showing 1–50 of 121 results for author: Ajit

Searching in archive cs. Search in all archives.
.
  1. Knowledge Distillation for Enhancing Walmart E-commerce Search Relevance Using Large Language Models

    Authors: Hongwei Shang, Nguyen Vo, Nitin Yadav, Tian Zhang, Ajit Puthenputhussery, Xunfan Cai, Shuyi Chen, Prijith Chandran, Changsung Kang

    Abstract: Ensuring the products displayed in e-commerce search results are relevant to users queries is crucial for improving the user experience. With their advanced semantic understanding, deep learning models have been widely used for relevance matching in search tasks. While large language models (LLMs) offer superior ranking capabilities, it is challenging to deploy LLMs in real-time systems due to the… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

    Comments: 9 pages, published at WWWW'25

    Journal ref: The Web Conference 2025

  2. arXiv:2504.19955  [pdf, ps, other

    cs.LG cs.IT

    Robust Federated Personalised Mean Estimation for the Gaussian Mixture Model

    Authors: Malhar A. Managoli, Vinod M. Prabhakaran, Suhas Diggavi

    Abstract: Federated learning with heterogeneous data and personalization has received significant recent attention. Separately, robustness to corrupted data in the context of federated learning has also been studied. In this paper we explore combining personalization for heterogeneous data with robustness, where a constant fraction of the clients are corrupted. Motivated by this broad problem, we formulate… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  3. arXiv:2503.15228  [pdf, other

    cs.NI eess.SP

    Sensing-Based Beamformed Resource Allocation in Standalone Millimeter-Wave Vehicular Networks

    Authors: Alessandro Traspadini, Anay Ajit Deshpande, Marco Giordani, Chinmay Mahabal, Takayuki Shimizu, Michele Zorzi

    Abstract: In 3GPP New Radio (NR) Vehicle-to-Everything (V2X), the new standard for next-generation vehicular networks, vehicles can autonomously select sidelink resources for data transmission, which permits network operations without cellular coverage. However, standalone resource allocation is uncoordinated, and is complicated by the high mobility of the nodes that may introduce unforeseen channel collisi… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: 7 pages, 8 figures, 3 tables. Accepted for publication in the 2025 IEEE International Conference on Communications (ICC). \c{opyright} 2025 IEEE. A. Traspadini, A. A. Deshpande, M. Giordani, C. Mahabal, T. Shimizu, and M. Zorzi, "Sensing-Based Beamformed Resource Allocation in Standalone Millimeter-Wave Vehicular Networks," in Proc. IEEE International Conference on Communications (ICC), 2025

  4. arXiv:2502.19825  [pdf, other

    stat.ML cs.LG

    Fast Debiasing of the LASSO Estimator

    Authors: Shuvayan Banerjee, James Saunderson, Radhendushka Srivastava, Ajit Rajwade

    Abstract: In high-dimensional sparse regression, the \textsc{Lasso} estimator offers excellent theoretical guarantees but is well-known to produce biased estimates. To address this, \cite{Javanmard2014} introduced a method to ``debias" the \textsc{Lasso} estimates for a random sub-Gaussian sensing matrix $\boldsymbol{A}$. Their approach relies on computing an ``approximate inverse" $\boldsymbol{M}$ of the m… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  5. arXiv:2501.18229  [pdf, other

    cs.RO

    GPD: Guided Polynomial Diffusion for Motion Planning

    Authors: Ajit Srikanth, Parth Mahanjan, Kallol Saha, Vishal Mandadi, Pranjal Paul, Pawan Wadhwani, Brojeshwar Bhowmick, Arun Singh, Madhava Krishna

    Abstract: Diffusion-based motion planners are becoming popular due to their well-established performance improvements, stemming from sample diversity and the ease of incorporating new constraints directly during inference. However, a primary limitation of the diffusion process is the requirement for a substantial number of denoising steps, especially when the denoising process is coupled with gradient-based… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

  6. arXiv:2501.15724  [pdf, other

    cs.CV cs.AI

    A Survey on Computational Pathology Foundation Models: Datasets, Adaptation Strategies, and Evaluation Tasks

    Authors: Dong Li, Guihong Wan, Xintao Wu, Xinyu Wu, Ajit J. Nirmal, Christine G. Lian, Peter K. Sorger, Yevgeniy R. Semenov, Chen Zhao

    Abstract: Computational pathology foundation models (CPathFMs) have emerged as a powerful approach for analyzing histopathological data, leveraging self-supervised learning to extract robust feature representations from unlabeled whole-slide images. These models, categorized into uni-modal and multi-modal frameworks, have demonstrated promise in automating complex pathology tasks such as segmentation, class… ▽ More

    Submitted 25 February, 2025; v1 submitted 26 January, 2025; originally announced January 2025.

  7. arXiv:2501.12938  [pdf, ps, other

    cs.IT

    Robust Hypothesis Testing with Abstention

    Authors: Malhar A. Managoli, K. R. Sahasranand, Vinod M. Prabhakaran

    Abstract: We study the binary hypothesis testing problem where an adversary may potentially corrupt a fraction of the samples. The detector is, however, permitted to abstain from making a decision if (and only if) the adversary is present. We consider a few natural "contamination models" and characterize for them the trade-off between the error exponents of the four types of errors -- errors of deciding in… ▽ More

    Submitted 23 January, 2025; v1 submitted 22 January, 2025; originally announced January 2025.

  8. arXiv:2501.02872  [pdf, other

    cs.CV

    Two-Dimensional Unknown View Tomography from Unknown Angle Distributions

    Authors: Kaishva Chintan Shah, Karthik S. Gurumoorthy, Ajit Rajwade

    Abstract: This study presents a technique for 2D tomography under unknown viewing angles when the distribution of the viewing angles is also unknown. Unknown view tomography (UVT) is a problem encountered in cryo-electron microscopy and in the geometric calibration of CT systems. There exists a moderate-sized literature on the 2D UVT problem, but most existing 2D UVT algorithms assume knowledge of the angle… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.

    Comments: Accepted to the International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2025

  9. arXiv:2412.04637  [pdf

    cs.IR cs.AI cs.LG

    Semantic Retrieval at Walmart

    Authors: Alessandro Magnani, Feng Liu, Suthee Chaidaroon, Sachin Yadav, Praveen Reddy Suram, Ajit Puthenputhussery, Sijie Chen, Min Xie, Anirudh Kashi, Tony Lee, Ciya Liao

    Abstract: In product search, the retrieval of candidate products before re-ranking is more critical and challenging than other search like web search, especially for tail queries, which have a complex and specific search intent. In this paper, we present a hybrid system for e-commerce search deployed at Walmart that combines traditional inverted index and embedding-based neural retrieval to better answer us… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

    Comments: 9 page, 2 figures, 10 tables, KDD 2022

  10. Meta Learning to Rank for Sparsely Supervised Queries

    Authors: Xuyang Wu, Ajit Puthenputhussery, Hongwei Shang, Changsung Kang, Yi Fang

    Abstract: Supervisory signals are a critical resource for training learning to rank models. In many real-world search and retrieval scenarios, these signals may not be readily available or could be costly to obtain for some queries. The examples include domains where labeling requires professional expertise, applications with strong privacy constraints, and user engagement information that are too scarce. W… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

    Comments: Accepted at TOIS

  11. arXiv:2409.05345  [pdf, other

    stat.ML cs.IT cs.LG

    Robust Non-adaptive Group Testing under Errors in Group Membership Specifications

    Authors: Shuvayan Banerjee, Radhendushka Srivastava, James Saunderson, Ajit Rajwade

    Abstract: Given $p$ samples, each of which may or may not be defective, group testing (GT) aims to determine their defect status by performing tests on $n < p$ `groups', where a group is formed by mixing a subset of the $p$ samples. Assuming that the number of defective samples is very small compared to $p$, GT algorithms have provided excellent recovery of the status of all $p$ samples with even a small nu… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  12. Optimizing Structured Data Processing through Robotic Process Automation

    Authors: Vivek Bhardwaj, Ajit Noonia, Sandeep Chaurasia, Mukesh Kumar, Abdulnaser Rashid, Mohamed Tahar Ben Othman

    Abstract: Robotic Process Automation (RPA) has emerged as a game-changing technology in data extraction, revolutionizing the way organizations process and analyze large volumes of documents such as invoices, purchase orders, and payment advices. This study investigates the use of RPA for structured data extraction and evaluates its advantages over manual processes. By comparing human-performed tasks with th… ▽ More

    Submitted 31 October, 2024; v1 submitted 27 August, 2024; originally announced August 2024.

    Journal ref: Journal Européen des Systèmes Automatisés, Vol. 57, No. 5, pp. 1523-1530 (2024)

  13. arXiv:2407.01198  [pdf, ps, other

    math.CO cs.DM

    Cycles of weight divisible by $k$

    Authors: Ajit A. Diwan

    Abstract: A weighted (directed) graph is a (directed) graph with integer weights assigned to its vertices and edges. The weight of a subgraph is the sum of weights of vertices and edges in the subgraph. The problem of determining the largest order $f(k)$ of a weighted complete directed graph that does not contain a directed cycle of weight divisible by $k$, for an integer $k \ge 2$, was raised by Alon and K… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: The article that proves the optimal bound for odd k (arXiv:2406.19855) appeared after this had been submitted

    MSC Class: 05C35 05C22 05C38

  14. arXiv:2406.17542  [pdf, ps, other

    cs.LG cs.AI cs.CL

    CDQuant: Greedy Coordinate Descent for Accurate LLM Quantization

    Authors: Pranav Ajit Nair, Arun Sai Suggala

    Abstract: Large language models (LLMs) have recently demonstrated remarkable performance across diverse language tasks. But their deployment is often constrained by their substantial computational and storage requirements. Quantization has emerged as a key technique for addressing this challenge, enabling the compression of large models with minimal impact on performance. The recent GPTQ algorithm, a post-t… ▽ More

    Submitted 22 October, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  15. arXiv:2406.00247  [pdf, other

    cs.IR cs.AI

    Large Language Models for Relevance Judgment in Product Search

    Authors: Navid Mehrdad, Hrushikesh Mohapatra, Mossaab Bagdouri, Prijith Chandran, Alessandro Magnani, Xunfan Cai, Ajit Puthenputhussery, Sachin Yadav, Tony Lee, ChengXiang Zhai, Ciya Liao

    Abstract: High relevance of retrieved and re-ranked items to the search query is the cornerstone of successful product search, yet measuring relevance of items to queries is one of the most challenging tasks in product information retrieval, and quality of product search is highly influenced by the precision and scale of available relevance-labelled data. In this paper, we present an array of techniques for… ▽ More

    Submitted 16 July, 2024; v1 submitted 31 May, 2024; originally announced June 2024.

    Comments: 10 pages, 1 figure, 11 tables - SIGIR 2024, LLM4Eval

    ACM Class: H.3.3; I.2.7

  16. MunchSonic: Tracking Fine-grained Dietary Actions through Active Acoustic Sensing on Eyeglasses

    Authors: Saif Mahmud, Devansh Agarwal, Ashwin Ajit, Qikang Liang, Thalia Viranda, Francois Guimbretiere, Cheng Zhang

    Abstract: We introduce MunchSonic, an AI-powered active acoustic sensing system integrated into eyeglasses to track fine-grained dietary actions. MunchSonic emits inaudible ultrasonic waves from the eyeglass frame, with the reflected signals capturing detailed positions and movements of body parts, including the mouth, jaw, arms, and hands involved in eating. These signals are processed by a deep learning p… ▽ More

    Submitted 2 August, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: 8 pages, 7 figures

  17. Ethics Pathways: A Design Activity for Reflecting on Ethics Engagement in HCI Research

    Authors: Inha Cha, Ajit G. Pillai, Richmond Y. Wong

    Abstract: This paper introduces Ethics Pathways, a design activity aimed at understanding HCI and design researchers' ethics engagements and flows during their research process. Despite a strong ethical commitment in these fields, challenges persist in grasping the complexity of researchers' engagement with ethics -- practices conducted to operationalize ethics -- in situated institutional contexts. Ethics… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Accepted at ACM Designing Interactive Systems (DIS) 2024

  18. Planar cycle-extendable graphs

    Authors: Aditya Y Dalwadi, Kapil R Shenvi Pause, Ajit A Diwan, Nishad Kothari

    Abstract: For most problems pertaining to perfect matchings, one may restrict attention to matching covered graphs - that is, connected nontrivial graphs with the property that each edge belongs to some perfect matching. There is extensive literature on these graphs that are also known as 1-extendable graphs (since each edge extends to a perfect matching) including an ear decomposition theorem due to Lovász… ▽ More

    Submitted 12 May, 2025; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: The last author Nishad Kothari would like to acknowledge Rajat Adak (currently a PhD student at IISc) for many discussions on cycle-extendability (while he was a BSc student at CMI)

    Journal ref: Discrete Mathematics & Theoretical Computer Science, vol. 27:2, Graph Theory (May 13, 2025) dmtcs:13929

  19. arXiv:2405.05211  [pdf, ps, other

    cs.IT

    Broadcast Channel Synthesis from Shared Randomness

    Authors: Malhar A. Managoli, Vinod M. Prabhakaran

    Abstract: We study the problem of synthesising a two-user broadcast channel using a common message, where each output terminal shares an independent source of randomness with the input terminal. This generalises two problems studied in the literature (Cuff, IEEE Trans. Inform. Theory, 2013; Kurri et.al., IEEE Trans. Inform. Theory, 2021). We give an inner bound on the tradeoff region between the rates of co… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  20. arXiv:2405.02585  [pdf, ps, other

    cs.IT

    Maximal Guesswork Leakage

    Authors: Gowtham R. Kurri, Malhar Managoli, Vinod M. Prabhakaran

    Abstract: We introduce the study of information leakage through \emph{guesswork}, the minimum expected number of guesses required to guess a random variable. In particular, we define \emph{maximal guesswork leakage} as the multiplicative decrease, upon observing $Y$, of the guesswork of a randomized function of $X$, maximized over all such randomized functions. We also study a pointwise form of the leakage… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: 6 pages. Extended version of a paper accepted to ISIT 2024

  21. arXiv:2404.17390  [pdf, other

    cs.HC cs.AI

    How Could AI Support Design Education? A Study Across Fields Fuels Situating Analytics

    Authors: Ajit Jain, Andruid Kerne, Hannah Fowler, Jinsil Seo, Galen Newman, Nic Lupfer, Aaron Perrine

    Abstract: We use the process and findings from a case study of design educators' practices of assessment and feedback to fuel theorizing about how to make AI useful in service of human experience. We build on Suchman's theory of situated actions. We perform a qualitative study of 11 educators in 5 fields, who teach design processes situated in project-based learning contexts. Through qualitative data gather… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 31 pages, 3 figures, Submitted to ACM

    ACM Class: H.5.2

  22. arXiv:2404.13933  [pdf

    cs.HC

    Comparison of On-Orbit Manual Attitude Control Methods for Non-Docking Spacecraft Through Virtual Reality Simulation

    Authors: Ajit Krishnan, Himanshu Vishwakarma, Maharudra Kharsade, Pradipta Biswas

    Abstract: On-orbit manual attitude control of manned spacecraft is accomplished using external visual references and some method of three axis attitude control. All past, present, and developmental spacecraft feature the capability to manually control attitude for deorbit. National Aeronautics and Space Administration (NASA) spacecraft permit an aircraft windshield type front view, wherein an arc of the Ear… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    ACM Class: H.5.2

  23. arXiv:2404.13924  [pdf, other

    cs.HC cs.ET

    ActSonic: Recognizing Everyday Activities from Inaudible Acoustic Wave Around the Body

    Authors: Saif Mahmud, Vineet Parikh, Qikang Liang, Ke Li, Ruidong Zhang, Ashwin Ajit, Vipin Gunda, Devansh Agarwal, François Guimbretière, Cheng Zhang

    Abstract: We present ActSonic, an intelligent, low-power active acoustic sensing system integrated into eyeglasses that can recognize 27 different everyday activities (e.g., eating, drinking, toothbrushing) from inaudible acoustic waves around the body. It requires only a pair of miniature speakers and microphones mounted on each hinge of the eyeglasses to emit ultrasonic waves, creating an acoustic aura ar… ▽ More

    Submitted 25 November, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, Volume 8, Issue 4, November 2024, IMWUT/UbiComp 2025

  24. arXiv:2404.06445  [pdf, ps, other

    math.CO cs.DM

    Extremal minimal bipartite matching covered graphs

    Authors: Amit Kumar Mallik, Ajit A. Diwan, Nishad Kothari

    Abstract: A connected graph, on four or more vertices, is matching covered if every edge is present in some perfect matching. An ear decomposition theorem (similar to the one for $2$-connected graphs) exists for bipartite matching covered graphs due to Hetyei. From the results and proofs of Lovász and Plummer, that rely on Hetyei's theorem, one may deduce that any minimal bipartite matching covered graph ha… ▽ More

    Submitted 11 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: Submitted to Innovations in Graph Theory

  25. arXiv:2404.05417  [pdf, other

    cs.HC cs.AI cs.CY

    Indexing Analytics to Instances: How Integrating a Dashboard can Support Design Education

    Authors: Ajit Jain, Andruid Kerne, Nic Lupfer, Gabriel Britain, Aaron Perrine, Yoonsuck Choe, John Keyser, Ruihong Huang, Jinsil Seo, Annie Sungkajun, Robert Lightfoot, Timothy McGuire

    Abstract: We investigate how to use AI-based analytics to support design education. The analytics at hand measure multiscale design, that is, students' use of space and scale to visually and conceptually organize their design work. With the goal of making the analytics intelligible to instructors, we developed a research artifact integrating a design analytics dashboard with design instances, and the design… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 22 pages, 4 figures, Submitted to ACM DIS

    ACM Class: H.5.2

  26. arXiv:2403.12120  [pdf, other

    astro-ph.IM astro-ph.SR cs.LG

    Light Curve Classification with DistClassiPy: a new distance-based classifier

    Authors: Siddharth Chaini, Ashish Mahabal, Ajit Kembhavi, Federica B. Bianco

    Abstract: The rise of synoptic sky surveys has ushered in an era of big data in time-domain astronomy, making data science and machine learning essential tools for studying celestial objects. While tree-based models (e.g. Random Forests) and deep learning models dominate the field, we explore the use of different distance metrics to aid in the classification of astrophysical objects. We developed DistClassi… ▽ More

    Submitted 25 July, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted for publication in Astronomy and Computing (2024). 24 pages, 19 figures

  27. arXiv:2402.08644  [pdf, other

    cs.AI cs.CL

    Tandem Transformers for Inference Efficient LLMs

    Authors: Aishwarya P S, Pranav Ajit Nair, Yashas Samaga, Toby Boyd, Sanjiv Kumar, Prateek Jain, Praneeth Netrapalli

    Abstract: The autoregressive nature of conventional large language models (LLMs) inherently limits inference speed, as tokens are generated sequentially. While speculative and parallel decoding techniques attempt to mitigate this, they face limitations: either relying on less accurate smaller models for generation or failing to fully leverage the base LLM's representations. We introduce a novel architectu… ▽ More

    Submitted 20 October, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  28. arXiv:2402.07637  [pdf, other

    eess.SP cs.CV eess.IV

    Compressive Recovery of Signals Defined on Perturbed Graphs

    Authors: Sabyasachi Ghosh, Ajit Rajwade

    Abstract: Recovery of signals with elements defined on the nodes of a graph, from compressive measurements is an important problem, which can arise in various domains such as sensor networks, image reconstruction and group testing. In some scenarios, the graph may not be accurately known, and there may exist a few edge additions or deletions relative to a ground truth graph. Such perturbations, even if smal… ▽ More

    Submitted 16 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 18 pages, 15 figures. v2: Minor correction in ref [32]

  29. arXiv:2312.04294  [pdf, ps, other

    cs.NI

    Energy-Efficient Internet of Things Monitoring with Content-Based Wake-Up Radio

    Authors: Anay Ajit Deshpande, Federico Chiariotti, Andrea Zanella

    Abstract: The use of Wake-Up Radio (WUR) in Internet of Things (IoT) networks can significantly improve their energy efficiency: battery-powered sensors can remain in a low-power (sleep) mode while listening for wake-up messages using their WUR and reactivate only when polled. However, polling-based WUR may still lead to wasted energy if values sensed by the polled sensors provide no new information to the… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  30. arXiv:2312.01532  [pdf, other

    cs.HC cs.CL

    Using Large Language Models to Accelerate Communication for Users with Severe Motor Impairments

    Authors: Shanqing Cai, Subhashini Venugopalan, Katie Seaver, Xiang Xiao, Katrin Tomanek, Sri Jalasutram, Meredith Ringel Morris, Shaun Kane, Ajit Narayanan, Robert L. MacDonald, Emily Kornman, Daniel Vance, Blair Casey, Steve M. Gleason, Philip Q. Nelson, Michael P. Brenner

    Abstract: Finding ways to accelerate text input for individuals with profound motor impairments has been a long-standing area of research. Closing the speed gap for augmentative and alternative communication (AAC) devices such as eye-tracking keyboards is important for improving the quality of life for such individuals. Recent advances in neural networks of natural language pose new opportunities for re-thi… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  31. arXiv:2311.13821  [pdf, other

    cs.LG cs.AI cs.CE stat.AP

    HypUC: Hyperfine Uncertainty Calibration with Gradient-boosted Corrections for Reliable Regression on Imbalanced Electrocardiograms

    Authors: Uddeshya Upadhyay, Sairam Bade, Arjun Puranik, Shahir Asfahan, Melwin Babu, Francisco Lopez-Jimenez, Samuel J. Asirvatham, Ashim Prasad, Ajit Rajasekharan, Samir Awasthi, Rakesh Barve

    Abstract: The automated analysis of medical time series, such as the electrocardiogram (ECG), electroencephalogram (EEG), pulse oximetry, etc, has the potential to serve as a valuable tool for diagnostic decisions, allowing for remote monitoring of patients and more efficient use of expensive and time-consuming medical procedures. Deep neural networks (DNNs) have been demonstrated to process such signals ef… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: Published at TMLR

    Journal ref: Transactions on Machine Learning Research (TMLR), 2023

  32. arXiv:2311.02573  [pdf, other

    cs.DS cs.CV

    Group Testing for Accurate and Efficient Range-Based Near Neighbor Search for Plagiarism Detection

    Authors: Harsh Shah, Kashish Mittal, Ajit Rajwade

    Abstract: This work presents an adaptive group testing framework for the range-based high dimensional near neighbor search problem. Our method efficiently marks each item in a database as neighbor or non-neighbor of a query point, based on a cosine distance threshold without exhaustive search. Like other methods for large scale retrieval, our approach exploits the assumption that most of the items in the da… ▽ More

    Submitted 6 September, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

    Comments: 28 pages (including Supplementary Material)

  33. arXiv:2310.15233  [pdf, other

    gr-qc astro-ph.HE astro-ph.IM cs.AI cs.LG

    New approach to template banks of gravitational waves with higher harmonics: Reducing matched-filtering cost by over an order of magnitude

    Authors: Digvijay Wadekar, Tejaswi Venumadhav, Ajit Kumar Mehta, Javier Roulet, Seth Olsen, Jonathan Mushkin, Barak Zackay, Matias Zaldarriaga

    Abstract: Searches for gravitational wave events use models, or templates, for the signals of interest. The templates used in current searches in the LIGO-Virgo-Kagra (LVK) data model the dominant quadrupole mode $(\ell,|m|)=(2,2)$ of the signals, and omit sub-dominant higher-order modes (HM) such as $(\ell,|m|)=(3,3)$, $(4,4)$, which are predicted by general relativity. This omission reduces search sensiti… ▽ More

    Submitted 16 October, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 12+2 pages, 8+1 figures. The code for generating our template banks and reproducing the plots in our paper is publicly available at https://github.com/JayWadekar/gwIAS-HM

    Journal ref: Phys. Rev. D 110, 084035 (2024)

  34. EDMP: Ensemble-of-costs-guided Diffusion for Motion Planning

    Authors: Kallol Saha, Vishal Mandadi, Jayaram Reddy, Ajit Srikanth, Aditya Agarwal, Bipasha Sen, Arun Singh, Madhava Krishna

    Abstract: Classical motion planning for robotic manipulation includes a set of general algorithms that aim to minimize a scene-specific cost of executing a given plan. This approach offers remarkable adaptability, as they can be directly used off-the-shelf for any new scene without needing specific training datasets. However, without a prior understanding of what diverse valid trajectories are and without s… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: 8 pages, 8 figures, submitted to ICRA 2024 (International Conference on Robotics and Automation)

    Journal ref: 2024 IEEE International Conference on Robotics and Automation (ICRA)

  35. Low-Latency Massive Access with Multicast Wake Up Radio

    Authors: Anay Ajit Deshpande, Federico Chiariotti, Andrea Zanella

    Abstract: The use of Wake-Up Radio (WUR) in Internet of Things (IoT) networks can significantly improve their energy efficiency: battery-powered sensors can remain in a low-power (sleep) mode while listening for wake-up messages using their WUR and reactivate only when polled, saving energy. However, polling-based Time Division Multiple Access (TDMA) may significantly increase data transmission delay if pac… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: 2023 21st Mediterranean Communication and Computer Networking Conference (MedComNet)

  36. arXiv:2306.10797  [pdf, other

    eess.SY cs.LG math.DS

    Variability of echo state network prediction horizon for partially observed dynamical systems

    Authors: Ajit Mahata, Reetish Padhi, Amit Apte

    Abstract: Study of dynamical systems using partial state observation is an important problem due to its applicability to many real-world systems. We address the problem by studying an echo state network (ESN) framework with partial state input with partial or full state output. Application to the Lorenz system and Chua's oscillator (both numerically simulated and experimental systems) demonstrate the effect… ▽ More

    Submitted 5 December, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

  37. arXiv:2306.05495  [pdf, other

    cs.CV cs.LG

    Is Attentional Channel Processing Design Required? Comprehensive Analysis Of Robustness Between Vision Transformers And Fully Attentional Networks

    Authors: Abhishri Ajit Medewar, Swanand Ashokrao Kavitkar

    Abstract: The robustness testing has been performed for standard CNN models and Vision Transformers, however there is a lack of comprehensive study between the robustness of traditional Vision Transformers without an extra attentional channel design and the latest fully attentional network(FAN) models. So in this paper, we use the ImageNet dataset to compare the robustness of fully attentional network(FAN)… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: 4 pages, 12 figures

  38. arXiv:2306.04944  [pdf, ps, other

    math.CO cs.DM

    Colouring planar graphs with a precoloured induced cycle

    Authors: Ajit Diwan

    Abstract: Let $C$ be a cycle and $f : V(C) \rightarrow \{c_1,c_2,\ldots,c_k\}$ a proper $k$-colouring of $C$ for some $k \ge 4$. We say the colouring $f$ is safe if for any planar graph $G$ in which $C$ is an induced cycle, there exists a proper $k$-colouring $f'$ of $G$ such that $f'(v) = f(v)$ for all $v \in V(C)$. The only safe $4$-colouring is any proper colouring of a triangle. We give a simple necessa… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: 18 pages

    MSC Class: 05C10; 05C15

  39. arXiv:2305.16820  [pdf, other

    cs.CL cs.AI

    Domain Aligned Prefix Averaging for Domain Generalization in Abstractive Summarization

    Authors: Pranav Ajit Nair, Sukomal Pal, Pradeepika Verma

    Abstract: Domain generalization is hitherto an underexplored area applied in abstractive summarization. Moreover, most existing works on domain generalization have sophisticated training algorithms. In this paper, we propose a lightweight, weight averaging based, Domain Aligned Prefix Averaging approach to domain generalization for abstractive summarization. Given a number of source domains, our method firs… ▽ More

    Submitted 29 May, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 13 pages, Accepted to ACL 2023 Findings

  40. arXiv:2305.15108  [pdf, other

    cs.CL

    The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing

    Authors: Debayan Banerjee, Pranav Ajit Nair, Ricardo Usbeck, Chris Biemann

    Abstract: In this work, we analyse the role of output vocabulary for text-to-text (T2T) models on the task of SPARQL semantic parsing. We perform experiments within the the context of knowledge graph question answering (KGQA), where the task is to convert questions in natural language to the SPARQL query language. We observe that the query vocabulary is distinct from human vocabulary. Language Models (LMs)… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted as a short paper to ACL 2023 findings

  41. arXiv:2305.07639  [pdf, other

    cs.CV cs.LG

    Efficient Neural Network based Classification and Outlier Detection for Image Moderation using Compressed Sensing and Group Testing

    Authors: Sabyasachi Ghosh, Sanyam Saxena, Ajit Rajwade

    Abstract: Popular social media platforms employ neural network based image moderation engines to classify images uploaded on them as having potentially objectionable content. Such moderation engines must answer a large number of queries with heavy computational cost, even though the actual number of images with objectionable content is usually a tiny fraction. Inspired by recent work on Neural Group Testing… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  42. arXiv:2305.04883  [pdf, other

    q-bio.GN cs.LG

    Fuzzy Gene Selection and Cancer Classification Based on Deep Learning Model

    Authors: Mahmood Khalsan, Mu Mu, Eman Salih Al-Shamery, Lee Machado, Suraj Ajit, Michael Opoku Agyeman

    Abstract: Machine learning (ML) approaches have been used to develop highly accurate and efficient applications in many fields including bio-medical science. However, even with advanced ML techniques, cancer classification using gene expression data is still complicated because of the high dimensionality of the datasets employed. We developed a new fuzzy gene selection technique (FGS) to identify informativ… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: Journal of Intelligent Information Systems (25,17)

  43. arXiv:2304.11507  [pdf, other

    cs.LG cs.AI

    Machine learning framework for end-to-end implementation of Incident duration prediction

    Authors: Smrithi Ajit, Varsha R Mouli, Skylar Knickerbocker, Jonathan S. Wood

    Abstract: Traffic congestion caused by non-recurring incidents such as vehicle crashes and debris is a key issue for Traffic Management Centers (TMCs). Clearing incidents in a timely manner is essential for improving safety and reducing delays and emissions for the traveling public. However, TMCs and other responders face a challenge in predicting the duration of incidents (until the roadway is clear), maki… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

  44. arXiv:2304.11277  [pdf, other

    cs.DC cs.AI cs.LG cs.PF

    PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel

    Authors: Yanli Zhao, Andrew Gu, Rohan Varma, Liang Luo, Chien-Chin Huang, Min Xu, Less Wright, Hamid Shojanazeri, Myle Ott, Sam Shleifer, Alban Desmaison, Can Balioglu, Pritam Damania, Bernard Nguyen, Geeta Chauhan, Yuchen Hao, Ajit Mathews, Shen Li

    Abstract: It is widely acknowledged that large models have the potential to deliver superior performance across a broad range of domains. Despite the remarkable progress made in the field of machine learning systems research, which has enabled the development and exploration of large models, such abilities remain confined to a small group of advanced users and industry leaders, resulting in an implicit tech… ▽ More

    Submitted 12 September, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

  45. arXiv:2304.08769  [pdf, ps, other

    cs.LG cs.MA

    Cooperative Multi-Agent Reinforcement Learning for Inventory Management

    Authors: Madhav Khirwar, Karthik S. Gurumoorthy, Ankit Ajit Jain, Shantala Manchenahally

    Abstract: With Reinforcement Learning (RL) for inventory management (IM) being a nascent field of research, approaches tend to be limited to simple, linear environments with implementations that are minor modifications of off-the-shelf RL algorithms. Scaling these simplistic environments to a real-world supply chain comes with a few challenges such as: minimizing the computational requirements of the enviro… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 14 pages, 5 figures

  46. arXiv:2304.08740  [pdf, other

    stat.ML cs.LG eess.SP

    Estimating Joint Probability Distribution With Low-Rank Tensor Decomposition, Radon Transforms and Dictionaries

    Authors: Pranava Singhal, Waqar Mirza, Ajit Rajwade, Karthik S. Gurumoorthy

    Abstract: In this paper, we describe a method for estimating the joint probability density from data samples by assuming that the underlying distribution can be decomposed as a mixture of product densities with few mixture components. Prior works have used such a decomposition to estimate the joint density from lower-dimensional marginals, which can be estimated more reliably with the same number of samples… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    MSC Class: 62G07

  47. arXiv:2304.06376  [pdf, other

    cs.CV

    Signal Reconstruction from Samples at Unknown Locations with Application to 2D Unknown View Tomography

    Authors: Sheel Shah, Kaishva Shah, Karthik S. Gurumoorthy, Ajit Rajwade

    Abstract: It is well known that a band-limited signal can be reconstructed from its uniformly spaced samples if the sampling rate is sufficiently high. More recently, it has been proved that one can reconstruct a 1D band-limited signal even if the exact sample locations are unknown, but given a uniform distribution of the sample locations and their ordering in 1D. In this work, we extend the analytical erro… ▽ More

    Submitted 18 December, 2024; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: This is a preprint of a paper accepted to Signal Processing (Elsevier)

  48. arXiv:2304.00086  [pdf, other

    econ.GN cs.AI cs.LG stat.AP

    Machine Learning for Economics Research: When What and How?

    Authors: Ajit Desai

    Abstract: This article provides a curated review of selected papers published in prominent economics journals that use machine learning (ML) tools for research and policy analysis. The review focuses on three key questions: (1) when ML is used in economics, (2) what ML models are commonly preferred, and (3) how they are used for economic applications. The review highlights that ML is particularly used to pr… ▽ More

    Submitted 20 April, 2023; v1 submitted 31 March, 2023; originally announced April 2023.

  49. arXiv:2303.13284  [pdf, other

    cs.CL cs.DB cs.IR

    GETT-QA: Graph Embedding based T2T Transformer for Knowledge Graph Question Answering

    Authors: Debayan Banerjee, Pranav Ajit Nair, Ricardo Usbeck, Chris Biemann

    Abstract: In this work, we present an end-to-end Knowledge Graph Question Answering (KGQA) system named GETT-QA. GETT-QA uses T5, a popular text-to-text pre-trained language model. The model takes a question in natural language as input and produces a simpler form of the intended SPARQL query. In the simpler form, the model does not directly produce entity and relation IDs. Instead, it produces correspondin… ▽ More

    Submitted 28 March, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: 16 pages single column format accepted at ESWC 2023 research track

  50. arXiv:2303.06277  [pdf, other

    cs.CV

    SPOTR: Spatio-temporal Pose Transformers for Human Motion Prediction

    Authors: Avinash Ajit Nargund, Misha Sra

    Abstract: 3D human motion prediction is a research area of high significance and a challenge in computer vision. It is useful for the design of many applications including robotics and autonomous driving. Traditionally, autogregressive models have been used to predict human motion. However, these models have high computation needs and error accumulation that make it difficult to use them for realtime applic… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.