Skip to main content

Showing 1–50 of 71 results for author: Shetty, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.19364  [pdf, ps, other

    cs.CR

    RADEP: A Resilient Adaptive Defense Framework Against Model Extraction Attacks

    Authors: Amit Chakraborty, Sayyed Farid Ahamed, Sandip Roy, Soumya Banerjee, Kevin Choi, Abdul Rahman, Alison Hu, Edward Bowen, Sachin Shetty

    Abstract: Machine Learning as a Service (MLaaS) enables users to leverage powerful machine learning models through cloud-based APIs, offering scalability and ease of deployment. However, these services are vulnerable to model extraction attacks, where adversaries repeatedly query the application programming interface (API) to reconstruct a functionally similar model, compromising intellectual property and s… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: Presented at the IEEE International Wireless Communications and Mobile Computing Conference (IWCMC) 2025

    ACM Class: I.2.6; D.4.6; K.6.5

  2. arXiv:2504.21656  [pdf

    cs.NI

    DBSCAN-based Vehicle Clustering and UAV Placement for NOMA-based Resource Management in Cellular V2X Communications

    Authors: Hossein Davoudi, Behrouz Shahgholi Ghahfarokhi, Neda Moghim, Sachin Shetty

    Abstract: In the future wireless networks, terrestrial, aerial, space, and maritime wireless networks are integrated into a unified network to meet the needs of a fully connected global network. Nowadays, vehicular communication has become one of the challenging applications of wireless networks. In this article, we aim to address the radio resource management in Cellular V2X (C-V2X) networks using Unmanned… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

  3. arXiv:2504.20939  [pdf, other

    cs.NI eess.SP

    Flexible Semantic-Aware Resource Allocation: Serving More Users Through Similarity Range Constraints

    Authors: Nasrin Gholami, Neda Moghim, Behrouz Shahgholi Ghahfarokhi, Pouyan Salavati, Christo Kurisummoottil Thomas, Sachin Shetty, Tahereh Rahmati

    Abstract: Semantic communication (SemCom) aims to enhance the resource efficiency of next-generation networks by transmitting the underlying meaning of messages, focusing on information relevant to the end user. Existing literature on SemCom primarily emphasizes learning the encoder and decoder through end-to-end deep learning frameworks, with the objective of minimizing a task-specific semantic loss functi… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  4. arXiv:2504.18671  [pdf, other

    cs.AI

    Proof-of-TBI -- Fine-Tuned Vision Language Model Consortium and OpenAI-o3 Reasoning LLM-Based Medical Diagnosis Support System for Mild Traumatic Brain Injury (TBI) Prediction

    Authors: Ross Gore, Eranga Bandara, Sachin Shetty, Alberto E. Musto, Pratip Rana, Ambrosio Valencia-Romero, Christopher Rhea, Lobat Tayebi, Heather Richter, Atmaram Yarlagadda, Donna Edmonds, Steven Wallace, Donna Broshek

    Abstract: Mild Traumatic Brain Injury (TBI) detection presents significant challenges due to the subtle and often ambiguous presentation of symptoms in medical imaging, making accurate diagnosis a complex task. To address these challenges, we propose Proof-of-TBI, a medical diagnosis support system that integrates multiple fine-tuned vision-language models with the OpenAI-o3 reasoning large language model (… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

  5. arXiv:2503.09513  [pdf, other

    cs.CR cs.AI

    RESTRAIN: Reinforcement Learning-Based Secure Framework for Trigger-Action IoT Environment

    Authors: Md Morshed Alam, Lokesh Chandra Das, Sandip Roy, Sachin Shetty, Weichao Wang

    Abstract: Internet of Things (IoT) platforms with trigger-action capability allow event conditions to trigger actions in IoT devices autonomously by creating a chain of interactions. Adversaries exploit this chain of interactions to maliciously inject fake event conditions into IoT hubs, triggering unauthorized actions on target IoT devices to implement remote injection attacks. Existing defense mechanisms… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  6. arXiv:2502.20509  [pdf, other

    cs.CV

    CoCa-CXR: Contrastive Captioners Learn Strong Temporal Structures for Chest X-Ray Vision-Language Understanding

    Authors: Yixiong Chen, Shawn Xu, Andrew Sellergren, Yossi Matias, Avinatan Hassidim, Shravya Shetty, Daniel Golden, Alan Yuille, Lin Yang

    Abstract: Vision-language models have proven to be of great benefit for medical image analysis since they learn rich semantics from both images and reports. Prior efforts have focused on better alignment of image and text representations to enhance image understanding. However, though explicit reference to a prior image is common in Chest X-Ray (CXR) reports, aligning progression descriptions with the seman… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  7. arXiv:2502.10536  [pdf, other

    cs.CV cs.AI cs.LG

    PolyPath: Adapting a Large Multimodal Model for Multi-slide Pathology Report Generation

    Authors: Faruk Ahmed, Lin Yang, Tiam Jaroensri, Andrew Sellergren, Yossi Matias, Avinatan Hassidim, Greg S. Corrado, Dale R. Webster, Shravya Shetty, Shruthi Prabhakara, Yun Liu, Daniel Golden, Ellery Wulczyn, David F. Steiner

    Abstract: The interpretation of histopathology cases underlies many important diagnostic and treatment decisions in medicine. Notably, this process typically requires pathologists to integrate and summarize findings across multiple slides per case. Existing vision-language capabilities in computational pathology have so far been largely limited to small regions of interest, larger regions at low magnificati… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

    Comments: 8 main pages, 21 pages in total

  8. arXiv:2412.17462  [pdf, other

    cs.RO

    Sampling-Based Constrained Motion Planning with Products of Experts

    Authors: Amirreza Razmjoo, Teng Xue, Suhan Shetty, Sylvain Calinon

    Abstract: We present a novel approach to enhance the performance of sampling-based Model Predictive Control (MPC) in constrained optimization by leveraging products of experts. Our methodology divides the main problem into two components: one focused on optimality and the other on feasibility. By combining the solutions from each component, represented as distributions, we apply products of experts to imple… ▽ More

    Submitted 23 December, 2024; originally announced December 2024.

  9. arXiv:2412.11829  [pdf, other

    cs.RO

    Robust Contact-rich Manipulation through Implicit Motor Adaptation

    Authors: Teng Xue, Amirreza Razmjoo, Suhan Shetty, Sylvain Calinon

    Abstract: Contact-rich manipulation plays an important role in daily human activities. However, uncertain physical parameters often pose significant challenges for both planning and control. A promising strategy is to develop policies that are robust across a wide range of parameters. Domain adaptation and domain randomization are widely used, but they tend to either limit generalization to new instances or… ▽ More

    Submitted 28 May, 2025; v1 submitted 16 December, 2024; originally announced December 2024.

  10. arXiv:2412.05183  [pdf, other

    cs.LG cs.CR

    Privacy Drift: Evolving Privacy Concerns in Incremental Learning

    Authors: Sayyed Farid Ahamed, Soumya Banerjee, Sandip Roy, Aayush Kapoor, Marc Vucovich, Kevin Choi, Abdul Rahman, Edward Bowen, Sachin Shetty

    Abstract: In the evolving landscape of machine learning (ML), Federated Learning (FL) presents a paradigm shift towards decentralized model training while preserving user data privacy. This paper introduces the concept of ``privacy drift", an innovative framework that parallels the well-known phenomenon of concept drift. While concept drift addresses the variability in model accuracy over time due to change… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

    Comments: 6 pages, 7 figures, Accepted in IEEE ICNC 25

  11. arXiv:2412.05091  [pdf, other

    cs.DB cs.CR

    Designing a Secure, Scalable, and Cost-Effective Cloud Storage Solution: A Novel Approach to Data Management using NextCloud, TrueNAS, and QEMU/KVM

    Authors: Prakash Aryan, Sujala Deepak Shetty

    Abstract: This paper presents a novel approach to cloud storage challenges by integrating NextCloud, TrueNAS, and QEMU/KVM. Our research demonstrates how this combination creates a robust, flexible, and economical cloud storage system suitable for various applications. We detail the architecture, highlighting TrueNAS's ZFS-based storage, QEMU/KVM's virtualization, and NextCloud's user interface. Extensive t… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

  12. arXiv:2411.15128  [pdf, other

    cs.LG cs.AI cs.CV cs.MM eess.IV

    Health AI Developer Foundations

    Authors: Atilla P. Kiraly, Sebastien Baur, Kenneth Philbrick, Fereshteh Mahvar, Liron Yatziv, Tiffany Chen, Bram Sterling, Nick George, Fayaz Jamil, Jing Tang, Kai Bailey, Faruk Ahmed, Akshay Goel, Abbi Ward, Lin Yang, Andrew Sellergren, Yossi Matias, Avinatan Hassidim, Shravya Shetty, Daniel Golden, Shekoofeh Azizi, David F. Steiner, Yun Liu, Tim Thelin, Rory Pilgrim , et al. (1 additional authors not shown)

    Abstract: Robust medical Machine Learning (ML) models have the potential to revolutionize healthcare by accelerating clinical research, improving workflows and outcomes, and producing novel insights or capabilities. Developing such ML models from scratch is cost prohibitive and requires substantial compute, data, and time (e.g., expert labeling). To address these challenges, we introduce Health AI Developer… ▽ More

    Submitted 26 November, 2024; v1 submitted 22 November, 2024; originally announced November 2024.

    Comments: 16 pages, 8 figures

  13. arXiv:2411.07207  [pdf, other

    cs.LG cs.CY

    General Geospatial Inference with a Population Dynamics Foundation Model

    Authors: Mohit Agarwal, Mimi Sun, Chaitanya Kamath, Arbaaz Muslim, Prithul Sarker, Joydeep Paul, Hector Yee, Marcin Sieniek, Kim Jablonski, Yael Mayer, David Fork, Sheila de Guia, Jamie McPike, Adam Boulanger, Tomer Shekel, David Schottlander, Yao Xiao, Manjit Chakravarthy Manukonda, Yun Liu, Neslihan Bulut, Sami Abu-el-haija, Bryan Perozzi, Monica Bharel, Von Nguyen, Luke Barrington , et al. (7 additional authors not shown)

    Abstract: Supporting the health and well-being of dynamic populations around the world requires governmental agencies, organizations and researchers to understand and reason over complex relationships between human behavior and local contexts in order to identify high-risk groups and strategically allocate limited resources. Traditional approaches to these classes of problems often entail developing manuall… ▽ More

    Submitted 29 January, 2025; v1 submitted 11 November, 2024; originally announced November 2024.

    Comments: 28 pages, 16 figures, preprint; v4: updated authors

  14. arXiv:2410.22721  [pdf, other

    cs.LG

    Community search signatures as foundation features for human-centered geospatial modeling

    Authors: Mimi Sun, Chaitanya Kamath, Mohit Agarwal, Arbaaz Muslim, Hector Yee, David Schottlander, Shailesh Bavadekar, Niv Efron, Shravya Shetty, Gautam Prasad

    Abstract: Aggregated relative search frequencies offer a unique composite signal reflecting people's habits, concerns, interests, intents, and general information needs, which are not found in other readily available datasets. Temporal search trends have been successfully used in time series modeling across a variety of domains such as infectious diseases, unemployment rates, and retail sales. However, most… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: 8 pages, 8 figures, presented at the DMLR workshop at ICML 2024

  15. arXiv:2410.18560  [pdf

    cs.AI

    Explainable News Summarization -- Analysis and mitigation of Disagreement Problem

    Authors: Seema Aswani, Sujala D. Shetty

    Abstract: Explainable AI (XAI) techniques for text summarization provide valuable understanding of how the summaries are generated. Recent studies have highlighted a major challenge in this area, known as the disagreement problem. This problem occurs when different XAI methods offer contradictory explanations for the summary generated from the same input article. This inconsistency across XAI methods has be… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

  16. arXiv:2410.12245  [pdf, other

    eess.IV cs.CV

    Advancing Healthcare: Innovative ML Approaches for Improved Medical Imaging in Data-Constrained Environments

    Authors: Al Amin, Kamrul Hasan, Saleh Zein-Sabatto, Liang Hong, Sachin Shetty, Imtiaz Ahmed, Tariqul Islam

    Abstract: Healthcare industries face challenges when experiencing rare diseases due to limited samples. Artificial Intelligence (AI) communities overcome this situation to create synthetic data which is an ethical and privacy issue in the medical domain. This research introduces the CAT-U-Net framework as a new approach to overcome these limitations, which enhances feature extraction from medical images wit… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 7 pages, 7 figures

  17. arXiv:2410.11600  [pdf, other

    cs.RO

    Robust Manipulation Primitive Learning via Domain Contraction

    Authors: Teng Xue, Amirreza Razmjoo, Suhan Shetty, Sylvain Calinon

    Abstract: Contact-rich manipulation plays an important role in human daily activities, but uncertain parameters pose significant challenges for robots to achieve comparable performance through planning and control. To address this issue, domain adaptation and domain randomization have been proposed for robust policy learning. However, they either lose the generalization ability across diverse instances or p… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: Conference on Robot Learning (CoRL), 2024

  18. arXiv:2410.02637  [pdf, other

    cs.AI cs.CV

    Plots Unlock Time-Series Understanding in Multimodal Models

    Authors: Mayank Daswani, Mathias M. J. Bellaiche, Marc Wilson, Desislav Ivanov, Mikhail Papkov, Eva Schnider, Jing Tang, Kay Lamerigts, Gabriela Botea, Michael A. Sanchez, Yojan Patel, Shruthi Prabhakara, Shravya Shetty, Umesh Telang

    Abstract: While multimodal foundation models can now natively work with data beyond text, they remain underutilized in analyzing the considerable amounts of multi-dimensional time-series data in fields like healthcare, finance, and social sciences, representing a missed opportunity for richer, data-driven insights. This paper proposes a simple but effective method that leverages the existing vision encoders… ▽ More

    Submitted 28 November, 2024; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: 57 pages

  19. arXiv:2407.19119  [pdf, other

    cs.LG cs.AI cs.CR

    Accuracy-Privacy Trade-off in the Mitigation of Membership Inference Attack in Federated Learning

    Authors: Sayyed Farid Ahamed, Soumya Banerjee, Sandip Roy, Devin Quinn, Marc Vucovich, Kevin Choi, Abdul Rahman, Alison Hu, Edward Bowen, Sachin Shetty

    Abstract: Over the last few years, federated learning (FL) has emerged as a prominent method in machine learning, emphasizing privacy preservation by allowing multiple clients to collaboratively build a model while keeping their training data private. Despite this focus on privacy, FL models are susceptible to various attacks, including membership inference attacks (MIAs), posing a serious threat to data co… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  20. arXiv:2406.19578  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    PathAlign: A vision-language model for whole slide images in histopathology

    Authors: Faruk Ahmed, Andrew Sellergren, Lin Yang, Shawn Xu, Boris Babenko, Abbi Ward, Niels Olson, Arash Mohtashamian, Yossi Matias, Greg S. Corrado, Quang Duong, Dale R. Webster, Shravya Shetty, Daniel Golden, Yun Liu, David F. Steiner, Ellery Wulczyn

    Abstract: Microscopic interpretation of histopathology images underlies many important diagnostic and treatment decisions. While advances in vision-language modeling raise new opportunities for analysis of such images, the gigapixel-scale size of whole slide images (WSIs) introduces unique challenges. Additionally, pathology reports simultaneously highlight key findings from small regions while also aggrega… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 9 main pages and 19 pages of supplemental material; 3 main tables, 3 main figures and 11 supplemental tables, 7 supplemental figures

  21. arXiv:2406.19112  [pdf, other

    cs.LG

    A Teacher Is Worth A Million Instructions

    Authors: Nikhil Kothari, Ravindra Nayak, Shreyas Shetty, Amey Patil, Nikesh Garera

    Abstract: Large Language Models(LLMs) have shown exceptional abilities, yet training these models can be quite challenging. There is a strong dependence on the quality of data and finding the best instruction tuning set. Further, the inherent limitations in training methods create substantial difficulties to train relatively smaller models with 7B and 13B parameters. In our research, we suggest an improved… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 7 pages, 4 figures

  22. arXiv:2406.06474  [pdf, other

    cs.AI cs.CL

    Towards a Personal Health Large Language Model

    Authors: Justin Cosentino, Anastasiya Belyaeva, Xin Liu, Nicholas A. Furlotte, Zhun Yang, Chace Lee, Erik Schenck, Yojan Patel, Jian Cui, Logan Douglas Schneider, Robby Bryant, Ryan G. Gomes, Allen Jiang, Roy Lee, Yun Liu, Javier Perez, Jameson K. Rogers, Cathy Speed, Shyam Tailor, Megan Walker, Jeffrey Yu, Tim Althoff, Conor Heneghan, John Hernandez, Mark Malhotra , et al. (9 additional authors not shown)

    Abstract: In health, most large language model (LLM) research has focused on clinical tasks. However, mobile and wearable devices, which are rarely integrated into such tasks, provide rich, longitudinal data for personal health monitoring. Here we present Personal Health Large Language Model (PH-LLM), fine-tuned from Gemini for understanding and reasoning over numerical time-series personal health data. We… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 72 pages

  23. arXiv:2405.04082  [pdf, other

    cs.RO

    Logic-Skill Programming: An Optimization-based Approach to Sequential Skill Planning

    Authors: Teng Xue, Amirreza Razmjoo, Suhan Shetty, Sylvain Calinon

    Abstract: Recent advances in robot skill learning have unlocked the potential to construct task-agnostic skill libraries, facilitating the seamless sequencing of multiple simple manipulation primitives (aka. skills) to tackle significantly more complex tasks. Nevertheless, determining the optimal sequence for independently learned skills remains an open problem, particularly when the objective is given sole… ▽ More

    Submitted 16 July, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: In Proc. Robotics: Science and Systems (RSS), 2024

  24. arXiv:2405.03162  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Advancing Multimodal Medical Capabilities of Gemini

    Authors: Lin Yang, Shawn Xu, Andrew Sellergren, Timo Kohlberger, Yuchen Zhou, Ira Ktena, Atilla Kiraly, Faruk Ahmed, Farhad Hormozdiari, Tiam Jaroensri, Eric Wang, Ellery Wulczyn, Fayaz Jamil, Theo Guidroz, Chuck Lau, Siyuan Qiao, Yun Liu, Akshay Goel, Kendall Park, Arnav Agharwal, Nick George, Yang Wang, Ryutaro Tanno, David G. T. Barrett, Wei-Hung Weng , et al. (22 additional authors not shown)

    Abstract: Many clinical tasks require an understanding of specialized data, such as medical images and genomics, which is not typically found in general-purpose large multimodal models. Building upon Gemini's multimodal models, we develop several models within the new Med-Gemini family that inherit core capabilities of Gemini and are optimized for medical use via fine-tuning with 2D and 3D radiology, histop… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  25. arXiv:2403.06388  [pdf, other

    cs.CR cs.LG

    A Zero Trust Framework for Realization and Defense Against Generative AI Attacks in Power Grid

    Authors: Md. Shirajum Munir, Sravanthi Proddatoori, Manjushree Muralidhara, Walid Saad, Zhu Han, Sachin Shetty

    Abstract: Understanding the potential of generative AI (GenAI)-based attacks on the power grid is a fundamental challenge that must be addressed in order to protect the power grid by realizing and validating risk in new attack vectors. In this paper, a novel zero trust framework for a power grid supply chain (PGSC) is proposed. This framework facilitates early detection of potential GenAI-driven attack vect… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: Accepted article by IEEE International Conference on Communications (ICC 2024), Copyright 2024 IEEE

  26. arXiv:2403.02522  [pdf, other

    cs.LG cs.AI

    HeAR -- Health Acoustic Representations

    Authors: Sebastien Baur, Zaid Nabulsi, Wei-Hung Weng, Jake Garrison, Louis Blankemeier, Sam Fishman, Christina Chen, Sujay Kakarmath, Minyoi Maimbolwa, Nsala Sanjase, Brian Shuma, Yossi Matias, Greg S. Corrado, Shwetak Patel, Shravya Shetty, Shruthi Prabhakara, Monde Muyoyeta, Diego Ardila

    Abstract: Health acoustic sounds such as coughs and breaths are known to contain useful health signals with significant potential for monitoring health and disease, yet are underexplored in the medical machine learning community. The existing deep learning systems for health acoustics are often narrowly trained and evaluated on a single task, which is limited by data and may hinder generalization to other t… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 4 tables, 4 figures, 6 supplementary tables, 3 supplementary figures

  27. arXiv:2401.10289  [pdf

    cs.ET cs.AI cs.LG cs.NE

    Design and development of opto-neural processors for simulation of neural networks trained in image detection for potential implementation in hybrid robotics

    Authors: Sanjana Shetty

    Abstract: Neural networks have been employed for a wide range of processing applications like image processing, motor control, object detection and many others. Living neural networks offer advantages of lower power consumption, faster processing, and biological realism. Optogenetics offers high spatial and temporal control over biological neurons and presents potential in training live neural networks. Thi… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  28. Performance Analysis of Fixed Broadband Wireless Access in mmWave Band in 5G

    Authors: Soumya Banerjee, Sarada Prasad Gochhayat, Sachin Shetty

    Abstract: An end-to-end fiber-based network holds the potential to provide multi-gigabit fixed access to end-users. However, deploying fiber access, especially in areas where fiber is non-existent, can be time-consuming and costly, resulting in delayed returns for Operators. This work investigates transmission data from fixed broadband wireless access in the mmWave band in 5G. Given the growing interest in… ▽ More

    Submitted 28 November, 2023; originally announced December 2023.

    Comments: 6 pages, 16 figures, Published in ICNC 22

  29. arXiv:2312.00051  [pdf, other

    cs.CR cs.AI cs.LG

    MIA-BAD: An Approach for Enhancing Membership Inference Attack and its Mitigation with Federated Learning

    Authors: Soumya Banerjee, Sandip Roy, Sayyed Farid Ahamed, Devin Quinn, Marc Vucovich, Dhruv Nandakumar, Kevin Choi, Abdul Rahman, Edward Bowen, Sachin Shetty

    Abstract: The membership inference attack (MIA) is a popular paradigm for compromising the privacy of a machine learning (ML) model. MIA exploits the natural inclination of ML models to overfit upon the training data. MIAs are trained to distinguish between training and testing prediction confidence to infer membership information. Federated Learning (FL) is a privacy-preserving ML paradigm that enables mul… ▽ More

    Submitted 28 November, 2023; originally announced December 2023.

    Comments: 6 pages, 5 figures, Accepted to be published in ICNC 23

  30. arXiv:2311.18260  [pdf, other

    eess.IV cs.CL cs.CV cs.LG

    Consensus, dissensus and synergy between clinicians and specialist foundation models in radiology report generation

    Authors: Ryutaro Tanno, David G. T. Barrett, Andrew Sellergren, Sumedh Ghaisas, Sumanth Dathathri, Abigail See, Johannes Welbl, Karan Singhal, Shekoofeh Azizi, Tao Tu, Mike Schaekermann, Rhys May, Roy Lee, SiWai Man, Zahra Ahmed, Sara Mahdavi, Yossi Matias, Joelle Barral, Ali Eslami, Danielle Belgrave, Vivek Natarajan, Shravya Shetty, Pushmeet Kohli, Po-Sen Huang, Alan Karthikesalingam , et al. (1 additional authors not shown)

    Abstract: Radiology reports are an instrumental part of modern medicine, informing key clinical decisions such as diagnosis and treatment. The worldwide shortage of radiologists, however, restricts access to expert care and imposes heavy workloads, contributing to avoidable errors and delays in report delivery. While recent progress in automated report generation with vision-language models offer clear pote… ▽ More

    Submitted 20 December, 2023; v1 submitted 30 November, 2023; originally announced November 2023.

  31. arXiv:2311.17097  [pdf, other

    cs.LG cs.AI cs.CR cs.NI

    Anonymous Jamming Detection in 5G with Bayesian Network Model Based Inference Analysis

    Authors: Ying Wang, Shashank Jere, Soumya Banerjee, Lingjia Liu, Sachin Shetty, Shehadi Dayekh

    Abstract: Jamming and intrusion detection are critical in 5G research, aiming to maintain reliability, prevent user experience degradation, and avoid infrastructure failure. This paper introduces an anonymous jamming detection model for 5G based on signal parameters from the protocol stacks. The system uses supervised and unsupervised learning for real-time, high-accuracy detection of jamming, including unk… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 6 pages, 9 figures, Published in HPSR22. arXiv admin note: text overlap with arXiv:2304.13660

  32. arXiv:2309.05227  [pdf, other

    cs.CL cs.AI

    Detecting Natural Language Biases with Prompt-based Learning

    Authors: Md Abdul Aowal, Maliha T Islam, Priyanka Mary Mammen, Sandesh Shetty

    Abstract: In this project, we want to explore the newly emerging field of prompt engineering and apply it to the downstream task of detecting LM biases. More concretely, we explore how to design prompts that can indicate 4 different types of biases: (1) gender, (2) race, (3) sexual orientation, and (4) religion-based. Within our project, we experiment with different manually crafted prompts that can draw ou… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  33. arXiv:2308.01317  [pdf

    cs.CV eess.IV

    ELIXR: Towards a general purpose X-ray artificial intelligence system through alignment of large language models and radiology vision encoders

    Authors: Shawn Xu, Lin Yang, Christopher Kelly, Marcin Sieniek, Timo Kohlberger, Martin Ma, Wei-Hung Weng, Atilla Kiraly, Sahar Kazemzadeh, Zakkai Melamed, Jungyeon Park, Patricia Strachan, Yun Liu, Chuck Lau, Preeti Singh, Christina Chen, Mozziyar Etemadi, Sreenivasa Raju Kalidindi, Yossi Matias, Katherine Chou, Greg S. Corrado, Shravya Shetty, Daniel Tse, Shruthi Prabhakara, Daniel Golden , et al. (3 additional authors not shown)

    Abstract: In this work, we present an approach, which we call Embeddings for Language/Image-aligned X-Rays, or ELIXR, that leverages a language-aligned image encoder combined or grafted onto a fixed LLM, PaLM 2, to perform a broad range of chest X-ray tasks. We train this lightweight adapter architecture using images paired with corresponding free-text radiology reports from the MIMIC-CXR dataset. ELIXR ach… ▽ More

    Submitted 7 September, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

  34. arXiv:2307.09018  [pdf, other

    q-bio.QM cs.LG

    Multimodal LLMs for health grounded in individual-specific data

    Authors: Anastasiya Belyaeva, Justin Cosentino, Farhad Hormozdiari, Krish Eswaran, Shravya Shetty, Greg Corrado, Andrew Carroll, Cory Y. McLean, Nicholas A. Furlotte

    Abstract: Foundation large language models (LLMs) have shown an impressive ability to solve tasks across a wide range of fields including health. To effectively solve personalized health tasks, LLMs need the ability to ingest a diversity of data modalities that are relevant to an individual's health status. In this paper, we take a step towards creating multimodal LLMs for health that are grounded in indivi… ▽ More

    Submitted 20 July, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

  35. arXiv:2306.07993  [pdf, other

    cs.CR cs.AI cs.LG

    Trustworthy Artificial Intelligence Framework for Proactive Detection and Risk Explanation of Cyber Attacks in Smart Grid

    Authors: Md. Shirajum Munir, Sachin Shetty, Danda B. Rawat

    Abstract: The rapid growth of distributed energy resources (DERs), such as renewable energy sources, generators, consumers, and prosumers in the smart grid infrastructure, poses significant cybersecurity and trust challenges to the grid controller. Consequently, it is crucial to identify adversarial tactics and measure the strength of the attacker's DER. To enable a trustworthy smart grid controller, this w… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: Submitted for peer review

  36. arXiv:2305.05648  [pdf

    cs.CV cs.AI cs.LG

    Predicting Cardiovascular Disease Risk using Photoplethysmography and Deep Learning

    Authors: Wei-Hung Weng, Sebastien Baur, Mayank Daswani, Christina Chen, Lauren Harrell, Sujay Kakarmath, Mariam Jabara, Babak Behsaz, Cory Y. McLean, Yossi Matias, Greg S. Corrado, Shravya Shetty, Shruthi Prabhakara, Yun Liu, Goodarz Danaei, Diego Ardila

    Abstract: Cardiovascular diseases (CVDs) are responsible for a large proportion of premature deaths in low- and middle-income countries. Early CVD detection and intervention is critical in these populations, yet many existing CVD risk scores require a physical examination or lab measurements, which can be challenging in such health systems due to limited accessibility. Here we investigated the potential to… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: main: 24 pages (3 tables, 2 figures, 42 references), supplementary: 25 pages (9 tables, 4 figures, 11 references)

  37. arXiv:2304.10946  [pdf, other

    cs.CL cs.LG q-bio.BM

    CancerGPT: Few-shot Drug Pair Synergy Prediction using Large Pre-trained Language Models

    Authors: Tianhao Li, Sandesh Shetty, Advaith Kamath, Ajay Jaiswal, Xianqian Jiang, Ying Ding, Yejin Kim

    Abstract: Large pre-trained language models (LLMs) have been shown to have significant potential in few-shot learning across various fields, even with minimal training data. However, their ability to generalize to unseen tasks in more complex fields, such as biology, has yet to be fully evaluated. LLMs can offer a promising alternative approach for biological inference, particularly in cases where structure… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

  38. Understanding metric-related pitfalls in image analysis validation

    Authors: Annika Reinke, Minu D. Tizabi, Michael Baumgartner, Matthias Eisenmann, Doreen Heckmann-Nötzel, A. Emre Kavur, Tim Rädsch, Carole H. Sudre, Laura Acion, Michela Antonelli, Tal Arbel, Spyridon Bakas, Arriel Benis, Matthew Blaschko, Florian Buettner, M. Jorge Cardoso, Veronika Cheplygina, Jianxu Chen, Evangelia Christodoulou, Beth A. Cimini, Gary S. Collins, Keyvan Farahani, Luciana Ferrer, Adrian Galdran, Bram van Ginneken , et al. (53 additional authors not shown)

    Abstract: Validation metrics are key for the reliable tracking of scientific progress and for bridging the current chasm between artificial intelligence (AI) research and its translation into practice. However, increasing evidence shows that particularly in image analysis, metrics are often chosen inadequately in relation to the underlying research problem. This could be attributed to a lack of accessibilit… ▽ More

    Submitted 23 February, 2024; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: Shared first authors: Annika Reinke and Minu D. Tizabi; shared senior authors: Lena Maier-Hein and Paul F. Jäger. Published in Nature Methods. arXiv admin note: text overlap with arXiv:2206.01653

    Journal ref: Nature methods, 1-13 (2024)

  39. arXiv:2211.09006  [pdf, other

    cs.RO

    ToolFlowNet: Robotic Manipulation with Tools via Predicting Tool Flow from Point Clouds

    Authors: Daniel Seita, Yufei Wang, Sarthak J. Shetty, Edward Yao Li, Zackory Erickson, David Held

    Abstract: Point clouds are a widely available and canonical data modality which convey the 3D geometry of a scene. Despite significant progress in classification and segmentation from point clouds, policy learning from such a modality remains challenging, and most prior works in imitation learning focus on learning policies from images or state information. In this paper, we propose a novel framework for le… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: Conference on Robot Learning (CoRL), 2022. Supplementary material is available at https://sites.google.com/view/point-cloud-policy/home

  40. arXiv:2210.15430  [pdf, other

    cs.CY cs.AI cs.LG

    Student-centric Model of Learning Management System Activity and Academic Performance: from Correlation to Causation

    Authors: Varun Mandalapu, Lujie Karen Chen, Sushruta Shetty, Zhiyuan Chen, Jiaqi Gong

    Abstract: In recent years, there is a lot of interest in modeling students' digital traces in Learning Management System (LMS) to understand students' learning behavior patterns including aspects of meta-cognition and self-regulation, with the ultimate goal to turn those insights into actionable information to support students to improve their learning outcomes. In achieving this goal, however, there are tw… ▽ More

    Submitted 29 March, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

  41. arXiv:2210.06649  [pdf, other

    cs.AI cs.NI

    Neuro-symbolic Explainable Artificial Intelligence Twin for Zero-touch IoE in Wireless Network

    Authors: Md. Shirajum Munir, Ki Tae Kim, Apurba Adhikary, Walid Saad, Sachin Shetty, Seong-Bae Park, Choong Seon Hong

    Abstract: Explainable artificial intelligence (XAI) twin systems will be a fundamental enabler of zero-touch network and service management (ZSM) for sixth-generation (6G) wireless networks. A reliable XAI twin system for ZSM requires two composites: an extreme analytical ability for discretizing the physical behavior of the Internet of Everything (IoE) and rigorous methods for characterizing the reasoning… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Submitted to a journal for peer review

  42. arXiv:2206.05077  [pdf, other

    cs.RO cs.LG eess.SP eess.SY math.OC

    Tensor Train for Global Optimization Problems in Robotics

    Authors: Suhan Shetty, Teguh Lembono, Tobias Loew, Sylvain Calinon

    Abstract: The convergence of many numerical optimization techniques is highly dependent on the initial guess given to the solver. To address this issue, we propose a novel approach that utilizes tensor methods to initialize existing optimization solvers near global optima. Our method does not require access to a database of good solutions. We first transform the cost function, which depends on both task par… ▽ More

    Submitted 22 November, 2023; v1 submitted 10 June, 2022; originally announced June 2022.

    Comments: 25 pages, 21 figures

  43. Metrics reloaded: Recommendations for image analysis validation

    Authors: Lena Maier-Hein, Annika Reinke, Patrick Godau, Minu D. Tizabi, Florian Buettner, Evangelia Christodoulou, Ben Glocker, Fabian Isensee, Jens Kleesiek, Michal Kozubek, Mauricio Reyes, Michael A. Riegler, Manuel Wiesenfarth, A. Emre Kavur, Carole H. Sudre, Michael Baumgartner, Matthias Eisenmann, Doreen Heckmann-Nötzel, Tim Rädsch, Laura Acion, Michela Antonelli, Tal Arbel, Spyridon Bakas, Arriel Benis, Matthew Blaschko , et al. (49 additional authors not shown)

    Abstract: Increasing evidence shows that flaws in machine learning (ML) algorithm validation are an underestimated global problem. Particularly in automatic biomedical image analysis, chosen performance metrics often do not reflect the domain interest, thus failing to adequately measure scientific progress and hindering translation of ML techniques into practice. To overcome this, our large international ex… ▽ More

    Submitted 23 February, 2024; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: Shared first authors: Lena Maier-Hein, Annika Reinke. arXiv admin note: substantial text overlap with arXiv:2104.05642 Published in Nature Methods

    Journal ref: Nature methods, 1-18 (2024)

  44. arXiv:2205.12231  [pdf, other

    cs.CV cs.GR

    ASSET: Autoregressive Semantic Scene Editing with Transformers at High Resolutions

    Authors: Difan Liu, Sandesh Shetty, Tobias Hinz, Matthew Fisher, Richard Zhang, Taesung Park, Evangelos Kalogerakis

    Abstract: We present ASSET, a neural architecture for automatically modifying an input high-resolution image according to a user's edits on its semantic segmentation map. Our architecture is based on a transformer with a novel attention mechanism. Our key idea is to sparsify the transformer's attention matrix at high resolutions, guided by dense attention extracted at lower image resolutions. While previous… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: SIGGRAPH 2022 - Journal Track

  45. arXiv:2203.11903  [pdf

    cs.LG cs.CV eess.IV

    Enabling faster and more reliable sonographic assessment of gestational age through machine learning

    Authors: Chace Lee, Angelica Willis, Christina Chen, Marcin Sieniek, Akib Uddin, Jonny Wong, Rory Pilgrim, Katherine Chou, Daniel Tse, Shravya Shetty, Ryan G. Gomes

    Abstract: Fetal ultrasounds are an essential part of prenatal care and can be used to estimate gestational age (GA). Accurate GA assessment is important for providing appropriate prenatal care throughout pregnancy and identifying complications such as fetal growth disorders. Since derivation of GA from manual fetal biometry measurements (head, abdomen, femur) are operator-dependent and time-consuming, there… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

  46. arXiv:2203.10139  [pdf

    cs.LG cs.AI cs.CV eess.IV

    AI system for fetal ultrasound in low-resource settings

    Authors: Ryan G. Gomes, Bellington Vwalika, Chace Lee, Angelica Willis, Marcin Sieniek, Joan T. Price, Christina Chen, Margaret P. Kasaro, James A. Taylor, Elizabeth M. Stringer, Scott Mayer McKinney, Ntazana Sindano, George E. Dahl, William Goodnight III, Justin Gilmer, Benjamin H. Chi, Charles Lau, Terry Spitz, T Saensuksopa, Kris Liu, Jonny Wong, Rory Pilgrim, Akib Uddin, Greg Corrado, Lily Peng , et al. (4 additional authors not shown)

    Abstract: Despite considerable progress in maternal healthcare, maternal and perinatal deaths remain high in low-to-middle income countries. Fetal ultrasound is an important component of antenatal care, but shortage of adequately trained healthcare workers has limited its adoption. We developed and validated an artificial intelligence (AI) system that uses novice-acquired "blind sweep" ultrasound videos to… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

  47. arXiv:2203.05931  [pdf, other

    stat.ML cs.LG

    FedSyn: Synthetic Data Generation using Federated Learning

    Authors: Monik Raj Behera, Sudhir Upadhyay, Suresh Shetty, Sudha Priyadarshini, Palka Patel, Ker Farn Lee

    Abstract: As Deep Learning algorithms continue to evolve and become more sophisticated, they require massive datasets for model training and efficacy of models. Some of those data requirements can be met with the help of existing datasets within the organizations. Current Machine Learning practices can be leveraged to generate synthetic data from an existing dataset. Further, it is well established that div… ▽ More

    Submitted 5 April, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

  48. arXiv:2202.07764  [pdf, other

    quant-ph cs.CR cs.NI physics.optics

    Paving the Way towards 800 Gbps Quantum-Secured Optical Channel Deployment in Mission-Critical Environments

    Authors: Marco Pistoia, Omar Amer, Monik R. Behera, Joseph A. Dolphin, James F. Dynes, Benny John, Paul A. Haigh, Yasushi Kawakura, David H. Kramer, Jeffrey Lyon, Navid Moazzami, Tulasi D. Movva, Antigoni Polychroniadou, Suresh Shetty, Greg Sysak, Farzam Toudeh-Fallah, Sudhir Upadhyay, Robert I. Woodward, Andrew J. Shields

    Abstract: This article describes experimental research studies conducted towards understanding the implementation aspects of high-capacity quantum-secured optical channels in mission-critical metro-scale operational environments using Quantum Key Distribution (QKD) technology. To the best of our knowledge, this is the first time that an 800 Gbps quantum-secured optical channel -- along with several other De… ▽ More

    Submitted 2 March, 2023; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: 11 pages, 9 figures, 2 tables

    Journal ref: Quantum Science and Technology, Institute of Physics, May 2023

  49. Succinct Differentiation of Disparate Boosting Ensemble Learning Methods for Prognostication of Polycystic Ovary Syndrome Diagnosis

    Authors: Abhishek Gupta, Sannidhi Shetty, Raunak Joshi, Ronald Melwin Laban

    Abstract: Prognostication of medical problems using the clinical data by leveraging the Machine Learning techniques with stellar precision is one of the most important real world challenges at the present time. Considering the medical problem of Polycystic Ovary Syndrome also known as PCOS is an emerging problem in women aged from 15 to 49. Diagnosing this disorder by using various Boosting Ensemble Methods… ▽ More

    Submitted 13 August, 2022; v1 submitted 2 January, 2022; originally announced January 2022.

    Comments: 8 pages, 5 figures, 3 tables, Published in the Proceedings of IEEE 2021 International Conference on Advances in Computing, Communication and Control (ICAC3'21) 7th Edition

  50. arXiv:2107.10243  [pdf, other

    cs.CR cs.LG

    Federated Learning using Smart Contracts on Blockchains, based on Reward Driven Approach

    Authors: Monik Raj Behera, Sudhir Upadhyay, Suresh Shetty

    Abstract: Over the recent years, Federated machine learning continues to gain interest and momentum where there is a need to draw insights from data while preserving the data provider's privacy. However, one among other existing challenges in the adoption of federated learning has been the lack of fair, transparent and universally agreed incentivization schemes for rewarding the federated learning contribut… ▽ More

    Submitted 25 March, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: 9 pages, 7 figures and 1 table