Skip to main content

Showing 1–5 of 5 results for author: Vashisht, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.04565  [pdf, other

    cs.LG cs.CR

    Private Federated Learning In Real World Application -- A Case Study

    Authors: An Ji, Bortik Bandyopadhyay, Congzheng Song, Natarajan Krishnaswami, Prabal Vashisht, Rigel Smiroldo, Isabel Litton, Sayantan Mahinder, Mona Chitnis, Andrew W Hill

    Abstract: This paper presents an implementation of machine learning model training using private federated learning (PFL) on edge devices. We introduce a novel framework that uses PFL to address the challenge of training a model using users' private data. The framework ensures that user data remain on individual devices, with only essential model updates transmitted to a central server for aggregation with… ▽ More

    Submitted 10 February, 2025; v1 submitted 6 February, 2025; originally announced February 2025.

  2. arXiv:2409.04081  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    UI-JEPA: Towards Active Perception of User Intent through Onscreen User Activity

    Authors: Yicheng Fu, Raviteja Anantha, Prabal Vashisht, Jianpeng Cheng, Etai Littwin

    Abstract: Generating user intent from a sequence of user interface (UI) actions is a core challenge in comprehensive UI understanding. Recent advancements in multimodal large language models (MLLMs) have led to substantial progress in this area, but their demands for extensive model parameters, computing power, and high latency makes them impractical for scenarios requiring lightweight, on-device solutions… ▽ More

    Submitted 2 October, 2024; v1 submitted 6 September, 2024; originally announced September 2024.

  3. arXiv:2404.17749  [pdf, other

    cs.AI cs.CL

    UMass-BioNLP at MEDIQA-M3G 2024: DermPrompt -- A Systematic Exploration of Prompt Engineering with GPT-4V for Dermatological Diagnosis

    Authors: Parth Vashisht, Abhilasha Lodha, Mukta Maddipatla, Zonghai Yao, Avijit Mitra, Zhichao Yang, Junda Wang, Sunjae Kwon, Hong Yu

    Abstract: This paper presents our team's participation in the MEDIQA-ClinicalNLP2024 shared task B. We present a novel approach to diagnosing clinical dermatology cases by integrating large multimodal models, specifically leveraging the capabilities of GPT-4V under a retriever and a re-ranker framework. Our investigation reveals that GPT-4V, when used as a retrieval agent, can accurately retrieve the correc… ▽ More

    Submitted 8 May, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: Accepted at NAACL-ClinicalNLP workshop 2024

  4. arXiv:2402.13919  [pdf, other

    cs.CL cs.AI

    SYNFAC-EDIT: Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization

    Authors: Prakamya Mishra, Zonghai Yao, Parth Vashisht, Feiyun Ouyang, Beining Wang, Vidhi Dhaval Mody, Hong Yu

    Abstract: Large Language Models (LLMs) such as GPT & Llama have demonstrated significant achievements in summarization tasks but struggle with factual inaccuracies, a critical issue in clinical NLP applications where errors could lead to serious consequences. To counter the high costs and limited availability of expert-annotated data for factual alignment, this study introduces an innovative pipeline that u… ▽ More

    Submitted 2 October, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Equal contribution for the first two authors; To appear in proceedings of the Main Conference on Empirical Methods in Natural Language Processing (EMNLP) 2024

  5. arXiv:2303.00171  [pdf, other

    cs.LG cs.AI eess.AS

    DTW-SiameseNet: Dynamic Time Warped Siamese Network for Mispronunciation Detection and Correction

    Authors: Raviteja Anantha, Kriti Bhasin, Daniela de la Parra Aguilar, Prabal Vashisht, Becci Williamson, Srinivas Chappidi

    Abstract: Personal Digital Assistants (PDAs) - such as Siri, Alexa and Google Assistant, to name a few - play an increasingly important role to access information and complete tasks spanning multiple domains, and by diverse groups of users. A text-to-speech (TTS) module allows PDAs to interact in a natural, human-like manner, and play a vital role when the interaction involves people with visual impairments… ▽ More

    Submitted 28 February, 2023; originally announced March 2023.

    Comments: Preprint version