Skip to main content

Showing 1–14 of 14 results for author: Gandhi, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.12495  [pdf, other

    cs.CL cs.CY

    Beyond Text: Characterizing Domain Expert Needs in Document Research

    Authors: Sireesh Gururaja, Nupoor Gandhi, Jeremiah Milbauer, Emma Strubell

    Abstract: Working with documents is a key part of almost any knowledge work, from contextualizing research in a literature review to reviewing legal precedent. Recently, as their capabilities have expanded, primarily text-based NLP systems have often been billed as able to assist or even automate this kind of work. But to what extent are these systems able to model these tasks as experts conceptualize and p… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

  2. arXiv:2503.06732  [pdf, other

    cs.LG

    Data Efficient Subset Training with Differential Privacy

    Authors: Ninad Jayesh Gandhi, Moparthy Venkata Subrahmanya Sri Harsha

    Abstract: Private machine learning introduces a trade-off between the privacy budget and training performance. Training convergence is substantially slower and extensive hyper parameter tuning is required. Consequently, efficient methods to conduct private training of models is thoroughly investigated in the literature. To this end, we investigate the strength of the data efficient model training methods in… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

  3. arXiv:2410.08327  [pdf, other

    cs.CL

    Evaluating Differentially Private Synthetic Data Generation in High-Stakes Domains

    Authors: Krithika Ramesh, Nupoor Gandhi, Pulkit Madaan, Lisa Bauer, Charith Peris, Anjalie Field

    Abstract: The difficulty of anonymizing text data hinders the development and deployment of NLP in high-stakes domains that involve private data, such as healthcare and social services. Poorly anonymized sensitive data cannot be easily shared with annotators or external researchers, nor can it be used to train public models. In this work, we explore the feasibility of using synthetic data generated from dif… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: Accepted to EMNLP 2024 (Findings)

  4. arXiv:2401.15075  [pdf, other

    cs.CV cs.AI cs.GR

    Annotated Hands for Generative Models

    Authors: Yue Yang, Atith N Gandhi, Greg Turk

    Abstract: Generative models such as GANs and diffusion models have demonstrated impressive image generation capabilities. Despite these successes, these systems are surprisingly poor at creating images with hands. We propose a novel training framework for generative models that substantially improves the ability of such systems to create hand images. Our approach is to augment the training images with three… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  5. Machine Vision Using Cellphone Camera: A Comparison of deep networks for classifying three challenging denominations of Indian Coins

    Authors: Keyur D. Joshi, Dhruv Shah, Varshil Shah, Nilay Gandhi, Sanket J. Shah, Sanket B. Shah

    Abstract: Indian currency coins come in a variety of denominations. Off all the varieties Rs.1, RS.2, and Rs.5 have similar diameters. Majority of the coin styles in market circulation for denominations of Rs.1 and Rs.2 coins are nearly the same except for numerals on its reverse side. If a coin is resting on its obverse side, the correct denomination is not distinguishable by humans. Therefore, it was hypo… ▽ More

    Submitted 12 May, 2023; originally announced June 2023.

    Comments: 6 Pages, 4 Figures, 6 Tables, Conference paper

  6. Examining risks of racial biases in NLP tools for child protective services

    Authors: Anjalie Field, Amanda Coston, Nupoor Gandhi, Alexandra Chouldechova, Emily Putnam-Hornstein, David Steier, Yulia Tsvetkov

    Abstract: Although much literature has established the presence of demographic bias in natural language processing (NLP) models, most work relies on curated bias metrics that may not be reflective of real-world applications. At the same time, practitioners are increasingly using algorithmic tools in high-stakes settings, with particular recent interest in NLP. In this work, we focus on one such setting: chi… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: In 2023 ACM Conference on Fairness, Accountability, and Transparency (FAccT '23)

  7. arXiv:2210.07602  [pdf, other

    cs.CL

    Mention Annotations Alone Enable Efficient Domain Adaptation for Coreference Resolution

    Authors: Nupoor Gandhi, Anjalie Field, Emma Strubell

    Abstract: Although recent neural models for coreference resolution have led to substantial improvements on benchmark datasets, transferring these models to new target domains containing out-of-vocabulary spans and requiring differing annotation schemes remains challenging. Typical approaches involve continued training on annotated target-domain data, but obtaining annotations is costly and time-consuming. W… ▽ More

    Submitted 30 May, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

  8. arXiv:2109.09811  [pdf, other

    cs.LG cs.CL

    Improving Span Representation for Domain-adapted Coreference Resolution

    Authors: Nupoor Gandhi, Anjalie Field, Yulia Tsvetkov

    Abstract: Recent work has shown fine-tuning neural coreference models can produce strong performance when adapting to different domains. However, at the same time, this can require a large amount of annotated target examples. In this work, we focus on supervised domain adaptation for clinical notes, proposing the use of concept knowledge to more efficiently adapt coreference models to a new domain. We devel… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  9. arXiv:2106.09461  [pdf

    cs.LG cs.AI cs.MA cs.RO eess.SY

    Modelling resource allocation in uncertain system environment through deep reinforcement learning

    Authors: Neel Gandhi, Shakti Mishra

    Abstract: Reinforcement Learning has applications in field of mechatronics, robotics, and other resource-constrained control system. Problem of resource allocation is primarily solved using traditional predefined techniques and modern deep learning methods. The drawback of predefined and most deep learning methods for resource allocation is failing to meet the requirements in cases of uncertain system envir… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: Accepted at IRMAS'21

  10. DeepGamble: Towards unlocking real-time player intelligence using multi-layer instance segmentation and attribute detection

    Authors: Danish Syed, Naman Gandhi, Arushi Arora, Nilesh Kadam

    Abstract: Annually the gaming industry spends approximately $15 billion in marketing reinvestment. However, this amount is spent without any consideration for the skill and luck of the player. For a casino, an unskilled player could fetch ~4 times more revenue than a skilled player. This paper describes a video recognition system that is based on an extension of the Mask R-CNN model. Our system digitizes th… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

    Comments: 2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA)

  11. arXiv:2010.14027  [pdf, other

    cs.DC cs.PF

    EdgeBench: A Workflow-based Benchmark for Edge Computing

    Authors: Qirui Yang, Runyu Jin, Nabil Gandhi, Xiongzi Ge, Hoda Aghaei Khouzani, Ming Zhao

    Abstract: Edge computing has been developed to utilize multiple tiers of resources for privacy, cost and Quality of Service (QoS) reasons. Edge workloads have the characteristics of data-driven and latency-sensitive. Because of this, edge systems have developed to be both heterogeneous and distributed. The unique characteristics of edge workloads and edge systems have motivated EdgeBench, a workflow-based b… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

  12. arXiv:2005.02124  [pdf

    eess.SP cs.IT

    Realization of MIMO Channel Model for Spatial Diversity with Capacity and SNR Multiplexing Gains

    Authors: Subrato Bharati, Prajoy Podder, Niketa Gandhi, Ajith Abraham

    Abstract: Multiple input multiple output (MIMO) system transmission is a popular diversity technique to improve the reliability of a communication system where transmitter, communication channel and receiver are the important elements. Data transmission reliability can be ensured when the bit error rate is very low. Normally, multiple antenna elements are used at both the transmitting and receiving section… ▽ More

    Submitted 24 April, 2020; originally announced May 2020.

    Comments: 16 pages, 13 figures

    Journal ref: International Journal of Computer Information Systems and Industrial Management Applications ISSN: 2150-7988, Volume 12, 2020

  13. arXiv:1910.09324  [pdf, other

    cs.IR cs.LG

    Multi-dimensional Features for Prediction with Tweets

    Authors: Nupoor Gandhi, Alex Morales, Dolores Albarracin

    Abstract: With the rise of opioid abuse in the US, there has been a growth of overlapping hotspots for overdose-related and HIV-related deaths in Springfield, Boston, Fall River, New Bedford, and parts of Cape Cod. With a large part of population, including rural communities, active on social media, it is crucial that we leverage the predictive power of social media as a preventive measure. We explore the p… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

  14. Increase Apparent Public Speaking Fluency By Speech Augmentation

    Authors: Sagnik Das, Nisha Gandhi, Tejas Naik, Roy Shilkrot

    Abstract: Fluent and confident speech is desirable to every speaker. But professional speech delivering requires a great deal of experience and practice. In this paper, we propose a speech stream manipulation system which can help non-professional speakers to produce fluent, professional-like speech content, in turn contributing towards better listener engagement and comprehension. We propose to achieve thi… ▽ More

    Submitted 3 August, 2019; v1 submitted 8 December, 2018; originally announced December 2018.

    Journal ref: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)