Skip to main content

Showing 1–9 of 9 results for author: Nayak, N V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.13344  [pdf, ps, other

    cs.LG cs.CL q-bio.BM

    K-Paths: Reasoning over Graph Paths for Drug Repurposing and Drug Interaction Prediction

    Authors: Tassallah Abdullahi, Ioanna Gemou, Nihal V. Nayak, Ghulam Murtaza, Stephen H. Bach, Carsten Eickhoff, Ritambhara Singh

    Abstract: Biomedical knowledge graphs (KGs) encode rich, structured information critical for drug discovery tasks, but extracting meaningful insights from large-scale KGs remains challenging due to their complex structure. Existing biomedical subgraph retrieval methods are tailored for graph neural networks (GNNs), limiting compatibility with other paradigms, including large language models (LLMs). We intro… ▽ More

    Submitted 28 May, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

    Journal ref: Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2 (KDD '25), August 3--7, 2025, Toronto, ON, Canada

  2. arXiv:2410.23261  [pdf, other

    cs.CL cs.LG

    $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources

    Authors: Apoorv Khandelwal, Tian Yun, Nihal V. Nayak, Jack Merullo, Stephen H. Bach, Chen Sun, Ellie Pavlick

    Abstract: Pre-training is notoriously compute-intensive and academic researchers are notoriously under-resourced. It is, therefore, commonly assumed that academics can't pre-train models. In this paper, we seek to clarify this assumption. We first survey academic researchers to learn about their available compute and then empirically measure the time to replicate models on such resources. We introduce a ben… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

  3. arXiv:2402.18334  [pdf, other

    cs.CL cs.LG

    Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation

    Authors: Nihal V. Nayak, Yiyang Nan, Avi Trost, Stephen H. Bach

    Abstract: We introduce Bonito, an open-source model for conditional task generation that converts unannotated text into task-specific training datasets for instruction tuning. We aim to enable zero-shot task adaptation of large language models on users' specialized, private data. We train Bonito by fine-tuning a pretrained large language model on a new large-scale dataset with 1.65M examples created by remi… ▽ More

    Submitted 11 September, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: ACL Findings 2024

  4. arXiv:2212.10537  [pdf, other

    cs.CV cs.AI cs.CL

    Does CLIP Bind Concepts? Probing Compositionality in Large Image Models

    Authors: Martha Lewis, Nihal V. Nayak, Peilin Yu, Qinan Yu, Jack Merullo, Stephen H. Bach, Ellie Pavlick

    Abstract: Large-scale neural network models combining text and images have made incredible progress in recent years. However, it remains an open question to what extent such models encode compositional representations of the concepts over which they operate, such as correctly identifying "red cube" by reasoning over the constituents "red" and "cube". In this work, we focus on the ability of a large pretrain… ▽ More

    Submitted 30 August, 2024; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Lewis and Nayak contributed equally

    Journal ref: In Findings of the Association for Computational Linguistics, EACL 2024, pages 1487 - 1500, Malta. Association for Computational Linguistics

  5. arXiv:2210.00064  [pdf, other

    cs.LG

    CEREAL: Few-Sample Clustering Evaluation

    Authors: Nihal V. Nayak, Ethan R. Elenberg, Clemens Rosenbaum

    Abstract: Evaluating clustering quality with reliable evaluation metrics like normalized mutual information (NMI) requires labeled data that can be expensive to annotate. We focus on the underexplored problem of estimating clustering quality with limited labels. We adapt existing approaches from the few-sample model evaluation literature to actively sub-sample, with a learned surrogate model, the most infor… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

  6. arXiv:2204.03574  [pdf, other

    cs.LG cs.CL cs.CV

    Learning to Compose Soft Prompts for Compositional Zero-Shot Learning

    Authors: Nihal V. Nayak, Peilin Yu, Stephen H. Bach

    Abstract: We introduce compositional soft prompting (CSP), a parameter-efficient learning technique to improve the zero-shot compositionality of large-scale pretrained vision-language models (VLMs) like CLIP. We develop CSP for compositional zero-shot learning, the task of predicting unseen attribute-object compositions (e.g., old cat and young tiger). VLMs have a flexible text encoder that can represent ar… ▽ More

    Submitted 24 April, 2023; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: ICLR 2023

  7. arXiv:2202.01279  [pdf, other

    cs.LG cs.CL

    PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts

    Authors: Stephen H. Bach, Victor Sanh, Zheng-Xin Yong, Albert Webson, Colin Raffel, Nihal V. Nayak, Abheesht Sharma, Taewoon Kim, M Saiful Bari, Thibault Fevry, Zaid Alyafeai, Manan Dey, Andrea Santilli, Zhiqing Sun, Srulik Ben-David, Canwen Xu, Gunjan Chhablani, Han Wang, Jason Alan Fries, Maged S. Al-shaibani, Shanya Sharma, Urmish Thakker, Khalid Almubarak, Xiangru Tang, Dragomir Radev , et al. (2 additional authors not shown)

    Abstract: PromptSource is a system for creating, sharing, and using natural language prompts. Prompts are functions that map an example from a dataset to a natural language input and target output. Using prompts to train and query language models is an emerging area in NLP that requires new tools that let users develop and refine these prompts collaboratively. PromptSource addresses the emergent challenges… ▽ More

    Submitted 29 March, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: ACL 2022 Demo

  8. arXiv:2111.04798  [pdf, other

    cs.LG cs.CV

    TAGLETS: A System for Automatic Semi-Supervised Learning with Auxiliary Data

    Authors: Wasu Piriyakulkij, Cristina Menghini, Ross Briden, Nihal V. Nayak, Jeffrey Zhu, Elaheh Raisi, Stephen H. Bach

    Abstract: Machine learning practitioners often have access to a spectrum of data: labeled data for the target task (which is often limited), unlabeled data, and auxiliary data, the many available labeled datasets for other tasks. We describe TAGLETS, a system built to study techniques for automatically exploiting all three types of data and creating high-quality, servable classifiers. The key components of… ▽ More

    Submitted 5 May, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

    Comments: Paper published at MLSys 2022. It passed the artifact evaluation earning two ACM badges: (1) Artifacts Evaluated Functional v1.1 and (2) Artifacts Available v1.1

  9. arXiv:2006.10713  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    Zero-Shot Learning with Common Sense Knowledge Graphs

    Authors: Nihal V. Nayak, Stephen H. Bach

    Abstract: Zero-shot learning relies on semantic class representations such as hand-engineered attributes or learned embeddings to predict classes without any labeled examples. We propose to learn class representations by embedding nodes from common sense knowledge graphs in a vector space. Common sense knowledge graphs are an untapped source of explicit high-level knowledge that requires little human effort… ▽ More

    Submitted 25 August, 2022; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: Paper published in TMLR