Skip to main content

Showing 1–1 of 1 results for author: Pandeshwar, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.00260  [pdf, other

    cs.CL cs.AI cs.LG

    DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data Mining

    Authors: Vinayak Arannil, Neha Narwal, Sourav Sanjukta Bhabesh, Sai Nikhil Thirandas, Darren Yow-Bang Wang, Graham Horwood, Alex Anto Chirayath, Gouri Pandeshwar

    Abstract: Large Language Models (LLMs) have shown remarkable ability to generalize effectively across numerous industry domains while executing a range of tasks. Many of these competencies are obtained from the data utilized during the pre-training phase of the Language Models (LMs). However, these models exhibit limitations when tasked with performing in specialized or low-resource industry domains. More r… ▽ More

    Submitted 9 October, 2024; v1 submitted 30 September, 2024; originally announced October 2024.