Skip to main content

Showing 1–6 of 6 results for author: Awasthy, P

.
  1. arXiv:2502.20204  [pdf, other

    cs.IR cs.CL

    Granite Embedding Models

    Authors: Parul Awasthy, Aashka Trivedi, Yulong Li, Mihaela Bornea, David Cox, Abraham Daniels, Martin Franz, Gabe Goodhart, Bhavani Iyer, Vishwajeet Kumar, Luis Lastras, Scott McCarley, Rudra Murthy, Vignesh P, Sara Rosenthal, Salim Roukos, Jaydeep Sen, Sukriti Sharma, Avirup Sil, Kate Soule, Arafat Sultan, Radu Florian

    Abstract: We introduce the Granite Embedding models, a family of encoder-based embedding models designed for retrieval tasks, spanning dense-retrieval and sparse retrieval architectures, with both English and Multilingual capabilities. This report provides the technical details of training these highly effective 12 layer embedding models, along with their efficient 6 layer distilled counterparts. Extensive… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  2. arXiv:2401.06356  [pdf, other

    cs.LG

    An Empirical Investigation into the Effect of Parameter Choices in Knowledge Distillation

    Authors: Md Arafat Sultan, Aashka Trivedi, Parul Awasthy, Avirup Sil

    Abstract: We present a large-scale empirical study of how choices of configuration parameters affect performance in knowledge distillation (KD). An example of such a KD parameter is the measure of distance between the predictions of the teacher and the student, common choices for which include the mean squared error (MSE) and the KL-divergence. Although scattered efforts have been made to understand the dif… ▽ More

    Submitted 18 February, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  3. arXiv:2010.08652  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Cross-Lingual Relation Extraction with Transformers

    Authors: Jian Ni, Taesun Moon, Parul Awasthy, Radu Florian

    Abstract: Relation extraction (RE) is one of the most important tasks in information extraction, as it provides essential information for many NLP applications. In this paper, we propose a cross-lingual RE approach that does not require any human annotation in a target language or any cross-lingual resources. Building upon unsupervised cross-lingual representation learning frameworks, we develop several dee… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: 11 pages

  4. arXiv:2009.07317  [pdf, other

    cs.CL

    Cascaded Models for Better Fine-Grained Named Entity Recognition

    Authors: Parul Awasthy, Taesun Moon, Jian Ni, Radu Florian

    Abstract: Named Entity Recognition (NER) is an essential precursor task for many natural language applications, such as relation extraction or event extraction. Much of the NER research has been done on datasets with few classes of entity types (e.g. PER, LOC, ORG, MISC), but many real world applications (disaster relief, complex event extraction, law enforcement) can benefit from a larger NER typeset. More… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

  5. arXiv:2009.07188  [pdf, other

    cs.CL

    Event Presence Prediction Helps Trigger Detection Across Languages

    Authors: Parul Awasthy, Tahira Naseem, Jian Ni, Taesun Moon, Radu Florian

    Abstract: The task of event detection and classification is central to most information retrieval applications. We show that a Transformer based architecture can effectively model event extraction as a sequence labeling task. We propose a combination of sentence level and token level training objectives that significantly boosts the performance of a BERT based event extraction model. Our approach achieves a… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

  6. arXiv:1912.01389  [pdf, other

    cs.CL cs.LG stat.ML

    Towards Lingua Franca Named Entity Recognition with BERT

    Authors: Taesun Moon, Parul Awasthy, Jian Ni, Radu Florian

    Abstract: Information extraction is an important task in NLP, enabling the automatic extraction of data for relational database filling. Historically, research and data was produced for English text, followed in subsequent years by datasets in Arabic, Chinese (ACE/OntoNotes), Dutch, Spanish, German (CoNLL evaluations), and many others. The natural tendency has been to treat each language as a different data… ▽ More

    Submitted 12 December, 2019; v1 submitted 19 November, 2019; originally announced December 2019.