Skip to main content

Showing 1–3 of 3 results for author: Hall, K B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2209.11755  [pdf, other

    cs.CL cs.IR

    Promptagator: Few-shot Dense Retrieval From 8 Examples

    Authors: Zhuyun Dai, Vincent Y. Zhao, Ji Ma, Yi Luan, Jianmo Ni, Jing Lu, Anton Bakalov, Kelvin Guu, Keith B. Hall, Ming-Wei Chang

    Abstract: Much recent research on information retrieval has focused on how to transfer from one task (typically with abundant supervised data) to various other tasks where supervision is limited, with the implicit assumption that it is possible to generalize from one task to all the rest. However, this overlooks the fact that there are many diverse and unique retrieval tasks, each targeting different search… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

  2. arXiv:2112.07899  [pdf, other

    cs.IR cs.CL

    Large Dual Encoders Are Generalizable Retrievers

    Authors: Jianmo Ni, Chen Qu, Jing Lu, Zhuyun Dai, Gustavo Hernández Ábrego, Ji Ma, Vincent Y. Zhao, Yi Luan, Keith B. Hall, Ming-Wei Chang, Yinfei Yang

    Abstract: It has been shown that dual encoders trained on one domain often fail to generalize to other domains for retrieval tasks. One widespread belief is that the bottleneck layer of a dual encoder, where the final score is simply a dot-product between a query vector and a passage vector, is too limited to make dual encoders an effective retrieval model for out-of-domain generalization. In this paper, we… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

  3. arXiv:2108.08877  [pdf, other

    cs.CL

    Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models

    Authors: Jianmo Ni, Gustavo Hernández Ábrego, Noah Constant, Ji Ma, Keith B. Hall, Daniel Cer, Yinfei Yang

    Abstract: We provide the first exploration of sentence embeddings from text-to-text transformers (T5). Sentence embeddings are broadly useful for language processing tasks. While T5 achieves impressive performance on language tasks cast as sequence-to-sequence mapping problems, it is unclear how to produce sentence embeddings from encoder-decoder models. We investigate three methods for extracting T5 senten… ▽ More

    Submitted 14 December, 2021; v1 submitted 19 August, 2021; originally announced August 2021.