Skip to main content

Showing 1–5 of 5 results for author: Eckstein, I

.
  1. arXiv:2109.00702  [pdf, other

    cs.CL

    ShopTalk: A System for Conversational Faceted Search

    Authors: Gurmeet Manku, James Lee-Thorp, Bhargav Kanagal, Joshua Ainslie, Jingchen Feng, Zach Pearson, Ebenezer Anjorin, Sudeep Gandhe, Ilya Eckstein, Jim Rosswog, Sumit Sanghai, Michael Pohl, Larry Adams, D. Sivakumar

    Abstract: We present ShopTalk, a multi-turn conversational faceted search system for shopping that is designed to handle large and complex schemas that are beyond the scope of state of the art slot-filling systems. ShopTalk decouples dialog management from fulfillment, thereby allowing the dialog understanding system to be domain-agnostic and not tied to the particular shopping application. The dialog under… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

  2. arXiv:2105.04241  [pdf, other

    cs.CL cs.LG

    ReadTwice: Reading Very Large Documents with Memories

    Authors: Yury Zemlyanskiy, Joshua Ainslie, Michiel de Jong, Philip Pham, Ilya Eckstein, Fei Sha

    Abstract: Knowledge-intensive tasks such as question answering often require assimilating information from different sections of large inputs such as books or article collections. We propose ReadTwice, a simple and effective technique that combines several strengths of prior approaches to model long-range dependencies with Transformers. The main idea is to read text in small segments, in parallel, summarizi… ▽ More

    Submitted 11 May, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: To appear in the proceedings of NAACL 2021

  3. arXiv:2105.03824  [pdf, other

    cs.CL cs.LG

    FNet: Mixing Tokens with Fourier Transforms

    Authors: James Lee-Thorp, Joshua Ainslie, Ilya Eckstein, Santiago Ontanon

    Abstract: We show that Transformer encoder architectures can be sped up, with limited accuracy costs, by replacing the self-attention sublayers with simple linear transformations that "mix" input tokens. These linear mixers, along with standard nonlinearities in feed-forward layers, prove competent at modeling semantic relationships in several text classification tasks. Most surprisingly, we find that repla… ▽ More

    Submitted 26 May, 2022; v1 submitted 8 May, 2021; originally announced May 2021.

    Comments: To appear at NAACL 2022

  4. arXiv:2102.13247  [pdf, other

    cs.CL

    DOCENT: Learning Self-Supervised Entity Representations from Large Document Collections

    Authors: Yury Zemlyanskiy, Sudeep Gandhe, Ruining He, Bhargav Kanagal, Anirudh Ravula, Juraj Gottweis, Fei Sha, Ilya Eckstein

    Abstract: This paper explores learning rich self-supervised entity representations from large amounts of the associated text. Once pre-trained, these models become applicable to multiple entity-centric tasks such as ranked retrieval, knowledge base completion, question answering, and more. Unlike other methods that harvest self-supervision signals based merely on a local context within a sentence, we radica… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: To appear in the proceedings of EACL 2021

  5. arXiv:2009.01265  [pdf, ps, other

    cs.CR

    Google COVID-19 Search Trends Symptoms Dataset: Anonymization Process Description (version 1.0)

    Authors: Shailesh Bavadekar, Andrew Dai, John Davis, Damien Desfontaines, Ilya Eckstein, Katie Everett, Alex Fabrikant, Gerardo Flores, Evgeniy Gabrilovich, Krishna Gadepalli, Shane Glass, Rayman Huang, Chaitanya Kamath, Dennis Kraft, Akim Kumok, Hinali Marfatia, Yael Mayer, Benjamin Miller, Adam Pearce, Irippuge Milinda Perera, Venky Ramachandran, Karthik Raman, Thomas Roessler, Izhak Shafran, Tomer Shekel , et al. (5 additional authors not shown)

    Abstract: This report describes the aggregation and anonymization process applied to the initial version of COVID-19 Search Trends symptoms dataset (published at https://goo.gle/covid19symptomdataset on September 2, 2020), a publicly available dataset that shows aggregated, anonymized trends in Google searches for symptoms (and some related topics). The anonymization process is designed to protect the daily… ▽ More

    Submitted 2 September, 2020; originally announced September 2020.