Skip to main content

Showing 1–8 of 8 results for author: Lovelace, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.15962  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Pre-training Large Memory Language Models with Internal and External Knowledge

    Authors: Linxi Zhao, Sofian Zalouk, Christian K. Belardi, Justin Lovelace, Jin Peng Zhou, Kilian Q. Weinberger, Yoav Artzi, Jennifer J. Sun

    Abstract: Neural language models are black-boxes -- both linguistic patterns and factual knowledge are distributed across billions of opaque parameters. This entangled encoding makes it difficult to reliably inspect, verify, or update specific facts. We propose a new class of language models, Large Memory Language Models (LMLM) with a pre-training recipe that stores factual knowledge in both internal weight… ▽ More

    Submitted 2 July, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

    Comments: Code, models, and data available at https://github.com/kilian-group/LMLM

  2. arXiv:2409.03717  [pdf, other

    cs.SD cs.AI cs.LG

    Sample-Efficient Diffusion for Text-To-Speech Synthesis

    Authors: Justin Lovelace, Soham Ray, Kwangyoun Kim, Kilian Q. Weinberger, Felix Wu

    Abstract: This work introduces Sample-Efficient Speech Diffusion (SESD), an algorithm for effective speech synthesis in modest data regimes through latent diffusion. It is based on a novel diffusion architecture, that we call U-Audio Transformer (U-AT), that efficiently scales to long sequences and operates in the latent space of a pre-trained audio autoencoder. Conditioned on character-aware language model… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

    Comments: Interspeech 2024

  3. arXiv:2408.04220  [pdf, other

    cs.CL cs.LG

    Diffusion Guided Language Modeling

    Authors: Justin Lovelace, Varsha Kishore, Yiwei Chen, Kilian Q. Weinberger

    Abstract: Current language models demonstrate remarkable proficiency in text generation. However, for many applications it is desirable to control attributes, such as sentiment, or toxicity, of the generated language -- ideally tailored towards each specific use case and target audience. For auto-regressive language models, existing guidance methods are prone to decoding errors that cascade during generatio… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: ACL Findings 2024

  4. arXiv:2307.10323  [pdf, other

    cs.IR cs.CL cs.LG

    IncDSI: Incrementally Updatable Document Retrieval

    Authors: Varsha Kishore, Chao Wan, Justin Lovelace, Yoav Artzi, Kilian Q. Weinberger

    Abstract: Differentiable Search Index is a recently proposed paradigm for document retrieval, that encodes information about a corpus of documents within the parameters of a neural network and directly maps queries to corresponding documents. These models have achieved state-of-the-art performances for document retrieval across many benchmarks. These kinds of models have a significant limitation: it is not… ▽ More

    Submitted 19 August, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

  5. arXiv:2212.09462  [pdf, other

    cs.CL cs.LG

    Latent Diffusion for Language Generation

    Authors: Justin Lovelace, Varsha Kishore, Chao Wan, Eliot Shekhtman, Kilian Q. Weinberger

    Abstract: Diffusion models have achieved great success in modeling continuous data modalities such as images, audio, and video, but have seen limited use in discrete domains such as language. Recent attempts to adapt diffusion to language have presented diffusion as an alternative to existing pretrained language models. We view diffusion and existing language models as complementary. We demonstrate that enc… ▽ More

    Submitted 7 November, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: NeurIPS 2023

  6. arXiv:2106.06555  [pdf, other

    cs.LG

    Robust Knowledge Graph Completion with Stacked Convolutions and a Student Re-Ranking Network

    Authors: Justin Lovelace, Denis Newman-Griffis, Shikhar Vashishth, Jill Fain Lehman, Carolyn Penstein Rosé

    Abstract: Knowledge Graph (KG) completion research usually focuses on densely connected benchmark datasets that are not representative of real KGs. We curate two KG datasets that include biomedical and encyclopedic knowledge and use an existing commonsense KG dataset to explore KG completion in the more realistic setting where dense connectivity is not guaranteed. We develop a deep convolutional network tha… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021)

  7. arXiv:2008.01197  [pdf, other

    cs.IR cs.LG

    Dynamically Extracting Outcome-Specific Problem Lists from Clinical Notes with Guided Multi-Headed Attention

    Authors: Justin Lovelace, Nathan C. Hurley, Adrian D. Haimovich, Bobak J. Mortazavi

    Abstract: Problem lists are intended to provide clinicians with a relevant summary of patient medical issues and are embedded in many electronic health record systems. Despite their importance, problem lists are often cluttered with resolved or currently irrelevant conditions. In this work, we develop a novel end-to-end framework that first extracts diagnosis and procedure information from clinical notes an… ▽ More

    Submitted 25 July, 2020; originally announced August 2020.

    Comments: To appear in the proceedings of the Machine Learning for Healthcare Conference (MLHC) 2020. Accepted papers can be viewed at https://www.mlforhc.org/accepted-papers

  8. arXiv:1910.14095  [pdf, other

    cs.LG stat.ML

    Explainable Prediction of Adverse Outcomes Using Clinical Notes

    Authors: Justin R. Lovelace, Nathan C. Hurley, Adrian D. Haimovich, Bobak J. Mortazavi

    Abstract: Clinical notes contain a large amount of clinically valuable information that is ignored in many clinical decision support systems due to the difficulty that comes with mining that information. Recent work has found success leveraging deep learning models for the prediction of clinical outcomes using clinical notes. However, these models fail to provide clinically relevant and interpretable inform… ▽ More

    Submitted 12 November, 2019; v1 submitted 30 October, 2019; originally announced October 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract