Skip to main content

Showing 1–2 of 2 results for author: Cerqueira, R

Searching in archive physics. Search in all archives.
.
  1. arXiv:2407.20267  [pdf, other

    cs.LG cs.AI physics.chem-ph

    A Large Encoder-Decoder Family of Foundation Models For Chemical Language

    Authors: Eduardo Soares, Victor Shirasuna, Emilio Vital Brazil, Renato Cerqueira, Dmitry Zubarev, Kristin Schmidt

    Abstract: Large-scale pre-training methodologies for chemical language models represent a breakthrough in cheminformatics. These methods excel in tasks such as property prediction and molecule generation by learning contextualized representations of input tokens through self-supervised learning on large unlabeled corpora. Typically, this involves pre-training on unlabeled data followed by fine-tuning on spe… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: 14 pages, 3 figures, 14 tables

  2. arXiv:2306.14919  [pdf, other

    physics.chem-ph cs.LG q-bio.QM

    Beyond Chemical Language: A Multimodal Approach to Enhance Molecular Property Prediction

    Authors: Eduardo Soares, Emilio Vital Brazil, Karen Fiorela Aquino Gutierrez, Renato Cerqueira, Dan Sanders, Kristin Schmidt, Dmitry Zubarev

    Abstract: We present a novel multimodal language model approach for predicting molecular properties by combining chemical language representation with physicochemical features. Our approach, MULTIMODAL-MOLFORMER, utilizes a causal multistage feature selection method that identifies physicochemical features based on their direct causal effect on a specific target property. These causal features are then inte… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: 14 pages, 6 Figures, 5 tables. Submited to NEURIPS 2023, Under review

    ACM Class: J.2; I.2.1