Skip to main content

Showing 1–17 of 17 results for author: Brazil, E V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.01774  [pdf, other

    cs.LG cs.AI

    Grokking Explained: A Statistical Phenomenon

    Authors: Breno W. Carvalho, Artur S. d'Avila Garcez, Luís C. Lamb, Emílio Vital Brazil

    Abstract: Grokking, or delayed generalization, is an intriguing learning phenomenon where test set loss decreases sharply only after a model's training set loss has converged. This challenges conventional understanding of the training dynamics in deep learning networks. In this paper, we formalize and investigate grokking, highlighting that a key factor in its emergence is a distribution shift between train… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

  2. arXiv:2410.12348  [pdf, other

    cs.CE

    SELF-BART : A Transformer-based Molecular Representation Model using SELFIES

    Authors: Indra Priyadarsini, Seiji Takeda, Lisa Hamada, Emilio Vital Brazil, Eduardo Soares, Hajime Shinohara

    Abstract: Large-scale molecular representation methods have revolutionized applications in material science, such as drug discovery, chemical modeling, and material design. With the rise of transformers, models now learn representations directly from molecular structures. In this study, we develop an encoder-decoder model based on BART that is capable of leaning molecular representations and generate new mo… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: NeurIPS AI4Mat 2024

  3. arXiv:2407.20267  [pdf, other

    cs.LG cs.AI physics.chem-ph

    A Large Encoder-Decoder Family of Foundation Models For Chemical Language

    Authors: Eduardo Soares, Victor Shirasuna, Emilio Vital Brazil, Renato Cerqueira, Dmitry Zubarev, Kristin Schmidt

    Abstract: Large-scale pre-training methodologies for chemical language models represent a breakthrough in cheminformatics. These methods excel in tasks such as property prediction and molecule generation by learning contextualized representations of input tokens through self-supervised learning on large unlabeled corpora. Typically, this involves pre-training on unlabeled data followed by fine-tuning on spe… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: 14 pages, 3 figures, 14 tables

  4. arXiv:2310.13802  [pdf, other

    cs.LG cs.AI cs.CE q-bio.QM

    Improving Molecular Properties Prediction Through Latent Space Fusion

    Authors: Eduardo Soares, Akihiro Kishimoto, Emilio Vital Brazil, Seiji Takeda, Hiroshi Kajino, Renato Cerqueira

    Abstract: Pre-trained Language Models have emerged as promising tools for predicting molecular properties, yet their development is in its early stages, necessitating further research to enhance their efficacy and address challenges such as generalization and sample efficiency. In this paper, we present a multi-view approach that combines latent spaces derived from state-of-the-art chemical models. Our appr… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 8 Pages, 4 Figures - Submited to the AI4Science Workshop - Neurips 2023

  5. arXiv:2308.12152  [pdf, other

    cs.GR cs.CG

    Geo-Sketcher: Rapid 3D Geological Modeling using Geological and Topographic Map Sketches

    Authors: Ronan Amorim, Emilio Vital Brazil, Faramarz Samavati, Mario Costa Sousa

    Abstract: The construction of 3D geological models is an essential task in oil/gas exploration, development and production. However, it is a cumbersome, time-consuming and error-prone task mainly because of the model's geometric and topological complexity. The models construction is usually separated into interpretation and 3D modeling, performed by different highly specialized individuals, which leads to i… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: 21 pages, 30 Figures

  6. arXiv:2306.14919  [pdf, other

    physics.chem-ph cs.LG q-bio.QM

    Beyond Chemical Language: A Multimodal Approach to Enhance Molecular Property Prediction

    Authors: Eduardo Soares, Emilio Vital Brazil, Karen Fiorela Aquino Gutierrez, Renato Cerqueira, Dan Sanders, Kristin Schmidt, Dmitry Zubarev

    Abstract: We present a novel multimodal language model approach for predicting molecular properties by combining chemical language representation with physicochemical features. Our approach, MULTIMODAL-MOLFORMER, utilizes a causal multistage feature selection method that identifies physicochemical features based on their direct causal effect on a specific target property. These causal features are then inte… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: 14 pages, 6 Figures, 5 tables. Submited to NEURIPS 2023, Under review

    ACM Class: J.2; I.2.1

  7. arXiv:2305.07530  [pdf, other

    cs.HC

    Retrospective End-User Walkthrough: A Method for Assessing How People Combine Multiple AI Models in Decision-Making Systems

    Authors: Vagner Figueredo de Santana, Larissa Monteiro Da Fonseca Galeno, Emilio Vital Brazil, Aliza Heching, Renato Cerqueira

    Abstract: Evaluating human-AI decision-making systems is an emerging challenge as new ways of combining multiple AI models towards a specific goal are proposed every day. As humans interact with AI in decision-making systems, multiple factors may be present in a task including trust, interpretability, and explainability, amongst others. In this context, this work proposes a retrospective method to support a… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  8. arXiv:2303.05545  [pdf, other

    cs.LG

    Position Paper on Dataset Engineering to Accelerate Science

    Authors: Emilio Vital Brazil, Eduardo Soares, Lucas Villa Real, Leonardo Azevedo, Vinicius Segura, Luiz Zerkowski, Renato Cerqueira

    Abstract: Data is a critical element in any discovery process. In the last decades, we observed exponential growth in the volume of available data and the technology to manipulate it. However, data is only practical when one can structure it for a well-defined task. For instance, we need a corpus of text broken into sentences to train a natural language machine-learning model. In this work, we will use the… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Published at 2nd Annual AAAI Workshop on AI to Accelerate Science and Engineering (AI2ASE) https://ai-2-ase.github.io/papers/16%5cSubmission%5cAAAI_Dataset_Engineering-8.pdf

  9. arXiv:2303.05288  [pdf, other

    cs.AI cs.MA

    Knowledge-augmented Risk Assessment (KaRA): a hybrid-intelligence framework for supporting knowledge-intensive risk assessment of prospect candidates

    Authors: Carlos Raoni Mendes, Emilio Vital Brazil, Vinicius Segura, Renato Cerqueira

    Abstract: Evaluating the potential of a prospective candidate is a common task in multiple decision-making processes in different industries. We refer to a prospect as something or someone that could potentially produce positive results in a given context, e.g., an area where an oil company could find oil, a compound that, when synthesized, results in a material with required properties, and so on. In many… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Published at 2nd Annual AAAI Workshop on AI to Accelerate Science and Engineering (AI2ASE) https://ai-2-ase.github.io/papers/17%5cSubmission%5cKaRA_AI2ASE_Workshop-3.pdf. arXiv admin note: text overlap with arXiv:2211.04257

  10. arXiv:2211.04257  [pdf, other

    cs.LG cs.AI q-bio.QM

    Toward Human-AI Co-creation to Accelerate Material Discovery

    Authors: Dmitry Zubarev, Carlos Raoni Mendes, Emilio Vital Brazil, Renato Cerqueira, Kristin Schmidt, Vinicius Segura, Juliana Jansen Ferreira, Dan Sanders

    Abstract: There is an increasing need in our society to achieve faster advances in Science to tackle urgent problems, such as climate changes, environmental hazards, sustainable energy systems, pandemics, among others. In certain domains like chemistry, scientific discovery carries the extra burden of assessing risks of the proposed novel solutions before moving to the experimental stage. Despite several re… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

    Comments: 9 pages, 5 figures, NeurIPS 2022 WS: AI4Science

  11. arXiv:2010.00330  [pdf, other

    cs.DB cs.AI cs.DC cs.LG

    Workflow Provenance in the Lifecycle of Scientific Machine Learning

    Authors: Renan Souza, Leonardo G. Azevedo, Vítor Lourenço, Elton Soares, Raphael Thiago, Rafael Brandão, Daniel Civitarese, Emilio Vital Brazil, Marcio Moreno, Patrick Valduriez, Marta Mattoso, Renato Cerqueira, Marco A. S. Netto

    Abstract: Machine Learning (ML) has already fundamentally changed several businesses. More recently, it has also been profoundly impacting the computational science and engineering domains, like geoscience, climate science, and health science. In these domains, users need to perform comprehensive data analyses combining scientific data and ML models to provide for critical requirements, such as reproducibil… ▽ More

    Submitted 25 August, 2021; v1 submitted 30 September, 2020; originally announced October 2020.

    Comments: 21 pages, 10 figures, text overlap with arXiv:1910.04223, a workshop paper being extended in this journal paper

    MSC Class: 65Y05; 68P15 ACM Class: I.2; H.2; C.4; J.2

    Journal ref: Concurrency Computation Practice Experience. 2021;e6544

  12. arXiv:1910.04223  [pdf, other

    cs.DC cs.DB cs.LG

    Provenance Data in the Machine Learning Lifecycle in Computational Science and Engineering

    Authors: Renan Souza, Leonardo Azevedo, Vítor Lourenço, Elton Soares, Raphael Thiago, Rafael Brandão, Daniel Civitarese, Emilio Vital Brazil, Marcio Moreno, Patrick Valduriez, Marta Mattoso, Renato Cerqueira, Marco A. S. Netto

    Abstract: Machine Learning (ML) has become essential in several industries. In Computational Science and Engineering (CSE), the complexity of the ML lifecycle comes from the large variety of data, scientists' expertise, tools, and workflows. If data are not tracked properly during the lifecycle, it becomes unfeasible to recreate a ML model from scratch or to explain to stakeholders how it was created. The m… ▽ More

    Submitted 21 October, 2019; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: 10 pages, 7 figures, Accepted at Workflows in Support of Large-scale Science (WORKS) co-located with the ACM/IEEE International Conference for High Performance Computing, Networking, Storage, and Analysis (SC) 2019, Denver, Colorado

    MSC Class: 65Y05; 68P15 ACM Class: I.2; H.2; C.4; J.2

  13. arXiv:1905.04307  [pdf, other

    eess.IV cs.LG

    Semantic Segmentation of Seismic Images

    Authors: Daniel Civitarese, Daniela Szwarcman, Emilio Vital Brazil, Bianca Zadrozny

    Abstract: Almost all work to understand Earth's subsurface on a large scale relies on the interpretation of seismic surveys by experts who segment the survey (usually a cube) into layers; a process that is very time demanding. In this paper, we present a new deep neural network architecture specially designed to semantically segment seismic images with a minimal amount of training data. To achieve this, we… ▽ More

    Submitted 10 May, 2019; originally announced May 2019.

    Comments: 7 pages, 5 figures

  14. arXiv:1904.00770  [pdf, other

    cs.LG cs.CV physics.geo-ph stat.ML

    Netherlands Dataset: A New Public Dataset for Machine Learning in Seismic Interpretation

    Authors: Reinaldo Mozart Silva, Lais Baroni, Rodrigo S. Ferreira, Daniel Civitarese, Daniela Szwarcman, Emilio Vital Brazil

    Abstract: Machine learning and, more specifically, deep learning algorithms have seen remarkable growth in their popularity and usefulness in the last years. This is arguably due to three main factors: powerful computers, new techniques to train deeper networks and larger datasets. Although the first two are readily available in modern computers and ML libraries, the last one remains a challenge for many do… ▽ More

    Submitted 26 March, 2019; originally announced April 2019.

    Comments: 5 pages, 5 figures

  15. arXiv:1903.12060  [pdf, other

    physics.geo-ph cs.DL

    Penobscot Dataset: Fostering Machine Learning Development for Seismic Interpretation

    Authors: Lais Baroni, Reinaldo Mozart Silva, Rodrigo S. Ferreira, Daniel Civitarese, Daniela Szwarcman, Emilio Vital Brazil

    Abstract: We have seen in the past years the flourishing of machine and deep learning algorithms in several applications such as image classification and segmentation, object detection and recognition, among many others. This was only possible, in part, because datasets like ImageNet -- with +14 million labeled images -- were created and made publicly available, providing researches with a common ground to… ▽ More

    Submitted 21 March, 2019; originally announced March 2019.

    Comments: 5 pages, 5 figures

  16. arXiv:1406.7025  [pdf, other

    cs.GR

    DASS: Detail Aware Sketch-Based Surface Modeling

    Authors: Emilio Vital Brazil

    Abstract: We present a sketch-based modeling system suitable for detail editing, based on a multilevel representation for surfaces. The main advantage of this representation allowing for the control of local (details) and global changes of the model. We used an adaptive mesh (4-8 mesh) and developed a label theory to construct a manifold structure, which is responsible for controlling local editing of the m… ▽ More

    Submitted 26 June, 2014; originally announced June 2014.

    ACM Class: I.3.5

  17. The Cost of Perfection for Matchings in Graphs

    Authors: Emilio Vital Brazil, Guilherme D. da Fonseca, Celina de Figueiredo, Diana Sasaki

    Abstract: Perfect matchings and maximum weight matchings are two fundamental combinatorial structures. We consider the ratio between the maximum weight of a perfect matching and the maximum weight of a general matching. Motivated by the computer graphics application in triangle meshes, where we seek to convert a triangulation into a quadrangulation by merging pairs of adjacent triangles, we focus mainly on… ▽ More

    Submitted 4 December, 2014; v1 submitted 11 April, 2012; originally announced April 2012.

    Journal ref: Discrete Applied Mathematics, 210:112-122, 2016