Skip to main content

Showing 1–26 of 26 results for author: Godoy, F

.
  1. arXiv:2505.08135  [pdf, ps, other

    cs.SE cs.AI cs.DC cs.PF

    Leveraging AI for Productive and Trustworthy HPC Software: Challenges and Research Directions

    Authors: Keita Teranishi, Harshitha Menon, William F. Godoy, Prasanna Balaprakash, David Bau, Tal Ben-Nun, Abhinav Bhatele, Franz Franchetti, Michael Franusich, Todd Gamblin, Giorgis Georgakoudis, Tom Goldstein, Arjun Guha, Steven Hahn, Costin Iancu, Zheming Jin, Terry Jones, Tze Meng Low, Het Mankad, Narasinga Rao Miniskar, Mohammad Alaul Haque Monil, Daniel Nichols, Konstantinos Parasyris, Swaroop Pophale, Pedro Valero-Lara , et al. (3 additional authors not shown)

    Abstract: We discuss the challenges and propose research directions for using AI to revolutionize the development of high-performance computing (HPC) software. AI technologies, in particular large language models, have transformed every aspect of software development. For its part, HPC software is recognized as a highly specialized scientific field of its own. We discuss the challenges associated with lever… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: 12 pages, 1 Figure, Accepted at "The 1st International Workshop on Foundational Large Language Models Advances for HPC" LLM4HPC to be held in conjunction with ISC High Performance 2025

  2. arXiv:2505.05623  [pdf, other

    cs.PF cs.CE cs.DC

    Characterizing GPU Energy Usage in Exascale-Ready Portable Science Applications

    Authors: William F. Godoy, Oscar Hernandez, Paul R. C. Kent, Maria Patrou, Kazi Asifuzzaman, Narasinga Rao Miniskar, Pedro Valero-Lara, Jeffrey S. Vetter, Matthew D. Sinclair, Jason Lowe-Power, Bobby R. Bruce

    Abstract: We characterize the GPU energy usage of two widely adopted exascale-ready applications representing two classes of particle and mesh solvers: (i) QMCPACK, a quantum Monte Carlo package, and (ii) AMReXCastro, an adaptive mesh astrophysical code. We analyze power, temperature, utilization, and energy traces from double-/single (mixed)-precision benchmarks on NVIDIA's A100 and H100 and AMD's MI250X G… ▽ More

    Submitted 16 May, 2025; v1 submitted 8 May, 2025; originally announced May 2025.

    Comments: 13 pages, 8 figures, 3 tables. Accepted at the Energy Efficiency with Sustainable Performance: Techniques, Tools, and Best Practices, EESP Workshop, in conjunction with ISC High Performance 2025

  3. arXiv:2409.07299  [pdf, other

    physics.app-ph physics.chem-ph

    From Memory Traces to Surface Chemistry: Decoding REDOX Reactions

    Authors: Ana Luiza Costa Silva, Rafael Schio Wengenroth Silva, Lucas Augusto Moisés, Adenilson José Chiquito, Marcio Peron Franco de Godoy, Fabian Hartmann, Victor Lopez-Richard

    Abstract: Gas and moisture sensing devices leveraging the resistive switching effect in transition metal oxide memristors promise to revolutionize next-generation, nano-scaled, cost-effective, and environmentally sustainable sensor solutions. These sensors encode readouts in resistance state changes based on gas concentration, yet their nonlinear current-voltage characteristics offer richer dynamics, captur… ▽ More

    Submitted 28 October, 2024; v1 submitted 11 September, 2024; originally announced September 2024.

  4. arXiv:2312.02200  [pdf, other

    cs.CV cs.AI stat.AP

    An Empirical Study of Automated Mislabel Detection in Real World Vision Datasets

    Authors: Maya Srikanth, Jeremy Irvin, Brian Wesley Hill, Felipe Godoy, Ishan Sabane, Andrew Y. Ng

    Abstract: Major advancements in computer vision can primarily be attributed to the use of labeled datasets. However, acquiring labels for datasets often results in errors which can harm model performance. Recent works have proposed methods to automatically identify mislabeled images, but developing strategies to effectively implement them in real world datasets has been sparsely explored. Towards improved d… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  5. arXiv:2311.16061  [pdf

    cond-mat.mtrl-sci physics.optics

    Evaluation of microscale crystallinity modification induced by laser writing on Mn3O4 thin films

    Authors: Camila Ianhez-Pereira, Akhil Kuriakose, Ariano De Giovanni Rodrigues, Ana Luiza Costa Silva, Ottavia Jedrkiewicz, Monica Bollani, Marcio Peron Franco de Godoy

    Abstract: Defining microstructures and managing local crystallinity allow the implementation of several functionalities in thin film technology. The use of ultrashort Bessel beams for bulk crystallinity modification has garnered considerable attention as a versatile technique for semiconductor materials, dielectrics, or metal oxide substrates. The aim of this work is the quantitative evaluation of the cryst… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 27 pages

  6. Julia as a unifying end-to-end workflow language on the Frontier exascale system

    Authors: William F. Godoy, Pedro Valero-Lara, Caira Anderson, Katrina W. Lee, Ana Gainaru, Rafael Ferreira da Silva, Jeffrey S. Vetter

    Abstract: We evaluate Julia as a single language and ecosystem paradigm powered by LLVM to develop workflow components for high-performance computing. We run a Gray-Scott, 2-variable diffusion-reaction application using a memory-bound, 7-point stencil kernel on Frontier, the US Department of Energy's first exascale supercomputer. We evaluate the performance, scaling, and trade-offs of (i) the computational… ▽ More

    Submitted 27 September, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 11 pages, 8 figures, accepted at the 18th Workshop on Workflows in Support of Large-Scale Science (WORKS23), IEEE/ACM The International Conference for High Performance Computing, Networking, Storage, and Analysis, SC23

  7. arXiv:2309.07103  [pdf, other

    cs.SE cs.AI cs.DC cs.PL

    Comparing Llama-2 and GPT-3 LLMs for HPC kernels generation

    Authors: Pedro Valero-Lara, Alexis Huante, Mustafa Al Lail, William F. Godoy, Keita Teranishi, Prasanna Balaprakash, Jeffrey S. Vetter

    Abstract: We evaluate the use of the open-source Llama-2 model for generating well-known, high-performance computing kernels (e.g., AXPY, GEMV, GEMM) on different parallel programming models and languages (e.g., C++: OpenMP, OpenMP Offload, OpenACC, CUDA, HIP; Fortran: OpenMP, OpenMP Offload, OpenACC; Python: numpy, Numba, pyCUDA, cuPy; and Julia: Threads, CUDA.jl, AMDGPU.jl). We built upon our previous wor… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: Accepted at LCPC 2023, The 36th International Workshop on Languages and Compilers for Parallel Computing http://www.lcpcworkshop.org/LCPC23/ . 13 pages, 5 figures, 1 table

  8. arXiv:2307.11502  [pdf, other

    cs.SE cs.DC physics.comp-ph

    Software engineering to sustain a high-performance computing scientific application: QMCPACK

    Authors: William F. Godoy, Steven E. Hahn, Michael M. Walsh, Philip W. Fackler, Jaron T. Krogel, Peter W. Doak, Paul R. C. Kent, Alfredo A. Correa, Ye Luo, Mark Dewing

    Abstract: We provide an overview of the software engineering efforts and their impact in QMCPACK, a production-level ab-initio Quantum Monte Carlo open-source code targeting high-performance computing (HPC) systems. Aspects included are: (i) strategic expansion of continuous integration (CI) targeting CPUs, using GitHub Actions runners, and NVIDIA and AMD GPUs in pre-exascale systems, using self-hosted hard… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: Accepted at the first US-RSE Conference, USRSE2023, https://us-rse.org/usrse23/, 8 pages, 3 figures, 4 tables

  9. arXiv:2306.15121  [pdf, other

    cs.AI cs.ET cs.PL

    Evaluation of OpenAI Codex for HPC Parallel Programming Models Kernel Generation

    Authors: William F. Godoy, Pedro Valero-Lara, Keita Teranishi, Prasanna Balaprakash, Jeffrey S. Vetter

    Abstract: We evaluate AI-assisted generative capabilities on fundamental numerical kernels in high-performance computing (HPC), including AXPY, GEMV, GEMM, SpMV, Jacobi Stencil, and CG. We test the generated kernel codes for a variety of language-supported programming models, including (1) C++ (e.g., OpenMP [including offload], OpenACC, Kokkos, SyCL, CUDA, and HIP), (2) Fortran (e.g., OpenMP [including offl… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted at the Sixteenth International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2), 2023 to be held in conjunction with ICPP 2023: The 52nd International Conference on Parallel Processing. 10 pages, 6 figures, 5 tables

  10. arXiv:2304.08393  [pdf, other

    gr-qc astro-ph.CO astro-ph.HE

    Search for gravitational-lensing signatures in the full third observing run of the LIGO-Virgo network

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, R. Abbott, H. Abe, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, C. Alléné, A. Allocca, P. A. Altin , et al. (1670 additional authors not shown)

    Abstract: Gravitational lensing by massive objects along the line of sight to the source causes distortions of gravitational wave-signals; such distortions may reveal information about fundamental physics, cosmology and astrophysics. In this work, we have extended the search for lensing signatures to all binary black hole events from the third observing run of the LIGO--Virgo network. We search for repeated… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 28 pages, 11 figures

    Report number: LIGO-P2200031

  11. Evaluating performance and portability of high-level programming models: Julia, Python/Numba, and Kokkos on exascale nodes

    Authors: William F. Godoy, Pedro Valero-Lara, T. Elise Dettling, Christian Trefftz, Ian Jorquera, Thomas Sheehy, Ross G. Miller, Marc Gonzalez-Tallada, Jeffrey S. Vetter, Valentin Churavy

    Abstract: We explore the performance and portability of the high-level programming models: the LLVM-based Julia and Python/Numba, and Kokkos on high-performance computing (HPC) nodes: AMD Epyc CPUs and MI250X graphical processing units (GPUs) on Frontier's test bed Crusher system and Ampere's Arm-based CPUs and NVIDIA's A100 GPUs on the Wombat system at the Oak Ridge Leadership Computing Facilities. We comp… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted at the 28th HIPS workshop, held in conjunction with IPDPS 2023. 10 pages, 9 figures

  12. arXiv:2212.01477  [pdf, other

    astro-ph.HE astro-ph.CO

    Search for subsolar-mass black hole binaries in the second part of Advanced LIGO's and Advanced Virgo's third observing run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, R. Abbott, H. Abe, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, C. Alléné, A. Allocca, P. A. Altin , et al. (1680 additional authors not shown)

    Abstract: We describe a search for gravitational waves from compact binaries with at least one component with mass 0.2 $M_\odot$ -- $1.0 M_\odot$ and mass ratio $q \geq 0.1$ in Advanced LIGO and Advanced Virgo data collected between 1 November 2019, 15:00 UTC and 27 March 2020, 17:00 UTC. No signals were detected. The most significant candidate has a false alarm rate of 0.2 $\mathrm{yr}^{-1}$. We estimate t… ▽ More

    Submitted 26 January, 2024; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: https://dcc.ligo.org/P2200139

  13. Giving RSEs a Larger Stage through the Better Scientific Software Fellowship

    Authors: William F. Godoy, Ritu Arora, Keith Beattie, David E. Bernholdt, Sarah E. Bratt, Daniel S. Katz, Ignacio Laguna, Amiya K. Maji, Addi Malviya Thakur, Rafael M. Mudafort, Nitin Sukhija, Damian Rouson, Cindy Rubio-González, Karan Vahi

    Abstract: The Better Scientific Software Fellowship (BSSwF) was launched in 2018 to foster and promote practices, processes, and tools to improve developer productivity and software sustainability of scientific codes. BSSwF's vision is to grow the community with practitioners, leaders, mentors, and consultants to increase the visibility of scientific software production and sustainability. Over the last fiv… ▽ More

    Submitted 14 November, 2022; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: submitted to Computing in Science & Engineering (CiSE), Special Issue on the Future of Research Software Engineers in the US

  14. arXiv:2211.02740  [pdf, other

    cs.DC

    Bridging HPC Communities through the Julia Programming Language

    Authors: Valentin Churavy, William F Godoy, Carsten Bauer, Hendrik Ranocha, Michael Schlottke-Lakemper, Ludovic Räss, Johannes Blaschke, Mosè Giordano, Erik Schnetter, Samuel Omlin, Jeffrey S. Vetter, Alan Edelman

    Abstract: The Julia programming language has evolved into a modern alternative to fill existing gaps in scientific computing and data science applications. Julia leverages a unified and coordinated single-language and ecosystem paradigm and has a proven track record of achieving high performance without sacrificing user productivity. These aspects make Julia a viable alternative to high-performance computin… ▽ More

    Submitted 10 November, 2022; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: 20 pages; improved image quality

  15. arXiv:2209.02863  [pdf

    astro-ph.HE gr-qc

    Model-based cross-correlation search for gravitational waves from the low-mass X-ray binary Scorpius X-1 in LIGO O3 data

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, R. Abbott, H. Abe, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, C. Alléné, A. Allocca, P. A. Altin , et al. (1670 additional authors not shown)

    Abstract: We present the results of a model-based search for continuous gravitational waves from the low-mass X-ray binary Scorpius X-1 using LIGO detector data from the third observing run of Advanced LIGO, Advanced Virgo and KAGRA. This is a semicoherent search which uses details of the signal model to coherently combine data separated by less than a specified coherence time, which can be adjusted to bala… ▽ More

    Submitted 2 January, 2023; v1 submitted 6 September, 2022; originally announced September 2022.

    Comments: 19 pages, Open Access Journal PDF

    Report number: LIGO-P2100110-v13

    Journal ref: The Astrophysical Journal Letters, 941, L30 (2022)

  16. arXiv:2209.02610  [pdf, ps, other

    cs.SE

    A perspective to navigate the National Laboratory environment for RSE career growth

    Authors: William F Godoy

    Abstract: This paper shares a perspective for the research software engineering (RSE) community to navigate the National Laboratory landscape. The RSE role is a recent concept that led to organizational challenges to place and evaluate their impact, costs and benefits. The premise is that RSEs are a natural fit into the current landscape and can use traditional career growth strategies in science: publicati… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Comments: 2 pages, paper presented at the RSE-HPC workshop https://us-rse.org/rse-hpc-2022/ , part of Supercomputing 2022 https://sc22.supercomputing.org/

  17. Modeling pre-Exascale AMR Parallel I/O Workloads via Proxy Applications

    Authors: William F Godoy, Jenna Delozier, Gregory R Watson

    Abstract: The present work investigates the modeling of pre-exascale input/output (I/O) workloads of Adaptive Mesh Refinement (AMR) simulations through a simple proxy application. We collect data from the AMReX Castro framework running on the Summit supercomputer for a wide range of scales and mesh partitions for the hydrodynamic Sedov case as a baseline to provide sufficient coverage to the formulated prox… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

    Comments: 10 pages, 11 figures, accepted at Seventeenth International Workshop on Automatic Performance Tuning, iWAPT2022, held in conjunction with IEEE IPDPS 2022

  18. A Survey on Sustainable Software Ecosystems to Support Experimental and Observational Science at Oak Ridge National Laboratory

    Authors: David E Bernholdt, Mathieu Doucet, William F Godoy, Addi Malviya-Thakur, Gregory R Watson

    Abstract: In the search for a sustainable approach for software ecosystems that supports experimental and observational science (EOS) across Oak Ridge National Laboratory (ORNL), we conducted a survey to understand the current and future landscape of EOS software and data. This paper describes the survey design we used to identify significant areas of interest, gaps, and potential opportunities, followed by… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: 14 pages, no figures, only tables

    Journal ref: ICCS 2022, SE4Science Workshop

  19. Efficient loading of reduced data ensembles produced at ORNL SNS/HFIR neutron time-of-flight facilities

    Authors: William F Godoy, Andrei T Savici, Steven E Hahn, Peter F Peterson

    Abstract: We present algorithmic improvements to the loading operations of certain reduced data ensembles produced from neutron scattering experiments at Oak Ridge National Laboratory (ORNL) facilities. Ensembles from multiple measurements are required to cover a wide range of the phase space of a sample material of interest. They are stored using the standard NeXus schema on individual HDF5 files. This mak… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

    Comments: 7 pages, 6 figures, 4 tables, The Second International Workshop on Big Data Reduction held with 2021 IEEE International Conference on Big Data

  20. Transitioning from file-based HPC workflows to streaming data pipelines with openPMD and ADIOS2

    Authors: Franz Poeschel, Juncheng E, William F. Godoy, Norbert Podhorszki, Scott Klasky, Greg Eisenhauer, Philip E. Davis, Lipeng Wan, Ana Gainaru, Junmin Gu, Fabian Koller, René Widera, Michael Bussmann, Axel Huebl

    Abstract: This paper aims to create a transition path from file-based IO to streaming-based workflows for scientific applications in an HPC environment. By using the openPMP-api, traditional workflows limited by filesystem bottlenecks can be overcome and flexibly extended for in situ analysis. The openPMD-api is a library for the description of scientific data according to the Open Standard for Particle-Mes… ▽ More

    Submitted 19 January, 2022; v1 submitted 13 July, 2021; originally announced July 2021.

    Comments: 18 pages, 9 figures, SMC2021, supplementary material at https://zenodo.org/record/4906276

  21. arXiv:2102.00507  [pdf, ps, other

    gr-qc

    Radiating jump conditions in General Relativity

    Authors: L. F Castañeda-Godoy, J. Ospino, L. A. Núñez

    Abstract: We present a unified description of spherical discontinuity surfaces in General Relativity based on two parameters: mass function and surface permeability. The surfaces considered are: \textit{Impulsive fronts}, massive permeable layer; \textit{Surface layers}, massive impermeable layer; \textit{Shock fronts}, massless permeable surface; and \textit{Boundary surfaces}, massless impermeable surface… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

  22. Efficient Data Management in Neutron Scattering Data Reduction Workflows at ORNL

    Authors: William F Godoy, Peter F Peterson, Steven E Hahn, Jay J Billings

    Abstract: Oak Ridge National Laboratory (ORNL) experimental neutron science facilities produce 1.2\,TB a day of raw event-based data that is stored using the standard metadata-rich NeXus schema built on top of the HDF5 file format. Performance of several data reduction workflows is largely determined by the amount of time spent on the loading and processing algorithms in Mantid, an open-source data analysis… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

    Comments: 7 pages, 4 figures, International Workshop on Big Data Reduction held with 2020 IEEE International Conference on Big Data

  23. arXiv:2011.11773  [pdf, other

    cs.CL

    Advancing Humor-Focused Sentiment Analysis through Improved Contextualized Embeddings and Model Architecture

    Authors: Felipe Godoy

    Abstract: Humor is a natural and fundamental component of human interactions. When correctly applied, humor allows us to express thoughts and feelings conveniently and effectively, increasing interpersonal affection, likeability, and trust. However, understanding the use of humor is a computationally challenging task from the perspective of humor-aware language processing models. As language models become u… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

  24. Effects of Reducing Heat Treatment on the Structural and the Magnetic Properties of Mn:ZnO Ceramics

    Authors: V. M. Almeida Lage, R. T. da Silva, A. Mesquita, M. P. F. de Godoy, X. Gratens, d V. A. Chitta, H. B. de Carvalho

    Abstract: Polycrystalline bulk Mn:ZnO ceramics with Mn nominal concentrations of 6, 11, 17 and 22 at.% were prepared trough solid-state reaction method and subjected to a heat treatment in reducing atmosphere (Ar (95%) and H2 (5%)). The samples were studied with particular emphasis on their compositions, structural, and magnetic properties. A detailed microstructural and chemical analysis confirms the Mn do… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

    Comments: 19 pages; 7 figures

    Journal ref: Journal of Alloys and Compounds 863 (2021) 158320

  25. Defect Induced Room Temperature Ferromagnetism in High Quality Co-doped ZnO Bulk Samples

    Authors: M. P. F. de Godoy, X. Gratens, V. A. Chitta, A. Mesquita, M. M de Lima Jr., A. Cantarero, G. Rahman, J. M. Morbec, H. B. de Carvalho

    Abstract: The nature of the often reported room temperature ferromagnetism in transition metal doped oxides is still a matter of huge debate. Herein we report on room temperature ferromagnetism in high quality Co-doped ZnO (Zn1-xCoxO) bulk samples synthesized via standard solid-state reaction route. Reference paramagnetic Co-doped ZnO samples with low level of structural defects are subjected to heat treatm… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: 33 pages, 9 figures

    Journal ref: Journal of Alloys and Compounds 859 (2021) 157772

  26. arXiv:1703.07383  [pdf, ps, other

    physics.comp-ph physics.med-ph

    Mathematical Model with Autoregressive Process for Electrocardiogram Signals

    Authors: Ronaldo M Evaristo, Antonio M Batista, Ricardo L Viana, Kelly C Iarosz, José D Szezech Jr, Moacir F de Godoy

    Abstract: The cardiovascular system is composed of the heart, blood and blood vessels. Regarding the heart, cardiac conditions are determined by the electrocardiogram, that is a noninvasive medical procedure. In this work, we propose autoregressive process in a mathematical model based on coupled differential equations in order to model electrocardiogram signals. Our results are compared with experimental t… ▽ More

    Submitted 6 November, 2017; v1 submitted 21 March, 2017; originally announced March 2017.

    Journal ref: Communications in Nonlinear Science and Numerical Simulation, Volume 57, Pages 415-421, 2018