-
Kilometer-Scale E3SM Land Model Simulation over North America
Authors:
Dali Wang,
Chen Wang,
Qinglei Cao,
Peter Schwartz,
Fengming Yuan,
Jayesh Krishna,
Danqing Wu,
Danial Ricciuto,
Peter Thornton,
Shih-Chieh Kao,
Michele Thornton,
Kathryn Mohror
Abstract:
The development of a kilometer-scale E3SM Land Model (km-scale ELM) is an integral part of the E3SM project, which seeks to advance energy-related Earth system science research with state-of-the-art modeling and simulation capabilities on exascale computing systems. Through the utilization of high-fidelity data products, such as atmospheric forcing and soil properties, the km-scale ELM plays a cri…
▽ More
The development of a kilometer-scale E3SM Land Model (km-scale ELM) is an integral part of the E3SM project, which seeks to advance energy-related Earth system science research with state-of-the-art modeling and simulation capabilities on exascale computing systems. Through the utilization of high-fidelity data products, such as atmospheric forcing and soil properties, the km-scale ELM plays a critical role in accurately modeling geographical characteristics and extreme weather occurrences. The model is vital for enhancing our comprehension and prediction of climate patterns, as well as their effects on ecosystems and human activities.
This study showcases the first set of full-capability, km-scale ELM simulations over various computational domains, including simulations encompassing 21.6 million land gridcells, reflecting approximately 21.5 million square kilometers of North America at a 1 km x 1 km resolution. We present the largest km-scale ELM simulation using up to 100,800 CPU cores across 2,400 nodes. This continental-scale simulation is 300 times larger than any previous studies, and the computational resources used are about 400 times larger than those used in prior efforts. Both strong and weak scaling tests have been conducted, revealing exceptional performance efficiency and resource utilization.
The km-scale ELM uses the common E3SM modeling infrastructure and a general data toolkit known as KiloCraft. Consequently, it can be readily adapted for both fully-coupled E3SM simulations and data-driven simulations over specific areas, ranging from a single gridcell to the entire North America.
△ Less
Submitted 19 January, 2025;
originally announced January 2025.
-
S3LLM: Large-Scale Scientific Software Understanding with LLMs using Source, Metadata, and Document
Authors:
Kareem Shaik,
Dali Wang,
Weijian Zheng,
Qinglei Cao,
Heng Fan,
Peter Schwartz,
Yunhe Feng
Abstract:
The understanding of large-scale scientific software poses significant challenges due to its diverse codebase, extensive code length, and target computing architectures. The emergence of generative AI, specifically large language models (LLMs), provides novel pathways for understanding such complex scientific codes. This paper presents S3LLM, an LLM-based framework designed to enable the examinati…
▽ More
The understanding of large-scale scientific software poses significant challenges due to its diverse codebase, extensive code length, and target computing architectures. The emergence of generative AI, specifically large language models (LLMs), provides novel pathways for understanding such complex scientific codes. This paper presents S3LLM, an LLM-based framework designed to enable the examination of source code, code metadata, and summarized information in conjunction with textual technical reports in an interactive, conversational manner through a user-friendly interface. S3LLM leverages open-source LLaMA-2 models to enhance code analysis through the automatic transformation of natural language queries into domain-specific language (DSL) queries. Specifically, it translates these queries into Feature Query Language (FQL), enabling efficient scanning and parsing of entire code repositories. In addition, S3LLM is equipped to handle diverse metadata types, including DOT, SQL, and customized formats. Furthermore, S3LLM incorporates retrieval augmented generation (RAG) and LangChain technologies to directly query extensive documents. S3LLM demonstrates the potential of using locally deployed open-source LLMs for the rapid understanding of large-scale scientific computing software, eliminating the need for extensive coding expertise, and thereby making the process more efficient and effective. S3LLM is available at https://github.com/ResponsibleAILab/s3llm.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
Impact of gender on the formation and outcome of mentoring relationships in academic research
Authors:
Leah P. Schwartz,
Jean LiƩnard,
Stephen V. David
Abstract:
Despite increasing representation in graduate training programs, a disproportionate number of women leave academic research before obtaining an independent position. To understand factors underlying this trend, we analyzed a multidisciplinary database of Ph.D. and postdoctoral mentoring relationships covering the years 2000-2020, focusing on data from the life sciences. Student and mentor gender a…
▽ More
Despite increasing representation in graduate training programs, a disproportionate number of women leave academic research before obtaining an independent position. To understand factors underlying this trend, we analyzed a multidisciplinary database of Ph.D. and postdoctoral mentoring relationships covering the years 2000-2020, focusing on data from the life sciences. Student and mentor gender are both associated with differences in rates of student's continuation to independent mentor positions of their own. Although trainees of women mentors are less likely to take on independent positions than trainees of men mentors, this effect is reduced substantially after controlling for several measurements of mentor status. Thus the effect of mentor gender can be explained at least partially by gender disparities in social and financial resources available to mentors. Because trainees and mentors tend to be of the same gender, this association between mentor gender and academic continuation disproportionately impacts women trainees. On average, gender homophily in graduate training is unrelated to mentor status. A notable exception to this trend is the special case of scientists having been granted an outstanding distinction, evidenced by membership in the National Academy of Sciences, being a grantee of the Howard Hughes Medical Institute, or having been awarded the Nobel Prize. This group of mentors trains men graduate students at higher rates than their most successful colleagues. These results suggest that, in addition to other factors that limit career choices for women trainees, gender inequities in mentors' access to resources and prestige contribute to women's attrition from independent research positions.
△ Less
Submitted 4 May, 2022; v1 submitted 15 April, 2021;
originally announced April 2021.
-
High-order Discretization of a Gyrokinetic Vlasov Model in Edge Plasma Geometry
Authors:
Milo R. Dorr,
Phillip Colella,
Mikhail A. Dorf,
Debojyoti Ghosh,
Jeffrey A. F. Hittinger,
Peter O. Schwartz
Abstract:
We present a high-order spatial discretization of a continuum gyrokinetic Vlasov model in axisymmetric tokamak edge plasma geometries. Such models describe the phase space advection of plasma species distribution functions in the absence of collisions. The gyrokinetic model is posed in a four-dimensional phase space, upon which a grid is imposed when discretized. To mitigate the computational cost…
▽ More
We present a high-order spatial discretization of a continuum gyrokinetic Vlasov model in axisymmetric tokamak edge plasma geometries. Such models describe the phase space advection of plasma species distribution functions in the absence of collisions. The gyrokinetic model is posed in a four-dimensional phase space, upon which a grid is imposed when discretized. To mitigate the computational cost associated with high-dimensional grids, we employ a high-order discretization to reduce the grid size needed to achieve a given level of accuracy relative to lower-order methods. Strong anisotropy induced by the magnetic field motivates the use of mapped coordinate grids aligned with magnetic flux surfaces. The natural partitioning of the edge geometry by the separatrix between the closed and open field line regions leads to the consideration of multiple mapped blocks, in what is known as a mapped multiblock (MMB) approach. We describe the specialization of a more general formalism that we have developed for the construction of high-order, finite-volume discretizations on MMB grids, yielding the accurate evaluation of the gyrokinetic Vlasov operator, the metric factors resulting from the MMB coordinate mappings, and the interaction of blocks at adjacent boundaries. Our conservative formulation of the gyrokinetic Vlasov model incorporates the fact that the phase space velocity has zero divergence, which must be preserved discretely to avoid truncation error accumulation. We describe an approach for the discrete evaluation of the gyrokinetic phase space velocity that preserves the divergence-free property to machine precision.
△ Less
Submitted 5 December, 2017;
originally announced December 2017.