-
A hybrid origin for the Martian atmosphere
Authors:
Kaveh Pahlevan,
Laura Schaefer,
Don Porcelli
Abstract:
The Martian isotopic record displays a dichotomy in volatile compositions. Interior volatiles from the mantle record a chondritic heritage (e.g., H, N, Kr, Xe) whereas the atmospheric reservoir of Kr and Xe - which do not currently experience escape - record heritage from a solar-like source. Motivated by disparate inferences on the source of Martian atmospheric volatiles (outgassed versus nebular…
▽ More
The Martian isotopic record displays a dichotomy in volatile compositions. Interior volatiles from the mantle record a chondritic heritage (e.g., H, N, Kr, Xe) whereas the atmospheric reservoir of Kr and Xe - which do not currently experience escape - record heritage from a solar-like source. Motivated by disparate inferences on the source of Martian atmospheric volatiles (outgassed versus nebular captured), we consider hybrid-source accretionary atmospheres in which a high molecular weight (e.g., CO2-rich) outgassed component is mixed in with the low molecular weight H2-rich nebular atmosphere. We conduct calculations of nebular capture with and without a high molecular weight outgassed component mixed into the atmosphere during the lifetime of the solar nebula. Mixing an outgassed component into the nebular layer enhances the captured gas inventory by 1-3 orders of magnitude - depending on the outgassed inventory - relative to "pure" nebular capture. These observations and calculations suggest that the Martian atmosphere arose as a subequal mixture of outgassed and nebular-derived components, and provide a framework for assessing the role of various mechanisms of gas loss over the entire history of the planet.
△ Less
Submitted 20 February, 2025; v1 submitted 20 October, 2024;
originally announced October 2024.
-
The scaling behaviour of localised and extended states in one-dimensional tight-binding models with disorder
Authors:
Luca Schaefer,
Barbara Drossel
Abstract:
We investigate two one-dimensional tight-binding models with disorder that have extended states at zero energy. We use exact and partial diagonalisation of the Hamiltonian to obtain the eigenmodes and the associated participation ratios, and the transfer-matrix method to determine the localisation length. The first model has no on-site disorder, but random couplings. While the participation ratio…
▽ More
We investigate two one-dimensional tight-binding models with disorder that have extended states at zero energy. We use exact and partial diagonalisation of the Hamiltonian to obtain the eigenmodes and the associated participation ratios, and the transfer-matrix method to determine the localisation length. The first model has no on-site disorder, but random couplings. While the participation ratio remains finite at zero energy, the localisation length diverges logarithmically as the energy goes to zero. We provide an intuitive derivation of this logarithmic divergence based on the weak coupling of the two sublattices. The second model has a conserved quantity as the row sums of the Hamiltonian are zero. This model can be represented as a harmonic chain with random couplings, or as a diffusion model on a lattice with random links. We find, in agreement with existing analytical calculations, that the number of system-spanning eigenmodes increases proportionally to the square root of the system size, and we related this power law to other power laws that characterise the scaling behaviour of the eigenmodes, the participation ratio, the localisation length, and their dependence on energy and system size. When disorder is so strong that the smallest hopping terms can be arbitrarily close to zero, all these power laws change, and we show a crossover between the two scaling regimes. All these results are explained by intuitive arguments based on scaling.
△ Less
Submitted 4 October, 2024;
originally announced October 2024.
-
Towards a self-consistent evaluation of gas dwarf scenarios for temperate sub-Neptunes
Authors:
Frances E. Rigby,
Lorenzo Pica-Ciamarra,
Måns Holmberg,
Nikku Madhusudhan,
Savvas Constantinou,
Laura Schaefer,
Jie Deng,
Kanani K. M. Lee,
Julianne I. Moses
Abstract:
The recent JWST detections of carbon-bearing molecules in a habitable-zone sub-Neptune have opened a new era in the study of low-mass exoplanets. The sub-Neptune regime spans a wide diversity of planetary interiors and atmospheres not witnessed in the solar system, including mini-Neptunes, super-Earths, and water worlds. Recent works have investigated the possibility of gas dwarfs, with rocky inte…
▽ More
The recent JWST detections of carbon-bearing molecules in a habitable-zone sub-Neptune have opened a new era in the study of low-mass exoplanets. The sub-Neptune regime spans a wide diversity of planetary interiors and atmospheres not witnessed in the solar system, including mini-Neptunes, super-Earths, and water worlds. Recent works have investigated the possibility of gas dwarfs, with rocky interiors and thick H$_2$-rich atmospheres, to explain aspects of the sub-Neptune population, including the radius valley. Interactions between the H$_2$-rich envelope and a potential magma ocean may lead to observable atmospheric signatures. We report a coupled interior-atmosphere modelling framework for gas dwarfs to investigate the plausibility of magma oceans on such planets and their observable diagnostics. We find that the surface-atmosphere interactions and atmospheric composition are sensitive to a wide range of parameters, including the atmospheric and internal structure, mineral composition, volatile solubility and atmospheric chemistry. While magma oceans are typically associated with high-temperature rocky planets, we assess if such conditions may be admissible and observable for temperate sub-Neptunes. We find that a holistic modelling approach is required for this purpose and to avoid unphysical model solutions. We find using our model framework and considering the habitable-zone sub-Neptune K2-18 b as a case study that its observed atmospheric composition is incompatible with a magma ocean scenario. We identify key atmospheric molecular and elemental diagnostics, including the abundances of CO$_2$, CO, NH$_3$ and, potentially, S-bearing species. Our study also underscores the need for fundamental material properties for accurate modelling of such planets.
△ Less
Submitted 5 September, 2024;
originally announced September 2024.
-
Approximability of the Containment Problem for Zonotopes and Ellipsotopes
Authors:
Adrian Kulmburg,
Lukas Schäfer,
Matthias Althoff
Abstract:
The zonotope containment problem, i.e., whether one zonotope is contained in another, is a central problem in control theory. Applications include detecting faults and robustifying controllers by computing invariant sets, and obtain fixed points in reachability analysis. Despite the inherent co-NP-hardness of this problem, an approximation algorithm developed by S. Sadraddini and R. Tedrake has ga…
▽ More
The zonotope containment problem, i.e., whether one zonotope is contained in another, is a central problem in control theory. Applications include detecting faults and robustifying controllers by computing invariant sets, and obtain fixed points in reachability analysis. Despite the inherent co-NP-hardness of this problem, an approximation algorithm developed by S. Sadraddini and R. Tedrake has gained widespread recognition for its swift execution and consistent reliability in practice. In our study, we substantiate the precision of the algorithm with a definitive proof, elucidating the empirical accuracy observed in practice. Our proof hinges on establishing a connection between the containment problem and the computation of matrix norms, thereby enabling the extension of the approximation algorithm to encompass ellipsotopes -- a broader class of sets derived from zonotopes. We also explore the computational complexity of the ellipsotope containment problem with a focus on approximability. Finally, we present new methods to compute safe sets for linear dynamical systems, demonstrating the practical relevance of approximating the ellipsotope containment problem.
△ Less
Submitted 6 March, 2025; v1 submitted 17 April, 2024;
originally announced April 2024.
-
Combinatorial Printing of Functionally Graded Solid-State Electrolyte for High-Voltage Lithium Metal Batteries
Authors:
Qiang Jiang,
Stephanie Atampugre,
Yipu Du,
Lingyu Yang,
Jennifer L. Schaefer,
Yanliang Zhang
Abstract:
Heterogeneous multilayered solid-state electrolyte (HMSSE) has been widely explored for their broadened working voltage range and compatibility with electrodes. However, due to the limitations of traditional manufacturing methods such as casting, the interface between electrolyte layers in HMSSE can decrease the ionic conductivity severely. Here, a novel combinatory aerosol jet printing (CAJP) is…
▽ More
Heterogeneous multilayered solid-state electrolyte (HMSSE) has been widely explored for their broadened working voltage range and compatibility with electrodes. However, due to the limitations of traditional manufacturing methods such as casting, the interface between electrolyte layers in HMSSE can decrease the ionic conductivity severely. Here, a novel combinatory aerosol jet printing (CAJP) is introduced to fabricate functionally graded solid-state electrolyte (FGSSE) without sharp interface. Owing to the unique ability of CAJP (in-situ mixing and instantaneous tuning of the mixing ratio), FGSSE with smooth microscale compositional gradation is achieved. Electrochemical tests show that FGSSE has excellent oxidative stability exceeding 5.5 V and improved conductivity (>7 times of an analogous HMSSE). By decoupling the total resistance, we show that the resistance from the electrolyte/electrolyte interface of HMSSE is 5.7 times of the total resistance of FGSSE. The Li/FGSSE/NCM622 cell can be stably run for more than 200 cycles along with improved rate performance.
△ Less
Submitted 13 April, 2024;
originally announced April 2024.
-
On a coarse invertibility spectrum for coarse groups
Authors:
Leo Schäfer,
Federico Vigolo
Abstract:
We introduce a coarse algebraic invariant for coarse groups and use it to differentiate various coarsifications of the group of integers. This lets us answer two questions posed by Leitner and the second author. The invariant is obtained by considering the set of exponents n such that taking n-th powers defines a coarse equivalence of the coarse group.
We introduce a coarse algebraic invariant for coarse groups and use it to differentiate various coarsifications of the group of integers. This lets us answer two questions posed by Leitner and the second author. The invariant is obtained by considering the set of exponents n such that taking n-th powers defines a coarse equivalence of the coarse group.
△ Less
Submitted 7 April, 2025; v1 submitted 12 April, 2024;
originally announced April 2024.
-
Complementing cell taxonomies with a multicellular functional analysis of tissues
Authors:
Ricardo Omar Ramirez Flores,
Philipp Sven Lars Schäfer,
Leonie Küchenhoff,
Julio Saez-Rodriguez
Abstract:
The application of single-cell molecular profiling coupled with spatial technologies has enabled charting cellular heterogeneity in reference tissues and in disease. This new wave of molecular data has highlighted the expected diversity of single-cell dynamics upon shared external queues and spatial organizations. However, little is known about the relationship between single cell heterogeneity an…
▽ More
The application of single-cell molecular profiling coupled with spatial technologies has enabled charting cellular heterogeneity in reference tissues and in disease. This new wave of molecular data has highlighted the expected diversity of single-cell dynamics upon shared external queues and spatial organizations. However, little is known about the relationship between single cell heterogeneity and the emergence and maintenance of robust multicellular processes in developed tissues and its role in (patho)physiology. Here, we present emerging computational modeling strategies that use increasingly available large-scale cross-condition single cell and spatial datasets, to study multicellular organization in tissues and complement cell taxonomies. This perspective should enable us to better understand how cells within tissues collectively process information and adapt synchronized responses in disease contexts and to bridge the gap between structural changes and functions in tissues.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games
Authors:
Lukas Schäfer,
Logan Jones,
Anssi Kanervisto,
Yuhan Cao,
Tabish Rashid,
Raluca Georgescu,
Dave Bignell,
Siddhartha Sen,
Andrea Treviño Gavito,
Sam Devlin
Abstract:
Video games have served as useful benchmarks for the decision-making community, but going beyond Atari games towards modern games has been prohibitively expensive for the vast majority of the research community. Prior work in modern video games typically relied on game-specific integration to obtain game features and enable online training, or on existing large datasets. An alternative approach is…
▽ More
Video games have served as useful benchmarks for the decision-making community, but going beyond Atari games towards modern games has been prohibitively expensive for the vast majority of the research community. Prior work in modern video games typically relied on game-specific integration to obtain game features and enable online training, or on existing large datasets. An alternative approach is to train agents using imitation learning to play video games purely from images. However, this setting poses a fundamental question: which visual encoders obtain representations that retain information critical for decision making? To answer this question, we conduct a systematic study of imitation learning with publicly available pre-trained visual encoders compared to the typical task-specific end-to-end training approach in Minecraft, Counter-Strike: Global Offensive, and Minecraft Dungeons. Our results show that end-to-end training can be effective with comparably low-resolution images and only minutes of demonstrations, but significant improvements can be gained by utilising pre-trained encoders such as DINOv2 depending on the game. In addition to enabling effective decision making, we show that pre-trained encoders can make decision-making research in video games more accessible by significantly reducing the cost of training.
△ Less
Submitted 30 April, 2025; v1 submitted 4 December, 2023;
originally announced December 2023.
-
Parametric Matroid Interdiction
Authors:
Nils Hausbrandt,
Oliver Bachtler,
Stefan Ruzika,
Luca E. Schäfer
Abstract:
We introduce the parametric matroid one-interdiction problem. Given a matroid, each element of its ground set is associated with a weight that depends linearly on a real parameter from a given parameter interval. The goal is to find, for each parameter value, one element that, when being removed, maximizes the weight of a minimum weight basis. The complexity of this problem can be measured by the…
▽ More
We introduce the parametric matroid one-interdiction problem. Given a matroid, each element of its ground set is associated with a weight that depends linearly on a real parameter from a given parameter interval. The goal is to find, for each parameter value, one element that, when being removed, maximizes the weight of a minimum weight basis. The complexity of this problem can be measured by the number of slope changes of the piecewise linear function mapping the parameter to the weight of the optimal solution of the parametric matroid one-interdiction problem. We provide two polynomial upper bounds as well as a lower bound on the number of these slope changes. Using these, we develop algorithms that require a polynomial number of independence tests and analyse their running time in the special case of graphical matroids.
△ Less
Submitted 8 October, 2023;
originally announced October 2023.
-
Outgassing Composition of the Murchison Meteorite: Implications for Volatile Depletion of Planetesimals and Interior-atmosphere Connections for Terrestrial Exoplanets
Authors:
Maggie A. Thompson,
Myriam Telus,
Graham Harper Edwards,
Laura Schaefer,
Jasmeet Dhaliwal,
Brian Dreyer,
Jonathan J. Fortney,
Kyle Kim
Abstract:
Outgassing is a central process during the formation and evolution of terrestrial planets and their atmospheres both within and beyond the solar system. Although terrestrial planets' early atmospheres likely form via outgassing during planetary accretion, the connection between a planet's bulk composition and its initial atmospheric properties is not well understood. One way to inform this connect…
▽ More
Outgassing is a central process during the formation and evolution of terrestrial planets and their atmospheres both within and beyond the solar system. Although terrestrial planets' early atmospheres likely form via outgassing during planetary accretion, the connection between a planet's bulk composition and its initial atmospheric properties is not well understood. One way to inform this connection is to analyze the outgassing compositions of meteorites, and in particular carbonaceous chondrites, because they are some of the most volatile-rich, primitive materials (in terms of their bulk compositions) that are available for direct study. In addition, they may serve as compositional analogs for the building block materials of terrestrial planets in our solar system and around other Sun-like stars. This study builds upon previous outgassing experiments that monitored the abundances of volatile species (e.g., H2O, CO, and CO2) released from the Murchison meteorite. To gain a more complete understanding of Murchison's outgassing composition, we perform a series of heating experiments under atmospheric pressure (1 bar) and vacuum (1E-9 bar) conditions on samples of the Murchison meteorite and subsequent bulk element analysis to inform the outgassing trends of a suite of major elements in Murchison (e.g., Fe, Mg, Zn, and S). Under both pressure conditions, sulfur outgases significantly at the highest temperatures (800C - 1000C). For the samples heated under vacuum conditions, we also detect outgassing of zinc. Combined with prior outgassing experiments, this study provides important insights into the volatile depletion patterns of undifferentiated planetesimals and the early outgassing compositions of terrestrial exoplanets.
△ Less
Submitted 3 October, 2023;
originally announced October 2023.
-
Locally Adaptive Shrinkage Priors for Trends and Breaks in Count Time Series
Authors:
Toryn L. J. Schafer,
David S. Matteson
Abstract:
Non-stationary count time series characterized by features such as abrupt changes and fluctuations about the trend arise in many scientific domains including biophysics, ecology, energy, epidemiology, and social science domains. Current approaches for integer-valued time series lack the flexibility to capture local transient features while more flexible models for continuous data types are inadequ…
▽ More
Non-stationary count time series characterized by features such as abrupt changes and fluctuations about the trend arise in many scientific domains including biophysics, ecology, energy, epidemiology, and social science domains. Current approaches for integer-valued time series lack the flexibility to capture local transient features while more flexible models for continuous data types are inadequate for universal applications to integer-valued responses such as settings with small counts. We present a modeling framework, the negative binomial Bayesian trend filter (NB-BTF), that offers an adaptive model-based solution to capturing multiscale features with valid integer-valued inference for trend filtering. The framework is a hierarchical Bayesian model with a dynamic global-local shrinkage process. The flexibility of the global-local process allows for the necessary local regularization while the temporal dependence induces a locally smooth trend. In simulation, the NB-BTF outperforms a number of alternative trend filtering methods. Then, we demonstrate the method on weekly power outage frequency in Massachusetts townships. Power outage frequency is characterized by a nominal low level with occasional spikes. These illustrations show the estimation of a smooth, non-stationary trend with adequate uncertainty quantification.
△ Less
Submitted 31 August, 2023;
originally announced September 2023.
-
No thick carbon dioxide atmosphere on the rocky exoplanet TRAPPIST-1 c
Authors:
Sebastian Zieba,
Laura Kreidberg,
Elsa Ducrot,
Michaël Gillon,
Caroline Morley,
Laura Schaefer,
Patrick Tamburo,
Daniel D. B. Koll,
Xintong Lyu,
Lorena Acuña,
Eric Agol,
Aishwarya R. Iyer,
Renyu Hu,
Andrew P. Lincowski,
Victoria S. Meadows,
Franck Selsis,
Emeline Bolmont,
Avi M. Mandell,
Gabrielle Suissa
Abstract:
Seven rocky planets orbit the nearby dwarf star TRAPPIST-1, providing a unique opportunity to search for atmospheres on small planets outside the Solar System (Gillon et al., 2017). Thanks to the recent launch of JWST, possible atmospheric constituents such as carbon dioxide (CO2) are now detectable (Morley et al., 2017, Lincowski et al., 2018}. Recent JWST observations of the innermost planet TRA…
▽ More
Seven rocky planets orbit the nearby dwarf star TRAPPIST-1, providing a unique opportunity to search for atmospheres on small planets outside the Solar System (Gillon et al., 2017). Thanks to the recent launch of JWST, possible atmospheric constituents such as carbon dioxide (CO2) are now detectable (Morley et al., 2017, Lincowski et al., 2018}. Recent JWST observations of the innermost planet TRAPPIST-1 b showed that it is most probably a bare rock without any CO2 in its atmosphere (Greene et al., 2023). Here we report the detection of thermal emission from the dayside of TRAPPIST-1 c with the Mid-Infrared Instrument (MIRI) on JWST at 15 micron. We measure a planet-to-star flux ratio of fp/fs = 421 +/- 94 parts per million (ppm) which corresponds to an inferred dayside brightness temperature of 380 +/- 31 K. This high dayside temperature disfavours a thick, CO2-rich atmosphere on the planet. The data rule out cloud-free O2/CO2 mixtures with surface pressures ranging from 10 bar (with 10 ppm CO2) to 0.1 bar (pure CO2). A Venus-analogue atmosphere with sulfuric acid clouds is also disfavoured at 2.6 sigma confidence. Thinner atmospheres or bare-rock surfaces are consistent with our measured planet-to-star flux ratio. The absence of a thick, CO2-rich atmosphere on TRAPPIST-1 c suggests a relatively volatile-poor formation history, with less than 9.5 +7.5 -2.3 Earth oceans of water. If all planets in the system formed in the same way, this would indicate a limited reservoir of volatiles for the potentially habitable planets in the system.
△ Less
Submitted 16 June, 2023;
originally announced June 2023.
-
Magnetic properties of Nd6Fe13Cu single crystals
Authors:
Jianing Liu,
Ruiwen Xie,
Alex Aubert,
Lukas Schäfer,
Hongbin Zhang,
Oliver Gutfleisch,
Konstantin Skokov
Abstract:
The understanding of coercivity mechanism in high performance Nd-Fe-B permanent magnets relies on the analysis of the magnetic properties of all phases present in the magnets. By adding Cu in such compounds, a new Nd6Fe13Cu grain boundary phase is formed, however, the magnetic properties of this phase and its role in the magnetic decoupling of the matrix Nd2Fe14B grains are still insufficiently st…
▽ More
The understanding of coercivity mechanism in high performance Nd-Fe-B permanent magnets relies on the analysis of the magnetic properties of all phases present in the magnets. By adding Cu in such compounds, a new Nd6Fe13Cu grain boundary phase is formed, however, the magnetic properties of this phase and its role in the magnetic decoupling of the matrix Nd2Fe14B grains are still insufficiently studied. In this work, we have grown Nd6Fe13Cu single crystals by the reactive flux method and studied their magnetic properties in detail. It is observed that below the Néel temperature (TN = 410 K), the Nd6Fe13Cu is antiferromagnetic in zero magnetic field; whereas when a magnetic field is applied along the a-axis, a spin-flop transition occurs at approx. 6 T, indicating a strong competition between antiferromagnetic and ferromagnetic interactions in two Nd layers below and above the Cu layers. Our atomistic spin dynamics simulation confirms that an increase in temperature and/or magnetic field can significantly change the antiferromagnetic coupling between the two Nd layers below and above the Cu layers, which, in turn, is the reason for the observed spin-flop transition. These results suggest that the role of antiferromagnetic Nd6Fe13Cu grain boundary phase in the coercivity enhancement of Nd-Fe-B-Cu magnets is more complex than previously thought, mainly due to the competition between its antiferro- and ferro-magnetic exchange interactions.
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
Magneto-active composites with locally tailored stiffness produced by laser powder bed fusion
Authors:
Kilian Schäfer,
Matthias Lutzi,
Muhammad Bilal Khan,
Lukas Schäfer,
Konstantin Skokov,
Imants Dirba,
Sebastian Bruns,
Iman Valizadeh,
Oliver Weeger,
Claas Hartmann,
Mario Kupnik,
Esmaeil Adabifiroozjaei,
Leopoldo Molina-Luna,
Oliver Gutfleisch
Abstract:
Additive manufacturing technologies enable the production of complex and bioinspired shapes using magneto-responsive materials, which find diverse applications in soft robotics. Particularly, the development of composites with controlled gradients in mechanical properties offers new prospects for advancements in magneto-active materials. However, achieving such composites with gradients typically…
▽ More
Additive manufacturing technologies enable the production of complex and bioinspired shapes using magneto-responsive materials, which find diverse applications in soft robotics. Particularly, the development of composites with controlled gradients in mechanical properties offers new prospects for advancements in magneto-active materials. However, achieving such composites with gradients typically involves complex multi-material printing procedures. In this study, a single-step laser powder bed fusion (LPBF) process is proposed that enables precise local adjustments of the mechanical stiffness within magneto-active composites. By utilizing distinct laser parameters in specific regions of a composite containing thermoplastic polyurethane and atomized magnetic powder derived from hard magnetic Nd-Fe-B, the stiffness of the composite can be modified within the range of 2 to 22 MPa. Various magneto-responsive actuators with locally tailored stiffness are fabricated and their magnetic performance is investigated. The enhanced response exhibited by actuators with locally adjusted mechanical properties in comparison to their homogeneous counterparts with identical geometries is shown. As a demonstration of a biomedical application, a magnetically responsive stent with localized adjustment is presented with the ability to meet specific requirements in terms of geometry and local stiffness based on an individual's anatomy and disease condition. The proposed method presents an approach for creating functionally graded materials using LPBF, not only for magneto-active materials but also for several other structural and functional materials.
△ Less
Submitted 20 December, 2023; v1 submitted 4 May, 2023;
originally announced May 2023.
-
The Premartensite and Martensite in Fe50Rh50 System
Authors:
Esmaeil Adabifiroozjaei,
Fernando Maccari,
Lukas Schaefer,
Tianshu Jiang,
Oscar Recalde-Benitez,
Alisa Chirkova,
Navid Shayanfar,
Imants Dirba,
Nagaarjhuna A Kani,
Olga Shuleshova,
Robert Winkler,
Alexander Zintler,
Ziyuan Rao,
Lukas Pfeuffer,
Andras Kovacs,
Rafal E Dunin-Borkowski,
Michael Farle,
Konstantin Skokov,
Baptiste Gault,
Markus Gruner,
Oliver Gutfleisch,
Leopoldo Molina-Luna
Abstract:
Metallic/intermetalic materials with BCC structures hold an intrinsic instability due to phonon softening along [110] dirrection, causing BCC to lower-symmetry phases transformation when the BCC structures are thermally or mechanically stressed. Fe50Rh50 binary system is one of the exceptional BCC structures (ordered-B2) that has not been yet showing such transformation upon application of thermal…
▽ More
Metallic/intermetalic materials with BCC structures hold an intrinsic instability due to phonon softening along [110] dirrection, causing BCC to lower-symmetry phases transformation when the BCC structures are thermally or mechanically stressed. Fe50Rh50 binary system is one of the exceptional BCC structures (ordered-B2) that has not been yet showing such transformation upon application of thermal stress, although mechanical deformation results in B2 to disordered FCC (gamma) and L10 phases transformation. Here, a comprehensive transmission electron microscopy (TEM) study is conducted on thermally-stressed samples of Fe50Rh50 aged at water and liquid nitrogen from 1150 degree C and 1250 degree C. The results show that, samples quenched from 1150 degree C into water and liquid nitrogen show presence of 1/4{110} and 1/2{110} satellite reflections, the latter of which is expected from phonon dispersion curves obtained by density functional theory calculation. Therefore, it is believed that Fe50Rh50 maintains the B2 structure that is in premartensite state. Once Fe50Rh50 is quenched from 1250 degree C into liquid nitrogen, formation of two short-range ordered tetragonal phases with various c/a ratios (~1.15 and 1.4) is observed in line with phases formed from mechanically deformed (30%) sample. According to our observations, an accurate atomistic shear model ({110}<1-10>) is presented that describes the martensitic transformation of B2 to these tetragonal phases. These findings offer implications useful for understanding of magnetic and physical characteristics of metallic/intermetallic materials.
△ Less
Submitted 3 May, 2023; v1 submitted 2 May, 2023;
originally announced May 2023.
-
Using Offline Data to Speed Up Reinforcement Learning in Procedurally Generated Environments
Authors:
Alain Andres,
Lukas Schäfer,
Stefano V. Albrecht,
Javier Del Ser
Abstract:
One of the key challenges of Reinforcement Learning (RL) is the ability of agents to generalise their learned policy to unseen settings. Moreover, training RL agents requires large numbers of interactions with the environment. Motivated by the recent success of Offline RL and Imitation Learning (IL), we conduct a study to investigate whether agents can leverage offline data in the form of trajecto…
▽ More
One of the key challenges of Reinforcement Learning (RL) is the ability of agents to generalise their learned policy to unseen settings. Moreover, training RL agents requires large numbers of interactions with the environment. Motivated by the recent success of Offline RL and Imitation Learning (IL), we conduct a study to investigate whether agents can leverage offline data in the form of trajectories to improve the sample-efficiency in procedurally generated environments. We consider two settings of using IL from offline data for RL: (1) pre-training a policy before online RL training and (2) concurrently training a policy with online RL and IL from offline data. We analyse the impact of the quality (optimality of trajectories) and diversity (number of trajectories and covered level) of available offline trajectories on the effectiveness of both approaches. Across four well-known sparse reward tasks in the MiniGrid environment, we find that using IL for pre-training and concurrently during online RL training both consistently improve the sample-efficiency while converging to optimal policies. Furthermore, we show that pre-training a policy from as few as two trajectories can make the difference between learning an optimal policy at the end of online training and not learning at all. Our findings motivate the widespread adoption of IL for pre-training and concurrent IL in procedurally generated environments whenever offline trajectories are available or can be generated.
△ Less
Submitted 8 December, 2024; v1 submitted 18 April, 2023;
originally announced April 2023.
-
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Authors:
Lukas Schäfer,
Oliver Slumbers,
Stephen McAleer,
Yali Du,
Stefano V. Albrecht,
David Mguni
Abstract:
Multi-agent reinforcement learning (MARL) requires agents to explore within a vast joint action space to find joint actions that lead to coordination. Existing value-based MARL algorithms commonly rely on random exploration, such as $ε$-greedy, to explore the environment which is not systematic and inefficient at identifying effective actions in multi-agent problems. Additionally, the concurrent t…
▽ More
Multi-agent reinforcement learning (MARL) requires agents to explore within a vast joint action space to find joint actions that lead to coordination. Existing value-based MARL algorithms commonly rely on random exploration, such as $ε$-greedy, to explore the environment which is not systematic and inefficient at identifying effective actions in multi-agent problems. Additionally, the concurrent training of the policies of multiple agents during training can render the optimisation non-stationary. This can lead to unstable value estimates, highly variant gradients, and ultimately hinder coordination between agents. To address these challenges, we propose ensemble value functions for multi-agent exploration (EMAX). EMAX is a framework to seamlessly extend value-based MARL algorithms. EMAX leverages an ensemble of value functions for each agent to guide their exploration, reduce the variance of their optimisation, and makes their policies more robust to miscoordination. EMAX achieves these benefits by (1) systematically guiding the exploration of agents with a UCB policy towards parts of the environment that require multiple agents to coordinate. (2) EMAX computes average value estimates across the ensemble as target values to reduce the variance of gradients and make optimisation more stable. (3) During evaluation, EMAX selects actions following a majority vote across the ensemble to reduce the likelihood of miscoordination. We first instantiate independent DQN with EMAX and evaluate it in 11 general-sum tasks with sparse rewards. We show that EMAX improves final evaluation returns by 185% across all tasks. We then evaluate EMAX on top of IDQN, VDN and QMIX in 21 common-reward tasks, and show that EMAX improves sample efficiency and final evaluation returns across all tasks over all three vanilla algorithms by 60%, 47%, and 538%, respectively.
△ Less
Submitted 6 February, 2025; v1 submitted 7 February, 2023;
originally announced February 2023.
-
p-median location interdiction on trees
Authors:
Lena Leiß,
Till Heller,
Luca E. Schäfer,
Manuel Streicher,
Stefan Ruzika
Abstract:
In p-median location interdiction the aim is to find a subset of edges in a graph, such that the objective value of the p-median problem in the same graph without the selected edges is as large as possible.
We prove that this problem is NP-hard even on acyclic graphs. Restricting the problem to trees with unit lengths on the edges, unit interdiction costs, and a single edge interdiction, we prov…
▽ More
In p-median location interdiction the aim is to find a subset of edges in a graph, such that the objective value of the p-median problem in the same graph without the selected edges is as large as possible.
We prove that this problem is NP-hard even on acyclic graphs. Restricting the problem to trees with unit lengths on the edges, unit interdiction costs, and a single edge interdiction, we provide an algorithm which solves the problem in polynomial time. Furthermore, we investigate path graphs with unit and arbitrary lengths. For the former case, we present an algorithm, where multiple edges can get interdicted. Furthermore, for the latter case, we present a method to compute an optimal solution for one interdiction step which can also be extended to multiple interdicted edges.
△ Less
Submitted 31 January, 2023;
originally announced January 2023.
-
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
Authors:
Aleksandar Krnjaic,
Raul D. Steleac,
Jonathan D. Thomas,
Georgios Papoudakis,
Lukas Schäfer,
Andrew Wing Keung To,
Kuan-Ho Lao,
Murat Cubuktepe,
Matthew Haley,
Peter Börsting,
Stefano V. Albrecht
Abstract:
We consider a warehouse in which dozens of mobile robots and human pickers work together to collect and deliver items within the warehouse. The fundamental problem we tackle, called the order-picking problem, is how these worker agents must coordinate their movement and actions in the warehouse to maximise performance in this task. Established industry methods using heuristic approaches require la…
▽ More
We consider a warehouse in which dozens of mobile robots and human pickers work together to collect and deliver items within the warehouse. The fundamental problem we tackle, called the order-picking problem, is how these worker agents must coordinate their movement and actions in the warehouse to maximise performance in this task. Established industry methods using heuristic approaches require large engineering efforts to optimise for innately variable warehouse configurations. In contrast, multi-agent reinforcement learning (MARL) can be flexibly applied to diverse warehouse configurations (e.g. size, layout, number/types of workers, item replenishment frequency), and different types of order-picking paradigms (e.g. Goods-to-Person and Person-to-Goods), as the agents can learn how to cooperate optimally through experience. We develop hierarchical MARL algorithms in which a manager agent assigns goals to worker agents, and the policies of the manager and workers are co-trained toward maximising a global objective (e.g. pick rate). Our hierarchical algorithms achieve significant gains in sample efficiency over baseline MARL algorithms and overall pick rates over multiple established industry heuristics in a diverse set of warehouse configurations and different order-picking paradigms.
△ Less
Submitted 30 August, 2024; v1 submitted 22 December, 2022;
originally announced December 2022.
-
A machine learning based algorithm selection method to solve the minimum cost flow problem
Authors:
Philipp Herrmann,
Anna Meyer,
Stefan Ruzika,
Luca E. Schäfer,
Fabian von der Warth
Abstract:
The minimum cost flow problem is one of the most studied network optimization problems and appears in numerous applications. Some efficient algorithms exist for this problem, which are freely available in the form of libraries or software packages. It is noticeable that none of these solvers is better than the other solution methods on all instances. Thus, the question arises whether the fastest a…
▽ More
The minimum cost flow problem is one of the most studied network optimization problems and appears in numerous applications. Some efficient algorithms exist for this problem, which are freely available in the form of libraries or software packages. It is noticeable that none of these solvers is better than the other solution methods on all instances. Thus, the question arises whether the fastest algorithm can be selected for a given instance based on the characteristics of the instance. To this end, we train several machine learning classifiers to predict the fastest among a given set of solvers. We accomplish this by creating a representative data set of 81,000 instances and characterizing each of these instances by a vector of relevant features. To achieve better performance, we conduct a grid search to optimize the hyperparameters of the classifiers. Finally, we evaluate the different classifiers by means of accuracy. It is shown that tree-based models appear to adapt and exploit the relevant structures of the minimum-cost flow problem particularly well on a large number of instances, predicting the fastest solver with an accuracy of more than 90%.
△ Less
Submitted 3 October, 2022;
originally announced October 2022.
-
A primordial atmospheric origin of hydrospheric deuterium enrichment on Mars
Authors:
Kaveh Pahlevan,
Laura Schaefer,
Linda T. Elkins-Tanton,
Steven J. Desch,
Peter R. Buseck
Abstract:
The deuterium-to-hydrogen (D/H or 2H/1H) ratio of Martian atmospheric water (~6x standard mean ocean water, SMOW) is higher than that of known sources, requiring planetary enrichment. A recent measurement by NASA's Mars Science Laboratory rover Curiosity of >3 Gyr clays yields a D/H ratio ~3x SMOW, demonstrating that most enrichment occurs early in Mars's history. As on Venus, Mars's D/H enrichmen…
▽ More
The deuterium-to-hydrogen (D/H or 2H/1H) ratio of Martian atmospheric water (~6x standard mean ocean water, SMOW) is higher than that of known sources, requiring planetary enrichment. A recent measurement by NASA's Mars Science Laboratory rover Curiosity of >3 Gyr clays yields a D/H ratio ~3x SMOW, demonstrating that most enrichment occurs early in Mars's history. As on Venus, Mars's D/H enrichment is thought to reflect preferential loss to space of 1H (protium) relative to 2H (deuterium), but the global environmental context of large and early hydrogen losses remain to be determined. Here, we apply a recent model of primordial atmosphere evolution to Mars, link the magma ocean of the accretion epoch with a subsequent water-ocean epoch, and calculate the behavior of deuterium for comparison with the observed record. We find that a ~2-3x hydrospheric deuterium-enrichment is produced if the Martian magma ocean is chemically reducing at last equilibration with the primordial atmosphere, making H2-CO the initially dominant species, with minor abundances of H2O-CO2. Reducing gases - in particular H2 - can cause greenhouse warming and prevent a water ocean from freezing immediately after the magma ocean epoch. Moreover, the pressure-temperature conditions are high enough to produce ocean-atmosphere H2O-H2 isotopic equilibrium such that surface H2O strongly concentrates deuterium relative to H2, which preferentially takes up protium and escapes from the primordial atmosphere. The proposed scenario of primordial H2-rich outgassing and escape suggests significant durations (>Myr) of chemical conditions on the Martian surface conducive to prebiotic chemistry immediately following Martian accretion.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
The SAMI Galaxy Survey: Using concentrated star-formation and stellar population ages to understand environmental quenching
Authors:
Di Wang,
Scott M. Croom,
Julia J. Bryant,
Sam P. Vaughan,
Adam L. Schaefer,
Francesco D'Eugenio,
Stefania Barsanti,
Sarah Brough,
Claudia del P. Lagos,
Anne M. Medling,
Sree Oh,
Jesse van de Sande,
Giulia Santucci,
Joss Bland-Hawthorn,
Michael Goodwin,
Brent Groves,
Jon Lawrence,
Matt S. Owers,
Samuel Richards
Abstract:
We study environmental quenching using the spatial distribution of current star-formation and stellar population ages with the full SAMI Galaxy Survey. By using a star-formation concentration index [C-index, defined as log10(r_{50,Halpha}/r_{50,cont})], we separate our sample into regular galaxies (C-index>-0.2) and galaxies with centrally concentrated star-formation (SF-concentrated; C-index<-0.2…
▽ More
We study environmental quenching using the spatial distribution of current star-formation and stellar population ages with the full SAMI Galaxy Survey. By using a star-formation concentration index [C-index, defined as log10(r_{50,Halpha}/r_{50,cont})], we separate our sample into regular galaxies (C-index>-0.2) and galaxies with centrally concentrated star-formation (SF-concentrated; C-index<-0.2). Concentrated star-formation is a potential indicator of galaxies currently undergoing `outside-in' quenching. Our environments cover ungrouped galaxies, low-mass groups (M_200<10^12.5 M_sun), high-mass groups (M_200 in the range 10^{12.5-14} M_sun) and clusters (M_200>10^14 M_sun). We find the fraction of SF-concentrated galaxies increases as halo mass increases with 9\pm2 per cent, 8\pm3 per cent, 19\pm4 per cent and 29\pm4 per cent for ungrouped galaxies, low-mass groups, high-mass groups and clusters, respectively. We interpret these results as evidence for `outside-in' quenching in groups and clusters. To investigate the quenching time-scale in SF-concentrated galaxies, we calculate light-weighted age (Age_L) and mass-weighted age (Age_M) using full spectral fitting, as well as the Dn4000 and Hdelta_A indices. We assume that the average galaxy age radial profile before entering a group or cluster is similar to ungrouped regular galaxies. At large radius (1-2 R_e), SF-concentrated galaxies in high-mass groups have older ages than ungrouped regular galaxies with an age difference of 1.83\pm0.38 Gyr for Age_L and 1.34\pm0.56 Gyr for Age_M. This suggests that while `outside-in' quenching can be effective in groups, the process will not quickly quench the entire galaxy. In contrast, the ages at 1-2 R_e of cluster SF-concentrated galaxies and ungrouped regular galaxies are consistent (0.19\pm0.21 Gyr for Age_L, 0.40\pm0.61 Gyr for Age_M), suggesting the quenching process must be rapid.
△ Less
Submitted 1 September, 2022;
originally announced September 2022.
-
Deep Reinforcement Learning for Multi-Agent Interaction
Authors:
Ibrahim H. Ahmed,
Cillian Brewitt,
Ignacio Carlucho,
Filippos Christianos,
Mhairi Dunion,
Elliot Fosong,
Samuel Garcin,
Shangmin Guo,
Balint Gyevnar,
Trevor McInroe,
Georgios Papoudakis,
Arrasy Rahman,
Lukas Schäfer,
Massimiliano Tamborski,
Giuseppe Vecchio,
Cheng Wang,
Stefano V. Albrecht
Abstract:
The development of autonomous agents which can interact with other agents to accomplish a given task is a core area of research in artificial intelligence and machine learning. Towards this goal, the Autonomous Agents Research Group develops novel machine learning algorithms for autonomous systems control, with a specific focus on deep reinforcement learning and multi-agent reinforcement learning.…
▽ More
The development of autonomous agents which can interact with other agents to accomplish a given task is a core area of research in artificial intelligence and machine learning. Towards this goal, the Autonomous Agents Research Group develops novel machine learning algorithms for autonomous systems control, with a specific focus on deep reinforcement learning and multi-agent reinforcement learning. Research problems include scalable learning of coordinated agent policies and inter-agent communication; reasoning about the behaviours, goals, and composition of other agents from limited observations; and sample-efficient learning based on intrinsic motivation, curriculum learning, causal inference, and representation learning. This article provides a broad overview of the ongoing research portfolio of the group and discusses open problems for future directions.
△ Less
Submitted 2 August, 2022;
originally announced August 2022.
-
Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
Authors:
Lukas Schäfer,
Filippos Christianos,
Amos Storkey,
Stefano V. Albrecht
Abstract:
Successful deployment of multi-agent reinforcement learning often requires agents to adapt their behaviour. In this work, we discuss the problem of teamwork adaptation in which a team of agents needs to adapt their policies to solve novel tasks with limited fine-tuning. Motivated by the intuition that agents need to be able to identify and distinguish tasks in order to adapt their behaviour to the…
▽ More
Successful deployment of multi-agent reinforcement learning often requires agents to adapt their behaviour. In this work, we discuss the problem of teamwork adaptation in which a team of agents needs to adapt their policies to solve novel tasks with limited fine-tuning. Motivated by the intuition that agents need to be able to identify and distinguish tasks in order to adapt their behaviour to the current task, we propose to learn multi-agent task embeddings (MATE). These task embeddings are trained using an encoder-decoder architecture optimised for reconstruction of the transition and reward functions which uniquely identify tasks. We show that a team of agents is able to adapt to novel tasks when provided with task embeddings. We propose three MATE training paradigms: independent MATE, centralised MATE, and mixed MATE which vary in the information used for the task encoding. We show that the embeddings learned by MATE identify tasks and provide useful information which agents leverage during adaptation to novel tasks.
△ Less
Submitted 20 November, 2023; v1 submitted 5 July, 2022;
originally announced July 2022.
-
Multi-Horizon Representations with Hierarchical Forward Models for Reinforcement Learning
Authors:
Trevor McInroe,
Lukas Schäfer,
Stefano V. Albrecht
Abstract:
Learning control from pixels is difficult for reinforcement learning (RL) agents because representation learning and policy learning are intertwined. Previous approaches remedy this issue with auxiliary representation learning tasks, but they either do not consider the temporal aspect of the problem or only consider single-step transitions, which may cause learning inefficiencies if important envi…
▽ More
Learning control from pixels is difficult for reinforcement learning (RL) agents because representation learning and policy learning are intertwined. Previous approaches remedy this issue with auxiliary representation learning tasks, but they either do not consider the temporal aspect of the problem or only consider single-step transitions, which may cause learning inefficiencies if important environmental changes take many steps to manifest. We propose Hierarchical $k$-Step Latent (HKSL), an auxiliary task that learns multiple representations via a hierarchy of forward models that learn to communicate and an ensemble of $n$-step critics that all operate at varying magnitudes of step skipping. We evaluate HKSL in a suite of 30 robotic control tasks with and without distractors and a task of our creation. We find that HKSL either converges to higher or optimal episodic returns more quickly than several alternative representation learning approaches. Furthermore, we find that HKSL's representations capture task-relevant details accurately across timescales (even in the presence of distractors) and that communication channels between hierarchy levels organize information based on both sides of the communication process, both of which improve sample efficiency.
△ Less
Submitted 29 January, 2024; v1 submitted 22 June, 2022;
originally announced June 2022.
-
The Equilibrium Tide: An Updated Prescription for Population Synthesis Codes
Authors:
Holly P. Preece,
Adrian S. Hamers,
Patrick G. Neunteufel,
Adam L. Schafer,
Christopher A. Tout
Abstract:
We present an updated prescription for the equilibrium tides suitable for population synthesis codes. A grid of 1D evolutionary models was created and the viscous time-scale was calculated for each detailed model. A metallicity dependent power-law relation was fitted to both the convective cores and convective envelopes of the models. The prescription was implemented into the population synthesis…
▽ More
We present an updated prescription for the equilibrium tides suitable for population synthesis codes. A grid of 1D evolutionary models was created and the viscous time-scale was calculated for each detailed model. A metallicity dependent power-law relation was fitted to both the convective cores and convective envelopes of the models. The prescription was implemented into the population synthesis code BSE and predicts an 16.5% reduction in the overall number of merges, with those involving main-sequence stars most affected. The new prescription also reduces the overall supernova rate by 3.6% with individual channels being differently affected. The single degenerate Ia supernova occurrence is reduced by 12.8%. The merging of two Carbon Oxygen white dwarfs to cause a Ia supernova occurs 16% less frequently. The number of sub-synchronously rotating stars in close binaries is substantially increased with our prescription, as is the number of non-circularized systems at the start of common-envelope evolution.
△ Less
Submitted 23 June, 2022; v1 submitted 13 June, 2022;
originally announced June 2022.
-
Provably Safe Reinforcement Learning: Conceptual Analysis, Survey, and Benchmarking
Authors:
Hanna Krasowski,
Jakob Thumm,
Marlon Müller,
Lukas Schäfer,
Xiao Wang,
Matthias Althoff
Abstract:
Ensuring the safety of reinforcement learning (RL) algorithms is crucial to unlock their potential for many real-world tasks. However, vanilla RL and most safe RL approaches do not guarantee safety. In recent years, several methods have been proposed to provide hard safety guarantees for RL, which is essential for applications where unsafe actions could have disastrous consequences. Nevertheless,…
▽ More
Ensuring the safety of reinforcement learning (RL) algorithms is crucial to unlock their potential for many real-world tasks. However, vanilla RL and most safe RL approaches do not guarantee safety. In recent years, several methods have been proposed to provide hard safety guarantees for RL, which is essential for applications where unsafe actions could have disastrous consequences. Nevertheless, there is no comprehensive comparison of these provably safe RL methods. Therefore, we introduce a categorization of existing provably safe RL methods, present the conceptual foundations for both continuous and discrete action spaces, and empirically benchmark existing methods. We categorize the methods based on how they adapt the action: action replacement, action projection, and action masking. Our experiments on an inverted pendulum and a quadrotor stabilization task indicate that action replacement is the best-performing approach for these applications despite its comparatively simple realization. Furthermore, adding a reward penalty, every time the safety verification is engaged, improved training performance in our experiments. Finally, we provide practical guidance on selecting provably safe RL approaches depending on the safety specification, RL algorithm, and type of action space.
△ Less
Submitted 18 November, 2023; v1 submitted 13 May, 2022;
originally announced May 2022.
-
SDSS-IV MaNGA: Exploring the local scaling relations for N/O
Authors:
Adam L. Schaefer,
Christy Tremonti,
Guinevere Kauffmann,
Brett H. Andrews,
Matthew A. Bershady,
Nicholas F. Boardman,
Kevin Bundy,
Niv Drory,
José G. Fernández-Trincado,
Holly P. Preece,
Rogério Riffel,
Rogemar A. Riffel,
Sebastián F. Sánchez
Abstract:
We present, for the first time, the relationship between local stellar mass surface density, $\mathrm{Σ_{*}}$, and N/O derived from SDSS-IV MaNGA data, using a sample of $792765$ high signal-to-noise ratio star-forming spaxels. Using a combination of phenomenological modelling and partial correlation analysis, we find that $\mathrm{Σ_{*}}$ alone is insufficient to predict the N/O in MaNGA spaxels,…
▽ More
We present, for the first time, the relationship between local stellar mass surface density, $\mathrm{Σ_{*}}$, and N/O derived from SDSS-IV MaNGA data, using a sample of $792765$ high signal-to-noise ratio star-forming spaxels. Using a combination of phenomenological modelling and partial correlation analysis, we find that $\mathrm{Σ_{*}}$ alone is insufficient to predict the N/O in MaNGA spaxels, and that there is an additional dependence on the local star formation rate surface density, $\mathrm{Σ_{SFR}}$. This effect is a factor of $3$ stronger than the dependence of 12+log(O/H) on $\mathrm{Σ_{SFR}}$. Surprisingly, we find that the local N/O scaling relations also depend on the total galaxy stellar mass at fixed $Σ_{*}$ as well as the galaxy size at fixed stellar mass. We find that more compact galaxies are more nitrogen rich, even when $\mathrm{Σ_{*}}$ and $\mathrm{Σ_{SFR}}$ are controlled for. We show that $\sim50\%$ of the variance of N/O is explained by the total stellar mass and size. Thus, the evolution of nitrogen in galaxies is set by more than just local effects and does not simply track the build up of oxygen in galaxies. The precise form of the N/O-O/H relation is therefore sensitive to the sample of galaxies from which it is derived. This result casts doubt on the universal applicability of nitrogen-based strong-line metallicity indicators derived in the local universe.
△ Less
Submitted 31 March, 2022;
originally announced March 2022.
-
Geophysical Evolution During Rocky Planet Formation
Authors:
Tim Lichtenberg,
Laura K. Schaefer,
Miki Nakajima,
Rebecca A. Fischer
Abstract:
Progressive astronomical characterization of planet-forming disks and rocky exoplanets highlight the need for increasing interdisciplinary efforts to understand the birth and life cycle of terrestrial worlds in a unified picture. Here, we review major geophysical and geochemical processes that shape the evolution of rocky planets and their precursor planetesimals during planetary formation and ear…
▽ More
Progressive astronomical characterization of planet-forming disks and rocky exoplanets highlight the need for increasing interdisciplinary efforts to understand the birth and life cycle of terrestrial worlds in a unified picture. Here, we review major geophysical and geochemical processes that shape the evolution of rocky planets and their precursor planetesimals during planetary formation and early evolution, and how these map onto the astrophysical timeline and varying accretion environments of planetary growth. The evolution of the coupled core-mantle-atmosphere system of growing protoplanets diverges in thermal, compositional, and structural states to first order, and ultimately shapes key planetary characteristics that can discern planets harboring clement surface conditions from those that do not. Astronomical campaigns seeking to investigate rocky exoplanets will require significant advances in laboratory characterization of planetary materials and time- and spatially-resolved theoretical models of planetary evolution, to extend planetary science beyond the Solar System and constrain the origins and frequency of habitable worlds like our own.
△ Less
Submitted 18 March, 2022;
originally announced March 2022.
-
Drift vs Shift: Decoupling Trends and Changepoint Analysis
Authors:
Haoxuan Wu,
Toryn L. J. Schafer,
Sean Ryan,
David S. Matteson
Abstract:
We introduce a new approach for decoupling trends (drift) and changepoints (shifts) in time series. Our locally adaptive model-based approach for robustly decoupling combines Bayesian trend filtering and machine learning based regularization. An over-parameterized Bayesian dynamic linear model (DLM) is first applied to characterize drift. Then a weighted penalized likelihood estimator is paired wi…
▽ More
We introduce a new approach for decoupling trends (drift) and changepoints (shifts) in time series. Our locally adaptive model-based approach for robustly decoupling combines Bayesian trend filtering and machine learning based regularization. An over-parameterized Bayesian dynamic linear model (DLM) is first applied to characterize drift. Then a weighted penalized likelihood estimator is paired with the estimated DLM posterior distribution to identify shifts. We show how Bayesian DLMs specified with so-called shrinkage priors can provide smooth estimates of underlying trends in the presence of complex noise components. However, their inability to shrink exactly to zero inhibits direct changepoint detection. In contrast, penalized likelihood methods are highly effective in locating changepoints. However, they require data with simple patterns in both signal and noise. The proposed decoupling approach combines the strengths of both, i.e. the flexibility of Bayesian DLMs with the hard thresholding property of penalized likelihood estimators, to provide changepoint analysis in complex, modern settings. The proposed framework is outlier robust and can identify a variety of changes, including in mean and slope. It is also easily extended for analysis of parameter shifts in time-varying parameter models like dynamic regressions. We illustrate the flexibility and contrast the performance and robustness of our approach with several alternative methods across a wide range of simulations and application examples.
△ Less
Submitted 6 January, 2024; v1 submitted 17 January, 2022;
originally announced January 2022.
-
Analysis of animal-related electric outages using species distribution models and community science data
Authors:
Mei-Ling E. Feng,
Olukunle O. Owolabi,
Toryn L. J. Schafer,
Sanhita Sengupta,
Lan Wang,
David S. Matteson,
Judy P. Che-Castaldo,
Deborah A. Sunter
Abstract:
Animal-related outages (AROs) are a prevalent form of outages in electrical distribution systems. Animal-infrastructure interactions vary across focal species and regions, underlining the need to study the animal-outage relationship in more species and diverse systems. Animal activity has been used as an indicator of reliability in the electrical grid system and to describe temporal patterns in AR…
▽ More
Animal-related outages (AROs) are a prevalent form of outages in electrical distribution systems. Animal-infrastructure interactions vary across focal species and regions, underlining the need to study the animal-outage relationship in more species and diverse systems. Animal activity has been used as an indicator of reliability in the electrical grid system and to describe temporal patterns in AROs. However, these ARO models have been limited by a lack of available estimates of species activity, instead approximating activity based on seasonal and weather patterns in animal-related outage records and characteristics of broad taxonomic groups, e.g., squirrels. We highlight publicly available resources to fill the ecological data gap that is limiting joint analyses between ecology and energy sectors. Species distribution models (SDMs), a common technique to model the distribution of a species across geographic space and time, paired with data sourced from eBird, a community science database for bird observations, provided us with species-specific estimates of activity to model spatio-temporal patterns of AROs. These flexible, species-specific estimates can allow future animal-indicators of grid reliability to be investigated in more diverse regions and ecological communities, providing a better understanding of the variation that exists in animal-outage relationship. AROs were best modeled by accounting for multiple outage-prone species activity patterns and their unique relationships with seasonality and habitat availability. Different species were important for modeling outages in different landscapes and seasons depending on their distribution and migration behavior. We recommend that future models of AROs include species-specific activity data that account for the diverse spectrum of spatio-temporal activity patterns that outage-prone animals exhibit.
△ Less
Submitted 22 December, 2021;
originally announced December 2021.
-
Role of Variable Renewable Energy Penetration on Electricity Price and its Volatility Across Independent System Operators in the United States
Authors:
Olukunle O. Owolabi,
Toryn L. J. Schafer,
Georgia E. Smits,
Sanhita Sengupta,
Sean E. Ryan,
Lan Wang,
David S. Matteson,
Mila Getmansky Sherman,
Deborah A. Sunter
Abstract:
The U.S. electrical grid has undergone substantial transformation with increased penetration of wind and solar -- forms of variable renewable energy (VRE). Despite the benefits of VRE for decarbonization, it has garnered some controversy for inducing unwanted effects in regional electricity markets. In this study, the role of VRE penetration is examined on the system electricity price and price vo…
▽ More
The U.S. electrical grid has undergone substantial transformation with increased penetration of wind and solar -- forms of variable renewable energy (VRE). Despite the benefits of VRE for decarbonization, it has garnered some controversy for inducing unwanted effects in regional electricity markets. In this study, the role of VRE penetration is examined on the system electricity price and price volatility based on hourly, real-time, historical data from six Independent System Operators (ISOs) in the U.S. using quantile and skew t-distribution regressions. After correcting for temporal effects, we found an increase in VRE penetration is associated with decrease in system electricity price in all ISOs studied. The increase in VRE penetration is associated with decrease in temporal price volatility in five out of six ISOs studied. The relationships are non-linear. These results are consistent with the modern portfolio theory where diverse volatile assets may lead to more stable and less risky portfolios.
△ Less
Submitted 28 November, 2022; v1 submitted 10 November, 2021;
originally announced December 2021.
-
The Seventeenth Data Release of the Sloan Digital Sky Surveys: Complete Release of MaNGA, MaStar and APOGEE-2 Data
Authors:
Abdurro'uf,
Katherine Accetta,
Conny Aerts,
Victor Silva Aguirre,
Romina Ahumada,
Nikhil Ajgaonkar,
N. Filiz Ak,
Shadab Alam,
Carlos Allende Prieto,
Andres Almeida,
Friedrich Anders,
Scott F. Anderson,
Brett H. Andrews,
Borja Anguiano,
Erik Aquino-Ortiz,
Alfonso Aragon-Salamanca,
Maria Argudo-Fernandez,
Metin Ata,
Marie Aubert,
Vladimir Avila-Reese,
Carles Badenes,
Rodolfo H. Barba,
Kat Barger,
Jorge K. Barrera-Ballesteros,
Rachael L. Beaton
, et al. (316 additional authors not shown)
Abstract:
This paper documents the seventeenth data release (DR17) from the Sloan Digital Sky Surveys; the fifth and final release from the fourth phase (SDSS-IV). DR17 contains the complete release of the Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey, which reached its goal of surveying over 10,000 nearby galaxies. The complete release of the MaNGA Stellar Library (MaStar) accompanies…
▽ More
This paper documents the seventeenth data release (DR17) from the Sloan Digital Sky Surveys; the fifth and final release from the fourth phase (SDSS-IV). DR17 contains the complete release of the Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey, which reached its goal of surveying over 10,000 nearby galaxies. The complete release of the MaNGA Stellar Library (MaStar) accompanies this data, providing observations of almost 30,000 stars through the MaNGA instrument during bright time. DR17 also contains the complete release of the Apache Point Observatory Galactic Evolution Experiment 2 (APOGEE-2) survey which publicly releases infra-red spectra of over 650,000 stars. The main sample from the Extended Baryon Oscillation Spectroscopic Survey (eBOSS), as well as the sub-survey Time Domain Spectroscopic Survey (TDSS) data were fully released in DR16. New single-fiber optical spectroscopy released in DR17 is from the SPectroscipic IDentification of ERosita Survey (SPIDERS) sub-survey and the eBOSS-RM program. Along with the primary data sets, DR17 includes 25 new or updated Value Added Catalogs (VACs). This paper concludes the release of SDSS-IV survey data. SDSS continues into its fifth phase with observations already underway for the Milky Way Mapper (MWM), Local Volume Mapper (LVM) and Black Hole Mapper (BHM) surveys.
△ Less
Submitted 13 January, 2022; v1 submitted 3 December, 2021;
originally announced December 2021.
-
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning
Authors:
Rujie Zhong,
Duohan Zhang,
Lukas Schäfer,
Stefano V. Albrecht,
Josiah P. Hanna
Abstract:
Reinforcement learning (RL) algorithms are often categorized as either on-policy or off-policy depending on whether they use data from a target policy of interest or from a different behavior policy. In this paper, we study a subtle distinction between on-policy data and on-policy sampling in the context of the RL sub-problem of policy evaluation. We observe that on-policy sampling may fail to mat…
▽ More
Reinforcement learning (RL) algorithms are often categorized as either on-policy or off-policy depending on whether they use data from a target policy of interest or from a different behavior policy. In this paper, we study a subtle distinction between on-policy data and on-policy sampling in the context of the RL sub-problem of policy evaluation. We observe that on-policy sampling may fail to match the expected distribution of on-policy data after observing only a finite number of trajectories and this failure hinders data-efficient policy evaluation. Towards improved data-efficiency, we show how non-i.i.d., off-policy sampling can produce data that more closely matches the expected on-policy data distribution and consequently increases the accuracy of the Monte Carlo estimator for policy evaluation. We introduce a method called Robust On-Policy Sampling and demonstrate theoretically and empirically that it produces data that converges faster to the expected on-policy distribution compared to on-policy sampling. Empirically, we show that this faster convergence leads to lower mean squared error policy value estimates.
△ Less
Submitted 10 October, 2022; v1 submitted 29 November, 2021;
originally announced November 2021.
-
Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning
Authors:
Trevor McInroe,
Lukas Schäfer,
Stefano V. Albrecht
Abstract:
Deep reinforcement learning (RL) agents that exist in high-dimensional state spaces, such as those composed of images, have interconnected learning burdens. Agents must learn an action-selection policy that completes their given task, which requires them to learn a representation of the state space that discerns between useful and useless information. The reward function is the only supervised fee…
▽ More
Deep reinforcement learning (RL) agents that exist in high-dimensional state spaces, such as those composed of images, have interconnected learning burdens. Agents must learn an action-selection policy that completes their given task, which requires them to learn a representation of the state space that discerns between useful and useless information. The reward function is the only supervised feedback that RL agents receive, which causes a representation learning bottleneck that can manifest in poor sample efficiency. We present $k$-Step Latent (KSL), a new representation learning method that enforces temporal consistency of representations via a self-supervised auxiliary task wherein agents learn to recurrently predict action-conditioned representations of the state space. The state encoder learned by KSL produces low-dimensional representations that make optimization of the RL task more sample efficient. Altogether, KSL produces state-of-the-art results in both data efficiency and asymptotic performance in the popular PlaNet benchmark suite. Our analyses show that KSL produces encoders that generalize better to new tasks unseen during training, and its representations are more strongly tied to reward, are more invariant to perturbations in the state space, and move more smoothly through the temporal axis of the RL problem than other methods such as DrQ, RAD, CURL, and SAC-AE.
△ Less
Submitted 10 October, 2021;
originally announced October 2021.
-
The air over there: exploring exoplanet atmospheres
Authors:
Laura Schaefer,
Vivien Parmentier
Abstract:
Atmospheric compositions for rocky exoplanets will depend strongly on the bulk planetary composition and the orbital position of the planet. Non-traditional gases may be present in the atmospheres of exceptionally hot planets. Atmospheres of more clement planets will depend on the abundances of volatiles acquired during planet formation and atmospheric removal processes, including escape, condensa…
▽ More
Atmospheric compositions for rocky exoplanets will depend strongly on the bulk planetary composition and the orbital position of the planet. Non-traditional gases may be present in the atmospheres of exceptionally hot planets. Atmospheres of more clement planets will depend on the abundances of volatiles acquired during planet formation and atmospheric removal processes, including escape, condensation, and reaction with the surface. While the observations of exoplanet atmospheres to date has focused on giant planets, a series of new space and ground-based observatories over the coming decade will revolutionize the precision and spectral resolution with which we are able to probe exoplanet atmospheres. This article consolidates lessons learned from the study of giant planet atmospheres, and points to the observations and challenges on the horizon for terrestrial planets.
△ Less
Submitted 18 August, 2021;
originally announced August 2021.
-
Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration
Authors:
Lukas Schäfer,
Filippos Christianos,
Josiah P. Hanna,
Stefano V. Albrecht
Abstract:
Intrinsic rewards can improve exploration in reinforcement learning, but the exploration process may suffer from instability caused by non-stationary reward shaping and strong dependency on hyperparameters. In this work, we introduce Decoupled RL (DeRL) as a general framework which trains separate policies for intrinsically-motivated exploration and exploitation. Such decoupling allows DeRL to lev…
▽ More
Intrinsic rewards can improve exploration in reinforcement learning, but the exploration process may suffer from instability caused by non-stationary reward shaping and strong dependency on hyperparameters. In this work, we introduce Decoupled RL (DeRL) as a general framework which trains separate policies for intrinsically-motivated exploration and exploitation. Such decoupling allows DeRL to leverage the benefits of intrinsic rewards for exploration while demonstrating improved robustness and sample efficiency. We evaluate DeRL algorithms in two sparse-reward environments with multiple types of intrinsic rewards. Our results show that DeRL is more robust to varying scale and rate of decay of intrinsic rewards and converges to the same evaluation returns than intrinsically-motivated baselines in fewer interactions. Lastly, we discuss the challenge of distribution shift and show that divergence constraint regularisers can successfully minimise instability caused by divergence of exploration and exploitation policies.
△ Less
Submitted 9 February, 2022; v1 submitted 19 July, 2021;
originally announced July 2021.
-
Composition of Terrestrial Exoplanet Atmospheres from Meteorite Outgassing Experiments
Authors:
Maggie A. Thompson,
Myriam Telus,
Laura Schaefer,
Jonathan J. Fortney,
Toyanath Joshi,
David Lederman
Abstract:
Terrestrial exoplanets likely form initial atmospheres through outgassing during and after accretion, although there is currently no first-principles understanding of how to connect a planet's bulk composition to its early atmospheric properties. Important insights into this connection can be gained by assaying meteorites, representative samples of planetary building blocks. We perform laboratory…
▽ More
Terrestrial exoplanets likely form initial atmospheres through outgassing during and after accretion, although there is currently no first-principles understanding of how to connect a planet's bulk composition to its early atmospheric properties. Important insights into this connection can be gained by assaying meteorites, representative samples of planetary building blocks. We perform laboratory outgassing experiments that use a mass spectrometer to measure the abundances of volatiles released when meteorite samples are heated to 1200 $^{\circ}$C. We find that outgassing from three carbonaceous chondrite samples consistently produce H$_2$O-rich (averaged ~66 %) atmospheres but with significant amounts of CO (~18 %) and CO$_2$ (~15 %) as well as smaller quantities of H$_2$ and H$_2$S (up to 1 %). These results provide experimental constraints on the initial chemical composition in theoretical models of terrestrial planet atmospheres, supplying abundances for principal gas species as a function of temperature.
△ Less
Submitted 16 April, 2021;
originally announced April 2021.
-
Water on Hot Rocky Exoplanets
Authors:
Edwin S. Kite,
Laura Schaefer
Abstract:
Data suggest that most rocky exoplanets with orbital period $p$ $<$ 100 d ("hot" rocky exoplanets) formed as gas-rich sub-Neptunes that subsequently lost most of their envelopes, but whether these rocky exoplanets still have atmospheres is unknown. We identify a pathway by which 1-1.7 $R_{Earth}$ (1-10 $M_{Earth}$) rocky exoplanets with orbital periods of 10-100 days can acquire long-lived 10-2000…
▽ More
Data suggest that most rocky exoplanets with orbital period $p$ $<$ 100 d ("hot" rocky exoplanets) formed as gas-rich sub-Neptunes that subsequently lost most of their envelopes, but whether these rocky exoplanets still have atmospheres is unknown. We identify a pathway by which 1-1.7 $R_{Earth}$ (1-10 $M_{Earth}$) rocky exoplanets with orbital periods of 10-100 days can acquire long-lived 10-2000 bar atmospheres that are H$_2$O-dominated, with mean molecular weight $>$10. These atmospheres form during the planets' evolution from sub-Neptunes into rocky exoplanets. H$_2$O that is made by reduction of iron oxides in the silicate magma is highly soluble in the magma, forming a dissolved reservoir that is protected from loss so long as the H$_2$-dominated atmosphere persists. The large size of the dissolved reservoir buffers the H$_2$O atmosphere against loss after the H$_2$ has dispersed. Within our model, a long-lived, water-dominated atmosphere is a common outcome for efficient interaction between a nebula-derived atmosphere (peak atmosphere mass fraction 0.1-0.6 wt%) and oxidized magma ($>$5 wt% FeO), followed by atmospheric loss. This idea predicts that most rocky planets that have orbital periods of 10-100 days and that have radii within 0.1-0.2 $R_{Earth}$ of the lower edge of the radius valley still retain H$_2$O atmospheres. This prediction is imminently testable with JWST and has implications for the interpretation of data for transiting super-Earths.
△ Less
Submitted 13 March, 2021;
originally announced March 2021.
-
Magneto-electric Tuning of Pinning-Type Permanent Magnets through Atomic-Scale Engineering of Grain Boundaries
Authors:
Xinglong Ye,
Fengkai Yan,
Lukas Schaefer,
Di Wang,
Holger Geßwein,
Wu Wang,
Mohammed Reda Chellali,
Leigh T. Stephenson,
Konstantin Skokov,
Oliver Gutfleisch,
Dierk Raabe,
Horst Hahn,
Baptiste Gault,
Robert Kruk
Abstract:
Pinning-type magnets maintaining high coercivity, i.e. the ability to sustain magnetization, at high temperature are at the core of thriving clean-energy technologies. Among these, Sm2Co17-based magnets are excellent candidates owing to their high-temperature stability. However, despite decades of efforts to optimize the intragranular microstructure, the coercivity currently only reaches 20~30% of…
▽ More
Pinning-type magnets maintaining high coercivity, i.e. the ability to sustain magnetization, at high temperature are at the core of thriving clean-energy technologies. Among these, Sm2Co17-based magnets are excellent candidates owing to their high-temperature stability. However, despite decades of efforts to optimize the intragranular microstructure, the coercivity currently only reaches 20~30% of the theoretical limits. Here, the roles of the grain-interior nanostructure and the grain boundaries in controlling coercivity are disentangled by an emerging magneto-electric approach. Through hydrogen charging/discharging by applying voltages of only ~ 1 V, the coercivity is reversibly tuned by an unprecedented value of ~ 1.3 T. In situ magneto-structural measurements and atomic-scale tracking of hydrogen atoms reveal that the segregation of hydrogen atoms at the grain boundaries, rather than the change of the crystal structure, dominates the reversible and substantial change of coercivity. Hydrogen lowers the local magnetocrystalline anisotropy and facilitates the magnetization reversal starting from the grain boundaries. Our study reveals the previously neglected critical role of grain boundaries in the conventional magnetisation-switching paradigm, suggesting a critical reconsideration of strategies to overcome the coercivity limits in permanent magnets, via for instance atomic-scale grain boundary engineering.
△ Less
Submitted 10 February, 2021;
originally announced February 2021.
-
Critical Risk Indicators (CRIs) for the electric power grid: A survey and discussion of interconnected effects
Authors:
Judy P. Che-Castaldo,
Rémi Cousin,
Stefani Daryanto,
Grace Deng,
Mei-Ling E. Feng,
Rajesh K. Gupta,
Dezhi Hong,
Ryan M. McGranaghan,
Olukunle O. Owolabi,
Tianyi Qu,
Wei Ren,
Toryn L. J. Schafer,
Ashutosh Sharma,
Chaopeng Shen,
Mila Getmansky Sherman,
Deborah A. Sunter,
Lan Wang,
David S. Matteson
Abstract:
The electric power grid is a critical societal resource connecting multiple infrastructural domains such as agriculture, transportation, and manufacturing. The electrical grid as an infrastructure is shaped by human activity and public policy in terms of demand and supply requirements. Further, the grid is subject to changes and stresses due to solar weather, climate, hydrology, and ecology. The e…
▽ More
The electric power grid is a critical societal resource connecting multiple infrastructural domains such as agriculture, transportation, and manufacturing. The electrical grid as an infrastructure is shaped by human activity and public policy in terms of demand and supply requirements. Further, the grid is subject to changes and stresses due to solar weather, climate, hydrology, and ecology. The emerging interconnected and complex network dependencies make such interactions increasingly dynamic causing potentially large swings, thus presenting new challenges to manage the coupled human-natural system. This paper provides a survey of models and methods that seek to explore the significant interconnected impact of the electric power grid and interdependent domains. We also provide relevant critical risk indicators (CRIs) across diverse domains that may influence electric power grid risks, including climate, ecology, hydrology, finance, space weather, and agriculture. We discuss the convergence of indicators from individual domains to explore possible systemic risk, i.e., holistic risk arising from cross-domains interconnections. Our study provides an important first step towards data-driven analysis and predictive modeling of risks in the coupled interconnected systems. Further, we propose a compositional approach to risk assessment that incorporates diverse domain expertise and information, data science, and computer science to identify domain-specific CRIs and their union in systemic risk indicators.
△ Less
Submitted 9 June, 2021; v1 submitted 19 January, 2021;
originally announced January 2021.
-
SDSS-IV/MaNGA: Can impulsive gaseous inflows explain steep oxygen abundance profiles \& anomalously-low-metallicity regions?
Authors:
Zachary J. Pace,
Christy Tremonti,
Adam L. Schaefer,
David V. Stark,
Catherine A. Witherspoon,
Karen L. Masters,
Niv Drory,
Kai Zhang
Abstract:
Gaseous inflows are necessary suppliers of galaxies' star-forming fuel, but are difficult to characterize at the survey scale. We use integral-field spectroscopic measurements of gas-phase metallicity and single-dish radio measurements of total atomic gas mass to estimate the magnitude and frequency of gaseous inflows incident on star-forming galaxies. We reveal a mutual correlation between steep…
▽ More
Gaseous inflows are necessary suppliers of galaxies' star-forming fuel, but are difficult to characterize at the survey scale. We use integral-field spectroscopic measurements of gas-phase metallicity and single-dish radio measurements of total atomic gas mass to estimate the magnitude and frequency of gaseous inflows incident on star-forming galaxies. We reveal a mutual correlation between steep oxygen abundance profiles between $0.25-1.5 R_e$, increased variability of metallicity between $1.25-1.75 R_e$, and elevated HI content at fixed total galaxy stellar mass. Employing a simple but intuitive inflow model, we find that galaxies with total stellar mass less than $10^{10.1} {\rm M_{\odot}}$ have local oxygen abundance profiles consistent with reinvigoration by inflows. Approximately 10-25\% of low-mass galaxies possess signatures of recent accretion, with estimated typical enhancements of approximately 10-90\% in local gas mass surface density. Higher-mass galaxies have limited evidence for such inflows. The large diversity of HI mass implies that inflow-associated gas ought to reside far from the star-forming disk. We therefore propose that a combination of high HI mass, steep metallicity profile between $0.25-1.5 R_e$, and wide metallicity distribution function between $1.25 - 1.75 R_e$ be employed to target possible hosts of inflowing gas for high-resolution radio follow-up.
△ Less
Submitted 23 December, 2020;
originally announced December 2020.
-
Trend and Variance Adaptive Bayesian Changepoint Analysis & Local Outlier Scoring
Authors:
Haoxuan Wu,
Toryn L. J. Schafer,
David S. Matteson
Abstract:
We adaptively estimate both changepoints and local outlier processes in a Bayesian dynamic linear model with global-local shrinkage priors in a novel model we call Adaptive Bayesian Changepoints with Outliers (ABCO). We utilize a state-space approach to identify a dynamic signal in the presence of outliers and measurement error with stochastic volatility. We find that global state equation paramet…
▽ More
We adaptively estimate both changepoints and local outlier processes in a Bayesian dynamic linear model with global-local shrinkage priors in a novel model we call Adaptive Bayesian Changepoints with Outliers (ABCO). We utilize a state-space approach to identify a dynamic signal in the presence of outliers and measurement error with stochastic volatility. We find that global state equation parameters are inadequate for most real applications and we include local parameters to track noise at each time-step. This setup provides a flexible framework to detect unspecified changepoints in complex series, such as those with large interruptions in local trends, with robustness to outliers and heteroskedastic noise. Finally, we compare our algorithm against several alternatives to demonstrate its efficacy in diverse simulation scenarios and two empirical examples on the U.S. economy.
△ Less
Submitted 13 March, 2024; v1 submitted 18 November, 2020;
originally announced November 2020.
-
On the Bicriterion Maximum Flow Network Interdiction Problem
Authors:
Luca E. Schäfer,
Stefan Ruzika,
Sven O. Krumke,
Carlos M. Fonseca
Abstract:
This article focuses on a biobjective extension of the maximum flow network interdiction problem, where each arc in the network is associated with two capacity values. Two maximum flows from a source to a sink are to be computed independently of each other with respect to the first and second capacity function, respectively, while an interdictor aims to minimize the value of both maximum flows by…
▽ More
This article focuses on a biobjective extension of the maximum flow network interdiction problem, where each arc in the network is associated with two capacity values. Two maximum flows from a source to a sink are to be computed independently of each other with respect to the first and second capacity function, respectively, while an interdictor aims to minimize the value of both maximum flows by interdicting arcs. We show that this problem is intractable and that the decision problem, which asks whether or not a feasible interdiction strategy is efficient, is NP-complete. We propose a pseudopolynomial time algorithm in the case of two-terminal series-parallel graphs and positive integer-valued interdiction costs. We extend this algorithm to a fully polynomial-time approximation scheme for the case of unit interdiction costs by appropriately partitioning the objective space.
△ Less
Submitted 8 October, 2020; v1 submitted 6 October, 2020;
originally announced October 2020.
-
Bayesian Inverse Reinforcement Learning for Collective Animal Movement
Authors:
Toryn L. J. Schafer,
Christopher K. Wikle,
Mevin B. Hooten
Abstract:
Agent-based methods allow for defining simple rules that generate complex group behaviors. The governing rules of such models are typically set a priori and parameters are tuned from observed behavior trajectories. Instead of making simplifying assumptions across all anticipated scenarios, inverse reinforcement learning provides inference on the short-term (local) rules governing long term behavio…
▽ More
Agent-based methods allow for defining simple rules that generate complex group behaviors. The governing rules of such models are typically set a priori and parameters are tuned from observed behavior trajectories. Instead of making simplifying assumptions across all anticipated scenarios, inverse reinforcement learning provides inference on the short-term (local) rules governing long term behavior policies by using properties of a Markov decision process. We use the computationally efficient linearly-solvable Markov decision process to learn the local rules governing collective movement for a simulation of the self propelled-particle (SPP) model and a data application for a captive guppy population. The estimation of the behavioral decision costs is done in a Bayesian framework with basis function smoothing. We recover the true costs in the SPP simulation and find the guppies value collective movement more than targeted movement toward shelter.
△ Less
Submitted 11 June, 2022; v1 submitted 8 September, 2020;
originally announced September 2020.
-
Exogeoscience and Its Role in Characterizing Exoplanet Habitability and the Detectability of Life
Authors:
Cayman T. Unterborn,
Paul K. Byrne,
Ariel D. Anbar,
Giada Arney,
David Brain,
Steve J. Desch,
Bradford J. Foley,
Martha S. Gilmore,
Hilairy E. Hartnett,
Wade G. Henning,
Marc M. Hirschmann,
Noam R. Izenberg,
Stephen R. Kane,
Edwin S. Kite,
Laura Kreidberg,
Kanani K. M. Lee,
Timothy W. Lyons,
Stephanie L. Olson,
Wendy R. Panero,
Noah J. Planavsky,
Christopher T. Reinhard,
Joseph P. Renaud,
Laura K. Schaefer,
Edward W. Schwieterman,
Linda E. Sohl
, et al. (2 additional authors not shown)
Abstract:
The search for exoplanetary life must encompass the complex geological processes reflected in an exoplanet's atmosphere, or we risk reporting false positive and false negative detections. To do this, we must nurture the nascent discipline of "exogeoscience" to fully integrate astronomers, astrophysicists, geoscientists, oceanographers, atmospheric chemists and biologists. Increased funding for int…
▽ More
The search for exoplanetary life must encompass the complex geological processes reflected in an exoplanet's atmosphere, or we risk reporting false positive and false negative detections. To do this, we must nurture the nascent discipline of "exogeoscience" to fully integrate astronomers, astrophysicists, geoscientists, oceanographers, atmospheric chemists and biologists. Increased funding for interdisciplinary research programs, supporting existing and future multidisciplinary research nodes, and developing research incubators is key to transforming true exogeoscience from an aspiration to a reality.
△ Less
Submitted 23 July, 2020; v1 submitted 16 July, 2020;
originally announced July 2020.
-
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Authors:
Georgios Papoudakis,
Filippos Christianos,
Lukas Schäfer,
Stefano V. Albrecht
Abstract:
Multi-agent deep reinforcement learning (MARL) suffers from a lack of commonly-used evaluation tasks and criteria, making comparisons between approaches difficult. In this work, we provide a systematic evaluation and comparison of three different classes of MARL algorithms (independent learning, centralised multi-agent policy gradient, value decomposition) in a diverse range of cooperative multi-a…
▽ More
Multi-agent deep reinforcement learning (MARL) suffers from a lack of commonly-used evaluation tasks and criteria, making comparisons between approaches difficult. In this work, we provide a systematic evaluation and comparison of three different classes of MARL algorithms (independent learning, centralised multi-agent policy gradient, value decomposition) in a diverse range of cooperative multi-agent learning tasks. Our experiments serve as a reference for the expected performance of algorithms across different learning tasks, and we provide insights regarding the effectiveness of different learning approaches. We open-source EPyMARL, which extends the PyMARL codebase to include additional algorithms and allow for flexible configuration of algorithm implementation details such as parameter sharing. Finally, we open-source two environments for multi-agent research which focus on coordination under sparse rewards.
△ Less
Submitted 9 November, 2021; v1 submitted 14 June, 2020;
originally announced June 2020.
-
Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning
Authors:
Filippos Christianos,
Lukas Schäfer,
Stefano V. Albrecht
Abstract:
Exploration in multi-agent reinforcement learning is a challenging problem, especially in environments with sparse rewards. We propose a general method for efficient exploration by sharing experience amongst agents. Our proposed algorithm, called Shared Experience Actor-Critic (SEAC), applies experience sharing in an actor-critic framework. We evaluate SEAC in a collection of sparse-reward multi-a…
▽ More
Exploration in multi-agent reinforcement learning is a challenging problem, especially in environments with sparse rewards. We propose a general method for efficient exploration by sharing experience amongst agents. Our proposed algorithm, called Shared Experience Actor-Critic (SEAC), applies experience sharing in an actor-critic framework. We evaluate SEAC in a collection of sparse-reward multi-agent environments and find that it consistently outperforms two baselines and two state-of-the-art algorithms by learning in fewer steps and converging to higher returns. In some harder environments, experience sharing makes the difference between learning to solve the task and not learning at all.
△ Less
Submitted 19 May, 2021; v1 submitted 12 June, 2020;
originally announced June 2020.
-
Distributed lag models to identify the cumulative effects of training and recovery in athletes using multivariate ordinal wellness data
Authors:
Erin M. Schliep,
Toryn L. J. Schafer,
Matthew Hawkey
Abstract:
Subjective wellness data can provide important information on the well-being of athletes and be used to maximize player performance and detect and prevent against injury. Wellness data, which are often ordinal and multivariate, include metrics relating to the physical, mental, and emotional status of the athlete. Training and recovery can have significant short- and long-term effects on athlete we…
▽ More
Subjective wellness data can provide important information on the well-being of athletes and be used to maximize player performance and detect and prevent against injury. Wellness data, which are often ordinal and multivariate, include metrics relating to the physical, mental, and emotional status of the athlete. Training and recovery can have significant short- and long-term effects on athlete wellness, and these effects can vary across individual. We develop a joint multivariate latent factor model for ordinal response data to investigate the effects of training and recovery on athlete wellness. We use a latent factor distributed lag model to capture the cumulative effects of training and recovery through time. Current efforts using subjective wellness data have averaged over these metrics to create a univariate summary of wellness, however this approach can mask important information in the data. Our multivariate model leverages each ordinal variable and can be used to identify the relative importance of each in monitoring athlete wellness. The model is applied to athlete daily wellness, training, and recovery data collected across two Major League Soccer seasons.
△ Less
Submitted 18 May, 2020;
originally announced May 2020.
-
The two player shortest path network interdiction problem
Authors:
Simon Busam,
Luca E. Schäfer,
Stefan Ruzika
Abstract:
In this article, we study a biobjective extension of the shortest path network interdiction problem. Each arc in the network is associated with two integer length values and two players compute their respective shortest paths from source to sink independently from each other while an interdictor tries to lengthen both shortest paths by removing arcs. We show that this problem is intractable and th…
▽ More
In this article, we study a biobjective extension of the shortest path network interdiction problem. Each arc in the network is associated with two integer length values and two players compute their respective shortest paths from source to sink independently from each other while an interdictor tries to lengthen both shortest paths by removing arcs. We show that this problem is intractable and that deciding whether a feasible interdiction strategy is efficient, is NP- complete. We provide a solution procedure to solve the problem on two-terminal series-parallel graphs in pseudopolynomial time.
△ Less
Submitted 23 August, 2020; v1 submitted 17 April, 2020;
originally announced April 2020.