-
Applying computational protein design to therapeutic antibody discovery -- current state and perspectives
Authors:
Weronika Bielska,
Igor Jaszczyszyn,
Pawel Dudzic,
Bartosz Janusz,
Dawid Chomicz,
Sonia Wrobel,
Victor Greiff,
Ryan Feehan,
Jared Adolf-Bryfogle,
Konrad Krawczyk
Abstract:
Machine learning applications in protein sciences have ushered in a new era for designing molecules in silico. Antibodies, which currently form the largest group of biologics in clinical use, stand to benefit greatly from this shift. Despite the proliferation of these protein design tools, their direct application to antibodies is often limited by the unique structural biology of these molecules.…
▽ More
Machine learning applications in protein sciences have ushered in a new era for designing molecules in silico. Antibodies, which currently form the largest group of biologics in clinical use, stand to benefit greatly from this shift. Despite the proliferation of these protein design tools, their direct application to antibodies is often limited by the unique structural biology of these molecules. Here, we review the current computational methods for antibody design, highlighting their role in advancing computational drug discovery.
△ Less
Submitted 2 March, 2025;
originally announced March 2025.
-
Detecting Systematic Weaknesses in Vision Models along Predefined Human-Understandable Dimensions
Authors:
Sujan Sai Gannamaneni,
Rohil Prakash Rao,
Michael Mock,
Maram Akila,
Stefan Wrobel
Abstract:
Slice discovery methods (SDMs) are prominent algorithms for finding systematic weaknesses in DNNs. They identify top-k semantically coherent slices/subsets of data where a DNN-under-test has low performance. For being directly useful, slices should be aligned with human-understandable and relevant dimensions, which, for example, are defined by safety and domain experts as part of the operational d…
▽ More
Slice discovery methods (SDMs) are prominent algorithms for finding systematic weaknesses in DNNs. They identify top-k semantically coherent slices/subsets of data where a DNN-under-test has low performance. For being directly useful, slices should be aligned with human-understandable and relevant dimensions, which, for example, are defined by safety and domain experts as part of the operational design domain (ODD). While SDMs can be applied effectively on structured data, their application on image data is complicated by the lack of semantic metadata. To address these issues, we present an algorithm that combines foundation models for zero-shot image classification to generate semantic metadata with methods for combinatorial search to find systematic weaknesses in images. In contrast to existing approaches, ours identifies weak slices that are in line with pre-defined human-understandable dimensions. As the algorithm includes foundation models, its intermediate and final results may not always be exact. Therefore, we include an approach to address the impact of noisy metadata. We validate our algorithm on both synthetic and real-world datasets, demonstrating its ability to recover human-understandable systematic weaknesses. Furthermore, using our approach, we identify systematic weaknesses of multiple pre-trained and publicly available state-of-the-art computer vision DNNs.
△ Less
Submitted 6 March, 2025; v1 submitted 17 February, 2025;
originally announced February 2025.
-
Reinforcement Learning for Efficient Returns Management
Authors:
Pascal Linden,
Nathalie Paul,
Tim Wirtz,
Stefan Wrobel
Abstract:
In retail warehouses, returned products are typically placed in an intermediate storage until a decision regarding further shipment to stores is made. The longer products are held in storage, the higher the inefficiency and costs of the returns management process, since enough storage area has to be provided and maintained while the products are not placed for sale. To reduce the average product s…
▽ More
In retail warehouses, returned products are typically placed in an intermediate storage until a decision regarding further shipment to stores is made. The longer products are held in storage, the higher the inefficiency and costs of the returns management process, since enough storage area has to be provided and maintained while the products are not placed for sale. To reduce the average product storage time, we consider an alternative solution where reallocation decisions for products can be made instantly upon their arrival in the warehouse allowing only a limited number of products to still be stored simultaneously. We transfer the problem to an online multiple knapsack problem and propose a novel reinforcement learning approach to pack the items (products) into the knapsacks (stores) such that the overall value (expected revenue) is maximized. Empirical evaluations on simulated data demonstrate that, compared to the usual offline decision procedure, our approach comes with a performance gap of only 3% while significantly reducing the average storage time of a product by 96%.
△ Less
Submitted 24 January, 2025;
originally announced January 2025.
-
Efficient training of machine learning potentials for metallic glasses: CuZrAl validation
Authors:
Antoni Wadowski,
Anshul D. S. Parmar,
Jesper Byggmästar,
Jan S. Wróbel,
Mikko J. Alava,
Silvia Bonfanti
Abstract:
Interatomic potentials play a vital role in revealing microscopic details and structure-property relations, which are fundamental for multiscale simulations and to assist high-throughput experiments. For metallic glasses, developing these potentials is challenging due to the complexity of their unique disordered structure. As a result, chemistry-specific interaction potentials for this important c…
▽ More
Interatomic potentials play a vital role in revealing microscopic details and structure-property relations, which are fundamental for multiscale simulations and to assist high-throughput experiments. For metallic glasses, developing these potentials is challenging due to the complexity of their unique disordered structure. As a result, chemistry-specific interaction potentials for this important class of materials are often missing. Here, we solve this gap by implementing an efficient methodology for designing machine learning interatomic potentials (MLIPs) for metallic glasses, and we benchmark it with the widely studied CuZrAl system. By combining a Lennard-Jones surrogate model with swap-Monte Carlo sampling and Density Functional Theory (DFT) corrections, we capture diverse amorphous structures from 14 decades of supercooling. These distinct structures provide robust and efficient training of the model and applicability to the wider spectrum of energies. This approach reduces the need for extensive DFT and ab initio optimization datasets, while maintaining high accuracy. Our MLIP shows results comparable to the classical Embedded Atom Method (EAM) available for CuZrAl, in predicting structural, energetic, and mechanical properties. This work paves the way for the development of new MLIPs for complex metallic glasses, including emerging multicomponent and high entropy metallic glasses.
△ Less
Submitted 31 December, 2024;
originally announced January 2025.
-
Segregation, ordering, and precipitation in refractory alloys
Authors:
Jesper Byggmästar,
Damian Sobieraj,
Jan S. Wróbel,
Daniel K. Schreiber,
Osman El-Atwani,
Enrique Martinez,
Duc Nguyen-Manh
Abstract:
Tungsten-based low-activation high-entropy alloys are possible candidates for next-generation fusion reactors due to their exceptional tolerance to irradiation, thermal loads, and stress. We develop an accurate and efficient machine-learned interatomic potential for the W-Ta-Cr-V system and use it in hybrid Monte Carlo molecular dynamics simulations of ordering and segregation to all common types…
▽ More
Tungsten-based low-activation high-entropy alloys are possible candidates for next-generation fusion reactors due to their exceptional tolerance to irradiation, thermal loads, and stress. We develop an accurate and efficient machine-learned interatomic potential for the W-Ta-Cr-V system and use it in hybrid Monte Carlo molecular dynamics simulations of ordering and segregation to all common types of defects in WTaCrV. The predictions are compared to atom probe tomography analysis of segregation and precipitation in WTaCrV thin films. By also considering two other alloys, WTaV and MoNbTaVW, we are able to draw general conclusions about preferred segregation in refractory alloys and the reasons behind it, guiding future alloy design and elucidating experimental observations. We show that the experimentally observed CrV precipitates in WTaCrV form semicoherent bcc-to-bcc interfaces with the surrounding matrix, as coherent precipitates are not thermodynamically stable due to excessive lattice mismatch. The predictions from simulations align well with our atom probe tomography analysis as well as previous experimental observations.
△ Less
Submitted 18 December, 2024;
originally announced December 2024.
-
Enhancing Irradiation Resistance in Refractory Medium Entropy Alloys with Simplified Chemistry
Authors:
M. A. Tunes,
D. Parkison,
B. Sun,
P. Willenshofer,
S. Samberger,
B. K. Derby,
J. K. S. Baldwin,
S. J. Fensin,
D. Sobieraj,
J. S. Wróbel,
J. Byggmästar,
S. Pogatscher,
E. Martinez,
D. Nguyen-Manh,
O. El-Atwani
Abstract:
Refractory High-Entropy Alloys (RHEAs) hold promising potential to be used as structural materials in future nuclear fusion reactors, where W and its alloys are currently leading candidates. Fusion materials must be able to withstand extreme conditions, such as (i) severe radiation-damage arising from highly-energetic neutrons, (ii) embrittlement caused by implantation of H and He ions, and (iii)…
▽ More
Refractory High-Entropy Alloys (RHEAs) hold promising potential to be used as structural materials in future nuclear fusion reactors, where W and its alloys are currently leading candidates. Fusion materials must be able to withstand extreme conditions, such as (i) severe radiation-damage arising from highly-energetic neutrons, (ii) embrittlement caused by implantation of H and He ions, and (iii) exposure to extreme high-temperatures and thermal gradients. Recent research demonstrated that two RHEAs - the WTaCrV and WTaCrVHf - can outperform both coarse-grained and nanocrystalline W in terms of its radiation response and microstructural stability. Chemical complexity and nanocrystallinity enhance the radiation tolerance of these new RHEAs, but their multi-element nature, including low-melting Cr, complicates bulk fabrication and limits practical applications. We demonstrate that reducing the number of alloying elements and yet retain high-radiation tolerance is possible within the ternary system W-Ta-V via synthesis of two novel nanocrystalline refractory medium-entropy alloys (RMEAs): the W$_{53}$Ta$_{44}$V$_{3}$ and W$_{53}$Ta$_{42}$V$_{5}$ (in at.\%). We experimentally show that the radiation response of the W-Ta-V system can be tailored by small additions of V, and such experimental result was validated with theoretical analysis of chemical short-range orders (CSRO) from combined ab-initio atomistic Monte-Carlo modeling. It is predicted from computational analysis that a small change in V concentration has a significant effect on the Ta-V CRSO between W$_{53}$Ta$_{44}$V$_{3}$ and W$_{53}$Ta$_{42}$V$_{5}$ leading to radiation-resistant microstructures in these RMEAs from chemistry stand-point of views. We deviate from the original high-entropy alloy concept to show that high radiation resistance can be achieved in systems with simplified chemical complexity.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Constraint programming methods in three-dimensional container packing
Authors:
Szymon Wróbel
Abstract:
Cutting and packing problems are present in many, at first glance unconnected, areas, therefore it's beneficial to have a good understanding of their underlying structure, to select proper techniques for finding solutions. Cutting and packing problems are a class of combinatorial problems in which there are specified two classes of objects: big and small items and the task is to place the small it…
▽ More
Cutting and packing problems are present in many, at first glance unconnected, areas, therefore it's beneficial to have a good understanding of their underlying structure, to select proper techniques for finding solutions. Cutting and packing problems are a class of combinatorial problems in which there are specified two classes of objects: big and small items and the task is to place the small items within big items. Even in the 1-dimensional case, bin-packing is strongly NP-hard (Garey 1978), which suggests, that exact solutions may not be found in a reasonable time for bigger instances. In the literature, there are presented many various approaches to packing problems, e.g. mixed-integer programming, approximation algorithms, heuristic solutions, and local search algorithms, including metaheuristic approaches like Tabu Search or Simulated Annealing.
The main goal of this work is to review existing solutions, survey the variants arising from the industry applications, present a solution based on constraint programming and compare its performance with the results in the literature. Optimization with constraint programming is a method searching for the global optima, hence it may require a higher workload compared to the heuristic and local search approaches, which may finish in a local optimum. The performance of the presented model will be measured on test data used in the literature, which were used in many articles presenting a variety of approaches to three-dimensional container packing, which will allow us to compare the efficiency of the constraint programming model with other methods used in the operational research.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Guideline for Trustworthy Artificial Intelligence -- AI Assessment Catalog
Authors:
Maximilian Poretschkin,
Anna Schmitz,
Maram Akila,
Linara Adilova,
Daniel Becker,
Armin B. Cremers,
Dirk Hecker,
Sebastian Houben,
Michael Mock,
Julia Rosenzweig,
Joachim Sicking,
Elena Schulz,
Angelika Voss,
Stefan Wrobel
Abstract:
Artificial Intelligence (AI) has made impressive progress in recent years and represents a key technology that has a crucial impact on the economy and society. However, it is clear that AI and business models based on it can only reach their full potential if AI applications are developed according to high quality standards and are effectively protected against new AI risks. For instance, AI bears…
▽ More
Artificial Intelligence (AI) has made impressive progress in recent years and represents a key technology that has a crucial impact on the economy and society. However, it is clear that AI and business models based on it can only reach their full potential if AI applications are developed according to high quality standards and are effectively protected against new AI risks. For instance, AI bears the risk of unfair treatment of individuals when processing personal data e.g., to support credit lending or staff recruitment decisions. The emergence of these new risks is closely linked to the fact that the behavior of AI applications, particularly those based on Machine Learning (ML), is essentially learned from large volumes of data and is not predetermined by fixed programmed rules.
Thus, the issue of the trustworthiness of AI applications is crucial and is the subject of numerous major publications by stakeholders in politics, business and society. In addition, there is mutual agreement that the requirements for trustworthy AI, which are often described in an abstract way, must now be made clear and tangible. One challenge to overcome here relates to the fact that the specific quality criteria for an AI application depend heavily on the application context and possible measures to fulfill them in turn depend heavily on the AI technology used. Lastly, practical assessment procedures are needed to evaluate whether specific AI applications have been developed according to adequate quality standards. This AI assessment catalog addresses exactly this point and is intended for two target groups: Firstly, it provides developers with a guideline for systematically making their AI applications trustworthy. Secondly, it guides assessors and auditors on how to examine AI applications for trustworthiness in a structured way.
△ Less
Submitted 20 June, 2023;
originally announced July 2023.
-
Robustness in Fatigue Strength Estimation
Authors:
Dorina Weichert,
Alexander Kister,
Sebastian Houben,
Gunar Ernis,
Stefan Wrobel
Abstract:
Fatigue strength estimation is a costly manual material characterization process in which state-of-the-art approaches follow a standardized experiment and analysis procedure. In this paper, we examine a modular, Machine Learning-based approach for fatigue strength estimation that is likely to reduce the number of experiments and, thus, the overall experimental costs. Despite its high potential, de…
▽ More
Fatigue strength estimation is a costly manual material characterization process in which state-of-the-art approaches follow a standardized experiment and analysis procedure. In this paper, we examine a modular, Machine Learning-based approach for fatigue strength estimation that is likely to reduce the number of experiments and, thus, the overall experimental costs. Despite its high potential, deployment of a new approach in a real-life lab requires more than the theoretical definition and simulation. Therefore, we study the robustness of the approach against misspecification of the prior and discretization of the specified loads. We identify its applicability and its advantageous behavior over the state-of-the-art methods, potentially reducing the number of costly experiments.
△ Less
Submitted 2 December, 2022;
originally announced December 2022.
-
An innovative materials design protocol for the development of novel refractory high-entropy alloys for extreme environments
Authors:
O. El Atwani,
H. T. Vo,
M. Tunes,
C. Lee,
A. Alvarado,
N. Krienke,
J. D. Poplawsky,
A. A. Kohnert,
J. Gigax,
W. -Y. Chen,
M. Li,
Y. Wang,
J. S. Wróbel,
Duc Nguyen-Manh,
J. K. S. Baldwin,
U. Tukac,
E. Aydogan,
S. Fensin,
E. Martinez
Abstract:
In the quest of new materials that can withstand severe irradiation and mechanical extremes for advanced applications (e.g. fission reactors, fusion devices, space applications, etc), design, prediction and control of advanced materials beyond current material designs become a paramount goal. Here, though a combined experimental and simulation methodology, the design of a new nanocrystalline refra…
▽ More
In the quest of new materials that can withstand severe irradiation and mechanical extremes for advanced applications (e.g. fission reactors, fusion devices, space applications, etc), design, prediction and control of advanced materials beyond current material designs become a paramount goal. Here, though a combined experimental and simulation methodology, the design of a new nanocrystalline refractory high entropy alloy (RHEA) system is established. Compositions of this alloy, assessed under extreme environments and in situ electron-microscopy, revealed both high mechanical strength and thermal stability, grain refinement under heavy ion irradiation and outstanding irradiation resistance to dual-beam irradiation and helium implantation, marked by remarkable resistance to defect generation, growth and coalescence. The experimental and modeling results, which demonstrated notable agreement, can be applied to design and rapidly assess other alloys subjected to extreme environmental conditions.
△ Less
Submitted 28 October, 2022;
originally announced October 2022.
-
First-principles analysis of the Al-rich corner of Al-Li-Cu phase diagram
Authors:
S. Liu,
J. S. Wróbel,
J. LLorca
Abstract:
The phase diagram of Al-Li-Cu system in the Al-rich region was determined by means of first-principles calculations and statistical mechanics. The mixing enthalpies of many configurations for different lattices in the whole Al-Li-Cu system were determined by density functional theory simulations to find the stable phases in the convex hull. They were fitted with a cluster expansion to calculate th…
▽ More
The phase diagram of Al-Li-Cu system in the Al-rich region was determined by means of first-principles calculations and statistical mechanics. The mixing enthalpies of many configurations for different lattices in the whole Al-Li-Cu system were determined by density functional theory simulations to find the stable phases in the convex hull. They were fitted with a cluster expansion to calculate the free energy of the configurations with different compositions as a function of temperature in the Al-rich region (Al content > 40 at. %) by means of Monte Carlo simulations. It was found that the ground state phases in the Al-rich part of the Al-Li-Cu phase diagram were α-Al, θ' (Al2Cu), δ' (Al3Li), δ (AlLi) and T1 (Al6Cu4Li3), while θ'' (Al3Cu), T1' (Al2CuLi) and Al3Cu2Li were found on the lowest mixing enthalpy surfaces of their lattices and were metastable. α-Al, δ and T1 are stable phases in the whole temperature range while δ' becomes metastable at very low temperature and θ (Al2Cu) replaces θ' as the stable phase at approximately 550 K due to the vibrational entropic contribution. In addition, the phase diagram in the Al-rich region was built and it was shown in isothermal sections from 100 K to 900 K. They were in good agreement with the limited experimental data in the literature and provided new information regarding the stability, solubility and stoichiometry of the different phases. This information is important to understand the precipitation mechanisms during high temperature aging.
△ Less
Submitted 28 June, 2022;
originally announced June 2022.
-
A Fast Heuristic for Computing Geodesic Cores in Large Networks
Authors:
Florian Seiffarth,
Tamás Horváth,
Stefan Wrobel
Abstract:
Motivated by the increasing interest in applications of graph geodesic convexity in machine learning and data mining, we present a heuristic for computing the geodesic convex hull of node sets in networks. It generates a set of almost maximal outerplanar spanning subgraphs for the input graph, computes the geodesic closure in each of these graphs, and regards a node as an element of the convex hul…
▽ More
Motivated by the increasing interest in applications of graph geodesic convexity in machine learning and data mining, we present a heuristic for computing the geodesic convex hull of node sets in networks. It generates a set of almost maximal outerplanar spanning subgraphs for the input graph, computes the geodesic closure in each of these graphs, and regards a node as an element of the convex hull if it belongs to the closed sets for at least a user specified number of outerplanar graphs. Our heuristic algorithm runs in time linear in the number of edges of the input graph, i.e., it is faster with one order of magnitude than the standard algorithm computing the closure exactly. Its performance is evaluated empirically by approximating convexity based core-periphery decomposition of networks. Our experimental results with large real-world networks show that for most networks, the proposed heuristic was able to produce close approximations significantly faster than the standard algorithm computing the exact convex hulls. For example, while our algorithm calculated an approximate core-periphery decomposition in 5 hours or less for networks with more than 20 million edges, the standard algorithm did not terminate within 50 days.
△ Less
Submitted 15 June, 2022;
originally announced June 2022.
-
Multi-Agent Neural Rewriter for Vehicle Routing with Limited Disclosure of Costs
Authors:
Nathalie Paul,
Tim Wirtz,
Stefan Wrobel,
Alexander Kister
Abstract:
We interpret solving the multi-vehicle routing problem as a team Markov game with partially observable costs. For a given set of customers to serve, the playing agents (vehicles) have the common goal to determine the team-optimal agent routes with minimal total cost. Each agent thereby observes only its own cost. Our multi-agent reinforcement learning approach, the so-called multi-agent Neural Rew…
▽ More
We interpret solving the multi-vehicle routing problem as a team Markov game with partially observable costs. For a given set of customers to serve, the playing agents (vehicles) have the common goal to determine the team-optimal agent routes with minimal total cost. Each agent thereby observes only its own cost. Our multi-agent reinforcement learning approach, the so-called multi-agent Neural Rewriter, builds on the single-agent Neural Rewriter to solve the problem by iteratively rewriting solutions. Parallel agent action execution and partial observability require new rewriting rules for the game. We propose the introduction of a so-called pool in the system which serves as a collection point for unvisited nodes. It enables agents to act simultaneously and exchange nodes in a conflict-free manner. We realize limited disclosure of agent-specific costs by only sharing them during learning. During inference, each agents acts decentrally, solely based on its own cost. First empirical results on small problem sizes demonstrate that we reach a performance close to the employed OR-Tools benchmark which operates in the perfect cost information setting.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
Tailored Uncertainty Estimation for Deep Learning Systems
Authors:
Joachim Sicking,
Maram Akila,
Jan David Schneider,
Fabian Hüger,
Peter Schlicht,
Tim Wirtz,
Stefan Wrobel
Abstract:
Uncertainty estimation bears the potential to make deep learning (DL) systems more reliable. Standard techniques for uncertainty estimation, however, come along with specific combinations of strengths and weaknesses, e.g., with respect to estimation quality, generalization abilities and computational complexity. To actually harness the potential of uncertainty quantification, estimators are requir…
▽ More
Uncertainty estimation bears the potential to make deep learning (DL) systems more reliable. Standard techniques for uncertainty estimation, however, come along with specific combinations of strengths and weaknesses, e.g., with respect to estimation quality, generalization abilities and computational complexity. To actually harness the potential of uncertainty quantification, estimators are required whose properties closely match the requirements of a given use case. In this work, we propose a framework that, firstly, structures and shapes these requirements, secondly, guides the selection of a suitable uncertainty estimation method and, thirdly, provides strategies to validate this choice and to uncover structural weaknesses. By contributing tailored uncertainty estimation in this sense, our framework helps to foster trustworthy DL systems. Moreover, it anticipates prospective machine learning regulations that require, e.g., in the EU, evidences for the technical appropriateness of machine learning systems. Our framework provides such evidences for system components modeling uncertainty.
△ Less
Submitted 29 April, 2022;
originally announced April 2022.
-
Graph Filtration Kernels
Authors:
Till Hendrik Schulz,
Pascal Welke,
Stefan Wrobel
Abstract:
The majority of popular graph kernels is based on the concept of Haussler's $\mathcal{R}$-convolution kernel and defines graph similarities in terms of mutual substructures. In this work, we enrich these similarity measures by considering graph filtrations: Using meaningful orders on the set of edges, which allow to construct a sequence of nested graphs, we can consider a graph at multiple granula…
▽ More
The majority of popular graph kernels is based on the concept of Haussler's $\mathcal{R}$-convolution kernel and defines graph similarities in terms of mutual substructures. In this work, we enrich these similarity measures by considering graph filtrations: Using meaningful orders on the set of edges, which allow to construct a sequence of nested graphs, we can consider a graph at multiple granularities. For one thing, this provides access to features on different levels of resolution. Furthermore, rather than to simply compare frequencies of features in graphs, it allows for their comparison in terms of when and for how long they exist in the sequences. In this work, we propose a family of graph kernels that incorporate these existence intervals of features. While our approach can be applied to arbitrary graph features, we particularly highlight Weisfeiler-Lehman vertex labels, leading to efficient kernels. We show that using Weisfeiler-Lehman labels over certain filtrations strictly increases the expressive power over the ordinary Weisfeiler-Lehman procedure in terms of deciding graph isomorphism. In fact, this result directly yields more powerful graph kernels based on such features and has implications to graph neural networks due to their close relationship to the Weisfeiler-Lehman method. We empirically validate the expressive power of our graph kernels and show significant improvements over state-of-the-art graph kernels in terms of predictive performance on various real-world benchmark datasets.
△ Less
Submitted 22 October, 2021;
originally announced October 2021.
-
Learning Weakly Convex Sets in Metric Spaces
Authors:
Eike Stadtländer,
Tamás Horváth,
Stefan Wrobel
Abstract:
One of the central problems studied in the theory of machine learning is the question of whether, for a given class of hypotheses, it is possible to efficiently find a {consistent} hypothesis, i.e., which has zero training error. While problems involving {\em convex} hypotheses have been extensively studied, the question of whether efficient learning is possible for non-convex hypotheses composed…
▽ More
One of the central problems studied in the theory of machine learning is the question of whether, for a given class of hypotheses, it is possible to efficiently find a {consistent} hypothesis, i.e., which has zero training error. While problems involving {\em convex} hypotheses have been extensively studied, the question of whether efficient learning is possible for non-convex hypotheses composed of possibly several disconnected regions is still less understood. Although it has been shown quite a while ago that efficient learning of weakly convex hypotheses, a parameterized relaxation of convex hypotheses, is possible for the special case of Boolean functions, the question of whether this idea can be developed into a generic paradigm has not been studied yet. In this paper, we provide a positive answer and show that the consistent hypothesis finding problem can indeed be solved in polynomial time for a broad class of weakly convex hypotheses over metric spaces. To this end, we propose a general domain-independent algorithm for finding consistent weakly convex hypotheses and prove sufficient conditions for its efficiency that characterize the corresponding hypothesis classes. To illustrate our general algorithm and its properties, we discuss several non-trivial learning examples to demonstrate how it can be used to efficiently solve the corresponding consistent hypothesis finding problem. Without the weak convexity constraint, these problems are known to be computationally intractable. We then proceed to show that the general idea of our algorithm can even be extended to the case of extensional weakly convex hypotheses, as it naturally arise, e.g., when performing vertex classification in graphs. We prove that using our extended algorithm, the problem can be solved in polynomial time provided the distances in the domain can be computed efficiently.
△ Less
Submitted 19 March, 2024; v1 submitted 10 May, 2021;
originally announced May 2021.
-
First principles model for voids decorated by transmutation solutes: Short-range order effects and application to neutron irradiated tungsten
Authors:
Duc Nguyen-Manh,
Jan S. Wrobel,
Michael Klimenkov,
Matthew J. Lloyd,
Luca Messina,
Sergei L. Dudarev
Abstract:
Understanding how properties of materials change due to nuclear transmutations is a major challenge for the design of structural components for a fusion power plant. In this study, by combining a first-principles matrix Hamiltonian approach with thermodynamic integration we investigate quasi-steady state configurations of multi-component alloys, containing defects, over a broad range of temperatur…
▽ More
Understanding how properties of materials change due to nuclear transmutations is a major challenge for the design of structural components for a fusion power plant. In this study, by combining a first-principles matrix Hamiltonian approach with thermodynamic integration we investigate quasi-steady state configurations of multi-component alloys, containing defects, over a broad range of temperature and composition. The model enables simulating transmutation-induced segregation effects in materials, including tungsten where the phenomenon is strongly pronounced. Finite-temperature analysis shows that voids are decorated by Re and Os, but there is no decoration by tantalum (Ta). The difference between the elements is correlated with the sign of the short range order (SRO) parameter between impurity and vacancy species, in agreement with Atom Probe Tomography (APT) observations of irradiated W-Re, W-Os, W-Ta alloys in the solid solution limit. Statistical analyses of Re and Os impurities in vacancy-rich tungsten show that the SRO effects involving the two solutes are highly sensitive to the background concentration the species. In quaternary W-Re-Os-Vac alloys containing 1.5% Re and 0.1% Os, the SRO Re-Os parameter is negative at 1200K, driving the formation of concentrated Re and Os precipitates. Comparison with experimental Transmission Electron Microscopy (TEM) and APT data on W samples irradiated at the High Flux Reactor (HFR) shows that the model explains the origin of anomalous segregation of transmutation products (Re,Os) to vacancy clusters and voids in the high temperature limit pertinent to the operating conditions of a fusion power plant.
△ Less
Submitted 2 February, 2021;
originally announced February 2021.
-
A Generalized Weisfeiler-Lehman Graph Kernel
Authors:
Till Hendrik Schulz,
Tamás Horváth,
Pascal Welke,
Stefan Wrobel
Abstract:
The Weisfeiler-Lehman graph kernels are among the most prevalent graph kernels due to their remarkable time complexity and predictive performance. Their key concept is based on an implicit comparison of neighborhood representing trees with respect to equality (i.e., isomorphism). This binary valued comparison is, however, arguably too rigid for defining suitable similarity measures over graphs. To…
▽ More
The Weisfeiler-Lehman graph kernels are among the most prevalent graph kernels due to their remarkable time complexity and predictive performance. Their key concept is based on an implicit comparison of neighborhood representing trees with respect to equality (i.e., isomorphism). This binary valued comparison is, however, arguably too rigid for defining suitable similarity measures over graphs. To overcome this limitation, we propose a generalization of Weisfeiler-Lehman graph kernels which takes into account the similarity between trees rather than equality. We achieve this using a specifically fitted variation of the well-known tree edit distance which can efficiently be calculated. We empirically show that our approach significantly outperforms state-of-the-art methods in terms of predictive performance on datasets containing structurally more complex graphs beyond the typically considered molecular graphs.
△ Less
Submitted 20 January, 2021;
originally announced January 2021.
-
A Novel Regression Loss for Non-Parametric Uncertainty Optimization
Authors:
Joachim Sicking,
Maram Akila,
Maximilian Pintz,
Tim Wirtz,
Asja Fischer,
Stefan Wrobel
Abstract:
Quantification of uncertainty is one of the most promising approaches to establish safe machine learning. Despite its importance, it is far from being generally solved, especially for neural networks. One of the most commonly used approaches so far is Monte Carlo dropout, which is computationally cheap and easy to apply in practice. However, it can underestimate the uncertainty. We propose a new o…
▽ More
Quantification of uncertainty is one of the most promising approaches to establish safe machine learning. Despite its importance, it is far from being generally solved, especially for neural networks. One of the most commonly used approaches so far is Monte Carlo dropout, which is computationally cheap and easy to apply in practice. However, it can underestimate the uncertainty. We propose a new objective, referred to as second-moment loss (SML), to address this issue. While the full network is encouraged to model the mean, the dropout networks are explicitly used to optimize the model variance. We intensively study the performance of the new objective on various UCI regression datasets. Comparing to the state-of-the-art of deep ensembles, SML leads to comparable prediction accuracies and uncertainty estimates while only requiring a single model. Under distribution shift, we observe moderate improvements. As a side result, we introduce an intuitive Wasserstein distance-based uncertainty measure that is non-saturating and thus allows to resolve quality differences between any two uncertainty estimates.
△ Less
Submitted 7 January, 2021;
originally announced January 2021.
-
Wasserstein Dropout
Authors:
Joachim Sicking,
Maram Akila,
Maximilian Pintz,
Tim Wirtz,
Asja Fischer,
Stefan Wrobel
Abstract:
Despite of its importance for safe machine learning, uncertainty quantification for neural networks is far from being solved. State-of-the-art approaches to estimate neural uncertainties are often hybrid, combining parametric models with explicit or implicit (dropout-based) ensembling. We take another pathway and propose a novel approach to uncertainty quantification for regression tasks, Wasserst…
▽ More
Despite of its importance for safe machine learning, uncertainty quantification for neural networks is far from being solved. State-of-the-art approaches to estimate neural uncertainties are often hybrid, combining parametric models with explicit or implicit (dropout-based) ensembling. We take another pathway and propose a novel approach to uncertainty quantification for regression tasks, Wasserstein dropout, that is purely non-parametric. Technically, it captures aleatoric uncertainty by means of dropout-based sub-network distributions. This is accomplished by a new objective which minimizes the Wasserstein distance between the label distribution and the model distribution. An extensive empirical analysis shows that Wasserstein dropout outperforms state-of-the-art methods, on vanilla test data as well as under distributional shift, in terms of producing more accurate and stable uncertainty estimates.
△ Less
Submitted 2 December, 2021; v1 submitted 23 December, 2020;
originally announced December 2020.
-
Elastic dipole tensors and relaxation volumes of point defects in concentrated random magnetic Fe-Cr alloys
Authors:
Jan S. Wróbel,
Marcin R. Zemła,
Duc Nguyen-Manh,
Pär Olsson,
Luca Messina,
Christophe Domain,
Tomasz Wejrzanowski,
Sergei L. Dudarev
Abstract:
Point defects in body-centred cubic Fe, Cr and concentrated random magnetic Fe-Cr are investigated using density functional theory and theory of elasticity. The volume of a substitutional Cr atom in ferromagnetic bcc Fe is approximately 18\% larger than the volume of a host Fe atom, whereas the volume of a substitutional Fe atom in antiferromagnetic bcc Cr is 5\% smaller than the volume of a host…
▽ More
Point defects in body-centred cubic Fe, Cr and concentrated random magnetic Fe-Cr are investigated using density functional theory and theory of elasticity. The volume of a substitutional Cr atom in ferromagnetic bcc Fe is approximately 18\% larger than the volume of a host Fe atom, whereas the volume of a substitutional Fe atom in antiferromagnetic bcc Cr is 5\% smaller than the volume of a host Cr atom. Elastic dipole $\boldsymbol{P}$ and relaxation volume $\boldsymbolΩ$ tensors of vacancies and self-interstitial atom (SIA) defects exhibit large fluctuations, with vacancies having negative and SIA large positive relaxation volumes. Dipole tensors of vacancies are nearly isotropic across the entire alloy composition range, with diagonal elements $P_{ii}$ decreasing as a function of Cr content. Fe-Fe and Fe-Cr SIA dumbbells are more anisotropic than Cr-Cr dumbbells. Fluctuations of elastic dipole tensors of SIA defects are primarily associated with the variable crystallographic orientations of the dumbbells. Statistical properties of tensors $\boldsymbol{P}$ and $\boldsymbolΩ$ are analysed using their principal invariants, suggesting that point defects differ significantly in alloys containing below and above 10\% at. Cr. The relaxation volume of a vacancy depends sensitively on whether it occupies a Fe or a Cr lattice site. A correlation between elastic relaxation volumes and magnetic moments of defects found in this study suggests that magnetism is a significant factor influencing elastic fields of defects in Fe-Cr alloys.
△ Less
Submitted 30 July, 2020;
originally announced July 2020.
-
Learning Syllogism with Euler Neural-Networks
Authors:
Tiansi Dong,
Chengjiang Li,
Christian Bauckhage,
Juanzi Li,
Stefan Wrobel,
Armin B. Cremers
Abstract:
Traditional neural networks represent everything as a vector, and are able to approximate a subset of logical reasoning to a certain degree. As basic logic relations are better represented by topological relations between regions, we propose a novel neural network that represents everything as a ball and is able to learn topological configuration as an Euler diagram. So comes the name Euler Neural…
▽ More
Traditional neural networks represent everything as a vector, and are able to approximate a subset of logical reasoning to a certain degree. As basic logic relations are better represented by topological relations between regions, we propose a novel neural network that represents everything as a ball and is able to learn topological configuration as an Euler diagram. So comes the name Euler Neural-Network (ENN). The central vector of a ball is a vector that can inherit representation power of traditional neural network. ENN distinguishes four spatial statuses between balls, namely, being disconnected, being partially overlapped, being part of, being inverse part of. Within each status, ideal values are defined for efficient reasoning. A novel back-propagation algorithm with six Rectified Spatial Units (ReSU) can optimize an Euler diagram representing logical premises, from which logical conclusion can be deduced. In contrast to traditional neural network, ENN can precisely represent all 24 different structures of Syllogism. Two large datasets are created: one extracted from WordNet-3.0 covers all types of Syllogism reasoning, the other extracted all family relations from DBpedia. Experiment results approve the superior power of ENN in logical representation and reasoning. Datasets and source code are available upon request.
△ Less
Submitted 20 July, 2020; v1 submitted 14 July, 2020;
originally announced July 2020.
-
Maximal Closed Set and Half-Space Separations in Finite Closure Systems
Authors:
Florian Seiffarth,
Tamas Horvath,
Stefan Wrobel
Abstract:
Several concept learning problems can be regarded as special cases of half-space separation in abstract closure systems over finite ground sets. For the typical scenario that the closure system is implicitly given via a closure operator, we show that the half-space separation problem is NP-complete. As a first approach to overcome this negative result, we relax the problem to maximal closed set se…
▽ More
Several concept learning problems can be regarded as special cases of half-space separation in abstract closure systems over finite ground sets. For the typical scenario that the closure system is implicitly given via a closure operator, we show that the half-space separation problem is NP-complete. As a first approach to overcome this negative result, we relax the problem to maximal closed set separation, give a generic greedy algorithm solving this problem with a linear number of closure operator calls, and show that this bound is sharp. For a second direction, we consider Kakutani closure systems and prove that they are algorithmically characterized by the greedy algorithm. As a first special case of the general problem setting, we consider Kakutani closure systems over graphs and give a sufficient condition for this kind of closure systems in terms of forbidden graph minors. For a second special case, we then focus on closure systems over finite lattices, give an improved adaptation of the generic greedy algorithm, and present an application concerning subsumption lattices.
△ Less
Submitted 13 June, 2022; v1 submitted 13 January, 2020;
originally announced January 2020.
-
On discrete Hardy-Littlewood maximal functions over the balls in $\mathbb Z^d$: dimension-free estimates
Authors:
Jean Bourgain,
Mariusz Mirek,
Elias M. Stein Błażej Wróbel
Abstract:
We show that the discrete Hardy-Littlewood maximal functions associated with the Euclidean balls in $\mathbb Z^d$ with dyadic radii have bounds independent of the dimension on $\ell^p(\mathbb Z^d)$ for $p\in[2, \infty]$.
We show that the discrete Hardy-Littlewood maximal functions associated with the Euclidean balls in $\mathbb Z^d$ with dyadic radii have bounds independent of the dimension on $\ell^p(\mathbb Z^d)$ for $p\in[2, \infty]$.
△ Less
Submitted 2 November, 2019; v1 submitted 1 December, 2018;
originally announced December 2018.
-
Outstanding Radiation Resistance of Tungsten-based High Entropy Alloys
Authors:
O. El-Atwani,
N. Li,
M. Li,
A. Devaraj,
M. Schneider,
D. Sobieraj,
J. S. Wrobel,
D. D. Nguyen-Manh,
S. A. Maloy,
E. Martinez
Abstract:
A novel W-based refractory high entropy alloy with outstanding radiation resistance has been developed. The alloy was grown as thin films showing a bimodal grain size distribution in the nanocrystalline and ultrafine regimes and a unique 4 nm lamella-like structure revealed by atom probe tomography (APT). Transmission electron microscopy (TEM) and X-ray diffraction show an underlying body-centered…
▽ More
A novel W-based refractory high entropy alloy with outstanding radiation resistance has been developed. The alloy was grown as thin films showing a bimodal grain size distribution in the nanocrystalline and ultrafine regimes and a unique 4 nm lamella-like structure revealed by atom probe tomography (APT). Transmission electron microscopy (TEM) and X-ray diffraction show an underlying body-centered cubic crystalline structure with certain black spots appearing after thermal annealing at elevated temperatures. Thorough analysis based on TEM and APT correlated the black spots with second phase particles rich in Cr and V. After both in situ and ex situ irradiation, these precipitates evolve to quasi-spherical particles with no sign of irradiation-created dislocation loops even after 8 dpa at either room temperature or 1073 K. Furthermore, nanomechanical testing shows a large hardness of 14 GPa in the as-deposited samples, with a slight increase after thermal annealing and almost negligible irradiation hardening. Theoretical modeling based on ab initio methodologies combined with Monte Carlo techniques predicts the formation of Cr and V rich second phase particles and points at equal mobilities of point defects as the origin of the exceptional radiation tolerance. The fact that these alloys are suitable for bulk production coupled with the exceptional radiation and mechanical properties makes them ideal structural materials for applications requiring extreme conditions.
△ Less
Submitted 5 November, 2018;
originally announced November 2018.
-
Efficient Decentralized Deep Learning by Dynamic Model Averaging
Authors:
Michael Kamp,
Linara Adilova,
Joachim Sicking,
Fabian Hüger,
Peter Schlicht,
Tim Wirtz,
Stefan Wrobel
Abstract:
We propose an efficient protocol for decentralized training of deep neural networks from distributed data sources. The proposed protocol allows to handle different phases of model training equally well and to quickly adapt to concept drifts. This leads to a reduction of communication by an order of magnitude compared to periodically communicating state-of-the-art approaches. Moreover, we derive a…
▽ More
We propose an efficient protocol for decentralized training of deep neural networks from distributed data sources. The proposed protocol allows to handle different phases of model training equally well and to quickly adapt to concept drifts. This leads to a reduction of communication by an order of magnitude compared to periodically communicating state-of-the-art approaches. Moreover, we derive a communication bound that scales well with the hardness of the serialized learning problem. The reduction in communication comes at almost no cost, as the predictive performance remains virtually unchanged. Indeed, the proposed protocol retains loss bounds of periodically averaging schemes. An extensive empirical evaluation validates major improvement of the trade-off between model performance and communication which could be beneficial for numerous decentralized learning applications, such as autonomous driving, or voice recognition and image classification on mobile phones.
△ Less
Submitted 13 November, 2018; v1 submitted 9 July, 2018;
originally announced July 2018.
-
Dense Pooling layers in Fully Convolutional Network for Skin Lesion Segmentation
Authors:
Ebrahim Nasr-Esfahani,
Shima Rafiei,
Mohammad H. Jafari,
Nader Karimi,
James S. Wrobel,
S. M. Reza Soroushmehr,
Shadrokh Samavi,
Kayvan Najarian
Abstract:
One of the essential tasks in medical image analysis is segmentation and accurate detection of borders. Lesion segmentation in skin images is an essential step in the computerized detection of skin cancer. However, many of the state-of-the-art segmentation methods have deficiencies in their border detection phase. In this paper, a new class of fully convolutional network is proposed, with new dens…
▽ More
One of the essential tasks in medical image analysis is segmentation and accurate detection of borders. Lesion segmentation in skin images is an essential step in the computerized detection of skin cancer. However, many of the state-of-the-art segmentation methods have deficiencies in their border detection phase. In this paper, a new class of fully convolutional network is proposed, with new dense pooling layers for segmentation of lesion regions in skin images. This network leads to highly accurate segmentation of lesions on skin lesion datasets which outperforms state-of-the-art algorithms in the skin lesion segmentation.
△ Less
Submitted 31 August, 2019; v1 submitted 29 December, 2017;
originally announced December 2017.
-
Dynamic Simulation of Structural Phase Transitions in Magnetic Iron
Authors:
Pui-Wai Ma,
S. L. Dudarev,
Jan S. Wróbel
Abstract:
The occurrence of bcc-fcc ($α$-$γ$) and fcc-bcc ($γ$-$δ$) phase transitions in magnetic iron stems from the interplay between magnetic excitations and lattice vibrations. However, this fact has never been proven by a direct dynamic simulation, treating non-collinear magnetic fluctuations and dynamics of atoms, and their coupling at a finite temperature. Starting from a large set of data generated…
▽ More
The occurrence of bcc-fcc ($α$-$γ$) and fcc-bcc ($γ$-$δ$) phase transitions in magnetic iron stems from the interplay between magnetic excitations and lattice vibrations. However, this fact has never been proven by a direct dynamic simulation, treating non-collinear magnetic fluctuations and dynamics of atoms, and their coupling at a finite temperature. Starting from a large set of data generated by ab initio simulations, we derive non-collinear magnetic many-body potentials for bcc and fcc iron describing fluctuations in the vicinity of near perfect lattice positions. We then use spin-lattice dynamics simulations to evaluate the difference between free energies of bcc and fcc phases, assessing their relative stability within a unified dynamic picture. We find two intersections between the bcc and fcc free energy curves, which correspond to $α$-$γ$ bcc-fcc and $γ$-$δ$ fcc-bcc phase transitions. The maximum fcc-bcc free energy difference over the temperature interval between the two phase transition points is 2 meV, in agreement with other experimental and theoretical estimates.
△ Less
Submitted 23 June, 2017;
originally announced June 2017.
-
Adiabatic Quantum Computing for Binary Clustering
Authors:
Christian Bauckhage,
Eduardo Brito,
Kostadin Cvejoski,
Cesar Ojeda,
Rafet Sifa,
Stefan Wrobel
Abstract:
Quantum computing for machine learning attracts increasing attention and recent technological developments suggest that especially adiabatic quantum computing may soon be of practical interest. In this paper, we therefore consider this paradigm and discuss how to adopt it to the problem of binary clustering. Numerical simulations demonstrate the feasibility of our approach and illustrate how syste…
▽ More
Quantum computing for machine learning attracts increasing attention and recent technological developments suggest that especially adiabatic quantum computing may soon be of practical interest. In this paper, we therefore consider this paradigm and discuss how to adopt it to the problem of binary clustering. Numerical simulations demonstrate the feasibility of our approach and illustrate how systems of qubits adiabatically evolve towards a solution.
△ Less
Submitted 17 June, 2017;
originally announced June 2017.
-
Short-range order in high entropy alloys:Theoretical formulation and application to Mo-Nb-Ta-V-W system
Authors:
A. Fernandez-Caballero,
J. S. Wrobel,
P. M. Mummery,
D. Nguyen-Manh
Abstract:
In high-entropy alloys (HEAs), the local chemical fluctuations from disordered solute solution state into segregation, precipitation and ordering configurations are complex due to the large number of elements. In this work, the cluster expansion (CE) Hamiltonian for multi-component alloy systems is developed in order to investigate the dependence of chemical ordering of HEAs as a function of tempe…
▽ More
In high-entropy alloys (HEAs), the local chemical fluctuations from disordered solute solution state into segregation, precipitation and ordering configurations are complex due to the large number of elements. In this work, the cluster expansion (CE) Hamiltonian for multi-component alloy systems is developed in order to investigate the dependence of chemical ordering of HEAs as a function of temperature dependence due to derivation of configuration entropy from the ideal solute solution. Analytic expressions for Warren-Cowley short-range order (SRO) parameters are derived for a five component alloy system. The theoretical formulation is used to investigate the evolution of the ten different SRO parameters in the MoNbTaVW and the sub-quaternary systems obtained by MonteCarlo simulations within the combined CE and first-principles formalism.
△ Less
Submitted 4 May, 2017;
originally announced May 2017.
-
Using Echo State Networks for Cryptography
Authors:
Rajkumar Ramamurthy,
Christian Bauckhage,
Krisztian Buza,
Stefan Wrobel
Abstract:
Echo state networks are simple recurrent neural networks that are easy to implement and train. Despite their simplicity, they show a form of memory and can predict or regenerate sequences of data. We make use of this property to realize a novel neural cryptography scheme. The key idea is to assume that Alice and Bob share a copy of an echo state network. If Alice trains her copy to memorize a mess…
▽ More
Echo state networks are simple recurrent neural networks that are easy to implement and train. Despite their simplicity, they show a form of memory and can predict or regenerate sequences of data. We make use of this property to realize a novel neural cryptography scheme. The key idea is to assume that Alice and Bob share a copy of an echo state network. If Alice trains her copy to memorize a message, she can communicate the trained part of the network to Bob who plugs it into his copy to regenerate the message. Considering a byte-level representation of in- and output, the technique applies to arbitrary types of data (texts, images, audio files, etc.) and practical experiments reveal it to satisfy the fundamental cryptographic properties of diffusion and confusion.
△ Less
Submitted 4 April, 2017;
originally announced April 2017.
-
Radiation-induced segregation in dilute Re-W solid solutions
Authors:
Jan S. Wrobel,
Duc Nguyen-Manh,
Krzysztof J. Kurzydlowski,
Sergei L. Dudarev
Abstract:
The occurrence of segregation in highly dilute alloys under irradiation is an unusual phenomenon that has so far eluded theoretical explanation. Using ab initio calculations, we are able to explain the origin of radiation-induced rhenium segregation in dilute tungsten-rhenium alloys.
The occurrence of segregation in highly dilute alloys under irradiation is an unusual phenomenon that has so far eluded theoretical explanation. Using ab initio calculations, we are able to explain the origin of radiation-induced rhenium segregation in dilute tungsten-rhenium alloys.
△ Less
Submitted 13 April, 2016;
originally announced April 2016.
-
Magnetic Cluster Expansion model for random and ordered magnetic face-centered cubic Fe-Ni-Cr alloys
Authors:
M. Y. Lavrentiev,
J. S. Wróbel,
D. Nguyen-Manh,
S. L. Dudarev,
M. G. Ganchenkova
Abstract:
A Magnetic Cluster Expansion (MCE) model for ternary face-centered cubic Fe-Ni-Cr alloys has been developed using DFT data spanning binary and ternary alloy configurations. Using this MCE model Hamiltonian, we perform Monte Carlo simulations and explore magnetic structures of alloys over the entire range of alloy compositions, considering both random and ordered alloy structures. In random alloys,…
▽ More
A Magnetic Cluster Expansion (MCE) model for ternary face-centered cubic Fe-Ni-Cr alloys has been developed using DFT data spanning binary and ternary alloy configurations. Using this MCE model Hamiltonian, we perform Monte Carlo simulations and explore magnetic structures of alloys over the entire range of alloy compositions, considering both random and ordered alloy structures. In random alloys, the removal of magnetic collinearity constraint reduces the total magnetic moment but does not affect the predicted range of compositions where the alloys adopt low temperature ferromagnetic configurations. During alloying of ordered fcc Fe-Ni compounds with Cr, chromium atoms tend to replace nickel rather than iron atoms. Replacement of Ni by Cr in alloys with high iron content increases the Curie temperature of the alloys. This can be explained by strong antiferromagnetic Fe-Cr coupling, similar to that found in bcc Fe-Cr solutions, where the Curie temperature increase, predicted by simulations as a function of Cr concentration, is confirmed by experimental observations.
△ Less
Submitted 9 March, 2015;
originally announced March 2015.
-
Phase stability of ternary fcc and bcc Fe-Cr-Ni alloys
Authors:
Jan S. Wrobel,
Duc Nguyen-Manh,
Mikhail Yu. Lavrentiev,
Marek Muzyk,
Sergei L. Dudarev
Abstract:
The phase stability of fcc and bcc magnetic binary Fe-Cr, Fe-Ni, Cr-Ni alloys and ternary Fe-Cr-Ni alloys is investigated using a combination of density functional theory (DFT), Cluster Expansion (CE) and Magnetic Cluster Expansion (MCE). Energies, magnetic moments, and volumes of more than 500 alloy structures are evaluated using DFT, and the most stable magnetic configurations are compared with…
▽ More
The phase stability of fcc and bcc magnetic binary Fe-Cr, Fe-Ni, Cr-Ni alloys and ternary Fe-Cr-Ni alloys is investigated using a combination of density functional theory (DFT), Cluster Expansion (CE) and Magnetic Cluster Expansion (MCE). Energies, magnetic moments, and volumes of more than 500 alloy structures are evaluated using DFT, and the most stable magnetic configurations are compared with experimental data. Deviations from the Vegard law in fcc Fe-Cr-Ni alloys, associated with non-linear variation of atomic magnetic moments as functions of alloy composition, are observed. Accuracy of the CE model is assessed against the DFT data, where for ternary alloys the cross-validation error is smaller than 12 meV/atom. A set of cluster interaction parameters is defined for each alloy, where it is used for predicting new ordered alloy structures. Fcc Fe2CrNi phase with Cu2NiZn-like structure is predicted as the global ground state with the lowest chemical ordering temperature of 650K. DFT-based Monte Carlo (MC) simulations are used for assessing finite temperature fcc-bcc phase stability and order-disorder transitions in Fe-Cr-Ni alloys. Enthalpies of formation of ternary alloys calculated from MC simulations at 1600K combined with magnetic correction derived from MCE are in excellent agreement with experimental values measured at 1565K. Chemical order is analysed, as a function of temperature and composition, in terms of the Warren-Cowley short-range order (SRO) parameters and effective chemical pairwise interactions.
△ Less
Submitted 2 February, 2015; v1 submitted 2 October, 2014;
originally announced October 2014.
-
Magnetic and Thermodynamic properties of face-centered cubic Fe-Ni alloys
Authors:
M. Yu. Lavrentiev,
J. S. Wrobel,
D. Nguyen-Manh,
S. L. Dudarev
Abstract:
A model lattice ab initio parameterised Hamiltonian spanning a broad range of alloy compositions and a large variety of chemical and magnetic configurations has been developed for face-centered cubic Fe-Ni alloys. Thermodynamic and magnetic properties of the alloys are explored using configuration and magnetic Monte Carlo simulations in a temperature range extending well over 1000 K.
A model lattice ab initio parameterised Hamiltonian spanning a broad range of alloy compositions and a large variety of chemical and magnetic configurations has been developed for face-centered cubic Fe-Ni alloys. Thermodynamic and magnetic properties of the alloys are explored using configuration and magnetic Monte Carlo simulations in a temperature range extending well over 1000 K.
△ Less
Submitted 3 April, 2014;
originally announced April 2014.