-
Deposition Rates in Thermal Laser Epitaxy: Simulation and Experiment
Authors:
Thomas J. Smart,
Bilen Emek Abali,
Hans Boschker,
Wolfgang Braun
Abstract:
The modeling of deposition rates in Thermal Laser Epitaxy (TLE) is essential for the accurate prediction of the evaporation process and for improved dynamic process control. We demonstrate excellent agreement between experimental data and a model based on a finite element simulation that describes the temperature distribution of an elemental source when irradiated with continuous wave laser radiat…
▽ More
The modeling of deposition rates in Thermal Laser Epitaxy (TLE) is essential for the accurate prediction of the evaporation process and for improved dynamic process control. We demonstrate excellent agreement between experimental data and a model based on a finite element simulation that describes the temperature distribution of an elemental source when irradiated with continuous wave laser radiation. The simulation strongly depends on the thermophysical constants of the material, data of which is lacking for many elements. Effective values for the parameters may be determined with precision by means of an unambiguous reference provided by the melting point of the material, which is directly observed during the experiments. TLE may therefore be used to study the high temperature thermophysical and optical properties of the elements.
△ Less
Submitted 3 January, 2025;
originally announced January 2025.
-
A comparative analysis for different finite element types in strain-gradient elasticity simulations performed on Firedrake and FEniCS
Authors:
B. Cagri Sarar,
M. Erden Yildizdag,
Francesco Fabbrocino,
B. Emek Abali
Abstract:
The layer-upon-layer approach in additive manufacturing, open or closed cells in polymeric or metallic foams involve an intrinsic microstructure tailored to the underlying applications. Homogenization of such architectured materials creates metamaterials modeled by higher-gradient models, specifically when the microstructure's characteristic length is comparable to the length scale of the structur…
▽ More
The layer-upon-layer approach in additive manufacturing, open or closed cells in polymeric or metallic foams involve an intrinsic microstructure tailored to the underlying applications. Homogenization of such architectured materials creates metamaterials modeled by higher-gradient models, specifically when the microstructure's characteristic length is comparable to the length scale of the structure. In this study, we conduct a comparative analysis of various finite elements methods for solving problems in strain-gradient elasticity. We employ open-source packages from Firedrake and FEniCS. Different finite element formulations are tested: we implement Lagrange, Argyris, Hermite elements, a Hu--Washizu type (mixed) formulation, as well as isogeometric analysis with Non-Uniform Rational B-Splines (NURBS). For the numerical study, we investigate one- and two-dimensional problems discussed in the literature of strain-gradient modeling. All developed codes are open-access to encourage research in Finite Element Method (FEM) based computation of generalized continua.
△ Less
Submitted 16 June, 2025; v1 submitted 18 November, 2024;
originally announced November 2024.
-
Determining parameters in generalized thermomechanics for metamaterials by means of asymptotic homogenization
Authors:
Bozo Vazic,
Bilen Emek Abali,
Pania Newell
Abstract:
Advancement in manufacturing methods enable designing so called metamaterials with a tailor-made microstructure. Microstructure affects materials response within a length-scale, where we model this behavior by using the generalized thermomechanics. Strain gradient theory is employed as a higher-order theory with thermodynamics modeled as a first-order theory. Developing multiphysics models for het…
▽ More
Advancement in manufacturing methods enable designing so called metamaterials with a tailor-made microstructure. Microstructure affects materials response within a length-scale, where we model this behavior by using the generalized thermomechanics. Strain gradient theory is employed as a higher-order theory with thermodynamics modeled as a first-order theory. Developing multiphysics models for heterogeneous materials is indeed a challenge and even this ``simplest'' model in generalized thermomechanics causes dozens of parameters to be determined. We develop a computational model by using a given microstructure, modeled as a periodic domain, and numerically calculate all parameters by means of asymptotic homogenization. Finite element method (FEM) is employed with the aid of open-source codes (FEniCS). Some example with symmetric and random distribution of voids in a model problem verifies the method and provides an example at which length-scale we need to consider generalized thermoeleasticity in composite materials.
△ Less
Submitted 7 July, 2022;
originally announced July 2022.
-
Verification of asymptotic homogenization method developed for periodic architected materials in strain gradient continuum
Authors:
Hua Yang,
Bilen Emek Abali,
Wolfgang H. Müller,
Salma Barboura,
Jia Li
Abstract:
Strain gradient theory is an accurate model for capturing the size effect and localization phenomena. However, the challenge in identification of corresponding constitutive parameters limits the practical application of the theory. We present and utilize asymptotic homogenization herein. All parameters in rank four, five, and six tensors are determined with the demonstrated computational approach.…
▽ More
Strain gradient theory is an accurate model for capturing the size effect and localization phenomena. However, the challenge in identification of corresponding constitutive parameters limits the practical application of the theory. We present and utilize asymptotic homogenization herein. All parameters in rank four, five, and six tensors are determined with the demonstrated computational approach. Examples for epoxy carbon fiber composite, metal matrix composite, and aluminum foam illustrate the effectiveness and versatility of the proposed method. The influences of volume fraction of matrix, the stack of RVEs, and the varying unit cell lengths on the identified parameters are investigated. The homogenization computational tool is applicable to a wide class materials and makes use of open-source codes in FEniCS. We make all of the codes publicly available in order to encourage a transparent scientific exchange.
△ Less
Submitted 18 December, 2021; v1 submitted 9 June, 2021;
originally announced June 2021.
-
EFloat: Entropy-coded Floating Point Format for Compressing Vector Embedding Models
Authors:
Rajesh Bordawekar,
Bulent Abali,
Ming-Hung Chen
Abstract:
In a large class of deep learning models, including vector embedding models such as word and database embeddings, we observe that floating point exponent values cluster around a few unique values, permitting entropy based data compression. Entropy coding compresses fixed-length values with variable-length codes, encoding most probable values with fewer bits. We propose the EFloat compressed floati…
▽ More
In a large class of deep learning models, including vector embedding models such as word and database embeddings, we observe that floating point exponent values cluster around a few unique values, permitting entropy based data compression. Entropy coding compresses fixed-length values with variable-length codes, encoding most probable values with fewer bits. We propose the EFloat compressed floating point number format that uses a variable field boundary between the exponent and significand fields. EFloat uses entropy coding on exponent values and signs to minimize the average width of the exponent and sign fields, while preserving the original FP32 exponent range unchanged. Saved bits become part of the significand field increasing the EFloat numeric precision by 4.3 bits on average compared to other reduced-precision floating point formats. EFloat makes 8-bit and even smaller floats practical without sacrificing the exponent range of a 32-bit floating point representation. We currently use the EFloat format for saving memory capacity and bandwidth consumption of large vector embedding models such as those used for database embeddings. Using the RMS error as metric, we demonstrate that EFloat provides higher accuracy than other floating point formats with equal bit budget. The EF12 format with 12-bit budget has less end-to-end application error than the 16-bit BFloat16. EF16 with 16-bit budget has an RMS-error 17 to 35 times less than BF16 RMS-error for a diverse set of embedding models. When making similarity and dissimilarity queries, using the NDCG ranking metric, EFloat matches the result quality of prior floating point representations with larger bit budgets.
△ Less
Submitted 2 February, 2022; v1 submitted 4 February, 2021;
originally announced February 2021.
-
Additive manufacturing introduced substructure and computational determination of metamaterials parameters by means of the asymptotic homogenization
Authors:
Bilen Emek Abali,
Emilio Barchiesi
Abstract:
Metamaterials exhibit materials response deviation from conventional elasticity. This phenomenon is captured by the generalized elasticity as a result of extending the theory at the expense of introducing additional parameters. These parameters are linked to internal length scales. Describing on a macroscopic level a material possessing a substructure at a microscopic length scale calls for introd…
▽ More
Metamaterials exhibit materials response deviation from conventional elasticity. This phenomenon is captured by the generalized elasticity as a result of extending the theory at the expense of introducing additional parameters. These parameters are linked to internal length scales. Describing on a macroscopic level a material possessing a substructure at a microscopic length scale calls for introducing additional constitutive parameters. Therefore, in principle, an asymptotic homogenization is feasible to determine these parameters given an accurate knowledge on the substructure. Especially in additive manufacturing, known under the infill ratio, topology optimization introduces a substructure leading to higher order terms in mechanical response. Hence, weight reduction creates a metamaterial with an accurately known substructure. Herein, we develop a computational scheme using both scales for numerically identifying metamaterials parameters. As a specific example we apply it on a honeycomb substructure and discuss the infill ratio. Such a computational approach is applicable to a wide class substructures and makes use of open-source codes; we make it publicly available for a transparent scientific exchange.
△ Less
Submitted 14 October, 2020; v1 submitted 26 September, 2020;
originally announced September 2020.
-
Touché: Towards Ideal and Efficient Cache Compression By Mitigating Tag Area Overheads
Authors:
Seokin Hong,
Bulent Abali,
Alper Buyuktosunoglu,
Michael B. Healy,
Prashant J. Nair
Abstract:
Compression is seen as a simple technique to increase the effective cache capacity. Unfortunately, compression techniques either incur tag area overheads or restrict data placement to only include neighboring compressed cache blocks to mitigate tag area overheads. Ideally, we should be able to place arbitrary compressed cache blocks without any placement restrictions and tag area overheads.
This…
▽ More
Compression is seen as a simple technique to increase the effective cache capacity. Unfortunately, compression techniques either incur tag area overheads or restrict data placement to only include neighboring compressed cache blocks to mitigate tag area overheads. Ideally, we should be able to place arbitrary compressed cache blocks without any placement restrictions and tag area overheads.
This paper proposes Touché, a framework that enables storing multiple arbitrary compressed cache blocks within a physical cacheline without any tag area overheads. The Touché framework consists of three components. The first component, called the ``Signature'' (SIGN) engine, creates shortened signatures from the tag addresses of compressed blocks. Due to this, the SIGN engine can store multiple signatures in each tag entry. On a cache access, the physical cacheline is accessed only if there is a signature match (which has a negligible probability of false positive). The second component, called the ``Tag Appended Data'' (TADA) mechanism, stores the full tag addresses with data. TADA enables Touché to detect false positive signature matches by ensuring that the actual tag address is available for comparison. The third component, called the ``Superblock Marker'' (SMARK) mechanism, uses a unique marker in the tag entry to indicate the occurrence of compressed cache blocks from neighboring physical addresses in the same cacheline. Touché is completely hardware-based and achieves an average speedup of 12\% (ideal 13\%) when compared to an uncompressed baseline.
△ Less
Submitted 2 September, 2019;
originally announced September 2019.
-
Determination of Metamaterial Parameters by Means of a Homogenization Approach Based on Asymptotic Analysis
Authors:
H. Yang,
B. E. Abali,
W. H. Müller,
D. Timofeev
Abstract:
Owing to additive manufacturing techniques, a structure at millimeter length scale (macroscale) can be produced by using a lattice substructure at micrometer length scale (microscale). Such a system is called a metamaterial at the macroscale as the mechanical characteristics deviate from the characteristics at the microscale. As a remedy, metamaterial is modeled by using additional parameters; we…
▽ More
Owing to additive manufacturing techniques, a structure at millimeter length scale (macroscale) can be produced by using a lattice substructure at micrometer length scale (microscale). Such a system is called a metamaterial at the macroscale as the mechanical characteristics deviate from the characteristics at the microscale. As a remedy, metamaterial is modeled by using additional parameters; we intend to determine them. A homogenization approach based on the asymptotic analysis establishes a connection between these different characteristics at micro- and macroscales. A linear elastic first order theory at the microscale is related to a linear elastic second order theory at the macroscale. Relation for parameters at the macroscale is derived by using the equivalence of energy at macro- and microscales within a so-called Representative Volume Element (RVE). Determination of parameters are succeeded by solving a boundary value problem with the Finite Element Method (FEM). The proposed approach guarantees that the additional parameters vanish if the material is purely homogeneous, in other words, it is fully compatible with conventional homogenization schemes based on spatial averaging techniques. Moreover, the proposed approach is reliable as it ensures that such resolved additional parameters are not sensitive to choices of RVE consisting in the repetition of smaller RVEs but depend upon the intrinsic size of the structure.
△ Less
Submitted 18 June, 2019;
originally announced June 2019.
-
Theory and computation of electromagnetic fields and thermomechanical structure interaction for systems undergoing large deformations
Authors:
B. E. Abali,
A. F. Queiruga
Abstract:
For an accurate description of electromagneto-thermomechanical systems, electromagnetic fields need to be described in a Eulerian frame, whereby the thermomechanics is solved in a Lagrangean frame. It is possible to map the Eulerian frame to the current placement of the matter and the Lagrangean frame to a reference placement. We present a rigorous and thermodynamically consistent derivation of go…
▽ More
For an accurate description of electromagneto-thermomechanical systems, electromagnetic fields need to be described in a Eulerian frame, whereby the thermomechanics is solved in a Lagrangean frame. It is possible to map the Eulerian frame to the current placement of the matter and the Lagrangean frame to a reference placement. We present a rigorous and thermodynamically consistent derivation of governing equations for fully coupled electromagneto-thermomechanical systems properly handling finite deformations. A clear separation of the different frames is necessary. There are various attempts to formulate electromagnetism in the Lagrangean frame, or even to compute all fields in the current placement. Both formulations are challenging and heavily discussed in the literature. In this work, we propose another solution scheme that exploits the capabilities of advanced computational tools. Instead of amending the formulation, we can solve thermomechanics in the Lagrangean frame and electromagnetism in the Eulerian frame and manage the interaction between the fields. The approach is similar to its analog in fluid structure interaction, but more challenging because the field equations in electromagnetism must also be solved within the solid body while following their own different set of transformation rules. We additionally present a mesh-morphing algorithm necessary to accommodate finite deformations to solve the electromagnetic fields outside of the material body. We illustrate the use of the new formulation by developing an open-source implementation using the FEniCS package and applying this implementation to several engineering problems in electromagnetic structure interaction undergoing large deformations.
△ Less
Submitted 4 June, 2019; v1 submitted 28 March, 2018;
originally announced March 2018.
-
An accurate finite element method for the numerical solution of isothermal and incompressible flow of viscous fluid
Authors:
Bilen Emek Abali
Abstract:
Despite its numerical challenges, finite element method is used to compute viscous fluid flow. A consensus on the cause of numerical problems has been reached; however, general algorithms---allowing a robust and accurate simulation for any process---are still missing. Either a very high computational cost is necessary for a direct numerical solution (DNS) or some limiting procedure is used by addi…
▽ More
Despite its numerical challenges, finite element method is used to compute viscous fluid flow. A consensus on the cause of numerical problems has been reached; however, general algorithms---allowing a robust and accurate simulation for any process---are still missing. Either a very high computational cost is necessary for a direct numerical solution (DNS) or some limiting procedure is used by adding artificial dissipation to the system. These stabilization methods are useful; however, they are often applied relative to the element size such that a local monotonous convergence is challenging to acquire. We need a computational strategy for solving viscous fluid flow using solely the balance equations. In this work, we present a general procedure solving fluid mechanics problems without use of any stabilization or splitting schemes. Hence, its generalization to multiphysics applications is straightforward. We discuss emerging numerical problems and present the methodology rigorously. Implementation is achieved by using open-source packages and the accuracy as well as the robustness is demonstrated by comparing results to the closed-form solutions and also by solving well-known benchmarking problems.
△ Less
Submitted 8 December, 2018; v1 submitted 4 September, 2017;
originally announced September 2017.
-
Disaggregated and optically interconnected memory: when will it be cost effective?
Authors:
Bulent Abali,
Richard J. Eickemeyer,
Hubertus Franke,
Chung-Sheng Li,
Marc A. Taubenblatt
Abstract:
The "Disaggregated Server" concept has been proposed for datacenters where the same type server resources are aggregated in their respective pools, for example a compute pool, memory pool, network pool, and a storage pool. Each server is constructed dynamically by allocating the right amount of resources from these pools according to the workload's requirements. Modularity, higher packaging and co…
▽ More
The "Disaggregated Server" concept has been proposed for datacenters where the same type server resources are aggregated in their respective pools, for example a compute pool, memory pool, network pool, and a storage pool. Each server is constructed dynamically by allocating the right amount of resources from these pools according to the workload's requirements. Modularity, higher packaging and cooling efficiencies, and higher resource utilization are among the suggested benefits. With the emergence of very large datacenters, "clouds" containing tens of thousands of servers, datacenter efficiency has become an important topic. Few computer chip and systems vendors are working on and making frequent announcements on silicon photonics and disaggregated memory systems.
In this paper we study the trade-off between cost and performance of building a disaggregated memory system where DRAM modules in the datacenter are pooled, for example in memory-only chassis and racks. The compute pool and the memory pool are interconnected by an optical interconnect to overcome the distance and bandwidth issues of electrical fabrics. We construct a simple cost model that includes the cost of latency, cost of bandwidth and the savings expected from a disaggregated memory system. We then identify the level at which a disaggregated memory system becomes cost competitive with a traditional direct attached memory system.
Our analysis shows that a rack-scale disaggregated memory system will have a non-trivial performance penalty, and at the datacenter scale the penalty is impractically high, and the optical interconnect costs are at least a factor of 10 more expensive than where they should be when compared to the traditional direct attached memory systems.
△ Less
Submitted 3 March, 2015;
originally announced March 2015.