-
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities
Authors:
Gheorghe Comanici,
Eric Bieber,
Mike Schaekermann,
Ice Pasupat,
Noveen Sachdeva,
Inderjit Dhillon,
Marcel Blistein,
Ori Ram,
Dan Zhang,
Evan Rosen,
Luke Marris,
Sam Petulla,
Colin Gaffney,
Asaf Aharoni,
Nathan Lintz,
Tiago Cardal Pais,
Henrik Jacobsson,
Idan Szpektor,
Nan-Jiang Jiang,
Krishna Haridasan,
Ahmed Omran,
Nikunj Saunshi,
Dara Bahri,
Gaurav Mishra,
Eric Chu
, et al. (3264 additional authors not shown)
Abstract:
In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde…
▽ More
In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal understanding and it is now able to process up to 3 hours of video content. Its unique combination of long context, multimodal and reasoning capabilities can be combined to unlock new agentic workflows. Gemini 2.5 Flash provides excellent reasoning abilities at a fraction of the compute and latency requirements and Gemini 2.0 Flash and Flash-Lite provide high performance at low latency and cost. Taken together, the Gemini 2.X model generation spans the full Pareto frontier of model capability vs cost, allowing users to explore the boundaries of what is possible with complex agentic problem solving.
△ Less
Submitted 11 July, 2025; v1 submitted 7 July, 2025;
originally announced July 2025.
-
Redundancy Parameterization of the ABB YuMi Robot Arm
Authors:
Alexander J. Elias,
John T. Wen
Abstract:
The ABB YuMi is a 7-DOF collaborative robot arm with a complex, redundant kinematic structure. Path planning for the YuMi is challenging, especially with joint limits considered. The redundant degree of freedom is parameterized by the Shoulder-Elbow-Wrist (SEW) angle, called the arm angle by ABB, but the exact definition must be known for path planning outside the RobotStudio simulator. We provide…
▽ More
The ABB YuMi is a 7-DOF collaborative robot arm with a complex, redundant kinematic structure. Path planning for the YuMi is challenging, especially with joint limits considered. The redundant degree of freedom is parameterized by the Shoulder-Elbow-Wrist (SEW) angle, called the arm angle by ABB, but the exact definition must be known for path planning outside the RobotStudio simulator. We provide the first complete and validated definition of the SEW angle used for the YuMi. It follows the conventional SEW angle formulation with the shoulder-elbow direction chosen to be the direction of the fourth joint axis. Our definition also specifies the shoulder location, making it compatible with any choice of reference vector. A previous attempt to define the SEW angle exists in the literature, but it is incomplete and deviates from the behavior observed in RobotStudio. Because our formulation fits within the general SEW angle framework, we also obtain the expression for the SEW angle Jacobian and complete numerical conditions for all algorithmic singularities. Finally, we demonstrate using IK-Geo, our inverse kinematics (IK) solver based on subproblem decomposition, to find all IK solutions using 2D search. Code examples are available in a publicly accessible repository.
△ Less
Submitted 29 May, 2025;
originally announced May 2025.
-
Simulating Heterogeneity within Elastic and Inelastic Discrete Mechanical Models
Authors:
Jan Raisinger,
Qiwei Zhang,
John E. Bolander,
Jan Eliáš
Abstract:
Two approaches to incorporate heterogeneity in discrete models are compared. In the first, standard approach, the heterogeneity is dictated by geometrical structure of the discrete system. In the second approach, the heterogeneity is imposed by randomizing material parameters of the contacts between the rigid bodies. A similar randomization strategy is often adopted in continuous homogeneous model…
▽ More
Two approaches to incorporate heterogeneity in discrete models are compared. In the first, standard approach, the heterogeneity is dictated by geometrical structure of the discrete system. In the second approach, the heterogeneity is imposed by randomizing material parameters of the contacts between the rigid bodies. A similar randomization strategy is often adopted in continuous homogeneous models. The study investigates both the elastic and fracture behaviors of these model types, and compares their local and macroscale responses. It is found that the stress oscillations present in the standard discrete models built on heterogeneous geometric structures cannot be replicated by randomization of the elastically homogeneous discrete system. The marginal distributions and dependencies between the stress tensor components cannot be adequately matched. Therefore, there is a fundamental difference between these two views on discrete models. The numerical experiments performed in the paper showed that an identical response can be achieved at the macroscale by tuning the material parameters. However, the local behavior, fracturing, and internal dependencies are quite different. These findings provide insight into the potential for controlled random assignment of heterogeneity in homogeneous models. They also demonstrate the need for experimental data capable of verifying the correctness of such an approach.
△ Less
Submitted 25 June, 2025; v1 submitted 29 April, 2025;
originally announced April 2025.
-
Path Planning and Optimization for Cuspidal 6R Manipulators
Authors:
Alexander J. Elias,
John T. Wen
Abstract:
A cuspidal robot can move from one inverse kinematics (IK) solution to another without crossing a singularity. Multiple industrial robots are cuspidal. They tend to have a beautiful mechanical design, but they pose path planning challenges. A task-space path may have a valid IK solution for each point along the path, but a continuous joint-space path may depend on the choice of the IK solution or…
▽ More
A cuspidal robot can move from one inverse kinematics (IK) solution to another without crossing a singularity. Multiple industrial robots are cuspidal. They tend to have a beautiful mechanical design, but they pose path planning challenges. A task-space path may have a valid IK solution for each point along the path, but a continuous joint-space path may depend on the choice of the IK solution or even be infeasible. This paper presents new analysis, path planning, and optimization methods to enhance the utility of cuspidal robots. We first demonstrate an efficient method to identify cuspidal robots and show, for the first time, that the ABB GoFa and certain robots with three parallel joint axes are cuspidal. We then propose a new path planning method for cuspidal robots by finding all IK solutions for each point along a task-space path and constructing a graph to connect each vertex corresponding to an IK solution. Graph edges have a weight based on the optimization metric, such as minimizing joint velocity. The optimal feasible path is the shortest path in the graph. This method can find non-singular paths as well as smooth paths which pass through singularities. Finally, we incorporate this path planning method into a path optimization algorithm. Given a fixed workspace toolpath, we optimize the offset of the toolpath in the robot base frame while ensuring continuous joint motion. Code examples are available in a publicly accessible repository.
△ Less
Submitted 29 April, 2025; v1 submitted 30 January, 2025;
originally announced January 2025.
-
Application of $J$-Integral to a Random Elastic Medium
Authors:
Jan Eliáš,
Josef Martinásek,
Jia-Liang Le
Abstract:
This study investigates the use of the $J$-integral to compute the statistics of the energy release rate of a random elastic medium. The spatial variability of the elastic modulus is modeled as a homogeneous lognormal random field. Within the framework of Monte Carlo simulation, a modified contour integral is applied to evaluate the first and second statistical moments of the energy release rate.…
▽ More
This study investigates the use of the $J$-integral to compute the statistics of the energy release rate of a random elastic medium. The spatial variability of the elastic modulus is modeled as a homogeneous lognormal random field. Within the framework of Monte Carlo simulation, a modified contour integral is applied to evaluate the first and second statistical moments of the energy release rate. These results are compared with the energy release rate calculated from the potential energy function. The comparison shows that, if the random field of elastic modulus is homogeneous in space, the path independence of the classical $J$-integral remains valid for calculating the mean energy release rate. However, this path independence does not extend to the higher order statistical moments. The simulation further reveals the effect of the correlation length of the spatially varying elastic modulus on the energy release rate of the specimen.
△ Less
Submitted 6 January, 2025;
originally announced January 2025.
-
Macroscopic stress, couple stress and flux tensors derived through energetic equivalence from microscopic continuous and discrete heterogeneous finite representative volumes
Authors:
Jan Eliáš,
Gianluca Cusatis
Abstract:
This paper presents a rigorous derivation of equations to evaluate the macroscopic stress tensor, the couple stress tensor, and the flux vector equivalent to underlying microscopic fields in continuous and discrete heterogeneous systems with independent displacements and rotations. Contrary to the classical asymptotic expansion homogenization, finite size representative volume is considered. First…
▽ More
This paper presents a rigorous derivation of equations to evaluate the macroscopic stress tensor, the couple stress tensor, and the flux vector equivalent to underlying microscopic fields in continuous and discrete heterogeneous systems with independent displacements and rotations. Contrary to the classical asymptotic expansion homogenization, finite size representative volume is considered. First, the macroscopic quantities are derived for a heterogeneous Cosserat continuum. The resulting continuum equations are discretized to provide macroscopic quantities in discrete heterogeneous systems. Finally, the expressions for discrete system are derived once again, this time considering the discrete nature directly.
The formulations are presented in two variants, considering either internal or external forces, couples, and fluxes. The derivation is based on the virtual work equivalence and elucidates the fundamental significance of the couple stress tensor in the context of balance equations and admissible virtual deformation modes. Notably, an additional term in the couple stress tensor formula emerges, explaining its dependence on the reference system and position of the macroscopic point. The resulting equations are verified by comparing their predictions with known analytical solutions and results of other numerical models under both steady state and transient conditions.
△ Less
Submitted 30 December, 2024; v1 submitted 17 December, 2024;
originally announced December 2024.
-
Scaling Instructable Agents Across Many Simulated Worlds
Authors:
SIMA Team,
Maria Abi Raad,
Arun Ahuja,
Catarina Barros,
Frederic Besse,
Andrew Bolt,
Adrian Bolton,
Bethanie Brownfield,
Gavin Buttimore,
Max Cant,
Sarah Chakera,
Stephanie C. Y. Chan,
Jeff Clune,
Adrian Collister,
Vikki Copeman,
Alex Cullum,
Ishita Dasgupta,
Dario de Cesare,
Julia Di Trapani,
Yani Donchev,
Emma Dunleavy,
Martin Engelcke,
Ryan Faulkner,
Frankie Garcia,
Charles Gbadamosi
, et al. (69 additional authors not shown)
Abstract:
Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions, in order to accomplish complex tasks. The Scalable, Instructable, Multiworld Agent (SIMA) project tackles this by training agents to follow free-form instructio…
▽ More
Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions, in order to accomplish complex tasks. The Scalable, Instructable, Multiworld Agent (SIMA) project tackles this by training agents to follow free-form instructions across a diverse range of virtual 3D environments, including curated research environments as well as open-ended, commercial video games. Our goal is to develop an instructable agent that can accomplish anything a human can do in any simulated 3D environment. Our approach focuses on language-driven generality while imposing minimal assumptions. Our agents interact with environments in real-time using a generic, human-like interface: the inputs are image observations and language instructions and the outputs are keyboard-and-mouse actions. This general approach is challenging, but it allows agents to ground language across many visually complex and semantically rich environments while also allowing us to readily run agents in new environments. In this paper we describe our motivation and goal, the initial progress we have made, and promising preliminary results on several diverse research environments and a variety of commercial video games.
△ Less
Submitted 11 October, 2024; v1 submitted 13 March, 2024;
originally announced April 2024.
-
Gemini: A Family of Highly Capable Multimodal Models
Authors:
Gemini Team,
Rohan Anil,
Sebastian Borgeaud,
Jean-Baptiste Alayrac,
Jiahui Yu,
Radu Soricut,
Johan Schalkwyk,
Andrew M. Dai,
Anja Hauth,
Katie Millican,
David Silver,
Melvin Johnson,
Ioannis Antonoglou,
Julian Schrittwieser,
Amelia Glaese,
Jilin Chen,
Emily Pitler,
Timothy Lillicrap,
Angeliki Lazaridou,
Orhan Firat,
James Molloy,
Michael Isard,
Paul R. Barham,
Tom Hennigan,
Benjamin Lee
, et al. (1326 additional authors not shown)
Abstract:
This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr…
▽ More
This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of the Gemini family in cross-modal reasoning and language understanding will enable a wide variety of use cases. We discuss our approach toward post-training and deploying Gemini models responsibly to users through services including Gemini, Gemini Advanced, Google AI Studio, and Cloud Vertex AI.
△ Less
Submitted 9 May, 2025; v1 submitted 18 December, 2023;
originally announced December 2023.
-
Redundancy parameterization and inverse kinematics of 7-DOF revolute manipulators
Authors:
Alexander J. Elias,
John T. Wen
Abstract:
Seven degree-of-freedom (DOF) robot arms have one redundant DOF which does not change the motion of the end effector. The redundant DOF offers greater manipulability of the arm configuration to avoid obstacles and singularities, but it must be parameterized to fully specify the joint angles for a given end effector pose. For 7-DOF revolute (7R) manipulators, we introduce a new concept of generaliz…
▽ More
Seven degree-of-freedom (DOF) robot arms have one redundant DOF which does not change the motion of the end effector. The redundant DOF offers greater manipulability of the arm configuration to avoid obstacles and singularities, but it must be parameterized to fully specify the joint angles for a given end effector pose. For 7-DOF revolute (7R) manipulators, we introduce a new concept of generalized shoulder-elbow-wrist (SEW) angle, a generalization of the conventional SEW angle but with an arbitrary choice of the reference direction function. The SEW angle is widely used and easy for human operators to visualize as a rotation of the elbow about the shoulder-wrist line. Since other redundancy parameterizations including the conventional SEW angle encounter an algorithmic singularity along a line in the workspace, we introduce a special choice of the reference direction function called the stereographic SEW angle which has a singularity only along a half-line, which can be placed out of reach. We prove that such a singularity is unavoidable for any parameterization. We also include expressions for the SEW angle Jacobian along with singularity analysis. Finally, we provide efficient and singularity-robust inverse kinematics solutions for most known 7R manipulators using the general SEW angle and the subproblem decomposition method. These solutions are often closed-form but may sometimes involve a 1D or 2D search in the general case. Search-based solutions may be converted to finding zeros of a high-order polynomial. Inverse kinematics solutions, examples, and evaluations are available in a publicly accessible repository.
△ Less
Submitted 20 March, 2024; v1 submitted 24 July, 2023;
originally announced July 2023.
-
Adaptive discretization refinement for discrete models of coupled mechanics and mass transport in concrete
Authors:
Jan Mašek,
Josef Květon,
Jan Eliáš
Abstract:
An adaptive discretization refinement strategy for steady state discrete mesoscale models of coupled mechanics and mass transport in concrete is presented. Coupling is provided by two phenomena: the Biot's theory of poromechanics and an effect of cracks on material permeability coefficient. The model kinematics is derived from rigid body motion of Voronoi cells obtained by tessellation of the doma…
▽ More
An adaptive discretization refinement strategy for steady state discrete mesoscale models of coupled mechanics and mass transport in concrete is presented. Coupling is provided by two phenomena: the Biot's theory of poromechanics and an effect of cracks on material permeability coefficient. The model kinematics is derived from rigid body motion of Voronoi cells obtained by tessellation of the domain. Starting with a coarse discretization, the density of Voronoi generator points is adaptively increased on the fly in regions where the maximum principal stress exceeds a chosen threshold. Purely elastic behavior is assumed in the coarse discretization, therefore no transfer of history/state variables is needed. Examples showing (i) computational time savings achieved via the adaptive technique and (ii) an agreement of the outputs from the fine and adaptive models during simulations of hydraulic fracturing and three-point bending combined with a fluid pressure loading are presented.
△ Less
Submitted 27 June, 2023;
originally announced June 2023.
-
A framework for fully autonomous design of materials via multiobjective optimization and active learning: challenges and next steps
Authors:
Tyler H. Chang,
Jakob R. Elias,
Stefan M. Wild,
Santanu Chaudhuri,
Joseph A. Libera
Abstract:
In order to deploy machine learning in a real-world self-driving laboratory where data acquisition is costly and there are multiple competing design criteria, systems need to be able to intelligently sample while balancing performance trade-offs and constraints. For these reasons, we present an active learning process based on multiobjective black-box optimization with continuously updated machine…
▽ More
In order to deploy machine learning in a real-world self-driving laboratory where data acquisition is costly and there are multiple competing design criteria, systems need to be able to intelligently sample while balancing performance trade-offs and constraints. For these reasons, we present an active learning process based on multiobjective black-box optimization with continuously updated machine learning models. This workflow is built on open-source technologies for real-time data streaming and modular multiobjective optimization software development. We demonstrate a proof of concept for this workflow through the autonomous operation of a continuous-flow chemistry laboratory, which identifies ideal manufacturing conditions for the electrolyte 2,2,2-trifluoroethyl methyl carbonate.
△ Less
Submitted 14 April, 2023;
originally announced April 2023.
-
IK-Geo: Unified Robot Inverse Kinematics Using Subproblem Decomposition
Authors:
Alexander J. Elias,
John T. Wen
Abstract:
This paper presents the open-source robot inverse kinematics (IK) solver IK-Geo, the fastest general IK solver based on published literature. In this unifying approach, IK for any 6-DOF all-revolute (6R) manipulator is decomposed into six canonical geometric subproblems solved by intersecting circles with other geometric objects. We present new efficient and singularity-robust solutions to these s…
▽ More
This paper presents the open-source robot inverse kinematics (IK) solver IK-Geo, the fastest general IK solver based on published literature. In this unifying approach, IK for any 6-DOF all-revolute (6R) manipulator is decomposed into six canonical geometric subproblems solved by intersecting circles with other geometric objects. We present new efficient and singularity-robust solutions to these subproblems using geometric and linear algebra methods. IK-Geo finds all IK solutions including singular solutions and sometimes least-squares solutions by solving for subproblem solutions in all cases, including in a continuous and sometimes least-squares sense when a solution does not exist. Robots are classified into kinematic families based on cases of intersecting or parallel joint axes, and robots in the same family use the same IK algorithm. 6R robots with three intersecting or parallel axes are solved in closed form, and all solutions are found exactly without iteration. Other 6R robots are efficiently solved by searching for zeros of an error function of one or two joint angles. The subproblem and IK solutions are easy to understand, implement, test, and modify, meaning this method is readily ported to new languages and environments. We connect our geometric method with less efficient but more robust polynomial-based methods: rather than using search, subproblems and error functions may be written in terms of the tangent half-angle of one joint. This results in a system of multivariate polynomial equations from which the univariate polynomial with zeros corresponding to IK solutions is readily derived.
△ Less
Submitted 19 February, 2024; v1 submitted 10 November, 2022;
originally announced November 2022.
-
Improving alignment of dialogue agents via targeted human judgements
Authors:
Amelia Glaese,
Nat McAleese,
Maja Trębacz,
John Aslanides,
Vlad Firoiu,
Timo Ewalds,
Maribeth Rauh,
Laura Weidinger,
Martin Chadwick,
Phoebe Thacker,
Lucy Campbell-Gillingham,
Jonathan Uesato,
Po-Sen Huang,
Ramona Comanescu,
Fan Yang,
Abigail See,
Sumanth Dathathri,
Rory Greig,
Charlie Chen,
Doug Fritz,
Jaume Sanchez Elias,
Richard Green,
Soňa Mokrá,
Nicholas Fernando,
Boxi Wu
, et al. (9 additional authors not shown)
Abstract:
We present Sparrow, an information-seeking dialogue agent trained to be more helpful, correct, and harmless compared to prompted language model baselines. We use reinforcement learning from human feedback to train our models with two new additions to help human raters judge agent behaviour. First, to make our agent more helpful and harmless, we break down the requirements for good dialogue into na…
▽ More
We present Sparrow, an information-seeking dialogue agent trained to be more helpful, correct, and harmless compared to prompted language model baselines. We use reinforcement learning from human feedback to train our models with two new additions to help human raters judge agent behaviour. First, to make our agent more helpful and harmless, we break down the requirements for good dialogue into natural language rules the agent should follow, and ask raters about each rule separately. We demonstrate that this breakdown enables us to collect more targeted human judgements of agent behaviour and allows for more efficient rule-conditional reward models. Second, our agent provides evidence from sources supporting factual claims when collecting preference judgements over model statements. For factual questions, evidence provided by Sparrow supports the sampled response 78% of the time. Sparrow is preferred more often than baselines while being more resilient to adversarial probing by humans, violating our rules only 8% of the time when probed. Finally, we conduct extensive analyses showing that though our model learns to follow our rules it can exhibit distributional biases.
△ Less
Submitted 28 September, 2022;
originally announced September 2022.
-
Homogenization of discrete mesoscale model of concrete for coupled mass transport and mechanics by asymptotic expansion
Authors:
Jan Eliáš,
Gianluca Cusatis
Abstract:
Mass transport phenomenon in concrete structures is strongly coupled with their mechanical behavior. The first coupling fabric is the Biot's theory according to which fluid pressure interacts with solid stress state and volumetric deformation rate of the solid induces changes in fluid pressure. Another coupling mechanism emerges with cracks which serve as channels for the fluid to flow through the…
▽ More
Mass transport phenomenon in concrete structures is strongly coupled with their mechanical behavior. The first coupling fabric is the Biot's theory according to which fluid pressure interacts with solid stress state and volumetric deformation rate of the solid induces changes in fluid pressure. Another coupling mechanism emerges with cracks which serve as channels for the fluid to flow through them and provide volume for fluid storage. Especially the second coupling mechanism presents a challenge for numerical modeling as it requires detailed knowledge about cracking process. Discrete mesoscale mechanical models coupled with mass transport offer simple and robust way to solve the problem. On the other hand, however, they are computationally demanding. In order to reduce this computational burden, the present paper applies the asymptotic expansion homogenization technique to the coupled problem to deliver (i) continuous and homogeneous description of the macroscopic problem which can be easily solved by the finite element method, (ii) discrete and heterogeneous mesoscale problem in the periodic setup attached to each integration point of the macroscale along with (iii) equations providing communication between these two scales. The transient terms appear at the macroscale only, as well as the Biot's coupling terms. The coupling through cracking is treated at the mesoscale by changing conductivity of the conduit elements according to the mechanical solution, otherwise the two mesoscale steady state problems are decoupled and can be therefore solved in a sequence. This paper presents verification studies showing performance of the homogenized solution.
△ Less
Submitted 3 September, 2022;
originally announced September 2022.
-
Real-Time Streaming and Event-driven Control of Scientific Experiments
Authors:
Jakob R. Elias,
Ryan Chard,
Maksim Levental,
Zhengchun Liu,
Ian Foster,
Santanu Chaudhuri
Abstract:
Advancements in scientific instrument sensors and connected devices provide unprecedented insight into ongoing experiments and present new opportunities for control, optimization, and steering. However, the diversity of sensors and heterogeneity of their data result in make it challenging to fully realize these new opportunities. Organizing and synthesizing diverse data streams in near-real-time r…
▽ More
Advancements in scientific instrument sensors and connected devices provide unprecedented insight into ongoing experiments and present new opportunities for control, optimization, and steering. However, the diversity of sensors and heterogeneity of their data result in make it challenging to fully realize these new opportunities. Organizing and synthesizing diverse data streams in near-real-time requires both rich automation and Machine Learning (ML). To efficiently utilize ML during an experiment, the entire ML lifecycle must be addressed, including refining experiment configurations, retraining models, and applying decisions-tasks that require an equally diverse array of computational resources spanning centralized HPC to the accelerators at the edge. Here we present the Manufacturing Data and Machine Learning platform (MDML). The MDML is designed to standardize the research and operational environment for advanced data analytics and ML-enabled automated process optimization by providing the cyberinfrastructure to integrate sensor data streams and AI in cyber-physical systems for in-situ analysis. To achieve this, the MDML provides a fabric to receive and aggregate IoT data and simultaneously orchestrate remote computation across the computing continuum. In this paper we describe the MDML and show how it is used in advanced manufacturing to act on IoT data and orchestrate distributed ML to guide experiments.
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
Towards Online Steering of Flame Spray Pyrolysis Nanoparticle Synthesis
Authors:
Maksim Levental,
Ryan Chard,
Joseph A. Libera,
Kyle Chard,
Aarthi Koripelly,
Jakob R. Elias,
Marcus Schwarting,
Ben Blaiszik,
Marius Stan,
Santanu Chaudhuri,
Ian Foster
Abstract:
Flame Spray Pyrolysis (FSP) is a manufacturing technique to mass produce engineered nanoparticles for applications in catalysis, energy materials, composites, and more. FSP instruments are highly dependent on a number of adjustable parameters, including fuel injection rate, fuel-oxygen mixtures, and temperature, which can greatly affect the quality, quantity, and properties of the yielded nanopart…
▽ More
Flame Spray Pyrolysis (FSP) is a manufacturing technique to mass produce engineered nanoparticles for applications in catalysis, energy materials, composites, and more. FSP instruments are highly dependent on a number of adjustable parameters, including fuel injection rate, fuel-oxygen mixtures, and temperature, which can greatly affect the quality, quantity, and properties of the yielded nanoparticles. Optimizing FSP synthesis requires monitoring, analyzing, characterizing, and modifying experimental conditions.Here, we propose a hybrid CPU-GPU Difference of Gaussians (DoG)method for characterizing the volume distribution of unburnt solution, so as to enable near-real-time optimization and steering of FSP experiments. Comparisons against standard implementations show our method to be an order of magnitude more efficient. This surrogate signal can be deployed as a component of an online end-to-end pipeline that maximizes the synthesis yield.
△ Less
Submitted 16 October, 2020;
originally announced October 2020.
-
The Manufacturing Data and Machine Learning Platform: Enabling Real-time Monitoring and Control of Scientific Experiments via IoT
Authors:
Jakob R. Elias,
Ryan Chard,
Joseph A. Libera,
Ian Foster,
Santanu Chaudhuri
Abstract:
IoT devices and sensor networks present new opportunities for measuring, monitoring, and guiding scientific experiments. Sensors, cameras, and instruments can be combined to provide previously unachievable insights into the state of ongoing experiments. However, IoT devices can vary greatly in the type, volume, and velocity of data they generate, making it challenging to fully realize this potenti…
▽ More
IoT devices and sensor networks present new opportunities for measuring, monitoring, and guiding scientific experiments. Sensors, cameras, and instruments can be combined to provide previously unachievable insights into the state of ongoing experiments. However, IoT devices can vary greatly in the type, volume, and velocity of data they generate, making it challenging to fully realize this potential. Indeed, synergizing diverse IoT data streams in near-real time can require the use of machine learning (ML). In addition, new tools and technologies are required to facilitate the collection, aggregation, and manipulation of sensor data in order to simplify the application of ML models and in turn, fully realize the utility of IoT devices in laboratories. Here we will demonstrate how the use of the Argonne-developed Manufacturing Data and Machine Learning (MDML) platform can analyze and use IoT devices in a manufacturing experiment. MDML is designed to standardize the research and operational environment for advanced data analytics and AI-enabled automated process optimization by providing the infrastructure to integrate AI in cyber-physical systems for in situ analysis. We will show that MDML is capable of processing diverse IoT data streams, using multiple computing resources, and integrating ML models to guide an experiment.
△ Less
Submitted 27 May, 2020;
originally announced May 2020.
-
Joint Planning of Network Slicing and Mobile Edge Computing: Models and Algorithms
Authors:
Bin Xiang,
Jocelyne Elias,
Fabio Martignon,
Elisabetta Di Nitto
Abstract:
Multi-access Edge Computing (MEC) facilitates the deployment of critical applications with stringent QoS requirements, latency in particular. This paper considers the problem of jointly planning the availability of computational resources at the edge, the slicing of mobile network and edge computation resources, and the routing of heterogeneous traffic types to the various slices. These aspects ar…
▽ More
Multi-access Edge Computing (MEC) facilitates the deployment of critical applications with stringent QoS requirements, latency in particular. This paper considers the problem of jointly planning the availability of computational resources at the edge, the slicing of mobile network and edge computation resources, and the routing of heterogeneous traffic types to the various slices. These aspects are intertwined and must be addressed together to provide the desired QoS to all mobile users and traffic types still keeping costs under control. We formulate our problem as a mixed-integer nonlinear program (MINLP) and we define a heuristic, named Neighbor Exploration and Sequential Fixing (NESF), to facilitate the solution of the problem. The approach allows network operators to fine tune the network operation cost and the total latency experienced by users. We evaluate the performance of the proposed model and heuristic against two natural greedy approaches. We show the impact of the variation of all the considered parameters (viz., different types of traffic, tolerable latency, network topology and bandwidth, computation and link capacity) on the defined model. Numerical results demonstrate that NESF is very effective, achieving near-optimal planning and resource allocation solutions in a very short computing time even for large-scale network scenarios.
△ Less
Submitted 8 September, 2021; v1 submitted 14 May, 2020;
originally announced May 2020.
-
Fracture in random quasibrittle media: I. Discrete mesoscale simulations of load capacity and fracture process zone
Authors:
Jan Eliáš,
Miroslav Vořechovský
Abstract:
Numerical simulations of concrete fracture performed with a probabilistic mesoscale discrete model are presented. The model represents a substantial part of material randomness by assigning random locations to the largest aggregates. The remaining part of randomness is introduced by causing material parameters to fluctuate randomly via a homogeneous random field. An extensive numerical study perfo…
▽ More
Numerical simulations of concrete fracture performed with a probabilistic mesoscale discrete model are presented. The model represents a substantial part of material randomness by assigning random locations to the largest aggregates. The remaining part of randomness is introduced by causing material parameters to fluctuate randomly via a homogeneous random field. An extensive numerical study performed with the model considers prisms loaded in uniaxial tension with both fixed and rotating platens, and also beams with and without a notch loaded in three point bending. The results show the nontrivial effect of (i) autocorrelation length and (ii) variance of the random field on the fracture behavior of the model. Statistics of the peak load are presented as well as the size and shape of the fracture process zone at the moment when the maximum load is attained. Local averaging within the fracture process zone and weakest-link are identified as underlying mechanisms explaining the reported results. The companion paper, Part II [64], introduces an analytical model capable of predicting the distribution of the peak load obtained with the probabilistic discrete model via the simple estimation of extremes of a random field obtained as moving average of local strength.
△ Less
Submitted 14 June, 2020; v1 submitted 5 February, 2020;
originally announced February 2020.
-
Elastic properties of isotropic discrete systems: Connections between geometric structure and Poisson's ratio
Authors:
Jan Eliáš
Abstract:
The use of discrete material representation in numerical models is advantageous due to the straightforward way it takes into account material heterogeneity and randomness, and the discrete and orientated nature of cracks. Unfortunately, it also restricts the macroscopic Poisson's ratio and therefore narrows its applicability. The paper studies the Poisson's ratio of a discrete model analytically.…
▽ More
The use of discrete material representation in numerical models is advantageous due to the straightforward way it takes into account material heterogeneity and randomness, and the discrete and orientated nature of cracks. Unfortunately, it also restricts the macroscopic Poisson's ratio and therefore narrows its applicability. The paper studies the Poisson's ratio of a discrete model analytically. It derives theoretical limits for cases where the geometry of the model is completely arbitrary, but isotropic in the statistical sense. It is shown that the widest limits are obtained for models where normal directions of contacts between discrete units are parallel with the vectors connecting these units. Any deviation from parallelism causes the limits to shrink. A comparison of the derived equations to the results of the actual numerical model is presented. It shows relatively large deviations from the theory because the fundamental assumptions behind the theoretical derivations are largely violated in systems with complex geometry. The real shrinking of the Poisson's ratio limit is less severe compared to that which is theoretically derived.
△ Less
Submitted 25 December, 2019; v1 submitted 12 July, 2019;
originally announced July 2019.
-
Optimal Geographic Caching in Cellular Networks with Linear Content Coding
Authors:
Jocelyne Elias,
Bartłomiej Błaszczyszyn
Abstract:
We state and solve a problem of the optimal geographic caching of content in cellular networks, where linear combinations of contents are stored in the caches of base stations. We consider a general content popularity distribution and a general distribution of the number of stations covering the typical location in the network. We are looking for a policy of content caching maximizing the probabil…
▽ More
We state and solve a problem of the optimal geographic caching of content in cellular networks, where linear combinations of contents are stored in the caches of base stations. We consider a general content popularity distribution and a general distribution of the number of stations covering the typical location in the network. We are looking for a policy of content caching maximizing the probability of serving the typical content request from the caches of covering stations. The problem has a special form of monotone sub-modular set function maximization. Using dynamic programming, we find a deterministic policy solving the problem. We also consider two natural greedy caching policies. We evaluate our policies considering two popular stochastic geometric coverage models: the Boolean one and the Signal-to-Interference-and-Noise-Ratio one, assuming Zipf popularity distribution. Our numerical results show that the proposed deterministic policies are in general not worse than some randomized policy considered in the literature and can further improve the total hit probability in the moderately high coverage regime.
△ Less
Submitted 27 April, 2017;
originally announced April 2017.
-
Quasi-Orthogonal Space-Time-Frequency Trellis Codes for MIMO-OFDM Systems
Authors:
J. Robinson Ebi Elias,
R. Rajesh
Abstract:
The main objective of this project is to design the full-rate Space-Time-Frequency Trellis code (STFTC), which is based on Quasi-Orthogonal designs for Multiple-Input Multiple-Output (MIMO) Orthogonal Frequency Division Multiplexing (OFDM) systems. The proposed Quasi-Orthogonal Space-Time-Frequency Trellis code combines set partitioning and the structure of quasi-orthogonal space-frequency designs…
▽ More
The main objective of this project is to design the full-rate Space-Time-Frequency Trellis code (STFTC), which is based on Quasi-Orthogonal designs for Multiple-Input Multiple-Output (MIMO) Orthogonal Frequency Division Multiplexing (OFDM) systems. The proposed Quasi-Orthogonal Space-Time-Frequency Trellis code combines set partitioning and the structure of quasi-orthogonal space-frequency designs in a systematic way. In addition to multipath diversity and transmit diversity, the proposed code provides receive diversity, array gain, and achieve high-coding gain over a frequency selective fading channel. As simulation results demonstrate, the code outperforms the existing Quasi-Orthogonal Space-Time-Frequency Trellis codes in terms of frame error rate performance.
△ Less
Submitted 12 June, 2012;
originally announced July 2012.