Search | arXiv e-print repository

Rapid yet accurate Tile-circuit and device modeling for Analog In-Memory Computing

Authors: J. Luquin, C. Mackin, S. Ambrogio, A. Chen, F. Baldi, G. Miralles, M. J. Rasch, J. Büchel, M. Lalwani, W. Ponghiran, P. Solomon, H. Tsai, G. W. Burr, P. Narayanan

Abstract: Analog In-Memory Compute (AIMC) can improve the energy efficiency of Deep Learning by orders of magnitude. Yet analog-domain device and circuit non-idealities -- within the analog ``Tiles'' performing Matrix-Vector Multiply (MVM) operations -- can degrade neural-network task accuracy. We quantify the impact of low-level distortions and noise, and develop a mathematical model for Multiply-ACcumulat… ▽ More Analog In-Memory Compute (AIMC) can improve the energy efficiency of Deep Learning by orders of magnitude. Yet analog-domain device and circuit non-idealities -- within the analog ``Tiles'' performing Matrix-Vector Multiply (MVM) operations -- can degrade neural-network task accuracy. We quantify the impact of low-level distortions and noise, and develop a mathematical model for Multiply-ACcumulate (MAC) operations mapped to analog tiles. Instantaneous-current IR-drop (the most significant circuit non-ideality), and ADC quantization effects are fully captured by this model, which can predict MVM tile-outputs both rapidly and accurately, as compared to much slower rigorous circuit simulations. A statistical model of PCM read noise at nanosecond timescales is derived from -- and matched against -- experimental measurements. We integrate these (statistical) device and (deterministic) circuit effects into a PyTorch-based framework to assess the accuracy impact on the BERT and ALBERT Transformer networks. We show that hardware-aware fine-tuning using simple Gaussian noise provides resilience against ADC quantization and PCM read noise effects, but is less effective against IR-drop. This is because IR-drop -- although deterministic -- is non-linear, is changing significantly during the time-integration window, and is ultimately dependent on all the excitations being introduced in parallel into the analog tile. The apparent inability of simple Gaussian noise applied during training to properly prepare a DNN network for IR-drop during inference implies that more complex training approaches -- incorporating advances such as the Tile-circuit model introduced here -- will be critical for resilient deployment of large neural networks onto AIMC hardware. △ Less

Submitted 5 May, 2025; originally announced June 2025.

arXiv:2505.11067 [pdf, other]

Assessing the Performance of Analog Training for Transfer Learning

Authors: Omobayode Fagbohungbe, Corey Lammie, Malte J. Rasch, Takashi Ando, Tayfun Gokmen, Vijay Narayanan

Abstract: Analog in-memory computing is a next-generation computing paradigm that promises fast, parallel, and energy-efficient deep learning training and transfer learning (TL). However, achieving this promise has remained elusive due to a lack of suitable training algorithms. Analog memory devices exhibit asymmetric and non-linear switching behavior in addition to device-to-device variation, meaning that… ▽ More Analog in-memory computing is a next-generation computing paradigm that promises fast, parallel, and energy-efficient deep learning training and transfer learning (TL). However, achieving this promise has remained elusive due to a lack of suitable training algorithms. Analog memory devices exhibit asymmetric and non-linear switching behavior in addition to device-to-device variation, meaning that most, if not all, of the current off-the-shelf training algorithms cannot achieve good training outcomes. Also, recently introduced algorithms have enjoyed limited attention, as they require bi-directionally switching devices of unrealistically high symmetry and precision and are highly sensitive. A new algorithm chopped TTv2 (c-TTv2), has been introduced, which leverages the chopped technique to address many of the challenges mentioned above. In this paper, we assess the performance of the c-TTv2 algorithm for analog TL using a Swin-ViT model on a subset of the CIFAR100 dataset. We also investigate the robustness of our algorithm to changes in some device specifications, including weight transfer noise, symmetry point skew, and symmetry point variability △ Less

Submitted 16 May, 2025; originally announced May 2025.

arXiv:2504.16562 [pdf, other]

A Vision for AI-Driven Adaptation of Dynamic AR Content to Users and Environments

Authors: Julian Rasch, Florian Müller, Francesco Chiossi

Abstract: Augmented Reality (AR) is transforming the way we interact with virtual information in the physical world. By overlaying digital content in real-world environments, AR enables new forms of immersive and engaging experiences. However, existing AR systems often struggle to effectively manage the many interactive possibilities that AR presents. This vision paper speculates on AI-driven approaches for… ▽ More Augmented Reality (AR) is transforming the way we interact with virtual information in the physical world. By overlaying digital content in real-world environments, AR enables new forms of immersive and engaging experiences. However, existing AR systems often struggle to effectively manage the many interactive possibilities that AR presents. This vision paper speculates on AI-driven approaches for adaptive AR content placement, dynamically adjusting to user movement and environmental changes. By leveraging machine learning methods, such a system would intelligently manage content distribution between AR projections integrated into the external environment and fixed static content, enabling seamless UI layout and potentially reducing users' cognitive load. By exploring the possibilities of AI-driven dynamic AR content placement, we aim to envision new opportunities for innovation and improvement in various industries, from urban navigation and workplace productivity to immersive learning and beyond. This paper outlines a vision for the development of more intuitive, engaging, and effective AI-powered AR experiences. △ Less

Submitted 23 April, 2025; originally announced April 2025.

arXiv:2502.20944 [pdf, other]

doi 10.1145/3706598.3714258

AR You on Track? Investigating Effects of Augmented Reality Anchoring on Dual-Task Performance While Walking

Authors: Julian Rasch, Matthias Wilhalm, Florian Müller, Francesco Chiossi

Abstract: With the increasing spread of AR head-mounted displays suitable for everyday use, interaction with information becomes ubiquitous, even while walking. However, this requires constant shifts of our attention between walking and interacting with virtual information to fulfill both tasks adequately. Accordingly, we as a community need a thorough understanding of the mutual influences of walking and i… ▽ More With the increasing spread of AR head-mounted displays suitable for everyday use, interaction with information becomes ubiquitous, even while walking. However, this requires constant shifts of our attention between walking and interacting with virtual information to fulfill both tasks adequately. Accordingly, we as a community need a thorough understanding of the mutual influences of walking and interacting with digital information to design safe yet effective interactions. Thus, we systematically investigate the effects of different AR anchors (hand, head, torso) and task difficulties on user experience and performance. We engage participants (n=26) in a dual-task paradigm involving a visual working memory task while walking. We assess the impact of dual-tasking on both virtual and walking performance, and subjective evaluations of mental and physical load. Our results show that head-anchored AR content least affected walking while allowing for fast and accurate virtual task interaction, while hand-anchored content increased reaction times and workload. △ Less

Submitted 4 March, 2025; v1 submitted 28 February, 2025; originally announced February 2025.

arXiv:2502.03069 [pdf, other]

doi 10.1145/3706598.3713720

CreepyCoCreator? Investigating AI Representation Modes for 3D Object Co-Creation in Virtual Reality

Authors: Julian Rasch, Julia Töws, Teresa Hirzle, Florian Müller, Martin Schmitz

Abstract: Generative AI in Virtual Reality offers the potential for collaborative object-building, yet challenges remain in aligning AI contributions with user expectations. In particular, users often struggle to understand and collaborate with AI when its actions are not transparently represented. This paper thus explores the co-creative object-building process through a Wizard-of-Oz study, focusing on how… ▽ More Generative AI in Virtual Reality offers the potential for collaborative object-building, yet challenges remain in aligning AI contributions with user expectations. In particular, users often struggle to understand and collaborate with AI when its actions are not transparently represented. This paper thus explores the co-creative object-building process through a Wizard-of-Oz study, focusing on how AI can effectively convey its intent to users during object customization in Virtual Reality. Inspired by human-to-human collaboration, we focus on three representation modes: the presence of an embodied avatar, whether the AI's contributions are visualized immediately or incrementally, and whether the areas modified are highlighted in advance. The findings provide insights into how these factors affect user perception and interaction with object-generating AI tools in Virtual Reality as well as satisfaction and ownership of the created objects. The results offer design implications for co-creative world-building systems, aiming to foster more effective and satisfying collaborations between humans and AI in Virtual Reality. △ Less

Submitted 21 February, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

Comments: To appear: Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems - CHI '25

arXiv:2406.12774 [pdf, other]

Towards Exact Gradient-based Training on Analog In-memory Computing

Authors: Zhaoxian Wu, Tayfun Gokmen, Malte J. Rasch, Tianyi Chen

Abstract: Given the high economic and environmental costs of using large vision or language models, analog in-memory accelerators present a promising solution for energy-efficient AI. While inference on analog accelerators has been studied recently, the training perspective is underexplored. Recent studies have shown that the "workhorse" of digital AI training - stochastic gradient descent (SGD) algorithm c… ▽ More Given the high economic and environmental costs of using large vision or language models, analog in-memory accelerators present a promising solution for energy-efficient AI. While inference on analog accelerators has been studied recently, the training perspective is underexplored. Recent studies have shown that the "workhorse" of digital AI training - stochastic gradient descent (SGD) algorithm converges inexactly when applied to model training on non-ideal devices. This paper puts forth a theoretical foundation for gradient-based training on analog devices. We begin by characterizing the non-convergent issue of SGD, which is caused by the asymmetric updates on the analog devices. We then provide a lower bound of the asymptotic error to show that there is a fundamental performance limit of SGD-based analog training rather than an artifact of our analysis. To address this issue, we study a heuristic analog algorithm called Tiki-Taka that has recently exhibited superior empirical performance compared to SGD and rigorously show its ability to exactly converge to a critical point and hence eliminates the asymptotic error. The simulations verify the correctness of the analyses. △ Less

Submitted 18 June, 2024; originally announced June 2024.

Comments: 10 pages, 5 figures,2 tables

arXiv:2406.04871 [pdf, other]

doi 10.1145/3643834.3661557

Mind Mansion: Exploring Metaphorical Interactions to Engage with Negative Thoughts in Virtual Reality

Authors: Julian Rasch, Michelle Johanna Zender, Sophia Sakel, Nadine Wagener

Abstract: Recurrent negative thoughts can significantly disrupt daily life and contribute to negative emotional states. Facing, confronting, and noticing such thoughts without support can be challenging. To provide a playful setting and leverage the technical maturation of Virtual Reality (VR), our VR experience, Mind Mansion, places the user in an initially cluttered virtual apartment. Here we utilize esta… ▽ More Recurrent negative thoughts can significantly disrupt daily life and contribute to negative emotional states. Facing, confronting, and noticing such thoughts without support can be challenging. To provide a playful setting and leverage the technical maturation of Virtual Reality (VR), our VR experience, Mind Mansion, places the user in an initially cluttered virtual apartment. Here we utilize established concepts from traditional therapy and metaphors identified in prior works to let users engage metaphorically with representations of thoughts, gradually sorting the space, fostering awareness of thoughts, and supporting mental self-care. The results of our user study (n = 30) reveal that Mind Mansion encourages the exploration of alternative perspectives, fosters acceptance, and potentially offers new coping mechanisms. Our findings suggest that this VR intervention can reduce negative affect and improve overall emotional awareness. △ Less

Submitted 7 June, 2024; originally announced June 2024.

Comments: To appear in Proceedings of the Designing Interactive Systems Conference (DIS '24), July 1-5, 2024, IT University of Copenhagen, Denmark

arXiv:2403.11756 [pdf, other]

doi 10.1145/3613904.3642864

Just Undo It: Exploring Undo Mechanics in Multi-User Virtual Reality

Authors: Julian Rasch, Florian Perzl, Yannick Weiss, Florian Müller

Abstract: With the proliferation of VR and a metaverse on the horizon, many multi-user activities are migrating to the VR world, calling for effective collaboration support. As one key feature, traditional collaborative systems provide users with undo mechanics to reverse errors and other unwanted changes. While undo has been extensively researched in this domain and is now considered industry standard, it… ▽ More With the proliferation of VR and a metaverse on the horizon, many multi-user activities are migrating to the VR world, calling for effective collaboration support. As one key feature, traditional collaborative systems provide users with undo mechanics to reverse errors and other unwanted changes. While undo has been extensively researched in this domain and is now considered industry standard, it is strikingly absent for VR systems in research and industry. This work addresses this research gap by exploring different undo techniques for basic object manipulation in different collaboration modes in VR. We conducted a study involving 32 participants organized in teams of two. Here, we studied users' performance and preferences in a tower stacking task, varying the available undo techniques and their mode of collaboration. The results suggest that users desire and use undo in VR and that the choice of the undo technique impacts users' performance and social connection. △ Less

Submitted 18 March, 2024; originally announced March 2024.

Comments: To appear in Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '24), May 11-16, 2024, Honolulu, HI, USA

arXiv:2307.09357 [pdf, other]

doi 10.1063/5.0168089

Using the IBM Analog In-Memory Hardware Acceleration Kit for Neural Network Training and Inference

Authors: Manuel Le Gallo, Corey Lammie, Julian Buechel, Fabio Carta, Omobayode Fagbohungbe, Charles Mackin, Hsinyu Tsai, Vijay Narayanan, Abu Sebastian, Kaoutar El Maghraoui, Malte J. Rasch

Abstract: Analog In-Memory Computing (AIMC) is a promising approach to reduce the latency and energy consumption of Deep Neural Network (DNN) inference and training. However, the noisy and non-linear device characteristics, and the non-ideal peripheral circuitry in AIMC chips, require adapting DNNs to be deployed on such hardware to achieve equivalent accuracy to digital computing. In this tutorial, we prov… ▽ More Analog In-Memory Computing (AIMC) is a promising approach to reduce the latency and energy consumption of Deep Neural Network (DNN) inference and training. However, the noisy and non-linear device characteristics, and the non-ideal peripheral circuitry in AIMC chips, require adapting DNNs to be deployed on such hardware to achieve equivalent accuracy to digital computing. In this tutorial, we provide a deep dive into how such adaptations can be achieved and evaluated using the recently released IBM Analog Hardware Acceleration Kit (AIHWKit), freely available at https://github.com/IBM/aihwkit. The AIHWKit is a Python library that simulates inference and training of DNNs using AIMC. We present an in-depth description of the AIHWKit design, functionality, and best practices to properly perform inference and training. We also present an overview of the Analog AI Cloud Composer, a platform that provides the benefits of using the AIHWKit simulation in a fully managed cloud setting along with physical AIMC hardware access, freely available at https://aihw-composer.draco.res.ibm.com. Finally, we show examples on how users can expand and customize AIHWKit for their own needs. This tutorial is accompanied by comprehensive Jupyter Notebook code examples that can be run using AIHWKit, which can be downloaded from https://github.com/IBM/aihwkit/tree/master/notebooks/tutorial. △ Less

Submitted 26 January, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

Journal ref: APL Machine Learning (2023) 1 (4): 041102

arXiv:2306.05377 [pdf, other]

Numerical coupling of aerosol emissions, dry removal, and turbulent mixing in the E3SM Atmosphere Model version 1 (EAMv1), part I: dust budget analyses and the impacts of a revised coupling scheme

Authors: Hui Wan, Kai Zhang, Christopher J. Vogl, Carol S. Woodward, Richard C. Easter, Philip J. Rasch, Yan Feng, Hailong Wang

Abstract: An earlier study evaluating the dust life cycle in the Energy Exascale Earth System Model (E3SM) Atmosphere Model version 1 (EAMv1) has revealed that the simulated global mean dust lifetime is substantially shorter when higher vertical resolution is used, primarily due to significant strengthening of dust dry removal in source regions. This paper demonstrates that the sequential splitting of aeros… ▽ More An earlier study evaluating the dust life cycle in the Energy Exascale Earth System Model (E3SM) Atmosphere Model version 1 (EAMv1) has revealed that the simulated global mean dust lifetime is substantially shorter when higher vertical resolution is used, primarily due to significant strengthening of dust dry removal in source regions. This paper demonstrates that the sequential splitting of aerosol emissions, dry removal, and turbulent mixing in the model's time integration loop, especially the calculation of dry removal after surface emissions and before turbulent mixing, is the primary reason for the vertical resolution sensitivity reported in that earlier study. Based on this reasoning, we propose a simple revision to the numerical process coupling scheme, which moves the application of the surface emissions to after dry removal and before turbulent mixing. The revised scheme allows newly emitted particles to be transported aloft by turbulence before being removed from the atmosphere, and hence better resembles the dust life cycle in the real world. Sensitivity experiments are conducted and analyzed to evaluate the impact of the revised coupling on the simulated aerosol climatology in EAMv1. △ Less

Submitted 17 June, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

arXiv:2305.07859 [pdf, other]

HAiVA: Hybrid AI-assisted Visual Analysis Framework to Study the Effects of Cloud Properties on Climate Patterns

Authors: Subhashis Hazarika, Haruki Hirasawa, Sookyung Kim, Kalai Ramea, Salva R. Cachay, Peetak Mitra, Dipti Hingmire, Hansi Singh, Phil J. Rasch

Abstract: Clouds have a significant impact on the Earth's climate system. They play a vital role in modulating Earth's radiation budget and driving regional changes in temperature and precipitation. This makes clouds ideal for climate intervention techniques like Marine Cloud Brightening (MCB) which refers to modification in cloud reflectivity, thereby cooling the surrounding region. However, to avoid unint… ▽ More Clouds have a significant impact on the Earth's climate system. They play a vital role in modulating Earth's radiation budget and driving regional changes in temperature and precipitation. This makes clouds ideal for climate intervention techniques like Marine Cloud Brightening (MCB) which refers to modification in cloud reflectivity, thereby cooling the surrounding region. However, to avoid unintended effects of MCB, we need a better understanding of the complex cloud to climate response function. Designing and testing such interventions scenarios with conventional Earth System Models is computationally expensive. Therefore, we propose a hybrid AI-assisted visual analysis framework to drive such scientific studies and facilitate interactive what-if investigation of different MCB intervention scenarios to assess their intended and unintended impacts on climate patterns. We work with a team of climate scientists to develop a suite of hybrid AI models emulating cloud-climate response function and design a tightly coupled frontend interactive visual analysis system to perform different MCB intervention experiments. △ Less

Submitted 13 May, 2023; originally announced May 2023.

arXiv:2303.15800 [pdf, other]

doi 10.1145/3544548.3581557

UndoPort: Exploring the Influence of Undo-Actions for Locomotion in Virtual Reality on the Efficiency, Spatial Understanding and User Experience

Authors: Florian Müller, Arantxa Ye, Dominik Schön, Julian Rasch

Abstract: When we get lost in Virtual Reality (VR) or want to return to a previous location, we use the same methods of locomotion for the way back as for the way forward. This is time-consuming and requires additional physical orientation changes, increasing the risk of getting tangled in the headsets' cables. In this paper, we propose the use of undo actions to revert locomotion steps in VR. We explore ei… ▽ More When we get lost in Virtual Reality (VR) or want to return to a previous location, we use the same methods of locomotion for the way back as for the way forward. This is time-consuming and requires additional physical orientation changes, increasing the risk of getting tangled in the headsets' cables. In this paper, we propose the use of undo actions to revert locomotion steps in VR. We explore eight different variations of undo actions as extensions of point\&teleport, based on the possibility to undo position and orientation changes together with two different visualizations of the undo step (discrete and continuous). We contribute the results of a controlled experiment with 24 participants investigating the efficiency and orientation of the undo techniques in a radial maze task. We found that the combination of position and orientation undo together with a discrete visualization resulted in the highest efficiency without increasing orientation errors. △ Less

Submitted 6 April, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

Comments: To appear in Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI 23), April 23-28, 2023, Hamburg, Germany. ACM, New York, NY, USA, 15 pages

Journal ref: In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI '23). Association for Computing Machinery, New York, NY, USA, Article 234, 1-15

arXiv:2303.04721 [pdf, ps, other]

doi 10.1038/s41467-024-51221-z

Fast offset corrected in-memory training

Authors: Malte J. Rasch, Fabio Carta, Omebayode Fagbohungbe, Tayfun Gokmen

Abstract: In-memory computing with resistive crossbar arrays has been suggested to accelerate deep-learning workloads in highly efficient manner. To unleash the full potential of in-memory computing, it is desirable to accelerate the training as well as inference for large deep neural networks (DNNs). In the past, specialized in-memory training algorithms have been proposed that not only accelerate the forw… ▽ More In-memory computing with resistive crossbar arrays has been suggested to accelerate deep-learning workloads in highly efficient manner. To unleash the full potential of in-memory computing, it is desirable to accelerate the training as well as inference for large deep neural networks (DNNs). In the past, specialized in-memory training algorithms have been proposed that not only accelerate the forward and backward passes, but also establish tricks to update the weight in-memory and in parallel. However, the state-of-the-art algorithm (Tiki-Taka version 2 (TTv2)) still requires near perfect offset correction and suffers from potential biases that might occur due to programming and estimation inaccuracies, as well as longer-term instabilities of the device materials. Here we propose and describe two new and improved algorithms for in-memory computing (Chopped-TTv2 (c-TTv2) and Analog Gradient Accumulation with Dynamic reference (AGAD)), that retain the same runtime complexity but correct for any remaining offsets using choppers. These algorithms greatly relax the device requirements and thus expanding the scope of possible materials potentially employed for such fast in-memory DNN training. △ Less

Submitted 8 March, 2023; originally announced March 2023.

Comments: 14 pages, 10 figures

arXiv:2302.08469 [pdf, ps, other]

Hardware-aware training for large-scale and diverse deep learning inference workloads using in-memory computing-based accelerators

Authors: Malte J. Rasch, Charles Mackin, Manuel Le Gallo, An Chen, Andrea Fasoli, Frederic Odermatt, Ning Li, S. R. Nandakumar, Pritish Narayanan, Hsinyu Tsai, Geoffrey W. Burr, Abu Sebastian, Vijay Narayanan

Abstract: Analog in-memory computing (AIMC) -- a promising approach for energy-efficient acceleration of deep learning workloads -- computes matrix-vector multiplications (MVMs) but only approximately, due to nonidealities that often are non-deterministic or nonlinear. This can adversely impact the achievable deep neural network (DNN) inference accuracy as compared to a conventional floating point (FP) impl… ▽ More Analog in-memory computing (AIMC) -- a promising approach for energy-efficient acceleration of deep learning workloads -- computes matrix-vector multiplications (MVMs) but only approximately, due to nonidealities that often are non-deterministic or nonlinear. This can adversely impact the achievable deep neural network (DNN) inference accuracy as compared to a conventional floating point (FP) implementation. While retraining has previously been suggested to improve robustness, prior work has explored only a few DNN topologies, using disparate and overly simplified AIMC hardware models. Here, we use hardware-aware (HWA) training to systematically examine the accuracy of AIMC for multiple common artificial intelligence (AI) workloads across multiple DNN topologies, and investigate sensitivity and robustness to a broad set of nonidealities. By introducing a new and highly realistic AIMC crossbar-model, we improve significantly on earlier retraining approaches. We show that many large-scale DNNs of various topologies, including convolutional neural networks (CNNs), recurrent neural networks (RNNs), and transformers, can in fact be successfully retrained to show iso-accuracy on AIMC. Our results further suggest that AIMC nonidealities that add noise to the inputs or outputs, not the weights, have the largest impact on DNN accuracy, and that RNNs are particularly robust to all nonidealities. △ Less

Submitted 16 February, 2023; originally announced February 2023.

Comments: 35 pages, 7 figures, 5 tables

arXiv:2302.03258 [pdf, other]

Climate Intervention Analysis using AI Model Guided by Statistical Physics Principles

Authors: Soo Kyung Kim, Kalai Ramea, Salva Rühling Cachay, Haruki Hirasawa, Subhashis Hazarika, Dipti Hingmire, Peetak Mitra, Philip J. Rasch, Hansi A. Singh

Abstract: The availability of training data remains a significant obstacle for the implementation of machine learning in scientific applications. In particular, estimating how a system might respond to external forcings or perturbations requires specialized labeled data or targeted simulations, which may be computationally intensive to generate at scale. In this study, we propose a novel solution to this ch… ▽ More The availability of training data remains a significant obstacle for the implementation of machine learning in scientific applications. In particular, estimating how a system might respond to external forcings or perturbations requires specialized labeled data or targeted simulations, which may be computationally intensive to generate at scale. In this study, we propose a novel solution to this challenge by utilizing a principle from statistical physics known as the Fluctuation-Dissipation Theorem (FDT) to discover knowledge using an AI model that can rapidly produce scenarios for different external forcings. By leveraging FDT, we are able to extract information encoded in a large dataset produced by Earth System Models, which includes 8250 years of internal climate fluctuations, to estimate the climate system's response to forcings. Our model, AiBEDO, is capable of capturing the complex, multi-timescale effects of radiation perturbations on global and regional surface climate, allowing for a substantial acceleration of the exploration of the impacts of spatially-heterogenous climate forcers. To demonstrate the utility of AiBEDO, we use the example of a climate intervention technique called Marine Cloud Brightening, with the ultimate goal of optimizing the spatial pattern of cloud brightening to achieve regional climate targets and prevent known climate tipping points. While we showcase the effectiveness of our approach in the context of climate science, it is generally applicable to other scientific disciplines that are limited by the extensive computational demands of domain simulation models. Source code of AiBEDO framework is made available at https://github.com/kramea/kdd_aibedo. A sample dataset is made available at https://doi.org/10.5281/zenodo.7597027. Additional data available upon request. △ Less

Submitted 7 February, 2023; originally announced February 2023.

arXiv:2302.01957 [pdf, other]

Accelerating exploration of Marine Cloud Brightening impacts on tipping points Using an AI Implementation of Fluctuation-Dissipation Theorem

Authors: Haruki Hirasawa, Sookyung Kim, Peetak Mitra, Subhashis Hazarika, Salva Ruhling-Cachay, Dipti Hingmire, Kalai Ramea, Hansi Singh, Philip J. Rasch

Abstract: Marine cloud brightening (MCB) is a proposed climate intervention technology to partially offset greenhouse gas warming and possibly avoid crossing climate tipping points. The impacts of MCB on regional climate are typically estimated using computationally expensive Earth System Model (ESM) simulations, preventing a thorough assessment of the large possibility space of potential MCB interventions.… ▽ More Marine cloud brightening (MCB) is a proposed climate intervention technology to partially offset greenhouse gas warming and possibly avoid crossing climate tipping points. The impacts of MCB on regional climate are typically estimated using computationally expensive Earth System Model (ESM) simulations, preventing a thorough assessment of the large possibility space of potential MCB interventions. Here, we describe an AI model, named AiBEDO, that can be used to rapidly projects climate responses to forcings via a novel application of the Fluctuation-Dissipation Theorem (FDT). AiBEDO is a Multilayer Perceptron (MLP) model that uses maps monthly-mean radiation anomalies to surface climate anomalies at a range of time lags. By leveraging a large existing dataset of ESM simulations containing internal climate noise, we use AiBEDO to construct an FDT operator that successfully projects climate responses to MCB forcing, when evaluated against ESM simulations. We propose that AiBEDO-FDT can be used to optimize MCB forcing patterns to reduce tipping point risks while minimizing negative side effects in other parts of the climate. △ Less

Submitted 3 February, 2023; originally announced February 2023.

Comments: AAAI Spring Symposium conference full paper

ACM Class: J.2; I.2.1

arXiv:2210.13820 [pdf, other]

Supporting Electronics Learning through Augmented Reality

Authors: Thomas Kosch, Julian Rasch, Albrecht Schmidt, Sebastian Feger

Abstract: Understanding electronics is a critical area in the maker scene. Many of the makers' projects require electronics knowledge to connect microcontrollers with sensors and actuators. Yet, learning electronics is challenging, as internal component processes remain invisible, and students often fear personal harm or component damage. Augmented Reality (AR) applications are developed to support electron… ▽ More Understanding electronics is a critical area in the maker scene. Many of the makers' projects require electronics knowledge to connect microcontrollers with sensors and actuators. Yet, learning electronics is challenging, as internal component processes remain invisible, and students often fear personal harm or component damage. Augmented Reality (AR) applications are developed to support electronics learning and visualize complex processes. This paper reflects on related work around AR and electronics that characterize open research challenges around the four characteristics functionality, fidelity, feedback type, and interactivity. △ Less

Submitted 25 October, 2022; originally announced October 2022.

ACM Class: H.5.1

arXiv:2110.03772 [pdf, other]

CondiDiag1.0: A flexible online diagnostic tool for conditional sampling and budget analysis in the E3SM atmosphere model (EAM)

Authors: Hui Wan, Kai Zhang, Philip J. Rasch, Vincent E. Larson, Xubin Zeng, Shixuan Zhang, Ross Dixon

Abstract: Numerical models used in weather and climate prediction take into account a comprehensive set of atmospheric processes such as the resolved and unresolved fluid dynamics, radiative transfer, cloud and aerosol life cycles, and mass or energy exchanges with the Earth's surface. In order to identify model deficiencies and improve predictive skills, it is important to obtain process-level understandin… ▽ More Numerical models used in weather and climate prediction take into account a comprehensive set of atmospheric processes such as the resolved and unresolved fluid dynamics, radiative transfer, cloud and aerosol life cycles, and mass or energy exchanges with the Earth's surface. In order to identify model deficiencies and improve predictive skills, it is important to obtain process-level understanding of the interactions between different processes. Conditional sampling and budget analysis are powerful tools for process-oriented model evaluation, but they often require tedious ad hoc coding and large amounts of instantaneous model output, resulting in inefficient use of human and computing resources. This paper presents an online diagnostic tool that addresses this challenge by monitoring model variables in a generic manner as they evolve within the time integration cycle. The tool is convenient to use. It allows users to select sampling conditions and specify monitored variables at run time. Both the evolving values of the model variables and their increments caused by different atmospheric processes can be monitored and archived. Online calculation of vertical integrals is also supported. Multiple sampling conditions can be monitored in a single simulation in combination with unconditional sampling. The paper explains in detail the design and implementation of the tool in the Energy Exascale Earth System Model (E3SM) version 1. The usage is demonstrated through three examples: a global budget analysis of dust aerosol mass concentration, a composite analysis of sea salt emission and its dependency on surface wind speed, and a conditionally sampled relative humidity budget. The tool is expected to be easily portable to closely related atmospheric models that use the same or similar data structures and time integration methods. △ Less

Submitted 7 October, 2021; originally announced October 2021.

arXiv:2105.07733 [pdf, other]

Knowledge State Networks for Effective Skill Assessment in Atomic Learning

Authors: Julian Rasch, David Middelbeck

Abstract: The goal of this paper is to introduce a new framework for fast and effective knowledge state assessments in the context of personalized, skill-based online learning. We use knowledge state networks - specific neural networks trained on assessment data of previous learners - to predict the full knowledge state of other learners from only partial information about their skills. In combination with… ▽ More The goal of this paper is to introduce a new framework for fast and effective knowledge state assessments in the context of personalized, skill-based online learning. We use knowledge state networks - specific neural networks trained on assessment data of previous learners - to predict the full knowledge state of other learners from only partial information about their skills. In combination with a matching assessment strategy for asking discriminative questions we demonstrate that our approach leads to a significant speed-up of the assessment process - in terms of the necessary number of assessment questions - in comparison to standard assessment designs. In practice, the presented methods enable personalized, skill-based online learning also for skill ontologies of very fine granularity without deteriorating the associated learning experience by a lengthy assessment process. △ Less

Submitted 17 May, 2021; originally announced May 2021.

Comments: submitted to JEDM

arXiv:2104.02184 [pdf, ps, other]

doi 10.1109/AICAS51828.2021.9458494

A flexible and fast PyTorch toolkit for simulating training and inference on analog crossbar arrays

Authors: Malte J. Rasch, Diego Moreda, Tayfun Gokmen, Manuel Le Gallo, Fabio Carta, Cindy Goldberg, Kaoutar El Maghraoui, Abu Sebastian, Vijay Narayanan

Abstract: We introduce the IBM Analog Hardware Acceleration Kit, a new and first of a kind open source toolkit to simulate analog crossbar arrays in a convenient fashion from within PyTorch (freely available at https://github.com/IBM/aihwkit). The toolkit is under active development and is centered around the concept of an "analog tile" which captures the computations performed on a crossbar array. Analog t… ▽ More We introduce the IBM Analog Hardware Acceleration Kit, a new and first of a kind open source toolkit to simulate analog crossbar arrays in a convenient fashion from within PyTorch (freely available at https://github.com/IBM/aihwkit). The toolkit is under active development and is centered around the concept of an "analog tile" which captures the computations performed on a crossbar array. Analog tiles are building blocks that can be used to extend existing network modules with analog components and compose arbitrary artificial neural networks (ANNs) using the flexibility of the PyTorch framework. Analog tiles can be conveniently configured to emulate a plethora of different analog hardware characteristics and their non-idealities, such as device-to-device and cycle-to-cycle variations, resistive device response curves, and weight and output noise. Additionally, the toolkit makes it possible to design custom unit cell configurations and to use advanced analog optimization algorithms such as Tiki-Taka. Moreover, the backward and update behavior can be set to "ideal" to enable hardware-aware training features for chips that target inference acceleration only. To evaluate the inference accuracy of such chips over time, we provide statistical programming noise and drift models calibrated on phase-change memory hardware. Our new toolkit is fully GPU accelerated and can be used to conveniently estimate the impact of material properties and non-idealities of future analog technology on the accuracy for arbitrary ANNs. △ Less

Submitted 5 April, 2021; originally announced April 2021.

Comments: Submitted to AICAS2021

arXiv:2010.07479 [pdf, other]

Quantifying and attributing time step sensitivities in present-day climate simulations conducted with EAMv1

Authors: Hui Wan, Shixuan Zhang, Philip J. Rasch, Vincent E. Larson, Xubin Zeng, Huiping Yan

Abstract: This study assesses the relative importance of time integration error in present-day climate simulations conducted with the atmosphere component of the Energy Exascale Earth System Model version 1 (EAMv1) at 1-degree horizontal resolution. We show that a factor-of-6 reduction of time step size in all major parts of the model leads to significant changes in the long-term mean climate. These changes… ▽ More This study assesses the relative importance of time integration error in present-day climate simulations conducted with the atmosphere component of the Energy Exascale Earth System Model version 1 (EAMv1) at 1-degree horizontal resolution. We show that a factor-of-6 reduction of time step size in all major parts of the model leads to significant changes in the long-term mean climate. These changes imply that the reduction of temporal truncation errors leads to a notable although unsurprising degradation of agreement between the simulated and observed present-day climate; the model would require retuning to regain optimal climate fidelity in the absence of those truncation errors. A coarse-grained attribution of the time step sensitivities is carried out by separately shortening time steps used in various components of EAM or by revising the numerical coupling between some processes. The results provide useful clues to help better understand the root causes of time step sensitivities in EAM. The experimentation strategy used here can also provide a pathway for other models to identify and reduce time integration errors. △ Less

Submitted 28 February, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

arXiv:2004.11124 [pdf, other]

Group-level selection avoids the tragedy of the commons

Authors: Arend Hintze, Jochen Staudacher, Katja Gelhar, Alexander Pothmann, Juliana Rasch, Daniel Wildegger

Abstract: The public goods game is a famous example illustrating the tragedy of the commons. In this game cooperating individuals contribute to a pool, which in turn is distributed to all members of the group, including defectors who reap the same rewards as cooperators without having made a contribution before. The question is now, how to incentivize group members to all cooperate as it maximizes the commo… ▽ More The public goods game is a famous example illustrating the tragedy of the commons. In this game cooperating individuals contribute to a pool, which in turn is distributed to all members of the group, including defectors who reap the same rewards as cooperators without having made a contribution before. The question is now, how to incentivize group members to all cooperate as it maximizes the common good. While costly punishment presents one such method, the cost of punishment still reduces the common good. Here we show how group-level selection can be such an incentive, and specifically how even fractions of group-level selection can overcome the benefits defectors receive. Further, we show how punishment and group-level selection interact. This work suggests that a redistribution similar to a basic income that is coupled to the economic success of the entire group could overcome the tragedy of the commons. △ Less

Submitted 23 April, 2020; originally announced April 2020.

Comments: 15 pages, 2 figures

arXiv:1906.02698 [pdf, ps, other]

doi 10.1109/MDAT.2019.2952341

Training large-scale ANNs on simulated resistive crossbar arrays

Authors: Malte J. Rasch, Tayfun Gokmen, Wilfried Haensch

Abstract: Accelerating training of artificial neural networks (ANN) with analog resistive crossbar arrays is a promising idea. While the concept has been verified on very small ANNs and toy data sets (such as MNIST), more realistically sized ANNs and datasets have not yet been tackled. However, it is to be expected that device materials and hardware design constraints, such as noisy computations, finite num… ▽ More Accelerating training of artificial neural networks (ANN) with analog resistive crossbar arrays is a promising idea. While the concept has been verified on very small ANNs and toy data sets (such as MNIST), more realistically sized ANNs and datasets have not yet been tackled. However, it is to be expected that device materials and hardware design constraints, such as noisy computations, finite number of resistive states of the device materials, saturating weight and activation ranges, and limited precision of analog-to-digital converters, will cause significant challenges to the successful training of state-of-the-art ANNs. By using analog hardware aware ANN training simulations, we here explore a number of simple algorithmic compensatory measures to cope with analog noise and limited weight and output ranges and resolutions, that dramatically improve the simulated training performances on RPU arrays on intermediately to large-scale ANNs. △ Less

Submitted 6 June, 2019; originally announced June 2019.

arXiv:1807.01356 [pdf, ps, other]

doi 10.3389/fnins.2019.00753

Efficient ConvNets for Analog Arrays

Authors: Malte J. Rasch, Tayfun Gokmen, Mattia Rigotti, Wilfried Haensch

Abstract: Analog arrays are a promising upcoming hardware technology with the potential to drastically speed up deep learning. Their main advantage is that they compute matrix-vector products in constant time, irrespective of the size of the matrix. However, early convolution layers in ConvNets map very unfavorably onto analog arrays, because kernel matrices are typically small and the constant time operati… ▽ More Analog arrays are a promising upcoming hardware technology with the potential to drastically speed up deep learning. Their main advantage is that they compute matrix-vector products in constant time, irrespective of the size of the matrix. However, early convolution layers in ConvNets map very unfavorably onto analog arrays, because kernel matrices are typically small and the constant time operation needs to be sequentially iterated a large number of times, reducing the speed up advantage for ConvNets. Here, we propose to replicate the kernel matrix of a convolution layer on distinct analog arrays, and randomly divide parts of the compute among them, so that multiple kernel matrices are trained in parallel. With this modification, analog arrays execute ConvNets with an acceleration factor that is proportional to the number of kernel matrices used per layer (here tested 16-128). Despite having more free parameters, we show analytically and in numerical experiments that this convolution architecture is self-regularizing and implicitly learns similar filters across arrays. We also report superior performance on a number of datasets and increased robustness to adversarial attacks. Our investigation suggests to revise the notion that mixed analog-digital hardware is not suitable for ConvNets. △ Less

Submitted 3 July, 2018; originally announced July 2018.

arXiv:1806.10038 [pdf, other]

doi 10.1088/1361-6420/aaf6f5

Convergence rates and structure of solutions of inverse problems with imperfect forward models

Authors: Martin Burger, Yury Korolev, Julian Rasch

Abstract: The goal of this paper is to further develop an approach to inverse problems with imperfect forward operators that is based on partially ordered spaces. Studying the dual problem yields useful insights into the convergence of the regularised solutions and allow us to obtain convergence rates in terms of Bregman distances - as usual in inverse problems, under an additional assumption on the exact s… ▽ More The goal of this paper is to further develop an approach to inverse problems with imperfect forward operators that is based on partially ordered spaces. Studying the dual problem yields useful insights into the convergence of the regularised solutions and allow us to obtain convergence rates in terms of Bregman distances - as usual in inverse problems, under an additional assumption on the exact solution called the source condition. These results are obtained for general absolutely one-homogeneous functionals. In the special case of TV-based regularisation we also study the structure of regularised solutions and prove convergence of their level sets to those of an exact solution. Finally, using the developed theory, we adapt the concept of debiasing to inverse problems with imperfect operators and propose an approach to pointwise error estimation in TV-based regularisation. △ Less

Submitted 20 November, 2018; v1 submitted 26 June, 2018; originally announced June 2018.

MSC Class: 65J20; 94A08; 49N45; 49N30

arXiv:1806.00166 [pdf]

doi 10.3389/fnins.2018.00745

Training LSTM Networks with Resistive Cross-Point Devices

Authors: Tayfun Gokmen, Malte Rasch, Wilfried Haensch

Abstract: In our previous work we have shown that resistive cross point devices, so called Resistive Processing Unit (RPU) devices, can provide significant power and speed benefits when training deep fully connected networks as well as convolutional neural networks. In this work, we further extend the RPU concept for training recurrent neural networks (RNNs) namely LSTMs. We show that the mapping of recurre… ▽ More In our previous work we have shown that resistive cross point devices, so called Resistive Processing Unit (RPU) devices, can provide significant power and speed benefits when training deep fully connected networks as well as convolutional neural networks. In this work, we further extend the RPU concept for training recurrent neural networks (RNNs) namely LSTMs. We show that the mapping of recurrent layers is very similar to the mapping of fully connected layers and therefore the RPU concept can potentially provide large acceleration factors for RNNs as well. In addition, we study the effect of various device imperfections and system parameters on training performance. Symmetry of updates becomes even more crucial for RNNs; already a few percent asymmetry results in an increase in the test error compared to the ideal case trained with floating point numbers. Furthermore, the input signal resolution to device arrays needs to be at least 7 bits for successful training. However, we show that a stochastic rounding scheme can reduce the input signal resolution back to 5 bits. Further, we find that RPU device variations and hardware noise are enough to mitigate overfitting, so that there is less need for using dropout. We note that the models trained here are roughly 1500 times larger than the fully connected network trained on MNIST dataset in terms of the total number of multiplication and summation operations performed per epoch. Thus, here we attempt to study the validity of the RPU approach for large scale networks. △ Less

Submitted 31 May, 2018; originally announced June 2018.

Comments: 17 pages, 5 figures

arXiv:1803.10576 [pdf, other]

Inexact First-Order Primal-Dual Algorithms

Authors: Julian Rasch, Antonin Chambolle

Abstract: In this paper we investigate the convergence of a recently popular class of first-order primal-dual algorithms for saddle point problems under the presence of errors occurring in the proximal maps and gradients. We study several types of errors and show that, provided a sufficient decay of these errors, the same convergence rates as for the error-free algorithm can be established. More precisely,… ▽ More In this paper we investigate the convergence of a recently popular class of first-order primal-dual algorithms for saddle point problems under the presence of errors occurring in the proximal maps and gradients. We study several types of errors and show that, provided a sufficient decay of these errors, the same convergence rates as for the error-free algorithm can be established. More precisely, we prove the (optimal) $O(1/N)$ convergence to a saddle point in finite dimensions for the class of non-smooth problems considered in this paper, and prove a $O(1/N^2)$ or even linear $O(θ^N)$ convergence rate if either the primal or dual objective respectively both are strongly convex. Moreover we show that also under a slower decay of errors we can establish rates, however slower and directly depending on the decay of the errors. We demonstrate the performance and practical use of the algorithms on the example of nested algorithms and show how they can be used to split the global objective more efficiently. △ Less

Submitted 24 February, 2020; v1 submitted 28 March, 2018; originally announced March 2018.

Comments: update after revision

arXiv:1712.00099 [pdf, other]

Dynamic MRI Reconstruction from Undersampled Data with an Anatomical Prescan

Authors: Julian Rasch, Ville Kolehmainen, Riikka Nivajärvi, Mikko Kettunen, Olli Gröhn, Martin Burger, Eva-Maria Brinkmann

Abstract: The goal of dynamic magnetic resonance imaging (dynamic MRI) is to visualize tissue properties and their local changes over time that are traceable in the MR signal. We propose a new variational approach for the reconstruction of subsampled dynamic MR data, which combines smooth, temporal regularization with spatial total variation regularization. In particular, it furthermore uses the infimal con… ▽ More The goal of dynamic magnetic resonance imaging (dynamic MRI) is to visualize tissue properties and their local changes over time that are traceable in the MR signal. We propose a new variational approach for the reconstruction of subsampled dynamic MR data, which combines smooth, temporal regularization with spatial total variation regularization. In particular, it furthermore uses the infimal convolution of two total variation Bregman distances to incorporate structural a-priori information from an anatomical MRI prescan into the reconstruction of the dynamic image sequence. The method promotes the reconstructed image sequence to have a high structural similarity to the anatomical prior, while still allowing for local intensity changes which are smooth in time. The approach is evaluated using artificial data simulating functional magnetic resonance imaging (fMRI), and experimental dynamic contrast-enhanced magnetic resonance data from small animal imaging using radial golden angle sampling of the k-space. △ Less

Submitted 30 November, 2017; originally announced December 2017.

arXiv:1710.05705 [pdf, other]

doi 10.1088/1361-6420/aaaf63

Blind Image Fusion for Hyperspectral Imaging with the Directional Total Variation

Authors: Leon Bungert, David A. Coomes, Matthias J. Ehrhardt, Jennifer Rasch, Rafael Reisenhofer, Carola-Bibiane Schönlieb

Abstract: Hyperspectral imaging is a cutting-edge type of remote sensing used for mapping vegetation properties, rock minerals and other materials. A major drawback of hyperspectral imaging devices is their intrinsic low spatial resolution. In this paper, we propose a method for increasing the spatial resolution of a hyperspectral image by fusing it with an image of higher spatial resolution that was obtain… ▽ More Hyperspectral imaging is a cutting-edge type of remote sensing used for mapping vegetation properties, rock minerals and other materials. A major drawback of hyperspectral imaging devices is their intrinsic low spatial resolution. In this paper, we propose a method for increasing the spatial resolution of a hyperspectral image by fusing it with an image of higher spatial resolution that was obtained with a different imaging modality. This is accomplished by solving a variational problem in which the regularization functional is the directional total variation. To accommodate for possible mis-registrations between the two images, we consider a non-convex blind super-resolution problem where both a fused image and the corresponding convolution kernel are estimated. Using this approach, our model can realign the given images if needed. Our experimental results indicate that the non-convexity is negligible in practice and that reliable solutions can be computed using a variety of different optimization algorithms. Numerical results on real remote sensing data from plant sciences and urban monitoring show the potential of the proposed method and suggests that it is robust with respect to the regularization parameters, mis-registration and the shape of the kernel. △ Less

Submitted 9 April, 2018; v1 submitted 4 October, 2017; originally announced October 2017.

Comments: 24 pages, 18 figures, published in Inverse Problems, typo corrected, figure added

MSC Class: 49M37; 65K10; 90C30; 90C90

Journal ref: Inverse Problems, 34(4), 044003, 2018

arXiv:1704.06073 [pdf, other]

doi 10.1088/1361-6420/aa9425

Joint Reconstruction via Coupled Bregman Iterations with Applications to PET-MR Imaging

Authors: Julian Rasch, Eva-Maria Brinkmann, Martin Burger

Abstract: Joint reconstruction has recently attracted a lot of attention, especially in the field of medical multi-modality imaging such as PET-MRI. Most of the developed methods rely on the comparison of image gradients, or more precisely their location, direction and magnitude, to make use of structural similarities between the images. A challenge and still an open issue for most of the methods is to hand… ▽ More Joint reconstruction has recently attracted a lot of attention, especially in the field of medical multi-modality imaging such as PET-MRI. Most of the developed methods rely on the comparison of image gradients, or more precisely their location, direction and magnitude, to make use of structural similarities between the images. A challenge and still an open issue for most of the methods is to handle images in entirely different scales, i.e. different magnitudes of gradients that cannot be dealt with by a global scaling of the data. We propose the use of generalized Bregman distances and infimal convolutions thereof with regard to the well-known total variation functional. The use of a total variation subgradient respectively the involved vector field rather than an image gradient naturally excludes the magnitudes of gradients, which in particular solves the scaling behavior. Additionally, the presented method features a weighting that allows to control the amount of interaction between channels. We give insights into the general behavior of the method, before we further tailor it to a particular application, namely PET-MRI joint reconstruction. To do so, we compute joint reconstruction results from blurry Poisson data for PET and undersampled Fourier data from MRI and show that we can gain a mutual benefit for both modalities. In particular, the results are superior to the respective separate reconstructions and other joint reconstruction methods. △ Less

Submitted 26 September, 2017; v1 submitted 20 April, 2017; originally announced April 2017.

Comments: Submitted

arXiv:1606.05113 [pdf, other]

Bias-Reduction in Variational Regularization

Authors: Eva-Maria Brinkmann, Martin Burger, Julian Rasch, Camille Sutour

Abstract: The aim of this paper is to introduce and study a two-step debiasing method for variational regularization. After solving the standard variational problem, the key idea is to add a consecutive debiasing step minimizing the data fidelity on an appropriate set, the so-called model manifold. The latter is defined by Bregman distances or infimal convolutions thereof, using the (uniquely defined) subgr… ▽ More The aim of this paper is to introduce and study a two-step debiasing method for variational regularization. After solving the standard variational problem, the key idea is to add a consecutive debiasing step minimizing the data fidelity on an appropriate set, the so-called model manifold. The latter is defined by Bregman distances or infimal convolutions thereof, using the (uniquely defined) subgradient appearing in the optimality condition of the variational method. For particular settings, such as anisotropic $\ell^1$ and TV-type regularization, previously used debiasing techniques are shown to be special cases. The proposed approach is however easily applicable to a wider range of regularizations. The two-step debiasing is shown to be well-defined and to optimally reduce bias in a certain setting. In addition to visual and PSNR-based evaluations, different notions of bias and variance decompositions are investigated in numerical studies. The improvements offered by the proposed scheme are demonstrated and its performance is shown to be comparable to optimal results obtained with Bregman iterations. △ Less

Submitted 22 June, 2017; v1 submitted 16 June, 2016; originally announced June 2016.

Comments: Accepted by JMIV

arXiv:1605.06480 [pdf, other]

doi 10.1175/MWR-D-17-0345.1

Recent progress and review of issues related to Physics Dynamics Coupling in geophysical models

Authors: Markus Gross, Hui Wan, Philip J. Rasch, Peter M. Caldwell, David L. Williamson, Daniel Klocke, Christiane Jablonowski, Diana R. Thatcher, Nigel Wood, Mike Cullen, Bob Beare, Martin Willett, Florian Lemarié, Eric Blayo, Sylvie Malardel, Piet Termonia, Almut Gassmann, Peter H. Lauritzen, Hans Johansen, Colin M. Zarzycki, Koichi Sakaguchi, Ruby Leung

Abstract: Geophysical models of the atmosphere and ocean invariably involve parameterizations. These represent two distinct areas: Subgrid processes that the model cannot resolve, and diabatic sources in the equations, due to radiation for example. Hence, coupling between these physics parameterizations and the resolved fluid dynamics and also between the dynamics of the air and water, is necessary. In this… ▽ More Geophysical models of the atmosphere and ocean invariably involve parameterizations. These represent two distinct areas: Subgrid processes that the model cannot resolve, and diabatic sources in the equations, due to radiation for example. Hence, coupling between these physics parameterizations and the resolved fluid dynamics and also between the dynamics of the air and water, is necessary. In this paper weather and climate models are used to illustrate the problems. Nevertheless the same applies to other geophysical models. This coupling is an important aspect of geophysical models. However, often model development is strictly segregated into either physics or dynamics. As a consequence, this area has many unanswered questions. Recent developments in the design of dynamical cores, extended process physics and predicted future changes of the computational infrastructure are increasing complexity. This paper reviews the state-of-the-art of the physics-dynamics coupling in geophysical models, surveys the analysis techniques, and illustrates open questions in this field. This paper focuses on two objectives: To illustrate the phenomenology of the coupling problem with references to examples in the literature and to show how the problem can be analysed. Proposals are made on how to advance the understanding and upcoming challenges with emerging modeling strategies. This paper is of interest to model developers who aim to improve the models and have to make choices on and test new implementations, to users who have to understand choices presented to them and finally users of outputs, who have to distinguish physical features from numerical problems in the model data. △ Less

Submitted 12 June, 2017; v1 submitted 20 May, 2016; originally announced May 2016.

arXiv:1210.5039 [pdf]

doi 10.1109/TASC.2012.2233851

Highly responsive Y-Ba-Cu-O thin film THz detectors with picosecond time resolution

Authors: P. Thoma, J. Raasch, A. Scheuring, M. Hofherr, K. Ilin, S. Wünsch, A. Semenov, H. -W. Hübers, V. Judin, A. -S. Müller, N. Smale, J. Hänisch, B. Holzapfel, M. Siegel

Abstract: High-temperature superconducting YBa2Cu3O7-d (YBCO) thin-film detectors with improved responsivities were developed for fast time-domain measurements in the THz frequency range. YBCO thin films of 30 nm thickness were patterned to micro- and nanobridges and embedded into planar log-spiral THz antennas. The YBCO thin-film detectors were characterized with continuous wave radiation at 0.65 THz. Resp… ▽ More High-temperature superconducting YBa2Cu3O7-d (YBCO) thin-film detectors with improved responsivities were developed for fast time-domain measurements in the THz frequency range. YBCO thin films of 30 nm thickness were patterned to micro- and nanobridges and embedded into planar log-spiral THz antennas. The YBCO thin-film detectors were characterized with continuous wave radiation at 0.65 THz. Responsivity values as high as 710 V/W were found for the YBCO nanobridges. Pulsed measurements in the THz frequency range were performed at the electron storage ring ANKA from the Karlsruhe Institute of Technology (KIT). Due to the high responsivities of the nanobridges no biasing was required for the detection of the coherent synchrotron radiation pulses achieving very good agreement between the measured pulse shapes and simulations. △ Less

Submitted 18 October, 2012; originally announced October 2012.

arXiv:0907.4850 [pdf, ps, other]

doi 10.1103/PhysRevB.80.104431

Magnetoelastic coupling in triangular lattice antiferromagnet CuCrS2

Authors: Julia C. E. Rasch, Martin Boehm, Clemens Ritter, Hannu Mutka, Jürg Schefer, Lukas Keller, Galina M. Abramova, Antonio Cervellino, Jörg F. Löffler

Abstract: CuCrS2 is a triangular lattice Heisenberg antiferromagnet with a rhombohedral crystal structure. We report on neutron and synchrotron powder diffraction results which reveal a monoclinic lattice distortion at the magnetic transition and verify a magnetoelastic coupling. CuCrS2 is therefore an interesting material to study the influence of magnetism on the relief of geometrical frustration. CuCrS2 is a triangular lattice Heisenberg antiferromagnet with a rhombohedral crystal structure. We report on neutron and synchrotron powder diffraction results which reveal a monoclinic lattice distortion at the magnetic transition and verify a magnetoelastic coupling. CuCrS2 is therefore an interesting material to study the influence of magnetism on the relief of geometrical frustration. △ Less

Submitted 28 July, 2009; originally announced July 2009.

Comments: 6 pages, 6 figures, 1 table

Journal ref: Phys. Rev. B 80, 104431 (2009)

arXiv:0810.5296 [pdf, ps, other]

doi 10.1016/j.jssc.2009.02.001

Structural properties of Pb3Mn7O15 determined from high-resolution synchrotron powder diffraction

Authors: J. C. E. Rasch, D. V. Sheptyakov, J. Schefer, L. Keller, M. Böhm, F. Gozzo, N. V. Volkov, K. A. Sablina, G. A. Petrakovskii, H. Grimmer, K. Conder, J. F. Löffler

Abstract: We report on the crystallographic structure of the layered compound Pb3Mn7O15. Previous analysis based on laboratory X-ray data at room temperature gave contradictory results in terms of the description of the unit cell. Motivated by recent magnetic bulk measurements of this system, we re-investigated the chemical structure with high-resolution synchrotron powder diffraction at temperatures betw… ▽ More We report on the crystallographic structure of the layered compound Pb3Mn7O15. Previous analysis based on laboratory X-ray data at room temperature gave contradictory results in terms of the description of the unit cell. Motivated by recent magnetic bulk measurements of this system, we re-investigated the chemical structure with high-resolution synchrotron powder diffraction at temperatures between 15 K and 295 K. Our results show that the crystal structure of stoichiometric Pb3Mn7O15 has a pronounced 2-dimensional character and can be described in the orthorhombic space group Pnma. △ Less

Submitted 29 October, 2008; originally announced October 2008.

Comments: 6 pages, 4 figures, 2 tables

Journal ref: J. Solid State Chem. 182, 1188 (2009)

arXiv:0805.2368 [pdf, ps, other]

A Kernel Method for the Two-Sample Problem

Authors: Arthur Gretton, Karsten Borgwardt, Malte J. Rasch, Bernhard Scholkopf, Alexander J. Smola

Abstract: We propose a framework for analyzing and comparing distributions, allowing us to design statistical tests to determine if two samples are drawn from different distributions. Our test statistic is the largest difference in expectations over functions in the unit ball of a reproducing kernel Hilbert space (RKHS). We present two tests based on large deviation bounds for the test statistic, while a… ▽ More We propose a framework for analyzing and comparing distributions, allowing us to design statistical tests to determine if two samples are drawn from different distributions. Our test statistic is the largest difference in expectations over functions in the unit ball of a reproducing kernel Hilbert space (RKHS). We present two tests based on large deviation bounds for the test statistic, while a third is based on the asymptotic distribution of this statistic. The test statistic can be computed in quadratic time, although efficient linear time approximations are available. Several classical metrics on distributions are recovered when the function space used to compute the difference in expectations is allowed to be more general (eg. a Banach space). We apply our two-sample tests to a variety of problems, including attribute matching for databases using the Hungarian marriage method, where they perform strongly. Excellent performance is also obtained when comparing distributions over graphs, for which these are the first such tests. △ Less

Submitted 15 May, 2008; originally announced May 2008.

ACM Class: G.3; I.2.6

Showing 1–36 of 36 results for author: Rasch, J