-
Federated Isolation Forest for Efficient Anomaly Detection on Edge IoT Systems
Authors:
Pavle Vasiljevic,
Milica Matic,
Miroslav Popovic
Abstract:
Recently, federated learning frameworks such as Python TestBed for Federated Learning Algorithms and MicroPython TestBed for Federated Learning Algorithms have emerged to tackle user privacy concerns and efficiency in embedded systems. Even more recently, an efficient federated anomaly detection algorithm, FLiForest, based on Isolation Forests has been developed, offering a low-resource, unsupervi…
▽ More
Recently, federated learning frameworks such as Python TestBed for Federated Learning Algorithms and MicroPython TestBed for Federated Learning Algorithms have emerged to tackle user privacy concerns and efficiency in embedded systems. Even more recently, an efficient federated anomaly detection algorithm, FLiForest, based on Isolation Forests has been developed, offering a low-resource, unsupervised method well-suited for edge deployment and continuous learning. In this paper, we present an application of Isolation Forest-based temperature anomaly detection, developed using the previously mentioned federated learning frameworks, aimed at small edge devices and IoT systems running MicroPython. The system has been experimentally evaluated, achieving over 96% accuracy in distinguishing normal from abnormal readings and above 78% precision in detecting anomalies across all tested configurations, while maintaining a memory usage below 160 KB during model training. These results highlight its suitability for resource-constrained environments and edge systems, while upholding federated learning principles of data privacy and collaborative learning.
△ Less
Submitted 5 June, 2025;
originally announced June 2025.
-
Globally Consistent RGB-D SLAM with 2D Gaussian Splatting
Authors:
Xingguang Zhong,
Yue Pan,
Liren Jin,
Marija Popović,
Jens Behley,
Cyrill Stachniss
Abstract:
Recently, 3D Gaussian splatting-based RGB-D SLAM displays remarkable performance of high-fidelity 3D reconstruction. However, the lack of depth rendering consistency and efficient loop closure limits the quality of its geometric reconstructions and its ability to perform globally consistent mapping online. In this paper, we present 2DGS-SLAM, an RGB-D SLAM system using 2D Gaussian splatting as the…
▽ More
Recently, 3D Gaussian splatting-based RGB-D SLAM displays remarkable performance of high-fidelity 3D reconstruction. However, the lack of depth rendering consistency and efficient loop closure limits the quality of its geometric reconstructions and its ability to perform globally consistent mapping online. In this paper, we present 2DGS-SLAM, an RGB-D SLAM system using 2D Gaussian splatting as the map representation. By leveraging the depth-consistent rendering property of the 2D variant, we propose an accurate camera pose optimization method and achieve geometrically accurate 3D reconstruction. In addition, we implement efficient loop detection and camera relocalization by leveraging MASt3R, a 3D foundation model, and achieve efficient map updates by maintaining a local active map. Experiments show that our 2DGS-SLAM approach achieves superior tracking accuracy, higher surface reconstruction quality, and more consistent global map reconstruction compared to existing rendering-based SLAM methods, while maintaining high-fidelity image rendering and improved computational efficiency.
△ Less
Submitted 1 June, 2025;
originally announced June 2025.
-
Cell size heterogeneity controls crystallization of the developing fruit fly wing
Authors:
Kartik Chhajed,
Franz S. Gruber,
Natalie A. Dye,
Frank Jülicher,
Marko Popović
Abstract:
A fundamental question in Biology is to understand how patterns and shapes emerge from the collective interplay of large numbers of cells. Cells forming two-dimensional epithelial tissues behave as active materials that undergo remodeling and spontaneous shape changes. Focussing on the fly wing as a model system, we find that the cellular packing in the wing epithelium transitions from a disordere…
▽ More
A fundamental question in Biology is to understand how patterns and shapes emerge from the collective interplay of large numbers of cells. Cells forming two-dimensional epithelial tissues behave as active materials that undergo remodeling and spontaneous shape changes. Focussing on the fly wing as a model system, we find that the cellular packing in the wing epithelium transitions from a disordered packing to an ordered, crystalline packing. We investigate biophysical mechanisms controlling this crystallization process. While previous studies propose a role of tissue shear flow in establishing the ordered cell packing in the fly wing, we reveal a role of cell size heterogeneity. Indeed, we find that even if tissue shear have been inhibited, cell packings in the fruit fly wing epithelium transition from disordered to an ordered packing. We propose that the transition is controlled by the cell size heterogeneity, which is quantified by the cell size polydispersity. We use the vertex model of epithelial tissues to show that there is a critical value of cell size polydispersity above which cellular packings are disordered and below which they form a crystalline packing. By analyzing experimental data we find that cell size polydispersity decreases during fly wing development. The observed dynamics of tissue crystallisation is consistent with the slow ordering kinetics we observe in the vertex model. Therefore, although tissue shear does not control the transition, it significantly enhances the rate of tissue scale ordering by facilitating alingment of locally ordered crystallites.
△ Less
Submitted 8 May, 2025;
originally announced May 2025.
-
DM-OSVP++: One-Shot View Planning Using 3D Diffusion Models for Active RGB-Based Object Reconstruction
Authors:
Sicong Pan,
Liren Jin,
Xuying Huang,
Cyrill Stachniss,
Marija Popović,
Maren Bennewitz
Abstract:
Active object reconstruction is crucial for many robotic applications. A key aspect in these scenarios is generating object-specific view configurations to obtain informative measurements for reconstruction. One-shot view planning enables efficient data collection by predicting all views at once, eliminating the need for time-consuming online replanning. Our primary insight is to leverage the gene…
▽ More
Active object reconstruction is crucial for many robotic applications. A key aspect in these scenarios is generating object-specific view configurations to obtain informative measurements for reconstruction. One-shot view planning enables efficient data collection by predicting all views at once, eliminating the need for time-consuming online replanning. Our primary insight is to leverage the generative power of 3D diffusion models as valuable prior information. By conditioning on initial multi-view images, we exploit the priors from the 3D diffusion model to generate an approximate object model, serving as the foundation for our view planning. Our novel approach integrates the geometric and textural distributions of the object model into the view planning process, generating views that focus on the complex parts of the object to be reconstructed. We validate the proposed active object reconstruction system through both simulation and real-world experiments, demonstrating the effectiveness of using 3D diffusion priors for one-shot view planning.
△ Less
Submitted 15 April, 2025;
originally announced April 2025.
-
Keypoint Semantic Integration for Improved Feature Matching in Outdoor Agricultural Environments
Authors:
Rajitha de Silva,
Jonathan Cox,
Marija Popovic,
Cesar Cadena,
Cyrill Stachniss,
Riccardo Polvara
Abstract:
Robust robot navigation in outdoor environments requires accurate perception systems capable of handling visual challenges such as repetitive structures and changing appearances. Visual feature matching is crucial to vision-based pipelines but remains particularly challenging in natural outdoor settings due to perceptual aliasing. We address this issue in vineyards, where repetitive vine trunks an…
▽ More
Robust robot navigation in outdoor environments requires accurate perception systems capable of handling visual challenges such as repetitive structures and changing appearances. Visual feature matching is crucial to vision-based pipelines but remains particularly challenging in natural outdoor settings due to perceptual aliasing. We address this issue in vineyards, where repetitive vine trunks and other natural elements generate ambiguous descriptors that hinder reliable feature matching. We hypothesise that semantic information tied to keypoint positions can alleviate perceptual aliasing by enhancing keypoint descriptor distinctiveness. To this end, we introduce a keypoint semantic integration technique that improves the descriptors in semantically meaningful regions within the image, enabling more accurate differentiation even among visually similar local features. We validate this approach in two vineyard perception tasks: (i) relative pose estimation and (ii) visual localisation. Across all tested keypoint types and descriptors, our method improves matching accuracy by 12.6%, demonstrating its effectiveness over multiple months in challenging vineyard conditions.
△ Less
Submitted 11 March, 2025;
originally announced March 2025.
-
Discrete Gaussian Process Representations for Optimising UAV-based Precision Weed Mapping
Authors:
Jacob Swindell,
Madeleine Darbyshire,
Marija Popovic,
Riccardo Polvara
Abstract:
Accurate agricultural weed mapping using UAVs is crucial for precision farming applications. Traditional methods rely on orthomosaic stitching from rigid flight paths, which is computationally intensive and time-consuming. Gaussian Process (GP)-based mapping offers continuous modelling of the underlying variable (i.e. weed distribution) but requires discretisation for practical tasks like path pla…
▽ More
Accurate agricultural weed mapping using UAVs is crucial for precision farming applications. Traditional methods rely on orthomosaic stitching from rigid flight paths, which is computationally intensive and time-consuming. Gaussian Process (GP)-based mapping offers continuous modelling of the underlying variable (i.e. weed distribution) but requires discretisation for practical tasks like path planning or visualisation. Current implementations often default to quadtrees or gridmaps without systematically evaluating alternatives. This study compares five discretisation methods: quadtrees, wedgelets, top-down binary space partition (BSP) trees using least square error (LSE), bottom-up BSP trees using graph merging, and variable-resolution hexagonal grids. Evaluations on real-world weed distributions measure visual similarity, mean squared error (MSE), and computational efficiency. Results show quadtrees perform best overall, but alternatives excel in specific scenarios: hexagons or BSP LSE suit fields with large, dominant weed patches, while quadtrees are optimal for dispersed small-scale distributions. These findings highlight the need to tailor discretisation approaches to weed distribution patterns (patch size, density, coverage) rather than relying on default methods. By choosing representations based on the underlying distribution, we can improve mapping accuracy and efficiency for precision agriculture applications.
△ Less
Submitted 10 March, 2025;
originally announced March 2025.
-
PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural Map
Authors:
Yue Pan,
Xingguang Zhong,
Liren Jin,
Louis Wiesmann,
Marija Popović,
Jens Behley,
Cyrill Stachniss
Abstract:
Robots require high-fidelity reconstructions of their environment for effective operation. Such scene representations should be both, geometrically accurate and photorealistic to support downstream tasks. While this can be achieved by building distance fields from range sensors and radiance fields from cameras, the scalable incremental mapping of both fields consistently and at the same time with…
▽ More
Robots require high-fidelity reconstructions of their environment for effective operation. Such scene representations should be both, geometrically accurate and photorealistic to support downstream tasks. While this can be achieved by building distance fields from range sensors and radiance fields from cameras, the scalable incremental mapping of both fields consistently and at the same time with high quality remains challenging. In this paper, we propose a novel map representation that unifies a continuous signed distance field and a Gaussian splatting radiance field within an elastic and compact point-based implicit neural map. By enforcing geometric consistency between these fields, we achieve mutual improvements by exploiting both modalities. We devise a LiDAR-visual SLAM system called PINGS using the proposed map representation and evaluate it on several challenging large-scale datasets. Experimental results demonstrate that PINGS can incrementally build globally consistent distance and radiance fields encoded with a compact set of neural points. Compared to the state-of-the-art methods, PINGS achieves superior photometric and geometric rendering at novel views by leveraging the constraints from the distance field. Furthermore, by utilizing dense photometric cues and multi-view consistency from the radiance field, PINGS produces more accurate distance fields, leading to improved odometry estimation and mesh reconstruction.
△ Less
Submitted 8 February, 2025;
originally announced February 2025.
-
ActiveGS: Active Scene Reconstruction Using Gaussian Splatting
Authors:
Liren Jin,
Xingguang Zhong,
Yue Pan,
Jens Behley,
Cyrill Stachniss,
Marija Popović
Abstract:
Robotics applications often rely on scene reconstructions to enable downstream tasks. In this work, we tackle the challenge of actively building an accurate map of an unknown scene using an RGB-D camera on a mobile platform. We propose a hybrid map representation that combines a Gaussian splatting map with a coarse voxel map, leveraging the strengths of both representations: the high-fidelity scen…
▽ More
Robotics applications often rely on scene reconstructions to enable downstream tasks. In this work, we tackle the challenge of actively building an accurate map of an unknown scene using an RGB-D camera on a mobile platform. We propose a hybrid map representation that combines a Gaussian splatting map with a coarse voxel map, leveraging the strengths of both representations: the high-fidelity scene reconstruction capabilities of Gaussian splatting and the spatial modelling strengths of the voxel map. At the core of our framework is an effective confidence modelling technique for the Gaussian splatting map to identify under-reconstructed areas, while utilising spatial information from the voxel map to target unexplored areas and assist in collision-free path planning. By actively collecting scene information in under-reconstructed and unexplored areas for map updates, our approach achieves superior Gaussian splatting reconstruction results compared to state-of-the-art approaches. Additionally, we demonstrate the real-world applicability of our framework using an unmanned aerial vehicle.
△ Less
Submitted 8 April, 2025; v1 submitted 23 December, 2024;
originally announced December 2024.
-
Particle transport in a correlated ratchet
Authors:
Saloni Saxena,
Marko Popović,
Frank Jülicher
Abstract:
One of the many measures of the non-equilibrium nature of a system is the existence of a non-zero steady state current which is especially relevant for many biological systems. To this end, we study the non-equilibrium dynamics of a particle moving in a tilted colored noise ratchet in two different situations. In the first, the colored noise variable is reset to a specific value every time the par…
▽ More
One of the many measures of the non-equilibrium nature of a system is the existence of a non-zero steady state current which is especially relevant for many biological systems. To this end, we study the non-equilibrium dynamics of a particle moving in a tilted colored noise ratchet in two different situations. In the first, the colored noise variable is reset to a specific value every time the particle transitions from one well to another in the ratchet. Contrary to intuition, we find that the current magnitude decreases as the correlation time of the noise increases, and increases monotonically with noise strength. The average displacement of the particle is against the tilt, which implies that the particle performs work. We then consider a variation of the same problem in which the colored noise process is allowed to evolve freely without any resetting at the transitions. Again, the average displacement is against the potential. However, the current magnitude increases with the correlation time, and there is an optimal noise strength that maximizes the current magnitude. Finally, we provide quantitative arguments to explain these findings and their relevance to active biological matter such as tissues.
△ Less
Submitted 12 December, 2024;
originally announced December 2024.
-
Scalable Feedback Stabilization of Quantum Light Sources on a CMOS Chip
Authors:
Danielius Kramnik,
Imbert Wang,
Anirudh Ramesh,
Josep M. Fargas Cabanillas,
Ðorđe Gluhović,
Sidney Buchbinder,
Panagiotis Zarkos,
Christos Adamopoulos,
Prem Kumar,
Vladimir M. Stojanović,
Miloš A. Popović
Abstract:
Silicon photonics is a leading platform for realizing the vast numbers of physical qubits needed for useful quantum information processing because it leverages mature complementary metal-oxide-semiconductor (CMOS) manufacturing to integrate on-chip thousands of optical devices for generating and manipulating quantum states of light. A challenge to the practical operation and scale-up of silicon qu…
▽ More
Silicon photonics is a leading platform for realizing the vast numbers of physical qubits needed for useful quantum information processing because it leverages mature complementary metal-oxide-semiconductor (CMOS) manufacturing to integrate on-chip thousands of optical devices for generating and manipulating quantum states of light. A challenge to the practical operation and scale-up of silicon quantum-photonic integrated circuits, however, is the need to control their extreme sensitivity to process and temperature variations, free-carrier and self-heating nonlinearities, and thermal crosstalk. To date these challenges have been partially addressed using bulky off-chip electronics, sacrificing many benefits of a chip-scale platform. Here, we demonstrate the first electronic-photonic quantum system-on-chip (EPQSoC) consisting of quantum-correlated photon-pair sources stabilized via on-chip feedback control circuits, all fabricated in a high-volume 45nm CMOS microelectronics foundry. We use non-invasive photocurrent sensing in a tunable microring cavity photon-pair source to actively lock it to a fixed pump laser while operating in the quantum regime, enabling large scale microring-based quantum systems. In this first demonstration of such a capability, we achieve a high CAR of 134 with an ultra-low g(2)(0) of 0.021 at 2.2 kHz off-chip detected pair rate and 3.3 MHz/mW2 on-chip pair generation efficiency, and over 100 kHz off-chip detected pair rate at higher pump powers (1.5 MHz on-chip). These sources maintain stable quantum properties in the presence of temperature variations, operating reliably in practical settings with many adjacent devices creating thermal disturbances on the same chip. Such dense electronic-photonic integration enables implementation and control of quantum-photonic systems at the scale required for useful quantum information processing with CMOS-fabricated chips.
△ Less
Submitted 8 November, 2024;
originally announced November 2024.
-
Towards Map-Agnostic Policies for Adaptive Informative Path Planning
Authors:
Julius Rückin,
David Morilla-Cabello,
Cyrill Stachniss,
Eduardo Montijano,
Marija Popović
Abstract:
Robots are frequently tasked to gather relevant sensor data in unknown terrains. A key challenge for classical path planning algorithms used for autonomous information gathering is adaptively replanning paths online as the terrain is explored given limited onboard compute resources. Recently, learning-based approaches emerged that train planning policies offline and enable computationally efficien…
▽ More
Robots are frequently tasked to gather relevant sensor data in unknown terrains. A key challenge for classical path planning algorithms used for autonomous information gathering is adaptively replanning paths online as the terrain is explored given limited onboard compute resources. Recently, learning-based approaches emerged that train planning policies offline and enable computationally efficient online replanning performing policy inference. These approaches are designed and trained for terrain monitoring missions assuming a single specific map representation, which limits their applicability to different terrains. To address these issues, we propose a novel formulation of the adaptive informative path planning problem unified across different map representations, enabling training and deploying planning policies in a larger variety of monitoring missions. Experimental results validate that our novel formulation easily integrates with classical non-learning-based planning approaches while maintaining their performance. Our trained planning policy performs similarly to state-of-the-art map-specifically trained policies. We validate our learned policy on unseen real-world terrain datasets.
△ Less
Submitted 7 April, 2025; v1 submitted 22 October, 2024;
originally announced October 2024.
-
Towards Formal Verification of Federated Learning Orchestration Protocols on Satellites
Authors:
Miroslav Popovic,
Marko Popovic,
Miodrag Djukic,
Ilija Basicevic
Abstract:
Python Testbed for Federated Learning Algorithms (PTB-FLA) is a simple FL framework targeting smart Internet of Things in edge systems that provides both generic centralized and decentralized FL algorithms, which implement the corresponding FL orchestration protocols that were formally verified using the process algebra CSP. This approach is appropriate for systems with stationary nodes but cannot…
▽ More
Python Testbed for Federated Learning Algorithms (PTB-FLA) is a simple FL framework targeting smart Internet of Things in edge systems that provides both generic centralized and decentralized FL algorithms, which implement the corresponding FL orchestration protocols that were formally verified using the process algebra CSP. This approach is appropriate for systems with stationary nodes but cannot be applied to systems with moving nodes. In this paper, we use celestial mechanics to model spacecraft movement, and timed automata (TA) to formalize and verify the centralized FL orchestration protocol, in two phases. In the first phase, we created a conventional TA model to prove traditional properties, namely deadlock freeness and termination. In the second phase, we created a stochastic TA model to prove timing correctness and to estimate termination probability.
△ Less
Submitted 22 January, 2025; v1 submitted 17 October, 2024;
originally announced October 2024.
-
Active Learning of Robot Vision Using Adaptive Path Planning
Authors:
Julius Rückin,
Federico Magistri,
Cyrill Stachniss,
Marija Popović
Abstract:
Robots need robust and flexible vision systems to perceive and reason about their environments beyond geometry. Most of such systems build upon deep learning approaches. As autonomous robots are commonly deployed in initially unknown environments, pre-training on static datasets cannot always capture the variety of domains and limits the robot's vision performance during missions. Recently, self-s…
▽ More
Robots need robust and flexible vision systems to perceive and reason about their environments beyond geometry. Most of such systems build upon deep learning approaches. As autonomous robots are commonly deployed in initially unknown environments, pre-training on static datasets cannot always capture the variety of domains and limits the robot's vision performance during missions. Recently, self-supervised as well as fully supervised active learning methods emerged to improve robotic vision. These approaches rely on large in-domain pre-training datasets or require substantial human labelling effort. To address these issues, we present a recent adaptive planning framework for efficient training data collection to substantially reduce human labelling requirements in semantic terrain monitoring missions. To this end, we combine high-quality human labels with automatically generated pseudo labels. Experimental results show that the framework reaches segmentation performance close to fully supervised approaches with drastically reduced human labelling effort while outperforming purely self-supervised approaches. We discuss the advantages and limitations of current methods and outline valuable future research avenues towards more robust and flexible robotic vision systems in unknown environments.
△ Less
Submitted 14 October, 2024;
originally announced October 2024.
-
UlcerGPT: A Multimodal Approach Leveraging Large Language and Vision Models for Diabetic Foot Ulcer Image Transcription
Authors:
Reza Basiri,
Ali Abedi,
Chau Nguyen,
Milos R. Popovic,
Shehroz S. Khan
Abstract:
Diabetic foot ulcers (DFUs) are a leading cause of hospitalizations and lower limb amputations, placing a substantial burden on patients and healthcare systems. Early detection and accurate classification of DFUs are critical for preventing serious complications, yet many patients experience delays in receiving care due to limited access to specialized services. Telehealth has emerged as a promisi…
▽ More
Diabetic foot ulcers (DFUs) are a leading cause of hospitalizations and lower limb amputations, placing a substantial burden on patients and healthcare systems. Early detection and accurate classification of DFUs are critical for preventing serious complications, yet many patients experience delays in receiving care due to limited access to specialized services. Telehealth has emerged as a promising solution, improving access to care and reducing the need for in-person visits. The integration of artificial intelligence and pattern recognition into telemedicine has further enhanced DFU management by enabling automatic detection, classification, and monitoring from images. Despite advancements in artificial intelligence-driven approaches for DFU image analysis, the application of large language models for DFU image transcription has not yet been explored. To address this gap, we introduce UlcerGPT, a novel multimodal approach leveraging large language and vision models for DFU image transcription. This framework combines advanced vision and language models, such as Large Language and Vision Assistant and Chat Generative Pre-trained Transformer, to transcribe DFU images by jointly detecting, classifying, and localizing regions of interest. Through detailed experiments on a public dataset, evaluated by expert clinicians, UlcerGPT demonstrates promising results in the accuracy and efficiency of DFU transcription, offering potential support for clinicians in delivering timely care via telemedicine.
△ Less
Submitted 2 October, 2024;
originally announced October 2024.
-
Implicit Dynamical Flow Fusion (IDFF) for Generative Modeling
Authors:
Mohammad R. Rezaei,
Milos R. Popovic,
Milad Lankarany,
Rahul G. Krishnan
Abstract:
Conditional Flow Matching (CFM) models can generate high-quality samples from a non-informative prior, but they can be slow, often needing hundreds of network evaluations (NFE). To address this, we propose Implicit Dynamical Flow Fusion (IDFF); IDFF learns a new vector field with an additional momentum term that enables taking longer steps during sample generation while maintaining the fidelity of…
▽ More
Conditional Flow Matching (CFM) models can generate high-quality samples from a non-informative prior, but they can be slow, often needing hundreds of network evaluations (NFE). To address this, we propose Implicit Dynamical Flow Fusion (IDFF); IDFF learns a new vector field with an additional momentum term that enables taking longer steps during sample generation while maintaining the fidelity of the generated distribution. Consequently, IDFFs reduce the NFEs by a factor of ten (relative to CFMs) without sacrificing sample quality, enabling rapid sampling and efficient handling of image and time-series data generation tasks. We evaluate IDFF on standard benchmarks such as CIFAR-10 and CelebA for image generation, where we achieve likelihood and quality performance comparable to CFMs and diffusion-based models with fewer NFEs. IDFF also shows superior performance on time-series datasets modeling, including molecular simulation and sea surface temperature (SST) datasets, highlighting its versatility and effectiveness across different domains.\href{https://github.com/MrRezaeiUofT/IDFF}{Github Repository}
△ Less
Submitted 27 May, 2025; v1 submitted 22 September, 2024;
originally announced September 2024.
-
Preliminary WMT24 Ranking of General MT Systems and LLMs
Authors:
Tom Kocmi,
Eleftherios Avramidis,
Rachel Bawden,
Ondrej Bojar,
Anton Dvorkovich,
Christian Federmann,
Mark Fishel,
Markus Freitag,
Thamme Gowda,
Roman Grundkiewicz,
Barry Haddow,
Marzena Karpinska,
Philipp Koehn,
Benjamin Marie,
Kenton Murray,
Masaaki Nagata,
Martin Popel,
Maja Popovic,
Mariya Shmatova,
Steinþór Steingrímsson,
Vilém Zouhar
Abstract:
This is the preliminary ranking of WMT24 General MT systems based on automatic metrics. The official ranking will be a human evaluation, which is superior to the automatic ranking and supersedes it. The purpose of this report is not to interpret any findings but only provide preliminary results to the participants of the General MT task that may be useful during the writing of the system submissio…
▽ More
This is the preliminary ranking of WMT24 General MT systems based on automatic metrics. The official ranking will be a human evaluation, which is superior to the automatic ranking and supersedes it. The purpose of this report is not to interpret any findings but only provide preliminary results to the participants of the General MT task that may be useful during the writing of the system submission.
△ Less
Submitted 29 July, 2024;
originally announced July 2024.
-
Dating ancient manuscripts using radiocarbon and AI-based writing style analysis
Authors:
Mladen Popović,
Maruf A. Dhali,
Lambert Schomaker,
Johannes van der Plicht,
Kaare Lund Rasmussen,
Jacopo La Nasa,
Ilaria Degano,
Maria Perla Colombini,
Eibert Tigchelaar
Abstract:
Determining the chronology of ancient handwritten manuscripts is essential for reconstructing the evolution of ideas. For the Dead Sea Scrolls, this is particularly important. However, there is an almost complete lack of date-bearing manuscripts evenly distributed across the timeline and written in similar scripts available for palaeographic comparison. Here, we present Enoch, a state-of-the-art A…
▽ More
Determining the chronology of ancient handwritten manuscripts is essential for reconstructing the evolution of ideas. For the Dead Sea Scrolls, this is particularly important. However, there is an almost complete lack of date-bearing manuscripts evenly distributed across the timeline and written in similar scripts available for palaeographic comparison. Here, we present Enoch, a state-of-the-art AI-based date-prediction model, trained on the basis of new radiocarbon-dated samples of the scrolls. Enoch uses established handwriting-style descriptors and applies Bayesian ridge regression. The challenge of this study is that the number of radiocarbon-dated manuscripts is small, while current machine learning requires an abundance of training data. We show that by using combined angular and allographic writing style feature vectors and applying Bayesian ridge regression, Enoch could predict the radiocarbon-based dates from style, supported by leave-one-out validation, with varied MAEs of 27.9 to 30.7 years relative to the radiocarbon dating. Enoch was then used to estimate the dates of 135 unseen manuscripts, revealing that 79 per cent of the samples were considered 'realistic' upon palaeographic post-hoc evaluation. We present a new chronology of the scrolls. The radiocarbon ranges and Enoch's style-based predictions are often older than the traditionally assumed palaeographic estimates. In the range of 300-50 BCE, Enoch's date prediction provides an improved granularity. The study is in line with current developments in multimodal machine-learning techniques, and the methods can be used for date prediction in other partially-dated manuscript collections. This research shows how Enoch's quantitative, probability-based approach can be a tool for palaeographers and historians, re-dating ancient Jewish key texts and contributing to current debates on Jewish and Christian origins.
△ Less
Submitted 18 October, 2024; v1 submitted 26 June, 2024;
originally announced July 2024.
-
Closed-Loop Binary Media-Based Modulation
Authors:
Majid Nasiri Khormuji,
Branislav M. Popovic
Abstract:
Presenting analytical results for Binary Media-Based Modulation (B-MBM) over fading channels for single-antenna receivers. Illustrating that open-loop B-MBM, in the absence of feedback, only achieves a diversity order of one. However, with feedback and optimal weight selection in closed-loop configurations, a diversity order of two becomes achievable. Notably, the closed-loop B-MBM, with analytica…
▽ More
Presenting analytical results for Binary Media-Based Modulation (B-MBM) over fading channels for single-antenna receivers. Illustrating that open-loop B-MBM, in the absence of feedback, only achieves a diversity order of one. However, with feedback and optimal weight selection in closed-loop configurations, a diversity order of two becomes achievable. Notably, the closed-loop B-MBM, with analytically computed optimal weights, performs equivalent to Alamouti-coded BPSK transmission, demonstrating feasibility even with just one radio frequency chain when feedback is available.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Error Span Annotation: A Balanced Approach for Human Evaluation of Machine Translation
Authors:
Tom Kocmi,
Vilém Zouhar,
Eleftherios Avramidis,
Roman Grundkiewicz,
Marzena Karpinska,
Maja Popović,
Mrinmaya Sachan,
Mariya Shmatova
Abstract:
High-quality Machine Translation (MT) evaluation relies heavily on human judgments. Comprehensive error classification methods, such as Multidimensional Quality Metrics (MQM), are expensive as they are time-consuming and can only be done by experts, whose availability may be limited especially for low-resource languages. On the other hand, just assigning overall scores, like Direct Assessment (DA)…
▽ More
High-quality Machine Translation (MT) evaluation relies heavily on human judgments. Comprehensive error classification methods, such as Multidimensional Quality Metrics (MQM), are expensive as they are time-consuming and can only be done by experts, whose availability may be limited especially for low-resource languages. On the other hand, just assigning overall scores, like Direct Assessment (DA), is simpler and faster and can be done by translators of any level, but is less reliable. In this paper, we introduce Error Span Annotation (ESA), a human evaluation protocol which combines the continuous rating of DA with the high-level error severity span marking of MQM. We validate ESA by comparing it to MQM and DA for 12 MT systems and one human reference translation (English to German) from WMT23. The results show that ESA offers faster and cheaper annotations than MQM at the same quality level, without the requirement of expensive MQM experts.
△ Less
Submitted 18 October, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
Cell divisions imprint long lasting elastic strain fields in epithelial tissues
Authors:
Ali Tahaei,
Romina Pisticello-Gómez,
S Suganthan,
Greta Cwikla,
Jana F. Fuhrmann,
Natalie A. Dye,
Marko Popović
Abstract:
A hallmark of biological tissues, viewed as complex cellular materials, is the active generation of mechanical stresses by cellular processes, such as cell divisions. Each cellular event generates a force dipole that deforms the surrounding tissue. Therefore, a quantitative description of these force dipoles, and their consequences on tissue mechanics, is one of the central problems in understandi…
▽ More
A hallmark of biological tissues, viewed as complex cellular materials, is the active generation of mechanical stresses by cellular processes, such as cell divisions. Each cellular event generates a force dipole that deforms the surrounding tissue. Therefore, a quantitative description of these force dipoles, and their consequences on tissue mechanics, is one of the central problems in understanding the overall tissue mechanics. In this work we analyze previously published experimental data on fruit fly \textit{D. melanogaster} wing epithelia to quantitatively describe the deformation fields induced by a cell-scale force dipole. We find that the measured deformation field can be explained by a simple model of fly epithelium as a linearly elastic sheet. This fact allows us to use measurements of the strain field around cellular events, such as cell divisions, to infer the magnitude and dynamics of the mechanical forces they generate. In particular, we find that cell divisions exert a transient isotropic force dipole field, corresponding to the temporary localisation of the cell nucleus to the tissue surface during the division, and traceless-symmetric force dipole field that remains detectable from the tissue strain field for up to about $3.5$ hours after the division. This is the timescale on which elastic strains are erased by other mechanical processes and therefore it corresponds to the tissue fluidization timescale. In summary, we have developed a method to infer force dipoles induced by cell divisions, by observing the strain field in the surrounding tissues. Using this method we quantitatively characterize mechanical forces generated during a cell division, and their effects on the tissue mechanics.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
ZTF SN Ia DR2: Study of Type Ia Supernova lightcurve fits
Authors:
M. Rigault,
M. Smith,
N. Regnault,
D. W. Kenworthy,
K. Maguire,
A. Goobar,
G. Dimitriadis,
M. Amenouche,
M. Aubert,
C. Barjou-Delayre,
C. E. Bellm,
U. Burgaz,
B. Carreres,
Y. Copin,
M. Deckers,
T. de Jaeger,
S. Dhawan,
F. Feinstein,
D. Fouchez,
L. Galbany,
M. Ginolin,
J. M. Graham,
Y. -L. Kim,
M. Kowalski,
D. Kuhn
, et al. (12 additional authors not shown)
Abstract:
Type Ia supernova (SN Ia) cosmology relies on the estimation of lightcurve parameters to derive precision distances that leads to the estimation of cosmological parameters. The empirical SALT2 lightcurve modeling that relies on only two parameters, a stretch x1, and a color c, has been used by the community for almost two decades. In this paper we study the ability of the SALT2 model to fit the ne…
▽ More
Type Ia supernova (SN Ia) cosmology relies on the estimation of lightcurve parameters to derive precision distances that leads to the estimation of cosmological parameters. The empirical SALT2 lightcurve modeling that relies on only two parameters, a stretch x1, and a color c, has been used by the community for almost two decades. In this paper we study the ability of the SALT2 model to fit the nearly 3000 cosmology-grade SN Ia lightcurves from the second release of the Zwicky Transient Facility (ZTF) cosmology science working group. While the ZTF data was not used to train SALT2, the algorithm is modeling the ZTF SN Ia optical lightcurves remarkably well, except for lightcurve points prior to -10 d from maximum, where the training critically lacks statistics. We find that the lightcurve fitting is robust against the considered choice of phase-range, but we show the [-10; +40] d range to be optimal in terms of statistics and accuracy. We do not detect any significant features in the lightcurve fit residuals that could be connected to the host environment. Potential systematic population differences related to the SN Ia host properties might thus not be accountable for by the addition of extra lightcurve parameters. However, a small but significant inconsistency between residuals of blue- and red-SN Ia strongly suggests the existence of a phase-dependent color term, with potential implications for the use of SNe Ia in precision cosmology. We thus encourage modellers to explore this avenue and we emphasize the importance that SN Ia cosmology must include a SALT2 retraining to accurately model the lightcurves and avoid biasing the derivation of cosmological parameters.
△ Less
Submitted 2 December, 2024; v1 submitted 4 June, 2024;
originally announced June 2024.
-
Fast Transaction Scheduling in Blockchain Sharding
Authors:
Ramesh Adhikari,
Costas Busch,
Miroslav Popovic
Abstract:
Sharding is a promising technique for addressing the scalability issues of blockchain, and this technique is especially important for IoT, edge, or mobile computing. It divides the $n$ participating nodes into $s$ disjoint groups called shards, where each shard processes transactions in parallel. We examine batch scheduling problems on the shard graph $G_s$, where we find efficient schedules for a…
▽ More
Sharding is a promising technique for addressing the scalability issues of blockchain, and this technique is especially important for IoT, edge, or mobile computing. It divides the $n$ participating nodes into $s$ disjoint groups called shards, where each shard processes transactions in parallel. We examine batch scheduling problems on the shard graph $G_s$, where we find efficient schedules for a set of transactions. First, we present a centralized scheduler where one of the shards is considered as a leader, who receives the transaction information from all of the other shards and determines the schedule to process the transactions. For general graphs, where a transaction and its accessing objects are arbitrarily far from each other with a maximum distance $d$, the centralized scheduler provides $O(kd)$ approximation to the optimal schedule, where $k$ is the maximum number of shards each transaction accesses. Next, we provide a centralized scheduler with a bucketing approach that offers improved bounds for the case where $G_s$ is a line graph, or the $k$ objects are randomly selected. Finally, we provide a distributed scheduler where shards do not require global transaction information. We achieve this by using a hierarchical clustering of the shards and using the centralized scheduler in each cluster. We show that the distributed scheduler has a competitive ratio of $O(A_{CS} \cdot \log d \cdot \log s)$, where $A_{CS}$ is the approximation ratio of the centralized scheduler. To our knowledge, we are the first to give provably fast transaction scheduling algorithms for blockchain sharding systems. We also present simulation results for our schedulers and compare their performance with a lock-based approach. The results show that our schedulers are generally better with up to 3x lower latency and 2x higher throughput.
△ Less
Submitted 16 January, 2025; v1 submitted 23 May, 2024;
originally announced May 2024.
-
MicroPython Testbed for Federated Learning Algorithms
Authors:
Miroslav Popovic,
Marko Popovic,
Ivan Kastelan,
Miodrag Djukic,
Ilija Basicevic
Abstract:
Recently, Python Testbed for Federated Learning Algorithms emerged as a low code and generative large language models amenable framework for developing decentralized and distributed applications, primarily targeting edge systems, by nonprofessional programmers with the help of emerging artificial intelligence tools. This light framework is written in pure Python to be easy to install and to fit in…
▽ More
Recently, Python Testbed for Federated Learning Algorithms emerged as a low code and generative large language models amenable framework for developing decentralized and distributed applications, primarily targeting edge systems, by nonprofessional programmers with the help of emerging artificial intelligence tools. This light framework is written in pure Python to be easy to install and to fit into a small IoT memory. It supports formally verified generic centralized and decentralized federated learning algorithms, as well as the peer-to-peer data exchange used in time division multiplexing communication, and its current main limitation is that all the application instances can run only on a single PC. This paper presents the MicroPyton Testbed for Federated Learning Algorithms, the new framework that overcomes its predecessor's limitation such that individual application instances may run on different network nodes like PCs and IoTs, primarily in edge systems. The new framework carries on the pure Python ideal, is based on asynchronous I/O abstractions, and runs on MicroPython, and therefore is a great match for IoTs and devices in edge systems. The new framework was experimentally validated on a wireless network comprising PCs and Raspberry Pi Pico W boards, by using application examples originally developed for the predecessor framework.
△ Less
Submitted 22 January, 2025; v1 submitted 15 May, 2024;
originally announced May 2024.
-
Learning-based Methods for Adaptive Informative Path Planning
Authors:
Marija Popovic,
Joshua Ott,
Julius Rückin,
Mykel J. Kochenderfer
Abstract:
Adaptive informative path planning (AIPP) is important to many robotics applications, enabling mobile robots to efficiently collect useful data about initially unknown environments. In addition, learning-based methods are increasingly used in robotics to enhance adaptability, versatility, and robustness across diverse and complex tasks. Our survey explores research on applying robotic learning to…
▽ More
Adaptive informative path planning (AIPP) is important to many robotics applications, enabling mobile robots to efficiently collect useful data about initially unknown environments. In addition, learning-based methods are increasingly used in robotics to enhance adaptability, versatility, and robustness across diverse and complex tasks. Our survey explores research on applying robotic learning to AIPP, bridging the gap between these two research fields. We begin by providing a unified mathematical framework for general AIPP problems. Next, we establish two complementary taxonomies of current work from the perspectives of (i) learning algorithms and (ii) robotic applications. We explore synergies, recent trends, and highlight the benefits of learning-based methods in AIPP frameworks. Finally, we discuss key challenges and promising future directions to enable more generally applicable and robust robotic data-gathering systems through learning. We provide a comprehensive catalogue of papers reviewed in our survey, including publicly available repositories, to facilitate future studies in the field.
△ Less
Submitted 23 July, 2024; v1 submitted 10 April, 2024;
originally announced April 2024.
-
Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning
Authors:
Sicong Pan,
Liren Jin,
Xuying Huang,
Cyrill Stachniss,
Marija Popović,
Maren Bennewitz
Abstract:
Object reconstruction is relevant for many autonomous robotic tasks that require interaction with the environment. A key challenge in such scenarios is planning view configurations to collect informative measurements for reconstructing an initially unknown object. One-shot view planning enables efficient data collection by predicting view configurations and planning the globally shortest path conn…
▽ More
Object reconstruction is relevant for many autonomous robotic tasks that require interaction with the environment. A key challenge in such scenarios is planning view configurations to collect informative measurements for reconstructing an initially unknown object. One-shot view planning enables efficient data collection by predicting view configurations and planning the globally shortest path connecting all views at once. However, prior knowledge about the object is required to conduct one-shot view planning. In this work, we propose a novel one-shot view planning approach that utilizes the powerful 3D generation capabilities of diffusion models as priors. By incorporating such geometric priors into our pipeline, we achieve effective one-shot view planning starting with only a single RGB image of the object to be reconstructed. Our planning experiments in simulation and real-world setups indicate that our approach balances well between object reconstruction quality and movement cost.
△ Less
Submitted 15 September, 2024; v1 submitted 25 March, 2024;
originally announced March 2024.
-
STAIR: Semantic-Targeted Active Implicit Reconstruction
Authors:
Liren Jin,
Haofei Kuang,
Yue Pan,
Cyrill Stachniss,
Marija Popović
Abstract:
Many autonomous robotic applications require object-level understanding when deployed. Actively reconstructing objects of interest, i.e. objects with specific semantic meanings, is therefore relevant for a robot to perform downstream tasks in an initially unknown environment. In this work, we propose a novel framework for semantic-targeted active reconstruction using posed RGB-D measurements and 2…
▽ More
Many autonomous robotic applications require object-level understanding when deployed. Actively reconstructing objects of interest, i.e. objects with specific semantic meanings, is therefore relevant for a robot to perform downstream tasks in an initially unknown environment. In this work, we propose a novel framework for semantic-targeted active reconstruction using posed RGB-D measurements and 2D semantic labels as input. The key components of our framework are a semantic implicit neural representation and a compatible planning utility function based on semantic rendering and uncertainty estimation, enabling adaptive view planning to target objects of interest. Our planning approach achieves better reconstruction performance in terms of mesh and novel view rendering quality compared to implicit reconstruction baselines that do not consider semantics for view planning. Our framework further outperforms a state-of-the-art semantic-targeted active reconstruction pipeline based on explicit maps, justifying our choice of utilising implicit neural representations to tackle semantic-targeted active reconstruction problems.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Deep Reinforcement Learning with Dynamic Graphs for Adaptive Informative Path Planning
Authors:
Apoorva Vashisth,
Julius Rückin,
Federico Magistri,
Cyrill Stachniss,
Marija Popović
Abstract:
Autonomous robots are often employed for data collection due to their efficiency and low labour costs. A key task in robotic data acquisition is planning paths through an initially unknown environment to collect observations given platform-specific resource constraints, such as limited battery life. Adaptive online path planning in 3D environments is challenging due to the large set of valid actio…
▽ More
Autonomous robots are often employed for data collection due to their efficiency and low labour costs. A key task in robotic data acquisition is planning paths through an initially unknown environment to collect observations given platform-specific resource constraints, such as limited battery life. Adaptive online path planning in 3D environments is challenging due to the large set of valid actions and the presence of unknown occlusions. To address these issues, we propose a novel deep reinforcement learning approach for adaptively replanning robot paths to map targets of interest in unknown 3D environments. A key aspect of our approach is a dynamically constructed graph that restricts planning actions local to the robot, allowing us to react to newly discovered static obstacles and targets of interest. For replanning, we propose a new reward function that balances between exploring the unknown environment and exploiting online-discovered targets of interest. Our experiments show that our method enables more efficient target discovery compared to state-of-the-art learning and non-learning baselines. We also showcase our approach for orchard monitoring using an unmanned aerial vehicle in a photorealistic simulator. We open-source our code and model at: https://github.com/dmar-bonn/ipp-rl-3d.
△ Less
Submitted 5 July, 2024; v1 submitted 7 February, 2024;
originally announced February 2024.
-
Ductile-to-brittle transition and yielding in soft amorphous materials: perspectives and open questions
Authors:
Thibaut Divoux,
Elisabeth Agoritsas,
Stefano Aime,
Catherine Barentin,
Jean-Louis Barrat,
Roberto Benzi,
Ludovic Berthier,
Dapeng Bi,
Giulio Biroli,
Daniel Bonn,
Philippe Bourrianne,
Mehdi Bouzid,
Emanuela Del Gado,
Hélène Delanoë-Ayari,
Kasra Farain,
Suzanne Fielding,
Matthias Fuchs,
Jasper van der Gucht,
Silke Henkes,
Maziyar Jalaal,
Yogesh M. Joshi,
Anaël Lemaître,
Robert L. Leheny,
Sébastien Manneville,
Kirsten Martens
, et al. (15 additional authors not shown)
Abstract:
Soft amorphous materials are viscoelastic solids ubiquitously found around us, from clays and cementitious pastes to emulsions and physical gels encountered in food or biomedical engineering. Under an external deformation, these materials undergo a noteworthy transition from a solid to a liquid state that reshapes the material microstructure. This yielding transition was the main theme of a worksh…
▽ More
Soft amorphous materials are viscoelastic solids ubiquitously found around us, from clays and cementitious pastes to emulsions and physical gels encountered in food or biomedical engineering. Under an external deformation, these materials undergo a noteworthy transition from a solid to a liquid state that reshapes the material microstructure. This yielding transition was the main theme of a workshop held from January 9 to 13, 2023 at the Lorentz Center in Leiden. The manuscript presented here offers a critical perspective on the subject, synthesizing insights from the various brainstorming sessions and informal discussions that unfolded during this week of vibrant exchange of ideas. The result of these exchanges takes the form of a series of open questions that represent outstanding experimental, numerical, and theoretical challenges to be tackled in the near future.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Developing Elementary Federated Learning Algorithms Leveraging the ChatGPT
Authors:
Miroslav Popovic,
Marko Popovic,
Ivan Kastelan,
Miodrag Djukic,
Ilija Basicevic
Abstract:
The Python Testbed for Federated Learning Algorithms is a simple Python FL framework easy to use by ML&AI developers who do not need to be professional programmers, and this paper shows that it is also amenable to emerging AI tools. In this paper, we successfully developed three elementary FL algorithms using the following three steps process: (i) specify context, (ii) ask ChatGPT to complete serv…
▽ More
The Python Testbed for Federated Learning Algorithms is a simple Python FL framework easy to use by ML&AI developers who do not need to be professional programmers, and this paper shows that it is also amenable to emerging AI tools. In this paper, we successfully developed three elementary FL algorithms using the following three steps process: (i) specify context, (ii) ask ChatGPT to complete server and clients' callback functions, and (iii) verify the generated code.
△ Less
Submitted 8 January, 2024; v1 submitted 7 December, 2023;
originally announced December 2023.
-
Semi-Supervised Active Learning for Semantic Segmentation in Unknown Environments Using Informative Path Planning
Authors:
Julius Rückin,
Federico Magistri,
Cyrill Stachniss,
Marija Popović
Abstract:
Semantic segmentation enables robots to perceive and reason about their environments beyond geometry. Most of such systems build upon deep learning approaches. As autonomous robots are commonly deployed in initially unknown environments, pre-training on static datasets cannot always capture the variety of domains and limits the robot's perception performance during missions. Recently, self-supervi…
▽ More
Semantic segmentation enables robots to perceive and reason about their environments beyond geometry. Most of such systems build upon deep learning approaches. As autonomous robots are commonly deployed in initially unknown environments, pre-training on static datasets cannot always capture the variety of domains and limits the robot's perception performance during missions. Recently, self-supervised and fully supervised active learning methods emerged to improve a robot's vision. These approaches rely on large in-domain pre-training datasets or require substantial human labelling effort. We propose a planning method for semi-supervised active learning of semantic segmentation that substantially reduces human labelling requirements compared to fully supervised approaches. We leverage an adaptive map-based planner guided towards the frontiers of unexplored space with high model uncertainty collecting training data for human labelling. A key aspect of our approach is to combine the sparse high-quality human labels with pseudo labels automatically extracted from highly certain environment map areas. Experimental results show that our method reaches segmentation performance close to fully supervised approaches with drastically reduced human labelling effort while outperforming self-supervised approaches.
△ Less
Submitted 26 January, 2024; v1 submitted 7 December, 2023;
originally announced December 2023.
-
Domain-Specific Deep Learning Feature Extractor for Diabetic Foot Ulcer Detection
Authors:
Reza Basiri,
Milos R. Popovic,
Shehroz S. Khan
Abstract:
Diabetic Foot Ulcer (DFU) is a condition requiring constant monitoring and evaluations for treatment. DFU patient population is on the rise and will soon outpace the available health resources. Autonomous monitoring and evaluation of DFU wounds is a much-needed area in health care. In this paper, we evaluate and identify the most accurate feature extractor that is the core basis for developing a d…
▽ More
Diabetic Foot Ulcer (DFU) is a condition requiring constant monitoring and evaluations for treatment. DFU patient population is on the rise and will soon outpace the available health resources. Autonomous monitoring and evaluation of DFU wounds is a much-needed area in health care. In this paper, we evaluate and identify the most accurate feature extractor that is the core basis for developing a deep-learning wound detection network. For the evaluation, we used mAP and F1-score on the publicly available DFU2020 dataset. A combination of UNet and EfficientNetb3 feature extractor resulted in the best evaluation among the 14 networks compared. UNet and Efficientnetb3 can be used as the classifier in the development of a comprehensive DFU domain-specific autonomous wound detection pipeline.
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Beamforming Performances of Holographic Surfaces
Authors:
Peng Wang,
Majid Nasiri Khormuji,
Branislav M. Popovic
Abstract:
In this paper, we investigate the beamforming performances of holographic surfaces implemented as lossless antenna arrays with less than half-wavelength spacing. We first develop a method to quantify the mutual coupling effect among the antennas in an array. The developed coupling model is general and applicable to arrays with arbitrary distribution of any type of antennas with arbitrary structure…
▽ More
In this paper, we investigate the beamforming performances of holographic surfaces implemented as lossless antenna arrays with less than half-wavelength spacing. We first develop a method to quantify the mutual coupling effect among the antennas in an array. The developed coupling model is general and applicable to arrays with arbitrary distribution of any type of antennas with arbitrary structure, physical size and radiation power pattern. In particular, it reduces to a neat analytical expression for arbitrarily deployed isotropic antenna arrays. We then discuss the beamforming design for holographic surfaces, and in particular provide analytical beamforming characterizations for arrays with two arbitrarily spaced isotropic antennas. Numerical results indicate that, by accounting for the mutual coupling effect between antennas, the array densification by packing more antennas in a given surface aperture can significantly enhance both the beamforming gain and spatial resolution of the system. The beamforming gain enhancement and beamwidth reduction can be several dBs higher than, and more than half of, those achieved by the conventional half-wavelength spaced antenna arrays in the same surface aperture. The gains of densification become saturated when the antenna spacing is below a critical value, and the saturated gain reduces as the surface aperture increases.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Synthesizing Diabetic Foot Ulcer Images with Diffusion Model
Authors:
Reza Basiri,
Karim Manji,
Francois Harton,
Alisha Poonja,
Milos R. Popovic,
Shehroz S. Khan
Abstract:
Diabetic Foot Ulcer (DFU) is a serious skin wound requiring specialized care. However, real DFU datasets are limited, hindering clinical training and research activities. In recent years, generative adversarial networks and diffusion models have emerged as powerful tools for generating synthetic images with remarkable realism and diversity in many applications. This paper explores the potential of…
▽ More
Diabetic Foot Ulcer (DFU) is a serious skin wound requiring specialized care. However, real DFU datasets are limited, hindering clinical training and research activities. In recent years, generative adversarial networks and diffusion models have emerged as powerful tools for generating synthetic images with remarkable realism and diversity in many applications. This paper explores the potential of diffusion models for synthesizing DFU images and evaluates their authenticity through expert clinician assessments. Additionally, evaluation metrics such as Frechet Inception Distance (FID) and Kernel Inception Distance (KID) are examined to assess the quality of the synthetic DFU images. A dataset of 2,000 DFU images is used for training the diffusion model, and the synthetic images are generated by applying diffusion processes. The results indicate that the diffusion model successfully synthesizes visually indistinguishable DFU images. 70% of the time, clinicians marked synthetic DFU images as real DFUs. However, clinicians demonstrate higher unanimous confidence in rating real images than synthetic ones. The study also reveals that FID and KID metrics do not significantly align with clinicians' assessments, suggesting alternative evaluation approaches are needed. The findings highlight the potential of diffusion models for generating synthetic DFU images and their impact on medical training programs and research in wound detection and classification.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Transverse Emittance Reduction in Muon Beams by Ionization Cooling
Authors:
The MICE Collaboration,
M. Bogomilov,
R. Tsenov,
G. Vankova-Kirilova,
Y. P. Song,
J. Y. Tang,
Z. H. Li,
R. Bertoni,
M. Bonesini,
F. Chignoli,
R. Mazza,
A. de Bari,
D. Orestano,
L. Tortora,
Y. Kuno,
H. Sakamoto,
A. Sato,
S. Ishimoto,
M. Chung,
C. K. Sung,
F. Filthaut,
M. Fedorov,
D. Jokovic,
D. Maletic,
M. Savic
, et al. (112 additional authors not shown)
Abstract:
Accelerated muon beams have been considered for next-generation studies of high-energy lepton-antilepton collisions and neutrino oscillations. However, high-brightness muon beams have not yet been produced. The main challenge for muon acceleration and storage stems from the large phase-space volume occupied by the beam, derived from the muon production mechanism through the decay of pions from pro…
▽ More
Accelerated muon beams have been considered for next-generation studies of high-energy lepton-antilepton collisions and neutrino oscillations. However, high-brightness muon beams have not yet been produced. The main challenge for muon acceleration and storage stems from the large phase-space volume occupied by the beam, derived from the muon production mechanism through the decay of pions from proton collisions. Ionization cooling is the technique proposed to decrease the muon beam phase-space volume. Here we demonstrate a clear signal of ionization cooling through the observation of transverse emittance reduction in beams that traverse lithium hydride or liquid hydrogen absorbers in the Muon Ionization Cooling Experiment (MICE). The measurement is well reproduced by the simulation of the experiment and the theoretical model. The results shown here represent a substantial advance towards the realization of muon-based facilities that could operate at the energy and intensity frontiers.
△ Less
Submitted 13 October, 2023; v1 submitted 9 October, 2023;
originally announced October 2023.
-
A Federated Learning Algorithms Development Paradigm
Authors:
Miroslav Popovic,
Marko Popovic,
Ivan Kastelan,
Miodrag Djukic,
Ilija Basicevic
Abstract:
At present many distributed and decentralized frameworks for federated learning algorithms are already available. However, development of such a framework targeting smart Internet of Things in edge systems is still an open challenge. A solution to that challenge named Python Testbed for Federated Learning Algorithms (PTB-FLA) appeared recently. This solution is written in pure Python, it supports…
▽ More
At present many distributed and decentralized frameworks for federated learning algorithms are already available. However, development of such a framework targeting smart Internet of Things in edge systems is still an open challenge. A solution to that challenge named Python Testbed for Federated Learning Algorithms (PTB-FLA) appeared recently. This solution is written in pure Python, it supports both centralized and decentralized algorithms, and its usage was validated and illustrated by three simple algorithm examples. In this paper, we present the federated learning algorithms development paradigm based on PTB-FLA. The paradigm comprises the four phases named by the code they produce: (1) the sequential code, (2) the federated sequential code, (3) the federated sequential code with callbacks, and (4) the PTB-FLA code. The development paradigm is validated and illustrated in the case study on logistic regression, where both centralized and decentralized algorithms are developed.
△ Less
Submitted 3 December, 2023; v1 submitted 8 October, 2023;
originally announced October 2023.
-
Active Implicit Reconstruction Using One-Shot View Planning
Authors:
Hao Hu,
Sicong Pan,
Liren Jin,
Marija Popović,
Maren Bennewitz
Abstract:
Active object reconstruction using autonomous robots is gaining great interest. A primary goal in this task is to maximize the information of the object to be reconstructed, given limited on-board resources. Previous view planning methods exhibit inefficiency since they rely on an iterative paradigm based on explicit representations, consisting of (1) planning a path to the next-best view only; an…
▽ More
Active object reconstruction using autonomous robots is gaining great interest. A primary goal in this task is to maximize the information of the object to be reconstructed, given limited on-board resources. Previous view planning methods exhibit inefficiency since they rely on an iterative paradigm based on explicit representations, consisting of (1) planning a path to the next-best view only; and (2) requiring a considerable number of less-gain views in terms of surface coverage. To address these limitations, we propose to integrate implicit representations into the One-Shot View Planning (OSVP). The key idea behind our approach is to use implicit representations to obtain the small missing surface areas instead of observing them with extra views. Therefore, we design a deep neural network, named OSVP, to directly predict a set of views given a dense point cloud refined from an initial sparse observation. To train our OSVP network, we generate supervision labels using dense point clouds refined by implicit representations and set covering optimization problems. Simulated experiments show that our method achieves sufficient reconstruction quality, outperforming several baselines under limited view and movement budgets. We further demonstrate the applicability of our approach in a real-world object reconstruction scenario.
△ Less
Submitted 13 February, 2024; v1 submitted 1 October, 2023;
originally announced October 2023.
-
How Many Views Are Needed to Reconstruct an Unknown Object Using NeRF?
Authors:
Sicong Pan,
Liren Jin,
Hao Hu,
Marija Popović,
Maren Bennewitz
Abstract:
Neural Radiance Fields (NeRFs) are gaining significant interest for online active object reconstruction due to their exceptional memory efficiency and requirement for only posed RGB inputs. Previous NeRF-based view planning methods exhibit computational inefficiency since they rely on an iterative paradigm, consisting of (1) retraining the NeRF when new images arrive; and (2) planning a path to th…
▽ More
Neural Radiance Fields (NeRFs) are gaining significant interest for online active object reconstruction due to their exceptional memory efficiency and requirement for only posed RGB inputs. Previous NeRF-based view planning methods exhibit computational inefficiency since they rely on an iterative paradigm, consisting of (1) retraining the NeRF when new images arrive; and (2) planning a path to the next best view only. To address these limitations, we propose a non-iterative pipeline based on the Prediction of the Required number of Views (PRV). The key idea behind our approach is that the required number of views to reconstruct an object depends on its complexity. Therefore, we design a deep neural network, named PRVNet, to predict the required number of views, allowing us to tailor the data acquisition based on the object complexity and plan a globally shortest path. To train our PRVNet, we generate supervision labels using the ShapeNet dataset. Simulated experiments show that our PRV-based view planning method outperforms baselines, achieving good reconstruction quality while significantly reducing movement cost and planning time. We further justify the generalization ability of our approach in a real-world experiment.
△ Less
Submitted 13 February, 2024; v1 submitted 1 October, 2023;
originally announced October 2023.
-
Fine-Resolution Silicon Photonic Wavelength-Selective Switch Using Hybrid Multimode Racetrack Resonators
Authors:
Lucas M. Cohen,
Saleha Fatema,
Vivek V. Wankhade,
Navin B. Lingaraju,
Bohan Zhang,
Deniz Onural,
Milos Popovic,
Andrew M. Weiner
Abstract:
In this work, we describe a procedure for synthesizing racetrack resonators with large quality factors and apply it to realize a multi-channel wavelength-selective switch (WSS) on a silicon photonic chip. We first determine the contribution of each component primitive to propagation loss in a racetrack resonator and use this data to develop a model for the frequency response of arbitrary order, co…
▽ More
In this work, we describe a procedure for synthesizing racetrack resonators with large quality factors and apply it to realize a multi-channel wavelength-selective switch (WSS) on a silicon photonic chip. We first determine the contribution of each component primitive to propagation loss in a racetrack resonator and use this data to develop a model for the frequency response of arbitrary order, coupled-racetrack channel dropping filters. We design second-order racetrack filters based on this model and cascade multiple such filters to form a 1x7 WSS. We find good agreement between our model and device performance with second-order racetrack that have ~1 dB of drop-port loss, ~2 GHz FWHM linewidth, and low optical crosstalk due to the quick filter roll-off of ~ 5.3 dB/GHz. Using a control algorithm, we show three-channel operation of our WSS with a channel spacing of only 10 GHz. Owing to the high quality factor and quick roll-off of our filter design, adjacent channel crosstalk is measured to be <-25 dB for channels spaced on a 10 GHz grid. As a further demonstration, we use five of seven WSS channels to perform a demultiplexing operation on both an 8 GHz and a 10 GHz grid. These results suggest that a low-loss WSS with fine channel resolution can be realized in a scalable manner using the silicon photonics platform.
△ Less
Submitted 29 September, 2023;
originally announced September 2023.
-
Perceptual Factors for Environmental Modeling in Robotic Active Perception
Authors:
David Morilla-Cabello,
Jonas Westheider,
Marija Popovic,
Eduardo Montijano
Abstract:
Accurately assessing the potential value of new sensor observations is a critical aspect of planning for active perception. This task is particularly challenging when reasoning about high-level scene understanding using measurements from vision-based neural networks. Due to appearance-based reasoning, the measurements are susceptible to several environmental effects such as the presence of occlude…
▽ More
Accurately assessing the potential value of new sensor observations is a critical aspect of planning for active perception. This task is particularly challenging when reasoning about high-level scene understanding using measurements from vision-based neural networks. Due to appearance-based reasoning, the measurements are susceptible to several environmental effects such as the presence of occluders, variations in lighting conditions, and redundancy of information due to similarity in appearance between nearby viewpoints. To address this, we propose a new active perception framework incorporating an arbitrary number of perceptual effects in planning and fusion. Our method models the correlation with the environment by a set of general functions termed perceptual factors to construct a perceptual map, which quantifies the aggregated influence of the environment on candidate viewpoints. This information is seamlessly incorporated into the planning and fusion processes by adjusting the uncertainty associated with measurements to weigh their contributions. We evaluate our perceptual maps in a simulated environment that reproduces environmental conditions common in robotics applications. Our results show that, by accounting for environmental effects within our perceptual maps, we improve in the state estimation by correctly selecting the viewpoints and considering the measurement noise correctly when affected by environmental factors. We furthermore deploy our approach on a ground robot to showcase its applicability for real-world active perception missions.
△ Less
Submitted 10 October, 2023; v1 submitted 19 September, 2023;
originally announced September 2023.
-
Permutation Polynomial Interleaved Zadoff-Chu Sequences
Authors:
Fredrik Berggren,
Branislav M. Popovic
Abstract:
Constant amplitude zero autocorrelation (CAZAC) sequences have modulus one and ideal periodic autocorrelation function. Such sequences are used in cellular radio communications systems, e.g., for reference signals, synchronization signals and random access preambles. We propose a new family CAZAC sequences, which is constructed by interleaving a Zadoff-Chu sequence by a quadratic permutation polyn…
▽ More
Constant amplitude zero autocorrelation (CAZAC) sequences have modulus one and ideal periodic autocorrelation function. Such sequences are used in cellular radio communications systems, e.g., for reference signals, synchronization signals and random access preambles. We propose a new family CAZAC sequences, which is constructed by interleaving a Zadoff-Chu sequence by a quadratic permutation polynomial (QPP), or by a permutation polynomial whose inverse is a QPP. It is demonstrated that a set of orthogonal interleaved Zadoff-Chu sequences can be constructed by proper choice of QPPs.
△ Less
Submitted 26 April, 2024; v1 submitted 28 June, 2023;
originally announced June 2023.
-
Correct orchestration of Federated Learning generic algorithms: formalisation and verification in CSP
Authors:
Ivan Prokić,
Silvia Ghilezan,
Simona Kašterović,
Miroslav Popovic,
Marko Popovic,
Ivan Kaštelan
Abstract:
Federated learning (FL) is a machine learning setting where clients keep the training data decentralised and collaboratively train a model either under the coordination of a central server (centralised FL) or in a peer-to-peer network (decentralised FL). Correct orchestration is one of the main challenges. In this paper, we formally verify the correctness of two generic FL algorithms, a centralise…
▽ More
Federated learning (FL) is a machine learning setting where clients keep the training data decentralised and collaboratively train a model either under the coordination of a central server (centralised FL) or in a peer-to-peer network (decentralised FL). Correct orchestration is one of the main challenges. In this paper, we formally verify the correctness of two generic FL algorithms, a centralised and a decentralised one, using the CSP process calculus and the PAT model checker. The CSP models consist of CSP processes corresponding to generic FL algorithm instances. PAT automatically proves the correctness of the two generic FL algorithms by proving their deadlock freeness (safety property) and successful termination (liveness property). The CSP models are constructed bottom-up by hand as a faithful representation of the real Python code and is automatically checked top-down by PAT.
△ Less
Submitted 26 June, 2023;
originally announced June 2023.
-
A Simple Python Testbed for Federated Learning Algorithms
Authors:
Miroslav Popovic,
Marko Popovic,
Ivan Kastelan,
Miodrag Djukic,
Silvia Ghilezan
Abstract:
Nowadays many researchers are developing various distributed and decentralized frameworks for federated learning algorithms. However, development of such a framework targeting smart Internet of Things in edge systems is still an open challenge. In this paper, we present our solution to that challenge called Python Testbed for Federated Learning Algorithms. The solution is written in pure Python, a…
▽ More
Nowadays many researchers are developing various distributed and decentralized frameworks for federated learning algorithms. However, development of such a framework targeting smart Internet of Things in edge systems is still an open challenge. In this paper, we present our solution to that challenge called Python Testbed for Federated Learning Algorithms. The solution is written in pure Python, and it supports both centralized and decentralized algorithms. The usage of the presented solution is both validated and illustrated by three simple algorithm examples.
△ Less
Submitted 18 July, 2023; v1 submitted 31 May, 2023;
originally announced May 2023.
-
PSTM Transaction Scheduler Verification Based on CSP and Testing
Authors:
Miroslav Popovic,
Marko Popovic,
Branislav Kordic,
Huibiao Zhu
Abstract:
Many online transaction scheduler architectures and algorithms for various software transactional memories have been designed in order to maintain good system performance even for high concurrency workloads. Most of these algorithms were directly implemented in a target programming language, and experimentally evaluated, without theoretical proofs of correctness and analysis of their performance.…
▽ More
Many online transaction scheduler architectures and algorithms for various software transactional memories have been designed in order to maintain good system performance even for high concurrency workloads. Most of these algorithms were directly implemented in a target programming language, and experimentally evaluated, without theoretical proofs of correctness and analysis of their performance. Only a small number of these algorithms were modeled using formal methods, such as process algebra CSP, in order to verify that they satisfy properties such as deadlock-freeness and starvation-freeness. However, as this paper shows, using solely formal methods has its disadvantages, too. In this paper, we first analyze the previous CSP model of PSTM transaction scheduler by comparing the model checker PAT results with the manually derived expected results, for the given test workloads. Next, according to the results of this analysis, we correct and extend the CSP model. Finally, based on PAT results for the new CSP model, we analyze the performance of PSTM online transaction scheduling algorithms from the perspective of makespan, number of aborts, and throughput. Based on our findings, we may conclude that for the complete formal verification of trustworthy software, both formal verification and it's testing must be jointly used.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Scaling Description of Dynamical Heterogeneity and Avalanches of Relaxation in Glass-Forming Liquids
Authors:
Ali Tahaei,
Giulio Biroli,
Misaki Ozawa,
Marko Popović,
Matthieu Wyart
Abstract:
We provide a theoretical description of dynamical heterogeneities in glass-forming liquids, based on the premise that relaxation occurs via local rearrangements coupled by elasticity. In our framework, the growth of the dynamical correlation length $ξ$ and of the correlation volume $χ_4$ are controlled by a zero-temperature fixed point. We connect this critical behavior to the properties of the di…
▽ More
We provide a theoretical description of dynamical heterogeneities in glass-forming liquids, based on the premise that relaxation occurs via local rearrangements coupled by elasticity. In our framework, the growth of the dynamical correlation length $ξ$ and of the correlation volume $χ_4$ are controlled by a zero-temperature fixed point. We connect this critical behavior to the properties of the distribution of local energy barriers at zero temperature. Our description makes a direct connection between dynamical heterogeneities and avalanche-type relaxation associated to dynamic facilitation, allowing us to relate the size distribution of heterogeneities to their time evolution. Within an avalanche, a local region relaxes multiple times, the more the larger is the avalanche. This property, related to the nature of the zero-temperature fixed point, directly leads to decoupling of particle diffusion and relaxation time (the so-called Stokes-Einstein violation). Our most salient predictions are tested and confirmed by numerical simulations of scalar and tensorial thermal elasto-plastic models.
△ Less
Submitted 3 August, 2023; v1 submitted 29 April, 2023;
originally announced May 2023.
-
Supervised and Unsupervised Deep Learning Approaches for EEG Seizure Prediction
Authors:
Zakary Georgis-Yap,
Milos R. Popovic,
Shehroz S. Khan
Abstract:
Epilepsy affects more than 50 million people worldwide, making it one of the world's most prevalent neurological diseases. The main symptom of epilepsy is seizures, which occur abruptly and can cause serious injury or death. The ability to predict the occurrence of an epileptic seizure could alleviate many risks and stresses people with epilepsy face. We formulate the problem of detecting preictal…
▽ More
Epilepsy affects more than 50 million people worldwide, making it one of the world's most prevalent neurological diseases. The main symptom of epilepsy is seizures, which occur abruptly and can cause serious injury or death. The ability to predict the occurrence of an epileptic seizure could alleviate many risks and stresses people with epilepsy face. We formulate the problem of detecting preictal (or pre-seizure) with reference to normal EEG as a precursor to incoming seizure. To this end, we developed several supervised deep learning approaches to identify preictal EEG from normal EEG. We further develop novel unsupervised deep learning approaches to train the models on only normal EEG, and detecting pre-seizure EEG as an anomalous event. These deep learning models were trained and evaluated on two large EEG seizure datasets in a person-specific manner. We found that both supervised and unsupervised approaches are feasible; however, their performance varies depending on the patient, approach and architecture. This new line of research has the potential to develop therapeutic interventions and save human lives.
△ Less
Submitted 3 February, 2024; v1 submitted 24 April, 2023;
originally announced April 2023.
-
Theory of rheology and aging of protein condensates
Authors:
Ryota Takaki,
Louise Jawerth,
Marko Popović,
Frank Jülicher
Abstract:
Biological condensates are assemblies of proteins and nucleic acids that form membraneless compartments in cells and play essential roles in cellular functions. In many cases they exhibit the physical properties of liquid droplets that coexist in a surrounding fluid. Recently, quantitative studies on the material properties of biological condensates have become available, revealing complex materia…
▽ More
Biological condensates are assemblies of proteins and nucleic acids that form membraneless compartments in cells and play essential roles in cellular functions. In many cases they exhibit the physical properties of liquid droplets that coexist in a surrounding fluid. Recently, quantitative studies on the material properties of biological condensates have become available, revealing complex material properties. In vitro experiments have shown that protein condensates exhibit time dependent material properties, similar to aging in glasses. To understand this phenomenon from a theoretical perspective, we develop a rheological model based on the physical picture of protein diffusion and stochastic binding inside condensates. The complex nature of protein interactions is captured by a distribution of binding energies, incorporated in a trap model originally developed to study glass transitions. Our model can describe diffusion of constituent particles, as well as the material response to time-dependent forces, and it recapitulates the age dependent relaxation time of Maxwell glass observed experimentally both in active and passive rheology. We derive a generalized fluctuation-response relations of our model in which the relaxation function does not obey time translation invariance. Our study sheds light on the complex material properties of biological condensates and provides a theoretical framework for understanding their aging behavior.
△ Less
Submitted 30 June, 2023; v1 submitted 31 March, 2023;
originally announced March 2023.
-
Graph-based View Motion Planning for Fruit Detection
Authors:
Tobias Zaenker,
Julius Rückin,
Rohit Menon,
Marija Popović,
Maren Bennewitz
Abstract:
Crop monitoring is crucial for maximizing agricultural productivity and efficiency. However, monitoring large and complex structures such as sweet pepper plants presents significant challenges, especially due to frequent occlusions of the fruits. Traditional next-best view planning can lead to unstructured and inefficient coverage of the crops. To address this, we propose a novel view motion plann…
▽ More
Crop monitoring is crucial for maximizing agricultural productivity and efficiency. However, monitoring large and complex structures such as sweet pepper plants presents significant challenges, especially due to frequent occlusions of the fruits. Traditional next-best view planning can lead to unstructured and inefficient coverage of the crops. To address this, we propose a novel view motion planner that builds a graph network of viable view poses and trajectories between nearby poses, thereby considering robot motion constraints. The planner searches the graphs for view sequences with the highest accumulated information gain, allowing for efficient pepper plant monitoring while minimizing occlusions. The generated view poses aim at both sufficiently covering already detected and discovering new fruits. The graph and the corresponding best view pose sequence are computed with a limited horizon and are adaptively updated in fixed time intervals as the system gathers new information. We demonstrate the effectiveness of our approach through simulated and real-world experiments using a robotic arm equipped with an RGB-D camera and mounted on a trolley. As the experimental results show, our planner produces view pose sequences to systematically cover the crops and leads to increased fruit coverage when given a limited time in comparison to a state-of-the-art single next-best view planner.
△ Less
Submitted 15 August, 2023; v1 submitted 6 March, 2023;
originally announced March 2023.
-
NeU-NBV: Next Best View Planning Using Uncertainty Estimation in Image-Based Neural Rendering
Authors:
Liren Jin,
Xieyuanli Chen,
Julius Rückin,
Marija Popović
Abstract:
Autonomous robotic tasks require actively perceiving the environment to achieve application-specific goals. In this paper, we address the problem of positioning an RGB camera to collect the most informative images to represent an unknown scene, given a limited measurement budget. We propose a novel mapless planning framework to iteratively plan the next best camera view based on collected image me…
▽ More
Autonomous robotic tasks require actively perceiving the environment to achieve application-specific goals. In this paper, we address the problem of positioning an RGB camera to collect the most informative images to represent an unknown scene, given a limited measurement budget. We propose a novel mapless planning framework to iteratively plan the next best camera view based on collected image measurements. A key aspect of our approach is a new technique for uncertainty estimation in image-based neural rendering, which guides measurement acquisition at the most uncertain view among view candidates, thus maximising the information value during data collection. By incrementally adding new measurements into our image collection, our approach efficiently explores an unknown scene in a mapless manner. We show that our uncertainty estimation is generalisable and valuable for view planning in unknown scenes. Our planning experiments using synthetic and real-world data verify that our uncertainty-guided approach finds informative images leading to more accurate scene representations when compared against baselines.
△ Less
Submitted 23 July, 2023; v1 submitted 2 March, 2023;
originally announced March 2023.
-
Multi-UAV Adaptive Path Planning Using Deep Reinforcement Learning
Authors:
Jonas Westheider,
Julius Rückin,
Marija Popović
Abstract:
Efficient aerial data collection is important in many remote sensing applications. In large-scale monitoring scenarios, deploying a team of unmanned aerial vehicles (UAVs) offers improved spatial coverage and robustness against individual failures. However, a key challenge is cooperative path planning for the UAVs to efficiently achieve a joint mission goal. We propose a novel multi-agent informat…
▽ More
Efficient aerial data collection is important in many remote sensing applications. In large-scale monitoring scenarios, deploying a team of unmanned aerial vehicles (UAVs) offers improved spatial coverage and robustness against individual failures. However, a key challenge is cooperative path planning for the UAVs to efficiently achieve a joint mission goal. We propose a novel multi-agent informative path planning approach based on deep reinforcement learning for adaptive terrain monitoring scenarios using UAV teams. We introduce new network feature representations to effectively learn path planning in a 3D workspace. By leveraging a counterfactual baseline, our approach explicitly addresses credit assignment to learn cooperative behaviour. Our experimental evaluation shows improved planning performance, i.e. maps regions of interest more quickly, with respect to non-counterfactual variants. Results on synthetic and real-world data show that our approach has superior performance compared to state-of-the-art non-learning-based methods, while being transferable to varying team sizes and communication constraints.
△ Less
Submitted 2 March, 2023;
originally announced March 2023.
-
An Informative Path Planning Framework for Active Learning in UAV-based Semantic Mapping
Authors:
Julius Rückin,
Federico Magistri,
Cyrill Stachniss,
Marija Popović
Abstract:
Unmanned aerial vehicles (UAVs) are frequently used for aerial mapping and general monitoring tasks. Recent progress in deep learning enabled automated semantic segmentation of imagery to facilitate the interpretation of large-scale complex environments. Commonly used supervised deep learning for segmentation relies on large amounts of pixel-wise labelled data, which is tedious and costly to annot…
▽ More
Unmanned aerial vehicles (UAVs) are frequently used for aerial mapping and general monitoring tasks. Recent progress in deep learning enabled automated semantic segmentation of imagery to facilitate the interpretation of large-scale complex environments. Commonly used supervised deep learning for segmentation relies on large amounts of pixel-wise labelled data, which is tedious and costly to annotate. The domain-specific visual appearance of aerial environments often prevents the usage of models pre-trained on publicly available datasets. To address this, we propose a novel general planning framework for UAVs to autonomously acquire informative training images for model re-training. We leverage multiple acquisition functions and fuse them into probabilistic terrain maps. Our framework combines the mapped acquisition function information into the UAV's planning objectives. In this way, the UAV adaptively acquires informative aerial images to be manually labelled for model re-training. Experimental results on real-world data and in a photorealistic simulation show that our framework maximises model performance and drastically reduces labelling efforts. Our map-based planners outperform state-of-the-art local planning.
△ Less
Submitted 6 September, 2023; v1 submitted 7 February, 2023;
originally announced February 2023.