Search | arXiv e-print repository

arXiv:2407.12008 [pdf, other]

Enabling an OpenStack-based cloud on top of RISC-V hardware

Authors: Diego Marrón, Aaron Call, Josep Ll. Berral, Ramon Nou

Abstract: The European Union's technological sovereignty strategy centers around the RISC-V Instruction Set Architecture, with the European Processor Initiative leading efforts to build production-ready processors. Focusing on realizing a functional RISC-V cloud ecosystem, the Vitamin-V European project developed an OpenStack cluster utilizing genuine hardware. In this poster, we detail the efforts done in… ▽ More The European Union's technological sovereignty strategy centers around the RISC-V Instruction Set Architecture, with the European Processor Initiative leading efforts to build production-ready processors. Focusing on realizing a functional RISC-V cloud ecosystem, the Vitamin-V European project developed an OpenStack cluster utilizing genuine hardware. In this poster, we detail the efforts done in porting and setting up the cluster and the many software services required by OpenStack to properly run on real hardware. In this poster, we detail our efforts on building an minimal viable prototype OpenStack cluster using real hardware. The cluster is almost functional, and we expect it to be complete in the next few months. △ Less

Submitted 17 June, 2024; originally announced July 2024.

Comments: RISC-V Summit Europe 2024

arXiv:2407.00052 [pdf]

Vitamin-V: Expanding Open-Source RISC-V Cloud Environments

Authors: Ramon Canal, Stefano Di Carlo, Dimitris Gizopoulos, Alberto Scionti, Francesco Lubrano, Josep-Lluís Berral, Aaron Call, Diego Marron, Konstantinos Nikas, Dionisios Pnevmatikatos, Daniel Raho, Alvise Rigo, Yannis Papaefstathiou, José María Arnau, Angelos Arelakis

Abstract: Among the key contributions of Vitamin-V (2023-2025 Horizon Europe project), we develop a complete RISC-V open-source software stack for cloud services with comparable performance to the cloud-dominant x86 counterpart. In this paper, we detail the software suites and applications ported plus the three cloud setups under evaluation. Among the key contributions of Vitamin-V (2023-2025 Horizon Europe project), we develop a complete RISC-V open-source software stack for cloud services with comparable performance to the cloud-dominant x86 counterpart. In this paper, we detail the software suites and applications ported plus the three cloud setups under evaluation. △ Less

Submitted 12 June, 2024; originally announced July 2024.

Comments: RISC-V Summit Europe 2024, 24-28 June 2024

arXiv:1905.05881 [pdf, other]

Resource-aware Elastic Swap Random Forest for Evolving Data Streams

Authors: Diego Marrón, Eduard Ayguadé, José Ramon Herrero, Albert Bifet

Abstract: Continual learning based on data stream mining deals with ubiquitous sources of Big Data arriving at high-velocity and in real-time. Adaptive Random Forest ({\em ARF}) is a popular ensemble method used for continual learning due to its simplicity in combining adaptive leveraging bagging with fast random Hoeffding trees. While the default ARF size provides competitive accuracy, it is usually over-p… ▽ More Continual learning based on data stream mining deals with ubiquitous sources of Big Data arriving at high-velocity and in real-time. Adaptive Random Forest ({\em ARF}) is a popular ensemble method used for continual learning due to its simplicity in combining adaptive leveraging bagging with fast random Hoeffding trees. While the default ARF size provides competitive accuracy, it is usually over-provisioned resulting in the use of additional classifiers that only contribute to increasing CPU and memory consumption with marginal impact in the overall accuracy. This paper presents Elastic Swap Random Forest ({\em ESRF}), a method for reducing the number of trees in the ARF ensemble while providing similar accuracy. {\em ESRF} extends {\em ARF} with two orthogonal components: 1) a swap component that splits learners into two sets based on their accuracy (only classifiers with the highest accuracy are used to make predictions); and 2) an elastic component for dynamically increasing or decreasing the number of classifiers in the ensemble. The experimental evaluation of {\em ESRF} and comparison with the original {\em ARF} shows how the two new components contribute to reducing the number of classifiers up to one third while providing almost the same accuracy, resulting in speed-ups in terms of per-sample execution time close to 3x. △ Less

Submitted 14 May, 2019; originally announced May 2019.

arXiv:1511.00971 [pdf, other]

Data Stream Classification using Random Feature Functions and Novel Method Combinations

Authors: Diego Marrón, Jesse Read, Albert Bifet, Nacho Navarro

Abstract: Big Data streams are being generated in a faster, bigger, and more commonplace. In this scenario, Hoeffding Trees are an established method for classification. Several extensions exist, including high-performing ensemble setups such as online and leveraging bagging. Also, $k$-nearest neighbors is a popular choice, with most extensions dealing with the inherent performance limitations over a potent… ▽ More Big Data streams are being generated in a faster, bigger, and more commonplace. In this scenario, Hoeffding Trees are an established method for classification. Several extensions exist, including high-performing ensemble setups such as online and leveraging bagging. Also, $k$-nearest neighbors is a popular choice, with most extensions dealing with the inherent performance limitations over a potentially-infinite stream. At the same time, gradient descent methods are becoming increasingly popular, owing in part to the successes of deep learning. Although deep neural networks can learn incrementally, they have so far proved too sensitive to hyper-parameter options and initial conditions to be considered an effective `off-the-shelf' data-streams solution. In this work, we look at combinations of Hoeffding-trees, nearest neighbour, and gradient descent methods with a streaming preprocessing approach in the form of a random feature functions filter for additional predictive power. We further extend the investigation to implementing methods on GPUs, which we test on some large real-world datasets, and show the benefits of using GPUs for data-stream learning due to their high scalability. Our empirical evaluation yields positive results for the novel approaches that we experiment with, highlighting important issues, and shed light on promising future directions in approaches to data-stream classification. △ Less

Submitted 3 November, 2015; originally announced November 2015.

Comments: 20 pages, journal

Showing 1–4 of 4 results for author: Marrón, D