Skip to main content

Showing 1–10 of 10 results for author: Olcoz, K

Searching in archive cs. Search in all archives.
.
  1. Balanced segmentation of CNNs for multi-TPU inference

    Authors: Jorge Villarrubia, Luis Costero, Francisco D. Igual, Katzalin Olcoz

    Abstract: In this paper, we propose different alternatives for convolutional neural networks (CNNs) segmentation, addressing inference processes on computing architectures composed by multiple Edge TPUs. Specifically, we compare the inference performance for a number of state-of-the-art CNN models taking as a reference inference times on one TPU and a compiler-based pipelined inference implementation as pro… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

    Comments: Accepted for publication at The Journal of Supercomputing. The final published version is available in: https://doi.org/10.1007/s11227-024-06605-9

    ACM Class: C.1.3; B.8

    Journal ref: The Journal of Supercomputing, 2025

  2. Improving inference time in multi-TPU systems with profiled model segmentation

    Authors: Jorge Villarrubia, Luis Costero, Francisco D. Igual, Katzalin Olcoz

    Abstract: In this paper, we systematically evaluate the inference performance of the Edge TPU by Google for neural networks with different characteristics. Specifically, we determine that, given the limited amount of on-chip memory on the Edge TPU, accesses to external (host) memory rapidly become an important performance bottleneck. We demonstrate how multiple devices can be jointly used to alleviate the b… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

    Comments: Accepted for publication at the 2023 Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP). The final published version is available in IEEE Xplore: https://doi.org/10.1109/PDP59025.2023.00020

    ACM Class: C.1.3; B.8

    Journal ref: 2023 Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)

  3. arXiv:2411.14954  [pdf

    cs.AR

    Teaching Experiences using the RVfpga Package

    Authors: D. Chaver, S. Harris, L. Pinuel, O. Kindgren, R. Kravitz, J. I. Gomez, F. Castro, K. Olcoz, J. Villalba, A. Grinshpun, F. Gabbay, L. Seed, R. Duarte, M. Lopez, O. Alonso, R. Owen

    Abstract: The RVfpga course offers a solid introduction to computer architecture using the RISC-V instruction set and FPGA technology. It focuses on providing hands-on experience with real-world RISC-V cores, the VeeR EH1 and the VeeR EL2, developed by Western Digital a few years ago and currently hosted by ChipsAlliance. This course is particularly aimed at educators and students in computer science, compu… ▽ More

    Submitted 22 November, 2024; originally announced November 2024.

  4. arXiv:2404.15324  [pdf, other

    eess.SP cs.AI eess.SY

    Advanced simulation-based predictive modelling for solar irradiance sensor farms

    Authors: José L. Risco-Martín, Ignacio-Iker Prado-Rujas, Javier Campoy, María S. Pérez, Katzalin Olcoz

    Abstract: As solar power continues to grow and replace traditional energy sources, the need for reliable forecasting models becomes increasingly important to ensure the stability and efficiency of the grid. However, the management of these models still needs to be improved, and new tools and technologies are required to handle the deployment and control of solar facilities. This work introduces a novel fram… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Journal ref: Journal of Simulation, pp. 1-18, 2024

  5. Energy efficiency optimization of task-parallel codes on asymmetric architectures

    Authors: Luis Costero, Francisco D. Igual, Katzalin Olcoz, Francisco Tirado

    Abstract: We present a family of policies that, integrated within a runtime task scheduler (Nanox), pursue the goal of improving the energy efficiency of task-parallel executions with no intervention from the programmer. The proposed policies tackle the problem by modifying the core operating frequency via DVFS mechanisms, or by enabling/disabling the mapping of tasks to specific cores at selected execution… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  6. Leveraging knowledge-as-a-service (KaaS) for QoS-aware resource management in multi-user video transcoding

    Authors: Luis Costero, Francisco D. Igual, Katzalin Olcoz, Francisco Tirado

    Abstract: The coexistence of parallel applications in shared computing nodes, each one featuring different Quality of Service (QoS) requirements, carries out new challenges to improve resource occupation while keeping acceptable rates in terms of QoS. As more application-specific and system-wide metrics are included as QoS dimensions, or under situations in which resource-usage limits are strict, building a… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Journal ref: Journal of Supercomputing 76, pp. 9388 to 9403 (2020)

  7. Optimization of a Line Detection Algorithm for Autonomous Vehicles on a RISC-V with Accelerator

    Authors: María José Belda, Katzalin Olcoz, Fernando Castro, Francisco Tirado

    Abstract: In recent years, autonomous vehicles have attracted the attention of many research groups, both in academia and business, including researchers from leading companies such as Google, Uber and Tesla. This type of vehicles are equipped with systems that are subject to very strict requirements, essentially aimed at performing safe operations -- both for potential passengers and pedestrians -- as well… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Journal ref: Journal of Computer Science & Technology, 22(2), 2022

  8. A Unified Cloud-Enabled Discrete Event Parallel and Distributed Simulation Architecture

    Authors: José L. Risco-Martín, Kevin Henares, Saurabh Mittal, Luis F. Almendras, Katzalin Olcoz

    Abstract: Cloud simulation environments today are largely employed to model and simulate complex systems for remote accessibility and variable capacity requirements. In this regard, scalability issues in Modeling and Simulation (M\&S) computational requirements can be tackled through the elasticity of on-demand Cloud deployment. However, implementing a high performance cloud M\&S framework following these e… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Journal ref: Simulation Modelling Practice and Theory, 118, 2022

  9. arXiv:1904.11268  [pdf, other

    cs.CR

    Detecting time-fragmented cache attacks against AES using Performance Monitoring Counters

    Authors: Iván Prada, Francisco D. Igual, Katzalin Olcoz

    Abstract: Cache timing attacks use shared caches in multi-core processors as side channels to extract information from victim processes. These attacks are particularly dangerous in cloud infrastructures, in which the deployed countermeasures cause collateral effects in terms of performance loss and increase in energy consumption. We propose to monitor the victim process using an independent monitoring (dete… ▽ More

    Submitted 25 April, 2019; originally announced April 2019.

  10. arXiv:1509.02058  [pdf, other

    cs.DC

    Revisiting Conventional Task Schedulers to Exploit Asymmetry in ARM big.LITTLE Architectures for Dense Linear Algebra

    Authors: Luis Costero, Francisco D. Igual, Katzalin Olcoz, Enrique S. Quintana-Ortí

    Abstract: Dealing with asymmetry in the architecture opens a plethora of questions from the perspective of scheduling task-parallel applications, and there exist early attempts to address this problem via ad-hoc strategies embedded into a runtime framework. In this paper we take a different path, which consists in addressing the complexity of the problem at the library level, via a few asymmetry-aware funda… ▽ More

    Submitted 7 September, 2015; originally announced September 2015.