Skip to main content

Showing 1–24 of 24 results for author: Garcia, A L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.01352  [pdf, other

    cs.LG cs.CR

    Metric Privacy in Federated Learning for Medical Imaging: Improving Convergence and Preventing Client Inference Attacks

    Authors: Judith Sáinz-Pardo Díaz, Andreas Athanasiou, Kangsoo Jung, Catuscia Palamidessi, Álvaro López García

    Abstract: Federated learning is a distributed learning technique that allows training a global model with the participation of different data owners without the need to share raw data. This architecture is orchestrated by a central server that aggregates the local models from the clients. This server may be trusted, but not all nodes in the network. Then, differential privacy (DP) can be used to privatize t… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

  2. arXiv:2501.15949  [pdf, other

    cs.LG

    Enhancing the Convergence of Federated Learning Aggregation Strategies with Limited Data

    Authors: Judith Sáinz-Pardo Díaz, Álvaro López García

    Abstract: The development of deep learning techniques is a leading field applied to cases in which medical data is used, particularly in cases of image diagnosis. This type of data has privacy and legal restrictions that in many cases prevent it from being processed from central servers. However, in this area collaboration between different research centers, in order to create models as robust as possible,… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

  3. arXiv:2408.10766  [pdf, ps, other

    cs.CR cs.DB cs.SE

    An Open Source Python Library for Anonymizing Sensitive Data

    Authors: Judith Sáinz-Pardo Díaz, Álvaro López García

    Abstract: Open science is a fundamental pillar to promote scientific progress and collaboration, based on the principles of open data, open source and open access. However, the requirements for publishing and sharing open data are in many cases difficult to meet in compliance with strict data protection regulations. Consequently, researchers need to rely on proven methods that allow them to anonymize their… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: Preprint under review

  4. arXiv:2408.05761  [pdf, other

    cs.LG physics.ao-ph

    Personalized Federated Learning for improving radar based precipitation nowcasting on heterogeneous areas

    Authors: Judith Sáinz-Pardo Díaz, María Castrillo, Juraj Bartok, Ignacio Heredia Cachá, Irina Malkin Ondík, Ivan Martynovskyi, Khadijeh Alibabaei, Lisana Berberi, Valentin Kozlov, Álvaro López García

    Abstract: The increasing generation of data in different areas of life, such as the environment, highlights the need to explore new techniques for processing and exploiting data for useful purposes. In this context, artificial intelligence techniques, especially through deep learning models, are key tools to be used on the large amount of data that can be obtained, for example, from weather radars. In many… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

    Comments: Accepted for publication in Earth Science Informatics

  5. arXiv:2312.11531  [pdf, other

    q-fin.ST cs.LG cs.SI

    The irruption of cryptocurrencies into Twitter cashtags: a classifying solution

    Authors: Ana Fernández Vilas, Rebeca Díaz Redondo, Antón Lorenzo García

    Abstract: There is a consensus about the good sensing characteristics of Twitter to mine and uncover knowledge in financial markets, being considered a relevant feeder for taking decisions about buying or holding stock shares and even for detecting stock manipulation. Although Twitter hashtags allow to aggregate topic-related content, a specific mechanism for financial information also exists: Cashtag. Howe… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Journal ref: EEE Access, vol. 8, 2020

  6. arXiv:2305.07415  [pdf, other

    cs.LG cs.CR cs.DB

    Comparison of machine learning models applied on anonymized data with different techniques

    Authors: Judith Sáinz-Pardo Díaz, Álvaro López García

    Abstract: Anonymization techniques based on obfuscating the quasi-identifiers by means of value generalization hierarchies are widely used to achieve preset levels of privacy. To prevent different types of attacks against database privacy it is necessary to apply several anonymization techniques beyond the classical k-anonymity or $\ell$-diversity. However, the application of these methods is directly conne… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: Accepted for publication: IEEE International Conference in Cyber Security and Resilience 2023 (IEEE CSR)

  7. pyCANON: A Python library to check the level of anonymity of a dataset

    Authors: Judith Sáinz-Pardo Díaz, Álvaro López García

    Abstract: Openly sharing data with sensitive attributes and privacy restrictions is a challenging task. In this document we present the implementation of pyCANON, a Python library and command line interface (CLI) to check and assess the level of anonymity of a dataset through some of the most common anonymization techniques: k-anonymity, ($α$,k)-anonymity, $\ell$-diversity, entropy $\ell$-diversity, recursi… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

  8. A Container-Based Workflow for Distributed Training of Deep Learning Algorithms in HPC Clusters

    Authors: Jose González-Abad, Álvaro López García, Valentin Y. Kozlov

    Abstract: Deep learning has been postulated as a solution for numerous problems in different branches of science. Given the resource-intensive nature of these models, they often need to be executed on specialized hardware such graphical processing units (GPUs) in a distributed manner. In the academic field, researchers get access to this kind of resources through High Performance Computing (HPC) clusters. T… ▽ More

    Submitted 14 November, 2022; v1 submitted 4 August, 2022; originally announced August 2022.

    Comments: This version of the article has been accepted for publication but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: https://doi.org/10.1007/s10586-022-03798-7

  9. arXiv:2207.08581  [pdf, other

    cs.LG cs.AI cs.CV eess.IV

    Study of the performance and scalability of federated learning for medical imaging with intermittent clients

    Authors: Judith Sáinz-Pardo Díaz, Álvaro López García

    Abstract: Federated learning is a data decentralization privacy-preserving technique used to perform machine or deep learning in a secure way. In this paper we present theoretical aspects about federated learning, such as the presentation of an aggregation operator, different types of federated learning, and issues to be taken into account in relation to the distribution of data from the clients, together w… ▽ More

    Submitted 3 November, 2022; v1 submitted 18 July, 2022; originally announced July 2022.

  10. Forecasting COVID-19 spreading trough an ensemble of classical and machine learning models: Spain's case study

    Authors: Ignacio Heredia Cacha, Judith Sainz-Pardo Díaz, María Castrillo Melguizo, Álvaro López García

    Abstract: In this work we evaluate the applicability of an ensemble of population models and machine learning models to predict the near future evolution of the COVID-19 pandemic, with a particular use case in Spain. We rely solely in open and public datasets, fusing incidence, vaccination, human mobility and weather data to feed our machine learning models (Random Forest, Gradient Boosting, k-Nearest Neigh… ▽ More

    Submitted 12 August, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

  11. arXiv:2110.12025  [pdf, other

    cs.NI cs.LG

    Interaction and Conflict Management in AI-assisted Operational Control Loops in 6G

    Authors: Saeedeh Parsaeefard, Pooyan Habibi, Alberto Leon Garcia

    Abstract: This paper studies autonomous and AI-assisted control loops (ACLs) in the next generation of wireless networks in the lens of multi-agent environments. We will study the diverse interactions and conflict management among these loops. We propose "interaction and conflict management" (ICM) modules to achieve coherent, consistent and interactions among these ACLs. We introduce three categories of ACL… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

  12. arXiv:2104.12608  [pdf, other

    cs.NI cs.DC cs.LG

    Generalized ADMM in Distributed Learning via Variational Inequality

    Authors: Saeedeh Parsaeefard, Alberto Leon Garcia

    Abstract: Due to the explosion in size and complexity of modern data sets and privacy concerns of data holders, it is increasingly important to be able to solve machine learning problems in distributed manners. The Alternating Direction Method of Multipliers (ADMM) through the concept of consensus variables is a practical algorithm in this context where its diverse variations and its performance have been s… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

  13. arXiv:2104.11700  [pdf, other

    cs.LG cs.DC cs.NE cs.NI

    Robust Federated Learning by Mixture of Experts

    Authors: Saeedeh Parsaeefard, Sayed Ehsan Etesami, Alberto Leon Garcia

    Abstract: We present a novel weighted average model based on the mixture of experts (MoE) concept to provide robustness in Federated learning (FL) against the poisoned/corrupted/outdated local models. These threats along with the non-IID nature of data sets can considerably diminish the accuracy of the FL model. Our proposed MoE-FL setup relies on the trust between users and the server where the users share… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

  14. arXiv:2006.05027  [pdf, ps, other

    cs.IT cs.NI

    Stochastic Geometry-Based Modeling and Analysis of Beam Management in 5G

    Authors: Sanket S. Kalamkar, Fuad M. Abinader Jr., François Baccelli, Andrea S. Marcano Fani, and Luis G. Uzeda Garcia

    Abstract: Beam management is central in the operation of dense 5G cellular networks. Focusing the energy radiated to mobile terminals (MTs) by increasing the number of beams per cell increases signal power and decreases interference, and has hence the potential to bring major improvements on area spectral efficiency (ASE). This benefit, however, comes with unavoidable overheads that increase with the number… ▽ More

    Submitted 14 September, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: This is a work in progress. Your comments are welcome. 9 pages, 6 figures

  15. Estimation of high frequency nutrient concentrations from water quality surrogates using machine learning methods

    Authors: María Castrillo, Álvaro López García

    Abstract: Continuous high frequency water quality monitoring is becoming a critical task to support water management. Despite the advancements in sensor technologies, certain variables cannot be easily and/or economically monitored in-situ and in real time. In these cases, surrogate measures can be used to make estimations by means of data-driven models. In this work, variables that are commonly measured in… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

    Journal ref: Water Research. Volume 172, 1 April 2020, 115490

  16. arXiv:1912.05571  [pdf, other

    cs.LG cs.DC stat.ML

    Representation of Federated Learning via Worst-Case Robust Optimization Theory

    Authors: Saeedeh Parsaeefard, Iman Tabrizian, Alberto Leon Garcia

    Abstract: Federated learning (FL) is a distributed learning approach where a set of end-user devices participate in the learning process by acting on their isolated local data sets. Here, we process local data sets of users where worst-case optimization theory is used to reformulate the FL problem where the impact of local data sets in training phase is considered as an uncertain function bounded in a close… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

  17. An efficient cloud scheduler design supporting preemptible instances

    Authors: Álvaro López García, Enol Fernández-del-Castillo, Isabel Campos Plasencia

    Abstract: Maximizing resource utilization by performing an efficient resource provisioning is a key factor for any cloud provider: commercial actors can maximize their revenues, whereas scientific and non-commercial providers can maximize their infrastructure utilization. Traditionally, batch systems have allowed data centers to fill their resources as much as possible by using backfilling and similar techn… ▽ More

    Submitted 28 January, 2020; v1 submitted 27 December, 2018; originally announced December 2018.

    Journal ref: Future Generation Computer Systems (2019)

  18. umd-verification: Automation of Software Validation for the EGI federated e-Infrastructure

    Authors: Pablo Orviz Fernandez, Joao Pina, Alvaro Lopez Garcia, Isabel Campos Plasencia, Mario David, Jorge Gomes

    Abstract: Supporting e-Science in the EGI e-Infrastructure requires extensive and reliable software, for advanced computing use, deployed across over approximately 300 European and worldwide data centers. The Unified Middleware Distribution (UMD) and Cloud Middleware Distribution (CMD) are the channels to deliver the software for the EGI e-Infrastructure consumption. The software is compiled, validated and… ▽ More

    Submitted 30 July, 2018; originally announced July 2018.

    Comments: This is the author's pre-print version of this work. The final publication is available at http://dx.doi.org/10.1007/s10723-018-9454-2

    Journal ref: Journal of Grid COmputing (2018) 1-14

  19. Efficient image deployment in cloud environments

    Authors: Álvaro López García, Enol Fernández del Castillo

    Abstract: The biggest overhead for the instantiation of a virtual machine in a cloud infrastructure is the time spent in transferring the image of the virtual machine into the physical node that executes it. This overhead becomes larger for requests composed of several virtual machines to be started concurrently, and the illusion of flexibility and elasticity usually associated with the cloud computing mode… ▽ More

    Submitted 21 November, 2017; originally announced November 2017.

    Journal ref: Journal of Network and Computer Applications, Volume 63, 2016, Pages 140-149

  20. Standards for enabling heterogeneous IaaS cloud federations

    Authors: Álvaro López García, Enol Fernández del Castillo, Pablo Orviz Fernández

    Abstract: Technology market is continuing a rapid growth phase where different resource providers and Cloud Management Frameworks are positioning to provide ad-hoc solutions -in terms of management interfaces, information discovery or billing- trying to differentiate from competitors but that as a result remain incompatible between them when addressing more complex scenarios like federated clouds. Grasping… ▽ More

    Submitted 21 November, 2017; originally announced November 2017.

    Journal ref: Computer Standards & Interfaces, Volume 47, 2016, Pages 19-23,

  21. Orchestrating Complex Application Architectures in Heterogeneous Clouds

    Authors: Miguel Caballer, Sahdev Zala, Álvaro López García, Germán Moltó, Pablo Orviz Fernández, Mathieu Velten

    Abstract: Private cloud infrastructures are now widely deployed and adopted across technology industries and research institutions. Although cloud computing has emerged as a reality, it is now known that a single cloud provider cannot fully satisfy complex user requirements. This has resulted in a growing interest in developing hybrid cloud solutions that bind together distinct and heterogeneous cloud infra… ▽ More

    Submitted 9 November, 2017; originally announced November 2017.

    Journal ref: J Grid Computing (2017)

  22. Resource provisioning in Science Clouds: Requirements and challenges

    Authors: Álvaro López García, Enol Fernández-del-Castillo, Pablo Orviz Fernández, Isabel Campos Plasencia, Jesús Marco de Lucas

    Abstract: Cloud computing has permeated into the information technology industry in the last few years, and it is emerging nowadays in scientific environments. Science user communities are demanding a broad range of computing power to satisfy the needs of high-performance applications, such as local clusters, high-performance computing systems, and computing grids. Different workloads are needed from differ… ▽ More

    Submitted 25 September, 2017; originally announced September 2017.

    Journal ref: Software: Practice and Experience. 2017;1-13

  23. Improved Cloud resource allocation: how INDIGO-DataCloud is overcoming the current limitations in Cloud schedulers

    Authors: Alvaro Lopez Garcia, Lisa Zangrando, Massimo Sgaravatto, Vincent Llorens, Sara Vallero, Valentina Zaccolo, Stefano Bagnasco, Sonia Taneja, Stefano Dal Pra, Davide Salomoni, Giacinto Donvito

    Abstract: Performing efficient resource provisioning is a fundamental aspect for any resource provider. Local Resource Management Systems (LRMS) have been used in data centers for decades in order to obtain the best usage of the resources, providing their fair usage and partitioning for the users. In contrast, current cloud schedulers are normally based on the immediate allocation of resources on a first-co… ▽ More

    Submitted 20 July, 2017; originally announced July 2017.

    Journal ref: 2017 J. Phys.: Conf. Ser. 898 092010

  24. arXiv:1309.6109  [pdf, ps, other

    cs.DC cs.CE

    Analysis of Scientific Cloud Computing requirements

    Authors: Álvaro López García, Enol Fernández del Castillo

    Abstract: While the requirements of enterprise and web applications have driven the development of Cloud computing, some of its key features, such as customized environments and rapid elasticity, could also benefit scientific applications. However, neither virtualization techniques nor Cloud-like access to resources is common in scientific computing centers due to the negative perception of the impact that… ▽ More

    Submitted 22 June, 2015; v1 submitted 24 September, 2013; originally announced September 2013.

    Journal ref: 7th Iberian Grid Infrastructure Conference (IBERGRID 2013) proceedings, pp 147-158