Skip to main content

Showing 1–50 of 56 results for author: Cruz, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.13537  [pdf, ps, other

    cs.LG cs.DC

    FedTilt: Towards Multi-Level Fairness-Preserving and Robust Federated Learning

    Authors: Binghui Zhang, Luis Mares De La Cruz, Binghui Wang

    Abstract: Federated Learning (FL) is an emerging decentralized learning paradigm that can partly address the privacy concern that cannot be handled by traditional centralized and distributed learning. Further, to make FL practical, it is also necessary to consider constraints such as fairness and robustness. However, existing robust FL methods often produce unfair models, and existing fair FL methods only c… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

    Comments: 13 pages

  2. arXiv:2501.07180  [pdf

    cs.RO cs.HC eess.SY

    Evaluating Robotic Approach Techniques for the Insertion of a Straight Instrument into a Vitreoretinal Surgery Trocar

    Authors: Ross Henry, Martin Huber, Anestis Mablekos-Alexiou, Carlo Seneci, Mohamed Abdelaziz, Hans Natalius, Lyndon da Cruz, Christos Bergeles

    Abstract: Advances in vitreoretinal robotic surgery enable precise techniques for gene therapies. This study evaluates three robotic approaches using the 7-DoF robotic arm for docking a micro-precise tool to a trocar: fully co-manipulated, hybrid co-manipulated/teleoperated, and hybrid with camera assistance. The fully co-manipulated approach was the fastest but had a 42% success rate. Hybrid methods showed… ▽ More

    Submitted 13 January, 2025; originally announced January 2025.

    Comments: 2 Pages, 2 Figures, 1 Table

  3. arXiv:2412.10063  [pdf, other

    cs.SE

    Unveiling the Energy Vampires: A Methodology for Debugging Software Energy Consumption

    Authors: Enrique Barba Roque, Luis Cruz, Thomas Durieux

    Abstract: Energy consumption in software systems is becoming increasingly important, especially in large-scale deployments. However, debugging energy-related issues remains challenging due to the lack of specialized tools. This paper presents an energy debugging methodology for identifying and isolating energy consumption hotspots in software systems. We demonstrate the methodology's effectiveness through a… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

  4. arXiv:2411.01018  [pdf

    eess.AS cs.SD eess.SP

    An incremental algorithm based on multichannel non-negative matrix partial co-factorization for ambient denoising in auscultation

    Authors: Juan De La Torre Cruz, Francisco Jesus Canadas Quesada, Damian Martinez-Munoz, Nicolas Ruiz Reyes, Sebastian Garcia Galan, Julio Jose Carabias Orti

    Abstract: The aim of this study is to implement a method to remove ambient noise in biomedical sounds captured in auscultation. We propose an incremental approach based on multichannel non-negative matrix partial co-factorization (NMPCF) for ambient denoising focusing on high noisy environment with a Signal-to-Noise Ratio (SNR) <= -5 dB. The first contribution applies NMPCF assuming that ambient noise can b… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

    Journal ref: Applied Acoustics, volume 182, November 2021, 108229

  5. arXiv:2409.10996  [pdf, other

    cs.LG

    GINTRIP: Interpretable Temporal Graph Regression using Information bottleneck and Prototype-based method

    Authors: Ali Royat, Seyed Mohamad Moghadas, Lesley De Cruz, Adrian Munteanu

    Abstract: Deep neural networks (DNNs) have demonstrated remarkable performance across various domains, yet their application to temporal graph regression tasks faces significant challenges regarding interpretability. This critical issue, rooted in the inherent complexity of both DNNs and underlying spatio-temporal patterns in the graph, calls for innovative solutions. While interpretability concerns in Grap… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  6. arXiv:2408.00153  [pdf, other

    cs.SE

    Understanding Feedback Mechanisms in Machine Learning Jupyter Notebooks

    Authors: Arumoy Shome, Luis Cruz, Diomidis Spinellis, Arie van Deursen

    Abstract: The machine learning development lifecycle is characterized by iterative and exploratory processes that rely on feedback mechanisms to ensure data and model integrity. Despite the critical role of feedback in machine learning engineering, no prior research has been conducted to identify and understand these mechanisms. To address this knowledge gap, we mine 297.8 thousand Jupyter notebooks and ana… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

  7. arXiv:2406.18142  [pdf, other

    cs.SE cs.AI

    Innovating for Tomorrow: The Convergence of SE and Green AI

    Authors: Luís Cruz, Xavier Franch Gutierrez, Silverio Martínez-Fernández

    Abstract: The latest advancements in machine learning, specifically in foundation models, are revolutionizing the frontiers of existing software engineering (SE) processes. This is a bi-directional phenomona, where 1) software systems are now challenged to provide AI-enabled features to their users, and 2) AI is used to automate tasks within the software development lifecycle. In an era where sustainability… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted in SE 2030 - International Workshop on Software Engineering in 2030

  8. arXiv:2405.17451  [pdf, other

    cs.LG cs.AI cs.CY cs.SE

    Green AI in Action: Strategic Model Selection for Ensembles in Production

    Authors: Nienke Nijkamp, June Sallou, Niels van der Heijden, Luís Cruz

    Abstract: Integrating Artificial Intelligence (AI) into software systems has significantly enhanced their capabilities while escalating energy demands. Ensemble learning, combining predictions from multiple models to form a single prediction, intensifies this problem due to cumulative energy consumption. This paper presents a novel approach to model selection that addresses the challenge of balancing the ac… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 9 pages. Accepted at the 1st ACM International Conference on AI-powered Software (AIware), 2024

  9. arXiv:2404.18314  [pdf, other

    cs.LG nlin.CD physics.ao-ph

    DIRESA, a distance-preserving nonlinear dimension reduction technique based on regularized autoencoders

    Authors: Geert De Paepe, Lesley De Cruz

    Abstract: In meteorology, finding similar weather patterns or analogs in historical datasets can be useful for data assimilation, forecasting, and postprocessing. In climate science, analogs in historical and climate projection data are used for attribution and impact studies. However, most of the time, those large weather and climate datasets are nearline. This means that they must be downloaded, which tak… ▽ More

    Submitted 27 April, 2025; v1 submitted 28 April, 2024; originally announced April 2024.

    Comments: 28 pages, 19 figures, 6 tables (including Appendices); accepted for publication in Artificial Intelligence for the Earth Systems

  10. arXiv:2401.14093  [pdf, other

    cs.SE cs.LG

    McUDI: Model-Centric Unsupervised Degradation Indicator for Failure Prediction AIOps Solutions

    Authors: Lorena Poenaru-Olaru, Luis Cruz, Jan Rellermeyer, Arie van Deursen

    Abstract: Due to the continuous change in operational data, AIOps solutions suffer from performance degradation over time. Although periodic retraining is the state-of-the-art technique to preserve the failure prediction AIOps models' performance over time, this technique requires a considerable amount of labeled data to retrain. In AIOps obtaining label data is expensive since it requires the availability… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  11. arXiv:2401.07697  [pdf, other

    cs.LG cs.CY cs.SE

    Data vs. Model Machine Learning Fairness Testing: An Empirical Study

    Authors: Arumoy Shome, Luis Cruz, Arie van Deursen

    Abstract: Although several fairness definitions and bias mitigation techniques exist in the literature, all existing solutions evaluate fairness of Machine Learning (ML) systems after the training stage. In this paper, we take the first steps towards evaluating a more holistic approach by testing for fairness both before and after model training. We evaluate the effectiveness of the proposed approach and po… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  12. arXiv:2401.07696  [pdf, other

    cs.SE

    Towards Automatic Translation of Machine Learning Visual Insights to Analytical Assertions

    Authors: Arumoy Shome, Luis Cruz, Arie van Deursen

    Abstract: We present our vision for developing an automated tool capable of translating visual properties observed in Machine Learning (ML) visualisations into Python assertions. The tool aims to streamline the process of manually verifying these visualisations in the ML development cycle, which is critical as real-world data and assumptions often change post-deployment. In a prior study, we mined $54,070$… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  13. Energy Patterns for Web: An Exploratory Study

    Authors: Pooja Rani, Jonas Zellweger, Veronika Kousadianos, Luis Cruz, Timo Kehrer, Alberto Bacchelli

    Abstract: As the energy footprint generated by software is increasing at an alarming rate, understanding how to develop energy-efficient applications has become a necessity. Previous work has introduced catalogs of coding practices, also known as energy patterns. These patterns are yet limited to Mobile or third-party libraries. In this study, we focus on the Web domain--a main source of energy consumption.… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  14. arXiv:2312.13897  [pdf, other

    cs.SE

    EnergiBridge: Empowering Software Sustainability through Cross-Platform Energy Measurement

    Authors: June Sallou, Luís Cruz, Thomas Durieux

    Abstract: In the continually evolving realm of software engineering, the need to address software energy consumption has gained increasing prominence. However, the absence of a platform-independent tool that facilitates straightforward energy measurements remains a notable gap. This paper presents EnergiBridge, a cross-platform measurement utility that provides support for Linux, Windows, and MacOS, as well… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  15. INRISCO: INcident monitoRing In Smart COmmunities

    Authors: Mónica Aguilar Igartua, Florina Almenares, Rebeca P. Díaz Redondo, Manuela I. Martín, Jordi Forné, Celeste Campo, Ana Fernández, Luis J. de la Cruz, Carlos García-Rubio, Andrés Marínn, Ahmad Mohamad Mezher, Daniel Díaz, Héctor Cerezo, David Rebollo-Monedero, Patricia Arias, Francisco Rico

    Abstract: Major advances in information and communication technologies (ICTs) make citizens to be considered as sensors in motion. Carrying their mobile devices, moving in their connected vehicles or actively participating in social networks, citizens provide a wealth of information that, after properly processing, can support numerous applications for the benefit of the community. In the context of smart c… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Journal ref: EEE Access, vol. 8, 2020

  16. Is Your Anomaly Detector Ready for Change? Adapting AIOps Solutions to the Real World

    Authors: Lorena Poenaru-Olaru, Natalia Karpova, Luis Cruz, Jan Rellermeyer, Arie van Deursen

    Abstract: Anomaly detection techniques are essential in automating the monitoring of IT systems and operations. These techniques imply that machine learning algorithms are trained on operational data corresponding to a specific period of time and that they are continuously evaluated on newly emerging data. Operational data is constantly changing over time, which affects the performance of deployed anomaly d… ▽ More

    Submitted 11 April, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

  17. arXiv:2308.04436  [pdf, other

    cs.AI cs.CY

    The Two Faces of AI in Green Mobile Computing: A Literature Review

    Authors: Wander Siemers, June Sallou, Luís Cruz

    Abstract: Artificial intelligence is bringing ever new functionalities to the realm of mobile devices that are now considered essential (e.g., camera and voice assistants, recommender systems). Yet, operating artificial intelligence takes up a substantial amount of energy. However, artificial intelligence is also being used to enable more energy-efficient solutions for mobile systems. Hence, artificial inte… ▽ More

    Submitted 21 July, 2023; originally announced August 2023.

    Comments: 9 pages, 5 figures. Accepted at Euromicro Conference Series on Software Engineering and Advanced Applications (SEAA) 2023

  18. arXiv:2307.11434  [pdf, other

    cs.LG cs.AI cs.CV cs.SE

    Batching for Green AI -- An Exploratory Study on Inference

    Authors: Tim Yarally, Luís Cruz, Daniel Feitosa, June Sallou, Arie van Deursen

    Abstract: The batch size is an essential parameter to tune during the development of new neural networks. Amongst other quality indicators, it has a large degree of influence on the model's accuracy, generalisability, training times and parallelisability. This fact is generally known and commonly studied. However, during the application phase of a deep learning model, when the model is utilised by an end-us… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: 8 pages, 4 figures, 1 table. Accepted at Euromicro Conference Series on Software Engineering and Advanced Applications (SEAA) 2023

  19. arXiv:2307.10296  [pdf, other

    eess.IV cs.CV cs.LG

    Towards Automated Semantic Segmentation in Mammography Images

    Authors: Cesar A. Sierra-Franco, Jan Hurtado, Victor de A. Thomaz, Leonardo C. da Cruz, Santiago V. Silva, Alberto B. Raposo

    Abstract: Mammography images are widely used to detect non-palpable breast lesions or nodules, preventing cancer and providing the opportunity to plan interventions when necessary. The identification of some structures of interest is essential to make a diagnosis and evaluate image adequacy. Thus, computer-aided detection systems can be helpful in assisting medical interpretation by automatically segmenting… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 6 pages

  20. arXiv:2307.05520  [pdf, other

    cs.LG cs.CY cs.SE

    How to use model architecture and training environment to estimate the energy consumption of DL training

    Authors: Santiago del Rey, Silverio Martínez-Fernández, Luís Cruz, Xavier Franch

    Abstract: To raise awareness of the huge impact Deep Learning (DL) has on the environment, several works have tried to estimate the energy consumption and carbon footprint of DL-based systems across their life cycle. However, the estimations for energy consumption in the training stage usually rely on assumptions that have not been thoroughly tested. This study aims to move past these assumptions by leverag… ▽ More

    Submitted 21 November, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: 32 pages, 11 figures, under review in ACM Transactions on Software Engineering and Methodology (TOSEM). This work is an extension of arXiv:2307.05520v3 [cs.LG]

  21. arXiv:2305.16849  [pdf, other

    cs.SE cs.LG

    Green Runner: A tool for efficient model selection from model repositories

    Authors: Jai Kannan, Scott Barnett, Anj Simmons, Taylan Selvi, Luis Cruz

    Abstract: Deep learning models have become essential in software engineering, enabling intelligent features like image captioning and document generation. However, their popularity raises concerns about environmental impact and inefficient model selection. This paper introduces GreenRunnerGPT, a novel tool for efficiently selecting deep learning models based on specific use cases. It employs a large languag… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  22. arXiv:2305.04988  [pdf, ps, other

    cs.SE

    Towards Understanding Machine Learning Testing in Practise

    Authors: Arumoy Shome, Luis Cruz, Arie van Deursen

    Abstract: Visualisations drive all aspects of the Machine Learning (ML) Development Cycle but remain a vastly untapped resource by the research community. ML testing is a highly interactive and cognitive process which demands a human-in-the-loop approach. Besides writing tests for the code base, bulk of the evaluation requires application of domain expertise to generate and interpret visualisations. To gain… ▽ More

    Submitted 22 May, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

  23. arXiv:2303.13972  [pdf, other

    cs.LG cs.SE

    Uncovering Energy-Efficient Practices in Deep Learning Training: Preliminary Steps Towards Green AI

    Authors: Tim Yarally, Luís Cruz, Daniel Feitosa, June Sallou, Arie van Deursen

    Abstract: Modern AI practices all strive towards the same goal: better results. In the context of deep learning, the term "results" often refers to the achieved accuracy on a competitive problem set. In this paper, we adopt an idea from the emerging field of Green AI to consider energy consumption as a metric of equal importance to accuracy and to reduce any irrelevant tasks or energy usage. We examine the… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: 12 pages, 9 figures, 5 tables. Accepted to CAIN23, 2nd International Conference on AI Engineering - Software Engineering for AI

  24. arXiv:2301.11047  [pdf, other

    cs.AI

    A Systematic Review of Green AI

    Authors: Roberto Verdecchia, June Sallou, Luís Cruz

    Abstract: With the ever-growing adoption of AI-based systems, the carbon footprint of AI is no longer negligible. AI researchers and practitioners are therefore urged to hold themselves accountable for the carbon emissions of the AI models they design and use. This led in recent years to the appearance of researches tackling AI environmental sustainability, a field referred to as Green AI. Despite the rapid… ▽ More

    Submitted 5 May, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: Journal WIREs Data Mining and Knowledge Discovery. 16 pages, 12 figures

  25. arXiv:2212.02985  [pdf, other

    cs.LG cs.AI cs.CY

    Multi-Layer Personalized Federated Learning for Mitigating Biases in Student Predictive Analytics

    Authors: Yun-Wei Chu, Seyyedali Hosseinalipour, Elizabeth Tenorio, Laura Cruz, Kerrie Douglas, Andrew Lan, Christopher Brinton

    Abstract: Conventional methods for student modeling, which involve predicting grades based on measured activities, struggle to provide accurate results for minority/underrepresented student groups due to data availability biases. In this paper, we propose a Multi-Layer Personalized Federated Learning (MLPFL) methodology that optimizes inference accuracy over different layers of student grouping criteria, su… ▽ More

    Submitted 28 May, 2024; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: IEEE Transactions on Emerging Topics in Computing, 2024

  26. arXiv:2211.13098  [pdf, other

    cs.LG cs.AI

    Are Concept Drift Detectors Reliable Alarming Systems? -- A Comparative Study

    Authors: Lorena Poenaru-Olaru, Luis Cruz, Arie van Deursen, Jan S. Rellermeyer

    Abstract: As machine learning models increasingly replace traditional business logic in the production system, their lifecycle management is becoming a significant concern. Once deployed into production, the machine learning models are constantly evaluated on new streaming data. Given the continuous data flow, shifting data, also known as concept drift, is ubiquitous in such settings. Concept drift usually… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  27. arXiv:2210.05635  [pdf, other

    cs.CV

    Oflib: Facilitating Operations with and on Optical Flow Fields in Python

    Authors: Claudio Ravasio, Lyndon Da Cruz, Christos Bergeles

    Abstract: We present a robust theoretical framework for the characterisation and manipulation of optical flow, i.e 2D vector fields, in the context of their use in motion estimation algorithms and beyond. The definition of two frames of reference guides the mathematical derivation of flow field application, inversion, evaluation, and composition operations. This structured approach is then used as the found… ▽ More

    Submitted 14 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: "What is Motion for?" - ECCV 2022 Workshop Submission

  28. arXiv:2208.01182  [pdf

    cs.LG cs.AI cs.CY

    Mitigating Biases in Student Performance Prediction via Attention-Based Personalized Federated Learning

    Authors: Yun-Wei Chu, Seyyedali Hosseinalipour, Elizabeth Tenorio, Laura Cruz, Kerrie Douglas, Andrew Lan, Christopher Brinton

    Abstract: Traditional learning-based approaches to student modeling generalize poorly to underrepresented student groups due to biases in data availability. In this paper, we propose a methodology for predicting student performance from their online learning activities that optimizes inference accuracy over different demographic groups such as race and gender. Building upon recent foundations in federated l… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: 10 pages, CIKM 2022

  29. arXiv:2205.03790  [pdf, other

    cs.SE cs.LG

    MLSmellHound: A Context-Aware Code Analysis Tool

    Authors: Jai Kannan, Scott Barnett, Luís Cruz, Anj Simmons, Akash Agarwal

    Abstract: Meeting the rise of industry demand to incorporate machine learning (ML) components into software systems requires interdisciplinary teams contributing to a shared code base. To maintain consistency, reduce defects and ensure maintainability, developers use code analysis tools to aid them in identifying defects and maintaining standards. With the inclusion of machine learning, tools must account f… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

    Comments: ICSE 2022 NIER track

  30. Data-Centric Green AI: An Exploratory Empirical Study

    Authors: Roberto Verdecchia, Luís Cruz, June Sallou, Michelle Lin, James Wickenden, Estelle Hotellier

    Abstract: With the growing availability of large-scale datasets, and the popularization of affordable storage and computational capabilities, the energy consumed by AI is becoming a growing concern. To address this issue, in recent years, studies have focused on demonstrating how AI energy efficiency can be improved by tuning the model training strategy. Nevertheless, how modifications applied to datasets c… ▽ More

    Submitted 7 April, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: 11 pages, 3 figures, 2 tables. Accepted at the 8th ICT for Sustainability Conference (ICT4S) 2022

  31. arXiv:2203.13746  [pdf, other

    cs.SE cs.AI

    Code Smells for Machine Learning Applications

    Authors: Haiyin Zhang, Luís Cruz, Arie van Deursen

    Abstract: The popularity of machine learning has wildly expanded in recent years. Machine learning techniques have been heatedly studied in academia and applied in the industry to create business value. However, there is a lack of guidelines for code quality in machine learning applications. In particular, code smells have rarely been studied in this domain. Although machine learning code is usually integra… ▽ More

    Submitted 30 March, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted at CAIN

    MSC Class: 68-04

  32. arXiv:2203.13409  [pdf, other

    cs.CV

    Multi-scale and Cross-scale Contrastive Learning for Semantic Segmentation

    Authors: Theodoros Pissas, Claudio S. Ravasio, Lyndon Da Cruz, Christos Bergeles

    Abstract: This work considers supervised contrastive learning for semantic segmentation. We apply contrastive learning to enhance the discriminative power of the multi-scale features extracted by semantic segmentation networks. Our key methodological insight is to leverage samples from the feature spaces emanating from multiple stages of a model's encoder itself requiring neither data augmentation nor onlin… ▽ More

    Submitted 19 July, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: to appear at ECCV 2022

  33. Data Smells in Public Datasets

    Authors: Arumoy Shome, Luis Cruz, Arie van Deursen

    Abstract: The adoption of Artificial Intelligence (AI) in high-stakes domains such as healthcare, wildlife preservation, autonomous driving and criminal justice system calls for a data-centric approach to AI. Data scientists spend the majority of their time studying and wrangling the data, yet tools to aid them with data analysis are lacking. This study identifies the recurrent data quality issues in public… ▽ More

    Submitted 25 March, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

  34. arXiv:2201.08246  [pdf, other

    cs.SE cs.AI

    "Project smells" -- Experiences in Analysing the Software Quality of ML Projects with mllint

    Authors: Bart van Oort, Luís Cruz, Babak Loni, Arie van Deursen

    Abstract: Machine Learning (ML) projects incur novel challenges in their development and productionisation over traditional software applications, though established principles and best practices in ensuring the project's software quality still apply. While using static analysis to catch code smells has been shown to improve software quality attributes, it is only a small piece of the software quality puzzl… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Comments: Accepted at ICSE SEIP 2022

    MSC Class: 68-06

  35. arXiv:2111.00901  [pdf, other

    cs.LG

    Click-Based Student Performance Prediction: A Clustering Guided Meta-Learning Approach

    Authors: Yun-Wei Chu, Elizabeth Tenorio, Laura Cruz, Kerrie Douglas, Andrew S. Lan, Christopher G. Brinton

    Abstract: We study the problem of predicting student knowledge acquisition in online courses from clickstream behavior. Motivated by the proliferation of eLearning lecture delivery, we specifically focus on student in-video activity in lectures videos, which consist of content and in-video quizzes. Our methodology for predicting in-video quiz performance is based on three key ideas we develop. First, we mod… ▽ More

    Submitted 15 November, 2021; v1 submitted 28 October, 2021; originally announced November 2021.

    Comments: 10 pages, IEEE BigData 2021

  36. arXiv:2110.10965  [pdf, other

    eess.IV cs.CV

    2020 CATARACTS Semantic Segmentation Challenge

    Authors: Imanol Luengo, Maria Grammatikopoulou, Rahim Mohammadi, Chris Walsh, Chinedu Innocent Nwoye, Deepak Alapatt, Nicolas Padoy, Zhen-Liang Ni, Chen-Chen Fan, Gui-Bin Bian, Zeng-Guang Hou, Heonjin Ha, Jiacheng Wang, Haojie Wang, Dong Guo, Lu Wang, Guotai Wang, Mobarakol Islam, Bharat Giddwani, Ren Hongliang, Theodoros Pissas, Claudio Ravasio, Martin Huber, Jeremy Birch, Joan M. Nunez Do Rio , et al. (15 additional authors not shown)

    Abstract: Surgical scene segmentation is essential for anatomy and instrument localization which can be further used to assess tissue-instrument interactions during a surgical procedure. In 2017, the Challenge on Automatic Tool Annotation for cataRACT Surgery (CATARACTS) released 50 cataract surgery videos accompanied by instrument usage annotations. These annotations included frame-level instrument presenc… ▽ More

    Submitted 24 February, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

  37. arXiv:2108.06119  [pdf, other

    cs.CV

    Effective semantic segmentation in Cataract Surgery: What matters most?

    Authors: Theodoros Pissas, Claudio Ravasio, Lyndon Da Cruz, Christos Bergeles

    Abstract: Our work proposes neural network design choices that set the state-of-the-art on a challenging public benchmark on cataract surgery, CaDIS. Our methodology achieves strong performance across three semantic segmentation tasks with increasingly granular surgical tool class sets by effectively handling class imbalance, an inherent challenge in any surgical video. We consider and evaluate two conceptu… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

    Comments: MICCAI 2021

  38. arXiv:2108.03028  [pdf, ps, other

    cs.SE

    Green Software Lab: Towards an Engineering Discipline for Green Software

    Authors: Rui Abreu, Marco Couto, Luís Cruz, Jácome Cunha, João Paulo Fernandes, Rui Pereira, Alexandre Perez, João Saraiva

    Abstract: This report describes the research goals and results of the Green Software Lab (GSL) research project. This was a project funded by Fundação para a Ciência e a Tecnologia (FCT) -- the Portuguese research foundation -- under reference POCI-01-0145-FEDER-016718, that ran from January 2016 till July 2020. This report includes the complete document reporting the results achieved during the project e… ▽ More

    Submitted 6 August, 2021; originally announced August 2021.

    Comments: 19 pages. Project's Final Report

  39. arXiv:2106.03271  [pdf, other

    cs.SE cs.CR

    Fixing Vulnerabilities Potentially Hinders Maintainability

    Authors: Sofia Reis, Rui Abreu, Luis Cruz

    Abstract: Security is a requirement of utmost importance to produce high-quality software. However, there is still a considerable amount of vulnerabilities being discovered and fixed almost weekly. We hypothesize that developers affect the maintainability of their codebases when patching vulnerabilities. This paper evaluates the impact of patches to improve security on the maintainability of open-source sof… ▽ More

    Submitted 12 September, 2021; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: Accepted at the Empirical Software Engineering Journal

  40. arXiv:2103.10248  [pdf, other

    cs.LG cs.AI cs.SE

    Systematic Mapping Study on the Machine Learning Lifecycle

    Authors: Yuanhao Xie, Luís Cruz, Petra Heck, Jan S. Rellermeyer

    Abstract: The development of artificial intelligence (AI) has made various industries eager to explore the benefits of AI. There is an increasing amount of research surrounding AI, most of which is centred on the development of new AI algorithms and techniques. However, the advent of AI is bringing an increasing set of practical problems related to AI model lifecycle management that need to be investigated.… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Comments: Accepted at WAIN21: 1st Workshop on AI Engineering - Software Engineering for AI

    MSC Class: 68T01 (Primary) ACM Class: D.2.9; I.2.5

  41. arXiv:2103.04146  [pdf, other

    cs.SE cs.AI cs.LG

    The Prevalence of Code Smells in Machine Learning projects

    Authors: Bart van Oort, Luís Cruz, Maurício Aniche, Arie van Deursen

    Abstract: Artificial Intelligence (AI) and Machine Learning (ML) are pervasive in the current computer science landscape. Yet, there still exists a lack of software engineering experience and best practices in this field. One such best practice, static code analysis, can be used to find code smells, i.e., (potential) defects in the source code, refactoring opportunities, and violations of common coding stan… ▽ More

    Submitted 6 March, 2021; originally announced March 2021.

    Comments: Submitted and accepted to 2021 IEEE/ACM 1st Workshop on AI Engineering - Software Engineering for AI (WAIN)

  42. Building a SDN Enterprise WLAN Based on Virtual APs

    Authors: Luis Sequeira, Juan Luis de la Cruz, Jose Ruiz-Mas, Jose Saldana, Julian Fernandez-Navajas, Jose Almodovar

    Abstract: In this letter the development and testing of an open enterprise Wi-Fi solution based on virtual APs, managed by a central WLAN controller is presented. It allows seamless handovers between APs in different channels, maintaining the QoS of real-time services. The potential scalability issues associated to the beacon generation and channel assignment have been addressed. A battery of tests has been… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Journal ref: IEEE Communications Letters. Vol. 21, no. 2, pp. 374-377, Feb. 2017

  43. arXiv:2010.02716  [pdf, other

    cs.SE

    AI Lifecycle Models Need To Be Revised. An Exploratory Study in Fintech

    Authors: Mark Haakman, Luís Cruz, Hennie Huijgens, Arie van Deursen

    Abstract: Tech-leading organizations are embracing the forthcoming artificial intelligence revolution. Intelligent systems are replacing and cooperating with traditional software components. Thus, the same development processes and standards in software engineering ought to be complied in artificial intelligence systems. This study aims to understand the processes by which artificial intelligence-based syst… ▽ More

    Submitted 2 June, 2021; v1 submitted 3 October, 2020; originally announced October 2020.

    Comments: Accepted in Empirical Software Engineering in April, 2021

    MSC Class: 68T01 ACM Class: I.2.0; D.2.9

  44. arXiv:2009.08198  [pdf

    cs.LG stat.ML

    Multi-objective dynamic programming with limited precision

    Authors: L. Mandow, J. L. Pérez de la Cruz, N. Pozas

    Abstract: This paper addresses the problem of approximating the set of all solutions for Multi-objective Markov Decision Processes. We show that in the vast majority of interesting cases, the number of solutions is exponential or even infinite. In order to overcome this difficulty we propose to approximate the set of all solutions by means of a limited precision approach based on White's multi-objective val… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

    ACM Class: F.2.2; I.2.8

  45. arXiv:2007.16071  [pdf

    cs.NI

    Can a Wi-Fi WLAN Support a First Person Shooter?

    Authors: Jose Saldana, Juan Luis de la Cruz, Luis Sequeira, Julian Fernandez-Navajas, Jose Ruiz-Mas

    Abstract: In corporate and commercial environments, the deployment of a set of coordinated Wi-Fi APs is becoming a common solution to provide Internet coverage to moving users. In these scenarios, real-time services as online games can also be present. This paper presents a set of experiments developed in a test scenario where an end device moves between different APs while generating game traffic. A WLAN s… ▽ More

    Submitted 30 July, 2020; originally announced July 2020.

    Journal ref: NetGames 2015, Zagreb, Croatia, December 3-4, 2015. ISBN 978-1-5090-0067-8

  46. arXiv:2007.07032  [pdf

    cs.MM

    QUALINET White Paper on Definitions of Immersive Media Experience (IMEx)

    Authors: Andrew Perkis, Christian Timmerer, Sabina Baraković, Jasmina Baraković Husić, Søren Bech, Sebastian Bosse, Jean Botev, Kjell Brunnström, Luis Cruz, Katrien De Moor, Andrea de Polo Saibanti, Wouter Durnez, Sebastian Egger-Lampl, Ulrich Engelke, Tiago H. Falk, Jesús Gutiérrez, Asim Hameed, Andrew Hines, Tanja Kojic, Dragan Kukolj, Eirini Liotou, Dragorad Milovanovic, Sebastian Möller, Niall Murray, Babak Naderi , et al. (19 additional authors not shown)

    Abstract: With the coming of age of virtual/augmented reality and interactive media, numerous definitions, frameworks, and models of immersion have emerged across different fields ranging from computer graphics to literary works. Immersion is oftentimes used interchangeably with presence as both concepts are closely related. However, there are noticeable interdisciplinary differences regarding definitions,… ▽ More

    Submitted 24 November, 2020; v1 submitted 10 June, 2020; originally announced July 2020.

  47. arXiv:1910.08768  [pdf, other

    cs.SE

    On the Energy Footprint of Mobile Testing Frameworks

    Authors: Luís Cruz, Rui Abreu

    Abstract: High energy consumption is a challenging issue that an ever increasing number of mobile applications face today. However, energy consumption is being tested in an ad hoc way, despite being an important non-functional requirement of an application. Such limitation becomes particularly disconcerting during software testing: on the one hand, developers do not really know how to measure energy; on the… ▽ More

    Submitted 19 October, 2019; originally announced October 2019.

    Comments: Accepted for publication at IEEE Transactions on Software Engineering (TSE)

  48. arXiv:1908.08332  [pdf, other

    cs.SE cs.PF

    Do Energy-oriented Changes Hinder Maintainability?

    Authors: Luis Cruz, Rui Abreu, John Grundy, Li Li, Xin Xia

    Abstract: Energy efficiency is a crucial quality requirement for mobile applications. However, improving energy efficiency is far from trivial as developers lack the knowledge and tools to aid in this activity. In this paper we study the impact of changes to improve energy efficiency on the maintainability of Android applications. Using a dataset containing 539 energy efficiency-oriented commits, we measure… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

    Comments: International Conference on Software Maintenance and Evolution - ICSME 2019

  49. An Analysis of 35+ Million Jobs of Travis CI

    Authors: Thomas Durieux, Rui Abreu, Martin Monperrus, Tegawendé F. Bissyandé, Luís Cruz

    Abstract: Travis CI handles automatically thousands of builds every day to, amongst other things, provide valuable feedback to thousands of open-source developers. In this paper, we investigate Travis CI to firstly understand who is using it, and when they start to use it. Secondly, we investigate how the developers use Travis CI and finally, how frequently the developers change the Travis CI configurations… ▽ More

    Submitted 28 September, 2019; v1 submitted 20 April, 2019; originally announced April 2019.

    Journal ref: Proceedings of the International Conference on Software Maintenance and Evolution (ICSME), 2019

  50. arXiv:1904.02206  [pdf, other

    cs.LG stat.ML

    Jointly Pre-training with Supervised, Autoencoder, and Value Losses for Deep Reinforcement Learning

    Authors: Gabriel V. de la Cruz Jr., Yunshu Du, Matthew E. Taylor

    Abstract: Deep Reinforcement Learning (DRL) algorithms are known to be data inefficient. One reason is that a DRL agent learns both the feature and the policy tabula rasa. Integrating prior knowledge into DRL algorithms is one way to improve learning efficiency since it helps to build helpful representations. In this work, we consider incorporating human knowledge to accelerate the asynchronous advantage ac… ▽ More

    Submitted 3 April, 2019; originally announced April 2019.

    Comments: Accepted in Adaptive and Learning Agents (ALA) Workshop at AAMAS