Search | arXiv e-print repository

An analysis of data variation and bias in image-based dermatological datasets for machine learning classification

Authors: Francisco Filho, Emanoel Santos, Rodrigo Mota, Kelvin Cunha, Fabio Papais, Amanda Arruda, Mateus Baltazar, Camila Vieira, José Gabriel Tavares, Rafael Barros, Othon Souza, Thales Bezerra, Natalia Lopes, Érico Moutinho, Jéssica Guido, Shirley Cruz, Paulo Borba, Tsang Ing Ren

Abstract: AI algorithms have become valuable in aiding professionals in healthcare. The increasing confidence obtained by these models is helpful in critical decision demands. In clinical dermatology, classification models can detect malignant lesions on patients' skin using only RGB images as input. However, most learning-based methods employ data acquired from dermoscopic datasets on training, which are l… ▽ More AI algorithms have become valuable in aiding professionals in healthcare. The increasing confidence obtained by these models is helpful in critical decision demands. In clinical dermatology, classification models can detect malignant lesions on patients' skin using only RGB images as input. However, most learning-based methods employ data acquired from dermoscopic datasets on training, which are large and validated by a gold standard. Clinical models aim to deal with classification on users' smartphone cameras that do not contain the corresponding resolution provided by dermoscopy. Also, clinical applications bring new challenges. It can contain captures from uncontrolled environments, skin tone variations, viewpoint changes, noises in data and labels, and unbalanced classes. A possible alternative would be to use transfer learning to deal with the clinical images. However, as the number of samples is low, it can cause degradations on the model's performance; the source distribution used in training differs from the test set. This work aims to evaluate the gap between dermoscopic and clinical samples and understand how the dataset variations impact training. It assesses the main differences between distributions that disturb the model's prediction. Finally, from experiments on different architectures, we argue how to combine the data from divergent distributions, decreasing the impact on the model's final accuracy. △ Less

Submitted 11 February, 2025; v1 submitted 15 January, 2025; originally announced January 2025.

Comments: 10 pages, 1 figure

ACM Class: I.5.4; J.3

arXiv:2402.05048 [pdf, other]

How VADER is your AI? Towards a definition of artificial intelligence systems appropriate for regulation

Authors: Leonardo C. T. Bezerra, Alexander E. I. Brownlee, Luana Ferraz Alvarenga, Renan Cipriano Moioli, Thais Vasconcelos Batista

Abstract: Artificial intelligence (AI) has driven many information and communication technology (ICT) breakthroughs. Nonetheless, the scope of ICT systems has expanded far beyond AI since the Turing test proposal. Critically, recent AI regulation proposals adopt AI definitions affecting ICT techniques, approaches, and systems that are not AI. In some cases, even works from mathematics, statistics, and engin… ▽ More Artificial intelligence (AI) has driven many information and communication technology (ICT) breakthroughs. Nonetheless, the scope of ICT systems has expanded far beyond AI since the Turing test proposal. Critically, recent AI regulation proposals adopt AI definitions affecting ICT techniques, approaches, and systems that are not AI. In some cases, even works from mathematics, statistics, and engineering would be affected. Worryingly, AI misdefinitions are observed from Western societies to the Global South. In this paper, we propose a framework to score how validated as appropriately-defined for regulation (VADER) an AI definition is. Our online, publicly-available VADER framework scores the coverage of premises that should underlie AI definitions for regulation, which aim to (i) reproduce principles observed in other successful technology regulations, and (ii) include all AI techniques and approaches while excluding non-AI works. Regarding the latter, our score is based on a dataset of representative AI, non-AI ICT, and non-ICT examples. We demonstrate our contribution by reviewing the AI regulation proposals of key players, namely the United States, United Kingdom, European Union, and Brazil. Importantly, none of the proposals assessed achieve the appropriateness score, ranging from a revision need to a concrete risk to ICT systems and works from other fields. △ Less

Submitted 24 January, 2025; v1 submitted 7 February, 2024; originally announced February 2024.

ACM Class: I.2.0

arXiv:2103.00535 [pdf, other]

A multi-objective time series analysis of community mobility reduction comparing first and second COVID-19 waves

Authors: Gabriela Cavalcante da Silva, Fernanda Monteiro de Almeida, Sabrina Oliveira, Leonardo C. T. Bezerra, Elizabeth F. Wanner, Ricardo H. C. Takahashi

Abstract: With the logistic challenges faced by most countries for the production, distribution, and application of vaccines for the novel coronavirus disease~(COVID-19), social distancing~(SD) remains the most tangible approach to mitigate the spread of the virus. To assist SD monitoring, several tech companies have made publicly available anonymized mobility data. In this work, we conduct a multi-objectiv… ▽ More With the logistic challenges faced by most countries for the production, distribution, and application of vaccines for the novel coronavirus disease~(COVID-19), social distancing~(SD) remains the most tangible approach to mitigate the spread of the virus. To assist SD monitoring, several tech companies have made publicly available anonymized mobility data. In this work, we conduct a multi-objective mobility reduction rate comparison between the first and second COVID-19 waves in several localities from America and Europe using Google community mobility reports~(CMR) data. Through multi-dimensional visualization, we are able to compare in a Pareto-compliant way the reduction in mobility from the different lockdown periods for each locality selected, simultaneously considering all place categories provided in CMR. In addition, our analysis comprises a 56-day lockdown period for each locality and COVID-19 wave, which we analyze both as 56-day periods and as 14-day consecutive windows. Results vary considerably as a function of the locality considered, particularly when the temporal evolution of the mobility reduction is considered. We thus discuss each locality individually, relating social distancing measures and the reduction observed. △ Less

Submitted 28 February, 2021; originally announced March 2021.

arXiv:2009.10648 [pdf, other]

Google COVID-19 community mobility reports: insights from multi-criteria decision making

Authors: Gabriela Cavalcante da Silvaa, Sabrina Oliveirab, Elizabeth F. Wanner, Leonardo C. T. Bezerra

Abstract: Social distancing (SD) has been critical in the fight against the novel coronavirus disease (COVID-19). To aid SD monitoring, many technology companies have made available mobility data, the most prominent example being the community mobility reports (CMR) provided by Google. Given the wide range of research fields that have been drawing insights from CMR data, there has been a rising concern for… ▽ More Social distancing (SD) has been critical in the fight against the novel coronavirus disease (COVID-19). To aid SD monitoring, many technology companies have made available mobility data, the most prominent example being the community mobility reports (CMR) provided by Google. Given the wide range of research fields that have been drawing insights from CMR data, there has been a rising concern for methodological discussion on how to use them. Indeed, Google recently released their own guidelines, concerning the nature of the place categories and the need for calibrating regional values. In this work, we discuss how measures developed in the field of multi-criteria decision making (MCDM) might benefit researchers analyzing this data. Concretely, we discuss how Pareto dominance and performance measures adopted in MCDM enable the mobility evaluation for (i) multiple categories for a given time period and (ii) multiple categories over multiple time periods. We empirically demonstrate these approaches conducting both a region- and country-level analysis, comparing some of the most relevant outbreak examples from different continents. △ Less

Submitted 17 September, 2020; originally announced September 2020.

Showing 1–4 of 4 results for author: Bezerra, T