Skip to main content

Showing 1–14 of 14 results for author: Ferreira, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.08005  [pdf, ps, other

    cs.SE

    Assessing the Bug-Proneness of Refactored Code: A Longitudinal Multi-Project Study

    Authors: Isabella Ferreira, Lawrence Arkoh, Anderson Uchôa, Ana Carla Bibiano, Alessandro Garcia, Wesley K. G. Assunção

    Abstract: Refactoring is a common practice in software development, aimed at improving the internal code structure in order to make it easier to understand and modify. Consequently, it is often assumed that refactoring makes the code less prone to bugs. However, in practice, refactoring is a complex task and applied in different ways (e.g., various refactoring types, single vs. composite refactorings) and w… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  2. FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion

    Authors: Alef Iury Siqueira Ferreira, Lucas Rafael Gris, Augusto Seben da Rosa, Frederico Santos de Oliveira, Edresson Casanova, Rafael Teixeira Sousa, Arnaldo Candido Junior, Anderson da Silva Soares, Arlindo Galvão Filho

    Abstract: This work presents FreeSVC, a promising multilingual singing voice conversion approach that leverages an enhanced VITS model with Speaker-invariant Clustering (SPIN) for better content representation and the State-of-the-Art (SOTA) speaker encoder ECAPA2. FreeSVC incorporates trainable language embeddings to handle multiple languages and employs an advanced speaker encoder to disentangle speaker c… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

  3. arXiv:2409.19474  [pdf, other

    cs.CV cs.AI

    FairPIVARA: Reducing and Assessing Biases in CLIP-Based Multimodal Models

    Authors: Diego A. B. Moreira, Alef Iury Ferreira, Jhessica Silva, Gabriel Oliveira dos Santos, Luiz Pereira, João Medrado Gondim, Gustavo Bonil, Helena Maia, Nádia da Silva, Simone Tiemi Hashiguti, Jefersson A. dos Santos, Helio Pedrini, Sandra Avila

    Abstract: Despite significant advancements and pervasive use of vision-language models, a paucity of studies has addressed their ethical implications. These models typically require extensive training data, often from hastily reviewed text and image datasets, leading to highly imbalanced datasets and ethical concerns. Additionally, models initially trained in English are frequently fine-tuned for other lang… ▽ More

    Submitted 4 October, 2024; v1 submitted 28 September, 2024; originally announced September 2024.

    Comments: 14 pages, 10 figures. Accepted to 35th British Machine Vision Conference (BMVC 2024), Workshop on Privacy, Fairness, Accountability and Transparency in Computer Vision

  4. arXiv:2409.11600  [pdf, other

    cs.PL cs.AI cs.LG

    No Saved Kaleidosope: an 100% Jitted Neural Network Coding Language with Pythonic Syntax

    Authors: Augusto Seben da Rosa, Marlon Daniel Angeli, Jorge Aikes Junior, Alef Iury Ferreira, Lucas Rafael Gris, Anderson da Silva Soares, Arnaldo Candido Junior, Frederico Santos de Oliveira, Gabriel Trevisan Damke, Rafael Teixeira Sousa

    Abstract: We developed a jitted compiler for training Artificial Neural Networks using C++, LLVM and Cuda. It features object-oriented characteristics, strong typing, parallel workers for data pre-processing, pythonic syntax for expressions, PyTorch like model declaration and Automatic Differentiation. We implement the mechanisms of cache and pooling in order to manage VRAM, cuBLAS for high performance matr… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

    Comments: 12 pages, 3 figures and 3 tables

    MSC Class: 68T07 ACM Class: D.3; I.2; I.4; I.7

  5. arXiv:2401.17474  [pdf, other

    cs.DC math.NA

    Parallelization Strategies for the Randomized Kaczmarz Algorithm on Large-Scale Dense Systems

    Authors: Inês Ferreira, Juan A. Acebrón, José Monteiro

    Abstract: The Kaczmarz algorithm is an iterative technique designed to solve consistent linear systems of equations. It falls within the category of row-action methods, focusing on handling one equation per iteration. This characteristic makes it especially useful in solving very large systems. The recent introduction of a randomized version, the Randomized Kaczmarz method, renewed interest in the algorithm… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    MSC Class: 15A06; 15A52; 65F10; 65F20; 68W20; 65Y05; 68W10; 68W15

  6. arXiv:2310.13683  [pdf, other

    cs.LG

    CAPIVARA: Cost-Efficient Approach for Improving Multilingual CLIP Performance on Low-Resource Languages

    Authors: Gabriel Oliveira dos Santos, Diego A. B. Moreira, Alef Iury Ferreira, Jhessica Silva, Luiz Pereira, Pedro Bueno, Thiago Sousa, Helena Maia, Nádia Da Silva, Esther Colombini, Helio Pedrini, Sandra Avila

    Abstract: This work introduces CAPIVARA, a cost-efficient framework designed to enhance the performance of multilingual CLIP models in low-resource languages. While CLIP has excelled in zero-shot vision-language tasks, the resource-intensive nature of model training remains challenging. Many datasets lack linguistic diversity, featuring solely English descriptions for images. CAPIVARA addresses this by augm… ▽ More

    Submitted 23 October, 2023; v1 submitted 20 October, 2023; originally announced October 2023.

  7. arXiv:2307.03682  [pdf

    cs.CY stat.AP

    Anonymising Clinical Data for Secondary Use

    Authors: Irene Ferreira, Chris Harbron, Alex Hughes, Tamsin Sargood, Christoph Gerlinger

    Abstract: Secondary use of data already collected in clinical studies has become more and more popular in recent years, with the commitment of the pharmaceutical industry and many academic institutions in Europe and the US to provide access to their clinical trial data. Whilst this clearly provides societal benefit in helping to progress medical research, this has to be balanced against protection of subjec… ▽ More

    Submitted 17 May, 2023; originally announced July 2023.

    Comments: 25 pages

  8. arXiv:2208.10602  [pdf, other

    cs.CR cs.NI

    ABL: An original active blacklist based on a modification of the SMTP

    Authors: Pablo M. Oliveira, Mateus B. Vieira, Isaac C. Ferreira, João P. R. R. Leite, Edvard M. Oliveira, Bruno T. Kuehne, Edmilson M. Moreira, Otávio A. S. Carpinteiro

    Abstract: This paper presents a novel Active Blacklist (ABL) based on a modification of the Simple Mail Transfer Protocol (SMTP). ABL was implemented in the Mail Transfer Agent (MTA) Postfix of the e-mail server Zimbra and assessed exhaustively in a series of experiments. The modified server Zimbra showed computational performance and costs similar to those of the original server Zimbra when receiving legit… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 18 pages, 6 figures, 5 tables

  9. arXiv:2207.14418  [pdf, other

    cs.CL cs.SD eess.AS

    Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge

    Authors: Alef Iury Siqueira Ferreira, Gustavo dos Reis Oliveira

    Abstract: This paper presents our efforts to build a robust ASR model for the shared task Automatic Speech Recognition for spontaneous and prepared speech & Speech Emotion Recognition in Portuguese (SE&R 2022). The goal of the challenge is to advance the ASR research for the Portuguese language, considering prepared and spontaneous speech in different dialects. Our method consist on fine-tuning an ASR model… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

    Comments: Proceedings of the First Workshop on Automatic Speech Recognition for Spontaneous and Prepared Speech & Speech Emotion Recognition in Portuguese (SE&R 2022), co-located with PROPOR 2022

  10. Incivility Detection in Open Source Code Review and Issue Discussions

    Authors: Isabella Ferreira, Ahlaam Rafiq, Jinghui Cheng

    Abstract: Given the democratic nature of open source development, code review and issue discussions may be uncivil. Incivility, defined as features of discussion that convey an unnecessarily disrespectful tone, can have negative consequences to open source communities. To prevent or minimize these negative consequences, open source platforms have included mechanisms for removing uncivil language from the di… ▽ More

    Submitted 18 December, 2023; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: 18 pages

  11. arXiv:2204.05114  [pdf, other

    cs.CV cs.LG eess.IV

    PetroGAN: A novel GAN-based approach to generate realistic, label-free petrographic datasets

    Authors: I. Ferreira, L. Ochoa, A. Koeshidayatullah

    Abstract: Deep learning architectures have enriched data analytics in the geosciences, complementing traditional approaches to geological problems. Although deep learning applications in geosciences show encouraging signs, the actual potential remains untapped. This is primarily because geological datasets, particularly petrography, are limited, time-consuming, and expensive to obtain, requiring in-depth kn… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  12. How heated is it? Understanding GitHub locked issues

    Authors: Isabella Ferreira, Bram Adams, Jinghui Cheng

    Abstract: Although issues of open source software are created to discuss and solve technical problems, conversations can become heated, with discussants getting angry and/or agitated for a variety of reasons, such as poor suggestions or violation of community conventions. To prevent and mitigate discussions from getting heated, tools like GitHub have introduced the ability to lock issue discussions that vio… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

    Journal ref: In 19th International Conference on Mining Software Repositories (MSR'22), May 23-24, 2022, Pittsburgh, PA, USA

  13. arXiv:2111.13414  [pdf, other

    cs.NI eess.SY

    Optimizing Packet Reception Rates for Low Duty-Cycle BLE Relay Nodes

    Authors: Nuno Paulino, Luís Pessoa, André Branquinho, Rafael Tavares, Igor Ferreira

    Abstract: In order to achieve the full potential of the Internet-of-Things, connectivity between devices should be ubiquitous and efficient. Wireless mesh networks are a critical component to achieve this ubiquitous connectivity for a wide range of services, and are composed of terminal devices (i.e., nodes), such as sensors of various types, and wall powered gateway devices, which provide further internet… ▽ More

    Submitted 29 November, 2021; v1 submitted 26 November, 2021; originally announced November 2021.

  14. arXiv:2108.09905  [pdf, other

    cs.SE cs.HC

    The "Shut the f**k up" Phenomenon: Characterizing Incivility in Open Source Code Review Discussions

    Authors: Isabella Ferreira, Jinghui Cheng, Bram Adams

    Abstract: Code review is an important quality assurance activity for software development. Code review discussions among developers and maintainers can be heated and sometimes involve personal attacks and unnecessary disrespectful comments, demonstrating, therefore, incivility. Although incivility in public discussions has received increasing attention from researchers in different domains, the knowledge ab… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.