Skip to main content

Showing 1–14 of 14 results for author: Jordao, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.15954  [pdf, ps, other

    cs.LG

    One Period to Rule Them All: Identifying Critical Learning Periods in Deep Networks

    Authors: Vinicius Yuiti Fukase, Heitor Gama, Barbara Bueno, Lucas Libanio, Anna Helena Reali Costa, Artur Jordao

    Abstract: Critical Learning Periods comprehend an important phenomenon involving deep learning, where early epochs play a decisive role in the success of many training recipes, such as data augmentation. Existing works confirm the existence of this phenomenon and provide useful insights. However, the literature lacks efforts to precisely identify when critical periods occur. In this work, we fill this gap b… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  2. arXiv:2506.04513  [pdf, ps, other

    cs.CV

    Pruning Everything, Everywhere, All at Once

    Authors: Gustavo Henrique do Nascimento, Ian Pons, Anna Helena Reali Costa, Artur Jordao

    Abstract: Deep learning stands as the modern paradigm for solving cognitive tasks. However, as the problem complexity increases, models grow deeper and computationally prohibitive, hindering advancements in real-world and resource-constrained applications. Extensive studies reveal that pruning structures in these models efficiently reduces model complexity and improves computational efficiency. Successful s… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: To be published in International Joint Conference on Neural Networks (IJCNN), 2025

  3. arXiv:2505.12100  [pdf, ps, other

    cs.CL cs.AI

    Improving Fairness in LLMs Through Testing-Time Adversaries

    Authors: Isabela Pereira Gregio, Ian Pons, Anna Helena Reali Costa, Artur Jordão

    Abstract: Large Language Models (LLMs) push the bound-aries in natural language processing and generative AI, driving progress across various aspects of modern society. Unfortunately, the pervasive issue of bias in LLMs responses (i.e., predictions) poses a significant and open challenge, hindering their application in tasks involving ethical sensitivity and responsible decision-making. In this work, we pro… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  4. arXiv:2504.21174  [pdf, ps, other

    cs.LG

    Efficient LLMs with AMP: Attention Heads and MLP Pruning

    Authors: Leandro Giusti Mugnaini, Bruno Lopes Yamamoto, Lucas Lauton de Alcantara, Victor Zacarias, Edson Bollis, Lucas Pellicer, Anna Helena Reali Costa, Artur Jordao

    Abstract: Deep learning drives a new wave in computing systems and triggers the automation of increasingly complex problems. In particular, Large Language Models (LLMs) have significantly advanced cognitive tasks, often matching or even surpassing human-level performance. However, their extensive parameters result in high computational costs and slow inference, posing challenges for deployment in resource-l… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

    Comments: To be published in International Joint Conference on Neural Networks (IJCNN), 2025

  5. arXiv:2411.14345  [pdf, other

    cs.LG cs.CV

    Layer Pruning with Consensus: A Triple-Win Solution

    Authors: Leandro Giusti Mugnaini, Carolina Tavares Duarte, Anna H. Reali Costa, Artur Jordao

    Abstract: Layer pruning offers a promising alternative to standard structured pruning, effectively reducing computational costs, latency, and memory footprint. While notable layer-pruning approaches aim to detect unimportant layers for removal, they often rely on single criteria that may not fully capture the complex, underlying properties of layers. We propose a novel approach that combines multiple simila… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  6. arXiv:2405.17081  [pdf, other

    cs.LG

    Effective Layer Pruning Through Similarity Metric Perspective

    Authors: Ian Pons, Bruno Yamamoto, Anna H. Reali Costa, Artur Jordao

    Abstract: Deep neural networks have been the predominant paradigm in machine learning for solving cognitive tasks. Such models, however, are restricted by a high computational overhead, limiting their applicability and hindering advancements in the field. Extensive research demonstrated that pruning structures from these models is a straightforward approach to reducing network complexity. In this direction,… ▽ More

    Submitted 4 November, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: Published at International Conference on Pattern Recognition (ICPR), 2024. Oral presentation

  7. arXiv:2301.10835  [pdf, other

    cs.LG

    When Layers Play the Lottery, all Tickets Win at Initialization

    Authors: Artur Jordao, George Correa de Araujo, Helena de Almeida Maia, Helio Pedrini

    Abstract: Pruning is a standard technique for reducing the computational cost of deep networks. Many advances in pruning leverage concepts from the Lottery Ticket Hypothesis (LTH). LTH reveals that inside a trained dense network exists sparse subnetworks (tickets) able to achieve similar accuracy (i.e., win the lottery - winning tickets). Pruning at initialization focuses on finding winning tickets without… ▽ More

    Submitted 19 March, 2024; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: Published at International Conference on Computer Vision Workshop (ICCV), 2023

  8. arXiv:2108.04890  [pdf, other

    cs.CV

    On the Effect of Pruning on Adversarial Robustness

    Authors: Artur Jordao, Helio Pedrini

    Abstract: Pruning is a well-known mechanism for reducing the computational cost of deep convolutional networks. However, studies have shown the potential of pruning as a form of regularization, which reduces overfitting and improves generalization. We demonstrate that this family of strategies provides additional benefits beyond computational performance and generalization. Our analyses reveal that pruning… ▽ More

    Submitted 24 November, 2021; v1 submitted 10 August, 2021; originally announced August 2021.

    Comments: Published at International Conference on Computer Vision Workshop (ICCVW), 2021

  9. arXiv:2004.11178  [pdf, other

    cs.CV

    Stage-Wise Neural Architecture Search

    Authors: Artur Jordao, Fernando Akio, Maiko Lie, William Robson Schwartz

    Abstract: Modern convolutional networks such as ResNet and NASNet have achieved state-of-the-art results in many computer vision applications. These architectures consist of stages, which are sets of layers that operate on representations in the same resolution. It has been demonstrated that increasing the number of layers in each stage improves the prediction ability of the network. However, the resulting… ▽ More

    Submitted 19 October, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: Accepted for publication at International Conference on Pattern Recognition (ICPR) 2020

  10. arXiv:1910.02319  [pdf, ps, other

    cs.CV cs.LG

    Covariance-free Partial Least Squares: An Incremental Dimensionality Reduction Method

    Authors: Artur Jordao, Maiko Lie, Victor Hugo Cunha de Melo, William Robson Schwartz

    Abstract: Dimensionality reduction plays an important role in computer vision problems since it reduces computational cost and is often capable of yielding more discriminative data representation. In this context, Partial Least Squares (PLS) has presented notable results in tasks such as image classification and neural network optimization. However, PLS is infeasible on large datasets, such as ImageNet, bec… ▽ More

    Submitted 10 November, 2020; v1 submitted 5 October, 2019; originally announced October 2019.

    Comments: Accepted for publication at Winter Conference on Applications of Computer Vision (WACV) 2021

  11. arXiv:1810.07610  [pdf, other

    cs.CV

    Pruning Deep Neural Networks using Partial Least Squares

    Authors: Artur Jordao, Ricardo Kloss, Fernando Yamada, William Robson Schwartz

    Abstract: Modern pattern recognition methods are based on convolutional networks since they are able to learn complex patterns that benefit the classification. However, convolutional networks are computationally expensive and require a considerable amount of memory, which limits their deployment on low-power and resource-constrained systems. To handle these problems, recent approaches have proposed pruning… ▽ More

    Submitted 19 September, 2019; v1 submitted 17 October, 2018; originally announced October 2018.

    Journal ref: British Machine Vision Conference Workshop, 2019

  12. arXiv:1806.05226  [pdf, other

    cs.CV

    Human Activity Recognition Based on Wearable Sensor Data: A Standardization of the State-of-the-Art

    Authors: Artur Jordao, Antonio C. Nazare Jr., Jessica Sena, William Robson Schwartz

    Abstract: Human activity recognition based on wearable sensor data has been an attractive research topic due to its application in areas such as healthcare and smart environments. In this context, many works have presented remarkable results using accelerometer, gyroscope and magnetometer data to represent the activities categories. However, current studies do not consider important issues that lead to skew… ▽ More

    Submitted 1 February, 2019; v1 submitted 13 June, 2018; originally announced June 2018.

  13. arXiv:1806.03361  [pdf, other

    cs.CV cs.AI

    A Content-Based Late Fusion Approach Applied to Pedestrian Detection

    Authors: Jessica Sena, Artur Jordao, William Robson Schwartz

    Abstract: The variety of pedestrians detectors proposed in recent years has encouraged some works to fuse pedestrian detectors to achieve a more accurate detection. The intuition behind is to combine the detectors based on its spatial consensus. We propose a novel method called Content-Based Spatial Consensus (CSBC), which, in addition to relying on spatial consensus, considers the content of the detection… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

  14. arXiv:1711.02652  [pdf, other

    cs.CV

    Latent hypernet: Exploring all Layers from Convolutional Neural Networks

    Authors: Artur Jordao, Ricardo Kloss, William Robson Schwartz

    Abstract: Since Convolutional Neural Networks (ConvNets) are able to simultaneously learn features and classifiers to discriminate different categories of activities, recent works have employed ConvNets approaches to perform human activity recognition (HAR) based on wearable sensors, allowing the removal of expensive human work and expert knowledge. However, these approaches have their power of discriminati… ▽ More

    Submitted 16 November, 2018; v1 submitted 7 November, 2017; originally announced November 2017.