Skip to main content

Showing 1–50 of 77 results for author: Dao, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.11180  [pdf, ps, other

    cs.SE cs.AI cs.ET eess.SY

    Beyond Formal Semantics for Capabilities and Skills: Model Context Protocol in Manufacturing

    Authors: Luis Miguel Vieira da Silva, Aljosha Köcher, Felix Gehlhoff

    Abstract: Explicit modeling of capabilities and skills -- whether based on ontologies, Asset Administration Shells, or other technologies -- requires considerable manual effort and often results in representations that are not easily accessible to Large Language Models (LLMs). In this work-in-progress paper, we present an alternative approach based on the recently introduced Model Context Protocol (MCP). MC… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  2. arXiv:2506.08795  [pdf, other

    cs.RO cs.AI

    Towards Biosignals-Free Autonomous Prosthetic Hand Control via Imitation Learning

    Authors: Kaijie Shi, Wanglong Lu, Hanli Zhao, Vinicius Prado da Fonseca, Ting Zou, Xianta Jiang

    Abstract: Limb loss affects millions globally, impairing physical function and reducing quality of life. Most traditional surface electromyographic (sEMG) and semi-autonomous methods require users to generate myoelectric signals for each control, imposing physically and mentally taxing demands. This study aims to develop a fully autonomous control system that enables a prosthetic hand to automatically grasp… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  3. arXiv:2505.20456  [pdf, ps, other

    eess.SP cs.LG

    Federated Learning-Distillation Alternation for Resource-Constrained IoT

    Authors: Rafael Valente da Silva, Onel L. Alcaraz López, Richard Demo Souza

    Abstract: Federated learning (FL) faces significant challenges in Internet of Things (IoT) networks due to device limitations in energy and communication resources, especially when considering the large size of FL models. From an energy perspective, the challenge is aggravated if devices rely on energy harvesting (EH), as energy availability can vary significantly over time, influencing the average number o… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  4. arXiv:2505.03295  [pdf, other

    cs.AI cs.RO cs.SE

    Capability-Driven Skill Generation with LLMs: A RAG-Based Approach for Reusing Existing Libraries and Interfaces

    Authors: Luis Miguel Vieira da Silva, Aljosha Köcher, Nicolas König, Felix Gehlhoff, Alexander Fay

    Abstract: Modern automation systems increasingly rely on modular architectures, with capabilities and skills as one solution approach. Capabilities define the functions of resources in a machine-readable form and skills provide the concrete implementations that realize those capabilities. However, the development of a skill implementation conforming to a corresponding capability remains a time-consuming and… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  5. arXiv:2504.07951  [pdf, other

    cs.CV

    Scaling Laws for Native Multimodal Models

    Authors: Mustafa Shukor, Enrico Fini, Victor Guilherme Turrisi da Costa, Matthieu Cord, Joshua Susskind, Alaaeldin El-Nouby

    Abstract: Building general-purpose models that can effectively perceive the world through multimodal signals has been a long-standing goal. Current approaches involve integrating separately pre-trained components, such as connecting vision encoders to LLMs and continuing multimodal training. While such approaches exhibit remarkable sample efficiency, it remains an open question whether such late-fusion arch… ▽ More

    Submitted 11 April, 2025; v1 submitted 10 April, 2025; originally announced April 2025.

    Comments: 31 pages, 26 figures, 13 tables

  6. arXiv:2504.00709  [pdf

    astro-ph.IM astro-ph.EP cs.AI cs.LG

    Science Autonomy using Machine Learning for Astrobiology

    Authors: Victoria Da Poian, Bethany Theiling, Eric Lyness, David Burtt, Abigail R. Azari, Joey Pasterski, Luoth Chou, Melissa Trainer, Ryan Danell, Desmond Kaplan, Xiang Li, Lily Clough, Brett McKinney, Lukas Mandrake, Bill Diamond, Caroline Freissinet

    Abstract: In recent decades, artificial intelligence (AI) including machine learning (ML) have become vital for space missions enabling rapid data processing, advanced pattern recognition, and enhanced insight extraction. These tools are especially valuable in astrobiology applications, where models must distinguish biotic patterns from complex abiotic backgrounds. Advancing the integration of autonomy thro… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

    Comments: 8 pages (expanded citations compared to 5 page submitted version for DARES white papers), a white paper for the 2025 NASA Decadal Astrobiology Research and Exploration Strategy (DARES)

  7. arXiv:2501.04267  [pdf, other

    cs.NI

    A 5G-Edge Architecture for Computational Offloading of Computer Vision Applications

    Authors: Marcelo V. B. da Silva, Maria Barbosa, Anderson Queiroz, Kelvin L. Dias

    Abstract: Processing computer vision applications (CVA) on mobile devices is challenging due to limited battery life and computing power. While cloud-based remote processing of CVA offers abundant computational resources, it introduces latency issues that can hinder real-time applications. To overcome this problem, computational offloading to edge servers has been adopted by industry and academic research.… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

    Comments: Accept on conference the 39th International Conference on Information Networking (ICOIN 2025): 6 pages, 8 figures, 1 table

  8. arXiv:2412.19226  [pdf, other

    cs.NI cs.AI cs.CV cs.LG

    VINEVI: A Virtualized Network Vision Architecture for Smart Monitoring of Heterogeneous Applications and Infrastructures

    Authors: Rodrigo Moreira, Hugo G. V. O. da Cunha, Larissa F. Rodrigues Moreira, Flávio de Oliveira Silva

    Abstract: Monitoring heterogeneous infrastructures and applications is essential to cope with user requirements properly, but it still lacks enhancements. The well-known state-of-the-art methods and tools do not support seamless monitoring of bare-metal, low-cost infrastructures, neither hosted nor virtualized services with fine-grained details. This work proposes VIrtualized NEtwork VIsion architecture (VI… ▽ More

    Submitted 26 December, 2024; originally announced December 2024.

    Comments: 12 pages

    Journal ref: International Conference on Advanced Information Networking and Applications (AINA-2022)

  9. arXiv:2411.14402  [pdf, other

    cs.CV cs.LG

    Multimodal Autoregressive Pre-training of Large Vision Encoders

    Authors: Enrico Fini, Mustafa Shukor, Xiujun Li, Philipp Dufter, Michal Klein, David Haldimann, Sai Aitharaju, Victor Guilherme Turrisi da Costa, Louis Béthune, Zhe Gan, Alexander T Toshev, Marcin Eichner, Moin Nabi, Yinfei Yang, Joshua M. Susskind, Alaaeldin El-Nouby

    Abstract: We introduce a novel method for pre-training of large-scale vision encoders. Building on recent advancements in autoregressive pre-training of vision models, we extend this framework to a multimodal setting, i.e., images and text. In this paper, we present AIMV2, a family of generalist vision encoders characterized by a straightforward pre-training process, scalability, and remarkable performance… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

    Comments: https://github.com/apple/ml-aim

  10. arXiv:2411.03484  [pdf, other

    cond-mat.mtrl-sci cs.IR

    Automated, LLM enabled extraction of synthesis details for reticular materials from scientific literature

    Authors: Viviane Torres da Silva, Alexandre Rademaker, Krystelle Lionti, Ronaldo Giro, Geisa Lima, Sandro Fiorini, Marcelo Archanjo, Breno W. Carvalho, Rodrigo Neumann, Anaximandro Souza, João Pedro Souza, Gabriela de Valnisio, Carmen Nilda Paz, Renato Cerqueira, Mathias Steiner

    Abstract: Automated knowledge extraction from scientific literature can potentially accelerate materials discovery. We have investigated an approach for extracting synthesis protocols for reticular materials from scientific literature using large language models (LLMs). To that end, we introduce a Knowledge Extraction Pipeline (KEP) that automatizes LLM-assisted paragraph classification and information extr… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

    Comments: 16 pages

  11. arXiv:2410.16331  [pdf, other

    quant-ph cs.ET cs.LG

    Exploring Quantum Neural Networks for Demand Forecasting

    Authors: Gleydson Fernandes de Jesus, Maria Heloísa Fraga da Silva, Otto Menegasso Pires, Lucas Cruz da Silva, Clebson dos Santos Cruz, Valéria Loureiro da Silva

    Abstract: Forecasting demand for assets and services can be addressed in various markets, providing a competitive advantage when the predictive models used demonstrate high accuracy. However, the training of machine learning models incurs high computational costs, which may limit the training of prediction models based on available computational capacity. In this context, this paper presents an approach for… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: 22 pages, 13 figures, 10 tables

  12. arXiv:2410.08905  [pdf, other

    cs.CL

    Lifelong Event Detection via Optimal Transport

    Authors: Viet Dao, Van-Cuong Pham, Quyen Tran, Thanh-Thien Le, Linh Ngo Van, Thien Huu Nguyen

    Abstract: Continual Event Detection (CED) poses a formidable challenge due to the catastrophic forgetting phenomenon, where learning new tasks (with new coming event types) hampers performance on previous ones. In this paper, we introduce a novel approach, Lifelong Event Detection via Optimal Transport (LEDOT), that leverages optimal transport principles to align the optimization of our classification modul… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: Accepted to EMNLP 2024

  13. arXiv:2409.01506  [pdf, other

    cs.CV

    Less is more: concatenating videos for Sign Language Translation from a small set of signs

    Authors: David Vinicius da Silva, Valter Estevam, David Menotti

    Abstract: The limited amount of labeled data for training the Brazilian Sign Language (Libras) to Portuguese Translation models is a challenging problem due to video collection and annotation costs. This paper proposes generating sign language content by concatenating short clips containing isolated signals for training Sign Language Translation models. We employ the V-LIBRASIL dataset, composed of 4,089 si… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: SIBGRAPI 2024

  14. arXiv:2409.00045  [pdf, other

    cs.CV

    PolypDB: A Curated Multi-Center Dataset for Development of AI Algorithms in Colonoscopy

    Authors: Debesh Jha, Nikhil Kumar Tomar, Vanshali Sharma, Quoc-Huy Trinh, Koushik Biswas, Hongyi Pan, Ritika K. Jha, Gorkem Durak, Alexander Hann, Jonas Varkey, Hang Viet Dao, Long Van Dao, Binh Phuc Nguyen, Nikolaos Papachrysos, Brandon Rieders, Peter Thelin Schmidt, Enrik Geissler, Tyler Berzin, Pål Halvorsen, Michael A. Riegler, Thomas de Lange, Ulas Bagci

    Abstract: Colonoscopy is the primary method for examination, detection, and removal of polyps. However, challenges such as variations among the endoscopists' skills, bowel quality preparation, and the complex nature of the large intestine contribute to high polyp miss-rate. These missed polyps can develop into cancer later, underscoring the importance of improving the detection methods. To address this gap… ▽ More

    Submitted 3 January, 2025; v1 submitted 19 August, 2024; originally announced September 2024.

    Comments: 3 Figures, 6 tables

  15. Intelligent Urban Traffic Management via Semantic Interoperability across Multiple Heterogeneous Mobility Data Sources

    Authors: Mario Scrocca, Marco Grassi, Marco Comerio, Valentina Anita Carriero, Tiago Delgado Dias, Ana Vieira Da Silva, Irene Celino

    Abstract: The integrated exploitation of data sources in the mobility domain is key to providing added-value services to passengers, transport companies and authorities. Indeed, multiple stakeholders operate and maintain different kinds of data but several interoperability issues limit their effective usage. In this paper, we present an architecture enabled by Semantic Web technologies to overcome such issu… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: In Use paper accepted for publication at the 23rd International Semantic Web Conference (ISWC) 2024. This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this contribution will be published in the conference proceedings

  16. arXiv:2407.02876  [pdf, other

    cs.RO eess.SY

    Prävention und Beseitigung von Fehlerursachen im Kontext von unbemannten Fahrzeugen

    Authors: Aron Schnakenbeck, Christoph Sieber, Luis Miguel Vieira da Silva, Felix Gehlhoff, Alexander Fay

    Abstract: Mobile robots, becoming increasingly autonomous, are capable of operating in diverse and unknown environments. This flexibility allows them to fulfill goals independently and adapting their actions dynamically without rigidly predefined control codes. However, their autonomous behavior complicates guaranteeing safety and reliability due to the limited influence of a human operator to accurately su… ▽ More

    Submitted 4 November, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: Language: German. Dieser Beitrag wird eingereicht in: "dtec.bw-Beiträge der Helmut-Schmidt-Universität/Universität der Bundeswehr Hamburg: Forschungsaktivitäten im Zentrum für Digitalisierungs- und Technologieforschung der Bundeswehr dtec.bw"

  17. arXiv:2406.13128  [pdf, other

    cs.CV cs.LG

    A New Approach for Evaluating and Improving the Performance of Segmentation Algorithms on Hard-to-Detect Blood Vessels

    Authors: João Pedro Parella, Matheus Viana da Silva, Cesar Henrique Comin

    Abstract: Many studies regarding the vasculature of biological tissues involve the segmentation of the blood vessels in a sample followed by the creation of a graph structure to model the vasculature. The graph is then used to extract relevant vascular properties. Small segmentation errors can lead to largely distinct connectivity patterns and a high degree of variability of the extracted properties. Nevert… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  18. Toward a Method to Generate Capability Ontologies from Natural Language Descriptions

    Authors: Luis Miguel Vieira da Silva, Aljosha Köcher, Felix Gehlhoff, Alexander Fay

    Abstract: To achieve a flexible and adaptable system, capability ontologies are increasingly leveraged to describe functions in a machine-interpretable way. However, modeling such complex ontological descriptions is still a manual and error-prone task that requires a significant amount of effort and ontology expertise. This contribution presents an innovative method to automate capability ontology modeling… ▽ More

    Submitted 18 October, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: \c{opyright} 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  19. On the Use of Large Language Models to Generate Capability Ontologies

    Authors: Luis Miguel Vieira da Silva, Aljosha Köcher, Felix Gehlhoff, Alexander Fay

    Abstract: Capability ontologies are increasingly used to model functionalities of systems or machines. The creation of such ontological models with all properties and constraints of capabilities is very complex and can only be done by ontology experts. However, Large Language Models (LLMs) have shown that they can generate machine-interpretable models from natural language text input and thus support engine… ▽ More

    Submitted 18 October, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: \c{opyright} 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  20. arXiv:2404.10170  [pdf, other

    cs.CV cs.AI

    High-Resolution Detection of Earth Structural Heterogeneities from Seismic Amplitudes using Convolutional Neural Networks with Attention layers

    Authors: Luiz Schirmer, Guilherme Schardong, Vinícius da Silva, Rogério Santos, Hélio Lopes

    Abstract: Earth structural heterogeneities have a remarkable role in the petroleum economy for both exploration and production projects. Automatic detection of detailed structural heterogeneities is challenging when considering modern machine learning techniques like deep neural networks. Typically, these techniques can be an excellent tool for assisted interpretation of such heterogeneities, but it heavily… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  21. arXiv:2403.10304  [pdf, ps, other

    cs.AI cs.DB

    KIF: A Wikidata-Based Framework for Integrating Heterogeneous Knowledge Sources

    Authors: Guilherme Lima, João M. B. Rodrigues, Marcelo Machado, Elton Soares, Sandro R. Fiorini, Raphael Thiago, Leonardo G. Azevedo, Viviane T. da Silva, Renato Cerqueira

    Abstract: We present a Wikidata-based framework, called KIF, for virtually integrating heterogeneous knowledge sources. KIF is written in Python and is released as open-source. It leverages Wikidata's data model and vocabulary plus user-defined mappings to construct a unified view of the underlying sources while keeping track of the context and provenance of their statements. The underlying sources can be t… ▽ More

    Submitted 24 July, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  22. arXiv:2403.01417  [pdf, other

    cs.LG cs.DC

    Asyn2F: An Asynchronous Federated Learning Framework with Bidirectional Model Aggregation

    Authors: Tien-Dung Cao, Nguyen T. Vuong, Thai Q. Le, Hoang V. N. Dao, Tram Truong-Huu

    Abstract: In federated learning, the models can be trained synchronously or asynchronously. Many research works have focused on developing an aggregation method for the server to aggregate multiple local models into the global model with improved performance. They ignore the heterogeneity of the training workers, which causes the delay in the training of the local models, leading to the obsolete information… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  23. arXiv:2402.18511  [pdf

    cs.RO

    Leveraging Compliant Tactile Perception for Haptic Blind Surface Reconstruction

    Authors: Laurent Yves Emile Ramos Cheret, Vinicius Prado da Fonseca, Thiago Eustaquio Alves de Oliveira

    Abstract: Non-flat surfaces pose difficulties for robots operating in unstructured environments. Reconstructions of uneven surfaces may only be partially possible due to non-compliant end-effectors and limitations on vision systems such as transparency, reflections, and occlusions. This study achieves blind surface reconstruction by harnessing the robotic manipulator's kinematic data and a compliant tactile… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 7 pages, 9 figures, 2024 IEEE International Conference on Robotics and Automation (ICRA 2024)

  24. arXiv:2312.12598  [pdf, other

    cs.SE cs.AI

    A Case Study on Test Case Construction with Large Language Models: Unveiling Practical Insights and Challenges

    Authors: Roberto Francisco de Lima Junior, Luiz Fernando Paes de Barros Presta, Lucca Santos Borborema, Vanderson Nogueira da Silva, Marcio Leal de Melo Dahia, Anderson Carlos Sousa e Santos

    Abstract: This paper presents a detailed case study examining the application of Large Language Models (LLMs) in the construction of test cases within the context of software engineering. LLMs, characterized by their advanced natural language processing capabilities, are increasingly garnering attention as tools to automate and enhance various aspects of the software development life cycle. Leveraging a cas… ▽ More

    Submitted 21 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

  25. arXiv:2312.08801  [pdf, other

    cs.AI cs.LO

    Automated Process Planning Based on a Semantic Capability Model and SMT

    Authors: Aljosha Köcher, Luis Miguel Vieira da Silva, Alexander Fay

    Abstract: In research of manufacturing systems and autonomous robots, the term capability is used for a machine-interpretable specification of a system function. Approaches in this research area develop information models that capture all information relevant to interpret the requirements, effects and behavior of functions. These approaches are intended to overcome the heterogeneity resulting from the vario… ▽ More

    Submitted 14 February, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Presented at CAIPI Workshop at AAAI 2024

  26. arXiv:2312.03046  [pdf, other

    cs.CV

    Diversified in-domain synthesis with efficient fine-tuning for few-shot classification

    Authors: Victor G. Turrisi da Costa, Nicola Dall'Asen, Yiming Wang, Nicu Sebe, Elisa Ricci

    Abstract: Few-shot image classification aims to learn an image classifier using only a small set of labeled examples per class. A recent research direction for improving few-shot classifiers involves augmenting the labelled samples with synthetic images created by state-of-the-art text-to-image generation models. Following this trend, we propose Diversified In-domain Synthesis with Efficient Fine-tuning (DI… ▽ More

    Submitted 6 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: 14 pages, 6 figures, 8 tables

  27. AdaSub: Stochastic Optimization Using Second-Order Information in Low-Dimensional Subspaces

    Authors: João Victor Galvão da Mata, Martin S. Andersen

    Abstract: We introduce AdaSub, a stochastic optimization algorithm that computes a search direction based on second-order information in a low-dimensional subspace that is defined adaptively based on available current and past information. Compared to first-order methods, second-order methods exhibit better convergence characteristics, but the need to compute the Hessian matrix at each iteration results in… ▽ More

    Submitted 6 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Published in: 2023 IEEE 10th International Conference on Data Science and Advanced Analytics (DSAA)

  28. arXiv:2310.14553  [pdf, other

    cs.RO cs.AI cs.MA

    Denoising Opponents Position in Partial Observation Environment

    Authors: Aref Sayareh, Aria Sardari, Vahid Khoddami, Nader Zare, Vinicius Prado da Fonseca, Amilcar Soares

    Abstract: The RoboCup competitions hold various leagues, and the Soccer Simulation 2D League is a major among them. Soccer Simulation 2D (SS2D) match involves two teams, including 11 players and a coach for each team, competing against each other. The players can only communicate with the Soccer Simulation Server during the game. Several code bases are released publicly to simplify team development. So rese… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  29. arXiv:2310.11344  [pdf, other

    cs.CL cs.AI

    The effect of stemming and lemmatization on Portuguese fake news text classification

    Authors: Lucca de Freitas Santos, Murilo Varges da Silva

    Abstract: With the popularization of the internet, smartphones and social media, information is being spread quickly and easily way, which implies bigger traffic of information in the world, but there is a problem that is harming society with the dissemination of fake news. With a bigger flow of information, some people are trying to disseminate deceptive information and fake news. The automatic detection o… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  30. arXiv:2309.06033  [pdf, other

    eess.SP cs.IT cs.LG

    Energy-Aware Federated Learning with Distributed User Sampling and Multichannel ALOHA

    Authors: Rafael Valente da Silva, Onel L. Alcaraz López, Richard Demo Souza

    Abstract: Distributed learning on edge devices has attracted increased attention with the advent of federated learning (FL). Notably, edge devices often have limited battery and heterogeneous energy availability, while multiple rounds are required in FL for convergence, intensifying the need for energy efficiency. Energy depletion may hinder the training process and the efficient utilization of the trained… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  31. Neural Implicit Morphing of Face Images

    Authors: Guilherme Schardong, Tiago Novello, Hallison Paz, Iurii Medvedev, Vinícius da Silva, Luiz Velho, Nuno Gonçalves

    Abstract: Face morphing is a problem in computer graphics with numerous artistic and forensic applications. It is challenging due to variations in pose, lighting, gender, and ethnicity. This task consists of a warping for feature alignment and a blending for a seamless transition between the warped images. We propose to leverage coord-based neural networks to represent such warpings and blendings of face im… ▽ More

    Submitted 13 June, 2024; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: 14 pages, 20 figures, accepted for CVPR 2024

    ACM Class: I.4.8; I.4.10

  32. arXiv:2307.10018  [pdf, other

    cs.RO cs.AI

    RobôCIn Small Size League Extended Team Description Paper for RoboCup 2023

    Authors: Aline Lima de Oliveira, Cauê Addae da Silva Gomes, Cecília Virginia Santos da Silva, Charles Matheus de Sousa Alves, Danilo Andrade Martins de Souza, Driele Pires Ferreira Araújo Xavier, Edgleyson Pereira da Silva, Felipe Bezerra Martins, Lucas Henrique Cavalcanti Santos, Lucas Dias Maciel, Matheus Paixão Gumercindo dos Santos, Matheus Lafayette Vasconcelos, Matheus Vinícius Teotonio do Nascimento Andrade, João Guilherme Oliveira Carvalho de Melo, João Pedro Souza Pereira de Moura, José Ronald da Silva, José Victor Silva Cruz, Pedro Henrique Santana de Morais, Pedro Paulo Salman de Oliveira, Riei Joaquim Matos Rodrigues, Roberto Costa Fernandes, Ryan Vinicius Santos Morais, Tamara Mayara Ramos Teobaldo, Washington Igor dos Santos Silva, Edna Natividade Silva Barros

    Abstract: RobôCIn has participated in RoboCup Small Size League since 2019, won its first world title in 2022 (Division B), and is currently a three-times Latin-American champion. This paper presents our improvements to defend the Small Size League (SSL) division B title in RoboCup 2023 in Bordeaux, France. This paper aims to share some of the academic research that our team developed over the past year. Ou… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  33. Toward a Mapping of Capability and Skill Models using Asset Administration Shells and Ontologies

    Authors: Luis Miguel Vieira da Silva, Aljosha Köcher, Milapji Singh Gill, Marco Weiss, Alexander Fay

    Abstract: In order to react efficiently to changes in production, resources and their functions must be integrated into plants in accordance with the plug and produce principle. In this context, research on so-called capabilities and skills has shown promise. However, there are currently two incompatible approaches to modeling capabilities and skills. On the one hand, formal descriptions using ontologies ha… ▽ More

    Submitted 28 April, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: \c{opyright} 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  34. arXiv:2305.11994  [pdf, other

    cs.LG eess.IV

    ISP meets Deep Learning: A Survey on Deep Learning Methods for Image Signal Processing

    Authors: Matheus Henrique Marques da Silva, Jhessica Victoria Santos da Silva, Rodrigo Reis Arrais, Wladimir Barroso Guedes de Araújo Neto, Leonardo Tadeu Lopes, Guilherme Augusto Bileki, Iago Oliveira Lima, Lucas Borges Rondon, Bruno Melo de Souza, Mayara Costa Regazio, Rodolfo Coelho Dalapicola, Claudio Filipi Gonçalves dos Santos

    Abstract: The entire Image Signal Processor (ISP) of a camera relies on several processes to transform the data from the Color Filter Array (CFA) sensor, such as demosaicing, denoising, and enhancement. These processes can be executed either by some hardware or via software. In recent years, Deep Learning has emerged as one solution for some of them or even to replace the entire ISP using a single neural ne… ▽ More

    Submitted 23 May, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

  35. arXiv:2305.11033  [pdf, other

    cs.CV cs.AI cs.LG

    Visual Question Answering: A Survey on Techniques and Common Trends in Recent Literature

    Authors: Ana Cláudia Akemi Matsuki de Faria, Felype de Castro Bastos, José Victor Nogueira Alves da Silva, Vitor Lopes Fabris, Valeska de Sousa Uchoa, Décio Gonçalves de Aguiar Neto, Claudio Filipi Goncalves dos Santos

    Abstract: Visual Question Answering (VQA) is an emerging area of interest for researches, being a recent problem in natural language processing and image prediction. In this area, an algorithm needs to answer questions about certain images. As of the writing of this survey, 25 recent studies were analyzed. Besides, 6 datasets were analyzed and provided their link to download. In this work, several recent pi… ▽ More

    Submitted 2 June, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: 30 pages. arXiv admin note: text overlap with arXiv:2104.00926, arXiv:2110.02526, arXiv:2108.02059, arXiv:1908.01801 by other authors

  36. arXiv:2305.07511  [pdf, ps, other

    cs.LG cs.AI cs.CY eess.IV

    eXplainable Artificial Intelligence on Medical Images: A Survey

    Authors: Matteus Vargas Simão da Silva, Rodrigo Reis Arrais, Jhessica Victoria Santos da Silva, Felipe Souza Tânios, Mateus Antonio Chinelatto, Natalia Backhaus Pereira, Renata De Paris, Lucas Cesar Ferreira Domingos, Rodrigo Dória Villaça, Vitor Lopes Fabris, Nayara Rossi Brito da Silva, Ana Claudia Akemi Matsuki de Faria, Jose Victor Nogueira Alves da Silva, Fabiana Cristina Queiroz de Oliveira Marucci, Francisco Alves de Souza Neto, Danilo Xavier Silva, Vitor Yukio Kondo, Claudio Filipi Gonçalves dos Santos

    Abstract: Over the last few years, the number of works about deep learning applied to the medical field has increased enormously. The necessity of a rigorous assessment of these models is required to explain these results to all people involved in medical exams. A recent field in the machine learning area is explainable artificial intelligence, also known as XAI, which targets to explain the results of such… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  37. arXiv:2302.08905  [pdf, other

    cs.DL

    GraphLED: A graph-based approach to process and visualise linked engineering documents

    Authors: Vanessa Telles da Silva, Lucas de Angelo Martins Ribeiro, Willian Borges de Lemos, Sílvia Silva da Costa Botelho, Nelson Lopes Duarte Filho, Marcelo Rita Pias

    Abstract: The architecture, engineering and construction (AEC) sector extensively uses documents supporting product and process development. As part of this, organisations should handle big data of hundreds, or even thousands, of technical documents strongly linked together, including CAD design of industrial plants, equipment purchase orders, quality certificates, and part material analysis. However, analy… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  38. arXiv:2301.04517  [pdf, other

    cs.CV

    A new dataset for measuring the performance of blood vessel segmentation methods under distribution shifts

    Authors: Matheus Viana da Silva, Natália de Carvalho Santos, Julie Ouellette, Baptiste Lacoste, Cesar Henrique Comin

    Abstract: Creating a dataset for training supervised machine learning algorithms can be a demanding task. This is especially true for medical image segmentation since one or more specialists are usually required for image annotation, and creating ground truth labels for just a single image can take up to several hours. In addition, it is paramount that the annotated samples represent well the different cond… ▽ More

    Submitted 18 April, 2024; v1 submitted 11 January, 2023; originally announced January 2023.

    Comments: This work has been submitted to the IEEE for possible publication

  39. arXiv:2301.03322  [pdf, other

    cs.CV

    Simplifying Open-Set Video Domain Adaptation with Contrastive Learning

    Authors: Giacomo Zara, Victor Guilherme Turrisi da Costa, Subhankar Roy, Paolo Rota, Elisa Ricci

    Abstract: In an effort to reduce annotation costs in action recognition, unsupervised video domain adaptation methods have been proposed that aim to adapt a predictive model from a labelled dataset (i.e., source domain) to an unlabelled dataset (i.e., target domain). In this work we address a more realistic scenario, called open-set video domain adaptation (OUVDA), where the target dataset contains "unknown… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

    Comments: Currently under review at Computer Vision and Image Understanding (CVIU) journal

  40. Extractive Text Summarization Using Generalized Additive Models with Interactions for Sentence Selection

    Authors: Vinícius Camargo da Silva, João Paulo Papa, Kelton Augusto Pontara da Costa

    Abstract: Automatic Text Summarization (ATS) is becoming relevant with the growth of textual data; however, with the popularization of public large-scale datasets, some recent machine learning approaches have focused on dense models and architectures that, despite producing notable results, usually turn out in models difficult to interpret. Given the challenge behind interpretable learning-based text summar… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  41. arXiv:2210.02390  [pdf, other

    cs.CV cs.AI cs.LG

    Bayesian Prompt Learning for Image-Language Model Generalization

    Authors: Mohammad Mahdi Derakhshani, Enrique Sanchez, Adrian Bulat, Victor Guilherme Turrisi da Costa, Cees G. M. Snoek, Georgios Tzimiropoulos, Brais Martinez

    Abstract: Foundational image-language models have generated considerable interest due to their efficient adaptation to downstream tasks by prompt learning. Prompt learning treats part of the language model input as trainable while freezing the rest, and optimizes an Empirical Risk Minimization objective. However, Empirical Risk Minimization is known to suffer from distributional shifts which hurt generaliza… ▽ More

    Submitted 20 August, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted at ICCV 2023

  42. A Capability and Skill Model for Heterogeneous Autonomous Robots

    Authors: Luis Miguel Vieira da Silva, Aljosha Köcher, Alexander Fay

    Abstract: Teams of heterogeneous autonomous robots become increasingly important due to their facilitation of various complex tasks. For such heterogeneous robots, there is currently no consistent way of describing the functions that each robot provides. In the field of manufacturing, capability modeling is considered a promising approach to semantically model functions provided by different machines. This… ▽ More

    Submitted 9 February, 2023; v1 submitted 22 September, 2022; originally announced September 2022.

  43. arXiv:2208.01712  [pdf, other

    cs.LG cs.CL cs.IR stat.ML

    No Pattern, No Recognition: a Survey about Reproducibility and Distortion Issues of Text Clustering and Topic Modeling

    Authors: Marília Costa Rosendo Silva, Felipe Alves Siqueira, João Pedro Mantovani Tarrega, João Vitor Pataca Beinotti, Augusto Sousa Nunes, Miguel de Mattos Gardini, Vinícius Adolfo Pereira da Silva, Nádia Félix Felipe da Silva, André Carlos Ponce de Leon Ferreira de Carvalho

    Abstract: Extracting knowledge from unlabeled texts using machine learning algorithms can be complex. Document categorization and information retrieval are two applications that may benefit from unsupervised learning (e.g., text clustering and topic modeling), including exploratory data analysis. However, the unsupervised learning paradigm poses reproducibility issues. The initialization can lead to variabi… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    ACM Class: I.2; I.2.7; I.5.3

  44. arXiv:2207.12842  [pdf, other

    cs.CV

    Unsupervised Domain Adaptation for Video Transformers in Action Recognition

    Authors: Victor G. Turrisi da Costa, Giacomo Zara, Paolo Rota, Thiago Oliveira-Santos, Nicu Sebe, Vittorio Murino, Elisa Ricci

    Abstract: Over the last few years, Unsupervised Domain Adaptation (UDA) techniques have acquired remarkable importance and popularity in computer vision. However, when compared to the extensive literature available for images, the field of videos is still relatively unexplored. On the other hand, the performance of a model in action recognition is heavily affected by domain shift. In this paper, we propose… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    Comments: Accepted at ICPR 2022

  45. arXiv:2207.10649  [pdf, other

    cs.CL cs.LG

    Multilingual Disinformation Detection for Digital Advertising

    Authors: Zofia Trstanova, Nadir El Manouzi, Maryline Chen, Andre L. V. da Cunha, Sergei Ivanov

    Abstract: In today's world, the presence of online disinformation and propaganda is more widespread than ever. Independent publishers are funded mostly via digital advertising, which is unfortunately also the case for those publishing disinformation content. The question of how to remove such publishers from advertising inventory has long been ignored, despite the negative impact on the open internet. In th… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: Disinformation Countermeasures and Machine Learning Workshop at ICML 2022

  46. Modeling and Executing Production Processes with Capabilities and Skills using Ontologies and BPMN

    Authors: Aljosha Köcher, Luis Miguel Vieira da Silva, Alexander Fay

    Abstract: Current challenges of the manufacturing industry require modular and changeable manufacturing systems that can be adapted to variable conditions with little effort. At the same time, production recipes typically represent important company know-how that should not be directly tied to changing plant configurations. Thus, there is a need to model general production recipes independent of specific pl… ▽ More

    Submitted 4 November, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

    Comments: \c{opyright} 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  47. Neural Implicit Surface Evolution

    Authors: Tiago Novello, Vinicius da Silva, Guilherme Schardong, Luiz Schirmer, Helio Lopes, Luiz Velho

    Abstract: This work investigates the use of smooth neural networks for modeling dynamic variations of implicit surfaces under the level set equation (LSE). For this, it extends the representation of neural implicit surfaces to the space-time $\mathbb{R}^3\times \mathbb{R}$, which opens up mechanisms for continuous geometric transformations. Examples include evolving an initial surface towards general vector… ▽ More

    Submitted 20 August, 2023; v1 submitted 24 January, 2022; originally announced January 2022.

  48. Exploring Differential Geometry in Neural Implicits

    Authors: Tiago Novello, Guilherme Schardong, Luiz Schirmer, Vinicius da Silva, Helio Lopes, Luiz Velho

    Abstract: We introduce a neural implicit framework that exploits the differentiable properties of neural networks and the discrete geometry of point-sampled surfaces to approximate them as the level sets of neural implicit functions. To train a neural implicit function, we propose a loss functional that approximates a signed distance function, and allows terms with high-order derivatives, such as the alig… ▽ More

    Submitted 20 August, 2022; v1 submitted 23 January, 2022; originally announced January 2022.

  49. arXiv:2201.09147  [pdf, other

    cs.GR

    Neural Implicit Mapping via Nested Neighborhoods

    Authors: Vinícius da Silva, Tiago Novello, Guilherme Schardong, Luiz Schirmer, Hélio Lopes, Luiz Velho

    Abstract: We introduce a novel approach for rendering static and dynamic 3D neural signed distance functions (SDF) in real-time. We rely on nested neighborhoods of zero-level sets of neural SDFs, and mappings between them. This framework supports animations and achieves real-time performance without the use of spatial data-structures. It consists of three uncoupled algorithms representing the rendering step… ▽ More

    Submitted 6 December, 2022; v1 submitted 22 January, 2022; originally announced January 2022.

    Comments: 9 pages, 8 figures

    ACM Class: I.3.7; I.3.5

  50. arXiv:2112.04215  [pdf, other

    cs.CV cs.LG

    Self-Supervised Models are Continual Learners

    Authors: Enrico Fini, Victor G. Turrisi da Costa, Xavier Alameda-Pineda, Elisa Ricci, Karteek Alahari, Julien Mairal

    Abstract: Self-supervised models have been shown to produce comparable or better visual representations than their supervised counterparts when trained offline on unlabeled data at scale. However, their efficacy is catastrophically reduced in a Continual Learning (CL) scenario where data is presented to the model sequentially. In this paper, we show that self-supervised loss functions can be seamlessly conv… ▽ More

    Submitted 1 April, 2022; v1 submitted 8 December, 2021; originally announced December 2021.