Skip to main content

Showing 1–30 of 30 results for author: Soares, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.20872  [pdf, other

    cs.CV

    FLIM-based Salient Object Detection Networks with Adaptive Decoders

    Authors: Gilson Junior Soares, Matheus Abrantes Cerqueira, Jancarlo F. Gomes, Laurent Najman, Silvio Jamil F. Guimarães, Alexandre Xavier Falcão

    Abstract: Salient Object Detection (SOD) methods can locate objects that stand out in an image, assign higher values to their pixels in a saliency map, and binarize the map outputting a predicted segmentation mask. A recent tendency is to investigate pre-trained lightweight models rather than deep neural networks in SOD tasks, coping with applications under limited computational resources. In this context,… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

    Comments: This work has been submitted to the Journal of the Brazilian Computer Society (JBCS)

  2. Multi-Sensor Fusion for Quadruped Robot State Estimation using Invariant Filtering and Smoothing

    Authors: Ylenia Nisticò, Hajun Kim, João Carlos Virgolino Soares, Geoff Fink, Hae-Won Park, Claudio Semini

    Abstract: This letter introduces two multi-sensor state estimation frameworks for quadruped robots, built on the Invariant Extended Kalman Filter (InEKF) and Invariant Smoother (IS). The proposed methods, named E-InEKF and E-IS, fuse kinematics, IMU, LiDAR, and GPS data to mitigate position drift, particularly along the z-axis, a common issue in proprioceptive-based approaches. We derived observation models… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

    Comments: Accepted for publication in IEEE Robotics and Automation Letters

  3. arXiv:2503.12101  [pdf, other

    cs.RO eess.SP

    MUSE: A Real-Time Multi-Sensor State Estimator for Quadruped Robots

    Authors: Ylenia Nisticò, João Carlos Virgolino Soares, Lorenzo Amatucci, Geoff Fink, Claudio Semini

    Abstract: This paper introduces an innovative state estimator, MUSE (MUlti-sensor State Estimator), designed to enhance state estimation's accuracy and real-time performance in quadruped robot navigation. The proposed state estimator builds upon our previous work presented in [1]. It integrates data from a range of onboard sensors, including IMUs, encoders, cameras, and LiDARs, to deliver a comprehensive an… ▽ More

    Submitted 27 March, 2025; v1 submitted 15 March, 2025; originally announced March 2025.

    Comments: Accepted for publication in IEEE Robotics and Automation Letters

  4. arXiv:2503.07743  [pdf, other

    cs.CV cs.RO

    SANDRO: a Robust Solver with a Splitting Strategy for Point Cloud Registration

    Authors: Michael Adlerstein, João Carlos Virgolino Soares, Angelo Bratta, Claudio Semini

    Abstract: Point cloud registration is a critical problem in computer vision and robotics, especially in the field of navigation. Current methods often fail when faced with high outlier rates or take a long time to converge to a suitable solution. In this work, we introduce a novel algorithm for point cloud registration called SANDRO (Splitting strategy for point cloud Alignment using Non-convex anD Robust O… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: Accepted to the IEEE International Conference on Robotics and Automation (ICRA) 2025

  5. arXiv:2501.09837  [pdf, other

    eess.SP cs.IT cs.NI

    Complex-Valued Neural Networks for Ultra-Reliable Massive MIMO

    Authors: Pedro Benevenuto Valadares, Jonathan Aguiar Soares, Kayol Mayer, Dalton Soares Arantes

    Abstract: In the evolving landscape of 5G and 6G networks, the demands extend beyond high data rates, ultra-low latency, and extensive coverage, increasingly emphasizing the need for reliability. This paper proposes an ultra-reliable multiple-input multiple-output (MIMO) scheme utilizing quasi-orthogonal space-time block coding (QOSTBC) combined with singular value decomposition (SVD) for channel state info… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

  6. arXiv:2501.08347  [pdf, other

    cs.CV cs.AI

    SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval

    Authors: Bhavin Jawade, Joao V. B. Soares, Kapil Thadani, Deen Dayal Mohan, Amir Erfan Eshratifar, Benjamin Culpepper, Paloma de Juan, Srirangaraj Setlur, Venu Govindaraju

    Abstract: Compositional image retrieval (CIR) is a multimodal learning task where a model combines a query image with a user-provided text modification to retrieve a target image. CIR finds applications in a variety of domains including product retrieval (e-commerce) and web search. Existing methods primarily focus on fully-supervised learning, wherein models are trained on datasets of labeled triplets such… ▽ More

    Submitted 12 January, 2025; originally announced January 2025.

    Comments: Paper accepted at WACV 2025 in round 1

  7. arXiv:2410.05256  [pdf, other

    cs.RO

    Proprioceptive State Estimation for Quadruped Robots using Invariant Kalman Filtering and Scale-Variant Robust Cost Functions

    Authors: Hilton Marques Souza Santana, João Carlos Virgolino Soares, Ylenia Nisticò, Marco Antonio Meggiolaro, Claudio Semini

    Abstract: Accurate state estimation is crucial for legged robot locomotion, as it provides the necessary information to allow control and navigation. However, it is also challenging, especially in scenarios with uneven and slippery terrain. This paper presents a new Invariant Extended Kalman filter for legged robot state estimation using only proprioceptive sensors. We formulate the methodology by combining… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: Accepted to the IEEE-RAS International Conference on Humanoid Robots 2024

  8. arXiv:2409.18878  [pdf

    cs.CL cs.AI cs.CY cs.IR

    Suicide Phenotyping from Clinical Notes in Safety-Net Psychiatric Hospital Using Multi-Label Classification with Pre-Trained Language Models

    Authors: Zehan Li, Yan Hu, Scott Lane, Salih Selek, Lokesh Shahani, Rodrigo Machado-Vieira, Jair Soares, Hua Xu, Hongfang Liu, Ming Huang

    Abstract: Accurate identification and categorization of suicidal events can yield better suicide precautions, reducing operational burden, and improving care quality in high-acuity psychiatric settings. Pre-trained language models offer promise for identifying suicidality from unstructured clinical narratives. We evaluated the performance of four BERT-based models using two fine-tuning strategies (multiple… ▽ More

    Submitted 3 October, 2024; v1 submitted 27 September, 2024; originally announced September 2024.

    Comments: submitted to AMIA Informatics Summit 2025 as a conference paper

  9. arXiv:2408.16472  [pdf, other

    cs.CV

    Creating a Segmented Pointcloud of Grapevines by Combining Multiple Viewpoints Through Visual Odometry

    Authors: Michael Adlerstein, Angelo Bratta, João Carlos Virgolino Soares, Giovanni Dessy, Miguel Fernandes, Matteo Gatti, Claudio Semini

    Abstract: Grapevine winter pruning is a labor-intensive and repetitive process that significantly influences the quality and quantity of the grape harvest and produced wine of the following season. It requires a careful and expert detection of the point to be cut. Because of its complexity, repetitive nature and time constraint, the task requires skilled labor that needs to be trained. This extended abstrac… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  10. arXiv:2405.02177  [pdf, other

    cs.RO

    Panoptic-SLAM: Visual SLAM in Dynamic Environments using Panoptic Segmentation

    Authors: Gabriel Fischer Abati, João Carlos Virgolino Soares, Vivian Suzano Medeiros, Marco Antonio Meggiolaro, Claudio Semini

    Abstract: The majority of visual SLAM systems are not robust in dynamic scenarios. The ones that deal with dynamic objects in the scenes usually rely on deep-learning-based methods to detect and filter these objects. However, these methods cannot deal with unknown moving objects. This work presents Panoptic-SLAM, an open-source visual SLAM system robust to dynamic environments, even in the presence of unkno… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  11. arXiv:2404.10157  [pdf, other

    cs.CV cs.LG

    Salient Object-Aware Background Generation using Text-Guided Diffusion Models

    Authors: Amir Erfan Eshratifar, Joao V. B. Soares, Kapil Thadani, Shaunak Mishra, Mikhail Kuznetsov, Yueh-Ning Ku, Paloma de Juan

    Abstract: Generating background scenes for salient objects plays a crucial role across various domains including creative design and e-commerce, as it enhances the presentation and context of subjects by integrating them into tailored environments. Background generation can be framed as a task of text-conditioned outpainting, where the goal is to extend image content beyond a salient object's boundaries on… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted for publication at CVPR 2024's Generative Models for Computer Vision workshop

  12. arXiv:2403.13124  [pdf, other

    cs.RO

    Cooperative Modular Manipulation with Numerous Cable-Driven Robots for Assistive Construction and Gap Crossing

    Authors: Kevin Murphy, Joao C. V. Soares, Justin K. Yim, Dustin Nottage, Ahmet Soylemezoglu, Joao Ramos

    Abstract: Soldiers in the field often need to cross negative obstacles, such as rivers or canyons, to reach goals or safety. Military gap crossing involves on-site temporary bridges construction. However, this procedure is conducted with dangerous, time and labor intensive operations, and specialized machinery. We envision a scalable robotic solution inspired by advancements in force-controlled and Cable Dr… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 8 pages, 9 figures. Submit to IROS 2024

  13. arXiv:2311.11046  [pdf

    q-bio.QM cs.LG q-bio.NC

    Classification of Major Depressive Disorder Using Vertex-Wise Brain Sulcal Depth, Curvature, and Thickness with a Deep and a Shallow Learning Model

    Authors: Roberto Goya-Maldonado, Tracy Erwin-Grabner, Ling-Li Zeng, Christopher R. K. Ching, Andre Aleman, Alyssa R. Amod, Zeynep Basgoze, Francesco Benedetti, Bianca Besteher, Katharina Brosch, Robin Bülow, Romain Colle, Colm G. Connolly, Emmanuelle Corruble, Baptiste Couvy-Duchesne, Kathryn Cullen, Udo Dannlowski, Christopher G. Davey, Annemiek Dols, Jan Ernsting, Jennifer W. Evans, Lukas Fisch, Paola Fuentes-Claramonte, Ali Saffet Gonul, Ian H. Gotlib , et al. (62 additional authors not shown)

    Abstract: Major depressive disorder (MDD) is a complex psychiatric disorder that affects the lives of hundreds of millions of individuals around the globe. Even today, researchers debate if morphological alterations in the brain are linked to MDD, likely due to the heterogeneity of this disorder. The application of deep learning tools to neuroimaging data, capable of capturing complex non-linear patterns, h… ▽ More

    Submitted 24 January, 2025; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2206.08122

  14. HAL 9000: a Risk Manager for ITSs

    Authors: Tadeu Freitas, Carlos Novo, Joao Soares, Ines Dutra, Manuel E. Correia, Behnam Shariati, Rolando Martins

    Abstract: HAL 9000 is an Intrusion Tolerant Systems (ITSs) Risk Manager, which assesses configuration risks against potential intrusions. It utilizes gathered threat knowledge and remains operational, even in the absence of updated information. Based on its advice, the ITSs can dynamically and proactively adapt to recent threats to minimize and mitigate future intrusions from malicious adversaries. Our goal… ▽ More

    Submitted 21 March, 2025; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: 10 pages, 4 figures

  15. On the Computational Complexities of Complex-valued Neural Networks

    Authors: Kayol Soares Mayer, Jonathan Aguiar Soares, Ariadne Arrais Cruz, Dalton Soares Arantes

    Abstract: Complex-valued neural networks (CVNNs) are nonlinear filters used in the digital signal processing of complex-domain data. Compared with real-valued neural networks~(RVNNs), CVNNs can directly handle complex-valued input and output signals due to their complex domain parameters and activation functions. With the trend toward low-power systems, computational complexity analysis has become essential… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: IEEE Latin-American Conference on Communications

    Journal ref: IEEE Latin-American Conference on Communications (LATINCOM 2023)

  16. CVNN-based Channel Estimation and Equalization in OFDM Systems Without Cyclic Prefix

    Authors: Heitor dos Santos Sousa, Jonathan Aguiar Soares, Kayol Soares Mayer, Dalton Soares Arantes

    Abstract: In modern communication systems operating with Orthogonal Frequency-Division Multiplexing (OFDM), channel estimation requires minimal complexity with one-tap equalizers. However, this depends on cyclic prefixes, which must be sufficiently large to cover the channel impulse response. Conversely, the use of cyclic prefix (CP) decreases the useful information that can be conveyed in an OFDM frame, th… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: XLI Simpósio Brasileiro de Telecomunicações e Processamento Digital de Sinais - SBrT 2023

    Journal ref: XLI Simpósio Brasileiro de Telecomunicações e Processamento de Sinais (SBrT 2023)

  17. Matrices inducing generalized metric on sequences

    Authors: Eloi Araujo, Fábio V. Martinez, Carlos H. A. Higa, José Soares

    Abstract: Sequence comparison is a basic task to capture similarities and differences between two or more sequences of symbols, with countless applications such as in computational biology. An alignment is a way to compare sequences, where a giving scoring function determines the degree of similarity between them. Many scoring functions are obtained from scoring matrices. However,not all scoring matrices in… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 40 pages, 2 figures

    Journal ref: Discrete Applied Mathematics, Volume 332, 15 June 2023, Pages 135-154

  18. SoccerNet 2022 Challenges Results

    Authors: Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao , et al. (69 additional authors not shown)

    Abstract: The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team. In 2022, the challenges were composed of 6 vision-based tasks: (1) action spotting, focusing on retrieving action timestamps in long untrimmed videos, (2) replay grounding, focusing on retrieving the live moment of an action shown in a replay, (3) pitch localization, focusing on det… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted at ACM MMSports 2022

  19. PCA-based Channel Estimation for MIMO Communications

    Authors: Jonathan Aguiar Soares, Kayol Soares Mayer, Pedro Benevenuto Valadares, Dalton Soares Arantes

    Abstract: In multiple-input multiple-output communications, channel estimation is paramount to keep base stations and users on track. This paper proposes a novel PCA-based-principal component analysis-channel estimation approach for MIMO orthogonal frequency division multiplexing systems. The channel frequency response is firstly estimated with the least squares method, and then PCA is used to filter only t… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: 5 pages, 7 figures, XL SIMPÓSIO BRASILEIRO DE TELECOMUNICAÇÕES E PROCESSAMENTO DE SINAIS (SBrT 2022)

  20. arXiv:2209.10710  [pdf, other

    cs.RO

    Visual Localization and Mapping in Dynamic and Changing Environments

    Authors: João Carlos Virgolino Soares, Vivian Suzano Medeiros, Gabriel Fischer Abati, Marcelo Becker, Glauco Caurin, Marcelo Gattass, Marco Antonio Meggiolaro

    Abstract: The real-world deployment of fully autonomous mobile robots depends on a robust SLAM (Simultaneous Localization and Mapping) system, capable of handling dynamic environments, where objects are moving in front of the robot, and changing environments, where objects are moved or replaced after the robot has already mapped the scene. This paper presents Changing-SLAM, a method for robust Visual SLAM i… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: 14 pages, 13 figures

  21. arXiv:2206.07846  [pdf, ps, other

    cs.CV

    Action Spotting using Dense Detection Anchors Revisited: Submission to the SoccerNet Challenge 2022

    Authors: João V. B. Soares, Avijit Shah

    Abstract: This brief technical report describes our submission to the Action Spotting SoccerNet Challenge 2022. The challenge was part of the CVPR 2022 ActivityNet Workshop. Our submission was based on a recently proposed method which focuses on increasing temporal precision via a densely sampled set of detection anchors. Due to its emphasis on temporal precision, this approach had shown significant improve… ▽ More

    Submitted 3 August, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: v2: a few more experiments, more detailed method description

  22. arXiv:2205.10450  [pdf, other

    cs.CV

    Temporally Precise Action Spotting in Soccer Videos Using Dense Detection Anchors

    Authors: João V. B. Soares, Avijit Shah, Topojoy Biswas

    Abstract: We present a model for temporally precise action spotting in videos, which uses a dense set of detection anchors, predicting a detection confidence and corresponding fine-grained temporal displacement for each anchor. We experiment with two trunk architectures, both of which are able to incorporate large temporal contexts while preserving the smaller-scale features required for precise localizatio… ▽ More

    Submitted 11 July, 2022; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: Accepted in International Conference on Image Processing (ICIP), 2022

  23. Analyzing Flight Delay Prediction Under Concept Drift

    Authors: Lucas Giusti, Leonardo Carvalho, Antonio Tadeu Gomes, Rafaelli Coutinho, Jorge Soares, Eduardo Ogasawara

    Abstract: Flight delays impose challenges that impact any flight transportation system. Predicting when they are going to occur is an important way to mitigate this issue. However, the behavior of the flight delay system varies through time. This phenomenon is known in predictive analytics as concept drift. This paper investigates the prediction performance of different drift handling strategies in aviation… ▽ More

    Submitted 4 April, 2021; originally announced April 2021.

  24. An effective and friendly tool for seed image analysis

    Authors: Andrea Loddo, Cecilia Di Ruberto, A. M. P. G. Vale, Mariano Ucchesu, J. M. Soares, Gianluigi Bacchetta

    Abstract: Image analysis is an essential field for several topics in the life sciences, such as biology or botany. In particular, the analysis of seeds (e.g. fossil research) can provide significant information on their evolution, the history of agriculture, plant domestication and knowledge of diets in ancient times. This work aims to present software that performs image analysis for feature extraction and… ▽ More

    Submitted 23 July, 2021; v1 submitted 31 March, 2021; originally announced March 2021.

  25. arXiv:2012.06414  [pdf

    cs.CV

    A new automatic approach to seed image analysis: From acquisition to segmentation

    Authors: A. M. P. G. Vale, M. Ucchesu, C. Di Ruberto, A. Loddo, J. M. Soares, G. Bacchetta

    Abstract: Image Analysis offers a new tool for classifying vascular plant species based on the morphological and colorimetric features of the seeds, and has made significant contributions in systematic studies. However, in order to extract the morphological and colorimetric features, it is necessary to segment the image containing the samples to be analysed. This stage represents one of the most challenging… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

  26. arXiv:1911.11099  [pdf, ps, other

    cs.DM math.CO

    Polyhedral study of the Convex Recoloring problem

    Authors: Manoel Campêlo, Phablo F. S. Moura, Joel C. Soares

    Abstract: A coloring of the vertices of a connected graph is convex if each color class induces a connected subgraph. We address the convex recoloring (CR) problem defined as follows. Given a graph $G$ and a coloring of its vertices, recolor a minimum number of vertices of $G$ so that the resulting coloring is convex. This problem, known to be NP-hard even on paths, was firstly motivated by applications on… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

  27. arXiv:1906.05963  [pdf, other

    cs.CV cs.CL

    Image Captioning: Transforming Objects into Words

    Authors: Simao Herdade, Armin Kappeler, Kofi Boakye, Joao Soares

    Abstract: Image captioning models typically follow an encoder-decoder architecture which uses abstract image feature vectors as input to the encoder. One of the most successful algorithms uses feature vectors extracted from the region proposals obtained from an object detector. In this work we introduce the Object Relation Transformer, that builds upon this approach by explicitly incorporating information a… ▽ More

    Submitted 11 January, 2020; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: 10 pages

  28. A Review on Flight Delay Prediction

    Authors: Alice Sternberg, Jorge Soares, Diego Carvalho, Eduardo Ogasawara

    Abstract: Flight delays hurt airlines, airports, and passengers. Their prediction is crucial during the decision-making process for all players of commercial aviation. Moreover, the development of accurate prediction models for flight delays became cumbersome due to the complexity of air transportation system, the number of methods for prediction, and the deluge of flight data. In this context, this paper p… ▽ More

    Submitted 4 April, 2021; v1 submitted 15 March, 2017; originally announced March 2017.

  29. arXiv:1701.01941  [pdf

    cs.CV

    Multi-Objective Software Suite of Two-Dimensional Shape Descriptors for Object-Based Image Analysis

    Authors: Andrea Baraldi, João V. B. Soares

    Abstract: In recent years two sets of planar (2D) shape attributes, provided with an intuitive physical meaning, were proposed to the remote sensing community by, respectively, Nagao & Matsuyama and Shackelford & Davis in their seminal works on the increasingly popular geographic object based image analysis (GEOBIA) paradigm. These two published sets of intuitive geometric features were selected as initial… ▽ More

    Submitted 2 February, 2017; v1 submitted 8 January, 2017; originally announced January 2017.

  30. Retinal Vessel Segmentation Using the 2-D Morlet Wavelet and Supervised Classification

    Authors: João V. B. Soares, Jorge J. G. Leandro, Roberto M. Cesar Jr., Herbert F. Jelinek, Michael J. Cree

    Abstract: We present a method for automated segmentation of the vasculature in retinal images. The method produces segmentations by classifying each image pixel as vessel or non-vessel, based on the pixel's feature vector. Feature vectors are composed of the pixel's intensity and continuous two-dimensional Morlet wavelet transform responses taken at multiple scales. The Morlet wavelet is capable of tuning… ▽ More

    Submitted 11 May, 2006; v1 submitted 30 September, 2005; originally announced October 2005.

    Comments: 9 pages, 7 figures and 1 table. Accepted for publication in IEEE Trans Med Imag; added copyright notice

    Journal ref: IEEE Trans Med Imag, Vol. 25, no. 9, pp. 1214- 1222, Sep. 2006.