Skip to main content

Showing 1–50 of 200 results for author: Costa, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.22345  [pdf, ps, other

    cs.SI physics.soc-ph

    A Systematic Approach for Studying How Topological Measurements Respond to Complex Networks Modifications

    Authors: Alexandre Benatti, Roberto M. Cesar Jr., Luciano da F. Costa

    Abstract: Different types of graphs and complex networks have been characterized, analyzed, and modeled based on measurements of their respective topology. However, the available networks may constitute approximations of the original structure as a consequence of sampling incompleteness, noise, and/or error in the representation of that structure. Therefore, it becomes of particular interest to quantify how… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 38 pages and 18 figures

  2. Scaling Up: Revisiting Mining Android Sandboxes at Scale for Malware Classification

    Authors: Francisco Costa, Ismael Medeiros, Leandro Oliveira, João Calássio, Rodrigo Bonifácio, Krishna Narasimhan, Mira Mezini, Márcio Ribeiro

    Abstract: The widespread use of smartphones in daily life has raised concerns about privacy and security among researchers and practitioners. Privacy issues are generally highly prevalent in mobile applications, particularly targeting the Android platform, the most popular mobile operating system. For this reason, several techniques have been proposed to identify malicious behavior in Android applications,… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

    Comments: 12 pages, 2 figures, ECOOP 2025

  3. Continuous signal sparse encoding using analog neuromorphic variability

    Authors: Filippo Costa, Chiara De Luca

    Abstract: Achieving fast and reliable temporal signal encoding is crucial for low-power, always-on systems. While current spike-based encoding algorithms rely on complex networks or precise timing references, simple and robust encoding models can be obtained by leveraging the intrinsic properties of analog hardware substrates. We propose an encoding framework inspired by biological principles that leverages… ▽ More

    Submitted 22 April, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

  4. arXiv:2501.06003  [pdf, other

    cs.LG

    Learning to generate feasible graphs using graph grammars

    Authors: Stefan Mautner, Rolf Backofen, Fabrizio Costa

    Abstract: Generative methods for graphs need to be sufficiently flexible to model complex dependencies between sets of nodes. At the same time, the generated graphs need to satisfy domain-dependent feasibility conditions, that is, they should not violate certain constraints that would make their interpretation impossible within the given application domain (e.g. a molecular graph where an atom has a very la… ▽ More

    Submitted 21 January, 2025; v1 submitted 10 January, 2025; originally announced January 2025.

  5. arXiv:2410.13385  [pdf, other

    eess.AS cs.CL cs.SD

    On the Use of Audio to Improve Dialogue Policies

    Authors: Daniel Roncel, Federico Costa, Javier Hernando

    Abstract: With the significant progress of speech technologies, spoken goal-oriented dialogue systems are becoming increasingly popular. One of the main modules of a dialogue system is typically the dialogue policy, which is responsible for determining system actions. This component usually relies only on audio transcriptions, being strongly dependent on their quality and ignoring very important extralingui… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: IberSpeech 2024

  6. arXiv:2410.11151  [pdf

    stat.ME cs.IT stat.AP

    Discovering the critical number of respondents to validate an item in a questionnaire: The Binomial Cut-level Content Validity proposal

    Authors: Helder Gomes Costa, Eduardo Shimoda, José Fabiano da Serra Costa, Aldo Shimoya, Edilvando Pereira Eufrazio

    Abstract: The question that drives this research is: "How to discover the number of respondents that are necessary to validate items of a questionnaire as actually essential to reach the questionnaire's proposal?" Among the efforts in this subject, \cite{Lawshe1975, Wilson2012, Ayre_CVR_2014} approached this issue by proposing and refining the Content Validation Ratio (CVR) that looks to identify items that… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: 17 pages, 1 figure

    MSC Class: 62P05; 62P10; 62P10; 62P12; 62P15; 62P20; 62P25; 90B50; 91B06; 90C29; 90C31; 91A35; 91B06 ACM Class: G.3; H.4; H.4.2

  7. arXiv:2409.11389  [pdf, other

    cs.LG physics.soc-ph

    Normalization in Proportional Feature Spaces

    Authors: Alexandre Benatti, Luciano da F. Costa

    Abstract: The subject of features normalization plays an important central role in data representation, characterization, visualization, analysis, comparison, classification, and modeling, as it can substantially influence and be influenced by all of these activities and respective aspects. The selection of an appropriate normalization method needs to take into account the type and characteristics of the in… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

    Comments: 31 pages, 10 figures

  8. arXiv:2409.01213  [pdf, other

    cs.LG physics.soc-ph

    Supervised Pattern Recognition Involving Skewed Feature Densities

    Authors: Alexandre Benatti, Luciano da F. Costa

    Abstract: Pattern recognition constitutes a particularly important task underlying a great deal of scientific and technologica activities. At the same time, pattern recognition involves several challenges, including the choice of features to represent the data elements, as well as possible respective transformations. In the present work, the classification potential of the Euclidean distance and a dissimila… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: 25 page and 16 figures

  9. arXiv:2408.08971  [pdf, other

    cs.CL

    A Multi-Task and Multi-Label Classification Model for Implicit Discourse Relation Recognition

    Authors: Nelson Filipe Costa, Leila Kosseim

    Abstract: We address the inherent ambiguity in Implicit Discourse Relation Recognition (IDRR) by introducing a novel multi-task classification model capable of learning both multi-label and single-label representations of discourse relations. Our model is trained exclusively on the DiscoGeM corpus and evaluated both on the DiscoGeM and the PDTB 3.0 corpus. We establish the first benchmark on multi-label IDR… ▽ More

    Submitted 28 October, 2024; v1 submitted 16 August, 2024; originally announced August 2024.

  10. arXiv:2406.15636  [pdf, other

    cs.SI physics.soc-ph

    Simple Games on Complex Networks

    Authors: Alexandre Benatti, Luciano da F. Costa

    Abstract: The relationship between topology and dynamics of complex systems has motivated continuing interest from the scientific community. In the present work, we address this interesting topic from the perspective of simple games, involving two teams playing according to a small set of simple rules, taking place on four types of complex networks. Starting from a minimalist game, characterized by full sym… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 16 pages, 8 figures

  11. Double Multi-Head Attention Multimodal System for Odyssey 2024 Speech Emotion Recognition Challenge

    Authors: Federico Costa, Miquel India, Javier Hernando

    Abstract: As computer-based applications are becoming more integrated into our daily lives, the importance of Speech Emotion Recognition (SER) has increased significantly. Promoting research with innovative approaches in SER, the Odyssey 2024 Speech Emotion Recognition Challenge was organized as part of the Odyssey 2024 Speaker and Language Recognition Workshop. In this paper we describe the Double Multi-He… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: Odyssey 2024: The Speaker and Language Recognition Workshop

    Journal ref: Proc. The Speaker and Language Recognition Workshop (Odyssey 2024), 266-273

  12. arXiv:2406.03587  [pdf, other

    physics.soc-ph cs.SI

    Subsuming Complex Networks by Node Walks

    Authors: Alexandre Benatti, Luciano da F. Costa

    Abstract: The concept of node walk in graphs and complex networks has been addressed, consisting of one or more nodes that move into adjacent nodes, henceforth incorporating the respective connections. This type of dynamics is then applied to subsume complex networks. Three types of networks (Erdós- Rény, Barabási-Albert, as well as a geometric model) are considered, while three node walks heuristics (unifo… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 14 pages and 7 figures

  13. arXiv:2405.15498  [pdf, other

    cs.SI physics.soc-ph

    Node Accessibility Characterization of Radially-Grown Structures

    Authors: Alexandre Benatti, Roberto M. Cesar Jr., Luciano da F. Costa

    Abstract: Complex systems have motivated continuing interest from the scientific community, leading to new concepts and methods. Growing systems represent a case of particular interest, as their topological, geometrical, and also dynamical properties change along time, as new elements are incorporated into the existing structure. In the present work, an approach is the case in which systems grown radially a… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 14 pages, 8 figures

  14. Speaker Characterization by means of Attention Pooling

    Authors: Federico Costa, Miquel India, Javier Hernando

    Abstract: State-of-the-art Deep Learning systems for speaker verification are commonly based on speaker embedding extractors. These architectures are usually composed of a feature extractor front-end together with a pooling layer to encode variable-length utterances into fixed-length speaker vectors. The authors have recently proposed the use of a Double Multi-Head Self-Attention pooling for speaker recogni… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: IberSpeech 2022

    Journal ref: Proc. IberSPEECH 2022, 166-170

  15. Dirigent: Lightweight Serverless Orchestration

    Authors: Lazar Cvetković, François Costa, Mihajlo Djokic, Michal Friedman, Ana Klimovic

    Abstract: While Function as a Service (FaaS) platforms can initialize function sandboxes on worker nodes in 10-100s of milliseconds, the latency to schedule functions in real FaaS clusters can be orders of magnitude higher. The current approach of building FaaS cluster managers on top of legacy orchestration systems (e.g., Kubernetes) leads to high scheduling delays when clusters experience high sandbox chu… ▽ More

    Submitted 28 October, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  16. arXiv:2403.17713  [pdf, other

    physics.soc-ph cs.SI

    Distance-Based Hierarchical Cutting of Complex Networks with Non-Preferential and Preferential Choice of Seeds

    Authors: Alexandre Benatti, Luciano da F. Costa

    Abstract: Graphs and complex networks can be successively separated into connected components associated to respective seed nodes, therefore establishing a respective hierarchical organization. In the present work, we study the properties of the hierarchical structure implied by distance-based cutting of Erdős-Rényi, Barabási-Albert, and a specific geometric network. Two main situations are considered regar… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 15 pages and 9 figures

  17. arXiv:2403.06876  [pdf, other

    cs.SI physics.soc-ph

    Hierarchical Cutting of Complex Networks Performed by Random Walks

    Authors: Alexandre Benatti, Luciano da F. Costa

    Abstract: Several interesting approaches have been reported in the literature on complex networks, random walks, and hierarchy of graphs. While many of these works perform random walks on stable, fixed networks, in the present work we address the situation in which the connections traversed by each step of a uniformly random walks are progressively removed, yielding a successively less interconnected struct… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 16 pages and 9 figures

  18. arXiv:2403.02821  [pdf, other

    cs.LG cs.CE math.OC

    An Adaptive Hydropower Management Approach for Downstream Ecosystem Preservation

    Authors: C. Coelho, M. Jing, M. Fernanda P. Costa, L. L. Ferrás

    Abstract: Hydropower plants play a pivotal role in advancing clean and sustainable energy production, contributing significantly to the global transition towards renewable energy sources. However, hydropower plants are currently perceived both positively as sources of renewable energy and negatively as disruptors of ecosystems. In this work, we highlight the overlooked potential of using hydropower plant as… ▽ More

    Submitted 4 April, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    ACM Class: J.2; I.5.1; G.1.6

  19. arXiv:2403.02737  [pdf, other

    cs.LG cs.CE math.NA

    Neural Fractional Differential Equations

    Authors: C. Coelho, M. Fernanda P. Costa, L. L. Ferrás

    Abstract: Fractional Differential Equations (FDEs) are essential tools for modelling complex systems in science and engineering. They extend the traditional concepts of differentiation and integration to non-integer orders, enabling a more precise representation of processes characterised by non-local and memory-dependent behaviours. This property is useful in systems where variables do not respond to cha… ▽ More

    Submitted 25 July, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    MSC Class: G.1; G.1.10; G.4; I.5.1

    Journal ref: Applied Mathematical Modelling (2025): 116060

  20. arXiv:2403.02730  [pdf, other

    cs.LG cs.CE math.OC

    A Two-Stage Training Method for Modeling Constrained Systems With Neural Networks

    Authors: C. Coelho, M. Fernanda P. Costa, L. L. Ferrás

    Abstract: Real-world systems are often formulated as constrained optimization problems. Techniques to incorporate constraints into Neural Networks (NN), such as Neural Ordinary Differential Equations (Neural ODEs), have been used. However, these introduce hyperparameters that require manual tuning through trial and error, raising doubts about the successful incorporation of constraints into the generated mo… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    MSC Class: 35A01; 65L10; 65L12; 65L20; 65L70 ACM Class: I.5.1; G.1.6

  21. arXiv:2402.01198  [pdf, other

    cs.IT eess.SP

    Physical Layer Location Privacy in SIMO Communication Using Fake Path Injection

    Authors: Trong Duy Tran, Maxime Ferreira Da Costa, Linh Trung Nguyen

    Abstract: Fake path injection is an emerging paradigm for inducing privacy over wireless networks. In this paper, fake paths are injected by the transmitters into a single-input multiple-output (SIMO) communication channel to obscure their physical location from an eavesdropper. The case where the receiver (Bob) and the eavesdropper (Eve) use a linear uniform array to locate the transmitter's (Alice) positi… ▽ More

    Submitted 3 February, 2025; v1 submitted 2 February, 2024; originally announced February 2024.

  22. arXiv:2401.17887  [pdf, other

    cs.SI

    Detecting Groups in Directed and Non-Directed Bipartite Networks

    Authors: Alexandre Benatti, Luciano da F. Costa

    Abstract: Bipartite networks provide an effective resource for representing, characterizing, and modeling several abstract and real-world systems and structures involving binary relations, which include food webs, social interactions, and customer-product relationships. Of particular interest is the problem of, given a specific bipartite network, to identify possible respective groups or clusters characteri… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 22 pages, 13 figures

  23. arXiv:2312.00859  [pdf, other

    physics.soc-ph cs.GR

    Random Walks Performed by Topologically-Specific Agents on Complex Networks

    Authors: Alexandre Benatti, Luciano da F. Costa

    Abstract: Random walks by single-node agents have been systematically conducted on various types of complex networks in order to investigate how their topologies can affect the dynamics of the agents. However, by fitting any network node, these agents do not engage in topological interactions with the network. In the present work, we describe random walks on complex networks performed by agents that are act… ▽ More

    Submitted 15 May, 2025; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: 21 pages, 15 figures

  24. arXiv:2311.14817  [pdf, ps, other

    physics.soc-ph cs.SI

    Quantifying edge relevance for epidemic spreading via the semi-metric topology of complex networks

    Authors: David Soriano Paños, Felipe Xavier Costa, Luis M. Rocha

    Abstract: Sparsification aims at extracting a reduced core of associations that best preserves both the dynamics and topology of networks while reducing the computational cost of simulations. We show that the semi-metric topology of complex networks yields a natural and algebraically-principled sparsification that outperforms existing methods on those goals. Weighted graphs whose edges represent distances b… ▽ More

    Submitted 4 June, 2025; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: 13 pages, 4 figures. Supplementary Text: 12 pages, 1 table, 9 figures

  25. arXiv:2311.13293  [pdf, other

    cs.LG math.OC

    The Influence of Neural Networks on Hydropower Plant Management in Agriculture: Addressing Challenges and Exploring Untapped Opportunities

    Authors: C. Coelho, M. Fernanda P. Costa, L. L. Ferrás

    Abstract: Hydropower plants are crucial for stable renewable energy and serve as vital water sources for sustainable agriculture. However, it is essential to assess the current water management practices associated with hydropower plant management software. A key concern is the potential conflict between electricity generation and agricultural water needs. Prioritising water for electricity generation can r… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    MSC Class: 68T07 ACM Class: G.1.6; J.2; I.2.m

  26. arXiv:2311.09867  [pdf, other

    cs.MA

    Parallel and Sequential Resources Networks

    Authors: Alexandre Benatti, Luciano da F. Costa

    Abstract: A large number of real and abstract systems involve the transformation of some basic resource into respective products under the action of multiple processing agents, which can be understood as multiple-agent production systems (MAP). At each discrete time instant, for each agent, a fraction of the resources is assumed to be kept, forwarded to other agents, or converted into work with some efficie… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 21 pages, 12 figures

  27. arXiv:2311.04925  [pdf

    cs.CL cs.AI

    Investigating Deep-Learning NLP for Automating the Extraction of Oncology Efficacy Endpoints from Scientific Literature

    Authors: Aline Gendrin-Brokmann, Eden Harrison, Julianne Noveras, Leonidas Souliotis, Harris Vince, Ines Smit, Francisco Costa, David Milward, Sashka Dimitrievska, Paul Metcalfe, Emilie Louvet

    Abstract: Benchmarking drug efficacy is a critical step in clinical trial design and planning. The challenge is that much of the data on efficacy endpoints is stored in scientific papers in free text form, so extraction of such data is currently a largely manual task. Our objective is to automate this task as much as possible. In this study we have developed and optimised a framework to extract efficacy end… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  28. arXiv:2311.04133  [pdf, other

    cs.SI

    Simple Bundles of Complex Networks

    Authors: Alexandre Benatti, Luciano da F. Costa

    Abstract: Complex networks can be used to represent and model an ample diversity of abstract and real-world systems and structures. A good deal of the research on these structures has focused on specific topological properties, including node degree, shortest paths, and modularity. In the present work, we develop an approach aimed at identifying and characterizing simple bundles of interconnections between… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 29 pages, 21 figures

  29. arXiv:2309.12949  [pdf, other

    cs.IT eess.SP

    Guaranteed Private Communication with Secret Block Structure

    Authors: Maxime Ferreira Da Costa, Jianxiu Li, Urbashi Mitra

    Abstract: A novel private communication framework is proposed where privacy is induced by transmitting over a channel instances of linear inverse problems that are identifiable to the legitimate receiver but unidentifiable to an eavesdropper. The gap in identifiability is created in the framework by leveraging secret knowledge between the transmitter and the legitimate receiver. Specifically, the case where… ▽ More

    Submitted 22 July, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

  30. arXiv:2308.14541  [pdf, other

    cs.NE

    Multilayer Multiset Neuronal Networks -- MMNNs

    Authors: Alexandre Benatti, Luciano da Fontoura Costa

    Abstract: The coincidence similarity index, based on a combination of the Jaccard and overlap similarity indices, has noticeable properties in comparing and classifying data, including enhanced selectivity and sensitivity, intrinsic normalization, and robustness to data perturbations and outliers. These features allow multiset neurons, which are based on the coincidence similarity operation, to perform effe… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: 32 pages, 21 figures

  31. arXiv:2307.15208  [pdf, other

    eess.IV cs.CV

    Generative AI for Medical Imaging: extending the MONAI Framework

    Authors: Walter H. L. Pinaya, Mark S. Graham, Eric Kerfoot, Petru-Daniel Tudosiu, Jessica Dafflon, Virginia Fernandez, Pedro Sanchez, Julia Wolleb, Pedro F. da Costa, Ashay Patel, Hyungjin Chung, Can Zhao, Wei Peng, Zelong Liu, Xueyan Mei, Oeslle Lucena, Jong Chul Ye, Sotirios A. Tsaftaris, Prerna Dogra, Andrew Feng, Marc Modat, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso

    Abstract: Recent advances in generative AI have brought incredible breakthroughs in several areas, including medical imaging. These generative models have tremendous potential not only to help safely share medical data via synthetic datasets but also to perform an array of diverse applications, such as anomaly detection, image-to-image translation, denoising, and MRI reconstruction. However, due to the comp… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  32. arXiv:2307.14940  [pdf, other

    cs.LG math.OC

    A Self-Adaptive Penalty Method for Integrating Prior Knowledge Constraints into Neural ODEs

    Authors: C. Coelho, M. Fernanda P. Costa, L. L. Ferrás

    Abstract: The continuous dynamics of natural systems has been effectively modelled using Neural Ordinary Differential Equations (Neural ODEs). However, for accurate and meaningful predictions, it is crucial that the models follow the underlying rules or laws that govern these systems. In this work, we propose a self-adaptive penalty algorithm for Neural ODEs to enable modelling of constrained natural system… ▽ More

    Submitted 5 March, 2024; v1 submitted 27 July, 2023; originally announced July 2023.

    ACM Class: I.5.1; G.1.6

  33. arXiv:2307.10123  [pdf, other

    cs.CV

    Two Approaches to Supervised Image Segmentation

    Authors: Alexandre Benatti, Luciano da F. Costa

    Abstract: Though performed almost effortlessly by humans, segmenting 2D gray-scale or color images into respective regions of interest (e.g.~background, objects, or portions of objects) constitutes one of the greatest challenges in science and technology as a consequence of several effects including dimensionality reduction(3D to 2D), noise, reflections, shades, and occlusions, among many other possibilitie… ▽ More

    Submitted 22 August, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: 38 pages, 19 figures

  34. Enhancing Continuous Time Series Modelling with a Latent ODE-LSTM Approach

    Authors: C. Coelho, M. Fernanda P. Costa, L. L. Ferrás

    Abstract: Due to their dynamic properties such as irregular sampling rate and high-frequency sampling, Continuous Time Series (CTS) are found in many applications. Since CTS with irregular sampling rate are difficult to model with standard Recurrent Neural Networks (RNNs), RNNs have been generalised to have continuous-time hidden dynamics defined by a Neural Ordinary Differential Equation (Neural ODE), lead… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    ACM Class: I.5.1; G.1.7

    Journal ref: Applied Mathematics and Computation 475 (2024): 128727

  35. Neural Chronos ODE: Unveiling Temporal Patterns and Forecasting Future and Past Trends in Time Series Data

    Authors: C. Coelho, M. Fernanda P. Costa, L. L. Ferrás

    Abstract: This work introduces Neural Chronos Ordinary Differential Equations (Neural CODE), a deep neural network architecture that fits a continuous-time ODE dynamics for predicting the chronology of a system both forward and backward in time. To train the model, we solve the ODE as an initial value problem and a final value problem, similar to Neural ODEs. We also explore two approaches to combining Neur… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: Under review at journal

    ACM Class: I.5.1; G.1.7

    Journal ref: Expert Systems with Applications (2025): 126784

  36. arXiv:2306.06553  [pdf, other

    cs.CV cs.AI cs.LG

    Hinting Pipeline and Multivariate Regression CNN for Maize Kernel Counting on the Ear

    Authors: Felipe Araújo, Igor Gadelha, Rodrigo Tsukahara, Luiz Pita, Filipe Costa, Igor Vaz, Andreza Santos, Guilherme Fôlego

    Abstract: Maize is a highly nutritional cereal widely used for human and animal consumption and also as raw material by the biofuels industries. This highlights the importance of precisely quantifying the corn grain productivity in season, helping the commercialization process, operationalization, and critical decision-making. Considering the manual labor cost of counting maize kernels, we propose in this w… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

  37. arXiv:2305.18315  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    CDJUR-BR -- A Golden Collection of Legal Document from Brazilian Justice with Fine-Grained Named Entities

    Authors: Antonio Mauricio, Vladia Pinheiro, Vasco Furtado, João Araújo Monteiro Neto, Francisco das Chagas Jucá Bomfim, André Câmara Ferreira da Costa, Raquel Silveira, Nilsiton Aragão

    Abstract: A basic task for most Legal Artificial Intelligence (Legal AI) applications is Named Entity Recognition (NER). However, texts produced in the context of legal practice make references to entities that are not trivially recognized by the currently available NERs. There is a lack of categorization of legislation, jurisprudence, evidence, penalties, the roles of people in a legal process (judge, lawy… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: 15 pages, in Portuguese language, 3 figures, 5 tables

  38. arXiv:2301.02940  [pdf, ps, other

    eess.SY cs.AI eess.SP

    GA-Aided Directivity in Volumetric and Planar Massive-Antenna Array Design

    Authors: Bruno Felipe Costa, Taufik Abrão

    Abstract: The problem of directivity enhancement, leading to the increase in the directivity gain over a certain desired angle of arrival/departure (AoA/AoD), is considered in this work. A new formulation of the volumetric array directivity problem is proposed using the rectangular coordinates to describe each antenna element and the desired azimuth and elevation angles with a general element pattern. Such… ▽ More

    Submitted 7 January, 2023; originally announced January 2023.

    Comments: 25pages

    Journal ref: COSTA, BRUNO FELIPE ; Abrão, Taufik . GA-aided directivity in volumetric and planar massive-antenna array design. SIGNAL PROCESSING, v. 205, p. 108857, 2023

  39. arXiv:2212.04984  [pdf, other

    cs.LG cs.AI

    Transformer-based normative modelling for anomaly detection of early schizophrenia

    Authors: Pedro F Da Costa, Jessica Dafflon, Sergio Leonardo Mendes, João Ricardo Sato, M. Jorge Cardoso, Robert Leech, Emily JH Jones, Walter H. L. Pinaya

    Abstract: Despite the impact of psychiatric disorders on clinical health, early-stage diagnosis remains a challenge. Machine learning studies have shown that classifiers tend to be overly narrow in the diagnosis prediction task. The overlap between conditions leads to high heterogeneity among participants that is not adequately captured by classification models. To address this issue, normative approaches h… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: 10 pages, 2 figures, 2 tables, presented at NeurIPS22@PAI4MH

  40. arXiv:2210.02334  [pdf, other

    cs.CL cs.LG

    Using Full-Text Content to Characterize and Identify Best Seller Books

    Authors: Giovana D. da Silva, Filipi N. Silva, Henrique F. de Arruda, Bárbara C. e Souza, Luciano da F. Costa, Diego R. Amancio

    Abstract: Artistic pieces can be studied from several perspectives, one example being their reception among readers over time. In the present work, we approach this interesting topic from the standpoint of literary works, particularly assessing the task of predicting whether a book will become a best seller. Dissimilarly from previous approaches, we focused on the full content of books and considered visual… ▽ More

    Submitted 11 May, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

  41. arXiv:2209.07162  [pdf, other

    eess.IV cs.CV q-bio.QM

    Brain Imaging Generation with Latent Diffusion Models

    Authors: Walter H. L. Pinaya, Petru-Daniel Tudosiu, Jessica Dafflon, Pedro F da Costa, Virginia Fernandez, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso

    Abstract: Deep neural networks have brought remarkable breakthroughs in medical image analysis. However, due to their data-hungry nature, the modest dataset sizes in medical imaging projects might be hindering their full potential. Generating synthetic data provides a promising alternative, allowing to complement training datasets and conducting medical image research at a larger scale. Diffusion models rec… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: 10 pages, 3 figures, Accepted in the Deep Generative Models workshop @ MICCAI 2022

  42. arXiv:2209.01181  [pdf, other

    cs.SI cs.DS physics.soc-ph

    The distance backbone of directed networks

    Authors: Felipe Xavier Costa, Rion Brattig Correia, Luis M. Rocha

    Abstract: In weighted graphs the shortest path between two nodes is often reached through an indirect path, out of all possible connections, leading to structural redundancies which play key roles in the dynamics and evolution of complex networks. We have previously developed a parameter-free, algebraically-principled methodology to uncover such redundancy and reveal the distance backbone of weighted graphs… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: Accepted at the 11th International Conference on Complex Networks and their Applications

  43. arXiv:2208.10073  [pdf, other

    cs.IT eess.SP math.NA

    Local Geometry of Nonconvex Spike Deconvolution from Low-Pass Measurements

    Authors: Maxime Ferreira Da Costa, Yuejie Chi

    Abstract: Spike deconvolution is the problem of recovering the point sources from their convolution with a known point spread function, which plays a fundamental role in many sensing and imaging applications. In this paper, we investigate the local geometry of recovering the parameters of point sources$\unicode{x2014}$including both amplitudes and locations$\unicode{x2014}$by minimizing a natural nonconvex… ▽ More

    Submitted 27 February, 2023; v1 submitted 22 August, 2022; originally announced August 2022.

  44. arXiv:2206.03461  [pdf, other

    cs.CV eess.IV q-bio.QM

    Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion Models

    Authors: Walter H. L. Pinaya, Mark S. Graham, Robert Gray, Pedro F Da Costa, Petru-Daniel Tudosiu, Paul Wright, Yee H. Mah, Andrew D. MacKinnon, James T. Teo, Rolf Jager, David Werring, Geraint Rees, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso

    Abstract: Deep generative models have emerged as promising tools for detecting arbitrary anomalies in data, dispensing with the necessity for manual labelling. Recently, autoregressive transformers have achieved state-of-the-art performance for anomaly detection in medical imaging. Nonetheless, these models still have some intrinsic weaknesses, such as requiring images to be modelled as 1D sequences, the ac… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

  45. arXiv:2202.02932  [pdf, other

    cs.IT eess.SP math.NA

    On the Stability of Super-Resolution and a Beurling-Selberg Type Extremal Problem

    Authors: Maxime Ferreira Da Costa, Urbashi Mitra

    Abstract: Super-resolution estimation is the problem of recovering a stream of spikes (point sources) from the noisy observation of a few numbers of its first trigonometric moments. The performance of super-resolution is recognized to be intimately related to the separation between the spikes to recover. A novel notion of stability of the Fisher information matrix (FIM) of the super-resolution problem is in… ▽ More

    Submitted 15 May, 2022; v1 submitted 6 February, 2022; originally announced February 2022.

  46. Text characterization based on recurrence networks

    Authors: Bárbara C. e Souza, Filipi N. Silva, Henrique F. de Arruda, Giovana D. da Silva, Luciano da F. Costa, Diego R. Amancio

    Abstract: Several complex systems are characterized by presenting intricate characteristics taking place at several scales of time and space. These multiscale characterizations are used in various applications, including better understanding diseases, characterizing transportation systems, and comparison between cities, among others. In particular, texts are also characterized by a hierarchical structure th… ▽ More

    Submitted 2 May, 2022; v1 submitted 17 January, 2022; originally announced January 2022.

    Journal ref: Information Sciences (2023)

  47. arXiv:2112.01369  [pdf, other

    stat.ME cs.IT

    The Classic Cross-Correlation and the Real-Valued Jaccard and Coincidence Indices

    Authors: Luciano da F. Costa

    Abstract: In this work we describe and compare the classic inner product and Pearson correlation coefficient as well as the recently introduced real-valued Jaccard and coincidence indices. Special attention is given to diverse schemes for taking into account the signs of the operands, as well as on the study of the geometry of the scalar field surface related to the generalized multiset binary operations un… ▽ More

    Submitted 25 November, 2021; originally announced December 2021.

    Comments: 9 pages, 8 figure. A preprint

  48. arXiv:2111.08516  [pdf, other

    cs.LG

    Multiset Neurons

    Authors: Luciano da F. Costa

    Abstract: The present work reports a comparative performance of artificial neurons obtained in terms of the real-valued Jaccard and coincidence similarity indices and respectively derived functionals. The interiority index and classic cross-correlation are also included for comparison purposes. After presenting the basic concepts related to real-valued multisets and the adopted similarity metrics, including… ▽ More

    Submitted 23 April, 2022; v1 submitted 13 November, 2021; originally announced November 2021.

    Comments: 21 pages, 32 figures. A preprint of a work submitted to a scientific journal (first revision)

  49. arXiv:2111.08514  [pdf, other

    cs.ET cs.LG

    Multiset Signal Processing and Electronics

    Authors: Luciano da F. Costa

    Abstract: Multisets are an intuitive extension of the traditional concept of sets that allow repetition of elements, with the number of times each element appears being understood as the respective multiplicity. Recent generalizations of multisets to real-valued functions, accounting for possibly negative values, have paved the way to a number of interesting implications and applications, including respecti… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

    Comments: 7 pages, 8 figures. A preprint of a work submitted to a scientific journal

  50. arXiv:2111.08513  [pdf, other

    cs.LG

    Comparing Cross Correlation-Based Similarities

    Authors: Luciano da F. Costa

    Abstract: The real-valued Jaccard and coincidence indices, in addition to their conceptual and computational simplicity, have been verified to be able to provide promising results in tasks such as template matching, tending to yield peaks that are sharper and narrower than those typically obtained by standard cross-correlation, while also attenuating substantially secondary matchings. In this work, the mult… ▽ More

    Submitted 21 November, 2021; v1 submitted 8 November, 2021; originally announced November 2021.

    Comments: 13 pages, 8 figures. A preprint