Skip to main content

Showing 1–50 of 111 results for author: Silva, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.11020  [pdf, other

    cs.SE cs.AI

    Extracting Knowledge Graphs from User Stories using LangChain

    Authors: Thayná Camargo da Silva

    Abstract: This thesis introduces a novel methodology for the automated generation of knowledge graphs from user stories by leveraging the advanced capabilities of Large Language Models. Utilizing the LangChain framework as a basis, the User Story Graph Transformer module was developed to extract nodes and relationships from user stories using an LLM to construct accurate knowledge graphs.This innovative tec… ▽ More

    Submitted 14 May, 2025; originally announced June 2025.

    Comments: Master thesis work

  2. arXiv:2506.08274  [pdf, ps, other

    cs.LG stat.ML

    The Impact of Feature Scaling In Machine Learning: Effects on Regression and Classification Tasks

    Authors: João Manoel Herrera Pinheiro, Suzana Vilas Boas de Oliveira, Thiago Henrique Segreto Silva, Pedro Antonio Rabelo Saraiva, Enzo Ferreira de Souza, Leonardo André Ambrosio, Marcelo Becker

    Abstract: This research addresses the critical lack of comprehensive studies on feature scaling by systematically evaluating 12 scaling techniques - including several less common transformations - across 14 different Machine Learning algorithms and 16 datasets for classification and regression tasks. We meticulously analyzed impacts on predictive performance (using metrics such as accuracy, MAE, MSE, and… ▽ More

    Submitted 11 June, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

    Comments: 27 pages

  3. arXiv:2506.06357  [pdf, ps, other

    eess.SP cs.IT math.ST

    Cascaded Multiwire-PLC/Multiple-VLC System: Characterization and Performance

    Authors: Hugerles S. Silva, Higo T. P. Silva, Paulo V. B. Tomé, Felipe A. P. Figueiredo, Edson P. da Silva, Rausley A. A. de Souza

    Abstract: This paper proposes a cascaded multiwire-power line communication (PLC)/multiple-visible light communication (VLC) system. This hybrid architecture offers low installation cost, enhanced performance, practical feasibility, and a wide range of applications. Novel analytical expressions are derived for key statistics and outage probability, bit error probability, and ergodic channel capacity metrics… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  4. arXiv:2505.21533  [pdf, ps, other

    cs.CV cs.LG

    Self-Organizing Visual Prototypes for Non-Parametric Representation Learning

    Authors: Thalles Silva, Helio Pedrini, Adín Ramírez Rivera

    Abstract: We present Self-Organizing Visual Prototypes (SOP), a new training technique for unsupervised visual feature learning. Unlike existing prototypical self-supervised learning (SSL) methods that rely on a single prototype to encode all relevant features of a hidden cluster in the data, we propose the SOP strategy. In this strategy, a prototype is represented by many semantically similar representatio… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: Accepted at ICML 2025, code at https://github.com/sthalles/sop

  5. arXiv:2505.04566  [pdf, other

    cs.LG

    Multitask LSTM for Arboviral Outbreak Prediction Using Public Health Data

    Authors: Lucas R. C. Farias, Talita P. Silva, Pedro H. M. Araujo

    Abstract: This paper presents a multitask learning approach based on long-short-term memory (LSTM) networks for the joint prediction of arboviral outbreaks and case counts of dengue, chikungunya, and Zika in Recife, Brazil. Leveraging historical public health data from DataSUS (2017-2023), the proposed model concurrently performs binary classification (outbreak detection) and regression (case forecasting) t… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: 6 pages, 4 figures

  6. Beyond authorship: Analyzing contributions in PLOS ONE and the challenges of appropriate attribution

    Authors: Abdelghani Maddi, Jaime A. Teixeira da Silva

    Abstract: This study aims to evaluate the accuracy of authorship attributions in scientific publications, focusing on the fairness and precision of individual contributions within academic works. The study analyzes 81,823 publications from the journal PLOS ONE, covering the period from January 2018 to June 2023. It examines the authorship attributions within these publications to try and determine the preva… ▽ More

    Submitted 24 April, 2025; v1 submitted 8 April, 2025; originally announced April 2025.

    Journal ref: Abdelghani Maddi, Jaime A. Teixeira da Silva. Beyond authorship: Analyzing contributions in PLOS ONE and the challenges of appropriate attribution[J]. Journal of Data and Information Science, 2024

  7. Design and Implementation of the Transparent, Interpretable, and Multimodal (TIM) AR Personal Assistant

    Authors: Erin McGowan, Joao Rulff, Sonia Castelo, Guande Wu, Shaoyu Chen, Roque Lopez, Bea Steers, Iran R. Roman, Fabio F. Dias, Jing Qian, Parikshit Solunke, Michael Middleton, Ryan McKendrick, Claudio T. Silva

    Abstract: The concept of an AI assistant for task guidance is rapidly shifting from a science fiction staple to an impending reality. Such a system is inherently complex, requiring models for perceptual grounding, attention, and reasoning, an intuitive interface that adapts to the performer's needs, and the orchestration of data streams from many sensors. Moreover, all data acquired by the system must be re… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: Copyright 2025 IEEE. All rights reserved, including rights for text and data mining and training of artificial intelligence and similar technologies. Personal use is permitted, but republication/redistribution requires IEEE permission. Article accepted for publication in IEEE Computer Graphics and Applications. This is the author's version, content may change prior to final publication

  8. arXiv:2503.15618  [pdf, other

    cs.IT eess.SP

    On the Secrecy Performance of $α$-$\mathcal{F}$ Channels with Pointing Errors

    Authors: Gabriel M. C. Neves, Hugerles S. Silva, Higo T. P. Silva, Wamberto J. L. Queiroz, Felipe A. P. Figueiredo, Rausley A. A. de Souza

    Abstract: This paper investigates the physical layer security (PLS) performance of $α$-$\mathcal{F}$ fading channels with pointing errors under passive and active eavesdropping scenarios. Novel analytical expressions are derived for key PLS metrics, including the probability of strictly positive secrecy capacity, the average secrecy capacity, and the secure outage probability. An asymptotic analysis is also… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

  9. Sensing Movement: Contemporary Dance Workshops with People who are Blind or have Low Vision and Dance Teachers

    Authors: Madhuka Thisuri De Silva, Jim Smiley, Sarah Goodwin, Leona M Holloway, Matthew Butler

    Abstract: Dance teachers rely primarily on verbal instructions and visual demonstrations to convey key dance concepts and movement. These techniques, however, have limitations in supporting students who are blind or have low vision (BLV). This work explores the role technology can play in supporting instruction for BLV students, as well as improvisation with their instructor. Through a series of design work… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: Accepted to appear at ACM CHI Conference on Human Factors in Computing Systems (CHI '25), April 26 - May 1, 2025, Yokohama, Japan

  10. arXiv:2502.12350  [pdf, other

    cs.CE

    Mamute: high-performance computing for geophysical methods

    Authors: João B. Fernandes, Antônio D. S. Oliveira, Mateus C. A. T. Silva, Felipe H. Santos-da-Silva, Vitor H. M. Rodrigues, Kleiton A. Schneider, Calebe P. Bianchini, João M. de Araujo, Tiago Barros, Ítalo A. S. Assis, Samuel Xavier-de-Souza

    Abstract: Due to their high computational cost, geophysical applications are typically designed to run in large computing systems. Because of that, such applications must implement several high-performance techniques to use the computational resources better. In this paper, we present Mamute, a software that delivers wave equation-based geophysical methods. Mamute implements two geophysical methods: seismic… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: 24 pages, 6 figures, Journal

  11. arXiv:2411.05899  [pdf, other

    cs.LG

    Streaming Bayes GFlowNets

    Authors: Tiago da Silva, Daniel Augusto de Souza, Diego Mesquita

    Abstract: Bayes' rule naturally allows for inference refinement in a streaming fashion, without the need to recompute posteriors from scratch whenever new data arrives. In principle, Bayesian streaming is straightforward: we update our prior with the available data and use the resulting posterior as a prior when processing the next data chunk. In practice, however, this recipe entails i) approximating an in… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

    Comments: 25 pages, 8 figures

  12. arXiv:2411.03484  [pdf, other

    cond-mat.mtrl-sci cs.IR

    Automated, LLM enabled extraction of synthesis details for reticular materials from scientific literature

    Authors: Viviane Torres da Silva, Alexandre Rademaker, Krystelle Lionti, Ronaldo Giro, Geisa Lima, Sandro Fiorini, Marcelo Archanjo, Breno W. Carvalho, Rodrigo Neumann, Anaximandro Souza, João Pedro Souza, Gabriela de Valnisio, Carmen Nilda Paz, Renato Cerqueira, Mathias Steiner

    Abstract: Automated knowledge extraction from scientific literature can potentially accelerate materials discovery. We have investigated an approach for extracting synthesis protocols for reticular materials from scientific literature using large language models (LLMs). To that end, we introduce a Knowledge Extraction Pipeline (KEP) that automatizes LLM-assisted paragraph classification and information extr… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

    Comments: 16 pages

  13. arXiv:2410.16605  [pdf, other

    cs.RO

    EnKode: Active Learning of Unknown Flows with Koopman Operators

    Authors: Alice Kate Li, Thales C. Silva, M. Ani Hsieh

    Abstract: In this letter, we address the task of adaptive sampling to model vector fields. When modeling environmental phenomena with a robot, gathering high resolution information can be resource intensive. Actively gathering data and modeling flows with the data is a more efficient alternative. However, in such scenarios, data is often sparse and thus requires flow modeling techniques that are effective a… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  14. arXiv:2410.13937  [pdf, ps, other

    quant-ph cs.CC

    Quantum computational complexity of matrix functions

    Authors: Santiago Cifuentes, Samson Wang, Thais L. Silva, Mario Berta, Leandro Aolita

    Abstract: We investigate the dividing line between classical and quantum computational power in estimating properties of matrix functions. More precisely, we study the computational complexity of two primitive problems: given a function $f$ and a Hermitian matrix $A$, compute a matrix element of $f(A)$ or compute a local measurement on $f(A)|0\rangle^{\otimes n}$, with $|0\rangle^{\otimes n}$ an $n$-qubit r… ▽ More

    Submitted 22 April, 2025; v1 submitted 17 October, 2024; originally announced October 2024.

    Comments: 10+30 pages, 1 table, 2 figures

  15. arXiv:2410.09355  [pdf, other

    cs.LG stat.ML

    On Divergence Measures for Training GFlowNets

    Authors: Tiago da Silva, Eliezer de Souza da Silva, Diego Mesquita

    Abstract: Generative Flow Networks (GFlowNets) are amortized inference models designed to sample from unnormalized distributions over composable objects, with applications in generative modeling for tasks in fields such as causal discovery, NLP, and drug discovery. Traditionally, the training procedure for GFlowNets seeks to minimize the expected log-squared difference between a proposal (forward policy) an… ▽ More

    Submitted 21 October, 2024; v1 submitted 11 October, 2024; originally announced October 2024.

    Comments: Accepted at NeurIPS 2024, https://openreview.net/forum?id=N5H4z0Pzvn

    MSC Class: 68T05 ACM Class: G.3; I.5.1; I.2.8; I.2.6

  16. Understanding Challenges and Opportunities in Body Movement Education of People who are Blind or have Low Vision

    Authors: Madhuka Thisuri De Silva, Sarah Goodwin, Leona M Holloway, Matthew Butler

    Abstract: Actively participating in body movement such as dance, sports, and fitness activities is challenging for people who are blind or have low vision (BLV). Teachers primarily rely on verbal instructions and physical demonstrations with limited accessibility. Recent work shows that technology can support body movement education for BLV people. However, there is limited involvement with the BLV communit… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

  17. arXiv:2407.17486  [pdf, other

    cs.CV cs.LG

    Learning from Memory: Non-Parametric Memory Augmented Self-Supervised Learning of Visual Features

    Authors: Thalles Silva, Helio Pedrini, Adín Ramírez Rivera

    Abstract: This paper introduces a novel approach to improving the training stability of self-supervised learning (SSL) methods by leveraging a non-parametric memory of seen concepts. The proposed method involves augmenting a neural network with a memory component to stochastically compare current image views with previously encountered concepts. Additionally, we introduce stochastic memory blocks to regular… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: To appear in ICML 2024. Code at https://github.com/sthalles/MaSSL

  18. arXiv:2406.03288  [pdf, other

    cs.LG stat.ML

    Embarrassingly Parallel GFlowNets

    Authors: Tiago da Silva, Luiz Max Carvalho, Amauri Souza, Samuel Kaski, Diego Mesquita

    Abstract: GFlowNets are a promising alternative to MCMC sampling for discrete compositional random variables. Training GFlowNets requires repeated evaluations of the unnormalized target distribution or reward function. However, for large-scale posterior sampling, this may be prohibitive since it incurs traversing the data several times. Moreover, if the data are distributed across clients, employing standar… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024

  19. arXiv:2405.20670  [pdf

    cs.DL

    Twitter should now be referred to as X: How academics, journals and publishers need to make the nomenclatural transition

    Authors: Jaime A. Teixeira da Silva, Serhii Nazarovets

    Abstract: Here, we note how academics, journals and publishers should no longer refer to the social media platform Twitter as such, rather as X. Relying on Google Scholar, we found 16 examples of papers published in the last months of 2023 - essentially during the transition period between Twitter and X - that used Twitter and X, but in different ways. Unlike that transition period in which the binary Twitt… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  20. arXiv:2405.13957  [pdf, other

    cs.LG

    Exploring the Relationship Between Feature Attribution Methods and Model Performance

    Authors: Priscylla Silva, Claudio T. Silva, Luis Gustavo Nonato

    Abstract: Machine learning and deep learning models are pivotal in educational contexts, particularly in predicting student success. Despite their widespread application, a significant gap persists in comprehending the factors influencing these models' predictions, especially in explainability within education. This work addresses this gap by employing nine distinct explanation methods and conducting a comp… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: AAAI2024 Workshop on AI for Education - Bridging Innovation and Responsibility

  21. T-Explainer: A Model-Agnostic Explainability Framework Based on Gradients

    Authors: Evandro S. Ortigossa, Fábio F. Dias, Brian Barr, Claudio T. Silva, Luis Gustavo Nonato

    Abstract: The development of machine learning applications has increased significantly in recent years, motivated by the remarkable ability of learning-powered systems to discover and generalize intricate patterns hidden in massive datasets. Modern learning models, while powerful, often exhibit a complexity level that renders them opaque black boxes, lacking transparency and hindering our understanding of t… ▽ More

    Submitted 24 April, 2025; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Copyright 2025 IEEE. All rights reserved, including rights for text, data mining and training of artificial intelligence and similar technologies. Personal use is permitted, but republication/redistribution requires IEEE permission. Article accepted for publication in IEEE Intelligent Systems. This author's version includes the supplementary material. Content may change prior to final publication

  22. arXiv:2404.13219  [pdf, other

    cs.SI cs.CY

    Bubble reachers and uncivil discourse in polarized online public sphere

    Authors: Jordan K Kobellarz, Milos Brocic, Daniel Silver, Thiago H Silva

    Abstract: Early optimism saw possibilities for social media to renew democratic discourse, marked by hopes for individuals from diverse backgrounds to find opportunities to learn from and interact with others different from themselves. This optimism quickly waned as social media seemed to breed ideological homophily marked by "filter bubble" or "echo chambers." A typical response to the sense of fragmentati… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 41 pages, 5 figures

  23. arXiv:2404.08712  [pdf, other

    econ.GN cs.LG physics.soc-ph

    Machine learning and economic forecasting: the role of international trade networks

    Authors: Thiago C. Silva, Paulo V. B. Wilhelm, Diego R. Amancio

    Abstract: This study examines the effects of de-globalization trends on international trade networks and their role in improving forecasts for economic growth. Using section-level trade data from nearly 200 countries from 2010 to 2022, we identify significant shifts in the network topology driven by rising trade policy uncertainty. Our analysis highlights key global players through centrality rankings, with… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  24. arXiv:2403.10304  [pdf, ps, other

    cs.AI cs.DB

    KIF: A Wikidata-Based Framework for Integrating Heterogeneous Knowledge Sources

    Authors: Guilherme Lima, João M. B. Rodrigues, Marcelo Machado, Elton Soares, Sandro R. Fiorini, Raphael Thiago, Leonardo G. Azevedo, Viviane T. da Silva, Renato Cerqueira

    Abstract: We present a Wikidata-based framework, called KIF, for virtually integrating heterogeneous knowledge sources. KIF is written in Python and is released as open-source. It leverages Wikidata's data model and vocabulary plus user-defined mappings to construct a unified view of the underlying sources while keeping track of the context and provenance of their statements. The underlying sources can be t… ▽ More

    Submitted 24 July, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  25. arXiv:2402.17905  [pdf, other

    cs.LG cs.CY cs.SI

    Using Graph Neural Networks to Predict Local Culture

    Authors: Thiago H Silva, Daniel Silver

    Abstract: Urban research has long recognized that neighbourhoods are dynamic and relational. However, lack of data, methodologies, and computer processing power have hampered a formal quantitative examination of neighbourhood relational dynamics. To make progress on this issue, this study proposes a graph neural network (GNN) approach that permits combining and evaluating multiple sources of information abo… ▽ More

    Submitted 22 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 14 pages, 5 figures

  26. arXiv:2402.17866  [pdf, other

    cs.SI cs.CE

    Towards spatiotemporal integration of bus transit with data-driven approaches

    Authors: Júlio Borges, Altieris M. Peixoto, Thiago H. Silva, Anelise Munaretto, Ricardo Luders

    Abstract: This study aims to propose an approach for spatiotemporal integration of bus transit, which enables users to change bus lines by paying a single fare. This could increase bus transit efficiency and, consequently, help to make this mode of transportation more attractive. Usually, this strategy is allowed for a few hours in a non-restricted area; thus, certain walking distance areas behave like "vir… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 20 pages, 16 FIGURES

  27. Reimagining TaxiVis through an Immersive Space-Time Cube metaphor and reflecting on potential benefits of Immersive Analytics for urban data exploration

    Authors: Jorge Wagner, Claudio T. Silva, Wolfgang Stuerzlinger, Luciana Nedel

    Abstract: Current visualization research has identified the potential of more immersive settings for data exploration, leveraging VR and AR technologies. To explore how a traditional visualization system could be adapted into an immersive framework, and how it could benefit from this, we decided to revisit a landmark paper presented ten years ago at IEEE VIS. TaxiVis, by Ferreira et al., enabled interactive… ▽ More

    Submitted 23 May, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: Published in the proceedings of the IEEE VR 2024 conference

    ACM Class: H.5.1

    Journal ref: 2024 IEEE Conference Virtual Reality and 3D User Interfaces (VR), Orlando, FL, USA, 2024, pp. 827-838

  28. arXiv:2401.07356  [pdf, other

    cs.SE

    BUGSPHP: A dataset for Automated Program Repair in PHP

    Authors: K. D. Pramod, W. T. N. De Silva, W. U. K. Thabrew, Ridwan Shariffdeen, Sandareka Wickramanayake

    Abstract: Automated Program Repair (APR) improves developer productivity by saving debugging and bug-fixing time. While APR has been extensively explored for C/C++ and Java programs, there is little research on bugs in PHP programs due to the lack of a benchmark PHP bug dataset. This is surprising given that PHP has been one of the most widely used server-side languages for over two decades, being used in a… ▽ More

    Submitted 21 January, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

  29. arXiv:2311.05281  [pdf, other

    cs.CR cs.SE

    Finding Software Vulnerabilities in Open-Source C Projects via Bounded Model Checking

    Authors: Janislley Oliveira de Sousa, Bruno Carvalho de Farias, Thales Araujo da Silva, Eddie Batista de Lima Filho, Lucas C. Cordeiro

    Abstract: Computer-based systems have solved several domain problems, including industrial, military, education, and wearable. Nevertheless, such arrangements need high-quality software to guarantee security and safety as both are mandatory for modern software products. We advocate that bounded model-checking techniques can efficiently detect vulnerabilities in general software systems. However, such an app… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 27 pages, submitted to STTT journal

  30. arXiv:2310.12692  [pdf, other

    cs.CV cs.LG

    Representation Learning via Consistent Assignment of Views over Random Partitions

    Authors: Thalles Silva, Adín Ramírez Rivera

    Abstract: We present Consistent Assignment of Views over Random Partitions (CARP), a self-supervised clustering method for representation learning of visual features. CARP learns prototypes in an end-to-end online fashion using gradient descent without additional non-differentiable modules to solve the cluster assignment problem. CARP optimizes a new pretext task based on random partitions of prototypes tha… ▽ More

    Submitted 27 October, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: To appear in NeurIPS 2023. Code available at https://github.com/sthalles/carp

  31. arXiv:2310.00527  [pdf, other

    cs.CV

    Self-supervised Learning of Contextualized Local Visual Embeddings

    Authors: Thalles Santos Silva, Helio Pedrini, Adín Ramírez Rivera

    Abstract: We present Contextualized Local Visual Embeddings (CLoVE), a self-supervised convolutional-based method that learns representations suited for dense prediction tasks. CLoVE deviates from current methods and optimizes a single loss function that operates at the level of contextualized local embeddings learned from output feature maps of convolution neural network (CNN) encoders. To learn contextual… ▽ More

    Submitted 4 October, 2023; v1 submitted 30 September, 2023; originally announced October 2023.

    Comments: Pre-print. 4th Visual Inductive Priors for Data-Efficient Deep Learning Workshop ICCV 2023. Code at https://github.com/sthalles/CLoVE

    ACM Class: I.4.6; I.4.7

    Journal ref: 4th Visual Inductive Priors for Data-Efficient Deep Learning Workshop ICCV 2023

  32. arXiv:2309.13494  [pdf, other

    cs.RO cs.MA

    Communication-Constrained Multi-Robot Exploration with Intermittent Rendezvous

    Authors: Alysson Ribeiro da Silva, Luiz Chaimowicz, Thales Costa Silva, Ani Hsieh

    Abstract: Communication constraints can significantly impact robots' ability to share information, coordinate their movements, and synchronize their actions, thus limiting coordination in Multi-Robot Exploration (MRE) applications. In this work, we address these challenges by modeling the MRE application as a DEC-POMDP and designing a joint policy that follows a rendezvous plan. This policy allows robots to… ▽ More

    Submitted 23 July, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: 7 pages, 12 figures, 1 table, video: https://youtu.be/EuVbCoyjuIY

  33. arXiv:2309.12032  [pdf, other

    cs.LG stat.ML

    Human-in-the-Loop Causal Discovery under Latent Confounding using Ancestral GFlowNets

    Authors: Tiago da Silva, Eliezer Silva, António Góis, Dominik Heider, Samuel Kaski, Diego Mesquita, Adèle Ribeiro

    Abstract: Structure learning is the crux of causal inference. Notably, causal discovery (CD) algorithms are brittle when data is scarce, possibly inferring imprecise causal relations that contradict expert knowledge -- especially when considering latent confounders. To aggravate the issue, most CD methods do not provide uncertainty estimates, making it hard for users to interpret results and improve the inf… ▽ More

    Submitted 1 November, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

  34. arXiv:2309.04700  [pdf, other

    cs.CR

    From Programming Bugs to Multimillion-Dollar Scams: An Analysis of Trapdoor Tokens on Uniswap

    Authors: Phuong Duy Huynh, Thisal De Silva, Son Hoang Dau, Xiaodong Li, Iqbal Gondal, Emanuele Viterbo

    Abstract: We investigate in this work a recently emerged type of scam ERC-20 token called Trapdoor, which has cost investors billions of US dollars on Uniswap, the largest decentralised exchange on Ethereum, from 2020 to 2023. In essence, Trapdoor tokens allow users to buy but preventing them from selling by embedding logical bugs and/or owner-only features in their smart contracts. By manually inspecting a… ▽ More

    Submitted 19 December, 2024; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: 22 pages, 11 figures

  35. arXiv:2309.00172  [pdf, other

    cs.AI

    Detecting Evidence of Organization in groups by Trajectories

    Authors: T. F. Silva, J. E. B. Maia

    Abstract: Effective detection of organizations is essential for fighting crime and maintaining public safety, especially considering the limited human resources and tools to deal with each group that exhibits co-movement patterns. This paper focuses on solving the Network Structure Inference (NSI) challenge. Thus, we introduce two new approaches to detect network structure inferences based on agent trajecto… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Comments: 17 pages, 16 figures, 3 algorithms, 1 table

  36. arXiv:2308.00570  [pdf, other

    cs.RO eess.SY

    Enhancing Sample Efficiency and Uncertainty Compensation in Learning-based Model Predictive Control for Aerial Robots

    Authors: Kong Yao Chee, Thales C. Silva, M. Ani Hsieh, George J. Pappas

    Abstract: The recent increase in data availability and reliability has led to a surge in the development of learning-based model predictive control (MPC) frameworks for robot systems. Despite attaining substantial performance improvements over their non-learning counterparts, many of these frameworks rely on an offline learning procedure to synthesize a dynamics model. This implies that uncertainties encoun… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 7 pages, 7 figures. Accepted for publication in the proceedings of the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2023)

  37. arXiv:2303.15931  [pdf

    cs.RO

    FC Portugal 3D Simulation Team: Team Description Paper 2020

    Authors: Nuno Lau, Luis Paulo Reis, David Simoes, Mohammadreza Kasaei. Miguel Abreu, Tiago Silva, Francisco Resende

    Abstract: The FC Portugal 3D team is developed upon the structure of our previous Simulation league 2D/3D teams and our standard platform league team. Our research concerning the robot low-level skills is focused on developing behaviors that may be applied on real robots with minimal adaptation using model-based approaches. Our research on high-level soccer coordination methodologies and team playing is mai… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  38. arXiv:2302.08905  [pdf, other

    cs.DL

    GraphLED: A graph-based approach to process and visualise linked engineering documents

    Authors: Vanessa Telles da Silva, Lucas de Angelo Martins Ribeiro, Willian Borges de Lemos, Sílvia Silva da Costa Botelho, Nelson Lopes Duarte Filho, Marcelo Rita Pias

    Abstract: The architecture, engineering and construction (AEC) sector extensively uses documents supporting product and process development. As part of this, organisations should handle big data of hundreds, or even thousands, of technical documents strongly linked together, including CAD design of industrial plants, equipment purchase orders, quality certificates, and part material analysis. However, analy… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  39. arXiv:2301.06168  [pdf, other

    cs.DL

    Using citation networks to evaluate the impact of text length on the identification of relevant concepts

    Authors: Jorge A. V. Tohalino, Thiago C. Silva, Diego R. Amancio

    Abstract: The identification of the most significant concepts in unstructured data is of critical importance in various practical applications. Despite the large number of methods that have been put forth to extract the main topics of texts, a limited number of studies have probed the impact of the text length on the performance of keyword extraction (KE) methods. In this study, we adopted a network-based a… ▽ More

    Submitted 15 January, 2023; originally announced January 2023.

  40. arXiv:2212.11447  [pdf, other

    cs.RO

    Stochastic Nonlinear Ensemble Modeling and Control for Robot Team Environmental Monitoring

    Authors: Victoria Edwards, Thales C. Silva, M. Ani Hsieh

    Abstract: We seek methods to model, control, and analyze robot teams performing environmental monitoring tasks. During environmental monitoring, the goal is to have teams of robots collect various data throughout a fixed region for extended periods of time. Standard bottom-up task assignment methods do not scale as the number of robots and task locations increases and require computationally expensive repla… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

  41. arXiv:2212.09816  [pdf, other

    cs.RO

    Proportional Control for Stochastic Regulation on Allocation of Multi-Robots

    Authors: Thales C. Silva, Victoria Edwards, M. Ani Hsieh

    Abstract: Any strategy used to distribute a robot ensemble over a set of sequential tasks is subject to inaccuracy due to robot-level uncertainties and environmental influences on the robots' behavior. We approach the problem of inaccuracy during task allocation by modeling and controlling the overall ensemble behavior. Our model represents the allocation problem as a stochastic jump process and we regulate… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  42. arXiv:2212.09808  [pdf, other

    cs.RO

    Receding Horizon Control on the Broadcast of Information in Stochastic Networks

    Authors: Thales C. Silva, Li Shen, Xi Yu, M. Ani Hsieh

    Abstract: This paper focuses on the broadcast of information on robot networks with stochastic network interconnection topologies. Problematic communication networks are almost unavoidable in areas where we wish to deploy multi-robotic systems, usually due to a lack of environmental consistency, accessibility, and structure. We tackle this problem by modeling the broadcast of information in a multi-robot co… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  43. arXiv:2211.01438  [pdf, other

    eess.AS cs.CL cs.SD

    Variable Attention Masking for Configurable Transformer Transducer Speech Recognition

    Authors: Pawel Swietojanski, Stefan Braun, Dogan Can, Thiago Fraga da Silva, Arnab Ghoshal, Takaaki Hori, Roger Hsiao, Henry Mason, Erik McDermott, Honza Silovsky, Ruchir Travadi, Xiaodan Zhuang

    Abstract: This work studies the use of attention masking in transformer transducer based speech recognition for building a single configurable model for different deployment scenarios. We present a comprehensive set of experiments comparing fixed masking, where the same attention mask is applied at every frame, with chunked masking, where the attention mask for each frame is determined by chunk boundaries,… ▽ More

    Submitted 18 April, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: To appear in ICASSP 2023

    Journal ref: International Conference on Acoustics, Speech, and Signal Processing, 2023 International Conference on Acoustics, Speech, and Signal Processing International Conference on Acoustics, Speech, and Signal Processing

  44. arXiv:2210.12214  [pdf, ps, other

    cs.SD cs.CL eess.AS

    Optimizing Bilingual Neural Transducer with Synthetic Code-switching Text Generation

    Authors: Thien Nguyen, Nathalie Tran, Liuhui Deng, Thiago Fraga da Silva, Matthew Radzihovsky, Roger Hsiao, Henry Mason, Stefan Braun, Erik McDermott, Dogan Can, Pawel Swietojanski, Lyan Verwimp, Sibel Oyman, Tresi Arvizo, Honza Silovsky, Arnab Ghoshal, Mathieu Martel, Bharat Ram Ambati, Mohamed Ali

    Abstract: Code-switching describes the practice of using more than one language in the same sentence. In this study, we investigate how to optimize a neural transducer based bilingual automatic speech recognition (ASR) model for code-switching speech. Focusing on the scenario where the ASR model is trained without supervised code-switching data, we found that semi-supervised training and synthetic code-swit… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 5 pages, 1 figure, submitted to ICASSP 2023, *: equal contributions

  45. arXiv:2209.10665  [pdf

    cs.SI

    Changing the Scene: applying four models of social evolution to the scenescape

    Authors: Daniel Silver, Thiago H Silva, Patrick Adler

    Abstract: This paper elaborates a multi-model approach to studying how local scenes change. We refer to this as the "4 D's" of scene change: development, differentiation, defense, and diffusion. Each posits somewhat distinct change processes, and has its own tradition of theory and empirical research, which we briefly review. After summarizing some major trends in scenes and amenities in the US context, for… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: Published at Journal Wuhan University

    Journal ref: Journal Wuhan University 2022

  46. arXiv:2209.06034  [pdf

    cs.SE cs.HC

    Assessing User Interface Design Artifacts: A Tool-Supported Behavior-Based Approach

    Authors: Thiago Rocha Silva, Marco Winckler

    Abstract: Behaviour-Driven Development (BDD) has emerged in the last years as a powerful methodology to specify testable and executable user requirements through stories and scenarios. With the support of external testing frameworks, BDD stories can be used to automatically assess the behavior of a fully functional software system. This article describes a toolset which extends BDD with the aim of providing… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

  47. arXiv:2207.07035  [pdf, other

    cs.SI

    Characterizing Nodes and Edges in Dynamic Attributed Networks: A Social-based Approach

    Authors: Thiago H. P. Silva, Alberto H. F. Laender, Pedro O. S. Vaz de Melo

    Abstract: How to characterize nodes and edges in dynamic attributed networks based on social aspects? We address this problem by exploring the strength of the ties between actors and their associated attributes over time, thus capturing the social roles of the actors and the meaning of their dynamic interactions in different social network scenarios. For this, we apply social concepts to promote a better un… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: 11 pages, 5 figures

  48. arXiv:2206.14097  [pdf, other

    cs.IR

    Item Matching using Text Description and Similarity Search

    Authors: Ana Paula Appel, Anderson Luis de Paula Silva, Adriana Reigota Silva, Caique Dutra Santos, Thiago Logo da Silva, Rafael Poggi de Araujo, Luiz Carlos Faray de Aquino

    Abstract: In this paper, we focus on the problem of item matching using only the description. Those specific items not only lack a unique code but also contain short text descriptions, making the item matching process difficult. Our goal is to compare products using only the description provided by the purchase process. Therefore, evaluating other characteristics and differences can uncover possible flaws d… ▽ More

    Submitted 1 July, 2022; v1 submitted 28 June, 2022; originally announced June 2022.

  49. arXiv:2206.13677  [pdf, other

    cs.CV cs.HC

    Towards Global-Scale Crowd+AI Techniques to Map and Assess Sidewalks for People with Disabilities

    Authors: Maryam Hosseini, Mikey Saugstad, Fabio Miranda, Andres Sevtsuk, Claudio T. Silva, Jon E. Froehlich

    Abstract: There is a lack of data on the location, condition, and accessibility of sidewalks across the world, which not only impacts where and how people travel but also fundamentally limits interactive mapping tools and urban analytics. In this paper, we describe initial work in semi-automatically building a sidewalk network topology from satellite imagery using hierarchical multi-scale attention models,… ▽ More

    Submitted 18 August, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: CVPR 2022 AVA (Accessibility, Vision, and Autonomy Meet) Workshop

  50. arXiv:2206.03375  [pdf, other

    quant-ph cs.CC math.CO

    Walking on Vertices and Edges by Continuous-Time Quantum Walk

    Authors: Caue F. T. Silva, Daniel Posner, Renato Portugal

    Abstract: The quantum walk dynamics obey the laws of quantum mechanics with an extra locality constraint, which demands that the evolution operator is local in the sense that the walker must visit the neighboring locations before endeavoring to distant places. Usually, the Hamiltonian is obtained from either the adjacency or the laplacian matrix of the graph and the walker hops from vertices to neighboring… ▽ More

    Submitted 20 December, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: 15 pages

    Journal ref: Quantum Information Processing, 22(2):93, Jan 2023