Skip to main content

Showing 1–50 of 99 results for author: da Silva, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.11020  [pdf, other

    cs.SE cs.AI

    Extracting Knowledge Graphs from User Stories using LangChain

    Authors: Thayná Camargo da Silva

    Abstract: This thesis introduces a novel methodology for the automated generation of knowledge graphs from user stories by leveraging the advanced capabilities of Large Language Models. Utilizing the LangChain framework as a basis, the User Story Graph Transformer module was developed to extract nodes and relationships from user stories using an LLM to construct accurate knowledge graphs.This innovative tec… ▽ More

    Submitted 14 May, 2025; originally announced June 2025.

    Comments: Master thesis work

  2. arXiv:2506.05631  [pdf, ps, other

    astro-ph.SR astro-ph.EP astro-ph.IM cs.LG

    The TESS Ten Thousand Catalog: 10,001 uniformly-vetted and -validated Eclipsing Binary Stars detected in Full-Frame Image data by machine learning and analyzed by citizen scientists

    Authors: Veselin B. Kostov, Brian P. Powell, Aline U. Fornear, Marco Z. Di Fraia, Robert Gagliano, Thomas L. Jacobs, Julien S. de Lambilly, Hugo A. Durantini Luca, Steven R. Majewski, Mark Omohundro, Jerome Orosz, Saul A. Rappaport, Ryan Salik, Donald Short, William Welsh, Svetoslav Alexandrov, Cledison Marcos da Silva, Erika Dunning, Gerd Guhne, Marc Huten, Michiharu Hyogo, Davide Iannone, Sam Lee, Christian Magliano, Manya Sharma , et al. (14 additional authors not shown)

    Abstract: The Transiting Exoplanet Survey Satellite (TESS) has surveyed nearly the entire sky in Full-Frame Image mode with a time resolution of 200 seconds to 30 minutes and a temporal baseline of at least 27 days. In addition to the primary goal of discovering new exoplanets, TESS is exceptionally capable at detecting variable stars, and in particular short-period eclipsing binaries which are relatively c… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: 40 pages, 39 figures, 4 tables

  3. arXiv:2505.00787  [pdf, ps, other

    cs.LG cs.AI

    Constructing an Optimal Behavior Basis for the Option Keyboard

    Authors: Lucas N. Alegre, Ana L. C. Bazzan, André Barreto, Bruno C. da Silva

    Abstract: Multi-task reinforcement learning aims to quickly identify solutions for new tasks with minimal or no additional interaction with the environment. Generalized Policy Improvement (GPI) addresses this by combining a set of base policies to produce a new one that is at least as good -- though not necessarily optimal -- as any individual base policy. Optimality can be ensured, particularly in the line… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

    MSC Class: I.2

  4. arXiv:2503.12508  [pdf, other

    cs.RO eess.SY

    Closed-Loop Control and Disturbance Mitigation of an Underwater Multi-Segment Continuum Manipulator

    Authors: Kyle L. Walker, Hsing-Yu Chen, Alix J. Partridge, Lucas Cruz da Silva, Adam A. Stokes, Francesco Giorgio-Serchi

    Abstract: The use of soft and compliant manipulators in marine environments represents a promising paradigm shift for subsea inspection, with devices better suited to tasks owing to their ability to safely conform to items during contact. However, limitations driven by material characteristics often restrict the reach of such devices, with the complexity of obtaining state estimations making control non-tri… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

    Comments: Accepted for presentation at RoboSoft 2025, Lausanne

  5. arXiv:2503.05895  [pdf, ps, other

    cs.CC

    Minimum cost flow decomposition on arc-coloured networks

    Authors: Claudio Carvalho Neto, Ana Karolinna Maia, Cláudia Linhares Sales, Jonas Costa Ferreira da Silva

    Abstract: A network $\mathcal{N}$ is formed by a (multi)digraph $D$ together with a \emph{capacity function} $u : A(D) \to R_+$, and it is denoted by $\mathcal{N} = (D,u)$. A flow on $\mathcal{N}$ is a function $x: A(D) \to R_+$ such that $x(a) \leq u(a)$ for all $a \in A(D)$, and it is said to be $k$-splittable if it can be decomposed into up to $k$ paths. We say that a flow is $λ$-uniform if its value on… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: 20 pages, 10 figures

  6. arXiv:2502.20021  [pdf, other

    cs.CY cs.SE

    Systems-of-Systems for Environmental Sustainability: A Systematic Mapping Study

    Authors: Ana Clara Araújo Gomes da Silva, Gilmar Teixeira Junior, Lívia Mancine C. de Campos, Renato F. Bulcão-Neto, Valdemar Vicente Graciano Neto

    Abstract: Environmental sustainability in Systems-of-Systems (SoS) is an emerging field that seeks to integrate technological solutions to promote the efficient management of natural resources. While systematic reviews address sustainability in the context of Smart Cities (a category of SoS), a systematic study synthesizing the existing knowledge on environmental sustainability applied to SoS in general doe… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  7. arXiv:2501.04845  [pdf, ps, other

    physics.ins-det cs.LG hep-ex nucl-ex

    Intelligent experiments through real-time AI: Fast Data Processing and Autonomous Detector Control for sPHENIX and future EIC detectors

    Authors: J. Kvapil, G. Borca-Tasciuc, H. Bossi, K. Chen, Y. Chen, Y. Corrales Morales, H. Da Costa, C. Da Silva, C. Dean, J. Durham, S. Fu, C. Hao, P. Harris, O. Hen, H. Jheng, Y. Lee, P. Li, X. Li, Y. Lin, M. X. Liu, V. Loncar, J. P. Mitrevski, A. Olvera, M. L. Purschke, J. S. Renck , et al. (8 additional authors not shown)

    Abstract: This R\&D project, initiated by the DOE Nuclear Physics AI-Machine Learning initiative in 2022, leverages AI to address data processing challenges in high-energy nuclear experiments (RHIC, LHC, and future EIC). Our focus is on developing a demonstrator for real-time processing of high-rate data streams from sPHENIX experiment tracking detectors. The limitations of a 15 kHz maximum trigger rate imp… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: proceedings for 42nd International Conference on High Energy Physics (ICHEP2024), 18-24 July 2024, Prague, Czech Republic

    Report number: LA-UR-24-30394

  8. An Edge Computing-Based Solution for Real-Time Leaf Disease Classification using Thermal Imaging

    Authors: Públio Elon Correa da Silva, Jurandy Almeida

    Abstract: Deep learning (DL) technologies can transform agriculture by improving crop health monitoring and management, thus improving food safety. In this paper, we explore the potential of edge computing for real-time classification of leaf diseases using thermal imaging. We present a thermal image dataset for plant disease classification and evaluate deep learning models, including InceptionV3, MobileNet… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

    Journal ref: IEEE Geoscience and Remote Sensing Letters (2024)

  9. arXiv:2410.21274  [pdf, other

    cs.NE cs.DM math.OC

    High-level hybridization of heuristics and metaheuristics to solve symmetric TSP: a comparative study

    Authors: Carlos Alberto da Silva Junior, Roberto Yuji Tanaka, Luiz Carlos Farias da Silva, Angelo Passaro

    Abstract: The Travelling Salesman Problem - TSP is one of the most explored problems in the scientific literature to solve real problems regarding the economy, transportation, and logistics, to cite a few cases. Adapting TSP to solve different problems has originated several variants of the optimization problem with more complex objectives and different restrictions. Metaheuristics have been used to solve t… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

  10. arXiv:2410.19193  [pdf, other

    cs.CL cs.AI cs.LG cs.SI stat.ML

    Enriching GNNs with Text Contextual Representations for Detecting Disinformation Campaigns on Social Media

    Authors: Bruno Croso Cunha da Silva, Thomas Palmeira Ferraz, Roseli De Deus Lopes

    Abstract: Disinformation on social media poses both societal and technical challenges, requiring robust detection systems. While previous studies have integrated textual information into propagation networks, they have yet to fully leverage the advancements in Transformer-based language models for high-quality contextual text representations. This work addresses this gap by incorporating Transformer-based t… ▽ More

    Submitted 22 November, 2024; v1 submitted 24 October, 2024; originally announced October 2024.

    Comments: Work still in progress. Accepted as Extended Abstract Poster at LoG Conference 2024

  11. arXiv:2410.17865  [pdf, other

    cs.LG

    Population stratification for prediction of mortality in post-AKI patients

    Authors: Flavio S. Correa da Silva, Simon Sawhney

    Abstract: Acute kidney injury (AKI) is a serious clinical condition that affects up to 20% of hospitalised patients. AKI is associated with short term unplanned hospital readmission and post-discharge mortality risk. Patient risk and healthcare expenditures can be minimised by followup planning grounded on predictive models and machine learning. Since AKI is multi-factorial, predictive models specialised in… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

  12. arXiv:2410.16331  [pdf, other

    quant-ph cs.ET cs.LG

    Exploring Quantum Neural Networks for Demand Forecasting

    Authors: Gleydson Fernandes de Jesus, Maria Heloísa Fraga da Silva, Otto Menegasso Pires, Lucas Cruz da Silva, Clebson dos Santos Cruz, Valéria Loureiro da Silva

    Abstract: Forecasting demand for assets and services can be addressed in various markets, providing a competitive advantage when the predictive models used demonstrate high accuracy. However, the training of machine learning models incurs high computational costs, which may limit the training of prediction models based on available computational capacity. In this context, this paper presents an approach for… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: 22 pages, 13 figures, 10 tables

  13. arXiv:2410.02415  [pdf, other

    eess.SY cs.NI

    Cellular Network Densification: a System-level Analysis with IAB, NCR and RIS

    Authors: Gabriel C. M. da Silva, Victor F. Monteiro, Diego A. Sousa, Darlan C. Moreira, Tarcisio F. Maciel, Fco. Rafael M. Lima, Behrooz Makki

    Abstract: As the number of user equipments increases in fifth generation (5G) and beyond, it is desired to densify the cellular network with auxiliary nodes assisting the base stations. Examples of these nodes are integrated access and backhaul (IAB) nodes, network-controlled repeaters (NCRs) and reconfigurable intelligent surfaces (RISs). In this context, this work presents a system level overview of these… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: Paper submitted to IEEE Systems Journal

  14. arXiv:2410.02172  [pdf, other

    cs.LG cs.AI stat.ML

    Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation

    Authors: Shreyas Chaudhari, Ameet Deshpande, Bruno Castro da Silva, Philip S. Thomas

    Abstract: Evaluating policies using off-policy data is crucial for applying reinforcement learning to real-world problems such as healthcare and autonomous driving. Previous methods for off-policy evaluation (OPE) generally suffer from high variance or irreducible bias, leading to unacceptably high prediction errors. In this work, we introduce STAR, a framework for OPE that encompasses a broad range of esti… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

    Comments: Accepted at the Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024)

  15. arXiv:2409.16218  [pdf, other

    cs.LG cs.AI

    Problem-oriented AutoML in Clustering

    Authors: Matheus Camilo da Silva, Gabriel Marques Tavares, Eric Medvet, Sylvio Barbon Junior

    Abstract: The Problem-oriented AutoML in Clustering (PoAC) framework introduces a novel, flexible approach to automating clustering tasks by addressing the shortcomings of traditional AutoML solutions. Conventional methods often rely on predefined internal Clustering Validity Indexes (CVIs) and static meta-features, limiting their adaptability and effectiveness across diverse clustering tasks. In contrast,… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  16. arXiv:2409.11267  [pdf, other

    eess.SY cs.AI cs.LG

    Integrating Reinforcement Learning and Model Predictive Control with Applications to Microgrids

    Authors: Caio Fabio Oliveira da Silva, Azita Dabiri, Bart De Schutter

    Abstract: This work proposes an approach that integrates reinforcement learning and model predictive control (MPC) to solve finite-horizon optimal control problems in mixed-logical dynamical systems efficiently. Optimization-based control of such systems with discrete and continuous decision variables entails the online solution of mixed-integer linear programs, which suffer from the curse of dimensionality… ▽ More

    Submitted 14 April, 2025; v1 submitted 17 September, 2024; originally announced September 2024.

  17. arXiv:2409.04424  [pdf, other

    eess.IV cs.CV cs.GR

    Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques

    Authors: Davide Clode da Silva, Marina Musse Bernardes, Nathalia Giacomini Ceretta, Gabriel Vaz de Souza, Gabriel Fonseca Silva, Rafael Heitor Bordini, Soraia Raupp Musse

    Abstract: Machine learning has significantly advanced healthcare by aiding in disease prevention and treatment identification. However, accessing patient data can be challenging due to privacy concerns and strict regulations. Generating synthetic, realistic data offers a potential solution for overcoming these limitations, and recent studies suggest that fine-tuning foundation models can produce such data e… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

  18. arXiv:2408.13084  [pdf, other

    cs.HC cs.AI

    Avatar Visual Similarity for Social HCI: Increasing Self-Awareness

    Authors: Bernhard Hilpert, Claudio Alves da Silva, Leon Christidis, Chirag Bhuvaneshwara, Patrick Gebhard, Fabrizio Nunnari, Dimitra Tsovaltzi

    Abstract: Self-awareness is a critical factor in social human-human interaction and, hence, in social HCI interaction. Increasing self-awareness through mirrors or video recordings is common in face-to-face trainings, since it influences antecedents of self-awareness like explicit identification and implicit affective identification (affinity). However, increasing self-awareness has been scarcely examined i… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  19. arXiv:2407.19051  [pdf, other

    cs.NI cs.AI

    Towards a Transformer-Based Pre-trained Model for IoT Traffic Classification

    Authors: Bruna Bazaluk, Mosab Hamdan, Mustafa Ghaleb, Mohammed S. M. Gismalla, Flavio S. Correa da Silva, Daniel Macêdo Batista

    Abstract: The classification of IoT traffic is important to improve the efficiency and security of IoT-based networks. As the state-of-the-art classification methods are based on Deep Learning, most of the current results require a large amount of data to be trained. Thereby, in real-life situations, where there is a scarce amount of IoT traffic data, the models would not perform so well. Consequently, thes… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: Updated version of: B. Bazaluk, M. Hamdan, M. Ghaleb, M. S. M. Gismalla, F. S. Correa da Silva and D. M. Batista, "Towards a Transformer-Based Pre-trained Model for IoT Traffic Classification," NOMS 2024-2024 IEEE Network Operations and Management Symposium, Seoul, Korea, Republic of, 2024, pp. 1-7, doi: 10.1109/NOMS59830.2024.10575448

  20. arXiv:2407.02669  [pdf, other

    cs.NI eess.SY

    Impact of Network Deployment on the Performance of NCR-assisted Networks

    Authors: Gabriel C. M. da Silva, Diego A. Sousa, Victor F. Monteiro, Darlan C. Moreira, Tarcisio F. Maciel, Fco. Rafael M. Lima, Behrooz Makki

    Abstract: To address the need of coverage enhancement in the fifth generation (5G) of wireless cellular telecommunications, while taking into account possible bottlenecks related to deploying fiber based backhaul (e.g., required cost and time), the 3rd generation partnership project (3GPP) proposed in Release 18 the concept of network-controlled repeaters (NCRs). NCRs enhance previous radio frequency (RF) r… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Paper accepted for publication in the conference proceedings of "19th International Symposium on Wireless Communication Systems" (ISWCS)

  21. arXiv:2406.16241  [pdf, other

    cs.LG stat.ME

    Position: Benchmarking is Limited in Reinforcement Learning Research

    Authors: Scott M. Jordan, Adam White, Bruno Castro da Silva, Martha White, Philip S. Thomas

    Abstract: Novel reinforcement learning algorithms, or improvements on existing ones, are commonly justified by evaluating their performance on benchmark environments and are compared to an ever-changing set of standard algorithms. However, despite numerous calls for improvements, experimental practices continue to produce misleading or unsupported claims. One reason for the ongoing substandard practices is… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 19 pages, 13 figures, The Forty-first International Conference on Machine Learning (ICML 2024)

  22. arXiv:2406.04377  [pdf, other

    eess.IV cs.LG

    Combining Graph Neural Network and Mamba to Capture Local and Global Tissue Spatial Relationships in Whole Slide Images

    Authors: Ruiwen Ding, Kha-Dinh Luong, Erika Rodriguez, Ana Cristina Araujo Lemos da Silva, William Hsu

    Abstract: In computational pathology, extracting spatial features from gigapixel whole slide images (WSIs) is a fundamental task, but due to their large size, WSIs are typically segmented into smaller tiles. A critical aspect of this analysis is aggregating information from these tiles to make predictions at the WSI level. We introduce a model that combines a message-passing graph neural network (GNN) with… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  23. A Modular, Tendon Driven Variable Stiffness Manipulator with Internal Routing for Improved Stability and Increased Payload Capacity

    Authors: Kyle L. Walker, Alix J. Partridge, Hsing-Yu Chen, Rahul R. Ramachandran, Adam A. Stokes, Kenjiro Tadakuma, Lucas Cruz da Silva, Francesco Giorgio-Serchi

    Abstract: Stability and reliable operation under a spectrum of environmental conditions is still an open challenge for soft and continuum style manipulators. The inability to carry sufficient load and effectively reject external disturbances are two drawbacks which limit the scale of continuum designs, preventing widespread adoption of this technology. To tackle these problems, this work details the design… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: To be presented at ICRA 2024, Yokohama, Japan. 6 pages

  24. arXiv:2404.08555  [pdf, other

    cs.LG cs.AI cs.CL

    RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

    Authors: Shreyas Chaudhari, Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande, Bruno Castro da Silva

    Abstract: State-of-the-art large language models (LLMs) have become indispensable tools for various tasks. However, training LLMs to serve as effective assistants for humans requires careful consideration. A promising approach is reinforcement learning from human feedback (RLHF), which leverages human feedback to update the model in accordance with human preferences and mitigate issues like toxicity and hal… ▽ More

    Submitted 15 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

  25. arXiv:2403.06164  [pdf, other

    cs.CV

    Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation

    Authors: Paweł A. Pierzchlewicz, Caio O. da Silva, R. James Cotton, Fabian H. Sinz

    Abstract: Single camera 3D pose estimation is an ill-defined problem due to inherent ambiguities from depth, occlusion or keypoint noise. Multi-hypothesis pose estimation accounts for this uncertainty by providing multiple 3D poses consistent with the 2D measurements. Current research has predominantly concentrated on generating multiple hypotheses for single frame static pose estimation or single hypothesi… ▽ More

    Submitted 27 September, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  26. arXiv:2402.16968  [pdf, ps, other

    cs.CR cs.AI

    A Survey of Large Language Models in Cybersecurity

    Authors: Gabriel de Jesus Coelho da Silva, Carlos Becker Westphall

    Abstract: Large Language Models (LLMs) have quickly risen to prominence due to their ability to perform at or close to the state-of-the-art in a variety of fields while handling natural language. An important field of research is the application of such models at the cybersecurity context. This survey aims to identify where in the field of cybersecurity LLMs have already been applied, the ways in which they… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  27. arXiv:2401.16182  [pdf, other

    cs.CL cs.AI

    LLaMandement: Large Language Models for Summarization of French Legislative Proposals

    Authors: Joseph Gesnouin, Yannis Tannier, Christophe Gomes Da Silva, Hatim Tapory, Camille Brier, Hugo Simon, Raphael Rozenberg, Hermann Woehrel, Mehdi El Yakaabi, Thomas Binder, Guillaume Marie, Emilie Caron, Mathile Nogueira, Thomas Fontas, Laure Puydebois, Marie Theophile, Stephane Morandi, Mael Petit, David Creissac, Pauline Ennouchy, Elise Valetoux, Celine Visade, Severine Balloux, Emmanuel Cortes, Pierre-Etienne Devineau , et al. (3 additional authors not shown)

    Abstract: This report introduces LLaMandement, a state-of-the-art Large Language Model, fine-tuned by the French government and designed to enhance the efficiency and efficacy of processing parliamentary sessions (including the production of bench memoranda and documents required for interministerial meetings) by generating neutral summaries of legislative proposals. Addressing the administrative challenges… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 21 pages, 9 figures

  28. arXiv:2312.12972  [pdf, other

    cs.LG

    From Past to Future: Rethinking Eligibility Traces

    Authors: Dhawal Gupta, Scott M. Jordan, Shreyas Chaudhari, Bo Liu, Philip S. Thomas, Bruno Castro da Silva

    Abstract: In this paper, we introduce a fresh perspective on the challenges of credit assignment and policy evaluation. First, we delve into the nuances of eligibility traces and explore instances where their updates may result in unexpected credit assignment to preceding states. From this investigation emerges the concept of a novel value function, which we refer to as the \emph{bidirectional value functio… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: Accepted in The 38th Annual AAAI Conference on Artificial Intelligence

  29. Deep encoder-decoder hierarchical convolutional neural networks for conjugate heat transfer surrogate modeling

    Authors: Takiah Ebbs-Picken, David A. Romero, Carlos M. Da Silva, Cristina H. Amon

    Abstract: Conjugate heat transfer (CHT) analyses are vital for the design of many energy systems. However, high-fidelity CHT numerical simulations are computationally intensive, which limits their applications such as design optimization, where hundreds to thousands of evaluations are required. In this work, we develop a modular deep encoder-decoder hierarchical (DeepEDH) convolutional neural network, a nov… ▽ More

    Submitted 17 December, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: Revised version published in Applied Energy (https://doi.org/10.1016/j.apenergy.2024.123723)

    Journal ref: Applied Energy 372 (2024) 123723

  30. arXiv:2310.19007  [pdf, other

    cs.LG

    Behavior Alignment via Reward Function Optimization

    Authors: Dhawal Gupta, Yash Chandak, Scott M. Jordan, Philip S. Thomas, Bruno Castro da Silva

    Abstract: Designing reward functions for efficiently guiding reinforcement learning (RL) agents toward specific behaviors is a complex task. This is challenging since it requires the identification of reward structures that are not sparse and that avoid inadvertently inducing undesirable behaviors. Naively modifying the reward structure to offer denser and more frequent feedback can lead to unintended outco… ▽ More

    Submitted 31 October, 2023; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: (Spotlight) Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

  31. arXiv:2309.00176  [pdf, other

    cs.RO

    Parallel Distributional Prioritized Deep Reinforcement Learning for Unmanned Aerial Vehicles

    Authors: Alisson Henrique Kolling, Victor Augusto Kich, Junior Costa de Jesus, Andressa Cavalcante da Silva, Ricardo Bedin Grando, Paulo Lilles Jorge Drews-Jr, Daniel F. T. Gamarra

    Abstract: This work presents a study on parallel and distributional deep reinforcement learning applied to the mapless navigation of UAVs. For this, we developed an approach based on the Soft Actor-Critic method, producing a distributed and distributional variant named PDSAC, and compared it with a second one based on the traditional SAC algorithm. In addition, we also embodied a prioritized memory system i… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Comments: 7 pages, 6 figures. Approved at LARS 2023

  32. arXiv:2307.10018  [pdf, other

    cs.RO cs.AI

    RobôCIn Small Size League Extended Team Description Paper for RoboCup 2023

    Authors: Aline Lima de Oliveira, Cauê Addae da Silva Gomes, Cecília Virginia Santos da Silva, Charles Matheus de Sousa Alves, Danilo Andrade Martins de Souza, Driele Pires Ferreira Araújo Xavier, Edgleyson Pereira da Silva, Felipe Bezerra Martins, Lucas Henrique Cavalcanti Santos, Lucas Dias Maciel, Matheus Paixão Gumercindo dos Santos, Matheus Lafayette Vasconcelos, Matheus Vinícius Teotonio do Nascimento Andrade, João Guilherme Oliveira Carvalho de Melo, João Pedro Souza Pereira de Moura, José Ronald da Silva, José Victor Silva Cruz, Pedro Henrique Santana de Morais, Pedro Paulo Salman de Oliveira, Riei Joaquim Matos Rodrigues, Roberto Costa Fernandes, Ryan Vinicius Santos Morais, Tamara Mayara Ramos Teobaldo, Washington Igor dos Santos Silva, Edna Natividade Silva Barros

    Abstract: RobôCIn has participated in RoboCup Small Size League since 2019, won its first world title in 2022 (Division B), and is currently a three-times Latin-American champion. This paper presents our improvements to defend the Small Size League (SSL) division B title in RoboCup 2023 in Bordeaux, France. This paper aims to share some of the academic research that our team developed over the past year. Ou… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  33. arXiv:2305.09838  [pdf, other

    cs.LG cs.AI

    Coagent Networks: Generalized and Scaled

    Authors: James E. Kostas, Scott M. Jordan, Yash Chandak, Georgios Theocharous, Dhawal Gupta, Martha White, Bruno Castro da Silva, Philip S. Thomas

    Abstract: Coagent networks for reinforcement learning (RL) [Thomas and Barto, 2011] provide a powerful and flexible framework for deriving principled learning rules for arbitrary stochastic neural networks. The coagent framework offers an alternative to backpropagation-based deep learning (BDL) that overcomes some of backpropagation's main limitations. For example, coagent networks can compute different par… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  34. arXiv:2301.11173  [pdf, other

    cs.RO cs.AI

    Double Deep Reinforcement Learning Techniques for Low Dimensional Sensing Mapless Navigation of Terrestrial Mobile Robots

    Authors: Linda Dotto de Moraes, Victor Augusto Kich, Alisson Henrique Kolling, Jair Augusto Bottega, Raul Steinmetz, Emerson Cassiano da Silva, Ricardo Bedin Grando, Anselmo Rafael Cuckla, Daniel Fernando Tello Gamarra

    Abstract: In this work, we present two Deep Reinforcement Learning (Deep-RL) approaches to enhance the problem of mapless navigation for a terrestrial mobile robot. Our methodology focus on comparing a Deep-RL technique based on the Deep Q-Network (DQN) algorithm with a second one based on the Double Deep Q-Network (DDQN) algorithm. We use 24 laser measurement samples and the relative position and angle of… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Journal ref: International Conference on Intelligent Systems Design and Applications, 2022

  35. arXiv:2301.10330  [pdf, other

    cs.LG cs.AI

    Off-Policy Evaluation for Action-Dependent Non-Stationary Environments

    Authors: Yash Chandak, Shiv Shankar, Nathaniel D. Bastian, Bruno Castro da Silva, Emma Brunskil, Philip S. Thomas

    Abstract: Methods for sequential decision-making are often built upon a foundational assumption that the underlying decision process is stationary. This limits the application of such methods because real-world problems are often subject to changes due to external factors (passive non-stationarity), changes induced by interactions with the system itself (active non-stationarity), or both (hybrid non-station… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: Accepted at Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022)

  36. Sample-Efficient Multi-Objective Learning via Generalized Policy Improvement Prioritization

    Authors: Lucas N. Alegre, Ana L. C. Bazzan, Diederik M. Roijers, Ann Nowé, Bruno C. da Silva

    Abstract: Multi-objective reinforcement learning (MORL) algorithms tackle sequential decision problems where agents may have different preferences over (possibly conflicting) reward functions. Such algorithms often learn a set of policies (each optimized for a particular agent preference) that can later be used to solve problems with novel preferences. We introduce a novel algorithm that uses Generalized Po… ▽ More

    Submitted 23 March, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

    Comments: Accepted to AAMAS 2023

  37. Extractive Text Summarization Using Generalized Additive Models with Interactions for Sentence Selection

    Authors: Vinícius Camargo da Silva, João Paulo Papa, Kelton Augusto Pontara da Costa

    Abstract: Automatic Text Summarization (ATS) is becoming relevant with the growth of textual data; however, with the popularization of public large-scale datasets, some recent machine learning approaches have focused on dense models and architectures that, despite producing notable results, usually turn out in models difficult to interpret. Given the challenge behind interpretable learning-based text summar… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  38. Adopting Microservices and DevOps in the Cyber-Physical Systems Domain: A Rapid Review and Case Study

    Authors: Jonas Fritzsch, Justus Bogner, Markus Haug, Ana Cristina Franco da Silva, Carolin Rubner, Matthias Saft, Horst Sauer, Stefan Wagner

    Abstract: The domain of cyber-physical systems (CPS) has recently seen strong growth, e.g., due to the rise of the Internet of Things (IoT) in industrial domains, commonly referred to as "Industry 4.0". However, CPS challenges like the strong hardware focus can impact modern software development practices, especially in the context of modernizing legacy systems. While microservices and DevOps have been wide… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: 10 pages, 8 figures, accepted for publication at "Software: Practice and Experience - Wiley Online Library"

  39. arXiv:2208.14501  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Model-Based Reinforcement Learning with SINDy

    Authors: Rushiv Arora, Bruno Castro da Silva, Eliot Moss

    Abstract: We draw on the latest advancements in the physics community to propose a novel method for discovering the governing non-linear dynamics of physical systems in reinforcement learning (RL). We establish that this method is capable of discovering the underlying dynamics using significantly fewer trajectories (as little as one rollout with $\leq 30$ time steps) than state of the art model learning alg… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: 8 pages, 1 figure, 1 table, 1 algorithm, presented at the Decision Awareness in Reinforcement Learning workshop held at the International Conference on Machine Learning, 22 July 2022, Baltimore MD, USA

  40. arXiv:2208.11744  [pdf, other

    cs.LG cs.AI cs.CY

    Enforcing Delayed-Impact Fairness Guarantees

    Authors: Aline Weber, Blossom Metevier, Yuriy Brun, Philip S. Thomas, Bruno Castro da Silva

    Abstract: Recent research has shown that seemingly fair machine learning models, when used to inform decisions that have an impact on peoples' lives or well-being (e.g., applications involving education, employment, and lending), can inadvertently increase social inequality in the long term. This is because prior fairness-aware algorithms only consider static fairness constraints, such as equal opportunity… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Comments: 24 pages, 5 figures

  41. arXiv:2207.08007  [pdf, other

    math.CO cs.DM

    A family of counterexamples for a conjecture of Berge on $α$-diperfect digraphs

    Authors: Caroline Aparecida de Paula Silva, Cândida Nunes da Silva, Orlando Lee

    Abstract: Let $D$ be a digraph. A stable set $S$ of $D$ and a path partition $\mathcal{P}$ of $D$ are orthogonal if every path $P \in \mathcal{P}$ contains exactly one vertex of $S$. In 1982, Berge defined the class of $α$-diperfect digraphs. A digraph $D$ is $α$-diperfect if for every maximum stable set $S$ of $D$ there is a path partition $\mathcal{P}$ of $D$ orthogonal to $S$ and this property holds for… ▽ More

    Submitted 28 July, 2022; v1 submitted 16 July, 2022; originally announced July 2022.

  42. arXiv:2207.03225  [pdf, other

    cs.SE cs.CR

    Towards Immediate Feedback for Security Relevant Code in Development Environments

    Authors: Markus Haug Ana Cristina Franco Da Silva, Stefan Wagner

    Abstract: Nowadays, the correct use of cryptography libraries is essential to ensure the necessary information security in different kinds of applications. A common practice in software development is the use of static application security testing (SAST) tools to analyze code regarding security vulnerabilities. Most of these tools are designed to run separately from development environments. Their results a… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: submitted to the 16th Symposium and Summer School On Service-Oriented Computing 2022

  43. Sequence-aware multimodal page classification of Brazilian legal documents

    Authors: Pedro H. Luz de Araujo, Ana Paula G. S. de Almeida, Fabricio A. Braz, Nilton C. da Silva, Flavio de Barros Vidal, Teofilo E. de Campos

    Abstract: The Brazilian Supreme Court receives tens of thousands of cases each semester. Court employees spend thousands of hours to execute the initial analysis and classification of those cases -- which takes effort away from posterior, more complex stages of the case management workflow. In this paper, we explore multimodal classification of documents from Brazil's Supreme Court. We train and evaluate ou… ▽ More

    Submitted 15 July, 2022; v1 submitted 2 July, 2022; originally announced July 2022.

    Comments: 11 pages, 6 figures. This preprint, which was originally written on 8 April 2021, has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this article is published in the International Journal on Document Analysis and Recognition, and is available online at https://doi.org/10.1007/s10032-022-00406-7 and https://rdcu.be/cRvvV

    Journal ref: International Journal on Document Analysis and Recognition.2022

  44. arXiv:2206.12293  [pdf, other

    cs.CL

    Text and author-level political inference using heterogeneous knowledge representations

    Authors: Samuel Caetano da Silva, Ivandre Paraboni

    Abstract: The inference of politically-charged information from text data is a popular research topic in Natural Language Processing (NLP) at both text- and author-level. In recent years, studies of this kind have been implemented with the aid of representations from transformers such as BERT. Despite considerable success, however, we may ask whether results may be improved even further by combining transfo… ▽ More

    Submitted 29 July, 2022; v1 submitted 24 June, 2022; originally announced June 2022.

  45. arXiv:2206.11326  [pdf, other

    cs.LG cs.AI

    Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer

    Authors: Lucas N. Alegre, Ana L. C. Bazzan, Bruno C. da Silva

    Abstract: In many real-world applications, reinforcement learning (RL) agents might have to solve multiple tasks, each one typically modeled via a reward function. If reward functions are expressed linearly, and the agent has previously learned a set of policies for different tasks, successor features (SFs) can be exploited to combine such policies and identify reasonable solutions for new problems. However… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: Proceedings of the 39th International Conference on Machine Learning (ICML'22)

  46. arXiv:2204.13857  [pdf

    cs.CV

    Equine radiograph classification using deep convolutional neural networks

    Authors: Raniere Gaia Costa da Silva, Ambika Prasad Mishra, Christopher Riggs, Michael Doube

    Abstract: Purpose: To assess the capability of deep convolutional neural networks to classify anatomical location and projection from a series of 48 standard views of racehorse limbs. Materials and Methods: 9504 equine pre-import radiographs were used to train, validate, and test six deep learning architectures available as part of the open source machine learning framework PyTorch. Results: ResNet-34 a… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

  47. arXiv:2204.03706  [pdf, other

    cs.IR cs.LG

    Introducing a Framework and a Decision Protocol to Calibrate Recommender Systems

    Authors: Diego Corrêa da Silva, Frederico Araújo Durão

    Abstract: Recommender Systems use the user's profile to generate a recommendation list with unknown items to a target user. Although the primary goal of traditional recommendation systems is to deliver the most relevant items, such an effort unintentionally can cause collateral effects including low diversity and unbalanced genres or categories, benefiting particular groups of categories. This paper propose… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: 12 Tables and 5 figures. Submitted to a journal

  48. arXiv:2203.12600  [pdf, other

    q-fin.GN cs.CR cs.CY

    Standing Forest Coin (SFC)

    Authors: Marcelo de A. Borges, Guido L. de S. Filho, Cicero Inacio da Silva, Anderson M. P. Barros, Raul V. B. J. Britto, Nivaldo M. de C. Junior, Daniel F. L. de Souza

    Abstract: This article describes a proposal to create a digital currency that allows the decentralized collection of resources directed to initiatives and activities that aim to protect the Brazilian Amazon ecosystem by using blockchain and digital contracts. In addition to the digital currency, the goal is to design a smart contract based in oracles to ensure credibility and security for investors and dono… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: in Portuguese

    MSC Class: 58-04 ACM Class: J.7

  49. arXiv:2112.13819  [pdf, other

    cs.RO

    Trajectory Planning for Hybrid Unmanned Aerial Underwater Vehicles with Smooth Media Transition

    Authors: Pedro Miranda Pinheiro, Armando Alves Neto, Ricardo Bedin Grando, Cesar Bastos da Silva, Vivian Misaki Aoki, Dayana Cardoso, Alexandre Campos Horn, Paulo Lilles Jorge Drews-Jr

    Abstract: In the last decade, a great effort has been employed in the study of Hybrid Unmanned Aerial Underwater Vehicles, robots that can easily fly and dive into the water with different levels of mechanical adaptation. However, most of this literature is concentrated on physical design, practical issues of construction, and, more recently, low-level control strategies. Little has been done in the context… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

    Comments: Accepted to the Journal of Intelligent & Robotic Systems

  50. arXiv:2112.13721  [pdf, other

    math.NA cs.MS

    Variational symplectic diagonally implicit Runge-Kutta methods for isospectral systems

    Authors: Clauson Carvalho da Silva, Christian Lessig

    Abstract: Isospectral flows appear in a variety of applications, e.g. the Toda lattice in solid state physics or in discrete models for two-dimensional hydrodynamics, with the isospectral property often corresponding to mathematically or physically important conservation laws. Their most prominent feature, i.e. the conservation of the eigenvalues of the matrix state variable, should therefore be retained wh… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

    MSC Class: 65L06; 65P10