Skip to main content

Showing 1–31 of 31 results for author: Silva, B

Searching in archive eess. Search in all archives.
.
  1. TerraTrace: Temporal Signature Land Use Mapping System

    Authors: Angela Busheska, Vikram Iyer, Bruno Silva, Peder Olsen, Ranveer Chandra, Vaishnavi Ranganathan

    Abstract: Understanding land use over time is critical to tracking events related to climate change, like deforestation. However, satellite-based remote sensing tools which are used for monitoring struggle to differentiate vegetation types in farms and orchards from forests. We observe that metrics such as the Normalized Difference Vegetation Index (NDVI), based on plant photosynthesis, have unique temporal… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  2. arXiv:2411.19032  [pdf, other

    eess.SP cs.NI

    Machine Learning for Spectrum Sharing: A Survey

    Authors: Francisco R. V. Guimarães, José Mairton B. da Silva Jr., Charles Casimiro Cavalcante, Gabor Fodor, Mats Bengtsson, Carlo Fischione

    Abstract: The 5th generation (5G) of wireless systems is being deployed with the aim to provide many sets of wireless communication services, such as low data rates for a massive amount of devices, broadband, low latency, and industrial wireless access. Such an aim is even more complex in the next generation wireless systems (6G) where wireless connectivity is expected to serve any connected intelligent uni… ▽ More

    Submitted 28 November, 2024; originally announced November 2024.

    Comments: Published at NOW Foundations and Trends in Networking

    Journal ref: Foundations and Trends in Networking: Vol. 14: No. 1-2, pp 1-159, 2024

  3. arXiv:2403.02043  [pdf, other

    eess.IV cs.CV

    Iterative Occlusion-Aware Light Field Depth Estimation using 4D Geometrical Cues

    Authors: Rui Lourenço, Lucas Thomaz, Eduardo A. B. Silva, Sergio M. M. Faria

    Abstract: Light field cameras and multi-camera arrays have emerged as promising solutions for accurately estimating depth by passively capturing light information. This is possible because the 3D information of a scene is embedded in the 4D light field geometry. Commonly, depth estimation methods extract this information relying on gradient information, heuristic-based optimisation models, or learning-based… ▽ More

    Submitted 14 May, 2025; v1 submitted 4 March, 2024; originally announced March 2024.

  4. arXiv:2306.12962  [pdf, other

    eess.SY cs.LG math.DS physics.comp-ph

    PyKoopman: A Python Package for Data-Driven Approximation of the Koopman Operator

    Authors: Shaowu Pan, Eurika Kaiser, Brian M. de Silva, J. Nathan Kutz, Steven L. Brunton

    Abstract: PyKoopman is a Python package for the data-driven approximation of the Koopman operator associated with a dynamical system. The Koopman operator is a principled linear embedding of nonlinear dynamics and facilitates the prediction, estimation, and control of strongly nonlinear dynamics using linear systems theory. In particular, PyKoopman provides tools for data-driven system identification for un… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: 16 pages

  5. arXiv:2305.07511  [pdf, ps, other

    cs.LG cs.AI cs.CY eess.IV

    eXplainable Artificial Intelligence on Medical Images: A Survey

    Authors: Matteus Vargas Simão da Silva, Rodrigo Reis Arrais, Jhessica Victoria Santos da Silva, Felipe Souza Tânios, Mateus Antonio Chinelatto, Natalia Backhaus Pereira, Renata De Paris, Lucas Cesar Ferreira Domingos, Rodrigo Dória Villaça, Vitor Lopes Fabris, Nayara Rossi Brito da Silva, Ana Claudia Akemi Matsuki de Faria, Jose Victor Nogueira Alves da Silva, Fabiana Cristina Queiroz de Oliveira Marucci, Francisco Alves de Souza Neto, Danilo Xavier Silva, Vitor Yukio Kondo, Claudio Filipi Gonçalves dos Santos

    Abstract: Over the last few years, the number of works about deep learning applied to the medical field has increased enormously. The necessity of a rigorous assessment of these models is required to explain these results to all people involved in medical exams. A recent field in the machine learning area is explainable artificial intelligence, also known as XAI, which targets to explain the results of such… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  6. arXiv:2303.00577  [pdf, ps, other

    eess.SP cs.DC cs.NI

    Computing Functions Over-the-Air Using Digital Modulations

    Authors: Saeed Razavikia, Jose Mairton Barros da Silva Jr, Carlo Fischione

    Abstract: Over-the-air computation (AirComp) is a known technique in which wireless devices transmit values by analog amplitude modulation so that a function of these values is computed over the communication channel at a common receiver. The physical reason is the superposition properties of the electromagnetic waves, which naturally return sums of analog values. Consequently, the applications of AirComp a… ▽ More

    Submitted 20 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: submitted version to the IEEE ICC conference

  7. arXiv:2302.03022  [pdf, other

    cs.CV cs.RO eess.IV

    SurgT challenge: Benchmark of Soft-Tissue Trackers for Robotic Surgery

    Authors: Joao Cartucho, Alistair Weld, Samyakh Tukra, Haozheng Xu, Hiroki Matsuzaki, Taiyo Ishikawa, Minjun Kwon, Yong Eun Jang, Kwang-Ju Kim, Gwang Lee, Bizhe Bai, Lueder Kahrs, Lars Boecking, Simeon Allmendinger, Leopold Muller, Yitong Zhang, Yueming Jin, Sophia Bano, Francisco Vasconcelos, Wolfgang Reiter, Jonas Hajek, Bruno Silva, Estevao Lima, Joao L. Vilaca, Sandro Queiros , et al. (1 additional authors not shown)

    Abstract: This paper introduces the ``SurgT: Surgical Tracking" challenge which was organised in conjunction with MICCAI 2022. There were two purposes for the creation of this challenge: (1) the establishment of the first standardised benchmark for the research community to assess soft-tissue trackers; and (2) to encourage the development of unsupervised deep learning methods, given the lack of annotated da… ▽ More

    Submitted 30 August, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  8. Physical Layer Security Techniques Applied to Vehicle-to-Everything Networks

    Authors: Leonardo Barbosa da Silva, Evelio Martín Garcia Fernández, Ândrei Camponogara

    Abstract: Physical Layer Security (PLS) is an emerging concept in the field of secrecy for wireless communications that can be used alongside cryptography to prevent unauthorized devices from eavesdropping a legitimate transmission. It offers low computational cost and overhead by injecting an interfering signal in the wiretap channels of potential eavesdroppers. This paper discusses the benefits of the Art… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

    Comments: 5 pages, 6 figures

  9. arXiv:2212.13950  [pdf, other

    cs.IT eess.SP

    Mixed Coherent and Non-Coherent Transmission for Multi-CPU Cell-Free Systems

    Authors: Roberto P. Antonioli, Iran M. Braga Jr., Gabor Fodor, Yuri C. B. Silva, Walter C. Freitas Jr

    Abstract: Existing works on cell-free systems consider either coherent or non-coherent downlink data transmission and a network deployment with a single central processing unit (CPU). While it is known that coherent transmission outperforms noncoherent transmission when assuming unlimited fronthaul links, the former requires a perfect timing synchronization, which is practically not viable over a large netw… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

    Comments: Submitted for possible publication in IEEE conference

  10. arXiv:2212.13804  [pdf, ps, other

    eess.SP

    A Distributed Game-Theoretic Solution for Power Management in the Uplink of Cell-Free Systems

    Authors: Juno V. Saraiva, Roberto P. Antonioli, Gábor Fodor, Walter C. Freitas Jr., Yuri C. B. Silva

    Abstract: This paper investigates cell-free massive multiple input multiple output systems with a particular focus on uplink power allocation. In these systems, uplink power control is highly non-trivial, since a single user terminal is associated with multiple intended receiving base stations. In addition, in cell-free systems, distributed power control schemes that address the inherent spectral and energy… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

    Comments: Accepted at IEEE Globecom 2022

  11. arXiv:2212.13798  [pdf, ps, other

    eess.SP

    Efficient Battery Usage in Wireless-Powered Cell-Free Systems with Self-Energy Recycling

    Authors: Iran M. Braga Jr., Roberto P. Antonioli, Gabor Fodor, Yuri C. B. Silva, Walter C. Freitas Jr

    Abstract: This paper investigates wireless-powered cell-free systems, in which the users send their uplink data signal while simultaneously harvesting energy from network nodes and user terminals - including the transmitting user terminal itself - by performing self-energy recycling. In this rather general setting, a closed-form lower bound of the amount of harvested energy and the achieved signal-to-interf… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

    Comments: Accepted as a correspondance at IEEE TVT

  12. arXiv:2211.04152  [pdf, other

    cs.LG eess.SP math.OC

    Federated Learning Using Three-Operator ADMM

    Authors: Shashi Kant, José Mairton B. da Silva Jr., Gabor Fodor, Bo Göransson, Mats Bengtsson, Carlo Fischione

    Abstract: Federated learning (FL) has emerged as an instance of distributed machine learning paradigm that avoids the transmission of data generated on the users' side. Although data are not transmitted, edge devices have to deal with limited communication bandwidths, data heterogeneity, and straggler effects due to the limited computational resources of users' devices. A prominent approach to overcome such… ▽ More

    Submitted 25 March, 2024; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: accepted to IEEE Journal of Selected Topics in Signal Processing, 2022

  13. arXiv:2210.17469  [pdf, ps, other

    cs.LG cs.DC eess.SP

    Blind Asynchronous Over-the-Air Federated Edge Learning

    Authors: Saeed Razavikia, Jaume Anguera Peris, Jose Mairton B. da Silva Jr, Carlo Fischione

    Abstract: Federated Edge Learning (FEEL) is a distributed machine learning technique where each device contributes to training a global inference model by independently performing local computations with their data. More recently, FEEL has been merged with over-the-air computation (OAC), where the global model is calculated over the air by leveraging the superposition of analog signals. However, when implem… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

  14. arXiv:2208.14501  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Model-Based Reinforcement Learning with SINDy

    Authors: Rushiv Arora, Bruno Castro da Silva, Eliot Moss

    Abstract: We draw on the latest advancements in the physics community to propose a novel method for discovering the governing non-linear dynamics of physical systems in reinforcement learning (RL). We establish that this method is capable of discovering the underlying dynamics using significantly fewer trajectories (as little as one rollout with $\leq 30$ time steps) than state of the art model learning alg… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: 8 pages, 1 figure, 1 table, 1 algorithm, presented at the Decision Awareness in Reinforcement Learning workshop held at the International Conference on Machine Learning, 22 July 2022, Baltimore MD, USA

  15. arXiv:2111.08481  [pdf, other

    eess.SY cs.LG physics.flu-dyn

    PySINDy: A comprehensive Python package for robust sparse system identification

    Authors: Alan A. Kaptanoglu, Brian M. de Silva, Urban Fasel, Kadierdan Kaheman, Andy J. Goldschmidt, Jared L. Callaham, Charles B. Delahunt, Zachary G. Nicolaou, Kathleen Champion, Jean-Christophe Loiseau, J. Nathan Kutz, Steven L. Brunton

    Abstract: Automated data-driven modeling, the process of directly discovering the governing equations of a system from data, is increasingly being used across the scientific community. PySINDy is a Python package that provides tools for applying the sparse identification of nonlinear dynamics (SINDy) approach to data-driven model discovery. In this major update to PySINDy, we implement several advanced feat… ▽ More

    Submitted 25 January, 2022; v1 submitted 12 November, 2021; originally announced November 2021.

  16. arXiv:2108.13109  [pdf, other

    physics.med-ph eess.IV

    Signal-carrying speckle in Optical Coherence Tomography: a methodological review on biomedical applications

    Authors: Vania Bastos Silva, Danilo Andrade De Jesus, Stefan Klein, Theo van Walsum, João Cardoso, Luisa Sánchez Brea, Pedro G. Vaz

    Abstract: Significance: Speckle has historically been considered a source of noise in coherent light imaging. However, a number of works in optical coherence tomography (OCT) imaging have shown that speckle patterns may contain relevant information regarding sub-resolution and structural properties of the tissues from which it is originated. Aim: The objective of this work is to provide a comprehensive ov… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

  17. arXiv:2106.11447  [pdf, other

    eess.IV cs.CV cs.LG

    Encoder-Decoder Architectures for Clinically Relevant Coronary Artery Segmentation

    Authors: João Lourenço Silva, Miguel Nobre Menezes, Tiago Rodrigues, Beatriz Silva, Fausto J. Pinto, Arlindo L. Oliveira

    Abstract: Coronary X-ray angiography is a crucial clinical procedure for the diagnosis and treatment of coronary artery disease, which accounts for roughly 16% of global deaths every year. However, the images acquired in these procedures have low resolution and poor contrast, making lesion detection and assessment challenging. Accurate coronary artery segmentation not only helps mitigate these problems, but… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

  18. arXiv:2105.09128  [pdf, other

    cs.CV cs.LG cs.SD eess.AS

    XCycles Backprojection Acoustic Super-Resolution

    Authors: Feras Almasri, Jurgen Vandendriessche, Laurent Segers, Bruno da Silva, An Braeken, Kris Steenhaut, Abdellah Touhafi, Olivier Debeir

    Abstract: The computer vision community has paid much attention to the development of visible image super-resolution (SR) using deep neural networks (DNNs) and has achieved impressive results. The advancement of non-visible light sensors, such as acoustic imaging sensors, has attracted much attention, as they allow people to visualize the intensity of sound waves beyond the visible spectrum. However, becaus… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    Journal ref: Sensors 2021, 21, 3453

  19. arXiv:2104.12749  [pdf, other

    eess.SP cs.IT

    Simultaneous Wireless Information and Power Transfer for Federated Learning

    Authors: José Mairton B. da Silva Jr., Konstantinos Ntougias, Ioannis Krikidis, Gábor Fodor, Carlo Fischione

    Abstract: In the Internet of Things, learning is one of most prominent tasks. In this paper, we consider an Internet of Things scenario where federated learning is used with simultaneous transmission of model data and wireless power. We investigate the trade-off between the number of communication rounds and communication round time while harvesting energy to compensate the energy expenditure. We formulate… ▽ More

    Submitted 21 July, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: Accepted to appear in the IEEE International Workshop on Signal Processing Advances in Wireless Communications (SPAWC) in Lucca, Italy, Sep. 2021

  20. arXiv:2103.15968  [pdf, other

    eess.SP

    Joint Resource Allocation and Transceiver Design for Sum-Rate Maximization under Latency Constraints in Multicell MU-MIMO Systems

    Authors: Iran M. Braga Jr., Roberto P. Antonioli, Gabor Fodor, Yuri C. B. Silva, Carlos F. M. e Silva, Walter C. Freitas Jr

    Abstract: Due to the continuous advancements of orthogonal frequency division multiplexing (OFDM) and multiple antenna techniques, multiuser multiple input multiple output (MU-MIMO) OFDM is a key enabler of both fourth and fifth generation networks. In this paper, we consider the problem of weighted sum-rate maximization under latency constraints in finite buffer multicell MU-MIMO OFDM systems. Unlike previ… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: Accepted at IEEE Transactions on Communications

  21. arXiv:2102.13476  [pdf, other

    eess.SP cs.LG math.OC

    PySensors: A Python Package for Sparse Sensor Placement

    Authors: Brian M. de Silva, Krithika Manohar, Emily Clark, Bingni W. Brunton, Steven L. Brunton, J. Nathan Kutz

    Abstract: PySensors is a Python package for selecting and placing a sparse set of sensors for classification and reconstruction tasks. Specifically, PySensors implements algorithms for data-driven sparse sensor placement optimization for reconstruction (SSPOR) and sparse sensor placement optimization for classification (SSPOC). In this work we provide a brief description of the mathematical algorithms and t… ▽ More

    Submitted 20 February, 2021; originally announced February 2021.

  22. arXiv:2010.11317  [pdf, other

    cs.IT cs.NI eess.SP

    Full-Duplex and Dynamic-TDD: Pushing the Limits of Spectrum Reuse in Multi-Cell Communications

    Authors: José Mairton B. da Silva Jr., Gustav Wikström, Ratheesh K. Mungara, Carlo Fischione

    Abstract: Although in cellular networks full-duplex and dynamic time-division duplexing promise increased spectrum efficiency, their potential is so far challenged by increased interference. While previous studies have shown that self-interference can be suppressed to a sufficient level, we show that the cross-link interference for both duplexing modes, especially from base station to base station, is the r… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: 15 pages, 6 figures. Accepted to IEEE Wireless Communications - Special Issue on Full Duplex Communications Theory, Standardization and Practice

  23. arXiv:2008.13492  [pdf, other

    eess.SP cs.LG

    Wireless for Machine Learning

    Authors: Henrik Hellström, José Mairton B. da Silva Jr, Mohammad Mohammadi Amiri, Mingzhe Chen, Viktoria Fodor, H. Vincent Poor, Carlo Fischione

    Abstract: As data generation increasingly takes place on devices without a wired connection, machine learning (ML) related traffic will be ubiquitous in wireless networks. Many studies have shown that traditional wireless protocols are highly inefficient or unsustainable to support ML, which creates the need for new wireless communication methods. In this survey, we give an exhaustive review of the state-of… ▽ More

    Submitted 9 June, 2022; v1 submitted 31 August, 2020; originally announced August 2020.

  24. arXiv:2007.14863  [pdf, other

    cs.CV eess.IV

    Automatic Detection of Aedes aegypti Breeding Grounds Based on Deep Networks with Spatio-Temporal Consistency

    Authors: Wesley L. Passos, Gabriel M. Araujo, Amaro A. de Lima, Sergio L. Netto, Eduardo A. B. da Silva

    Abstract: Every year, the Aedes aegypti mosquito infects millions of people with diseases such as dengue, zika, chikungunya, and urban yellow fever. The main form to combat these diseases is to avoid mosquito reproduction by searching for and eliminating the potential mosquito breeding grounds. In this work, we introduce a comprehensive dataset of aerial videos, acquired with an unmanned aerial vehicle, con… ▽ More

    Submitted 27 November, 2021; v1 submitted 29 July, 2020; originally announced July 2020.

  25. arXiv:2006.13380  [pdf, other

    eess.SY

    Physics-informed machine learning for sensor fault detection with flight test data

    Authors: Brian M. de Silva, Jared Callaham, Jonathan Jonker, Nicholas Goebel, Jennifer Klemisch, Darren McDonald, Nathan Hicks, J. Nathan Kutz, Steven L. Brunton, Aleksandr Y. Aravkin

    Abstract: We develop data-driven algorithms to fully automate sensor fault detection in systems governed by underlying physics. The proposed machine learning method uses a time series of typical behavior to approximate the evolution of measurements of interest by a linear time-invariant system. Given additional data from related sensors, a Kalman observer is used to maintain a separate real-time estimate of… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

    Comments: 21 pages, 10 figures, submitted to AIAA

  26. Hippocampus Segmentation on Epilepsy and Alzheimer's Disease Studies with Multiple Convolutional Neural Networks

    Authors: Diedre Carmo, Bruna Silva, Clarissa Yasuda, Letícia Rittner, Roberto Lotufo

    Abstract: Hippocampus segmentation on magnetic resonance imaging is of key importance for the diagnosis, treatment decision and investigation of neuropsychiatric disorders. Automatic segmentation is an active research field, with many recent models using deep learning. Most current state-of-the art hippocampus segmentation methods train their methods on healthy or Alzheimer's disease patients from public da… ▽ More

    Submitted 10 February, 2021; v1 submitted 14 January, 2020; originally announced January 2020.

    Comments: Code is available at https://github.com/dscarmo/e2dhipseg Published in Heliyon: https://www.sciencedirect.com/science/article/pii/S2405844021003315

    Journal ref: Heliyon, Volume 7, Issue 2, 2021

  27. A Multistage Method for SCMA Codebook Design Based on MDS Codes

    Authors: Bruno Fontana da Silva, Danilo Silva, Bartolomeu F. Uchôa-Filho, Didier Le Ruyet

    Abstract: Sparse Code Multiple Access (SCMA) has been recently proposed for the future generation of wireless communication standards. SCMA system design involves specifying several parameters. In order to simplify the procedure, most works consider a multistage design approach. Two main stages are usually emphasized in these methods: sparse signatures design (equivalently, resource allocation) and codebook… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

    Comments: Submitted to IEEE Wireless Communication Letters

  28. arXiv:1902.04487  [pdf, other

    eess.IV cs.CV cs.LG

    Extended 2D Consensus Hippocampus Segmentation

    Authors: Diedre Carmo, Bruna Silva, Clarissa Yasuda, Letícia Rittner, Roberto Lotufo

    Abstract: Hippocampus segmentation plays a key role in diagnosing various brain disorders such as Alzheimer's disease, epilepsy, multiple sclerosis, cancer, depression and others. Nowadays, segmentation is still mainly performed manually by specialists. Segmentation done by experts is considered to be a gold-standard when evaluating automated methods, buts it is a time consuming and arduos task, requiring s… ▽ More

    Submitted 13 May, 2020; v1 submitted 12 February, 2019; originally announced February 2019.

    Comments: This was published as an extended abstract in MIDL 2019 [arXiv:1907.08612]. An alpha version of the code is available at https://github.com/dscarmo/e2dhipseg. More experiments on improvements to the method and code are ongoing. Future updates are to be expected. A new, more complete paper is published in arXiv:2001.05058

    Report number: MIDL/2019/ExtendedAbstract/Sygx97DaKV

  29. arXiv:1801.03717  [pdf, ps, other

    cs.IT cs.NI eess.SP

    How to Split UL/DL Antennas in Full-Duplex Cellular Networks

    Authors: José Mairton B. da Silva Jr., Hadi Ghauch, Gábor Fodor, Carlo Fischione

    Abstract: To further improve the potential of full-duplex communications, networks may employ multiple antennas at the base station or user equipment. To this end, networks that employ current radios usually deal with self-interference and multi-user interference by beamforming techniques. Although previous works investigated beamforming design to improve spectral efficiency, the fundamental question of how… ▽ More

    Submitted 23 May, 2018; v1 submitted 11 January, 2018; originally announced January 2018.

    Comments: 7 pages, 4 figures. Accepted to IEEE ICC 2018 Workshop on Full-Duplex Communications for Future Wireless Networks

  30. arXiv:1712.00789  [pdf

    physics.med-ph cs.NE eess.IV

    Reconstruction of Electrical Impedance Tomography Using Fish School Search, Non-Blind Search, and Genetic Algorithm

    Authors: Valter Augusto de Freitas Barbosa, Reiga Ramalho Ribeiro, Allan Rivalles Souza Feitosa, Victor Luiz Bezerra Araújo da Silva, Arthur Diego Dias Rocha, Rafaela Covello de Freitas, Ricardo Emmanuel de Souza, Wellington Pinheiro dos Santos

    Abstract: Electrical Impedance Tomography (EIT) is a noninvasive imaging technique that does not use ionizing radiation, with application both in environmental sciences and in health. Image reconstruction is performed by solving an inverse problem and ill-posed. Evolutionary Computation and Swarm Intelligence have become a source of methods for solving inverse problems. Fish School Search (FSS) is a promisi… ▽ More

    Submitted 3 December, 2017; originally announced December 2017.

    Journal ref: International Journal of Swarm Intelligence Research, Volume 8, Issue 2, 2017

  31. arXiv:1711.09048  [pdf, other

    cs.AI cs.RO eess.SY

    A Compression-Inspired Framework for Macro Discovery

    Authors: Francisco M. Garcia, Bruno C. da Silva, Philip S. Thomas

    Abstract: In this paper we consider the problem of how a reinforcement learning agent tasked with solving a set of related Markov decision processes can use knowledge acquired early in its lifetime to improve its ability to more rapidly solve novel, but related, tasks. One way of exploiting this experience is by identifying recurrent patterns in trajectories obtained from well-performing policies. We propos… ▽ More

    Submitted 22 February, 2019; v1 submitted 24 November, 2017; originally announced November 2017.

    Comments: Accepted as Extended Abstract, AAMAS, 2019