Search | arXiv e-print repository

Cascaded Multiwire-PLC/Multiple-VLC System: Characterization and Performance

Authors: Hugerles S. Silva, Higo T. P. Silva, Paulo V. B. Tomé, Felipe A. P. Figueiredo, Edson P. da Silva, Rausley A. A. de Souza

Abstract: This paper proposes a cascaded multiwire-power line communication (PLC)/multiple-visible light communication (VLC) system. This hybrid architecture offers low installation cost, enhanced performance, practical feasibility, and a wide range of applications. Novel analytical expressions are derived for key statistics and outage probability, bit error probability, and ergodic channel capacity metrics… ▽ More This paper proposes a cascaded multiwire-power line communication (PLC)/multiple-visible light communication (VLC) system. This hybrid architecture offers low installation cost, enhanced performance, practical feasibility, and a wide range of applications. Novel analytical expressions are derived for key statistics and outage probability, bit error probability, and ergodic channel capacity metrics. Furthermore, the analytical results are validated through Monte Carlo simulations, with several performance curves presented under various channel and PLC/VLC system parameters. All expressions derived in this work are original and have not been previously published. Our proposed system proves feasible for smart environments, green communication systems, internet of things networks, industrial environments, and next-generation networks. △ Less

Submitted 3 June, 2025; originally announced June 2025.

arXiv:2505.17259 [pdf, ps, other]

Understanding the Algorithm Behind Audio Key Detection

Authors: Henrique Perez G. Silva

Abstract: The determination of musical key is a fundamental aspect of music theory and perception, providing a harmonic context for melodies and chord progressions. Automating this process, known as automatic key detection, is a significant task in the field of Music Information Retrieval (MIR). This article outlines an algorithmic methodology for estimating the musical key of an audio recording by analyzin… ▽ More The determination of musical key is a fundamental aspect of music theory and perception, providing a harmonic context for melodies and chord progressions. Automating this process, known as automatic key detection, is a significant task in the field of Music Information Retrieval (MIR). This article outlines an algorithmic methodology for estimating the musical key of an audio recording by analyzing its tonal content through digital signal processing techniques and comparison with theoretical key profiles. △ Less

Submitted 22 May, 2025; originally announced May 2025.

Comments: Preprint. Describes an algorithmic approach to musical key detection implemented in Python. Includes conceptual explanation of audio feature extraction and key profile matching

MSC Class: 94A12; 00A69; 68T10 ACM Class: H.5.5; I.5.4

arXiv:2503.15618 [pdf, other]

On the Secrecy Performance of $α$-$\mathcal{F}$ Channels with Pointing Errors

Authors: Gabriel M. C. Neves, Hugerles S. Silva, Higo T. P. Silva, Wamberto J. L. Queiroz, Felipe A. P. Figueiredo, Rausley A. A. de Souza

Abstract: This paper investigates the physical layer security (PLS) performance of $α$-$\mathcal{F}$ fading channels with pointing errors under passive and active eavesdropping scenarios. Novel analytical expressions are derived for key PLS metrics, including the probability of strictly positive secrecy capacity, the average secrecy capacity, and the secure outage probability. An asymptotic analysis is also… ▽ More This paper investigates the physical layer security (PLS) performance of $α$-$\mathcal{F}$ fading channels with pointing errors under passive and active eavesdropping scenarios. Novel analytical expressions are derived for key PLS metrics, including the probability of strictly positive secrecy capacity, the average secrecy capacity, and the secure outage probability. An asymptotic analysis is also investigated to provide further insights into the system behavior under high signal-to-noise ratio conditions. The analytical results are validated through Monte Carlo simulations, with several performance curves presented for a range of channel and system parameters. All expressions derived in this work are original and have not been previously published. △ Less

Submitted 19 March, 2025; originally announced March 2025.

arXiv:2502.00861 [pdf, other]

Multivariable Stochastic Newton-Based Extremum Seeking with Delays

Authors: Paulo Cesar Souza Silva, Paulo Cesar Pellanda, Tiago Roux Oliveira

Abstract: This paper presents a Newton-based stochastic extremum-seeking control method for real-time optimization in multi-input systems with distinct input delays. It combines predictor-based feedback and Hessian inverse estimation via stochastic perturbations to enable delay compensation with user-defined convergence rates. The method ensures exponential stability and convergence near the unknown extremu… ▽ More This paper presents a Newton-based stochastic extremum-seeking control method for real-time optimization in multi-input systems with distinct input delays. It combines predictor-based feedback and Hessian inverse estimation via stochastic perturbations to enable delay compensation with user-defined convergence rates. The method ensures exponential stability and convergence near the unknown extremum, even under long delays. It extends to multi-input, single-output systems with cross-coupled channels. Stability is analyzed using backstepping and infinite-dimensional averaging. Numerical simulations demonstrate its effectiveness in handling time-delayed channels, showcasing both the challenges and benefits of real-time optimization in distributed parameter settings. △ Less

Submitted 2 February, 2025; originally announced February 2025.

Comments: 28 pages, 13 figures

arXiv:2412.10956 [pdf, ps, other]

Iterative Detection and Decoding for Clustered Cell-Free Massive MIMO Networks

Authors: T. Ssettumba, S. Mashdour, L. Landau, P. da Silva, R. C. de Lamare

Abstract: In this letter, we propose an iterative soft interference cancellation scheme for intra-cluster (ICL) and out-of-cluster (OCL) interference mitigation in user-centric clustered cell-free massive multiple-antenna networks. We propose a minimum mean-square error receive filter with a novel modified parallel interference cancellation scheme to mitigate ICL and OCL interference. Unlike prior work, we… ▽ More In this letter, we propose an iterative soft interference cancellation scheme for intra-cluster (ICL) and out-of-cluster (OCL) interference mitigation in user-centric clustered cell-free massive multiple-antenna networks. We propose a minimum mean-square error receive filter with a novel modified parallel interference cancellation scheme to mitigate ICL and OCL interference. Unlike prior work, we model the OCL interference and devise a least squares estimator to perform OCL interference estimation. An iterative detection and decoding scheme that adopts low-density parity check codes and incorporates the OCL interference estimate is developed. Simulations assess the proposed scheme against existing techniques in terms of bit error rate performance. △ Less

Submitted 14 December, 2024; originally announced December 2024.

Comments: 4 figures, 6 pages

arXiv:2411.10580 [pdf, other]

Gradient-Based Stochastic Extremum-Seeking Control for Multivariable Systems with Distinct Input Delays

Authors: Paulo Cesar Souza Silva, Paulo Cesar Pellanda, Tiago Roux Oliveira

Abstract: This paper addresses the design and analysis of a multivariable gradient-based stochastic extremum-seeking control method for multi-input systems with arbitrary input delays. The approach accommodates systems with distinct time delays across input channels and achieves local exponential stability of the closed-loop system, guaranteeing convergence to a small neighborhood around the extremum point.… ▽ More This paper addresses the design and analysis of a multivariable gradient-based stochastic extremum-seeking control method for multi-input systems with arbitrary input delays. The approach accommodates systems with distinct time delays across input channels and achieves local exponential stability of the closed-loop system, guaranteeing convergence to a small neighborhood around the extremum point. By incorporating phase compensation for dither signals and a novel predictor-feedback mechanism with averaging-based estimates of the unknown gradient and Hessian, the proposed method overcomes traditional challenges associated with arbitrary, distinct input delays. Unlike previous work on deterministic multiparameter extremum-seeking with distinct input delays, this stability analysis is achieved without using backstepping transformations, simplifying the predictor design and enabling a more straightforward implementation. Specifically, the direct application of Artstein's reduction approach results in delay- and system-dimension-independent convergence rates, enhancing practical applicability. A numerical example illustrates the robust performance and advantages of the proposed delay-compensated stochastic extremum-seeking method. △ Less

Submitted 15 November, 2024; originally announced November 2024.

Comments: 8 pages, 8 figures

arXiv:2410.02770 [pdf, other]

Insightful Railway Track Evaluation: Leveraging NARX Feature Interpretation

Authors: P. H. O. Silva, A. S. Cerqueira, E. G. Nepomuceno

Abstract: The classification of time series is essential for extracting meaningful insights and aiding decision-making in engineering domains. Parametric modeling techniques like NARX are invaluable for comprehending intricate processes, such as environmental time series, owing to their easily interpretable and transparent structures. This article introduces a classification algorithm, Logistic-NARX Multino… ▽ More The classification of time series is essential for extracting meaningful insights and aiding decision-making in engineering domains. Parametric modeling techniques like NARX are invaluable for comprehending intricate processes, such as environmental time series, owing to their easily interpretable and transparent structures. This article introduces a classification algorithm, Logistic-NARX Multinomial, which merges the NARX methodology with logistic regression. This approach not only produces interpretable models but also effectively tackles challenges associated with multiclass classification. Furthermore, this study introduces an innovative methodology tailored for the railway sector, offering a tool by employing NARX models to interpret the multitude of features derived from onboard sensors. This solution provides profound insights through feature importance analysis, enabling informed decision-making regarding safety and maintenance. △ Less

Submitted 17 September, 2024; originally announced October 2024.

Comments: In English. CBA 2024 - XXV Brazilian Congress of Automation (CBA - XXV Congresso Brasileiro de Automática)

arXiv:2308.06309 [pdf, other]

Predicting Resilience with Neural Networks

Authors: Karen da Mata, Priscila Silva, Lance Fiondella

Abstract: Resilience engineering studies the ability of a system to survive and recover from disruptive events, which finds applications in several domains. Most studies emphasize resilience metrics to quantify system performance, whereas recent studies propose statistical modeling approaches to project system recovery time after degradation. Moreover, past studies are either performed on data after recover… ▽ More Resilience engineering studies the ability of a system to survive and recover from disruptive events, which finds applications in several domains. Most studies emphasize resilience metrics to quantify system performance, whereas recent studies propose statistical modeling approaches to project system recovery time after degradation. Moreover, past studies are either performed on data after recovering or limited to idealized trends. Therefore, this paper proposes three alternative neural network (NN) approaches including (i) Artificial Neural Networks, (ii) Recurrent Neural Networks, and (iii) Long-Short Term Memory (LSTM) to model and predict system performance, including negative and positive factors driving resilience to quantify the impact of disruptive events and restorative activities. Goodness-of-fit measures are computed to evaluate the models and compared with a classical statistical model, including mean squared error and adjusted R squared. Our results indicate that NN models outperformed the traditional model on all goodness-of-fit measures. More specifically, LSTMs achieved an over 60\% higher adjusted R squared, and decreased predictive error by 34-fold compared to the traditional method. These results suggest that NN models to predict resilience are both feasible and accurate and may find practical use in many important domains. △ Less

Submitted 11 August, 2023; originally announced August 2023.

arXiv:2307.01700 [pdf, other]

Data-driven load disturbance rejection

Authors: Róger W. P. da Silva, Diego Eckhard

Abstract: Data-driven direct methods are still growing in popularity almost three decades after they were introduced. These methods use data collected from the process to identify optimal controller's parameters with little knowledge about the process itself. However, most of those works focus on the problem of reference tracking, whereas many of the problems faced in real-life are of disturbance rejection… ▽ More Data-driven direct methods are still growing in popularity almost three decades after they were introduced. These methods use data collected from the process to identify optimal controller's parameters with little knowledge about the process itself. However, most of those works focus on the problem of reference tracking, whereas many of the problems faced in real-life are of disturbance rejection or attenuation. Also, the vastly majority of those works identify the parameters of linearly parametrized controllers, which amounts to fixing the poles of the controller's transfer function. Although the identification of the controller's poles is not prohibitive, as hinted by some of the papers, there is little effort on presenting a data-driven solution capable of doing so. With all that in mind, this work proposes a data-driven approach which is able to identify the zeros and the poles of a linear controller aiming at disturbance rejection. Two different one-step ahead predictors are proposed, one that is linear on the parameters and another that is non-linear. Also, two different techniques are employed to estimate the controller parameters, the first one minimizes the quadratic norm of the prediction error while the second one minimizes the correlation between the prediction error and an external signal. Simulations show the effectiveness of the proposed methods to estimate the optimal controller parameters of restricted order controllers aiming at disturbance rejection. △ Less

Submitted 4 July, 2023; originally announced July 2023.

arXiv:2305.19719 [pdf, other]

doi 10.1109/JIOT.2023.3310587

Low-Complexity Dynamic Directional Modulation: Vulnerability and Information Leakage

Authors: Pedro E. Gória Silva, Adam Narbudowicz, Nicola Marchetti, Pedro H. J. Nardelli, Rausley A. A. de Souza, Jules M. Moualeu

Abstract: In this paper, the privacy of wireless transmissions is improved through the use of an efficient technique termed dynamic directional modulation (DDM), and is subsequently assessed in terms of the measure of information leakage. Recently, a variation of DDM termed low-power dynamic directional modulation (LPDDM) has attracted significant attention as a prominent secure transmission method due to i… ▽ More In this paper, the privacy of wireless transmissions is improved through the use of an efficient technique termed dynamic directional modulation (DDM), and is subsequently assessed in terms of the measure of information leakage. Recently, a variation of DDM termed low-power dynamic directional modulation (LPDDM) has attracted significant attention as a prominent secure transmission method due to its ability to further improve the privacy of wireless communications. Roughly speaking, this modulation operates by randomly selecting the transmitting antenna from an antenna array whose radiation pattern is well known. Thereafter, the modulator adjusts the constellation phase so as to ensure that only the legitimate receiver recovers the information. To begin with, we highlight some privacy boundaries inherent to the underlying system. In addition, we propose features that the antenna array must meet in order to increase the privacy of a wireless communication system. Last, we adopt a uniform circular monopole antenna array with equiprobable transmitting antennas in order to assess the impact of DDM on the information leakage. It is shown that the bit error rate, while being a useful metric in the evaluation of wireless communication systems, does not provide the full information about the vulnerability of the underlying system. △ Less

Submitted 31 May, 2023; originally announced May 2023.

arXiv:2305.19710 [pdf, other]

doi 10.1109/MNET.2023.3329192

Semantic-Functional Communications in Cyber-Physical Systems

Authors: Pedro E. Goria Silva, Pedro H. J. Nardelli, Arthur S. de Sena, Harun Siljak, Niko Nevaranta, Nicola Marchetti, Rausley A. A. de Souza

Abstract: This paper explores the use of semantic knowledge inherent in the cyber-physical system (CPS) under study in order to minimize the use of explicit communication, which refers to the use of physical radio resources to transmit potentially informative data. It is assumed that the acquired data have a function in the system, usually related to its state estimation, which may trigger control actions.… ▽ More This paper explores the use of semantic knowledge inherent in the cyber-physical system (CPS) under study in order to minimize the use of explicit communication, which refers to the use of physical radio resources to transmit potentially informative data. It is assumed that the acquired data have a function in the system, usually related to its state estimation, which may trigger control actions. We propose that a semantic-functional approach can leverage the semantic-enabled implicit communication while guaranteeing that the system maintains functionality under the required performance. We illustrate the potential of this proposal through simulations of a swarm of drones jointly performing remote sensing in a given area. Our numerical results demonstrate that the proposed method offers the best design option regarding the ability to accomplish a previously established task -- remote sensing in the addressed case -- while minimising the use of radio resources by controlling the trade-offs that jointly determine the CPS performance and its effectiveness in the use of resources. In this sense, we establish a fundamental relationship between energy, communication, and functionality considering a given end application. △ Less

Submitted 31 May, 2023; originally announced May 2023.

arXiv:2305.19696 [pdf, other]

An Efficient Machine Learning-based Channel Prediction Technique for OFDM Sub-Bands

Authors: Pedro E. G. Silva, Jules M. Moualeu, Pedro H. Nardelli, Rausley A. A. de Souza

Abstract: The acquisition of accurate channel state information (CSI) is of utmost importance since it provides performance improvement of wireless communication systems. However, acquiring accurate CSI, which can be done through channel estimation or channel prediction, is an intricate task due to the complexity of the time-varying and frequency selectivity of the wireless environment. To this end, we prop… ▽ More The acquisition of accurate channel state information (CSI) is of utmost importance since it provides performance improvement of wireless communication systems. However, acquiring accurate CSI, which can be done through channel estimation or channel prediction, is an intricate task due to the complexity of the time-varying and frequency selectivity of the wireless environment. To this end, we propose an efficient machine learning (ML)-based technique for channel prediction in orthogonal frequency-division multiplexing (OFDM) sub-bands. The novelty of the proposed approach lies in the training of channel fading samples used to estimate future channel behaviour in selective fading. △ Less

Submitted 31 May, 2023; originally announced May 2023.

arXiv:2302.12620 [pdf, other]

Adaptive Turbo Equalization of Probabilistically Shaped Constellations

Authors: Edson Porto da Silva, Metodi Plamenov Yankov

Abstract: Fiber nonlinearity compensation of probabilistically shaped constellations with adaptive turbo equalization is investigated for the first time. Potential for more than 100% transmission reach extension is demonstrated by combining probabilistic shaping, single-channel digital backpropagation, and adaptive turbo equalization. Fiber nonlinearity compensation of probabilistically shaped constellations with adaptive turbo equalization is investigated for the first time. Potential for more than 100% transmission reach extension is demonstrated by combining probabilistic shaping, single-channel digital backpropagation, and adaptive turbo equalization. △ Less

Submitted 24 February, 2023; originally announced February 2023.

arXiv:2211.14372 [pdf, other]

Interpretability Analysis of Deep Models for COVID-19 Detection

Authors: Daniel Peixoto Pinto da Silva, Edresson Casanova, Lucas Rafael Stefanel Gris, Arnaldo Candido Junior, Marcelo Finger, Flaviane Svartman, Beatriz Raposo, Marcus Vinícius Moreira Martins, Sandra Maria Aluísio, Larissa Cristina Berti, João Paulo Teixeira

Abstract: During the outbreak of COVID-19 pandemic, several research areas joined efforts to mitigate the damages caused by SARS-CoV-2. In this paper we present an interpretability analysis of a convolutional neural network based model for COVID-19 detection in audios. We investigate which features are important for model decision process, investigating spectrograms, F0, F0 standard deviation, sex and age.… ▽ More During the outbreak of COVID-19 pandemic, several research areas joined efforts to mitigate the damages caused by SARS-CoV-2. In this paper we present an interpretability analysis of a convolutional neural network based model for COVID-19 detection in audios. We investigate which features are important for model decision process, investigating spectrograms, F0, F0 standard deviation, sex and age. Following, we analyse model decisions by generating heat maps for the trained models to capture their attention during the decision process. Focusing on a explainable Inteligence Artificial approach, we show that studied models can taken unbiased decisions even in the presence of spurious data in the training set, given the adequate preprocessing steps. Our best model has 94.44% of accuracy in detection, with results indicating that models favors spectrograms for the decision process, particularly, high energy areas in the spectrogram related to prosodic domains, while F0 also leads to efficient COVID-19 detection. △ Less

Submitted 25 November, 2022; originally announced November 2022.

Comments: 14 pages, 4 figures

arXiv:2204.03223 [pdf, other]

A novel semantic-functional approach for multiuser event-trigger communication

Authors: Pedro E. Gória Silva, Plínio S. Dester, Harun Siljak, Nicola Marchetti, Pedro H. J. Nardelli, Rausley A. A. de Souza

Abstract: This work introduces a new perspective for physical media sharing in multiuser communication by jointly considering (i) the meaning of the transmitted message and (ii) its function at the end user. Specifically, we have defined a scenario where multiple users (sensors) are continuously transmitting their own states concerning a predetermined event. On the receiver side there is an alarm monitoring… ▽ More This work introduces a new perspective for physical media sharing in multiuser communication by jointly considering (i) the meaning of the transmitted message and (ii) its function at the end user. Specifically, we have defined a scenario where multiple users (sensors) are continuously transmitting their own states concerning a predetermined event. On the receiver side there is an alarm monitoring system, whose function is to decide whether such a predetermined event has happened in a certain time period and, if yes, in which user. The media access control protocol proposed constitutes an alternative approach to the conventional physical layer methods, because the receiver does not decode the received waveform directly; rather, the relative position of the absence or presence of energy within a multidimensional resource space carries the (semantic) information. The protocol introduced here provides high efficiency in multiuser networks that operate with event-triggered sampling by enabling a constructive reconstruction of transmission collisions. We have demonstrated that the proposed method leads to a better event transmission efficiency than conventional methods like TDMA and slotted ALOHA. Remarkably, the proposed method achieves 100\% efficiency and 0\% error probability in almost all the studied cases, while consistently outperforming TDMA and slotted ALOHA. △ Less

Submitted 3 May, 2023; v1 submitted 7 April, 2022; originally announced April 2022.

arXiv:2110.15731 [pdf, other]

CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese

Authors: Arnaldo Candido Junior, Edresson Casanova, Anderson Soares, Frederico Santos de Oliveira, Lucas Oliveira, Ricardo Corso Fernandes Junior, Daniel Peixoto Pinto da Silva, Fernando Gorgulho Fayet, Bruno Baldissera Carlotto, Lucas Rafael Stefanel Gris, Sandra Maria Aluísio

Abstract: Automatic Speech recognition (ASR) is a complex and challenging task. In recent years, there have been significant advances in the area. In particular, for the Brazilian Portuguese (BP) language, there were about 376 hours public available for ASR task until the second half of 2020. With the release of new datasets in early 2021, this number increased to 574 hours. The existing resources, however,… ▽ More Automatic Speech recognition (ASR) is a complex and challenging task. In recent years, there have been significant advances in the area. In particular, for the Brazilian Portuguese (BP) language, there were about 376 hours public available for ASR task until the second half of 2020. With the release of new datasets in early 2021, this number increased to 574 hours. The existing resources, however, are composed of audios containing only read and prepared speech. There is a lack of datasets including spontaneous speech, which are essential in different ASR applications. This paper presents CORAA (Corpus of Annotated Audios) v1. with 290.77 hours, a publicly available dataset for ASR in BP containing validated pairs (audio-transcription). CORAA also contains European Portuguese audios (4.69 hours). We also present a public ASR model based on Wav2Vec 2.0 XLSR-53 and fine-tuned over CORAA. Our model achieved a Word Error Rate of 24.18% on CORAA test set and 20.08% on Common Voice test set. When measuring the Character Error Rate, we obtained 11.02% and 6.34% for CORAA and Common Voice, respectively. CORAA corpora were assembled to both improve ASR models in BP with phenomena from spontaneous speech and motivate young researchers to start their studies on ASR for Portuguese. All the corpora are publicly available at https://github.com/nilc-nlp/CORAA under the CC BY-NC-ND 4.0 license. △ Less

Submitted 18 November, 2021; v1 submitted 14 October, 2021; originally announced October 2021.

Comments: This paper is under consideration at Language Resources and Evaluation (LREV)

arXiv:2109.02148 [pdf, other]

Adaptive Turbo Equalization for Nonlinearity Compensation in WDM Systems

Authors: Edson Porto da Silva, Metodi Plamenov Yankov

Abstract: In this paper, the performance of adaptive turbo equalization for nonlinearity compensation (NLC) is investigated. A turbo equalization scheme is proposed where a recursive least-squares (RLS) algorithm is used as an adaptive channel estimator to track the time-varying intersymbol interference (ISI) coefficients associated with inter-channel nonlinear interference (NLI) model. The estimated channe… ▽ More In this paper, the performance of adaptive turbo equalization for nonlinearity compensation (NLC) is investigated. A turbo equalization scheme is proposed where a recursive least-squares (RLS) algorithm is used as an adaptive channel estimator to track the time-varying intersymbol interference (ISI) coefficients associated with inter-channel nonlinear interference (NLI) model. The estimated channel coefficients are used by a MIMO 2x2 soft-input soft-output (SISO) linear minimum mean square error (LMMSE) equalizer to compensate for the time-varying ISI. The SISO LMMSE equalizer and the SISO forward error correction (FEC) decoder exchange extrinsic information in every turbo iteration, allowing the receiver to improve the performance of the channel estimation and the equalization, achieving lower bit-error-rate (BER) values. The proposed scheme is investigated for polarization multiplexed 64QAM and 256QAM, although it applies to any proper modulation format. Extensive numerical results are presented. It is shown that the scheme allows up to 0.7 dB extra gain in effectively received signal-to-noise ratio (SNR) and up to 0.2 bits/symbol/pol in generalized mutual information (GMI), on top of the gain provided by single-channel digital backpropagation. △ Less

Submitted 5 September, 2021; originally announced September 2021.

arXiv:2104.06826 [pdf, other]

Towards Automatic Model Specialization for Edge Video Analytics

Authors: Daniel Rivas, Francesc Guim, Jordà Polo, Pubudu M. Silva, Josep Ll. Berral, David Carrera

Abstract: Judging by popular and generic computer vision challenges, such as the ImageNet or PASCAL VOC, neural networks have proven to be exceptionally accurate in recognition tasks. However, state-of-the-art accuracy often comes at a high computational price, requiring hardware acceleration to achieve real-time performance, while use cases, such as smart cities, require images from fixed cameras to be ana… ▽ More Judging by popular and generic computer vision challenges, such as the ImageNet or PASCAL VOC, neural networks have proven to be exceptionally accurate in recognition tasks. However, state-of-the-art accuracy often comes at a high computational price, requiring hardware acceleration to achieve real-time performance, while use cases, such as smart cities, require images from fixed cameras to be analyzed in real-time. Due to the amount of network bandwidth these streams would generate, we cannot rely on offloading compute to a centralized cloud. Thus, a distributed edge cloud is expected to process images locally. However, the edge is, by nature, resource-constrained, which puts a limit on the computational complexity that can execute. Yet, there is a need for a meeting point between the edge and accurate real-time video analytics. Specializing lightweight models on a per-camera basis may help but it quickly becomes unfeasible as the number of cameras grows unless the process is automated. In this paper, we present and evaluate COVA (Contextually Optimized Video Analytics), a framework to assist in the automatic specialization of models for video analytics in edge cameras. COVA automatically improves the accuracy of lightweight models through their specialization. Moreover, we discuss and review each step involved in the process to understand the different trade-offs that each one entails. Additionally, we show how the sole assumption of static cameras allows us to make a series of considerations that greatly simplify the scope of the problem. Finally, experiments show that state-of-the-art models, i.e., able to generalize to unseen environments, can be effectively used as teachers to tailor smaller networks to a specific context, boosting accuracy at a constant computational cost. Results show that our COVA can automatically improve accuracy of pre-trained models by an average of 21%. △ Less

Submitted 13 December, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

arXiv:2010.02645 [pdf, other]

Multirotors from Takeoff to Real-Time Full Identification Using the Modified Relay Feedback Test and Deep Neural Networks

Authors: Abdulla Ayyad, Mohamad Chehadeh, Pedro Silva, Mohamad Wahbah, Oussama Abdul Hay, Igor Boiko, Yahya Zweiri

Abstract: Low cost real-time identification of multirotor unmanned aerial vehicle (UAV) dynamics is an active area of research supported by the surge in demand and emerging application domains. Such real-time identification capabilities shorten development time and cost, making UAVs' technology more accessible, and enable a wide variety of advanced applications. In this paper, we present a novel comprehensi… ▽ More Low cost real-time identification of multirotor unmanned aerial vehicle (UAV) dynamics is an active area of research supported by the surge in demand and emerging application domains. Such real-time identification capabilities shorten development time and cost, making UAVs' technology more accessible, and enable a wide variety of advanced applications. In this paper, we present a novel comprehensive approach, called DNN-MRFT, for real-time identification and tuning of multirotor UAVs using the Modified Relay Feedback Test (MRFT) and Deep Neural Networks (DNN). The main contribution is the development of a generalized framework for the application of DNN-MRFT to higher-order systems. One of the notable advantages of DNN-MRFT is the exact estimation of identified process gain, which mitigates the inaccuracies introduced due to the use of the describing function method in approximating the response of Lure's systems. A secondary contribution is a generalized controller based on DNN-MRFT that takes-off a UAV with unknown dynamics and identifies the inner loops dynamics in-flight. Using the developed framework, DNN-MRFT is sequentially applied to the outer translational loops of the UAV utilizing in-flight results obtained for the inner attitude loops. DNN-MRFT takes on average 15 seconds to get the full knowledge of multirotor UAV dynamics and without any further tuning or calibration the UAV would be able to pass through a vertical window, and accurately follow trajectories achieving state-of-the-art performance. Such demonstrated accuracy, speed, and robustness of identification pushes the limits of state-of-the-art in real-time identification of UAVs. △ Less

Submitted 6 September, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

Comments: 19 pages, 11 figures, Submitted to IEEE Transactions on Control System Technology. A supplementary video for the work presented in this paper can be accessed from https://youtu.be/07RtnZxTJRM

arXiv:2009.14625 [pdf, other]

The Cubli: Modeling and Nonlinear Control Utilizing Unit Complex Numbers

Authors: Fabio Bobrow, Bruno A. Angelico, Paulo S. P. da Silva

Abstract: This paper covers the modeling and nonlinear control of the Cubli, a cube with three reaction wheels mounted on orthogonal faces that becomes a reaction wheel-based 1D/3D inverted pendulum when positioned in one of its edges (1D) or vertices (3D). Instead of angles, unit complex numbers are used as control states for the 1D configuration. This approach is useful not only to get rid of trigonometri… ▽ More This paper covers the modeling and nonlinear control of the Cubli, a cube with three reaction wheels mounted on orthogonal faces that becomes a reaction wheel-based 1D/3D inverted pendulum when positioned in one of its edges (1D) or vertices (3D). Instead of angles, unit complex numbers are used as control states for the 1D configuration. This approach is useful not only to get rid of trigonometric functions, but mainly because it is a specific case of the 3D configuration, that utilizes unit ultra-complex numbers (quaternions) as system states, and therefore facilitates its understanding. The derived nonlinear control law is equivalent to a linear one and is characterized by only three straightforward tuning parameters. Experiment results are presented to validate modeling and control. △ Less

Submitted 28 September, 2020; originally announced September 2020.

Comments: Paper submitted to ACC 2021

arXiv:2004.05717 [pdf, other]

doi 10.1007/s42600-021-00151-6

Towards an Effective and Efficient Deep Learning Model for COVID-19 Patterns Detection in X-ray Images

Authors: Eduardo Luz, Pedro Lopes Silva, Rodrigo Silva, Ludmila Silva, Gladston Moreira, David Menotti

Abstract: Confronting the pandemic of COVID-19, is nowadays one of the most prominent challenges of the human species. A key factor in slowing down the virus propagation is the rapid diagnosis and isolation of infected patients. The standard method for COVID-19 identification, the Reverse transcription polymerase chain reaction method, is time-consuming and in short supply due to the pandemic. Thus, researc… ▽ More Confronting the pandemic of COVID-19, is nowadays one of the most prominent challenges of the human species. A key factor in slowing down the virus propagation is the rapid diagnosis and isolation of infected patients. The standard method for COVID-19 identification, the Reverse transcription polymerase chain reaction method, is time-consuming and in short supply due to the pandemic. Thus, researchers have been looking for alternative screening methods and deep learning applied to chest X-rays of patients has been showing promising results. Despite their success, the computational cost of these methods remains high, which imposes difficulties to their accessibility and availability. Thus, the main goal of this work is to propose an accurate yet efficient method in terms of memory and processing time for the problem of COVID-19 screening in chest X-rays. Methods: To achieve the defined objective we exploit and extend the EfficientNet family of deep artificial neural networks which are known for their high accuracy and low footprints in other applications. We also exploit the underlying taxonomy of the problem with a hierarchical classifier. A dataset of 13,569 X-ray images divided into healthy, non-COVID-19 pneumonia, and COVID-19 patients is used to train the proposed approaches and other 5 competing architectures. Finally, 231 images of the three classes were used to assess the quality of the methods. Results: The results show that the proposed approach was able to produce a high-quality model, with an overall accuracy of 93.9%, COVID-19, sensitivity of 96.8% and positive prediction of 100%, while having from 5 to 30 times fewer parameters than other than the other tested architectures. Larger and more heterogeneous databases are still needed for validation before claiming that deep learning can assist physicians in the task of detecting COVID-19 in X-ray images. △ Less

Submitted 24 April, 2021; v1 submitted 12 April, 2020; originally announced April 2020.

Comments: This is a preprint of an article published in Research on Biomedical Engineering. The final authenticated version is available online at https://doi.org/10.1007/s42600-021-00151-6

arXiv:2002.11213 [pdf, other]

Speech2Phone: A Novel and Efficient Method for Training Speaker Recognition Models

Authors: Edresson Casanova, Arnaldo Candido Junior, Christopher Shulby, Frederico Santos de Oliveira, Lucas Rafael Stefanel Gris, Hamilton Pereira da Silva, Sandra Maria Aluisio, Moacir Antonelli Ponti

Abstract: In this paper we present an efficient method for training models for speaker recognition using small or under-resourced datasets. This method requires less data than other SOTA (State-Of-The-Art) methods, e.g. the Angular Prototypical and GE2E loss functions, while achieving similar results to those methods. This is done using the knowledge of the reconstruction of a phoneme in the speaker's voice… ▽ More In this paper we present an efficient method for training models for speaker recognition using small or under-resourced datasets. This method requires less data than other SOTA (State-Of-The-Art) methods, e.g. the Angular Prototypical and GE2E loss functions, while achieving similar results to those methods. This is done using the knowledge of the reconstruction of a phoneme in the speaker's voice. For this purpose, a new dataset was built, composed of 40 male speakers, who read sentences in Portuguese, totaling approximately 3h. We compare the three best architectures trained using our method to select the best one, which is the one with a shallow architecture. Then, we compared this model with the SOTA method for the speaker recognition task: the Fast ResNet-34 trained with approximately 2,000 hours, using the loss functions Angular Prototypical and GE2E. Three experiments were carried out with datasets in different languages. Among these three experiments, our model achieved the second best result in two experiments and the best result in one of them. This highlights the importance of our method, which proved to be a great competitor to SOTA speaker recognition models, with 500x less data and a simpler approach. △ Less

Submitted 18 June, 2021; v1 submitted 25 February, 2020; originally announced February 2020.

Comments: Submitted to BRACIS

arXiv:1910.06143 [pdf, other]

Detection of Muscle Fatigue Using Variable Bit Flow Modulation and Cross Correlation of Electromyographic Signals

Authors: L. C. Medeiros, P. H. O. Silva, V. H. S. Lopes, A. F. Oliveira, E. B. Pereira, E. G. Nepomuceno

Abstract: The surface electromyography (sEMG) analysis can provide information on muscle fatigue status by estimation of muscle fibre conduction velocity (MFCV), a measure of the travelling speed of motor unit action potentials in muscle tissue. This paper proposes a technique for MFCV estimation using cross-correlation methods and variable bitstream modulation. The technique displays an estimate based on a… ▽ More The surface electromyography (sEMG) analysis can provide information on muscle fatigue status by estimation of muscle fibre conduction velocity (MFCV), a measure of the travelling speed of motor unit action potentials in muscle tissue. This paper proposes a technique for MFCV estimation using cross-correlation methods and variable bitstream modulation. The technique displays an estimate based on a set of data generated by the gain variation modulation, providing an average estimate of the MFCV. The observed trend of MFCV decrease correlates with the fatigue state of the observed muscle. Finally, the values found were compared with information from the literature, validating the method and showing the advantages of using variable modulation. △ Less

Submitted 14 October, 2019; originally announced October 2019.

Comments: SBAI 2019 - Simposio Brasileiro de Automacao Inteligente - Ouro Preto. 6 pages. In Portuguese

arXiv:1903.01544 [pdf, ps, other]

doi 10.1109/JLT.2019.2904102

Dual-polarization NFDM transmission with continuous and discrete spectral modulation

Authors: F. Da Ros, S. Civelli, S. Gaiarin, E. P. da Silva, N. De Renzis, M. Secondini, D. Zibar

Abstract: Nonlinear distortion experienced by signals during their propagation through optical fibers strongly limits the throughput of optical communication systems. Recently, a strong research focus has been dedicated to nonlinearity mitigation and compensation techniques. At the same time, a more disruptive approach, the nonlinear Fourier transform (NFT), aims at designing signaling schemes more suited t… ▽ More Nonlinear distortion experienced by signals during their propagation through optical fibers strongly limits the throughput of optical communication systems. Recently, a strong research focus has been dedicated to nonlinearity mitigation and compensation techniques. At the same time, a more disruptive approach, the nonlinear Fourier transform (NFT), aims at designing signaling schemes more suited to the nonlinear fiber channel. In a short period, impressive results have been reported by modulating either the continuous spectrum or the discrete spectrum. Additionally, very recent works further introduced the opportunity to modulate both spectra for single polarization transmission. Here, we extend the joint modulation scheme to dual-polarization transmission by introducing the framework to construct a dual-polarization optical signal with the desired continuous and discrete spectra. After a brief analysis of the numerical algorithms used to implement the proposed scheme, the first experimental demonstration of dual-polarization joint nonlinear frequency division multiplexing (NFDM) modulation is reported for up to 3200 km of low-loss transmission fiber. The proposed dual-polarization joint modulation schemes enables to exploit all the degrees of freedom for modulation (both polarizations and both spectra) provided by a single-mode fiber (SMF). △ Less

Submitted 26 February, 2019; originally announced March 2019.

Journal ref: Journal of Lightwave Technology, pre-print 2019

arXiv:1901.05303 [pdf, other]

A Customized System to Assess Foot Plantar Pressure: A Case Study on Calloused and Normal Feet

Authors: A. M. A. I. Rathnayaka, W. N. D. Perera, H. P. Savindu, K. C. M. Madarasingha, S. P. Ranasinghe, H. G. T. V. Thuduwage, A. U. Kulathilaka, P. Silva, S. Jayasinghe, K. T. D. Kahaduwa, A. C. De Silva

Abstract: Foot plantar pressure monitoring is an important tool for biomechanical assessment of posture, foot complications due to callus formation and wounds and for sports applications. The pronounced cost associated with commercial plantar pressure monitoring systems and inflexibility of custom analyzing data in such systems prompted the development of a versatile system with minimized cost. This study f… ▽ More Foot plantar pressure monitoring is an important tool for biomechanical assessment of posture, foot complications due to callus formation and wounds and for sports applications. The pronounced cost associated with commercial plantar pressure monitoring systems and inflexibility of custom analyzing data in such systems prompted the development of a versatile system with minimized cost. This study focuses on the development of such a system with high speed data acquisition providing analysis tools for assessing plantar pressure variations of diabetic patients with calloused feet. The new system is capable of achieving a frame rate of 155 Hz which is ideal for pressure monitoring during both standing and walking. The system was verified using 10 normal subjects and 5 diabetic subjects with calluses on in their feet. Results indicate significantly high mechanical stresses on skin beneath callus and postural disorders during standing, in subjects with calluses. △ Less

Submitted 10 January, 2019; originally announced January 2019.

Comments: 5 pages, 7 figures, 2018 IEEE Region Ten Symposium (Tensymp), pp 202-206 2018

arXiv:1812.04708 [pdf, other]

Non-local Operational Anisotropic Diffusion Filter

Authors: Fábio A. M. Cappabianco, Petrus P. C. E. da Silva

Abstract: High-frequency noise is present in several modalities of medical images. It originates from the acquisition process and may be related to the scanner configurations, the scanned body, or to other external factors. This way, prospective filters are an important tool to improve the image quality. In this paper, we propose a non-local weighted operational anisotropic diffusion filter and evaluate its… ▽ More High-frequency noise is present in several modalities of medical images. It originates from the acquisition process and may be related to the scanner configurations, the scanned body, or to other external factors. This way, prospective filters are an important tool to improve the image quality. In this paper, we propose a non-local weighted operational anisotropic diffusion filter and evaluate its effect on magnetic resonance images and on kV/CBCT radiotherapy images. We also provide a detailed analysis of non-local parameter settings. Results show that the new filter enhances previous local implementations and has potential application in radiotherapy treatments. △ Less

Submitted 11 December, 2018; originally announced December 2018.

Comments: 7 pages, 10 figures

arXiv:1810.02184 [pdf, other]

doi 10.1109/JLT.2018.2882638

Perturbation-based FEC-assisted Iterative Nonlinearity Compensation for WDM Systems

Authors: Edson P. da Silva, Metodi P. Yankov, Francesco Da Ros, Toshio Morioka, Leif K. Oxenløwe

Abstract: A perturbation-based nonlinear compensation scheme assisted by a feedback from the forward error correction (FEC) decoder is numerically and experimentally investigated. It is shown by numerical simulations and transmission experiments that a feedback from the FEC decoder enables improved compensation performance, allowing the receiver to operate very close to the full data-aided performance bound… ▽ More A perturbation-based nonlinear compensation scheme assisted by a feedback from the forward error correction (FEC) decoder is numerically and experimentally investigated. It is shown by numerical simulations and transmission experiments that a feedback from the FEC decoder enables improved compensation performance, allowing the receiver to operate very close to the full data-aided performance bounds. The experimental analysis considers the dispersion uncompensated transmission of a 5 x 32 GBd WDM system with DP-16QAM and DP-64QAM after 4200 km and 1120 km, respectively. The experimental results show that the proposed scheme outperforms single-channel digital backpropagation. A perturbation-based nonlinear compensation scheme assisted by a feedback from the forward error correction (FEC) decoder is numerically and experimentally investigated. It is shown by numerical simulations and transmission experiments that a feedback from the FEC decoder enables improved compensation performance, allowing the receiver to operate very close to the full data-aided performance bounds. The experimental analysis considers the dispersion uncompensated transmission of a 5 x 32 GBd WDM system with DP-16QAM and DP-64QAM after 4200 km and 1120 km, respectively. The experimental results show that the proposed scheme outperforms single-channel digital backpropagation. △ Less

Submitted 4 October, 2018; originally announced October 2018.

arXiv:1808.06532 [pdf]

doi 10.1063/1.4978945

Optical wavelength conversion of high bandwidth phase-encoded signals in a high FOM 50cm CMOS compatible waveguide

Authors: Francesco Da Ros, Edson Porto da Silva, Darko Zibar, Sai T. Chu, Brent E. Little, Roberto Morandotti, Michael Galili, David J. Moss, Leif K. Oxenløwe

Abstract: We demonstrate wavelength conversion of QAM signals including 32GBd QPSK and 10GBd 16QAM in a 50cm long high index doped glass spiral waveguide. The quality of the generated idlers over a 10nm bandwidth is sufficient to achieve a BER performance below the HD FEC threshold (less than 3.8 x 10-3), with an OSNR penalty of less than 0.3 dB compared to the original signal. Our results confirm that this… ▽ More We demonstrate wavelength conversion of QAM signals including 32GBd QPSK and 10GBd 16QAM in a 50cm long high index doped glass spiral waveguide. The quality of the generated idlers over a 10nm bandwidth is sufficient to achieve a BER performance below the HD FEC threshold (less than 3.8 x 10-3), with an OSNR penalty of less than 0.3 dB compared to the original signal. Our results confirm that this is a promising platform for nonlinear optical signal processing, a result of both very low linear propagation loss (less than 0.07 dB/cm) and the large material bandgap that ensures negligible nonlinear loss at telecom wavelengths. △ Less

Submitted 7 August, 2018; originally announced August 2018.

Comments: 14 pages, 8 figures, 23 references

Journal ref: Applied Physics Letters Photonics Volume 2 Article 046105 (2017)

Showing 1–28 of 28 results for author: Silva, P