-
Block-Weighted Lasso for Joint Optimization of Memory Depth and Kernels in Wideband DPD
Authors:
Jinfei Wang,
Yi Ma,
Fei Tong,
Ziming He
Abstract:
The optimizations of both memory depth and kernel functions are critical for wideband digital pre-distortion (DPD). However, the memory depth is usually determined via exhaustive search over a wide range for the sake of linearization optimality, followed by the kernel selection of each memory depth, yielding excessive computational cost. In this letter, we aim to provide an efficient solution that…
▽ More
The optimizations of both memory depth and kernel functions are critical for wideband digital pre-distortion (DPD). However, the memory depth is usually determined via exhaustive search over a wide range for the sake of linearization optimality, followed by the kernel selection of each memory depth, yielding excessive computational cost. In this letter, we aim to provide an efficient solution that jointly optimizes the memory depth and kernels while preserving reasonable linearization performance. Specifically, we propose to formulate this optimization as a blockweighted least absolute shrinkage and selection operator (Lasso) problem, where kernels are assigned regularization weights based on their polynomial orders. Then, a block coordinate descent algorithm is introduced to solve the block-weighted Lasso problem. Measurement results on a generalized memory polynomial (GMP) model demonstrates that our proposed solution reduces memory depth by 31.6% and kernel count by 85% compared to the full GMP, while achieving -46.4 dB error vector magnitude (EVM) for signals of 80 MHz bandwidth. In addition, the proposed solution outperforms both the full GMP and the GMP pruned by standard Lasso by at least 0.7 dB in EVM.
△ Less
Submitted 18 April, 2025;
originally announced April 2025.
-
Exploiting Non-uniform Quantization for Enhanced ILC in Wideband Digital Pre-distortion
Authors:
Jinfei Wang,
Yi Ma,
Fei Tong,
Ziming He
Abstract:
In this paper, it is identified that lowering the reference level at the vector signal analyzer can significantly improve the performance of iterative learning control (ILC). We present a mathematical explanation for this phenomenon, where the signals experience logarithmic transform prior to analogue-to-digital conversion, resulting in non-uniform quantization. This process reduces the quantizati…
▽ More
In this paper, it is identified that lowering the reference level at the vector signal analyzer can significantly improve the performance of iterative learning control (ILC). We present a mathematical explanation for this phenomenon, where the signals experience logarithmic transform prior to analogue-to-digital conversion, resulting in non-uniform quantization. This process reduces the quantization noise of low-amplitude signals that constitute a substantial portion of orthogonal frequency division multiplexing (OFDM) signals, thereby improving ILC performance. Measurement results show that compared to setting the reference level to the peak amplitude, lowering the reference level achieves 3 dB improvement on error vector magnitude (EVM) and 15 dB improvement on normalized mean square error (NMSE) for 320 MHz WiFi OFDM signals.
△ Less
Submitted 28 February, 2025; v1 submitted 12 February, 2025;
originally announced February 2025.
-
Design and Implementation of mmWave Surface Wave Enabled Fluid Antennas and Experimental Results for Fluid Antenna Multiple Access
Authors:
Yuanjun Shen,
Boyi Tang,
Shuai Gao,
Kin-Fai Tong,
Hang Wong,
Kai-Kit Wong,
Yangyang Zhang
Abstract:
While multiple-input multiple-output (MIMO) technologies continue to advance, concerns arise as to how MIMO can remain scalable if more users are to be accommodated with an increasing number of antennas at the base station (BS) in the upcoming sixth generation (6G). Recently, the concept of fluid antenna system (FAS) has emerged, which promotes position flexibility to enable transmitter channel st…
▽ More
While multiple-input multiple-output (MIMO) technologies continue to advance, concerns arise as to how MIMO can remain scalable if more users are to be accommodated with an increasing number of antennas at the base station (BS) in the upcoming sixth generation (6G). Recently, the concept of fluid antenna system (FAS) has emerged, which promotes position flexibility to enable transmitter channel state information (CSI) free spatial multiple access on one radio frequency (RF) chain. On the theoretical side, the fluid antenna multiple access (FAMA) approach offers a scalable alternative to massive MIMO spatial multiplexing. However, FAMA lacks experimental validation and the hardware implementation of FAS remains a mysterious approach. The aim of this paper is to provide a novel hardware design for FAS and evaluate the performance of FAMA using experimental data. Our FAS design is based on a dynamically reconfigurable "fluid" radiator which is capable of adjusting its position within a predefined space. One single-channel fluid antenna (SCFA) and one double-channel fluid antenna (DCFA) are designed, electromagnetically simulated, fabricated, and measured. The measured radiation patterns of prototypes are imported into channel and network models for evaluating their performance in FAMA. The experimental results demonstrate that in the 5G millimeter-wave (mmWave) bands (24-30 GHz), the FAS prototypes can vary their gain up to an averaged value of 11 dBi. In the case of 4-user FAMA, the double-channel FAS can significantly reduce outage probability by 57% and increases the multiplexing gain to 2.27 when compared to a static omnidirectional antenna.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
On Propagation Characteristics of Reconfigurable Surface Wave Platform: Simulation and Experimental Verification
Authors:
Z. Chu,
K. F. Tong,
K. K. Wong,
C. B. Chae,
C. H. Chan
Abstract:
Reconfigurable intelligent surface (RIS) as a smart reflector is revolutionizing research for next-generation wireless communications. Complementing this is a concept of using RIS as an efficient propagation medium for potentially superior path loss characteristics. Motivated by a recent porous surface architecture that facilitates reconfigurable pathways with cavities filled with fluid metal, thi…
▽ More
Reconfigurable intelligent surface (RIS) as a smart reflector is revolutionizing research for next-generation wireless communications. Complementing this is a concept of using RIS as an efficient propagation medium for potentially superior path loss characteristics. Motivated by a recent porous surface architecture that facilitates reconfigurable pathways with cavities filled with fluid metal, this paper studies the propagation characteristics of different pathway configurations in different lossy materials on the reconfigurable surface wave platform by using a commercial full electromagnetic simulation software and S-parameters experiments. This paper also looks into the best scheme to switch between a straight pathway and a $90^\circ$-bend and attempts to quantify the additional path loss when making a turn. Our experimental results verify the simulation results, showing the effectiveness of the proposed reconfigurable surface wave platform for a wide-band, low path loss and highly programmable communications.
△ Less
Submitted 2 August, 2023; v1 submitted 26 April, 2023;
originally announced April 2023.
-
Spatial-aware Speaker Diarization for Multi-channel Multi-party Meeting
Authors:
Jie Wang,
Yuji Liu,
Binling Wang,
Yiming Zhi,
Song Li,
Shipeng Xia,
Jiayang Zhang,
Feng Tong,
Lin Li,
Qingyang Hong
Abstract:
This paper describes a spatial-aware speaker diarization system for the multi-channel multi-party meeting. The diarization system obtains direction information of speaker by microphone array. Speaker spatial embedding is generated by xvector and s-vector derived from superdirective beamforming (SDB) which makes the embedding more robust. Specifically, we propose a novel multi-channel sequence-to-s…
▽ More
This paper describes a spatial-aware speaker diarization system for the multi-channel multi-party meeting. The diarization system obtains direction information of speaker by microphone array. Speaker spatial embedding is generated by xvector and s-vector derived from superdirective beamforming (SDB) which makes the embedding more robust. Specifically, we propose a novel multi-channel sequence-to-sequence neural network architecture named discriminative multi-stream neural network (DMSNet) which consists of attention superdirective beamforming (ASDB) block and Conformer encoder. The proposed ASDB is a self-adapted channel-wise block that extracts the latent spatial features of array audios by modeling interdependencies between channels. We explore DMSNet to address overlapped speech problem on multi-channel audio and achieve 93.53% accuracy on evaluation set. By performing DMSNet based overlapped speech detection (OSD) module, the diarization error rate (DER) of cluster-based diarization system decrease significantly from 13.45% to 7.64%.
△ Less
Submitted 24 September, 2022;
originally announced September 2022.
-
Constellation-Oriented Perturbation for Scalable-Complexity MIMO Nonlinear Precoding
Authors:
Jinfei Wang,
Yi Ma,
Na Yi,
Rahim Tafazolli,
Fei Tong
Abstract:
In this paper, a novel nonlinear precoding (NLP) technique, namely constellation-oriented perturbation (COP), is proposed to tackle the scalability problem inherent in conventional NLP techniques. The basic concept of COP is to apply vector perturbation (VP) in the constellation domain instead of symbol domain; as often used in conventional techniques. By this means, the computational complexity o…
▽ More
In this paper, a novel nonlinear precoding (NLP) technique, namely constellation-oriented perturbation (COP), is proposed to tackle the scalability problem inherent in conventional NLP techniques. The basic concept of COP is to apply vector perturbation (VP) in the constellation domain instead of symbol domain; as often used in conventional techniques. By this means, the computational complexity of COP is made independent to the size of multi-antenna (i.e., MIMO) networks. Instead, it is related to the size of symbol constellation. Through widely linear transform, it is shown that COP has its complexity flexibly scalable in the constellation domain to achieve a good complexity-performance tradeoff. Our computer simulations show that COP can offer very comparable performance with the optimum VP in small MIMO systems. Moreover, it significantly outperforms current sub-optimum VP approaches (such as degree-2 VP) in large MIMO whilst maintaining much lower computational complexity.
△ Less
Submitted 4 September, 2022; v1 submitted 3 August, 2022;
originally announced August 2022.
-
On Surface Wave Propagation Characteristics of Porosity-Based Reconfigurable Surface
Authors:
Z. Chu,
K. K. Wong,
K. F. Tong
Abstract:
Reconfigurable surfaces facilitating energy-efficient, intelligent surface wave propagation have recently emerged as a technology that finds applications in many-core systems and 6G wireless communications. In this paper, we consider the porosity-based reconfigurable surface where there are cavities that can be filled on-demand with fluid metal such as Galinstan, in order to create adaptable chann…
▽ More
Reconfigurable surfaces facilitating energy-efficient, intelligent surface wave propagation have recently emerged as a technology that finds applications in many-core systems and 6G wireless communications. In this paper, we consider the porosity-based reconfigurable surface where there are cavities that can be filled on-demand with fluid metal such as Galinstan, in order to create adaptable channels for efficient wave propagation. We aim to investigate the propagation phenomenon of signal fluctuation resulting from the diffraction of discrete porosity and study how different porosity patterns affect this phenomenon. Our results cover the frequency range between 21.7GHz and 31.6GHz when a WR-34 waveguide is used as the transducer.
△ Less
Submitted 22 June, 2022;
originally announced June 2022.
-
Deep Representation Decomposition for Rate-Invariant Speaker Verification
Authors:
Fuchuan Tong,
Siqi Zheng,
Haodong Zhou,
Xingjia Xie,
Qingyang Hong,
Lin Li
Abstract:
While promising performance for speaker verification has been achieved by deep speaker embeddings, the advantage would reduce in the case of speaking-style variability. Speaking rate mismatch is often observed in practical speaker verification systems, which may actually degrade the system performance. To reduce intra-class discrepancy caused by speaking rate, we propose a deep representation deco…
▽ More
While promising performance for speaker verification has been achieved by deep speaker embeddings, the advantage would reduce in the case of speaking-style variability. Speaking rate mismatch is often observed in practical speaker verification systems, which may actually degrade the system performance. To reduce intra-class discrepancy caused by speaking rate, we propose a deep representation decomposition approach with adversarial learning to learn speaking rate-invariant speaker embeddings. Specifically, adopting an attention block, we decompose the original embedding into an identity-related component and a rate-related component through multi-task training. Additionally, to reduce the latent relationship between the two decomposed components, we further propose a cosine mapping block to train the parameters adversarially to minimize the cosine similarity between the two decomposed components. As a result, identity-related features become robust to speaking rate and then are used for verification. Experiments are conducted on VoxCeleb1 data and HI-MIA data to demonstrate the effectiveness of our proposed approach.
△ Less
Submitted 27 May, 2022;
originally announced May 2022.
-
Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data
Authors:
Fuchuan Tong,
Siqi Zheng,
Min Zhang,
Yafeng Chen,
Hongbin Suo,
Qingyang Hong,
Lin Li
Abstract:
Unsupervised clustering on speakers is becoming increasingly important for its potential uses in semi-supervised learning. In reality, we are often presented with enormous amounts of unlabeled data from multi-party meetings and discussions. An effective unsupervised clustering approach would allow us to significantly increase the amount of training data without additional costs for annotations. Re…
▽ More
Unsupervised clustering on speakers is becoming increasingly important for its potential uses in semi-supervised learning. In reality, we are often presented with enormous amounts of unlabeled data from multi-party meetings and discussions. An effective unsupervised clustering approach would allow us to significantly increase the amount of training data without additional costs for annotations. Recently, methods based on graph convolutional networks (GCN) have received growing attention for unsupervised clustering, as these methods exploit the connectivity patterns between nodes to improve learning performance. In this work, we present a GCN-based approach for semi-supervised learning. Given a pre-trained embedding extractor, a graph convolutional network is trained on the labeled data and clusters unlabeled data with "pseudo-labels". We present a self-correcting training mechanism that iteratively runs the cluster-train-correct process on pseudo-labels. We show that this proposed approach effectively uses unlabeled data and improves speaker recognition accuracy.
△ Less
Submitted 25 April, 2022;
originally announced April 2022.
-
The xmuspeech system for multi-channel multi-party meeting transcription challenge
Authors:
Jie Wang,
Yuji Liu,
Binling Wang,
Yiming Zhi,
Song Li1,
Shipeng Xia,
Jiayang Zhang,
Lin Li1,
Qingyang Hong,
Feng Tong
Abstract:
This paper describes the system developed by the XMUSPEECH team for the Multi-channel Multi-party Meeting Transcription Challenge (M2MeT). For the speaker diarization task, we propose a multi-channel speaker diarization system that obtains spatial information of speaker by Difference of Arrival (DOA) technology. Speaker-spatial embedding is generated by x-vector and s-vector derived from Filter-an…
▽ More
This paper describes the system developed by the XMUSPEECH team for the Multi-channel Multi-party Meeting Transcription Challenge (M2MeT). For the speaker diarization task, we propose a multi-channel speaker diarization system that obtains spatial information of speaker by Difference of Arrival (DOA) technology. Speaker-spatial embedding is generated by x-vector and s-vector derived from Filter-and-Sum Beamforming (FSB) which makes the embedding more robust. Specifically, we propose a novel multi-channel sequence-to-sequence neural network architecture named Discriminative Multi-stream Neural Network (DMSNet) which consists of Attention Filter-and-Sum block (AFSB) and Conformer encoder. We explore DMSNet to address overlapped speech problem on multi-channel audio. Compared with LSTM based OSD module, we achieve a decreases of 10.1% in Detection Error Rate(DetER). By performing DMSNet based OSD module, the DER of cluster-based diarization system decrease significantly form 13.44% to 7.63%. Our best fusion system achieves 7.09% and 9.80% of the diarization error rate (DER) on evaluation set and test set.
△ Less
Submitted 11 February, 2022;
originally announced February 2022.
-
XMUSPEECH System for VoxCeleb Speaker Recognition Challenge 2021
Authors:
Jie Wang,
Fuchuang Tong,
Zhicong Chen,
Lin Li,
Qingyang Hong,
Haodong Zhou
Abstract:
This paper describes the XMUSPEECH speaker recognition and diarisation systems for the VoxCeleb Speaker Recognition Challenge 2021. For track 2, we evaluate two systems including ResNet34-SE and ECAPA-TDNN. For track 4, an important part of our system is VAD module which greatly improves the performance. Our best submission on the track 4 obtained on the evaluation set DER 5.54% and JER 27.11%, wh…
▽ More
This paper describes the XMUSPEECH speaker recognition and diarisation systems for the VoxCeleb Speaker Recognition Challenge 2021. For track 2, we evaluate two systems including ResNet34-SE and ECAPA-TDNN. For track 4, an important part of our system is VAD module which greatly improves the performance. Our best submission on the track 4 obtained on the evaluation set DER 5.54% and JER 27.11%, while the performance on the development set is DER 2.92% and JER 20.84%.
△ Less
Submitted 6 September, 2021;
originally announced September 2021.
-
Image-to-Graph Convolutional Network for Deformable Shape Reconstruction from a Single Projection Image
Authors:
M. Nakao,
F. Tong,
M. Nakamura,
T. Matsuda
Abstract:
Shape reconstruction of deformable organs from two-dimensional X-ray images is a key technology for image-guided intervention. In this paper, we propose an image-to-graph convolutional network (IGCN) for deformable shape reconstruction from a single-viewpoint projection image. The IGCN learns relationship between shape/deformation variability and the deep image features based on a deformation mapp…
▽ More
Shape reconstruction of deformable organs from two-dimensional X-ray images is a key technology for image-guided intervention. In this paper, we propose an image-to-graph convolutional network (IGCN) for deformable shape reconstruction from a single-viewpoint projection image. The IGCN learns relationship between shape/deformation variability and the deep image features based on a deformation mapping scheme. In experiments targeted to the respiratory motion of abdominal organs, we confirmed the proposed framework with a regularized loss function can reconstruct liver shapes from a single digitally reconstructed radiograph with a mean distance error of 3.6mm.
△ Less
Submitted 31 August, 2021; v1 submitted 27 August, 2021;
originally announced August 2021.
-
Enhancing and Localizing Surface Wave Propagation with Reconfigurable Surfaces
Authors:
Z. Chu,
K. K. Wong,
K. F. Tong
Abstract:
As an attempt to develop a reconfigurable surface architecture that can use liquid metal such as Galinstan to shape surface channels on demand, this paper considers a punctured surface where cavities are evenly distributed and can be filled with liquid metal potentially via digitally controlled pumps. In this paper, we look at the benefits of such architecture in terms of surface-wave signal enhan…
▽ More
As an attempt to develop a reconfigurable surface architecture that can use liquid metal such as Galinstan to shape surface channels on demand, this paper considers a punctured surface where cavities are evenly distributed and can be filled with liquid metal potentially via digitally controlled pumps. In this paper, we look at the benefits of such architecture in terms of surface-wave signal enhancement and isolation, and examine how various system parameters impact the performance using full wave 3-dimensional electromagnetic simulations. It is shown that extraordinary signal shaping can be obtained.
△ Less
Submitted 19 June, 2021;
originally announced June 2021.
-
Reconfigurable Surface Wave Platform Using Fluidic Conductive Structures
Authors:
Z. Chu,
K. K. Wong,
K. F. Tong
Abstract:
Surface wave inherently has less propagation loss as it adheres to the surface and minimizes unwanted dissipation in space. Recently, they find applications in network-on-chip (NoC) communications and intelligent surface aided mobile networked communications. This paper puts forward a reconfigurable surface wave platform (RSWP) that utilizes liquid metal to produce highly energy-efficient and adap…
▽ More
Surface wave inherently has less propagation loss as it adheres to the surface and minimizes unwanted dissipation in space. Recently, they find applications in network-on-chip (NoC) communications and intelligent surface aided mobile networked communications. This paper puts forward a reconfigurable surface wave platform (RSWP) that utilizes liquid metal to produce highly energy-efficient and adaptive pathways for surface wave transmission. Our simulation results illustrate that the proposed RSWP using Galinstan can obtain a $25{\rm dB}$ gain in the electric field for a propagation distance of $35λ$ at $30{\rm GHz}$ where $λ$ denotes the wavelength. Moreover, less than $1{\rm dB}$ loss is observed even at a distance of $50λ$, and a pathway with right-angled turns can also be created with only a $3.5{\rm dB}$ loss at the turn.
△ Less
Submitted 22 May, 2021;
originally announced May 2021.
-
On global convergence of area-constrained formations of hierarchical multi-agent systems
Authors:
Toshiharu Sugie,
Fei Tong,
Brian D. O. Anderson,
Zhiyong Sun
Abstract:
This paper is concerned with a formation shaping problem for point agents in a two-dimensional space, where control avoids the possibility of reflection ambiguities. One solution for this type of problems was given first for three or four agents by considering a potential function which consists of both the distance error and the signed area terms. Then, by exploiting a hierarchical control strate…
▽ More
This paper is concerned with a formation shaping problem for point agents in a two-dimensional space, where control avoids the possibility of reflection ambiguities. One solution for this type of problems was given first for three or four agents by considering a potential function which consists of both the distance error and the signed area terms. Then, by exploiting a hierarchical control strategy with such potential functions, the method was extended to any number of agents recently. However, a specific gain on the signed area term must be employed there, and it does not guarantee the global convergence. To overcome this issue, this paper provides a necessary and sufficient condition for the global convergence, subject to the constraint that the desired formation consists of isosceles triangles only. This clarifies the admissible range of the gain on the signed area for this case. In addition, as for formations consisting of arbitrary triangles, it is shown when high gain on the signed area is admissible for global convergence.
△ Less
Submitted 4 September, 2020;
originally announced September 2020.
-
A Dataset of Human Motion Status Using IR-UWB Through-wall Radar
Authors:
Zhengliang Zhu,
Degui Yang,
Junchao Zhang,
Feng Tong
Abstract:
Ultra-wideband (UWB) through-wall radar has a wide range of applications in non-contact human information detection and monitoring. With the integration of machine learning technology, its potential prospects include the physiological monitoring of patients in the hospital environment and the daily monitoring at home. Although many target detection methods of UWB through-wall radar based on machin…
▽ More
Ultra-wideband (UWB) through-wall radar has a wide range of applications in non-contact human information detection and monitoring. With the integration of machine learning technology, its potential prospects include the physiological monitoring of patients in the hospital environment and the daily monitoring at home. Although many target detection methods of UWB through-wall radar based on machine learning have been proposed, there is a lack of an opensource dataset to evaluate the performance of the algorithm. This published dataset was measured by impulse radio UWB (IR-UWB) through-wall radar system. Three test subjects were measured in different environments and several defined motion statuses. Using the presented dataset, we propose a human-motion-status recognition method using a convolutional neural network (CNN), the detailed dataset partition method and recognition process flow is given. On the well-trained network, the recognition accuracy of testing data for three kinds of motion statuses is higher than 99.7%. The dataset presented in this paper considers a simple environment. Therefore, we call on all organizations in the UWB radar field to cooperate to build opensource datasets to further promote the development of UWB through-wall radar.
△ Less
Submitted 31 August, 2020;
originally announced August 2020.