Search | arXiv e-print repository

Block-Weighted Lasso for Joint Optimization of Memory Depth and Kernels in Wideband DPD

Authors: Jinfei Wang, Yi Ma, Fei Tong, Ziming He

Abstract: The optimizations of both memory depth and kernel functions are critical for wideband digital pre-distortion (DPD). However, the memory depth is usually determined via exhaustive search over a wide range for the sake of linearization optimality, followed by the kernel selection of each memory depth, yielding excessive computational cost. In this letter, we aim to provide an efficient solution that… ▽ More The optimizations of both memory depth and kernel functions are critical for wideband digital pre-distortion (DPD). However, the memory depth is usually determined via exhaustive search over a wide range for the sake of linearization optimality, followed by the kernel selection of each memory depth, yielding excessive computational cost. In this letter, we aim to provide an efficient solution that jointly optimizes the memory depth and kernels while preserving reasonable linearization performance. Specifically, we propose to formulate this optimization as a blockweighted least absolute shrinkage and selection operator (Lasso) problem, where kernels are assigned regularization weights based on their polynomial orders. Then, a block coordinate descent algorithm is introduced to solve the block-weighted Lasso problem. Measurement results on a generalized memory polynomial (GMP) model demonstrates that our proposed solution reduces memory depth by 31.6% and kernel count by 85% compared to the full GMP, while achieving -46.4 dB error vector magnitude (EVM) for signals of 80 MHz bandwidth. In addition, the proposed solution outperforms both the full GMP and the GMP pruned by standard Lasso by at least 0.7 dB in EVM. △ Less

Submitted 18 April, 2025; originally announced April 2025.

Comments: 4 pages, 1 figure

arXiv:2502.08360 [pdf, other]

Exploiting Non-uniform Quantization for Enhanced ILC in Wideband Digital Pre-distortion

Authors: Jinfei Wang, Yi Ma, Fei Tong, Ziming He

Abstract: In this paper, it is identified that lowering the reference level at the vector signal analyzer can significantly improve the performance of iterative learning control (ILC). We present a mathematical explanation for this phenomenon, where the signals experience logarithmic transform prior to analogue-to-digital conversion, resulting in non-uniform quantization. This process reduces the quantizati… ▽ More In this paper, it is identified that lowering the reference level at the vector signal analyzer can significantly improve the performance of iterative learning control (ILC). We present a mathematical explanation for this phenomenon, where the signals experience logarithmic transform prior to analogue-to-digital conversion, resulting in non-uniform quantization. This process reduces the quantization noise of low-amplitude signals that constitute a substantial portion of orthogonal frequency division multiplexing (OFDM) signals, thereby improving ILC performance. Measurement results show that compared to setting the reference level to the peak amplitude, lowering the reference level achieves 3 dB improvement on error vector magnitude (EVM) and 15 dB improvement on normalized mean square error (NMSE) for 320 MHz WiFi OFDM signals. △ Less

Submitted 28 February, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

Comments: 4 pages, 7 figures, WAMICON 2025

arXiv:2405.09663 [pdf]

Design and Implementation of mmWave Surface Wave Enabled Fluid Antennas and Experimental Results for Fluid Antenna Multiple Access

Authors: Yuanjun Shen, Boyi Tang, Shuai Gao, Kin-Fai Tong, Hang Wong, Kai-Kit Wong, Yangyang Zhang

Abstract: While multiple-input multiple-output (MIMO) technologies continue to advance, concerns arise as to how MIMO can remain scalable if more users are to be accommodated with an increasing number of antennas at the base station (BS) in the upcoming sixth generation (6G). Recently, the concept of fluid antenna system (FAS) has emerged, which promotes position flexibility to enable transmitter channel st… ▽ More While multiple-input multiple-output (MIMO) technologies continue to advance, concerns arise as to how MIMO can remain scalable if more users are to be accommodated with an increasing number of antennas at the base station (BS) in the upcoming sixth generation (6G). Recently, the concept of fluid antenna system (FAS) has emerged, which promotes position flexibility to enable transmitter channel state information (CSI) free spatial multiple access on one radio frequency (RF) chain. On the theoretical side, the fluid antenna multiple access (FAMA) approach offers a scalable alternative to massive MIMO spatial multiplexing. However, FAMA lacks experimental validation and the hardware implementation of FAS remains a mysterious approach. The aim of this paper is to provide a novel hardware design for FAS and evaluate the performance of FAMA using experimental data. Our FAS design is based on a dynamically reconfigurable "fluid" radiator which is capable of adjusting its position within a predefined space. One single-channel fluid antenna (SCFA) and one double-channel fluid antenna (DCFA) are designed, electromagnetically simulated, fabricated, and measured. The measured radiation patterns of prototypes are imported into channel and network models for evaluating their performance in FAMA. The experimental results demonstrate that in the 5G millimeter-wave (mmWave) bands (24-30 GHz), the FAS prototypes can vary their gain up to an averaged value of 11 dBi. In the case of 4-user FAMA, the double-channel FAS can significantly reduce outage probability by 57% and increases the multiplexing gain to 2.27 when compared to a static omnidirectional antenna. △ Less

Submitted 15 May, 2024; originally announced May 2024.

Comments: Submitted to IEEE Transactions on Antennas and Propagation

arXiv:2304.13903 [pdf, other]

On Propagation Characteristics of Reconfigurable Surface Wave Platform: Simulation and Experimental Verification

Authors: Z. Chu, K. F. Tong, K. K. Wong, C. B. Chae, C. H. Chan

Abstract: Reconfigurable intelligent surface (RIS) as a smart reflector is revolutionizing research for next-generation wireless communications. Complementing this is a concept of using RIS as an efficient propagation medium for potentially superior path loss characteristics. Motivated by a recent porous surface architecture that facilitates reconfigurable pathways with cavities filled with fluid metal, thi… ▽ More Reconfigurable intelligent surface (RIS) as a smart reflector is revolutionizing research for next-generation wireless communications. Complementing this is a concept of using RIS as an efficient propagation medium for potentially superior path loss characteristics. Motivated by a recent porous surface architecture that facilitates reconfigurable pathways with cavities filled with fluid metal, this paper studies the propagation characteristics of different pathway configurations in different lossy materials on the reconfigurable surface wave platform by using a commercial full electromagnetic simulation software and S-parameters experiments. This paper also looks into the best scheme to switch between a straight pathway and a $90^\circ$-bend and attempts to quantify the additional path loss when making a turn. Our experimental results verify the simulation results, showing the effectiveness of the proposed reconfigurable surface wave platform for a wide-band, low path loss and highly programmable communications. △ Less

Submitted 2 August, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

Comments: Submitted to IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2023

arXiv:2209.12002 [pdf, other]

doi 10.21437/Interspeech.2022-11412

Spatial-aware Speaker Diarization for Multi-channel Multi-party Meeting

Authors: Jie Wang, Yuji Liu, Binling Wang, Yiming Zhi, Song Li, Shipeng Xia, Jiayang Zhang, Feng Tong, Lin Li, Qingyang Hong

Abstract: This paper describes a spatial-aware speaker diarization system for the multi-channel multi-party meeting. The diarization system obtains direction information of speaker by microphone array. Speaker spatial embedding is generated by xvector and s-vector derived from superdirective beamforming (SDB) which makes the embedding more robust. Specifically, we propose a novel multi-channel sequence-to-s… ▽ More This paper describes a spatial-aware speaker diarization system for the multi-channel multi-party meeting. The diarization system obtains direction information of speaker by microphone array. Speaker spatial embedding is generated by xvector and s-vector derived from superdirective beamforming (SDB) which makes the embedding more robust. Specifically, we propose a novel multi-channel sequence-to-sequence neural network architecture named discriminative multi-stream neural network (DMSNet) which consists of attention superdirective beamforming (ASDB) block and Conformer encoder. The proposed ASDB is a self-adapted channel-wise block that extracts the latent spatial features of array audios by modeling interdependencies between channels. We explore DMSNet to address overlapped speech problem on multi-channel audio and achieve 93.53% accuracy on evaluation set. By performing DMSNet based overlapped speech detection (OSD) module, the diarization error rate (DER) of cluster-based diarization system decrease significantly from 13.45% to 7.64%. △ Less

Submitted 24 September, 2022; originally announced September 2022.

Comments: Accepted by Interspeech 2022. arXiv admin note: text overlap with arXiv:2202.05744

arXiv:2208.02418 [pdf, ps, other]

Constellation-Oriented Perturbation for Scalable-Complexity MIMO Nonlinear Precoding

Authors: Jinfei Wang, Yi Ma, Na Yi, Rahim Tafazolli, Fei Tong

Abstract: In this paper, a novel nonlinear precoding (NLP) technique, namely constellation-oriented perturbation (COP), is proposed to tackle the scalability problem inherent in conventional NLP techniques. The basic concept of COP is to apply vector perturbation (VP) in the constellation domain instead of symbol domain; as often used in conventional techniques. By this means, the computational complexity o… ▽ More In this paper, a novel nonlinear precoding (NLP) technique, namely constellation-oriented perturbation (COP), is proposed to tackle the scalability problem inherent in conventional NLP techniques. The basic concept of COP is to apply vector perturbation (VP) in the constellation domain instead of symbol domain; as often used in conventional techniques. By this means, the computational complexity of COP is made independent to the size of multi-antenna (i.e., MIMO) networks. Instead, it is related to the size of symbol constellation. Through widely linear transform, it is shown that COP has its complexity flexibly scalable in the constellation domain to achieve a good complexity-performance tradeoff. Our computer simulations show that COP can offer very comparable performance with the optimum VP in small MIMO systems. Moreover, it significantly outperforms current sub-optimum VP approaches (such as degree-2 VP) in large MIMO whilst maintaining much lower computational complexity. △ Less

Submitted 4 September, 2022; v1 submitted 3 August, 2022; originally announced August 2022.

Comments: 6 pages, 6 figures, published on the conference of GLOBECOM2022

arXiv:2206.11401 [pdf, other]

On Surface Wave Propagation Characteristics of Porosity-Based Reconfigurable Surface

Authors: Z. Chu, K. K. Wong, K. F. Tong

Abstract: Reconfigurable surfaces facilitating energy-efficient, intelligent surface wave propagation have recently emerged as a technology that finds applications in many-core systems and 6G wireless communications. In this paper, we consider the porosity-based reconfigurable surface where there are cavities that can be filled on-demand with fluid metal such as Galinstan, in order to create adaptable chann… ▽ More Reconfigurable surfaces facilitating energy-efficient, intelligent surface wave propagation have recently emerged as a technology that finds applications in many-core systems and 6G wireless communications. In this paper, we consider the porosity-based reconfigurable surface where there are cavities that can be filled on-demand with fluid metal such as Galinstan, in order to create adaptable channels for efficient wave propagation. We aim to investigate the propagation phenomenon of signal fluctuation resulting from the diffraction of discrete porosity and study how different porosity patterns affect this phenomenon. Our results cover the frequency range between 21.7GHz and 31.6GHz when a WR-34 waveguide is used as the transducer. △ Less

Submitted 22 June, 2022; originally announced June 2022.

Comments: Submitted to 2022 Asia-Pacific Microwave Conference APMC 2022 Nov.29-Dec.2,2022/Yokohama

arXiv:2205.14294 [pdf, other]

Deep Representation Decomposition for Rate-Invariant Speaker Verification

Authors: Fuchuan Tong, Siqi Zheng, Haodong Zhou, Xingjia Xie, Qingyang Hong, Lin Li

Abstract: While promising performance for speaker verification has been achieved by deep speaker embeddings, the advantage would reduce in the case of speaking-style variability. Speaking rate mismatch is often observed in practical speaker verification systems, which may actually degrade the system performance. To reduce intra-class discrepancy caused by speaking rate, we propose a deep representation deco… ▽ More While promising performance for speaker verification has been achieved by deep speaker embeddings, the advantage would reduce in the case of speaking-style variability. Speaking rate mismatch is often observed in practical speaker verification systems, which may actually degrade the system performance. To reduce intra-class discrepancy caused by speaking rate, we propose a deep representation decomposition approach with adversarial learning to learn speaking rate-invariant speaker embeddings. Specifically, adopting an attention block, we decompose the original embedding into an identity-related component and a rate-related component through multi-task training. Additionally, to reduce the latent relationship between the two decomposed components, we further propose a cosine mapping block to train the parameters adversarially to minimize the cosine similarity between the two decomposed components. As a result, identity-related features become robust to speaking rate and then are used for verification. Experiments are conducted on VoxCeleb1 data and HI-MIA data to demonstrate the effectiveness of our proposed approach. △ Less

Submitted 27 May, 2022; originally announced May 2022.

Comments: Accepted by Odyssey 2022

arXiv:2204.11501 [pdf, other]

Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data

Authors: Fuchuan Tong, Siqi Zheng, Min Zhang, Yafeng Chen, Hongbin Suo, Qingyang Hong, Lin Li

Abstract: Unsupervised clustering on speakers is becoming increasingly important for its potential uses in semi-supervised learning. In reality, we are often presented with enormous amounts of unlabeled data from multi-party meetings and discussions. An effective unsupervised clustering approach would allow us to significantly increase the amount of training data without additional costs for annotations. Re… ▽ More Unsupervised clustering on speakers is becoming increasingly important for its potential uses in semi-supervised learning. In reality, we are often presented with enormous amounts of unlabeled data from multi-party meetings and discussions. An effective unsupervised clustering approach would allow us to significantly increase the amount of training data without additional costs for annotations. Recently, methods based on graph convolutional networks (GCN) have received growing attention for unsupervised clustering, as these methods exploit the connectivity patterns between nodes to improve learning performance. In this work, we present a GCN-based approach for semi-supervised learning. Given a pre-trained embedding extractor, a graph convolutional network is trained on the labeled data and clusters unlabeled data with "pseudo-labels". We present a self-correcting training mechanism that iteratively runs the cluster-train-correct process on pseudo-labels. We show that this proposed approach effectively uses unlabeled data and improves speaker recognition accuracy. △ Less

Submitted 25 April, 2022; originally announced April 2022.

Comments: Accepted by ICASSP 2022

arXiv:2202.05744 [pdf, other]

The xmuspeech system for multi-channel multi-party meeting transcription challenge

Authors: Jie Wang, Yuji Liu, Binling Wang, Yiming Zhi, Song Li1, Shipeng Xia, Jiayang Zhang, Lin Li1, Qingyang Hong, Feng Tong

Abstract: This paper describes the system developed by the XMUSPEECH team for the Multi-channel Multi-party Meeting Transcription Challenge (M2MeT). For the speaker diarization task, we propose a multi-channel speaker diarization system that obtains spatial information of speaker by Difference of Arrival (DOA) technology. Speaker-spatial embedding is generated by x-vector and s-vector derived from Filter-an… ▽ More This paper describes the system developed by the XMUSPEECH team for the Multi-channel Multi-party Meeting Transcription Challenge (M2MeT). For the speaker diarization task, we propose a multi-channel speaker diarization system that obtains spatial information of speaker by Difference of Arrival (DOA) technology. Speaker-spatial embedding is generated by x-vector and s-vector derived from Filter-and-Sum Beamforming (FSB) which makes the embedding more robust. Specifically, we propose a novel multi-channel sequence-to-sequence neural network architecture named Discriminative Multi-stream Neural Network (DMSNet) which consists of Attention Filter-and-Sum block (AFSB) and Conformer encoder. We explore DMSNet to address overlapped speech problem on multi-channel audio. Compared with LSTM based OSD module, we achieve a decreases of 10.1% in Detection Error Rate(DetER). By performing DMSNet based OSD module, the DER of cluster-based diarization system decrease significantly form 13.44% to 7.63%. Our best fusion system achieves 7.09% and 9.80% of the diarization error rate (DER) on evaluation set and test set. △ Less

Submitted 11 February, 2022; originally announced February 2022.

arXiv:2109.02549 [pdf, ps, other]

XMUSPEECH System for VoxCeleb Speaker Recognition Challenge 2021

Authors: Jie Wang, Fuchuang Tong, Zhicong Chen, Lin Li, Qingyang Hong, Haodong Zhou

Abstract: This paper describes the XMUSPEECH speaker recognition and diarisation systems for the VoxCeleb Speaker Recognition Challenge 2021. For track 2, we evaluate two systems including ResNet34-SE and ECAPA-TDNN. For track 4, an important part of our system is VAD module which greatly improves the performance. Our best submission on the track 4 obtained on the evaluation set DER 5.54% and JER 27.11%, wh… ▽ More This paper describes the XMUSPEECH speaker recognition and diarisation systems for the VoxCeleb Speaker Recognition Challenge 2021. For track 2, we evaluate two systems including ResNet34-SE and ECAPA-TDNN. For track 4, an important part of our system is VAD module which greatly improves the performance. Our best submission on the track 4 obtained on the evaluation set DER 5.54% and JER 27.11%, while the performance on the development set is DER 2.92% and JER 20.84%. △ Less

Submitted 6 September, 2021; originally announced September 2021.

arXiv:2108.12533 [pdf, other]

doi 10.1007/978-3-030-87202-1_25

Image-to-Graph Convolutional Network for Deformable Shape Reconstruction from a Single Projection Image

Authors: M. Nakao, F. Tong, M. Nakamura, T. Matsuda

Abstract: Shape reconstruction of deformable organs from two-dimensional X-ray images is a key technology for image-guided intervention. In this paper, we propose an image-to-graph convolutional network (IGCN) for deformable shape reconstruction from a single-viewpoint projection image. The IGCN learns relationship between shape/deformation variability and the deep image features based on a deformation mapp… ▽ More Shape reconstruction of deformable organs from two-dimensional X-ray images is a key technology for image-guided intervention. In this paper, we propose an image-to-graph convolutional network (IGCN) for deformable shape reconstruction from a single-viewpoint projection image. The IGCN learns relationship between shape/deformation variability and the deep image features based on a deformation mapping scheme. In experiments targeted to the respiratory motion of abdominal organs, we confirmed the proposed framework with a regularized loss function can reconstruct liver shapes from a single digitally reconstructed radiograph with a mean distance error of 3.6mm. △ Less

Submitted 31 August, 2021; v1 submitted 27 August, 2021; originally announced August 2021.

Comments: This paper will be appeared in MICCAI 2021

Journal ref: International Conference on Medical Image Computing and Computer Assisted Intervention 2021 (MICCAI)

arXiv:2106.10569 [pdf, other]

Enhancing and Localizing Surface Wave Propagation with Reconfigurable Surfaces

Authors: Z. Chu, K. K. Wong, K. F. Tong

Abstract: As an attempt to develop a reconfigurable surface architecture that can use liquid metal such as Galinstan to shape surface channels on demand, this paper considers a punctured surface where cavities are evenly distributed and can be filled with liquid metal potentially via digitally controlled pumps. In this paper, we look at the benefits of such architecture in terms of surface-wave signal enhan… ▽ More As an attempt to develop a reconfigurable surface architecture that can use liquid metal such as Galinstan to shape surface channels on demand, this paper considers a punctured surface where cavities are evenly distributed and can be filled with liquid metal potentially via digitally controlled pumps. In this paper, we look at the benefits of such architecture in terms of surface-wave signal enhancement and isolation, and examine how various system parameters impact the performance using full wave 3-dimensional electromagnetic simulations. It is shown that extraordinary signal shaping can be obtained. △ Less

Submitted 19 June, 2021; originally announced June 2021.

Comments: Submitted to 2021 IEEE International Symposium on Antennas and Propagation, Taipei, Taiwan,2021

arXiv:2105.10810 [pdf, other]

Reconfigurable Surface Wave Platform Using Fluidic Conductive Structures

Authors: Z. Chu, K. K. Wong, K. F. Tong

Abstract: Surface wave inherently has less propagation loss as it adheres to the surface and minimizes unwanted dissipation in space. Recently, they find applications in network-on-chip (NoC) communications and intelligent surface aided mobile networked communications. This paper puts forward a reconfigurable surface wave platform (RSWP) that utilizes liquid metal to produce highly energy-efficient and adap… ▽ More Surface wave inherently has less propagation loss as it adheres to the surface and minimizes unwanted dissipation in space. Recently, they find applications in network-on-chip (NoC) communications and intelligent surface aided mobile networked communications. This paper puts forward a reconfigurable surface wave platform (RSWP) that utilizes liquid metal to produce highly energy-efficient and adaptive pathways for surface wave transmission. Our simulation results illustrate that the proposed RSWP using Galinstan can obtain a $25{\rm dB}$ gain in the electric field for a propagation distance of $35λ$ at $30{\rm GHz}$ where $λ$ denotes the wavelength. Moreover, less than $1{\rm dB}$ loss is observed even at a distance of $50λ$, and a pathway with right-angled turns can also be created with only a $3.5{\rm dB}$ loss at the turn. △ Less

Submitted 22 May, 2021; originally announced May 2021.

Comments: Submitted to 2021 IEEE International Symposium on Antennas and Propagation and USNC-URSI Radio Science Meeting, Singapore, 2021

arXiv:2009.03048 [pdf, other]

On global convergence of area-constrained formations of hierarchical multi-agent systems

Authors: Toshiharu Sugie, Fei Tong, Brian D. O. Anderson, Zhiyong Sun

Abstract: This paper is concerned with a formation shaping problem for point agents in a two-dimensional space, where control avoids the possibility of reflection ambiguities. One solution for this type of problems was given first for three or four agents by considering a potential function which consists of both the distance error and the signed area terms. Then, by exploiting a hierarchical control strate… ▽ More This paper is concerned with a formation shaping problem for point agents in a two-dimensional space, where control avoids the possibility of reflection ambiguities. One solution for this type of problems was given first for three or four agents by considering a potential function which consists of both the distance error and the signed area terms. Then, by exploiting a hierarchical control strategy with such potential functions, the method was extended to any number of agents recently. However, a specific gain on the signed area term must be employed there, and it does not guarantee the global convergence. To overcome this issue, this paper provides a necessary and sufficient condition for the global convergence, subject to the constraint that the desired formation consists of isosceles triangles only. This clarifies the admissible range of the gain on the signed area for this case. In addition, as for formations consisting of arbitrary triangles, it is shown when high gain on the signed area is admissible for global convergence. △ Less

Submitted 4 September, 2020; originally announced September 2020.

Comments: Accepted in the 59th IEEE Conference on Decision and Control (CDC 2020). arXiv admin note: text overlap with arXiv:1808.00312

arXiv:2008.13598 [pdf]

A Dataset of Human Motion Status Using IR-UWB Through-wall Radar

Authors: Zhengliang Zhu, Degui Yang, Junchao Zhang, Feng Tong

Abstract: Ultra-wideband (UWB) through-wall radar has a wide range of applications in non-contact human information detection and monitoring. With the integration of machine learning technology, its potential prospects include the physiological monitoring of patients in the hospital environment and the daily monitoring at home. Although many target detection methods of UWB through-wall radar based on machin… ▽ More Ultra-wideband (UWB) through-wall radar has a wide range of applications in non-contact human information detection and monitoring. With the integration of machine learning technology, its potential prospects include the physiological monitoring of patients in the hospital environment and the daily monitoring at home. Although many target detection methods of UWB through-wall radar based on machine learning have been proposed, there is a lack of an opensource dataset to evaluate the performance of the algorithm. This published dataset was measured by impulse radio UWB (IR-UWB) through-wall radar system. Three test subjects were measured in different environments and several defined motion statuses. Using the presented dataset, we propose a human-motion-status recognition method using a convolutional neural network (CNN), the detailed dataset partition method and recognition process flow is given. On the well-trained network, the recognition accuracy of testing data for three kinds of motion statuses is higher than 99.7%. The dataset presented in this paper considers a simple environment. Therefore, we call on all organizations in the UWB radar field to cooperate to build opensource datasets to further promote the development of UWB through-wall radar. △ Less

Submitted 31 August, 2020; originally announced August 2020.

Comments: 13 figures

MSC Class: H.4

Showing 1–16 of 16 results for author: Tong, F