-
FDMA-Based Passive Multiple Users SWIPT Utilizing Resonant Beams
Authors:
Yixuan Guo,
Mingliang Xiong,
Wen Fang,
Qingwei Jiang,
Qingwen Liu,
Gang Yan
Abstract:
The rapid development of IoT technology has led to a shortage of spectrum resources and energy, giving rise to simultaneous wireless information and power transfer (SWIPT) technology. However, traditional multiple input multiple output (MIMO)-based SWIPT faces challenges in target detection. We have designed a passive multi-user resonant beam system (MU-RBS) that can achieve efficient power transf…
▽ More
The rapid development of IoT technology has led to a shortage of spectrum resources and energy, giving rise to simultaneous wireless information and power transfer (SWIPT) technology. However, traditional multiple input multiple output (MIMO)-based SWIPT faces challenges in target detection. We have designed a passive multi-user resonant beam system (MU-RBS) that can achieve efficient power transfer and communication through adaptive beam alignment. The frequency division multiple access (FDMA) is employed in the downlink (DL) channel, while frequency conversion is utilized in the uplink (UL) channel to avoid echo interference and co-channel interference, and the system architecture design and corresponding mathematical model are presented. The simulation results show that MU-RBS can achieve adaptive beam-forming without the target transmitting pilot signals, has high directivity, and as the number of iterations increases, the power transmission efficiency, signal-to-noise ratio and spectral efficiency of the UL and DL are continuously optimized until the system reaches the optimal state.
△ Less
Submitted 24 May, 2025;
originally announced May 2025.
-
Movable Antenna Aided Multiuser Communications: Antenna Position Optimization Based on Statistical Channel Information
Authors:
Ge Yan,
Lipeng Zhu,
Rui Zhang
Abstract:
The movable antenna (MA) technology has attracted great attention recently due to its promising capability in improving wireless channel conditions by flexibly adjusting antenna positions. To reap maximal performance gains of MA systems, existing works mainly focus on MA position optimization to cater to the instantaneous channel state information (CSI). However, the resulting real-time antenna mo…
▽ More
The movable antenna (MA) technology has attracted great attention recently due to its promising capability in improving wireless channel conditions by flexibly adjusting antenna positions. To reap maximal performance gains of MA systems, existing works mainly focus on MA position optimization to cater to the instantaneous channel state information (CSI). However, the resulting real-time antenna movement may face challenges in practical implementation due to the additional time overhead and energy consumption required, especially in fast time-varying channel scenarios. To address this issue, we propose in this paper a new approach to optimize the MA positions based on the users' statistical CSI over a large timescale. In particular, we propose a general field response based statistical channel model to characterize the random channel variations caused by the local movement of users. Based on this model, a two-timescale optimization problem is formulated to maximize the ergodic sum rate of multiple users, where the precoding matrix and the positions of MAs at the base station (BS) are optimized based on the instantaneous and statistical CSI, respectively. To solve this non-convex optimization problem, a log-barrier penalized gradient ascent algorithm is developed to optimize the MA positions, where two methods are proposed to approximate the ergodic sum rate and its gradients with different complexities. Finally, we present simulation results to evaluate the performance of the proposed design and algorithms based on practical channels generated by ray-tracing. The results verify the performance advantages of MA systems compared to their fixed-position antenna (FPA) counterparts in terms of long-term rate improvement, especially for scenarios with more diverse channel power distributions in the angular domain.
△ Less
Submitted 28 February, 2025;
originally announced February 2025.
-
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction
Authors:
Ailin Huang,
Boyong Wu,
Bruce Wang,
Chao Yan,
Chen Hu,
Chengli Feng,
Fei Tian,
Feiyu Shen,
Jingbei Li,
Mingrui Chen,
Peng Liu,
Ruihang Miao,
Wang You,
Xi Chen,
Xuerui Yang,
Yechang Huang,
Yuxiang Zhang,
Zheng Gong,
Zixin Zhang,
Hongyu Zhou,
Jianjian Sun,
Brian Li,
Chengting Feng,
Changyi Wan,
Hanpeng Hu
, et al. (120 additional authors not shown)
Abstract:
Real-time speech interaction, serving as a fundamental interface for human-machine collaboration, holds immense potential. However, current open-source models face limitations such as high costs in voice data collection, weakness in dynamic control, and limited intelligence. To address these challenges, this paper introduces Step-Audio, the first production-ready open-source solution. Key contribu…
▽ More
Real-time speech interaction, serving as a fundamental interface for human-machine collaboration, holds immense potential. However, current open-source models face limitations such as high costs in voice data collection, weakness in dynamic control, and limited intelligence. To address these challenges, this paper introduces Step-Audio, the first production-ready open-source solution. Key contributions include: 1) a 130B-parameter unified speech-text multi-modal model that achieves unified understanding and generation, with the Step-Audio-Chat version open-sourced; 2) a generative speech data engine that establishes an affordable voice cloning framework and produces the open-sourced lightweight Step-Audio-TTS-3B model through distillation; 3) an instruction-driven fine control system enabling dynamic adjustments across dialects, emotions, singing, and RAP; 4) an enhanced cognitive architecture augmented with tool calling and role-playing abilities to manage complex tasks effectively. Based on our new StepEval-Audio-360 evaluation benchmark, Step-Audio achieves state-of-the-art performance in human evaluations, especially in terms of instruction following. On open-source benchmarks like LLaMA Question, shows 9.3% average performance improvement, demonstrating our commitment to advancing the development of open-source multi-modal language technologies. Our code and models are available at https://github.com/stepfun-ai/Step-Audio.
△ Less
Submitted 18 February, 2025; v1 submitted 17 February, 2025;
originally announced February 2025.
-
Integrated Sensing and Communication System Based on Radio Frequency Resonance Beam
Authors:
Yixuan Guo,
Shuaifan Xia,
Mingliang Xiong,
Qingwen Liu,
Wen Fang,
Qingwei Jiang,
Gang Yan,
Jiangchuan Mu
Abstract:
To address the complex beam control in traditional multiple-input multiple-output (MIMO) systems, researchers have proposed adaptive beam alignment using retro-directive antenna (RDA) arrays. This approach creates echo resonance between the base station (BS) and user equipment (UE), significantly reducing computational load. However, conventional resonant beam systems (RBS) suffer from echo interf…
▽ More
To address the complex beam control in traditional multiple-input multiple-output (MIMO) systems, researchers have proposed adaptive beam alignment using retro-directive antenna (RDA) arrays. This approach creates echo resonance between the base station (BS) and user equipment (UE), significantly reducing computational load. However, conventional resonant beam systems (RBS) suffer from echo interference due to the shared uplink and downlink frequency. Therefore, this paper proposes an innovative resonance beam-based integrated sensing and communication (RB-ISAC) system designed for efficient passive sensing and bidirectional communication. In this system, the UE operates passively, with both the BS and UE utilizing a phase conjugation and frequency conversion structure to decouple uplink and downlink carrier frequencies, ensuring continuous electromagnetic wave oscillation between the two ends. Effective compensation for signal propagation loss enables resonance after multiple oscillations. At this point, the beam's field forms a low-diffraction-loss, highly focused pattern, automatically aligning the transmitter and receiver. This enables high-precision passive positioning alongside robust uplink and downlink communication. Simulation results demonstrate the proposed system achieves resonance within multiple iterations, supporting uplink and downlink communication up to 5 m, and enabling passive direction of arrival (DOA) estimation with an error under 2$^\circ$ .
△ Less
Submitted 5 June, 2025; v1 submitted 30 January, 2025;
originally announced January 2025.
-
Resonant Beam Multi-Target DOA Estimation
Authors:
Yixuan Guo,
Qingwei Jiang,
Mingliang Xiong,
Wen Fang,
Mingqing Liu,
Qingqing Zhang,
Qingwen Liu,
Gang Yan
Abstract:
With the increasing demand for internet of things (IoT) applications, especially for location-based services, how to locate passive mobile targets (MTs) with minimal beam control has become a challenge. Resonant beam systems are considered promising IoT technologies with advantages such as beam self-alignment and energy concentration. To establish a resonant system in the radio frequency (RF) band…
▽ More
With the increasing demand for internet of things (IoT) applications, especially for location-based services, how to locate passive mobile targets (MTs) with minimal beam control has become a challenge. Resonant beam systems are considered promising IoT technologies with advantages such as beam self-alignment and energy concentration. To establish a resonant system in the radio frequency (RF) band and achieve multi-target localization, this paper designs a multi-target resonant system architecture, allowing a single base station (BS) to independently connect with multiple MTs. By employing a retro-directive array, a multi-channel cyclic model is established to realize one-to-many electromagnetic wave propagation and MT direction-of-arrival (DOA) estimation through echo resonance. Simulation results show that the proposed system supports resonant establishment between the BS and multiple MTs. This helps the BS to still have high DOA estimation accuracy in the face of multiple passive MTs, and can ensure that the DOA error is less than 1 degree within a range of 6 meters at a 50degree field of view, with higher accuracy than active beamforming localization systems.
△ Less
Submitted 13 February, 2025; v1 submitted 20 December, 2024;
originally announced December 2024.
-
Resonant Beam Enabled Passive 3D Positioning
Authors:
Yixuan Guo,
Mingliang Xiong,
Wen Fang,
Qingwei Jiang,
Mengyuan Xu,
Qingwen Liu,
Gang Yan
Abstract:
With the rapid development of the internet of things (IoT), location-based services are becoming increasingly prominent in various aspects of social life, and accurate location information is crucial. However, RF-based indoor positioning solutions are severely limited in positioning accuracy due to signal transmission losses and directional difficulties, and optical indoor positioning methods requ…
▽ More
With the rapid development of the internet of things (IoT), location-based services are becoming increasingly prominent in various aspects of social life, and accurate location information is crucial. However, RF-based indoor positioning solutions are severely limited in positioning accuracy due to signal transmission losses and directional difficulties, and optical indoor positioning methods require high propagation conditions. To achieve higher accuracy in indoor positioning, we utilize the principle of resonance to design a triangulation-based resonant beam positioning system (TRBPS) in the RF band. The proposed system employs phase-conjugation antenna arrays and resonance mechanism to achieve energy concentration and beam self-alignment, without requiring active signals from the target for positioning and complex beam control algorithms. Numerical evaluations indicate that TRBPS can achieve millimeter-level accuracy within a range of 3.6 m without the need for additional embedded systems.
△ Less
Submitted 20 December, 2024;
originally announced December 2024.
-
Resonant Beam Enabled DoA Estimation in Passive Positioning System
Authors:
Yixuan Guo,
Qingwei Jiang,
Mengyuan Xu,
Wen Fang,
Qingwen Liu,
Gang Yan,
Qunhui Yang,
Hai Lu
Abstract:
The rapid advancement of the next generation of communications and internet of things (IoT) technologies has made the provision of location-based services for diverse devices an increasingly pressing necessity. Localizing devices with/without intelligent computing abilities, including both active and passive devices is essential, especially in indoor scenarios. For traditional RF positioning syste…
▽ More
The rapid advancement of the next generation of communications and internet of things (IoT) technologies has made the provision of location-based services for diverse devices an increasingly pressing necessity. Localizing devices with/without intelligent computing abilities, including both active and passive devices is essential, especially in indoor scenarios. For traditional RF positioning systems, aligning transmission signals and dealing with signal interference in complex environments are inevitable challenges. Therefore, this paper proposed a new passive positioning system, the RF-band resonant beam positioning system (RF-RBPS), which achieves energy concentration and beam alignment by amplifying echoes between the base station (BS) and the passive target (PT), without the need for complex channel estimation and time-consuming beamforming and provides high-precision direction of arrival (DoA) estimation for battery-free targets using the resonant mechanism. The direction information of the PT is estimated using the multiple signal classification (MUSIC) algorithm at the end of BS. The feasibility of the proposed system is validated through theoretical analysis and simulations. Results indicate that the proposed RF-RBPS surpasses RF-band active positioning system (RF-APS) in precision, achieving millimeter-level precision at 2m within an elevation angle of 35$^\circ$, and an error of less than 3cm at 2.5m within an elevation angle of 35$^\circ$.
△ Less
Submitted 7 August, 2024;
originally announced August 2024.
-
Power Measurement Enabled Channel Autocorrelation Matrix Estimation for IRS-Assisted Wireless Communication
Authors:
Ge Yan,
Lipeng Zhu,
Rui Zhang
Abstract:
By reconfiguring wireless channels via passive signal reflection, intelligent reflecting surface (IRS) can bring significant performance enhancement for wireless communication systems. However, such performance improvement generally relies on the knowledge of channel state information (CSI) for IRS-involved links. Prior works on IRS CSI acquisition mainly estimate IRS-cascaded channels based on th…
▽ More
By reconfiguring wireless channels via passive signal reflection, intelligent reflecting surface (IRS) can bring significant performance enhancement for wireless communication systems. However, such performance improvement generally relies on the knowledge of channel state information (CSI) for IRS-involved links. Prior works on IRS CSI acquisition mainly estimate IRS-cascaded channels based on the extra pilot signals received at the users/base station (BS) with time-varying IRS reflections, which, however, needs to modify the existing channel training/estimation protocols of wireless systems. To address this issue, we propose in this paper a new channel estimation scheme for IRS-assisted communication systems based on the received signal power measured at the user terminal, which is practically attainable without the need of changing the current protocol. Due to the lack of signal phase information in measured power, the autocorrelation matrix of the BS-IRS-user cascaded channel is estimated by solving an equivalent rank-minimization problem. To this end, a low-rank-approaching (LRA) algorithm is proposed by employing the fractional programming and alternating optimization techniques. To reduce computational complexity, an approximate LRA (ALRA) algorithm is also developed. Furthermore, these two algorithms are extended to be robust against the receiver noise and quantization error in power measurement. Simulation results are provided to verify the effectiveness of the proposed channel estimation algorithms as well as the IRS passive reflection design based on the estimated channel autocorrelation matrix.
△ Less
Submitted 20 July, 2024;
originally announced July 2024.
-
Receiver Resonant Frequency Adaptive Tracking in Wireless Power Transfer Systems Using Primary Variable Capacitor
Authors:
Chang Liu,
Wei Han,
Guangyu Yan,
Bowang Zhang,
Chunlin Li
Abstract:
Parameter variations within the resonant network of wireless power transfer (WPT) systems can cause drift in the resonant frequency, leading to a detuned system that requires higher power capacity and experiences reduced transfer efficiency. To address this issue, this paper presents an adaptive online receiver resonant frequency tracking scheme based solely on primary-side detection. The proposed…
▽ More
Parameter variations within the resonant network of wireless power transfer (WPT) systems can cause drift in the resonant frequency, leading to a detuned system that requires higher power capacity and experiences reduced transfer efficiency. To address this issue, this paper presents an adaptive online receiver resonant frequency tracking scheme based solely on primary-side detection. The proposed method effectively compensates for parameter fluctuations in both primary and secondary resonators. The core of this approach is a switch-controlled capacitor (SCC) with a control angle calibrated during a system self-check process prior to high-power charging. Additionally, a two-step perturb-and-observe algorithm has been developed to perform online tracking while minimizing disturbances to the output power. Post-tracking, zero-voltage switching (ZVS) conditions can be achieved within a specified detuning range. To validate the efficacy of the proposed system, a 200W experimental platform was constructed. The measured results demonstrate that resonance is consistently maintained within the 79-90 kHz frequency range, as specified by the SAE J2954 standard. The maximum frequency tracking error and efficiency increase are 0.7 kHz and 9%, respectively. Notably, the tracking process is completed in less than 1 ms.
△ Less
Submitted 12 September, 2024; v1 submitted 26 March, 2024;
originally announced March 2024.
-
Multitask frame-level learning for few-shot sound event detection
Authors:
Liang Zou,
Genwei Yan,
Ruoyu Wang,
Jun Du,
Meng Lei,
Tian Gao,
Xin Fang
Abstract:
This paper focuses on few-shot Sound Event Detection (SED), which aims to automatically recognize and classify sound events with limited samples. However, prevailing methods methods in few-shot SED predominantly rely on segment-level predictions, which often providing detailed, fine-grained predictions, particularly for events of brief duration. Although frame-level prediction strategies have been…
▽ More
This paper focuses on few-shot Sound Event Detection (SED), which aims to automatically recognize and classify sound events with limited samples. However, prevailing methods methods in few-shot SED predominantly rely on segment-level predictions, which often providing detailed, fine-grained predictions, particularly for events of brief duration. Although frame-level prediction strategies have been proposed to overcome these limitations, these strategies commonly face difficulties with prediction truncation caused by background noise. To alleviate this issue, we introduces an innovative multitask frame-level SED framework. In addition, we introduce TimeFilterAug, a linear timing mask for data augmentation, to increase the model's robustness and adaptability to diverse acoustic environments. The proposed method achieves a F-score of 63.8%, securing the 1st rank in the few-shot bioacoustic event detection category of the Detection and Classification of Acoustic Scenes and Events Challenge 2023.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Power-Flow-Embedded Projection Conic Matrix Completion for Low-Observable Distribution Systems
Authors:
Xuzhuo Wang,
Guoan Yan,
Zhengshuo Li
Abstract:
A low-observable distribution system has insufficient measurements for conventional weighted least square state estimators. Matrix completion state estimators have been suggested, but their computational times could be prohibitive. To resolve this problem, a novel and efficient power-flow-embedded projection conic matrix completion method customized for low-observable distribution systems is propo…
▽ More
A low-observable distribution system has insufficient measurements for conventional weighted least square state estimators. Matrix completion state estimators have been suggested, but their computational times could be prohibitive. To resolve this problem, a novel and efficient power-flow-embedded projection conic matrix completion method customized for low-observable distribution systems is proposed in this letter. This method can yield more accurate state estimations (2-fold improvement) in a much shorter time (5% or less) than other methods. Case studies on different-scale systems have demonstrated the efficacy of the proposed method when applied to low-observable distribution system state estimation problems.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
Outlier-immune Data-driven Linear Power Flow Model Construction via Mixed-Integer Programming
Authors:
Guoan Yan,
Zhengshuo Li
Abstract:
The common approaches to construct a data-driven linear power flow (DD-LPF) model cannot completely eliminate the adverse impacts of outliers in a training dataset. In this letter, a novel outlier-immune DD-LPF model construction method via mixed-integer programming is presented for automatically and optimally identifying outliers to form a more accurate LPF model. Two acceleration solution strate…
▽ More
The common approaches to construct a data-driven linear power flow (DD-LPF) model cannot completely eliminate the adverse impacts of outliers in a training dataset. In this letter, a novel outlier-immune DD-LPF model construction method via mixed-integer programming is presented for automatically and optimally identifying outliers to form a more accurate LPF model. Two acceleration solution strategies are further suggested to reduce the computational time. Case studies demonstrate the superior accuracy and comparable computational time of the proposed method when compared to three common approaches.
△ Less
Submitted 25 December, 2023;
originally announced December 2023.
-
Channel Autocorrelation Estimation for IRS-Aided Wireless Communications Based on Power Measurements
Authors:
Ge Yan,
Lipeng Zhu,
Rui Zhang
Abstract:
Intelligent reflecting surface (IRS) can bring significant performance enhancement for wireless communication systems by reconfiguring wireless channels via passive signal reflection. However, such performance improvement generally relies on the knowledge of channel state information (CSI) for IRS-associated links. Prior IRS channel estimation strategies mainly estimate IRS-cascaded channels based…
▽ More
Intelligent reflecting surface (IRS) can bring significant performance enhancement for wireless communication systems by reconfiguring wireless channels via passive signal reflection. However, such performance improvement generally relies on the knowledge of channel state information (CSI) for IRS-associated links. Prior IRS channel estimation strategies mainly estimate IRS-cascaded channels based on the excessive pilot signals received at the users/base station (BS) with time-varying IRS reflections, which, however, are not compatible with the existing channel training/estimation protocol for cellular networks. To address this issue, we propose in this paper a new channel estimation scheme for IRS-assisted communication systems based on the received signal power measured at the user, which is practically attainable without the need of changing the current protocol. Specifically, due to the lack of signal phase information in power measurements, the autocorrelation matrix of the BS-IRS-user cascaded channel is estimated by solving equivalent matrix-rank-minimization problems. Simulation results are provided to verify the effectiveness of the proposed channel estimation algorithm as well as the IRS passive reflection design based on the estimated channel autocorrelation matrix.
△ Less
Submitted 17 October, 2023;
originally announced October 2023.
-
Realistic Bokeh Effect Rendering on Mobile GPUs, Mobile AI & AIM 2022 challenge: Report
Authors:
Andrey Ignatov,
Radu Timofte,
Jin Zhang,
Feng Zhang,
Gaocheng Yu,
Zhe Ma,
Hongbin Wang,
Minsu Kwon,
Haotian Qian,
Wentao Tong,
Pan Mu,
Ziping Wang,
Guangjing Yan,
Brian Lee,
Lei Fei,
Huaijin Chen,
Hyebin Cho,
Byeongjun Kwon,
Munchurl Kim,
Mingyang Qian,
Huixin Ma,
Yanan Li,
Xiaotao Wang,
Lei Lei
Abstract:
As mobile cameras with compact optics are unable to produce a strong bokeh effect, lots of interest is now devoted to deep learning-based solutions for this task. In this Mobile AI challenge, the target was to develop an efficient end-to-end AI-based bokeh effect rendering approach that can run on modern smartphone GPUs using TensorFlow Lite. The participants were provided with a large-scale EBB!…
▽ More
As mobile cameras with compact optics are unable to produce a strong bokeh effect, lots of interest is now devoted to deep learning-based solutions for this task. In this Mobile AI challenge, the target was to develop an efficient end-to-end AI-based bokeh effect rendering approach that can run on modern smartphone GPUs using TensorFlow Lite. The participants were provided with a large-scale EBB! bokeh dataset consisting of 5K shallow / wide depth-of-field image pairs captured using the Canon 7D DSLR camera. The runtime of the resulting models was evaluated on the Kirin 9000's Mali GPU that provides excellent acceleration results for the majority of common deep learning ops. A detailed description of all models developed in this challenge is provided in this paper.
△ Less
Submitted 7 November, 2022;
originally announced November 2022.
-
Network Topology Inference based on Timing Meta-Data
Authors:
Wenbo Du,
Tao Tan,
Haijun Zhang,
Xianbin Cao,
Gang Yan,
Osvaldo Simeone
Abstract:
Consider a processor having access only to meta-data consisting of the timings of data packets and acknowledgment (ACK) packets from all nodes in a network. The meta-data report the source node of each packet, but not the destination nodes or the contents of the packets. The goal of the processor is to infer the network topology based solely on such information. Prior work leveraged causality metr…
▽ More
Consider a processor having access only to meta-data consisting of the timings of data packets and acknowledgment (ACK) packets from all nodes in a network. The meta-data report the source node of each packet, but not the destination nodes or the contents of the packets. The goal of the processor is to infer the network topology based solely on such information. Prior work leveraged causality metrics to identify which links are active. If the data timings and ACK timings of two nodes -- say node 1 and node 2, respectively -- are causally related, this may be taken as evidence that node 1 is communicating to node 2 (which sends back ACK packets to node 1). This paper starts with the observation that packet losses can weaken the causality relationship between data and ACK timing streams. To obviate this problem, a new Expectation Maximization (EM)-based algorithm is introduced -- EM-causality discovery algorithm (EM-CDA) -- which treats packet losses as latent variables. EM-CDA iterates between the estimation of packet losses and the evaluation of causality metrics. The method is validated through extensive experiments in wireless sensor networks on the NS-3 simulation platform.
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
Analysis Method of Strapdown Inertial Navigation Error Distribution Based on Covariance Matrix Decomposition
Authors:
Xiaokang Yang,
Gongmin Yan,
Fan Liu,
Bofan Guan,
Sihai Li
Abstract:
Error distribution analysis is an important assistant technology for the research of SINS(Strapdown Inertial Navigation System). Error distribution result can provide the contribution of different errors to final navigation error, which is helpful for modifying and optimizing SINS. To realize decomposing the navigation error into parts that caused by each error source, the SINS error state space m…
▽ More
Error distribution analysis is an important assistant technology for the research of SINS(Strapdown Inertial Navigation System). Error distribution result can provide the contribution of different errors to final navigation error, which is helpful for modifying and optimizing SINS. To realize decomposing the navigation error into parts that caused by each error source, the SINS error state space model is established and covariance matrix is decomposed according to error sources. The proposed error distribution analysis method based on 34-dimension SINS error model can quantitatively analyze the contribution to the end navigation error of initial errors, IMU(Inertial Measurement Unit) bias, IMU scale factor errors, mounting errors of gyroscopes and accelerometers, and IMU stochastic errors. The simulations in static condition and single axis rotation condition indict that the distribution result of proposed analysis method accords with the law of error propagation. After trajectory determined, the corresponding error distribution result will be calculated with the proposed method. Compared with the Monte-Carlo method and other method based on covariance matrix, the proposed method uses more complete error model, considers the interaction effect of error sources and can be easily realized with less computation.
△ Less
Submitted 8 September, 2022; v1 submitted 22 March, 2022;
originally announced March 2022.
-
A GNSS Aided Initial Alignment Method for MEMS-IMU Based on Backtracking Algorithm and Backward Filtering
Authors:
Xiaokang Yang,
Gongmin Yan,
Hao Yang,
Sihai Li
Abstract:
To obtain a high-accuracy position with SINS(Strapdown Inertial Navigation System), initial alignment needs to determine initial attitude rapidly and accurately. High-accuracy grade IMU(Inertial Measurement Uint) can obtain the initial attitude indenpendently, however, the low-accuracy grade gyroscope doesn't adapt to determine the heading angle, hence the initial attitude matrix will not be obtai…
▽ More
To obtain a high-accuracy position with SINS(Strapdown Inertial Navigation System), initial alignment needs to determine initial attitude rapidly and accurately. High-accuracy grade IMU(Inertial Measurement Uint) can obtain the initial attitude indenpendently, however, the low-accuracy grade gyroscope doesn't adapt to determine the heading angle, hence the initial attitude matrix will not be obtained. If using large misalignment angle model to estiamting heading angle, the convergence time will become much longer. For solving these two problems, a novel alignment algorithm combined backtracking algorithm and reverse navigation updating method with GNSS(Global Navigation Satellite System) aiding is proposed herein. The simulation and land vehicle test were finished to evaluate the alignment accuracy of the proposed algorithm. The horizontal misalignment is less than 2.3 arcmin and the heading misalignment is less than 10.1 arcmin in test. The proposed algorithm is a feasible and practical alignment method for low-cost IMU to obtain initial attitude in short term and large misalignment condition aided by GNSS.
△ Less
Submitted 28 February, 2022;
originally announced February 2022.
-
Information Prebuilt Recurrent Reconstruction Network for Video Super-Resolution
Authors:
Shuyun Wang,
Ming Yu,
Cuihong Xue,
Yingchun Guo,
Gang Yan
Abstract:
The video super-resolution (VSR) method based on the recurrent convolutional network has strong temporal modeling capability for video sequences. However, the temporal receptive field of different recurrent units in the unidirectional recurrent network is unbalanced. Earlier reconstruction frames receive less spatio-temporal information, resulting in fuzziness or artifacts. Although the bidirectio…
▽ More
The video super-resolution (VSR) method based on the recurrent convolutional network has strong temporal modeling capability for video sequences. However, the temporal receptive field of different recurrent units in the unidirectional recurrent network is unbalanced. Earlier reconstruction frames receive less spatio-temporal information, resulting in fuzziness or artifacts. Although the bidirectional recurrent network can alleviate this problem, it requires more memory space and fails to perform many tasks with low latency requirements. To solve the above problems, we propose an end-to-end information prebuilt recurrent reconstruction network (IPRRN), consisting of an information prebuilt network (IPNet) and a recurrent reconstruction network (RRNet). By integrating sufficient information from the front of the video to build the hidden state needed for the initially recurrent unit to help restore the earlier frames, the information prebuilt network balances the input information difference at different time steps. In addition, we demonstrate an efficient recurrent reconstruction network, which outperforms the existing unidirectional recurrent schemes in all aspects. Many experiments have verified the effectiveness of the network we propose, which can effectively achieve better quantitative and qualitative evaluation performance compared to the existing state-of-the-art methods.
△ Less
Submitted 2 February, 2023; v1 submitted 10 December, 2021;
originally announced December 2021.
-
A New Entity Extraction Method Based on Machine Reading Comprehension
Authors:
Xiaobo Jiang,
Kun He,
Jiajun He,
Guangyu Yan
Abstract:
Entity extraction is a key technology for obtaining information from massive texts in natural language processing. The further interaction between them does not meet the standards of human reading comprehension, thus limiting the understanding of the model, and also the omission or misjudgment of the answer (ie the target entity) due to the reasoning question. An effective MRC-based entity extract…
▽ More
Entity extraction is a key technology for obtaining information from massive texts in natural language processing. The further interaction between them does not meet the standards of human reading comprehension, thus limiting the understanding of the model, and also the omission or misjudgment of the answer (ie the target entity) due to the reasoning question. An effective MRC-based entity extraction model-MRC-I2DP, which uses the proposed gated attention-attracting mechanism to adjust the restoration of each part of the text pair, creating problems and thinking for multi-level interactive attention calculations to increase the target entity It also uses the proposed 2D probability coding module, TALU function and mask mechanism to strengthen the detection of all possible targets of the target, thereby improving the probability and accuracy of prediction. Experiments have proved that MRC-I2DP represents an overall state-of-the-art model in 7 from the scientific and public domains, achieving a performance improvement of up to compared to the model model in F1.
△ Less
Submitted 20 August, 2021; v1 submitted 13 August, 2021;
originally announced August 2021.
-
Magnetic Resonance Spectroscopy Deep Learning Denoising Using Few In Vivo Data
Authors:
Dicheng Chen,
Wanqi Hu,
Huiting Liu,
Yirong Zhou,
Tianyu Qiu,
Yihui Huang,
Zi Wang,
Jiazheng Wang,
Liangjie Lin,
Zhigang Wu,
Hao Chen,
Xi Chen,
Gen Yan,
Di Guo,
Jianzhong Lin,
Xiaobo Qu
Abstract:
Magnetic Resonance Spectroscopy (MRS) is a noninvasive tool to reveal metabolic information. One challenge of 1H-MRS is the low Signal-Noise Ratio (SNR). To improve the SNR, a typical approach is to perform Signal Averaging (SA) with M repeated samples. The data acquisition time, however, is increased by M times accordingly, and a complete clinical MRS scan takes approximately 10 minutes at a comm…
▽ More
Magnetic Resonance Spectroscopy (MRS) is a noninvasive tool to reveal metabolic information. One challenge of 1H-MRS is the low Signal-Noise Ratio (SNR). To improve the SNR, a typical approach is to perform Signal Averaging (SA) with M repeated samples. The data acquisition time, however, is increased by M times accordingly, and a complete clinical MRS scan takes approximately 10 minutes at a common setting M=128. Recently, deep learning has been introduced to improve the SNR but most of them use the simulated data as the training set. This may hinder the MRS applications since some potential differences, such as acquisition system imperfections, and physiological and psychologic conditions may exist between the simulated and in vivo data. Here, we proposed a new scheme that purely used the repeated samples of realistic data. A deep learning model, Refusion Long Short-Term Memory (ReLSTM), was designed to learn the mapping from the low SNR time-domain data (24 SA) to the high SNR one (128 SA). Experiments on the in vivo brain spectra of 7 healthy subjects, 2 brain tumor patients and 1 cerebral infarction patient showed that only using 20% repeated samples, the denoised spectra by ReLSTM could provide comparable estimated concentrations of metabolites to 128 SA. Compared with the state-of-the-art low-rank denoising method, the ReLSTM achieved the lower relative error and the Cramér-Rao lower bounds in quantifying some important biomarkers. In summary, ReLSTM can perform high-fidelity denoising of the spectra under fast acquisition (24 SA), which would be valuable to MRS clinical studies.
△ Less
Submitted 25 October, 2022; v1 submitted 26 January, 2021;
originally announced January 2021.
-
VC-Net: Deep Volume-Composition Networks for Segmentation and Visualization of Highly Sparse and Noisy Image Data
Authors:
Yifan Wang,
Guoli Yan,
Haikuan Zhu,
Sagar Buch,
Ying Wang,
Ewart Mark Haacke,
Jing Hua,
Zichun Zhong
Abstract:
The motivation of our work is to present a new visualization-guided computing paradigm to combine direct 3D volume processing and volume rendered clues for effective 3D exploration such as extracting and visualizing microstructures in-vivo. However, it is still challenging to extract and visualize high fidelity 3D vessel structure due to its high sparseness, noisiness, and complex topology variati…
▽ More
The motivation of our work is to present a new visualization-guided computing paradigm to combine direct 3D volume processing and volume rendered clues for effective 3D exploration such as extracting and visualizing microstructures in-vivo. However, it is still challenging to extract and visualize high fidelity 3D vessel structure due to its high sparseness, noisiness, and complex topology variations. In this paper, we present an end-to-end deep learning method, VC-Net, for robust extraction of 3D microvasculature through embedding the image composition, generated by maximum intensity projection (MIP), into 3D volume image learning to enhance the performance. The core novelty is to automatically leverage the volume visualization technique (MIP) to enhance the 3D data exploration at deep learning level. The MIP embedding features can enhance the local vessel signal and are adaptive to the geometric variability and scalability of vessels, which is crucial in microvascular tracking. A multi-stream convolutional neural network is proposed to learn the 3D volume and 2D MIP features respectively and then explore their inter-dependencies in a joint volume-composition embedding space by unprojecting the MIP features into 3D volume embedding space. The proposed framework can better capture small / micro vessels and improve vessel connectivity. To our knowledge, this is the first deep learning framework to construct a joint convolutional embedding space, where the computed vessel probabilities from volume rendering based 2D projection and 3D volume can be explored and integrated synergistically. Experimental results are compared with the traditional 3D vessel segmentation methods and the deep learning state-of-the-art on public and real patient (micro-)cerebrovascular image datasets. Our method demonstrates the potential in a powerful MR arteriogram and venogram diagnosis of vascular diseases.
△ Less
Submitted 14 September, 2020;
originally announced September 2020.
-
Inverse NN Modelling of a Piezoelectric Stage with Dominant Variable
Authors:
Gangfeng Yan,
Hang Jian Soo,
Khalid Abidi,
Jian-Xin Xu
Abstract:
This paper presents an approach for developing a neural network inverse model of a piezoelectric positioning stage, which exhibits rate-dependent, asymmetric hysteresis. It is shown that using both the velocity and the acceleration as inputs results in over-fitting. To overcome this, a rough analytical model of the actuator is derived and by measuring its response to excitation, the velocity signa…
▽ More
This paper presents an approach for developing a neural network inverse model of a piezoelectric positioning stage, which exhibits rate-dependent, asymmetric hysteresis. It is shown that using both the velocity and the acceleration as inputs results in over-fitting. To overcome this, a rough analytical model of the actuator is derived and by measuring its response to excitation, the velocity signal is identified as the dominant variable. By setting the input space of the neural network to only the dominant variable, an inverse model with good predictive ability is obtained. Training of the network is accomplished using the Levenberg-Marquardt algorithm. Finally, the effectiveness of the proposed approach is experimentally demonstrated.
△ Less
Submitted 30 July, 2020;
originally announced July 2020.
-
A Practical Application of Sliding Mode Control in the Motion Control of a High Precision Piezoelectric Motor
Authors:
Gangfeng Yan,
Khalid Abidi
Abstract:
This paper proposes a practical implementation of sliding mode control (SMC) that utilizes partial modeling compensation. Sliding mode control is well known for its effectiveness as a model free control approach, however, its effectiveness is degraded if there is a constraint on the control gain or limitation on the switching frequency in digital implementation. This is especially the case with sy…
▽ More
This paper proposes a practical implementation of sliding mode control (SMC) that utilizes partial modeling compensation. Sliding mode control is well known for its effectiveness as a model free control approach, however, its effectiveness is degraded if there is a constraint on the control gain or limitation on the switching frequency in digital implementation. This is especially the case with systems that involve static friction. This approach aims to enhance the effectiveness of SMC by partial model compensation. Rigorous stability proofs are presented to validate the approach. In addition, experiments are carried out on a piezoelectric motor driven linear stage and the control approach is compared with the Discrete-Time Integral Sliding Mode (DTISMC) approach proposed by Abidi et al. as well as conventional PI control. The results show that the proposed control approach has a superior performance in comparison to the other approaches tested.
△ Less
Submitted 30 July, 2020;
originally announced July 2020.
-
Spatiotemporal Flexible Sparse Reconstruction for Rapid Dynamic Contrast-enhanced MRI
Authors:
Yuhan Hu,
Xinlin Zhang,
Li Feng,
Dicheng Chen,
Zhiping Yan,
Xiaoyong Shen,
Gen Yan,
Lin Ou-yang,
Xiaobo Qu
Abstract:
Dynamic Contrast-enhanced magnetic resonance imaging (DCE-MRI) is a tissue perfusion imaging technique. Some versatile free-breathing DCE-MRI techniques combining compressed sensing (CS) and parallel imaging with golden-angle radial sampling have been developed to improve motion robustness with high spatial and temporal resolution. These methods have demonstrated good diagnostic performance in cli…
▽ More
Dynamic Contrast-enhanced magnetic resonance imaging (DCE-MRI) is a tissue perfusion imaging technique. Some versatile free-breathing DCE-MRI techniques combining compressed sensing (CS) and parallel imaging with golden-angle radial sampling have been developed to improve motion robustness with high spatial and temporal resolution. These methods have demonstrated good diagnostic performance in clinical setting, but the reconstruction quality will degrade at high acceleration rates and overall reconstruction time remains long. In this paper, we proposed a new parallel CS reconstruction model for DCE-MRI that enforces flexible weighted sparse constraint along both spatial and temporal dimensions. Weights were introduced to flexibly adjust the importance of time and space sparsity, and we derived a fast thresholding algorithm which was proven to be simple and efficient for solving the proposed reconstruction model. Results on in vivo liver DCE datasets show that the proposed method outperforms the state-of-the-art methods in terms of visual image quality assessment and reconstruction speed without introducing significant temporal blurring.
△ Less
Submitted 6 July, 2020;
originally announced July 2020.
-
AxTrain: Hardware-Oriented Neural Network Training for Approximate Inference
Authors:
Xin He,
Liu Ke,
Wenyan Lu,
Guihai Yan,
Xuan Zhang
Abstract:
The intrinsic error tolerance of neural network (NN) makes approximate computing a promising technique to improve the energy efficiency of NN inference. Conventional approximate computing focuses on balancing the efficiency-accuracy trade-off for existing pre-trained networks, which can lead to suboptimal solutions. In this paper, we propose AxTrain, a hardware-oriented training framework to facil…
▽ More
The intrinsic error tolerance of neural network (NN) makes approximate computing a promising technique to improve the energy efficiency of NN inference. Conventional approximate computing focuses on balancing the efficiency-accuracy trade-off for existing pre-trained networks, which can lead to suboptimal solutions. In this paper, we propose AxTrain, a hardware-oriented training framework to facilitate approximate computing for NN inference. Specifically, AxTrain leverages the synergy between two orthogonal methods---one actively searches for a network parameters distribution with high error tolerance, and the other passively learns resilient weights by numerically incorporating the noise distributions of the approximate hardware in the forward pass during the training phase. Experimental results from various datasets with near-threshold computing and approximation multiplication strategies demonstrate AxTrain's ability to obtain resilient neural network parameters and system energy efficiency improvement.
△ Less
Submitted 21 May, 2018;
originally announced May 2018.
-
Controlling complex networks: How much energy is needed?
Authors:
Gang Yan,
Jie Ren,
Ying-Cheng Lai,
Choy-Heng Lai,
Baowen Li
Abstract:
The outstanding problem of controlling complex networks is relevant to many areas of science and engineering, and has the potential to generate technological breakthroughs as well. We address the physically important issue of the energy required for achieving control by deriving and validating scaling laws for the lower and upper energy bounds. These bounds represent a reasonable estimate of the e…
▽ More
The outstanding problem of controlling complex networks is relevant to many areas of science and engineering, and has the potential to generate technological breakthroughs as well. We address the physically important issue of the energy required for achieving control by deriving and validating scaling laws for the lower and upper energy bounds. These bounds represent a reasonable estimate of the energy cost associated with control, and provide a step forward from the current research on controllability toward ultimate control of complex networked dynamical systems.
△ Less
Submitted 12 April, 2012; v1 submitted 11 April, 2012;
originally announced April 2012.