Skip to main content

Showing 1–36 of 36 results for author: Zuo, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.03063  [pdf, ps, other

    cs.IT eess.SP

    Joint Beamforming for NOMA Assisted Pinching Antenna Systems (PASS)

    Authors: Deqiao Gan, Xiaoxia Xu, Jiakuo Zuo, Xiaohu Ge, Yuanwei Liu

    Abstract: Pinching antenna system (PASS) configures the positions of pinching antennas (PAs) along dielectric waveguides to change both large-scale fading and small-scale scattering, which is known as pinching beamforming. A novel non-orthogonal multiple access (NOMA) assisted PASS framework is proposed for downlink multi-user multiple-input multiple-output (MIMO) communications. The transmit power minimiza… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  2. arXiv:2506.01014  [pdf, ps, other

    eess.AS cs.SD

    Rhythm Controllable and Efficient Zero-Shot Voice Conversion via Shortcut Flow Matching

    Authors: Jialong Zuo, Shengpeng Ji, Minghui Fang, Mingze Li, Ziyue Jiang, Xize Cheng, Xiaoda Yang, Chen Feiyang, Xinyu Duan, Zhou Zhao

    Abstract: Zero-Shot Voice Conversion (VC) aims to transform the source speaker's timbre into an arbitrary unseen one while retaining speech content. Most prior work focuses on preserving the source's prosody, while fine-grained timbre information may leak through prosody, and transferring target prosody to synthesized speech is rarely studied. In light of this, we propose R-VC, a rhythm-controllable and eff… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

    Comments: Accepted by ACL 2025 (Main Conference)

  3. arXiv:2505.24496  [pdf, other

    eess.AS

    Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation

    Authors: Wenrui Liu, Qian Chen, Wen Wang, Yafeng Chen, Jin Xu, Zhifang Guo, Guanrou Yang, Weiqin Li, Xiaoda Yang, Tao Jin, Minghui Fang, Jialong Zuo, Bai Jionghao, Zemin Liu

    Abstract: Neural audio codecs, used as speech tokenizers, have demonstrated remarkable potential in the field of speech generation. However, to ensure high-fidelity audio reconstruction, neural audio codecs typically encode audio into long sequences of speech tokens, posing a significant challenge for downstream language models in long-context modeling. We observe that speech token sequences exhibit short-r… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  4. arXiv:2505.09558  [pdf, other

    eess.AS cs.AI cs.LG cs.MM cs.SD

    WavReward: Spoken Dialogue Models With Generalist Reward Evaluators

    Authors: Shengpeng Ji, Tianle Liang, Yangzhuo Li, Jialong Zuo, Minghui Fang, Jinzheng He, Yifu Chen, Zhengqing Liu, Ziyue Jiang, Xize Cheng, Siqi Zheng, Jin Xu, Junyang Lin, Zhou Zhao

    Abstract: End-to-end spoken dialogue models such as GPT-4o-audio have recently garnered significant attention in the speech domain. However, the evaluation of spoken dialogue models' conversational performance has largely been overlooked. This is primarily due to the intelligent chatbots convey a wealth of non-textual information which cannot be easily measured using text-based language models like ChatGPT.… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  5. arXiv:2504.20653  [pdf, other

    cs.SE eess.SY

    ComplexVCoder: An LLM-Driven Framework for Systematic Generation of Complex Verilog Code

    Authors: Jian Zuo, Junzhe Liu, Xianyong Wang, Yicheng Liu, Navya Goli, Tong Xu, Hao Zhang, Umamaheswara Rao Tida, Zhenge Jia, Mengying Zhao

    Abstract: Recent advances have demonstrated the promising capabilities of large language models (LLMs) in generating register-transfer level (RTL) code, such as Verilog. However, existing LLM-based frameworks still face significant challenges in accurately handling the complexity of real-world RTL designs, particularly those that are large-scale and involve multi-level module instantiations. To address this… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  6. arXiv:2502.18924  [pdf, other

    eess.AS cs.LG cs.SD

    MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis

    Authors: Ziyue Jiang, Yi Ren, Ruiqi Li, Shengpeng Ji, Boyang Zhang, Zhenhui Ye, Chen Zhang, Bai Jionghao, Xiaoda Yang, Jialong Zuo, Yu Zhang, Rui Liu, Xiang Yin, Zhou Zhao

    Abstract: While recent zero-shot text-to-speech (TTS) models have significantly improved speech quality and expressiveness, mainstream systems still suffer from issues related to speech-text alignment modeling: 1) models without explicit speech-text alignment modeling exhibit less robustness, especially for hard sentences in practical applications; 2) predefined alignment-based models suffer from naturalnes… ▽ More

    Submitted 28 March, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

  7. arXiv:2502.05471  [pdf, other

    cs.SD eess.AS

    Enhancing Expressive Voice Conversion with Discrete Pitch-Conditioned Flow Matching Model

    Authors: Jialong Zuo, Shengpeng Ji, Minghui Fang, Ziyue Jiang, Xize Cheng, Qian Yang, Wenrui Liu, Guangyan Zhang, Zehai Tu, Yiwen Guo, Zhou Zhao

    Abstract: This paper introduces PFlow-VC, a conditional flow matching voice conversion model that leverages fine-grained discrete pitch tokens and target speaker prompt information for expressive voice conversion (VC). Previous VC works primarily focus on speaker conversion, with further exploration needed in enhancing expressiveness (such as prosody and emotion) for timbre conversion. Unlike previous metho… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

    Comments: Accepted by ICASSP 2025

  8. arXiv:2412.13917  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Speech Watermarking with Discrete Intermediate Representations

    Authors: Shengpeng Ji, Ziyue Jiang, Jialong Zuo, Minghui Fang, Yifu Chen, Tao Jin, Zhou Zhao

    Abstract: Speech watermarking techniques can proactively mitigate the potential harmful consequences of instant voice cloning techniques. These techniques involve the insertion of signals into speech that are imperceptible to humans but can be detected by algorithms. Previous approaches typically embed watermark messages into continuous space. However, intuitively, embedding watermark information into robus… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Comments: Accepted by AAAI 2025

  9. arXiv:2411.13577  [pdf, other

    eess.AS cs.CL cs.LG cs.MM cs.SD

    WavChat: A Survey of Spoken Dialogue Models

    Authors: Shengpeng Ji, Yifu Chen, Minghui Fang, Jialong Zuo, Jingyu Lu, Hanting Wang, Ziyue Jiang, Long Zhou, Shujie Liu, Xize Cheng, Xiaoda Yang, Zehan Wang, Qian Yang, Jian Li, Yidi Jiang, Jingzhen He, Yunfei Chu, Jin Xu, Zhou Zhao

    Abstract: Recent advancements in spoken dialogue models, exemplified by systems like GPT-4o, have captured significant attention in the speech domain. Compared to traditional three-tier cascaded spoken dialogue models that comprise speech recognition (ASR), large language models (LLMs), and text-to-speech (TTS), modern spoken dialogue models exhibit greater intelligence. These advanced spoken dialogue model… ▽ More

    Submitted 26 November, 2024; v1 submitted 14 November, 2024; originally announced November 2024.

    Comments: 60 papes, working in progress

  10. arXiv:2410.21269  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup

    Authors: Xize Cheng, Siqi Zheng, Zehan Wang, Minghui Fang, Ziang Zhang, Rongjie Huang, Ziyang Ma, Shengpeng Ji, Jialong Zuo, Tao Jin, Zhou Zhao

    Abstract: The scaling up has brought tremendous success in the fields of vision and language in recent years. When it comes to audio, however, researchers encounter a major challenge in scaling up the training data, as most natural audio contains diverse interfering signals. To address this limitation, we introduce Omni-modal Sound Separation (OmniSep), a novel framework capable of isolating clean soundtrac… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: Working in progress

  11. arXiv:2408.16532  [pdf, other

    eess.AS cs.LG cs.MM cs.SD eess.SP

    WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

    Authors: Shengpeng Ji, Ziyue Jiang, Wen Wang, Yifu Chen, Minghui Fang, Jialong Zuo, Qian Yang, Xize Cheng, Zehan Wang, Ruiqi Li, Ziang Zhang, Xiaoda Yang, Rongjie Huang, Yidi Jiang, Qian Chen, Siqi Zheng, Zhou Zhao

    Abstract: Language models have been effectively applied to modeling natural signals, such as images, video, speech, and audio. A crucial component of these models is the codec tokenizer, which compresses high-dimensional natural signals into lower-dimensional discrete tokens. In this paper, we introduce WavTokenizer, which offers several advantages over previous SOTA acoustic codec models in the audio domai… ▽ More

    Submitted 25 February, 2025; v1 submitted 29 August, 2024; originally announced August 2024.

    Comments: Accepted by ICLR 2025

  12. arXiv:2407.14006  [pdf, other

    eess.AS cs.SD

    MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis

    Authors: Qian Yang, Jialong Zuo, Zhe Su, Ziyue Jiang, Mingze Li, Zhou Zhao, Feiyang Chen, Zhefeng Wang, Baoxing Huai

    Abstract: We introduce an open source high-quality Mandarin TTS dataset MSceneSpeech (Multiple Scene Speech Dataset), which is intended to provide resources for expressive speech synthesis. MSceneSpeech comprises numerous audio recordings and texts performed and recorded according to daily life scenarios. Each scenario includes multiple speakers and a diverse range of prosodic styles, making it suitable for… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: Accepted by INTERSPEECH 2024

  13. arXiv:2406.01205  [pdf, ps, other

    eess.AS cs.LG cs.SD

    ControlSpeech: Towards Simultaneous and Independent Zero-shot Speaker Cloning and Zero-shot Language Style Control

    Authors: Shengpeng Ji, Qian Chen, Wen Wang, Jialong Zuo, Minghui Fang, Ziyue Jiang, Hai Huang, Zehan Wang, Xize Cheng, Siqi Zheng, Zhou Zhao

    Abstract: In this paper, we present ControlSpeech, a text-to-speech (TTS) system capable of fully cloning the speaker's voice and enabling arbitrary control and adjustment of speaking style. Prior zero-shot TTS models only mimic the speaker's voice without further control and adjustment capabilities while prior controllable TTS models cannot perform speaker-specific voice generation. Therefore, ControlSpeec… ▽ More

    Submitted 4 June, 2025; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: ACL 2025 Main

  14. arXiv:2405.00842  [pdf, other

    math.ST cs.IT cs.LG eess.SP math.OC

    Quickest Change Detection with Confusing Change

    Authors: Yu-Zhen Janice Chen, Jinhang Zuo, Venugopal V. Veeravalli, Don Towsley

    Abstract: In the problem of quickest change detection (QCD), a change occurs at some unknown time in the distribution of a sequence of independent observations. This work studies a QCD problem where the change is either a bad change, which we aim to detect, or a confusing change, which is not of our interest. Our objective is to detect a bad change as quickly as possible while avoiding raising a false alarm… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  15. arXiv:2403.05557  [pdf, other

    eess.SP cs.HC cs.LG

    Re-thinking Human Activity Recognition with Hierarchy-aware Label Relationship Modeling

    Authors: Jingwei Zuo, Hakim Hacid

    Abstract: Human Activity Recognition (HAR) has been studied for decades, from data collection, learning models, to post-processing and result interpretations. However, the inherent hierarchy in the activities remains relatively under-explored, despite its significant impact on model performance and interpretation. In this paper, we propose H-HAR, by rethinking the HAR tasks from a fresh perspective by delvi… ▽ More

    Submitted 11 February, 2024; originally announced March 2024.

    Comments: Accepted by PAKDD 2024

  16. arXiv:2402.12208  [pdf, ps, other

    eess.AS cs.SD

    Language-Codec: Bridging Discrete Codec Representations and Speech Language Models

    Authors: Shengpeng Ji, Minghui Fang, Jialong Zuo, Ziyue Jiang, Dingdong Wang, Hanting Wang, Hai Huang, Zhou Zhao

    Abstract: In recent years, large language models have achieved significant success in generative tasks related to speech, audio, music, and other signal domains. A crucial element of these models is the discrete acoustic codecs, which serve as an intermediate representation replacing the mel-spectrogram. However, there exist several gaps between discrete codecs and downstream speech language models. Specifi… ▽ More

    Submitted 4 June, 2025; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: ACL 2025 Main

  17. arXiv:2402.09378  [pdf, other

    eess.AS cs.SD

    MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech

    Authors: Shengpeng Ji, Ziyue Jiang, Hanting Wang, Jialong Zuo, Zhou Zhao

    Abstract: Zero-shot text-to-speech (TTS) has gained significant attention due to its powerful voice cloning capabilities, requiring only a few seconds of unseen speaker voice prompts. However, all previous work has been developed for cloud-based systems. Taking autoregressive models as an example, although these approaches achieve high-fidelity voice cloning, they fall short in terms of inference speed, mod… ▽ More

    Submitted 2 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Accepted by ACL 2024 (Main Conference)

  18. arXiv:2311.03175  [pdf

    eess.IV cs.CV

    Frequency Domain Decomposition Translation for Enhanced Medical Image Translation Using GANs

    Authors: Zhuhui Wang, Jianwei Zuo, Xuliang Deng, Jiajia Luo

    Abstract: Medical Image-to-image translation is a key task in computer vision and generative artificial intelligence, and it is highly applicable to medical image analysis. GAN-based methods are the mainstream image translation methods, but they often ignore the variation and distribution of images in the frequency domain, or only take simple measures to align high-frequency information, which can lead to d… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  19. TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models

    Authors: Shengpeng Ji, Jialong Zuo, Minghui Fang, Ziyue Jiang, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao

    Abstract: Recently, there has been a growing interest in the field of controllable Text-to-Speech (TTS). While previous studies have relied on users providing specific style factor values based on acoustic knowledge or selecting reference speeches that meet certain requirements, generating speech solely from natural text prompts has emerged as a new challenge for researchers. This challenge arises due to th… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Journal ref: 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  20. arXiv:2308.11691  [pdf, other

    eess.SP cs.AI cs.LG

    Practical Insights on Incremental Learning of New Human Physical Activity on the Edge

    Authors: George Arvanitakis, Jingwei Zuo, Mthandazo Ndhlovu, Hakim Hacid

    Abstract: Edge Machine Learning (Edge ML), which shifts computational intelligence from cloud-based systems to edge devices, is attracting significant interest due to its evident benefits including reduced latency, enhanced data privacy, and decreased connectivity reliance. While these advantages are compelling, they introduce unique challenges absent in traditional cloud-based approaches. In this paper, we… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted by DSAA 2023 (Industrial Track)

  21. Intelligent Communication Planning for Constrained Environmental IoT Sensing with Reinforcement Learning

    Authors: Yi Hu, Jinhang Zuo, Bob Iannucci, Carlee Joe-Wong

    Abstract: Internet of Things (IoT) technologies have enabled numerous data-driven mobile applications and have the potential to significantly improve environmental monitoring and hazard warnings through the deployment of a network of IoT sensors. However, these IoT devices are often power-constrained and utilize wireless communication schemes with limited bandwidth. Such power constraints limit the amount o… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: To be published in the 20th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON 2023)

  22. arXiv:2306.12572  [pdf, ps, other

    cs.CV eess.SP

    Uniqueness of Iris Pattern Based on AR Model

    Authors: Katelyn M. Hampel, Jinyu Zuo, Priyanka Das, Natalia A. Schmid, Stephanie Schuckers, Joseph Skufca, Matthew C. Valenti

    Abstract: The assessment of iris uniqueness plays a crucial role in analyzing the capabilities and limitations of iris recognition systems. Among the various methodologies proposed, Daugman's approach to iris uniqueness stands out as one of the most widely accepted. According to Daugman, uniqueness refers to the iris recognition system's ability to enroll an increasing number of classes while maintaining a… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

  23. arXiv:2305.13612  [pdf, other

    cs.SD eess.AS

    FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models

    Authors: Ziyue Jiang, Qian Yang, Jialong Zuo, Zhenhui Ye, Rongjie Huang, Yi Ren, Zhou Zhao

    Abstract: Stutter removal is an essential scenario in the field of speech editing. However, when the speech recording contains stutters, the existing text-based speech editing approaches still suffer from: 1) the over-smoothing problem in the edited speech; 2) lack of robustness due to the noise introduced by stutter; 3) to remove the stutters, users are required to determine the edited region manually. To… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL 2023 (Findings)

  24. arXiv:2304.13185  [pdf, ps, other

    cs.IT eess.SP

    Non-Orthogonal Multiple Access For Near-Field Communications

    Authors: Jiakuo Zuo, Xidong Mu, Yuanwei Liu

    Abstract: The novel concept of near-field non-orthogonal multiple access (NF-NOMA) communications is proposed. The near-filed beamfocusing enables NOMA to be carried out in both angular and distance domains. Two novel frameworks are proposed, namely, single-location-beamfocusing NF-NOMA (SLB-NF-NOMA) and multiple-location-beamfocusing NF-NOMA (MLB-NF-NOMA). 1) For SLB-NF-NOMA, two NOMA users in the same ang… ▽ More

    Submitted 18 May, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

  25. Data-driven prognostics based on time-frequency analysis and symbolic recurrent neural network for fuel cells under dynamic load

    Authors: Chu Wang, Manfeng Dou, Zhongliang Li, Rachid Outbib, Dongdong Zhao, Jian Zuo, Yuanlin Wang, Bin Liang, Peng Wang

    Abstract: Data-centric prognostics is beneficial to improve the reliability and safety of proton exchange membrane fuel cell (PEMFC). For the prognostics of PEMFC operating under dynamic load, the challenges come from extracting degradation features, improving prediction accuracy, expanding the prognostics horizon, and reducing computational cost. To address these issues, this work proposes a data-driven PE… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

  26. arXiv:2302.04432  [pdf, ps, other

    eess.SP

    Active Simultaneously Transmitting and Reflecting (STAR)-RISs: Modelling and Analysis

    Authors: Jiaqi Xu, Jiakuo Zuo, Joey Tianyi Zhou, Yuanwei Liu

    Abstract: A hardware model for active simultaneously transmitting and reflecting reconfigurable intelligent surfaces (STAR-RISs) is proposed consisting of reflection-type amplifiers. The amplitude gains of the STAR element are derived for both coupled and independent phase-shift scenarios. Based on the proposed hardware model, an active STAR-RIS-aided two-user downlink communication system is investigated.… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: 13 pages

  27. arXiv:2210.02725  [pdf, ps, other

    cs.IT eess.SP

    Exploiting NOMA and RIS in Integrated Sensing and Communication

    Authors: Jiakuo Zuo, Yuanwei Liu, Chenming Zhu, Yixuan Zou, Dengyin Zhang, Naofal Al-Dhahir

    Abstract: A novel integrated sensing and communication (ISAC) system is proposed, where a dual-functional base station is utilized to transmit the superimposed non-orthogonal multiple access (NOMA) communication signal for serving communication users and sensing targets simultaneously. Furthermore, a new reconfigurable intelligent surface (RIS)-aided-sensing structure is also proposed to address the signifi… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: arXiv admin note: text overlap with arXiv:2208.04786

  28. arXiv:2208.04786  [pdf, ps, other

    cs.IT eess.SP

    Reconfigurable Intelligent Surface Assisted NOMA Empowered Integrated Sensing and Communication

    Authors: Jiakuo Zuo, Yuanwei Liu

    Abstract: This paper exploits the potential of reconfigurable intelligent surface (RIS) to improve radar sensing in a non-orthogonal multiple access (NOMA) empowered integrated sensing and communication (NOMA-ISAC) network. The objective is to maximize the minimum radar beampattern gain by jointly optimizing the active beamforming, power allocation coefficients and passive beamforming. To tackle the formula… ▽ More

    Submitted 28 September, 2022; v1 submitted 9 August, 2022; originally announced August 2022.

  29. arXiv:2112.05240  [pdf

    q-bio.QM cs.LG eess.IV physics.med-ph

    Label-free virtual HER2 immunohistochemical staining of breast tissue using deep learning

    Authors: Bijie Bai, Hongda Wang, Yuzhu Li, Kevin de Haan, Francesco Colonnese, Yujie Wan, Jingyi Zuo, Ngan B. Doan, Xiaoran Zhang, Yijie Zhang, Jingxi Li, Wenjie Dong, Morgan Angus Darrow, Elham Kamangar, Han Sung Lee, Yair Rivenson, Aydogan Ozcan

    Abstract: The immunohistochemical (IHC) staining of the human epidermal growth factor receptor 2 (HER2) biomarker is widely practiced in breast tissue analysis, preclinical studies and diagnostic decisions, guiding cancer treatment and investigation of pathogenesis. HER2 staining demands laborious tissue treatment and chemical processing performed by a histotechnologist, which typically takes one day to pre… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

    Comments: 26 Pages, 5 Figures

    Journal ref: BME Frontiers (2022)

  30. Applications And Potentials Of Intelligent Swarms For Magnetospheric Studies

    Authors: Raj Thilak Rajan, Shoshana Ben-Maor, Shaziana Kaderali, Calum Turner, Mohammed Milhim, Catrina Melograna, Dawn Haken, Gary Paul, Vedant, Sreekumar V, Johannes Weppler, Yosephine Gumulya, Riccardo Bunt, Asia Bulgarini, Maurice Marnat, Kadri Bussov, Frederick Pringle, Jusha Ma, Rushanka Amrutkar, Miguel Coto, Jiang He, Zijian Shi, Shahd Hayder, Dina Saad Fayez Jaber, Junchao Zuo , et al. (10 additional authors not shown)

    Abstract: Earth's magnetosphere is vital for today's technologically dependent society. To date, numerous design studies have been conducted and over a dozen science missions have own to study the magnetosphere. However, a majority of these solutions relied on large monolithic satellites, which limited the spatial resolution of these investigations, as did the technological limitations of the past. To count… ▽ More

    Submitted 4 July, 2021; originally announced July 2021.

    Comments: Accepted in Acta Astronautica

    Journal ref: Acta Astronautica, Elsevier, 2021

  31. Joint Design for Simultaneously Transmitting And Reflecting (STAR) RIS Assisted NOMA Systems

    Authors: Jiakuo Zuo, Yuanwei Liu, Zhiguo Ding, Lingyang Song, H. Vincent Poor

    Abstract: Different from traditional reflection-only reconfigurable intelligent surfaces (RISs), simultaneously transmitting and reflecting RISs (STAR-RISs) represent a novel technology, which extends the half-space coverage to full-space coverage by simultaneously transmitting and reflecting incident signals. STAR-RISs provide new degrees-of-freedom (DoF) for manipulating signal propagation. Motivated by t… ▽ More

    Submitted 17 September, 2022; v1 submitted 5 June, 2021; originally announced June 2021.

  32. arXiv:2012.10111  [pdf, ps, other

    cs.IT eess.SP

    Reconfigurable Intelligent Surface Enhanced NOMA Assisted Backscatter Communication System

    Authors: Jiakuo Zuo, Yuanwei Liu, Liang Yang, Lingyang Song, Ying-Chang Liang

    Abstract: A reconfigurable intelligent surface (RIS) enhanced non-orthogonal multiple access assisted backscatter communication (RIS-NOMABC) system is considered. A joint optimization problem over power reflection coefficients and phase shifts is formulated. To solve this non-convex problem, a low complexity algorithm is proposed by invoking the alternative optimization, successive convex approximation and… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

  33. arXiv:2011.08975  [pdf, ps, other

    eess.SP cs.IT math.OC

    Reconfigurable Intelligent Surface Assisted Cooperative Non-orthogonal Multiple Access Systems

    Authors: Jiakuo Zuo, Yuanwei Liu, Naofal Al-Dhahir

    Abstract: This paper considers downlink of reconfigurable intelligent surface (RIS) assisted cooperative non-orthogonal multiple access (CNOMA) systems. Our objective is to minimize the total transmit power by jointly optimizing the active beamforming vectors, transmit-relaying power, and RIS phase shifts. The formulated problem is a mixed-integer nonlinear programming (MINLP) problem. To tackle this proble… ▽ More

    Submitted 5 December, 2020; v1 submitted 17 November, 2020; originally announced November 2020.

  34. arXiv:2005.01562  [pdf, ps, other

    cs.IT eess.SP

    Intelligent Reflecting Surface Enhanced Millimeter-Wave NOMA Systems

    Authors: Jiakuo Zuo, Yuanwei Liu, Ertugrul Basar, Octavia A. Dobre

    Abstract: In this paper, a downlink intelligent reflecting surface (IRS) enhanced millimeter-wave (mmWave) non-orthogonal multiple access (NOMA) system is considered. A joint optimization problem over active beamforming, passive beamforming and power allocation is formulated. Due to the highly coupled variables, the formulated optimization problem is non-convex. To solve this problem, an alternative optimiz… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

  35. arXiv:2003.08923  [pdf, other

    eess.SP

    RF-Rhythm: Secure and Usable Two-Factor RFID Authentication

    Authors: Jiawei Li, Chuyu Wang, Ang Li, Dianqi Han, Yan Zhang, Jinhang Zuo, Rui Zhang, Lei Xie, Yanchao Zhang

    Abstract: Passive RFID technology is widely used in user authentication and access control. We propose RF-Rhythm, a secure and usable two-factor RFID authentication system with strong resilience to lost/stolen/cloned RFID cards. In RF-Rhythm, each legitimate user performs a sequence of taps on his/her RFID card according to a self-chosen secret melody. Such rhythmic taps can induce phase changes in the back… ▽ More

    Submitted 19 March, 2020; originally announced March 2020.

    Comments: To appear at IEEE INFOCOM 2020

  36. arXiv:2002.01765  [pdf, ps, other

    cs.IT eess.SP

    Resource Allocation in Intelligent Reflecting Surface Assisted NOMA Systems

    Authors: Jiakuo Zuo, Yuanwei Liu, Zhijin Qin, Naofal Al-Dhahir

    Abstract: This paper investigates the downlink communications of intelligent reflecting surface (IRS) assisted non-orthogonal multiple access (NOMA) systems. To maximize the system throughput, we formulate a joint optimization problem over the channel assignment, decoding order of NOMA users, power allocation, and reflection coefficients. The formulated problem is proved to be NP-hard. To tackle this proble… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.