Skip to main content

Showing 1–50 of 52 results for author: Weng, R

.
  1. arXiv:2505.21279  [pdf, ps, other

    cs.AI

    XBOUND: Exploring the Capability Boundaries of Device-Control Agents through Trajectory Tree Exploration

    Authors: Shaoqing Zhang, Kehai Chen, Zhuosheng Zhang, Rumei Li, Rongxiang Weng, Yang Xiang, Liqiang Nie, Min Zhang

    Abstract: Recent advancements in vision-language models (VLMs) have spurred increased interest in Device-Control Agents (DC agents), such as utilizing in-the-wild device control to manage graphical user interfaces. Conventional methods for assessing the capabilities of DC agents, such as computing step-wise action accuracy and overall task success rates, provide a macroscopic view of DC agents' performance;… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  2. arXiv:2505.10597  [pdf, other

    cs.LG cs.AI cs.CL

    Two Minds Better Than One: Collaborative Reward Modeling for LLM Alignment

    Authors: Jiazheng Zhang, Wenqing Jing, Zizhuo Zhang, Zhiheng Xi, Shihan Dou, Rongxiang Weng, Jiahuan Li, Jingang Wang, Mingxu Chai, Shibo Hong, Tao Gui, Qi Zhang

    Abstract: Reward models (RMs) play a pivotal role in aligning large language models (LLMs) with human values. However, noisy preferences in human feedback can lead to reward misgeneralization - a phenomenon where reward models learn spurious correlations or overfit to noisy preferences, which poses important challenges to the generalization of RMs. This paper systematically analyzes the characteristics of p… ▽ More

    Submitted 18 May, 2025; v1 submitted 15 May, 2025; originally announced May 2025.

  3. arXiv:2504.18818  [pdf, other

    cs.LG

    Frequency-Integrated Transformer for Arbitrary-Scale Super-Resolution

    Authors: Xufei Wang, Fei Ge, Jinchen Zhu, Mingjian Zhang, Qi Wu, Jifeng Ren Shizhuang Weng

    Abstract: Methods based on implicit neural representation have demonstrated remarkable capabilities in arbitrary-scale super-resolution (ASSR) tasks, but they neglect the potential value of the frequency domain, leading to sub-optimal performance. We proposes a novel network called Frequency-Integrated Transformer (FIT) to incorporate and utilize frequency information to enhance ASSR performance. FIT employ… ▽ More

    Submitted 26 April, 2025; originally announced April 2025.

    Comments: 11pages,8figures

  4. arXiv:2504.01801  [pdf, other

    cs.CL

    Investigating and Scaling up Code-Switching for Multilingual Language Model Pre-Training

    Authors: Zhijun Wang, Jiahuan Li, Hao Zhou, Rongxiang Weng, Jingang Wang, Xin Huang, Xue Han, Junlan Feng, Chao Deng, Shujian Huang

    Abstract: Large language models (LLMs) exhibit remarkable multilingual capabilities despite the extreme language imbalance in the pre-training data. In this paper, we closely examine the reasons behind this phenomenon, focusing on the pre-training corpus. We find that the existence of code-switching, alternating between different languages within a context, is key to multilingual capabilities. We conduct an… ▽ More

    Submitted 22 April, 2025; v1 submitted 2 April, 2025; originally announced April 2025.

  5. arXiv:2502.05878  [pdf, ps, other

    cs.CL

    Retrieval-augmented Large Language Models for Financial Time Series Forecasting

    Authors: Mengxi Xiao, Zihao Jiang, Lingfei Qian, Zhengyu Chen, Yueru He, Yijing Xu, Yuecheng Jiang, Dong Li, Ruey-Ling Weng, Min Peng, Jimin Huang, Sophia Ananiadou, Qianqian Xie

    Abstract: Accurately forecasting stock price movements is critical for informed financial decision-making, supporting applications ranging from algorithmic trading to risk management. However, this task remains challenging due to the difficulty of retrieving subtle yet high-impact patterns from noisy financial time-series data, where conventional retrieval methods, whether based on generic language models o… ▽ More

    Submitted 6 June, 2025; v1 submitted 9 February, 2025; originally announced February 2025.

    Comments: 11 pages, 4 figures

  6. arXiv:2502.05559  [pdf, other

    eess.SP

    Channel Estimation for RIS-Aided MU-MIMO mmWave Systems with Practical Hybrid Architecture

    Authors: Liuchang Zhuo, Cunhua Pan, Hong Ren, Ruisong Weng, Shi Jin, A. Lee Swindlehurst, Jiangzhou Wang

    Abstract: This paper proposes a correlation-based three-stage channel estimation strategy with low pilot overhead for reconfigurable intelligent surface (RIS)-aided millimeter wave (mmWave) multi-user (MU) MIMO systems, in which both users and base station (BS) are equipped with a hybrid RF architecture. In Stage I, all users jointly transmit pilots and recover the uncompressed received signals to estimate… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

    Comments: 13 pages, 7 figures, 1 table

  7. arXiv:2502.05551  [pdf, ps, other

    cs.CL

    FRAME: Boosting LLMs with A Four-Quadrant Multi-Stage Pretraining Strategy

    Authors: Xuemiao Zhang, Feiyu Duan, Liangyu Xu, Yongwei Zhou, Sirui Wang, Rongxiang Weng, Jingang Wang, Xunliang Cai

    Abstract: Large language models (LLMs) have significantly advanced human language understanding and generation, with pretraining data quality and organization being crucial to their performance. Multi-stage pretraining is a promising approach, but existing methods often lack quantitative criteria for data partitioning and instead rely on intuitive heuristics. In this paper, we propose the novel Four-quadRAn… ▽ More

    Submitted 31 May, 2025; v1 submitted 8 February, 2025; originally announced February 2025.

  8. arXiv:2502.00761  [pdf, other

    cs.CL

    FIRE: Flexible Integration of Data Quality Ratings for Effective Pre-Training

    Authors: Liangyu Xu, Xuemiao Zhang, Feiyu Duan, Sirui Wang, Rongxiang Weng, Jingang Wang, Xunliang Cai

    Abstract: Selecting high-quality data can improve the pretraining efficiency of large language models (LLMs). Existing methods generally rely on heuristic techniques or single quality signals, limiting their ability to evaluate data quality comprehensively. In this work, we propose FIRE, a flexible and scalable framework for integrating multiple data quality raters, which allows for a comprehensive assessme… ▽ More

    Submitted 22 May, 2025; v1 submitted 2 February, 2025; originally announced February 2025.

    Comments: 21 pages, 11 figures

  9. arXiv:2501.13126  [pdf, other

    cs.CL cs.AI

    Preference Curriculum: LLMs Should Always Be Pretrained on Their Preferred Data

    Authors: Xuemiao Zhang, Liangyu Xu, Feiyu Duan, Yongwei Zhou, Sirui Wang, Rongxiang Weng, Jingang Wang, Xunliang Cai

    Abstract: Large language models (LLMs) generally utilize a consistent data distribution throughout the pretraining process. However, as the model's capability improves, it is intuitive that its data preferences dynamically change, indicating the need for pretraining with different data at various training stages. To achieve it, we propose the Perplexity Difference (PD) based Preference Curriculum learning (… ▽ More

    Submitted 17 February, 2025; v1 submitted 21 January, 2025; originally announced January 2025.

    Comments: 18 pages, 13 figures

  10. arXiv:2412.10423  [pdf, other

    cs.CL cs.AI

    Look Before You Leap: Enhancing Attention and Vigilance Regarding Harmful Content with GuidelineLLM

    Authors: Shaoqing Zhang, Zhuosheng Zhang, Kehai Chen, Rongxiang Weng, Muyun Yang, Tiejun Zhao, Min Zhang

    Abstract: Despite being empowered with alignment mechanisms, large language models (LLMs) are increasingly vulnerable to emerging jailbreak attacks that can compromise their alignment mechanisms. This vulnerability poses significant risks to real-world applications. Existing work faces challenges in both training efficiency and generalization capabilities (i.e., Reinforcement Learning from Human Feedback an… ▽ More

    Submitted 14 April, 2025; v1 submitted 10 December, 2024; originally announced December 2024.

    Comments: AAAI 2025

  11. arXiv:2412.00491  [pdf

    cs.IR

    CDEMapper: Enhancing NIH Common Data Element Normalization using Large Language Models

    Authors: Yan Wang, Jimin Huang, Huan He, Vincent Zhang, Yujia Zhou, Xubing Hao, Pritham Ram, Lingfei Qian, Qianqian Xie, Ruey-Ling Weng, Fongci Lin, Yan Hu, Licong Cui, Xiaoqian Jiang, Hua Xu, Na Hong

    Abstract: Common Data Elements (CDEs) standardize data collection and sharing across studies, enhancing data interoperability and improving research reproducibility. However, implementing CDEs presents challenges due to the broad range and variety of data elements. This study aims to develop an effective and efficient mapping tool to bridge the gap between local data elements and National Institutes of Heal… ▽ More

    Submitted 30 November, 2024; originally announced December 2024.

    Comments: 11 pages,4 figures

  12. arXiv:2411.16579  [pdf, other

    cs.CL cs.AI cs.LG

    Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

    Authors: Zhiheng Xi, Dingwen Yang, Jixuan Huang, Jiafu Tang, Guanyu Li, Yiwen Ding, Wei He, Boyang Hong, Shihan Do, Wenyu Zhan, Xiao Wang, Rui Zheng, Tao Ji, Xiaowei Shi, Yitao Zhai, Rongxiang Weng, Jingang Wang, Xunliang Cai, Tao Gui, Zuxuan Wu, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Yu-Gang Jiang

    Abstract: Training large language models (LLMs) to spend more time thinking and reflection before responding is crucial for effectively solving complex reasoning tasks in fields such as science, coding, and mathematics. However, the effectiveness of mechanisms like self-reflection and self-correction depends on the model's capacity to accurately assess its own performance, which can be limited by factors su… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

    Comments: Preprint

  13. arXiv:2411.10020  [pdf, other

    cs.CL

    Information Extraction from Clinical Notes: Are We Ready to Switch to Large Language Models?

    Authors: Yan Hu, Xu Zuo, Yujia Zhou, Xueqing Peng, Jimin Huang, Vipina K. Keloth, Vincent J. Zhang, Ruey-Ling Weng, Qingyu Chen, Xiaoqian Jiang, Kirk E. Roberts, Hua Xu

    Abstract: Backgrounds: Information extraction (IE) is critical in clinical natural language processing (NLP). While large language models (LLMs) excel on generative tasks, their performance on extractive tasks remains debated. Methods: We investigated Named Entity Recognition (NER) and Relation Extraction (RE) using 1,588 clinical notes from four sources (UT Physicians, MTSamples, MIMIC-III, and i2b2). We d… ▽ More

    Submitted 7 January, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

  14. arXiv:2410.23074  [pdf, other

    cs.SE cs.CL

    Multi-Programming Language Sandbox for LLMs

    Authors: Shihan Dou, Jiazheng Zhang, Jianxiang Zang, Yunbo Tao, Weikang Zhou, Haoxiang Jia, Shichun Liu, Yuming Yang, Zhiheng Xi, Shenxi Wu, Shaoqing Zhang, Muling Wu, Changze Lv, Limao Xiong, Wenyu Zhan, Lin Zhang, Rongxiang Weng, Jingang Wang, Xunliang Cai, Yueming Wu, Ming Wen, Rui Zheng, Tao Ji, Yixin Cao, Tao Gui , et al. (3 additional authors not shown)

    Abstract: We introduce MPLSandbox, an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler and analysis tools for Large Language Models (LLMs). It can automatically identify the programming language of the code, compiling and executing it within an isolated sub-sandbox to ensure safety and stability. In addition, MPLSandbox also integrates bo… ▽ More

    Submitted 5 November, 2024; v1 submitted 30 October, 2024; originally announced October 2024.

    Comments: 25 pages, 14 figures

  15. arXiv:2409.06411  [pdf, other

    cs.LG cs.CL

    Length Desensitization in Direct Preference Optimization

    Authors: Wei Liu, Yang Bai, Chengcheng Han, Rongxiang Weng, Jun Xu, Xuezhi Cao, Jingang Wang, Xunliang Cai

    Abstract: Direct Preference Optimization (DPO) is widely utilized in the Reinforcement Learning from Human Feedback (RLHF) phase to align Large Language Models (LLMs) with human preferences, thereby enhancing both their harmlessness and efficacy. However, it has been observed that DPO tends to over-optimize for verbosity, which can detrimentally affect both performance and user experience. In this paper, we… ▽ More

    Submitted 27 November, 2024; v1 submitted 10 September, 2024; originally announced September 2024.

    Comments: 21 pages, 9 figures

  16. arXiv:2408.11878  [pdf, ps, other

    cs.CL cs.CE q-fin.CP

    Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

    Authors: Jimin Huang, Mengxi Xiao, Dong Li, Zihao Jiang, Yuzhe Yang, Yifei Zhang, Lingfei Qian, Yan Wang, Xueqing Peng, Yang Ren, Ruoyu Xiang, Zhengyu Chen, Xiao Zhang, Yueru He, Weiguang Han, Shunian Chen, Lihang Shen, Daniel Kim, Yangyang Yu, Yupeng Cao, Zhiyang Deng, Haohang Li, Duanyu Feng, Yongfu Dai, VijayaSai Somasundaram , et al. (19 additional authors not shown)

    Abstract: Financial LLMs hold promise for advancing financial tasks and domain-specific applications. However, they are limited by scarce corpora, weak multimodal capabilities, and narrow evaluations, making them less suited for real-world application. To address this, we introduce \textit{Open-FinLLMs}, the first open-source multimodal financial LLMs designed to handle diverse tasks across text, tabular, t… ▽ More

    Submitted 6 June, 2025; v1 submitted 20 August, 2024; originally announced August 2024.

    Comments: 33 pages, 13 figures

  17. arXiv:2407.06153  [pdf, other

    cs.SE cs.CL

    What's Wrong with Your Code Generated by Large Language Models? An Extensive Study

    Authors: Shihan Dou, Haoxiang Jia, Shenxi Wu, Huiyuan Zheng, Weikang Zhou, Muling Wu, Mingxu Chai, Jessica Fan, Caishuang Huang, Yunbo Tao, Yan Liu, Enyu Zhou, Ming Zhang, Yuhao Zhou, Yueming Wu, Rui Zheng, Ming Wen, Rongxiang Weng, Jingang Wang, Xunliang Cai, Tao Gui, Xipeng Qiu, Qi Zhang, Xuanjing Huang

    Abstract: The increasing development of large language models (LLMs) in code generation has drawn significant attention among researchers. To enhance LLM-based code generation ability, current efforts are predominantly directed towards collecting high-quality datasets and leveraging diverse training technologies. However, there is a notable lack of comprehensive studies examining the limitations and boundar… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 17 pages, 7 figures

  18. arXiv:2403.02942  [pdf, other

    cs.IT eess.SP

    Channel Estimation for mmWave MIMO-OFDM Systems in High-Mobility Scenarios: Instantaneous Model or Statistical Model?

    Authors: Ruizhe Wang, Hong Ren, Cunhua Pan, Gui Zhou, Ruisong Weng, Jiangzhou Wang

    Abstract: Classical linear statistical models, like the first-order auto-regressive (AR) model, are commonly used as channel model in high-mobility scenarios. However, compared to sub-6G, the effect of Doppler frequency shifts is more significant at millimeter wave (mmWave) frequencies, and the effectiveness of the statistical channel model in high-mobility mmWave scenarios should be reconsidered. In this p… ▽ More

    Submitted 27 August, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  19. arXiv:2402.05847  [pdf, other

    eess.SP

    Reconfigurable Intelligent Surface-Aided Dual-Function Radar and Communication Systems With MU-MIMO Communication

    Authors: Yasheng Jin, Hong Ren, Cunhua Pan, Zhiyuan Yu, Ruisong Weng, Boshi Wang, Gui Zhou, Yongchao He, Maged Elkashlan

    Abstract: In this paper, we investigate an reconfigurable intelligent surface (RIS)-aided integrated sensing and communication (ISAC) system. Our objective is to maximize the achievable sum rate of the multi-antenna communication users through the joint active and passive beamforming. {Specifically}, the weighted minimum mean-square error (WMMSE) method is { first} used to reformulate the original problem i… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  20. arXiv:2402.04532  [pdf, other

    eess.SP

    Joint Beamforming Design for Double Active RIS-assisted Radar-Communication Coexistence Systems

    Authors: Mengyu Liu, Hong Ren, Cunhua Pan, Boshi Wang, Zhiyuan Yu, Ruisong Weng, Kangda Zhi, Yongchao He

    Abstract: Integrated sensing and communication (ISAC) technology has been considered as one of the key candidate technologies in the next-generation wireless communication systems. However, when radar and communication equipment coexist in the same system, i.e. radar-communication coexistence (RCC), the interference from communication systems to radar can be large and cannot be ignored. Recently, reconfigur… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  21. arXiv:2402.02122  [pdf, other

    eess.SP

    Secure Wireless Communication in Active RIS-Assisted DFRC System

    Authors: Yang Zhang, Hong Ren, Cunhua Pan, Boshi Wang, Zhiyuan Yu, Ruisong Weng, Tuo Wu, Yongchao He

    Abstract: This work considers a dual-functional radar and communication (DFRC) system with an active reconfigurable intelligent surface (RIS) and a potential eavesdropper. Our purpose is to maximize the secrecy rate (SR) of the system by jointly designing the beamforming matrix at the DFRC base station (BS) and the reflecting coefficients at the active RIS, subject to the signal-to-interference-plus-noise-r… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: 13 pages, 9 figures

  22. arXiv:2310.10386  [pdf, other

    stat.AP

    Rating of players by Laplace approximation and dynamic modeling

    Authors: Hsuan-Fu Hua, Ching-Ju Chang, Tse-Ching Lin, Ruby Chiu-Hsing Weng

    Abstract: The Elo rating system is a simple and widely used method for calculating players' skills from paired comparisons data. Many have extended it in various ways. Yet the question of updating players' variances remains to be further explored. In this paper, we address the issue of variance update by using the Laplace approximation for posterior distribution, together with a random walk model for the dy… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  23. arXiv:2309.07864  [pdf, other

    cs.AI cs.CL

    The Rise and Potential of Large Language Model Based Agents: A Survey

    Authors: Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, Rui Zheng, Xiaoran Fan, Xiao Wang, Limao Xiong, Yuhao Zhou, Weiran Wang, Changhao Jiang, Yicheng Zou, Xiangyang Liu, Zhangyue Yin, Shihan Dou, Rongxiang Weng, Wensen Cheng, Qi Zhang, Wenjuan Qin , et al. (4 additional authors not shown)

    Abstract: For a long time, humanity has pursued artificial intelligence (AI) equivalent to or surpassing the human level, with AI agents considered a promising vehicle for this pursuit. AI agents are artificial entities that sense their environment, make decisions, and take actions. Many efforts have been made to develop intelligent agents, but they mainly focus on advancement in algorithms or training stra… ▽ More

    Submitted 19 September, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: 86 pages, 12 figures

  24. arXiv:2307.04964  [pdf, other

    cs.CL cs.AI cs.LG

    Secrets of RLHF in Large Language Models Part I: PPO

    Authors: Rui Zheng, Shihan Dou, Songyang Gao, Yuan Hua, Wei Shen, Binghai Wang, Yan Liu, Senjie Jin, Qin Liu, Yuhao Zhou, Limao Xiong, Lu Chen, Zhiheng Xi, Nuo Xu, Wenbin Lai, Minghao Zhu, Cheng Chang, Zhangyue Yin, Rongxiang Weng, Wensen Cheng, Haoran Huang, Tianxiang Sun, Hang Yan, Tao Gui, Qi Zhang , et al. (2 additional authors not shown)

    Abstract: Large language models (LLMs) have formulated a blueprint for the advancement of artificial general intelligence. Its primary objective is to function as a human-centric (helpful, honest, and harmless) assistant. Alignment with humans assumes paramount significance, and reinforcement learning with human feedback (RLHF) emerges as the pivotal technological paradigm underpinning this pursuit. Current… ▽ More

    Submitted 18 July, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

  25. arXiv:2303.10966  [pdf, other

    cs.CL

    Towards Reliable Neural Machine Translation with Consistency-Aware Meta-Learning

    Authors: Rongxiang Weng, Qiang Wang, Wensen Cheng, Changfeng Zhu, Min Zhang

    Abstract: Neural machine translation (NMT) has achieved remarkable success in producing high-quality translations. However, current NMT systems suffer from a lack of reliability, as their outputs that are often affected by lexical or syntactic changes in inputs, resulting in large variations in quality. This limitation hinders the practicality and trustworthiness of NMT. A contributing factor to this proble… ▽ More

    Submitted 19 September, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

  26. arXiv:2209.08738  [pdf, other

    cs.CL

    Learning Decoupled Retrieval Representation for Nearest Neighbour Neural Machine Translation

    Authors: Qiang Wang, Rongxiang Weng, Ming Chen

    Abstract: K-Nearest Neighbor Neural Machine Translation (kNN-MT) successfully incorporates external corpus by retrieving word-level representations at test time. Generally, kNN-MT borrows the off-the-shelf context representation in the translation task, e.g., the output of the last decoder layer, as the query vector of the retrieval task. In this work, we highlight that coupling the representations of these… ▽ More

    Submitted 19 September, 2023; v1 submitted 18 September, 2022; originally announced September 2022.

    Comments: Accepted by COLING 2022

  27. arXiv:2209.01438  [pdf, other

    eess.SP

    Active Reconfigurable Intelligent Surface for Mobile Edge Computing

    Authors: Zhangjie Peng, Ruisong Weng, Zhenkun Zhang, Cunhua Pan, Jiangzhou Wang

    Abstract: This paper investigates an active reconfigurable intelligent surface (RIS)-aided mobile edge computing (MEC) system. Compared with passive RIS, the active RIS is equipped with active reflective amplifier, which can effectively circumvent the "double path loss" attenuation. We propose a joint computing and communication design to minimize the maximum computational latency (MCL), subject to both the… ▽ More

    Submitted 6 September, 2022; v1 submitted 3 September, 2022; originally announced September 2022.

    Comments: Accepted by IEEE Wireless Communications Letters. Keywords: Mobile edge computing (MEC), latency minimization, Internet of things, reconfigurable intelligent surface (RIS), active RIS

  28. arXiv:2205.15495  [pdf, other

    cs.CV

    Joint Spatial-Temporal and Appearance Modeling with Transformer for Multiple Object Tracking

    Authors: Peng Dai, Yiqiang Feng, Renliang Weng, Changshui Zhang

    Abstract: The recent trend in multiple object tracking (MOT) is heading towards leveraging deep learning to boost the tracking performance. In this paper, we propose a novel solution named TransSTAM, which leverages Transformer to effectively model both the appearance features of each object and the spatial-temporal relationships among objects. TransSTAM consists of two major parts: (1) The encoder utilizes… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

  29. arXiv:2205.02405  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci physics.optics

    Enhanced optoelectronic performance and photogating effect in quasi-one-dimensional BiSeI wires

    Authors: H. J. Hu, W. L. Zhen, S. R. Weng, Y. D. Li, R. Niu, Z. L. Yue, F. Xu, L. Pi, C. J. Zhang, W. K. Zhu

    Abstract: Quasi-one-dimensional (quasi-1D) materials are a newly arising topic in low-dimensional researches. As a result of reduced dimensionality and enhanced anisotropy, the quasi-1D structure gives rise to novel properties and promising applications such as photodetectors. However, it remains an open question whether performance crossover will occur when the channel material is downsized. Here we report… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: 23 pages, 4 figures and SI

    Journal ref: Appl. Phys. Lett. 120, 201101 (2022)

  30. arXiv:2204.06812  [pdf, other

    cs.CL

    Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation

    Authors: Xiangpeng Wei, Heng Yu, Yue Hu, Rongxiang Weng, Weihua Luo, Jun Xie, Rong Jin

    Abstract: The principal task in supervised neural machine translation (NMT) is to learn to generate target sentences conditioned on the source inputs from a set of parallel sentence pairs, and thus produce a model capable of generalizing to unseen instances. However, it is commonly observed that the generalization performance of the model is highly influenced by the amount of parallel data used in training.… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted by ACL 2022 (main conference)

  31. arXiv:2203.11471  [pdf, other

    cs.CV

    Ray3D: ray-based 3D human pose estimation for monocular absolute 3D localization

    Authors: Yu Zhan, Fenghai Li, Renliang Weng, Wongun Choi

    Abstract: In this paper, we propose a novel monocular ray-based 3D (Ray3D) absolute human pose estimation with calibrated camera. Accurate and generalizable absolute 3D human pose estimation from monocular 2D pose input is an ill-posed problem. To address this challenge, we convert the input from pixel space to 3D normalized rays. This conversion makes our approach robust to camera intrinsic parameter chang… ▽ More

    Submitted 27 October, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: Accepted by CVPR 2022

  32. arXiv:2203.04478  [pdf, other

    cs.CV

    3SD: Self-Supervised Saliency Detection With No Labels

    Authors: Rajeev Yasarla, Renliang Weng, Wongun Choi, Vishal Patel, Amir Sadeghian

    Abstract: We present a conceptually simple self-supervised method for saliency detection. Our method generates and uses pseudo-ground truth labels for training. The generated pseudo-GT labels don't require any kind of human annotations (e.g., pixel-wise labels or weak labels like scribbles). Recent works show that features extracted from classification tasks provide important saliency cues like structure an… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

  33. arXiv:2202.11860  [pdf, ps, other

    cs.IT eess.SP

    Robust Transmission Design for RIS-assisted Secure Multiuser Communication Systems in the Presence of Hardware Impairments

    Authors: Zhangjie Peng, Ruisong Weng, Cunhua Pan, Gui Zhou, Marco Di Renzo, A. Lee Swindlehurst

    Abstract: This paper investigates reconfigurable intelligent surface (RIS)-assisted secure multiuser communication systems subject to hardware impairments (HIs). We jointly optimize the beamforming vectors at the base station (BS) and the phase shifts of the reflecting elements at the RIS so as to maximize the weighted minimum secrecy rate (WMSR), subject to both transmission power constraints at the BS and… ▽ More

    Submitted 10 October, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Comments: Revised version in IEEE TWC. Keywords: Reconfigurable intelligent surface (RIS), intelligent reflecting surface (IRS)

  34. IIP-Transformer: Intra-Inter-Part Transformer for Skeleton-Based Action Recognition

    Authors: Qingtian Wang, Jianlin Peng, Shuze Shi, Tingxi Liu, Jiabin He, Renliang Weng

    Abstract: Recently, Transformer-based networks have shown great promise on skeleton-based action recognition tasks. The ability to capture global and local dependencies is the key to success while it also brings quadratic computation and memory cost. Another problem is that previous studies mainly focus on the relationships among individual joints, which often suffers from the noisy skeleton joints introduc… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: 10 pages, 7 figures

  35. arXiv:2103.07889  [pdf, other

    cs.CV

    Learning a Proposal Classifier for Multiple Object Tracking

    Authors: Peng Dai, Renliang Weng, Wongun Choi, Changshui Zhang, Zhangping He, Wei Ding

    Abstract: The recent trend in multiple object tracking (MOT) is heading towards leveraging deep learning to boost the tracking performance. However, it is not trivial to solve the data-association problem in an end-to-end fashion. In this paper, we propose a novel proposal-based learnable framework, which models MOT as a proposal generation, proposal scoring and trajectory inference paradigm on an affinity… ▽ More

    Submitted 25 March, 2021; v1 submitted 14 March, 2021; originally announced March 2021.

    Comments: Accepted at CVPR 2021, Poster, EEE/CVF Conference on Computer Vision and Pattern Recognition

  36. arXiv:2012.01915  [pdf, other

    cs.AI cs.IR cs.LG cs.SI

    Origin-Aware Next Destination Recommendation with Personalized Preference Attention

    Authors: Nicholas Lim, Bryan Hooi, See-Kiong Ng, Xueou Wang, Yong Liang Goh, Renrong Weng, Rui Tan

    Abstract: Next destination recommendation is an important task in the transportation domain of taxi and ride-hailing services, where users are recommended with personalized destinations given their current origin location. However, recent recommendation works do not satisfy this origin-awareness property, and only consider learning from historical destination locations, without origin information. Thus, the… ▽ More

    Submitted 11 January, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: To appear in the Proceedings of the 14th ACM International Conference on Web Search and Data Mining (WSDM), 2021

  37. arXiv:2010.07024  [pdf, other

    cs.IR cs.LG cs.SI

    STP-UDGAT: Spatial-Temporal-Preference User Dimensional Graph Attention Network for Next POI Recommendation

    Authors: Nicholas Lim, Bryan Hooi, See-Kiong Ng, Xueou Wang, Yong Liang Goh, Renrong Weng, Jagannadan Varadarajan

    Abstract: Next Point-of-Interest (POI) recommendation is a longstanding problem across the domains of Location-Based Social Networks (LBSN) and transportation. Recent Recurrent Neural Network (RNN) based approaches learn POI-POI relationships in a local view based on independent user visit sequences. This limits the model's ability to directly connect and learn across users in a global view to recommend sem… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: To appear in Proceedings of the 29th ACM International Conference on Information and Knowledge Management (CIKM), 2020

  38. arXiv:2010.04411  [pdf, other

    cs.CL

    Uncertainty-Aware Semantic Augmentation for Neural Machine Translation

    Authors: Xiangpeng Wei, Heng Yu, Yue Hu, Rongxiang Weng, Luxi Xing, Weihua Luo

    Abstract: As a sequence-to-sequence generation task, neural machine translation (NMT) naturally contains intrinsic uncertainty, where a single sentence in one language has multiple valid counterparts in the other. However, the dominant methods for NMT only observe one of them from the parallel corpora for the model training but have to deal with adequate variations under the same meaning at inference. This… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: Accepted to EMNLP 2020, 12 pages, 2 figures, 9 tables

  39. arXiv:2007.15960  [pdf, other

    cs.CL

    On Learning Universal Representations Across Languages

    Authors: Xiangpeng Wei, Rongxiang Weng, Yue Hu, Luxi Xing, Heng Yu, Weihua Luo

    Abstract: Recent studies have demonstrated the overwhelming advantage of cross-lingual pre-trained models (PTMs), such as multilingual BERT and XLM, on cross-lingual NLP tasks. However, existing approaches essentially capture the co-occurrence among tokens through involving the masked language model (MLM) objective with token-level cross entropy. In this work, we extend these approaches to learn sentence-le… ▽ More

    Submitted 21 March, 2021; v1 submitted 31 July, 2020; originally announced July 2020.

    Comments: Accepted to ICLR 2021

  40. arXiv:2004.14021  [pdf, other

    cs.CL

    Multiscale Collaborative Deep Models for Neural Machine Translation

    Authors: Xiangpeng Wei, Heng Yu, Yue Hu, Yue Zhang, Rongxiang Weng, Weihua Luo

    Abstract: Recent evidence reveals that Neural Machine Translation (NMT) models with deeper neural networks can be more effective but are difficult to train. In this paper, we present a MultiScale Collaborative (MSC) framework to ease the training of NMT models that are substantially deeper than those used previously. We explicitly boost the gradient back-propagation from top to bottom levels by introducing… ▽ More

    Submitted 10 May, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: ACL 2020

  41. arXiv:2004.02196  [pdf, other

    cs.CL

    AR: Auto-Repair the Synthetic Data for Neural Machine Translation

    Authors: Shanbo Cheng, Shaohui Kuang, Rongxiang Weng, Heng Yu, Changfeng Zhu, Weihua Luo

    Abstract: Compared with only using limited authentic parallel data as training corpus, many studies have proved that incorporating synthetic parallel data, which generated by back translation (BT) or forward translation (FT, or selftraining), into the NMT training process can significantly improve translation quality. However, as a well-known shortcoming, synthetic parallel data is noisy because they are ge… ▽ More

    Submitted 5 April, 2020; originally announced April 2020.

  42. arXiv:2002.10101  [pdf, other

    cs.CL

    GRET: Global Representation Enhanced Transformer

    Authors: Rongxiang Weng, Haoran Wei, Shujian Huang, Heng Yu, Lidong Bing, Weihua Luo, Jiajun Chen

    Abstract: Transformer, based on the encoder-decoder framework, has achieved state-of-the-art performance on several natural language generation tasks. The encoder maps the words in the input sentence into a sequence of hidden states, which are then fed into the decoder to generate the output sentence. These hidden states usually correspond to the input words and focus on capturing local information. However… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

    Comments: Accepted by AAAI 2020

  43. arXiv:1912.01774  [pdf, other

    cs.CL

    Acquiring Knowledge from Pre-trained Model to Neural Machine Translation

    Authors: Rongxiang Weng, Heng Yu, Shujian Huang, Shanbo Cheng, Weihua Luo

    Abstract: Pre-training and fine-tuning have achieved great success in the natural language process field. The standard paradigm of exploiting them includes two steps: first, pre-training a model, e.g. BERT, with a large scale unlabeled monolingual data. Then, fine-tuning the pre-trained model with labeled data from downstream tasks. However, in neural machine translation (NMT), we address the problem that t… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

  44. arXiv:1908.07688  [pdf, other

    cs.CL

    Improving Neural Machine Translation with Pre-trained Representation

    Authors: Rongxiang Weng, Heng Yu, Shujian Huang, Weihua Luo, Jiajun Chen

    Abstract: Monolingual data has been demonstrated to be helpful in improving the translation quality of neural machine translation (NMT). The current methods stay at the usage of word-level knowledge, such as generating synthetic parallel data or extracting information from word embedding. In contrast, the power of sentence-level contextual knowledge which is more complex and diverse, playing an important ro… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: In Progress

  45. arXiv:1907.07328  [pdf, other

    cs.CL

    Learning Representation Mapping for Relation Detection in Knowledge Base Question Answering

    Authors: Peng Wu, Shujian Huang, Rongxiang Weng, Zaixiang Zheng, Jianbing Zhang, Xiaohui Yan, Jiajun Chen

    Abstract: Relation detection is a core step in many natural language process applications including knowledge base question answering. Previous efforts show that single-fact questions could be answered with high accuracy. However, one critical problem is that current approaches only get high accuracy for questions whose relations have been seen in the training data. But for unseen relations, the performance… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

    Comments: 10 pages, 5 figures, accepted by ACL 2019

  46. arXiv:1907.03468  [pdf, other

    cs.CL

    Correct-and-Memorize: Learning to Translate from Interactive Revisions

    Authors: Rongxiang Weng, Hao Zhou, Shujian Huang, Lei Li, Yifan Xia, Jiajun Chen

    Abstract: State-of-the-art machine translation models are still not on par with human translators. Previous work takes human interactions into the neural machine translation process to obtain improved results in target languages. However, not all model-translation errors are equal -- some are critical while others are minor. In the meanwhile, the same translation mistakes occur repeatedly in a similar conte… ▽ More

    Submitted 13 August, 2019; v1 submitted 8 July, 2019; originally announced July 2019.

    Comments: Accepted at IJCAI 2019

  47. arXiv:1810.10317  [pdf, other

    cs.CL

    Learning to Discriminate Noises for Incorporating External Information in Neural Machine Translation

    Authors: Zaixiang Zheng, Shujian Huang, Zewei Sun, Rongxiang Weng, Xin-Yu Dai, Jiajun Chen

    Abstract: Previous studies show that incorporating external information could improve the translation quality of Neural Machine Translation (NMT) systems. However, there are inevitably noises in the external information, severely reducing the benefit that the existing methods could receive from the incorporation. To tackle the problem, this study pays special attention to the discrimination of the noises du… ▽ More

    Submitted 19 November, 2018; v1 submitted 24 October, 2018; originally announced October 2018.

    Comments: 8 pages

  48. arXiv:1809.01290  [pdf

    cond-mat.mtrl-sci

    Origin of planar Hall effect in type-II Weyl semimetal MoTe2

    Authors: D. D. Liang, Y. J. Wang, W. L. Zhen, J. Yang, S. R. Weng, X. Yan, Y. Y. Han, W. Tong, L. Pi, W. K. Zhu, C. J. Zhang

    Abstract: Besides the negative longitudinal magnetoresistance (MR), planar Hall effect (PHE) is a newly emerging experimental tool to test the chiral anomaly or nontrivial Berry curvature in Weyl semimetals (WSMs). However, the origins of PHE in various systems are not fully distinguished and understood. Here we perform a systematic study on the PHE and anisotropic MR (AMR) of Td-MoTe2, a type-II WSM. Altho… ▽ More

    Submitted 4 September, 2018; originally announced September 2018.

    Comments: 14 pages, 4 figures

    Journal ref: AIP Advances 9, 055015 (2019)

  49. Current jetting distorted planar Hall effect in a Weyl semimetal with ultrahigh mobility

    Authors: J. Yang, W. L. Zhen, D. D. Liang, Y. J. Wang, X. Yan, S. R. Weng, J. R. Wang, W. Tong, L. Pi, W. K. Zhu, C. J. Zhang

    Abstract: A giant planar Hall effect (PHE) and anisotropic magnetoresistance (AMR) is observed in TaP, a nonmagnetic Weyl semimetal with ultrahigh mobility. The perpendicular resistivity (i.e., the planar magnetic field applied normal to the current) far exceeds the zero-field resistivity, which thus rules out the possible origin of negative longitudinal magnetoresistance. The giant PHE/AMR is finally attri… ▽ More

    Submitted 14 January, 2019; v1 submitted 17 July, 2018; originally announced July 2018.

    Comments: Physical Review Materials

    Journal ref: Phys. Rev. Materials 3, 014201 (2019)

  50. Non-stoichiometry effects on the extreme magnetoresistance in Weyl semimetal WTe2

    Authors: J. X. Gong, J. Yang, M. Ge, Y. J. Wang, D. D. Liang, L. Luo, X. Yan, W. L. Zhen, S. R. Weng, L. Pi, C. J. Zhang, W. K. Zhu

    Abstract: Non-stoichiometry effect on the extreme magnetoresistance is systematically investigated for the Weyl semimetal WTe2. Magnetoresistance and Hall resistivity are measured for the as-grown samples with a slight difference in Te vacancies and the annealed samples with increased Te vacancies. The fittings to a two-carrier model show that the magnetoresistance is strongly dependent on the residual resi… ▽ More

    Submitted 29 April, 2018; v1 submitted 29 December, 2017; originally announced December 2017.

    Journal ref: Chin. Phys. Lett. 35, 097101 (2018)