Skip to main content

Showing 1–13 of 13 results for author: You, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.06757  [pdf, ps, other

    cs.CV

    SAR2Struct: Extracting 3D Semantic Structural Representation of Aircraft Targets from Single-View SAR Image

    Authors: Ziyu Yue, Ruixi You, Feng Xu

    Abstract: To translate synthetic aperture radar (SAR) image into interpretable forms for human understanding is the ultimate goal of SAR advanced information retrieval. Existing methods mainly focus on 3D surface reconstruction or local geometric feature extraction of targets, neglecting the role of structural modeling in capturing semantic information. This paper proposes a novel task: SAR target structure… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

    Comments: 13 pages, 12 figures

  2. arXiv:2505.16994  [pdf, other

    cs.IR cs.AI cs.CL

    $\text{R}^2\text{ec}$: Towards Large Recommender Models with Reasoning

    Authors: Runyang You, Yongqi Li, Xinyu Lin, Xin Zhang, Wenjie Wang, Wenjie Li, Liqiang Nie

    Abstract: Large recommender models have extended LLMs as powerful recommenders via encoding or item generation, and recent breakthroughs in LLM reasoning synchronously motivate the exploration of reasoning in recommendation. Current studies usually position LLMs as external reasoning modules to yield auxiliary thought for augmenting conventional recommendation pipelines. However, such decoupled designs are… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  3. arXiv:2503.19798  [pdf, other

    cs.CV eess.IV

    Unpaired Object-Level SAR-to-Optical Image Translation for Aircraft with Keypoints-Guided Diffusion Models

    Authors: Ruixi You, Hecheng Jia, Feng Xu

    Abstract: Synthetic Aperture Radar (SAR) imagery provides all-weather, all-day, and high-resolution imaging capabilities but its unique imaging mechanism makes interpretation heavily reliant on expert knowledge, limiting interpretability, especially in complex target tasks. Translating SAR images into optical images is a promising solution to enhance interpretation and support downstream tasks. Most existin… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  4. arXiv:2503.16123  [pdf, other

    math.OC cs.LG

    Distributed Learning over Arbitrary Topology: Linear Speed-Up with Polynomial Transient Time

    Authors: Runze You, Shi Pu

    Abstract: We study a distributed learning problem in which $n$ agents, each with potentially heterogeneous local data, collaboratively minimize the sum of their local cost functions via peer-to-peer communication. We propose a novel algorithm, Spanning Tree Push-Pull (STPP), which employs two spanning trees extracted from a general communication graph to distribute both model parameters and stochastic gradi… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  5. arXiv:2503.14189  [pdf, other

    cs.CL cs.CV

    Towards Harmless Multimodal Assistants with Blind Preference Optimization

    Authors: Yongqi Li, Lu Yang, Jian Wang, Runyang You, Wenjie Li, Liqiang Nie

    Abstract: Multimodal Large Language Models (MLLMs) have demonstrated impressive capabilities in multimodal understanding, reasoning, and interaction. Given the extensive applications of MLLMs, the associated safety issues have become increasingly critical. Due to the effectiveness of preference optimization in aligning MLLMs with human preferences, there is an urgent need for safety-related preference data… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

  6. arXiv:2410.00053  [pdf, other

    cs.LG

    Frequency-adaptive Multi-scale Deep Neural Networks

    Authors: Jizu Huang, Rukang You, Tao Zhou

    Abstract: Multi-scale deep neural networks (MscaleDNNs) with downing-scaling mapping have demonstrated superiority over traditional DNNs in approximating target functions characterized by high frequency features. However, the performance of MscaleDNNs heavily depends on the parameters in the downing-scaling mapping, which limits their broader application. In this work, we establish a fitting error bound to… ▽ More

    Submitted 28 September, 2024; originally announced October 2024.

  7. arXiv:2408.04963  [pdf, other

    cs.LG

    LiD-FL: Towards List-Decodable Federated Learning

    Authors: Hong Liu, Liren Shan, Han Bao, Ronghui You, Yuhao Yi, Jiancheng Lv

    Abstract: Federated learning is often used in environments with many unverified participants. Therefore, federated learning under adversarial attacks receives significant attention. This paper proposes an algorithmic framework for list-decodable federated learning, where a central server maintains a list of models, with at least one guaranteed to perform well. The framework has no strict restriction on the… ▽ More

    Submitted 26 February, 2025; v1 submitted 9 August, 2024; originally announced August 2024.

    Comments: 26 pages, 5 figures

  8. Near-Optimal Resilient Aggregation Rules for Distributed Learning Using 1-Center and 1-Mean Clustering with Outliers

    Authors: Yuhao Yi, Ronghui You, Hong Liu, Changxin Liu, Yuan Wang, Jiancheng Lv

    Abstract: Byzantine machine learning has garnered considerable attention in light of the unpredictable faults that can occur in large-scale distributed learning systems. The key to secure resilience against Byzantine machines in distributed learning is resilient aggregation mechanisms. Although abundant resilient aggregation rules have been proposed, they are designed in ad-hoc manners, imposing extra barri… ▽ More

    Submitted 31 March, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: 17 pages, 4 figures. Accepted by the 38th Annual AAAI Conference on Artificial Intelligence (AAAI'24)

    Journal ref: AAAI 2024, 38, 16469-16477

  9. arXiv:2304.07511  [pdf, other

    cs.HC

    Pilgrimage to Pureland: Art, Perception and the Wutai Mural VR Reconstruction

    Authors: Rongxuan Mu, Yuhe Nie, Kent Cao, Ruoxin You, Yinzong Wei, Xin Tong

    Abstract: Virtual reality (VR) supports audiences to engage with cultural heritage proactively. We designed an easy-to-access and guided Pilgrimage To Pureland VR reconstruction of Dunhuang Mogao Grottoes to offer the general public an accessible and engaging way to explore the Dunhuang murals. We put forward an immersive VR reconstruction paradigm that can efficiently convert complex 2D artwork into a VR e… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

  10. arXiv:2210.16819  [pdf, other

    cs.HC

    Relative Attention-based One-Class Adversarial Autoencoder for Continuous Authentication of Smartphone Users

    Authors: Mingming Hu, Kun Zhang, Ruibang You, Bibo Tu

    Abstract: Behavioral biometrics-based continuous authentication is a promising authentication scheme, which uses behavioral biometrics recorded by built-in sensors to authenticate smartphone users throughout the session. However, current continuous authentication methods suffer some limitations: 1) behavioral biometrics from impostors are needed to train continuous authentication models. Since the distribut… ▽ More

    Submitted 1 November, 2022; v1 submitted 30 October, 2022; originally announced October 2022.

  11. arXiv:1912.07872  [pdf, other

    cs.CV

    Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification

    Authors: Renchun You, Zhiyao Guo, Lei Cui, Xiang Long, Yingze Bao, Shilei Wen

    Abstract: Multi-label image and video classification are fundamental yet challenging tasks in computer vision. The main challenges lie in capturing spatial or temporal dependencies between labels and discovering the locations of discriminative features for each class. In order to overcome these challenges, we propose to use cross-modality attention with semantic graph embedding for multi label classificatio… ▽ More

    Submitted 27 March, 2020; v1 submitted 17 December, 2019; originally announced December 2019.

    Comments: Accepted by AAAI2020

  12. arXiv:1904.12578  [pdf, ps, other

    cs.IR cs.LG stat.ML

    HAXMLNet: Hierarchical Attention Network for Extreme Multi-Label Text Classification

    Authors: Ronghui You, Zihan Zhang, Suyang Dai, Shanfeng Zhu

    Abstract: Extreme multi-label text classification (XMTC) addresses the problem of tagging each text with the most relevant labels from an extreme-scale label set. Traditional methods use bag-of-words (BOW) representations without context information as their features. The state-ot-the-art deep learning-based method, AttentionXML, which uses a recurrent neural network (RNN) and the multi-label attention, can… ▽ More

    Submitted 24 March, 2019; originally announced April 2019.

  13. arXiv:1811.01727  [pdf, other

    cs.CL cs.LG

    AttentionXML: Label Tree-based Attention-Aware Deep Model for High-Performance Extreme Multi-Label Text Classification

    Authors: Ronghui You, Zihan Zhang, Ziye Wang, Suyang Dai, Hiroshi Mamitsuka, Shanfeng Zhu

    Abstract: Extreme multi-label text classification (XMTC) is an important problem in the era of big data, for tagging a given text with the most relevant multiple labels from an extremely large-scale label set. XMTC can be found in many applications, such as item categorization, web page tagging, and news annotation. Traditionally most methods used bag-of-words (BOW) as inputs, ignoring word context as well… ▽ More

    Submitted 4 November, 2019; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: Accepted by NeurIPS 2019