Skip to main content

Showing 1–23 of 23 results for author: Leng, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.19939  [pdf, other

    cs.RO

    Uncertainty-Aware Safety-Critical Decision and Control for Autonomous Vehicles at Unsignalized Intersections

    Authors: Ran Yu, Zhuoren Li, Lu Xiong, Wei Han, Bo Leng

    Abstract: Reinforcement learning (RL) has demonstrated potential in autonomous driving (AD) decision tasks. However, applying RL to urban AD, particularly in intersection scenarios, still faces significant challenges. The lack of safety constraints makes RL vulnerable to risks. Additionally, cognitive limitations and environmental randomness can lead to unreliable decisions in safety-critical scenarios. The… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 8 pages, 6 figures

  2. arXiv:2505.15793  [pdf, other

    cs.RO cs.LG

    HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving

    Authors: Zhiwen Chen, Bo Leng, Zhuoren Li, Hanming Deng, Guizhe Jin, Ran Yu, Huanxi Wen

    Abstract: Integrating Large Language Models (LLMs) with Reinforcement Learning (RL) can enhance autonomous driving (AD) performance in complex scenarios. However, current LLM-Dominated RL methods over-rely on LLM outputs, which are prone to hallucinations. Evaluations show that state-of-the-art LLM indicates a non-hallucination rate of only approximately 57.95% when assessed on essential driving-related tas… ▽ More

    Submitted 22 May, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

  3. arXiv:2504.02440  [pdf, other

    cs.CV

    HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning

    Authors: Hao Wang, Shuo Zhang, Biao Leng

    Abstract: The computer vision community has witnessed an extensive exploration of vision transformers in the past two years. Drawing inspiration from traditional schemes, numerous works focus on introducing vision-specific inductive biases. However, the implicit modeling of permutation invariance and fully-connected interaction with individual tokens disrupts the regional context and spatial topology, furth… ▽ More

    Submitted 8 April, 2025; v1 submitted 3 April, 2025; originally announced April 2025.

  4. arXiv:2503.23650  [pdf, other

    cs.LG cs.RO

    A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective

    Authors: Zhuoren Li, Guizhe Jin, Ran Yu, Zhiwen Chen, Nan Li, Wei Han, Lu Xiong, Bo Leng, Jia Hu, Ilya Kolmanovsky, Dimitar Filev

    Abstract: Reinforcement learning (RL), with its ability to explore and optimize policies in complex, dynamic decision-making tasks, has emerged as a promising approach to addressing motion planning (MoP) challenges in autonomous driving (AD). Despite rapid advancements in RL and AD, a systematic description and interpretation of the RL design process tailored to diverse driving tasks remains underdeveloped.… ▽ More

    Submitted 30 March, 2025; originally announced March 2025.

    Comments: 21 pages, 5 figures

  5. arXiv:2503.19690  [pdf, other

    cs.RO

    Risk-Aware Reinforcement Learning for Autonomous Driving: Improving Safety When Driving through Intersection

    Authors: Bo Leng, Ran Yu, Wei Han, Lu Xiong, Zhuoren Li, Hailong Huang

    Abstract: Applying reinforcement learning to autonomous driving has garnered widespread attention. However, classical reinforcement learning methods optimize policies by maximizing expected rewards but lack sufficient safety considerations, often putting agents in hazardous situations. This paper proposes a risk-aware reinforcement learning approach for autonomous driving to improve the safety performance w… ▽ More

    Submitted 27 March, 2025; v1 submitted 25 March, 2025; originally announced March 2025.

    Comments: 11 pages, 10 figures

  6. arXiv:2501.08096  [pdf, other

    cs.RO cs.AI cs.ET cs.LG

    Hybrid Action Based Reinforcement Learning for Multi-Objective Compatible Autonomous Driving

    Authors: Guizhe Jin, Zhuoren Li, Bo Leng, Wei Han, Lu Xiong, Chen Sun

    Abstract: Reinforcement Learning (RL) has shown excellent performance in solving decision-making and control problems of autonomous driving, which is increasingly applied in diverse driving scenarios. However, driving is a multi-attribute problem, leading to challenges in achieving multi-objective compatibility for current RL methods, especially in both policy execution and policy iteration. On the one hand… ▽ More

    Submitted 28 March, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

    Comments: 12 pages, 9 figures, 5 tables

  7. arXiv:2412.08949  [pdf, other

    cs.CV

    Multimodal Industrial Anomaly Detection by Crossmodal Reverse Distillation

    Authors: Xinyue Liu, Jianyuan Wang, Biao Leng, Shuo Zhang

    Abstract: Knowledge distillation (KD) has been widely studied in unsupervised Industrial Image Anomaly Detection (AD), but its application to unsupervised multimodal AD remains underexplored. Existing KD-based methods for multimodal AD that use fused multimodal features to obtain teacher representations face challenges. Anomalies in one modality may not be effectively captured in the fused teacher features,… ▽ More

    Submitted 19 March, 2025; v1 submitted 12 December, 2024; originally announced December 2024.

  8. arXiv:2412.07579  [pdf, other

    cs.CV

    Unlocking the Potential of Reverse Distillation for Anomaly Detection

    Authors: Xinyue Liu, Jianyuan Wang, Biao Leng, Shuo Zhang

    Abstract: Knowledge Distillation (KD) is a promising approach for unsupervised Anomaly Detection (AD). However, the student network's over-generalization often diminishes the crucial representation differences between teacher and student in anomalous regions, leading to detection failures. To addresses this problem, the widely accepted Reverse Distillation (RD) paradigm designs the asymmetry teacher and stu… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: 18 pages, 14 figures, AAAI 2025

  9. Dual-Modeling Decouple Distillation for Unsupervised Anomaly Detection

    Authors: Xinyue Liu, Jianyuan Wang, Biao Leng, Shuo Zhang

    Abstract: Knowledge distillation based on student-teacher network is one of the mainstream solution paradigms for the challenging unsupervised Anomaly Detection task, utilizing the difference in representation capabilities of the teacher and student networks to implement anomaly localization. However, over-generalization of the student network to the teacher network may lead to negligible differences in rep… ▽ More

    Submitted 15 October, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

    Comments: 10 pages, 8 figures, Accepted to ACM MM '24

  10. arXiv:2208.08630  [pdf, other

    cs.CV

    Unifying Visual Perception by Dispersible Points Learning

    Authors: Jianming Liang, Guanglu Song, Biao Leng, Yu Liu

    Abstract: We present a conceptually simple, flexible, and universal visual perception head for variant visual tasks, e.g., classification, object detection, instance segmentation and pose estimation, and different frameworks, such as one-stage or two-stage pipelines. Our approach effectively identifies an object in an image while simultaneously generating a high-quality bounding box or contour-based segment… ▽ More

    Submitted 12 September, 2022; v1 submitted 18 August, 2022; originally announced August 2022.

    Comments: Accepted by ECCV 2022. Code: https://github.com/Sense-X/UniHead

  11. arXiv:2111.12624  [pdf, other

    cs.CV

    Self-slimmed Vision Transformer

    Authors: Zhuofan Zong, Kunchang Li, Guanglu Song, Yali Wang, Yu Qiao, Biao Leng, Yu Liu

    Abstract: Vision transformers (ViTs) have become the popular structures and outperformed convolutional neural networks (CNNs) on various vision tasks. However, such powerful transformers bring a huge computation burden, because of the exhausting token-to-token comparison. The previous works focus on dropping insignificant tokens to reduce the computational cost of ViTs. But when the dropping ratio increases… ▽ More

    Submitted 12 September, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: Accepted by ECCV 2022. Code is available at https://github.com/Sense-X/SiT

  12. RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection

    Authors: Zhuofan Zong, Qianggang Cao, Biao Leng

    Abstract: Feature pyramid networks (FPN) are widely exploited for multi-scale feature fusion in existing advanced object detection frameworks. Numerous previous works have developed various structures for bidirectional feature fusion, all of which are shown to improve the detection performance effectively. We observe that these complicated network structures require feature pyramids to be stacked in a fixed… ▽ More

    Submitted 23 October, 2021; originally announced October 2021.

    Comments: Accepted by ACM MM2021

  13. arXiv:2104.04606  [pdf, other

    cs.CV

    RaidaR: A Rich Annotated Image Dataset of Rainy Street Scenes

    Authors: Jiongchao Jin, Arezou Fatemi, Wallace Lira, Fenggen Yu, Biao Leng, Rui Ma, Ali Mahdavi-Amiri, Hao Zhang

    Abstract: We introduce RaidaR, a rich annotated image dataset of rainy street scenes, to support autonomous driving research. The new dataset contains the largest number of rainy images (58,542) to date, 5,000 of which provide semantic segmentations and 3,658 provide object instance segmentations. The RaidaR images cover a wide range of realistic rain-induced artifacts, including fog, droplets, and road ref… ▽ More

    Submitted 26 October, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: Presented in Second ICCV Workshop on Autonomous Vehicle Vision (AVVision), 2021. Website: https://raidar-dataset.com/

  14. arXiv:2005.11617  [pdf, other

    cs.GR cs.CG

    MeshODE: A Robust and Scalable Framework for Mesh Deformation

    Authors: Jingwei Huang, Chiyu Max Jiang, Baiqiang Leng, Bin Wang, Leonidas Guibas

    Abstract: We present MeshODE, a scalable and robust framework for pairwise CAD model deformation without prespecified correspondences. Given a pair of shapes, our framework provides a novel shape feature-preserving mapping function that continuously deforms one model to the other by minimizing fitting and rigidity losses based on the non-rigid iterative-closest-point (ICP) algorithm. We address two challeng… ▽ More

    Submitted 23 May, 2020; originally announced May 2020.

  15. arXiv:2003.07543  [pdf, other

    cs.CV

    KPNet: Towards Minimal Face Detector

    Authors: Guanglu Song, Yu Liu, Yuhang Zang, Xiaogang Wang, Biao Leng, Qingsheng Yuan

    Abstract: The small receptive field and capacity of minimal neural networks limit their performance when using them to be the backbone of detectors. In this work, we find that the appearance feature of a generic face is discriminative enough for a tiny and shallow neural network to verify from the background. And the essential barriers behind us are 1) the vague definition of the face bounding box and 2) tr… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

    Comments: AAAI 2020

  16. arXiv:1906.00546  [pdf, other

    cs.CV

    Rethinking Loss Design for Large-scale 3D Shape Retrieval

    Authors: Zhaoqun Li, Cheng Xu, Biao Leng

    Abstract: Learning discriminative shape representations is a crucial issue for large-scale 3D shape retrieval. In this paper, we propose the Collaborative Inner Product Loss (CIP Loss) to obtain ideal shape embedding that discriminative among different categories and clustered within the same class. Utilizing simple inner product operation, CIP loss explicitly enforces the features of the same class to be c… ▽ More

    Submitted 2 June, 2019; originally announced June 2019.

    Comments: Accepted by IJCAI2019

  17. arXiv:1811.08622  [pdf, other

    cs.CV

    Angular Triplet-Center Loss for Multi-view 3D Shape Retrieval

    Authors: Zhaoqun Li, Cheng Xu, Biao Leng

    Abstract: How to obtain the desirable representation of a 3D shape, which is discriminative across categories and polymerized within classes, is a significant challenge in 3D shape retrieval. Most existing 3D shape retrieval methods focus on capturing strong discriminative shape representation with softmax loss for the classification task, while the shape feature learning with metric loss is neglected for 3… ▽ More

    Submitted 22 January, 2019; v1 submitted 21 November, 2018; originally announced November 2018.

    Comments: Accepted by AAAI 2019

  18. arXiv:1808.03823  [pdf, other

    cs.CV cs.GR

    Learning Discriminative 3D Shape Representations by View Discerning Networks

    Authors: Biao Leng, Cheng Zhang, Xiaocheng Zhou, Cheng Xu, Kai Xu

    Abstract: In view-based 3D shape recognition, extracting discriminative visual representation of 3D shapes from projected images is considered the core problem. Projections with low discriminative ability can adversely influence the final 3D shape representation. Especially under the real situations with background clutter and object occlusion, the adverse effect is even more severe. To resolve this problem… ▽ More

    Submitted 20 August, 2018; v1 submitted 11 August, 2018; originally announced August 2018.

    Comments: Accepted by IEEE Transactions on Visualization and Computer Graphics. Corresponding Author: Kai Xu ([email protected])

    Journal ref: IEEE Transactions on Visualization and Computer Graphics, 2018

  19. arXiv:1804.05197  [pdf, other

    cs.CV

    Beyond Trade-off: Accelerate FCN-based Face Detector with Higher Accuracy

    Authors: Guanglu Song, Yu Liu, Ming Jiang, Yujie Wang, Junjie Yan, Biao Leng

    Abstract: Fully convolutional neural network (FCN) has been dominating the game of face detection task for a few years with its congenital capability of sliding-window-searching with shared kernels, which boiled down all the redundant calculation, and most recent state-of-the-art methods such as Faster-RCNN, SSD, YOLO and FPN use FCN as their backbone. So here comes one question: Can we find a universal str… ▽ More

    Submitted 2 June, 2018; v1 submitted 14 April, 2018; originally announced April 2018.

    Comments: Accepted by CVPR2018

  20. arXiv:1711.08766  [pdf, other

    cs.CV

    Region-based Quality Estimation Network for Large-scale Person Re-identification

    Authors: Guanglu Song, Biao Leng, Yu Liu, Congrui Hetang, Shaofan Cai

    Abstract: One of the major restrictions on the performance of video-based person re-id is partial noise caused by occlusion, blur and illumination. Since different spatial regions of a single frame have various quality, and the quality of the same region also varies across frames in a tracklet, a good way to address the problem is to effectively aggregate complementary information from all frames in a seque… ▽ More

    Submitted 21 December, 2017; v1 submitted 23 November, 2017; originally announced November 2017.

    Comments: Accepted by AAAI 2018

  21. arXiv:1611.05603  [pdf, other

    cs.CV

    Weakly-supervised Learning of Mid-level Features for Pedestrian Attribute Recognition and Localization

    Authors: Kai Yu, Biao Leng, Zhang Zhang, Dangwei Li, Kaiqi Huang

    Abstract: State-of-the-art methods treat pedestrian attribute recognition as a multi-label image classification problem. The location information of person attributes is usually eliminated or simply encoded in the rigid splitting of whole body in previous work. In this paper, we formulate the task in a weakly-supervised attribute localization framework. Based on GoogLeNet, firstly, a set of mid-level attrib… ▽ More

    Submitted 17 November, 2016; originally announced November 2016.

    Comments: Containing 9 pages and 5 figures. Codes open-sourced on https://github.com/kyu-sz/WPAL-network

  22. arXiv:1606.01208  [pdf

    physics.soc-ph cs.CY physics.data-an

    Gravitational scaling in Beijing Subway Network

    Authors: Biao Leng, Yali Cui, Jianyuan Wang, Zhang Xiong, Shlomo Havlin, Daqing Li

    Abstract: Recently, with the availability of various traffic datasets, human mobility has been studied in different contexts. Researchers attempt to understand the collective behaviors of human movement with respect to the spatio-temporal distribution in traffic dynamics, from which a gravitational scaling law characterizing the relation between the traffic flow, population and distance has been found. Howe… ▽ More

    Submitted 3 June, 2016; originally announced June 2016.

  23. arXiv:1511.05618  [pdf, other

    cs.SI

    Topic Model Based Behaviour Modeling and Clustering Analysis for Wireless Network Users

    Authors: Bingjie Leng, Jingchu Liu, Huimin Pan, Sheng Zhou, Zhisheng Niu

    Abstract: User behaviour analysis based on traffic log in wireless networks can be beneficial to many fields in real life: not only for commercial purposes, but also for improving network service quality and social management. We cluster users into groups marked by the most frequently visited websites to find their preferences. In this paper, we propose a user behaviour model based on Topic Model from docum… ▽ More

    Submitted 17 November, 2015; originally announced November 2015.

    Comments: 6 pages, 6 figures. APCC'15