Skip to main content

Showing 1–25 of 25 results for author: Xing, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.11100  [pdf, other

    cs.LG cs.AI

    Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors

    Authors: Lang Feng, Jiahao Lin, Dong Xing, Li Zhang, De Ma, Gang Pan

    Abstract: Population-population generalization is a challenging problem in multi-agent reinforcement learning (MARL), particularly when agents encounter unseen co-players. However, existing self-play-based methods are constrained by the limitation of inside-space generalization. In this study, we propose Bidirectional Distillation (BiDist), a novel mixed-play framework, to overcome this limitation in MARL.… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  2. arXiv:2505.01950  [pdf, other

    cs.CV cs.AI

    Segment Any RGB-Thermal Model with Language-aided Distillation

    Authors: Dong Xing, Xianxun Zhu, Wei Zhou, Qika Lin, Hang Yang, Yuqing Wang

    Abstract: The recent Segment Anything Model (SAM) demonstrates strong instance segmentation performance across various downstream tasks. However, SAM is trained solely on RGB data, limiting its direct applicability to RGB-thermal (RGB-T) semantic segmentation. Given that RGB-T provides a robust solution for scene understanding in adverse weather and lighting conditions, such as low light and overexposure, w… ▽ More

    Submitted 3 May, 2025; originally announced May 2025.

    Comments: arXiv admin note: text overlap with arXiv:2412.04220 by other authors

  3. arXiv:2504.08181  [pdf, ps, other

    cs.CV cs.AI

    TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation

    Authors: Ruineng Li, Daitao Xing, Huiming Sun, Yuanzhou Ha, Jinglin Shen, Chiuman Ho

    Abstract: Human-centric motion control in video generation remains a critical challenge, particularly when jointly controlling camera movements and human poses in scenarios like the iconic Grammy Glambot moment. While recent video diffusion models have made significant progress, existing approaches struggle with limited motion representations and inadequate integration of camera and human motion controls. I… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  4. arXiv:2504.03041  [pdf, other

    cs.CV

    VIP: Video Inpainting Pipeline for Real World Human Removal

    Authors: Huiming Sun, Yikang Li, Kangning Yang, Ruineng Li, Daitao Xing, Yangbo Xie, Lan Fu, Kaiyu Zhang, Ming Chen, Jiaming Ding, Jiang Geng, Jie Cai, Zibo Meng, Chiuman Ho

    Abstract: Inpainting for real-world human and pedestrian removal in high-resolution video clips presents significant challenges, particularly in achieving high-quality outcomes, ensuring temporal consistency, and managing complex object interactions that involve humans, their belongings, and their shadows. In this paper, we introduce VIP (Video Inpainting Pipeline), a novel promptless video inpainting frame… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  5. arXiv:2405.12615  [pdf, other

    cs.LG

    Learning Causal Dynamics Models in Object-Oriented Environments

    Authors: Zhongwei Yu, Jingqing Ruan, Dengpeng Xing

    Abstract: Causal dynamics models (CDMs) have demonstrated significant potential in addressing various challenges in reinforcement learning. To learn CDMs, recent studies have performed causal discovery to capture the causal dependencies among environmental variables. However, the learning of CDMs is still confined to small-scale environments due to computational complexity and sample efficiency constraints.… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML 2024. 42 Pages

  6. arXiv:2403.15800  [pdf, other

    cs.CL

    MRC-based Nested Medical NER with Co-prediction and Adaptive Pre-training

    Authors: Xiaojing Du, Hanjie Zhao, Danyan Xing, Yuxiang Jia, Hongying Zan

    Abstract: In medical information extraction, medical Named Entity Recognition (NER) is indispensable, playing a crucial role in developing medical knowledge graphs, enhancing medical question-answering systems, and analyzing electronic medical records. The challenge in medical NER arises from the complex nested structures and sophisticated medical terminologies, distinguishing it from its counterparts in tr… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  7. Learning Top-k Subtask Planning Tree based on Discriminative Representation Pre-training for Decision Making

    Authors: Jingqing Ruan, Kaishen Wang, Qingyang Zhang, Dengpeng Xing, Bo Xu

    Abstract: Many complicated real-world tasks can be broken down into smaller, more manageable parts, and planning with prior knowledge extracted from these simplified pieces is crucial for humans to make accurate decisions. However, replicating this process remains a challenge for AI agents and naturally raises two questions: How to extract discriminative knowledge representation from priors? How to develop… ▽ More

    Submitted 20 May, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted by Machine Intelligence Research

  8. arXiv:2310.05053  [pdf, other

    cs.LG cs.AI cs.MA

    FP3O: Enabling Proximal Policy Optimization in Multi-Agent Cooperation with Parameter-Sharing Versatility

    Authors: Lang Feng, Dong Xing, Junru Zhang, Gang Pan

    Abstract: Existing multi-agent PPO algorithms lack compatibility with different types of parameter sharing when extending the theoretical guarantee of PPO to cooperative multi-agent reinforcement learning (MARL). In this paper, we propose a novel and versatile multi-agent PPO algorithm for cooperative MARL to overcome this limitation. Our approach is achieved upon the proposed full-pipeline paradigm, which… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  9. arXiv:2307.12063  [pdf, other

    cs.LG

    Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs

    Authors: Qingyang Zhang, Yiming Yang, Jingqing Ruan, Xuantang Xiong, Dengpeng Xing, Bo Xu

    Abstract: Goal-Conditioned Hierarchical Reinforcement Learning (GCHRL) is a promising paradigm to address the exploration-exploitation dilemma in reinforcement learning. It decomposes the source task into subgoal conditional subtasks and conducts exploration and exploitation in the subgoal space. The effectiveness of GCHRL heavily relies on subgoal representation functions and subgoal selection strategy. Ho… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

    Comments: Accepted by the conference of International Joint Conference on Neural Networks (IJCNN) 2023

  10. arXiv:2306.10944  [pdf, other

    cs.MA

    Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification

    Authors: Dong Xing, Pengjie Gu, Qian Zheng, Xinrun Wang, Shanqi Liu, Longtao Zheng, Bo An, Gang Pan

    Abstract: Ad hoc teamwork requires an agent to cooperate with unknown teammates without prior coordination. Many works propose to abstract teammate instances into high-level representation of types and then pre-train the best response for each type. However, most of them do not consider the distribution of teammate instances within a type. This could expose the agent to the hidden risk of \emph{type confoun… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted by ICML 2023

  11. arXiv:2305.02749  [pdf, other

    cs.LG

    Explainable Reinforcement Learning via a Causal World Model

    Authors: Zhongwei Yu, Jingqing Ruan, Dengpeng Xing

    Abstract: Generating explanations for reinforcement learning (RL) is challenging as actions may produce long-term effects on the future. In this paper, we develop a novel framework for explainable RL by learning a causal world model without prior knowledge of the causal structure of the environment. The model captures the influence of actions, allowing us to interpret the long-term effects of actions throug… ▽ More

    Submitted 18 January, 2024; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted by IJCAI 2023

  12. arXiv:2302.06872  [pdf, other

    cs.RO cs.AI cs.MA

    Adaptive Value Decomposition with Greedy Marginal Contribution Computation for Cooperative Multi-Agent Reinforcement Learning

    Authors: Shanqi Liu, Yujing Hu, Runze Wu, Dong Xing, Yu Xiong, Changjie Fan, Kun Kuang, Yong Liu

    Abstract: Real-world cooperation often requires intensive coordination among agents simultaneously. This task has been extensively studied within the framework of cooperative multi-agent reinforcement learning (MARL), and value decomposition methods are among those cutting-edge solutions. However, traditional methods that learn the value function as a monotonic mixing of per-agent utilities cannot solve the… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: This paper is accepted by aamas 2023

  13. arXiv:2212.05729  [pdf, other

    cs.CV

    ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient Self-Supervised Monocular Depth Estimation

    Authors: Daitao Xing, Jinglin Shen, Chiuman Ho, Anthony Tzes

    Abstract: The exploration of mutual-benefit cross-domains has shown great potential toward accurate self-supervised depth estimation. In this work, we revisit feature fusion between depth and semantic information and propose an efficient local adaptive attention method for geometric aware representation enhancement. Instead of building global connections or deforming attention across the feature space witho… ▽ More

    Submitted 6 March, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: Camera Ready for AAAI 2023

  14. arXiv:2211.13508  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    1st Workshop on Maritime Computer Vision (MaCVi) 2023: Challenge Results

    Authors: Benjamin Kiefer, Matej Kristan, Janez Perš, Lojze Žust, Fabio Poiesi, Fabio Augusto de Alcantara Andrade, Alexandre Bernardino, Matthew Dawkins, Jenni Raitoharju, Yitong Quan, Adem Atmaca, Timon Höfer, Qiming Zhang, Yufei Xu, Jing Zhang, Dacheng Tao, Lars Sommer, Raphael Spraul, Hangyue Zhao, Hongpu Zhang, Yanyun Zhao, Jan Lukas Augustin, Eui-ik Jeon, Impyeong Lee, Luca Zedda , et al. (48 additional authors not shown)

    Abstract: The 1$^{\text{st}}$ Workshop on Maritime Computer Vision (MaCVi) 2023 focused on maritime computer vision for Unmanned Aerial Vehicles (UAV) and Unmanned Surface Vehicle (USV), and organized several subchallenges in this domain: (i) UAV-based Maritime Object Detection, (ii) UAV-based Maritime Object Tracking, (iii) USV-based Maritime Obstacle Segmentation and (iv) USV-based Maritime Obstacle Detec… ▽ More

    Submitted 28 November, 2022; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: MaCVi 2023 was part of WACV 2023. This report (38 pages) discusses the competition as part of MaCVi

  15. arXiv:2207.13321  [pdf, other

    cs.CR cs.CV cs.LG

    DynaMarks: Defending Against Deep Learning Model Extraction Using Dynamic Watermarking

    Authors: Abhishek Chakraborty, Daniel Xing, Yuntao Liu, Ankur Srivastava

    Abstract: The functionality of a deep learning (DL) model can be stolen via model extraction where an attacker obtains a surrogate model by utilizing the responses from a prediction API of the original model. In this work, we propose a novel watermarking technique called DynaMarks to protect the intellectual property (IP) of DL models against such model extraction attacks in a black-box setting. Unlike exis… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: 7 pages, 2 figures

  16. arXiv:2205.00427  [pdf, other

    cs.LG cs.AI

    TinyLight: Adaptive Traffic Signal Control on Devices with Extremely Limited Resources

    Authors: Dong Xing, Qian Zheng, Qianhui Liu, Gang Pan

    Abstract: Recent advances in deep reinforcement learning (DRL) have largely promoted the performance of adaptive traffic signal control (ATSC). Nevertheless, regarding the implementation, most works are cumbersome in terms of storage and computation. This hinders their deployment on scenarios where resources are limited. In this work, we propose TinyLight, the first DRL-based ATSC model that is designed for… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

    Comments: Accepted by IJCAI 2022 (Long Oral)

  17. arXiv:2203.12505   

    cs.LG cs.AI

    A Spatial-Temporal Attention Multi-Graph Convolution Network for Ride-Hailing Demand Prediction Based on Periodicity with Offset

    Authors: Dong Xing, Chenguang Zhao, Gang Wang

    Abstract: Ride-hailing service is becoming a leading part in urban transportation. To improve the efficiency of ride-hailing service, accurate prediction of transportation demand is a fundamental challenge. In this paper, we tackle this problem from both aspects of network structure and data-set formulation. For network design, we propose a spatial-temporal attention multi-graph convolution network (STA-MGC… ▽ More

    Submitted 8 April, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: We have found another earlier related work that uses a similar structure. And therfore, we hope to withdraw this version and willl include more related work in a future version

  18. arXiv:2201.06257  [pdf, other

    cs.MA

    GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning

    Authors: Jingqing Ruan, Yali Du, Xuantang Xiong, Dengpeng Xing, Xiyun Li, Linghui Meng, Haifeng Zhang, Jun Wang, Bo Xu

    Abstract: Many real-world scenarios involve a team of agents that have to coordinate their policies to achieve a shared goal. Previous studies mainly focus on decentralized control to maximize a common reward and barely consider the coordination among control policies, which is critical in dynamic and complicated environments. In this work, we propose factorizing the joint team policy into a graph generator… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

    Comments: Accepted by AAMAS2022

  19. arXiv:2110.08822  [pdf, other

    cs.CV cs.RO

    Siamese Transformer Pyramid Networks for Real-Time UAV Tracking

    Authors: Daitao Xing, Nikolaos Evangeliou, Athanasios Tsoukalas, Anthony Tzes

    Abstract: Recent object tracking methods depend upon deep networks or convoluted architectures. Most of those trackers can hardly meet real-time processing requirements on mobile platforms with limited computing resources. In this work, we introduce the Siamese Transformer Pyramid Network (SiamTPN), which inherits the advantages from both CNN and Transformer architectures. Specifically, we exploit the inher… ▽ More

    Submitted 17 October, 2021; originally announced October 2021.

    Comments: 10 pages, 8 figures, accepted by WACV2022

  20. arXiv:2110.08126  [pdf, other

    cs.MA cs.GT

    Learning Multi-agent Action Coordination via Electing First-move Agent

    Authors: Jingqing Ruan, Linghui Meng, Xuantang Xiong, Dengpeng Xing, Bo Xu

    Abstract: Learning to coordinate actions among agents is essential in complicated multi-agent systems. Prior works are constrained mainly by the assumption that all agents act simultaneously, and asynchronous action coordination between agents is rarely considered. This paper introduces a bi-level multi-agent decision hierarchy for coordinated behavior planning. We propose a novel election mechanism in whic… ▽ More

    Submitted 24 February, 2022; v1 submitted 20 September, 2021; originally announced October 2021.

    Comments: Accepted by ICAPS 2022

  21. arXiv:2110.01668  [pdf, other

    cs.LG cs.AI math.OC

    Learning to shortcut and shortlist order fulfillment deciding

    Authors: Brian Quanz, Ajay Deshpande, Dahai Xing, Xuan Liu

    Abstract: With the increase of order fulfillment options and business objectives taken into consideration in the deciding process, order fulfillment deciding is becoming more and more complex. For example, with the advent of ship from store retailers now have many more fulfillment nodes to consider, and it is now common to take into account many and varied business goals in making fulfillment decisions. Wit… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: accepted and presented at 2015 INFORMS Workshop on Data Mining and Analytics

    ACM Class: I.2.6; I.5.2; H.4.2; I.5.4

  22. arXiv:2104.02402  [pdf, ps, other

    cs.RO

    General Robot Dynamics Learning and Gen2Real

    Authors: Dengpeng Xing, Jiale Li, Yiming Yang, Bo Xu

    Abstract: Acquiring dynamics is an essential topic in robot learning, but up-to-date methods, such as dynamics randomization, need to restart to check nominal parameters, generate simulation data, and train networks whenever they face different robots. To improve it, we novelly investigate general robot dynamics, its inverse models, and Gen2Real, which means transferring to reality. Our motivations are to b… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: On posting to RSS

  23. arXiv:2002.06199  [pdf, other

    cs.NE

    Effective AER Object Classification Using Segmented Probability-Maximization Learning in Spiking Neural Networks

    Authors: Qianhui Liu, Haibo Ruan, Dong Xing, Huajin Tang, Gang Pan

    Abstract: Address event representation (AER) cameras have recently attracted more attention due to the advantages of high temporal resolution and low power consumption, compared with traditional frame-based cameras. Since AER cameras record the visual input as asynchronous discrete events, they are inherently suitable to coordinate with the spiking neural network (SNN), which is biologically plausible and e… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

    Comments: AAAI 2020 (Oral)

  24. arXiv:1911.08261  [pdf, other

    cs.NE cs.CV eess.IV

    Unsupervised AER Object Recognition Based on Multiscale Spatio-Temporal Features and Spiking Neurons

    Authors: Qianhui Liu, Gang Pan, Haibo Ruan, Dong Xing, Qi Xu, Huajin Tang

    Abstract: This paper proposes an unsupervised address event representation (AER) object recognition approach. The proposed approach consists of a novel multiscale spatio-temporal feature (MuST) representation of input AER events and a spiking neural network (SNN) using spike-timing-dependent plasticity (STDP) for object recognition with MuST. MuST extracts the features contained in both the spatial and temp… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

  25. arXiv:1711.11249  [pdf, other

    cs.CV

    ArbiText: Arbitrary-Oriented Text Detection in Unconstrained Scene

    Authors: Daitao Xing, Zichen Li, Xin Chen, Yi Fang

    Abstract: Arbitrary-oriented text detection in the wild is a very challenging task, due to the aspect ratio, scale, orientation, and illumination variations. In this paper, we propose a novel method, namely Arbitrary-oriented Text (or ArbText for short) detector, for efficient text detection in unconstrained natural scene images. Specifically, we first adopt the circle anchors rather than the rectangular on… ▽ More

    Submitted 30 November, 2017; originally announced November 2017.

    Comments: 10pages, 28 figures