Skip to main content

Showing 1–20 of 20 results for author: Diao, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.04290  [pdf, ps, other

    cs.CV

    MPQ-DMv2: Flexible Residual Mixed Precision Quantization for Low-Bit Diffusion Models with Temporal Distillation

    Authors: Weilun Feng, Chuanguang Yang, Haotong Qin, Yuqi Li, Xiangqi Li, Zhulin An, Libo Huang, Boyu Diao, Fuzhen Zhuang, Michele Magno, Yongjun Xu, Yingli Tian, Tingwen Huang

    Abstract: Diffusion models have demonstrated remarkable performance on vision generation tasks. However, the high computational complexity hinders its wide application on edge devices. Quantization has emerged as a promising technique for inference acceleration and memory reduction. However, existing quantization methods do not generalize well under extremely low-bit (2-4 bit) quantization. Directly applyin… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

  2. arXiv:2505.22167  [pdf, other

    cs.CV

    Q-VDiT: Towards Accurate Quantization and Distillation of Video-Generation Diffusion Transformers

    Authors: Weilun Feng, Chuanguang Yang, Haotong Qin, Xiangqi Li, Yu Wang, Zhulin An, Libo Huang, Boyu Diao, Zixiang Zhao, Yongjun Xu, Michele Magno

    Abstract: Diffusion transformers (DiT) have demonstrated exceptional performance in video generation. However, their large number of parameters and high computational complexity limit their deployment on edge devices. Quantization can reduce storage requirements and accelerate inference by lowering the bit-width of model parameters. Yet, existing quantization methods for image generation models do not gener… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: Accepted to ICML2025

  3. BLAST: Balanced Sampling Time Series Corpus for Universal Forecasting Models

    Authors: Zezhi Shao, Yujie Li, Fei Wang, Chengqing Yu, Yisong Fu, Tangwen Qian, Bin Xu, Boyu Diao, Yongjun Xu, Xueqi Cheng

    Abstract: The advent of universal time series forecasting models has revolutionized zero-shot forecasting across diverse domains, yet the critical role of data diversity in training these models remains underexplored. Existing large-scale time series datasets often suffer from inherent biases and imbalanced distributions, leading to suboptimal model performance and generalization. To address this gap, we in… ▽ More

    Submitted 26 May, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

    Comments: Accepted by SIGKDD 2025 (Research Track)

  4. arXiv:2505.08586  [pdf, ps, other

    cs.CV

    PrePrompt: Predictive prompting for class incremental learning

    Authors: Libo Huang, Zhulin An, Chuanguang Yang, Boyu Diao, Fei Wang, Yan Zeng, Zhifeng Hao, Yongjun Xu

    Abstract: Class Incremental Learning (CIL) based on pre-trained models offers a promising direction for open-world continual learning. Existing methods typically rely on correlation-based strategies, where an image's classification feature is used as a query to retrieve the most related key prompts and select the corresponding value prompts for training. However, these approaches face an inherent limitation… ▽ More

    Submitted 17 May, 2025; v1 submitted 13 May, 2025; originally announced May 2025.

    Comments: 16 pages, 29 figures, conference

    ACM Class: I.5.4

  5. arXiv:2505.06272  [pdf, other

    cs.LG cs.AI

    A Sensitivity-Driven Expert Allocation Method in LoRA-MoE for Efficient Fine-Tuning

    Authors: Junzhou Xu, Boyu Diao

    Abstract: As deep learning models expand, the pre-training-fine-tuning paradigm has become the standard approach for handling various downstream tasks. However, shared parameters can lead to diminished performance when dealing with complex datasets involving multiple tasks. While introducing Mixture-of-Experts (MoE) methods has alleviated this issue to some extent, it also significantly increases the number… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  6. arXiv:2504.08860  [pdf, other

    cs.DC cs.AI

    A Nonlinear Hash-based Optimization Method for SpMV on GPUs

    Authors: Chen Yan, Boyu Diao, Hangda Liu, Zhulin An, Yongjun Xu

    Abstract: Sparse matrix-vector multiplication (SpMV) is a fundamental operation with a wide range of applications in scientific computing and artificial intelligence. However, the large scale and sparsity of sparse matrix often make it a performance bottleneck. In this paper, we highlight the effectiveness of hash-based techniques in optimizing sparse matrix reordering, introducing the Hash-based Partition… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: This article has been indexed by CCGrid2025

  7. arXiv:2503.22175  [pdf, other

    cs.CV

    Efficient Continual Learning through Frequency Decomposition and Integration

    Authors: Ruiqi Liu, Boyu Diao, Libo Huang, Hangda Liu, Chuanguang Yang, Zhulin An, Yongjun Xu

    Abstract: Continual learning (CL) aims to learn new tasks while retaining past knowledge, addressing the challenge of forgetting during task adaptation. Rehearsal-based methods, which replay previous samples, effectively mitigate forgetting. However, research on enhancing the efficiency of these methods, especially in resource-constrained environments, remains limited, hindering their application in real-wo… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

  8. arXiv:2502.11407  [pdf, other

    cs.DC

    Gensor: A Graph-based Construction Tensor Compilation Method for Deep Learning

    Authors: Hangda Liu, Boyu Diao, Yu Yang, Wenxin Chen, Xiaohui Peng, Yongjun Xu

    Abstract: High-performance deep learning depends on efficient tensor programs. In recent years, automatic tensor program optimization, also known as tensor compilation, has emerged as the primary approach to generating efficient tensor programs. However, how to generate kernels with higher performance in a shorter time is still the key challenge. In this paper, we present Gensor, a graph-based construction… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

  9. arXiv:2412.11549  [pdf, other

    cs.CV

    MPQ-DM: Mixed Precision Quantization for Extremely Low Bit Diffusion Models

    Authors: Weilun Feng, Haotong Qin, Chuanguang Yang, Zhulin An, Libo Huang, Boyu Diao, Fei Wang, Renshuai Tao, Yongjun Xu, Michele Magno

    Abstract: Diffusion models have received wide attention in generation tasks. However, the expensive computation cost prevents the application of diffusion models in resource-constrained scenarios. Quantization emerges as a practical solution that significantly saves storage and computation by reducing the bit-width of parameters. However, the existing quantization methods for diffusion models still cause se… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

    Comments: Accepted by AAAI 2025

  10. arXiv:2410.07679  [pdf, other

    cs.CV

    Relational Diffusion Distillation for Efficient Image Generation

    Authors: Weilun Feng, Chuanguang Yang, Zhulin An, Libo Huang, Boyu Diao, Fei Wang, Yongjun Xu

    Abstract: Although the diffusion model has achieved remarkable performance in the field of image generation, its high inference delay hinders its wide application in edge devices with scarce computing resources. Therefore, many training-free sampling methods have been proposed to reduce the number of sampling steps required for diffusion models. However, they perform poorly under a very small number of samp… ▽ More

    Submitted 15 December, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

    Comments: Accepted by ACM MM 2024 Oral

  11. arXiv:2410.06645  [pdf, other

    cs.CV

    Continual Learning in the Frequency Domain

    Authors: Ruiqi Liu, Boyu Diao, Libo Huang, Zijia An, Zhulin An, Yongjun Xu

    Abstract: Continual learning (CL) is designed to learn new tasks while preserving existing knowledge. Replaying samples from earlier tasks has proven to be an effective method to mitigate the forgetting of previously acquired knowledge. However, the current research on the training efficiency of rehearsal-based methods is insufficient, which limits the practical application of CL systems in resource-limited… ▽ More

    Submitted 13 November, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: Accepted by NeurIPS 2024

  12. arXiv:2406.04829  [pdf, other

    cs.CV

    IOR: Inversed Objects Replay for Incremental Object Detection

    Authors: Zijia An, Boyu Diao, Libo Huang, Ruiqi Liu, Zhulin An, Yongjun Xu

    Abstract: Existing Incremental Object Detection (IOD) methods partially alleviate catastrophic forgetting when incrementally detecting new objects in real-world scenarios. However, many of these methods rely on the assumption that unlabeled old-class objects may co-occur with labeled new-class objects in the incremental data. When unlabeled old-class objects are absent, the performance of existing methods t… ▽ More

    Submitted 16 January, 2025; v1 submitted 7 June, 2024; originally announced June 2024.

  13. arXiv:2309.16117  [pdf, other

    cs.LG cs.AI

    E2Net: Resource-Efficient Continual Learning with Elastic Expansion Network

    Authors: RuiQi Liu, Boyu Diao, Libo Huang, Zhulin An, Yongjun Xu

    Abstract: Continual Learning methods are designed to learn new tasks without erasing previous knowledge. However, Continual Learning often requires massive computational power and storage capacity for satisfactory performance. In this paper, we propose a resource-efficient continual learning method called the Elastic Expansion Network (E2Net). Leveraging core subnet distillation and precise replay sample se… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  14. arXiv:2307.12732  [pdf, other

    cs.CV

    CLIP-KD: An Empirical Study of CLIP Model Distillation

    Authors: Chuanguang Yang, Zhulin An, Libo Huang, Junyu Bi, Xinqiang Yu, Han Yang, Boyu Diao, Yongjun Xu

    Abstract: Contrastive Language-Image Pre-training (CLIP) has become a promising language-supervised visual pre-training framework. This paper aims to distill small CLIP models supervised by a large teacher CLIP model. We propose several distillation strategies, including relation, feature, gradient and contrastive paradigms, to examine the effectiveness of CLIP-Knowledge Distillation (KD). We show that a si… ▽ More

    Submitted 7 May, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: CVPR-2024

  15. arXiv:2304.10103  [pdf, other

    cs.CV

    eTag: Class-Incremental Learning with Embedding Distillation and Task-Oriented Generation

    Authors: Libo Huang, Yan Zeng, Chuanguang Yang, Zhulin An, Boyu Diao, Yongjun Xu

    Abstract: Class-Incremental Learning (CIL) aims to solve the neural networks' catastrophic forgetting problem, which refers to the fact that once the network updates on a new task, its performance on previously-learned tasks drops dramatically. Most successful CIL methods incrementally train a feature extractor with the aid of stored exemplars, or estimate the feature distribution with the stored prototypes… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 12 pages, 12 figures

  16. arXiv:2108.06004  [pdf, ps, other

    cs.DC

    A Distributed SGD Algorithm with Global Sketching for Deep Learning Training Acceleration

    Authors: LingFei Dai, Boyu Diao, Chao Li, Yongjun Xu

    Abstract: Distributed training is an effective way to accelerate the training process of large-scale deep learning models. However, the parameter exchange and synchronization of distributed stochastic gradient descent introduce a large amount of communication overhead. Gradient compression is an effective method to reduce communication overhead. In synchronization SGD compression methods, many Top-k sparsif… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

  17. arXiv:2108.05057  [pdf, other

    cs.NI

    A Channel-Aware Routing Protocol With Nearest Neighbor Regression For Underwater Sensor Networks

    Authors: Boyu Diao, Chao Li, Qi Wang, Zhulin An, Yongjun Xu

    Abstract: The underwater acoustic channel is one of the most challenging communication channels. Due to periodical tidal and daily climatic variation, underwater noise is periodically fluctuating, which result in the periodical changing of acoustic channel quality in long-term. Also, time-variant channel quality leads to routing failure. Routing protocols with acoustic channel estimation, namely underwater… ▽ More

    Submitted 14 August, 2021; v1 submitted 11 August, 2021; originally announced August 2021.

  18. arXiv:2006.12963  [pdf, other

    cs.CV

    PFGDF: Pruning Filter via Gaussian Distribution Feature for Deep Neural Networks Acceleration

    Authors: Jianrong Xu, Boyu Diao, Bifeng Cui, Kang Yang, Chao Li, Yongjun Xu

    Abstract: Deep learning has achieved impressive results in many areas, but the deployment of edge intelligent devices is still very slow. To solve this problem, we propose a novel compression and acceleration method based on data distribution characteristics for deep neural networks, namely Pruning Filter via Gaussian Distribution Feature (PFGDF). Compared with previous advanced pruning methods, PFGDF compr… ▽ More

    Submitted 26 May, 2022; v1 submitted 23 June, 2020; originally announced June 2020.

    Journal ref: 2022IJCNN

  19. arXiv:1906.00399  [pdf, ps, other

    cs.NE cs.LG

    Multi-Objective Pruning for CNNs Using Genetic Algorithm

    Authors: Chuanguang Yang, Zhulin An, Chao Li, Boyu Diao, Yongjun Xu

    Abstract: In this work, we propose a heuristic genetic algorithm (GA) for pruning convolutional neural networks (CNNs) according to the multi-objective trade-off among error, computation and sparsity. In our experiments, we apply our approach to prune pre-trained LeNet across the MNIST dataset, which reduces 95.42% parameter size and achieves 16$\times$ speedups of convolutional layer computation with tiny… ▽ More

    Submitted 4 July, 2019; v1 submitted 2 June, 2019; originally announced June 2019.

    Comments: 6 pages,3 figures,Accepted as a conference paper at ICANN 2019

  20. arXiv:1404.3389  [pdf, other

    math.OC cs.GT eess.SY math.DS math.PR

    Mean-Field Games for Marriage

    Authors: Dario Bauso, Ben Mansour Dia, Boualem Djehiche, Hamidou Tembine, Raul Tempone

    Abstract: This article examines mean-field games for marriage. The results support the argument that optimizing the long-term well-being through effort and social feeling state distribution (mean-field) will help to stabilize marriage. However, if the cost of effort is very high, the couple fluctuates in a bad feeling state or the marriage breaks down. We then examine the influence of society on a couple us… ▽ More

    Submitted 13 April, 2014; originally announced April 2014.

    Comments: 22 figures. Accepted and to appear in PLoS One