Skip to main content

Showing 1–14 of 14 results for author: Zhai, Y

Searching in archive eess. Search in all archives.
.
  1. arXiv:2504.19119  [pdf, other

    eess.IV

    MLICv2: Enhanced Multi-Reference Entropy Modeling for Learned Image Compression

    Authors: Wei Jiang, Yongqi Zhai, Jiayu Yang, Feng Gao, Ronggang Wang

    Abstract: Recent advancements in learned image compression (LIC) have yielded impressive performance gains. Notably, the learned image compression models with multi-reference entropy models (MLIC series) have significantly outperformed existing traditional image codecs such as the Versatile Video Coding (VVC) Intra. In this paper, we present MLICv2 and MLICv2$^+$, enhanced versions of the MLIC series, featu… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

    Comments: Under Review

  2. arXiv:2412.00437  [pdf, other

    eess.IV cs.CV

    DeepFGS: Fine-Grained Scalable Coding for Learned Image Compression

    Authors: Yongqi Zhai, Yi Ma, Luyang Tang, Wei Jiang, Ronggang Wang

    Abstract: Scalable coding, which can adapt to channel bandwidth variation, performs well in today's complex network environment. However, most existing scalable compression methods face two challenges: reduced compression performance and insufficient scalability. To overcome the above problems, this paper proposes a learned fine-grained scalable image compression framework, namely DeepFGS. Specifically, we… ▽ More

    Submitted 30 November, 2024; originally announced December 2024.

    Comments: Accepted to DCC 2025

  3. arXiv:2401.08154  [pdf, ps, other

    cs.CV eess.IV

    TLIC: Learned Image Compression with ROI-Weighted Distortion and Bit Allocation

    Authors: Wei Jiang, Yongqi Zhai, Hangyu Li, Ronggang Wang

    Abstract: This short paper describes our method for the track of image compression. To achieve better perceptual quality, we use the adversarial loss to generate realistic textures, use region of interest (ROI) mask to guide the bit allocation for different regions. Our Team name is TLIC.

    Submitted 23 March, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: 2nd Place in the Image Compression Track, CLIC 2024, DCC 2024

  4. arXiv:2307.15421  [pdf, other

    eess.IV cs.CV

    MLIC++: Linear Complexity Multi-Reference Entropy Modeling for Learned Image Compression

    Authors: Wei Jiang, Jiayu Yang, Yongqi Zhai, Feng Gao, Ronggang Wang

    Abstract: The latent representation in learned image compression encompasses channel-wise, local spatial, and global spatial correlations, which are essential for the entropy model to capture for conditional entropy minimization. Efficiently capturing these contexts within a single entropy model, especially in high-resolution image coding, presents a challenge due to the computational complexity of existing… ▽ More

    Submitted 17 February, 2025; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: Accepted to ICML 2023 Neural Compression Workshop and ACM Transactions on Multimedia Computing, Communications, and Applications 2025

  5. arXiv:2306.15433  [pdf, other

    eess.SP

    Recursive LMMSE-Based Iterative Soft Interference Cancellation for MIMO Systems to Save Computations and Memories

    Authors: Hufei Zhu, Fuqin Deng, Yikui Zhai, Jiaming Zhong, Yanyang Liang

    Abstract: Firstly, a reordered description is given for the linear minimum mean square error (LMMSE)-based iterative soft interference cancellation (ISIC) detection process for Mutipleinput multiple-output (MIMO) wireless communication systems, which is based on the equivalent channel matrix. Then the above reordered description is applied to compare the detection process for LMMSE-ISIC with that for the ha… ▽ More

    Submitted 5 December, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

  6. arXiv:2304.09571  [pdf, other

    cs.CV cs.MM eess.IV

    LLIC: Large Receptive Field Transform Coding with Adaptive Weights for Learned Image Compression

    Authors: Wei Jiang, Peirong Ning, Jiayu Yang, Yongqi Zhai, Feng Gao, Ronggang Wang

    Abstract: The effective receptive field (ERF) plays an important role in transform coding, which determines how much redundancy can be removed during transform and how many spatial priors can be utilized to synthesize textures during inverse transform. Existing methods rely on stacks of small kernels, whose ERFs remain insufficiently large, or heavy non-local attention mechanisms, which limit the potential… ▽ More

    Submitted 21 June, 2024; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: Accepted to IEEE Transactions on Multimedia 2024

  7. arXiv:2302.14570   

    math.OC eess.SY

    Byzantine-Resilient Multi-Agent Distributed Exact Optimization with Less Data

    Authors: Yang Zhai, Zhi-Wei Liu, Dong Yue, Songlin Hu, Xiangpeng Xie

    Abstract: This paper studies the distributed multi-agent resilient optimization problem under the f-total Byzantine attacks. Compared with the previous work on Byzantineresilient multi-agent exact optimization problems, we do not require the communication topology to be fully connected. Under the redundancy of cost functions, we propose the distributed comparative gradient elimination resilient optimization… ▽ More

    Submitted 28 March, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: There are some errors in the provement of this paper

  8. MLIC: Multi-Reference Entropy Model for Learned Image Compression

    Authors: Wei Jiang, Jiayu Yang, Yongqi Zhai, Peirong Ning, Feng Gao, Ronggang Wang

    Abstract: Recently, learned image compression has achieved remarkable performance. The entropy model, which estimates the distribution of the latent representation, plays a crucial role in boosting rate-distortion performance. However, most entropy models only capture correlations in one dimension, while the latent representation contain channel-wise, local spatial, and global spatial correlations. To tackl… ▽ More

    Submitted 13 September, 2024; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: Accepted to ACMMM 2023

    Journal ref: Proceedings of the 31st ACM International Conference on Multimedia, pp.7618--7627, 2023

  9. arXiv:2201.01173  [pdf, other

    eess.IV cs.CV

    DeepFGS: Fine-Grained Scalable Coding for Learned Image Compression

    Authors: Yi Ma, Yongqi Zhai, Ronggang Wang

    Abstract: Scalable coding, which can adapt to channel bandwidth variation, performs well in today's complex network environment. However, the existing scalable compression methods face two challenges: reduced compression performance and insufficient scalability. In this paper, we propose the first learned fine-grained scalable image compression model (DeepFGS) to overcome the above two shortcomings. Specifi… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

  10. arXiv:2103.00673  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Convolutional Normalization: Improving Deep Convolutional Network Robustness and Training

    Authors: Sheng Liu, Xiao Li, Yuexiang Zhai, Chong You, Zhihui Zhu, Carlos Fernandez-Granda, Qing Qu

    Abstract: Normalization techniques have become a basic component in modern convolutional neural networks (ConvNets). In particular, many recent works demonstrate that promoting the orthogonality of the weights helps train deep models and improve robustness. For ConvNets, most existing methods are based on penalizing or normalizing weight matrices derived from concatenating or flattening the convolutional ke… ▽ More

    Submitted 3 January, 2022; v1 submitted 28 February, 2021; originally announced March 2021.

    Comments: SL and XL contributed equally to this work; 23 pages, 6 figures, 6 tables, published in NeurIPS'21

  11. Adaptive multi-channel event segmentation and feature extraction for monitoring health outcomes

    Authors: Xichen She, Yaya Zhai, Ricardo Henao, Christopher W. Woods, Christopher Chiu, Geoffrey S. Ginsburg, Peter X. K. Song, Alfred O. Hero

    Abstract: $\textbf{Objective}$: To develop a multi-channel device event segmentation and feature extraction algorithm that is robust to changes in data distribution. $\textbf{Methods}… ▽ More

    Submitted 19 November, 2020; v1 submitted 20 August, 2020; originally announced August 2020.

    Journal ref: IEEE Transactions on Biomedical Engineering, Nov. 17 2020

  12. arXiv:2001.06236  [pdf

    eess.IV cs.CV

    Detection Method Based on Automatic Visual Shape Clustering for Pin-Missing Defect in Transmission Lines

    Authors: Zhenbing Zhao, Hongyu Qi, Yincheng Qi, Ke Zhang, Yongjie Zhai, Wenqing Zhao

    Abstract: Bolts are the most numerous fasteners in transmission lines and are prone to losing their split pins. How to realize the automatic pin-missing defect detection for bolts in transmission lines so as to achieve timely and efficient trouble shooting is a difficult problem and the long-term research target of power systems. In this paper, an automatic detection model called Automatic Visual Shape Clus… ▽ More

    Submitted 17 January, 2020; originally announced January 2020.

  13. arXiv:1912.02427  [pdf, other

    cs.LG cs.IT eess.SP math.OC stat.ML

    Analysis of the Optimization Landscapes for Overcomplete Representation Learning

    Authors: Qing Qu, Yuexiang Zhai, Xiao Li, Yuqian Zhang, Zhihui Zhu

    Abstract: We study nonconvex optimization landscapes for learning overcomplete representations, including learning (i) sparsely used overcomplete dictionaries and (ii) convolutional dictionaries, where these unsupervised learning problems find many applications in high-dimensional data analysis. Despite the empirical success of simple nonconvex algorithms, theoretical justifications of why these methods wor… ▽ More

    Submitted 10 December, 2019; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: 68 pages, 5 figures

  14. arXiv:1906.02435  [pdf, other

    cs.LG eess.SP stat.CO stat.ML

    Complete Dictionary Learning via $\ell^4$-Norm Maximization over the Orthogonal Group

    Authors: Yuexiang Zhai, Zitong Yang, Zhenyu Liao, John Wright, Yi Ma

    Abstract: This paper considers the fundamental problem of learning a complete (orthogonal) dictionary from samples of sparsely generated signals. Most existing methods solve the dictionary (and sparse representations) based on heuristic algorithms, usually without theoretical guarantees for either optimality or complexity. The recent $\ell^1$-minimization based methods do provide such guarantees but the ass… ▽ More

    Submitted 6 April, 2021; v1 submitted 6 June, 2019; originally announced June 2019.