Skip to main content

Showing 1–13 of 13 results for author: Sui, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.08729  [pdf, ps, other

    cs.CV

    RoundaboutHD: High-Resolution Real-World Urban Environment Benchmark for Multi-Camera Vehicle Tracking

    Authors: Yuqiang Lin, Sam Lockyer, Mingxuan Sui, Li Gan, Florian Stanek, Markus Zarbock, Wenbin Li, Adrian Evans, Nic Zhang

    Abstract: The multi-camera vehicle tracking (MCVT) framework holds significant potential for smart city applications, including anomaly detection, traffic density estimation, and suspect vehicle tracking. However, current publicly available datasets exhibit limitations, such as overly simplistic scenarios, low-resolution footage, and insufficiently diverse conditions, creating a considerable gap between aca… ▽ More

    Submitted 21 July, 2025; v1 submitted 11 July, 2025; originally announced July 2025.

  2. arXiv:2410.19130  [pdf

    cs.LG cs.AI cs.CR

    Research on Key Technologies for Cross-Cloud Federated Training of Large Language Models

    Authors: Haowei Yang, Mingxiu Sui, Shaobo Liu, Xinyue Qian, Zhaoyang Zhang, Bingying Liu

    Abstract: With the rapid development of natural language processing technology, large language models have demonstrated exceptional performance in various application scenarios. However, training these models requires significant computational resources and data processing capabilities. Cross-cloud federated training offers a new approach to addressing the resource bottlenecks of a single cloud platform, al… ▽ More

    Submitted 22 December, 2024; v1 submitted 24 October, 2024; originally announced October 2024.

  3. arXiv:2410.09923  [pdf

    cs.IR cs.AI

    Analysis and Design of a Personalized Recommendation System Based on a Dynamic User Interest Model

    Authors: Chunyan Mao, Shuaishuai Huang, Mingxiu Sui, Haowei Yang, Xueshe Wang

    Abstract: With the rapid development of the internet and the explosion of information, providing users with accurate personalized recommendations has become an important research topic. This paper designs and analyzes a personalized recommendation system based on a dynamic user interest model. The system captures user behavior data, constructs a dynamic user interest model, and combines multiple recommendat… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

  4. arXiv:2409.13868  [pdf

    eess.IV cs.CV cs.LG

    Deep Learning-Based Channel Squeeze U-Structure for Lung Nodule Detection and Segmentation

    Authors: Mingxiu Sui, Jiacheng Hu, Tong Zhou, Zibo Liu, Likang Wen, Junliang Du

    Abstract: This paper introduces a novel deep-learning method for the automatic detection and segmentation of lung nodules, aimed at advancing the accuracy of early-stage lung cancer diagnosis. The proposed approach leverages a unique "Channel Squeeze U-Structure" that optimizes feature extraction and information integration across multiple semantic levels of the network. This architecture includes three key… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  5. arXiv:2409.13701  [pdf, ps, other

    cs.CL cs.AI

    CA-BERT: Leveraging Context Awareness for Enhanced Multi-Turn Chat Interaction

    Authors: Minghao Liu, Mingxiu Sui, Yi Nan, Cangqing Wang, Zhijie Zhou

    Abstract: Effective communication in automated chat systems hinges on the ability to understand and respond to context. Traditional models often struggle with determining when additional context is necessary for generating appropriate responses. This paper introduces Context-Aware BERT (CA-BERT), a transformer-based model specifically fine-tuned to address this challenge. CA-BERT innovatively applies deep l… ▽ More

    Submitted 1 October, 2024; v1 submitted 5 September, 2024; originally announced September 2024.

    Comments: This paper has been accepted by ICBASE 2024

  6. arXiv:2409.04977  [pdf

    cs.LG cs.AI cs.CV

    Enhancing Convolutional Neural Networks with Higher-Order Numerical Difference Methods

    Authors: Qi Wang, Zijun Gao, Mingxiu Sui, Taiyuan Mei, Xiaohan Cheng, Iris Li

    Abstract: With the rise of deep learning technology in practical applications, Convolutional Neural Networks (CNNs) have been able to assist humans in solving many real-world problems. To enhance the performance of CNNs, numerous network architectures have been explored. Some of these architectures are designed based on the accumulated experience of researchers over time, while others are designed through n… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

  7. arXiv:2405.13290  [pdf, other

    cs.LG cs.AI

    Theoretical Analysis of Meta Reinforcement Learning: Generalization Bounds and Convergence Guarantees

    Authors: Cangqing Wang, Mingxiu Sui, Dan Sun, Zecheng Zhang, Yan Zhou

    Abstract: This research delves deeply into Meta Reinforcement Learning (Meta RL) through a exploration focusing on defining generalization limits and ensuring convergence. By employing a approach this article introduces an innovative theoretical framework to meticulously assess the effectiveness and performance of Meta RL algorithms. We present an explanation of generalization limits measuring how well thes… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: This paper has been accepted by the 2024 International Conference on Modeling, Natural Language Processing and Machine Learning(CMNM 2024)

  8. arXiv:2206.04975  [pdf, other

    cs.CV

    NR-DFERNet: Noise-Robust Network for Dynamic Facial Expression Recognition

    Authors: Hanting Li, Mingzhe Sui, Zhaoqing Zhu, Feng zhao

    Abstract: Dynamic facial expression recognition (DFER) in the wild is an extremely challenging task, due to a large number of noisy frames in the video sequences. Previous works focus on extracting more discriminative features, but ignore distinguishing the key frames from the noisy frames. To tackle this problem, we propose a noise-robust dynamic facial expression recognition network (NR-DFERNet), which ca… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: 10 pages

  9. arXiv:2205.11785  [pdf, other

    cs.CV cs.AI

    AFNet-M: Adaptive Fusion Network with Masks for 2D+3D Facial Expression Recognition

    Authors: Mingzhe Sui, Hanting Li, Zhaoqing Zhu, Feng Zhao

    Abstract: 2D+3D facial expression recognition (FER) can effectively cope with illumination changes and pose variations by simultaneously merging 2D texture and more robust 3D depth information. Most deep learning-based approaches employ the simple fusion strategy that concatenates the multimodal features directly after fully-connected layers, without considering the different degrees of significance for eac… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: 6 pages, 6 figures, 4 tables

  10. arXiv:2201.05297  [pdf, other

    cs.CV

    MMNet: Muscle motion-guided network for micro-expression recognition

    Authors: Hanting Li, Mingzhe Sui, Zhaoqing Zhu, Feng Zhao

    Abstract: Facial micro-expressions (MEs) are involuntary facial motions revealing peoples real feelings and play an important role in the early intervention of mental illness, the national security, and many human-computer interaction systems. However, existing micro-expression datasets are limited and usually pose some challenges for training good classifiers. To model the subtle facial muscle motions, we… ▽ More

    Submitted 19 August, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

    Comments: 8 pages, 4 figures

    Journal ref: Proc. 31st Int'l Joint Conf. Artificial Intelligence (IJCAI), 2022

  11. arXiv:2109.13086  [pdf, other

    cs.CV cs.AI

    MFEViT: A Robust Lightweight Transformer-based Network for Multimodal 2D+3D Facial Expression Recognition

    Authors: Hanting Li, Mingzhe Sui, Zhaoqing Zhu, Feng Zhao

    Abstract: Vision transformer (ViT) has been widely applied in many areas due to its self-attention mechanism that help obtain the global receptive field since the first layer. It even achieves surprising performance exceeding CNN in some vision tasks. However, there exists an issue when leveraging vision transformer into 2D+3D facial expression recognition (FER), i.e., ViT training needs mass data. Nonethel… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: 9pages,6 figures,5 tables

  12. arXiv:2106.04520  [pdf, other

    cs.CV

    MVT: Mask Vision Transformer for Facial Expression Recognition in the wild

    Authors: Hanting Li, Mingzhe Sui, Feng Zhao, Zhengjun Zha, Feng Wu

    Abstract: Facial Expression Recognition (FER) in the wild is an extremely challenging task in computer vision due to variant backgrounds, low-quality facial images, and the subjectiveness of annotators. These uncertainties make it difficult for neural networks to learn robust features on limited-scale datasets. Moreover, the networks can be easily distributed by the above factors and perform incorrect decis… ▽ More

    Submitted 10 July, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: 11 pages, 6 figures, 5 tables, conference

  13. arXiv:1402.0273  [pdf

    cs.CY cs.HC

    The Designing of Online Multiple Intelligence Tools for Lecturers at Polytechnic

    Authors: Sazilah Salam, Siti Nurul Mahfuzah Mohamad, Norasiken Bakar, Linda Khoo Mei Sui

    Abstract: This paper addresses the designing of Online Multiple Intelligence (MI) Teaching Tools for Polytechnic lecturers. These teaching tools can assist lecturers to create their own teaching materials without having any knowledge of Information Technology (IT) especially in programming. The theory of MI is used in this paper and this theory postulates that everybody has at least two or more intelligence… ▽ More

    Submitted 2 February, 2014; originally announced February 2014.

    Comments: 7 pages, 4 figures, 1 table, International Journal of Soft Computing and Software Engineering [JSCSE], Vol. 3, No. 3