Skip to main content

Showing 1–50 of 55 results for author: Ba, Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.00287  [pdf

    cs.LG

    A Deep Learning Approach to Anomaly Detection in High-Frequency Trading Data

    Authors: Qiuliuyang Bao, Jiawei Wang, Hao Gong, Yiwei Zhang, Xiaojun Guo, Hanrui Feng

    Abstract: This paper proposes an algorithm based on a staged sliding window Transformer architecture to detect abnormal behaviors in the microstructure of the foreign exchange market, focusing on high-frequency EUR/USD trading data. The method captures multi-scale temporal features through a staged sliding window, extracts global and local dependencies by combining the self-attention mechanism and weighted… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

  2. arXiv:2503.06396  [pdf, other

    cs.AI

    Optimizing Minimum Vertex Cover Solving via a GCN-assisted Heuristic Algorithm

    Authors: Enqiang Zhu, Qiqi Bao, Yu Zhang, Chanjuan Liu

    Abstract: The problem of finding a minimum vertex cover (MVC) in a graph is a well-known NP-hard problem with significant practical applications in optimization and scheduling. Its complexity, combined with the increasing scale of problems, underscores the need for efficient and effective algorithms. However, existing heuristic algorithms for MVC often rely on simplistic initialization strategies and overlo… ▽ More

    Submitted 8 March, 2025; originally announced March 2025.

  3. arXiv:2503.03755  [pdf

    cs.SI

    Research on evolution and early warning model of network public opinion based on online Latent Dirichlet distribution model and BP neural network

    Authors: Qiaozhi Bao, Yanlin Chen, Xusheng Ji

    Abstract: Online public opinion is increasingly becoming a significant factor affecting the stability of the internet and society, particularly as the frequency of online public opinion crises has risen in recent years. Enhancing the capability for early warning of online public opinion crises is urgent. The most effective approach is to identify potential crises in their early stages and implement correspo… ▽ More

    Submitted 16 February, 2025; originally announced March 2025.

  4. arXiv:2503.03702  [pdf, other

    cs.CL

    Developing and Utilizing a Large-Scale Cantonese Dataset for Multi-Tasking in Large Language Models

    Authors: Jiyue Jiang, Alfred Kar Yin Truong, Yanyu Chen, Qinghang Bao, Sheng Wang, Pengan Chen, Jiuming Wang, Lingpeng Kong, Yu Li, Chuan Wu

    Abstract: High-quality data resources play a crucial role in learning large language models (LLMs), particularly for low-resource languages like Cantonese. Despite having more than 85 million native speakers, Cantonese is still considered a low-resource language in the field of natural language processing (NLP) due to factors such as the dominance of Mandarin, lack of cohesion within the Cantonese-speaking… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

  5. arXiv:2501.07849  [pdf, ps, other

    cs.SE cs.AI cs.CR

    The Invisible Hand: Unveiling Provider Bias in Large Language Models for Code Generation

    Authors: Xiaoyu Zhang, Juan Zhai, Shiqing Ma, Qingshuang Bao, Weipeng Jiang, Qian Wang, Chao Shen, Yang Liu

    Abstract: Large Language Models (LLMs) have emerged as the new recommendation engines, surpassing traditional methods in both capability and scope, particularly in code generation. In this paper, we reveal a novel provider bias in LLMs: without explicit directives, these models show systematic preferences for services from specific providers in their recommendations (e.g., favoring Google Cloud over Microso… ▽ More

    Submitted 3 June, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

    Comments: 27 pages, 13 figures

  6. arXiv:2410.10874  [pdf

    cs.CL cs.AI

    Optimizing Transformer based on high-performance optimizer for predicting employment sentiment in American social media content

    Authors: Feiyang Wang, Qiaozhi Bao, Zixuan Wang, Yanlin Chen

    Abstract: This article improves the Transformer model based on swarm intelligence optimization algorithm, aiming to predict the emotions of employment related text content on American social media. Through text preprocessing, feature extraction, and vectorization, the text data was successfully converted into numerical data and imported into the model for training. The experimental results show that during… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: 5 pages, 5 figures

  7. arXiv:2410.01459  [pdf

    cs.HC eess.SP

    A Smart Chair for Health Monitoring in Daily Life

    Authors: Nguyen Thi Minh Huong, Vo Quoc Bao, Nguyen Trung Hau, Huynh Quang Linh

    Abstract: Recent research has focused on the risks associated with poor sitting posture and the impact of sitting on biological parameters, such as heart rate because prolonged sitting is common across all ages and professions. In this work, we propose a novel approach that can display simultaneously posture and heart rate in real-time. In this device, pressure sensors are embedded into a flexible separate… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  8. arXiv:2409.17778  [pdf, other

    cs.CV

    Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEs

    Authors: Qinpeng Cui, Yixuan Liu, Xinyi Zhang, Qiqi Bao, Qingmin Liao, Li Wang, Tian Lu, Zicheng Liu, Zhongdao Wang, Emad Barsoum

    Abstract: Diffusion-based image super-resolution (SR) models have attracted substantial interest due to their powerful image restoration capabilities. However, prevailing diffusion models often struggle to strike an optimal balance between efficiency and performance. Typically, they either neglect to exploit the potential of existing extensive pretrained models, limiting their generative capacity, or they n… ▽ More

    Submitted 10 December, 2024; v1 submitted 26 September, 2024; originally announced September 2024.

    Comments: This paper is accepted by NeurIPS 2024

  9. arXiv:2409.08588  [pdf

    eess.IV cs.CV

    Improved Unet model for brain tumor image segmentation based on ASPP-coordinate attention mechanism

    Authors: Zixuan Wang, Yanlin Chen, Feiyang Wang, Qiaozhi Bao

    Abstract: In this paper, we propose an improved Unet model for brain tumor image segmentation, which combines coordinate attention mechanism and ASPP module to improve the segmentation effect. After the data set is divided, we do the necessary preprocessing to the image and use the improved model to experiment. First, we trained and validated the traditional Unet model. By analyzing the loss curve of the tr… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

    Comments: 5 pages, 8 figures, accepted by ICBASE 2024

  10. arXiv:2409.02119  [pdf, other

    cs.LG cs.AI cs.CL

    CoRA: Optimizing Low-Rank Adaptation with Common Subspace of Large Language Models

    Authors: Xiaojun Xiao, Sen Shen, Qiming Bao, Hongfei Rong, Kairui Liu, Zhongsheng Wang, Jiamou Liu

    Abstract: In fine-tuning large language models (LLMs), conserving computational resources while maintaining effectiveness and improving outcomes within the same computational constraints is crucial. The Low-Rank Adaptation (LoRA) strategy balances efficiency and performance in fine-tuning large models by reducing the number of trainable parameters and computational costs. However, current advancements in Lo… ▽ More

    Submitted 31 August, 2024; originally announced September 2024.

  11. arXiv:2408.16756  [pdf, other

    cs.CL

    How Well Do LLMs Handle Cantonese? Benchmarking Cantonese Capabilities of Large Language Models

    Authors: Jiyue Jiang, Pengan Chen, Liheng Chen, Sheng Wang, Qinghang Bao, Lingpeng Kong, Yu Li, Chuan Wu

    Abstract: The rapid evolution of large language models (LLMs) has transformed the competitive landscape in natural language processing (NLP), particularly for English and other data-rich languages. However, underrepresented languages like Cantonese, spoken by over 85 million people, face significant development gaps, which is particularly concerning given the economic significance of the Guangdong-Hong Kong… ▽ More

    Submitted 17 February, 2025; v1 submitted 29 August, 2024; originally announced August 2024.

    Comments: Accepted by NAACL 2025

  12. arXiv:2408.02922  [pdf, other

    cs.CV

    Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network

    Authors: Xinyi Zhang, Qiqi Bao, Qinpeng Cui, Wenming Yang, Qingmin Liao

    Abstract: Current state-of-the-art (SOTA) methods in 3D Human Pose Estimation (HPE) are primarily based on Transformers. However, existing Transformer-based 3D HPE backbones often encounter a trade-off between accuracy and computational efficiency. To resolve the above dilemma, in this work, we leverage recent advances in state space models and utilize Mamba for high-quality and efficient long-range modelin… ▽ More

    Submitted 25 February, 2025; v1 submitted 5 August, 2024; originally announced August 2024.

    Comments: This work has been accepted by AAAI 2025

  13. arXiv:2407.16341  [pdf, other

    cs.CV

    Motion Capture from Inertial and Vision Sensors

    Authors: Xiaodong Chen, Wu Liu, Qian Bao, Xinchen Liu, Quanwei Yang, Ruoli Dai, Tao Mei

    Abstract: Human motion capture is the foundation for many computer vision and graphics tasks. While industrial motion capture systems with complex camera arrays or expensive wearable sensors have been widely adopted in movie and game production, consumer-affordable and easy-to-use solutions for personal applications are still far from mature. To utilize a mixture of a monocular camera and very few inertial… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 17 pages,9 figures

  14. arXiv:2407.10162  [pdf, other

    cs.AI

    ChatLogic: Integrating Logic Programming with Large Language Models for Multi-Step Reasoning

    Authors: Zhongsheng Wang, Jiamou Liu, Qiming Bao, Hongfei Rong, Jingfeng Zhang

    Abstract: Large language models (LLMs) such as ChatGPT and GPT-4 have demonstrated impressive capabilities in various generative tasks. However, their performance is often hampered by limitations in accessing and leveraging long-term memory, leading to specific vulnerabilities and biases, especially during long interactions. This paper introduces ChatLogic, an innovative framework specifically targeted at L… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 8 pages, 3 figures. This paper has been accepted by WCCI IJCNN 2024

  15. arXiv:2407.09521  [pdf, other

    cs.CV cs.NE

    Apprenticeship-Inspired Elegance: Synergistic Knowledge Distillation Empowers Spiking Neural Networks for Efficient Single-Eye Emotion Recognition

    Authors: Yang Wang, Haiyang Mei, Qirui Bao, Ziqi Wei, Mike Zheng Shou, Haizhou Li, Bo Dong, Xin Yang

    Abstract: We introduce a novel multimodality synergistic knowledge distillation scheme tailored for efficient single-eye motion recognition tasks. This method allows a lightweight, unimodal student spiking neural network (SNN) to extract rich knowledge from an event-frame multimodal teacher network. The core strength of this approach is its ability to utilize the ample, coarser temporal cues found in conven… ▽ More

    Submitted 20 June, 2024; originally announced July 2024.

    Comments: Accepted by IJCAI 2024

  16. arXiv:2404.08831  [pdf, other

    eess.IV cs.CV cs.LG

    Structured Model Pruning for Efficient Inference in Computational Pathology

    Authors: Mohammed Adnan, Qinle Ba, Nazim Shaikh, Shivam Kalra, Satarupa Mukherjee, Auranuch Lorsakul

    Abstract: Recent years have seen significant efforts to adopt Artificial Intelligence (AI) in healthcare for various use cases, from computer-aided diagnosis to ICU triage. However, the size of AI models has been rapidly growing due to scaling laws and the success of foundational models, which poses an increasing challenge to leverage advanced models in practical applications. It is thus imperative to devel… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  17. arXiv:2403.07905  [pdf

    cs.DC cs.AI cs.LG

    Enhancing Kubernetes Automated Scheduling with Deep Learning and Reinforcement Techniques for Large-Scale Cloud Computing Optimization

    Authors: Zheng Xu, Yulu Gong, Yanlin Zhou, Qiaozhi Bao, Wenpin Qian

    Abstract: With the continuous expansion of the scale of cloud computing applications, artificial intelligence technologies such as Deep Learning and Reinforcement Learning have gradually become the key tools to solve the automated task scheduling of large-scale cloud computing systems. Aiming at the complexity and real-time requirement of task scheduling in large-scale cloud computing system, this paper pro… ▽ More

    Submitted 26 February, 2024; originally announced March 2024.

  18. arXiv:2401.01078  [pdf, other

    cs.CL cs.AI

    Vietnamese Poem Generation & The Prospect Of Cross-Language Poem-To-Poem Translation

    Authors: Triet Minh Huynh, Quan Le Bao

    Abstract: Poetry generation has been a challenging task in the field of Natural Language Processing, as it requires the model to understand the nuances of language, sentiment, and style. In this paper, we propose using Large Language Models to generate Vietnamese poems of various genres from natural language prompts, thereby facilitating an intuitive process with enhanced content control. Our most efficacio… ▽ More

    Submitted 4 January, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

  19. arXiv:2310.09430  [pdf, ps, other

    cs.CL cs.AI

    Assessing and Enhancing the Robustness of Large Language Models with Task Structure Variations for Logical Reasoning

    Authors: Qiming Bao, Gael Gendron, Alex Yuxuan Peng, Wanjun Zhong, Neset Tan, Yang Chen, Michael Witbrock, Jiamou Liu

    Abstract: Large language models (LLMs), such as LLaMA, Alpaca, Vicuna, GPT-3.5 and GPT-4, have advanced the performance of AI systems on various natural language processing tasks to human-like levels. However, their generalisation and robustness when performing logical reasoning has not been sufficiently assessed. To comprehensively evaluate this ability, we develop three new logical reasoning datasets name… ▽ More

    Submitted 16 January, 2025; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: The short version (v3) was accepted for oral presentation at the first LLM@IJCAI 2023 non-archival symposium, and the full version was accepted by ICONIP 2024

  20. arXiv:2309.10444  [pdf, other

    cs.AI cs.CL

    Exploring Iterative Enhancement for Improving Learnersourced Multiple-Choice Question Explanations with Large Language Models

    Authors: Qiming Bao, Juho Leinonen, Alex Yuxuan Peng, Wanjun Zhong, Gaël Gendron, Timothy Pistotti, Alice Huang, Paul Denny, Michael Witbrock, Jiamou Liu

    Abstract: Large language models exhibit superior capabilities in processing and understanding language, yet their applications in educational contexts remain underexplored. Learnersourcing enhances learning by engaging students in creating their own educational content. When learnersourcing multiple-choice questions, creating explanations for the solution of a question is a crucial step; it helps other stud… ▽ More

    Submitted 16 January, 2025; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: The short version (v4) has been accepted as a non-archival workshop paper at AGI@ICLR 2024, and the full version has been accepted by the main track of AAAI/EAAI 2025

  21. arXiv:2309.06169  [pdf, other

    cs.LG cs.CV

    Elucidating the solution space of extended reverse-time SDE for diffusion models

    Authors: Qinpeng Cui, Xinyi Zhang, Qiqi Bao, Qingmin Liao

    Abstract: Sampling from Diffusion Models can alternatively be seen as solving differential equations, where there is a challenge in balancing speed and image visual quality. ODE-based samplers offer rapid sampling time but reach a performance limit, whereas SDE-based samplers achieve superior quality, albeit with longer iterations. In this work, we formulate the sampling process as an Extended Reverse-Time… ▽ More

    Submitted 27 February, 2025; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: This paper has been accepted by WACV 2025 (Oral). The official version lacked proper attribution to the co-authors, and this version has been updated accordingly

  22. Measures and Optimization for Robustness and Vulnerability in Disconnected Networks

    Authors: Liwang Zhu, Qi Bao, Zhongzhi Zhang

    Abstract: The function or performance of a network is strongly dependent on its robustness, quantifying the ability of the network to continue functioning under perturbations. While a wide variety of robustness metrics have been proposed, they have their respective limitations. In this paper, we propose to use the forest index as a measure of network robustness, which overcomes the deficiencies of existing… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: 13 pages

    Journal ref: IEEE Transactions on Information Forensics and Security,pp:3350-3362,2023

  23. arXiv:2306.02850  [pdf, other

    cs.CV

    TRACE: 5D Temporal Regression of Avatars with Dynamic Cameras in 3D Environments

    Authors: Yu Sun, Qian Bao, Wu Liu, Tao Mei, Michael J. Black

    Abstract: Although the estimation of 3D human pose and shape (HPS) is rapidly progressing, current methods still cannot reliably estimate moving humans in global coordinates, which is critical for many applications. This is particularly challenging when the camera is also moving, entangling human and camera motion. To address these issues, we adopt a novel 5D representation (space, time, and identity) that… ▽ More

    Submitted 20 November, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Project page: https://www.yusun.work/TRACE/TRACE.html

  24. arXiv:2305.19555  [pdf, ps, other

    cs.CL cs.LG

    Large Language Models Are Not Strong Abstract Reasoners

    Authors: Gaël Gendron, Qiming Bao, Michael Witbrock, Gillian Dobbie

    Abstract: Large Language Models have shown tremendous performance on a large variety of natural language processing tasks, ranging from text comprehension to common sense reasoning. However, the mechanisms responsible for this success remain opaque, and it is unclear whether LLMs can achieve human-like cognitive capabilities or whether these models are still fundamentally circumscribed. Abstract reasoning i… ▽ More

    Submitted 2 January, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: 50 pages, 14 pages for the main paper and 36 pages for the supplement, 35 figures, 17 tables. V3: performed additional experiments

    ACM Class: I.2.2; I.2.3; I.2.7; I.5.1

  25. arXiv:2305.12599  [pdf, other

    cs.CL cs.AI

    Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning

    Authors: Qiming Bao, Alex Yuxuan Peng, Zhenyun Deng, Wanjun Zhong, Gael Gendron, Timothy Pistotti, Neset Tan, Nathan Young, Yang Chen, Yonghua Zhu, Paul Denny, Michael Witbrock, Jiamou Liu

    Abstract: Combining large language models with logical reasoning enhances their capacity to address problems in a robust and reliable manner. Nevertheless, the intricate nature of logical reasoning poses challenges when gathering reliable data from the web to build comprehensive training datasets, subsequently affecting performance on downstream tasks. To address this, we introduce a novel logic-driven data… ▽ More

    Submitted 17 April, 2025; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: 21 pages, 8 figures, the Findings of ACL 2024

  26. arXiv:2303.07585  [pdf, other

    cs.CL

    Input-length-shortening and text generation via attention values

    Authors: Neşet Özkan Tan, Alex Yuxuan Peng, Joshua Bensemann, Qiming Bao, Tim Hartill, Mark Gahegan, Michael Witbrock

    Abstract: Identifying words that impact a task's performance more than others is a challenge in natural language processing. Transformers models have recently addressed this issue by incorporating an attention mechanism that assigns greater attention (i.e., relevance) scores to some words than others. Because of the attention mechanism's high computational cost, transformer models usually have an input-leng… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: 7 pages, 4 figures. AAAI23-EMC2

  27. arXiv:2209.02431  [pdf, other

    cs.CV

    DPIT: Dual-Pipeline Integrated Transformer for Human Pose Estimation

    Authors: Shuaitao Zhao, Kun Liu, Yuhang Huang, Qian Bao, Dan Zeng, Wu Liu

    Abstract: Human pose estimation aims to figure out the keypoints of all people in different scenes. Current approaches still face some challenges despite promising results. Existing top-down methods deal with a single person individually, without the interaction between different people and the scene they are situated in. Consequently, the performance of human detection degrades when serious occlusion happe… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

  28. arXiv:2209.01059  [pdf, other

    cs.HC

    In-Place Gestures Classification via Long-term Memory Augmented Network

    Authors: Lizhi Zhao, Xuequan Lu, Qianyue Bao, Meili Wang

    Abstract: In-place gesture-based virtual locomotion techniques enable users to control their viewpoint and intuitively move in the 3D virtual environment. A key research problem is to accurately and quickly recognize in-place gestures, since they can trigger specific movements of virtual viewpoints and enhance user experience. However, to achieve real-time experience, only short-term sensor sequence data (u… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: This paper is accepted to IEEE ISMAR2022

  29. WOC: A Handy Webcam-based 3D Online Chatroom

    Authors: Chuanhang Yan, Yu Sun, Qian Bao, Jinhui Pang, Wu Liu, Tao Mei

    Abstract: We develop WOC, a webcam-based 3D virtual online chatroom for multi-person interaction, which captures the 3D motion of users and drives their individual 3D virtual avatars in real-time. Compared to the existing wearable equipment-based solution, WOC offers convenient and low-cost 3D motion capture with a single camera. To promote the immersive chat experience, WOC provides high-fidelity virtual a… ▽ More

    Submitted 17 March, 2023; v1 submitted 1 September, 2022; originally announced September 2022.

  30. arXiv:2208.03609  [pdf, other

    eess.IV cs.CV cs.LG

    Continual Learning for Tumor Classification in Histopathology Images

    Authors: Veena Kaustaban, Qinle Ba, Ipshita Bhattacharya, Nahil Sobh, Satarupa Mukherjee, Jim Martin, Mohammad Saleh Miri, Christoph Guetter, Amal Chaturvedi

    Abstract: Recent years have seen great advancements in the development of deep learning models for histopathology image analysis in digital pathology applications, evidenced by the increasingly common deployment of these models in both research and clinical settings. Although such models have shown unprecedented performance in solving fundamental computational tasks in DP applications, they suffer from cata… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: Accepted by MOVI, a MICCAI2022 workshop: https://sites.google.com/view/movi2022

  31. arXiv:2207.14000  [pdf, other

    cs.CL cs.AI cs.LG cs.LO

    Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation

    Authors: Qiming Bao, Alex Yuxuan Peng, Tim Hartill, Neset Tan, Zhenyun Deng, Michael Witbrock, Jiamou Liu

    Abstract: Combining deep learning with symbolic logic reasoning aims to capitalize on the success of both fields and is drawing increasing attention. Inspired by DeepLogic, an end-to-end model trained to perform inference on logic programs, we introduce IMA-GloVe-GA, an iterative neural inference network for multi-step reasoning expressed in natural language. In our model, reasoning is performed using an it… ▽ More

    Submitted 17 April, 2025; v1 submitted 28 July, 2022; originally announced July 2022.

    Comments: 10 pages, 3 figures, The 2nd International Joint Conference on Learning & Reasoning and 16th International Workshop on Neural-Symbolic Learning and Reasoning (IJCLR-NeSy 2022)

  32. arXiv:2203.12186  [pdf, other

    cs.CL

    AbductionRules: Training Transformers to Explain Unexpected Inputs

    Authors: Nathan Young, Qiming Bao, Joshua Bensemann, Michael Witbrock

    Abstract: Transformers have recently been shown to be capable of reliably performing logical reasoning over facts and rules expressed in natural language, but abductive reasoning - inference to the best explanation of an unexpected observation - has been underexplored despite significant applications to scientific discovery, common-sense reasoning, and model interpretability. We present AbductionRules, a… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: Findings of ACL 2022

  33. arXiv:2201.04024  [pdf, other

    cs.CV cs.MM

    Smart Director: An Event-Driven Directing System for Live Broadcasting

    Authors: Yingwei Pan, Yue Chen, Qian Bao, Ning Zhang, Ting Yao, Jingen Liu, Tao Mei

    Abstract: Live video broadcasting normally requires a multitude of skills and expertise with domain knowledge to enable multi-camera productions. As the number of cameras keep increasing, directing a live sports broadcast has now become more complicated and challenging than ever before. The broadcast directors need to be much more concentrated, responsive, and knowledgeable, during the production. To reliev… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)

  34. Relating Blindsight and AI: A Review

    Authors: Joshua Bensemann, Qiming Bao, Gaël Gendron, Tim Hartill, Michael Witbrock

    Abstract: Processes occurring in brains, a.k.a. biological neural networks, can and have been modeled within artificial neural network architectures. Due to this, we have conducted a review of research on the phenomenon of blindsight in an attempt to generate ideas for artificial intelligence models. Blindsight can be considered as a diminished form of visual experience. If we assume that artificial network… ▽ More

    Submitted 8 December, 2021; originally announced January 2022.

    Comments: Preprint of an article published in Journal of Artificial Intelligence and Consciousness, 2021 doi.org/10.1142/S2705078521500156 \c{opyright} copyright World Scientific Publishing Company www.worldscientific.com/worldscinet/jaic

    Journal ref: Journal of Artificial Intelligence and Consciousness, 1-15 (2021)

  35. arXiv:2201.00466  [pdf, other

    eess.IV cs.CV

    RFormer: Transformer-based Generative Adversarial Network for Real Fundus Image Restoration on A New Clinical Benchmark

    Authors: Zhuo Deng, Yuanhao Cai, Lu Chen, Zheng Gong, Qiqi Bao, Xue Yao, Dong Fang, Shaochong Zhang, Lan Ma

    Abstract: Ophthalmologists have used fundus images to screen and diagnose eye diseases. However, different equipments and ophthalmologists pose large variations to the quality of fundus images. Low-quality (LQ) degraded fundus images easily lead to uncertainty in clinical screening and generally increase the risk of misdiagnosis. Thus, real fundus image restoration is worth studying. Unfortunately, real cli… ▽ More

    Submitted 3 August, 2022; v1 submitted 2 January, 2022; originally announced January 2022.

    Comments: IEEE J-BHI 2022; The First Benchmark and First Transformer-based Method for Real Clinical Fundus Image Restoration

  36. arXiv:2112.08274  [pdf, other

    cs.CV

    Putting People in their Place: Monocular Regression of 3D People in Depth

    Authors: Yu Sun, Wu Liu, Qian Bao, Yili Fu, Tao Mei, Michael J. Black

    Abstract: Given an image with multiple people, our goal is to directly regress the pose and shape of all the people as well as their relative depth. Inferring the depth of a person in an image, however, is fundamentally ambiguous without knowing their height. This is particularly problematic when the scene contains people of very different sizes, e.g. from infants to adults. To solve this, we need several t… ▽ More

    Submitted 19 April, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: CVPR 2022; Code https://github.com/Arthur151/ROMP ; Dataset https://github.com/Arthur151/Relative_Human

  37. arXiv:2111.10058  [pdf, other

    cs.CL cs.AI cs.LG

    DeepQR: Neural-based Quality Ratings for Learnersourced Multiple-Choice Questions

    Authors: Lin Ni, Qiming Bao, Xiaoxuan Li, Qianqian Qi, Paul Denny, Jim Warren, Michael Witbrock, Jiamou Liu

    Abstract: Automated question quality rating (AQQR) aims to evaluate question quality through computational means, thereby addressing emerging challenges in online learnersourced question repositories. Existing methods for AQQR rely solely on explicitly-defined criteria such as readability and word count, while not fully utilising the power of state-of-the-art deep-learning techniques. We propose DeepQR, a n… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: EAAI 22

  38. arXiv:2110.07872  [pdf, ps, other

    cs.SI cs.AI

    Role Similarity Metric Based on Spanning Rooted Forest

    Authors: Qi Bao, Zhongzhi Zhang, Haibin Kan

    Abstract: As a fundamental issue in network analysis, structural node similarity has received much attention in academia and is adopted in a wide range of applications. Among these proposed structural node similarity measures, role similarity stands out because of satisfying several axiomatic properties including automorphism conformation. Existing role similarity metrics cannot handle top-k queries on larg… ▽ More

    Submitted 1 April, 2024; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: 10 pages, 5 figures

  39. arXiv:2108.13246  [pdf, other

    cs.CV

    LUAI Challenge 2021 on Learning to Understand Aerial Images

    Authors: Gui-Song Xia, Jian Ding, Ming Qian, Nan Xue, Jiaming Han, Xiang Bai, Michael Ying Yang, Shengyang Li, Serge Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, Liangpei Zhang, Qiang Zhou, Chao-hui Yu, Kaixuan Hu, Yingjia Bu, Wenming Tan, Zhe Yang, Wei Li, Shang Liu, Jiaxuan Zhao, Tianzhi Ma, Zi-han Gao, Lingqi Wang , et al. (11 additional authors not shown)

    Abstract: This report summarizes the results of Learning to Understand Aerial Images (LUAI) 2021 challenge held on ICCV 2021, which focuses on object detection and semantic segmentation in aerial images. Using DOTA-v2.0 and GID-15 datasets, this challenge proposes three tasks for oriented object detection, horizontal object detection, and semantic segmentation of common categories in aerial images. This cha… ▽ More

    Submitted 17 September, 2021; v1 submitted 30 August, 2021; originally announced August 2021.

    Comments: 7 pages, 2 figures, accepted by ICCVW 2021

  40. arXiv:2104.11536  [pdf, other

    cs.CV

    Recent Advances in Monocular 2D and 3D Human Pose Estimation: A Deep Learning Perspective

    Authors: Wu Liu, Qian Bao, Yu Sun, Tao Mei

    Abstract: Estimation of the human pose from a monocular camera has been an emerging research topic in the computer vision community with many applications. Recently, benefited from the deep learning technologies, a significant amount of research efforts have greatly advanced the monocular human pose estimation both in 2D and 3D areas. Although there have been some works to summarize the different approaches… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

  41. arXiv:2102.07087  [pdf, other

    cs.NI eess.SP

    Survey on Aerial Radio Access Networks: Toward a Comprehensive 6G Access Infrastructure

    Authors: Nhu-Ngoc Dao, Quoc-Viet Pham, Ngo Hoang Tu, Tran Thien Thanh, Vo Nguyen Quoc Bao, Demeke Shumeye Lakew, Sungrae Cho

    Abstract: Current network access infrastructures are characterized by heterogeneity, low latency, high throughput, and high computational capability, enabling massive concurrent connections and various services. Unfortunately, this design does not pay significant attention to mobile services in underserved areas. In this context, the use of aerial radio access networks (ARANs) is a promising strategy to com… ▽ More

    Submitted 27 February, 2021; v1 submitted 14 February, 2021; originally announced February 2021.

    Comments: Accepted by the IEEE Communications Surveys and Tutorials

  42. Fast Evaluation for Relevant Quantities of Opinion Dynamics

    Authors: Wanyue Xu, Qi Bao, Zhongzhi Zhang

    Abstract: One of the main subjects in the field of social networks is to quantify conflict, disagreement, controversy, and polarization, and some quantitative indicators have been developed to quantify these concepts. However, direct computation of these indicators involves the operations of matrix inversion and multiplication, which make it computationally infeasible for large-scale graphs with millions of… ▽ More

    Submitted 12 June, 2021; v1 submitted 20 January, 2021; originally announced January 2021.

    Journal ref: Proceedings of The Web Conference 2021, pp.2037-2045

  43. LOREN: Logic-Regularized Reasoning for Interpretable Fact Verification

    Authors: Jiangjie Chen, Qiaoben Bao, Changzhi Sun, Xinbo Zhang, Jiaze Chen, Hao Zhou, Yanghua Xiao, Lei Li

    Abstract: Given a natural language statement, how to verify its veracity against a large-scale textual knowledge source like Wikipedia? Most existing neural models make predictions without giving clues about which part of a false claim goes wrong. In this paper, we propose LOREN, an approach for interpretable fact verification. We decompose the verification of the whole claim at phrase-level, where the vera… ▽ More

    Submitted 9 December, 2021; v1 submitted 25 December, 2020; originally announced December 2020.

    Comments: Accepted to AAAI 2022

  44. arXiv:2012.04821  [pdf, ps, other

    cs.CL

    Complex Relation Extraction: Challenges and Opportunities

    Authors: Haiyun Jiang, Qiaoben Bao, Qiao Cheng, Deqing Yang, Li Wang, Yanghua Xiao

    Abstract: Relation extraction aims to identify the target relations of entities in texts. Relation extraction is very important for knowledge base construction and text understanding. Traditional binary relation extraction, including supervised, semi-supervised and distant supervised ones, has been extensively studied and significant results are achieved. In recent years, many complex relation extraction ta… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

    Comments: 7 pages

  45. arXiv:2010.14036  [pdf, other

    cs.CV

    Synthetic Training for Monocular Human Mesh Recovery

    Authors: Yu Sun, Qian Bao, Wu Liu, Wenpeng Gao, Yili Fu, Chuang Gan, Tao Mei

    Abstract: Recovering 3D human mesh from monocular images is a popular topic in computer vision and has a wide range of applications. This paper aims to estimate 3D mesh of multiple body parts (e.g., body, hands) with large-scale differences from a single RGB image. Existing methods are mostly based on iterative optimization, which is very time-consuming. We propose to train a single-shot model to achieve th… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

  46. arXiv:2008.12272  [pdf, other

    cs.CV

    Monocular, One-stage, Regression of Multiple 3D People

    Authors: Yu Sun, Qian Bao, Wu Liu, Yili Fu, Michael J. Black, Tao Mei

    Abstract: This paper focuses on the regression of multiple 3D people from a single RGB image. Existing approaches predominantly follow a multi-stage pipeline that first detects people in bounding boxes and then independently regresses their 3D body meshes. In contrast, we propose to Regress all meshes in a One-stage fashion for Multiple 3D People (termed ROMP). The approach is conceptually simple, bounding… ▽ More

    Submitted 16 September, 2021; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: ICCV 2021, Code https://github.com/Arthur151/ROMP

  47. arXiv:2008.06285  [pdf, other

    cs.CV cs.LG eess.IV

    Rb-PaStaNet: A Few-Shot Human-Object Interaction Detection Based on Rules and Part States

    Authors: Shenyu Zhang, Zichen Zhu, Qingquan Bao

    Abstract: Existing Human-Object Interaction (HOI) Detection approaches have achieved great progress on nonrare classes while rare HOI classes are still not well-detected. In this paper, we intend to apply human prior knowledge into the existing work. So we add human-labeled rules to PaStaNet and propose Rb-PaStaNet aimed at improving rare HOI classes detection. Our results show a certain improvement of the… ▽ More

    Submitted 14 August, 2020; originally announced August 2020.

  48. arXiv:2006.13693  [pdf, other

    q-bio.PE cs.LG physics.soc-ph

    PECAIQR: A Model for Infectious Disease Applied to the Covid-19 Epidemic

    Authors: Richard Bao, August Chen, Jethin Gowda, Shiva Mudide

    Abstract: The Covid-19 pandemic has made clear the need to improve modern multivariate time-series forecasting models. Current state of the art predictions of future daily deaths and, especially, hospital resource usage have confidence intervals that are unacceptably wide. Policy makers and hospitals require accurate forecasts to make informed decisions on passing legislation and allocating resources. We us… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  49. HHH: An Online Medical Chatbot System based on Knowledge Graph and Hierarchical Bi-Directional Attention

    Authors: Qiming Bao, Lin Ni, Jiamou Liu

    Abstract: This paper proposes a chatbot framework that adopts a hybrid model which consists of a knowledge graph and a text similarity model. Based on this chatbot framework, we build HHH, an online question-and-answer (QA) Healthcare Helper system for answering complex medical questions. HHH maintains a knowledge graph constructed from medical data collected from the Internet. HHH also implements a novel t… ▽ More

    Submitted 8 February, 2020; originally announced February 2020.

    Comments: 10 pages, 9 figures, 3 tables. Proceedings of the Australasian Computer Science Week Multiconference (ACSW 2020)

  50. arXiv:1911.08045  [pdf, ps, other

    math.CO cs.DM cs.NI

    The k-Power Domination Number in Some Self-Similar Graphs

    Authors: Yulun Xu, Qi Bao, Zhongzhi Zhang

    Abstract: The $k$-power domination problem is a problem in graph theory, which has applications in many areas. However, it is hard to calculate the exact $k$-power domination number since determining k-power domination number of a generic graph is a NP-complete problem. We determine the exact $k$-power domination number in two graphs which have the same number of vertices and edges: pseudofractal scale-free… ▽ More

    Submitted 18 November, 2019; originally announced November 2019.