Skip to main content

Showing 1–50 of 85 results for author: Qu, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.02980  [pdf, ps, other

    q-bio.GN cs.LG

    Modeling Gene Expression Distributional Shifts for Unseen Genetic Perturbations

    Authors: Kalyan Ramakrishnan, Jonathan G. Hedley, Sisi Qu, Puneet K. Dokania, Philip H. S. Torr, Cesar A. Prada-Medina, Julien Fauqueur, Kaspar Martens

    Abstract: We train a neural network to predict distributional responses in gene expression following genetic perturbations. This is an essential task in early-stage drug discovery, where such responses can offer insights into gene function and inform target identification. Existing methods only predict changes in the mean expression, overlooking stochasticity inherent in single-cell data. In contrast, we of… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

  2. arXiv:2506.07591  [pdf, ps, other

    cs.AI q-bio.QM

    Automating Exploratory Multiomics Research via Language Models

    Authors: Shang Qu, Ning Ding, Linhai Xie, Yifei Li, Zaoqu Liu, Kaiyan Zhang, Yibai Xiong, Yuxin Zuo, Zhangren Chen, Ermo Hua, Xingtai Lv, Youbang Sun, Yang Li, Dong Li, Fuchu He, Bowen Zhou

    Abstract: This paper introduces PROTEUS, a fully automated system that produces data-driven hypotheses from raw data files. We apply PROTEUS to clinical proteogenomics, a field where effective downstream data analysis and hypothesis proposal is crucial for producing novel discoveries. PROTEUS uses separate modules to simulate different stages of the scientific process, from open-ended data exploration to sp… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  3. arXiv:2505.23434  [pdf, ps, other

    cs.CV

    UrbanCraft: Urban View Extrapolation via Hierarchical Sem-Geometric Priors

    Authors: Tianhang Wang, Fan Lu, Sanqing Qu, Guo Yu, Shihang Du, Ya Wu, Yuan Huang, Guang Chen

    Abstract: Existing neural rendering-based urban scene reconstruction methods mainly focus on the Interpolated View Synthesis (IVS) setting that synthesizes from views close to training camera trajectory. However, IVS can not guarantee the on-par performance of the novel view outside the training camera distribution (\textit{e.g.}, looking left, right, or downwards), which limits the generalizability of the… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  4. arXiv:2504.16084  [pdf, ps, other

    cs.CL cs.LG

    TTRL: Test-Time Reinforcement Learning

    Authors: Yuxin Zuo, Kaiyan Zhang, Li Sheng, Shang Qu, Ganqu Cui, Xuekai Zhu, Haozhan Li, Yuchen Zhang, Xinwei Long, Ermo Hua, Biqing Qi, Youbang Sun, Zhiyuan Ma, Lifan Yuan, Ning Ding, Bowen Zhou

    Abstract: This paper investigates Reinforcement Learning (RL) on data without explicit labels for reasoning tasks in Large Language Models (LLMs). The core challenge of the problem is reward estimation during inference while not having access to ground-truth information. While this setting appears elusive, we find that common practices in Test-Time Scaling (TTS), such as majority voting, yield surprisingly… ▽ More

    Submitted 30 June, 2025; v1 submitted 22 April, 2025; originally announced April 2025.

  5. arXiv:2503.11224  [pdf, other

    cs.LG cs.AI cs.CL

    Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

    Authors: Xingtai Lv, Youbang Sun, Kaiyan Zhang, Shang Qu, Xuekai Zhu, Yuchen Fan, Yi Wu, Ermo Hua, Xinwei Long, Ning Ding, Bowen Zhou

    Abstract: State Space Models (SSMs) have emerged as a promising alternative to the popular transformer-based models and have been increasingly gaining attention. Compared to transformers, SSMs excel at tasks with sequential data or longer contexts, demonstrating comparable performances with significant efficiency gains. In this survey, we provide a coherent and systematic overview for SSMs, including their… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  6. arXiv:2503.10080  [pdf, ps, other

    cs.CV

    Bayesian Prompt Flow Learning for Zero-Shot Anomaly Detection

    Authors: Zhen Qu, Xian Tao, Xinyi Gong, Shichen Qu, Qiyu Chen, Zhengtao Zhang, Xingang Wang, Guiguang Ding

    Abstract: Recently, vision-language models (e.g. CLIP) have demonstrated remarkable performance in zero-shot anomaly detection (ZSAD). By leveraging auxiliary data during training, these models can directly perform cross-category anomaly detection on target datasets, such as detecting defects on industrial product surfaces or identifying tumors in organ tissues. Existing approaches typically construct text… ▽ More

    Submitted 3 June, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

  7. arXiv:2502.11742  [pdf, other

    cs.CV

    Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition

    Authors: Jianyi Peng, Fan Lu, Bin Li, Yuan Huang, Sanqing Qu, Guang Chen

    Abstract: Image-to-point cloud cross-modal Visual Place Recognition (VPR) is a challenging task where the query is an RGB image, and the database samples are LiDAR point clouds. Compared to single-modal VPR, this approach benefits from the widespread availability of RGB cameras and the robustness of point clouds in providing accurate spatial geometry and distance information. However, current methods rely o… ▽ More

    Submitted 28 February, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

  8. arXiv:2502.11715  [pdf, other

    cs.LG cs.AI

    Proactive Depot Discovery: A Generative Framework for Flexible Location-Routing

    Authors: Site Qu, Guoqiang Hu

    Abstract: The Location-Routing Problem (LRP), which combines the challenges of facility (depot) locating and vehicle route planning, is critically constrained by the reliance on predefined depot candidates, limiting the solution space and potentially leading to suboptimal outcomes. Previous research on LRP without predefined depots is scant and predominantly relies on heuristic algorithms that iteratively a… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  9. arXiv:2501.18362  [pdf, ps, other

    cs.AI cs.CL cs.CV cs.LG

    MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

    Authors: Yuxin Zuo, Shang Qu, Yifei Li, Zhangren Chen, Xuekai Zhu, Ermo Hua, Kaiyan Zhang, Ning Ding, Bowen Zhou

    Abstract: We introduce MedXpertQA, a highly challenging and comprehensive benchmark to evaluate expert-level medical knowledge and advanced reasoning. MedXpertQA includes 4,460 questions spanning 17 specialties and 11 body systems. It includes two subsets, Text for text evaluation and MM for multimodal evaluation. Notably, MM introduces expert-level exam questions with diverse images and rich clinical infor… ▽ More

    Submitted 6 June, 2025; v1 submitted 30 January, 2025; originally announced January 2025.

    Comments: ICML 2025

  10. arXiv:2501.09187  [pdf, other

    cs.CV cs.AI cs.LG

    Patch-aware Vector Quantized Codebook Learning for Unsupervised Visual Defect Detection

    Authors: Qisen Cheng, Shuhui Qu, Janghwan Lee

    Abstract: Unsupervised visual defect detection is critical in industrial applications, requiring a representation space that captures normal data features while detecting deviations. Achieving a balance between expressiveness and compactness is challenging; an overly expressive space risks inefficiency and mode collapse, impairing detection accuracy. We propose a novel approach using an enhanced VQ-VAE fram… ▽ More

    Submitted 15 January, 2025; originally announced January 2025.

    Comments: 7 pages, Accepted to 36th IEEE ICTAI 2024

  11. arXiv:2501.07048  [pdf, other

    cs.AI

    Unveiling the Potential of Text in High-Dimensional Time Series Forecasting

    Authors: Xin Zhou, Weiqing Wang, Shilin Qu, Zhiqiang Zhang, Christoph Bergmeir

    Abstract: Time series forecasting has traditionally focused on univariate and multivariate numerical data, often overlooking the benefits of incorporating multimodal information, particularly textual data. In this paper, we propose a novel framework that integrates time series models with Large Language Models to improve high-dimensional time series forecasting. Inspired by multimodal models, our method com… ▽ More

    Submitted 12 January, 2025; originally announced January 2025.

    Comments: Accepted by NeurIPS24 TSALM Workshop

  12. arXiv:2412.17623  [pdf, other

    math.OC cs.LG

    Towards An Unsupervised Learning Scheme for Efficiently Solving Parameterized Mixed-Integer Programs

    Authors: Shiyuan Qu, Fenglian Dong, Zhiwei Wei, Chao Shang

    Abstract: In this paper, we describe a novel unsupervised learning scheme for accelerating the solution of a family of mixed integer programming (MIP) problems. Distinct substantially from existing learning-to-optimize methods, our proposal seeks to train an autoencoder (AE) for binary variables in an unsupervised learning fashion, using data of optimal solutions to historical instances for a parametric fam… ▽ More

    Submitted 24 December, 2024; v1 submitted 23 December, 2024; originally announced December 2024.

  13. arXiv:2412.02267  [pdf, other

    cs.CV cs.RO

    GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos

    Authors: Zhiyuan Chen, Fan Lu, Guo Yu, Bin Li, Sanqing Qu, Yuan Huang, Changhong Fu, Guang Chen

    Abstract: Tracking the 6DoF pose of unknown objects in monocular RGB video sequences is crucial for robotic manipulation. However, existing approaches typically rely on accurate depth information, which is non-trivial to obtain in real-world scenarios. Although depth estimation algorithms can be employed, geometric inaccuracy can lead to failures in RGBD-based pose tracking methods. To address this challeng… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

  14. arXiv:2411.12354  [pdf, other

    cs.IR

    Scalable and Effective Negative Sample Generation for Hyperedge Prediction

    Authors: Shilin Qu, Weiqing Wang, Yuan-Fang Li, Quoc Viet Hung Nguyen, Hongzhi Yin

    Abstract: Hyperedge prediction is crucial in hypergraph analysis for understanding complex multi-entity interactions in various web-based applications, including social networks and e-commerce systems. Traditional methods often face difficulties in generating high-quality negative samples due to the imbalance between positive and negative instances. To address this, we present the Scalable and Effective Neg… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

    Comments: 11

  15. arXiv:2411.03743  [pdf, other

    cs.AI q-bio.QM

    Automating Exploratory Proteomics Research via Language Models

    Authors: Ning Ding, Shang Qu, Linhai Xie, Yifei Li, Zaoqu Liu, Kaiyan Zhang, Yibai Xiong, Yuxin Zuo, Zhangren Chen, Ermo Hua, Xingtai Lv, Youbang Sun, Yang Li, Dong Li, Fuchu He, Bowen Zhou

    Abstract: With the development of artificial intelligence, its contribution to science is evolving from simulating a complex problem to automating entire research processes and producing novel discoveries. Achieving this advancement requires both specialized general models grounded in real-world scientific data and iterative, exploratory frameworks that mirror human scientific methodologies. In this paper,… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

  16. FastSTI: A Fast Conditional Pseudo Numerical Diffusion Model for Spatio-temporal Traffic Data Imputation

    Authors: Shaokang Cheng, Nada Osman, Shiru Qu, Lamberto Ballan

    Abstract: High-quality spatiotemporal traffic data is crucial for intelligent transportation systems (ITS) and their data-driven applications. Inevitably, the issue of missing data caused by various disturbances threatens the reliability of data acquisition. Recent studies of diffusion probability models have demonstrated the superiority of deep generative models in imputation tasks by precisely capturing t… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: This paper has been accepted by IEEE Transactions on Intelligent Transportation Systems for publication. Permission from IEEE must be obtained for all other uses, in any current or future media

    Journal ref: IEEE Transactions on Intelligent Transportation Systems (Early Access) 2024

  17. arXiv:2410.07539  [pdf, other

    cond-mat.mtrl-sci cs.AI

    Efficient Generation of Molecular Clusters with Dual-Scale Equivariant Flow Matching

    Authors: Akshay Subramanian, Shuhui Qu, Cheol Woo Park, Sulin Liu, Janghwan Lee, Rafael Gómez-Bombarelli

    Abstract: Amorphous molecular solids offer a promising alternative to inorganic semiconductors, owing to their mechanical flexibility and solution processability. The packing structure of these materials plays a crucial role in determining their electronic and transport properties, which are key to enhancing the efficiency of devices like organic solar cells (OSCs). However, obtaining these optoelectronic p… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  18. arXiv:2410.03049  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Scalable Frame-based Construction of Sociocultural NormBases for Socially-Aware Dialogues

    Authors: Shilin Qu, Weiqing Wang, Xin Zhou, Haolan Zhan, Zhuang Li, Lizhen Qu, Linhao Luo, Yuan-Fang Li, Gholamreza Haffari

    Abstract: Sociocultural norms serve as guiding principles for personal conduct in social interactions, emphasizing respect, cooperation, and appropriate behavior, which is able to benefit tasks including conversational information retrieval, contextual information retrieval and retrieval-enhanced machine learning. We propose a scalable approach for constructing a Sociocultural Norm (SCN) Base using Large La… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: 17 pages

    Journal ref: TOMM 2024

  19. arXiv:2408.04245  [pdf, other

    cs.LG cs.AI cs.IR

    Scalable Transformer for High Dimensional Multivariate Time Series Forecasting

    Authors: Xin Zhou, Weiqing Wang, Wray Buntine, Shilin Qu, Abishek Sriramulu, Weicong Tan, Christoph Bergmeir

    Abstract: Deep models for Multivariate Time Series (MTS) forecasting have recently demonstrated significant success. Channel-dependent models capture complex dependencies that channel-independent models cannot capture. However, the number of channels in real-world applications outpaces the capabilities of existing channel-dependent models, and contrary to common expectations, some models underperform the ch… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    ACM Class: H.3

  20. arXiv:2407.19148  [pdf, other

    cs.CV

    Few-Shot Medical Image Segmentation with Large Kernel Attention

    Authors: Xiaoxiao Wu, Xiaowei Chen, Zhenguo Gao, Shulei Qu, Yuanyuan Qiu

    Abstract: Medical image segmentation has witnessed significant advancements with the emergence of deep learning. However, the reliance of most neural network models on a substantial amount of annotated data remains a challenge for medical image segmentation. To address this issue, few-shot segmentation methods based on meta-learning have been employed. Presently, the methods primarily focus on aligning the… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  21. arXiv:2407.17705  [pdf

    cs.CV

    ALMRR: Anomaly Localization Mamba on Industrial Textured Surface with Feature Reconstruction and Refinement

    Authors: Shichen Qu, Xian Tao, Zhen Qu, Xinyi Gong, Zhengtao Zhang, Mukesh Prasad

    Abstract: Unsupervised anomaly localization on industrial textured images has achieved remarkable results through reconstruction-based methods, yet existing approaches based on image reconstruction and feature reconstruc-tion each have their own shortcomings. Firstly, image-based methods tend to reconstruct both normal and anomalous regions well, which lead to over-generalization. Feature-based methods cont… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  22. arXiv:2407.12387  [pdf, other

    cs.CV

    HGL: Hierarchical Geometry Learning for Test-time Adaptation in 3D Point Cloud Segmentation

    Authors: Tianpei Zou, Sanqing Qu, Zhijun Li, Alois Knoll, Lianghua He, Guang Chen, Changjun Jiang

    Abstract: 3D point cloud segmentation has received significant interest for its growing applications. However, the generalization ability of models suffers in dynamic scenarios due to the distribution shift between test and training data. To promote robustness and adaptability across diverse scenarios, test-time adaptation (TTA) has recently been introduced. Nevertheless, most existing TTA methods are devel… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Journal ref: ECCV 2024

  23. arXiv:2406.13419  [pdf, ps, other

    cs.RO

    An eight-neuron network for quadruped locomotion with hip-knee joint control

    Authors: Yide Liu, Xiyan Liu, Dongqi Wang, Wei Yang, shaoxing Qu

    Abstract: The gait generator, which is capable of producing rhythmic signals for coordinating multiple joints, is an essential component in the quadruped robot locomotion control framework. The biological counterpart of the gait generator is the Central Pattern Generator (abbreviated as CPG), a small neural network consisting of interacting neurons. Inspired by this architecture, researchers have designed a… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  24. arXiv:2406.09481  [pdf, other

    cs.CV cs.LG

    ELF-UA: Efficient Label-Free User Adaptation in Gaze Estimation

    Authors: Yong Wu, Yang Wang, Sanqing Qu, Zhijun Li, Guang Chen

    Abstract: We consider the problem of user-adaptive 3D gaze estimation. The performance of person-independent gaze estimation is limited due to interpersonal anatomical differences. Our goal is to provide a personalized gaze estimation model specifically adapted to a target user. Previous work on user-adaptive gaze estimation requires some labeled images of the target person data to fine-tune the model at te… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: This paper has been accepted by IJCAI'24

  25. arXiv:2405.19327  [pdf, other

    cs.CL cs.AI cs.LG

    MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

    Authors: Ge Zhang, Scott Qu, Jiaheng Liu, Chenchen Zhang, Chenghua Lin, Chou Leuang Yu, Danny Pan, Esther Cheng, Jie Liu, Qunshu Lin, Raven Yuan, Tuney Zheng, Wei Pang, Xinrun Du, Yiming Liang, Yinghao Ma, Yizhi Li, Ziyang Ma, Bill Lin, Emmanouil Benetos, Huan Yang, Junting Zhou, Kaijing Ma, Minghao Liu, Morry Niu , et al. (20 additional authors not shown)

    Abstract: Large Language Models (LLMs) have made great strides in recent years to achieve unprecedented performance across different tasks. However, due to commercial interest, the most competitive models like GPT, Gemini, and Claude have been gated behind proprietary interfaces without disclosing the training details. Recently, many institutions have open-sourced several strong LLMs like LLaMA-3, comparabl… ▽ More

    Submitted 10 July, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: https://map-neo.github.io/

  26. arXiv:2405.07845  [pdf, other

    cs.CV

    Multi-Task Learning for Fatigue Detection and Face Recognition of Drivers via Tree-Style Space-Channel Attention Fusion Network

    Authors: Shulei Qu, Zhenguo Gao, Xiaowei Chen, Na Li, Yakai Wang, Xiaoxiao Wu

    Abstract: In driving scenarios, automobile active safety systems are increasingly incorporating deep learning technology. These systems typically need to handle multiple tasks simultaneously, such as detecting fatigue driving and recognizing the driver's identity. However, the traditional parallel-style approach of combining multiple single-task models tends to waste resources when dealing with similar task… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  27. arXiv:2405.07516  [pdf, other

    cs.CV

    Support-Query Prototype Fusion Network for Few-shot Medical Image Segmentation

    Authors: Xiaoxiao Wu, Zhenguo Gao, Xiaowei Chen, Yakai Wang, Shulei Qu, Na Li

    Abstract: In recent years, deep learning based on Convolutional Neural Networks (CNNs) has achieved remarkable success in many applications. However, their heavy reliance on extensive labeled data and limited generalization ability to unseen classes pose challenges to their suitability for medical image processing tasks. Few-shot learning, which utilizes a small amount of labeled data to generalize to unsee… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 19 pages, 7 figures, 4 tables

  28. arXiv:2404.15582  [pdf, other

    cs.CR

    Now Let's Make It Physical: Enabling Physically Trusted Certificate Issuance for Keyless Security in CAs

    Authors: Xiaolin Zhang, Chenghao Chen, Kailun Qin, Yuxuan Wang, Shipei Qu, Tengfei Wang, Chi Zhang, Dawu Gu

    Abstract: The signing key protection of Certificate Authorities (CAs) remains a critical challenge in PKI. Traditional approaches struggle to eliminate the risk of key exposure due to those (un)intentional human errors. This long-standing dilemma motivates us to propose Armored Core, a novel PKI security extension using the trusted binding of Physically Unclonable Function (PUF) for CAs. PUFs leverage manuf… ▽ More

    Submitted 13 January, 2025; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: Under peer review

  29. Deep Reinforcement Learning Based Toolpath Generation for Thermal Uniformity in Laser Powder Bed Fusion Process

    Authors: Mian Qin, Junhao Ding, Shuo Qu, Xu Song, Charlie C. L. Wang, Wei-Hsin Liao

    Abstract: Laser powder bed fusion (LPBF) is a widely used metal additive manufacturing technology. However, the accumulation of internal residual stress during printing can cause significant distortion and potential failure. Although various scan patterns have been studied to reduce possible accumulated stress, such as zigzag scanning vectors with changing directions or a chessboard-based scan pattern with… ▽ More

    Submitted 16 February, 2024; originally announced April 2024.

    Journal ref: Additive Manufacturing, vol.79, 103937 (12 pages), January 2024

  30. arXiv:2403.14410  [pdf, other

    cs.CV cs.AI cs.LG

    GLC++: Source-Free Universal Domain Adaptation through Global-Local Clustering and Contrastive Affinity Learning

    Authors: Sanqing Qu, Tianpei Zou, Florian Röhrbein, Cewu Lu, Guang Chen, Dacheng Tao, Changjun Jiang

    Abstract: Deep neural networks often exhibit sub-optimal performance under covariate and category shifts. Source-Free Domain Adaptation (SFDA) presents a promising solution to this dilemma, yet most SFDA approaches are restricted to closed-set scenarios. In this paper, we explore Source-Free Universal Domain Adaptation (SF-UniDA) aiming to accurately classify "known" data belonging to common categories and… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: This is a substantial extension of the CVPR 2023 paper "Upcycling Models under Domain and Category Shift"

  31. arXiv:2403.04149  [pdf, other

    cs.CV

    MAP: MAsk-Pruning for Source-Free Model Intellectual Property Protection

    Authors: Boyang Peng, Sanqing Qu, Yong Wu, Tianpei Zou, Lianghua He, Alois Knoll, Guang Chen, changjun jiang

    Abstract: Deep learning has achieved remarkable progress in various applications, heightening the importance of safeguarding the intellectual property (IP) of well-trained models. It entails not only authorizing usage but also ensuring the deployment of models in authorized data domains, i.e., making models exclusive to certain target domains. Previous methods necessitate concurrent access to source trainin… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  32. arXiv:2403.03421  [pdf, other

    cs.CV cs.AI cs.LG

    LEAD: Learning Decomposition for Source-free Universal Domain Adaptation

    Authors: Sanqing Qu, Tianpei Zou, Lianghua He, Florian Röhrbein, Alois Knoll, Guang Chen, Changjun Jiang

    Abstract: Universal Domain Adaptation (UniDA) targets knowledge transfer in the presence of both covariate and label shifts. Recently, Source-free Universal Domain Adaptation (SF-UniDA) has emerged to achieve UniDA without access to source data, which tends to be more practical due to data protection policies. The main challenge lies in determining whether covariate-shifted samples belong to target-private… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: To appear in CVPR 2024

  33. arXiv:2402.18925  [pdf, other

    cs.CV

    PCDepth: Pattern-based Complementary Learning for Monocular Depth Estimation by Best of Both Worlds

    Authors: Haotian Liu, Sanqing Qu, Fan Lu, Zongtao Bu, Florian Roehrbein, Alois Knoll, Guang Chen

    Abstract: Event cameras can record scene dynamics with high temporal resolution, providing rich scene details for monocular depth estimation (MDE) even at low-level illumination. Therefore, existing complementary learning approaches for MDE fuse intensity information from images and scene details from event data for better scene understanding. However, most methods directly fuse two modalities at pixel leve… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: Under Review

  34. arXiv:2402.11178  [pdf, other

    cs.CL

    RENOVI: A Benchmark Towards Remediating Norm Violations in Socio-Cultural Conversations

    Authors: Haolan Zhan, Zhuang Li, Xiaoxi Kang, Tao Feng, Yuncheng Hua, Lizhen Qu, Yi Ying, Mei Rianto Chandra, Kelly Rosalin, Jureynolds Jureynolds, Suraj Sharma, Shilin Qu, Linhao Luo, Lay-Ki Soon, Zhaleh Semnani Azad, Ingrid Zukerman, Gholamreza Haffari

    Abstract: Norm violations occur when individuals fail to conform to culturally accepted behaviors, which may lead to potential conflicts. Remediating norm violations requires social awareness and cultural sensitivity of the nuances at play. To equip interactive AI systems with a remediation ability, we offer ReNoVi - a large-scale corpus of 9,258 multi-turn dialogues annotated with social norms, as well as… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: work in progress. 15 pages, 7 figures

  35. arXiv:2402.08908  [pdf, other

    cs.CR

    Teamwork Makes TEE Work: Open and Resilient Remote Attestation on Decentralized Trust

    Authors: Xiaolin Zhang, Kailun Qin, Shipei Qu, Tengfei Wang, Chi Zhang, Dawu Gu

    Abstract: Remote Attestation (RA) enables the integrity and authenticity of applications in Trusted Execution Environment (TEE) to be verified. Existing TEE RA designs employ a centralized trust model where they rely on a single provisioned secret key and a centralized verifier to establish trust for remote parties. This model is however brittle and can be untrusted under advanced attacks nowadays. Besides,… ▽ More

    Submitted 9 August, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 18 pages, 9 figures. Under peer review of some IEEE Transaction Journal

  36. CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory Accelerators

    Authors: Songyun Qu, Shixin Zhao, Bing Li, Yintao He, Xuyi Cai, Lei Zhang, Ying Wang

    Abstract: In recent years, various computing-in-memory (CIM) processors have been presented, showing superior performance over traditional architectures. To unleash the potential of various CIM architectures, such as device precision, crossbar size, and crossbar number, it is necessary to develop compilation tools that are fully aware of the CIM architectural details and implementation diversity. However, d… ▽ More

    Submitted 8 May, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: 16 pages, 22 figures

    ACM Class: D.3.4

  37. arXiv:2312.17052  [pdf, other

    cs.CV

    Multi-Attention Fusion Drowsy Driving Detection Model

    Authors: Shulei QU, Zhenguo Gao, Xiaoxiao Wu, Yuanyuan Qiu

    Abstract: Drowsy driving represents a major contributor to traffic accidents, and the implementation of driver drowsy driving detection systems has been proven to significantly reduce the occurrence of such accidents. Despite the development of numerous drowsy driving detection algorithms, many of them impose specific prerequisites such as the availability of complete facial images, optimal lighting conditi… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 8 pages, 6 figures

  38. arXiv:2312.00336  [pdf, other

    cs.LG cs.IR

    Hypergraph Node Representation Learning with One-Stage Message Passing

    Authors: Shilin Qu, Weiqing Wang, Yuan-Fang Li, Xin Zhou, Fajie Yuan

    Abstract: Hypergraphs as an expressive and general structure have attracted considerable attention from various research domains. Most existing hypergraph node representation learning techniques are based on graph neural networks, and thus adopt the two-stage message passing paradigm (i.e. node -> hyperedge -> node). This paradigm only focuses on local information propagation and does not effectively take i… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: 11 pages

  39. Abusing Processor Exception for General Binary Instrumentation on Bare-metal Embedded Devices

    Authors: Shipei Qu, Xiaolin Zhang, Chi Zhang, Dawu Gu

    Abstract: Analyzing the security of closed-source drivers and libraries in embedded systems holds significant importance, given their fundamental role in the supply chain. Unlike x86, embedded platforms lack comprehensive binary manipulating tools, making it difficult for researchers and developers to effectively detect and patch security issues in such closed-source components. Existing works either depend… ▽ More

    Submitted 23 April, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: This paper has been accepted by the 61st ACM/IEEE Design Automation Conference (DAC '24), June 23--27, 2024, San Francisco, CA, USA

  40. arXiv:2311.04418  [pdf, other

    cond-mat.mtrl-sci cs.AI physics.comp-ph

    AI-accelerated Discovery of Altermagnetic Materials

    Authors: Ze-Feng Gao, Shuai Qu, Bocheng Zeng, Yang Liu, Ji-Rong Wen, Hao Sun, Peng-Jie Guo, Zhong-Yi Lu

    Abstract: Altermagnetism, a new magnetic phase, has been theoretically proposed and experimentally verified to be distinct from ferromagnetism and antiferromagnetism. Although altermagnets have been found to possess many exotic physical properties, the limited availability of known altermagnetic materials hinders the study of such properties. Hence, discovering more types of altermagnetic materials with dif… ▽ More

    Submitted 13 May, 2025; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: 46 pages; 23 figures; 4 tables

    Journal ref: National Science Review, Volume 12, Issue 4, April 2025

  41. arXiv:2309.16804  [pdf, other

    cs.CL

    Curriculum-Driven Edubot: A Framework for Developing Language Learning Chatbots Through Synthesizing Conversational Data

    Authors: Yu Li, Shang Qu, Jili Shen, Shangchao Min, Zhou Yu

    Abstract: Chatbots have become popular in educational settings, revolutionizing how students interact with material and how teachers teach. We present Curriculum-Driven EduBot, a framework for developing a chatbot that combines the interactive features of chatbots with the systematic material of English textbooks to assist students in enhancing their conversational skills. We begin by extracting pertinent t… ▽ More

    Submitted 3 August, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: SIGDIAL 2024, 23 pages

  42. arXiv:2309.10435  [pdf, other

    cs.IR cs.CL

    Reformulating Sequential Recommendation: Learning Dynamic User Interest with Content-enriched Language Modeling

    Authors: Junzhe Jiang, Shang Qu, Mingyue Cheng, Qi Liu, Zhiding Liu, Hao Zhang, Rujiao Zhang, Kai Zhang, Rui Li, Jiatong Li, Min Gao

    Abstract: Recommender systems are indispensable in the realm of online applications, and sequential recommendation has enjoyed considerable prevalence due to its capacity to encapsulate the dynamic shifts in user interests. However, previous sequential modeling methods still have limitations in capturing contextual information. The primary reason is the lack of understanding of domain-specific knowledge and… ▽ More

    Submitted 13 April, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

  43. arXiv:2309.08799  [pdf, other

    cs.LG cs.AI

    SHAPNN: Shapley Value Regularized Tabular Neural Network

    Authors: Qisen Cheng, Shuhui Qu, Janghwan Lee

    Abstract: We present SHAPNN, a novel deep tabular data modeling architecture designed for supervised learning. Our approach leverages Shapley values, a well-established technique for explaining black-box models. Our neural network is trained using standard backward propagation optimization methods, and is regularized with realtime estimated Shapley values. Our method offers several advantages, including the… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 9 pages, 8 figures

  44. arXiv:2305.16598  [pdf, other

    cs.CL

    NormMark: A Weakly Supervised Markov Model for Socio-cultural Norm Discovery

    Authors: Farhad Moghimifar, Shilin Qu, Tongtong Wu, Yuan-Fang Li, Gholamreza Haffari

    Abstract: Norms, which are culturally accepted guidelines for behaviours, can be integrated into conversational models to generate utterances that are appropriate for the socio-cultural context. Existing methods for norm recognition tend to focus only on surface-level features of dialogues and do not take into account the interactions within a conversation. To address this issue, we propose NormMark, a prob… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  45. PIE: Personalized Interest Exploration for Large-Scale Recommender Systems

    Authors: Khushhall Chandra Mahajan, Amey Porobo Dharwadker, Romil Shah, Simeng Qu, Gaurav Bang, Brad Schumitsch

    Abstract: Recommender systems are increasingly successful in recommending personalized content to users. However, these systems often capitalize on popular content. There is also a continuous evolution of user interests that need to be captured, but there is no direct way to systematically explore users' interests. This also tends to affect the overall quality of the recommendation pipeline as training data… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: Accepted by WWW'2023

  46. arXiv:2303.11629  [pdf, other

    cs.CV

    TMA: Temporal Motion Aggregation for Event-based Optical Flow

    Authors: Haotian Liu, Guang Chen, Sanqing Qu, Yanping Zhang, Zhijun Li, Alois Knoll, Changjun Jiang

    Abstract: Event cameras have the ability to record continuous and detailed trajectories of objects with high temporal resolution, thereby providing intuitive motion cues for optical flow estimation. Nevertheless, most existing learning-based approaches for event optical flow estimation directly remould the paradigm of conventional images by representing the consecutive event stream as static frames, ignorin… ▽ More

    Submitted 21 August, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: Accepted by ICCV2023

  47. arXiv:2303.10035  [pdf, other

    eess.SY cs.LG cs.MA cs.RO

    A Policy Iteration Approach for Flock Motion Control

    Authors: Shuzheng Qu, Mohammed Abouheaf, Wail Gueaieb, Davide Spinello

    Abstract: The flocking motion control is concerned with managing the possible conflicts between local and team objectives of multi-agent systems. The overall control process guides the agents while monitoring the flock-cohesiveness and localization. The underlying mechanisms may degrade due to overlooking the unmodeled uncertainties associated with the flock dynamics and formation. On another side, the effi… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: 7 pages, 3 figures

    Journal ref: IEEE International Symposium on Robotic and Sensors Environments (ROSE) 2021

  48. arXiv:2303.09946  [pdf, ps, other

    eess.SY cs.LG cs.MA cs.RO

    An Adaptive Fuzzy Reinforcement Learning Cooperative Approach for the Autonomous Control of Flock Systems

    Authors: Shuzheng Qu, Mohammed Abouheaf, Wail Gueaieb, Davide Spinello

    Abstract: The flock-guidance problem enjoys a challenging structure where multiple optimization objectives are solved simultaneously. This usually necessitates different control approaches to tackle various objectives, such as guidance, collision avoidance, and cohesion. The guidance schemes, in particular, have long suffered from complex tracking-error dynamics. Furthermore, techniques that are based on li… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: 7 pages, 2 figures

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA) 2021

  49. arXiv:2303.07123  [pdf, other

    cs.CV cs.AI cs.LG

    Modality-Agnostic Debiasing for Single Domain Generalization

    Authors: Sanqing Qu, Yingwei Pan, Guang Chen, Ting Yao, Changjun Jiang, Tao Mei

    Abstract: Deep neural networks (DNNs) usually fail to generalize well to outside of distribution (OOD) data, especially in the extreme case of single domain generalization (single-DG) that transfers DNNs from single domain to multiple unseen domains. Existing single-DG techniques commonly devise various data-augmentation algorithms, and remould the multi-source domain generalization methodology to learn dom… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: To appear in CVPR-2023

  50. arXiv:2303.07110  [pdf, other

    cs.CV cs.AI cs.LG

    Upcycling Models under Domain and Category Shift

    Authors: Sanqing Qu, Tianpei Zou, Florian Roehrbein, Cewu Lu, Guang Chen, Dacheng Tao, Changjun Jiang

    Abstract: Deep neural networks (DNNs) often perform poorly in the presence of domain shift and category shift. How to upcycle DNNs and adapt them to the target task remains an important open problem. Unsupervised Domain Adaptation (UDA), especially recently proposed Source-free Domain Adaptation (SFDA), has become a promising technology to address this issue. Nevertheless, existing SFDA methods require that… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: To appear in CVPR 2023. The code has been made public