Skip to main content

Showing 51–100 of 1,927 results for author: Zhou, W

.
  1. arXiv:2505.10125  [pdf, other

    cs.LG

    Enhancing the Performance of Global Model by Improving the Adaptability of Local Models in Federated Learning

    Authors: Wujun Zhou, Shu Ding, ZeLin Li, Wei Wang

    Abstract: Federated learning enables the clients to collaboratively train a global model, which is aggregated from local models. Due to the heterogeneous data distributions over clients and data privacy in federated learning, it is difficult to train local models to achieve a well-performed global model. In this paper, we introduce the adaptability of local models, i.e., the average performance of local mod… ▽ More

    Submitted 18 May, 2025; v1 submitted 15 May, 2025; originally announced May 2025.

  2. arXiv:2505.09590  [pdf, ps, other

    cs.IR

    Distance-aware Self-adaptive Graph Convolution for Fine-grained Hierarchical Recommendation

    Authors: Tao Huang, Yihong Chen, Wei Fan, Wei Zhou, Junhao Wen

    Abstract: Graph Convolutional Networks (GCNs) are widely used to improve recommendation accuracy and performance by effectively learning the representations of user and item nodes. However, two major challenges remain: (1) the lack of further optimization in the graph representation structure and (2) insufficient attention given to the varying contributions of different convolutional layers.This paper propo… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  3. arXiv:2505.07611  [pdf

    cs.CV

    Deep Learning Advances in Vision-Based Traffic Accident Anticipation: A Comprehensive Review of Methods,Datasets,and Future Directions

    Authors: Yi Zhang, Wenye Zhou, Ruonan Lin, Xin Yang, Hao Zheng

    Abstract: Traffic accident prediction and detection are critical for enhancing road safety,and vision-based traffic accident anticipation (Vision-TAA) has emerged as a promising approach in the era of deep learning.This paper reviews 147 recent studies,focusing on the application of supervised,unsupervised,and hybrid deep learning models for accident prediction,alongside the use of real-world and synthetic… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  4. arXiv:2505.06679  [pdf, other

    cs.CV

    Jailbreaking the Text-to-Video Generative Models

    Authors: Jiayang Liu, Siyuan Liang, Shiqian Zhao, Rongcheng Tu, Wenbo Zhou, Xiaochun Cao, Dacheng Tao, Siew Kei Lam

    Abstract: Text-to-video generative models have achieved significant progress, driven by the rapid advancements in diffusion models, with notable examples including Pika, Luma, Kling, and Sora. Despite their remarkable generation ability, their vulnerability to jailbreak attack, i.e. to generate unsafe content, including pornography, violence, and discrimination, raises serious safety concerns. Existing effo… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

  5. arXiv:2505.04028  [pdf

    cs.SI physics.soc-ph

    Appeal and Scope of Misinformation Spread by AI Agents and Humans

    Authors: Lynnette Hui Xian Ng, Wenqi Zhou, Kathleen M. Carley

    Abstract: This work examines the influence of misinformation and the role of AI agents, called bots, on social network platforms. To quantify the impact of misinformation, it proposes two new metrics based on attributes of tweet engagement and user network position: Appeal, which measures the popularity of the tweet, and Scope, which measures the potential reach of the tweet. In addition, it analyzes 5.8 mi… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: Accepted to AMCIS 2025

  6. arXiv:2505.03912  [pdf, other

    cs.RO cs.CV

    OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation

    Authors: Can Cui, Pengxiang Ding, Wenxuan Song, Shuanghao Bai, Xinyang Tong, Zirui Ge, Runze Suo, Wanqi Zhou, Yang Liu, Bofang Jia, Han Zhao, Siteng Huang, Donglin Wang

    Abstract: Dual-system VLA (Vision-Language-Action) architectures have become a hot topic in embodied intelligence research, but there is a lack of sufficient open-source work for further performance analysis and optimization. To address this problem, this paper will summarize and compare the structural designs of existing dual-system architectures, and conduct systematic empirical evaluations on the core de… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  7. arXiv:2505.03574  [pdf, other

    cs.CR cs.AI

    LlamaFirewall: An open source guardrail system for building secure AI agents

    Authors: Sahana Chennabasappa, Cyrus Nikolaidis, Daniel Song, David Molnar, Stephanie Ding, Shengye Wan, Spencer Whitman, Lauren Deason, Nicholas Doucette, Abraham Montilla, Alekhya Gampa, Beto de Paola, Dominik Gabi, James Crnkovich, Jean-Christophe Testud, Kat He, Rashnil Chaturvedi, Wu Zhou, Joshua Saxe

    Abstract: Large language models (LLMs) have evolved from simple chatbots into autonomous agents capable of performing complex tasks such as editing production code, orchestrating workflows, and taking higher-stakes actions based on untrusted inputs like webpages and emails. These capabilities introduce new security risks that existing security measures, such as model fine-tuning or chatbot-focused guardrail… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  8. arXiv:2505.03494  [pdf

    cs.CV

    UPMAD-Net: A Brain Tumor Segmentation Network with Uncertainty Guidance and Adaptive Multimodal Feature Fusion

    Authors: Zhanyuan Jia, Ni Yao, Danyang Sun, Chuang Han, Yanting Li, Jiaofen Nan, Fubao Zhu, Chen Zhao, Weihua Zhou

    Abstract: Background: Brain tumor segmentation has a significant impact on the diagnosis and treatment of brain tumors. Accurate brain tumor segmentation remains challenging due to their irregular shapes, vague boundaries, and high variability. Objective: We propose a brain tumor segmentation method that combines deep learning with prior knowledge derived from a region-growing algorithm. Methods: The propos… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: 21 pages, 7 figures

  9. arXiv:2505.01950  [pdf, other

    cs.CV cs.AI

    Segment Any RGB-Thermal Model with Language-aided Distillation

    Authors: Dong Xing, Xianxun Zhu, Wei Zhou, Qika Lin, Hang Yang, Yuqing Wang

    Abstract: The recent Segment Anything Model (SAM) demonstrates strong instance segmentation performance across various downstream tasks. However, SAM is trained solely on RGB data, limiting its direct applicability to RGB-thermal (RGB-T) semantic segmentation. Given that RGB-T provides a robust solution for scene understanding in adverse weather and lighting conditions, such as low light and overexposure, w… ▽ More

    Submitted 3 May, 2025; originally announced May 2025.

    Comments: arXiv admin note: text overlap with arXiv:2412.04220 by other authors

  10. arXiv:2505.01189  [pdf, ps, other

    math.CA

    Principal Non-singularity of Fourier Matrices on $\mathbb Z_p \times \mathbb Z_q$ and $\mathbb Z_2^k \times \mathbb Z_q$

    Authors: Weiqi Zhou

    Abstract: Let $F_n$ be the $n\times n$ Fourier matrix (on cyclic groups $\mathbb Z_n$), a reknowned theorem of Chebotarëv asserts that all minors in $F_n$ for prime $n$ are non-zero. In this short note it is shown that (i) all principal minors in the Kronecker product $F_p\otimes F_q$ are non-vanishing (principal non-singularity) for distinct odd primes $p,q$ if $q$ is large enough and generates the multipl… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

    MSC Class: 42A99; 15A15

  11. arXiv:2505.00304  [pdf, other

    stat.ML cs.LG stat.ME

    Reinforcement Learning with Continuous Actions Under Unmeasured Confounding

    Authors: Yuhan Li, Eugene Han, Yifan Hu, Wenzhuo Zhou, Zhengling Qi, Yifan Cui, Ruoqing Zhu

    Abstract: This paper addresses the challenge of offline policy learning in reinforcement learning with continuous action spaces when unmeasured confounders are present. While most existing research focuses on policy evaluation within partially observable Markov decision processes (POMDPs) and assumes discrete action spaces, we advance this field by establishing a novel identification result to enable the no… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

  12. arXiv:2504.19638  [pdf, other

    cs.LG cs.ET

    LODAP: On-Device Incremental Learning Via Lightweight Operations and Data Pruning

    Authors: Biqing Duan, Qing Wang, Di Liu, Wei Zhou, Zhenli He, Shengfa Miao

    Abstract: Incremental learning that learns new classes over time after the model's deployment is becoming increasingly crucial, particularly for industrial edge systems, where it is difficult to communicate with a remote server to conduct computation-intensive learning. As more classes are expected to learn after their execution for edge devices. In this paper, we propose LODAP, a new on-device incremental… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  13. arXiv:2504.19478  [pdf, other

    cs.CV

    CasaGPT: Cuboid Arrangement and Scene Assembly for Interior Design

    Authors: Weitao Feng, Hang Zhou, Jing Liao, Li Cheng, Wenbo Zhou

    Abstract: We present a novel approach for indoor scene synthesis, which learns to arrange decomposed cuboid primitives to represent 3D objects within a scene. Unlike conventional methods that use bounding boxes to determine the placement and scale of 3D objects, our approach leverages cuboids as a straightforward yet highly effective alternative for modeling objects. This allows for compact scene generation… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  14. arXiv:2504.19300  [pdf

    cs.CV

    Myocardial Region-guided Feature Aggregation Net for Automatic Coronary artery Segmentation and Stenosis Assessment using Coronary Computed Tomography Angiography

    Authors: Ni Yao, Xiangyu Liu, Danyang Sun, Chuang Han, Yanting Li, Jiaofen Nan, Chengyang Li, Fubao Zhu, Weihua Zhou, Chen Zhao

    Abstract: Coronary artery disease (CAD) remains a leading cause of mortality worldwide, requiring accurate segmentation and stenosis detection using Coronary Computed Tomography angiography (CCTA). Existing methods struggle with challenges such as low contrast, morphological variability and small vessel segmentation. To address these limitations, we propose the Myocardial Region-guided Feature Aggregation N… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

    Comments: 31 pages, 12 figures

  15. arXiv:2504.18600  [pdf, other

    q-fin.CP cs.AI cs.CE

    QuantBench: Benchmarking AI Methods for Quantitative Investment

    Authors: Saizhuo Wang, Hao Kong, Jiadong Guo, Fengrui Hua, Yiyan Qi, Wanyun Zhou, Jiahao Zheng, Xinyu Wang, Lionel M. Ni, Jian Guo

    Abstract: The field of artificial intelligence (AI) in quantitative investment has seen significant advancements, yet it lacks a standardized benchmark aligned with industry practices. This gap hinders research progress and limits the practical application of academic innovations. We present QuantBench, an industrial-grade benchmark platform designed to address this critical need. QuantBench offers three ke… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  16. Hemispheric Distribution of Solar Active Regions During Solar Cycles 23-25

    Authors: Yuxia Liu, Tingting Xu, Miao Wan, Linhua Deng, Xinhua Zhao, Shiyang Qi, Nanbin Xiang, Weihong Zhou

    Abstract: Solar active regions (ARs) are crucial for understanding the long-term evolution of solar activities and predicting eruptive phenomena, including solar flares and coronal mass ejections. However, the cycle-dependent properties in the north-south asymmetry of ARs have not been fully understood. In this study, we investigate the hemispheric distribution of ARs from Carrington Rotation 1909 to 2278 (… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  17. arXiv:2504.17263  [pdf, other

    cs.CV cs.CC

    Precision Neural Network Quantization via Learnable Adaptive Modules

    Authors: Wenqiang Zhou, Zhendong Yu, Xinyu Liu, Jiaming Yang, Rong Xiao, Tao Wang, Chenwei Tang, Jiancheng Lv

    Abstract: Quantization Aware Training (QAT) is a neural network quantization technique that compresses model size and improves operational efficiency while effectively maintaining model performance. The paradigm of QAT is to introduce fake quantization operators during the training process, allowing the model to autonomously compensate for information loss caused by quantization. Making quantization paramet… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  18. arXiv:2504.16601  [pdf, other

    cs.CL cs.AI

    Comparing Large Language Models and Traditional Machine Translation Tools for Translating Medical Consultation Summaries: A Pilot Study

    Authors: Andy Li, Wei Zhou, Rashina Hoda, Chris Bain, Peter Poon

    Abstract: This study evaluates how well large language models (LLMs) and traditional machine translation (MT) tools translate medical consultation summaries from English into Arabic, Chinese, and Vietnamese. It assesses both patient, friendly and clinician, focused texts using standard automated metrics. Results showed that traditional MT tools generally performed better, especially for complex texts, while… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    Comments: 8 pages, 2 tables and 1 Figure

  19. arXiv:2504.16552  [pdf, ps, other

    cs.DC

    DTVM: Revolutionizing Smart Contract Execution with Determinism and Compatibility

    Authors: Wei Zhou, Xiong Xu, Changzheng Wei, Ying Yan, Wei Tang, Zhihao Chen, Xuebing Huang, Wengang Chen, Jie Zhang, Yang Chen, Xiaofu Zheng, Hanghang Wu, Shenglong Chen, Ermei Wang, Xiangfei Chen, Yang Yu, Meng Wu, Tao Zhu, Liwei Yuan, Feng Yu, Alex Zhang, Wei Wang, Ji Luo, Zhengyu He, Wenbiao Zhao

    Abstract: We introduce the DeTerministic Virtual Machine (DTVM) Stack, a next-generation smart contract execution framework designed to address critical performance, determinism, and ecosystem compatibility challenges in blockchain networks. Building upon WebAssembly (Wasm) while maintaining full Ethereum Virtual Machine (EVM) ABI compatibility, DTVM introduces a Deterministic Middle Intermediate Representa… ▽ More

    Submitted 9 June, 2025; v1 submitted 23 April, 2025; originally announced April 2025.

  20. arXiv:2504.16511  [pdf, other

    cs.CL

    QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining

    Authors: Fengze Liu, Weidong Zhou, Binbin Liu, Zhimiao Yu, Yifan Zhang, Haobin Lin, Yifeng Yu, Bingni Zhang, Xiaohuan Zhou, Taifeng Wang, Yong Cao

    Abstract: Quality and diversity are two critical metrics for the training data of large language models (LLMs), positively impacting performance. Existing studies often optimize these metrics separately, typically by first applying quality filtering and then adjusting data proportions. However, these approaches overlook the inherent trade-off between quality and diversity, necessitating their joint consider… ▽ More

    Submitted 25 April, 2025; v1 submitted 23 April, 2025; originally announced April 2025.

  21. arXiv:2504.16405  [pdf, other

    cs.MM

    EEmo-Bench: A Benchmark for Multi-modal Large Language Models on Image Evoked Emotion Assessment

    Authors: Lancheng Gao, Ziheng Jia, Yunhao Zeng, Wei Sun, Yiming Zhang, Wei Zhou, Guangtao Zhai, Xiongkuo Min

    Abstract: The furnishing of multi-modal large language models (MLLMs) has led to the emergence of numerous benchmark studies, particularly those evaluating their perception and understanding capabilities. Among these, understanding image-evoked emotions aims to enhance MLLMs' empathy, with significant applications such as human-machine interaction and advertising recommendations. However, current evaluation… ▽ More

    Submitted 7 May, 2025; v1 submitted 23 April, 2025; originally announced April 2025.

  22. arXiv:2504.15710  [pdf

    physics.chem-ph

    Prediction of CO2 reduction reaction intermediates and products on transition metal-doped r-GeSe monolayers:A combined DFT and machine learning approach

    Authors: Xuxin Kang, Wenjing Zhou, Ziyuan Li, Zhaoqin Chu, Hanqin Yin, Shan Gao, Aijun Du, Xiangmei Duan

    Abstract: The electrocatalytic CO2 reduction reaction (CO2RR) is a complex multi-proton-electron transfer process that generates a vast network of reaction intermediates. Accurate prediction of free energy changes (G) of these intermediates and products is essential for evaluating catalytic performance. We combined density functional theory (DFT) and machine learning (ML) to screen 25 single-atom catalysts… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  23. arXiv:2504.15384  [pdf

    cs.CV

    ICGM-FRAX: Iterative Cross Graph Matching for Hip Fracture Risk Assessment using Dual-energy X-ray Absorptiometry Images

    Authors: Chen Zhao, Anjum Shaik, Joyce H. Keyak, Nancy E. Lane, Jeffrey D. Deng, Kuan-Jui Su, Qiuying Sha, Hui Shen, Hong-Wen Deng, Weihua Zhou

    Abstract: Hip fractures represent a major health concern, particularly among the elderly, often leading decreased mobility and increased mortality. Early and accurate detection of at risk individuals is crucial for effective intervention. In this study, we propose Iterative Cross Graph Matching for Hip Fracture Risk Assessment (ICGM-FRAX), a novel approach for predicting hip fractures using Dual-energy X-ra… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: 23 pages, 4 figures

  24. arXiv:2504.15279  [pdf, other

    cs.CV

    VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

    Authors: Weiye Xu, Jiahao Wang, Weiyun Wang, Zhe Chen, Wengang Zhou, Aijun Yang, Lewei Lu, Houqiang Li, Xiaohua Wang, Xizhou Zhu, Wenhai Wang, Jifeng Dai, Jinguo Zhu

    Abstract: Visual reasoning is a core component of human intelligence and a critical capability for advanced multimodal models. Yet current reasoning evaluations of multimodal large language models (MLLMs) often rely on text descriptions and allow language-based reasoning shortcuts, failing to measure genuine vision-centric reasoning. To address this, we introduce VisuLogic: a benchmark of 1,000 human-verifi… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: Code, data, and baselines are available at https://visulogic-benchmark.github.io/VisuLogic

  25. arXiv:2504.15146  [pdf, other

    cs.AI

    Behavioral Universe Network (BUN): A Behavioral Information-Based Framework for Complex Systems

    Authors: Wei Zhou, Ailiya Borjigin, Cong He

    Abstract: Modern digital ecosystems feature complex, dynamic interactions among autonomous entities across diverse domains. Traditional models often separate agents and objects, lacking a unified foundation to capture their interactive behaviors. This paper introduces the Behavioral Universe Network (BUN), a theoretical framework grounded in the Agent-Interaction-Behavior (AIB) formalism. BUN treats subject… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: 17 pages, 1 figure

  26. arXiv:2504.14267  [pdf, other

    cs.CV

    Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction

    Authors: Li Yu, Xuanzhe Sun, Wei Zhou, Moncef Gabbouj

    Abstract: Video saliency prediction is crucial for downstream applications, such as video compression and human-computer interaction. With the flourishing of multimodal learning, researchers started to explore multimodal video saliency prediction, including audio-visual and text-visual approaches. Auditory cues guide the gaze of viewers to sound sources, while textual cues provide semantic guidance for unde… ▽ More

    Submitted 19 April, 2025; originally announced April 2025.

  27. arXiv:2504.12711  [pdf, other

    cs.CV cs.AI eess.IV

    NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

    Authors: Xin Li, Yeying Jin, Xin Jin, Zongwei Wu, Bingchen Li, Yufei Wang, Wenhan Yang, Yu Li, Zhibo Chen, Bihan Wen, Robby T. Tan, Radu Timofte, Qiyu Rong, Hongyuan Jing, Mengmeng Zhang, Jinglong Li, Xiangyu Lu, Yi Ren, Yuting Liu, Meng Zhang, Xiang Chen, Qiyuan Guan, Jiangxin Dong, Jinshan Pan, Conglin Gou , et al. (112 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images. This challenge received a wide range of impressive solutions, which are developed and evaluated using our collected real-world Raindrop Clarity dataset. Unlike existing deraining datasets, our Raindrop Clarity dataset is more diverse and challenging in degradation types and contents, which includ… ▽ More

    Submitted 19 April, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

    Comments: Challenge Report of CVPR NTIRE 2025; 26 pages; Methods from 32 teams

  28. arXiv:2504.12328  [pdf, other

    cs.CL cs.AI

    A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future

    Authors: Jialun Zhong, Wei Shen, Yanzeng Li, Songyang Gao, Hua Lu, Yicheng Chen, Yang Zhang, Wei Zhou, Jinjie Gu, Lei Zou

    Abstract: Reward Model (RM) has demonstrated impressive potential for enhancing Large Language Models (LLM), as RM can serve as a proxy for human preferences, providing signals to guide LLMs' behavior in various tasks. In this paper, we provide a comprehensive overview of relevant research, exploring RMs from the perspectives of preference collection, reward modeling, and usage. Next, we introduce the appli… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

  29. arXiv:2504.12276  [pdf, other

    cs.CV

    The Tenth NTIRE 2025 Image Denoising Challenge Report

    Authors: Lei Sun, Hang Guo, Bin Ren, Luc Van Gool, Radu Timofte, Yawei Li, Xiangyu Kong, Hyunhee Park, Xiaoxuan Yu, Suejin Han, Hakjae Jeon, Jia Li, Hyung-Ju Chun, Donghun Ryou, Inju Ha, Bohyung Han, Jingyu Ma, Zhijuan Huang, Huiyuan Fu, Hongyuan Yu, Boqi Zhang, Jiawei Shi, Heng Zhang, Huadong Ma, Deepak Kumar Tyagi , et al. (69 additional authors not shown)

    Abstract: This paper presents an overview of the NTIRE 2025 Image Denoising Challenge (σ = 50), highlighting the proposed methodologies and corresponding results. The primary objective is to develop a network architecture capable of achieving high-quality denoising performance, quantitatively evaluated using PSNR, without constraints on computational complexity or model size. The task assumes independent ad… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

  30. arXiv:2504.11854  [pdf, ps, other

    cs.GT

    Less-excludable Mechanism for DAOs in Public Good Auctions

    Authors: Jing Chen, Wentao Zhou

    Abstract: With the rise of smart contracts, decentralized autonomous organizations (DAOs) have emerged in public good auctions, allowing "small" bidders to gather together and enlarge their influence in high-valued auctions. However, models and mechanisms in the existing research literature do not guarantee non-excludability, which is a main property of public goods. As such, some members of the winning DAO… ▽ More

    Submitted 18 April, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

  31. arXiv:2504.11733  [pdf, other

    cs.CV

    DVLTA-VQA: Decoupled Vision-Language Modeling with Text-Guided Adaptation for Blind Video Quality Assessment

    Authors: Li Yu, Situo Wang, Wei Zhou, Moncef Gabbouj

    Abstract: Inspired by the dual-stream theory of the human visual system (HVS) - where the ventral stream is responsible for object recognition and detail analysis, while the dorsal stream focuses on spatial relationships and motion perception - an increasing number of video quality assessment (VQA) works built upon this framework are proposed. Recent advancements in large multi-modal models, notably Contras… ▽ More

    Submitted 19 April, 2025; v1 submitted 15 April, 2025; originally announced April 2025.

  32. arXiv:2504.10078  [pdf, other

    cs.CE

    Unleashing Expert Opinion from Social Media for Stock Prediction

    Authors: Wanyun Zhou, Saizhuo Wang, Xiang Li, Yiyan Qi, Jian Guo, Xiaowen Chu

    Abstract: While stock prediction task traditionally relies on volume-price and fundamental data to predict the return ratio or price movement trend, sentiment factors derived from social media platforms such as StockTwits offer a complementary and useful source of real-time market information. However, we find that most social media posts, along with the public sentiment they reflect, provide limited value… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

  33. arXiv:2504.09361  [pdf, other

    cs.CV

    PapMOT: Exploring Adversarial Patch Attack against Multiple Object Tracking

    Authors: Jiahuan Long, Tingsong Jiang, Wen Yao, Shuai Jia, Weijia Zhang, Weien Zhou, Chao Ma, Xiaoqian Chen

    Abstract: Tracking multiple objects in a continuous video stream is crucial for many computer vision tasks. It involves detecting and associating objects with their respective identities across successive frames. Despite significant progress made in multiple object tracking (MOT), recent studies have revealed the vulnerability of existing MOT methods to adversarial attacks. Nevertheless, all of these attack… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

    Comments: Accepted by ECCV 2024

  34. arXiv:2504.08857  [pdf, other

    econ.GN

    Structural robustness of the international food supply network under external shocks and its determinants

    Authors: Han-Yu Zhu, Yin-Ting Zhang, Wen-Jie Xie, Wei-Xing Zhou

    Abstract: The stability of the global food supply network is critical for ensuring food security. This study constructs an aggregated international food supply network based on the trade data of four staple crops and evaluates its structural robustness through network integrity under accumulating external shocks. Network integrity is typically quantified in network science by the relative size of the larges… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

  35. arXiv:2504.05535  [pdf, other

    cs.CL

    COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

    Authors: M-A-P Team, Siwei Wu, Jincheng Ren, Xinrun Du, Shuyue Guo, Xingwei Qu, Yiming Liang, Jie Liu, Yunwen Li, Tianyu Zheng, Boyu Feng, Huaqing Yuan, Zenith Wang, Jiaheng Liu, Wenhao Huang, Chenglin Cai, Haoran Que, Jian Yang, Yuelin Bai, Zekun Moore Wang, Zhouliang Yu, Qunshu Lin, Ding Pan, Yuchen Jiang, Tiannan Wang , et al. (7 additional authors not shown)

    Abstract: Aligning large language models (LLMs) with human preferences has achieved remarkable success. However, existing Chinese preference datasets are limited by small scale, narrow domain coverage, and lack of rigorous data validation. Additionally, the reliance on human annotators for instruction and response labeling significantly constrains the scalability of human preference datasets. To address the… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  36. The Mini-SiTian Array: first-two-year operation

    Authors: Min He, Hong Wu, Liang Ge, Jian-feng Tian, Zheng Wang, Hai-yang Mu, Yu Zhang, Yang Huang, Jie Zheng, Zhou Fan, Zheng-yang Li, Hong-hui Gu, Heng-geng Han, Kai Xiao, Zhi-rui Li, Jun-jie Jin, Bei-chuan Wang, Jun Ma, Jin-hang Zou, Ying Wu, Jiu-peng Guo, Li-guo Fang, Zhi-gang Hou, Bo-wen Zhang, Yun-fei Xu , et al. (48 additional authors not shown)

    Abstract: The SiTian project, designed to utilize 60 telescopes distributed across multiple sites in China, is a next-generation time-domain survey initiative. As a pathfinder for the SiTian project, the Mini-SiTian (MST) has been proposed and implemented to test the SiTian's brain and data pipeline, and to evaluate the feasibility of its technology and science cases. Mounted at the Xinglong Observatory, th… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: 10 pages, 11 figures, Accepted for publication in a special issue of Research in Astronomy and Astrophysics on the Mini-SiTian Array

  37. arXiv:2504.01025  [pdf

    eess.IV cs.AI cs.CV physics.med-ph

    Diagnosis of Pulmonary Hypertension by Integrating Multimodal Data with a Hybrid Graph Convolutional and Transformer Network

    Authors: Fubao Zhu, Yang Zhang, Gengmin Liang, Jiaofen Nan, Yanting Li, Chuang Han, Danyang Sun, Zhiguo Wang, Chen Zhao, Wenxuan Zhou, Jian He, Yi Xu, Iokfai Cheang, Xu Zhu, Yanli Zhou, Weihua Zhou

    Abstract: Early and accurate diagnosis of pulmonary hypertension (PH) is essential for optimal patient management. Differentiating between pre-capillary and post-capillary PH is critical for guiding treatment decisions. This study develops and validates a deep learning-based diagnostic model for PH, designed to classify patients as non-PH, pre-capillary PH, or post-capillary PH. This retrospective study ana… ▽ More

    Submitted 27 March, 2025; originally announced April 2025.

    Comments: 23 pages, 8 figures, 4 tables

  38. arXiv:2504.00882  [pdf, other

    cs.DB cs.AI cs.CL cs.IR cs.LG

    CrackSQL: A Hybrid SQL Dialect Translation System Powered by Large Language Models

    Authors: Wei Zhou, Yuyang Gao, Xuanhe Zhou, Guoliang Li

    Abstract: Dialect translation plays a key role in enabling seamless interaction across heterogeneous database systems. However, translating SQL queries between different dialects (e.g., from PostgreSQL to MySQL) remains a challenging task due to syntactic discrepancies and subtle semantic variations. Existing approaches including manual rewriting, rule-based systems, and large language model (LLM)-based tec… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

    Comments: Extension of our SIGMOD 2025 paper. Please refer to source code available at: https://github.com/weAIDB/CrackSQL

  39. arXiv:2504.00786  [pdf, other

    cs.DB cs.LG

    FeatInsight: An Online ML Feature Management System on 4Paradigm Sage-Studio Platform

    Authors: Xin Tong, Xuanhe Zhou, Bingsheng He, Guoliang Li, Zirui Tang, Wei Zhou, Fan Wu, Mian Lu, Yuqiang Chen

    Abstract: Feature management is essential for many online machine learning applications and can often become the performance bottleneck (e.g., taking up to 70% of the overall latency in sales prediction service). Improper feature configurations (e.g., introducing too many irrelevant features) can severely undermine the model's generalization capabilities. However, managing online ML features is challenging… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  40. arXiv:2503.21837  [pdf

    physics.bio-ph physics.med-ph

    Impact of Oxygen on DNA Damage Distribution in 3D Genome and Its Correlation to Oxygen Enhancement Ratio under High LET Irradiation

    Authors: Ankang Hu, Wanyi Zhou, Xiyu Luo, Rui Qiu, Junli Li

    Abstract: The variation of the oxygen enhancement ratio (OER) across different values of Linear Energy Transfer (LET) currently lacks a comprehensive mechanistic interpretation and a mechanistic model. Our earlier research revealed a significant correlation between the distribution of double-strand breaks (DSBs) within the 3D genome and radiation-induced cell death, which offers valuable insights into the o… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

    Comments: 14 pages, 6 figures

  41. arXiv:2503.21446  [pdf

    cond-mat.mtrl-sci

    No-drift phase-change memory alloy for neuromorphic computing

    Authors: Xiaozhe Wang, Ruobing Wang, Suyang Sun, Ding Xu, Chao Nie, Zhou Zhou, Chenyu Wen, Junying Zhang, Ruixuan Chu, Xueyang Shen, Wen Zhou, Zhitang Song, Jiang-Jing Wang, En Ma, Wei Zhang

    Abstract: Spontaneous structural relaxation is intrinsic to glassy materials due to their metastable nature. For phase-change materials (PCMs), the resultant temporal change in electrical resistance seriously hamper in-memory computing (IMC) applications. Here, we report an ab-initio-calculation-informed design of amorphous PCM composed of robust "molecule-like" motifs with minimal Peierls distortion, depri… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  42. arXiv:2503.21284  [pdf, other

    cs.CV cs.AI

    Multi-Scale Invertible Neural Network for Wide-Range Variable-Rate Learned Image Compression

    Authors: Hanyue Tu, Siqi Wu, Li Li, Wengang Zhou, Houqiang Li

    Abstract: Autoencoder-based structures have dominated recent learned image compression methods. However, the inherent information loss associated with autoencoders limits their rate-distortion performance at high bit rates and restricts their flexibility of rate adaptation. In this paper, we present a variable-rate image compression model based on invertible transform to overcome these limitations. Specific… ▽ More

    Submitted 27 March, 2025; v1 submitted 27 March, 2025; originally announced March 2025.

    Comments: Accepted for publication in IEEE Transactions on Multimedia 2025

  43. arXiv:2503.20628  [pdf, ps, other

    math.AP

    Carleman estimate for full-discrete approximations of the complex Ginzburg-Landau equation with dynamic boundary conditions and applications to controllability

    Authors: Xu Zhu, Wenwen Zhou, Bin Wu

    Abstract: In this paper, we investigate Carleman estimate and controllability result for the fully-discrete approximations of a one-dimensional Ginzburg-Landau equation with dynamic boundary conditions. We first establish a new discrete Carleman estimate for the corresponding adjoint system. Based on this Carleman estimate, we obtain a relaxed observability inequality for the adjoint system, and then a cont… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

  44. arXiv:2503.20314  [pdf, other

    cs.CV

    Wan: Open and Advanced Large-Scale Video Generative Models

    Authors: Team Wan, Ang Wang, Baole Ai, Bin Wen, Chaojie Mao, Chen-Wei Xie, Di Chen, Feiwu Yu, Haiming Zhao, Jianxiao Yang, Jianyuan Zeng, Jiayu Wang, Jingfeng Zhang, Jingren Zhou, Jinkai Wang, Jixuan Chen, Kai Zhu, Kang Zhao, Keyu Yan, Lianghua Huang, Mengyang Feng, Ningyi Zhang, Pandeng Li, Pingyu Wu, Ruihang Chu , et al. (37 additional authors not shown)

    Abstract: This report presents Wan, a comprehensive and open suite of video foundation models designed to push the boundaries of video generation. Built upon the mainstream diffusion transformer paradigm, Wan achieves significant advancements in generative capabilities through a series of innovations, including our novel VAE, scalable pre-training strategies, large-scale data curation, and automated evaluat… ▽ More

    Submitted 18 April, 2025; v1 submitted 26 March, 2025; originally announced March 2025.

    Comments: 60 pages, 33 figures

  45. arXiv:2503.18843  [pdf, other

    physics.plasm-ph physics.acc-ph physics.optics quant-ph

    Experimental Evidence of Vortex $γ$ Photons in All-Optical Inverse Compton Scattering

    Authors: Mingxuan Wei, Siyu Chen, Yu Wang, Xichen Hu, Mingyang Zhu, Hao Hu, Pei-Lun He, Weijun Zhou, Jiao Jia, Li Lu, Boyuan Li, Feng Liu, Min Chen, Liming Chen, Jian-Xing Li, Wenchao Yan, Jie Zhang

    Abstract: Vortex $γ$ photons carrying orbital angular momenta (OAM) hold great potential for various applications. However, their generation remains a great challenge. Here, we successfully generate sub-MeV vortex $γ$ photons via all-optical inverse Compton scattering of relativistic electrons colliding with a sub-relativistic Laguerre-Gaussian laser. In principle, directly measuring the OAM of $γ$ photons… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: 8 pages, 4 figures

  46. arXiv:2503.18672  [pdf, other

    cs.CV

    Feature Calibration enhanced Parameter Synthesis for CLIP-based Class-incremental Learning

    Authors: Juncen Guo, Yang Liu, Xiaoguang Zhu, Lianlong Sun, Liangyu Teng, Jingyi Wu, Di Li, Wei Zhou, Liang Song

    Abstract: Class-Incremental Learning (CIL) enables models to continuously learn new class knowledge while retaining previous classes, facilitating adaptation and evolution in dynamic, real-world environments. Traditional CIL methods primarily rely on visual features, which limits their effectiveness in complex, multimodal scenarios. In contrast, VLMs show promising potential for enhancing CIL by leveraging… ▽ More

    Submitted 17 April, 2025; v1 submitted 24 March, 2025; originally announced March 2025.

  47. arXiv:2503.18034  [pdf, ps, other

    cs.CV cs.CL

    Expanding the Boundaries of Vision Prior Knowledge in Multi-modal Large Language Models

    Authors: Qiao Liang, Yanjiang Liu, Weixiang Zhou, Ben He, Yaojie Lu, Hongyu Lin, Jia Zheng, Xianpei Han, Le Sun, Yingfei Sun

    Abstract: Does the prior knowledge of the vision encoder constrain the capability boundary of Multi-modal Large Language Models (MLLMs)? While most existing research treats MLLMs as unified systems optimized through end-to-end training, the impact of vision encoder's prior knowledge is seldom investigated. In this work, we introduce a novel metric, $Rank_e$, to quantify the effect of prior knowledge of the… ▽ More

    Submitted 30 May, 2025; v1 submitted 23 March, 2025; originally announced March 2025.

  48. arXiv:2503.18004  [pdf, other

    physics.soc-ph cs.IT

    Dynamic structural resilience of international staple food trade networks

    Authors: Si-Yao Wei, Wei-Xing Zhou

    Abstract: It is important to maintain the resilient international food trade network for food security. We have constructed the international trade networks of maize, rice, soybean, and wheat based on bilateral flows data between economies. Drawing on information theory, we have measured their dynamic resilience based on efficiency and redundancy during 1986 to 2022. We have also investigated the impact of… ▽ More

    Submitted 23 March, 2025; originally announced March 2025.

  49. arXiv:2503.17407  [pdf, other

    cs.CL cs.LG

    A Comprehensive Survey on Long Context Language Modeling

    Authors: Jiaheng Liu, Dawei Zhu, Zhiqi Bai, Yancheng He, Huanxuan Liao, Haoran Que, Zekun Wang, Chenchen Zhang, Ge Zhang, Jiebin Zhang, Yuanxing Zhang, Zhuo Chen, Hangyu Guo, Shilong Li, Ziqiang Liu, Yong Shan, Yifan Song, Jiayi Tian, Wenhao Wu, Zhejian Zhou, Ruijie Zhu, Junlan Feng, Yang Gao, Shizhu He, Zhoujun Li , et al. (12 additional authors not shown)

    Abstract: Efficient processing of long contexts has been a persistent pursuit in Natural Language Processing. With the growing number of long documents, dialogues, and other textual data, it is important to develop Long Context Language Models (LCLMs) that can process and analyze extensive inputs in an effective and efficient way. In this paper, we present a comprehensive survey on recent advances in long-c… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  50. arXiv:2503.16937  [pdf, other

    astro-ph.GA

    External tides: an important driver of velocity dispersion in molecular clouds

    Authors: J. W. Zhou

    Abstract: Using the 3D density distribution derived from the 3D dust map of the solar neighborhood, the gravitational potential is obtained by solving the Poisson equation, from which the tidal tensor is computed. In the optimal decomposition, the external tidal tensor follows the same formalism as that of a point mass. The average tidal strength of the clouds, derived from both tidal tensor analysis and pi… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

    Comments: Accepted for publication in A&A Letter