Skip to main content

Showing 1–50 of 181 results for author: Wen, Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.00393  [pdf, other

    cs.DB cs.SI

    S3AND: Efficient Subgraph Similarity Search Under Aggregated Neighbor Difference Semantics (Technical Report)

    Authors: Qi Wen, Yutong Ye, Xiang Lian, Mingsong Chen

    Abstract: For the past decades, the \textit{subgraph similarity search} over a large-scale data graph has become increasingly important and crucial in many real-world applications, such as social network analysis, bioinformatics network analytics, knowledge graph discovery, and many others. While previous works on subgraph similarity search used various graph similarity metrics such as the graph isomorphism… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

  2. arXiv:2504.15585  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

    Authors: Kun Wang, Guibin Zhang, Zhenhong Zhou, Jiahao Wu, Miao Yu, Shiqian Zhao, Chenlong Yin, Jinhu Fu, Yibo Yan, Hanjun Luo, Liang Lin, Zhihao Xu, Haolang Lu, Xinye Cao, Xinyun Zhou, Weifei Jin, Fanci Meng, Junyuan Mao, Hao Wu, Minghe Wang, Fan Zhang, Junfeng Fang, Chengwei Liu, Yifan Zhang, Qiankun Li , et al. (57 additional authors not shown)

    Abstract: The remarkable success of Large Language Models (LLMs) has illuminated a promising pathway toward achieving Artificial General Intelligence for both academic and industrial communities, owing to their unprecedented performance across various applications. As LLMs continue to gain prominence in both research and commercial domains, their security and safety implications have become a growing concer… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  3. arXiv:2504.03965  [pdf, other

    cs.IR

    Automating Personalization: Prompt Optimization for Recommendation Reranking

    Authors: Chen Wang, Mingdai Yang, Zhiwei Liu, Pan Li, Linsey Pang, Qingsong Wen, Philip Yu

    Abstract: Modern recommender systems increasingly leverage large language models (LLMs) for reranking to improve personalization. However, existing approaches face two key limitations: (1) heavy reliance on manually crafted prompts that are difficult to scale, and (2) inadequate handling of unstructured item metadata that complicates preference inference. We present AGP (Auto-Guided Prompt Refinement), a no… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  4. arXiv:2504.00032  [pdf, other

    cs.CV cs.CG cs.RO

    Skeletonization Quality Evaluation: Geometric Metrics for Point Cloud Analysis in Robotics

    Authors: Qingmeng Wen, Yu-Kun Lai, Ze Ji, Seyed Amir Tafrishi

    Abstract: Skeletonization is a powerful tool for shape analysis, rooted in the inherent instinct to understand an object's morphology. It has found applications across various domains, including robotics. Although skeletonization algorithms have been studied in recent years, their performance is rarely quantified with detailed numerical evaluations. This work focuses on defining and quantifying geometric pr… ▽ More

    Submitted 29 March, 2025; originally announced April 2025.

    Comments: 15 pages, 12 figures, under-review

  5. arXiv:2503.20701  [pdf, other

    cs.CL

    UniEDU: A Unified Language and Vision Assistant for Education Applications

    Authors: Zhendong Chu, Jian Xie, Shen Wang, Zichao Wang, Qingsong Wen

    Abstract: Education materials for K-12 students often consist of multiple modalities, such as text and images, posing challenges for models to fully understand nuanced information in these materials. In this paper, we propose a unified language and vision assistant UniEDU designed for various educational applications, including knowledge recommendation, knowledge tracing, time cost prediction, and user answ… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

  6. arXiv:2503.18132  [pdf, other

    cs.CL

    MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection

    Authors: Yibo Yan, Shen Wang, Jiahao Huo, Philip S. Yu, Xuming Hu, Qingsong Wen

    Abstract: Mathematical error detection in educational settings presents a significant challenge for Multimodal Large Language Models (MLLMs), requiring a sophisticated understanding of both visual and textual mathematical content along with complex reasoning capabilities. Though effective in mathematical problem-solving, MLLMs often struggle with the nuanced task of identifying and categorizing student erro… ▽ More

    Submitted 23 March, 2025; originally announced March 2025.

    Comments: Work In Progress

  7. arXiv:2503.14504  [pdf, ps, other

    cs.CV

    Aligning Multimodal LLM with Human Preference: A Survey

    Authors: Tao Yu, Yi-Fan Zhang, Chaoyou Fu, Junkang Wu, Jinda Lu, Kun Wang, Xingyu Lu, Yunhang Shen, Guibin Zhang, Dingjie Song, Yibo Yan, Tianlong Xu, Qingsong Wen, Zhang Zhang, Yan Huang, Liang Wang, Tieniu Tan

    Abstract: Large language models (LLMs) can handle a wide variety of general tasks with simple prompts, without the need for task-specific training. Multimodal Large Language Models (MLLMs), built upon LLMs, have demonstrated impressive potential in tackling complex tasks involving visual, auditory, and textual data. However, critical issues related to truthfulness, safety, o1-like reasoning, and alignment w… ▽ More

    Submitted 23 March, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

    Comments: Project page: https://github.com/BradyFU/Awesome-Multimodal-Large-Language-Models/tree/Alignment

  8. arXiv:2503.13502  [pdf, other

    cs.DB cs.LG

    Foundation Models for Spatio-Temporal Data Science: A Tutorial and Survey

    Authors: Yuxuan Liang, Haomin Wen, Yutong Xia, Ming Jin, Bin Yang, Flora Salim, Qingsong Wen, Shirui Pan, Gao Cong

    Abstract: Spatio-Temporal (ST) data science, which includes sensing, managing, and mining large-scale data across space and time, is fundamental to understanding complex systems in domains such as urban computing, climate science, and intelligent transportation. Traditional deep learning approaches have significantly advanced this field, particularly in the stage of ST data mining. However, these models rem… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  9. arXiv:2503.11835  [pdf, other

    cs.LG cs.CV

    How Can Time Series Analysis Benefit From Multiple Modalities? A Survey and Outlook

    Authors: Haoxin Liu, Harshavardhan Kamarthi, Zhiyuan Zhao, Shangqing Xu, Shiyu Wang, Qingsong Wen, Tom Hartvigsen, Fei Wang, B. Aditya Prakash

    Abstract: Time series analysis (TSA) is a longstanding research topic in the data mining community and has wide real-world significance. Compared to "richer" modalities such as language and vision, which have recently experienced explosive development and are densely connected, the time-series modality remains relatively underexplored and isolated. We notice that many recent TSA works have formed a new rese… ▽ More

    Submitted 27 March, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

    Comments: Github Repo: https://github.com/AdityaLab/MM4TSA

  10. arXiv:2503.11733  [pdf, other

    cs.CY cs.AI cs.CL cs.HC

    LLM Agents for Education: Advances and Applications

    Authors: Zhendong Chu, Shen Wang, Jian Xie, Tinghui Zhu, Yibo Yan, Jinheng Ye, Aoxiao Zhong, Xuming Hu, Jing Liang, Philip S. Yu, Qingsong Wen

    Abstract: Large Language Model (LLM) agents have demonstrated remarkable capabilities in automating tasks and driving innovation across diverse educational applications. In this survey, we provide a systematic review of state-of-the-art research on LLM agents in education, categorizing them into two broad classes: (1) \emph{Pedagogical Agents}, which focus on automating complex pedagogical tasks to support… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Comments: 17 pages

  11. arXiv:2503.11411  [pdf, other

    cs.LG

    Empowering Time Series Analysis with Synthetic Data: A Survey and Outlook in the Era of Foundation Models

    Authors: Xu Liu, Taha Aksu, Juncheng Liu, Qingsong Wen, Yuxuan Liang, Caiming Xiong, Silvio Savarese, Doyen Sahoo, Junnan Li, Chenghao Liu

    Abstract: Time series analysis is crucial for understanding dynamics of complex systems. Recent advances in foundation models have led to task-agnostic Time Series Foundation Models (TSFMs) and Large Language Model-based Time Series Models (TSLLMs), enabling generalized learning and integrating contextual information. However, their success depends on large, diverse, and high-quality datasets, which are cha… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  12. arXiv:2503.09648  [pdf, other

    cs.MA cs.CY

    A Survey on Trustworthy LLM Agents: Threats and Countermeasures

    Authors: Miao Yu, Fanci Meng, Xinyun Zhou, Shilong Wang, Junyuan Mao, Linsey Pang, Tianlong Chen, Kun Wang, Xinfeng Li, Yongfeng Zhang, Bo An, Qingsong Wen

    Abstract: With the rapid evolution of Large Language Models (LLMs), LLM-based agents and Multi-agent Systems (MAS) have significantly expanded the capabilities of LLM ecosystems. This evolution stems from empowering LLMs with additional modules such as memory, tools, environment, and even other agents. However, this advancement has also introduced more complex issues of trustworthiness, which previous resea… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  13. arXiv:2503.06072  [pdf, other

    cs.CL cs.AI

    A Survey on Post-training of Large Language Models

    Authors: Guiyao Tie, Zeli Zhao, Dingjie Song, Fuyang Wei, Rong Zhou, Yurou Dai, Wen Yin, Zhejian Yang, Jiangyue Yan, Yao Su, Zhenhan Dai, Yifeng Xie, Yihan Cao, Lichao Sun, Pan Zhou, Lifang He, Hechang Chen, Yu Zhang, Qingsong Wen, Tianming Liu, Neil Zhenqiang Gong, Jiliang Tang, Caiming Xiong, Heng Ji, Philip S. Yu , et al. (1 additional authors not shown)

    Abstract: The emergence of Large Language Models (LLMs) has fundamentally transformed natural language processing, making them indispensable across domains ranging from conversational systems to scientific exploration. However, their pre-trained architectures often reveal limitations in specialized contexts, including restricted reasoning capacities, ethical uncertainties, and suboptimal domain-specific per… ▽ More

    Submitted 8 March, 2025; originally announced March 2025.

    Comments: 87 pages, 21 figures, 9 tables

  14. arXiv:2503.04392  [pdf, other

    cs.AI

    AgentSafe: Safeguarding Large Language Model-based Multi-agent Systems via Hierarchical Data Management

    Authors: Junyuan Mao, Fanci Meng, Yifan Duan, Miao Yu, Xiaojun Jia, Junfeng Fang, Yuxuan Liang, Kun Wang, Qingsong Wen

    Abstract: Large Language Model based multi-agent systems are revolutionizing autonomous communication and collaboration, yet they remain vulnerable to security threats like unauthorized access and data breaches. To address this, we introduce AgentSafe, a novel framework that enhances MAS security through hierarchical information management and memory protection. AgentSafe classifies information by security… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  15. arXiv:2503.04252  [pdf, other

    cs.DB cs.LG

    RCRank: Multimodal Ranking of Root Causes of Slow Queries in Cloud Database Systems

    Authors: Biao Ouyang, Yingying Zhang, Hanyin Cheng, Yang Shu, Chenjuan Guo, Bin Yang, Qingsong Wen, Lunting Fan, Christian S. Jensen

    Abstract: With the continued migration of storage to cloud database systems,the impact of slow queries in such systems on services and user experience is increasing. Root-cause diagnosis plays an indispensable role in facilitating slow-query detection and revision. This paper proposes a method capable of both identifying possible root cause types for slow queries and ranking these according to their potenti… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: Accepted by VLDB 2025

  16. arXiv:2503.01875  [pdf, other

    cs.CL cs.AI cs.LG

    Time-MQA: Time Series Multi-Task Question Answering with Context Enhancement

    Authors: Yaxuan Kong, Yiyuan Yang, Yoontae Hwang, Wenjie Du, Stefan Zohren, Zhangyang Wang, Ming Jin, Qingsong Wen

    Abstract: Time series data are foundational in finance, healthcare, and energy domains. However, most existing methods and datasets remain focused on a narrow spectrum of tasks, such as forecasting or anomaly detection. To bridge this gap, we introduce Time Series Multi-Task Question Answering (Time-MQA), a unified framework that enables natural language queries across multiple time series tasks - numerical… ▽ More

    Submitted 26 February, 2025; originally announced March 2025.

  17. arXiv:2503.00580  [pdf, other

    cs.LG cs.AI eess.SP

    Brain Foundation Models: A Survey on Advancements in Neural Signal Processing and Brain Discovery

    Authors: Xinliang Zhou, Chenyu Liu, Zhisheng Chen, Kun Wang, Yi Ding, Ziyu Jia, Qingsong Wen

    Abstract: Brain foundation models (BFMs) have emerged as a transformative paradigm in computational neuroscience, offering a revolutionary framework for processing diverse neural signals across different brain-related tasks. These models leverage large-scale pre-training techniques, allowing them to generalize effectively across multiple scenarios, tasks, and modalities, thus overcoming the traditional limi… ▽ More

    Submitted 1 March, 2025; originally announced March 2025.

  18. arXiv:2502.18209  [pdf, other

    cs.CL cs.AI

    LAG: LLM agents for Leaderboard Auto Generation on Demanding

    Authors: Jian Wu, Jiayu Zhang, Dongyuan Li, Linyi Yang, Aoxiao Zhong, Renhe Jiang, Qingsong Wen, Yue Zhang

    Abstract: This paper introduces Leaderboard Auto Generation (LAG), a novel and well-organized framework for automatic generation of leaderboards on a given research topic in rapidly evolving fields like Artificial Intelligence (AI). Faced with a large number of AI papers updated daily, it becomes difficult for researchers to track every paper's proposed methods, experimental results, and settings, prompting… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  19. arXiv:2502.17055  [pdf, other

    cs.LG cs.AI

    Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam

    Authors: Tianjin Huang, Haotian Hu, Zhenyu Zhang, Gaojie Jin, Xiang Li, Li Shen, Tianlong Chen, Lu Liu, Qingsong Wen, Zhangyang Wang, Shiwei Liu

    Abstract: This paper comprehensively evaluates several recently proposed optimizers for 4-bit training, revealing that low-bit precision amplifies sensitivity to learning rates and often causes unstable gradient norms, leading to divergence at higher learning rates. Among these, SPAM, a recent optimizer featuring momentum reset and spike-aware gradient clipping, achieves the best performance across various… ▽ More

    Submitted 11 April, 2025; v1 submitted 24 February, 2025; originally announced February 2025.

  20. arXiv:2502.15261  [pdf, other

    cs.CL cs.AI

    Corrections Meet Explanations: A Unified Framework for Explainable Grammatical Error Correction

    Authors: Jingheng Ye, Shang Qin, Yinghui Li, Hai-Tao Zheng, Shen Wang, Qingsong Wen

    Abstract: Grammatical Error Correction (GEC) faces a critical challenge concerning explainability, notably when GEC systems are designed for language learners. Existing research predominantly focuses on explaining grammatical errors extracted in advance, thus neglecting the relationship between explanations and corrections. To address this gap, we introduce EXGEC, a unified explainable GEC framework that in… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: 19 pages, 2 figures, and 9 tables

  21. arXiv:2502.13789  [pdf, other

    cs.CV

    From Correctness to Comprehension: AI Agents for Personalized Error Diagnosis in Education

    Authors: Yi-Fan Zhang, Hang Li, Dingjie Song, Lichao Sun, Tianlong Xu, Qingsong Wen

    Abstract: Large Language Models (LLMs), such as GPT-4, have demonstrated impressive mathematical reasoning capabilities, achieving near-perfect performance on benchmarks like GSM8K. However, their application in personalized education remains limited due to an overemphasis on correctness over error diagnosis and feedback generation. Current models fail to provide meaningful insights into the causes of stude… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  22. arXiv:2502.08114  [pdf, other

    cs.HC stat.CO

    From Clicks to Conversations: Evaluating the Effectiveness of Conversational Agents in Statistical Analysis

    Authors: Qifu Wen, Prishita Kochhar, Sherif Zeyada, Tahereh Javaheri, Reza Rawassizadeh

    Abstract: The rapid proliferation of data science forced different groups of individuals with different backgrounds to adapt to statistical analysis. We hypothesize that conversational agents are better suited for statistical analysis than traditional graphical user interfaces (GUI). In this work, we propose a novel conversational agent, StatZ, for statistical analysis. We evaluate the efficacy of StatZ rel… ▽ More

    Submitted 16 February, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

    Comments: 20 pages, 6 figures. Under review

    MSC Class: 62-07 ACM Class: H.5.2; I.2.7

  23. arXiv:2502.05467  [pdf, other

    cs.CL cs.AI

    Position: LLMs Can be Good Tutors in Foreign Language Education

    Authors: Jingheng Ye, Shen Wang, Deqing Zou, Yibo Yan, Kun Wang, Hai-Tao Zheng, Zenglin Xu, Irwin King, Philip S. Yu, Qingsong Wen

    Abstract: While recent efforts have begun integrating large language models (LLMs) into foreign language education (FLE), they often rely on traditional approaches to learning tasks without fully embracing educational methodologies, thus lacking adaptability to language learning. To address this gap, we argue that LLMs have the potential to serve as effective tutors in FLE. Specifically, LLMs can play three… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

    Comments: 18 pages, 4 figures

  24. arXiv:2502.04395  [pdf, other

    cs.CV cs.LG

    Time-VLM: Exploring Multimodal Vision-Language Models for Augmented Time Series Forecasting

    Authors: Siru Zhong, Weilin Ruan, Ming Jin, Huan Li, Qingsong Wen, Yuxuan Liang

    Abstract: Recent advancements in time series forecasting have explored augmenting models with text or vision modalities to improve accuracy. While text provides contextual understanding, it often lacks fine-grained temporal details. Conversely, vision captures intricate temporal patterns but lacks semantic context, limiting the complementary potential of these modalities. To address this, we propose Time-VL… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

    Comments: 19 pages

  25. arXiv:2502.02871  [pdf, other

    cs.CL cs.AI

    Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning

    Authors: Yibo Yan, Shen Wang, Jiahao Huo, Jingheng Ye, Zhendong Chu, Xuming Hu, Philip S. Yu, Carla Gomes, Bart Selman, Qingsong Wen

    Abstract: Scientific reasoning, the process through which humans apply logic, evidence, and critical thinking to explore and interpret scientific phenomena, is essential in advancing knowledge reasoning across diverse fields. However, despite significant progress, current scientific reasoning models still struggle with generalization across domains and often fall short of multimodal perception. Multimodal L… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

  26. arXiv:2502.01477  [pdf, other

    cs.LG cs.AI

    Position: Empowering Time Series Reasoning with Multimodal LLMs

    Authors: Yaxuan Kong, Yiyuan Yang, Shiyu Wang, Chenghao Liu, Yuxuan Liang, Ming Jin, Stefan Zohren, Dan Pei, Yan Liu, Qingsong Wen

    Abstract: Understanding time series data is crucial for multiple real-world applications. While large language models (LLMs) show promise in time series tasks, current approaches often rely on numerical data alone, overlooking the multimodal nature of time-dependent information, such as textual descriptions, visual data, and audio signals. Moreover, these methods underutilize LLMs' reasoning capabilities, l… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

  27. arXiv:2502.00338  [pdf, other

    cs.LG physics.ao-ph

    OneForecast: A Universal Framework for Global and Regional Weather Forecasting

    Authors: Yuan Gao, Hao Wu, Ruiqi Shu, Huanshuo Dong, Fan Xu, Rui Chen, Yibo Yan, Qingsong Wen, Xuming Hu, Kun Wang, Jiahao Wu, Qing Li, Hui Xiong, Xiaomeng Huang

    Abstract: Accurate weather forecasts are important for disaster prevention, agricultural planning, and water resource management. Traditional numerical weather prediction (NWP) methods offer physically interpretable high-accuracy predictions but are computationally expensive and fail to fully leverage rapidly growing historical data. In recent years, deep learning methods have made significant progress in w… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

  28. Noise-Resilient Point-wise Anomaly Detection in Time Series Using Weak Segment Labels

    Authors: Yaxuan Wang, Hao Cheng, Jing Xiong, Qingsong Wen, Han Jia, Ruixuan Song, Liyuan Zhang, Zhaowei Zhu, Yang Liu

    Abstract: Detecting anomalies in temporal data has gained significant attention across various real-world applications, aiming to identify unusual events and mitigate potential hazards. In practice, situations often involve a mix of segment-level labels (detected abnormal events with segments of time points) and unlabeled data (undetected events), while the ideal algorithmic outcome should be point-level pr… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

    Comments: Accepted by 2025 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'25)

  29. arXiv:2501.01282  [pdf, other

    cs.AI cs.CL cs.CV

    CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries

    Authors: Shudong Liu, Yiqiao Jin, Cheng Li, Derek F. Wong, Qingsong Wen, Lichao Sun, Haipeng Chen, Xing Xie, Jindong Wang

    Abstract: Vision-language models (VLMs) have advanced human-AI interaction but struggle with cultural understanding, often misinterpreting symbols, gestures, and artifacts due to biases in predominantly Western-centric training data. In this paper, we construct CultureVerse, a large-scale multimodal benchmark covering 19, 682 cultural concepts, 188 countries/regions, 15 cultural concepts, and 3 question typ… ▽ More

    Submitted 2 January, 2025; originally announced January 2025.

    Comments: Technical report; 26 pages

  30. arXiv:2501.00055  [pdf, other

    cs.CR cs.AI cs.CL

    LLM-Virus: Evolutionary Jailbreak Attack on Large Language Models

    Authors: Miao Yu, Junfeng Fang, Yingjie Zhou, Xing Fan, Kun Wang, Shirui Pan, Qingsong Wen

    Abstract: While safety-aligned large language models (LLMs) are increasingly used as the cornerstone for powerful systems such as multi-agent frameworks to solve complex real-world problems, they still suffer from potential adversarial queries, such as jailbreak attacks, which attempt to induce harmful content. Researching attack methods allows us to better understand the limitations of LLM and make trade-o… ▽ More

    Submitted 28 December, 2024; originally announced January 2025.

  31. arXiv:2412.16838  [pdf, other

    cs.CL

    Ask-Before-Detection: Identifying and Mitigating Conformity Bias in LLM-Powered Error Detector for Math Word Problem Solutions

    Authors: Hang Li, Tianlong Xu, Kaiqi Yang, Yucheng Chu, Yanling Chen, Yichi Song, Qingsong Wen, Hui Liu

    Abstract: The rise of large language models (LLMs) offers new opportunities for automatic error detection in education, particularly for math word problems (MWPs). While prior studies demonstrate the promise of LLMs as error detectors, they overlook the presence of multiple valid solutions for a single MWP. Our preliminary analysis reveals a significant performance gap between conventional and alternative s… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

    Comments: 12 pages, 4 figures

  32. arXiv:2412.11936  [pdf, other

    cs.CL

    A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges

    Authors: Yibo Yan, Jiamin Su, Jianxiang He, Fangteng Fu, Xu Zheng, Yuanhuiyi Lyu, Kun Wang, Shen Wang, Qingsong Wen, Xuming Hu

    Abstract: Mathematical reasoning, a core aspect of human cognition, is vital across many domains, from educational problem-solving to scientific advancements. As artificial general intelligence (AGI) progresses, integrating large language models (LLMs) with mathematical reasoning tasks is becoming increasingly significant. This survey provides the first comprehensive analysis of mathematical reasoning in th… ▽ More

    Submitted 17 February, 2025; v1 submitted 16 December, 2024; originally announced December 2024.

  33. arXiv:2412.10430  [pdf, other

    cs.CV cs.GR

    Unsupervised Cross-Domain Regression for Fine-grained 3D Game Character Reconstruction

    Authors: Qi Wen, Xiang Wen, Hao Jiang, Siqi Yang, Bingfeng Han, Tianlei Hu, Gang Chen, Shuang Li

    Abstract: With the rise of the ``metaverse'' and the rapid development of games, it has become more and more critical to reconstruct characters in the virtual world faithfully. The immersive experience is one of the most central themes of the ``metaverse'', while the reducibility of the avatar is the crucial point. Meanwhile, the game is the carrier of the metaverse, in which players can freely edit the fac… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: 12 pages, 10 figures

  34. arXiv:2411.17218  [pdf, other

    cs.LG cs.AI

    GraphSubDetector: Time Series Subsequence Anomaly Detection via Density-Aware Adaptive Graph Neural Network

    Authors: Weiqi Chen, Zhiqiang Zhou, Qingsong Wen, Liang Sun

    Abstract: Time series subsequence anomaly detection is an important task in a large variety of real-world applications ranging from health monitoring to AIOps, and is challenging due to the following reasons: 1) how to effectively learn complex dynamics and dependencies in time series; 2) diverse and complicated anomalous subsequences as well as the inherent variance and noise of normal patterns; 3) how to… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

  35. arXiv:2411.03033  [pdf, other

    cs.CV cs.LG

    Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression Perspective

    Authors: Qishuai Wen, Chun-Guang Li

    Abstract: State-of-the-art methods for Transformer-based semantic segmentation typically adopt Transformer decoders that are used to extract additional embeddings from image embeddings via cross-attention, refine either or both types of embeddings via self-attention, and project image embeddings onto the additional embeddings via dot-product. Despite their remarkable success, these empirical designs still l… ▽ More

    Submitted 14 January, 2025; v1 submitted 5 November, 2024; originally announced November 2024.

    Comments: NeurIPS2024. Code:https://github.com/QishuaiWen/DEPICT/

  36. arXiv:2411.02815  [pdf

    eess.IV cs.CV

    Artificial Intelligence-Enhanced Couinaud Segmentation for Precision Liver Cancer Therapy

    Authors: Liang Qiu, Wenhao Chi, Xiaohan Xing, Praveenbalaji Rajendran, Mingjie Li, Yuming Jiang, Oscar Pastor-Serrano, Sen Yang, Xiyue Wang, Yuanfeng Ji, Qiang Wen

    Abstract: Precision therapy for liver cancer necessitates accurately delineating liver sub-regions to protect healthy tissue while targeting tumors, which is essential for reducing recurrence and improving survival rates. However, the segmentation of hepatic segments, known as Couinaud segmentation, is challenging due to indistinct sub-region boundaries and the need for extensive annotated datasets. This st… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

  37. arXiv:2410.15686  [pdf, other

    cs.MA cs.AI

    NetSafe: Exploring the Topological Safety of Multi-agent Networks

    Authors: Miao Yu, Shilong Wang, Guibin Zhang, Junyuan Mao, Chenlong Yin, Qijiong Liu, Qingsong Wen, Kun Wang, Yang Wang

    Abstract: Large language models (LLMs) have empowered nodes within multi-agent networks with intelligence, showing growing applications in both academia and industry. However, how to prevent these networks from generating malicious information remains unexplored with previous research on single LLM's safety be challenging to transfer. In this paper, we focus on the safety of multi-agent networks from a topo… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  38. arXiv:2410.12206  [pdf, other

    cs.LG cs.AI

    Abnormality Forecasting: Time Series Anomaly Prediction via Future Context Modeling

    Authors: Sinong Zhao, Wenrui Wang, Hongzuo Xu, Zhaoyang Yu, Qingsong Wen, Gang Wang, xiaoguang Liu, Guansong Pang

    Abstract: Identifying anomalies from time series data plays an important role in various fields such as infrastructure security, intelligent operation and maintenance, and space exploration. Current research focuses on detecting the anomalies after they occur, which can lead to significant financial/reputation loss or infrastructure damage. In this work we instead study a more practical yet very challenging… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 11 pages, 5 figures, submitted to KDD conference

  39. arXiv:2410.11802  [pdf, other

    cs.LG

    FoundTS: Comprehensive and Unified Benchmarking of Foundation Models for Time Series Forecasting

    Authors: Zhe Li, Xiangfei Qiu, Peng Chen, Yihang Wang, Hanyin Cheng, Yang Shu, Jilin Hu, Chenjuan Guo, Aoying Zhou, Qingsong Wen, Christian S. Jensen, Bin Yang

    Abstract: Time Series Forecasting (TSF) is key functionality in numerous fields, including in finance, weather services, and energy management. While TSF methods are emerging these days, many of them require domain-specific data collection and model training and struggle with poor generalization performance on new domains. Foundation models aim to overcome this limitation. Pre-trained on large-scale languag… ▽ More

    Submitted 26 November, 2024; v1 submitted 15 October, 2024; originally announced October 2024.

  40. arXiv:2410.11273  [pdf, other

    cs.SI cs.DB

    GCLS$^2$: Towards Efficient Community Detection Using Graph Contrastive Learning with Structure Semantics

    Authors: Qi Wen, Yiyang Zhang, Yutong Ye, Yingbo Zhou, Nan Zhang, Xiang Lian, Mingsong Chen

    Abstract: Due to the power of learning representations from unlabeled graphs, graph contrastive learning (GCL) has shown excellent performance in community detection tasks. Existing GCL-based methods on the community detection usually focused on learning attribute representations of individual nodes, which, however, ignores structural semantics of communities (e.g., nodes in the same community should be str… ▽ More

    Submitted 2 December, 2024; v1 submitted 15 October, 2024; originally announced October 2024.

  41. arXiv:2410.09283  [pdf, other

    cs.CL

    Comparative Analysis of Static and Contextual Embeddings for Analyzing Semantic Changes in Medieval Latin Charters

    Authors: Yifan Liu, Gelila Tilahun, Xinxiang Gao, Qianfeng Wen, Michael Gervers

    Abstract: The Norman Conquest of 1066 C.E. brought profound transformations to England's administrative, societal, and linguistic practices. The DEEDS (Documents of Early England Data Set) database offers a unique opportunity to explore these changes by examining shifts in word meanings within a vast collection of Medieval Latin charters. While computational linguistics typically relies on vector representa… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 11 pages, 6 figures

  42. arXiv:2410.06652  [pdf, other

    cs.LG cs.AI

    Task-oriented Time Series Imputation Evaluation via Generalized Representers

    Authors: Zhixian Wang, Linxiao Yang, Liang Sun, Qingsong Wen, Yi Wang

    Abstract: Time series analysis is widely used in many fields such as power energy, economics, and transportation, including different tasks such as forecasting, anomaly detection, classification, etc. Missing values are widely observed in these tasks, and often leading to unpredictable negative effects on existing methods, hindering their further application. In response to this situation, existing time ser… ▽ More

    Submitted 10 October, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: 22 pages, 9 figures, 38th Conference on Neural Information Processing Systems (NeurIPS 2024)

  43. arXiv:2410.06651  [pdf, other

    cs.LG cs.AI

    Toward Physics-guided Time Series Embedding

    Authors: Jiaxi Hu, Bowen Zhang, Qingsong Wen, Fugee Tsung, Yuxuan Liang

    Abstract: In various scientific and engineering fields, the primary research areas have revolved around physics-based dynamical systems modeling and data-driven time series analysis. According to the embedding theory, dynamical systems and time series can be mutually transformed using observation functions and physical reconstruction techniques. Based on this, we propose Embedding Duality Theory, where the… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  44. arXiv:2410.05298  [pdf, ps, other

    cs.LG cs.AI

    How Do Large Language Models Understand Graph Patterns? A Benchmark for Graph Pattern Comprehension

    Authors: Xinnan Dai, Haohao Qu, Yifen Shen, Bohang Zhang, Qihao Wen, Wenqi Fan, Dongsheng Li, Jiliang Tang, Caihua Shan

    Abstract: Benchmarking the capabilities and limitations of large language models (LLMs) in graph-related tasks is becoming an increasingly popular and crucial area of research. Recent studies have shown that LLMs exhibit a preliminary ability to understand graph structures and node features. However, the potential of LLMs in graph pattern mining remains largely unexplored. This is a key component in fields… ▽ More

    Submitted 20 April, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

    Comments: The paper is published in ICLR 2025

  45. arXiv:2410.04509  [pdf, other

    cs.CL

    ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection

    Authors: Yibo Yan, Shen Wang, Jiahao Huo, Hang Li, Boyan Li, Jiamin Su, Xiong Gao, Yi-Fan Zhang, Tianlong Xu, Zhendong Chu, Aoxiao Zhong, Kun Wang, Hui Xiong, Philip S. Yu, Xuming Hu, Qingsong Wen

    Abstract: As the field of Multimodal Large Language Models (MLLMs) continues to evolve, their potential to revolutionize artificial intelligence is particularly promising, especially in addressing mathematical reasoning tasks. Current mathematical benchmarks predominantly focus on evaluating MLLMs' problem-solving ability, yet there is a crucial gap in addressing more complex scenarios such as error detecti… ▽ More

    Submitted 8 October, 2024; v1 submitted 6 October, 2024; originally announced October 2024.

  46. arXiv:2410.01677  [pdf, other

    cs.AI

    Mind Scramble: Unveiling Large Language Model Psychology Via Typoglycemia

    Authors: Miao Yu, Junyuan Mao, Guibin Zhang, Jingheng Ye, Junfeng Fang, Aoxiao Zhong, Yang Liu, Yuxuan Liang, Kun Wang, Qingsong Wen

    Abstract: Research into the external behaviors and internal mechanisms of large language models (LLMs) has shown promise in addressing complex tasks in the physical world. Studies suggest that powerful LLMs, like GPT-4, are beginning to exhibit human-like cognitive abilities, including planning, reasoning, and reflection. In this paper, we introduce a research line and methodology called LLM Psychology, lev… ▽ More

    Submitted 23 October, 2024; v1 submitted 2 October, 2024; originally announced October 2024.

  47. arXiv:2410.01598  [pdf, other

    cs.IR cs.AI

    Elaborative Subtopic Query Reformulation for Broad and Indirect Queries in Travel Destination Recommendation

    Authors: Qianfeng Wen, Yifan Liu, Joshua Zhang, George Saad, Anton Korikov, Yury Sambale, Scott Sanner

    Abstract: In Query-driven Travel Recommender Systems (RSs), it is crucial to understand the user intent behind challenging natural language(NL) destination queries such as the broadly worded "youth-friendly activities" or the indirect description "a high school graduation trip". Such queries are challenging due to the wide scope and subtlety of potential user intents that confound the ability of retrieval m… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

    Comments: 9 pages, 7 figures,The 1st Workshop on Risks, Opportunities, and Evaluation of Generative Models in Recommender Systems (ROEGEN@RecSys 2024), October 2024, Bari, Italy

  48. arXiv:2409.19718  [pdf, other

    cs.LG stat.ML

    Evolving Multi-Scale Normalization for Time Series Forecasting under Distribution Shifts

    Authors: Dalin Qin, Yehui Li, Weiqi Chen, Zhaoyang Zhu, Qingsong Wen, Liang Sun, Pierre Pinson, Yi Wang

    Abstract: Complex distribution shifts are the main obstacle to achieving accurate long-term time series forecasting. Several efforts have been conducted to capture the distribution characteristics and propose adaptive normalization techniques to alleviate the influence of distribution shifts. However, these methods neglect the intricate distribution dynamics observed from various scales and the evolving fun… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

  49. arXiv:2409.16040  [pdf, other

    cs.LG cs.AI

    Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

    Authors: Xiaoming Shi, Shiyu Wang, Yuqi Nie, Dianqi Li, Zhou Ye, Qingsong Wen, Ming Jin

    Abstract: Deep learning for time series forecasting has seen significant advancements over the past decades. However, despite the success of large-scale pre-training in language and vision domains, pre-trained time series models remain limited in scale and operate at a high cost, hindering the development of larger capable forecasting models in real-world applications. In response, we introduce Time-MoE, a… ▽ More

    Submitted 27 February, 2025; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: Accepted by the 13th International Conference on Learning Representations (ICLR 2025)

  50. arXiv:2409.12169  [pdf, other

    cs.LG

    LogoRA: Local-Global Representation Alignment for Robust Time Series Classification

    Authors: Huanyu Zhang, Yi-Fan Zhang, Zhang Zhang, Qingsong Wen, Liang Wang

    Abstract: Unsupervised domain adaptation (UDA) of time series aims to teach models to identify consistent patterns across various temporal scenarios, disregarding domain-specific differences, which can maintain their predictive accuracy and effectively adapt to new domains. However, existing UDA methods struggle to adequately extract and align both global and local features in time series data. To address t… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Comments: Accepted by IEEE Transactions on Knowledge and Data Engineering