Skip to main content

Showing 1–23 of 23 results for author: Xing, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.12928  [pdf, ps, other

    cs.AI

    Scaling Test-time Compute for LLM Agents

    Authors: King Zhu, Hanhao Li, Siwei Wu, Tianshun Xing, Dehua Ma, Xiangru Tang, Minghao Liu, Jian Yang, Jiaheng Liu, Yuchen Eleanor Jiang, Changwang Zhang, Chenghua Lin, Jun Wang, Ge Zhang, Wangchunshu Zhou

    Abstract: Scaling test time compute has shown remarkable success in improving the reasoning abilities of large language models (LLMs). In this work, we conduct the first systematic exploration of applying test-time scaling methods to language agents and investigate the extent to which it improves their effectiveness. Specifically, we explore different test-time scaling strategies, including: (1) parallel sa… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  2. arXiv:2505.01438  [pdf

    cs.LG cond-mat.mtrl-sci cs.AI

    Global Stress Generation and Spatiotemporal Super-Resolution Physics-Informed Operator under Dynamic Loading for Two-Phase Random Materials

    Authors: Tengfei Xing, Xiaodan Ren, Jie Li

    Abstract: Material stress analysis is a critical aspect of material design and performance optimization. Under dynamic loading, the global stress evolution in materials exhibits complex spatiotemporal characteristics, especially in two-phase random materials (TRMs). Such kind of material failure is often associated with stress concentration, and the phase boundaries are key locations where stress concentrat… ▽ More

    Submitted 26 April, 2025; originally announced May 2025.

  3. arXiv:2504.18854  [pdf

    cond-mat.mtrl-sci cs.AI cs.LG

    Predicting Stress in Two-phase Random Materials and Super-Resolution Method for Stress Images by Embedding Physical Information

    Authors: Tengfei Xing, Xiaodan Ren, Jie Li

    Abstract: Stress analysis is an important part of material design. For materials with complex microstructures, such as two-phase random materials (TRMs), material failure is often accompanied by stress concentration. Phase interfaces in two-phase materials are critical for stress concentration. Therefore, the prediction error of stress at phase boundaries is crucial. In practical engineering, the pixels of… ▽ More

    Submitted 26 April, 2025; originally announced April 2025.

  4. arXiv:2503.11064  [pdf, other

    eess.SP cs.LG

    MobiVital: Self-supervised Time-series Quality Estimation for Contactless Respiration Monitoring Using UWB Radar

    Authors: Ziqi Wang, Derek Hua, Wenjun Jiang, Tianwei Xing, Xun Chen, Mani Srivastava

    Abstract: Respiration waveforms are increasingly recognized as important biomarkers, offering insights beyond simple respiration rates, such as detecting breathing irregularities for disease diagnosis or monitoring breath patterns to guide rehabilitation training. Previous works in wireless respiration monitoring have primarily focused on estimating respiration rate, where the breath waveforms are often gen… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  5. arXiv:2410.23692  [pdf, other

    cs.CL cs.CY

    Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility Prediction

    Authors: Peizhi Tang, Chuang Yang, Tong Xing, Xiaohang Xu, Renhe Jiang, Kaoru Sezaki

    Abstract: Human mobility prediction plays a critical role in applications such as disaster response, urban planning, and epidemic forecasting. Traditional methods often rely on designing crafted, domain-specific models, and typically focus on short-term predictions, which struggle to generalize across diverse urban environments. In this study, we introduce Llama-3-8B-Mob, a large language model fine-tuned w… ▽ More

    Submitted 31 October, 2024; originally announced October 2024.

  6. Multi-source Domain Adaptation for Panoramic Semantic Segmentation

    Authors: Jing Jiang, Sicheng Zhao, Jiankun Zhu, Wenbo Tang, Zhaopan Xu, Jidong Yang, Guoping Liu, Tengfei Xing, Pengfei Xu, Hongxun Yao

    Abstract: Unsupervised domain adaptation methods for panoramic semantic segmentation utilize real pinhole images or low-cost synthetic panoramic images to transfer segmentation models to real panoramic images. However, these methods struggle to understand the panoramic structure using only real pinhole images and lack real-world scene perception with only synthetic panoramic images. Therefore, in this paper… ▽ More

    Submitted 7 January, 2025; v1 submitted 29 August, 2024; originally announced August 2024.

    Comments: Accepted by Information Fusion 2025

    Journal ref: Information Fusion, 2025: 102909

  7. arXiv:2406.10125  [pdf, other

    cs.CV

    MapVision: CVPR 2024 Autonomous Grand Challenge Mapless Driving Tech Report

    Authors: Zhongyu Yang, Mai Liu, Jinluo Xie, Yueming Zhang, Chen Shen, Wei Shao, Jichao Jiao, Tengfei Xing, Runbo Hu, Pengfei Xu

    Abstract: Autonomous driving without high-definition (HD) maps demands a higher level of active scene understanding. In this competition, the organizers provided the multi-perspective camera images and standard-definition (SD) maps to explore the boundaries of scene reasoning capabilities. We found that most existing algorithms construct Bird's Eye View (BEV) features from these multi-perspective images and… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  8. arXiv:2403.14354  [pdf, other

    cs.CV

    LDTR: Transformer-based Lane Detection with Anchor-chain Representation

    Authors: Zhongyu Yang, Chen Shen, Wei Shao, Tengfei Xing, Runbo Hu, Pengfei Xu, Hua Chai, Ruini Xue

    Abstract: Despite recent advances in lane detection methods, scenarios with limited- or no-visual-clue of lanes due to factors such as lighting conditions and occlusion remain challenging and crucial for automated driving. Moreover, current lane representations require complex post-processing and struggle with specific instances. Inspired by the DETR architecture, we propose LDTR, a transformer-based model… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted by CVM 2024 and CVMJ. 16 pages, 14 figures

  9. arXiv:2402.18700  [pdf, other

    cs.CL cs.AI cs.LG

    Learning to Compress Prompt in Natural Language Formats

    Authors: Yu-Neng Chuang, Tianwei Xing, Chia-Yuan Chang, Zirui Liu, Xun Chen, Xia Hu

    Abstract: Large language models (LLMs) are great at processing multiple natural language processing tasks, but their abilities are constrained by inferior performance with long context, slow inference speed, and the high cost of computing the results. Deploying LLMs with precise and informative context helps users process large-scale datasets more effectively and cost-efficiently. Existing works rely on com… ▽ More

    Submitted 1 April, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  10. Application of AI in Nutrition

    Authors: Ritu Ramakrishnan, Tianxiang Xing, Tianfeng Chen, Ming-Hao Lee, Jinzhu Gao

    Abstract: In healthcare, artificial intelligence (AI) has been changing the way doctors and health experts take care of people. This paper will cover how AI is making major changes in the health care system, especially with nutrition. Various machine learning and deep learning algorithms have been developed to extract valuable information from healthcare data which help doctors, nutritionists, and health ex… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Journal ref: Journal of Advances in Information Science and Technology, Volume 1, Issue 1, 2023, Pages 7-12

  11. arXiv:2311.18531  [pdf, other

    cs.CV cs.AI cs.LG

    Dataset Distillation via the Wasserstein Metric

    Authors: Haoyang Liu, Yijiang Li, Tiancheng Xing, Vibhu Dalal, Luwei Li, Jingrui He, Haohan Wang

    Abstract: Dataset Distillation (DD) emerges as a powerful strategy to encapsulate the expansive information of large datasets into significantly smaller, synthetic equivalents, thereby preserving model performance with reduced computational overhead. Pursuing this objective, we introduce the Wasserstein distance, a metric grounded in optimal transport theory, to enhance distribution matching in DD. Our appr… ▽ More

    Submitted 15 March, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: 21 pages, 8 figures

  12. arXiv:2304.11546  [pdf, other

    cs.CV

    CANet: Curved Guide Line Network with Adaptive Decoder for Lane Detection

    Authors: Zhongyu Yang, Chen Shen, Wei Shao, Tengfei Xing, Runbo Hu, Pengfei Xu, Hua Chai, Ruini Xue

    Abstract: Lane detection is challenging due to the complicated on road scenarios and line deformation from different camera perspectives. Lots of solutions were proposed, but can not deal with corner lanes well. To address this problem, this paper proposes a new top-down deep learning lane detection approach, CANET. A lane instance is first responded by the heat-map on the U-shaped curved guide line at glob… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: 5 pages, IEEE ICASSP 2023

  13. arXiv:2204.04492  [pdf, other

    cs.CV

    S4OD: Semi-Supervised learning for Single-Stage Object Detection

    Authors: Yueming Zhang, Xingxu Yao, Chao Liu, Feng Chen, Xiaolin Song, Tengfei Xing, Runbo Hu, Hua Chai, Pengfei Xu, Guoshan Zhang

    Abstract: Single-stage detectors suffer from extreme foreground-background class imbalance, while two-stage detectors do not. Therefore, in semi-supervised object detection, two-stage detectors can deliver remarkable performance by only selecting high-quality pseudo labels based on classification scores. However, directly applying this strategy to single-stage detectors would aggravate the class imbalance w… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

  14. arXiv:2110.14240  [pdf, other

    cs.CV

    3rd Place Solution for VisDA 2021 Challenge -- Universally Domain Adaptive Image Recognition

    Authors: Haojin Liao, Xiaolin Song, Sicheng Zhao, Shanghang Zhang, Xiangyu Yue, Xingxu Yao, Yueming Zhang, Tengfei Xing, Pengfei Xu, Qiang Wang

    Abstract: The Visual Domain Adaptation (VisDA) 2021 Challenge calls for unsupervised domain adaptation (UDA) methods that can deal with both input distribution shift and label set variance between the source and target domains. In this report, we introduce a universal domain adaptation (UniDA) method by aggregating several popular feature extraction and domain adaptation schemes. First, we utilize VOLO, a T… ▽ More

    Submitted 27 February, 2025; v1 submitted 27 October, 2021; originally announced October 2021.

  15. arXiv:2110.08090  [pdf, other

    cs.SD cs.AI eess.AS

    Using DeepProbLog to perform Complex Event Processing on an Audio Stream

    Authors: Marc Roig Vilamala, Tianwei Xing, Harrison Taylor, Luis Garcia, Mani Srivastava, Lance Kaplan, Alun Preece, Angelika Kimmig, Federico Cerutti

    Abstract: In this paper, we present an approach to Complex Event Processing (CEP) that is based on DeepProbLog. This approach has the following objectives: (i) allowing the use of subsymbolic data as an input, (ii) retaining the flexibility and modularity on the definitions of complex event rules, (iii) allowing the system to be trained in an end-to-end manner and (iv) being robust against noisily labelled… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

    Comments: 8 pages, 3 figures

  16. arXiv:2106.08713  [pdf, ps, other

    cs.CV

    2nd Place Solution for Waymo Open Dataset Challenge -- Real-time 2D Object Detection

    Authors: Yueming Zhang, Xiaolin Song, Bing Bai, Tengfei Xing, Chao Liu, Xin Gao, Zhihui Wang, Yawei Wen, Haojin Liao, Guoshan Zhang, Pengfei Xu

    Abstract: In an autonomous driving system, it is essential to recognize vehicles, pedestrians and cyclists from images. Besides the high accuracy of the prediction, the requirement of real-time running brings new challenges for convolutional network models. In this report, we introduce a real-time method to detect the 2D objects from images. We aggregate several popular one-stage object detectors and train… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  17. arXiv:2011.10794  [pdf, other

    cs.AI

    Spatially Correlated Patterns in Adversarial Images

    Authors: Nandish Chattopadhyay, Lionell Yip En Zhi, Bryan Tan Bing Xing, Anupam Chattopadhyay

    Abstract: Adversarial attacks have proved to be the major impediment in the progress on research towards reliable machine learning solutions. Carefully crafted perturbations, imperceptible to human vision, can be added to images to force misclassification by an otherwise high performing neural network. To have a better understanding of the key contributors of such structured attacks, we searched for and stu… ▽ More

    Submitted 21 November, 2020; originally announced November 2020.

    Comments: Submitted for review

  18. arXiv:2010.14388  [pdf, other

    cs.AI

    An Experimentation Platform for Explainable Coalition Situational Understanding

    Authors: Katie Barrett-Powell, Jack Furby, Liam Hiley, Marc Roig Vilamala, Harrison Taylor, Federico Cerutti, Alun Preece, Tianwei Xing, Luis Garcia, Mani Srivastava, Dave Braines

    Abstract: We present an experimentation platform for coalition situational understanding research that highlights capabilities in explainable artificial intelligence/machine learning (AI/ML) and integration of symbolic and subsymbolic AI/ML approaches for event processing. The Situational Understanding Explorer (SUE) platform is designed to be lightweight, to easily facilitate experiments and demonstrations… ▽ More

    Submitted 9 November, 2020; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: Presented at AAAI FSS-20: Artificial Intelligence in Government and Public Sector, Washington, DC, USA

  19. arXiv:2009.03420  [pdf, ps, other

    cs.AI

    A Hybrid Neuro-Symbolic Approach for Complex Event Processing

    Authors: Marc Roig Vilamala, Harrison Taylor, Tianwei Xing, Luis Garcia, Mani Srivastava, Lance Kaplan, Alun Preece, Angelika Kimmig, Federico Cerutti

    Abstract: Training a model to detect patterns of interrelated events that form situations of interest can be a complex problem: such situations tend to be uncommon, and only sparse data is available. We propose a hybrid neuro-symbolic architecture based on Event Calculus that can perform Complex Event Processing (CEP). It leverages both a neural network to interpret inputs and logical rules that express the… ▽ More

    Submitted 13 October, 2020; v1 submitted 7 September, 2020; originally announced September 2020.

    Comments: Accepted as extended abstract at ICLP2020

  20. arXiv:2003.00832  [pdf, other

    cs.CV cs.HC cs.MM

    An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos

    Authors: Sicheng Zhao, Yunsheng Ma, Yang Gu, Jufeng Yang, Tengfei Xing, Pengfei Xu, Runbo Hu, Hua Chai, Kurt Keutzer

    Abstract: Emotion recognition in user-generated videos plays an important role in human-centered computing. Existing methods mainly employ traditional two-stage shallow pipeline, i.e. extracting visual and/or audio features and training classifiers. In this paper, we propose to recognize video emotions in an end-to-end manner based on convolutional neural networks (CNNs). Specifically, we develop a deep Vis… ▽ More

    Submitted 12 February, 2020; originally announced March 2020.

    Comments: Accepted by AAAI 2020

  21. Time-Series Anomaly Detection Service at Microsoft

    Authors: Hansheng Ren, Bixiong Xu, Yujing Wang, Chao Yi, Congrui Huang, Xiaoyu Kou, Tony Xing, Mao Yang, Jie Tong, Qi Zhang

    Abstract: Large companies need to monitor various metrics (for example, Page Views and Revenue) of their applications and services in real time. At Microsoft, we develop a time-series anomaly detection service which helps customers to monitor the time-series continuously and alert for potential incidents on time. In this paper, we introduce the pipeline and algorithm of our anomaly detection service, which… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: KDD 2019

  22. arXiv:1707.04693  [pdf, other

    cs.CV

    Binarized Convolutional Neural Networks with Separable Filters for Efficient Hardware Acceleration

    Authors: Jeng-Hau Lin, Tianwei Xing, Ritchie Zhao, Zhiru Zhang, Mani Srivastava, Zhuowen Tu, Rajesh K. Gupta

    Abstract: State-of-the-art convolutional neural networks are enormously costly in both compute and memory, demanding massively parallel GPUs for execution. Such networks strain the computational capabilities and energy available to embedded and mobile processing platforms, restricting their use in many important applications. In this paper, we push the boundaries of hardware-effective CNN design by proposin… ▽ More

    Submitted 15 July, 2017; originally announced July 2017.

    Comments: 9 pages, 6 figures, accepted for Embedded Vision Workshop (CVPRW)

  23. Personalized Course Sequence Recommendations

    Authors: Jie Xu, Tianwei Xing, Mihaela van der Schaar

    Abstract: Given the variability in student learning it is becoming increasingly important to tailor courses as well as course sequences to student needs. This paper presents a systematic methodology for offering personalized course sequence recommendations to students. First, a forward-search backward-induction algorithm is developed that can optimally select course sequences to decrease the time required f… ▽ More

    Submitted 11 January, 2016; v1 submitted 30 December, 2015; originally announced December 2015.