Skip to main content

Showing 1–50 of 256 results for author: Sue, T

.
  1. arXiv:2506.17800  [pdf, ps, other

    cond-mat.mtrl-sci

    CLAMM: a spin CLuster expansion--Monte Carlo toolkit for Alloys and Magnetic Materials

    Authors: Brian Blankenau, Tianyu Su, Namhoon Kim, Elif Ertekin

    Abstract: Finite-temperature magnetism gives rise to many phenomena in alloy materials, such as magnetic phase transformations, short or medium range order in magnetic alloys, spin waves, critical phenomena, and the magnetocaloric effect. Lattice models, such as the Ising, Potts, cluster expansion, and magnetic cluster expansion models, are powerful tools for studying complex magnetic alloys and compounds.… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  2. arXiv:2506.17088  [pdf, ps, other

    cs.CL

    Chain-of-Thought Prompting Obscures Hallucination Cues in Large Language Models: An Empirical Evaluation

    Authors: Jiahao Cheng, Tiancheng Su, Jia Yuan, Guoxiu He, Jiawei Liu, Xinqi Tao, Jingwen Xie, Huaxia Li

    Abstract: Large Language Models (LLMs) often exhibit \textit{hallucinations}, generating factually incorrect or semantically irrelevant content in response to prompts. Chain-of-Thought (CoT) prompting can mitigate hallucinations by encouraging step-by-step reasoning, but its impact on hallucination detection remains underexplored. To bridge this gap, we conduct a systematic empirical evaluation. We begin wi… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  3. arXiv:2506.06710  [pdf, ps, other

    cs.CV eess.IV

    A Systematic Investigation on Deep Learning-Based Omnidirectional Image and Video Super-Resolution

    Authors: Qianqian Zhao, Chunle Guo, Tianyi Zhang, Junpei Zhang, Peiyang Jia, Tan Su, Wenjie Jiang, Chongyi Li

    Abstract: Omnidirectional image and video super-resolution is a crucial research topic in low-level vision, playing an essential role in virtual reality and augmented reality applications. Its goal is to reconstruct high-resolution images or video frames from low-resolution inputs, thereby enhancing detail preservation and enabling more accurate scene analysis and interpretation. In recent years, numerous i… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

  4. arXiv:2505.23134  [pdf, ps, other

    cs.CV cs.AI

    Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing

    Authors: Tongtong Su, Chengyu Wang, Jun Huang, Dongming Lu

    Abstract: Appearance editing according to user needs is a pivotal task in video editing. Existing text-guided methods often lead to ambiguities regarding user intentions and restrict fine-grained control over editing specific aspects of objects. To overcome these limitations, this paper introduces a novel approach named {Zero-to-Hero}, which focuses on reference-based video editing that disentangles the edi… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  5. arXiv:2505.23014  [pdf, ps, other

    cs.LG

    Hyperbolic-PDE GNN: Spectral Graph Neural Networks in the Perspective of A System of Hyperbolic Partial Differential Equations

    Authors: Juwei Yue, Haikuo Li, Jiawei Sheng, Xiaodong Li, Taoyu Su, Tingwen Liu, Li Guo

    Abstract: Graph neural networks (GNNs) leverage message passing mechanisms to learn the topological features of graph data. Traditional GNNs learns node features in a spatial domain unrelated to the topology, which can hardly ensure topological features. In this paper, we formulates message passing as a system of hyperbolic partial differential equations (hyperbolic PDEs), constituting a dynamical system th… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 18 pages, 2 figures, published to ICML 2025

    Journal ref: International Conference on Machine Learning 2025

  6. arXiv:2505.15950  [pdf, ps, other

    eess.SY

    Gaussian Processes in Power Systems: Techniques, Applications, and Future Works

    Authors: Bendong Tan, Tong Su, Yu Weng, Ketian Ye, Parikshit Pareek, Petr Vorobev, Hung Nguyen, Junbo Zhao, Deepjyoti Deka

    Abstract: The increasing integration of renewable energy sources (RESs) and distributed energy resources (DERs) has significantly heightened operational complexity and uncertainty in modern power systems. Concurrently, the widespread deployment of smart meters, phasor measurement units (PMUs) and other sensors has generated vast spatiotemporal data streams, enabling advanced data-driven analytics and decisi… ▽ More

    Submitted 22 May, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

  7. arXiv:2505.14212  [pdf, ps, other

    cs.CL cs.AI

    Automatic Dataset Generation for Knowledge Intensive Question Answering Tasks

    Authors: Sizhe Yuen, Ting Su, Ziyang Wang, Yali Du, Adam J. Sobey

    Abstract: A question-answering (QA) system is to search suitable answers within a knowledge base. Current QA systems struggle with queries requiring complex reasoning or real-time knowledge integration. They are often supplemented with retrieval techniques on a data source such as Retrieval-Augmented Generation (RAG). However, RAG continues to face challenges in handling complex reasoning and logical connec… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  8. arXiv:2505.11997  [pdf, ps, other

    cs.CV

    Multimodal Cancer Survival Analysis via Hypergraph Learning with Cross-Modality Rebalance

    Authors: Mingcheng Qu, Guang Yang, Donglin Di, Tonghua Su, Yue Gao, Yang Song, Lei Fan

    Abstract: Multimodal pathology-genomic analysis has become increasingly prominent in cancer survival prediction. However, existing studies mainly utilize multi-instance learning to aggregate patch-level features, neglecting the information loss of contextual and hierarchical details within pathology images. Furthermore, the disparity in data granularity and dimensionality between pathology and genomics lead… ▽ More

    Submitted 20 May, 2025; v1 submitted 17 May, 2025; originally announced May 2025.

    Comments: accepted by IJCAI2025 Code: https://github.com/MCPathology/MRePath

  9. arXiv:2505.11010  [pdf, other

    cs.CL cs.AI

    Review-Instruct: A Review-Driven Multi-Turn Conversations Generation Method for Large Language Models

    Authors: Jiangxu Wu, Cong Wang, TianHuang Su, Jun Yang, Haozhi Lin, Chao Zhang, Ming Peng, Kai Shi, SongPan Yang, BinQing Pan, ZiXian Li, Ni Yang, ZhenYu Yang

    Abstract: The effectiveness of large language models (LLMs) in conversational AI is hindered by their reliance on single-turn supervised fine-tuning (SFT) data, which limits contextual coherence in multi-turn dialogues. Existing methods for generating multi-turn dialogue data struggle to ensure both diversity and quality in instructions. To address this, we propose Review-Instruct, a novel framework that sy… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: ACL2025 Accepted

  10. arXiv:2505.08838  [pdf, ps, other

    eess.IV cs.AI cs.CV

    Ultrasound Report Generation with Multimodal Large Language Models for Standardized Texts

    Authors: Peixuan Ge, Tongkun Su, Faqin Lv, Baoliang Zhao, Peng Zhang, Chi Hong Wong, Liang Yao, Yu Sun, Zenan Wang, Pak Kin Wong, Ying Hu

    Abstract: Ultrasound (US) report generation is a challenging task due to the variability of US images, operator dependence, and the need for standardized text. Unlike X-ray and CT, US imaging lacks consistent datasets, making automation difficult. In this study, we propose a unified framework for multi-organ and multilingual US report generation, integrating fragment-based multilingual training and leveragi… ▽ More

    Submitted 19 May, 2025; v1 submitted 13 May, 2025; originally announced May 2025.

  11. arXiv:2504.19458  [pdf, other

    cs.MM cs.CL cs.IR

    Mitigating Modality Bias in Multi-modal Entity Alignment from a Causal Perspective

    Authors: Taoyu Su, Jiawei Sheng, Duohe Ma, Xiaodong Li, Juwei Yue, Mengxiao Song, Yingkai Tang, Tingwen Liu

    Abstract: Multi-Modal Entity Alignment (MMEA) aims to retrieve equivalent entities from different Multi-Modal Knowledge Graphs (MMKGs), a critical information retrieval task. Existing studies have explored various fusion paradigms and consistency constraints to improve the alignment of equivalent entities, while overlooking that the visual modality may not always contribute positively. Empirically, entities… ▽ More

    Submitted 15 May, 2025; v1 submitted 27 April, 2025; originally announced April 2025.

    Comments: Accepted by SIGIR 2025, 11 pages, 10 figures, 4 tables,

  12. arXiv:2504.18594  [pdf, other

    cs.LG cs.AI

    A Simple DropConnect Approach to Transfer-based Targeted Attack

    Authors: Tongrui Su, Qingbin Li, Shengyu Zhu, Wei Chen, Xueqi Cheng

    Abstract: We study the problem of transfer-based black-box attack, where adversarial samples generated using a single surrogate model are directly applied to target models. Compared with untargeted attacks, existing methods still have lower Attack Success Rates (ASRs) in the targeted setting, i.e., the obtained adversarial examples often overfit the surrogate model but fail to mislead other models. In this… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  13. arXiv:2504.12027  [pdf, other

    cs.CV

    Understanding Attention Mechanism in Video Diffusion Models

    Authors: Bingyan Liu, Chengyu Wang, Tongtong Su, Huan Ten, Jun Huang, Kailing Guo, Kui Jia

    Abstract: Text-to-video (T2V) synthesis models, such as OpenAI's Sora, have garnered significant attention due to their ability to generate high-quality videos from a text prompt. In diffusion-based T2V models, the attention mechanism is a critical component. However, it remains unclear what intermediate features are learned and how attention blocks in T2V models affect various aspects of video synthesis, s… ▽ More

    Submitted 16 April, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

  14. arXiv:2504.10214  [pdf, other

    cs.CV

    Balancing Stability and Plasticity in Pretrained Detector: A Dual-Path Framework for Incremental Object Detection

    Authors: Songze Li, Qixing Xu, Tonghua Su, Xu-Yao Zhang, Zhongjie Wang

    Abstract: The balance between stability and plasticity remains a fundamental challenge in pretrained model-based incremental object detection (PTMIOD). While existing PTMIOD methods demonstrate strong performance on in-domain tasks aligned with pretraining data, their plasticity to cross-domain scenarios remains underexplored. Through systematic component-wise analysis of pretrained detectors, we reveal a f… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

  15. arXiv:2504.06521  [pdf, other

    cs.CV

    DUKAE: DUal-level Knowledge Accumulation and Ensemble for Pre-Trained Model-Based Continual Learning

    Authors: Songze Li, Tonghua Su, Xu-Yao Zhang, Qixing Xu, Zhongjie Wang

    Abstract: Pre-trained model-based continual learning (PTMCL) has garnered growing attention, as it enables more rapid acquisition of new knowledge by leveraging the extensive foundational understanding inherent in pre-trained model (PTM). Most existing PTMCL methods use Parameter-Efficient Fine-Tuning (PEFT) to learn new knowledge while consolidating existing memory. However, they often face some challenges… ▽ More

    Submitted 14 April, 2025; v1 submitted 8 April, 2025; originally announced April 2025.

  16. arXiv:2504.06342  [pdf, other

    astro-ph.GA astro-ph.HE

    MACER3D -- an upgrade of MACER2D with enhanced subgrid models and gas physics -- and its application to simulating AGN feedback in a massive elliptical galaxy

    Authors: Haoen Zhang, Haojie Xia, Suoqing Ji, Feng Yuan, Minhang Guo, Rui Zhang, Bocheng Zhu, Yihuan Di, Aoyun He, Tingfang Su, Yuxuan Zou

    Abstract: We present MACER3D (Multiscale AGN-regulated Cosmic Ecosystem Resolver in 3D), a new suite of three-dimensional hydrodynamic simulations that study active galactic nuclei (AGN) feedback on galactic scales over Gyr duration, with major enhancement in subgrid models and gas physics over its predecessor -- MACER (Massive AGN Controlled Ellipticals Resolved) which is in two dimensions (hereafter MACER… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

    Comments: 28 pages, 10 figures, accepted for publication in ApJ

  17. arXiv:2504.00521  [pdf, other

    cs.SE cs.AI

    Automated detection of atomicity violations in large-scale systems

    Authors: Hang He, Yixing Luo, Chengcheng Wan, Ting Su, Haiying Sun, Geguang Pu

    Abstract: Atomicity violations in interrupt-driven programs pose a significant threat to software safety in critical systems. These violations occur when the execution sequence of operations on shared resources is disrupted by asynchronous interrupts. Detecting atomicity violations is challenging due to the vast program state space, application-level code dependencies, and complex domain-specific knowledge.… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  18. arXiv:2504.00380  [pdf, other

    cs.CV

    Hierarchical Flow Diffusion for Efficient Frame Interpolation

    Authors: Yang Hai, Guo Wang, Tan Su, Wenjie Jiang, Yinlin Hu

    Abstract: Most recent diffusion-based methods still show a large gap compared to non-diffusion methods for video frame interpolation, in both accuracy and efficiency. Most of them formulate the problem as a denoising procedure in latent space directly, which is less effective caused by the large latent space. We propose to model bilateral optical flow explicitly by hierarchical diffusion models, which has m… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

    Comments: Accepted by CVPR 2025

  19. arXiv:2503.17638  [pdf, other

    stat.AP

    Collective Wisdom: Policy Averaging with an Application to the Newsvendor Problem

    Authors: Xiangyu Cui, Nicholas G. Hall, Yun Shi, Tianyuan Su

    Abstract: We propose a Policy Averaging Approach (PAA) that synthesizes the strengths of existing approaches to create more reliable, flexible and justifiable policies for stochastic optimization problems. An important component of the PAA is risk diversification to reduce the randomness of policies. A second component emulates model averaging from statistics. A third component involves using cross-validati… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  20. arXiv:2503.16522  [pdf, other

    cs.CV

    Adams Bashforth Moulton Solver for Inversion and Editing in Rectified Flow

    Authors: Yongjia Ma, Donglin Di, Xuan Liu, Xiaokai Chen, Lei Fan, Wei Chen, Tonghua Su

    Abstract: Rectified flow models have achieved remarkable performance in image and video generation tasks. However, existing numerical solvers face a trade-off between fast sampling and high-accuracy solutions, limiting their effectiveness in downstream applications such as reconstruction and editing. To address this challenge, we propose leveraging the Adams-Bashforth-Moulton (ABM) predictor-corrector metho… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

  21. arXiv:2502.19008  [pdf, other

    cs.CL cs.AI

    Binary Neural Networks for Large Language Model: A Survey

    Authors: Liangdong Liu, Zhitong Zheng, Cong Wang, Tianhuang Su, Zhenyu Yang

    Abstract: Large language models (LLMs) have wide applications in the field of natural language processing(NLP), such as GPT-4 and Llama. However, with the exponential growth of model parameter sizes, LLMs bring significant resource overheads. Low-bit quantization, as a key technique, reduces memory usage and computational demands by decreasing the bit-width of model parameters, activations, and gradients. P… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

    Comments: 23 pages, 7 figures

  22. arXiv:2502.02457  [pdf, other

    cs.CE cs.LG

    Orientation-aware interaction-based deep material network in polycrystalline materials modeling

    Authors: Ting-Ju Wei, Tung-Huan Su, Chuin-Shan Chen

    Abstract: Multiscale simulations are indispensable for connecting microstructural features to the macroscopic behavior of polycrystalline materials, but their high computational demands limit their practicality. Deep material networks (DMNs) have been proposed as efficient surrogate models, yet they fall short of capturing texture evolution. To address this limitation, we propose the orientation-aware inter… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

  23. arXiv:2501.14659  [pdf, other

    cs.CV

    Towards Unified Structured Light Optimization

    Authors: Tinglei Wan, Tonghua Su, Zhongjie Wang

    Abstract: Structured light (SL) 3D reconstruction captures the precise surface shape of objects, providing high-accuracy 3D data essential for industrial inspection and robotic vision systems. However, current research on optimizing projection patterns in SL 3D reconstruction faces two main limitations: each scene requires separate training of calibration parameters, and optimization is restricted to specif… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

  24. arXiv:2501.10793  [pdf, other

    astro-ph.GA astro-ph.HE

    Modeling the Spectral Energy Distribution of Active Galactic Nuclei: Implications for Cosmological Simulations of Galaxy Formation

    Authors: Tong Su, Qi Guo, Erlin Qiao, Wenxiang Pei, Luis C. Ho, Cedric G. Lacey

    Abstract: Modeling the spectral energy distribution (SED) of active galactic nuclei (AGN) plays a very important role in constraining modern cosmological simulations of galaxy formation. Here, we utilize an advanced supermassive black hole (SMBH) accretion disk model to compute the accretion flow structure and AGN SED across a wide range of black hole mass ($M_{\rm SMBH}$) and dimensionless accretion rates… ▽ More

    Submitted 18 January, 2025; originally announced January 2025.

    Comments: Submitted to ApJ, comments are welcomed

  25. arXiv:2501.03582  [pdf, other

    quant-ph cond-mat.stat-mech

    Exact Decoding of Repetition Code under Circuit Level Noise

    Authors: Hanyan Cao, Shoukuan Zhao, Dongyang Feng, Zisong Shen, Haisheng Yan, Tang Su, Weijie Sun, Huikai Xu, Feng Pan, Haifeng Yu, Pan Zhang

    Abstract: Repetition code forms a fundamental basis for quantum error correction experiments. To date, it stands as the sole code that has achieved large distances and extremely low error rates. Its applications span the spectrum of evaluating hardware limitations, pinpointing hardware defects, and detecting rare events. However, current methods for decoding repetition codes under circuit level noise are su… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  26. arXiv:2501.02863  [pdf, other

    cs.SE

    Beyond Pass or Fail: Multi-Dimensional Benchmarking of Foundation Models for Goal-based Mobile UI Navigation

    Authors: Dezhi Ran, Mengzhou Wu, Hao Yu, Yuetong Li, Jun Ren, Yuan Cao, Xia Zeng, Haochuan Lu, Zexin Xu, Mengqian Xu, Ting Su, Liangchao Yao, Ting Xiong, Wei Yang, Yuetang Deng, Assaf Marron, David Harel, Tao Xie

    Abstract: Recent advances of foundation models (FMs) have made navigating mobile applications (apps) based on high-level goal instructions within reach, with significant industrial applications such as UI testing. While existing benchmarks evaluate FM-based UI navigation using the binary pass/fail metric, they have two major limitations: they cannot reflect the complex nature of mobile UI navigation where F… ▽ More

    Submitted 11 February, 2025; v1 submitted 6 January, 2025; originally announced January 2025.

  27. arXiv:2501.00873  [pdf, other

    cs.CV cs.LG

    Exploring Structured Semantic Priors Underlying Diffusion Score for Test-time Adaptation

    Authors: Mingjia Li, Shuang Li, Tongrui Su, Longhui Yuan, Jian Liang, Wei Li

    Abstract: Capitalizing on the complementary advantages of generative and discriminative models has always been a compelling vision in machine learning, backed by a growing body of research. This work discloses the hidden semantic structure within score-based generative models, unveiling their potential as effective discriminative priors. Inspired by our theoretical findings, we propose DUSA to exploit the s… ▽ More

    Submitted 1 January, 2025; originally announced January 2025.

    Comments: Accepted by NeurIPS 2024. Project page: https://kiwixr.github.io/projects/dusa

  28. arXiv:2412.19522  [pdf, other

    cs.CL

    Exploiting Domain-Specific Parallel Data on Multilingual Language Models for Low-resource Language Translation

    Authors: Surangika Ranathungaa, Shravan Nayak, Shih-Ting Cindy Huang, Yanke Mao, Tong Su, Yun-Hsiang Ray Chan, Songchen Yuan, Anthony Rinaldi, Annie En-Shiun Lee

    Abstract: Neural Machine Translation (NMT) systems built on multilingual sequence-to-sequence Language Models (msLMs) fail to deliver expected results when the amount of parallel data for a language, as well as the language's representation in the model are limited. This restricts the capabilities of domain-specific NMT systems for low-resource languages (LRLs). As a solution, parallel data from auxiliary d… ▽ More

    Submitted 27 December, 2024; originally announced December 2024.

  29. arXiv:2412.18208  [pdf, other

    quant-ph cs.LG

    Quantum framework for Reinforcement Learning: Integrating Markov decision process, quantum arithmetic, and trajectory search

    Authors: Thet Htar Su, Shaswot Shresthamali, Masaaki Kondo

    Abstract: This paper introduces a quantum framework for addressing reinforcement learning (RL) tasks, grounded in the quantum principles and leveraging a fully quantum model of the classical Markov decision process (MDP). By employing quantum concepts and a quantum search algorithm, this work presents the implementation and optimization of the agent-environment interactions entirely within the quantum domai… ▽ More

    Submitted 28 May, 2025; v1 submitted 24 December, 2024; originally announced December 2024.

    Journal ref: Physical Review A (2025)

  30. arXiv:2412.12862  [pdf, other

    physics.med-ph

    Scatter correction for photon-counting detector based CBCT imaging

    Authors: Xin Zhang, Ting Su, Jiongtao Zhu, Hairong Zheng, Dong Liang, Yongshuai Ge

    Abstract: Objective: The aim of this study is to validate the effectiveness of an energy-modulated scatter correction method in suppressing scatter in photon-counting detector (PCD)-based cone beam CT (CBCT) imaging. Approach: The scatter correction method, named e-Grid, which was initially applied to dual-layer flat-panel detector (DLFPD)-based CBCT imaging, was tested for its performance in PCD-CBCT imagi… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

  31. arXiv:2412.08065  [pdf, other

    eess.SY

    A Survey of Open-Source Power System Dynamic Simulators with Grid-Forming Inverter for Machine Learning Applications

    Authors: Tong Su, Jiangkai Peng, Alaa Selim, Junbo Zhao, Jin Tan

    Abstract: The emergence of grid-forming (GFM) inverter technology and the increasing role of machine learning in power systems highlight the need for evaluating the latest dynamic simulators. Open-source simulators offer distinct advantages in this field, being both free and highly customizable, which makes them well-suited for scientific research and validation of the latest models and methods. This paper… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

  32. arXiv:2412.06521  [pdf

    q-bio.GN

    Ancient DNA from 120-Million-Year-Old Lycoptera Fossils Reveals Evolutionary Insights

    Authors: Wan-Qian Zhao, Zhan-Yong Guo, Zeng-Yuan Tian, Tong-Fu Su, Gang-Qiang Cao, Zi-Xin Qi, Tian-Cang Qin, Wei Zhou, Jin-Yu Yang, Ming-Jie Chen, Xin-Ge Zhang, Chun-Yan Zhou, Chuan-Jia Zhu, Meng-Fei Tang, Di Wu, Mei-Rong Song, Yu-Qi Guo, Li-You Qiu, Fei Liang, Mei-Jun Li, Jun-Hui Geng, Li-Juan Zhao, Shu-Jie Zhang

    Abstract: High quality ancient DNA (aDNA) is essential for molecular paleontology. Due to DNA degradation and contamination by environmental DNA (eDNA), current research is limited to fossils less than 1 million years old. The study successfully extracted DNA from Lycoptera davidi fossils from the Early Cretaceous period, dating 120 million years ago. Using high-throughput sequencing, 1,258,901 DNA sequence… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

    Comments: 14 pages,3 Figures

  33. arXiv:2412.04072  [pdf, other

    cs.LG

    Boundary-Guided Learning for Gene Expression Prediction in Spatial Transcriptomics

    Authors: Mingcheng Qu, Yuncong Wu, Donglin Di, Anyang Su, Tonghua Su, Yang Song, Lei Fan

    Abstract: Spatial transcriptomics (ST) has emerged as an advanced technology that provides spatial context to gene expression. Recently, deep learning-based methods have shown the capability to predict gene expression from WSI data using ST data. Existing approaches typically extract features from images and the neighboring regions using pretrained models, and then develop methods to fuse this information t… ▽ More

    Submitted 8 December, 2024; v1 submitted 5 December, 2024; originally announced December 2024.

    Comments: 8 pages, 5 figures

  34. arXiv:2411.09577  [pdf, other

    cs.HC

    SimTube: Generating Simulated Video Comments through Multimodal AI and User Personas

    Authors: Yu-Kai Hung, Yun-Chien Huang, Ting-Yu Su, Yen-Ting Lin, Lung-Pan Cheng, Bryan Wang, Shao-Hua Sun

    Abstract: Audience feedback is crucial for refining video content, yet it typically comes after publication, limiting creators' ability to make timely adjustments. To bridge this gap, we introduce SimTube, a generative AI system designed to simulate audience feedback in the form of video comments before a video's release. SimTube features a computational pipeline that integrates multimodal data from the vid… ▽ More

    Submitted 17 November, 2024; v1 submitted 14 November, 2024; originally announced November 2024.

  35. Neural Network Certification Informed Power System Transient Stability Preventive Control with Renewable Energy

    Authors: Tong Su, Junbo Zhao

    Abstract: Existing machine learning-based surrogate modeling methods for transient stability constrained-optimal power flow (TSC-OPF) lack certifications in the presence of unseen disturbances or uncertainties. This may lead to divergence of TSC-OPF or insecure control strategies. This paper proposes a neural network certification-informed power system transient stability preventive control method consideri… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

    Journal ref: IEEE Transactions on Power Systems, 2025

  36. arXiv:2410.19025  [pdf, other

    cs.LG cs.AI

    Large Language Models for Financial Aid in Financial Time-series Forecasting

    Authors: Md Khairul Islam, Ayush Karmacharya, Timothy Sue, Judy Fox

    Abstract: Considering the difficulty of financial time series forecasting in financial aid, much of the current research focuses on leveraging big data analytics in financial services. One modern approach is to utilize "predictive analysis", analogous to forecasting financial trends. However, many of these time series data in Financial Aid (FA) pose unique challenges due to limited historical datasets and h… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

    Comments: GitHub link https://github.com/UVA-MLSys/Financial-Time-Series

  37. arXiv:2410.12099  [pdf, ps, other

    nucl-ex

    The EMC Effect of Tritium and Helium-3 from the JLab MARATHON Experiment

    Authors: D. Abrams, H. Albataineh, B. S. Aljawrneh, S. Alsalmi, D. Androic, K. Aniol, W. Armstrong, J. Arrington, H. Atac, T. Averett, C. Ayerbe Gayoso, X. Bai, J. Bane, S. Barcus, A. Beck, V. Bellini, H. Bhatt, D. Bhetuwal, D. Biswas, D. Blyth, W. Boeglin, D. Bulumulla, J. Butler, A. Camsonne, M. Carmignotto , et al. (109 additional authors not shown)

    Abstract: Measurements of the EMC effect in the tritium and helium-3 mirror nuclei are reported. The data were obtained by the MARATHON Jefferson Lab experiment, which performed deep inelastic electron scattering from deuterium and the three-body nuclei, using a cryogenic gas target system and the High Resolution Spectrometers of the Hall A Facility of the Lab. The data cover the Bjorken $x$ range from 0.20… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: arXiv admin note: text overlap with arXiv:2104.05850

  38. arXiv:2410.07151  [pdf, other

    cs.CV

    FaceVid-1K: A Large-Scale High-Quality Multiracial Human Face Video Dataset

    Authors: Donglin Di, He Feng, Wenzhang Sun, Yongjia Ma, Hao Li, Wei Chen, Xiaofei Gou, Tonghua Su, Xun Yang

    Abstract: Generating talking face videos from various conditions has recently become a highly popular research area within generative tasks. However, building a high-quality face video generation model requires a well-performing pre-trained backbone, a key obstacle that universal models fail to adequately address. Most existing works rely on universal video or image generation models and optimize control me… ▽ More

    Submitted 23 September, 2024; originally announced October 2024.

  39. arXiv:2409.07144  [pdf, other

    eess.IV

    Dual channel CW nnU-Net for 3D PET-CT Lesion Segmentation in 2024 autoPET III Challenge

    Authors: Ching-Wei Wang, Ting-Sheng Su, Keng-Wei Liu

    Abstract: PET/CT is extensively used in imaging malignant tumors because it highlights areas of increased glucose metabolism, indicative of cancerous activity. Accurate 3D lesion segmentation in PET/CT imaging is essential for effective oncological diagnostics and treatment planning. In this study, we developed an advanced 3D residual U-Net model for the Automated Lesion Segmentation in Whole-Body PET/CT -… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

  40. arXiv:2409.07035   

    cs.CC math.CO

    Approximately counting maximal independent set is equivalent to #SAT

    Authors: Hao Zhang, Tonghua Su

    Abstract: A maximal independent set is an independent set that is not a subset of any other independent set. It is also the key problem of mathematics, computer science, and other fields. A counting problem is a type of computational problem that associated with the number of solutions. Besides, counting problems help us better understand several fields such as algorithm analysis, complexity theory, artific… ▽ More

    Submitted 13 September, 2024; v1 submitted 11 September, 2024; originally announced September 2024.

    Comments: After discussion, this is already known in JCSS (with the arXiv:1411.6829),proving that approximately counting MIS in bipartite graphs is equivalent to #SAT under AP-reductions, it is a stronger result if it restricts to bipartite graphs, which implies it for general graphs. Therefore, this paper tends to be more of a direct proof exercise

  41. arXiv:2409.01994  [pdf, other

    cs.SE cs.CR

    BinPRE: Enhancing Field Inference in Binary Analysis Based Protocol Reverse Engineering

    Authors: Jiayi Jiang, Xiyuan Zhang, Chengcheng Wan, Haoyi Chen, Haiying Sun, Ting Su

    Abstract: Protocol reverse engineering (PRE) aims to infer the specification of network protocols when the source code is not available. Specifically, field inference is one crucial step in PRE to infer the field formats and semantics. To perform field inference, binary analysis based PRE techniques are one major approach category. However, such techniques face two key challenges - (1) the format inference… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: Accepted by ACM Conference on Computer and Communications Security (CCS) 2024

  42. arXiv:2408.17301  [pdf, ps, other

    math.AG

    Integral cohomology of dual boundary complexes is motivic

    Authors: Tao Su

    Abstract: In this note, we give a motivic characterization of the integral cohomology of dual boundary complexes of smooth quasi-projective complex algebraic varieties. As a corollary, the dual boundary complex of any stably affine space (of positive dimension) is contractible. In a separate paper [Su23], this corollary has been used by the author in his proof of the weak geometric P=W conjecture for very g… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: 8 pages; Following the anonymous referee's suggestion, the original paper arXiv:2307.16657 (v3) has been separated into two: v4 of that paper keeps the main result; this one deals with the motivic part

    MSC Class: 14C15 (Primary) 14F45; 14C30 (Secondary)

  43. arXiv:2408.13855  [pdf, other

    cs.SE

    An Empirical Study of False Negatives and Positives of Static Code Analyzers From the Perspective of Historical Issues

    Authors: Han Cui, Menglei Xie, Ting Su, Chengyu Zhang, Shin Hwei Tan

    Abstract: Static code analyzers are widely used to help find program flaws. However, in practice the effectiveness and usability of such analyzers is affected by the problems of false negatives (FNs) and false positives (FPs). This paper aims to investigate the FNs and FPs of such analyzers from a new perspective, i.e., examining the historical issues of FNs and FPs of these analyzers reported by the mainta… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

  44. arXiv:2408.04943  [pdf, other

    physics.med-ph

    CBCT scatter correction with dual-layer flat-panel detector

    Authors: Xin Zhang, Jixiong Xie, Ting Su, Jiongtao Zhu, Han Cui, Yuhang Tan, Dongmei Xia, Hairong Zheng, Dong Liang, Yongshuai Ge

    Abstract: Background: Recently, the popularity of dual-layer flat-panel detector (DL-FPD) based dual-energy cone-beam CT (DE-CBCT) imaging has been increasing. However, the image quality of DE-CBCT remains constrained by the Compton scattered X-ray photons. Purpose: The objective of this study is to develop an energy-modulated scatter correction method for DL-FPD based CBCT imaging. Methods: In DL-FPD,… ▽ More

    Submitted 27 October, 2024; v1 submitted 9 August, 2024; originally announced August 2024.

  45. Multi-Purpose Architecture for Fast Reset and Protective Readout of Superconducting Qubits

    Authors: Jiayu Ding, Yulong Li, He Wang, Guangming Xue, Tang Su, Chenlu Wang, Weijie Sun, Feiyu Li, Yujia Zhang, Yang Gao, Jun Peng, Zhi Hao Jiang, Yang Yu, Haifeng Yu, Fei Yan

    Abstract: The ability to fast reset a qubit state is crucial for quantum information processing. However, to actively reset a qubit requires engineering a pathway to interact with a dissipative bath, which often comes with the cost of reduced qubit protection from the environment. Here, we present a novel multi-purpose architecture that enables fast reset and protection of superconducting qubits during cont… ▽ More

    Submitted 8 January, 2025; v1 submitted 31 July, 2024; originally announced July 2024.

    Comments: 8 pages, 4 figures

    Journal ref: Phys. Rev. Applied 23, 014012 (2025)

  46. arXiv:2407.20773  [pdf

    cs.AR

    UpDown: Programmable fine-grained Events for Scalable Performance on Irregular Applications

    Authors: Andronicus Rajasukumar, Jiya Su, Yuqing, Wang, Tianshuo Su, Marziyeh Nourian, Jose M Monsalve Diaz, Tianchi Zhang, Jianru Ding, Wenyi Wang, Ziyi Zhang, Moubarak Jeje, Henry Hoffmann, Yanjing Li, Andrew A. Chien

    Abstract: Applications with irregular data structures, data-dependent control flows and fine-grained data transfers (e.g., real-world graph computations) perform poorly on cache-based systems. We propose the UpDown accelerator that supports fine-grained execution with novel architecture mechanisms - lightweight threading, event-driven scheduling, efficient ultra-short threads, and split-transaction DRAM acc… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: 14 pages, 23 figures

  47. arXiv:2407.19625  [pdf, other

    cs.CL cs.MM

    LoginMEA: Local-to-Global Interaction Network for Multi-modal Entity Alignment

    Authors: Taoyu Su, Xinghua Zhang, Jiawei Sheng, Zhenyu Zhang, Tingwen Liu

    Abstract: Multi-modal entity alignment (MMEA) aims to identify equivalent entities between two multi-modal knowledge graphs (MMKGs), whose entities can be associated with relational triples and related images. Most previous studies treat the graph structure as a special modality, and fuse different modality information with separate uni-modal encoders, neglecting valuable relational associations in modaliti… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: Accepted by ECAI 2024

  48. arXiv:2407.19302  [pdf, other

    cs.CL cs.MM

    IBMEA: Exploring Variational Information Bottleneck for Multi-modal Entity Alignment

    Authors: Taoyu Su, Jiawei Sheng, Shicheng Wang, Xinghua Zhang, Hongbo Xu, Tingwen Liu

    Abstract: Multi-modal entity alignment (MMEA) aims to identify equivalent entities between multi-modal knowledge graphs (MMKGs), where the entities can be associated with related images. Most existing studies integrate multi-modal information heavily relying on the automatically-learned fusion module, rarely suppressing the redundant information for MMEA explicitly. To this end, we explore variational infor… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

    Comments: Accepted by ACM MM 2024

  49. arXiv:2407.18955  [pdf, other

    cs.CV

    Real Face Video Animation Platform

    Authors: Xiaokai Chen, Xuan Liu, Donglin Di, Yongjia Ma, Wei Chen, Tonghua Su

    Abstract: In recent years, facial video generation models have gained popularity. However, these models often lack expressive power when dealing with exaggerated anime-style faces due to the absence of high-quality anime-style face training sets. We propose a facial animation platform that enables real-time conversion from real human faces to cartoon-style faces, supporting multiple models. Built on the Gra… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  50. arXiv:2407.08949  [pdf, other

    cs.CV

    One-Shot Pose-Driving Face Animation Platform

    Authors: He Feng, Donglin Di, Yongjia Ma, Wei Chen, Tonghua Su

    Abstract: The objective of face animation is to generate dynamic and expressive talking head videos from a single reference face, utilizing driving conditions derived from either video or audio inputs. Current approaches often require fine-tuning for specific identities and frequently fail to produce expressive videos due to the limited effectiveness of Wav2Pose modules. To facilitate the generation of one-… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.