Skip to main content

Showing 201–250 of 1,936 results for author: Zhou, W

.
  1. arXiv:2501.08279  [pdf, ps, other

    cs.CV

    SmartEraser: Remove Anything from Images using Masked-Region Guidance

    Authors: Longtao Jiang, Zhendong Wang, Jianmin Bao, Wengang Zhou, Dongdong Chen, Lei Shi, Dong Chen, Houqiang Li

    Abstract: Object removal has so far been dominated by the mask-and-inpaint paradigm, where the masked region is excluded from the input, leaving models relying on unmasked areas to inpaint the missing region. However, this approach lacks contextual information for the masked area, often resulting in unstable performance. In this work, we introduce SmartEraser, built with a new removing paradigm called Maske… ▽ More

    Submitted 11 June, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

    Comments: Project at: https://longtaojiang.github.io/smarteraser.github.io/

    Journal ref: The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025

  2. arXiv:2501.07579  [pdf

    physics.med-ph physics.bio-ph

    Correlation Between DNA Double-Strand Break Distribution in 3D Genome and Radiation-Induced Cell Death

    Authors: Ankang Hu, Wanyi Zhou, Xiyu Luo, Rui Qiu, Junli Li

    Abstract: The target theory is the most classical hypothesis explaining radiation-induced cell death, the physical or biological nature of the "target" remains ambiguous. This study hypothesizes that the distribution of DNA double-strand breaks (DSBs) within the 3D genome is a pivotal factor affecting the probability of radiation-induced cell death. We propose that clustered DSBs in DNA segments with high i… ▽ More

    Submitted 9 June, 2025; v1 submitted 27 December, 2024; originally announced January 2025.

    Comments: 19 pages, 6 figures, 1 supplementary document

    Journal ref: Radiation Research. 2025, 203(6): 421-432

  3. arXiv:2501.06835  [pdf, other

    cs.CV

    X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding

    Authors: Wenqi Zhou, Kai Cao, Hao Zheng, Xinyi Zheng, Miao Liu, Per Ola Kristensson, Walterio Mayol-Cuevas, Fan Zhang, Weizhe Lin, Junxiao Shen

    Abstract: Long-form egocentric video understanding provides rich contextual information and unique insights into long-term human behaviors, holding significant potential for applications in embodied intelligence, long-term activity analysis, and personalized assistive technologies. However, existing benchmark datasets primarily focus on single, short-duration videos or moderately long videos up to dozens of… ▽ More

    Submitted 12 January, 2025; originally announced January 2025.

  4. arXiv:2501.06689  [pdf, other

    cs.CL

    TAPO: Task-Referenced Adaptation for Prompt Optimization

    Authors: Wenxin Luo, Weirui Wang, Xiaopeng Li, Weibo Zhou, Pengyue Jia, Xiangyu Zhao

    Abstract: Prompt engineering can significantly improve the performance of large language models (LLMs), with automated prompt optimization (APO) gaining significant attention due to the time-consuming and laborious nature of manual prompt design. However, much of the existing work in APO overlooks task-specific characteristics, resulting in prompts that lack domain specificity and are not well-suited for ta… ▽ More

    Submitted 26 February, 2025; v1 submitted 11 January, 2025; originally announced January 2025.

    Comments: Accepted to ICASSP 2025

  5. arXiv:2501.06645  [pdf, other

    cs.CL cs.AI

    FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings

    Authors: Tong Liu, Xiao Yu, Wenxuan Zhou, Jindong Gu, Volker Tresp

    Abstract: Efficient preference optimization algorithms such as Direct Preference Optimization (DPO) have become a popular approach in aligning large language models (LLMs) with human preferences. These algorithms implicitly treat the LLM as a reward model, and focus on training it to correct misranked preference pairs. However, recent work~\citep{chen2024preference} empirically finds that DPO training \text… ▽ More

    Submitted 3 June, 2025; v1 submitted 11 January, 2025; originally announced January 2025.

    Comments: ACL 2025

  6. arXiv:2501.06590  [pdf, other

    cs.CL cs.AI

    ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning

    Authors: Xiangru Tang, Tianyu Hu, Muyang Ye, Yanjun Shao, Xunjian Yin, Siru Ouyang, Wangchunshu Zhou, Pan Lu, Zhuosheng Zhang, Yilun Zhao, Arman Cohan, Mark Gerstein

    Abstract: Chemical reasoning usually involves complex, multi-step processes that demand precise calculations, where even minor errors can lead to cascading failures. Furthermore, large language models (LLMs) encounter difficulties handling domain-specific formulas, executing reasoning steps accurately, and integrating code effectively when tackling chemical reasoning tasks. To address these challenges, we p… ▽ More

    Submitted 11 January, 2025; originally announced January 2025.

  7. arXiv:2501.05107  [pdf

    cs.RO physics.app-ph

    Harnessing the Power of Vibration Motors to Develop Miniature Untethered Robotic Fishes

    Authors: Chongjie Jiang, Yingying Dai, Jinyang Le, Xiaomeng Chen, Yu Xie, Wei Zhou, Fuzhou Niu, Ying Li, Tao Luo

    Abstract: Miniature underwater robots play a crucial role in the exploration and development of marine resources, particularly in confined spaces and high-pressure deep-sea environments. This study presents the design, optimization, and performance of a miniature robotic fish, powered by the oscillation of bio-inspired fins. These fins feature a rigid-flexible hybrid structure and use an eccentric rotating… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

    Comments: 8 pages, 8 figures

  8. arXiv:2501.04945  [pdf, ps, other

    cs.CL cs.AI

    Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models

    Authors: Qingyu Ren, Jie Zeng, Qianyu He, Jiaqing Liang, Yanghua Xiao, Weikang Zhou, Zeye Sun, Fei Yu

    Abstract: It is crucial for large language models (LLMs) to follow instructions that involve multiple constraints. However, it is an unexplored area to enhance LLMs' ability to follow soft constraints. To bridge the gap, we initially design a pipeline to construct datasets with high-quality outputs automatically. Additionally, to fully utilize the positive and negative samples generated during the data cons… ▽ More

    Submitted 31 May, 2025; v1 submitted 8 January, 2025; originally announced January 2025.

  9. arXiv:2501.04907  [pdf, other

    physics.optics

    Optical skyrmion lattices accelerating in free space

    Authors: Haijun Wu, Weijie Zhou, Zhihan Zhu, Yijie Shen

    Abstract: Generation and propagation of optical skyrmions provide a versatile plalform for topologically nontrivial optical informatics and light-matter interactions, but their acceleration along curved trajectories is to be studied. In this study, we experimentally demonstrate the first accelerating skyrmion lattices conveyed by Airy structured light, characterized by topologically stable skyrmion textures… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

  10. arXiv:2501.03936  [pdf, other

    cs.AI cs.CL

    PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides

    Authors: Hao Zheng, Xinyan Guan, Hao Kong, Jia Zheng, Weixiang Zhou, Hongyu Lin, Yaojie Lu, Ben He, Xianpei Han, Le Sun

    Abstract: Automatically generating presentations from documents is a challenging task that requires accommodating content quality, visual appeal, and structural coherence. Existing methods primarily focus on improving and evaluating the content quality in isolation, overlooking visual appeal and structural coherence, which limits their practical applicability. To address these limitations, we propose PPTAge… ▽ More

    Submitted 21 February, 2025; v1 submitted 7 January, 2025; originally announced January 2025.

    Comments: 8 pages, 23 figures, see https://github.com/icip-cas/PPTAgent for details

  11. arXiv:2501.02732  [pdf, other

    cs.LG cs.AI

    AFed: Algorithmic Fair Federated Learning

    Authors: Huiqiang Chen, Tianqing Zhu, Wanlei Zhou, Wei Zhao

    Abstract: Federated Learning (FL) has gained significant attention as it facilitates collaborative machine learning among multiple clients without centralizing their data on a server. FL ensures the privacy of participating clients by locally storing their data, which creates new challenges in fairness. Traditional debiasing methods assume centralized access to sensitive information, rendering them impracti… ▽ More

    Submitted 5 January, 2025; originally announced January 2025.

    Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems

  12. arXiv:2412.20833  [pdf, ps, other

    cs.CV cs.MM

    Inclusion 2024 Global Multimedia Deepfake Detection Challenge: Towards Multi-dimensional Face Forgery Detection

    Authors: Yi Zhang, Weize Gao, Changtao Miao, Man Luo, Jianshu Li, Wenzhong Deng, Zhe Li, Bingyu Hu, Weibin Yao, Yunfeng Diao, Wenbo Zhou, Tao Gong, Qi Chu

    Abstract: In this paper, we present the Global Multimedia Deepfake Detection held concurrently with the Inclusion 2024. Our Multimedia Deepfake Detection aims to detect automatic image and audio-video manipulations including but not limited to editing, synthesis, generation, Photoshop,etc. Our challenge has attracted 1500 teams from all over the world, with about 5000 valid result submission counts. We invi… ▽ More

    Submitted 3 June, 2025; v1 submitted 30 December, 2024; originally announced December 2024.

    Comments: Inclusion 2024 Global Multimedia Deepfake Detection Competition Top Team Technical Report

  13. arXiv:2412.20413  [pdf, other

    cs.CV

    EraseAnything: Enabling Concept Erasure in Rectified Flow Transformers

    Authors: Daiheng Gao, Shilin Lu, Shaw Walters, Wenbo Zhou, Jiaming Chu, Jie Zhang, Bang Zhang, Mengxi Jia, Jian Zhao, Zhaoxin Fan, Weiming Zhang

    Abstract: Removing unwanted concepts from large-scale text-to-image (T2I) diffusion models while maintaining their overall generative quality remains an open challenge. This difficulty is especially pronounced in emerging paradigms, such as Stable Diffusion (SD) v3 and Flux, which incorporate flow matching and transformer-based architectures. These advancements limit the transferability of existing concept-… ▽ More

    Submitted 2 January, 2025; v1 submitted 29 December, 2024; originally announced December 2024.

    Comments: 24 pages, 18 figures

  14. arXiv:2412.20145  [pdf, other

    cs.CL

    Efficient Multi-Agent Collaboration with Tool Use for Online Planning in Complex Table Question Answering

    Authors: Wei Zhou, Mohsen Mesgar, Annemarie Friedrich, Heike Adel

    Abstract: Complex table question answering (TQA) aims to answer questions that require complex reasoning, such as multi-step or multi-category reasoning, over data represented in tabular form. Previous approaches demonstrated notable performance by leveraging either closed-source large language models (LLMs) or fine-tuned open-weight LLMs. However, fine-tuning LLMs requires high-quality training data, which… ▽ More

    Submitted 8 February, 2025; v1 submitted 28 December, 2024; originally announced December 2024.

    Comments: Accepted at NAACL 2025 Findings

  15. arXiv:2412.18933  [pdf, other

    cs.CV cs.MM eess.IV

    TINQ: Temporal Inconsistency Guided Blind Video Quality Assessment

    Authors: Yixiao Li, Xiaoyuan Yang, Weide Liu, Xin Jin, Xu Jia, Yukun Lai, Haotao Liu, Paul L Rosin, Wei Zhou

    Abstract: Blind video quality assessment (BVQA) has been actively researched for user-generated content (UGC) videos. Recently, super-resolution (SR) techniques have been widely applied in UGC. Therefore, an effective BVQA method for both UGC and SR scenarios is essential. Temporal inconsistency, referring to irregularities between consecutive frames, is relevant to video quality. Current BVQA approaches ty… ▽ More

    Submitted 25 December, 2024; originally announced December 2024.

  16. arXiv:2412.18895  [pdf, other

    nucl-th hep-th

    Effects of chiral symmetry restoration on dilepton production in heavy ion collisions

    Authors: Wen-Hao Zhou, Che Ming Ko, Kai-Jia Sun

    Abstract: Because of their weak interactions with the strongly interacting matter produced in relativistic heavy-ion collisions, dileptons provide an ideal probe of the early dynamics of these collisions. Here, we study dilepton production using a partonic transport model that is based on an extended Nambu-Jona-Lasinio (NJL) model. In this model, the in-medium quark masses decrease with increasing temperatu… ▽ More

    Submitted 25 December, 2024; originally announced December 2024.

    Comments: 8 pages, 9 figures

  17. arXiv:2412.18132  [pdf, ps, other

    math.CA

    On Tiling and Spectral Sets in $\mathbb Z_{p^2}\times\mathbb Z_{p^2}$

    Authors: Weiqi Zhou

    Abstract: Let $p$ be a prime number, it is shown that tiling and spectral sets coincide in $\mathbb Z_{p^2}\times\mathbb Z_{p^2}$ by considering equivalently symplectic spectral pairs. Symplectic structures appear naturally in time-frequency analysis and provides a perspective to reveal patterns that may not be so evident in the Euclidean setting. The main approach here is however still to count the size of… ▽ More

    Submitted 22 February, 2025; v1 submitted 23 December, 2024; originally announced December 2024.

    Comments: Expository improvements: The proof of the main theorem is now focused on those difficult cases, other simpler cases are moved to earlier sections. More details and helpful comments are added at various places

    MSC Class: 42A99; 05B45

  18. arXiv:2412.17724  [pdf

    physics.med-ph physics.ins-det physics.optics

    Comprehensive Optimization of Interferometric Diffusing Wave Spectroscopy (iDWS)

    Authors: Mingjun Zhao, Leah Dickstein, Akshay S. Nadig, Wenjun Zhou, Santosh Aparanji, Hector Garcia Estrada, Shing-Jiuan Liu, Ting Zhou, Weijian Yang, Aaron Lord, Vivek J. Srinivasan

    Abstract: It has been shown that light speckle fluctuations provide a means for noninvasive measurements of cerebral blood flow index (CBFi). While conventional Diffuse Correlation Spectroscopy (DCS) provides marginal brain sensitivity for CBFi in adult humans, new techniques have recently emerged to improve diffuse light throughput and thus, brain sensitivity. Here we further optimize one such approach, in… ▽ More

    Submitted 23 December, 2024; originally announced December 2024.

    Comments: 12 pages, 15 figures, 4 tables

  19. arXiv:2412.17632  [pdf, other

    cs.AI cs.CV cs.MM

    D-Judge: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance

    Authors: Renyang Liu, Ziyu Lyu, Wei Zhou, See-Kiong Ng

    Abstract: In Artificial Intelligence Generated Content (AIGC), distinguishing AI-synthesized images from natural ones remains a key challenge. Despite advancements in generative models, significant discrepancies persist. To systematically investigate and quantify these discrepancies, we introduce an AI-Natural Image Discrepancy accessing benchmark (\textit{D-Judge}) aimed at addressing the critical question… ▽ More

    Submitted 29 March, 2025; v1 submitted 23 December, 2024; originally announced December 2024.

  20. arXiv:2412.16865  [pdf, ps, other

    math.CA

    Mutual Annihilation of Tiles

    Authors: Weiqi Zhou

    Abstract: Given $A\subset\mathbb Z_n^2$, the purpose of this article is to investigate when is the difference set $ΔA$ disjoint with the zero set of the Fourier transform of $A$. In the study of tiles in $\mathbb Z_n^2$, the author observed an interesting phenomenon that if $(A,B)$ is a tiling pair with $|A|=|B|$, then sometimes $(A,B)$ is also a spectral pair and vice versa. Moreover, in such cases actuall… ▽ More

    Submitted 26 March, 2025; v1 submitted 22 December, 2024; originally announced December 2024.

    Comments: Added an example at the end to illustrate the main theorem, removed the computation for n=p case (trivial)

    MSC Class: 42A99; 05B45

  21. arXiv:2412.16822  [pdf, other

    cs.CV cs.AI cs.LG

    Layer- and Timestep-Adaptive Differentiable Token Compression Ratios for Efficient Diffusion Transformers

    Authors: Haoran You, Connelly Barnes, Yuqian Zhou, Yan Kang, Zhenbang Du, Wei Zhou, Lingzhi Zhang, Yotam Nitzan, Xiaoyang Liu, Zhe Lin, Eli Shechtman, Sohrab Amirghodsi, Yingyan Celine Lin

    Abstract: Diffusion Transformers (DiTs) have achieved state-of-the-art (SOTA) image generation quality but suffer from high latency and memory inefficiency, making them difficult to deploy on resource-constrained devices. One major efficiency bottleneck is that existing DiTs apply equal computation across all regions of an image. However, not all image tokens are equally important, and certain localized are… ▽ More

    Submitted 27 March, 2025; v1 submitted 21 December, 2024; originally announced December 2024.

    Comments: Accepted by CVPR 2025

  22. arXiv:2412.16720  [pdf, other

    cs.AI

    OpenAI o1 System Card

    Authors: OpenAI, :, Aaron Jaech, Adam Kalai, Adam Lerer, Adam Richardson, Ahmed El-Kishky, Aiden Low, Alec Helyar, Aleksander Madry, Alex Beutel, Alex Carney, Alex Iftimie, Alex Karpenko, Alex Tachard Passos, Alexander Neitz, Alexander Prokofiev, Alexander Wei, Allison Tam, Ally Bennett, Ananya Kumar, Andre Saraiva, Andrea Vallone, Andrew Duberstein, Andrew Kondrich , et al. (238 additional authors not shown)

    Abstract: The o1 model series is trained with large-scale reinforcement learning to reason using chain of thought. These advanced reasoning capabilities provide new avenues for improving the safety and robustness of our models. In particular, our models can reason about our safety policies in context when responding to potentially unsafe prompts, through deliberative alignment. This leads to state-of-the-ar… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

  23. arXiv:2412.15957  [pdf, other

    cs.CL cs.AI cs.IR

    From General to Specific: Tailoring Large Language Models for Personalized Healthcare

    Authors: Ruize Shi, Hong Huang, Wei Zhou, Kehan Yin, Kai Zhao, Yun Zhao

    Abstract: The rapid development of large language models (LLMs) has transformed many industries, including healthcare. However, previous medical LLMs have largely focused on leveraging general medical knowledge to provide responses, without accounting for patient variability and lacking true personalization at the individual level. To address this, we propose a novel method called personalized medical langu… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

  24. arXiv:2412.15738  [pdf, other

    q-fin.RM

    Risk spillovers between the BRICS and the U.S. staple grain futures markets

    Authors: Ying-Hui Shao, Yan-Hong Yang, Wei-Xing Zhou

    Abstract: This study examines contemporaneous and lagged spillover effects in BRICS staple grain futures markets and their linkages with U.S. markets. The results show that contemporaneous spillovers dominate, while net spillovers are driven by lagged connectedness. Systemic risk is lower in intra-BRICS markets compared to those including the U.S., highlighting the U.S. grain market's significant influence.… ▽ More

    Submitted 25 December, 2024; v1 submitted 20 December, 2024; originally announced December 2024.

    Comments: 22 pages, 11 figures

  25. arXiv:2412.14528  [pdf, other

    cs.CL

    Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models

    Authors: Xiao Cui, Mo Zhu, Yulei Qin, Liang Xie, Wengang Zhou, Houqiang Li

    Abstract: Knowledge distillation (KD) has become a prevalent technique for compressing large language models (LLMs). Existing KD methods are constrained by the need for identical tokenizers (i.e., vocabularies) between teacher and student models, limiting their versatility in handling LLMs of different architecture families. In this paper, we introduce the Multi-Level Optimal Transport (MultiLevelOT), a nov… ▽ More

    Submitted 18 January, 2025; v1 submitted 18 December, 2024; originally announced December 2024.

    Comments: Accepted by AAAI 2025 (Oral)

  26. arXiv:2412.13551  [pdf, other

    cs.CR

    Large Language Model Federated Learning with Blockchain and Unlearning for Cross-Organizational Collaboration

    Authors: Xuhan Zuo, Minghao Wang, Tianqing Zhu, Shui Yu, Wanlei Zhou

    Abstract: Large language models (LLMs) have transformed the way computers understand and process human language, but using them effectively across different organizations remains still difficult. When organizations work together to improve LLMs, they face several main challenges. First, organizations hesitate to share their valuable data with others. Second, competition between organizations creates trust p… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

  27. Mass Acquisition of Dirac Fermions in Bi4I4 by Spontaneous Symmetry Breaking

    Authors: Ming Yang, Wenxuan Zhao, Dan Mu, Zhijian Shi, Jingyuan Zhong, Yaqi Li, Yundan Liu, Jianxin Zhong, Ningyan Cheng, Wei Zhou, Jianfeng Wang, Yan Shi, Ying Sun, Weichang Hao, Lexian Yang, Jincheng Zhuang, Yi Du

    Abstract: Massive Dirac fermions, which are essential for realizing novel topological phenomena, are expected to be generated from massless Dirac fermions by breaking the related symmetry, such as time-reversal symmetry (TRS) in topological insulators or crystal symmetry in topological crystalline insulators. Here, we report scanning tunneling microscopy and angle-resolved photoemission spectroscopy studies… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

    Journal ref: Physical Review Letters 133, 256601 (2024)

  28. arXiv:2412.13420  [pdf, other

    cs.SI

    BotSim: LLM-Powered Malicious Social Botnet Simulation

    Authors: Boyu Qiao, Kun Li, Wei Zhou, Shilong Li, Qianqian Lu, Songlin Hu

    Abstract: Social media platforms like X(Twitter) and Reddit are vital to global communication. However, advancements in Large Language Model (LLM) technology give rise to social media bots with unprecedented intelligence. These bots adeptly simulate human profiles, conversations, and interactions, disseminating large amounts of false information and posing significant challenges to platform regulation. To b… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

  29. arXiv:2412.13103  [pdf, other

    cs.CL cs.AI

    AI PERSONA: Towards Life-long Personalization of LLMs

    Authors: Tiannan Wang, Meiling Tao, Ruoyu Fang, Huilin Wang, Shuai Wang, Yuchen Eleanor Jiang, Wangchunshu Zhou

    Abstract: In this work, we introduce the task of life-long personalization of large language models. While recent mainstream efforts in the LLM community mainly focus on scaling data and compute for improved capabilities of LLMs, we argue that it is also very important to enable LLM systems, or language agents, to continuously adapt to the diverse and ever-changing profiles of every distinct user and provid… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

    Comments: Work in progress

  30. arXiv:2412.12888  [pdf, other

    cs.CV cs.AI

    ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction

    Authors: Zhongjie Duan, Qianyi Zhao, Cen Chen, Daoyuan Chen, Wenmeng Zhou, Yaliang Li, Yingda Chen

    Abstract: The emergence of diffusion models has significantly advanced image synthesis. The recent studies of model interaction and self-corrective reasoning approach in large language models offer new insights for enhancing text-to-image models. Inspired by these studies, we propose a novel method called ArtAug for enhancing text-to-image models in this paper. To the best of our knowledge, ArtAug is the fi… ▽ More

    Submitted 18 December, 2024; v1 submitted 17 December, 2024; originally announced December 2024.

    Comments: 18 pages, 8 figures

  31. arXiv:2412.12839  [pdf, other

    cs.AI

    From An LLM Swarm To A PDDL-Empowered HIVE: Planning Self-Executed Instructions In A Multi-Modal Jungle

    Authors: Kaustubh Vyas, Damien Graux, Yijun Yang, Sébastien Montella, Chenxin Diao, Wendi Zhou, Pavlos Vougiouklis, Ruofei Lai, Yang Ren, Keshuang Li, Jeff Z. Pan

    Abstract: In response to the call for agent-based solutions that leverage the ever-increasing capabilities of the deep models' ecosystem, we introduce Hive -- a comprehensive solution for selecting appropriate models and subsequently planning a set of atomic actions to satisfy the end-users' instructions. Hive operates over sets of models and, upon receiving natural language instructions (i.e. user queries)… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

    Comments: Under review

  32. arXiv:2412.11476  [pdf, other

    cs.LG

    Vertical Federated Unlearning via Backdoor Certification

    Authors: Mengde Han, Tianqing Zhu, Lefeng Zhang, Huan Huo, Wanlei Zhou

    Abstract: Vertical Federated Learning (VFL) offers a novel paradigm in machine learning, enabling distinct entities to train models cooperatively while maintaining data privacy. This method is particularly pertinent when entities possess datasets with identical sample identifiers but diverse attributes. Recent privacy regulations emphasize an individual's \emph{right to be forgotten}, which necessitates the… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

  33. arXiv:2412.11417  [pdf, other

    cs.AI cs.LG

    RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement

    Authors: Junjie Lin, Jian Zhao, Lin Liu, Yue Deng, Youpeng Zhao, Lanxiao Huang, Xia Lin, Wengang Zhou, Houqiang Li

    Abstract: Traditionally, AI development for two-player zero-sum games has relied on two primary techniques: decision trees and reinforcement learning (RL). A common approach involves using a fixed decision tree as one player's strategy while training an RL agent as the opponent to identify vulnerabilities in the decision tree, thereby improving its strategic strength iteratively. However, this process often… ▽ More

    Submitted 16 December, 2024; v1 submitted 15 December, 2024; originally announced December 2024.

    Comments: Length:10 pages. Figures:10 figures. Additional Notes:In this paper, we have introduced a novel hybrid approach which leverages the strengths of both RL and LLMs to itera- tively refine decision tree tactics, enhancing their performance and adaptability

    MSC Class: 68T05 ACM Class: I.2.6; I.2.11

  34. arXiv:2412.09130  [pdf, other

    hep-ph nucl-th

    Exploring the chiral magnetic effect in isobar collisions through Chiral Anomaly Transport

    Authors: Zilin Yuan, Anping Huang, Guannan Xie, Wen-Hao Zhou, Guo-Liang Ma, Mei Huang

    Abstract: We investigate the signal of the chiral magnetic effect (CME) in Au+Au collisions and isobar collisions of $_{44}^{96}\text{Ru}+\rm{} _{44}^{96}Ru$ and $_{40}^{96}\text{Zr}+\rm{}_{40}^{96}Zr$ in the newly developed chiral anomaly transport (CAT) module based on the state-of-the-art model a multiphase transport (AMPT). Our numerical simulation results for the ratio charge correlation $Δγ$ in Ru+Ru… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

    Comments: 13 pages, 15 figures

  35. arXiv:2412.08898  [pdf, ps, other

    eess.SY

    Updated version "Robust Voltage Regulation of DC-DC Buck Converter With ZIP Load via An Energy Shaping Control Approach"

    Authors: Wei He, Yanqin Zhang, Yukai Shang, Mohammad Masoud Namazi, Wangping Zhou, Josep M. Guerrero

    Abstract: ZIP loads (the parallel combination of constant impedance loads, constant current loads and constant power loads) exist widely in power system. In order to stabilize buck converter based DC distributed system with ZIP load, an adaptive energy shaping controller (AESC) is devised in this paper. Firstly, based on the assumption that lumped disturbances are known, a full information controller is des… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

  36. arXiv:2412.08082  [pdf, other

    cs.CV

    FaceTracer: Unveiling Source Identities from Swapped Face Images and Videos for Fraud Prevention

    Authors: Zhongyi Zhang, Jie Zhang, Wenbo Zhou, Xinghui Zhou, Qing Guo, Weiming Zhang, Tianwei Zhang, Nenghai Yu

    Abstract: Face-swapping techniques have advanced rapidly with the evolution of deep learning, leading to widespread use and growing concerns about potential misuse, especially in cases of fraud. While many efforts have focused on detecting swapped face images or videos, these methods are insufficient for tracing the malicious users behind fraudulent activities. Intrusive watermark-based approaches also fail… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: 17 pages, 18 figures, under review

  37. arXiv:2412.07575  [pdf, other

    cs.CR

    Defending Against Neural Network Model Inversion Attacks via Data Poisoning

    Authors: Shuai Zhou, Dayong Ye, Tianqing Zhu, Wanlei Zhou

    Abstract: Model inversion attacks pose a significant privacy threat to machine learning models by reconstructing sensitive data from their outputs. While various defenses have been proposed to counteract these attacks, they often come at the cost of the classifier's utility, thus creating a challenging trade-off between privacy protection and model utility. Moreover, most existing defenses require retrainin… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

  38. arXiv:2412.06521  [pdf

    q-bio.GN

    Ancient DNA from 120-Million-Year-Old Lycoptera Fossils Reveals Evolutionary Insights

    Authors: Wan-Qian Zhao, Zhan-Yong Guo, Zeng-Yuan Tian, Tong-Fu Su, Gang-Qiang Cao, Zi-Xin Qi, Tian-Cang Qin, Wei Zhou, Jin-Yu Yang, Ming-Jie Chen, Xin-Ge Zhang, Chun-Yan Zhou, Chuan-Jia Zhu, Meng-Fei Tang, Di Wu, Mei-Rong Song, Yu-Qi Guo, Li-You Qiu, Fei Liang, Mei-Jun Li, Jun-Hui Geng, Li-Juan Zhao, Shu-Jie Zhang

    Abstract: High quality ancient DNA (aDNA) is essential for molecular paleontology. Due to DNA degradation and contamination by environmental DNA (eDNA), current research is limited to fossils less than 1 million years old. The study successfully extracted DNA from Lycoptera davidi fossils from the Early Cretaceous period, dating 120 million years ago. Using high-throughput sequencing, 1,258,901 DNA sequence… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

    Comments: 14 pages,3 Figures

  39. arXiv:2412.05830  [pdf, other

    cs.CR cs.AI

    Large Language Models Merging for Enhancing the Link Stealing Attack on Graph Neural Networks

    Authors: Faqian Guan, Tianqing Zhu, Wenhan Chang, Wei Ren, Wanlei Zhou

    Abstract: Graph Neural Networks (GNNs), specifically designed to process the graph data, have achieved remarkable success in various applications. Link stealing attacks on graph data pose a significant privacy threat, as attackers aim to extract sensitive relationships between nodes (entities), potentially leading to academic misconduct, fraudulent transactions, or other malicious activities. Previous studi… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

    Comments: Link Stealing Attacks, Large Language Models, Graph Neural Networks, Privacy Attacks, Model Merging

  40. arXiv:2412.04606  [pdf, other

    cs.AI cs.CL

    Semantic Consistency-Based Uncertainty Quantification for Factuality in Radiology Report Generation

    Authors: Chenyu Wang, Weichao Zhou, Shantanu Ghosh, Kayhan Batmanghelich, Wenchao Li

    Abstract: Radiology report generation (RRG) has shown great potential in assisting radiologists by automating the labor-intensive task of report writing. While recent advancements have improved the quality and coherence of generated reports, ensuring their factual correctness remains a critical challenge. Although generative medical Vision Large Language Models (VLLMs) have been proposed to address this iss… ▽ More

    Submitted 16 March, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

  41. arXiv:2412.03148  [pdf, other

    cs.CL cs.AI cs.CY

    Fine-Grained Behavior Simulation with Role-Playing Large Language Model on Social Media

    Authors: Kun Li, Chenwei Dai, Wei Zhou, Songlin Hu

    Abstract: Large language models (LLMs) have demonstrated impressive capabilities in role-playing tasks. However, there is limited research on whether LLMs can accurately simulate user behavior in real-world scenarios, such as social media. This requires models to effectively analyze a user's history and simulate their role. In this paper, we introduce \textbf{FineRob}, a novel fine-grained behavior simulati… ▽ More

    Submitted 4 December, 2024; originally announced December 2024.

  42. arXiv:2412.02721  [pdf, other

    physics.plasm-ph nucl-ex

    Advancing Tritium Self-Sufficiency in Fusion Power Plants: Insights from the BABY Experiment

    Authors: Remi Delaporte-Mathurin, Nikola Goles, John Ball, Collin Dunn, Emily Edwards, Sara Ferry, Edward Lamere, Andrew Lanzrath, Rick Leccacorvi, Samuele Meschini, Ethan Peterson, Stefano Segantin, Rui Vieira, Dennis Whyte, Weiyue Zhou, Kevin Woller

    Abstract: In the pursuit of fusion power, achieving tritium self-sufficiency stands as a pivotal challenge. Tritium breeding within molten salts is a critical aspect of next-generation fusion reactors, yet experimental measurements of \gls{tbr} have remained elusive. Here we present the results of the \gls{baby} experiment, which represents a pioneering effort in tritium research by utilizing high-energy (\… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

  43. arXiv:2412.02685  [pdf, other

    cs.CL cs.AI cs.LG

    T-REG: Preference Optimization with Token-Level Reward Regularization

    Authors: Wenxuan Zhou, Shujian Zhang, Lingxiao Zhao, Tao Meng

    Abstract: Reinforcement learning from human feedback (RLHF) has been crucial in aligning large language models (LLMs) with human values. Traditionally, RLHF involves generating responses to a query and using a reward model to assign a reward to the entire response. However, this approach faces challenges due to its reliance on a single, sparse reward, which makes it challenging for the model to identify whi… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

  44. arXiv:2412.00348  [pdf, other

    cs.CV

    Vision Technologies with Applications in Traffic Surveillance Systems: A Holistic Survey

    Authors: Wei Zhou, Lei Zhao, Runyu Zhang, Yifan Cui, Hongpu Huang, Kun Qie, Chen Wang

    Abstract: Traffic Surveillance Systems (TSS) have become increasingly crucial in modern intelligent transportation systems, with vision-based technologies playing a central role for scene perception and understanding. While existing surveys typically focus on isolated aspects of TSS, a comprehensive analysis bridging low-level and high-level perception tasks, particularly considering emerging technologies,… ▽ More

    Submitted 29 November, 2024; originally announced December 2024.

  45. arXiv:2411.19445  [pdf

    physics.optics

    Achromatic single-layer hologram

    Authors: Zhi Li, Wenhui Zhou, Xin Yuan, Weiwei Cai, Dongdong Teng, Qiang Song, Huigao Duan

    Abstract: Phase retrieval is a fundamental technique of advanced optical technologies, enabling precise control over wavefront properties. A persistent challenge in diffractive optical element (DOE) design is that a single hologram typically operates within a single wavelength or color channel, limiting it to monochromatic image generation. This limitation in channel capacity significantly restricts the app… ▽ More

    Submitted 28 November, 2024; originally announced November 2024.

  46. arXiv:2411.19194  [pdf

    physics.med-ph

    Influencing Factors of the FLASH Effect: Unveiling the Importance of Free Radicals

    Authors: Yan Zhang, Chenyang Huang, Ankang Hu, Yucheng Wang, Wanyi Zhou, Jiaqi Qiu, Jian Wang, Qibin Fu, Tuchen Huang, Hao Zha, Wei Wang, Xiaowu Deng, Junli Li

    Abstract: Purpose: Our aim was to elucidate the critical factors responsible for inducing the FLASH effect, focusing on the role of free radicals through simulation and experimental approaches. Methods and Materials: The whole abdomen of C57BL/6 mice was irradiated with 6 MeV electron beam. The endpoint was acute intestinal toxicity quantified by histological score. Total doses ranging from 6 to 15 Gy were… ▽ More

    Submitted 28 November, 2024; originally announced November 2024.

    Comments: 15 pages, 4 figures, 1 table

  47. arXiv:2411.19062  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    Unveiling the anisotropy of linear and nonlinear charge-spin conversion in Weyl semimetal TaIrTe4

    Authors: Tao Tang, Mengzhou Li, Bin Lao, Xuan Zheng, Wei Zhou, Xiaofeng Xu, Jie Pang, You-guo Shi, Run-Wei Li, Zhiming Wang

    Abstract: In Weyl semimetals, the nonlinear planar Hall effect (NPHE) and spin-orbit torque (SOT) are prominent manifestations of nonlinear and linear charge-spin conversion, respectively. However, simultaneous investigations of these phenomena within a single material system are scarce, limiting our understanding of their intrinsic connection and underlying mechanisms. Here, we report the first simultaneou… ▽ More

    Submitted 28 November, 2024; originally announced November 2024.

  48. The binary Yarkovsky effect on the primary asteroid with applications to singly synchronous binary asteroids

    Authors: Wen-Han Zhou

    Abstract: The binary Yarkovsky effect on the secondary asteroid (BYS) was recently discovered to influence binary asteroid systems by pushing the secondary asteroid toward a synchronous orbit on a short timescale. However, the binary Yarkovsky effect on the primary (BYP) remains less understood, partly due to non-linear effects from partial eclipses, but could have significant implications for singly synchr… ▽ More

    Submitted 28 November, 2024; originally announced November 2024.

    Comments: 7 pages, 5 figures. Published in A&A Letters

  49. arXiv:2411.18197  [pdf, other

    cs.GR cs.CV

    Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters

    Authors: Zhiyang Guo, Jinxu Xiang, Kai Ma, Wengang Zhou, Houqiang Li, Ran Zhang

    Abstract: 3D characters are essential to modern creative industries, but making them animatable often demands extensive manual work in tasks like rigging and skinning. Existing automatic rigging tools face several limitations, including the necessity for manual annotations, rigid skeleton topologies, and limited generalization across diverse shapes and poses. An alternative approach is to generate animatabl… ▽ More

    Submitted 11 March, 2025; v1 submitted 27 November, 2024; originally announced November 2024.

    Comments: CVPR 2025. Project page: https://jasongzy.github.io/Make-It-Animatable/

  50. arXiv:2411.15714  [pdf, other

    cs.CV

    ROOT: VLM based System for Indoor Scene Understanding and Beyond

    Authors: Yonghui Wang, Shi-Yong Chen, Zhenxing Zhou, Siyi Li, Haoran Li, Wengang Zhou, Houqiang Li

    Abstract: Recently, Vision Language Models (VLMs) have experienced significant advancements, yet these models still face challenges in spatial hierarchical reasoning within indoor scenes. In this study, we introduce ROOT, a VLM-based system designed to enhance the analysis of indoor scenes. Specifically, we first develop an iterative object perception algorithm using GPT-4V to detect object entities within… ▽ More

    Submitted 23 November, 2024; originally announced November 2024.