Skip to main content

Showing 1–50 of 405 results for author: Wei, M

.
  1. arXiv:2505.09527  [pdf

    q-bio.OT

    Artificial intelligence-enabled precision medicine for inflammatory skin diseases

    Authors: Alice Tang, Maria Wei, Anna Haemel, Cindy La, Marina Sirota, Ernest Y. Lee

    Abstract: Recent advances in artificial intelligence (AI) and multimodal data collection are revolutionizing dermatology. Generative AI and machine learning approaches offer opportunities to enhance the diagnosis and treatment of inflammatory skin diseases, including atopic dermatitis, psoriasis, hidradenitis suppurativa, and autoimmune connective tissue disease. This review examines the current landscape o… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  2. arXiv:2505.08712  [pdf, ps, other

    cs.RO

    NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance

    Authors: Wenzhe Cai, Jiaqi Peng, Yuqiang Yang, Yujian Zhang, Meng Wei, Hanqing Wang, Yilun Chen, Tai Wang, Jiangmiao Pang

    Abstract: Learning navigation in dynamic open-world environments is an important yet challenging skill for robots. Most previous methods rely on precise localization and mapping or learn from expensive real-world demonstrations. In this paper, we propose the Navigation Diffusion Policy (NavDP), an end-to-end framework trained solely in simulation and can zero-shot transfer to different embodiments in divers… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: 14 pages, 6 figures

  3. arXiv:2505.06784  [pdf, ps, other

    q-bio.QM

    Hillclimb-Causal Inference: A Data-Driven Approach to Identify Causal Pathways Among Parental Behaviors, Genetic Risk, and Externalizing Behaviors in Children

    Authors: Mengman Wei, Qian Peng

    Abstract: Motivation: Externalizing behaviors in children, such as aggression, hyperactivity, and defiance, are influenced by complex interplays between genetic predispositions and environmental factors, particularly parental behaviors. Unraveling these intricate causal relationships can benefit from the use of robust data-driven methods. Methods: We developed a method called Hillclimb-Causal Inference, a… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

  4. arXiv:2505.05766  [pdf, ps, other

    astro-ph.HE

    Measurement of separate electron and positron spectra from 10 GeV to 20GeV with the geomagnetic field on DAMPE

    Authors: DAMPE Collaboration, F. Alemanno, Q. An, P. Azzarello, F. C. T. Barbato, P. Bernardini, X. J. Bi, H. Boutin, I. Cagnoli, M. S. Cai, E. Casilli, E. Catanzani, J. Chang, D. Y. Chen, J. L. Chen, Z. F. Chen, Z. X. Chen, P. Coppin, M. Y. Cui, T. S. Cui, Y. X. Cui, I. DeMitri, F. dePalma, A. DiGiovanni, T. K. Dong , et al. (127 additional authors not shown)

    Abstract: The cosmic-ray (CR) electrons and positrons in space are of great significance for studying the origin and propagation of cosmic-rays. The satellite-borne experiment DArk Matter Particle Explorer (DAMPE) has been used to measure the separate electron and positron spectra, as well as the positron fraction. In this work, the Earth's magnetic field is used to distinguish CR electrons and positrons, a… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    Comments: 18 pages, 5 figures

  5. arXiv:2505.04369  [pdf, other

    cs.CV

    WDMamba: When Wavelet Degradation Prior Meets Vision Mamba for Image Dehazing

    Authors: Jie Sun, Heng Liu, Yongzhen Wang, Xiao-Ping Zhang, Mingqiang Wei

    Abstract: In this paper, we reveal a novel haze-specific wavelet degradation prior observed through wavelet transform analysis, which shows that haze-related information predominantly resides in low-frequency components. Exploiting this insight, we propose a novel dehazing framework, WDMamba, which decomposes the image dehazing task into two sequential stages: low-frequency restoration followed by detail en… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  6. arXiv:2505.03981  [pdf, ps, other

    cs.AI cs.CL cs.LG

    X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains

    Authors: Qianchu Liu, Sheng Zhang, Guanghui Qin, Timothy Ossowski, Yu Gu, Ying Jin, Sid Kiblawi, Sam Preston, Mu Wei, Paul Vozila, Tristan Naumann, Hoifung Poon

    Abstract: Recent proprietary models (e.g., o3) have begun to demonstrate strong multimodal reasoning capabilities. Yet, most existing open-source research concentrates on training text-only reasoning models, with evaluations limited to mainly mathematical and general-domain tasks. Therefore, it remains unclear how to effectively extend reasoning capabilities beyond text input and general domains. This paper… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  7. arXiv:2505.02332  [pdf, other

    physics.plasm-ph

    Record Magnetic Field Generation by Laser-Driven Capacitor-Coil Targets

    Authors: Lan Gao, Yang Zhang, Hantao Ji, Brandon K. Russell, Geoffrey Pomraning, Jesse Griff-McMahon, Sallee Klein, Carolyn Kuranz, Mingsheng Wei

    Abstract: Magnetic fields generated by capacitor-coil targets driven by intense short-pulse lasers have been characterized using ultrafast proton radiography. A 1-kJ, 15-ps laser at a center wavelength of 1053 nm irradiated the back plate of the capacitor with an intensity of $\sim$8.3 $\times$ 10$^{18}$ W$/$cm$^{2}$, creating ultra large currents in the connecting coils. High-quality proton data obtained i… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

  8. arXiv:2505.02326  [pdf, other

    physics.plasm-ph

    Determining Magnetic and Electric Field Generations in Laser-Driven Coil Targets

    Authors: Yang Zhang, Lan Gao, Hantao Ji, Brandon K. Russell, Geoffrey Pomraning, Jesse Griff-McMahon, Sallee Klein, Carolyn Kuranz, Mingsheng Wei

    Abstract: Laser-driven capacitor coils are widely used to generate intense magnetic fields for various applications in high-energy-density physics research. Accurate measurement of the magnetic fields is essential but challenging, due to the overlapping contributions from magnetic and electric fields in proton radiography, which is the primary tool diagnosing the field generation around the coils. In this s… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

  9. arXiv:2504.21497  [pdf, other

    cs.CV

    MagicPortrait: Temporally Consistent Face Reenactment with 3D Geometric Guidance

    Authors: Mengting Wei, Yante Li, Tuomas Varanka, Yan Jiang, Guoying Zhao

    Abstract: In this study, we propose a method for video face reenactment that integrates a 3D face parametric model into a latent diffusion framework, aiming to improve shape consistency and motion control in existing video-based face generation approaches. Our approach employs the FLAME (Faces Learned with an Articulated Model and Expressions) model as the 3D face parametric representation, providing a unif… ▽ More

    Submitted 10 May, 2025; v1 submitted 30 April, 2025; originally announced April 2025.

  10. arXiv:2504.17414  [pdf, ps, other

    cs.CV

    3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models

    Authors: Min Wei, Chaohui Yu, Jingkai Zhou, Fan Wang

    Abstract: Video try-on replaces clothing in videos with target garments. Existing methods struggle to generate high-quality and temporally consistent results when handling complex clothing patterns and diverse body poses. We present 3DV-TON, a novel diffusion-based framework for generating high-fidelity and temporally consistent video try-on results. Our approach employs generated animatable textured 3D mes… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: Project page: https://2y7c3.github.io/3DV-TON/

  11. arXiv:2504.15223  [pdf

    cs.LG

    A Deep Learning Framework for Sequence Mining with Bidirectional LSTM and Multi-Scale Attention

    Authors: Tao Yang, Yu Cheng, Yaokun Ren, Yujia Lou, Minggu Wei, Honghui Xin

    Abstract: This paper addresses the challenges of mining latent patterns and modeling contextual dependencies in complex sequence data. A sequence pattern mining algorithm is proposed by integrating Bidirectional Long Short-Term Memory (BiLSTM) with a multi-scale attention mechanism. The BiLSTM captures both forward and backward dependencies in sequences, enhancing the model's ability to perceive global cont… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

  12. arXiv:2504.14977  [pdf, other

    cs.CV

    RealisDance-DiT: Simple yet Strong Baseline towards Controllable Character Animation in the Wild

    Authors: Jingkai Zhou, Yifan Wu, Shikai Li, Min Wei, Chao Fan, Weihua Chen, Wei Jiang, Fan Wang

    Abstract: Controllable character animation remains a challenging problem, particularly in handling rare poses, stylized characters, character-object interactions, complex illumination, and dynamic scenes. To tackle these issues, prior work has largely focused on injecting pose and appearance guidance via elaborate bypass networks, but often struggles to generalize to open-world scenarios. In this paper, we… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: Project Page: https://thefoxofsky.github.io/project_pages_new/RealisDance-DiT/index

  13. arXiv:2504.08685  [pdf, other

    cs.CV cs.AI

    Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

    Authors: Team Seawead, Ceyuan Yang, Zhijie Lin, Yang Zhao, Shanchuan Lin, Zhibei Ma, Haoyuan Guo, Hao Chen, Lu Qi, Sen Wang, Feng Cheng, Feilong Zuo, Xuejiao Zeng, Ziyan Yang, Fangyuan Kong, Meng Wei, Zhiwu Qing, Fei Xiao, Tuyen Hoang, Siyu Zhang, Peihao Zhu, Qi Zhao, Jiangqiao Yan, Liangke Gui, Sheng Bi , et al. (30 additional authors not shown)

    Abstract: This technical report presents a cost-efficient strategy for training a video generation foundation model. We present a mid-sized research model with approximately 7 billion parameters (7B) called Seaweed-7B trained from scratch using 665,000 H100 GPU hours. Despite being trained with moderate computational resources, Seaweed-7B demonstrates highly competitive performance compared to contemporary… ▽ More

    Submitted 4 May, 2025; v1 submitted 11 April, 2025; originally announced April 2025.

    Comments: Technical report (some typos fixed)

  14. arXiv:2504.05135  [pdf, other

    cs.CV

    DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration

    Authors: Jiamei Xiong, Xuefeng Yan, Yongzhen Wang, Wei Zhao, Xiao-Ping Zhang, Mingqiang Wei

    Abstract: Image restoration under adverse weather conditions is a critical task for many vision-based applications. Recent all-in-one frameworks that handle multiple weather degradations within a unified model have shown potential. However, the diversity of degradation patterns across different weather conditions, as well as the complex and varied nature of real-world degradations, pose significant challeng… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  15. arXiv:2503.18843  [pdf, other

    physics.plasm-ph physics.acc-ph physics.optics quant-ph

    Experimental Evidence of Vortex $γ$ Photons in All-Optical Inverse Compton Scattering

    Authors: Mingxuan Wei, Siyu Chen, Yu Wang, Xichen Hu, Mingyang Zhu, Hao Hu, Pei-Lun He, Weijun Zhou, Jiao Jia, Li Lu, Boyuan Li, Feng Liu, Min Chen, Liming Chen, Jian-Xing Li, Wenchao Yan, Jie Zhang

    Abstract: Vortex $γ$ photons carrying orbital angular momenta (OAM) hold great potential for various applications. However, their generation remains a great challenge. Here, we successfully generate sub-MeV vortex $γ$ photons via all-optical inverse Compton scattering of relativistic electrons colliding with a sub-relativistic Laguerre-Gaussian laser. In principle, directly measuring the OAM of $γ$ photons… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: 8 pages, 4 figures

  16. arXiv:2503.15949  [pdf, other

    cs.CV

    CausalCLIPSeg: Unlocking CLIP's Potential in Referring Medical Image Segmentation with Causal Intervention

    Authors: Yaxiong Chen, Minghong Wei, Zixuan Zheng, Jingliang Hu, Yilei Shi, Shengwu Xiong, Xiao Xiang Zhu, Lichao Mou

    Abstract: Referring medical image segmentation targets delineating lesions indicated by textual descriptions. Aligning visual and textual cues is challenging due to their distinct data properties. Inspired by large-scale pre-trained vision-language models, we propose CausalCLIPSeg, an end-to-end framework for referring medical image segmentation that leverages CLIP. Despite not being trained on medical data… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    Comments: MICCAI 2024

  17. arXiv:2503.15144  [pdf, other

    cs.CV

    PointSFDA: Source-free Domain Adaptation for Point Cloud Completion

    Authors: Xing He, Zhe Zhu, Liangliang Nan, Honghua Chen, Jing Qin, Mingqiang Wei

    Abstract: Conventional methods for point cloud completion, typically trained on synthetic datasets, face significant challenges when applied to out-of-distribution real-world scans. In this paper, we propose an effective yet simple source-free domain adaptation framework for point cloud completion, termed \textbf{PointSFDA}. Unlike unsupervised domain adaptation that reduces the domain gap by directly lever… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

  18. arXiv:2503.11410  [pdf, ps, other

    quant-ph

    Remote preparation of motional Schrödinger cat states via dissipatively-driven non-Gaussian mechanical entanglement

    Authors: Zunbo Yu, Miaomiao Wei, Huatang Tan

    Abstract: In this paper, we propose a driven-dissipative scheme for generating non-Gaussian mechanical entangled states and remotely preparing mechanical Schrödinger cat states via the entanglement. The system under study consists of a cavity optomechanical setup with two frequency-mismatched mechanical oscillators coupled to a cavity field driven by a bichromatic pump. We show that under proper conditions,… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  19. arXiv:2503.11038  [pdf, other

    cs.CV

    ACMo: Attribute Controllable Motion Generation

    Authors: Mingjie Wei, Xuemei Xie, Guangming Shi

    Abstract: Attributes such as style, fine-grained text, and trajectory are specific conditions for describing motion. However, existing methods often lack precise user control over motion attributes and suffer from limited generalizability to unseen motions. This work introduces an Attribute Controllable Motion generation architecture, to address these challenges via decouple any conditions and control them… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

  20. arXiv:2503.10592  [pdf, other

    cs.CV

    CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models

    Authors: Hao He, Ceyuan Yang, Shanchuan Lin, Yinghao Xu, Meng Wei, Liangke Gui, Qi Zhao, Gordon Wetzstein, Lu Jiang, Hongsheng Li

    Abstract: This paper introduces CameraCtrl II, a framework that enables large-scale dynamic scene exploration through a camera-controlled video diffusion model. Previous camera-conditioned video generative models suffer from diminished video dynamics and limited range of viewpoints when generating videos with large camera movement. We take an approach that progressively expands the generation of dynamic sce… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: Project page: https://hehao13.github.io/Projects-CameraCtrl-II/

  21. Steady-state tripartite non-Gaussian entanglement and steering in output field from intracavity triple-photon parametric downconversion

    Authors: Miaomiao Wei, Huatang Tan

    Abstract: Nondegenerate triple-photon parametric downconversion (NTPD) is a potential source for unconditional tripartite non-Gaussian entangled states of continuous variables. Recent experiment has demonstrated strong third-order correlations among bright photon triplets via microwave NTPD in a superconducting cavity [Phys. Rev. X 10, 011011 (2020)]. Previous theoretic works have revealed that only short-t… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: Published by Phys. Rev. A

  22. arXiv:2503.07224  [pdf, ps, other

    quant-ph

    Optomechanical non-Gaussian quantum steering and remote preparation of large-size motional Schördinger cat states

    Authors: Miaomiao Wei, Huatang Tan

    Abstract: In this paper, we present a scheme for remotely generating large-size motional Schrödinger cat states in cavity optomechanical (OM) systems with non-Gaussian quantum steering of continuous variables. We consider that the output field from the OM cavity undergoes three typical kinds of multiphoton operations: multiphoton subtraction, multiphoton addition, or multiphoton catalysis, followed by homod… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: accepted by Physical Review A

  23. arXiv:2503.02841  [pdf, other

    cs.CV

    Boltzmann Attention Sampling for Image Analysis with Small Objects

    Authors: Theodore Zhao, Sid Kiblawi, Naoto Usuyama, Ho Hin Lee, Sam Preston, Hoifung Poon, Mu Wei

    Abstract: Detecting and segmenting small objects, such as lung nodules and tumor lesions, remains a critical challenge in image analysis. These objects often occupy less than 0.1% of an image, making traditional transformer architectures inefficient and prone to performance degradation due to redundant attention computations on irrelevant regions. Existing sparse attention mechanisms rely on rigid hierarchi… ▽ More

    Submitted 26 March, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

  24. arXiv:2503.00801  [pdf, other

    cs.CV

    STAR-Edge: Structure-aware Local Spherical Curve Representation for Thin-walled Edge Extraction from Unstructured Point Clouds

    Authors: Zikuan Li, Honghua Chen, Yuecheng Wang, Sibo Wu, Mingqiang Wei, Jun Wang

    Abstract: Extracting geometric edges from unstructured point clouds remains a significant challenge, particularly in thin-walled structures that are commonly found in everyday objects. Traditional geometric methods and recent learning-based approaches frequently struggle with these structures, as both rely heavily on sufficient contextual information from local point neighborhoods. However, 3D measurement d… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

    Comments: Accepted at CVPR 2025

  25. arXiv:2502.19896  [pdf, other

    cs.CV

    GenPC: Zero-shot Point Cloud Completion via 3D Generative Priors

    Authors: An Li, Zhe Zhu, Mingqiang Wei

    Abstract: Existing point cloud completion methods, which typically depend on predefined synthetic training datasets, encounter significant challenges when applied to out-of-distribution, real-world scans. To overcome this limitation, we introduce a zero-shot completion framework, termed GenPC, designed to reconstruct high-quality real-world scans by leveraging explicit 3D generative priors. Our key insight… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Comments: Accepted by CVPR 2025

  26. arXiv:2502.17053  [pdf, other

    cs.CV

    PointSea: Point Cloud Completion via Self-structure Augmentation

    Authors: Zhe Zhu, Honghua Chen, Xing He, Mingqiang Wei

    Abstract: Point cloud completion is a fundamental yet not well-solved problem in 3D vision. Current approaches often rely on 3D coordinate information and/or additional data (e.g., images and scanning viewpoints) to fill in missing parts. Unlike these methods, we explore self-structure augmentation and propose PointSea for global-to-local point cloud completion. In the global stage, consider how we inspect… ▽ More

    Submitted 26 February, 2025; v1 submitted 24 February, 2025; originally announced February 2025.

    Comments: Accepted by International Journal of Computer Vision (IJCV). Extension of our ICCV 2023 work: arXiv:2307.08492

  27. arXiv:2502.15447  [pdf, other

    astro-ph.HE hep-ph

    Ultra-high-energy $γ$-ray emission associated with the tail of a bow-shock pulsar wind nebula

    Authors: Zhen Cao, F. Aharonian, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, C. M. Cai, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen, S. H. Chen, S. Z. Chen , et al. (274 additional authors not shown)

    Abstract: In this study, we present a comprehensive analysis of an unidentified point-like ultra-high-energy (UHE) $γ$-ray source, designated as 1LHAASO J1740+0948u, situated in the vicinity of the middle-aged pulsar PSR J1740+1000. The detection significance reached 17.1$σ$ (9.4$σ$) above 25$\,$TeV (100$\,$TeV). The source energy spectrum extended up to 300$\,$TeV, which was well fitted by a log-parabola f… ▽ More

    Submitted 24 February, 2025; v1 submitted 21 February, 2025; originally announced February 2025.

    Comments: Corrected spelling errors in several author names

    Journal ref: The Innovation (2025), 100802

  28. General method for calculating transport properties of disordered mesoscopic systems based on the nonequilibrium Green's function formalism

    Authors: Gaoyang Li, MiaoMiao Wei, Fuming Xu, Jian Wang

    Abstract: Disorder scattering plays important roles in quantum transport as well as various Hall effects, including the second-order nonlinear Hall effect induced by Berry curvature dipole. Calculation of disorder-averaged transport properties usually requires substantial computational resources, especially for higher-order effects. Existing methods are either limited by approximation conditions or constrai… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

    Journal ref: Phys. Rev. B 111, 035409 (2025)

  29. Spin separation and filtering assisted by topological corner states in the Kekulé lattice

    Authors: Kai-Tong Wang, Hui Wang, Shijie Liu, Miaomiao Wei, Fuming Xu

    Abstract: Higher-order topological corner states have been realized in two-dimensional Kekulé lattice, which can be further coupled with spin polarization through the implementation of local magnetization. In this work, we numerically investigate the spin-dependent transport properties assisted by topological corner states in the Kekulé lattice. By applying local magnetization and electric potential, the to… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

    Journal ref: Phys. Rev. B 110, 125433 (2024)

  30. arXiv:2502.04848  [pdf, other

    astro-ph.HE

    Broadband $γ$-ray spectrum of supernova remnant Cassiopeia A

    Authors: Zhen Cao, F. Aharonian, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, C. M. Cai, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen, S. H. Chen, S. Z. Chen , et al. (293 additional authors not shown)

    Abstract: The core-collapse supernova remnant (SNR) Cassiopeia A (Cas A) is one of the brightest galactic radio sources with an angular radius of $\sim$ 2.5 $\arcmin$. Although no extension of this source has been detected in the $γ$-ray band, using more than 1000 days of LHAASO data above $\sim 0.8$ TeV, we find that its spectrum is significantly softer than those obtained with Imaging Air Cherenkov Telesc… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  31. arXiv:2502.04377  [pdf, other

    cs.CV cs.AI

    MapFusion: A Novel BEV Feature Fusion Network for Multi-modal Map Construction

    Authors: Xiaoshuai Hao, Yunfeng Diao, Mengchuan Wei, Yifan Yang, Peng Hao, Rong Yin, Hui Zhang, Weiming Li, Shu Zhao, Yu Liu

    Abstract: Map construction task plays a vital role in providing precise and comprehensive static environmental information essential for autonomous driving systems. Primary sensors include cameras and LiDAR, with configurations varying between camera-only, LiDAR-only, or camera-LiDAR fusion, based on cost-performance considerations. While fusion-based methods typically perform best, existing approaches ofte… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

  32. arXiv:2502.02465  [pdf, other

    cs.CV

    Towards Consistent and Controllable Image Synthesis for Face Editing

    Authors: Mengting Wei, Tuomas Varanka, Yante Li, Xingxun Jiang, Huai-Qian Khor, Guoying Zhao

    Abstract: Face editing methods, essential for tasks like virtual avatars, digital human synthesis and identity preservation, have traditionally been built upon GAN-based techniques, while recent focus has shifted to diffusion-based models due to their success in image reconstruction. However, diffusion models still face challenges in controlling specific attributes and preserving the consistency of other un… ▽ More

    Submitted 9 February, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

  33. arXiv:2502.00943  [pdf, other

    cs.CL

    Universal Abstraction: Harnessing Frontier Models to Structure Real-World Data at Scale

    Authors: Cliff Wong, Sam Preston, Qianchu Liu, Zelalem Gero, Jass Bagga, Sheng Zhang, Shrey Jain, Theodore Zhao, Yu Gu, Yanbo Xu, Sid Kiblawi, Roshanthi Weerasinghe, Rom Leidner, Kristina Young, Brian Piening, Carlo Bifulco, Tristan Naumann, Mu Wei, Hoifung Poon

    Abstract: The vast majority of real-world patient information resides in unstructured clinical text, and the process of medical abstraction seeks to extract and normalize structured information from this unstructured input. However, traditional medical abstraction methods can require significant manual efforts that can include crafting rules or annotating training labels, limiting scalability. In this paper… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

  34. arXiv:2502.00637  [pdf

    cs.HC

    Constructing AI ethics narratives based on real-world data: Human-AI collaboration in data-driven visual storytelling

    Authors: Mengyi Wei, Chenjing Jiao, Chenyu Zuo, Lorenz Hurni, Liqiu Meng

    Abstract: AI ethics narratives have the potential to shape the public accurate understanding of AI technologies and promote communication among different stakeholders. However, AI ethics narratives are largely lacking. Existing limited narratives tend to center on works of science fiction or corporate marketing campaigns of large technology companies. Misuse of "socio-technical imaginary" can blur the line… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

    Comments: 13 pages, 9 figures

  35. arXiv:2501.06367  [pdf, other

    cs.CR

    Resilient Endurance-Aware NVM-based PUF against Learning-based Attacks

    Authors: Hassan Nassar, Ming-Liang Wei, Chia-Lin Yang, Jörg Henkel, Kuan-Hsun Chen

    Abstract: Physical Unclonable Functions (PUFs) based on Non-Volatile Memory (NVM) technology have emerged as a promising solution for secure authentication and cryptographic applications. By leveraging the multi-level cell (MLC) characteristic of NVMs, these PUFs can generate a wide range of unique responses, enhancing their resilience to machine learning (ML) modeling attacks. However, a significant issue… ▽ More

    Submitted 10 January, 2025; originally announced January 2025.

  36. arXiv:2501.02540  [pdf, ps, other

    physics.chem-ph physics.atm-clus physics.comp-ph physics.optics

    Mapping Transient Structures of Cyclo[18]Carbon by Computational X-Ray Spectra

    Authors: Minrui Wei, Sheng-Yu Wang, Jun-Rong Zhang, Lu Zhang, Guoyan Ge, Zeyu Liu, Weijie Hua

    Abstract: The structure of cyclo[18]carbon (C$_{18}$), whether in its polyynic form with bond length alternation (BLA) or its cumulenic form without BLA, has long fascinated researchers, even prior to its successful synthesis. Recent studies suggest a polyynic ground state and a cumulenic transient state; however, the dynamics remain unclear and lack experimental validation. This study presents a first-prin… ▽ More

    Submitted 5 January, 2025; originally announced January 2025.

    Comments: 3 figures

  37. arXiv:2501.02260  [pdf, other

    cs.CV

    MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control

    Authors: Mengting Wei, Tuomas Varanka, Xingxun Jiang, Huai-Qian Khor, Guoying Zhao

    Abstract: We address the problem of facial expression editing by controling the relative variation of facial action-unit (AU) from the same person. This enables us to edit this specific person's expression in a fine-grained, continuous and interpretable manner, while preserving their identity, pose, background and detailed facial attributes. Key to our model, which we dub MagicFace, is a diffusion model con… ▽ More

    Submitted 9 January, 2025; v1 submitted 4 January, 2025; originally announced January 2025.

  38. arXiv:2501.02248  [pdf, other

    quant-ph physics.atom-ph

    Enhanced Atom-by-Atom Assembly of Defect-Free Two-Dimensional Mixed-Species Atomic Arrays

    Authors: Ming-Rui Wei, Kun-Peng Wang, Jia-Yi Hou, Yi Chen, Peng Xu, Jun Zhuang, Rui-Jun Guo, Min Liu, Jin Wang, Xiao-Dong He, Ming-Sheng Zhan

    Abstract: Defect-free single atom array in optical tweezers is a promising platform for scalable quantum computing, quantum simulation, and quantum metrology. Extending single-species array to mixed-species one promise to offer new possibilities. In our recent proof of principle realization of defect-free two-dimensional assembly of mixed-species $^{85}$Rb ($^{87}$Rb) atom arrays [C. Sheng et al.\href{https… ▽ More

    Submitted 9 January, 2025; v1 submitted 4 January, 2025; originally announced January 2025.

    Comments: 8 pages, 5 figures

  39. arXiv:2501.01320  [pdf, other

    cs.CV

    SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration

    Authors: Jianyi Wang, Zhijie Lin, Meng Wei, Yang Zhao, Ceyuan Yang, Fei Xiao, Chen Change Loy, Lu Jiang

    Abstract: Video restoration poses non-trivial challenges in maintaining fidelity while recovering temporally consistent details from unknown degradations in the wild. Despite recent advances in diffusion-based restoration, these methods often face limitations in generation capability and sampling efficiency. In this work, we present SeedVR, a diffusion transformer designed to handle real-world video restora… ▽ More

    Submitted 22 March, 2025; v1 submitted 2 January, 2025; originally announced January 2025.

    Comments: CVPR25 CR ver., add a co-author additionally. Project page: https://iceclear.github.io/projects/seedvr/

  40. arXiv:2501.01037  [pdf, other

    cs.RO cs.AI cs.CV

    MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception

    Authors: Xiaoshuai Hao, Guanqun Liu, Yuting Zhao, Yuheng Ji, Mengchuan Wei, Haimei Zhao, Lingdong Kong, Rong Yin, Yu Liu

    Abstract: Multi-sensor fusion models play a crucial role in autonomous driving perception, particularly in tasks like 3D object detection and HD map construction. These models provide essential and comprehensive static environmental information for autonomous driving systems. While camera-LiDAR fusion methods have shown promising results by integrating data from both modalities, they often depend on complet… ▽ More

    Submitted 1 January, 2025; originally announced January 2025.

  41. Improving Acoustic Scene Classification in Low-Resource Conditions

    Authors: Zhi Chen, Yun-Fei Shao, Yong Ma, Mingsheng Wei, Le Zhang, Wei-Qiang Zhang

    Abstract: Acoustic Scene Classification (ASC) identifies an environment based on an audio signal. This paper explores ASC in low-resource conditions and proposes a novel model, DS-FlexiNet, which combines depthwise separable convolutions from MobileNetV2 with ResNet-inspired residual connections for a balance of efficiency and accuracy. To address hardware limitations and device heterogeneity, DS-FlexiNet e… ▽ More

    Submitted 27 April, 2025; v1 submitted 30 December, 2024; originally announced December 2024.

    Comments: Copyright 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component

    Journal ref: ICASSP (2025)

  42. arXiv:2412.19065  [pdf, ps, other

    physics.chem-ph astro-ph.HE physics.atm-clus physics.comp-ph

    Predicting Accurate X-ray Absorption Spectra for CN$^+$, CN, and CN$^-$: Insights from Multiconfigurational and Density Functional Simulations

    Authors: Jinyu Li, Sheng-Yu Wang, Lu Zhang, Guoyan Ge, Minrui Wei, Junxiang Zuo, Weijie Hua

    Abstract: High-resolution X-ray spectroscopy is an essential tool in X-ray astronomy, enabling detailed studies of celestial objects and their physical and chemical properties. However, comprehensive mapping of high-resolution X-ray spectra for even simple interstellar and circumstellar molecules is still lacking. In this study, we conducted systematic quantum chemical simulations to predict the C1s X-ray a… ▽ More

    Submitted 27 March, 2025; v1 submitted 26 December, 2024; originally announced December 2024.

    Comments: 5 figures

    Journal ref: Phys. Rev. A 111, 052803 (2025)

  43. arXiv:2412.16085  [pdf, other

    eess.IV cs.CV

    Efficient MedSAMs: Segment Anything in Medical Images on Laptop

    Authors: Jun Ma, Feifei Li, Sumin Kim, Reza Asakereh, Bao-Hiep Le, Dang-Khoa Nguyen-Vu, Alexander Pfefferle, Muxin Wei, Ruochen Gao, Donghang Lyu, Songxiao Yang, Lennart Purucker, Zdravko Marinov, Marius Staring, Haisheng Lu, Thuy Thanh Dao, Xincheng Ye, Zhi Li, Gianluca Brugnara, Philipp Vollmuth, Martha Foltyn-Dumitru, Jaeyoung Cho, Mustafa Ahmed Mahmutoglu, Martin Bendszus, Irada Pflüger , et al. (57 additional authors not shown)

    Abstract: Promptable segmentation foundation models have emerged as a transformative approach to addressing the diverse needs in medical images, but most existing models require expensive computing, posing a big barrier to their adoption in clinical practice. In this work, we organized the first international competition dedicated to promptable medical image segmentation, featuring a large-scale dataset spa… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

    Comments: CVPR 2024 MedSAM on Laptop Competition Summary: https://www.codabench.org/competitions/1847/

  44. arXiv:2412.13693  [pdf, other

    cs.SE

    UITrans: Seamless UI Translation from Android to HarmonyOS

    Authors: Lina Gong, Chen Wang, Yujun Huang, Di Cui, Mingqiang Wei

    Abstract: Seamless user interface (i.e., UI) translation has emerged as a pivotal technique for modern mobile developers, addressing the challenge of developing separate UI applications for Android and HarmonyOS platforms due to fundamental differences in layout structures and development paradigms. In this paper, we present UITrans, the first automated UI translation tool designed for Android to HarmonyOS.… ▽ More

    Submitted 5 February, 2025; v1 submitted 18 December, 2024; originally announced December 2024.

    Comments: 5 pages

  45. arXiv:2412.13471  [pdf, other

    cs.AI cs.CL

    Gradual Vigilance and Interval Communication: Enhancing Value Alignment in Multi-Agent Debates

    Authors: Rui Zou, Mengqi Wei, Jintian Feng, Qian Wan, Jianwen Sun, Sannyuya Liu

    Abstract: In recent years, large language models have shown exceptional performance in fulfilling diverse human needs. However, their training data can introduce harmful content, underscoring the necessity for robust value alignment. Mainstream methods, which depend on feedback learning and supervised training, are resource-intensive and may constrain the full potential of the models. Multi-Agent Debate (MA… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

  46. arXiv:2412.12770  [pdf, other

    cs.IR

    A Survey on Sequential Recommendation

    Authors: Liwei Pan, Weike Pan, Meiyan Wei, Hongzhi Yin, Zhong Ming

    Abstract: Different from most conventional recommendation problems, sequential recommendation focuses on learning users' preferences by exploiting the internal order and dependency among the interacted items, which has received significant attention from both researchers and practitioners. In recent years, we have witnessed great progress and achievements in this field, necessitating a new survey. In this s… ▽ More

    Submitted 13 March, 2025; v1 submitted 17 December, 2024; originally announced December 2024.

  47. arXiv:2412.11460  [pdf, other

    astro-ph.HE hep-ex

    Observation of a spectral hardening in cosmic ray boron spectrum with the DAMPE space mission

    Authors: DAMPE Collaboration, F. Alemanno, C. Altomare, Q. An, P. Azzarello, F. C. T. Barbato, P. Bernardini, X. J. Bi, H. Boutin, I. Cagnoli, M. S. Cai, E. Casilli, E. Catanzani, J. Chang, D. Y. Chen, J. L. Chen, Z. F. Chen, Z. X. Chen, P. Coppin, M. Y. Cui, T. S. Cui, Y. X. Cui, I. De Mitri, F. de Palma, A. Di Giovanni , et al. (121 additional authors not shown)

    Abstract: Secondary cosmic ray fluxes are important probes of the propagation and interaction of high-energy particles in the Galaxy. Recent measurements of primary and secondary cosmic ray nuclei have revealed unexpected spectral features that demand a deeper understanding. In this work we report the direct measurement of the cosmic ray boron spectrum from 10 GeV/n to 8 TeV/n with eight years of data colle… ▽ More

    Submitted 18 December, 2024; v1 submitted 16 December, 2024; originally announced December 2024.

    Comments: 10 pages, 10 figures, submitted to PRL

  48. ESA: Example Sieve Approach for Multi-Positive and Unlabeled Learning

    Authors: Zhongnian Li, Meng Wei, Peng Ying, Xinzheng Xu

    Abstract: Learning from Multi-Positive and Unlabeled (MPU) data has gradually attracted significant attention from practical applications. Unfortunately, the risk of MPU also suffer from the shift of minimum risk, particularly when the models are very flexible as shown in Fig.\ref{moti}. In this paper, to alleviate the shifting of minimum risk problem, we propose an Example Sieve Approach (ESA) to select ex… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

    Comments: 12 pages, 6 figures

  49. Learning from Concealed Labels

    Authors: Zhongnian Li, Meng Wei, Peng Ying, Tongfeng Sun, Xinzheng Xu

    Abstract: Annotating data for sensitive labels (e.g., disease, smoking) poses a potential threats to individual privacy in many real-world scenarios. To cope with this problem, we propose a novel setting to protect privacy of each instance, namely learning from concealed labels for multi-class classification. Concealed labels prevent sensitive labels from appearing in the label set during the label collecti… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

    Comments: 12 pages, 2 figures

  50. arXiv:2412.00370  [pdf, other

    cs.GT

    Incentive-Driven Task Offloading and Collaborative Computing in Device-Assisted MEC Networks

    Authors: Yang Li, Xing Zhang, Bo Lei, Qianying Zhao, Min Wei, Zheyan Qu, Wenbo Wang

    Abstract: Edge computing (EC), positioned near end devices, holds significant potential for delivering low-latency, energy-efficient, and secure services. This makes it a crucial component of the Internet of Things (IoT). However, the increasing number of IoT devices and emerging services place tremendous pressure on edge servers (ESs). To better handle dynamically arriving heterogeneous tasks, ESs and IoT… ▽ More

    Submitted 30 November, 2024; originally announced December 2024.

    Comments: Accepted to IEEE Internet of Things Journal