Skip to main content

Showing 51–100 of 506 results for author: Feng, B

.
  1. arXiv:2411.03088  [pdf, ps, other

    hep-th gr-qc hep-ph

    Multivariate hypergeometric solutions of cosmological (dS) correlators by $\text{d} \log$-form differential equations

    Authors: Jiaqi Chen, Bo Feng, Yi-Xiao Tao

    Abstract: In this paper, we give the analytic expression for the homogeneous part of solutions of arbitrary tree-level cosmological correlators, including massive propagators and time-derivative interaction cases. The solutions are given in the form of multivariate hypergeometric functions. It is achieved by two steps. Firstly, we indicate the factorization of the homogeneous part of solutions, i.e., the ho… ▽ More

    Submitted 13 March, 2025; v1 submitted 5 November, 2024; originally announced November 2024.

    Comments: v1: 33 pages, 1 figure; v2: published version

  2. arXiv:2410.17922  [pdf, other

    cs.AI

    Dynamic Guided and Domain Applicable Safeguards for Enhanced Security in Large Language Models

    Authors: Weidi Luo, He Cao, Zijing Liu, Yu Wang, Aidan Wong, Bing Feng, Yuan Yao, Yu Li

    Abstract: With the extensive deployment of Large Language Models (LLMs), ensuring their safety has become increasingly critical. However, existing defense methods often struggle with two key issues: (i) inadequate defense capabilities, particularly in domain-specific scenarios like chemistry, where a lack of specialized knowledge can lead to the generation of harmful responses to malicious queries. (ii) ove… ▽ More

    Submitted 8 February, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

  3. arXiv:2410.05787  [pdf, other

    cs.NE

    An accelerate Prediction Strategy for Dynamic Multi-Objective Optimization

    Authors: Ru Lei, Lin Li, Rustam Stolkin, Bin Feng

    Abstract: This paper addresses the challenge of dynamic multi-objective optimization problems (DMOPs) by introducing novel approaches for accelerating prediction strategies within the evolutionary algorithm framework. Since the objectives of DMOPs evolve over time, both the Pareto optimal set (PS) and the Pareto optimal front (PF) are dynamic. To effectively track the changes in the PS and PF in both decisi… ▽ More

    Submitted 13 November, 2024; v1 submitted 8 October, 2024; originally announced October 2024.

    Comments: Dynamic Multi-objective Optimization Problems

  4. arXiv:2410.04696  [pdf, other

    stat.ME

    Efficient Input Uncertainty Quantification for Ratio Estimator

    Authors: Linyun He, Ben Feng, Eunhye Song

    Abstract: We study the construction of a confidence interval (CI) for a simulation output performance measure that accounts for input uncertainty when the input models are estimated from finite data. In particular, we focus on performance measures that can be expressed as a ratio of two dependent simulation outputs' means. We adopt the parametric bootstrap method to mimic input data sampling and construct t… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

  5. arXiv:2410.02764  [pdf, other

    cs.CV cs.LG eess.IV

    Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats

    Authors: Mingyang Xie, Haoming Cai, Sachin Shah, Yiran Xu, Brandon Y. Feng, Jia-Bin Huang, Christopher A. Metzler

    Abstract: We introduce a simple yet effective approach for separating transmitted and reflected light. Our key insight is that the powerful novel view synthesis capabilities provided by modern inverse rendering methods (e.g.,~3D Gaussian splatting) allow one to perform flash/no-flash reflection separation using unpaired measurements -- this relaxation dramatically simplifies image acquisition over conventio… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  6. arXiv:2409.18026  [pdf, other

    cs.CV cs.RO

    ReliOcc: Towards Reliable Semantic Occupancy Prediction via Uncertainty Learning

    Authors: Song Wang, Zhongdao Wang, Jiawei Yu, Wentong Li, Bailan Feng, Junbo Chen, Jianke Zhu

    Abstract: Vision-centric semantic occupancy prediction plays a crucial role in autonomous driving, which requires accurate and reliable predictions from low-cost sensors. Although having notably narrowed the accuracy gap with LiDAR, there is still few research effort to explore the reliability in predicting semantic occupancy from camera. In this paper, we conduct a comprehensive evaluation of existing sema… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: Technical report. Work in progress

  7. arXiv:2409.14072  [pdf, other

    cs.CV

    Dynamic 2D Gaussians: Geometrically accurate radiance fields for dynamic objects

    Authors: Shuai Zhang, Guanjun Wu, Xinggang Wang, Bin Feng, Wenyu Liu

    Abstract: Reconstructing objects and extracting high-quality surfaces play a vital role in the real world. Current 4D representations show the ability to render high-quality novel views for dynamic objects but cannot reconstruct high-quality meshes due to their implicit or geometrically inaccurate representations. In this paper, we propose a novel representation that can reconstruct accurate meshes from spa… ▽ More

    Submitted 21 September, 2024; originally announced September 2024.

  8. arXiv:2409.12663  [pdf, ps, other

    hep-th hep-ph

    Notes on Selection Rules of Canonical Differential Equations and Relative Cohomology

    Authors: Jiaqi Chen, Bo Feng

    Abstract: We give an explanation of the $\mathrm{d}\log$-form of the coefficient matrix of canonical differential equations using the projection of ($n$+1)-$\mathrm{d}\log$ forms onto $n$-$\mathrm{d}\log$ forms. This projection is done using the leading-order formula for intersection numbers. This formula gives a simple way to compute the coefficient matrix. When combined with the relative twisted cohomolog… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

    Comments: 40 pages, 1 figure

  9. arXiv:2409.12356  [pdf, other

    hep-ph astro-ph.HE nucl-th

    Magnetized neutral 2SC color superconductivity and possible origin of the inner magnetic field of magnetars

    Authors: Shuai Yuan, Bo Feng, Efrain J. Ferrer, Alejandro Pinero

    Abstract: In this paper the neutral 2SC phase of color superconductivity is investigated in the presence of a magnetic field and for diquark coupling constants and baryonic densities that are expected to characterize neutron stars. Specifically, the behavior of the charged gluons Meissner masses is investigated in the parameter region of interest taking into account, in addition, the contribution of a rotat… ▽ More

    Submitted 20 December, 2024; v1 submitted 18 September, 2024; originally announced September 2024.

    Comments: 14 pages, 7 figures; Version appeared in PRD

  10. arXiv:2409.07762  [pdf, ps, other

    cs.CV cs.LG

    Exploring Kolmogorov-Arnold networks for realistic image sharpness assessment

    Authors: Shaode Yu, Ze Chen, Zhimu Yang, Jiacheng Gu, Bizu Feng

    Abstract: Score prediction is crucial in evaluating realistic image sharpness based on collected informative features. Recently, Kolmogorov-Arnold networks (KANs) have been developed and witnessed remarkable success in data fitting. This study introduces the Taylor series-based KAN (TaylorKAN). Then, different KANs are explored in four realistic image databases (BID2011, CID2013, CLIVE, and KonIQ-10k) to pr… ▽ More

    Submitted 4 December, 2024; v1 submitted 12 September, 2024; originally announced September 2024.

  11. arXiv:2408.10870  [pdf

    physics.chem-ph

    Revisiting the measurements and interpretations of DLVO forces

    Authors: Bo Feng, Xiantang Liu, Xinmin Liu, Yingli Li, Hang Li

    Abstract: The DLVO theory and electrical double layer (EDL) theory are the foundation of colloid and interface science. With the invention and development of surface forces apparatus (SFA) and atomic force microscope (AFM), the measurements and interpretations of DLVO forces (i.e., mainly measuring the EDL force (electrostatic force) FEDL and van der Waals force FvdW, and interpreting the potential ψ, charg… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 71 pages, 18 figures

  12. arXiv:2407.20606  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Evidence for Two-dimensional Weyl Fermions in Air-Stable Monolayer PtTe$_{1.75}$

    Authors: Zhihao Cai, Haijun Cao, Haohao Sheng, Xuegao Hu, Zhenyu Sun, Qiaoxiao Zhao, Jisong Gao, Shin-ichiro Ideta, Kenya Shimada, Jiawei Huang, Peng Cheng, Lan Chen, Yugui Yao, Sheng Meng, Kehui Wu, Zhijun Wang, Baojie Feng

    Abstract: The Weyl semimetals represent a distinct category of topological materials wherein the low-energy excitations appear as the long-sought Weyl fermions. Exotic transport and optical properties are expected because of the chiral anomaly and linear energy-momentum dispersion. While three-dimensional Weyl semimetals have been successfully realized, the quest for their two-dimensional (2D) counterparts… ▽ More

    Submitted 12 December, 2024; v1 submitted 30 July, 2024; originally announced July 2024.

    Journal ref: Nano Lett. 24, 10237-10243 (2024)

  13. arXiv:2407.12519  [pdf, other

    cs.CV

    Causality-inspired Discriminative Feature Learning in Triple Domains for Gait Recognition

    Authors: Haijun Xiong, Bin Feng, Xinggang Wang, Wenyu Liu

    Abstract: Gait recognition is a biometric technology that distinguishes individuals by their walking patterns. However, previous methods face challenges when accurately extracting identity features because they often become entangled with non-identity clues. To address this challenge, we propose CLTD, a causality-inspired discriminative feature learning module designed to effectively eliminate the influence… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV 2024

  14. arXiv:2407.12294  [pdf, other

    cs.CV

    VEON: Vocabulary-Enhanced Occupancy Prediction

    Authors: Jilai Zheng, Pin Tang, Zhongdao Wang, Guoqing Wang, Xiangxuan Ren, Bailan Feng, Chao Ma

    Abstract: Perceiving the world as 3D occupancy supports embodied agents to avoid collision with any types of obstacle. While open-vocabulary image understanding has prospered recently, how to bind the predicted 3D occupancy grids with open-world semantics still remains under-explored due to limited open-world annotations. Hence, instead of building our model from scratch, we try to blend 2D foundation model… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV2024

  15. arXiv:2407.11382  [pdf, other

    cs.CV cs.AI cs.RO

    Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts

    Authors: Jianhao Li, Tianyu Sun, Zhongdao Wang, Enze Xie, Bailan Feng, Hongbo Zhang, Ze Yuan, Ke Xu, Jiaheng Liu, Ping Luo

    Abstract: This paper proposes an algorithm for automatically labeling 3D objects from 2D point or box prompts, especially focusing on applications in autonomous driving. Unlike previous arts, our auto-labeler predicts 3D shapes instead of bounding boxes and does not require training on a specific dataset. We propose a Segment, Lift, and Fit (SLF) paradigm to achieve this goal. Firstly, we segment high-quali… ▽ More

    Submitted 17 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  16. arXiv:2407.08353  [pdf

    cond-mat.mtrl-sci

    1D flat bands in phosphorene nanoribbons with pentagonal nature

    Authors: Shuo Sun, Jing-Yang You, Zhihao Cai, Jie Su, Tong Yang, Xinnan Peng, Yihe Wang, Daiyu Geng, Jian Gou, Yuli Huang, Sisheng Duan, Lan Chen, Kehui Wu, Andrew T. S. Wee, Yuan Ping Feng, Jia Lin Zhang, Jiong Lu, Baojie Feng, Wei Chen

    Abstract: Materials with flat bands can serve as a promising platform to investigate strongly interacting phenomena. However, experimental realization of ideal flat bands is mostly limited to artificial lattices or moiré systems. Here we report a general way to construct one-dimensional (1D) flat bands in phosphorene nanoribbons (PNRs) with pentagonal nature: penta-hexa-PNRs and penta-dodeca-PNRs, wherein t… ▽ More

    Submitted 12 December, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: 13 pages, 4 figures

  17. arXiv:2407.01029  [pdf, other

    cs.CV

    EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting

    Authors: Chenxin Li, Brandon Y. Feng, Yifan Liu, Hengyu Liu, Cheng Wang, Weihao Yu, Yixuan Yuan

    Abstract: 3D reconstruction of biological tissues from a collection of endoscopic images is a key to unlock various important downstream surgical applications with 3D capabilities. Existing methods employ various advanced neural rendering techniques for photorealistic view synthesis, but they often struggle to recover accurate 3D representations when only sparse observations are available, which is usually… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Accpeted by MICCAI2024

  18. arXiv:2406.14746  [pdf, other

    cs.LG cs.RO

    Behavior-Inspired Neural Networks for Relational Inference

    Authors: Yulong Yang, Bowen Feng, Keqin Wang, Naomi Ehrich Leonard, Adji Bousso Dieng, Christine Allen-Blanchette

    Abstract: From pedestrians to Kuramoto oscillators, interactions between agents govern how dynamical systems evolve in space and time. Discovering how these agents relate to each other has the potential to improve our understanding of the often complex dynamics that underlie these systems. Recent works learn to categorize relationships between agents based on observations of their physical behavior. These a… ▽ More

    Submitted 11 March, 2025; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Accept to The 28th International Conference on Artificial Intelligence and Statistics (AISTATS 2025)

  19. arXiv:2406.12816  [pdf, other

    cs.LG cs.CV eess.IV

    Neural Approximate Mirror Maps for Constrained Diffusion Models

    Authors: Berthy T. Feng, Ricardo Baptista, Katherine L. Bouman

    Abstract: Diffusion models excel at creating visually-convincing images, but they often struggle to meet subtle constraints inherent in the training data. Such constraints could be physics-based (e.g., satisfying a PDE), geometric (e.g., respecting symmetry), or semantic (e.g., including a particular number of objects). When the training data all satisfy a certain constraint, enforcing this constraint on a… ▽ More

    Submitted 9 April, 2025; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: ICLR 2025

  20. arXiv:2406.12355  [pdf, other

    cs.CV

    LiCAF: LiDAR-Camera Asymmetric Fusion for Gait Recognition

    Authors: Yunze Deng, Haijun Xiong, Bin Feng

    Abstract: Gait recognition is a biometric technology that identifies individuals by using walking patterns. Due to the significant achievements of multimodal fusion in gait recognition, we consider employing LiDAR-camera fusion to obtain robust gait representations. However, existing methods often overlook intrinsic characteristics of modalities, and lack fine-grained fusion and temporal modeling. In this p… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted by ICIP2024

  21. arXiv:2406.08814  [pdf, other

    cs.CV

    Skim then Focus: Integrating Contextual and Fine-grained Views for Repetitive Action Counting

    Authors: Zhengqi Zhao, Xiaohu Huang, Hao Zhou, Kun Yao, Errui Ding, Jingdong Wang, Xinggang Wang, Wenyu Liu, Bin Feng

    Abstract: The key to action counting is accurately locating each video's repetitive actions. Instead of estimating the probability of each frame belonging to an action directly, we propose a dual-branch network, i.e., SkimFocusNet, working in a two-step manner. The model draws inspiration from empirical observations indicating that humans typically engage in coarse skimming of entire sequences to grasp the… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 13 pages, 9 figures

  22. arXiv:2406.02785  [pdf, other

    astro-ph.IM cs.LG eess.IV

    Event-horizon-scale Imaging of M87* under Different Assumptions via Deep Generative Image Priors

    Authors: Berthy T. Feng, Katherine L. Bouman, William T. Freeman

    Abstract: Reconstructing images from the Event Horizon Telescope (EHT) observations of M87*, the supermassive black hole at the center of the galaxy M87, depends on a prior to impose desired image statistics. However, given the impossibility of directly observing black holes, there is no clear choice for a prior. We present a framework for flexibly designing a range of priors, each bringing different biases… ▽ More

    Submitted 9 November, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Journal ref: ApJ 975 201 (2024)

  23. arXiv:2406.00948  [pdf

    cond-mat.mtrl-sci

    Real-space tilting method for atomic resolution STEM imaging of nanocrystalline materials

    Authors: Jiake Wei, Zhangze Xu, Wenjie Shen, Bin Feng, Ryo Ishikawa, Naoya Shibata, Yuichi Ikuhara, Xuedong Bai

    Abstract: Atomic-resolution scanning transmission electron microscopy (STEM) characterization requires precise tilting of the specimen to high symmetric zone axis, which is usually processed in reciprocal space by following the diffraction patterns. However, for small-sized nanocrystalline materials, their diffraction patterns are too faint to guide the tilting process. Here, a simple and effective tilting… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  24. arXiv:2405.20334  [pdf, other

    cs.CV cs.GR

    VividDream: Generating 3D Scene with Ambient Dynamics

    Authors: Yao-Chih Lee, Yi-Ting Chen, Andrew Wang, Ting-Hsuan Liao, Brandon Y. Feng, Jia-Bin Huang

    Abstract: We introduce VividDream, a method for generating explorable 4D scenes with ambient dynamics from a single input image or text prompt. VividDream first expands an input image into a static 3D point cloud through iterative inpainting and geometry merging. An ensemble of animated videos is then generated using video diffusion models with quality refinement techniques and conditioned on renderings of… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Project page: https://vivid-dream-4d.github.io

  25. arXiv:2405.11485  [pdf

    cond-mat.mtrl-sci

    Evidence for Multiferroicity in Single-Layer CuCrSe$_2$

    Authors: Zhenyu Sun, Yueqi Su, Aomiao Zhi, Zhicheng Gao, Xu Han, Kang Wu, Lihong Bao, Yuan Huang, Youguo Shi, Xuedong Bai, Peng Cheng, Lan Chen, Kehui Wu, Xuezeng Tian, Changzheng Wu, Baojie Feng

    Abstract: Multiferroic materials, which simultaneously exhibit ferroelectricity and magnetism, have attracted substantial attention due to their fascinating physical properties and potential technological applications. With the trends towards device miniaturization, there is an increasing demand for the persistence of multiferroicity in single-layer materials at elevated temperatures. Here, we report high-t… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Journal ref: Nature Communications 15, 4252 (2024)

  26. arXiv:2405.10463  [pdf, other

    physics.optics eess.IV physics.bio-ph

    Single-shot volumetric fluorescence imaging with neural fields

    Authors: Oumeng Zhang, Haowen Zhou, Brandon Y. Feng, Elin M. Larsson, Reinaldo E. Alcalde, Siyuan Yin, Catherine Deng, Changhuei Yang

    Abstract: Single-shot volumetric fluorescence (SVF) imaging offers a significant advantage over traditional imaging methods that require scanning across multiple axial planes as it can capture biological processes with high temporal resolution. The key challenges in SVF imaging include requiring sparsity constraints, eliminating depth ambiguity in the reconstruction, and maintaining high resolution across a… ▽ More

    Submitted 21 January, 2025; v1 submitted 16 May, 2024; originally announced May 2024.

  27. Dual-Task Vision Transformer for Rapid and Accurate Intracerebral Hemorrhage CT Image Classification

    Authors: Jialiang Fan, Xinhui Fan, Chengyan Song, Xiaofan Wang, Bingdong Feng, Lucan Li, Guoyu Lu

    Abstract: Intracerebral hemorrhage (ICH) is a severe and sudden medical condition caused by the rupture of blood vessels in the brain, leading to permanent damage to brain tissue and often resulting in functional disabilities or death in patients. Diagnosis and analysis of ICH typically rely on brain CT imaging. Given the urgency of ICH conditions, early treatment is crucial, necessitating rapid analysis of… ▽ More

    Submitted 2 August, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

  28. arXiv:2405.04531  [pdf, other

    stat.ME stat.CO

    Stochastic Gradient MCMC for Massive Geostatistical Data

    Authors: Mohamed A. Abba, Brian J. Reich, Reetam Majumder, Brandon Feng

    Abstract: Gaussian processes (GPs) are commonly used for prediction and inference for spatial data analyses. However, since estimation and prediction tasks have cubic time and quadratic memory complexity in number of locations, GPs are difficult to scale to large spatial datasets. The Vecchia approximation induces sparsity in the dependence structure and is one of several methods proposed to scale GP infere… ▽ More

    Submitted 3 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  29. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  30. arXiv:2405.04185  [pdf

    physics.app-ph

    Research on signalized intersection mixed traffic flow platoon control method considering Backward-looking effect

    Authors: Binghao Feng, Hui Guo, Minghui Ma, Yuepeng Wu, Shidong Liang, Yansong Wang

    Abstract: Connected and Autonomous Vehicles (CAVs) technology facilitates the advancement of intelligent transportation. However, intelligent control techniques for mixed traffic flow at signalized intersections involving both CAVs and Human-Driven Vehicles (HDVs) require further investigation into the impact of backward-looking effect. This paper proposes the concept of 1+n+1 mixed platoon considering the… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  31. arXiv:2404.19385  [pdf

    cond-mat.mtrl-sci

    High-performance solid-state electrochemical thermal switches with earth-abundant cerium oxide

    Authors: Ahrong Jeong, Mitsuki Yoshimura, Hyeonjun Kong, Zhiping Bian, Jason Tam, Bin Feng, Yuichi Ikuhara, Takashi Endo, Yasutaka Matsuo, Hiromichi Ohta

    Abstract: Thermal switches, which electrically turn heat flow on and off, have attracted attention as thermal management devices. Electrochemical reduction/oxidation switches the thermal conductivity (\k{appa}\) of active metal oxide films. The performance of the previously proposed electrochemical thermal switches is low; on/off \k{appa}\-ratio is mostly less than 5 and \k{appa}\-switching width is less th… ▽ More

    Submitted 22 August, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: 19 pages, 7 figures with supporting information (12 pages, 11 figures, 1 table)

    Journal ref: Science Adv. 11, eads6137 (2025)

  32. Integrable Semi-Discretization for a Modified Camassa-Holm Equation with Cubic Nonlinearity

    Authors: Bao-Feng Feng, Heng-Chun Hu, Han-Han Sheng, Wei Yin, Guo-Fu Yu

    Abstract: In the present paper, an integrable semi-discretization of the modified Camassa-Holm (mCH) equation with cubic nonlinearity is presented. The key points of the construction are based on the discrete Kadomtsev-Petviashvili (KP) equation and appropriate definition of discrete reciprocal transformations. First, we demonstrate that these bilinear equations and their determinant solutions can be derive… ▽ More

    Submitted 12 October, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

    Journal ref: SIGMA 20 (2024), 091, 14 pages

  33. arXiv:2404.15761  [pdf, other

    eess.SP

    Rechargeable UAV Trajectory Optimization for Real-Time Persistent Data Collection of Large-Scale Sensor Networks

    Authors: Rui Wang, Deshi Li, Qingqing Wu, Kaitao Meng, Boning Feng, Lele Cong

    Abstract: Unmanned aerial vehicles (UAVs) have received plenty of attention due to their high flexibility and enhanced communication ability, nonetheless, the limited onboard energy restricts UAVs' application on persistent data collection missions in large areas. In this paper, we propose a rechargeable UAV-assisted periodic data collection scheme, where a UAV is dispatched to periodically collect data fro… ▽ More

    Submitted 9 October, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: 15 pages, 17 figures, submitted to IEEE for possible publication

  34. arXiv:2404.15014  [pdf, other

    cs.CV

    OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving

    Authors: Guoqing Wang, Zhongdao Wang, Pin Tang, Jilai Zheng, Xiangxuan Ren, Bailan Feng, Chao Ma

    Abstract: Existing solutions for 3D semantic occupancy prediction typically treat the task as a one-shot 3D voxel-wise segmentation perception problem. These discriminative methods focus on learning the mapping between the inputs and occupancy map in a single step, lacking the ability to gradually refine the occupancy map and the reasonable scene imaginative capacity to complete the local regions somewhere.… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  35. arXiv:2404.13026  [pdf, other

    cs.CV cs.AI

    PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation

    Authors: Tianyuan Zhang, Hong-Xing Yu, Rundi Wu, Brandon Y. Feng, Changxi Zheng, Noah Snavely, Jiajun Wu, William T. Freeman

    Abstract: Realistic object interactions are crucial for creating immersive virtual experiences, yet synthesizing realistic 3D object dynamics in response to novel interactions remains a significant challenge. Unlike unconditional or text-conditioned dynamics generation, action-conditioned dynamics requires perceiving the physical material properties of objects and grounding the 3D motion prediction on these… ▽ More

    Submitted 7 October, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    Comments: Project website at: https://physdreamer.github.io/ Appear on ECCV 2024

  36. arXiv:2404.09734  [pdf, other

    cs.IT eess.SP

    Weighted Sum-Rate Maximization for Movable Antenna-Enhanced Wireless Networks

    Authors: Biqian Feng, Yongpeng Wu, Xiang-Gen Xia, Chengshan Xiao

    Abstract: This letter investigates the weighted sum rate maximization problem in movable antenna (MA)-enhanced systems. To reduce the computational complexity, we transform it into a more tractable weighted minimum mean square error (WMMSE) problem well-suited for MA. We then adopt the WMMSE algorithm and majorization-minimization algorithm to optimize the beamforming and antenna positions, respectively. Mo… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted by IEEE Wireless Communications Letters

  37. arXiv:2404.09502  [pdf, other

    cs.CV

    SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction

    Authors: Pin Tang, Zhongdao Wang, Guoqing Wang, Jilai Zheng, Xiangxuan Ren, Bailan Feng, Chao Ma

    Abstract: Vision-based perception for autonomous driving requires an explicit modeling of a 3D space, where 2D latent representations are mapped and subsequent 3D operators are applied. However, operating on dense latent spaces introduces a cubic time and space complexity, which limits scalability in terms of perception range or spatial resolution. Existing approaches compress the dense representation using… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 10 pages, 4 figures, accepted by CVPR 2024

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition 2024 (CVPR 2024)

  38. arXiv:2404.07985  [pdf, other

    cs.CV eess.IV

    WaveMo: Learning Wavefront Modulations to See Through Scattering

    Authors: Mingyang Xie, Haiyun Guo, Brandon Y. Feng, Lingbo Jin, Ashok Veeraraghavan, Christopher A. Metzler

    Abstract: Imaging through scattering media is a fundamental and pervasive challenge in fields ranging from medical diagnostics to astronomy. A promising strategy to overcome this challenge is wavefront modulation, which induces measurement diversity during image acquisition. Despite its importance, designing optimal wavefront modulations to image through scattering remains under-explored. This paper introdu… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  39. arXiv:2404.00471  [pdf, other

    physics.med-ph cs.CV cs.LG eess.IV

    Score-Based Diffusion Models for Photoacoustic Tomography Image Reconstruction

    Authors: Sreemanti Dey, Snigdha Saha, Berthy T. Feng, Manxiu Cui, Laure Delisle, Oscar Leong, Lihong V. Wang, Katherine L. Bouman

    Abstract: Photoacoustic tomography (PAT) is a rapidly-evolving medical imaging modality that combines optical absorption contrast with ultrasound imaging depth. One challenge in PAT is image reconstruction with inadequate acoustic signals due to limited sensor coverage or due to the density of the transducer array. Such cases call for solving an ill-posed inverse reconstruction problem. In this work, we use… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 5 pages

    Journal ref: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, Korea, Republic of, 2024, pp. 2470-2474

  40. arXiv:2403.16095  [pdf, other

    cs.CV cs.RO

    CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field

    Authors: Jiarui Hu, Xianhao Chen, Boyin Feng, Guanglin Li, Liangjing Yang, Hujun Bao, Guofeng Zhang, Zhaopeng Cui

    Abstract: Recently neural radiance fields (NeRF) have been widely exploited as 3D representations for dense simultaneous localization and mapping (SLAM). Despite their notable successes in surface modeling and novel view synthesis, existing NeRF-based methods are hindered by their computationally intensive and time-consuming volume rendering pipeline. This paper presents an efficient dense RGB-D SLAM system… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Project Page: https://zju3dv.github.io/cg-slam

  41. arXiv:2403.16040  [pdf, ps, other

    hep-ph

    General One-loop Generating Function by IBP relations

    Authors: Bo Feng, Chang Hu, Jiyuan Shen, Yaobo Zhang

    Abstract: In this paper we have studied the most general generating function of reduction for one loop integrals with arbitrary tensor structure in numerator and arbitrary power distribution of propagators in denominator. Using IBP relations, we have established the partial differential equations for these generating functions and solved them analytically. These results provide useful guidance for applying… ▽ More

    Submitted 11 November, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

    Comments: 50 pages

  42. arXiv:2403.13800  [pdf, other

    cs.CV

    TimeRewind: Rewinding Time with Image-and-Events Video Diffusion

    Authors: Jingxi Chen, Brandon Y. Feng, Haoming Cai, Mingyang Xie, Christopher Metzler, Cornelia Fermuller, Yiannis Aloimonos

    Abstract: This paper addresses the novel challenge of ``rewinding'' time from a single captured image to recover the fleeting moments missed just before the shutter button is pressed. This problem poses a significant challenge in computer vision and computational photography, as it requires predicting plausible pre-capture motion from a single static frame, an inherently ill-posed task due to the high degre… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  43. arXiv:2403.11050  [pdf, other

    cs.CV

    Endora: Video Generation Models as Endoscopy Simulators

    Authors: Chenxin Li, Hengyu Liu, Yifan Liu, Brandon Y. Feng, Wuyang Li, Xinyu Liu, Zhen Chen, Jing Shao, Yixuan Yuan

    Abstract: Generative models hold promise for revolutionizing medical education, robot-assisted surgery, and data augmentation for machine learning. Despite progress in generating 2D medical images, the complex domain of clinical video generation has largely remained untapped.This paper introduces \model, an innovative approach to generate medical videos that simulate clinical endoscopy scenes. We present a… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: Project page: https://endora-medvidgen.github.io/

  44. Layer-dependent Raman spectroscopy of ultrathin Ta$_2$Pd$_3$Te$_5$

    Authors: Zhenyu Sun, Zhaopeng Guo, Dayu Yan, Peng Cheng, Lan Chen, Youguo Shi, Yuan Huang, Zhijun Wang, Kehui Wu, Baojie Feng

    Abstract: Two-dimensional topological insulators (2DTIs) or quantum spin Hall insulators are attracting increasing attention due to their potential applications in next-generation spintronic devices. Despite their promising prospects, realizable 2DTIs are still limited. Recently, Ta2Pd3Te5, a semiconducting van der Waals material, has shown spectroscopic evidence of quantum spin Hall states. However, achiev… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Journal ref: Phys. Rev. Materials 7, 094004 (2023)

  45. arXiv:2401.17213  [pdf

    physics.optics physics.ins-det

    Ptycho-endoscopy on a lensless ultrathin fiber bundle tip

    Authors: Pengming Song, Ruihai Wang, Lars Loetgering, Jia Liu, Peter Vouras, Yujin Lee, Shaowei Jiang, Bin Feng, Andrew Maiden, Changhuei Yang, Guoan Zheng

    Abstract: Synthetic aperture radar (SAR) utilizes an aircraft-carried antenna to emit electromagnetic pulses and detect the returning echoes. As the aircraft travels across a designated area, it synthesizes a large virtual aperture to improve image resolution. Inspired by SAR, we introduce synthetic aperture ptycho-endoscopy (SAPE) for micro-endoscopic imaging beyond the diffraction limit. SAPE operates by… ▽ More

    Submitted 6 July, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  46. arXiv:2401.06458  [pdf, other

    math.AP math-ph

    Asymptotic behavior for a new higher-order nonlinear Schrödinger equation

    Authors: Hongyi Zhang, Yufeng Zhang, Binlu Feng

    Abstract: We investigate the Cauchy problem of a new higher-order nonlinear Schrödinger equation (NHNSE) with weighted Sobolev initial data which is derived by ourselves. By applying $\bar{\partial}$-steepest descent method, we derive the long-time asymptotics of the NHNSE. Explicit steps are as follows: first of all, based on the spectral analysis of a Lax pair and scattering matrice, the solution of the N… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  47. arXiv:2401.00129  [pdf, other

    hep-th astro-ph.CO gr-qc hep-ph

    Towards Systematic Evaluation of de Sitter Correlators via Generalized Integration-By-Parts Relations

    Authors: Jiaqi Chen, Bo Feng

    Abstract: We generalize Integration-By-Parts (IBP) and differential equations methods to de Sitter correlators related to inflation. While massive correlators in de Sitter spacetime are usually regarded as highly intricate, we find they have remarkably hidden concise structures from the perspective of IBP. We find the factorization of the IBP relations of each vertex integral family corresponding to… ▽ More

    Submitted 27 October, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    Comments: 26 pages, 2 figures. Revision after accepted: corrected formulas about remaining terms, fixed typos, and simplify formulas

    Journal ref: JHEP06(2024)199

  48. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1326 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 9 May, 2025; v1 submitted 18 December, 2023; originally announced December 2023.

  49. arXiv:2312.04679  [pdf, other

    eess.IV cs.CV

    ConVRT: Consistent Video Restoration Through Turbulence with Test-time Optimization of Neural Video Representations

    Authors: Haoming Cai, Jingxi Chen, Brandon Y. Feng, Weiyun Jiang, Mingyang Xie, Kevin Zhang, Ashok Veeraraghavan, Christopher Metzler

    Abstract: tmospheric turbulence presents a significant challenge in long-range imaging. Current restoration algorithms often struggle with temporal inconsistency, as well as limited generalization ability across varying turbulence levels and scene content different than the training data. To tackle these issues, we introduce a self-supervised method, Consistent Video Restoration through Turbulence (ConVRT)… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: https://convrt-2024.github.io/

  50. arXiv:2312.03788  [pdf, other

    cs.LG cs.CL

    SmoothQuant+: Accurate and Efficient 4-bit Post-Training WeightQuantization for LLM

    Authors: Jiayi Pan, Chengcan Wang, Kaifu Zheng, Yangguang Li, Zhenyu Wang, Bin Feng

    Abstract: Large language models (LLMs) have shown remarkable capabilities in various tasks. However their huge model size and the consequent demand for computational and memory resources also pose challenges to model deployment. Currently, 4-bit post-training quantization (PTQ) has achieved some success in LLMs, reducing the memory footprint by approximately 75% compared to FP16 models, albeit with some acc… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.