Skip to main content

Showing 201–250 of 833 results for author: Gao, P

.
  1. arXiv:2312.06462  [pdf, other

    cs.CV cs.AI cs.SD eess.AS

    Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-Visual Segmentation

    Authors: Qi Yang, Xing Nie, Tong Li, Pengfei Gao, Ying Guo, Cheng Zhen, Pengfei Yan, Shiming Xiang

    Abstract: Recently, an audio-visual segmentation (AVS) task has been introduced, aiming to group pixels with sounding objects within a given video. This task necessitates a first-ever audio-driven pixel-level understanding of the scene, posing significant challenges. In this paper, we propose an innovative audio-visual transformer framework, termed COMBO, an acronym for COoperation of Multi-order Bilateral… ▽ More

    Submitted 7 April, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 Highlight. 13 pages, 10 figures

  2. arXiv:2312.06208  [pdf, other

    nlin.PS

    Dark solitons and their bound states in a nonlinear fiber with second- and fourth-order dispersion

    Authors: Peng Gao, Li-Zheng Lv, Xin Li

    Abstract: We study the excitations of dark solitons in a nonlinear optical fiber with the second- and fourth-order dispersion, and find the emergence of striped dark solitons (SDSs) and some multi-dark-soliton bound states. The SDSs can exhibit time-domain oscillating structures on a plane wave, and they have two types: the ones with or without the total phase step, while the multi-dark-soliton bound states… ▽ More

    Submitted 7 March, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: 9 pages, 6 figures

  3. arXiv:2312.04547  [pdf, other

    cs.CV cs.AI cs.GR cs.HC

    Digital Life Project: Autonomous 3D Characters with Social Intelligence

    Authors: Zhongang Cai, Jianping Jiang, Zhongfei Qing, Xinying Guo, Mingyuan Zhang, Zhengyu Lin, Haiyi Mei, Chen Wei, Ruisi Wang, Wanqi Yin, Xiangyu Fan, Han Du, Liang Pan, Peng Gao, Zhitao Yang, Yang Gao, Jiaqi Li, Tianxiang Ren, Yukun Wei, Xiaogang Wang, Chen Change Loy, Lei Yang, Ziwei Liu

    Abstract: In this work, we present Digital Life Project, a framework utilizing language as the universal medium to build autonomous 3D characters, who are capable of engaging in social interactions and expressing with articulated body motions, thereby simulating life in a digital environment. Our framework comprises two primary components: 1) SocioMind: a meticulously crafted digital brain that models perso… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Homepage: https://digital-life-project.com/

  4. arXiv:2312.03700  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.MM

    OneLLM: One Framework to Align All Modalities with Language

    Authors: Jiaming Han, Kaixiong Gong, Yiyuan Zhang, Jiaqi Wang, Kaipeng Zhang, Dahua Lin, Yu Qiao, Peng Gao, Xiangyu Yue

    Abstract: Multimodal large language models (MLLMs) have gained significant attention due to their strong multimodal understanding capability. However, existing works rely heavily on modality-specific encoders, which usually differ in architecture and are limited to common modalities. In this paper, we present OneLLM, an MLLM that aligns eight modalities to language using a unified framework. We achieve this… ▽ More

    Submitted 9 January, 2025; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Accepted by CVPR 2024. Code: https://github.com/csuhan/OneLLM

  5. arXiv:2311.17963  [pdf, other

    cs.CV

    M$^{2}$Chat: Empowering VLM for Multimodal LLM Interleaved Text-Image Generation

    Authors: Xiaowei Chi, Rongyu Zhang, Zhengkai Jiang, Yijiang Liu, Yatian Wang, Xingqun Qi, Wenhan Luo, Peng Gao, Shanghang Zhang, Qifeng Liu, Yike Guo

    Abstract: While current LLM chatbots like GPT-4V bridge the gap between human instructions and visual representations to enable text-image generations, they still lack efficient alignment methods for high-fidelity performance on multiple downstream tasks. In this paper, we propose \textbf{$M^{2}Chat$}, a novel unified multimodal LLM framework for generating interleaved text-image conversation across various… ▽ More

    Submitted 13 April, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  6. arXiv:2311.16572   

    eess.SY physics.ao-ph physics.soc-ph

    Adapting to climate change: Long-term impact of wind resource changes on China's power system resilience

    Authors: Jiaqi Ruan, Xiangrui Meng, Yifan Zhu, Gaoqi Liang, Xianzhuo Sun, Huayi Wu, Huijuan Xiao, Mengqian Lu, Pin Gao, Jiapeng Li, Wai-Kin Wong, Zhao Xu, Junhua Zhao

    Abstract: Modern society's reliance on power systems is at risk from the escalating effects of wind-related climate change. Yet, failure to identify the intricate relationship between wind-related climate risks and power systems could lead to serious short- and long-term issues, including partial or complete blackouts. Here, we develop a comprehensive framework to assess China's power system resilience acro… ▽ More

    Submitted 24 January, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Not suitable for publication

  7. arXiv:2311.12381  [pdf

    physics.optics cond-mat.mes-hall cond-mat.mtrl-sci

    Room-temperature continuous-wave pumped exciton polariton condensation in a perovskite microcavity

    Authors: Jiepeng Song, Sanjib Ghosh, Xinyi Deng, Qiuyu Shang, Xinfeng Liu, Yubin Wang, Xiaoyue Gao, Wenkai Yang, Xianjin Wang, Qing Zhao, Kebin Shi, Peng Gao, Qihua Xiong, Qing Zhang

    Abstract: Microcavity exciton polaritons (polaritons) as part-light part-matter quasiparticles, garner significant attention for non-equilibrium Bose-Einstein condensation at elevated temperatures. Recently, halide perovskites have emerged as promising room-temperature polaritonic platforms thanks to their large exciton binding energies and superior optical properties. However, currently, inducing room-temp… ▽ More

    Submitted 14 February, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: 16 pages, 4 figures

  8. arXiv:2311.08626  [pdf, ps, other

    math.NT

    Ratios conjecture of cubic $L$-functions of prime moduli

    Authors: Peng Gao, Liangyi Zhao

    Abstract: We develop $L$-functions ratios conjecture with one shift in the numerator and denominator in certain ranges for the family of cubic Hecke $L$-functions of prime moduli over the Eisenstein field using multiple Dirichlet series under the generalized Riemann hypothesis. As applications, we evaluate asymptotically the first moment of central values as well as the one-level density of the same family… ▽ More

    Submitted 26 February, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 14 pages

    MSC Class: 11M06; 11M41

  9. arXiv:2311.07575  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models

    Authors: Ziyi Lin, Chris Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Chen Lin, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Hongsheng Li, Yu Qiao

    Abstract: We present SPHINX, a versatile multi-modal large language model (MLLM) with a joint mixing of model weights, tuning tasks, and visual embeddings. First, for stronger vision-language alignment, we unfreeze the large language model (LLM) during pre-training, and introduce a weight mix strategy between LLMs trained by real-world and synthetic data. By directly integrating the weights from two domains… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Work in progress. Code and demos are released at https://github.com/Alpha-VLLM/LLaMA2-Accessory

  10. arXiv:2311.07023  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Photochemical Upcycling of Ultrastrong Polyethylene Nanomembranes into Fibrous Carbon at Ambient Conditions

    Authors: Yuexiang Sun, Xin Ma, Qiao Gu, Ping Gao

    Abstract: The escalating global issue of plastic waste accumulation, specifically polyolefins, necessitates an urgent solution for upcycling these materials into beneficial compounds. Yet, achieving such upcycling without introducing carbon dioxide into the environment remains a formidable challenge. In this study, we demonstrate an eco-friendly approach for the photochemical conversion of ultrastrong, ultr… ▽ More

    Submitted 13 January, 2024; v1 submitted 12 November, 2023; originally announced November 2023.

  11. arXiv:2311.05533  [pdf, ps, other

    math.CO cs.DM

    Building Hamiltonian Cycles in the Semi-Random Graph Process in Less Than $2n$ Rounds

    Authors: Alan Frieze, Pu Gao, Calum MacRury, Paweł Prałat, Gregory Sorkin

    Abstract: The semi-random graph process is an adaptive random graph process in which an online algorithm is initially presented an empty graph on $n$ vertices. In each round, a vertex $u$ is presented to the algorithm independently and uniformly at random. The algorithm then adaptively selects a vertex $v$, and adds the edge $uv$ to the graph. For a given graph property, the objective of the algorithm is to… ▽ More

    Submitted 20 December, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: 29 pages. arXiv admin note: substantial text overlap with arXiv:2205.02350

  12. arXiv:2311.01778  [pdf, other

    nlin.PS cond-mat.quant-gas

    Quantum scattering treatment on the time-domain diffraction of a matter-wave soliton

    Authors: Peng Gao, Jie Liu

    Abstract: We study the dynamics of the matter-wave soliton interacting with a vibrating mirror created by an evanescent light and provide a quantum scattering picture for the time-domain diffraction of the matter-wave soliton. Under Kramers-Henneberger (KH) transformation, i.e., in a vibrating coordinate, the vibration of the mirror can be cast to an effective gauge field. We then can exploit Dyson series a… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: 10 pages, 4 figures

  13. arXiv:2310.20491  [pdf, other

    cs.RO

    Collaborative Decision-Making Using Spatiotemporal Graphs in Connected Autonomy

    Authors: Peng Gao, Yu Shen, Ming C. Lin

    Abstract: Collaborative decision-making is an essential capability for multi-robot systems, such as connected vehicles, to collaboratively control autonomous vehicles in accident-prone scenarios. Under limited communication bandwidth, capturing comprehensive situational awareness by integrating connected agents' observation is very challenging. In this paper, we propose a novel collaborative decision-making… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  14. arXiv:2310.17043  [pdf, other

    astro-ph.EP astro-ph.SR

    Quantifying the Transit Light Source Effect: Measurements of Spot Temperature and Coverage on the Photosphere of AU Microscopii with High-Resolution Spectroscopy and Multi-Color Photometry

    Authors: William Waalkes, Zachory Berta-Thompson, Elisabeth Newton, Andrew Mann, Peter Gao, Hannah Wakeford, Lili Alderson, Peter Plavchan

    Abstract: AU Mic is an active 24 Myr pre-main sequence M dwarf in the stellar neighborhood (d$=$9.7 pc) with a rotation period of 4.86 days. The two transiting planets orbiting AU Mic, AU Mic b and c, are warm sub-Neptunes on 8.5 and 18.9 day periods and are targets of interest for atmospheric observations of young planets. Here we study AU Mic's unocculted starspots using ground-based photometry and spectr… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 25 pages, 13 figures, Accepted to ApJ

  15. arXiv:2310.08358  [pdf, other

    cs.LG

    Towards Demystifying the Generalization Behaviors When Neural Collapse Emerges

    Authors: Peifeng Gao, Qianqian Xu, Yibo Yang, Peisong Wen, Huiyang Shao, Zhiyong Yang, Bernard Ghanem, Qingming Huang

    Abstract: Neural Collapse (NC) is a well-known phenomenon of deep neural networks in the terminal phase of training (TPT). It is characterized by the collapse of features and classifier into a symmetrical structure, known as simplex equiangular tight frame (ETF). While there have been extensive studies on optimization characteristics showing the global optimality of neural collapse, little research has been… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: 20 pages, 6 figures. arXiv admin note: substantial text overlap with arXiv:2304.08914

  16. arXiv:2310.06311  [pdf, other

    cs.CV cs.MM

    Improving Compositional Text-to-image Generation with Large Vision-Language Models

    Authors: Song Wen, Guian Fang, Renrui Zhang, Peng Gao, Hao Dong, Dimitris Metaxas

    Abstract: Recent advancements in text-to-image models, particularly diffusion models, have shown significant promise. However, compositional text-to-image models frequently encounter difficulties in generating high-quality images that accurately align with input texts describing multiple objects, variable attributes, and intricate spatial relationships. To address this limitation, we employ large vision-lan… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  17. arXiv:2310.04180  [pdf, other

    cs.CV

    Degradation-Aware Self-Attention Based Transformer for Blind Image Super-Resolution

    Authors: Qingguo Liu, Pan Gao, Kang Han, Ningzhong Liu, Wei Xiang

    Abstract: Compared to CNN-based methods, Transformer-based methods achieve impressive image restoration outcomes due to their abilities to model remote dependencies. However, how to apply Transformer-based methods to the field of blind super-resolution (SR) and further make an SR network adaptive to degradation information is still an open problem. In this paper, we propose a new degradation-aware self-atte… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: 12 pages

  18. arXiv:2310.01443  [pdf, other

    cs.LG cs.AI cs.ET quant-ph

    Quantum-Based Feature Selection for Multi-classification Problem in Complex Systems with Edge Computing

    Authors: Wenjie Liu, Junxiu Chen, Yuxiang Wang, Peipei Gao, Zhibin Lei, Xu Ma

    Abstract: The complex systems with edge computing require a huge amount of multi-feature data to extract appropriate insights for their decision making, so it is important to find a feasible feature selection method to improve the computational efficiency and save the resource consumption. In this paper, a quantum-based feature selection algorithm for the multi-classification problem, namely, QReliefF, is p… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: 22 pages, 11 figures

    Journal ref: Complexity, 2020.2020:p.8216874

  19. arXiv:2309.16917  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci

    Atomic-scale mechanism of enhanced electron-phonon coupling at the interface of MgB$_2$ thin film

    Authors: Xiaowen Zhang, Tiequan Xu, Ruochen Shi, Bo Han, Fachen Liu, Zhetong Liu, Xiaoyue Gao, Jinlong Du, Yue Wang, Peng Gao

    Abstract: In this study, we explore the heterointerface of MgB$_2$ film on SiC substrate at atomic scale using electron microscopy and spectroscopy. We detect ~1 nm MgO between MgB$_2$ and SiC. Atomic-level electron energy loss spectra (EELS) show MgB$_2$-E2g mode splitting and softening near the MgB$_2$/MgO interface. Orbital-resolved core-level EELS link the phonon softening to in-plane boron-atom electro… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  20. arXiv:2309.16583  [pdf, other

    cs.CL

    GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond

    Authors: Shen Zheng, Yuyu Zhang, Yijie Zhu, Chenguang Xi, Pengyang Gao, Xun Zhou, Kevin Chen-Chuan Chang

    Abstract: With the rapid advancement of large language models (LLMs), there is a pressing need for a comprehensive evaluation suite to assess their capabilities and limitations. Existing LLM leaderboards often reference scores reported in other papers without consistent settings and prompts, which may inadvertently encourage cherry-picking favored settings and prompts for better results. In this work, we in… ▽ More

    Submitted 1 April, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: Accepted by NAACL 2024

  21. arXiv:2309.14366  [pdf, other

    quant-ph cs.AI cs.ET cs.LG

    A Unitary Weights Based One-Iteration Quantum Perceptron Algorithm for Non-Ideal Training Sets

    Authors: Wenjie Liu, Peipei Gao, Yuxiang Wang, Wenbin Yu, Maojun Zhang

    Abstract: In order to solve the problem of non-ideal training sets (i.e., the less-complete or over-complete sets) and implement one-iteration learning, a novel efficient quantum perceptron algorithm based on unitary weights is proposed, where the singular value decomposition of the total weight matrix from the training set is calculated to make the weight matrix to be unitary. The example validation of qua… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: 12 pages, 5 figures

    Journal ref: IEEE Access, 2019. 7: p. 36854-36865

  22. arXiv:2309.13811  [pdf, ps, other

    math.NT

    Ratios conjecture for primitive quadratic Hecke $L$-functions

    Authors: Peng Gao, Liangyi Zhao

    Abstract: We develop the ratios conjecture with one shift in the numerator and denominator in certain ranges for families of primitive quadratic Hecke $L$-functions of imaginary quadratic number fields with class number one using multiple Dirichlet series under the generalized Riemann hypothesis. We also obtain unconditional asymptotic formulas for the first moments of central values of these families of… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    Comments: 16 pages

    MSC Class: 11M06; 11M41

  23. arXiv:2309.13193  [pdf, other

    cs.HC

    SurrealDriver: Designing LLM-powered Generative Driver Agent Framework based on Human Drivers' Driving-thinking Data

    Authors: Ye Jin, Ruoxuan Yang, Zhijie Yi, Xiaoxi Shen, Huiling Peng, Xiaoan Liu, Jingli Qin, Jiayang Li, Jintao Xie, Peizhong Gao, Guyue Zhou, Jiangtao Gong

    Abstract: Leveraging advanced reasoning capabilities and extensive world knowledge of large language models (LLMs) to construct generative agents for solving complex real-world problems is a major trend. However, LLMs inherently lack embodiment as humans, resulting in suboptimal performance in many embodied decision-making tasks. In this paper, we introduce a framework for building human-like generative dri… ▽ More

    Submitted 21 July, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: 6 pages, 3 figures

    MSC Class: H.5.2

  24. An Efficient and Secure Arbitrary N-Party Quantum Key Agreement Protocol Using Bell States

    Authors: Wen-Jie Liu, Yong Xu, Ching-Nung Yang, Pei-Pei Gao, Wen-Bin Yu

    Abstract: Two quantum key agreement protocols using Bell states and Bell measurement were recently proposed by Shukla et al.(Quantum Inf. Process. 13(11), 2391-2405, 2014). However, Zhu et al. pointed out that there are some security flaws and proposed an improved version (Quantum Inf. Process. 14(11), 4245-4254, 2015). In this study, we will show Zhu et al.'s improvement still exists some security problems… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Comments: 13 pages, 5 figures

    Journal ref: International Journal of Theoretical Physics, 2018. 57(1): p. 195-207

  25. arXiv:2309.12119  [pdf, other

    stat.ME

    Pseudo-Bayesian unit level modeling for small area estimation under informative sampling

    Authors: Peter A. Gao, Jon Wakefield

    Abstract: When mapping subnational health and demographic indicators, direct weighted estimators of small area means based on household survey data can be unreliable when data are limited. If survey microdata are available, unit level models can relate individual survey responses to unit level auxiliary covariates and explicitly account for spatial dependence and between area variation using random effects.… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  26. arXiv:2309.10309  [pdf, other

    cs.RO

    Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill

    Authors: Wenzhe Cai, Siyuan Huang, Guangran Cheng, Yuxing Long, Peng Gao, Changyin Sun, Hao Dong

    Abstract: Zero-shot object navigation is a challenging task for home-assistance robots. This task emphasizes visual grounding, commonsense inference and locomotion abilities, where the first two are inherent in foundation models. But for the locomotion part, most works still depend on map-based planning approaches. The gap between RGB space and map space makes it difficult to directly transfer the knowledge… ▽ More

    Submitted 20 September, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: 8 pages, 5 figures

  27. arXiv:2309.08365  [pdf, other

    cs.CV cs.AI

    M$^3$Net: Multilevel, Mixed and Multistage Attention Network for Salient Object Detection

    Authors: Yao Yuan, Pan Gao, XiaoYang Tan

    Abstract: Most existing salient object detection methods mostly use U-Net or feature pyramid structure, which simply aggregates feature maps of different scales, ignoring the uniqueness and interdependence of them and their respective contributions to the final prediction. To overcome these, we propose the M$^3$Net, i.e., the Multilevel, Mixed and Multistage attention network for Salient Object Detection (S… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  28. arXiv:2309.08214  [pdf, other

    cs.RO

    MTG: Mapless Trajectory Generator with Traversability Coverage for Outdoor Navigation

    Authors: Jing Liang, Peng Gao, Xuesu Xiao, Adarsh Jagan Sathyamoorthy, Mohamed Elnoor, Ming C. Lin, Dinesh Manocha

    Abstract: We present a novel learning-based trajectory generation algorithm for outdoor robot navigation. Our goal is to compute collision-free paths that also satisfy the environment-specific traversability constraints. Our approach is designed for global planning using limited onboard robot perception in mapless environments while ensuring comprehensive coverage of all traversable directions. Our formulat… ▽ More

    Submitted 4 March, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: 9

  29. arXiv:2309.07688  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Nanoscale Cathodoluminescence Spectroscopy Probing the Nitride Quantum Wells in an Electron Microcope

    Authors: Zhetong Liu, Bingyao Liu, Dongdong Liang, Xiaomei Li, Xiaomin Li, Li Chen, Rui Zhu, Jun Xu, Tongbo Wei, Xuedong Bai, Peng Gao

    Abstract: To gain a deeper understanding of the luminescence of multiquantum wells and the factors affecting it on a microscopic level, cathodoluminescence combined with scanning transmission electron microscopy and spectroscopy was used to reveal the luminescence of In0.15Ga0.85N five-period multiquantum wells. The composition-wave-energy relationship was established in combination with energy-dispersive X… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 13 pages,4 figures

  30. arXiv:2309.06037  [pdf, other

    astro-ph.HE astro-ph.GA gr-qc

    Fast resolving Galactic binaries in LISA data and its ability to study the Milky Way

    Authors: Pin Gao, Xi-Long Fan, Zhou-Jian Cao, Xue-Hao Zhang

    Abstract: Resolving individual gravitational waves from tens of millions of double white dwarf (DWD) binaries in the Milky Way is a challenge for future space-based gravitational wave detection programs. By using previous data to define the priors for the next search, we propose an accelerated approach of searching the DWD binaries and demonstrate its efficiency based on the GBSIEVER detection pipeline. Com… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 16 pages, 19 figures

    Journal ref: Phys. Rev. D 107, 123029, 2023

  31. arXiv:2309.05881  [pdf, ps, other

    math.CO math.PR

    On the pre- and post-positional semi-random graph processes

    Authors: Pu Gao, Hidde Koerts

    Abstract: We study the semi-random graph process, and a variant process recently suggested by Nick Wormald. We show that these two processes are asymptotically equally fast in constructing a semi-random graph $G$ that has property ${\mathcal P}$, for the following examples of ${\mathcal P}$: - ${\mathcal P}$ is the set of graphs containing a $d$-degenerate subgraph, where $d\ge 1$ is fixed; -… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  32. arXiv:2309.03905  [pdf, other

    cs.MM cs.CL cs.CV cs.LG cs.SD eess.AS

    ImageBind-LLM: Multi-modality Instruction Tuning

    Authors: Jiaming Han, Renrui Zhang, Wenqi Shao, Peng Gao, Peng Xu, Han Xiao, Kaipeng Zhang, Chris Liu, Song Wen, Ziyu Guo, Xudong Lu, Shuai Ren, Yafei Wen, Xiaoxin Chen, Xiangyu Yue, Hongsheng Li, Yu Qiao

    Abstract: We present ImageBind-LLM, a multi-modality instruction tuning method of large language models (LLMs) via ImageBind. Existing works mainly focus on language and image instruction tuning, different from which, our ImageBind-LLM can respond to multi-modality conditions, including audio, 3D point clouds, video, and their embedding-space arithmetic by only image-text alignment training. During training… ▽ More

    Submitted 11 September, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: Code is available at https://github.com/OpenGVLab/LLaMA-Adapter

  33. arXiv:2309.02714  [pdf

    cond-mat.supr-con

    Atomic-scale observation of localized phonons at FeSe/SrTiO3 interface

    Authors: Ruochen Sh, Qize Li, Xiaofeng Xu, Bo Han, Ruixue Zhu, Fachen Liu, Ruishi Qi, Xiaowen Zhang, Jinlong Du, Ji Chen, Dapeng Yu, Xuetao Zhu, Jiandong Guo, Peng Gao

    Abstract: In single unit-cell FeSe grown on SrTiO3, the superconductivity transition temperature features a significant enhancement. Local phonon modes at the interface associated with electron-phonon coupling may play an important role in the interface-induced enhancement. However, such phonon modes have eluded direct experimental observations. Indeed, the complicated atomic structure of the interface brin… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Journal ref: Nat Commun 15, 3418 (2024)

  34. arXiv:2309.00767  [pdf, other

    physics.comp-ph cs.LG physics.chem-ph physics.flu-dyn

    Physics-informed machine learning of the correlation functions in bulk fluids

    Authors: Wenqian Chen, Peiyuan Gao, Panos Stinis

    Abstract: The Ornstein-Zernike (OZ) equation is the fundamental equation for pair correlation function computations in the modern integral equation theory for liquids. In this work, machine learning models, notably physics-informed neural networks and physics-informed neural operator networks, are explored to solve the OZ equation. The physics-informed machine learning models demonstrate great accuracy and… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: 8 figures

    Report number: PNNL-SA-189736

  35. arXiv:2309.00615  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.MM

    Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following

    Authors: Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Yiwen Tang, Xianzheng Ma, Jiaming Han, Kexin Chen, Peng Gao, Xianzhi Li, Hongsheng Li, Pheng-Ann Heng

    Abstract: We introduce Point-Bind, a 3D multi-modality model aligning point clouds with 2D image, language, audio, and video. Guided by ImageBind, we construct a joint embedding space between 3D and multi-modalities, enabling many promising applications, e.g., any-to-3D generation, 3D embedding arithmetic, and 3D open-world understanding. On top of this, we further present Point-LLM, the first 3D large lang… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: Work in progress. Code is available at https://github.com/ZiyuGuo99/Point-Bind_Point-LLM

  36. arXiv:2308.14482  [pdf, other

    cs.CL

    An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation

    Authors: Pengzhi Gao, Ruiqing Zhang, Zhongjun He, Hua Wu, Haifeng Wang

    Abstract: Consistency regularization methods, such as R-Drop (Liang et al., 2021) and CrossConST (Gao et al., 2023), have achieved impressive supervised and zero-shot performance in the neural machine translation (NMT) field. Can we also boost end-to-end (E2E) speech-to-text translation (ST) by leveraging consistency regularization? In this paper, we conduct empirical studies on intra-modal and cross-modal… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  37. arXiv:2308.13137  [pdf, other

    cs.LG cs.CL

    OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models

    Authors: Wenqi Shao, Mengzhao Chen, Zhaoyang Zhang, Peng Xu, Lirui Zhao, Zhiqian Li, Kaipeng Zhang, Peng Gao, Yu Qiao, Ping Luo

    Abstract: Large language models (LLMs) have revolutionized natural language processing tasks. However, their practical deployment is hindered by their immense memory and computation requirements. Although recent post-training quantization (PTQ) methods are effective in reducing memory footprint and improving the computational efficiency of LLM, they hand-craft quantization parameters, leading to low perform… ▽ More

    Submitted 18 March, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: ICLR 2024 Camera Ready

  38. arXiv:2308.12961  [pdf, other

    cs.CV

    Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks

    Authors: Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Hao Dong, Peng Gao

    Abstract: To reduce the reliance on large-scale datasets, recent works in 3D segmentation resort to few-shot learning. Current 3D few-shot semantic segmentation methods first pre-train the models on `seen' classes, and then evaluate their generalization performance on `unseen' classes. However, the prior pre-training stage not only introduces excessive time overhead, but also incurs a significant domain gap… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: Code is available at https://github.com/yangyangyang127/TFS3D

  39. arXiv:2308.11219  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Controlling the 2D magnetism of CrBr$_3$ by van der Waals stacking engineering

    Authors: Shiqi Yang, Xiaolong Xu, Bo Han, Pingfan Gu, Roger Guzman, Yiwen Song, Zhongchong Lin, Peng Gao, Wu Zhou, Jinbo Yang, Zuxin Chen, Yu Ye

    Abstract: The manipulation of two-dimensional (2D) magnetic order is of significant importance to facilitate future 2D magnets for low-power and high-speed spintronic devices. Van der Waals stacking engineering makes promises for controllable magnetism via interlayer magnetic coupling. However, directly examining the stacking order changes accompanying magnetic order transitions at the atomic scale and prep… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: 7 pages, 4 figures

  40. arXiv:2308.11138  [pdf, ps, other

    stat.ME cs.CL q-fin.RM stat.ML

    NLP-based detection of systematic anomalies among the narratives of consumer complaints

    Authors: Peiheng Gao, Ning Sun, Xuefeng Wang, Chen Yang, Ričardas Zitikis

    Abstract: We develop an NLP-based procedure for detecting systematic nonmeritorious consumer complaints, simply called systematic anomalies, among complaint narratives. While classification algorithms are used to detect pronounced anomalies, in the case of smaller and frequent systematic anomalies, the algorithms may falter due to a variety of reasons, including technical ones as well as natural limitations… ▽ More

    Submitted 26 March, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

  41. First moment of central values of some primitive Dirichlet $L$-functions with fixed order characters

    Authors: Peng Gao, Liangyi Zhao

    Abstract: We evaluate asymptotically the smoothed first moment of central values of families of primitive cubic, quartic and sextic Dirichlet $L$-functions, using the method of double Dirichlet series. Quantitative non-vanishing result for these $L$-values are also proved.

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: 11 pages. arXiv admin note: text overlap with arXiv:2306.10726

    MSC Class: 11M06; 11M41; 11N37; 11L05; 11L40

    Journal ref: J. Number Theory, vol. 261, 2024, pp. 125--142

  42. arXiv:2308.07583  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Atomic-Scale Tracking Phase Transition Dynamics of Berezinskii-Kosterlitz-Thouless Polar Vortex-Antivortex

    Authors: Ruixue Zhu, Sizheng Zheng, Xiaomei Li, Tao Wang, Congbing Tan, Tiancheng Yu, Zhetong Liu, Xinqiang Wang, Jiangyu Li, Jie Wang, Peng Gao

    Abstract: Particle-like topologies, such as vortex-antivortex (V-AV) pairs, have garnered significant attention in the field of condensed matter. However, the detailed phase transition dynamics of V-AV pairs, as exemplified by self-annihilation, motion, and dissociation, have yet to be verified in real space due to the lack of suitable experimental techniques. Here, we employ polar V-AV pairs as a model sys… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: 19 pages and 4 figures

  43. arXiv:2308.04701  [pdf

    cond-mat.mtrl-sci

    Direct and in situ examination of Li+ transport kinetics in isotope labelled solid electrolyte interphase

    Authors: Xiaofei Yu, Stefany Angarita-Gomez, Yaobin Xu, Peiyuan Gao, Jun-Gang Wang, Xin Zhang, Hao Jia, Wu Xu, Xiaolin Li, Yingge Du, Zhijie Xu, Janet S. Ho, Kang Xu, Perla B. Balbuena, Chongmin Wang, Zihua Zhu

    Abstract: Here, using unique in-situ liquid secondary ion mass spectroscopy on isotope-labelled solid-electrolyte-interphase (SEI), assisted by cryogenic transmission electron microscopy and constrained ab initio molecular dynamics simulation, for the first time we answer the question regarding Li+ transport mechanism across SEI, and quantitatively determine the Li+-mobility therein. We unequivocally unveil… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 25 pages, 4 figures

    MSC Class: None ACM Class: I.6.4

  44. arXiv:2308.03729  [pdf, other

    cs.CV cs.AI

    TinyLVLM-eHub: Towards Comprehensive and Efficient Evaluation for Large Vision-Language Models

    Authors: Wenqi Shao, Meng Lei, Yutao Hu, Peng Gao, Kaipeng Zhang, Fanqing Meng, Peng Xu, Siyuan Huang, Hongsheng Li, Yu Qiao, Ping Luo

    Abstract: Recent advancements in Large Vision-Language Models (LVLMs) have demonstrated significant progress in tackling complex multimodal tasks. Among these cutting-edge developments, Google's Bard stands out for its remarkable multimodal capabilities, promoting comprehensive comprehension and reasoning across various domains. This work presents an early and holistic evaluation of LVLMs' multimodal abilit… ▽ More

    Submitted 10 August, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: accepted to IEEE Transactions on Big Data. Project Page: http://lvlm-ehub.opengvlab.com/

  45. Model-independent search for the quasinormal modes of gravitational wave echoes

    Authors: Di Wu, Pengyuan Gao, Jing Ren, Niayesh Afshordi

    Abstract: Postmerger gravitational wave echoes provide a unique opportunity to probe the near-horizon structure of astrophysical black holes, which may be modified due to nonperturbative quantum gravity phenomena. However, since the waveform is subject to large theoretical uncertainties, it is necessary to develop search methods that are less reliant on specific models for detecting echoes from observationa… ▽ More

    Submitted 20 December, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: 46 pages, 19 figures, 4 tables. Python code to reproduce figures is available at the link http://github.com/hermione-evans/echomase; v2: typos fixed, matches published version in PRD

    Journal ref: Phys.Rev.D 108 (2023) 12, 124006

  46. arXiv:2307.16151  [pdf, other

    cs.CV

    StylePrompter: All Styles Need Is Attention

    Authors: Chenyi Zhuang, Pan Gao, Aljosa Smolic

    Abstract: GAN inversion aims at inverting given images into corresponding latent codes for Generative Adversarial Networks (GANs), especially StyleGAN where exists a disentangled latent space that allows attribute-based image manipulation at latent level. As most inversion methods build upon Convolutional Neural Networks (CNNs), we transfer a hierarchical vision Transformer backbone innovatively to predict… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: Some figures in the appendix are compressed for the reason of arXiv submission constrict

  47. arXiv:2307.16144  [pdf, other

    cs.CV cs.MM

    Video Frame Interpolation with Flow Transformer

    Authors: Pan Gao, Haoyue Tian, Jie Qin

    Abstract: Video frame interpolation has been actively studied with the development of convolutional neural networks. However, due to the intrinsic limitations of kernel weight sharing in convolution, the interpolated frame generated by it may lose details. In contrast, the attention mechanism in Transformer can better distinguish the contribution of each pixel, and it can also capture long-range pixel depen… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: Accepted to ACM MM23

  48. arXiv:2307.15685  [pdf, other

    math.CO

    Minors of matroids represented by sparse random matrices over finite fields

    Authors: Pu Gao, Peter Nelson

    Abstract: Consider a random $n\times m$ matrix $A$ over the finite field of order $q$ where every column has precisely $k$ nonzero elements, and let $M[A]$ be the matroid represented by $A$. In the case that q=2, Cooper, Frieze and Pegden (RS\&A 2019) proved that given a fixed binary matroid $N$, if $k\ge k_N$ and $m/n\ge d_N$ where $k_N$ and $d_N$ are sufficiently large constants depending on N, then a.a.s… ▽ More

    Submitted 18 January, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

  49. arXiv:2307.15024  [pdf, other

    astro-ph.EP astro-ph.SR

    The Variable Detection of Atmospheric Escape around the young, Hot Neptune AU Mic b

    Authors: Keighley E. Rockcliffe, Elisabeth R. Newton, Allison Youngblood, Girish M. Duvvuri, Peter Plavchan, Peter Gao, Andrew W. Mann, Patrick J. Lowrance

    Abstract: Photoevaporation is a potential explanation for several features within exoplanet demographics. Atmospheric escape observed in young Neptune-sized exoplanets can provide insight into and characterize which mechanisms drive this evolution and at what times they dominate. AU Mic b is one such exoplanet, slightly larger than Neptune (4.19 Earth radii). It closely orbits a 23 Myr pre-Main Sequence M d… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: 24 pages, 11 figures

    Journal ref: The Astronomical Journal, Volume 166, Number 2, 2023

  50. Probing reflection from aerosols with the near-infrared dayside spectrum of WASP-80b

    Authors: Bob Jacobs, Jean-Michel Désert, Peter Gao, Caroline V. Morley, Jacob Arcangeli, Saugata Barat, Mark S. Marley, Julianne I. Moses, Jonathan J. Fortney, Jacob L. Bean, Kevin B. Stevenson, Vatsal Panwar

    Abstract: The presence of aerosols is intimately linked to the global energy budget and the composition of a planet's atmospheres. Their ability to reflect incoming light prevents energy from being deposited into the atmosphere, and they shape spectra of exoplanets. We observed five near-infrared secondary eclipses of WASP-80b with the Wide Field Camera 3 (WFC3) aboard the \textit{Hubble Space Telescope} to… ▽ More

    Submitted 26 October, 2023; v1 submitted 26 July, 2023; originally announced July 2023.

    Comments: Published in ApJ Letters (20 Oct 2023)

    Journal ref: ApJL 2023 Volume 956, Number 2, page L43