Skip to main content

Showing 1–50 of 67 results for author: Qin, D

.
  1. arXiv:2510.21858  [pdf, ps, other

    cs.LG cs.AI cs.CR

    Privacy-preserving Decision-focused Learning for Multi-energy Systems

    Authors: Yangze Zhou, Ruiyang Yao, Dalin Qin, Yixiong Jia, Yi Wang

    Abstract: Decision-making for multi-energy system (MES) dispatch depends on accurate load forecasting. Traditionally, load forecasting and decision-making for MES are implemented separately. Forecasting models are typically trained to minimize forecasting errors, overlooking their impact on downstream decision-making. To address this, decision-focused learning (DFL) has been studied to minimize decision-mak… ▽ More

    Submitted 23 October, 2025; originally announced October 2025.

    Comments: 10 pages, 7 figures

  2. arXiv:2510.20686  [pdf, ps, other

    quant-ph

    Classical Noise Inversion: A Practical and Optimal framework for Robust Quantum Applications

    Authors: Dayue Qin, Ying Li, You Zhou

    Abstract: Quantum error mitigation is a critical technology for extracting reliable computations from noisy quantum processors, proving itself essential not only in the near term but also as a valuable supplement to fully fault-tolerant systems in the future. However, its practical implementation is hampered by two major challenges: the expansive cost of sampling from quantum circuits and the reliance on un… ▽ More

    Submitted 23 October, 2025; originally announced October 2025.

  3. arXiv:2510.15086  [pdf, ps, other

    math.CO

    Stem-Symmetry, Comb Products, and their Relation to Amoeba Graphs

    Authors: Jillian Eddy, Ryan Pesak, Daniel Qin, Denae Ventura

    Abstract: Local and global amoebas are families of labeled graphs that satisfy interpolation properties on a fixed vertex set. A labeled graph $G$ on $n$ vertices is a local amoeba (resp. global amoeba) if there exists a sequence of feasible edge-replacements between any two labelled embeddings of $G$ into $K_n$ (resp. $K_{n+1}$). Here, a feasible edge-replacement removes an edge and reinserts it so that th… ▽ More

    Submitted 16 October, 2025; originally announced October 2025.

    MSC Class: 05C25

  4. arXiv:2510.07819  [pdf, ps, other

    math.CO

    Symmetric Lorentzian Polynomials

    Authors: Tracy Chin, Daniel Qin

    Abstract: We study the class of Lorentzian symmetric polynomials and Lorentzian symmetric functions, which are defined to be symmetric functions for which every truncation of variables is Lorentzian. Similar to the space of Lorentzian polynomials, we show that the space of Lorentzian symmetric polynomials is homeomorphic to a closed Euclidean ball. Our main result is a reduction scheme that significantly re… ▽ More

    Submitted 9 October, 2025; originally announced October 2025.

    Comments: 36 pages, 1 figure

  5. arXiv:2510.07704  [pdf, ps, other

    cond-mat.str-el cond-mat.mes-hall cond-mat.mtrl-sci

    Surface band-selective moiré effect induces flat band in mixed-dimensional heterostructures

    Authors: Shuming Yu, Zhentao Fu, Dingkun Qin, Enting Li, Hao Zhong, Xingzhe Wang, Keming Zhao, Shangkun Mo, Qiang Wan, Yiwei Li, Jie Li, Jianxin Zhong, Hong Ding, Nan Xu

    Abstract: In this work, we reveal a curious type of moiré effect that selectively modifies the surface states of bulk crystal. We synthesize mixed-dimensional heterostructures consisting of a noble gas monolayer grow on the surface of bulk Bi(111), and determine the electronic structure of the heterostructures using angle-resolved photoemission spectroscopy. We directly observe moiré replicas of the Bi(111)… ▽ More

    Submitted 8 October, 2025; originally announced October 2025.

    Comments: 5 pages, 4 figures

  6. arXiv:2509.18189  [pdf, ps, other

    cs.CV cs.AI

    Qianfan-VL: Domain-Enhanced Universal Vision-Language Models

    Authors: Daxiang Dong, Mingming Zheng, Dong Xu, Bairong Zhuang, Wenyu Zhang, Chunhua Luo, Haoran Wang, Zijian Zhao, Jie Li, Yuxuan Li, Hanjun Zhong, Mengyue Liu, Jieting Chen, Shupeng Li, Lun Tian, Yaping Feng, Xin Li, Donggang Jiang, Yong Chen, Yehua Xu, Duohao Qin, Chen Feng, Dan Wang, Henghua Zhang, Jingjing Ha , et al. (10 additional authors not shown)

    Abstract: We present Qianfan-VL, a series of multimodal large language models ranging from 3B to 70B parameters, achieving state-of-the-art performance through innovative domain enhancement techniques. Our approach employs multi-stage progressive training and high-precision data synthesis pipelines, which prove to be critical technologies for enhancing domain-specific capabilities while maintaining strong g… ▽ More

    Submitted 19 September, 2025; originally announced September 2025.

    Comments: 12 pages

  7. arXiv:2509.05747  [pdf, ps, other

    cs.CV cs.AI cs.LG cs.MA cs.RO

    InterAct: A Large-Scale Dataset of Dynamic, Expressive and Interactive Activities between Two People in Daily Scenarios

    Authors: Leo Ho, Yinghao Huang, Dafei Qin, Mingyi Shi, Wangpok Tse, Wei Liu, Junichi Yamagishi, Taku Komura

    Abstract: We address the problem of accurate capture of interactive behaviors between two people in daily scenarios. Most previous works either only consider one person or solely focus on conversational gestures of two people, assuming the body orientation and/or position of each actor are constant or barely change over each interaction. In contrast, we propose to simultaneously model two people's activitie… ▽ More

    Submitted 6 September, 2025; originally announced September 2025.

    Comments: The first two authors contributed equally to this work

    ACM Class: I.5.4

    Journal ref: Proceedings of the ACM on Computer Graphics and Interactive Techniques 8.4 (2025) 53:1-27

  8. Realization of an untrusted intermediate relay architecture using a quantum dot single-photon source

    Authors: Mi Zou, Yu-Ming He, Yizhi Huang, Jun-Yi Zhao, Bin-Chen Li, Yong-Peng Guo, Xing Ding, Mo-Chi Xu, Run-Ze Liu, Geng-Yan Zou, Zhen Ning, Xiang You, Hui Wang, Wen-Xin Pan, Hao-Tao Zhu, Ming-Yang Zheng, Xiu-Ping Xie, Dandan Qin, Xiao Jiang, Yong-Heng Huo, Qiang Zhang, Chao-Yang Lu, Xiongfeng Ma, Teng-Yun Chen, Jian-Wei Pan

    Abstract: To fully exploit the potential of quantum technologies, quantum networks are needed to link different systems, significantly enhancing applications in computing, cryptography, and metrology. Central to these networks are quantum relays that can facilitate long-distance entanglement distribution and quantum communication. In this work, we present a modular and scalable quantum relay architecture us… ▽ More

    Submitted 29 August, 2025; originally announced August 2025.

    Comments: 29 pages,17 figures, 2 tables

  9. arXiv:2507.19050  [pdf, ps, other

    cs.NI

    Large Language Model-Based Task Offloading and Resource Allocation for Digital Twin Edge Computing Networks

    Authors: Qiong Wu, Yu Xie, Pingyi Fan, Dong Qin, Kezhi Wang, Nan Cheng, Khaled B. Letaief

    Abstract: In this paper, we propose a general digital twin edge computing network comprising multiple vehicles and a server. Each vehicle generates multiple computing tasks within a time slot, leading to queuing challenges when offloading tasks to the server. The study investigates task offloading strategies, queue stability, and resource allocation. Lyapunov optimization is employed to transform long-term… ▽ More

    Submitted 25 July, 2025; originally announced July 2025.

    Comments: This paper has been submitted to IEEE TMC

  10. arXiv:2507.14582  [pdf, ps, other

    cs.RO

    BT-TL-DMPs: A Novel Robot TAMP Framework Combining Behavior Tree, Temporal Logic and Dynamical Movement Primitives

    Authors: Zezhi Liu, Shizhen Wu, Hanqian Luo, Deyun Qin, Yongchun Fang

    Abstract: In the field of Learning from Demonstration (LfD), enabling robots to generalize learned manipulation skills to novel scenarios for long-horizon tasks remains challenging. Specifically, it is still difficult for robots to adapt the learned skills to new environments with different task and motion requirements, especially in long-horizon, multi-stage scenarios with intricate constraints. This paper… ▽ More

    Submitted 19 July, 2025; originally announced July 2025.

    Comments: 11 pages, 8 figures

  11. arXiv:2507.13237  [pdf, ps, other

    quant-ph

    Robust and efficient estimation of global quantum properties under realistic noise

    Authors: Qingyue Zhang, Dayue Qin, Zhou You, Feng Xu, Jens Eisert, You Zhou

    Abstract: Measuring global quantum properties -- such as the fidelity to complex multipartite states -- is both an essential and experimentally challenging task. Classical shadow estimation offers favorable sample complexity, but typically relies on many-qubit circuits that are difficult to realize on current platforms. We propose the robust phase shadow scheme, a measurement framework based on random circu… ▽ More

    Submitted 17 July, 2025; originally announced July 2025.

    Comments: 7+34 pages, 3+12 figures

  12. arXiv:2507.08419  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.str-el

    Observation of quasi-steady dark excitons and gap phase in a doped semiconductor

    Authors: Shangkun Mo, Yunfei Bai, Chunlong Wu, Xingxia Cui, Guangqiang Mei, Qiang Wan, Renzhe Li, Cao Peng, Keming Zhao, Dingkun Qin, Shuming Yu, Hao Zhong, Xingzhe Wang, Enting Li, Yiwei Li, Limin Cao, Min Feng, Sheng Meng, Nan Xu

    Abstract: Exciton plays an important role in optics and optics-related behaviors and leads to novel correlated phases like charge order, exciton insulator, and exciton-polariton condensation. Dark exciton shows distinct properties from bright one. However, it cannot be directly detected by conventional optic measurements. The electronic modulation effect of dark excitons in quasi-equilibrium distribution, c… ▽ More

    Submitted 11 July, 2025; originally announced July 2025.

    Comments: 16 pages, 5 figures

  13. arXiv:2507.06261  [pdf, ps, other

    cs.CL cs.AI

    Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

    Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3410 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More

    Submitted 16 October, 2025; v1 submitted 7 July, 2025; originally announced July 2025.

    Comments: 72 pages, 17 figures

  14. arXiv:2505.16577  [pdf, other

    cs.LG

    Large Language Model-Empowered Interactive Load Forecasting

    Authors: Yu Zuo, Dalin Qin, Yi Wang

    Abstract: The growing complexity of power systems has made accurate load forecasting more important than ever. An increasing number of advanced load forecasting methods have been developed. However, the static design of current methods offers no mechanism for human-model interaction. As the primary users of forecasting models, system operators often find it difficult to understand and apply these advanced m… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  15. arXiv:2504.20839  [pdf, other

    cs.CL quant-ph

    Universal language model with the intervention of quantum theory

    Authors: D. -F. Qin

    Abstract: This paper examines language modeling based on the theory of quantum mechanics. It focuses on the introduction of quantum mechanics into the symbol-meaning pairs of language in order to build a representation model of natural language. At the same time, it is realized that word embedding, which is widely used as a basic technique for statistical language modeling, can be explained and improved by… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  16. arXiv:2504.07553  [pdf

    physics.bio-ph

    Single-Cell Trajectory Reconstruction Reveals Migration Potential of Cell Populations

    Authors: Yanping Liu, Dui Qin, Xinwei Li, Guoqiang Li, Zhichao Liu, Kena Song, Wei Wang, Zhangyong Li

    Abstract: Cell migration, which is strictly regulated by intracellular and extracellular cues, is crucial for normal physiological processes and the progression of certain diseases. However, there is a lack of an efficient approach to analyze super-statistical and time-varying characteristics of cell migration based on single trajectories. Here, we propose an approach to reconstruct single-cell trajectories… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  17. arXiv:2503.20743  [pdf, other

    math.DS

    Topology of The Polar Vortex and Montana Weather

    Authors: Joshua Dorrington, Sushovan Majhi, Atish Mitra, James Moukheiber, Demi Qin, Jacob Sriraman, Kristian Strommen

    Abstract: This paper explores the use of Topological Data Analysis (TDA) to investigate patterns in zonal-mean zonal winds of the Arctic, which make up the polar vortex, in order to better explain polar vortex dynamics. We demonstrate how TDA reveals significant topological features in this polar vortex data, and how they may relate these features to the collapse of the stratospheric vortex during the winte… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

    MSC Class: 37N10

  18. arXiv:2412.02419  [pdf, other

    cs.SD cs.CV cs.GR cs.MM eess.AS

    It Takes Two: Real-time Co-Speech Two-person's Interaction Generation via Reactive Auto-regressive Diffusion Model

    Authors: Mingyi Shi, Dafei Qin, Leo Ho, Zhouyingcheng Liao, Yinghao Huang, Junichi Yamagishi, Taku Komura

    Abstract: Conversational scenarios are very common in real-world settings, yet existing co-speech motion synthesis approaches often fall short in these contexts, where one person's audio and gestures will influence the other's responses. Additionally, most existing methods rely on offline sequence-to-sequence frameworks, which are unsuitable for online applications. In this work, we introduce an audio-drive… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

    Comments: 15 pages, 10 figures

  19. arXiv:2412.01654  [pdf, other

    cs.LG

    FSMLP: Modelling Channel Dependencies With Simplex Theory Based Multi-Layer Perceptions In Frequency Domain

    Authors: Zhengnan Li, Haoxuan Li, Hao Wang, Jun Fang, Duoyin Li Yunxiao Qin

    Abstract: Time series forecasting (TSF) plays a crucial role in various domains, including web data analysis, energy consumption prediction, and weather forecasting. While Multi-Layer Perceptrons (MLPs) are lightweight and effective for capturing temporal dependencies, they are prone to overfitting when used to model inter-channel dependencies. In this paper, we investigate the overfitting problem in channe… ▽ More

    Submitted 2 December, 2024; v1 submitted 2 December, 2024; originally announced December 2024.

  20. arXiv:2411.15764  [pdf, other

    cs.LG eess.SP

    LLM Online Spatial-temporal Signal Reconstruction Under Noise

    Authors: Yi Yan, Dayu Qin, Ercan Engin Kuruoglu

    Abstract: This work introduces the LLM Online Spatial-temporal Reconstruction (LLM-OSR) framework, which integrates Graph Signal Processing (GSP) and Large Language Models (LLMs) for online spatial-temporal signal reconstruction. The LLM-OSR utilizes a GSP-based spatial-temporal signal handler to enhance graph signals and employs LLMs to predict missing values based on spatiotemporal patterns. The performan… ▽ More

    Submitted 24 November, 2024; originally announced November 2024.

  21. arXiv:2410.18718  [pdf, other

    cs.AI

    LLM-based Online Prediction of Time-varying Graph Signals

    Authors: Dayu Qin, Yi Yan, Ercan Engin Kuruoglu

    Abstract: In this paper, we propose a novel framework that leverages large language models (LLMs) for predicting missing values in time-varying graph signals by exploiting spatial and temporal smoothness. We leverage the power of LLM to achieve a message-passing scheme. For each missing node, its neighbors and previous estimates are fed into and processed by LLM to infer the missing observations. Tested on… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

  22. arXiv:2410.16874  [pdf, other

    q-bio.NC q-bio.QM

    Topological and Graph Theoretical Analysis of Dynamic Functional Connectivity for Autism Spectrum Disorder

    Authors: Yuzhe Chen, Dayu Qin, Ercan Engin Kuruoglu

    Abstract: Autism Spectrum Disorder (ASD) is a prevalent neurological disorder. However, the multi-faceted symptoms and large individual differences among ASD patients are hindering the diagnosis process, which largely relies on subject descriptions and lacks quantitative biomarkers. To remediate such problems, this paper explores the use of graph theory and topological data analysis (TDA) to study brain act… ▽ More

    Submitted 8 November, 2024; v1 submitted 22 October, 2024; originally announced October 2024.

    Comments: Accepted by the Brain Informatics 2024 Conference. This is the final version of the paper for the conference. First author: Yuzhe Chen. Second author: Dayu Qin. Third & Corresponding author: Ercan Engin Kuruoglu

  23. arXiv:2410.14142  [pdf, ps, other

    cs.IT

    Secure Collaborative Computation Offloading and Resource Allocation in Cache-Assisted Ultra-Dense IoT Networks With Multi-Slope Channels

    Authors: Tianqing Zhou, Bobo Wang, Dong Qin, Xuefang Nie, Nan Jiang, Chunguo Li

    Abstract: Cache-assisted ultra-dense mobile edge computing (MEC) networks are a promising solution for meeting the increasing demands of numerous Internet-of-Things mobile devices (IMDs). To address the complex interferences caused by small base stations (SBSs) deployed densely in such networks, this paper explores the combination of orthogonal frequency division multiple access (OFDMA), non-orthogonal mult… ▽ More

    Submitted 21 October, 2024; v1 submitted 17 October, 2024; originally announced October 2024.

  24. arXiv:2410.12186  [pdf, ps, other

    cs.IT

    Joint Data Compression, Secure Multi-Part Collaborative Task Offloading and Resource Assignment in Ultra-Dense Networks

    Authors: Tianqing Zhou, Kangle Liu, Dong Qin, Xuan Li, Nan Jiang, Chunguo Li

    Abstract: To enhance resource utilization and address interference issues in ultra-dense networks with mobile edge computing (MEC), a resource utilization approach is first introduced, which integrates orthogonal frequency division multiple access (OFDMA) and non-orthogonal multiple access (NOMA). Then, to minimize the energy consumed by ultra-densely deployed small base stations (SBSs) while ensuring propo… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  25. arXiv:2409.19718  [pdf, other

    cs.LG stat.ML

    Evolving Multi-Scale Normalization for Time Series Forecasting under Distribution Shifts

    Authors: Dalin Qin, Yehui Li, Weiqi Chen, Zhaoyang Zhu, Qingsong Wen, Liang Sun, Pierre Pinson, Yi Wang

    Abstract: Complex distribution shifts are the main obstacle to achieving accurate long-term time series forecasting. Several efforts have been conducted to capture the distribution characteristics and propose adaptive normalization techniques to alleviate the influence of distribution shifts. However, these methods neglect the intricate distribution dynamics observed from various scales and the evolving fun… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

  26. arXiv:2409.18018  [pdf

    cond-mat.mtrl-sci

    Molecular dynamics simulations of interaction between a super edge dislocation and interstitial dislocation loops in irradiated L12-Ni3Al

    Authors: Cheng Chen, Dongyang Qin, Yiding Wang, Fei Xu, Jun Song

    Abstract: The study employed MD simulations to investigate the interactions between a <110> super-edge dislocation, consisting of the four Shockley partials, and interstitial dislocation loops (IDLs) in irradiated L12-Ni3Al. Accounting for symmetry breakage in the L12 lattice, the superlattice planar faults with four distinct fault vectors have been considered for different IDL configurations. The detailed… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: 25pages,10 figures

  27. arXiv:2409.07441  [pdf, other

    cs.GR

    Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering

    Authors: Dafei Qin, Hongyang Lin, Qixuan Zhang, Kaichun Qiao, Longwen Zhang, Zijun Zhao, Jun Saito, Jingyi Yu, Lan Xu, Taku Komura

    Abstract: We propose GauFace, a novel Gaussian Splatting representation, tailored for efficient animation and rendering of physically-based facial assets. Leveraging strong geometric priors and constrained optimization, GauFace ensures a neat and structured Gaussian representation, delivering high fidelity and real-time facial interaction of 30fps@1440p on a Snapdragon 8 Gen 2 mobile platform. Then, we in… ▽ More

    Submitted 30 September, 2024; v1 submitted 11 September, 2024; originally announced September 2024.

    Comments: Project Page: https://dafei-qin.github.io/TransGS.github.io/

  28. arXiv:2409.01039  [pdf, other

    q-bio.QM

    Exploring Neurofunctional Phase Transition Patterns in Autism Spectrum Disorder: A Thermodynamics Parameters Analysis Approach

    Authors: Dayu Qin, Yuzhe Chen, Ercan Engin Kuruoglu

    Abstract: Designing network parameters that can effectively represent complex networks is of significant importance for the analysis of time-varying complex networks. This paper introduces a novel thermodynamic framework for analyzing complex networks, focusing on Spectral Core Entropy (SCE), Node Energy, internal energy and temperature to measure structural changes in dynamic complex network. This framewor… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

  29. arXiv:2407.13292  [pdf, other

    cs.SD cs.CL eess.AS

    Low-Resourced Speech Recognition for Iu Mien Language via Weakly-Supervised Phoneme-based Multilingual Pre-training

    Authors: Lukuan Dong, Donghong Qin, Fengbo Bai, Fanhua Song, Yan Liu, Chen Xu, Zhijian Ou

    Abstract: The mainstream automatic speech recognition (ASR) technology usually requires hundreds to thousands of hours of annotated speech data. Three approaches to low-resourced ASR are phoneme or subword based supervised pre-training, and self-supervised pre-training over multilingual data. The Iu Mien language is the main ethnic language of the Yao ethnic group in China and is low-resourced in the sense… ▽ More

    Submitted 16 September, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: Accepted into ISCSLP 2024

  30. arXiv:2405.11690  [pdf, other

    cs.CV

    InterAct: Capture and Modelling of Realistic, Expressive and Interactive Activities between Two Persons in Daily Scenarios

    Authors: Yinghao Huang, Leo Ho, Dafei Qin, Mingyi Shi, Taku Komura

    Abstract: We address the problem of accurate capture and expressive modelling of interactive behaviors happening between two persons in daily scenarios. Different from previous works which either only consider one person or focus on conversational gestures, we propose to simultaneously model the activities of two persons, and target objective-driven, dynamic, and coherent interactions which often span long… ▽ More

    Submitted 27 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: The first two authors contributed equally to this work

  31. arXiv:2404.13605  [pdf, other

    cs.CV eess.IV

    Turb-Seg-Res: A Segment-then-Restore Pipeline for Dynamic Videos with Atmospheric Turbulence

    Authors: Ripon Kumar Saha, Dehao Qin, Nianyi Li, Jinwei Ye, Suren Jayasuriya

    Abstract: Tackling image degradation due to atmospheric turbulence, particularly in dynamic environment, remains a challenge for long-range imaging systems. Existing techniques have been primarily designed for static scenes or scenes with small motion. This paper presents the first segment-then-restore pipeline for restoring the videos of dynamic scenes in turbulent environment. We leverage mean optical flo… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 Paper

  32. arXiv:2404.10518  [pdf, other

    cs.CV

    MobileNetV4 -- Universal Models for the Mobile Ecosystem

    Authors: Danfeng Qin, Chas Leichner, Manolis Delakis, Marco Fornoni, Shixin Luo, Fan Yang, Weijun Wang, Colby Banbury, Chengxi Ye, Berkin Akin, Vaibhav Aggarwal, Tenghui Zhu, Daniele Moro, Andrew Howard

    Abstract: We present the latest generation of MobileNets, known as MobileNetV4 (MNv4), featuring universally efficient architecture designs for mobile devices. At its core, we introduce the Universal Inverted Bottleneck (UIB) search block, a unified and flexible structure that merges Inverted Bottleneck (IB), ConvNext, Feed Forward Network (FFN), and a novel Extra Depthwise (ExtraDW) variant. Alongside UIB,… ▽ More

    Submitted 29 September, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  33. arXiv:2404.10512  [pdf

    cs.LG

    Four-hour thunderstorm nowcasting using deep diffusion models of satellite

    Authors: Kuai Dai, Xutao Li, Junying Fang, Yunming Ye, Demin Yu, Hui Su, Di Xian, Danyu Qin, Jingsong Wang

    Abstract: Convection (thunderstorm) develops rapidly within hours and is highly destructive, posing a significant challenge for nowcasting and resulting in substantial losses to nature and society. After the emergence of artificial intelligence (AI)-based methods, convection nowcasting has experienced rapid advancements, with its performance surpassing that of physics-based numerical weather prediction and… ▽ More

    Submitted 1 June, 2025; v1 submitted 16 April, 2024; originally announced April 2024.

  34. arXiv:2403.14949  [pdf, other

    cs.LG

    Addressing Concept Shift in Online Time Series Forecasting: Detect-then-Adapt

    Authors: YiFan Zhang, Weiqi Chen, Zhaoyang Zhu, Dalin Qin, Liang Sun, Xue Wang, Qingsong Wen, Zhang Zhang, Liang Wang, Rong Jin

    Abstract: Online updating of time series forecasting models aims to tackle the challenge of concept drifting by adjusting forecasting models based on streaming data. While numerous algorithms have been developed, most of them focus on model design and updating. In practice, many of these methods struggle with continuous performance regression in the face of accumulated concept drifts over time. To address t… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 7 figures, 14 pages. arXiv admin note: text overlap with arXiv:2309.12659

  35. arXiv:2403.11416  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.str-el

    Surface region band enhancement in noble gas adsorption assisted ARPES on kagome superconductor RbV3Sb5

    Authors: Cao Peng, Yiwei Li, Xu Chen, Shenghao Dai, Zewen Wu, Chunlong Wu, Qiang Wan, Keming Zhao, Renzhe Li, Shangkun Mo, Dingkun Qin, Shuming Yu, Hao Zhong, Shengjun Yuan, Jiangang Guo, Nan Xu

    Abstract: Electronic states near surface regions can be distinct from bulk states, which are paramount in understanding various physical phenomena occurring at surfaces and in applications in semiconductors, energy, and catalysis. Here, we report an abnormal surface region band enhancement effect in angle-resolved photoemission spectroscopy on kagome superconductor RbV3Sb5, by depositing noble gases with fi… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 17 pages,4 figures

    Journal ref: Phys. Rev. B 109, 115415 (2024)

  36. arXiv:2402.16430  [pdf, other

    cs.CR cs.HC

    Improving behavior based authentication against adversarial attack using XAI

    Authors: Dong Qin, George Amariucai, Daji Qiao, Yong Guan

    Abstract: In recent years, machine learning models, especially deep neural networks, have been widely used for classification tasks in the security domain. However, these models have been shown to be vulnerable to adversarial manipulation: small changes learned by an adversarial attack model, when applied to the input, can cause significant changes in the output. Most research on adversarial attacks and cor… ▽ More

    Submitted 10 March, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  37. arXiv:2402.04671  [pdf, other

    cs.CV

    V2VSSC: A 3D Semantic Scene Completion Benchmark for Perception with Vehicle to Vehicle Communication

    Authors: Yuanfang Zhang, Junxuan Li, Kaiqing Luo, Yiying Yang, Jiayi Han, Nian Liu, Denghui Qin, Peng Han, Chengpei Xu

    Abstract: Semantic scene completion (SSC) has recently gained popularity because it can provide both semantic and geometric information that can be used directly for autonomous vehicle navigation. However, there are still challenges to overcome. SSC is often hampered by occlusion and short-range perception due to sensor limitations, which can pose safety risks. This paper proposes a fundamental solution to… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  38. Prospective Prediction of Body Mass Index Trajectories using Multi-task Gaussian Processes

    Authors: Arthur Leroy, Varsha Gupta, Mya Thway Tint, Delicia Ooi Shu Qin, Keith M. Godfrey, Fabian Yap, Leck Ngee, Yung Seng Lee, Johan G. Eriksson, Navin Michael, Mauricio A. Alvarez, Dennis Wang

    Abstract: Clinicians often investigate the body mass index (BMI) trajectories of children to assess their growth with respect to their peers, as well as to anticipate future growth and disease risk. While retrospective modelling of BMI trajectories has been an active area of research, prospective prediction of continuous BMI trajectories from historical growth data has not been well investigated. Using weig… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 17 pages, 9 figures, 5 tables

    Journal ref: International Journal of Obesity, 2025, volume 49

  39. arXiv:2401.15687  [pdf, other

    cs.CV cs.GR

    Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance

    Authors: Qingcheng Zhao, Pengyu Long, Qixuan Zhang, Dafei Qin, Han Liang, Longwen Zhang, Yingliang Zhang, Jingyi Yu, Lan Xu

    Abstract: The synthesis of 3D facial animations from speech has garnered considerable attention. Due to the scarcity of high-quality 4D facial data and well-annotated abundant multi-modality labels, previous methods often suffer from limited realism and a lack of lexible conditioning. We address this challenge through a trilogy. We first introduce Generalized Neural Parametric Facial Asset (GNPFA), an effic… ▽ More

    Submitted 30 January, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

    Comments: Project Page: https://sites.google.com/view/media2face

  40. arXiv:2311.03863  [pdf

    eess.SY cs.LG

    An Explainable Framework for Machine learning-Based Reactive Power Optimization of Distribution Network

    Authors: Wenlong Liao, Benjamin Schäfer, Dalin Qin, Gonghao Zhang, Zhixian Wang, Zhe Yang

    Abstract: To reduce the heavy computational burden of reactive power optimization of distribution networks, machine learning models are receiving increasing attention. However, most machine learning models (e.g., neural networks) are usually considered as black boxes, making it challenging for power system operators to identify and comprehend potential biases or errors in the decision-making process of mach… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: It was submitted to the 23rd Power Systems Computation Conference (PSCC 2024) on Sept.2023

  41. arXiv:2311.03572  [pdf, other

    cs.CV

    Unsupervised Region-Growing Network for Object Segmentation in Atmospheric Turbulence

    Authors: Dehao Qin, Ripon Saha, Suren Jayasuriya, Jinwei Ye, Nianyi Li

    Abstract: Moving object segmentation in the presence of atmospheric turbulence is highly challenging due to turbulence-induced irregular and time-varying distortions. In this paper, we present an unsupervised approach for segmenting moving objects in videos downgraded by atmospheric turbulence. Our key approach is a detect-then-grow scheme: we first identify a small set of moving object pixels with high con… ▽ More

    Submitted 4 August, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

  42. arXiv:2310.17945  [pdf, other

    cs.LG cs.AI

    A Comprehensive and Reliable Feature Attribution Method: Double-sided Remove and Reconstruct (DoRaR)

    Authors: Dong Qin, George Amariucai, Daji Qiao, Yong Guan, Shen Fu

    Abstract: The limited transparency of the inner decision-making mechanism in deep neural networks (DNN) and other machine learning (ML) models has hindered their application in several domains. In order to tackle this issue, feature attribution methods have been developed to identify the crucial features that heavily influence decisions made by these black box models. However, many feature attribution metho… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 16 pages, 22 figures

  43. arXiv:2310.06851  [pdf, other

    cs.CV cs.AI cs.GR

    BodyFormer: Semantics-guided 3D Body Gesture Synthesis with Transformer

    Authors: Kunkun Pang, Dafei Qin, Yingruo Fan, Julian Habekost, Takaaki Shiratori, Junichi Yamagishi, Taku Komura

    Abstract: Automatic gesture synthesis from speech is a topic that has attracted researchers for applications in remote communication, video games and Metaverse. Learning the mapping between speech and 3D full-body gestures is difficult due to the stochastic nature of the problem and the lack of a rich cross-modal dataset that is needed for training. In this paper, we propose a novel transformer-based framew… ▽ More

    Submitted 6 September, 2023; originally announced October 2023.

    Comments: 12 pages, 13 figures

  44. arXiv:2305.08296  [pdf, other

    cs.GR cs.AI

    Neural Face Rigging for Animating and Retargeting Facial Meshes in the Wild

    Authors: Dafei Qin, Jun Saito, Noam Aigerman, Thibault Groueix, Taku Komura

    Abstract: We propose an end-to-end deep-learning approach for automatic rigging and retargeting of 3D models of human faces in the wild. Our approach, called Neural Face Rigging (NFR), holds three key properties: (i) NFR's expression space maintains human-interpretable editing parameters for artistic controls; (ii) NFR is readily applicable to arbitrary facial meshes with different connectivity and expr… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

    Comments: SIGGRAPH 2023(Conference Track), 13 pages, 15 figures

  45. arXiv:2303.06353  [pdf, ps, other

    cs.IT

    Secure and Multi-Step Computation Offloading and Resource Allocation in Ultra-Dense Multi-Task NOMA-Enabled IoT Networks

    Authors: Tianqing Zhou, Yanyan Fu, Dong Qin, Xuefang Nie, Nan Jiang, Chunguo Li

    Abstract: Ultra-dense networks are widely regarded as a promising solution to explosively growing applications of Internet-of-Things (IoT) mobile devices (IMDs). However, complicated and severe interferences need to be tackled properly in such networks. To this end, both orthogonal multiple access (OMA) and non-orthogonal multiple access (NOMA) are utilized at first. Then, in order to attain a goal of green… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

  46. arXiv:2302.09975  [pdf, other

    physics.flu-dyn

    Optimal energy harvesting efficiency from vortex-induced vibration of a circular cylinder under flow

    Authors: Peng Han, Qiaogao Huang, Guang Pan, Denghui Qin, Wei Wang, Rodolfo T. Gonçalves, Jisheng Zhao

    Abstract: This work applies a combined approach a reduced-order model (ROM) together with experiments and direct numerical simulations to investigate the optimal efficiency of fluid-flow energy harvesting from transverse vortex-induced vibration (VIV) of a circular cylinder. High resolution efficiency maps were predicted over wide ranges of flow reduced velocities and structural damping ratios, and the maxi… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

  47. arXiv:2212.01336  [pdf

    physics.soc-ph

    The Influence of Cultural Distance on Settlement Intention of Floating Population in China

    Authors: Dan Qin

    Abstract: Based on a nationwide labour-force survey data, this paper investigates the influence of cultural variance on migrants' settlement intention in China. By using dialectal distance as a proxy for cultural distance, we find strong evidence for the negative effects of cultural distance on migrants' settlement intention. By further investigation into sub-samples separated by gender, generation and high… ▽ More

    Submitted 8 November, 2022; originally announced December 2022.

  48. arXiv:2211.04031  [pdf, other

    cs.CV cs.AI

    Hilbert Distillation for Cross-Dimensionality Networks

    Authors: Dian Qin, Haishuai Wang, Zhe Liu, Hongjia Xu, Sheng Zhou, Jiajun Bu

    Abstract: 3D convolutional neural networks have revealed superior performance in processing volumetric data such as video and medical imaging. However, the competitive performance by leveraging 3D networks results in huge computational costs, which are far beyond that of 2D networks. In this paper, we propose a novel Hilbert curve-based cross-dimensionality distillation approach that facilitates the knowled… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: Accepted at NeurIPS 2022

  49. Ultrafast reversible self-assembly of living tangled matter

    Authors: Vishal P. Patil, Harry Tuazon, Emily Kaufman, Tuhin Chakrabortty, David Qin, Jörn Dunkel, M. Saad Bhamla

    Abstract: Tangled active filaments are ubiquitous in nature, from chromosomal DNA and cilia carpets to root networks and worm blobs. How activity and elasticity facilitate collective topological transformations in living tangled matter is not well understood. Here, we report an experimental and theoretical study of California blackworms (Lumbriculus variegatus), which slowly form tangles over minutes but ca… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  50. arXiv:2207.06629  [pdf

    cond-mat.supr-con

    K-doped Ba122 epitaxial thin film on MgO substrate by buffer engineering

    Authors: Dongyi Qin, Kazumasa Iida, Zimeng Guo, Chao Wang, Hikaru Saito, Satoshi Hata, Michio Naito, Akiyasu Yamamoto

    Abstract: Molecular beam epitaxy of K-doped Ba122 (Ba$_{1-x}$K$_x$Fe$_\text{2}$As$_\text{2}$) superconductor was realized on a MgO substrate. Microstructural observation revealed that the undoped Ba122 served as a perfect buffer layer for epitaxial growth of the K-doped Ba122. The film exhibited a high critical temperature of 39.8 K and a high critical current density of 3.9 MA/cm$^\text{2}$ at 4 K. The suc… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: 5 pages, 4 figures, accepted manuscript Supercond. Sci. Technol 2022