Search | arXiv e-print repository

The Smith normal form of the Q-walk matrix of the Dynkin graph $A_n$

Abstract: In this paper, we give an explicit formula for the rank of the $Q$-walk matrix of the Dynkin graph $A_n$. Moreover, we prove that its Smith normal form is $$ \mathrm{diag}\left( \underset{r=\lceil \frac{n}{2} \rceil}{\underbrace{1,2,2,...,2}},0,...,0 \right), $$ where $r$ is the rank of the $Q$-walk matrix $W_Q\left( A_n \right) $ of the Dynkin graph $A_n$. In this paper, we give an explicit formula for the rank of the $Q$-walk matrix of the Dynkin graph $A_n$. Moreover, we prove that its Smith normal form is $$ \mathrm{diag}\left( \underset{r=\lceil \frac{n}{2} \rceil}{\underbrace{1,2,2,...,2}},0,...,0 \right), $$ where $r$ is the rank of the $Q$-walk matrix $W_Q\left( A_n \right) $ of the Dynkin graph $A_n$. △ Less

Submitted 23 November, 2024; originally announced November 2024.

Comments: All commets are welcome

arXiv:2411.17095 [pdf]

Finite Volume Physical Informed Neural Network (FV-PINN) with Reduced Derivative Order for Incompressible Flows

Authors: Zijie Su, Yunpu Liu, Sheng Pan, Zheng Li, Changyu Shen

Abstract: Physics-Informed Neural Networks (PINN) has evolved into a powerful tool for solving partial differential equations, which has been applied to various fields such as energy, environment, en-gineering, etc. When utilizing PINN to solve partial differential equations, it is common to rely on Automatic Differentiation (AD) to compute the residuals of the governing equations. This can lead to certain… ▽ More Physics-Informed Neural Networks (PINN) has evolved into a powerful tool for solving partial differential equations, which has been applied to various fields such as energy, environment, en-gineering, etc. When utilizing PINN to solve partial differential equations, it is common to rely on Automatic Differentiation (AD) to compute the residuals of the governing equations. This can lead to certain precision losses, thus affecting the accuracy of the network prediction. This paper pro-poses a Finite Volume Physics-Informed Neural Network (FV-PINN), designed to address steady-state problems of incompressible flow. This method divides the solution domain into mul-tiple grids. Instead of calculating the residuals of the Navier-Stokes equations at collocation points within the grid, as is common in traditional PINNs, this approach evaluates them at Gaussian in-tegral points on the grid boundaries using Gauss's theorem. The loss function is constructed using the Gaussian integral method, and the differentiation order for velocity is reduced. To validate the effectiveness of this approach, we predict the velocity and pressure fields for two typical examples in fluid topology optimization. The results are compared with commercial software COMSOL, which indicates that FVI-PINN significantly improves the prediction accuracy of both the velocity and pressure fields while accelerating the training speed of the network. △ Less

Submitted 25 November, 2024; originally announced November 2024.

arXiv:2411.16144 [pdf, other]

Using Drone Swarm to Stop Wildfire: A Predict-then-optimize Approach

Authors: Shijie Pan, Aoran Cheng, Yiqi Sun, Kai Kang, Cristobal Pais, Yulun Zhou, Zuo-Jun Max Shen

Abstract: Drone swarms coupled with data intelligence can be the future of wildfire fighting. However, drone swarm firefighting faces enormous challenges, such as the highly complex environmental conditions in wildfire scenes, the highly dynamic nature of wildfire spread, and the significant computational complexity of drone swarm operations. We develop a predict-then-optimize approach to address these chal… ▽ More Drone swarms coupled with data intelligence can be the future of wildfire fighting. However, drone swarm firefighting faces enormous challenges, such as the highly complex environmental conditions in wildfire scenes, the highly dynamic nature of wildfire spread, and the significant computational complexity of drone swarm operations. We develop a predict-then-optimize approach to address these challenges to enable effective drone swarm firefighting. First, we construct wildfire spread prediction convex neural network (Convex-NN) models based on real wildfire data. Then, we propose a mixed-integer programming (MIP) model coupled with dynamic programming (DP) to enable efficient drone swarm task planning. We further use chance-constrained robust optimization (CCRO) to ensure robust firefighting performances under varying situations. The formulated model is solved efficiently using Benders Decomposition and Branch-and-Cut algorithms. After 75 simulated wildfire environments training, the MIP+CCRO approach shows the best performance among several testing sets, reducing movements by 37.3\% compared to the plain MIP. It also significantly outperformed the GA baseline, which often failed to fully extinguish the fire. Eventually, we will conduct real-world fire spread and quenching experiments in the next stage for further validation. △ Less

Submitted 25 November, 2024; originally announced November 2024.

arXiv:2411.14507 [pdf, other]

FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers

Authors: Zehua Pei, Hui-Ling Zhen, Xianzhi Yu, Sinno Jialin Pan, Mingxuan Yuan, Bei Yu

Abstract: Generative Pre-trained Transformers (GPTs) have demonstrated remarkable performance across diverse domains, largely due to the extensive scaling of model parameters. Recent works have observed redundancy within transformer blocks and developed compression methods by structured pruning of less important blocks. However, such direct removal often leads to irreversible performance degradation. In thi… ▽ More Generative Pre-trained Transformers (GPTs) have demonstrated remarkable performance across diverse domains, largely due to the extensive scaling of model parameters. Recent works have observed redundancy within transformer blocks and developed compression methods by structured pruning of less important blocks. However, such direct removal often leads to irreversible performance degradation. In this paper, we propose FuseGPT, a novel methodology designed to recycle pruned transformer blocks, thereby recovering the model's performance. Firstly, we introduce a new importance detection metric, Macro Influence (MI), which evaluates the long-term impact of each transformer block by quantifying the information loss incurred upon its removal. Next, we propose group-level layer fusion, which leverages the parameters from layers of less important blocks and integrates them into the corresponding layers of neighboring blocks. This fusion process is not a one-time operation but is refined through iterative parameter updates by lightweight group-level fine-tuning. Specifically, the injected parameters are frozen but are weighted with learnable rank decomposition matrices to reduce the computational overhead during fine-tuning. Our approach not only works well for large language models but also for large multimodal models. Experimental results indicate that, even with modest amounts of data, FuseGPT surpasses previous methods in both perplexity and zero-shot task performance. △ Less

Submitted 24 May, 2025; v1 submitted 21 November, 2024; originally announced November 2024.

arXiv:2411.14459 [pdf, other]

Unveiling User Preferences: A Knowledge Graph and LLM-Driven Approach for Conversational Recommendation

Authors: Zhangchi Qiu, Linhao Luo, Shirui Pan, Alan Wee-Chung Liew

Abstract: Conversational Recommender Systems (CRSs) aim to provide personalized recommendations through dynamically capturing user preferences in interactive conversations. Conventional CRSs often extract user preferences as hidden representations, which are criticized for their lack of interpretability. This diminishes the transparency and trustworthiness of the recommendation process. Recent works have ex… ▽ More Conversational Recommender Systems (CRSs) aim to provide personalized recommendations through dynamically capturing user preferences in interactive conversations. Conventional CRSs often extract user preferences as hidden representations, which are criticized for their lack of interpretability. This diminishes the transparency and trustworthiness of the recommendation process. Recent works have explored combining the impressive capabilities of Large Language Models (LLMs) with the domain-specific knowledge of Knowledge Graphs (KGs) to generate human-understandable recommendation explanations. Despite these efforts, the integration of LLMs and KGs for CRSs remains challenging due to the modality gap between unstructured dialogues and structured KGs. Moreover, LLMs pre-trained on large-scale corpora may not be well-suited for analyzing user preferences, which require domain-specific knowledge. In this paper, we propose COMPASS, a plug-and-play framework that synergizes LLMs and KGs to unveil user preferences, enhancing the performance and explainability of existing CRSs. To address integration challenges, COMPASS employs a two-stage training approach: first, it bridges the gap between the structured KG and natural language through an innovative graph entity captioning pre-training mechanism. This enables the LLM to transform KG entities into concise natural language descriptions, allowing them to comprehend domain-specific knowledge. Following, COMPASS optimizes user preference modeling via knowledge-aware instruction fine-tuning, where the LLM learns to reason and summarize user preferences from both dialogue histories and KG-augmented context. This enables COMPASS to perform knowledge-aware reasoning and generate comprehensive and interpretable user preferences that can seamlessly integrate with existing CRS models for improving recommendation performance and explainability. △ Less

Submitted 16 November, 2024; originally announced November 2024.

arXiv:2411.12791 [pdf, other]

Mitigating Perception Bias: A Training-Free Approach to Enhance LMM for Image Quality Assessment

Authors: Siyi Pan, Baoliang Chen, Danni Huang, Hanwei Zhu, Lingyu Zhu, Xiangjie Sui, Shiqi Wang

Abstract: Despite the impressive performance of large multimodal models (LMMs) in high-level visual tasks, their capacity for image quality assessment (IQA) remains limited. One main reason is that LMMs are primarily trained for high-level tasks (e.g., image captioning), emphasizing unified image semantics extraction under varied quality. Such semantic-aware yet quality-insensitive perception bias inevitabl… ▽ More Despite the impressive performance of large multimodal models (LMMs) in high-level visual tasks, their capacity for image quality assessment (IQA) remains limited. One main reason is that LMMs are primarily trained for high-level tasks (e.g., image captioning), emphasizing unified image semantics extraction under varied quality. Such semantic-aware yet quality-insensitive perception bias inevitably leads to a heavy reliance on image semantics when those LMMs are forced for quality rating. In this paper, instead of retraining or tuning an LMM costly, we propose a training-free debiasing framework, in which the image quality prediction is rectified by mitigating the bias caused by image semantics. Specifically, we first explore several semantic-preserving distortions that can significantly degrade image quality while maintaining identifiable semantics. By applying these specific distortions to the query or test images, we ensure that the degraded images are recognized as poor quality while their semantics remain. During quality inference, both a query image and its corresponding degraded version are fed to the LMM along with a prompt indicating that the query image quality should be inferred under the condition that the degraded one is deemed poor quality.This prior condition effectively aligns the LMM's quality perception, as all degraded images are consistently rated as poor quality, regardless of their semantic difference.Finally, the quality scores of the query image inferred under different prior conditions (degraded versions) are aggregated using a conditional probability model. Extensive experiments on various IQA datasets show that our debiasing framework could consistently enhance the LMM performance and the code will be publicly available. △ Less

Submitted 19 November, 2024; originally announced November 2024.

arXiv:2411.12219 [pdf]

Two-dimensional superconductivity and anomalous vortex dissipation in newly-discovered transition metal dichalcogenide-based superlattices

Authors: Mengzhu Shi, Kaibao Fan, Houpu Li, Senyang Pan, Jiaqiang Cai, Nan Zhang, Hongyu Li, Tao Wu, Jinglei Zhang, Chuanying Xi, Ziji Xiang, Xianhui Chen

Abstract: Properties of layered superconductors can vary drastically when thinned down from bulk to monolayer, owing to the reduced dimensionality and weakened interlayer coupling. In transition metal dichalcogenides (TMDs), the inherent symmetry breaking effect in atomically thin crystals prompts novel states of matter, such as Ising superconductivity with an extraordinary in-plane upper critical field. He… ▽ More Properties of layered superconductors can vary drastically when thinned down from bulk to monolayer, owing to the reduced dimensionality and weakened interlayer coupling. In transition metal dichalcogenides (TMDs), the inherent symmetry breaking effect in atomically thin crystals prompts novel states of matter, such as Ising superconductivity with an extraordinary in-plane upper critical field. Here, we demonstrate that two-dimensional (2D) superconductivity resembling those in atomic layers but with more fascinating behaviours can be realized in the bulk crystals of two new TMD-based superconductors Ba0.75ClTaS2 and Ba0.75ClTaSe2. They comprise an alternating stack of H-type TMD layers and Ba-Cl layers. In both materials, intrinsic 2D superconductivity develops below a Berezinskii-Kosterlitz-Thouless transition. The upper critical field along ab plane exceeds the Pauli limit (Hp); in particular, Ba0.75ClTaSe2 exhibits an extremely high in plane Hc2 (14Hp) and a colossal superconducting anisotropy of 150. Moreover, the temperature-field phase diagram of Ba0.75ClTaSe2 under an in-plane magnetic field contains a large phase regime of vortex dissipation, which can be ascribed to the Josephson vortex motion, signifying an unprecedentedly strong fluctuation effect in TMD-based superconductors. Our results provide a new path towards the establishment of 2D superconductivity and novel exotic quantum phases in bulk crystals of TMD-based superconductors. △ Less

Submitted 18 November, 2024; originally announced November 2024.

Comments: Accepted by JACS

arXiv:2411.09677 [pdf, other]

Effect of invisible neutrino decay on neutrino oscillation at long baselines

Authors: Animesh Chatterjee, Srubabati Goswami, Supriya Pan, Paras Thacker

Abstract: In this article, we study the effect of invisible neutrino decay of the third neutrino state for accelerator neutrino experiments at two different baselines, 1300 km with a liquid argon time projection chamber (LArTPC) detector (similar to DUNE) and 2588 km with a water Cherenkov detector (similar to P2O). For such baselines, the matter effect starts to become important. Our aim is to ascertain th… ▽ More In this article, we study the effect of invisible neutrino decay of the third neutrino state for accelerator neutrino experiments at two different baselines, 1300 km with a liquid argon time projection chamber (LArTPC) detector (similar to DUNE) and 2588 km with a water Cherenkov detector (similar to P2O). For such baselines, the matter effect starts to become important. Our aim is to ascertain the sensitivity to mass hierarchy and octant of $θ_{23}$ in these two experiments in the presence of a decaying neutrino state. We compare and contrast the results of the two experimental setups. We find that, in general, hierarchy sensitivity decreases in the presence of decay. However, if we consider decay only in the opposite hierarchy (test scenario), in the 2588 km setup, the hierarchy sensitivity with the true hierarchy as IH is larger than the no decay case. We also study the dependence of hierarchy sensitivity with true $θ_{23}$. We find that the dominant muon background in P2O plays an important role in how the hierarchy sensitivity depends on $θ_{23}$. The octant sensitivity for both setups increases in the presence of decay except for the LArTPC setup in case true $θ_{23}=49^\circ$. To understand the octant sensitivity results in the two setups, we check the synergy in sensitivity between electron and muon channels as a function of test $θ_{23}$. We also study the degeneracies in the test $θ_{23}-δ_{CP}$ plane and find that combined analysis of the two setups removes all the degeneracies in the test $θ_{23}-δ_{CP}$ plane at $5σ$ significance. △ Less

Submitted 25 November, 2024; v1 submitted 14 November, 2024; originally announced November 2024.

Comments: v1: 24 pages, 12 captioned figures, 3 tables

arXiv:2411.07535 [pdf, other]

Double-Signed Fragmented DNSSEC for Countering Quantum Threat

Authors: Syed W. Shah. Lei Pan, Din Duc Nha Nguyen, Robin Doss, Warren Armstrong, Praveen Gauravaram

Abstract: DNSSEC, a DNS security extension, is essential to accurately translating domain names to IP addresses. Digital signatures provide the foundation for this reliable translation, however, the evolution of 'Quantum Computers' has made traditional digital signatures vulnerable. In light of this, NIST has recently selected potential post-quantum digital signatures that can operate on conventional comput… ▽ More DNSSEC, a DNS security extension, is essential to accurately translating domain names to IP addresses. Digital signatures provide the foundation for this reliable translation, however, the evolution of 'Quantum Computers' has made traditional digital signatures vulnerable. In light of this, NIST has recently selected potential post-quantum digital signatures that can operate on conventional computers and resist attacks made with Quantum Computers. Since these post-quantum digital signatures are still in their early stages of development, replacing pre-quantum digital signature schemes in DNSSEC with post-quantum candidates is risky until the post-quantum candidates have undergone a thorough security analysis. Given this, herein, we investigate the viability of employing 'Double-Signatures' in DNSSEC, combining a post-quantum digital signature and a classic one. The rationale is that double-signatures will offer protection against quantum threats on conventional signature schemes as well as unknown non-quantum attacks on post-quantum signature schemes, hence even if one fails the other provides security guarantees. However, the inclusion of two signatures in the DNSSEC response message doesn't bode well with the maximum allowed size of DNSSEC responses (i.e., 1232B, a limitation enforced by MTU of physical links). To counter this issue, we leverage a way to do application-layer fragmentation of DNSSEC responses with two signatures. We implement our solution on top of OQS-BIND and through experiments show that the addition of two signatures in DNSSEC and application-layer fragmentation of all relevant resource records and their reassembly does not have any substantial impact on the efficiency of the resolution process and thus is suitable for the interim period at least until the quantum computers are fully realized. △ Less

Submitted 11 November, 2024; originally announced November 2024.

arXiv:2411.06190 [pdf, other]

Gravitational reheating formulas and bounds in oscillating backgrounds II: Constraints on the spectral index and gravitational dark matter production

Authors: Jaume de Haro, Supriya Pan

Abstract: The reheating temperature plays a crucial role in the early universe's evolution, marking the transition from inflation to the radiation-dominated era. It directly impacts the number of $e$-folds and, consequently, the observable parameters of inflation, such as the spectral index of scalar perturbations. By establishing a relationship between the gravitational reheating temperature and the spectr… ▽ More The reheating temperature plays a crucial role in the early universe's evolution, marking the transition from inflation to the radiation-dominated era. It directly impacts the number of $e$-folds and, consequently, the observable parameters of inflation, such as the spectral index of scalar perturbations. By establishing a relationship between the gravitational reheating temperature and the spectral index, we can derive constraints on inflationary models. Specifically, the range of viable reheating temperatures imposes bounds on the spectral index, which can then be compared with observational data, such as those from the Planck satellite, to test the consistency of various models with cosmological observations. Additionally, in the context of dark matter production, we demonstrate that gravitational reheating provides a viable mechanism when there is a relationship between the mass of the dark matter particles and the mass of the particles responsible for reheating. This connection offers a pathway to link dark matter genesis with inflationary and reheating parameters, allowing for a unified perspective on early universe dynamics. △ Less

Submitted 31 January, 2025; v1 submitted 9 November, 2024; originally announced November 2024.

Comments: 15 pages (including references), 2 figures, 6 tables; version accepted for publication by Phys. Rev. D

arXiv:2411.06174 [pdf, other]

State Chrono Representation for Enhancing Generalization in Reinforcement Learning

Authors: Jianda Chen, Wen Zheng Terence Ng, Zichen Chen, Sinno Jialin Pan, Tianwei Zhang

Abstract: In reinforcement learning with image-based inputs, it is crucial to establish a robust and generalizable state representation. Recent advancements in metric learning, such as deep bisimulation metric approaches, have shown promising results in learning structured low-dimensional representation space from pixel observations, where the distance between states is measured based on task-relevant featu… ▽ More In reinforcement learning with image-based inputs, it is crucial to establish a robust and generalizable state representation. Recent advancements in metric learning, such as deep bisimulation metric approaches, have shown promising results in learning structured low-dimensional representation space from pixel observations, where the distance between states is measured based on task-relevant features. However, these approaches face challenges in demanding generalization tasks and scenarios with non-informative rewards. This is because they fail to capture sufficient long-term information in the learned representations. To address these challenges, we propose a novel State Chrono Representation (SCR) approach. SCR augments state metric-based representations by incorporating extensive temporal information into the update step of bisimulation metric learning. It learns state distances within a temporal framework that considers both future dynamics and cumulative rewards over current and long-term future states. Our learning strategy effectively incorporates future behavioral information into the representation space without introducing a significant number of additional parameters for modeling dynamics. Extensive experiments conducted in DeepMind Control and Meta-World environments demonstrate that SCR achieves better performance comparing to other recent metric-based methods in demanding generalization tasks. The codes of SCR are available in https://github.com/jianda-chen/SCR. △ Less

Submitted 9 November, 2024; originally announced November 2024.

Journal ref: 38th Conference on Neural Information Processing Systems (NeurIPS 2024)

arXiv:2411.04486 [pdf]

Multi-Wavelength Selective Thermal Emission Enabled by Dual-Layer Localized Surface Plasmon Polaritons

Authors: Shuang Pan, Shaoteng Wu, Huixue Ren, Jiarong Zhao, Yuanhao Zhu, Sailei Li, Li He, Jun-Wei Luo

Abstract: Thermal emission is a ubiquitous electromagnetic wave with an extreme broad spectrum in nature, and controlling thermal emission can be used to develop low-cost and convenient infrared light sources with wavelength tunable in a wide range that is currently difficult to other sources. Conventional metasurfaces are commonly used to control light but lack the flexibility to achieve complex emission s… ▽ More Thermal emission is a ubiquitous electromagnetic wave with an extreme broad spectrum in nature, and controlling thermal emission can be used to develop low-cost and convenient infrared light sources with wavelength tunable in a wide range that is currently difficult to other sources. Conventional metasurfaces are commonly used to control light but lack the flexibility to achieve complex emission spectral profiles and dynamic tuning. Here, we introduce a novel dual-layer metasurface structure with two completely independent layers to achieve a multi-peak thermal emission within the 5-8 μm wavelength range. Simulations and experiments show that this two-layer structure can achieve arbitrary spectral shapes without interfering with multiple resonant modes. This unique configuration presents a promising platform for further exploration in thermal emission engineering, enabling spectral control and dynamic tuning. △ Less

Submitted 7 November, 2024; originally announced November 2024.

Comments: 13 pages, 8 figures

arXiv:2411.03025 [pdf, other]

DA-MoE: Addressing Depth-Sensitivity in Graph-Level Analysis through Mixture of Experts

Authors: Zelin Yao, Chuang Liu, Xianke Meng, Yibing Zhan, Jia Wu, Shirui Pan, Wenbin Hu

Abstract: Graph neural networks (GNNs) are gaining popularity for processing graph-structured data. In real-world scenarios, graph data within the same dataset can vary significantly in scale. This variability leads to depth-sensitivity, where the optimal depth of GNN layers depends on the scale of the graph data. Empirically, fewer layers are sufficient for message passing in smaller graphs, while larger g… ▽ More Graph neural networks (GNNs) are gaining popularity for processing graph-structured data. In real-world scenarios, graph data within the same dataset can vary significantly in scale. This variability leads to depth-sensitivity, where the optimal depth of GNN layers depends on the scale of the graph data. Empirically, fewer layers are sufficient for message passing in smaller graphs, while larger graphs typically require deeper networks to capture long-range dependencies and global features. However, existing methods generally use a fixed number of GNN layers to generate representations for all graphs, overlooking the depth-sensitivity issue in graph structure data. To address this challenge, we propose the depth adaptive mixture of expert (DA-MoE) method, which incorporates two main improvements to GNN backbone: \textbf{1)} DA-MoE employs different GNN layers, each considered an expert with its own parameters. Such a design allows the model to flexibly aggregate information at different scales, effectively addressing the depth-sensitivity issue in graph data. \textbf{2)} DA-MoE utilizes GNN to capture the structural information instead of the linear projections in the gating network. Thus, the gating network enables the model to capture complex patterns and dependencies within the data. By leveraging these improvements, each expert in DA-MoE specifically learns distinct graph patterns at different scales. Furthermore, comprehensive experiments on the TU dataset and open graph benchmark (OGB) have shown that DA-MoE consistently surpasses existing baselines on various tasks, including graph, node, and link-level analyses. The code are available at \url{https://github.com/Celin-Yao/DA-MoE}. △ Less

Submitted 5 November, 2024; originally announced November 2024.

Comments: 8pages

arXiv:2411.01671 [pdf, other]

Gravitational reheating formulas and bounds in oscillating backgrounds

Authors: Jaume de Haro, Llibert Aresté Saló, Supriya Pan

Abstract: In this article we calculate the reheating temperature in the cosmological scenarios where heavy scalar particles are gravitationally produced, due to a conformally coupled interaction between a massive scalar quantum field and the Ricci scalar, during the oscillations of the inflaton field. We explore two distinct cases, namely the one in which these particles decay during the domination of the i… ▽ More In this article we calculate the reheating temperature in the cosmological scenarios where heavy scalar particles are gravitationally produced, due to a conformally coupled interaction between a massive scalar quantum field and the Ricci scalar, during the oscillations of the inflaton field. We explore two distinct cases, namely the one in which these particles decay during the domination of the inflaton's energy density and the other one where the decay occurs after this phase. For each scenario, we have derived formulas to calculate the reheating temperatures based on the energy density of the produced particles and their decay rate. We establish bounds for the maximum reheating temperature, defined as the temperature reached by the universe when the decay of gravitationally produced particles concludes at the onset of the radiation-dominated epoch. Finally, we use the Born approximation to find analytic formulas for the reheating temperature. △ Less

Submitted 3 November, 2024; originally announced November 2024.

Comments: 15 pages including bibliography and 1 figure; Accepted for publication in Phys. Rev. D

arXiv:2411.01598 [pdf, other]

doi 10.3390/sym16111434

Reheating constraints and the $H_0$ tension in Quintessential Inflation

Authors: Jaume de Haro, Supriya Pan

Abstract: In this work, we focus on two important aspects of modern cosmology: reheating and Hubble constant tension within the framework of a unified model, namely, quintessential inflation connecting the early inflationary era and late-time cosmic acceleration. In the context of reheating, we use instant preheating and gravitational reheating, two viable reheating mechanisms when the evolution of the univ… ▽ More In this work, we focus on two important aspects of modern cosmology: reheating and Hubble constant tension within the framework of a unified model, namely, quintessential inflation connecting the early inflationary era and late-time cosmic acceleration. In the context of reheating, we use instant preheating and gravitational reheating, two viable reheating mechanisms when the evolution of the universe is not affected by an oscillating regime. After obtaining the reheating temperature, we analyze the number of $e$-folds and establish its relationship with the reheating temperature. This allows us to connect, for different quintessential inflation models, the reheating temperature with the spectral index of scalar perturbations, thereby enabling us to constrain its values. In the second part of this article, we explore various alternatives to address the $H_0$ tension, a discrepancy which indicates a possible revision of the $Λ$CDM model. Initially, we establish that quintessential inflation alone cannot mitigate the Hubble tension by solely deviating from the concordance model at low redshifts. The introduction of a phantom fluid, capable of increasing the Hubble rate at the present time, becomes a crucial element in alleviating the Hubble tension, resulting in a deviation from the $Λ$CDM model only at low redshifts. On a different note, by utilizing quintessential inflation as a source of early dark energy, thereby diminishing the physical size of the sound horizon close to the baryon-photon decoupling redshift, we observe a reduction in the Hubble tension. This alternative avenue, which has the same effect of a cosmological constant changing its scale close to the recombination, sheds light on the nuanced interplay between the quintessential inflation and the Hubble tension, offering a distinct perspective on addressing this cosmological challenge. △ Less

Submitted 3 November, 2024; originally announced November 2024.

Comments: 13 pages including bibliography, 5 figures; Invited article from Symmetry; Abstract has been shortened, but the full abstract is kept in the journal version; published version

Journal ref: Symmetry 2024, 16(11), 1434

arXiv:2411.01237 [pdf, ps, other]

Sparse Linear Regression: Sequential Convex Relaxation, Robust Restricted Null Space Property, and Variable Selection

Authors: Shujun Bi, Yonghua Yang, Shaohua Pan

Abstract: For high dimensional sparse linear regression problems, we propose a sequential convex relaxation algorithm (iSCRA-TL1) by solving inexactly a sequence of truncated $\ell_1$-norm regularized minimization problems, in which the working index sets are constructed iteratively with an adaptive strategy. We employ the robust restricted null space property and sequential restricted null space property (… ▽ More For high dimensional sparse linear regression problems, we propose a sequential convex relaxation algorithm (iSCRA-TL1) by solving inexactly a sequence of truncated $\ell_1$-norm regularized minimization problems, in which the working index sets are constructed iteratively with an adaptive strategy. We employ the robust restricted null space property and sequential restricted null space property (rRNSP and rSRNSP) to provide the theoretical certificates of iSCRA-TL1. Specifically, under a mild rRNSP or rSRNSP, iSCRA-TL1 is shown to identify the support of the true $r$-sparse vector by solving at most $r$ truncated $\ell_1$-norm regularized problems, and the $\ell_1$-norm error bound of its iterates from the oracle solution is also established. As a consequence, an oracle estimator of high-dimensional linear regression problems can be achieved by solving at most $r\!+\!1$ truncated $\ell_1$-norm regularized problems. To the best of our knowledge, this is the first sequential convex relaxation algorithm to produce an oracle estimator under a weaker NSP condition within a specific number of steps, provided that the Lasso estimator lacks high quality, say, the supports of its first $r$ largest (in modulus) entries do not coincide with those of the true vector. △ Less

Submitted 2 November, 2024; originally announced November 2024.

Comments: 38 pages, 4 figures

arXiv:2411.00437 [pdf]

E2E-AFG: An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation

Authors: Yun Jiang, Zilong Xie, Wei Zhang, Yun Fang, Shuai Pan

Abstract: Retrieval-augmented generation methods often neglect the quality of content retrieved from external knowledge bases, resulting in irrelevant information or potential misinformation that negatively affects the generation results of large language models. In this paper, we propose an end-to-end model with adaptive filtering for retrieval-augmented generation (E2E-AFG), which integrates answer existe… ▽ More Retrieval-augmented generation methods often neglect the quality of content retrieved from external knowledge bases, resulting in irrelevant information or potential misinformation that negatively affects the generation results of large language models. In this paper, we propose an end-to-end model with adaptive filtering for retrieval-augmented generation (E2E-AFG), which integrates answer existence judgment and text generation into a single end-to-end framework. This enables the model to focus more effectively on relevant content while reducing the influence of irrelevant information and generating accurate answers. We evaluate E2E-AFG on six representative knowledge-intensive language datasets, and the results show that it consistently outperforms baseline models across all tasks, demonstrating the effectiveness and robustness of the proposed approach. △ Less

Submitted 8 May, 2025; v1 submitted 1 November, 2024; originally announced November 2024.

Comments: 13 pages, 3 figures, 5 tables

arXiv:2411.00332 [pdf]

In-situ Self-optimization of Quantum Dot Emission for Lasers by Machine-Learning Assisted Epitaxy

Authors: Chao Shen, Wenkang Zhan, Shujie Pan, Hongyue Hao, Ning Zhuo, Kaiyao Xin, Hui Cong, Chi Xu, Bo Xu, Tien Khee Ng, Siming Chen, Chunlai Xue, Fengqi Liu, Zhanguo Wang, Chao Zhao

Abstract: Traditional methods for optimizing light source emissions rely on a time-consuming trial-and-error approach. While in-situ optimization of light source gain media emission during growth is ideal, it has yet to be realized. In this work, we integrate in-situ reflection high-energy electron diffraction (RHEED) with machine learning (ML) to correlate the surface reconstruction with the photoluminesce… ▽ More Traditional methods for optimizing light source emissions rely on a time-consuming trial-and-error approach. While in-situ optimization of light source gain media emission during growth is ideal, it has yet to be realized. In this work, we integrate in-situ reflection high-energy electron diffraction (RHEED) with machine learning (ML) to correlate the surface reconstruction with the photoluminescence (PL) of InAs/GaAs quantum dots (QDs), which serve as the active region of lasers. A lightweight ResNet-GLAM model is employed for the real-time processing of RHEED data as input, enabling effective identification of optical performance. This approach guides the dynamic optimization of growth parameters, allowing real-time feedback control to adjust the QDs emission for lasers. We successfully optimized InAs QDs on GaAs substrates, with a 3.2-fold increase in PL intensity and a reduction in full width at half maximum (FWHM) from 36.69 meV to 28.17 meV under initially suboptimal growth conditions. Our automated, in-situ self-optimized lasers with 5-layer InAs QDs achieved electrically pumped continuous-wave operation at 1240 nm with a low threshold current of 150 A/cm2 at room temperature, an excellent performance comparable to samples grown through traditional manual multi-parameter optimization methods. These results mark a significant step toward intelligent, low-cost, and reproductive light emitters production. △ Less

Submitted 31 October, 2024; originally announced November 2024.

Comments: 5 figures

arXiv:2410.23582 [pdf, other]

Patient-Specific CBCT Synthesis for Real-time Tumor Tracking in Surface-guided Radiotherapy

Authors: Shaoyan Pan, Vanessa Su, Junbo Peng, Junyuan Li, Yuan Gao, Chih-Wei Chang, Tonghe Wang, Zhen Tian, Xiaofeng Yang

Abstract: We present a new imaging system to support real-time tumor tracking for surface-guided radiotherapy (SGRT). SGRT uses optical surface imaging (OSI) to acquire real-time surface topography images of the patient on the treatment couch. However, OSI cannot visualize internal anatomy. This study proposes an Advanced Surface Imaging (A-SI) framework to address this issue. In the proposed A-SI framework… ▽ More We present a new imaging system to support real-time tumor tracking for surface-guided radiotherapy (SGRT). SGRT uses optical surface imaging (OSI) to acquire real-time surface topography images of the patient on the treatment couch. However, OSI cannot visualize internal anatomy. This study proposes an Advanced Surface Imaging (A-SI) framework to address this issue. In the proposed A-SI framework, a high-speed surface imaging camera consistently captures surface images during radiation delivery, and a CBCT imager captures single-angle X-ray projections at low frequency. The A-SI then utilizes a generative model to generate real-time volumetric images with full anatomy, referred to as Optical Surface-Derived cone beam computed tomography (OSD-CBCT), based on the real-time high-frequent surface images and the low-frequency collected single-angle X-ray projections. The generated OSD-CBCT can provide accurate tumor motion for precise radiation delivery. The A-SI framework uses a patient-specific generative model: physics-integrated consistency-refinement denoising diffusion probabilistic model (PC-DDPM). This model leverages patient-specific anatomical structures and respiratory motion patterns derived from four-dimensional CT (4DCT) during treatment planning. It then employs a geometric transformation module (GTM) to extract volumetric anatomy information from the single-angle X-ray projection. A simulation study with 22 lung cancer patients evaluated the A-SI framework supported by PC-DDPM. The results showed that the framework produced real-time OSD-CBCT with high reconstruction fidelity and precise tumor localization. This study demonstrates the potential of A-SI to enable real-time tumor tracking with minimal imaging dose, advancing SGRT for motion-associated cancers and interventional procedures. △ Less

Submitted 31 October, 2024; v1 submitted 30 October, 2024; originally announced October 2024.

arXiv:2410.22007 [pdf, ps, other]

Survey of Load-Altering Attacks Against Power Grids: Attack Impact, Detection and Mitigation

Authors: Sajjad Maleki, Shijie Pan, Subhash Lakshminarayana, Charalambos Konstantinou

Abstract: The growing penetration of IoT devices in power grids despite its benefits, raises cybersecurity concerns. In particular, load-altering attacks (LAAs) targeting high-wattage IoT-controllable load devices pose serious risks to grid stability and disrupt electricity markets. This paper provides a comprehensive review of LAAs, highlighting the threat model, analyzing their impact on transmission and… ▽ More The growing penetration of IoT devices in power grids despite its benefits, raises cybersecurity concerns. In particular, load-altering attacks (LAAs) targeting high-wattage IoT-controllable load devices pose serious risks to grid stability and disrupt electricity markets. This paper provides a comprehensive review of LAAs, highlighting the threat model, analyzing their impact on transmission and distribution networks, and the electricity market dynamics. We also review the detection and localization schemes for LAAs that employ either model-based or data-driven approaches, with some hybrid methods combining the strengths of both. Additionally, mitigation techniques are examined, focusing on both preventive measures, designed to thwart attack execution, and reactive methods, which aim to optimize responses to ongoing attacks. We look into the application of each study and highlight potential streams for future research. △ Less

Submitted 16 April, 2025; v1 submitted 29 October, 2024; originally announced October 2024.

arXiv:2410.20321 [pdf, other]

Effective Instruction Parsing Plugin for Complex Logical Query Answering on Knowledge Graphs

Authors: Xingrui Zhuo, Jiapu Wang, Gongqing Wu, Shirui Pan, Xindong Wu

Abstract: Knowledge Graph Query Embedding (KGQE) aims to embed First-Order Logic (FOL) queries in a low-dimensional KG space for complex reasoning over incomplete KGs. To enhance the generalization of KGQE models, recent studies integrate various external information (such as entity types and relation context) to better capture the logical semantics of FOL queries. The whole process is commonly referred to… ▽ More Knowledge Graph Query Embedding (KGQE) aims to embed First-Order Logic (FOL) queries in a low-dimensional KG space for complex reasoning over incomplete KGs. To enhance the generalization of KGQE models, recent studies integrate various external information (such as entity types and relation context) to better capture the logical semantics of FOL queries. The whole process is commonly referred to as Query Pattern Learning (QPL). However, current QPL methods typically suffer from the pattern-entity alignment bias problem, leading to the learned defective query patterns limiting KGQE models' performance. To address this problem, we propose an effective Query Instruction Parsing Plugin (QIPP) that leverages the context awareness of Pre-trained Language Models (PLMs) to capture latent query patterns from code-like query instructions. Unlike the external information introduced by previous QPL methods, we first propose code-like instructions to express FOL queries in an alternative format. This format utilizes textual variables and nested tuples to convey the logical semantics within FOL queries, serving as raw materials for a PLM-based instruction encoder to obtain complete query patterns. Building on this, we design a query-guided instruction decoder to adapt query patterns to KGQE models. To further enhance QIPP's effectiveness across various KGQE models, we propose a query pattern injection mechanism based on compressed optimization boundaries and an adaptive normalization component, allowing KGQE models to utilize query patterns more efficiently. Extensive experiments demonstrate that our plug-and-play method improves the performance of eight basic KGQE models and outperforms two state-of-the-art QPL methods. △ Less

Submitted 26 October, 2024; originally announced October 2024.

arXiv:2410.19878 [pdf, other]

Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies

Authors: Luping Wang, Sheng Chen, Linnan Jiang, Shu Pan, Runze Cai, Sen Yang, Fei Yang

Abstract: The large models, as predicted by scaling raw forecasts, have made groundbreaking progress in many fields, particularly in natural language generation tasks, where they have approached or even surpassed human levels. However, the unprecedented scale of their parameters brings significant computational and storage costs. These large models require substantial computational resources and GPU memory… ▽ More The large models, as predicted by scaling raw forecasts, have made groundbreaking progress in many fields, particularly in natural language generation tasks, where they have approached or even surpassed human levels. However, the unprecedented scale of their parameters brings significant computational and storage costs. These large models require substantial computational resources and GPU memory to operate. When adapting large models to specific downstream tasks, their massive parameter scale poses a significant challenge in fine-tuning on hardware platforms with limited computational power and GPU memory. To address this issue, Parameter-Efficient Fine-Tuning (PEFT) offers a practical solution by efficiently adjusting the parameters of large pre-trained models to suit various downstream tasks. Specifically, PEFT adjusts the parameters of pre-trained large models to adapt to specific tasks or domains, minimizing the introduction of additional parameters and the computational resources required. This review mainly introduces the preliminary knowledge of PEFT, the core ideas and principles of various PEFT algorithms, the applications of PEFT, and potential future research directions. By reading this review, we believe that interested parties can quickly grasp the PEFT methodology, thereby accelerating its development and innovation. △ Less

Submitted 24 April, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

arXiv:2410.18148 [pdf, other]

Beyond the Kolmogorov Barrier: A Learnable Weighted Hybrid Autoencoder for Model Order Reduction

Authors: Nithin Somasekharan, Shaowu Pan

Abstract: Representation learning for high-dimensional, complex physical systems aims to identify a low-dimensional intrinsic latent space, which is crucial for reduced-order modeling and modal analysis. To overcome the well-known Kolmogorov barrier, deep autoencoders (AEs) have been introduced in recent years, but they often suffer from poor convergence behavior as the rank of the latent space increases. T… ▽ More Representation learning for high-dimensional, complex physical systems aims to identify a low-dimensional intrinsic latent space, which is crucial for reduced-order modeling and modal analysis. To overcome the well-known Kolmogorov barrier, deep autoencoders (AEs) have been introduced in recent years, but they often suffer from poor convergence behavior as the rank of the latent space increases. To address this issue, we propose the learnable weighted hybrid autoencoder, a hybrid approach that combines the strengths of singular value decomposition (SVD) with deep autoencoders through a learnable weighted framework. We find that the introduction of learnable weighting parameters is essential -- without them, the resulting model would either collapse into a standard POD or fail to exhibit the desired convergence behavior. Interestingly, we empirically find that our trained model has a sharpness thousands of times smaller compared to other models. Our experiments on classical chaotic PDE systems, including the 1D Kuramoto-Sivashinsky and forced isotropic turbulence datasets, demonstrate that our approach significantly improves generalization performance compared to several competing methods. Additionally, when combining with time series modeling techniques (e.g., Koopman operator, LSTM), the proposed technique offers significant improvements for surrogate modeling of high-dimensional multi-scale PDE systems. △ Less

Submitted 28 February, 2025; v1 submitted 22 October, 2024; originally announced October 2024.

Comments: 31 pages

MSC Class: 68T07; 76F99

arXiv:2410.16601 [pdf, other]

doi 10.1016/j.ast.2024.109915

An implicit coupling framework for numerical simulations between hypersonic nonequilibrium flows and charring material thermal response in the presence of ablation

Authors: Jingchao Zhang, Chunsheng Nie, Jinsheng Cai, Shucheng Pan

Abstract: An implicit coupling framework between hypersonic nonequilibrium flows and material thermal response is proposed for the numerical simulation of ablative thermal protection materials during its flight trajectory. Charring ablative materials, when subjected to aerodynamic heating from hypersonic flows, undergo complex processes such as ablation and pyrolysis, involving heterogeneous and homogeneous… ▽ More An implicit coupling framework between hypersonic nonequilibrium flows and material thermal response is proposed for the numerical simulation of ablative thermal protection materials during its flight trajectory. Charring ablative materials, when subjected to aerodynamic heating from hypersonic flows, undergo complex processes such as ablation and pyrolysis, involving heterogeneous and homogeneous chemical reactions. These multi-physical phenomena are simulated by a multicomponent material thermal response (MTR) solver that takes into account the complexity of component of pyrolysis gases. The species concentrations are calculated to improve the accuracy of transport and thermophysical parameters of pyrolysis gases. The MTR solver implements implicit time integration on finite difference discretization form to achieve higher efficiency. The numerical solutions of hypersonic flows and material thermal response are coupled through a gas-surface interaction interface based on surface mass and energy balance on the ablating surface. The coupled simulation employs the dual time-step technique, which introduces pseudo time step to improve temporal accuracy. The explicit coupling mechanism updates the interfacial quantities at physical time steps, which achieves higher computational efficiency, but introduces time discretization errors and numerical oscillations of interfacial quantities. In contrast, the implicit coupling mechanism updates the interfacial quantities at pseudo time steps, which reduces the temporal discretization error and suppresses numerical oscillations, but is less efficient. In addition, a simplified ablation boundary based on steady-state ablation assumption or radiation-equilibrium assumption is proposed to approximate solid heat conduction without coupling the MTR solution, providing quasi-steady flow solutions in the presence of ablation. △ Less

Submitted 21 October, 2024; originally announced October 2024.

arXiv:2410.15359 [pdf, other]

A Survey of Hallucination in Large Visual Language Models

Authors: Wei Lan, Wenyi Chen, Qingfeng Chen, Shirui Pan, Huiyu Zhou, Yi Pan

Abstract: The Large Visual Language Models (LVLMs) enhances user interaction and enriches user experience by integrating visual modality on the basis of the Large Language Models (LLMs). It has demonstrated their powerful information processing and generation capabilities. However, the existence of hallucinations has limited the potential and practical effectiveness of LVLM in various fields. Although lots… ▽ More The Large Visual Language Models (LVLMs) enhances user interaction and enriches user experience by integrating visual modality on the basis of the Large Language Models (LLMs). It has demonstrated their powerful information processing and generation capabilities. However, the existence of hallucinations has limited the potential and practical effectiveness of LVLM in various fields. Although lots of work has been devoted to the issue of hallucination mitigation and correction, there are few reviews to summary this issue. In this survey, we first introduce the background of LVLMs and hallucinations. Then, the structure of LVLMs and main causes of hallucination generation are introduced. Further, we summary recent works on hallucination correction and mitigation. In addition, the available hallucination evaluation benchmarks for LVLMs are presented from judgmental and generative perspectives. Finally, we suggest some future research directions to enhance the dependability and utility of LVLMs. △ Less

Submitted 20 October, 2024; originally announced October 2024.

arXiv:2410.13080 [pdf, other]

Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models

Authors: Linhao Luo, Zicheng Zhao, Gholamreza Haffari, Yuan-Fang Li, Chen Gong, Shirui Pan

Abstract: Large language models (LLMs) have demonstrated impressive reasoning abilities, but they still struggle with faithful reasoning due to knowledge gaps and hallucinations. To address these issues, knowledge graphs (KGs) have been utilized to enhance LLM reasoning through their structured knowledge. However, existing KG-enhanced methods, either retrieval-based or agent-based, encounter difficulties in… ▽ More Large language models (LLMs) have demonstrated impressive reasoning abilities, but they still struggle with faithful reasoning due to knowledge gaps and hallucinations. To address these issues, knowledge graphs (KGs) have been utilized to enhance LLM reasoning through their structured knowledge. However, existing KG-enhanced methods, either retrieval-based or agent-based, encounter difficulties in accurately retrieving knowledge and efficiently traversing KGs at scale. In this work, we introduce graph-constrained reasoning (GCR), a novel framework that bridges structured knowledge in KGs with unstructured reasoning in LLMs. To eliminate hallucinations, GCR ensures faithful KG-grounded reasoning by integrating KG structure into the LLM decoding process through KG-Trie, a trie-based index that encodes KG reasoning paths. KG-Trie constrains the decoding process, allowing LLMs to directly reason on graphs and generate faithful reasoning paths grounded in KGs. Additionally, GCR leverages a lightweight KG-specialized LLM for graph-constrained reasoning alongside a powerful general LLM for inductive reasoning over multiple reasoning paths, resulting in accurate reasoning with zero reasoning hallucination. Extensive experiments on several KGQA benchmarks demonstrate that GCR achieves state-of-the-art performance and exhibits strong zero-shot generalizability to unseen KGs without additional training. △ Less

Submitted 28 May, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

Comments: Accepted by ICML 2025

arXiv:2410.12360 [pdf, other]

Towards Neural Scaling Laws for Time Series Foundation Models

Authors: Qingren Yao, Chao-Han Huck Yang, Renhe Jiang, Yuxuan Liang, Ming Jin, Shirui Pan

Abstract: Scaling laws offer valuable insights into the design of time series foundation models (TSFMs). However, previous research has largely focused on the scaling laws of TSFMs for in-distribution (ID) data, leaving their out-of-distribution (OOD) scaling behavior and the influence of model architectures less explored. In this work, we examine two common TSFM architectures, encoder-only and decoder-only… ▽ More Scaling laws offer valuable insights into the design of time series foundation models (TSFMs). However, previous research has largely focused on the scaling laws of TSFMs for in-distribution (ID) data, leaving their out-of-distribution (OOD) scaling behavior and the influence of model architectures less explored. In this work, we examine two common TSFM architectures, encoder-only and decoder-only Transformers, and investigate their scaling behavior on both ID and OOD data. These models are trained and evaluated across varying parameter counts, compute budgets, and dataset sizes. Our experiments reveal that the log-likelihood loss of TSFMs exhibits similar scaling behavior in both OOD and ID settings. We further compare the scaling properties across different architectures, incorporating two state-of-the-art TSFMs as case studies, showing that model architecture plays a significant role in scaling. The encoder-only Transformers demonstrate better scalability than the decoder-only Transformers, while the architectural enhancements in the two advanced TSFMs primarily improve ID performance but reduce OOD scalability. While scaling up TSFMs is expected to drive performance breakthroughs, the lack of a comprehensive understanding of TSFM scaling laws has hindered the development of a robust framework to guide model scaling. We fill this gap in this work by synthesizing our findings and providing practical guidelines for designing and scaling larger TSFMs with enhanced model capabilities. △ Less

Submitted 18 March, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

Comments: Accepted by the 13th International Conference on Learning Representations (ICLR 2025)

arXiv:2410.12252 [pdf]

doi 10.1016/j.matt.2024.09.018

Large Enhancement of Properties in Strained Lead-free Multiferroic Solid Solutions with Strong Deviation from Vegard's Law

Authors: Tao Wang, Mingjie Zou, Dehe Zhang, Yu-Chieh Ku, Yawen Zheng, Shen Pan, Zhongqi Ren, Zedong Xu, Haoliang Huang, Wei Luo, Yunlong Tang, Lang Chen, Cheng-En Liu, Chun-Fu Chang, Sujit Das, Laurent Bellaiche, Yurong Yang, Xiuliang Ma, Chang-Yang Kuo, Xingjun Liu, Zuhuang Chen

Abstract: Efforts to combine the advantages of multiple systems to enhance functionlities through solid solution design present a great challenge due to the constraint imposed by the classical Vegard law. Here, we successfully navigate this trade off by leveraging the synergistic effect of chemical doping and strain engineering in solid solution system of BiFeO3 BaTiO3. Unlike bulks, a significant deviation… ▽ More Efforts to combine the advantages of multiple systems to enhance functionlities through solid solution design present a great challenge due to the constraint imposed by the classical Vegard law. Here, we successfully navigate this trade off by leveraging the synergistic effect of chemical doping and strain engineering in solid solution system of BiFeO3 BaTiO3. Unlike bulks, a significant deviation from the Vegard law accompanying with enhanced multiferroism is observed in the strained solid solution epitaxial films, where we achieve a pronounced tetragonality, enhanced saturated magnetization, substantial polarization, high ferroelectric Curie temperature, all while maintaining impressively low leakage current. These characteristics surpass the properties of their parent BiFeO3 and BaTiO3 films. Moreover, the superior ferroelectricity has never been reported in corresponding bulks. These findings underscore the potential of strained BiFeO3 BaTiO3 films as lead-free, room-temperature multiferroics. △ Less

Submitted 16 October, 2024; originally announced October 2024.

Comments: 19pages, 5 figures

Journal ref: Matter 8, 1-11, 2025

arXiv:2410.12232 [pdf, other]

doi 10.1109/ICRA57147.2024.10610641

Improving the Generalization of Unseen Crowd Behaviors for Reinforcement Learning based Local Motion Planners

Authors: Wen Zheng Terence Ng, Jianda Chen, Sinno Jialin Pan, Tianwei Zhang

Abstract: Deploying a safe mobile robot policy in scenarios with human pedestrians is challenging due to their unpredictable movements. Current Reinforcement Learning-based motion planners rely on a single policy to simulate pedestrian movements and could suffer from the over-fitting issue. Alternatively, framing the collision avoidance problem as a multi-agent framework, where agents generate dynamic movem… ▽ More Deploying a safe mobile robot policy in scenarios with human pedestrians is challenging due to their unpredictable movements. Current Reinforcement Learning-based motion planners rely on a single policy to simulate pedestrian movements and could suffer from the over-fitting issue. Alternatively, framing the collision avoidance problem as a multi-agent framework, where agents generate dynamic movements while learning to reach their goals, can lead to conflicts with human pedestrians due to their homogeneity. To tackle this problem, we introduce an efficient method that enhances agent diversity within a single policy by maximizing an information-theoretic objective. This diversity enriches each agent's experiences, improving its adaptability to unseen crowd behaviors. In assessing an agent's robustness against unseen crowds, we propose diverse scenarios inspired by pedestrian crowd behaviors. Our behavior-conditioned policies outperform existing works in these challenging scenes, reducing potential collisions without additional time or travel. △ Less

Submitted 16 October, 2024; originally announced October 2024.

arXiv:2410.09816 [pdf]

Silicon modulator exceeding 110 GHz using tunable time-frequency equalization

Authors: Hengsong Yue, Jianbin Fu, Hengwei Zhang, Bo Xiong, Shilong Pan, Tao Chu

Abstract: Silicon modulators have garnered considerable attention owing to their potential applications in high-density integration and high-speed modulation. However, they are increasingly challenged by the limited 3 dB bandwidth as the demand for modulation speed in optical communications continues to rise, impeding their ability to compete with modulators made of thin-film lithium niobate. This bandwidth… ▽ More Silicon modulators have garnered considerable attention owing to their potential applications in high-density integration and high-speed modulation. However, they are increasingly challenged by the limited 3 dB bandwidth as the demand for modulation speed in optical communications continues to rise, impeding their ability to compete with modulators made of thin-film lithium niobate. This bandwidth limitation arises because of the parasitic resistance and capacitance in the PN junction of the silicon modulators. This study demonstrates the first silicon modulator exceeding 110 GHz without any resonant structure using a tunable time-frequency equalization technique. This substantial breakthrough enables on-off keying modulation at a rate of 140 Gbaud without digital signal processing. These accomplishments represent the highest bandwidth and maximum baud rate achieved without digital signal processing in an all-silicon modulator, reaching the testing limitations of the experimental system. This opens the possibility of attaining modulation rates of up to 200 or even 300 Gbaud by adopting design strategies such as slow light and technologies such as digital signal processing. This advancement extends the speed capabilities of silicon modulators to the level of thin-film lithium niobate modulators, thereby promoting their application in the broader array of fields, such as linear-drive pluggable transceivers. △ Less

Submitted 13 October, 2024; originally announced October 2024.

arXiv:2410.08522 [pdf, other]

Evaluating the effects of Data Sparsity on the Link-level Bicycling Volume Estimation: A Graph Convolutional Neural Network Approach

Authors: Mohit Gupta, Debjit Bhowmick, Meead Saberi, Shirui Pan, Ben Beck

Abstract: Accurate bicycling volume estimation is crucial for making informed decisions and planning about future investments in bicycling infrastructure. However, traditional link-level volume estimation models are effective for motorized traffic but face significant challenges when applied to the bicycling context because of sparse data and the intricate nature of bicycling mobility patterns. To the best… ▽ More Accurate bicycling volume estimation is crucial for making informed decisions and planning about future investments in bicycling infrastructure. However, traditional link-level volume estimation models are effective for motorized traffic but face significant challenges when applied to the bicycling context because of sparse data and the intricate nature of bicycling mobility patterns. To the best of our knowledge, we present the first study to utilize a Graph Convolutional Network (GCN) architecture to model link-level bicycling volumes and systematically investigate the impact of varying levels of data sparsity (0%--99%) on model performance, simulating real-world scenarios. We have leveraged Strava Metro data as the primary source of bicycling counts across 15,933 road segments/links in the City of Melbourne, Australia. To evaluate the effectiveness of the GCN model, we benchmark it against traditional machine learning models, such as linear regression, support vector machines, and random forest. Our results show that the GCN model outperforms these traditional models in predicting Annual Average Daily Bicycle (AADB) counts, demonstrating its ability to capture the spatial dependencies inherent in bicycle traffic networks. While GCN remains robust up to 80% sparsity, its performance declines sharply beyond this threshold, highlighting the challenges of extreme data sparsity. These findings underscore the potential of GCNs in enhancing bicycling volume estimation, while also emphasizing the need for further research on methods to improve model resilience under high-sparsity conditions. Our findings offer valuable insights for city planners aiming to improve bicycling infrastructure and promote sustainable transportation. △ Less

Submitted 27 March, 2025; v1 submitted 11 October, 2024; originally announced October 2024.

arXiv:2410.00282 [pdf]

Smart Contract Vulnerability Detection based on Static Analysis and Multi-Objective Search

Authors: Dongcheng Li, W. Eric Wong, Xiaodan Wang, Sean Pan, Liang-Seng Koh

Abstract: This paper introduces a method for detecting vulnerabilities in smart contracts using static analysis and a multi-objective optimization algorithm. We focus on four types of vulnerabilities: reentrancy, call stack overflow, integer overflow, and timestamp dependencies. Initially, smart contracts are compiled into an abstract syntax tree to analyze relationships between contracts and functions, inc… ▽ More This paper introduces a method for detecting vulnerabilities in smart contracts using static analysis and a multi-objective optimization algorithm. We focus on four types of vulnerabilities: reentrancy, call stack overflow, integer overflow, and timestamp dependencies. Initially, smart contracts are compiled into an abstract syntax tree to analyze relationships between contracts and functions, including calls, inheritance, and data flow. These analyses are transformed into static evaluations and intermediate representations that reveal internal relations. Based on these representations, we examine contract's functions, variables, and data dependencies to detect the specified vulnerabilities. To enhance detection accuracy and coverage, we apply a multi-objective optimization algorithm to the static analysis process. This involves assigning initial numeric values to input data and monitoring changes in statement coverage and detection accuracy. Using coverage and accuracy as fitness values, we calculate Pareto front and crowding distance values to select the best individuals for the new parent population, iterating until optimization criteria are met. We validate our approach using an open-source dataset collected from Etherscan, containing 6,693 smart contracts. Experimental results show that our method outperforms state-of-the-art tools in terms of coverage, accuracy, efficiency, and effectiveness in detecting the targeted vulnerabilities. △ Less

Submitted 30 September, 2024; originally announced October 2024.

arXiv:2409.19938 [pdf, other]

Quantum-inspired Beamforming Optimization for Quantized Phase-only Massive MIMO Arrays

Authors: Yutong Jiang, Hangyu Ge, Bi-Ying Wang, Shuai S. A. Yuan, Shi-Jie Pan, Hongjing Xu, Xiaopeng Cui, Man-Hong Yung, Feng Liu, Wei E. I. Sha

Abstract: This paper introduces an innovative quantum-inspired method for beamforming (BF) optimization in multiple-input multiple-output (MIMO) arrays. The method leverages the simulated bifurcation (SB) algorithm to address the complex combinatorial optimization problem due to the quantized phase configuration. We propose novel encoding techniques for high-bit phase quantization, which are then mapped int… ▽ More This paper introduces an innovative quantum-inspired method for beamforming (BF) optimization in multiple-input multiple-output (MIMO) arrays. The method leverages the simulated bifurcation (SB) algorithm to address the complex combinatorial optimization problem due to the quantized phase configuration. We propose novel encoding techniques for high-bit phase quantization, which are then mapped into Ising spins. This enables efficient construction of the Hamiltonians and subsequent optimization of BF patterns. The results clearly demonstrate that the SB optimizer surpasses traditional schemes such as digital BF, holographic algorithms and genetic algorithms, offering faster convergence and higher solution quality. The impressive capability of the SB optimizer to handle complex BF scenarios, including sidelobe suppression and multiple beams with nulls, is undoubtedly demonstrated through several application cases. These findings strongly suggest that quantum-inspired methods have great potential to advance MIMO techniques in next-generation wireless communication. △ Less

Submitted 28 May, 2025; v1 submitted 30 September, 2024; originally announced September 2024.

arXiv:2409.18288 [pdf, other]

The track-length extension fitting algorithm for energy measurement of interacting particles in liquid argon TPCs and its performance with ProtoDUNE-SP data

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, F. Akbar, N. S. Alex, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, T. Alves, H. Amar, P. Amedo, J. Anderson, C. Andreopoulos , et al. (1348 additional authors not shown)

Abstract: This paper introduces a novel track-length extension fitting algorithm for measuring the kinetic energies of inelastically interacting particles in liquid argon time projection chambers (LArTPCs). The algorithm finds the most probable offset in track length for a track-like object by comparing the measured ionization density as a function of position with a theoretical prediction of the energy los… ▽ More This paper introduces a novel track-length extension fitting algorithm for measuring the kinetic energies of inelastically interacting particles in liquid argon time projection chambers (LArTPCs). The algorithm finds the most probable offset in track length for a track-like object by comparing the measured ionization density as a function of position with a theoretical prediction of the energy loss as a function of the energy, including models of electron recombination and detector response. The algorithm can be used to measure the energies of particles that interact before they stop, such as charged pions that are absorbed by argon nuclei. The algorithm's energy measurement resolutions and fractional biases are presented as functions of particle kinetic energy and number of track hits using samples of stopping secondary charged pions in data collected by the ProtoDUNE-SP detector, and also in a detailed simulation. Additional studies describe the impact of the dE/dx model on energy measurement performance. The method described in this paper to characterize the energy measurement performance can be repeated in any LArTPC experiment using stopping secondary charged pions. △ Less

Submitted 26 December, 2024; v1 submitted 26 September, 2024; originally announced September 2024.

Report number: FERMILAB-PUB-24-0561-LBNF-PPD, CERN-EP-2024-256

arXiv:2409.17389 [pdf, other]

Safe Leaf Manipulation for Accurate Shape and Pose Estimation of Occluded Fruits

Authors: Shaoxiong Yao, Sicong Pan, Maren Bennewitz, Kris Hauser

Abstract: Fruit monitoring plays an important role in crop management, and rising global fruit consumption combined with labor shortages necessitates automated monitoring with robots. However, occlusions from plant foliage often hinder accurate shape and pose estimation. Therefore, we propose an active fruit shape and pose estimation method that physically manipulates occluding leaves to reveal hidden fruit… ▽ More Fruit monitoring plays an important role in crop management, and rising global fruit consumption combined with labor shortages necessitates automated monitoring with robots. However, occlusions from plant foliage often hinder accurate shape and pose estimation. Therefore, we propose an active fruit shape and pose estimation method that physically manipulates occluding leaves to reveal hidden fruits. This paper introduces a framework that plans robot actions to maximize visibility and minimize leaf damage. We developed a novel scene-consistent shape completion technique to improve fruit estimation under heavy occlusion and utilize a perception-driven deformation graph model to predict leaf deformation during planning. Experiments on artificial and real sweet pepper plants demonstrate that our method enables robots to safely move leaves aside, exposing fruits for accurate shape and pose estimation, outperforming baseline methods. Project page: https://shaoxiongyao.github.io/lmap-ssc/. △ Less

Submitted 24 February, 2025; v1 submitted 25 September, 2024; originally announced September 2024.

Comments: Shaoxiong Yao and Sicong Pan have equal contributions. Publication to appear in IEEE International Conference on Robotics and Automation (ICRA), 2025

arXiv:2409.14855 [pdf]

doi 10.1063/5.0240434

Nonlinear field dependence of Hall effect and high-mobility multi-carrier transport in an altermagnet CrSb

Authors: Yuqing Bai, Xinji Xiang, Shuang Pan, Shichao Zhang, Haifeng Chen Xi Chen, Zhida Han, Guizhou Xu, Feng Xu

Abstract: As a promising candidate for altermagnet, CrSb possesses a distinctive compensated spin split band structure that could bring groundbreaking concepts to the field of spintronics. In this work, we have grown high-quality CrSb single crystals and comprehensively investigated their electronic and magneto-transport properties. We have observed large, positive, and non-saturated magnetoresistance (MR)… ▽ More As a promising candidate for altermagnet, CrSb possesses a distinctive compensated spin split band structure that could bring groundbreaking concepts to the field of spintronics. In this work, we have grown high-quality CrSb single crystals and comprehensively investigated their electronic and magneto-transport properties. We have observed large, positive, and non-saturated magnetoresistance (MR) in CrSb, which well obeys Kohler's rule, indicating its classic Lorentz scattering origins. Remarkably, a nonlinear magnetic field dependence of Hall effect resembling the spontaneous anomalous Hall is identified over a wide temperature range. After careful analysis of the transport data, we conclude the non-linearity mainly stems from the incorporation of different carriers in the magnetoconductivity. According to the Fermi surface analyses of CrSb, we applied the three-carrier model to fit the conductivity data, yielding good agreement. The extracted carrier concentration and mobility indicates that CrSb behaves more like a semimetal, with the highest mobility reaching 3*103 cm2V-1s-1. Furthermore, calculations using the semiclassical Boltzmann transport theory have successfully reproduced the main features of the experimental MR and Hall effect in CrSb. These exceptional transport properties make CrSb unique for applications in spintronics as an altermagnet. △ Less

Submitted 9 February, 2025; v1 submitted 23 September, 2024; originally announced September 2024.

Journal ref: Appl. Phys. Lett. 126, 042402 (2025)

arXiv:2409.13343 [pdf, ps, other]

"I Don't Use AI for Everything": Exploring Utility, Attitude, and Responsibility of AI-empowered Tools in Software Development

Authors: Shidong Pan, Litian Wang, Tianyi Zhang, Zhenchang Xing, Yanjie Zhao, Qinghua Lu, Xiaoyu Sun

Abstract: AI-empowered tools have emerged as a transformative force, fundamentally reshaping the software development industry and promising far-reaching impacts across diverse sectors. This study investigates the adoption, impact, and security considerations of AI-empowered tools in the software development process. Through semi-structured interviews with 19 software practitioners from diverse backgrounds,… ▽ More AI-empowered tools have emerged as a transformative force, fundamentally reshaping the software development industry and promising far-reaching impacts across diverse sectors. This study investigates the adoption, impact, and security considerations of AI-empowered tools in the software development process. Through semi-structured interviews with 19 software practitioners from diverse backgrounds, we explore three key aspects: the utility of AI tools, developers' attitudes towards them, and security and privacy responsibilities. Our findings reveal widespread adoption of AI tools across various stages of software development. Developers generally express positive attitudes towards AI, viewing it as an efficiency-enhancing assistant rather than a job replacement threat. However, they also recognized limitations in AI's ability to handle complex, unfamiliar, or highly specialized tasks in software development. Regarding security and privacy, we found varying levels of risk awareness among developers, with larger companies implementing more comprehensive risk management strategies. Our study provides insights into the current state of AI adoption in software development and offers recommendations for practitioners, organizations, AI providers, and regulatory bodies to effectively navigate the integration of AI in the software industry. △ Less

Submitted 21 November, 2024; v1 submitted 20 September, 2024; originally announced September 2024.

Comments: Compared to the previous version, we remove the MathJax format in the title, as the Google Scholar cannot correctly recognise it

arXiv:2409.12207 [pdf, other]

Architectural Co-LOD Generation

Authors: Runze Zhang, Shanshan Pan, Chenlei Lv, Minglun Gong, Hui Huang

Abstract: Managing the level-of-detail (LOD) in architectural models is crucial yet challenging, particularly for effective representation and visualization of buildings. Traditional approaches often fail to deliver controllable detail alongside semantic consistency, especially when dealing with noisy and inconsistent inputs. We address these limitations with \emph{Co-LOD}, a new approach specifically desig… ▽ More Managing the level-of-detail (LOD) in architectural models is crucial yet challenging, particularly for effective representation and visualization of buildings. Traditional approaches often fail to deliver controllable detail alongside semantic consistency, especially when dealing with noisy and inconsistent inputs. We address these limitations with \emph{Co-LOD}, a new approach specifically designed for effective LOD management in architectural modeling. Co-LOD employs shape co-analysis to standardize geometric structures across multiple buildings, facilitating the progressive and consistent generation of LODs. This method allows for precise detailing in both individual models and model collections, ensuring semantic integrity. Extensive experiments demonstrate that Co-LOD effectively applies accurate LOD across a variety of architectural inputs, consistently delivering superior detail and quality in LOD representations. △ Less

Submitted 17 September, 2024; originally announced September 2024.

Comments: ACM Transactions on Graphics (SIGGRAPH Aisa 2024); Project page: https://vcc.tech/research/2024/CoLOD

arXiv:2409.09253 [pdf, other]

Unleash LLMs Potential for Recommendation by Coordinating Twin-Tower Dynamic Semantic Token Generator

Authors: Jun Yin, Zhengxin Zeng, Mingzheng Li, Hao Yan, Chaozhuo Li, Weihao Han, Jianjin Zhang, Ruochen Liu, Allen Sun, Denvy Deng, Feng Sun, Qi Zhang, Shirui Pan, Senzhang Wang

Abstract: Owing to the unprecedented capability in semantic understanding and logical reasoning, the pre-trained large language models (LLMs) have shown fantastic potential in developing the next-generation recommender systems (RSs). However, the static index paradigm adopted by current methods greatly restricts the utilization of LLMs capacity for recommendation, leading to not only the insufficient alignm… ▽ More Owing to the unprecedented capability in semantic understanding and logical reasoning, the pre-trained large language models (LLMs) have shown fantastic potential in developing the next-generation recommender systems (RSs). However, the static index paradigm adopted by current methods greatly restricts the utilization of LLMs capacity for recommendation, leading to not only the insufficient alignment between semantic and collaborative knowledge, but also the neglect of high-order user-item interaction patterns. In this paper, we propose Twin-Tower Dynamic Semantic Recommender (TTDS), the first generative RS which adopts dynamic semantic index paradigm, targeting at resolving the above problems simultaneously. To be more specific, we for the first time contrive a dynamic knowledge fusion framework which integrates a twin-tower semantic token generator into the LLM-based recommender, hierarchically allocating meaningful semantic index for items and users, and accordingly predicting the semantic index of target item. Furthermore, a dual-modality variational auto-encoder is proposed to facilitate multi-grained alignment between semantic and collaborative knowledge. Eventually, a series of novel tuning tasks specially customized for capturing high-order user-item interaction patterns are proposed to take advantages of user historical behavior. Extensive experiments across three public datasets demonstrate the superiority of the proposed methodology in developing LLM-based generative RSs. The proposed TTDS recommender achieves an average improvement of 19.41% in Hit-Rate and 20.84% in NDCG metric, compared with the leading baseline methods. △ Less

Submitted 13 September, 2024; originally announced September 2024.

arXiv:2409.04481 [pdf, other]

Large Language Models in Drug Discovery and Development: From Disease Mechanisms to Clinical Trials

Authors: Yizhen Zheng, Huan Yee Koh, Maddie Yang, Li Li, Lauren T. May, Geoffrey I. Webb, Shirui Pan, George Church

Abstract: The integration of Large Language Models (LLMs) into the drug discovery and development field marks a significant paradigm shift, offering novel methodologies for understanding disease mechanisms, facilitating drug discovery, and optimizing clinical trial processes. This review highlights the expanding role of LLMs in revolutionizing various stages of the drug development pipeline. We investigate… ▽ More The integration of Large Language Models (LLMs) into the drug discovery and development field marks a significant paradigm shift, offering novel methodologies for understanding disease mechanisms, facilitating drug discovery, and optimizing clinical trial processes. This review highlights the expanding role of LLMs in revolutionizing various stages of the drug development pipeline. We investigate how these advanced computational models can uncover target-disease linkage, interpret complex biomedical data, enhance drug molecule design, predict drug efficacy and safety profiles, and facilitate clinical trial processes. Our paper aims to provide a comprehensive overview for researchers and practitioners in computational biology, pharmacology, and AI4Science by offering insights into the potential transformative impact of LLMs on drug discovery and development. △ Less

Submitted 5 September, 2024; originally announced September 2024.

arXiv:2408.15474 [pdf, other]

Drop the beat! Freestyler for Accompaniment Conditioned Rapping Voice Generation

Authors: Ziqian Ning, Shuai Wang, Yuepeng Jiang, Jixun Yao, Lei He, Shifeng Pan, Jie Ding, Lei Xie

Abstract: Rap, a prominent genre of vocal performance, remains underexplored in vocal generation. General vocal synthesis depends on precise note and duration inputs, requiring users to have related musical knowledge, which limits flexibility. In contrast, rap typically features simpler melodies, with a core focus on a strong rhythmic sense that harmonizes with accompanying beats. In this paper, we propose… ▽ More Rap, a prominent genre of vocal performance, remains underexplored in vocal generation. General vocal synthesis depends on precise note and duration inputs, requiring users to have related musical knowledge, which limits flexibility. In contrast, rap typically features simpler melodies, with a core focus on a strong rhythmic sense that harmonizes with accompanying beats. In this paper, we propose Freestyler, the first system that generates rapping vocals directly from lyrics and accompaniment inputs. Freestyler utilizes language model-based token generation, followed by a conditional flow matching model to produce spectrograms and a neural vocoder to restore audio. It allows a 3-second prompt to enable zero-shot timbre control. Due to the scarcity of publicly available rap datasets, we also present RapBank, a rap song dataset collected from the internet, alongside a meticulously designed processing pipeline. Experimental results show that Freestyler produces high-quality rapping voice generation with enhanced naturalness and strong alignment with accompanying beats, both stylistically and rhythmically. △ Less

Submitted 27 August, 2024; originally announced August 2024.

arXiv:2408.12725 [pdf, other]

DUNE Phase II: Scientific Opportunities, Detector Concepts, Technological Solutions

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, T. Alves, H. Amar, P. Amedo, J. Anderson, C. Andreopoulos, M. Andreotti , et al. (1347 additional authors not shown)

Abstract: The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy toward the implementation of this leading-edge, large-scale science project. The 2023 report of the US Particle Physics Project Prioritization Panel (P5) reaffirmed this vision and strongly endorsed DUNE Phase I… ▽ More The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy toward the implementation of this leading-edge, large-scale science project. The 2023 report of the US Particle Physics Project Prioritization Panel (P5) reaffirmed this vision and strongly endorsed DUNE Phase I and Phase II, as did the European Strategy for Particle Physics. While the construction of the DUNE Phase I is well underway, this White Paper focuses on DUNE Phase II planning. DUNE Phase-II consists of a third and fourth far detector (FD) module, an upgraded near detector complex, and an enhanced 2.1 MW beam. The fourth FD module is conceived as a "Module of Opportunity", aimed at expanding the physics opportunities, in addition to supporting the core DUNE science program, with more advanced technologies. This document highlights the increased science opportunities offered by the DUNE Phase II near and far detectors, including long-baseline neutrino oscillation physics, neutrino astrophysics, and physics beyond the standard model. It describes the DUNE Phase II near and far detector technologies and detector design concepts that are currently under consideration. A summary of key R&D goals and prototyping phases needed to realize the Phase II detector technical designs is also provided. DUNE's Phase II detectors, along with the increased beam power, will complete the full scope of DUNE, enabling a multi-decadal program of groundbreaking science with neutrinos. △ Less

Submitted 22 August, 2024; originally announced August 2024.

Report number: FERMILAB-TM-2833-LBNF

arXiv:2408.11554 [pdf, other]

Differentiating Choices via Commonality for Multiple-Choice Question Answering

Authors: Wenqing Deng, Zhe Wang, Kewen Wang, Shirui Pan, Xiaowang Zhang, Zhiyong Feng

Abstract: Multiple-choice question answering (MCQA) becomes particularly challenging when all choices are relevant to the question and are semantically similar. Yet this setting of MCQA can potentially provide valuable clues for choosing the right answer. Existing models often rank each choice separately, overlooking the context provided by other choices. Specifically, they fail to leverage the semantic com… ▽ More Multiple-choice question answering (MCQA) becomes particularly challenging when all choices are relevant to the question and are semantically similar. Yet this setting of MCQA can potentially provide valuable clues for choosing the right answer. Existing models often rank each choice separately, overlooking the context provided by other choices. Specifically, they fail to leverage the semantic commonalities and nuances among the choices for reasoning. In this paper, we propose a novel MCQA model by differentiating choices through identifying and eliminating their commonality, called DCQA. Our model captures token-level attention of each choice to the question, and separates tokens of the question attended to by all the choices (i.e., commonalities) from those by individual choices (i.e., nuances). Using the nuances as refined contexts for the choices, our model can effectively differentiate choices with subtle differences and provide justifications for choosing the correct answer. We conduct comprehensive experiments across five commonly used MCQA benchmarks, demonstrating that DCQA consistently outperforms baseline models. Furthermore, our case study illustrates the effectiveness of the approach in directing the attention of the model to more differentiating features. △ Less

Submitted 21 August, 2024; originally announced August 2024.

Comments: 9 pages, accepted to ECAI 2024

arXiv:2408.11468 [pdf, ps, other]

doi 10.15302/frontphys.2025.014207

Electric-field-tunable topological phases in valley-polarized quantum anomalous Hall systems with inequivalent exchange fields

Authors: Shiyao Pan, Zeyu Li, Yulei Han

Abstract: Incorporating valley as a degree of freedom into quantum anomalous Hall systems offers a novel approach to manipulating valleytronics in electronic transport. Using the Kane-Mele monolayer as a concrete model, we comprehensively explore the various topological phases in the presence of inequivalent exchange fields and reveal the roles of the interfacial Rashba effect and external electric field in… ▽ More Incorporating valley as a degree of freedom into quantum anomalous Hall systems offers a novel approach to manipulating valleytronics in electronic transport. Using the Kane-Mele monolayer as a concrete model, we comprehensively explore the various topological phases in the presence of inequivalent exchange fields and reveal the roles of the interfacial Rashba effect and external electric field in tuning topological valley-polarized states. We find that valley-polarized states can be realized by introducing Kane-Mele spin-orbit coupling and inequivalent exchange fields. Further introducing Rashba spin-orbit coupling and an electric field into the system can lead to diverse topological states, such as the valley-polarized quantum anomalous Hall effect with $\mathcal{C}=~\pm 1,\pm 2$ and valley-contrasting states with $\mathcal{C}=0$. Remarkably, different valley-polarized topological states can be continuously tuned by varying the strength and direction of the external electric field in a fixed system. Our work demonstrates the tunability of topological states in valley-polarized quantum anomalous Hall systems and provides an ideal platform for applications in electronic transport devices in topological valleytronics. △ Less

Submitted 21 August, 2024; originally announced August 2024.

Journal ref: Frontiers of Physics, 20, 14207 (2025)

arXiv:2408.10223 [pdf]

High order compact fully-discrete scheme for hyperbolic conversation laws

Authors: Tong Zhou, Haitao Dong, Shucheng Pan

Abstract: Based on the solution formula method, a series of one-step fully-discrete schemes, such as FWENO/Full-WENO has been proposed. Storing the by-products conservative variables at the half points (grid center) and using them as interpolation information to construct high-order schemes, we obtain a new class of one-step compact fully-discrete schemes. The new scheme can be associate with various non-os… ▽ More Based on the solution formula method, a series of one-step fully-discrete schemes, such as FWENO/Full-WENO has been proposed. Storing the by-products conservative variables at the half points (grid center) and using them as interpolation information to construct high-order schemes, we obtain a new class of one-step compact fully-discrete schemes. The new scheme can be associate with various non-oscillatory strategies. This paper takes state-of-the-art WENO-JS method as an example and proposes a family of compact fully-discrete WENO scheme. Detailed analysis is conducted on accuracy, errors, computational cost, efficiency and their connection with Hermite interpolation. Meanwhile, we design a new entropy flux linearization strategy for Euler equations to enhance its robustness, and also develop a multi-dimensional method for this compact fully-discrete framework. Due to the new scheme is one-step and utilizes stored by-products information for interpolation, it has a significant advantage in efficiency. For one-dimensional Euler equations, compared to the original FWENO, the computational cost only increases by 20-40%, while is approximately one-third of WENO+RK3. For two-dimensional case, a new special dimension-by-dimension strategy is applied. Although there is an additional computing cost, numerical experiments show that the new scheme only needs about 1/10 to 1/13 cost of that for WENO+RK3 when obtaining similar or even better resolution results, indicating that the new scheme is more efficient than semi-discrete schemes based on RK methods. △ Less

Submitted 31 July, 2024; originally announced August 2024.

arXiv:2408.09106 [pdf, other]

Fragment-Masked Diffusion for Molecular Optimization

Authors: Kun Li, Xiantao Cai, Jia Wu, Shirui Pan, Huiting Xu, Bo Du, Wenbin Hu

Abstract: Molecular optimization is a crucial aspect of drug discovery, aimed at refining molecular structures to enhance drug efficacy and minimize side effects, ultimately accelerating the overall drug development process. Many molecular optimization methods have been proposed, significantly advancing drug discovery. These methods primarily on understanding the specific drug target structures or their hyp… ▽ More Molecular optimization is a crucial aspect of drug discovery, aimed at refining molecular structures to enhance drug efficacy and minimize side effects, ultimately accelerating the overall drug development process. Many molecular optimization methods have been proposed, significantly advancing drug discovery. These methods primarily on understanding the specific drug target structures or their hypothesized roles in combating diseases. However, challenges such as a limited number of available targets and a difficulty capturing clear structures hinder innovative drug development. In contrast, phenotypic drug discovery (PDD) does not depend on clear target structures and can identify hits with novel and unbiased polypharmacology signatures. As a result, PDD-based molecular optimization can reduce potential safety risks while optimizing phenotypic activity, thereby increasing the likelihood of clinical success. Therefore, we propose a fragment-masked molecular optimization method based on PDD (FMOP). FMOP employs a regression-free diffusion model to conditionally optimize the molecular masked regions, effectively generating new molecules with similar scaffolds. On the large-scale drug response dataset GDSCv2, we optimize the potential molecules across all 985 cell lines. The overall experiments demonstrate that the in-silico optimization success rate reaches 95.4\%, with an average efficacy increase of 7.5\%. Additionally, we conduct extensive ablation and visualization experiments, confirming that FMOP is an effective and robust molecular optimization method. The code is available at: https://anonymous.4open.science/r/FMOP-98C2. △ Less

Submitted 14 May, 2025; v1 submitted 17 August, 2024; originally announced August 2024.

Comments: 12 pages, 9 figures, 4 tables

arXiv:2408.07505 [pdf, other]

Large Language Models Know What Makes Exemplary Contexts

Authors: Quanyu Long, Jianda Chen, Wenya Wang, Sinno Jialin Pan

Abstract: In-context learning (ICL) has proven to be a significant capability with the advancement of Large Language models (LLMs). By instructing LLMs using few-shot demonstrative examples, ICL enables them to perform a wide range of tasks without needing to update millions of parameters. This paper presents a unified framework for LLMs that allows them to self-select influential in-context examples to com… ▽ More In-context learning (ICL) has proven to be a significant capability with the advancement of Large Language models (LLMs). By instructing LLMs using few-shot demonstrative examples, ICL enables them to perform a wide range of tasks without needing to update millions of parameters. This paper presents a unified framework for LLMs that allows them to self-select influential in-context examples to compose their contexts; self-rank candidates with different demonstration compositions; self-optimize the demonstration selection and ordering through reinforcement learning. Specifically, our method designs a parameter-efficient retrieval head that generates the optimized demonstration after training with rewards from LLM's own preference. Experimental results validate the proposed method's effectiveness in enhancing ICL performance. Additionally, our approach effectively identifies and selects the most representative examples for the current task, and includes more diversity in retrieval. △ Less

Submitted 20 August, 2024; v1 submitted 14 August, 2024; originally announced August 2024.

Comments: 12 pages, 3 figures

arXiv:2408.06966 [pdf, other]

DyG-Mamba: Continuous State Space Modeling on Dynamic Graphs

Authors: Dongyuan Li, Shiyin Tan, Ying Zhang, Ming Jin, Shirui Pan, Manabu Okumura, Renhe Jiang

Abstract: Dynamic graph learning aims to uncover evolutionary laws in real-world systems, enabling accurate social recommendation (link prediction) or early detection of cancer cells (classification). Inspired by the success of state space models, e.g., Mamba, for efficiently capturing long-term dependencies in language modeling, we propose DyG-Mamba, a new continuous state space model (SSM) for dynamic gra… ▽ More Dynamic graph learning aims to uncover evolutionary laws in real-world systems, enabling accurate social recommendation (link prediction) or early detection of cancer cells (classification). Inspired by the success of state space models, e.g., Mamba, for efficiently capturing long-term dependencies in language modeling, we propose DyG-Mamba, a new continuous state space model (SSM) for dynamic graph learning. Specifically, we first found that using inputs as control signals for SSM is not suitable for continuous-time dynamic network data with irregular sampling intervals, resulting in models being insensitive to time information and lacking generalization properties. Drawing inspiration from the Ebbinghaus forgetting curve, which suggests that memory of past events is strongly correlated with time intervals rather than specific details of the events themselves, we directly utilize irregular time spans as control signals for SSM to achieve significant robustness and generalization. Through exhaustive experiments on 12 datasets for dynamic link prediction and dynamic node classification tasks, we found that DyG-Mamba achieves state-of-the-art performance on most of the datasets, while also demonstrating significantly improved computation and memory efficiency. △ Less

Submitted 13 August, 2024; originally announced August 2024.

arXiv:2408.06607 [pdf, other]

Learning Noise-Robust Stable Koopman Operator for Control with Hankel DMD

Authors: Shahriar Akbar Sakib, Shaowu Pan

Abstract: We propose a noise-robust learning framework for the Koopman operator of nonlinear dynamical systems, with guaranteed long-term stability and improved model performance for better model-based predictive control tasks. Unlike some existing approaches that rely on ad hoc observables or black-box neural networks in extended dynamic mode decomposition (EDMD), our framework leverages observables genera… ▽ More We propose a noise-robust learning framework for the Koopman operator of nonlinear dynamical systems, with guaranteed long-term stability and improved model performance for better model-based predictive control tasks. Unlike some existing approaches that rely on ad hoc observables or black-box neural networks in extended dynamic mode decomposition (EDMD), our framework leverages observables generated by the system dynamics, when the system dynamics is known, through a Hankel matrix, which shares similarities with discrete Polyflow. When system dynamics is unknown, we approximate them with a neural network while maintaining structural similarities to discrete Polyflow. To enhance noise robustness and ensure long-term stability, we developed a stable parameterization of the Koopman operator, along with a progressive learning strategy for rollout loss. To further improve the performance of the model in the phase space, a simple iterative data augmentation strategy was developed. Numerical experiments of prediction and control of classic nonlinear systems with ablation study showed the effectiveness of the proposed techniques over several state-of-the-art practices. △ Less

Submitted 21 March, 2025; v1 submitted 12 August, 2024; originally announced August 2024.

Comments: 15 pages

MSC Class: 37N35; 93B45 ACM Class: J.2

arXiv:2408.03508 [pdf]

SemiEpi: Self-driving, Closed-loop Multi-Step Growth of Semiconductor Heterostructures Guided by Machine Learning

Authors: Chao Shen, Wenkang Zhan, Kaiyao Xin, Shujie Pan, Xiaotian Cheng, Ruixiang Liu, Zhe Feng, Chaoyuan Jin, Hui Cong, Chi Xu, Bo Xu, Tien Khee Ng, Siming Chen, Chunlai Xue, Zhanguo Wang, Chao Zhao

Abstract: The semiconductor industry has prioritized automating repetitive tasks through closed-loop, self-driving experimentation, accelerating the optimization of complex multi-step processes. The emergence of machine learning (ML) has ushered in self-driving processes with minimal human intervention. This work introduces SemiEpi, a self-driving platform designed to execute molecular beam epitaxy (MBE) gr… ▽ More The semiconductor industry has prioritized automating repetitive tasks through closed-loop, self-driving experimentation, accelerating the optimization of complex multi-step processes. The emergence of machine learning (ML) has ushered in self-driving processes with minimal human intervention. This work introduces SemiEpi, a self-driving platform designed to execute molecular beam epitaxy (MBE) growth of semiconductor heterostructures through multi-step processes, in-situ monitoring, and on-the-fly feedback control. By integrating standard reactor, parameter initialization, and multiple ML models, SemiEpi identifies optimal initial conditions and proposes experiments for multi-step heterostructure growth, eliminating the need for extensive expertise in MBE processes. SemiEpi initializes material growth parameters tailored to specific material characteristics, and fine-tuned control over the growth process is then achieved through ML optimization. We optimize the growth for InAs quantum dots (QDs) heterostructures to showcase the power of SemiEpi, achieving a QD density of 5E10/cm2, 1.6-fold increased photoluminescence (PL) intensity and reduced full width at half maximum (FWHM) of 29.13 meV. This work highlights the potential of closed-loop, ML-guided systems to address challenges in multi-step growth. Our method is critical to achieve repeatable materials growth using commercially scalable tools. Furthermore, our strategy facilitates developing a hardware-independent process and enhancing process repeatability and stability, even without exhaustive knowledge of growth parameters. △ Less

Submitted 5 January, 2025; v1 submitted 6 August, 2024; originally announced August 2024.

Comments: 5 figures

Showing 101–150 of 868 results for author: Pan, S